Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI

被引:4754
作者
Barredo Arrieta, Alejandro [1 ]
Diaz-Rodriguez, Natalia [2 ,3 ]
Del Ser, Javier [1 ,4 ,5 ]
Bennetot, Adrien [2 ,3 ,6 ,7 ]
Tabik, Siham [8 ]
Barbado, Alberto [9 ]
Garcia, Salvador [8 ]
Gil-Lopez, Sergio [1 ]
Molina, Daniel [8 ]
Benjamins, Richard [9 ]
Chatila, Raja [7 ]
Herrera, Francisco [8 ]
机构
[1] TECNALIA, P Tecnol,Ed 700, Derio 48160, Bizkaia, Spain
[2] Inst Polytech Paris, ENSTA, Palaiseau, France
[3] INRIA, Flowers Team, Palaiseau, France
[4] Univ Basque Country, UPV EHU, Bilbao 48013, Spain
[5] Basque Ctr Appl Math, Bilbao 48009, Bizkaia, Spain
[6] Segula Technol, Parc Activite Pissaloup, Trappes, France
[7] Sorbonne Univ, Inst Syst Intelligents & Robot, Paris, France
[8] Univ Granada, DaSCI Andalusian Inst Data Sci & Computat Intelli, E-18071 Granada, Spain
[9] Telefonica, Madrid 28050, Spain
关键词
Explainable Artificial Intelligence; Machine Learning; Deep Learning; Data Fusion; Interpretability; Comprehensibility; Transparency; Privacy; Fairness; Accountability; Responsible Artificial Intelligence; GENERALIZED ADDITIVE-MODELS; TRAINED NEURAL-NETWORKS; SUPPORT VECTOR MACHINES; RULE EXTRACTION; LOGISTIC-REGRESSION; DATA FUSION; BLACK-BOX; DECISION TREES; BIG DATA; FEATURE-SELECTION;
D O I
10.1016/j.inffus.2019.12.012
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the last few years, Artificial Intelligence (AI) has achieved a notable momentum that, if harnessed appropriately, may deliver the best of expectations over many application sectors across the field. For this to occur shortly in Machine Learning, the entire community stands in front of the barrier of explainability, an inherent problem of the latest techniques brought by sub-symbolism (e.g. ensembles or Deep Neural Networks) that were not present in the last hype of AI (namely, expert systems and rule based models). Paradigms underlying this problem fall within the so-called eXplainable AI (XAI) field, which is widely acknowledged as a crucial feature for the practical deployment of AI models. The overview presented in this article examines the existing literature and contributions already done in the field of XAI, including a prospect toward what is yet to be reached. For this purpose we summarize previous efforts made to define explainability in Machine Learning, establishing a novel definition of explainable Machine Learning that covers such prior conceptual propositions with a major focus on the audience for which the explainability is sought. Departing from this definition, we propose and discuss about a taxonomy of recent contributions related to the explainability of different Machine Learning models, including those aimed at explaining Deep Learning methods for which a second dedicated taxonomy is built and examined in detail. This critical literature analysis serves as the motivating background for a series of challenges faced by XAI, such as the interesting crossroads of data fusion and explainability. Our prospects lead toward the concept of Responsible Artificial Intelligence, namely, a methodology for the large-scale implementation of AI methods in real organizations with fairness, model explainability and accountability at its core. Our ultimate goal is to provide newcomers to the field of XAI with a thorough taxonomy that can serve as reference material in order to stimulate future research advances, but also to encourage experts and professionals from other disciplines to embrace the benefits of AI in their activity sectors, without any prior bias for its lack of interpretability.
引用
收藏
页码:82 / 115
页数:34
相关论文
共 393 条
[41]   Network Dissection: Quantifying Interpretability of Deep Visual Representations [J].
Bau, David ;
Zhou, Bolei ;
Khosla, Aditya ;
Oliva, Aude ;
Torralba, Antonio .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :3319-3327
[42]   Visual Feature Attribution using Wasserstein GANs [J].
Baumgartner, Christian F. ;
Koch, Lisa M. ;
Tezcan, Kerem Can ;
Ang, Jia Xi ;
Konukoglu, Ender .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :8309-8319
[43]  
Belle V, 2017, PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P5116
[44]   Knowledge-aware Autoencoders for Explainable Recommender Systems [J].
Bellini, Vito ;
Schiavone, Angelo ;
Di Noia, Tommaso ;
Ragone, Azzurra ;
Di Sciascio, Eugenio .
PROCEEDINGS OF THE 3RD WORKSHOP ON DEEP LEARNING FOR RECOMMENDER SYSTEMS (DLRS), 2018, :24-31
[45]   Are artificial neural networks black boxes? [J].
Benitez, JM ;
Castro, JL ;
Requena, I .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1997, 8 (05) :1156-1164
[46]  
Benjamins R., 2019, RESPONSIBLE DESIGN
[47]  
Bennetot A., 2019, NESY WORKSHOP IJCAI
[48]   Bankruptcy prediction by generalized additive models [J].
Berg, Daniel .
APPLIED STOCHASTIC MODELS IN BUSINESS AND INDUSTRY, 2007, 23 (02) :129-143
[49]   Statistical Procedures for Forecasting Criminal Behavior A Comparative Assessment [J].
Berk, Richard A. ;
Bleich, Justin .
CRIMINOLOGY & PUBLIC POLICY, 2013, 12 (03) :513-+
[50]   Algorithms and applications for approximate nonnegative matrix factorization [J].
Berry, Michael W. ;
Browne, Murray ;
Langville, Amy N. ;
Pauca, V. Paul ;
Plemmons, Robert J. .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2007, 52 (01) :155-173