Learning Credible Models

被引:31
作者
Wang, Jiaxuan [1 ]
Oh, Jeeheh [1 ]
Wang, Haozhu [1 ]
Wiens, Jenna [1 ]
机构
[1] Univ Michigan, Ann Arbor, MI 48109 USA
来源
KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING | 2018年
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
Model Interpretability; Regularization; CLOSTRIDIUM-DIFFICILE INFECTION; SELECTION CONSISTENCY;
D O I
10.1145/3219819.3220070
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In many settings, it is important that a model be capable of providing reasons for its predictions (i.e., the model must be interpretable). However, the model's reasoning may not conform with well-established knowledge. In such cases, while interpretable, the model lacks credibility. In this work, we formally define credibility in the linear setting and focus on techniques for learning models that are both accurate and credible. In particular, we propose a regularization penalty, expert yielded estimates (EYE), that incorporates expert knowledge about well-known relationships among covariates and the outcome of interest. We give both theoretical and empirical results comparing our proposed method to several other regularization techniques. Across a range of settings, experiments on both synthetic and real data show that models learned using the EYE penalty are significantly more credible than those learned using other penalties. Applied to two large-scale patient risk stratification task, our proposed technique results in a model whose top features overlap significantly with known clinical risk factors, while still achieving good predictive performance.
引用
收藏
页码:2417 / 2426
页数:10
相关论文
共 38 条
[1]  
Altendorf Eric E, 2012, ARXIV12071364
[2]  
[Anonymous], 2012, CoRR
[3]  
[Anonymous], 2014, ARXIV14094005
[4]   MONOTONICITY MAINTENANCE IN INFORMATION-THEORETIC MACHINE LEARNING ALGORITHMS [J].
BENDAVID, A .
MACHINE LEARNING, 1995, 19 (01) :29-43
[5]   Weighted Lasso with Data Integration [J].
Bergersen, Linn Cecilie ;
Glad, Ingrid K. ;
Lyng, Heidi .
STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2011, 10 (01)
[6]   The Convex Geometry of Linear Inverse Problems [J].
Chandrasekaran, Venkat ;
Recht, Benjamin ;
Parrilo, Pablo A. ;
Willsky, Alan S. .
FOUNDATIONS OF COMPUTATIONAL MATHEMATICS, 2012, 12 (06) :805-849
[7]   GRAM: Graph-based Attention Model for Healthcare Representation Learning [J].
Choi, Edward ;
Bahadori, Mohammad Taha ;
Song, Le ;
Stewart, Walter F. ;
Sun, Jimeng .
KDD'17: PROCEEDINGS OF THE 23RD ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2017, :787-795
[8]   Development and Validation of a Clostridium difficile Infection Risk Prediction Model [J].
Dubberke, Erik R. ;
Yan, Yan ;
Reske, Kimberly A. ;
Butler, Anne M. ;
Doherty, Joshua ;
Pham, Victor ;
Fraser, Victoria J. .
INFECTION CONTROL AND HOSPITAL EPIDEMIOLOGY, 2011, 32 (04) :360-366
[9]   A clinical risk index for Clostridium difficile infection in hospitalised patients receiving broad-spectrum antibiotics [J].
Garey, K. W. ;
Dao-Tran, T. K. ;
Jiang, Z. D. ;
Price, M. P. ;
Gentry, L. O. ;
DuPont, H. L. .
JOURNAL OF HOSPITAL INFECTION, 2008, 70 (02) :142-147
[10]   PhysioBank, PhysioToolkit, and PhysioNet - Components of a new research resource for complex physiologic signals [J].
Goldberger, AL ;
Amaral, LAN ;
Glass, L ;
Hausdorff, JM ;
Ivanov, PC ;
Mark, RG ;
Mietus, JE ;
Moody, GB ;
Peng, CK ;
Stanley, HE .
CIRCULATION, 2000, 101 (23) :E215-E220