Incorporating expert knowledge when learning Bayesian network structure: A medical case study

被引:108
作者
Julia Flores, M. [1 ]
Nicholson, Ann E. [2 ]
Brunskill, Andrew [3 ]
Korb, Kevin B. [2 ]
Mascaro, Steven [4 ]
机构
[1] Univ Castilla La Mancha, Dept Sistemas Informat SIMD i3A, Albacete 02071, Spain
[2] Monash Univ, Fac Informat Technol, Clayton, Vic 3800, Australia
[3] Univ Washington, Dept Hlth Serv, Seattle, WA 98195 USA
[4] Bayesian Intelligence Pty Ltd, Clarinda, Vic 3169, Australia
关键词
Bayesian networks; Causal discovery; Structure learning; Expert priors; Medical datasets; Heart failure; CORONARY-HEART-DISEASE; PROBABILITIES; DISCOVERY; DIAGNOSIS;
D O I
10.1016/j.artmed.2011.08.004
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Objectives: Bayesian networks (BNs) are rapidly becoming a leading technology in applied Artificial Intelligence, with many applications in medicine. Both automated learning of BNs and expert elicitation have been Used to build these networks, but the potentially more useful combination of these two methods remains underexplored. In this paper we examine a number of approaches to their combination when learning structure and present new techniques for assessing their results. Methods and materials: Using public-domain medical data, we run an automated causal discovery system, CaMML, which allows the incorporation of multiple kinds of prior expert knowledge into its search, to test and compare unbiased discovery with discovery biased with different kinds of expert opinion. We use adjacency matrices enhanced with numerical and colour labels to assist with the interpretation of the results. We present an algorithm for generating a single BN from a set of learned BNs that incorporates user Preferences regarding complexity vs completeness. These techniques are presented as part of the first detailed workflow for hybrid structure learning within the broader knowledge engineering process. Results: The detailed knowledge engineering workflow is shown to be useful for structuring a complex iterative BN development process. The adjacency matrices make it clear that for our medical case study using the IOWA dataset, the simplest kind of prior information (partially sorting variables into tiers) was more effective in aiding model discovery than either using no prior information or using more sophisticated and detailed expert priors. The method for generating a single BN captures relationships that would be overlooked by other approaches in the literature. Conclusion: Hybrid causal learning of BNs is an important emerging technology. We present methods for incorporating it into the knowledge engineering process, including visualisation and analysis of the learned networks. (C) 2011 Elsevier B.V. All rights reserved.
引用
收藏
页码:181 / 204
页数:24
相关论文
共 52 条
[1]   A comparison of learning algorithms for Bayesian networks:: a case study based on data from an emergency medical service [J].
Acid, S ;
de Campos, LM ;
Fernández-Luna, JM ;
Rodríguez, S ;
Rodríguez, JM ;
Salcedo, JL .
ARTIFICIAL INTELLIGENCE IN MEDICINE, 2004, 30 (03) :215-232
[2]  
[Anonymous], 2005, Statistical and Inductive Inference by Minimum Message Length
[3]  
[Anonymous], 2002, The Delphi Method
[4]  
[Anonymous], 2012, DATA PREPROCESSING
[5]   Using literature and data to learn Bayesian networks as clinical models of ovarian tumors [J].
Antal, P ;
Fannes, G ;
Timmerman, D ;
Moreau, Y ;
De Moor, B .
ARTIFICIAL INTELLIGENCE IN MEDICINE, 2004, 30 (03) :257-281
[6]  
BEINLICH I, 1992, P 2 EUR C ART INT ME, P689
[7]   Matilda: A visual tool for modeling with Bayesian networks [J].
Boneh, T. ;
Nicholson, A. E. ;
Sonenberg, E. A. .
INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2006, 21 (11) :1127-1150
[8]  
Boneh T., 2010, THESIS MONASH U
[9]  
BURNSIDE BE, 2000, P 2000 AMIA ANN S, P106
[10]   Priors on network structures. Biasing the search for Bayesian networks [J].
Castelo, R ;
Siebes, A .
INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2000, 24 (01) :39-57