Decision diagrams in machine learning: an empirical study on real-life credit-risk data

被引:20
作者
Mues, C
Baesens, B
Files, CA
Vanthienen, J
机构
[1] Katholieke Univ Leuven, Dept Appl Econ Sci, B-3000 Louvain, Belgium
[2] Univ Southampton, Sch Management, Southampton SO17 1BJ, Hants, England
关键词
decision diagrams; data mining; credit scoring;
D O I
10.1016/j.eswa.2004.02.001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Decision trees are a widely used knowledge representation in machine learning. However, one of their main drawbacks is the inherent replication of isomorphic subtrees, as a result of which the produced classifiers might become too large to be comprehensible by the human experts that have to validate them. Alternatively, decision diagrams, a generalization of decision trees taking on the form of a rooted, acyclic digraph instead of a tree, have occasionally been suggested as a potentially more compact representation. Their application in machine learning has nonetheless been criticized, because the theoretical size advantages of subgraph sharing did not always directly materialize in the relatively scarce reported experiments on real-world data. Therefore, in this paper, starting from a series of rule sets extracted from three real-life credit-scoring data sets, we will empirically assess to what extent decision diagrams are able to provide a compact visual description. Furthermore, we will investigate the practical impact of finding a good attribute ordering on the achieved size savings. (C) 2004 Elsevier Ltd. All rights reserved.
引用
收藏
页码:257 / 264
页数:8
相关论文
共 22 条
[1]  
[Anonymous], 1993, P 13 INT JOINT C ART
[2]   Benchmarking state-of-the-art classification algorithms for credit scoring [J].
Baesens, B ;
Van Gestel, T ;
Viaene, S ;
Stepanova, M ;
Suykens, J ;
Vanthienen, J .
JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 2003, 54 (06) :627-635
[3]   Using neural network rule extraction and decision tables for credit-risk evaluation [J].
Baesens, B ;
Setiono, R ;
Mues, C ;
Vanthienen, J .
MANAGEMENT SCIENCE, 2003, 49 (03) :312-329
[4]  
BRYANT RE, 1986, IEEE T COMPUT, V35, P677, DOI 10.1109/TC.1986.1676819
[5]  
Craven MW, 1996, ADV NEUR IN, V8, P24
[6]  
DAVIS RH, 1992, IMA J MATH APPL BUSI, V4, P43
[7]   A comparison of neural networks and linear scoring models in the credit union environment [J].
Desai, VS ;
Crook, JN ;
Overstreet, GA .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 1996, 95 (01) :24-37
[8]  
ELOMAA T, 2001, LECT NOTES ARTIF INT, V2167, P133
[9]  
FILES C, 2000, NEW FUNCTIONAL DECOM
[10]   FINDING THE OPTIMAL VARIABLE ORDERING FOR BINARY DECISION DIAGRAMS [J].
FRIEDMAN, SJ ;
SUPOWIT, KJ .
IEEE TRANSACTIONS ON COMPUTERS, 1990, 39 (05) :710-713