AN ITERATIVE GROWING AND PRUNING ALGORITHM FOR CLASSIFICATION TREE DESIGN

被引:110
作者
GELFAND, SB
RAVISHANKAR, CS
DELP, EJ
机构
[1] Computer Vision and Image Processing Laboratory, School of Electrical Engineering, Purdue University, West Lafayette, IN
关键词
CLASSIFICATION TREE; ERROR RATE ESTIMATION; GRAPH THEORY (TREES); PATTERN RECOGNITION; TREE GROWING; TREE PRUNING;
D O I
10.1109/34.67645
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A critical issue in classification tree design is obtaining right-sized trees, i.e., trees which neither underfit nor overfit the data. Instead of using stopping rules to halt partitioning, we follow the approach of growing a large tree with pure terminal nodes and selectively pruning it back. A new efficient iterative method is proposed to grow and prune classification trees. This method divides the data sample into two subsets and iteratively grows a tree with one subset and prunes it with the other subset, successively interchanging the roles of the two subsets. The convergence and other properties of the algorithm are established. Theoretical and practical considerations suggest that the iterative tree growing and pruning algorithm should perform better and require less computation than other widely used tree growing and pruning algorithms. Numerical results on a waveform recognition problem are presented to support this view.
引用
收藏
页码:163 / 174
页数:12
相关论文
共 33 条
[1]  
ANDERSON AC, 1979, TREE7931 PURD U TECH
[2]   AN AUTOMATED APPROACH TO THE DESIGN OF DECISION TREE CLASSIFIERS [J].
ARGENTIERO, P ;
CHIN, R ;
BEAUDET, P .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1982, 4 (01) :51-57
[3]  
ATLAS L, 1989, P IEEE C SYSTEMS MAN, P915
[4]   SELECTIVE RADIANT TEMPERATURE MAPPING USING A LAYERED CLASSIFIER [J].
BARTOLUCCI, LA ;
SWAIN, PH ;
WU, C .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 1976, 14 (02) :101-106
[5]  
Breiman L, 2017, CLASSIFICATION REGRE, P368, DOI 10.1201/9781315139470
[6]   DECISION TREE DESIGN USING A PROBABILISTIC MODEL [J].
CASEY, RG ;
NAGY, G .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1984, 30 (01) :93-99
[7]   OPTIMAL PRUNING WITH APPLICATIONS TO TREE-STRUCTURED SOURCE-CODING AND MODELING [J].
CHOU, PA ;
LOOKABAUGH, T ;
GRAY, RM .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1989, 35 (02) :299-315
[8]  
DATTATREYA GR, 1980, 5TH P INT C PATT REC, P1212
[9]  
FRIEDMAN JH, 1977, IEEE T COMPUT, V26, P404, DOI 10.1109/TC.1977.1674849
[10]   DECISION TREE DESIGN FROM A COMMUNICATION-THEORY STANDPOINT [J].
GOODMAN, RM ;
SMYTH, P .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1988, 34 (05) :979-994