On the use of neural network ensembles in QSAR and QSPR

被引:136
作者
Agrafiotis, DK [1 ]
Cedeño, W [1 ]
Lobanov, VS [1 ]
机构
[1] 3 Dimens Pharmaceut Inc, Exton, PA 19341 USA
来源
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES | 2002年 / 42卷 / 04期
关键词
D O I
10.1021/ci0203702
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Despite their growing popularity among neural network practitioners, ensemble methods have not been widely adopted in structure-activity and structure-property correlation. Neural networks are inherently unstable, in that small changes in the training set and/or training parameters can lead to large changes in their generalization performance. Recent research has shown that by capitalizing on the diversity of the individual models, ensemble techniques can minimize uncertainty and produce more stable and accurate predictors. In this work, we present a critical assessment of the most common ensemble technique known as bootstrap aggregation, or bagging, as applied to QSAR and QSPR. Although aggregation does offer definitive advantages, we demonstrate that bagging may not be the best possible choice and that simpler techniques such as retraining with the full sample can often produce superior results. These findings are rationalized using Krogh and Vedelsby's decomposition of the generalization error into a term that measures the average generalization performance of the individual networks and a term that measures the diversity among them. For networks that are designed to resist over-fitting. the benefits of aggregation are clear but not overwhelming.
引用
收藏
页码:903 / 911
页数:9
相关论文
共 51 条
[31]   VARIABLE SELECTION IN QSAR STUDIES .1. AN EVOLUTIONARY ALGORITHM [J].
KUBINYI, H .
QUANTITATIVE STRUCTURE-ACTIVITY RELATIONSHIPS, 1994, 13 (03) :285-294
[32]   Adaptive neuro-fuzzy inference system: An instant and architecture-free predictor for improved QSAR studies [J].
Loukas, YL .
JOURNAL OF MEDICINAL CHEMISTRY, 2001, 44 (17) :2772-2783
[33]   EVOLUTIONARY PROGRAMMING APPLIED TO THE DEVELOPMENT OF QUANTITATIVE STRUCTURE-ACTIVITY-RELATIONSHIPS AND QUANTITATIVE STRUCTURE-PROPERTY RELATIONSHIPS [J].
LUKE, BT .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1994, 34 (06) :1279-1287
[34]  
Maclin R., 1999, J. Artif. Int. Res., V11, P169, DOI DOI 10.1613/JAIR.614
[35]  
MACLIN R, 1997, P 14 NAT C ART INT P
[36]   PREDICTION OF RECEPTOR PROPERTIES AND BINDING-AFFINITY OF LIGANDS TO BENZODIAZEPINE/GABA(A) RECEPTORS USING ARTIFICIAL NEURAL NETWORKS [J].
MADDALENA, DJ ;
JOHNSTON, GAR .
JOURNAL OF MEDICINAL CHEMISTRY, 1995, 38 (04) :715-724
[37]   ANALYSIS OF LINEAR AND NONLINEAR QSAR DATA USING NEURAL NETWORKS [J].
MANALLACK, DT ;
ELLIS, DD ;
LIVINGSTONE, DJ .
JOURNAL OF MEDICINAL CHEMISTRY, 1994, 37 (22) :3758-3767
[38]  
Nilsson N. J., 1965, Series in Systems Science
[39]   NETWORKS FOR APPROXIMATION AND LEARNING [J].
POGGIO, T ;
GIROSI, F .
PROCEEDINGS OF THE IEEE, 1990, 78 (09) :1481-1497
[40]   APPLICATION OF GENETIC FUNCTION APPROXIMATION TO QUANTITATIVE STRUCTURE-ACTIVITY-RELATIONSHIPS AND QUANTITATIVE STRUCTURE-PROPERTY RELATIONSHIPS [J].
ROGERS, D ;
HOPFINGER, AJ .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1994, 34 (04) :854-866