A decision-theoretic generalization of on-line learning and an application to boosting

被引:13129
作者
Freund, Y
Schapire, RE
机构
[1] AT and T Labs, Florham Park, NJ 07932
关键词
D O I
10.1006/jcss.1997.1504
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In the first part of the paper we consider the problem of dynamically apportioning resources among a set of options in a worst-case on-line framework. The model we study can be interpreted as a broad, abstract extension of the well-studied on-line prediction model to a general decision-theoretic setting. We show that the multiplicative weight-update Littlestone-Warmuth rule can be adapted to this model, yielding bounds that are slightly weaker in some cases, but applicable to a considerably more general class of learning problems. We show how the resulting learning algorithm can be applied to a variety of problems, including gambling, multiple-outcome prediction, repeated games, and prediction of points in R-n. In the second part of the paper we apply the multiplicative weight-update technique to derive a new boosting algorithm. This boosting algorithm does not require any prior knowledge about the performance of the weak learning algorithm, We also study generalizations of the new boosting algorithm to the problem of learning functions whose range, rather than being binary, is an arbitrary finite set or a bounded segment of the real line. (C) 1997 Academic Press.
引用
收藏
页码:119 / 139
页数:21
相关论文
共 27 条
[1]  
[Anonymous], 1982, ESTIMATION DEPENDENC
[2]   What Size Net Gives Valid Generalization? [J].
Baum, Eric B. ;
Haussler, David .
NEURAL COMPUTATION, 1989, 1 (01) :151-160
[3]  
Blackwell D., 1956, PAC J MATH, V6, P1, DOI [DOI 10.2140/PJM.1956.6.1, 10.2140/pjm.1956.6.1]
[4]  
Breiman L, 1996, rapport technique n 460
[5]  
CESABIANCHI N, 1993, 25TH P ANN ACM S THE, P382
[6]  
Chung T. H., 1994, Proceedings of the Seventh Annual ACM Conference on Computational Learning Theory, COLT 94, P183, DOI 10.1145/180139.181097
[7]  
Cover T. M., 1991, Mathematical Finance, V1, P1, DOI [https://doi.org/10.1111/j.1467-9965.1991.tb00002.x, DOI 10.1111/J.1467-9965.1991.TB00002.X, 10.1111/j.1467-9965.1991.tb00002.x]
[8]  
Dietterich T. G., 1995, Journal of Artificial Intelligence Research, V2, P263
[9]  
Drucker H., 1993, International Journal of Pattern Recognition and Artificial Intelligence, V7, P705, DOI 10.1142/S0218001493000352
[10]  
DRUCKER H, 1996, ADV NEURAL INFORM PR, V8