Regularization and semi-supervised learning on large graphs

被引:296
作者
Belkin, M [1 ]
Matveeva, I [1 ]
Niyogi, P [1 ]
机构
[1] Univ Chicago, Dept Comp Sci, Chicago, IL 60637 USA
来源
LEARNING THEORY, PROCEEDINGS | 2004年 / 3120卷
关键词
D O I
10.1007/978-3-540-27819-1_43
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We consider the problem of labeling a partially labeled graph. This setting may arise in a number of situations from survey sampling to information retrieval to pattern recognition in manifold settings. It is also of potential practical importance, when the data is abundant, but labeling is expensive or requires human assistance. Our approach develops a framework for regularization on such graphs. The algorithms are very simple and involve solving a single, usually sparse, system of linear equations. Using the notion of algorithmic stability, we derive bounds on the generalization error and relate it to structural invariants of the graph. Some experimental results testing the performance of the regularization algorithm and the usefulness of the generalization bound are presented.
引用
收藏
页码:624 / 638
页数:15
相关论文
共 17 条
[1]  
[Anonymous], 2001, LEARNING LABELED UNL
[2]  
BELKIN M, 2003, ADV NEURAL INFORMATI, V15
[3]  
BOUSQUET O, 2001, ADV NEURAL INFORMATI, V13
[4]  
CHAPELLE O, ADV NEURAL INFORMATI, V15
[5]   DISTRIBUTION-FREE INEQUALITIES FOR THE DELETED AND HOLDOUT ERROR ESTIMATES [J].
DEVROYE, LP ;
WAGNER, TJ .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1979, 25 (02) :202-207
[6]  
Fan Chung R. K., 1997, REGIONAL C SERIES MA, V92
[7]  
FIEDLER M, 1973, CZECH MATH J, V23, P298
[8]  
Harville D.A., 1998, Matrix algebra from a statistician's perspective
[9]  
Joachims T, 1999, MACHINE LEARNING, PROCEEDINGS, P200
[10]   Approximation algorithms for classification problems with pairwise relationships:: Metric labeling and Markov random fields [J].
Kleinberg, J ;
Tardos, É .
JOURNAL OF THE ACM, 2002, 49 (05) :616-639