CBSA: Content-based soft annotation for multimodal image retrieval using Bayes point machines

被引:243
作者
Chang, E [1 ]
Goh, K [1 ]
Sychay, G [1 ]
Wu, G [1 ]
机构
[1] Univ Calif Santa Barbara, Dept Comp Sci, Santa Barbara, CA 93106 USA
基金
美国国家科学基金会;
关键词
Bayes point machines (BPMs); image annotation; multimodal image retrieval; support vector machines (SVMs);
D O I
10.1109/TCSVT.2002.808079
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We propose a content-based soft annotation (CBSA) procedure for providing images with semantical labels. The annotation procedure starts with labeling a small set of training images, each with one single semantical label (e.g., forest, animal, or sky). An ensemble of binary classifiers is then trained for predicting label membership for images. The trained ensemble is applied to each individual image to give the image multiple soft labels, and each label is associated with a label membership factor. To select a base binary-classifier for CBSA, we experiment with two learning methods, support vector machines (SVMs) and Bayes point machines (BPMs), and compare their class-prediction accuracy. Our empirical study on a 116-category 25K-image set shows that the BPM-based ensemble provides better annotation quality than the SVM-based ensemble for supporting multimodal image retrievals.
引用
收藏
页码:26 / 38
页数:13
相关论文
共 52 条
[1]  
AGGARWAL CC, 2001, P 8 INT C DAT THEOR, P420
[2]   Reducing multiclass to binary: A unifying approach for margin classifiers [J].
Allwein, EL ;
Schapire, RE ;
Singer, Y .
JOURNAL OF MACHINE LEARNING RESEARCH, 2001, 1 (02) :113-141
[3]  
Barnard K, 2001, PROC CVPR IEEE, P434
[4]  
BARNARD K, 2001, P INT C COMP VIS, V2, P408, DOI DOI 10.1109/ICCV.2001.937654
[5]  
BENITEZ AB, 2002, P IEEE INT C MULT
[6]   Bagging predictors [J].
Breiman, L .
MACHINE LEARNING, 1996, 24 (02) :123-140
[7]  
BREIMAN L, 1997, ANN STAT, P801
[8]   A tutorial on Support Vector Machines for pattern recognition [J].
Burges, CJC .
DATA MINING AND KNOWLEDGE DISCOVERY, 1998, 2 (02) :121-167
[9]  
CHANG E, 2000, MEGA MAXIMIZING EXPE
[10]   Toward perception-based image retrieval [J].
Chang, EY ;
Li, BT ;
Li, C .
IEEE WORKSHOP ON CONTENT-BASED ACCESS OF IMAGE AND VIDEO LIBRARIES, PROCEEDINGS, 2000, :101-105