Statistical modeling and conceptualization of natural images

被引：43

作者：

Fan, JP

Gao, YL

Luo, HZ

Xu, GY

机构：

[1] Univ N Carolina, Dept Comp Sci, Charlotte, NC 28223 USA

[2] Tsinghua Univ, Dept Comp Sci, Beijing, Peoples R China

来源：

PATTERN RECOGNITION | 2005年 / 38卷 / 06期

基金：

中国国家自然科学基金;

关键词：

semantic image classification; salient object detection; adaptive EM algorithm; SVM;

D O I：

10.1016/j.patcog.2004.07.011

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Multi-level annotation of images is a promising solution to enable semantic image retrieval by using various keywords at different semantic levels. In this paper, we propose a multi-level approach to interpret and annotate the semantics of natural images by using both the dominant image components and the relevant semantic image concepts. In contrast to the well-known image-based and region-based approaches, we use the concept-sensitive salient objects as the dominant image components to achieve automatic image annotation at the content level. By using the concept-sensitive salient objects for image content representation and feature extraction, a novel image classification technique is developed to achieve automatic image annotation at the concept level. To detect the concept-sensitive salient objects automatically, a set of detection functions are learned from the labeled image regions by using support vector machine (SVM) classifiers with an automatic scheme for searching the optimal model parameters. To generate the semantic image concepts, the finite mixture models are used to approximate the class distributions of the relevant concept-sensitive salient objects. An adaptive EM algorithm has been proposed to determine the optimal model structure and model parameters simultaneously. In addition, a large number of unlabeled samples have been integrated with a limited number of labeled samples to achieve more effective classifier training and knowledge discovery. We have also demonstrated that our algorithms are very effective to enable multi-level interpretation and annotation of natural images. (c) 2004 Pattern Recognition Society. Published by Elsevier Ltd. All rights reserved.

引用

页码：865 / 885

页数：21

共 45 条

[1]

ADAMS WH, 2003, EURASIP JASP, V2, P170

[2]

[Anonymous], 2003, ACMMM

[3]

[Anonymous], P ICML

[4]

[Anonymous], 2002, ECCV

[5] Matching words and pictures [J].

Barnard, K ;

Duygulu, P ;

Forsyth, D ;

de Freitas, N ;

Blei, DM ;

Jordan, MI .

JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (06) :1107-1135

[6]

Barnard K, 2001, EIGHTH IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOL II, PROCEEDINGS, P408, DOI 10.1109/ICCV.2001.937654

[7]

BENITEZ AB, 2000, P SPIE, V4210

[8] Automatic segmentation and classification of outdoor images using neural networks [J].

Campbell, NW ;

Thomas, BT ;

Troscianko, T .

INTERNATIONAL JOURNAL OF NEURAL SYSTEMS, 1997, 8 (01) :137-144

[9]

Carson C., 1997, REGION BASED IMAGE Q

[10]

Chang, 1997, IEEE MULTIMEDIA

← 1 2 3 4 5 →