Learning Bayesian classifiers for scene classification with a visual grammar

被引:111
作者
Aksoy, S [1 ]
Koperski, K
Tusk, C
Marchisio, G
Tilton, JC
机构
[1] Bilkent Univ, Dept Comp Engn, TR-06800 Ankara, Turkey
[2] Insightful Corp, Seattle, WA 98109 USA
[3] NASA Goddard Space Flight Ctr, Greenbelt, MD USA
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2005年 / 43卷 / 03期
关键词
data fusion; image classification; image segmentation; spatial relationships; visual grammar;
D O I
10.1109/TGRS.2004.839547
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
A challenging problem in image content extraction and classification is building a system that automatically learns high-level semantic interpretations of images. We describe a Bayesian framework for a visual grammar that aims to reduce the gap between low-level features and high-level user semantics. Our approach includes modeling image pixels using automatic fusion of their spectral, textural, and other ancillary attributes; segmentation of image regions using an iterative split-and-merge algorithm; and representing scenes by decomposing them into prototype regions and modeling the interactions between these regions in terms of their spatial relationships. Naive Bayes classifiers are used in the learning of models for region segmentation an classification using positive and negative examples for user-defined semantic land cover labels. The system also automatically learns representative region groups that can distinguish different scenes and builds visual grammar models. Experiments using Landsat scenes show that the visual grammar enables creation of high-level classes that cannot be modeled by individual pixels or regions. Furthermore, learning of the classifiers requires only a few training examples.
引用
收藏
页码:581 / 589
页数:9
相关论文
共 21 条
[1]  
AKSOY S, 2003, P IEEE GRSS WORKSH A, P212
[2]  
AKSOY S, 2002, P IEEE INT GEOSC REM, V2, P1041
[3]  
Aksoy Selim, 2003, FRONTIERS REMOTE SEN, P35, DOI DOI 10.1142/9789812796752_0003
[4]  
BISOP CM, 1995, NEURAL NETWORKS PATT
[5]   Knowledge-based image retrieval with spatial and temporal constructs [J].
Chu, WW ;
Hsu, CC ;
Cárdenas, AF ;
Taira, RK .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1998, 10 (06) :872-888
[6]  
DeGroot M., 1970, OPTIMAL STAT DECISIO
[7]  
Geiger D, 1997, ANN STAT, V25, P1344
[8]   Rotation-invariant texture classification using a complete space-frequency model [J].
Haley, GM ;
Manjunath, BS .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 1999, 8 (02) :255-269
[9]  
Haralick R. M., 1992, COMPUTER ROBOT VISIO
[10]   TEXTURAL FEATURES FOR IMAGE CLASSIFICATION [J].
HARALICK, RM ;
SHANMUGAM, K ;
DINSTEIN, I .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1973, SMC3 (06) :610-621