Robust object recognition with cortex-like mechanisms

被引:1057
作者
Serre, Thomas
Wolf, Lior
Bileschi, Stanley
Riesenhuber, Maximilian
Poggio, Tomaso
机构
[1] MIT, Ctr Biol & Computat Learning, McGovern Inst Brain Res, Cambridge, MA 02139 USA
[2] MIT, Brain & Cognit Sci Dept, Cambridge, MA 02139 USA
[3] Georgetown Univ, Med Ctr, Washington, DC 20007 USA
关键词
object recognition; model; visual cortex; scene understanding; neural network;
D O I
10.1109/TPAMI.2007.56
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We introduce a new general framework for the recognition of complex visual scenes, which is motivated by biology: We describe a hierarchical system that closely follows the organization of visual cortex and builds an increasingly complex and invariant feature representation by alternating between a template matching and a maximum pooling operation. We demonstrate the strength of the approach on a range of recognition tasks: From invariant single object recognition in clutter to multiclass categorization problems and complex scene understanding tasks that rely on the recognition of both shape-based as well as texture-based objects. Given the biological constraints that the system had to satisfy, the approach performs surprisingly well: It has the capability of learning from only a few training examples and competes with state-of-the-art systems. We also discuss the existence of a universal, redundant dictionary of features that could handle the recognition of most object categories. In addition to its relevance for computer vision, the success of this approach suggests a plausibility proof for a class of feedforward models of object recognition in cortex.
引用
收藏
页码:411 / 426
页数:16
相关论文
共 68 条
[1]   An integrated network for invariant visual detection and recognition [J].
Amit, Y ;
Mascaro, M .
VISION RESEARCH, 2003, 43 (19) :2073-2088
[2]  
[Anonymous], 2005, P IEEE C COMP VIS PA
[3]  
[Anonymous], IEEE T PATTERN ANAL
[4]  
[Anonymous], THESIS MIT CAMBRIDGE
[5]  
[Anonymous], 2005, P IEEE C COMP VIS PA
[6]  
[Anonymous], P IEEE C COMP VIS PA
[7]  
[Anonymous], P INT C COMP VIS
[8]  
[Anonymous], P IEEE C COMP VIS PA
[9]  
Berg A., 2005, P IEEE C COMP VIS PA
[10]  
Bileschi S., 2005, P BRIT MACH VIS C