A neural network architecture for visual selection

被引:17
作者
Amit, Y [1 ]
机构
[1] Univ Chicago, Dept Stat, Chicago, IL 60637 USA
关键词
D O I
10.1162/089976600300015538
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This article describes a parallel neural net architecture for efficient and robust visual selection in generic gray-level images. Objects are represented through flexible star-type planar arrangements of binary local features which are in turn star-type planar arrangements of oriented edges. Candidate locations are detected over a range of scales and other deformations, using a generalized Hough transform. The flexibility of the arrangements provides the required invariance. Training involves selecting a small number of stable local features from a predefined pool, which are well localized on registered examples of the object. Training therefore requires only small data sets. The parallel architecture is constructed so that the Hough transform associated with any object can be implemented without creating ou modifying any connections. The different object representations are learned and stored in a central module. When one of these representations is evoked, it "primes" the appropriate layers in the network so that the corresponding Hough transform is computed. Analogies between the different layers in the network and those in the visual system are discussed. Furthermore, the model can be used to explain certain experiments on visual selection reported in the literature.
引用
收藏
页码:1141 / 1164
页数:24
相关论文
共 26 条
[1]   Task difficulty and the specificity of perceptual learning [J].
Ahissar, M ;
Hochstein, S .
NATURE, 1997, 387 (6631) :401-406
[2]   LEARNING IN NEURAL NETWORKS WITH MATERIAL SYNAPSES [J].
AMIT, DJ ;
FUSI, S .
NEURAL COMPUTATION, 1994, 6 (05) :957-982
[3]   A computational model for visual selection [J].
Amit, Y ;
Geman, D .
NEURAL COMPUTATION, 1999, 11 (07) :1691-1715
[4]  
AMIT Y, 1998, DEFORMABLE TEMPLATE
[5]   GENERALIZING THE HOUGH TRANSFORM TO DETECT ARBITRARY SHAPES [J].
BALLARD, DH .
PATTERN RECOGNITION, 1981, 13 (02) :111-122
[6]   Slow stochastic Hebbian learning of classes of stimuli in a recurrent neural network [J].
Brunel, N ;
Carusi, F ;
Fusi, S .
NETWORK-COMPUTATION IN NEURAL SYSTEMS, 1998, 9 (01) :123-152
[7]  
BURL MC, 1995, P INT WORKSH AUT FAC, P154
[8]   A NEURAL BASIS FOR VISUAL-SEARCH IN INFERIOR TEMPORAL CORTEX [J].
CHELAZZI, L ;
MILLER, EK ;
DUNCAN, J ;
DESIMONE, R .
NATURE, 1993, 363 (6427) :345-347
[9]   Locating faces using statistical feature detectors [J].
Cootes, TF ;
Taylor, CJ .
PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION, 1996, :204-209
[10]  
Desimone Robert, 1995, P475