A neural network architecture for visual selection

被引：17

作者：

Amit, Y ^{[1
]}

机构：

[1] Univ Chicago, Dept Stat, Chicago, IL 60637 USA

来源：

NEURAL COMPUTATION | 2000年 / 12卷 / 05期

关键词：

D O I：

10.1162/089976600300015538

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This article describes a parallel neural net architecture for efficient and robust visual selection in generic gray-level images. Objects are represented through flexible star-type planar arrangements of binary local features which are in turn star-type planar arrangements of oriented edges. Candidate locations are detected over a range of scales and other deformations, using a generalized Hough transform. The flexibility of the arrangements provides the required invariance. Training involves selecting a small number of stable local features from a predefined pool, which are well localized on registered examples of the object. Training therefore requires only small data sets. The parallel architecture is constructed so that the Hough transform associated with any object can be implemented without creating ou modifying any connections. The different object representations are learned and stored in a central module. When one of these representations is evoked, it "primes" the appropriate layers in the network so that the corresponding Hough transform is computed. Analogies between the different layers in the network and those in the visual system are discussed. Furthermore, the model can be used to explain certain experiments on visual selection reported in the literature.

引用

页码：1141 / 1164

页数：24

共 26 条

[1] Task difficulty and the specificity of perceptual learning [J].