A NETWORK THAT LEARNS TO RECOGNIZE 3-DIMENSIONAL OBJECTS

被引:617
作者
POGGIO, T
EDELMAN, S
机构
[1] Artificial Intelligence Laboratory, Center for Biological Information Processing, Massachusetts Institute of Technology, Cambridge
关键词
D O I
10.1038/343263a0
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
THE visual recognition of three-dimensional (3-D) objects on the basis of their shape poses at least two difficult problems. First, there is the problem of variable illumination, which can be addressed by working with relatively stable features such as intensity edges rather than the raw intensity images1,2. Second, there is the problem of the initially unknown pose of the object relative to the viewer. In one approach to this problem, a hypothesis is first made about the viewpoint, then the appearance of a model object from such a viewpoint is computed and compared with the actual image3-7. Such recognition schemes generally employ 3-D models of objects, but the automatic learning of 3-D models is itself a difficult problem8,9. To address this problem in computational vision, we have developed a scheme, based on the theory of approximation of multivariate functions, that learns from a small set of perspective views a function mapping any viewpoint to a standard view. A network equivalent to this scheme will thus 'recognize' the object on which it was trained from any viewpoint. © 1990 Nature Publishing Group.
引用
收藏
页码:263 / 266
页数:4
相关论文
共 30 条
[1]  
Broomhead D. S., 1988, Complex Systems, V2, P321
[2]  
Edelman G.M., 1984, DYNAMIC ASPECTS NEOC, P635
[3]  
Edelman S., 1989, OPT NEWS, V15, P8
[4]  
EDELMAN S, 1989, MIT1138 ART INT LAB
[5]  
EDELMAN S, 1989, MIT1146 ART INT LAB
[6]  
Fan T. J., 1988, Second International Conference on Computer Vision (IEEE Cat. No.88CH2664-1), P474
[7]   RANDOM SAMPLE CONSENSUS - A PARADIGM FOR MODEL-FITTING WITH APPLICATIONS TO IMAGE-ANALYSIS AND AUTOMATED CARTOGRAPHY [J].
FISCHLER, MA ;
BOLLES, RC .
COMMUNICATIONS OF THE ACM, 1981, 24 (06) :381-395
[8]   LOCALIZING OVERLAPPING PARTS BY SEARCHING THE INTERPRETATION TREE [J].
GRIMSON, WEL ;
LOZANOPEREZ, T .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1987, 9 (04) :469-482
[9]   VISUAL PROPERTIES OF NEURONS IN INFEROTEMPORAL CORTEX OF MACAQUE [J].
GROSS, CG ;
ROCHAMIR.CE ;
BENDER, DB .
JOURNAL OF NEUROPHYSIOLOGY, 1972, 35 (01) :96-&
[10]  
JENKINS W M, 1984, Society for Neuroscience Abstracts, V10, P665