Product Quantization for Nearest Neighbor Search

被引:2018
作者
Jegou, Herve [1 ]
Douze, Matthijs [2 ]
Schmid, Cordelia [2 ]
机构
[1] INRIA Rennes, F-35042 Rennes, France
[2] INRIA Rhone Alpes, F-38334 Saint Ismier, France
关键词
High-dimensional indexing; image indexing; very large databases; approximate search; OBJECT; SCENE;
D O I
10.1109/TPAMI.2010.57
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper introduces a product quantization-based approach for approximate nearest neighbor search. The idea is to decompose the space into a Cartesian product of low-dimensional subspaces and to quantize each subspace separately. A vector is represented by a short code composed of its subspace quantization indices. The euclidean distance between two vectors can be efficiently estimated from their codes. An asymmetric version increases precision, as it computes the approximate distance between a vector and a code. Experimental results show that our approach searches for nearest neighbors efficiently, in particular in combination with an inverted file system. Results for SIFT and GIST image descriptors show excellent search accuracy, outperforming three state-of-the-art approaches. The scalability of our approach is validated on a data set of two billion vectors.
引用
收藏
页码:117 / 128
页数:12
相关论文
共 29 条
[1]  
[Anonymous], P IEEE C COMP VIS PA
[2]  
[Anonymous], 2009, CIVR 09
[3]  
[Anonymous], 2008, ADV NEURAL INF PROCE
[4]  
[Anonymous], P EUR C COMP VIS OCT
[5]  
[Anonymous], 2008, P IEEE C COMP VIS PA
[6]  
[Anonymous], NEAREST NEIGHBOR MET
[7]  
[Anonymous], 2007, CVPR
[8]  
[Anonymous], P IEEE C COMP VIS PA
[9]  
[Anonymous], 2007, P IEEE C COMP VIS PA
[10]  
[Anonymous], 2008, P IEEE C COMP VIS PA