最近邻搜索
计算机科学
笛卡尔积
线性子空间
矢量量化
子空间拓扑
欧几里德距离
最佳垃圾箱优先
k-最近邻算法
量化(信号处理)
可扩展性
算法
点积
模式识别(心理学)
人工智能
数学
离散数学
几何学
数据库
作者
H. Jegou,Matthijs Douze,Calvin F. Schmid
标识
DOI:10.1109/tpami.2010.57
摘要
This paper introduces a product quantization-based approach for approximate nearest neighbor search. The idea is to decompose the space into a Cartesian product of low-dimensional subspaces and to quantize each subspace separately. A vector is represented by a short code composed of its subspace quantization indices. The euclidean distance between two vectors can be efficiently estimated from their codes. An asymmetric version increases precision, as it computes the approximate distance between a vector and a code. Experimental results show that our approach searches for nearest neighbors efficiently, in particular in combination with an inverted file system. Results for SIFT and GIST image descriptors show excellent search accuracy, outperforming three state-of-the-art approaches. The scalability of our approach is validated on a data set of two billion vectors.
科研通智能强力驱动
Strongly Powered by AbleSci AI