Product quantization for nearest neighbor search

IEEE Transactions on Pattern Analysis & Machine Intelligence, Volume 33, Number 1 - jan 2011
Download the publication : jegou_searching_with_quantization.pdf [592Ko]  
This paper introduces a product quantization based approach for approximate nearest neighbor search. The idea is to decomposes the space into a Cartesian product of low dimensional subspaces and to quantize each subspace separately. A vector is represented by a short code composed of its subspace quantization indices. The Euclidean distance between two vectors can be efficiently estimated from their codes. An asymmetric version increases precision, as it computes the approximate distance between a vector and a code. Experimental results show that our approach searches for nearest neighbors efficiently, in particular in combination with an inverted file system. Results for SIFT and GIST image descriptors show excellent search accuracy outperforming three state-of-the-art approaches. The scalability of our approach is validated on a dataset of two billion vectors.

Images and movies

 

See also

This article is an extended version of this technical report. A related method is described in this ICASSP article. Code available here.

BibTex references

@Article{JDS11,
  author       = "Herv\'e J\'egou and Matthijs Douze and Cordelia Schmid",
  title        = "Product quantization for nearest neighbor search",
  journal      = "IEEE Transactions on Pattern Analysis \& Machine Intelligence",
  number       = "1",
  volume       = "33",
  pages        = "117--128",
  month        = "jan",
  year         = "2011",
  note         = "to appear",
  url          = "http://lear.inrialpes.fr/pubs/2011/JDS11"
}

Other publications by...