Query-specific signature selection for efficient k -nearest neighbour approximation

Youngki Park, Heasoo Hwang, Sang Goo Lee

Research output: Contribution to journalArticlepeer-review

Abstract

Finding k-nearest neighbours (k-NN) is one of the most important primitives of many applications such as search engines and recommendation systems. However, its computational cost is extremely high when searching for k-NN points in a huge collection of high-dimensional points. Locality-sensitive hashing (LSH) has been introduced for an efficient k-NN approximation, but none of the existing LSH approaches clearly outperforms others. We propose a novel LSH approach, Signature Selection LSH (S2LSH), which finds approximate k-NN points very efficiently in various datasets. It first constructs a large pool of highly diversified signature regions with various sizes. Given a query point, it dynamically generates a query-specific signature region by merging highly effective signature regions selected from the signature pool. We also suggest S2LSH-M, a variant of S2LSH, which processes multiple queries more efficiently by using query-specific features and optimization techniques. Extensive experiments show the performance superiority of our approaches in diverse settings.

Original languageEnglish
Pages (from-to)440-457
Number of pages18
JournalJournal of Information Science
Volume43
Issue number4
DOIs
StatePublished - 1 Aug 2017

Keywords

  • k -nearest neighbour graph
  • k -nearest neighbour search
  • locality-sensitive hashing

Fingerprint

Dive into the research topics of 'Query-specific signature selection for efficient k -nearest neighbour approximation'. Together they form a unique fingerprint.

Cite this