Nearest neighbors and vector models – part 2 – algorithms and data structures

1 · Erik Bernhardsson · Oct. 1, 2015, 4 a.m.
This is a blog post rewritten from a presentation at NYC Machine Learning on Sep 17. It covers a library called Annoy that I have built that helps you do nearest neighbor queries in high dimensional spaces. In the first part, I went through some examples of why vector models are useful. In the second part I will be explaining the data structures and algorithms that Annoy uses to do approximate nearest neighbor queries. Let’s start by going back to our point set. The goal is to find nearest neigh...