
















































Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
LWR, Unsuperwised Learning, Kernel Density Estimate, Nearest Neighbor, Kernel Regression, Parametric, Locally Weighted Regression, Linear LWR, Complexity, Nearest Neighbor Search, Locality Sensitive Hashing, LSH, Hash Functions, Unsupervised Learning, Clustering, Gaussian Mixture, K-Means Clustering, K-Means, Optimization, Vector Quantization, Greg Shakhnarovich, Lecture Slides, Introduction to Machine Learning, Computer Science, Toyota Technological Institute at Chicago, United States of Americ
Typology: Lecture notes
1 / 56
This page cannot be seen from the preview
Don't miss anything!

















































TTIC 31020: Introduction to Machine Learning
Instructor: Greg Shakhnarovich
TTI–Chicago
November 12, 2010
Kernel density estimate:
−6^0 −5 −4 −3 −2 −1 0 1 2 3
Nearest neighbor
Kernel regression
Idea 2: bring back the parameters.
Fit a (simple) parametric model to the neighbors of x 0.
Idea 2: bring back the parameters.
Fit a (simple) parametric model to the neighbors of x 0.
Idea 2: bring back the parameters.
Fit a (simple) parametric model to the neighbors of x 0.
Implicit assumption: the target function is reasonably smooth.
from Atkeson et al.
Spring stiffness ⇔ K(x 0 , xi)
What kind of functions can we estimate with this model?
How similar is similar?
How similar is similar?
k nearest neighbors
How similar is similar?
k nearest neighbors
r-neighbors:
within radius r from x 0
(, r)-neighbors:
within radius (1 + )r
LSH [Indyk&Motwani]: very fast algorithm for finding a
(, r)-neighbor of x 0 :
dn^1 /(1+)
dn + n1+1/(1+)
LSH [Indyk&Motwani]: very fast algorithm for finding a
(, r)-neighbor of x 0 :
dn^1 /(1+)
dn + n1+1/(1+)
Practical meaning, with 10^6 examples ×10000 features, for
= 1:
LSH [Indyk&Motwani]: very fast algorithm for finding a
(, r)-neighbor of x 0 :
dn^1 /(1+)
dn + n1+1/(1+)
Practical meaning, with 10^6 examples ×10000 features, for
= 1:
Preprocessing: index the data by l hash tables
Hash functions are unlikely to separate close points
Preprocessing: index the data by l hash tables
Hash functions are unlikely to separate close points