


















Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
Instance-based methods, specifically histograms, in machine learning. The concept of histograms in two-dimensional boxes, dividing boxes into bins, counting proportions of training instances, and estimating probabilities. The document also explores the use of histograms in classification, k-nearest neighbors, and kernel functions such as hypercube and gaussian. The curse of dimensionality is addressed as a caveat.
Typology: Study notes
1 / 26
This page cannot be seen from the preview
Don't miss anything!



















k where Nx = number of instances in k and N = total number of instances and point x in bin k and V k is volume bin N k
x V k
P ( x ) = 2
x
k - Nearest Neighbors “Density Estimator”
30 points
B
h = 1/ V h
P ( x ) = N B
h
B
h = 1/ V h
P ( x ) = N B
h
B
h = 1/ V h
P ( x ) = N B
h
x
1
2 Estimate P ( x ) = (1/2) * ( K 1
2
x Estimate P ( x ) = (1/2) * ( K 1
2
2
Don’t get confused – a Gaussian kernel density estimator does not assume the data is Gaussian! x