Prepare for your exams
Get points
Guidelines and tips
Sell on Docsity
Docsity AI

Prepare for your exams

Study with the several resources on Docsity

Earn points to download

Earn points by helping other students or get them with a premium plan

Guidelines and tips

Sell on Docsity

Docsity AI

Prepare for your exams

Study with the several resources on Docsity

Find documents

Prepare for your exams with the study notes shared by other students like you on Docsity

Search for your university

Find the specific documents for your university's exams

Docsity AINEW

Summarize your documents, ask them questions, convert them into quizzes and concept maps

Explore questions

Clear up your doubts by reading the answers to questions asked by your fellow students

Earn points to download

Earn points by helping other students or get them with a premium plan

Share documents

20 Points

For each uploaded document

Answer questions

5 Points

For each given answer (max 1 per day)

All the ways to get free points

Get points immediately

Choose a premium plan with all the points you need

Study Opportunities

Choose your next study program

Get in touch with the best universities in the world. Search through thousands of universities and official partners

Community

Ask the community

Ask the community for help and clear up your study doubts

Free resources

Our save-the-student-ebooks!

Download our free guides on studying techniques, anxiety management strategies, and thesis advice from Docsity tutors

CS181 Lecture 21: Instance-Based Methods and Histograms in Machine Learning, Study notes of Artificial Intelligence

Harvard University Artificial Intelligence

Instance-based methods, specifically histograms, in machine learning. The concept of histograms in two-dimensional boxes, dividing boxes into bins, counting proportions of training instances, and estimating probabilities. The document also explores the use of histograms in classification, k-nearest neighbors, and kernel functions such as hypercube and gaussian. The curse of dimensionality is addressed as a caveat.

Typology: Study notes

2010/2011

Uploaded on 10/25/2011

thecoral 🇺🇸

4.5

(30)

395 documents

1 / 26

This page cannot be seen from the preview

Don't miss anything!

CS181 Lecture 21:

Instance-Based Methods

David C. Parkes

Discover Study notes of Artificial Intelligence Harvard University

Partial preview of the text

Download CS181 Lecture 21: Instance-Based Methods and Histograms in Machine Learning and more Study notes Artificial Intelligence in PDF only on Docsity!

CS181 Lecture 21:

Instance-Based Methods

David C. Parkes

Histograms

Consider M=2 dimensional box

Histograms

Count proportion of training instances in each bin
Estimate P ( x ) = N k

/ N £ V

k where Nx = number of instances in k and N = total number of instances and point x in bin k and V k is volume bin N k

/ N = 3/

x V k

P ( x ) = 2

Histograms: Classification

• Choose most common class in bin

k - Nearest Neighbors “Density Estimator”

Given an instance x
Find k points with smallest L 1 distance (max distance in any one direction).
Set cube side length h = 2d max (d max is furthest distance of k points) x k = 3 h = 2/ N = 6 V = 4/ P ( x ) = k /( NV h

Effect of k

30 points

Alternative View

Consider a hypercube with length h=1/ x

N

h = 1/ V h

N = 6

P ( x ) = N B

/ (N V

Alternative View

Consider a hypercube with length h=1/ x

N

h = 1/ V h

N = 6

P ( x ) = N B

/ (N V

Alternative View

Consider a hypercube with length h=1/ x

N

h = 1/ V h

N = 6

P ( x ) = N B

/ (N V

Gaussian Kernel Example

K

2 Estimate P ( x ) = (1/2) * ( K 1

+ K

Alternate View

x Estimate P ( x ) = (1/2) * ( K 1

+ K

K

Gaussian Kernel:

Bimodal Density Function

Don’t get confused – a Gaussian kernel density estimator does not assume the data is Gaussian! x

CS181 Lecture 21: Instance-Based Methods and Histograms in Machine Learning, Study notes of Artificial Intelligence

Related documents

Partial preview of the text

Download CS181 Lecture 21: Instance-Based Methods and Histograms in Machine Learning and more Study notes Artificial Intelligence in PDF only on Docsity!

CS181 Lecture 21:

Instance-Based Methods

David C. Parkes

Histograms

Histograms

/ N £ V

/ N = 3/

Histograms: Classification

• Choose most common class in bin

Effect of k

Alternative View

N

N = 6

/ (N V

Alternative View

N

N = 6

/ (N V

Alternative View

N

N = 6

/ (N V

Gaussian Kernel Example

K

K

+ K

Alternate View

+ K

K

Gaussian Kernel:

Bimodal Density Function