




























































































Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
The key points are:Learning and Generalization, Curve-Fitting, Minimum Description Length Principle, Problem-Based Intuition, Axis-Parallel Rectangle, Goal of Learning, Number of Elements, Uncountably Infinite Number, Finite Parameterization
Typology: Slides
1 / 145
This page cannot be seen from the preview
Don't miss anything!





























































































a general rule (a classifier or a function) that wouldpredict the ‘target’ or ‘output’
a general rule (a classifier or a function) that wouldpredict the ‘target’ or ‘output’
-^ Any learning algorithm takes training data as the inputand outputs a specific classifier/function. •^ For this, it searches over some chosen family offunctions to find one that optimizes a chosen criterionfunction.
-^ Any learning algorithm takes training data as the inputand outputs a specific classifier/function. •^ For this, it searches over some chosen family offunctions to find one that optimizes a chosen criterionfunction.^ {(X, y)} →ii
Learning Algorithm (searching over
-^ As discussed in the previous lecture, a simpleexample problem is ‘curve-fitting’.
-^ As discussed in the previous lecture, a simpleexample problem is ‘curve-fitting’. •^ Given training data
-^ As discussed in the previous lecture, a simpleexample problem is ‘curve-fitting’. •^ Given training data
-^ As we saw, the ‘data error’ is not a good measure forrating possible
-^ There are many different ways of formalizing this.
-^ For example, a generic approach is what is called^ Minimum Description Length
principle.
-^ For example, a generic approach is what is called^ Minimum Description Length
principle.
-^ Suppose we want to send the data over acommunication channel. •^ we can send the
number of bits.
-^ For example, a generic approach is what is called^ Minimum Description Length
principle.
-^ Suppose we want to send the data over acommunication channel. •^ we can send the
number of bits. • Or we can send
-^ If the fit is good, the errors
small range and we may be able to send them usingsmaller number of bits compared sending
-^ However, we also need to send
-^ If the fit is good, the errors
small range and we may be able to send them usingsmaller number of bits compared sending
-^ However, we also need to send
-^ If^ f^ is very complex, then what we save in bits bysending errors instead of
by the bits needed to send description of