



















































Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
An overview of statistical learning and modeling, focusing on supervised learning techniques within the realm of artificial intelligence. It covers linear models for classification, including parameter optimization via maximum likelihood and least squares. The document also delves into fisher's linear discriminant and the perceptron algorithm. A significant portion is dedicated to boosting, particularly adaboost, explaining its concept, ensemble examples, and its position among the top 10 algorithms in data mining. Useful for understanding the theoretical underpinnings and practical applications of these machine learning methods. It also touches on probabilistic generative and discriminative models, logistic regression, and their applications in classification problems. Structured as a lecture or study material, providing a comprehensive overview of the covered topics.
Typology: Exams
1 / 59
This page cannot be seen from the preview
Don't miss anything!




















































Outlines
Linear model for classification
Ada Boosting
Learning the parameters of Linear Discriminant Functions
Linear Basis Function Models
N ×M design matrix
Moore-Penrose pseudo-inverse Φ† = (Φ𝑇Φ)− 1 Φ𝑇
- : - group together: -
Maximum likelihood estimation method (MLE) The likelihood function indicates how likely the observed sample is as a function of possible parameter values. Therefore, maximizing the likelihood function determines the parameters that are most likely to produce the observed data. From a statistical point of view, MLE is usually recommended for large samples because it is versatile, applicable to most models and different types of data, and produces the most precise estimates.
Least squares estimation method (LSE) Least squares estimates are calculated by fitting a regression line to the points from a data set that has the minimal sum of the deviations squared (least square error). In reliability analysis, the line and the data are plotted on a probability plot.
In a linear model, if the errors belong to a normal distribution the least squaresestimators are also the maximum likelihood estimators.
Fisher‟s linear discriminant
Generalized Rayleigh quotient
Between-class covariance matrix Within-class covariance matrix
Fisher‟s linear discriminant
𝑆−^1 𝑆 𝒘^ =^ λ𝐰 𝑤 𝐵
Fisher‟s discriminant for multiple classes
covariance matrices defined in the original x-space
Fisher‟s discriminant for multiple classes
covariance matrices defined in the projected y-space
The perceptron algorithm
The perceptron algorithm