




































































Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
Tutorial on Probability and Estimation, Bernoulli Distribution, MAP Estimates, Beta Conjugate Prior, Multivariate Gaussian Distributions, Gaussians, Central Limit Theorem, Univariate Gaussian Distribution, Moments, Multivariate Gaussian, Matrix, Mean of the Gaussian, Covariance, Correlation Vs. Covariance, Covariance Matrix, Covariance of the Gaussian, Geometry, Density Contours, Linear Functions, Gaussian RV, Conditional Marginal, ML for Parameters, Covariance Matrices, Greg Shakhnarovich, Lect
Typology: Lecture notes
1 / 76
This page cannot be seen from the preview
Don't miss anything!





































































TTIC 31020: Introduction to Machine Learning
Instructor: Greg Shakhnarovich, Lecture by Dhruv Batra
TTI–Chicago
September 29, 2010
Last time: Bernoulli distribution, ML and MAP estimates, beta conjugate prior.
Today: uni- and multivariate Gaussian distributions
Things you should have seen before
Things you should have seen before
Things you should have seen before
This refresher WILL revise these topics.
Gaussian distributions are widely used in machine learning:
Gaussian distributions are widely used in machine learning:
X¯n = X 1 + X 2 + · · · + Xn
Gaussian distributions are widely used in machine learning:
X¯n = X 1 + X 2 + · · · + Xn √ n X¯n −→ Nd
x; μ, σ^2
Gaussian distributions are widely used in machine learning:
Gaussian distributions are widely used in machine learning:
Gaussian distributions are widely used in machine learning:
N (x; μ, σ^2 ) =
(2πσ^2 )^1 /^2
exp
2 σ^2 (x − μ)^2
mean μ determines location variance σ^2 ; standard deviation
σ^2 determines the spread around μ
N (x|μ, σ^2 )
x
2 σ
μ
Reminder: expectation of a RV x is E [x] ,
xp(x)dx, so
E [x] =
−∞
xN (x; μ, σ^2 )dx = μ
Reminder: expectation of a RV x is E [x] ,
xp(x)dx, so
E [x] =
−∞
xN (x; μ, σ^2 )dx = μ
Variance of x is var x , E
(x − E [x])^2
, and