













Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
Material Type: Assignment; Class: ST: Prog Analy &Mechanization; Subject: Computer Science; University: University of New Mexico; Term: Unknown 1989;
Typology: Assignments
1 / 21
This page cannot be seen from the preview
Don't miss anything!














Various homework bugs:
Due: Oct 12 (Tues) not 9 (Sat)
Problem 3 should read:
(duh)
(some) info on naive Bayes in Sec. 4.3 of text
d
i=
i
d
i=
− (x i −μ i,c ) 2 2 σ i,c
Joint probabilities
Given d different random vars,
The “joint” probability of them taking on the simultaneous values
given by
Or, for shorthand,
Closely related to the “joint PDF”
Independence:
Two random variables are statistically independent iff:
Or, equivalently (usually for discrete RVs):
For multivariate RVs:
d
i=
i
Given training data, [ X , Y ], w/ discrete labels Y
Break data out into sets , etc.
Want to come up with models, ,
Suppose the individual f() s are Gaussian, need the params μ and σ
How do you get the params?
Now, what if the f()s are something really funky you’ve never seen before in your life, with parameters
Principle of maximum likelihood:
Pick the parameters that make the data as probable (or, in general “likely”) as possible
Regard the probability function as a func of two variables: data and parameters:
Function L is the “likelihood function”
Want to pick the that maximizes L
0 2 4 6 8 10 0
x f(x) L(X,!) regarded as a fn of X
0 2 4 6 8 10 0
L(X,!) regarded as a fn of! ! L( ! )
In supervised learning, we usually assume that data points are sampled independently and from the same distribution
IID assumption: data are independent and identically distributed
In supervised learning, we usually assume that data points are sampled independently and from the same distribution
IID assumption: data are independent and identically distributed
joint PDF can be written as product of individual (marginal) PDFs:
N
i=
Find the maximum likelihood estimator of μ for the univariate Gaussian:
Find the maximum likelihood estimator of β for the degenerate gamma distribution:
Hint: consider the log of the likelihood fns in both cases
− (x−μ) 2 2 σ
3
2
− x β
complete training data
Conditional probabilities
Suppose you have a joint PDF, f ( H , W )
Now you get to see one of the values, e.g., H=“ 183cm ”
What’s your probability estimate of A , given this new knowledge?
Conditional probabilities
Suppose you have a joint PDF, f ( H , W )
Now you get to see one of the values, e.g., H=“ 183cm ”
What’s your probability estimate of A , given this new knowledge?
w