

Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
Problem set 3 for cs 4803a/8803a: pattern recognition course. It includes five problems related to estimating probabilities using the multinomial distribution, maximum likelihood estimation, bayesian classification, and independence of random variables. Problem 5 involves a fishing scenario where two fish with given weights are classified as bass, but the optimal classification is to be determined.
Typology: Assignments
1 / 2
This page cannot be seen from the preview
Don't miss anything!


Problem Set 3
Date: Feb 20, 2003 Due: Feb 28th, 2003 - which is Friday before Spring Break. But will be accepted until Monday 10th.
P (y 1 ,... , y 4 ; P 1 ,... , P 4 ) =
y 1 !y 2! · · · y 4!
P 1 y 1 P 2 y 2 · · · P 4 y 4.
This distribution is called the multinomial distribution. Suppose exactly yi occurrences of i are observed in the N rolls, where i = 1,... , 4. Find the maximum likelihood estimates { Pˆi} for the probabilities {Pi}, i = 1,... , 4 in terms of the yi’s.
(a) The weight of a fish in the pond can be considered a random variable. If I draw two fish, the weights correspond to two random variables. Are these two variables independent, given your limited knowledge of the pond? Motivate your answer mathematically. (Re- member: independence is not a property of the fish but of your knowledge about them.)
(b) There are four possible classifications of the two fish: c 2 = carp c 2 = bass c 1 = carp c 1 = bass where c 1 is the class of fish 1 (which weighs w 1 = 1. 5 pounds) and c 2 is the class of fish 2 (which weighs w 2 = 5 pounds). Make a table like this and fill in the probability of each possible classification given the weights you observed. What is the most probable classification of the fish? What is the least probable classification of the fish? (c) What does this result imply in general about performing pattern recognition when you are uncertain about the class distributions (e.g. because you estimated them from training data)?