


Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
Problem set 2 for the cs 4803-b/8803-b: pattern recognition course. The problem set includes various tasks related to hypothesis testing, bayes risk, decision rules, and error rates in pattern recognition. Students are required to find probabilities, calculate bayes risks, and determine decision regions using given data and formulas.
Typology: Assignments
1 / 4
This page cannot be seen from the preview
Don't miss anything!



Problem Set 2 Date: Jan 30, 2001 Due: start of class Feb 13, 2001
WARNING: Do not leave this for the last night before the PS is due. It takes some work....
py|ω 1 (ˆy|ω 1 ) = k 1 exp(−yˆ^2 /10).
Given class ω 2 , the conditional density is
py|ω 2 (ˆy|ω 2 ) = k 2 exp(−(ˆy − 2)^2 /2).
(a) Find k 1 and k 2 , and plot the two densities on a single graph using Matlab. (b) Assume that the prior probabilities of the two classes are equal, and that the cost for choosing correctly is zero. If the costs for choosing incorrectly are C 12 = 1 and C 21 =
5, what is the expression for the Bayes risk? (c) Find the decision regions which minimize the Bayes risk, and indicate them on the plot you made in part (a). (d) For the decision regions you found in part (c), what is the numerical value of the Bayes risk? Hint: use Matlab’s erf function but be careful - the erf function is a bit weird. Check help.
(a) Hans uses the latest model of the device, which we can call feature x 1 , to do the classification. The probability distribution of x 1 for each class is: x 1 = 1 x 1 = 2 x 1 = 3 p(x 1 |ω 1 ) 0.80 0.055 0. p(x 1 |ω 2 ) 0.15 0.05 0. Hans wants to use the decision rule which minimizes his error rate. What is the rule, and what is its error rate?
(b) Frans uses an older model of the device, which we can call feature x 2 , to do the classification. The probability distribution of x 2 for each class is: x 2 = 1 x 2 = 2 x 2 = 3 p(x 2 |ω 1 ) 0.26 0.73 0. p(x 2 |ω 2 ) 0.026 0.803 0. Frans also wants minimize his error rate. What rule should he use, and what is its error rate? (c) What is Hans’s confidence in his classification (how certain is he that he made the right choice), as a function of his measurement x 1? What is Frans’s confidence in his classification, as a function of his measurement x 2? (d) What does this tell you about the relationship between a classifier’s error rate and our confidence in its classification? Does Hans or does Frans have a better classifier? Why?
Discuss your results. Do you find that the eigenvectors that capture most of the en- ergy in the data (also known as the Most Expressive Feature [MEF]) also are the Most Discriminating Feature? If so, why? If not, why not?
when there is a mistake on the board, and
py|ω 2 (ˆy|ω 2 ) =
{ 3 e−^3 y^ if y > 0; 0 otherwise
when there is no mistake. Wilbur makes his decision whether or not to speak up based on the murmuring level y. Let PD be the probability that Wilbur speaks up correctly, i.e., when there is a mistake on the board, and let PF be the probability that he speaks up when there is no mistake on the board. Design a decision rule so that PD is maximized subject to the constraint that PF ≤ 0 .05. What is the resulting value of PD?
However, one expert says that the mean weight of a bass is 2 pounds, while the other expert says the mean weight of a bass is 4 pounds. You trust both experts equally well, and you believe one of them is right. There is an optimal Bayesian decision rule based on this information alone. What is the rule?