

Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
Problem set 3 for cs 4803b/8803b: pattern recognition course. The set includes four problems related to estimating probabilities using the multinomial distribution and maximum likelihood estimation, and decision making based on normally distributed figure of merit from two different tests. Zaphod beeblebrox, the owner of an ion drive delta boat manufacturing business, is seeking help to minimize mistakes and operating costs using these tests.
Typology: Assignments
1 / 3
This page cannot be seen from the preview
Don't miss anything!


Problem Set 3 Date: Feb 22, 2001 Due: March 1, 2001
P (y 1 ,... , y 4 ; P 1 ,... , P 4 ) =
y 1 !y 2! · · · y 4!
P 1 y 1 P 2 y 2 · · · P 4 y 4.
This distribution is called the multinomial distribution. Suppose exactly yi occurrences of i are observed in the N rolls, where i = 1,... , 4. Find the maximum likelihood estimates { Pˆi} for the probabilities {Pi}, i = 1,... , 4 in terms of the yi’s.
(a) Zaphod is confused because these tests do not seem to give a steady answer and he is afraid that he will make mistakes. So, he calls you to help him make the lowest number of mistakes. You look at it and tell him that you’ll design a simple software program that looks at these numbers and tells whether the iddb should be classified as good or bad. Explain to Zaphod and make neat plots so that he can understand how the decision is being made. (b) Zaphod finds out that it costs him 25000DUs in liability suits for selling bad iddbs and it costs him 5000DUs to do corrective work on the iddb that was classified by the decision making software as bad. He wants to minimize the operating cost. So, he calls you once again. Explain your new decision making process to him. Once again use plots and graphs to clearly indicate the decision boundaries to Zaphod. (c) The government of Damogran (which sends the coast guard to rescue the people from damaged iddbs) wants to reduce the cost of its rescue operations and hence is pondering on new legislation to make the manufacturers more accountable. Zaphod finds that the new pending legislation in the government would make it necessary for him to sell no more than 0.01% bad iddbs. Make the necessary changes in your decision making software to handle the requirements of this new legislation. For this problem you may want to assume that Zaphod will do either the NDCR or the NDIR test but not both.
For the following part, either both the tests (NDCR and NDIR) are to be performed in parallel or only one of them is performed. Do not perform sequential tests.
(d) Zaphod discovers that it costs him 1000DUs to run the NDIR test per iddb and 500DUs to run the NDCR per iddb. Assume the costs of repair and field failure from part (b). If he wants to minimize his entire cost of operation, what should he do:
We return to the fishing scenario in Problem Set 2, Problem 7. By now you’ve constructed the optimal Bayesian classifier to decide if a fish is carp or bass, based on the uncertain class information you received from the two experts. You draw two fish from the pond which weigh 1.5 pounds and 5 pounds, respectively. If you applied your classifier inde- pendently to each fish, they would both be labeled bass. However, this may not seem right, and indeed it isn’t (remember: only one expert is right about the pond). In this problem we will figure out what the optimal classification of the two fish should be.
(a) The weight of a fish in the pond can be considered a random variable. If I draw two fish, the weights correspond to two random variables. Are these two variables