Prepare for your exams
Get points
Guidelines and tips
Sell on Docsity
Docsity AI

Prepare for your exams

Study with the several resources on Docsity

Earn points to download

Earn points by helping other students or get them with a premium plan

Guidelines and tips

Sell on Docsity

Docsity AI

Log in Sign up

Prepare for your exams

Study with the several resources on Docsity

Find documents

Prepare for your exams with the study notes shared by other students like you on Docsity

Search for your university

Find the specific documents for your university's exams

Docsity AINEW

Summarize your documents, ask them questions, convert them into quizzes and concept maps

Explore questions

Clear up your doubts by reading the answers to questions asked by your fellow students

Earn points to download

Earn points by helping other students or get them with a premium plan

Share documents

20 Points

For each uploaded document

Answer questions

5 Points

For each given answer (max 1 per day)

All the ways to get free points

Get points immediately

Choose a premium plan with all the points you need

Study Opportunities

Choose your next study program

Get in touch with the best universities in the world. Search through thousands of universities and official partners

Community

Ask the community

Ask the community for help and clear up your study doubts

Free resources

Our save-the-student-ebooks!

Download our free guides on studying techniques, anxiety management strategies, and thesis advice from Docsity tutors

CS 4803-B/8801-B: Pattern Recognition - Problem Set 2 - Prof. Michael Best, Assignments of Computer Science

Georgia Institute of Technology - Main Campus Computer Science

Prof. Michael Best

Problem set 2 for the cs 4803-b/8803-b: pattern recognition course. The problem set includes various tasks related to hypothesis testing, bayes risk, decision rules, and error rates in pattern recognition. Students are required to find probabilities, calculate bayes risks, and determine decision regions using given data and formulas.

Typology: Assignments

Pre 2010

Uploaded on 08/05/2009

koofers-user-n3f 🇺🇸

10 documents

1 / 4

This page cannot be seen from the preview

Don't miss anything!

CS 4803-B/8803-B: Pattern Recognition

Problem Set 2

Date: Jan 30, 2001 Due: start of class Feb 13, 2001

WARNING: Do not leave this for the last night before the PS is due. It takes

some work....

1. In a particular binary hypothesis testing application, the conditional density for a scalar

feature ygiven class ω1is

py|ω1(ˆy|ω1)=k

1exp(−ˆy2/10).

Given class ω2, the conditional density is

py|ω2(ˆy|ω2)=k

2exp(−(ˆy−2)2/2).

(a) Find k1and k2, and plot the two densities on a single graph using Matlab.

(b) Assume that the prior probabilities of the two classes are equal, and that the cost

for choosing correctly is zero. If the costs for choosing incorrectly are C12 = 1 and

C21 =√5, what is the expression for the Bayes risk?

(c) Find the decision regions which minimize the Bayes risk, and indicate them on the

plot you made in part (a).

(d) For the decision regions you found in part (c), what is the numerical value of the

Bayes risk? Hint: use Matlab’s erf function but be careful - the erf function is a

bit weird. Check help.

2. Hans and Frans each want to classify patterns into two classes ω1and ω2, whose prior

probabilities are equal. They both measure quantity x, but with different measuring

devices.

(a) Hans uses the latest model of the device, which we can call feature x1,todothe

classification. The probability distribution of x1for each class is:

x1=1 x

1=2 x

1=3

p(x

1

|ω

1

)0.80 0.055 0.145

p(x1|ω2)0.15 0.05 0.80

Hans wants to use the decision rule which minimizes his error rate. What is the rule,

and what is its error rate?

1

Discover Assignments of Computer Science Georgia Institute of Technology - Main Campus

Partial preview of the text

Download CS 4803-B/8801-B: Pattern Recognition - Problem Set 2 - Prof. Michael Best and more Assignments Computer Science in PDF only on Docsity!

CS 4803-B/8803-B: Pattern Recognition

Problem Set 2 Date: Jan 30, 2001 Due: start of class Feb 13, 2001

WARNING: Do not leave this for the last night before the PS is due. It takes some work....

In a particular binary hypothesis testing application, the conditional density for a scalar feature y given class ω 1 is

py|ω 1 (ˆy|ω 1 ) = k 1 exp(−yˆ^2 /10).

Given class ω 2 , the conditional density is

py|ω 2 (ˆy|ω 2 ) = k 2 exp(−(ˆy − 2)^2 /2).

(a) Find k 1 and k 2 , and plot the two densities on a single graph using Matlab. (b) Assume that the prior probabilities of the two classes are equal, and that the cost for choosing correctly is zero. If the costs for choosing incorrectly are C 12 = 1 and C 21 =

5, what is the expression for the Bayes risk? (c) Find the decision regions which minimize the Bayes risk, and indicate them on the plot you made in part (a). (d) For the decision regions you found in part (c), what is the numerical value of the Bayes risk? Hint: use Matlab’s erf function but be careful - the erf function is a bit weird. Check help.

Hans and Frans each want to classify patterns into two classes ω 1 and ω 2 , whose prior probabilities are equal. They both measure quantity x, but with different measuring devices.

(a) Hans uses the latest model of the device, which we can call feature x 1 , to do the classification. The probability distribution of x 1 for each class is: x 1 = 1 x 1 = 2 x 1 = 3 p(x 1 |ω 1 ) 0.80 0.055 0. p(x 1 |ω 2 ) 0.15 0.05 0. Hans wants to use the decision rule which minimizes his error rate. What is the rule, and what is its error rate?

(b) Frans uses an older model of the device, which we can call feature x 2 , to do the classification. The probability distribution of x 2 for each class is: x 2 = 1 x 2 = 2 x 2 = 3 p(x 2 |ω 1 ) 0.26 0.73 0. p(x 2 |ω 2 ) 0.026 0.803 0. Frans also wants minimize his error rate. What rule should he use, and what is its error rate? (c) What is Hans’s confidence in his classification (how certain is he that he made the right choice), as a function of his measurement x 1? What is Frans’s confidence in his classification, as a function of his measurement x 2? (d) What does this tell you about the relationship between a classifier’s error rate and our confidence in its classification? Does Hans or does Frans have a better classifier? Why?

You are designing software for an airport radar system. The airport runway constantly emits a radar signal which is reflected when an aircraft is landing. The radar return is a time series of discrete measurements x[t]. If there is no aircraft, the return is zero. If there is an aircraft, the return is a deterministic signal s[t] of length n. Furthermore, the return always has noise added to it which we can model as zero-mean Gaussian with variance σ^2. You can assume that the noise at different times is independent. To detect aircraft, you will inspect every window of length n to see if it contains the characteristic signal s[t]. Each window can be considered a fixed-length vector x. Show that an optimal decision rule, regardless of the priors and costs, has the form xTy < t for some vector y and threshold t. Give one possible y.
You are given two datasets (find them by clicking on “datasets” on the course home page). Each dataset contains 100 examples of a 3D random vector. Your task is to discriminate between these two datasets. To do that, you will do the following in Matlab: - Calculate the sample covariance matrices of the datasets. If we assume that the classes have the same covariance, then the best estimate of the common covariance is the average of the two sample covariance matrices. - Compute the eigenvectors of your estimate of the common covariance matrix. - And then answer the following questions: - Which eigenvectors capture most of the energy in the data? - Which eigenvectors permit you to discriminate most easily?

Discuss your results. Do you find that the eigenvectors that capture most of the en- ergy in the data (also known as the Most Expressive Feature [MEF]) also are the Most Discriminating Feature? If so, why? If not, why not?

Billy’s Beetle Repainting of LA is a shop specializing in repainting VW Beetle automo- biles. When the car is white (about 30% of the cars), they must do a quick measurement

when there is a mistake on the board, and

py|ω 2 (ˆy|ω 2 ) =

{ 3 e−^3 y^ if y > 0; 0 otherwise

when there is no mistake. Wilbur makes his decision whether or not to speak up based on the murmuring level y. Let PD be the probability that Wilbur speaks up correctly, i.e., when there is a mistake on the board, and let PF be the probability that he speaks up when there is no mistake on the board. Design a decision rule so that PD is maximized subject to the constraint that PF ≤ 0 .05. What is the resulting value of PD?

This problem addresses the issue of making decisions when the class distributions are themselves uncertain. Perhaps surprisingly, this situation can be handled without any extra machinery beyond what you have been given so far. The problem: You want to determine if a fish taken from a given pond is a carp or bass, based on its weight in pounds. You consult two experts on fish who agree on some things and disagree on others. They both agree that: - Carp and bass are equally likely to be drawn from the pond. - The weight of either type of fish is Gaussian-distributed with a standard deviation of one pound. - The mean weight of a carp is 3 pounds.

However, one expert says that the mean weight of a bass is 2 pounds, while the other expert says the mean weight of a bass is 4 pounds. You trust both experts equally well, and you believe one of them is right. There is an optimal Bayesian decision rule based on this information alone. What is the rule?

CS 4803-B/8801-B: Pattern Recognition - Problem Set 2 - Prof. Michael Best, Assignments of Computer Science

Related documents

Partial preview of the text

Download CS 4803-B/8801-B: Pattern Recognition - Problem Set 2 - Prof. Michael Best and more Assignments Computer Science in PDF only on Docsity!

CS 4803-B/8803-B: Pattern Recognition