Pattern Recognition and Machine Learning - Assignment 2 | CS 446 | Assignments Computer Science

CS446: Pattern Recognition and Machine Learning Fall 2008

Problem Set 2

Handed Out: September 11, 2007 Due: September 25, 2007

•Feel free to talk to your classmates about the homework. I am more concerned that you learn how to

solve the problem than that you demonstrate that you solved it entirely on your own. You should,

however, write down your solution yourself. Please try to keep the solution brief and clear.

•Please, no handwritten solutions. Be sure your name appears on the top of each page.

•Please present your algorithms in both pseudocode and English. That is, give a precise formulation of

your algorithm as pseudocode and also explain in one or two concise paragraphs what your algorithm

does. Be aware that pseudocode is much simpler and more abstract than real code. Take a look at

the textbook pseudocode (e.g. Table 2.5 on page 33) to get an idea about the appropriate level of

abstraction.

•The homework is due at 4:00 pm on the due date. Email write-up and your code to the TA. Please

do NOT hand in a hard copy of your write-up. Please put “<userid>CS446 hw2 submission” as the

subject line of the email when you submit your homework to [email protected].

1. [Representing Boolean Functions - 10 points] (Based on Mitchell, exercise 3.1)

Give decision trees to represent the following Boolean functions:

a. ¬A∨B∧C[3 points]

b. (A∧ ¬B)∨ ¬(C∧D) [3 points]

c. (A∨B)⊕C∨A⊕(¬B∧C)[4 points]

2. [Space Complexity of Decision Trees - 15 points]

Let xbe a vector of nBoolean variables and krepresent the number of relevant variables

in the target function, (k≤n).

a. Let Dkbe the class of k-disjunctions (disjunction on kof the nvariables or their

negation) over (x1, x2,···, xn). State the size of the smallest possible consistent

decision tree for Dkin terms of nand k. Describe the shape of the resulting tree.

[3 points]

b. Let Ckbe the class of k-conjunctions (conjunction on kof the nvariables or their

negation) over (x1, x2,···, xn). State the size of the smallest possible consistent

decision tree for Ckin terms of nand k. Describe the shape of the resulting tree.

[3 points]

c. Let Pkbe the class of k-parity functions (parity function on kof the nvariables)

over (x1, x2,···, xn). The (even) parity function evaluates to 1 if there are an even

number of 1’s in the feature and evaluates to 0 if there are an odd number of 1’s

in the feature vector. State the size of the smallest possible consistent decision

tree for Pkin terms of nand k. [3 points]

d. What do these results imply about the application of decision tree learning for

learning functions in Dk,Ck, and Pk? [6 points]

Pattern Recognition and Machine Learning - Assignment 2 | CS 446, Assignments of Computer Science