



Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
Material Type: Assignment; Professor: Roth; Class: Machine Learning; Subject: Computer Science; University: University of Illinois - Urbana-Champaign; Term: Fall 2007;
Typology: Assignments
1 / 5
This page cannot be seen from the preview
Don't miss anything!




CS446: Pattern Recognition and Machine Learning Fall 2008
Handed Out: September 11, 2007 Due: September 25, 2007
a. ¬A ∨ B ∧ C [3 points] b. (A ∧ ¬B) ∨ ¬(C ∧ D) [3 points] c. (A ∨ B) ⊕ C ∨ A ⊕ (¬B ∧ C)[4 points]
a. Let Dk be the class of k-disjunctions (disjunction on k of the n variables or their negation) over (x 1 , x 2 , · · · , xn). State the size of the smallest possible consistent decision tree for Dk in terms of n and k. Describe the shape of the resulting tree. [3 points] b. Let Ck be the class of k-conjunctions (conjunction on k of the n variables or their negation) over (x 1 , x 2 , · · · , xn). State the size of the smallest possible consistent decision tree for Ck in terms of n and k. Describe the shape of the resulting tree. [3 points] c. Let Pk be the class of k-parity functions (parity function on k of the n variables) over (x 1 , x 2 , · · · , xn). The (even) parity function evaluates to 1 if there are an even number of 1’s in the feature and evaluates to 0 if there are an odd number of 1’s in the feature vector. State the size of the smallest possible consistent decision tree for Pk in terms of n and k. [3 points] d. What do these results imply about the application of decision tree learning for learning functions in Dk, Ck, and Pk? [6 points]
Your program should perform the items listed below. Please note that your actual implementation of the decision tree algorithm should be independent from the feature extraction mechanism, as we may use it in other assignments. In particular, we may be requiring you to reuse this generic decision tree code for rules extraction and boosting later this semester.
This says that:
Before you run your program on the data, you may wish to test it on a small set of examples for which you can construct the tree yourself (e.g., the data from Mitchell, exercise 3.2) for debugging purposes. You may also consider testing out your decision tree on the original Badges Game data, which is found in the handouts section of the website.
Once your program can perform the operations detailed above, evaluate its performance using 5-fold cross validation, as described below.
A ≡ (splittingHeuristic, depth, f eatureSet)
Remember, this is the minimum. Feel free to experiment with more tree depths and feature sets. You don’t need to try every combination of splitting criteria, depth and feature set, but you need to have the 4 algorithms mentioned above.
mkdir jdoe-hw mv *.java *.tree README jdoe-hw gtar zcvf jdoe-hw2.tar.gz jdoe-hw
Submit this file on via email to the TA.