

Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
An assignment for dr. Eick's cosc 6342 machine learning course in spring 2009. Students are required to develop a binary classifier called mtg (mixture of two gaussians) that uses a parametric approach with a mixture of two bivariate gaussians to approximate the density of each class. The classifier will be evaluated and enhanced on an arsenic dataset and an artificial dataset using two-fold cross validation. Topics covered include the differences between reinforcement learning and supervised learning, deriving solutions for weights, and computing mahalanobis distances.
Typology: Assignments
1 / 3
This page cannot be seen from the preview
Don't miss anything!


Dr. Eick
Due: Tuesday February, 24, 11p (electronic Submission) or 5 days after the topic was discussed in the lecture, whichever comes later; problem 5 is due on March 10, 11p (submit report and software!). Remark: Problem 6 was updated on March 1, 2008.
Because the classifier operates in ^2 , G,C=(xG,C, yG,C) and G,C={11,G,C ,12,G,C , 22,G,C}), called MTG (“ The mixture density function for each class is defined as follows: C:= CG1,C + (1-C)G2,C C’:= C’G1,C’ + (1-C’)G2,C’ with C ,C’ being “mixture” parameters in [0,1]. Finally, the MTG classifier assigns a class as follows to an example y ^2 : IF *C (y)>C’(y) THEN C ELSE C’ with being a parameter (default = |C|/|C’|).C|C|/|C’|)./|C|/|C’|).C’|C|/|C’|).). In summary, a MTG classifiers has 23 parameters (4 Gaussians each of which is characterized by 5 parameters, C, C’ and ). The goal of the project to develop procedures and techniques to derive a MTG-classifier from a training dataset and to evaluate its performance using two-fold cross validation. For the purpose of the project each dataset D is subdivided (using class-stratified sampling) into 3 sets: D 1 , D 2 , and DP. : D 1 and D 2 serve as training and test sets for learning the classifier and for two-fold cross-validation; DP serves as a optional validation set in case that your methods require a validation set. Write and submit a report that describes your approach to derive a MTG-classifier for a dataset, reports the results of the experimental evaluation of your methods, and gives a brief history of the project. Moreover, be prepared to demo your system!