






Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
Various feature selection techniques used in machine learning, including information gain, average relative entropy, markov blanket, and approximate markology blanket. Information gain measures the information provided by each term about the class, while average relative entropy determines the relevance of a set of features using the average relative entropy. Markov blanket identifies features that make a term conditionally independent of all other features, and approximate markov blanket uses correlation factors and average cross entropy to select the most relevant features. The document also covers measures of performance, such as the confusion matrix, precision, recall, and precision-recall curve.
Typology: Slides
1 / 10
This page cannot be seen from the preview
Don't miss anything!







l(C, Wj) between the class
C^ and the term
Wj
-^ For feature selection^ –^ compute the information gain for each unique term^ –^ remove terms whose information gain is less than some predefined threshold •^ Limitations^ –^ relevance assessment of each term is done separately^ –^ effect of term co-occurrences is not considered
K = ∑ ∑=^ = c^
j j j j^ j^
cPc PcP 1 PWG 10
),( log) )()( ,() (^ ω
ω ω^ ω
Wj
-^ If^ Wj^ is conditionally independent of all features in
V –
M - {Wj}, given
M^ ⊂^ V,^ Wj
∉M
-^ class^ C^ is conditionally independent of
Wj, given^ M
-^ Feature selection is performed by^ –^ removing features for which the Markov blanket isfound
Wj^ in^ G,
-^ compute the co-relation factor of
Wj^ with^ Wi
-^ obtain a set
M^ of^ k^ terms, that have highest co- relation with
Wj
-^ find the average cross entropy
-^ select the term for which the average relativeentropy is minimum • Repeat steps until a predefined number ofterms are eliminated from the set
G
-^ TN - irrelevant values not retrieved •^ TP - relevant values retrieved •^ FP - irrelevant values retrieved •^ FN - relevant values not retrieved PredictedActual CategoryCategory-^ •^ Total retrieved terms = TP + FP •^ Total relevant terms = TP + FN
-^ TN
FN +^ FP
TP
A = (TP+TN) / |D|
-^ classification error,
-^ For unbalanced domain^ –^ precision and recall characterize performance
TP = π^ FPTP^ +
TP = ρ FNTP^ +