Prepare for your exams
Get points
Guidelines and tips
Sell on Docsity
Docsity AI

Prepare for your exams

Study with the several resources on Docsity

Earn points to download

Earn points by helping other students or get them with a premium plan

Guidelines and tips

Sell on Docsity

Docsity AI

Prepare for your exams

Study with the several resources on Docsity

Find documents

Prepare for your exams with the study notes shared by other students like you on Docsity

Search for your university

Find the specific documents for your university's exams

Docsity AINEW

Summarize your documents, ask them questions, convert them into quizzes and concept maps

Explore questions

Clear up your doubts by reading the answers to questions asked by your fellow students

Earn points to download

Earn points by helping other students or get them with a premium plan

Share documents

20 Points

For each uploaded document

Answer questions

5 Points

For each given answer (max 1 per day)

All the ways to get free points

Get points immediately

Choose a premium plan with all the points you need

Study Opportunities

Choose your next study program

Get in touch with the best universities in the world. Search through thousands of universities and official partners

Community

Ask the community

Ask the community for help and clear up your study doubts

Free resources

Our save-the-student-ebooks!

Download our free guides on studying techniques, anxiety management strategies, and thesis advice from Docsity tutors

Information Gain, Schemes and Mind Maps of Aerodynamics

University of California - Los Angeles (UCLA)Aerodynamics

Entropy: a common way to measure impurity. • Entropy = p i is the probability of class i. Compute it as the proportion of class i in the set.

Typology: Schemes and Mind Maps

2022/2023

Uploaded on 02/28/2023

charlene 🇺🇸

4.8

(5)

265 documents

1 / 13

This page cannot be seen from the preview

Don't miss anything!

Information Gain

Which test is more informative?

Split over whether

Balance exceeds 50K

Over 50KLess or equal 50KEmployed

Unemployed

Split over whether

applicant is employed

Discover Schemes and Mind Maps of Aerodynamics University of California - Los Angeles (UCLA)

Partial preview of the text

Download Information Gain and more Schemes and Mind Maps Aerodynamics in PDF only on Docsity!

Information Gain

Which test is more informative?

Split over whether Balance exceeds 50K

Less or equal 50K Over 50K Unemployed Employed

Split over whether applicant is employed

Information Gain

Impurity/Entropy (informal)

Measures the level of impurity in a group of examples

Entropy: a common way to measure impurity

Entropy =

p (^) i is the probability of class i Compute it as the proportion of class i in the set.

Entropy comes from information theory. The

higher the entropy the more the information

content.

∑ − i

pi log 2 pi

What does that mean for learning from examples?

16/30 are green circles; 14/30 are pink crosses log 2 (16/30) = -.9; log 2 (14/30) = -1. Entropy = -(16/30)(-.9) –(14/30)(-1.1) =.

2-Class Cases:

What is the entropy of a group in which

all examples belong to the same

class?

entropy = - 1 log 2 1 = 0
What is the entropy of a group with

50% in either class?

entropy = -0.5 log 2 0.5 – 0.5 log 2 0.5 =

Minimum impurity

Maximum impurity

not a good training set for learning

good training set for learning

Calculating Information Gain

1 43 0 (^) lo g 2 1 43 0 1 63 0 lo g 2 1 63 0= 0. 9 9 6     −^ ⋅    

−^ ⋅

7 8 7 1 7

lo g^4 1 7

4 1 7

lo g 1 3 1 7

1 3 2 2 =   

−^ ⋅ 

  

−^ ⋅

Entire population (30 instances) 17 instances

13 instances

(Weighted) Average Entropy of Children = 30 0.^3910.^615

787 13 30

(^17) = 

  

+^ ⋅ 

  

 (^) ⋅

Information Gain= 0.996 - 0.615 = 0.38 for this split

3 9 1 1 3

lo g 1 2 1 3

1 2 1 3

lo g^1 1 3

1 2 2 =   

−^ ⋅ 

  

−^ ⋅

Information Gain = entropy(parent) – [average entropy(children)]

parent entropy

child entropy

Entropy-Based Automatic

Decision Tree Construction

Node 1 What feature should be used? What values?

Training Set S x 1 =(f 11 ,f 12 ,…f1m) x 2 =(f 21 ,f 22 , f2m) . . xn =(fn1 ,f 22 , f2m)

Quinlan suggested information gain in his ID3 system and later the gain ratio, both based on entropy.

Information Gain, Schemes and Mind Maps of Aerodynamics

Related documents

Partial preview of the text

Download Information Gain and more Schemes and Mind Maps Aerodynamics in PDF only on Docsity!

Information Gain

Which test is more informative?

Entropy: a common way to measure impurity

higher the entropy the more the information

content.

pi log 2 pi

2-Class Cases:

all examples belong to the same

class?

50% in either class?

Calculating Information Gain

Entropy-Based Automatic

Decision Tree Construction

Simple Example

X Y Z C

1 1 1 I

1 1 0 I

0 0 1 II

1 0 0 II

X Y Z C

1 1 1 I

1 1 0 I

0 0 1 II

1 0 0 II

I I

II II

I I

II

II

X=

X Y Z C

1 1 1 I

1 1 0 I

0 0 1 II

1 0 0 II

I I

II II

I

II

I

II

Z=