Analyzing Child's Outdoor Play Based on Weather, Temperature, Humidity, and Wind, Assignments of Machine Learning

A step-by-step example of how to build a decision tree to predict whether a child will go out to play based on various weather, temperature, humidity, and wind conditions. The example uses information gain to determine the best feature to split on at each level of the tree. The final decision tree is presented, and the concept of boosting is briefly introduced.

Typology: Assignments

2023/2024

Uploaded on 02/23/2024

1967-amrita-sinha
1967-amrita-sinha ๐Ÿ‡ฎ๐Ÿ‡ณ

1 document

1 / 10

Toggle sidebar

This page cannot be seen from the preview

Don't miss anything!

bg1
Decision Trees Example Problem
Consider the following data, where the Y label is whether or not the child goes out to play.
Day
Weather
Temperature
Humidity
Wind
Play?
1
Sunny
Hot
High
Weak
No
2
Cloudy
Hot
High
Weak
Yes
3
Sunny
Mild
Normal
Strong
Yes
4
Cloudy
Mild
High
Strong
Yes
5
Rainy
Mild
High
Strong
No
6
Rainy
Cool
Normal
Strong
No
7
Rainy
Mild
High
Weak
Yes
8
Sunny
Hot
High
Strong
No
9
Cloudy
Hot
Normal
Weak
Yes
10
Rainy
Mild
High
Strong
No
pf3
pf4
pf5
pf8
pf9
pfa

Partial preview of the text

Download Analyzing Child's Outdoor Play Based on Weather, Temperature, Humidity, and Wind and more Assignments Machine Learning in PDF only on Docsity!

Decision Trees Example Problem

Consider the following data, where the Y label is whether or not the child goes out to play.

Day Weather Temperature Humidity Wind Play?

1 Sunny Hot High Weak No

2 Cloudy Hot High Weak Yes

3 Sunny Mild Normal Strong Yes

4 Cloudy Mild High Strong Yes

5 Rainy Mild High Strong No

6 Rainy Cool Normal Strong No

7 Rainy Mild High Weak Yes

8 Sunny Hot High Strong No

9 Cloudy Hot Normal Weak Yes

10 Rainy Mild High Strong No

Step 1: Calculate the IG (information gain) for each attribute (feature)

Initial entropy = ๐ป(๐‘Œ) = โˆ’ โˆ‘ ๐‘ƒ(๐‘Œ = ๐‘ฆ) log 2

๐‘ฆ

log

2

log

2

= โˆ’( 0. 5 ) log

2

( 0. 5 ) โˆ’ ( 0. 5 ) log

2

Temperature:

Total entropy of this division is:

๐ป(๐‘Œ | ๐‘ก๐‘’๐‘š๐‘) = โˆ’ โˆ‘ ๐‘ƒ(๐‘ก๐‘’๐‘š๐‘ = ๐‘ฅ) โˆ‘ ๐‘ƒ(๐‘Œ = ๐‘ฆ | ๐‘ก๐‘’๐‘š๐‘ = ๐‘ฅ) log

2

๐‘ฅ ๐‘ฆ

= โˆ’(๐‘ƒ(๐‘ก๐‘’๐‘š๐‘ = ๐ป) โˆ‘ ๐‘ƒ(๐‘Œ = ๐‘ฆ |๐‘ก๐‘’๐‘š๐‘ = ๐ป) log

2

๐‘ฆ

๐‘ƒ(๐‘ก๐‘’๐‘š๐‘ = ๐‘€) โˆ‘ ๐‘ƒ(๐‘Œ = ๐‘ฆ |๐‘ก๐‘’๐‘š๐‘ = ๐‘€) log

2

๐‘ฆ

log

2

๐‘ฆ

1

2

) log

2

1

2

1

2

) log

2

1

2

3

5

) log

2

3

5

2

5

) log

2

2

5

log

2

log

2

IG(Y, temp) = 1 โ€“ 0.7884 = 0.

Temperature

HOT

N, Y, N, Y

MILD

Y, Y, N, Y, N

COLD

N

Humidity:

Total entropy of this division is:

๐ป(๐‘Œ | โ„Ž๐‘ข๐‘š) = โˆ’ โˆ‘ ๐‘ƒ(โ„Ž๐‘ข๐‘š = ๐‘ฅ) โˆ‘ ๐‘ƒ(๐‘Œ = ๐‘ฆ | โ„Ž๐‘ข๐‘š = ๐‘ฅ) log

2

๐‘ฅ ๐‘ฆ

= โˆ’(๐‘ƒ(โ„Ž๐‘ข๐‘š = ๐ป) โˆ‘ ๐‘ƒ(๐‘Œ = ๐‘ฆ |โ„Ž๐‘ข๐‘š = ๐ป) log

2

๐‘ฆ

๐‘ƒ(โ„Ž๐‘ข๐‘š = ๐‘) โˆ‘ ๐‘ƒ(๐‘Œ = ๐‘ฆ |โ„Ž๐‘ข๐‘š = ๐‘) log

2

๐‘ฆ

3

7

) log

2

3

7

4

7

) log

2

4

7

2

3

) log

2

2

3

1

3

) log

2

1

3

IG(Y, hum) = 1 โ€“ 0.8651 = 0.

Humidity

STRONG

Y, Y, Y, N, N, N, N

WEAK

Y, N, Y

Wind:

Total entropy of this division is:

๐ป(๐‘Œ | ๐‘ค๐‘–๐‘›๐‘‘) = โˆ’ โˆ‘ ๐‘ƒ(๐‘ค๐‘–๐‘›๐‘‘ = ๐‘ฅ) โˆ‘ ๐‘ƒ(๐‘Œ = ๐‘ฆ | ๐‘ค๐‘–๐‘›๐‘‘ = ๐‘ฅ) log

2

๐‘ฅ ๐‘ฆ

= โˆ’(๐‘ƒ(๐‘ค๐‘–๐‘›๐‘‘ = ๐‘†) โˆ‘ ๐‘ƒ(๐‘Œ = ๐‘ฆ |๐‘ค๐‘–๐‘›๐‘‘ = ๐‘†) log

2

๐‘ฆ

๐‘ƒ(๐‘ค๐‘–๐‘›๐‘‘ = ๐‘Š) โˆ‘ ๐‘ƒ(๐‘Œ = ๐‘ฆ |๐‘ค๐‘–๐‘›๐‘‘ = ๐‘Š) log

2

๐‘ฆ

2

6

) log

2

2

6

4

6

) log

2

4

6

1

4

) log

2

1

4

3

4

) log

2

3

4

IG(Y, wind) = 1 โ€“ 0.8755 = 0.

Step 2: Choose which feature to split with!

IG(Y, wind) = 0.

IG(Y, hum) = 0.

IG(Y, weather) = 0.

IG(Y, temp) = 0.

Wind

STRONG

Y, Y, N, N, N, N

WEAK

N, Y, Y, Y

Humidity

Entropy of โ€œSunnyโ€ node = โˆ’((

1

3

) log

2

1

3

2

3

) log

2

2

3

Entropy of its children = 0

IG = 0.

Entropy of โ€œRainyโ€ node = โˆ’((

1

4

) log

2

1

4

3

4

) log

2

3

4

Entropy of children = โˆ’(

3

4

1

3

) log

2

1

3

2

3

) log

2

2

3

IG = 0.

SUNNY

N, Y, N

Humidity

HIGH

N, N

NORMAL

Y

CLOUDY

Y, Y, Y

RAINY

N, N, Y, N

Humidity

HIGH

N, Y, N

NORMAL

N

Wind

Entropy of โ€œSunnyโ€ node = โˆ’((

1

3

) log

2

1

3

2

3

) log

2

2

3

Entropy of its children = โˆ’(

2

3

1

2

) log

2

1

2

1

2

) log

2

1

2

IG = 0.

Entropy of โ€œRainyโ€ node = โˆ’((

1

4

) log

2

1

4

3

4

) log

2

3

4

Entropy of children = 0

IG = 0.

SUNNY

N, Y, N

Wind

STRONG

N, Y

WEAK

N

CLOUDY

Y, Y, Y

RAINY

N, N, Y, N

Wind

STRONG

N, N, N

WEAK

Y

Boosting

(https://www.ccs.neu.edu/home/vip/teach/MLcourse/4_boosting/slides/boosting.pdf)