Dummy Variables - Econometrics - Lecture Notes, Study notes of Econometrics and Mathematical Economics

Dummy variables, Dependent variable, Regression output, Regression equation, Intercepts of two types, Regression equation for ISO certified, Intercept dummy are points you can learn about Econometric in this lecture.

Typology: Study notes

2011/2012

Uploaded on 11/10/2012

uzman
uzman ๐Ÿ‡ฎ๐Ÿ‡ณ

4.8

(12)

144 documents

1 / 4

Toggle sidebar

This page cannot be seen from the preview

Don't miss anything!

bg1
Chapter 07
The dummy variables
Some time we have a variable involved in a regression which is not directly interpretable in numbers.
For example, suppose we want to see the effect of gender on the educational performance. Along with
other variables that determine the educational performance, gender is also a variable. But this variable
is not numerical. Gender can take only two values i.e. male or female, which has no direct quantitative
representation. In this case, the dummy variables are used to see the effect of such dummy variable on
any dependent variable. Consider the following data set
CGPA
Study Time
Gender
3.3
5
Male
3.7
6
Female
2.5
5
Female
1.9
4
Male
3.9
6
Female
It has been observed in many educational institutions that the female students secure higher grades in
the exams. We want to see whether there is some effect of gender on educational performance or the
observed difference is just due to the fact that female spend more time for education. To see this we
want to use gender as a variable in the regression. We cannot directly use gender in the regression, let
me introduce a dummy variable as follows:
Let ๐‘‘๐‘š denote the dummy for male which is defined as follows:
๐‘‘๐‘š =๏ฟฝ1๐‘–๐‘“ ๐‘ ๐‘ก๐‘ข๐‘‘๐‘’๐‘›๐‘ก ๐‘ค๐‘Ž๐‘  ๐‘š๐‘Ž๐‘™๐‘’
0๐‘œ๐‘กโ„Ž๐‘’๐‘Ÿ๐‘ค๐‘–๐‘ ๐‘’
We can use this variable instead of the original variable. The data will become
CGPA
Study Time
Dm
3.3
5
1
3.7
6
0
2.5
5
0
1.9
4
1
3.9
6
0
Docsity.com
pf3
pf4

Partial preview of the text

Download Dummy Variables - Econometrics - Lecture Notes and more Study notes Econometrics and Mathematical Economics in PDF only on Docsity!

Chapter 07

The dummy variables

Some time we have a variable involved in a regression which is not directly interpretable in numbers. For example, suppose we want to see the effect of gender on the educational performance. Along with other variables that determine the educational performance, gender is also a variable. But this variable is not numerical. Gender can take only two values i.e. male or female, which has no direct quantitative representation. In this case, the dummy variables are used to see the effect of such dummy variable on any dependent variable. Consider the following data set

CGPA Study Time Gender 3.3 5 Male 3.7 6 Female 2.5 5 Female 1.9 4 Male 3.9 6 Female

It has been observed in many educational institutions that the female students secure higher grades in the exams. We want to see whether there is some effect of gender on educational performance or the observed difference is just due to the fact that female spend more time for education. To see this we want to use gender as a variable in the regression. We cannot directly use gender in the regression, let me introduce a dummy variable as follows:

Let ๐‘‘๐‘š denote the dummy for male which is defined as follows:

๐‘‘๐‘š = ๏ฟฝ^1 ๐‘–๐‘“^ ๐‘ ๐‘ก๐‘ข๐‘‘๐‘’๐‘›๐‘ก^ ๐‘ค๐‘Ž๐‘ ^ ๐‘š๐‘Ž๐‘™๐‘’ 0 ๐‘œ๐‘กโ„Ž๐‘’๐‘Ÿ๐‘ค๐‘–๐‘ ๐‘’

We can use this variable instead of the original variable. The data will become

CGPA Study Time Dm 3.3 5 1 3.7 6 0 2.5 5 0 1.9 4 1 3.9 6 0

Note that we were having a variable with two categories, and we have introduced only one dummy. This is because if we have only two possible options, knowledge of one option verifies whether other option exists or not. Technically it is not possible to compute regression with both dummies. Similarly if we have a variable with k-categories, we will introduce k-1 dummies.

Now we can calculate the regression in usual way. The calculated regression output is as follows:

SUMMARY OUTPUT

Regression Statistics Multiple R 0. R Square 0. Adjusted R Square 0. Standard Error 0. Observations 5

ANOVA df SS MS F Regression 2 2.809143 1.404571 122. Residual 2 0.022857 0. Total 4 2.

Coefficients

Standard Error t Stat P-value Intercept -4.24286 0.564241 -7.51959 0. Study Time 1.342857 0.098974 13.56773 0. Dm 0.8 0.151186 5.291503 0.

On basis of this regression output, we can write the regression output as follows;

CGPA=-4.24+1.34Studytime+0.8dm+e

This regression equation can give rise to two regression equations in following way.

๏‚ง Regression equation for male

If the student was male, dm=1, and we have

CGPA=-4.24+1.34Studytime+0.8+e=-3.44+1.34studytime+e

๏‚ง Regression equation for female

๏‚ง If the company is ISO certified, Diso=1 and we get

P= P=0.62+0.26V+0.35Adv+e

๏‚ง If company is not certified than Diso=0 and regression equation is

P=0.45+0.26V+0.35Adv+e

The intercept for ISO certified is higher; however the difference in the intercepts of two types is not significant on statistical grounds.