











Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
The use of F-tests for joint hypothesis tests when dealing with multiple coefficients in regression analysis. It covers the concept of joint hypotheses, the role of F-tests in model selection, and the correlation between estimators. The document also includes examples and formulas for calculating the F-test statistic.
Typology: Lecture notes
1 / 19
This page cannot be seen from the preview
Don't miss anything!












Now that we have multiple โ X โ variables, and multiple ฮฒ s, our
hypotheses might also involve more than one ฮฒ.
summary(lm(wage ~ education + gender + age + experience))
Coefficients:
Estimate Std. Error t value Pr(>|t|)
(Intercept) - 1.9574 6.8350 - 0.286 0.
education 1.3073 1.1201 1.167 0.
genderfemale - 2.3442 0.3889 - 6.028 3.12e- 09 ***
age - 0.3675 1.1195 - 0.328 0.
experience 0.4811 1.1205 0.429 0.
Signif. codes: 0 โโ 0.001 โโ 0.01 โโ 0.05 โ.โ 0.1 โ โ 1
Residual standard error: 4.458 on 529 degrees of freedom
Multiple R-squared: 0.2533, Adjusted R-squared: 0.
F-statistic: 44.86 on 4 and 529 DF, p-value: < 2.2e- 16
The results of the above regression make me want to drop age and
experience.
This corresponds to the hypothesis:
0
: ฮฒ
3
= 0 and ฮฒ
4
A
: either ฮฒ
3
โ 0 or ฮฒ
4
โ 0 or both
Why would we want to drop variables?
3
4
In the model:
0
1
1
2
2
3
3
4
4
3
and ๐
4
are not independent (e.g. they are
correlated)
3
and b
4
will be correlated - the
formula for b
3
(etc.) involves all of the โXโ variables
(remember OVB)
3
and t
4
will be correlated!
Suppose that ๐
3
and ๐
4
are positively correlated. Consider the
null:
0
: ฮฒ
3
= 0 and ฮฒ
4
3
and b
4
are both positive (or negative), itโs not that big of a
deal
Iโm going to estimate two models:
unrestricted model (the ฮฒ s are allowed to be anything)
model. I get this model by taking the null hypothesis to heart.
That is, substitute in the values ฮฒ
3
= 0 and ฮฒ
4
= 0 into the full
model
Unrestricted model (under H A
unrestricted <- lm(wage ~ education + gender
Restricted model (under H 0
restricted <- lm(wage ~ education + gender)
estimators that are involved in the test
than the restricted model, we should reject the null.
the model under the alternative leads to a formulation of the F -
test statistic, for testing joint hypotheses.
The RSS is a measure of fit:
๐
2
๐
๐= 1
where
e
๐
๐
๐
The F-test statistic may be written as:
๐๐๐ ๐ก๐๐๐๐ก๐๐
๐ข๐๐๐๐ ๐ก๐๐๐๐ก๐๐
๐ข๐๐๐๐ ๐ก๐๐๐๐ก๐๐
๐ข๐๐๐๐ ๐ก๐๐๐๐ก๐๐
where ๐ = # of restrictions, k = # of โ X โs
2 2
2
unrestricted restricted
unrestricted unrestricted
where:
2
restricted
2
for the restricted regression
2
unrestricted
2
for the unrestricted regression
q = the number of restrictions under the null
k unrestricted
= the number of regressors in the unrestricted regression.
The bigger the difference between the restricted and unrestricted
2
โs โ the greater the improvement in fit by adding the variables in
question โ the larger is the F statistic.
2
in the
restricted model ( H
0
model) and the unrestricted model ( H
A
model).
exceeds the (5%) critical value:
q 5% critical value
assume this)
Coefficients:
Estimate Std. Error t value Pr(>|t|)
(Intercept) - 1.9574 6.8350 - 0.286 0.
education 1.3073 1.12 01 1.167 0.
genderfemale - 2.3442 0.3889 - 6.028 3.12e- 09 ***
age - 0.3675 1.1195 - 0.328 0.
experience 0.4811 1.1205 0.429 0.
Signif. codes: 0 โโ 0.001 โโ 0.01 โโ 0.05 โ.โ 0.1 โ โ 1
Residual standard error: 4.458 on 529 degrees of freedom
Multiple R-squared: 0.2533, Adjusted R-squared: 0.
F-statistic: 44.86 on 4 and 529 DF, p-value: < 2.2e- 16
Coefficients:
Estimate Std. Error t value Pr(>|t|)
(Intercept) 0.21783 1.03632 0.210 0.
education 0.75128 0.07682 9.779 < 2e- 16 ***
genderfemale - 2.12406 0.40283 - 5.273 1.96e- 07 ***
Signif. codes: 0 โโ 0.001 โโ 0.01 โโ 0.05 โ.โ 0.1 โ โ 1
Residual standard error: 4.639 on 531 degrees of freedom
Multiple R-squared: 0.1884, Adjusted R-squared: 0.
F-statistic: 61.62 on 2 and 531 DF, p-value: < 2.2e- 16