



Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
Multiple Regression Excel Lab, Excel Setup, Multiple Regression, Analysis, Regression Results, Estimate of the Intercept, Estimate of the Slopes, Standard Error, Regression Statistics, Summary Output are some points from this lecture, Econometrics handout.
Typology: Study notes
1 / 6
This page cannot be seen from the preview
Don't miss anything!




Multiple Regression Excel Lab
Part I. Excel Setup A. Be sure you’ve got the data analysis tool pack added in Under “tools” in the main tool bar at the top of the screen see if the option “data analysis” is available (it should be at the bottom of the list). If data analysis is not there do the following:
B. Purpose of Today’s lab In today’s lab we’re going to learn how to get Excel to perform multiple regression, examine dummy variables, and take a “model-building” approach to regression.
Part II. Multiple Regression Here we’re going to estimate the linear relationship between ATTENDANCE (Y) and POP (X 1 ) and CURNTWIN (X 2 ).
A. Sheet Set-up
B. Analysis
Examine the regression results. Locate each of the following:
0
1
2
c. The standard error of the estimate (SER)
0
1
2
f. The 95% CI’s for β 0 , β 1 and β 2
g. The t-statistics for Ho(1): β 0 = 0 , Ho(2):β 1 = 0 , and Ho(3): β 2 = 0
h. The value of R-square (recall that it shows the percentage of the variation in Y that we explained with our regression).
Questions:
**1. Write out and interpret the regression equation you’ve estimated.
SUMMARY OUTPUT (Y = attendance, X = POP)
Regression Statistics Multiple R 0. R Square 0. Adjusted R Square 0. Standard Error 542. Observations 78
ANOVA
df SS MS F Significance F Regression 1 5180060.737 5180061 17.61062 7.28E- Residual 76 22354952.16 294144. Total 77 27535012.
Coefficients Standard Error t Stat P-value Lower 95%
Upper 95% Intercept 1393.433079 111.2778705 12.5221 3.7E-20 1171.804 1615. POP 0.109927776 0.026195102 4.196501 7.28E-05 0.057756 0.
SUMMARY OUTPUT (Y = attendance, X = CURNTWIN)
Regression Statistics Multiple R 0. R Square 0. Adjusted R Square 0. Standard Error 440. Observations 78
ANOVA
df SS MS F Significance F Regression 1 12819510.34 12819510 66.20792 6.02E- 12 Residual 76 14715502.55 193625 Total 77 27535012.
Coefficients Standard Error t Stat P-value Lower 95%
Upper 95% Intercept -1340.9213 387.1268058 -3.46378 0.000878 -2111.95 -569. CURNTWIN 38.6325231 4.74786222 8.136825 6.02E-12 29.17632 48.
Part III. Multiple regression with dummy variables: C. Import the data for this section The data set for this section is titled “salary, age, Ph.D, evaluation, pubs, and gender for 100 university professors”, and is on my webpage (“professor data”).
1 st, we’re going calculate the correlation coefficients for all combinations of these 6 variables.
Your output should consist of a 6 x 6 matrix. The correlation coefficient between two variables is found in the corresponding off-diagonal element of the matrix.
→ Rank age, Ph.D, evaluation, pubs, and gender in order of their highest correlation with salary.
Questions: → Given that “gender” and “PhD” are dummy variables, how would you interpret the mean of these variables?
→ Given that “gender” and “PhD” are dummy variables, how do you interpret correlation coefficients between these variables and other (quantitative) variables?
Regressions:
→ What is the interpretation of the coefficient on “gender”?
→ What is the interpretation of the coefficient on “Ph.D”?
→ What would you conclude about the following null hypotheses in the final regression?
1. Ho(1): β **0 = 0
→ Based on the above, what can you therefore conclude about the significance of the true regression coefficients in the context of the regression? Explain.
2. How has the value of the coefficient of determination (R^2 ) and adjusted coefficient of determination changed as you add variables to the model? (fill-in the table below)
Regression R-squared Adjusted R-squared Y = f(X 1 ) Y = f(X 1 , X 2 ) Y = f(X 1 , X 2 , X 3 ) Y = f(X 1 , X 2 , X 3 , X 4 ) Y = f(X 1 , X 2 , X 3 , X 4 , X 5 )
Based on the changes in R^2 and adjusted R^2 , what can you say about the contribution of each of these X variables towards explaining the variation in Y?