
















Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
Material Type: Notes; Class: INTRO BUSINESS STAT; Subject: Statistics; University: University of Pennsylvania; Term: Unknown 1989;
Typology: Study notes
1 / 24
This page cannot be seen from the preview
Don't miss anything!

















Score
50
60
70
80
90
100
110
120
130
Score
18
19
5 10 15 20 25 30 35 40 45
Age
Bivariate Fit of Score By Age
70
80
90
100
110
120
130
Score
5 10 15 20 25
Age
Linear Fit
Score = 105.63 - 0.78Age
Summary of Fit
RSquare
Analysis of Variance
Parameter Estimates
50
60
70
80
90
100
110
120
130
Score
5 10 15 20 25 30 35 40 45
Age
Linear Fit
Score = 109.87 - 1.13Age
Summary of Fit
RSquare
Analysis of Variance
Source DF Sum of Squares Mean Square F Ratio
Model 1 1604.0809 1604.08 13.
Error 19 2308.5858 121.50 Prob > F
C. Total 20 3912.
Parameter Estimates
Term Estimate Std Error t Ratio Prob>|t|
Intercept 109.87384 5.067802 21.68 <.
Age - 1.126989 0.310172 - 3.63 0.
Source DF Sum of Squares Mean Square F Ratio
Model 1 280.5195 280.519 2.
Error 18 2220.4805 123.360 Prob > F
C. Total 19 2501.
Term Estimate Std Error t Ratio Prob>|t|
Intercept 105.62987 7.161928 14.75 <.
Age - 0.779221 0.516733 - 1.
Conclusion: It is not clear at all that scores and ages are
related for normal children
Based on the previous study we will use PRECIP, EDUC,
NONWHITE, Log(NOX) and SO2 to predict MORT.
a) We will use stepwise selection guided by the effect tests to
add (or delete) predictors into the model.
b) Under Analyze > Fit Model >
MORT > Y
Add PRECIP, EDUC, NONWHITE, Log(NOX) and
SO2 into
Construct Model Effects
c) Choose Stepwise under Personality > Run Model.
We will check (or uncheck) each variable according to the āF-
ratioā statistics. The final model is chosen based on R-squares
and the p-values. Usually, only variables which are significant
should stay in the final model.
Stepwise Fit
Response:
Stepwise Regression Control
Prob to Enter 0.
Prob to Leave 0.
Direction:
Current Estimates
SSE DFE MSE RSquare RSquare Adj Cp AIC
Enter ed
Step History
Step Parameter Action "Sig
Prob"
Seq SS RSquare Cp p
1 NONWHITE Entered 0.0000 94595.56 0.4144 43.366 2
2 EDUC Entered 0.0000 33848.33 0.5627 20.206 3
3 SO2 Entered 0.0030 14603.66 0.6267 11.35 4
4 PRECIP Entered 0.0138 8965.8 0.6659 6.6858 5
Parameter Estimate nDF SS "F Ratio" "Prob>F"
Intercept 999.316169 1 0 0.000 1.
Log(NOX). 1 3613.212 2.686 0.
e
ļ· Residual plot of MORT vs. PRECIP, EDUC, NONWHITE and SO
ļ· Four places shown on the plot show some large residuals.
ļ· Notice that residual plots for multiple regression are using residuals vs. predicted values.
0
50
100
750 850 950 1050
j:
j
j
(both axes are recentered at their means)
j
Pollution data: the final model is
Summary of Fit
RSquare 0.
Analysis of Variance
Source DF Sum of Squares Mean Square F Ratio
Model 4 152013.34 38003.3 27.
Error 55 76259.74 1386.5 Prob > F
C. Total 59 228273.08 <.
Parameter Estimates
Term Estimate Std Error t Ratio Prob>|t|
Intercept 999.31617 92.07861 10.85 <.
Effect Tests
Source Nparm DF Sum of Squares F Ratio Prob > F
Residual by Predicted Plot
0
50
100
MORT Residual
750 800 850 900 950 1050 1150
MORT Predicted
Whole Model
Actual by Predicted Plot
Summary of Fit
RSquare 0.
RSquare Adj 0.
Root Mean Square Error 37.
Mean of Response 940.
Observations (or Sum Wgts) 60
Analysis of Variance
Source DF Sum of Squares Mean Square F Ratio
Model 4 152013.34 38003.3 27.
Error 55 76259.74 1386.5 (^) Prob > F
C. Total 59 228273.08 <.
Parameter Estimates
Term Estimate Std Error t Ratio Prob>|t|
Intercept 999.31617 92.07861 10.85 <.
PRECIP 1.6111235 0.633579 2.54 0.
EDUC - 15.773 37 6.992113 - 2.26 0.
NONWHITE 3.0609258 0.614004 4.99 <.
SO2 0.3271823 0.083867 3.90 0.
Effect Tests
Source Nparm DF Sum of
Squares
F
Ratio
Prob >
F
PRECIP 1 1 8965.800 6.4663 0.
EDUC 1 1 7056.097 5.0890 0.
NONWHITE 1 1 34458 .462 24.8521 <.
SO2 1 1 21102.285 15.2194 0.
Leverage Plot
MORT Leverage Residuals
New Orleans, LA
NONWHITE Leverage, P<.
Bivariate Fit of Y Leverage of NONWHITE for MORT
By X Leverage of NONWH
800
850
900
950
1000
1050
1100
Y Leverage of NONWHITE for MORT
New Orleans, LA
-5 0 5 10 15 20 25 30 35
X Leverage of NONWHITE for MORT
Linear Fit
Y Leverage of NONWHITE for MORT =
904.02358 + 3.0609258 X Leverage of NONWHITE for MORT
Summary of Fit
RSquare 0.
RSquare Adj 0.
Root Mean Square Error 36.
Mean of Response 940.
Observations (or Sum Wgts) 60
Analysis of Variance
Source DF Sum of Squares Mean Square F Ratio
Model 1 34458.46 34458.5 26.
Error 58 76259.74 1314.8 (^) Prob > F
C. Total 59 110718.20 <.
Parameter Estimates
Term Estimate Std Error t Ratio Prob>|t|
Intercept 904.02358 8.502028 106.33 <.
X Leverage of NONWHITE for MORT 3.0609258 0.597914 5.12 <.
ļ·The output from the
whole model fit is on the
left together with the
Leverage plot for
NONWHITE
ļ·We can reproduce the
leverage plot by
Analyze > Fit Model >
Save Columns > Effect
Leverage Pairs.
Then fit Y leverage to X
leverage in a simple
regression , shown on the
right.
ļ· Notice the coefficients
for NONWHITE are the
same from both outputs.