



















Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
Lecture 10. Multiple Linear Regression ... 10-2. Topic Overview. • Multiple Linear Regression Model ... Note formulas are same as before, with hat matrix:.
Typology: Study notes
1 / 27
This page cannot be seen from the preview
Don't miss anything!




















STAT 512Spring 2011
Background Reading
Multiple Linear Regression Model
Predictor variables are often correlated to
each other.
If predictor variables are highly correlated,
they will be “fighting” to explain the samepart of the variation in the responsevariable.
Caution
: Using highly correlated predictor
variables in the same model will not leadto useful parameter estimates. Want to becareful of this.
0
1
1
2
2
1
,^
1
i^
i^
i^
p^
i p
i
β
β
β
β
ε
−
−
i^
n
observations
Assumptions exactly as before:
iid i^
ε
σ
i Y
is the value of the response variable for the
i
th
case.
ik X
is the value of the
k
th
explanatory
variable for the
i
th
case.
p
β^ β p β
^
^
^
^
=
^
^
^
^
^
⋮
β
Minimize distances between point and
response surface
Find b to minimize
Xb
Xb
Obtain normal equations as before:
X Xb = X Y
Least Squares Solution as before:
1 −
′^
b
The term
linear
here refers to the
parameters
, not the predictor variables.
We can use
linear
regression models to deal
with almost any “function” of a predictorvariable (e.g.
(
)
2 , log X
, etc.)
We cannot use
linear
regression models to
deal with nonlinear functions of theparameters (unless we can find atransformation that makes them linear).
Continuous Predictors – we are used to
these.
Qualitative Predictors
^
Two possible outcomes (e.g. male/female)represented by 0 or 1
Polynomial Regression
^
Use squared or higher-ordered terms in regressionmodel. ^
Typically always include lower order terms. ^
2
1
0
1
2
1
p
i^
i^
i^
p^
i^
i
Y
X
X
X
β
β
β
β
ε
−
−
=
⋯
Formulas for sums of squares(in matrix terms)are the same as before
(
)
(
)
(^
) 2 2
2
i i^
i
i
n
^ n
∑^ ∑
∑
b X Y
e e
b X Y
Degrees of Freedom depend on the model
Always
n – 1
total degrees of freedom
Model degrees of freedom is equal to the
number of terms in the model
p
Each variable has at least one term
May be additional terms for squares,interactions, etc.
Error degrees of freedom is difference
between total and model degrees offreedom
n
p −
Source
df
Regression
(Model)
p-
(^
(^2) )
ˆY^ i
∑
R SSR^ df
MSR MSE
Error
n-p
(
(^2) ) ˆ
i^
i
∑
E SSE^ df
Total
n-
(^
(^2) )
i Y
∑
T SSTO
df
10-
The ratio F = MSR / MSE is again used to
test for a regression relationship.
Difference from SLR
Null Hyp:
0
1
2
1
p
β
β
β
−
Alt Hyp:
at least one
a^
k
β
Tests model significance, not individual
variables; gives no indication of whichvariable(s) in particular are important