












Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
Material Type: Notes; Professor: Cui; Class: STATISTCS FOR BIOLOGICL SCIENCES; Subject: Statistics; University: University of California-Riverside; Term: Unknown 1989;
Typology: Study notes
1 / 20
This page cannot be seen from the preview
Don't miss anything!













i 0 1 i 1 i
i
th
i
i
i
i
j
Response/Regression function and an example:
i i 1
i 0 1 i 1
The response Y i
, given the level of X in the i
th
trial Xi, comes from a probability distribution
whose mean is 9.5+2.1 Xi. Therefore,
Response/Regression function relates the
means of the probability distributions of Y (for
given X ) to the level of X.
o We want to find the pair (b 0
, b 1
) that
minimizes
SSE= ^
2
i
e = ^
2
i 0 1 i
(Y b bX)
o We set the partial derivatives of SSE with
respect to b 0 , b 1 equal to zero:
X(Y b bX) 0
( X)( 2 )(Y b bX) 0
Normalequation(2) :
(Y b bX) 0
( 1 )( 2 )(Y b bX) 0
Normalequation(1) :
i i 0 1 i
i i 0 1 i
1
i 0 1 i
i 0 1 i
0
Then the solution is (derivation on board);
2
i
i i
1
0 1
b
Y b X
b
Properties of the residuals
o
e 0. i
since the regression line goes
through the point
( X,Y) .
o
Ye 0
X e 0 and i i i i The residuals are
uncorrelated with the independent variables
X i
and with the fitted values i
. (prove it on
board)
Why
e 0. i
,
Ye 0?
X e 0 and i i i i
In fact, from normal equation (1) and (2), we
can immediately tell
e 0 i
(^) and
X e 0 i i
(^). Since
i i 0 1 i i 0 i 1 i i
Ye (b bX)e b e b Xe
, we can easily
know that
Ye 0?
i i
o Least square estimates are uniquely defined
as long as the values of the independent
variable are not all identical. In that case the
numerator
2
i
(^) (draw figure).
Point Estimation of Error Terms Variance
o Single Population: Unbiased sample
variance estimator of the population
variance
2 2
2
i 2
E s
n 1
s
Estimating the Mean Value at Xh:
Estimator
Mean
Variance
Estimated variance
Analysis of Variance Approach to Regression
Analysis
XX
h
h
XX
h
h
h h
h h
h h
2
2
2
2 2
0 1
0 1
2
i
2
i
2
i i i
2
i
i i i i
TOT
Basic Table
Source of
Variation SS df MS E { MS }
Regression
SSR Y Y i
2 1
MSR
SSR
1
2
1
2
2
i
Error
SSE (^) Y Y i i
2 n - 2
Total SS (^) Y Y TOT i
2
n - 1
o Test Statistic
o F Distribution
o Numerator
Degrees of Freedom dfR=
o Denominator
Degrees of Freedom dfR=n-
o Hypothesis:
o Decision Rule:
Fitting a regression in SAS
data Toluca;
infile 'C:\stat231B06\ch01ta01.txt';
input lotsize workhrs;
proc reg ; /*least square estimation of regression
coefficient*/
model workhrs=lotsize;
output out=results p=yhat r=residual;/*yhat denotes for fitted values
and residual denotes for residual values*/
1
0
1 1
0 1
If 1 ; 1 , 2
If 1 ; 1 , 2
For levelofsignifican ce
F F n H
F F n H
(y y) 0 i
, only
calculated from (^)
n 1
i 1
n i
(2) SSE= (^)
2
i
e
p 1 pp p 1
(3) SSR= (^)
2
i
(y ˆ y)
i
( yˆ y) (yˆ y y y) (yˆ y) (y y) 0 i i i i i i i
. Hence SSR has