
Statistics 333 Assignment 1 Due Sept. 19, 2003
1. For the simple linear regression model Yi=
β
0+
β
1Xi+
ε
i,i=1, . . . , n,consider the least
squares fits ˆ
Yi=b0+b1Xiand residuals givenbye
i=Y
i−ˆ
Y
i=Y
i−(b
0+b
1
X
i
),i=1, . . . , n.
Since b0=Y−b1X,itiseasy to see (1/n)Σn
i=1ˆ
Yi=Yand hence also that Σn
i=1(Yi−ˆ
Yi)=0.
Similarly,also showthat the residuals eisatisfy the additional ‘orthogonality’ constraint
Σn
i=1Xi(Yi−ˆ
Yi)=0.
2. For the simple linear regression model in Exercise 1, under the standard model assump-
tions for the random errors
ε
i,showthat the least squares estimator b1=Sxy /Sxx and
Y=(1/n)Σn
i=1Yihave zero covariance, i.e., Cov( b1,Y)=0,where Sxy =Σn
i=1(Xi−X)Yi.
3. For the simple linear regression model, the estimator of the error variance
σ
2=Var(
ε
i)is
givenby
S
2=1
n−2
n
i=1
Σ(Yi−ˆ
Yi)2≡1
n−2SSE .
Showthat this estimator S2is unbiased for
σ
2,i.e., prove that E(S2)=
σ
2.
Note: Toprove this result, use the following approach. First, verify the identity
Yi−(
β
0+
β
1Xi)=[Yi−(b0+b1Xi)]+[Y−(
β
0+
β
1X)]+[(b1−
β
1)(X
i−X)].
Then verify that the sum of squares of elements on the left-hand side,
Σn
i=1[Yi−(
β
0+
β
1Xi)]2,isequal to the sum of the 3 sums of squares of individual elements
on the right-hand side, due to ‘orthogonality’ in cross-terms (Exer.1). Hence, verify that
Σn
i=1[Yi−(
β
0+
β
1Xi)]2=SSE +n[Y−(
β
0+
β
1X)]2+Sxx (b1−
β
1)2.
Finally,equate the expected values of both sides of the sums of squares relation, to get
n
σ
2=E[SSE ] +nE[Y−(
β
0+
β
1X)]2+Sxx E[(b1−
β
1)2], ‘evaluate’ the other expected
value terms, and solvefor E[SSE ].Also use definitions and known results for Var( b1)
etc., for instance, E{[Yi−(
β
0+
β
1Xi)]2}=Var(Yi)≡
σ
2by definition of variance.
4. Consider the zero or no intercept model givenbyY
i=
β
1
X
i+
ε
i
,i=1, . . . , n,with the errors
ε
ibeing independent, normal r.v.’ s with mean 0 and variance
σ
2.
i) Derive the least squares estimator b1of
β
1,and also derive the variance of this estimator.
ii) For an arbitrary fixed value X0of X,establish that a 100(1 −
α
)%confidence interval for
the mean response value E(Y|X0)=
β
1X0is givenby
b
1
X
0±t
(
α
/2)
n−1S√
X2
0/Σn
i=1X2
i,
where S2=Σn
i=1(Yi−b1Xi)2/(n−1) provides an unbiased estimator of
σ
2with n−1df.
Note: You can conclude that (b1−
β
1)/se( b1)has the tn−1distribution.
5. An experiment was conducted to study the mass of a tracer material exchanged between
the main flowofanopen channel and the "dead zone" caused by a sudden open channel
expansion. Researchers need this information to improve the water quality modeling capa-
bility of a river. Itisimportant to determine the exchange constant Kfor varying flow