Completely Randomized Design - Lecture Notes | STAT 507, Study notes of Statistics

Material Type: Notes; Class: Experimental Design; Subject: Statistics; University: University of Idaho; Term: Unknown 1989;

Typology: Study notes

Pre 2010

Uploaded on 08/19/2009

koofers-user-0zj
koofers-user-0zj 🇺🇸

9 documents

1 / 2

Toggle sidebar

This page cannot be seen from the preview

Don't miss anything!

bg1
1 Completely Randomized Design (CRD) Part I
It is the simplest of designs, but even planning for it requires scientific and statistical decisions (Acid rain
example).
1.1 Exploratory Data Analysis (EDA)
Essential for any statistical analysis. For the CRD we often use boxplots for initial examination of the data.
From the boxplot we can look for evidence of a treatment difference, and possible outliers or problems with
homogeneity of variance.
1.2 ANOVA as a choice of the best fitting model for the mean
Let yij (i= 1, ..., g;j= 1, ..., ni) be the jth observation in group i. In ANOVA from a CRD we consider two
models for yij . The first model, yij =µi+εij , specifies that each group has a different mean value µi. This
is also called the full model for yij . The second model, yij =µ+εij , specifies that all groups have identical
mean values µ. This is also called the reduced model for yij . Note that the reduced model is a special
case (or subset) of the full model. Both models make the assumption that the εij are independent, have
mean zero, and variance σ2. To conduct statistical inference (tests, confidence intervals, etc.) we make the
further assumption that the εij have a normal distribution.
An alternative way to express the models above is by letting µi=µ+αi, where µis the overall mean
and αiis the treatment effect of group i. Then the full model is yij =µ+αi+εij . This new formulation
of the full model generalizes well to more complicated models, but introduces a complication because there
are now more parameters than groups. For both the full and reduced models, we can develop estimators
for the parameters µand σ2(reduced) or µ, µi, αi,and σ2(full), shown in Display 3.1 in the text. We can
also calculate confidence intervals for our parameters.
To choose between the full and reduced models, we compare their sum of squared residuals (SSR). A
residual ris the error in predicting an observation, r=yby, where byis the predicted value of y. For the full
model, byij =yi·(the sample group mean), and for the reduced model, byij =y·· (the sample overall mean).
SSR for the full model can never exceed SSR for the reduced model, so we wish to decide if SSR for the full
model has been reduced enough to account for the extra parameters (µi) in the full model.
1.3 Analysis of Variance mechanics
Analysis of variance involves a partition of the total sum of squares for the observations yij. Using our
notation from above, yij y·· = (yij yi·)+(yi·
y··), which equals the residual (from the full model) plus
the ith treatment effect, or rij +bαi.By squaring and summing these terms and cancelling the cross product
we obtain:
g
X
i=1
ni
X
j=1
(yij y··)2=
g
X
i=1
ni
X
j=1
(yij yi·)2+
g
X
i=1
ni
X
j=1
(yi·
y··)2,or SST= SSE+SST rt .
As an example, consider three groups with the following data: Group 1 has y1jvalues of 1, 2, and 3,
Group 2 has y2jvalues of 5, 3, and 4, and Group 3 has y3jvalues of 6, 7, and 5. The overall sample mean
is y·· = (Pg
i=1 P3
j=1 yij)/3g= 36/9 = 4.Then SSTis
1
pf2

Partial preview of the text

Download Completely Randomized Design - Lecture Notes | STAT 507 and more Study notes Statistics in PDF only on Docsity!

1 Completely Randomized Design (CRD) Part I

It is the simplest of designs, but even planning for it requires scientific and statistical decisions (Acid rain example).

1.1 Exploratory Data Analysis (EDA)

Essential for any statistical analysis. For the CRD we often use boxplots for initial examination of the data. From the boxplot we can look for evidence of a treatment difference, and possible outliers or problems with homogeneity of variance.

1.2 ANOVA as a choice of the best fitting model for the mean

Let yij (i = 1, ..., g; j = 1, ..., ni) be the jth observation in group i. In ANOVA from a CRD we consider two models for yij. The first model, yij = μi + εij , specifies that each group has a different mean value μi. This is also called the full model for yij. The second model, yij = μ + εij , specifies that all groups have identical mean values μ. This is also called the reduced model for yij. Note that the reduced model is a special case (or subset) of the full model. Both models make the assumption that the εij are independent, have mean zero, and variance σ^2. To conduct statistical inference (tests, confidence intervals, etc.) we make the further assumption that the εij have a normal distribution. An alternative way to express the models above is by letting μi = μ∗^ + αi , where μ∗^ is the overall mean and αi is the treatment effect of group i. Then the full model is yij = μ∗^ + αi + εij. This new formulation of the full model generalizes well to more complicated models, but introduces a complication because there are now more parameters than groups. For both the full and reduced models, we can develop estimators for the parameters μ and σ^2 (reduced) or μ, μi, αi, and σ^2 (full), shown in Display 3.1 in the text. We can also calculate confidence intervals for our parameters. To choose between the full and reduced models, we compare their sum of squared residuals (SSR). A residual r is the error in predicting an observation, r = y − y,̂ where ̂y is the predicted value of y. For the full model, ̂yij = yi· (the sample group mean), and for the reduced model, ̂yij = y·· (the sample overall mean). SSR for the full model can never exceed SSR for the reduced model, so we wish to decide if SSR for the full model has been reduced enough to account for the extra parameters (μi) in the full model.

1.3 Analysis of Variance mechanics

Analysis of variance involves a partition of the total sum of squares for the observations yij. Using our notation from above, yij − y·· = (yij − yi·) + (yi· − y··), which equals the residual (from the full model) plus the ith treatment effect, or rij + ̂αi. By squaring and summing these terms and cancelling the cross product we obtain:

∑^ g

i=

∑^ ni

j=

(yij − y··)^2 =

∑^ g

i=

∑^ ni

j=

(yij − yi·)^2 +

∑^ g

i=

∑^ ni

j=

(yi· − y··)^2 , or SST = SSE +SST rt.

As an example, consider three groups with the following data: Group 1 has y 1 j values of 1, 2, and 3, Group 2 has y 2 j values of 5, 3, and 4, and Group 3 has y 3 j values of 6, 7, and 5. The overall sample mean is y·· = (

∑g i=

j=1 yij^ )/^3 g^ = 36/9 = 4.^ Then SST^ is

∑g

i=

∑^ ni

j=

(yij − y··)^2 = (1 − 4)^2 + (2 − 4)^2 + ... + (5 − 4)^2 = 30.

The group means are y 1 · = 2, y 2 · = 4, and y 3 · = 6, so SSE and SST rt are

SSE =

∑^ g

i=

∑^ ni

j=

(yij − yi·)^2 = (1 − 2)^2 + (2 − 2)^2 + (3 − 2)^2 + (5 − 4)^2 + ... + (5 − 6)^2 = 6, and

SST rt =

∑^ g

i=

∑^ ni

j=

(yi· − y··)^2 =

∑^ g

i=

ni(yi· − y··)^2 = 3(2 − 4)^2 + 3(4 − 4)^2 + 3(6 − 4)^2 = 24.

Thus SST = SSE +SST rt or 30 = 6 + 24 partitions the total sum of squares about the overall mean into two parts, one within groups (due to error, or effects not accounted by the model) and one between groups (due to the treatment, measuring the difference between sample means). Since each group here has ni observations, each group contributes ni − 1 degrees of freedom for the error sum of squares, for a total of

(ni − 1) degrees of freedom for SSE. SST rt is calculating the sum of squares of g sample means about their (overall) mean, so it has g − 1 degrees of freedom. For the example data above,

(ni − 1) = g(n − 1) = 3(2) = 6, and g − 1 = 3 - 1 = 2. We can summarize this information in an analysis of variance table:

Source SS df MS F Between groups 24 2 12 12 Within groups 6 6 1 Total sum of squares 30 8

Three distinct ways to consider these calculations are: 1) Focus attention only on the partition of SST , 30 = 6 + 24, 2) To test the null hypothesis H 0 : μ 1 = μ 2 = μ 3 against the alternative hypothesis Ha : some μi’s differ, using an F test, or 3) View it as a model selection problem, comparing SST (= SSR for reduced model) to SSE (= SSR for the full model) to decide the best fitting model.