Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
Community
Ask the community for help and clear up your study doubts
Discover the best universities in your country according to Docsity users
Free resources
Download our free guides on studying techniques, anxiety management strategies, and thesis advice from Docsity tutors
An overview of the single factor analysis of variance (anova) model used in quantitative business analysis. The model's components, the partition of sums of squares, degrees of freedom, and the test statistic for equality of treatment means. It also covers interval estimation of treatment means and pairwise comparisons.
Typology: Exams
1 / 12
dependent variable and one or more independent variables.
or the differences or similarities in the response variable between treatments.
usually called one way ANOVA, and two factor ANOVA, usually called two way ANOVA.
mean , and e are independent error terms.
j
ij
with a mean
j
2
2
2 Y ij eij
effect and no further analysis is required.
Y ij − the i th observation in the j th treatment or factor level.
Y (^) j − the sample treatment mean, i.e ., the sample mean, for the j th treatment or factor
level.
Y − the overall sample mean or grand mean.
n (^) j − the sample size, i.e ., the number of observations, in the j th treatment.
n T − the grand sample size , i.e ., the total number of observations, in all the treatments.
k − number of treatments or factor levels.
1
nj
j i j (^) i
n (^) =
= (^) ∑ j , , 1
k
T j j
= (^) ∑ 1 1
k nj
ij T (^) j i
n (^) = =
= (^) ∑∑
same.
Partition of Sum of Squares
2
1 1
k n j
ij j i
= =
= (^) ∑∑ −
2 2
1 1 1
k n j k
j j j j i j
SSB Y Y n Y Y = = =
= (^) ∑∑ − = (^) ∑ −
2
1 1
k n j
ij j j i
= =
= (^) ∑∑ −
Yij − Y = ( Y (^) ij − Y (^) j ) + ( Y (^) j − Y )
Squaring both sides, summing over all observations and combining terms, we obtain
2 2
1 1 1 1 1 1
k n^ j k n^ j k nj
ij ij j j j i j i j i
SST SSW SSB
= = = = = =
∑∑ −^ =^ ∑∑ −^ +^ ∑∑ −
2 ).
Therefore, we have
A kitchen utensils manufacturer selected 24 similar department stores to try out four different
promotional displays for a newly designed rice cooker. The display that generates the
highest sales in the study is to be used in the manufacturer’s national promotion program.
Each display was assigned at random to six stores. Sales in units for the store during the two
week observation period are given in the following table.
Store^ Display (^ j )
( i ) (^1 2 3 )
Total
Total 162 132 114 192 600
Mean Y 1 = 27 Y 2 = 22 Y 3^ =^19 Y 4 = 32
n (^) j n 1 (^) = 6 n 2 (^) = 6 n 3 (^) = (^6) n 4 (^) = 6 nT = 24
In this example, the dependent variable is sales Y , i.e ., the number of units sold. The factor,
or the independent variable, is the display. The treatments, or the factor levels, are the
different displays. We assume every thing else is the same. Only the effects of display are
studied.
We also have k = 4 and 1 1
k n j
ij T (^) j i
n (^) = =
= (^) ∑∑ = =.
The following table lists the Yij − Y and the
2 ( Y (^) ij − Y ) values
Yi (^) 1 − Y Yi (^) 2 − Y Yi (^) 3 − Y Yi (^) 4 − Y
2 ( Yi (^) 1 − Y )
2 ( Yi (^) 2 − Y )
2 ( Yi (^) 3 − Y )
2 ( Yi (^) 4 − Y )
Sum 140 130 348 340
From the table, we obtain
6 2 1 1
( (^) i ) 140 i
=
∑ −^ = ,
6 2 2 1
( (^) i ) 130 i
=
∑ −^ = ,
6 2 3 1
( (^) i ) 348 i
=
∑ −^ = ,
6 2 4 1
( (^) i ) 340 i
=
∑ −^ =
2
1 1
k n j
ij j i
= =
= (^) ∑∑ − = + + + =.
The following table lists the Yij − Yj and the
2 ( Y (^) ij − Yj ) values
Yi (^) 1 − Y 1 Yi (^) 2 − Y 2 Yi (^) 3 − Y 3 Yi (^) 4 − Y 4
2 ( Yi (^) 1 − Y 1 )
2 ( Yi (^) 2 − Y 2 )
2 ( Y (^) i (^) 3 − Y 3 )
2 ( Y (^) i (^) 4 − Y 4 )
Sum 116 76 132 46
From this table, we obtain
6 2 1 1 1
( (^) i ) 116 i
=
∑ −^ = ,
6 2 2 2 1
( (^) i ) i
=
∑ −^ =^76 ,
6 2 3 3 1
( (^) i ) 132 i
=
∑ −^ = ,
6 2 4 4 1
( (^) i ) i
=
∑ −^ =^46
2
1 1
k n j
ij j j i
= =
= (^) ∑∑ − = + + + =.
Furthermore, we have
2 2 2 2 2
1
k
j j j
= (^) ∑ − = + − + + =.
Computational Formula
2
2 2
1 1 1 1 1 1
k n^ j k n^ j k nj
ij ij ij T j i T j i j i
SST Y Y Y n Y = = n = = = =
∑∑ ∑∑ ∑∑
2
2 2 2 2
1 1 1 1 1 1
k n^ j k n^ j k nj
ij ij ij T j j^ i T^ j i j j i
SSB Y Y Y n Y = n^ = n^ = = = n =
∑ ∑ ∑∑ ∑ ∑
2 2
1 1 1 1
k n^ j k nj
ij ij j i j j i
= = = n =
∑∑ ∑ ∑ or^ SSW^ =^ SST^ − SSB
The Y values are given in the following table
2 ij
j i 1 2 3 4
Sum 4490 2980 2298 6190
From this table, we obtain
6 2 1 1
i^4490 i
=
∑ = ,^ ∑ ,^ ∑ ,^ ∑
6 2 2 1
i^2980 i
=
6 2 3 1
i^2298 i
=
6 2 4 1
i^6190 i
=
Therefore
2
1 1
k n j
ij j i
= =
∑∑ =^ +^ +^ +^ =
From the first table, we have
2 2
1 1
k n j
ij T (^) j i
n (^) = =
∑∑
Also,
1
2 2
1 (^1 )
n
i i
n (^) =
∑ ,^
2
2 2
2 (^2 )
n
i i
n (^) =
∑ ,^
3
2 2
3 (^3 )
n
i i
n (^) =
∑ ,
4
2 2
4 4 1
n
i i
n (^) =
∑ ,
therefore,
2
1 1
k n j
ij j j i
= n =
∑ ∑.
Using the computational formulas,
2 2
1 1 1 1
k n^ j k nj
ij ij j i T j i
= = n = =
∑∑ ∑∑
2 2
1 1 1 1
k n^ j k nj
ij ij j j^ i T j i
= n^ = n = =
∑ ∑ ∑∑ =
2 2
1 1 1 1
k n^ j k nj
ij ij j i j j i
= = = n =
∑∑ ∑ ∑
or SSW = SST − SSB = 958 − 588 = 370
Partition of Degrees of Freedom
for for for
df SST df^ SSB df SSW
n − = k − + n − k
Mean Squares
k
T
n k
k
n k
ANOVA Table
Source of Variation SS df^ MS (^) F
Between SSB k − 1 MSB F^ = MSB MSW
Within SSW nT^ −^ k MSW
Total SST nT − 1
Source of Variation SS df^ MS (^) F
Between 588 3 196 F^ =^ 196 18.5^ =10.
Within 370 20 18.
Total 958 23
= has a (^) 1, k nT k F (^) − − distribution.
denominator degrees of freedom.
Test of Equality of Treatment Means
Ha : not all treatment means are equal
Do not reject H 0 if , 1, k nT k F ≤ F α − −
Reject H 0^ if^ , 1, , k nT k F > F α − −
Where, (^) , 1, is such a value that k nT k F α − − ( (^) 1, , 1, ) k nT k k nT k
between the treatment means. If is rejected, the treatment means may be all different or
only some of them are different.
For the example above, test the equality of the treatment means at a significance level
Do not reject H 0 if F ≤4.
Reject H 0 if F >4.51.
say that the treatment means are significantly different.
MINITAB Result
(unstacked).
One-way Analysis of Variance
Analysis of Variance for Sales Source DF SS MS F P Display 3 588.0 196.0 10.59 0. Error 20 370.0 18. Total 23 958. Individual 95% CIs For Mean Based on Pooled StDev Level N Mean StDev -----+---------+---------+---------+- 1 6 27.000 4.817 (----------) 2 6 22.000 3.899 (----------) 3 6 19.000 5.138 (----------) 4 6 32.000 3.033 (----------) -----+---------+---------+---------+- Pooled StDev = 4.301 18.0 24.0 30.0 36.
Interval Estimation of Treatment Means
2 Y j j
s n
j
j
Y
s
j t
− μ = has a t distribution with nT − k degrees of freedom.
j (^) 2 , nT kYj L = Y − t (^) α (^) − s
j (^) 2 , nT kYj R = Y + t (^) α (^) − s
For the example above,
j 6 Y j
s n
= = and
j 6
sY = = ,
because all n (^) j are the same, i.e. , n (^) j = 6 for all j = 1, 2,3, 4. If the n (^) j are different,
2 Y j s will
be different.
Now construct 95% confidence intervals for the treatment means. Then α = 0.05and
2 ,^ 0.025,
n T k t (^) α (^) − = t =.
For μ 1 ,
(^1 2) , (^1)
n T k Y L = Y − t (^) α (^) − s = − =
(^1 2) , (^1)
n T kY
R Y t (^) α s −
With 95% confidence we can say that μ 1 is somewhere between 23.337 and 30.663.
For μ 2 ,
(^2 2) , (^2)
n T kY L = Y − t (^) α (^) − s = − =
(^2 2) , (^2)
n T kY R = Y + t (^) α (^) − s = + =
With 95% confidence we can say that μ 2 is somewhere between 18.337 and 25.663.
(^3 2) , (^3)
n T k Y L = Y − t (^) α (^) − s = − =
(^3 2) , (^3)
n T k Y R = Y + t (^) α (^) − s = + =
For μ 4 ,
(^4 2) , (^4)
n T kY
L Y t (^) α s −
(^4 2) , (^4)
n T kY R = Y + t (^) α (^) − s = + =
With 95% confidence we can say that μ 4 is somewhere between 28.337 and 35.663.
Comparison of Two Treatment Means
pairwise comparisons of the following form
j 1 (^) j 2 μ −μ
j 1 (^) j 2
1 2 1 2 1
Y j Yj j j j
s MSW n n n n
−
j 2
1 2 1 2
1 2
j j
j j j j
Y Y
s
t
μ μ
−
= has a distribution with degrees of
freedom.
t nT − k
2 ,^ T j 1^ j 2
L = Yj − Y (^) j − t (^) α (^) n − ksY − Y
2 ,^ T j 1^ j 2
R = Y (^) j − Y (^) j + t (^) α (^) n − ksY − Y
this problem, 2 ,^ 0.05,
n T k t α (^) (^) − = t =.
1 3
2
1 3
sY (^) Y MSW n n
−
and 1 3
sY (^) − Y = =
2 3
2
2 3
sY (^) Y MSW n n
−
and 3 4
sY (^) − Y = =
2 3 0.05,20 (^2 ) L = Y − Y − t sY (^) − Y = (22 − 19) − 1.725(2.483) = −1.
2 3 0.05,20 (^2 ) R = Y − Y + t sY (^) − Y = (22 − 19) + 1.725(2.483) =7.
different at a significance level α. Otherwise, they are significantly different.