





Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
Variance, Source of variance, analysis of variance, sum of square analysis Dunnett test, Tukey test, confidence interval of means, ANOVA table, T values, F test
Typology: Study notes
1 / 9
This page cannot be seen from the preview
Don't miss anything!






total
2
1
2
2
2
k
2
Since there is error in any measurement, it’s not
surprising that the means are different.
We want to know if the difference is due to variance in
the method or real sample differences.
Simple two level model
between = variance of sample material
within = variance of analytical method
total
between
within
Level 1
Level 2
Level 1 gives us an idea as to sample variability
Level 2 tells use about the method variability
1
2
3
4
1
1
1
2
2
2
3
3
3
4
4
4
T
2
T
2
x T
= Grand Mean
mean of all the points
s
T
2
MS between =
df (^) s
ss between
Next, calculate the between sample variance
Then the mean square for the samples
x s
= mean of each sample
df (^) s =# samples - 1
nr =# replicates per sample
Since you know SST and SSbetween, you can find the
within sample variance by:
The mean sum of squares for our replicates is then:
1 15.9 16.1 16.
2 14.9 15.2 15.
3 14.8 15.8 15.
4 16.2 16.0 15.
Source df SS MS
Total
T
Sample
between
Replicate
within
OK. We’ve done several
calculations. Now what?
We can now use the F test to
determine if there is a significant
difference between the two
sources of variance.
F is then compared to F c to see if
the difference is significant. This
will be covered in a bit.
small
2
big
2
Using XLStat
Note: XLstat does not report Fc values - just the P value
Data must be ordered in a single column.
Two methods are available for calculating sum of
squares for your groups - Type I and III. These are only
useful for more complex multivariable ANOVA
Might as well review them at this point.
Sum of squares analysis.
Type I (Sequential)
The Sums of Squares obtained by fitting effects in the order
specified in the model. Type I SS for each effect will change
if the order of the effects in the model is changed.
Type III (Marginal)
The Sums of Squares obtained by fitting each effect after all
the other terms in the model. The Type III SS do not depend
upon the order in which effects are specified in the model.
Sum of squares analysis.
Type I SS - Useful to explore unbalanced experimental data - where
some effects are measured more than others. Can also show flaws in
an experimental design (next chapter)
Type III Sums of Squares are preferable in most cases since they
correspond to the variation attributable to an effect after correcting for
any other effects in the model. They are unaffected by the frequency
of observations.
With a balanced experiment (all combinations measured with equal
frequency), Type I and III give the same results.
Analysis of variance:Analysis of variance:Analysis of variance:Analysis of variance:Analysis of variance:Analysis of variance:
Source DF
Sum of
squares
Mean
squares
F Pr > F
Model 11 438.943 39.904 40.264 < 0.
Error 36 35.678 0.
Corrected
Total
Fcrit =
There is a difference. Can we tell what it is?
In this example, there is <0.01% chance of there NOT
being a difference.
XLStat results.
Lead / Standardized
coefficients
Chemist-A1 Chemist-A
Chemist-A
Chemist-A4 Chemist-A
Chemist-A
Chemist-A
Chemist-A
Chemist-A
Chemist-A
Chemist-A
Chemist-A
-0.
0
1
Variable
Standardized coefficients
This plot shows
how each chemist
performed.
While results have
been normalized,
you’d get the
same basic plot
with the raw data.
Using XLStat
The Dunnett test is used to
compare samples (your chemists)
to a control.
There actually is no control but
the test provides a useful way of
comparing results.
In this case, choose Chemist A
because his/her results were the
lowest, causing the results to be
positive for the others.
Dunnett test
Compares group means.
Each is pitted against one control or
reference group.
Calculate a t test values for each group
comparison.
Test typically can only be used when all
groups are of equal size.
Dunnett test
Category Difference
Standardize
d difference
Critical
value
Critical
difference
Pr > Diff Significant
A1 vs A12 -10.798 -15.339 2.890 2.034 0.000 Yes
A1 vs A9 -8.078 -11.475 2.890 2.034 0.000 Yes
A1 vs A5 -6.345 -9.014 2.890 2.034 0.000 Yes
A1 vs A4 -6.328 -8.989 2.890 2.034 0.000 Yes
A1 vs A7 -5.838 -8.293 2.890 2.034 0.000 Yes
A1 vs A10 -5.227 -7.426 2.890 2.034 0.000 Yes
A1 vs A8 -4.903 -6.964 2.890 2.034 0.000 Yes
A1 vs A6 -4.475 -6.357 2.890 2.034 0.000 Yes
A1 vs A3 -2.575 -3.658 2.890 2.034 0.007 Yes
A1 vs A11 -2.165 -3.076 2.890 2.034 0.032 Yes
A1 vs A2 -0.105 -0.149 2.890 2.034 1.000 No
Tukey Test
“Honestly Significantly Different (HSD) test.
Based on pairwise comparison among means.
Mi - Mj = difference between pair means
MSE = mean square error
nh = the harmonized mean
Harmonized mean is the weighted
arithmetic mean, with each value's weight
being the reciprocal of the value.
Harmonized
mean
i = 1
n /
Tukey test
Contrast Difference
Standardized difference
Critical value Pr > Diff Significant
A12 vs A1 10.798 15.339 3.490 < 0.0001 Yes
A12 vs A2 10.693 15.190 3.490 < 0.0001 Yes
A12 vs A11 8.633 12.263 3.490 < 0.0001 Yes
A12 vs A3 8.223 11.681 3.490 < 0.0001 Yes
A12 vs A6 6.323 8.982 3.490 < 0.0001 Yes
A12 vs A8 5.895 8.374 3.490 < 0.0001 Yes
A12 vs A10 5.570 7.913 3.490 < 0.0001 Yes
A12 vs A7 4.960 7.046 3.490 < 0.0001 Yes
Compares each
chemist’s results
to see if there is a
significant
difference.
Tukey test
Provides grouping
of chemists with
statistically similar
results (95%
confidence.)
Chemist Means GroupsGroupsGroupsGroupsGroupsGroups
A12 45.170 A
A9 42.450 B
A5 40.718 B C
A4 40.700 B C
A7 40.210 B C
A10 39.600 C
A8 39.275 C D
A6 38.848 C D E
A3 36.948 D E
A11 36.538 E F
A2 34.478 F
A1 34.373 F
t values
Confidence level
Degrees of 90% 95% 99%
freedom t . t . t .
Example
Example
2.13 x 0.
5
1/
2.78 x 0.
5
1/
t test example
Beyond the mean
You can have samples that are considered significantly
different and still have the same mean.
In both examples, the populations would be considered to
be different - even though the means, medians and modes
are identical in example on the right.
The F test
The F test
F =
S
2 larger
S
2 smaller
A - mean = 50 mg/l, s = 2.0 mg/l, n = 5, df = 4
B - mean = 45 mg/l, s = 1.5 mg/l, n = 6, df = 5
c is 5.19 at 95% confidence
The variance values are essentially the same so the
means must really differ.
Comparison of the methods
Comparison of the methods
Here, the means are identical but
the distributions look different.
However, the lower curve is for a
much smaller data set.
The F test would show them to be
the same.
It accounts for the variations in
sample size - using df.