Analysis of Variance (ANOVA) for Different Scenarios | Exercises Mathematical Statistics

Math 143 – ANOVA

Analysis of Variance (ANOVA)

Recall, when we wanted to compare two population means, we used the 2-sample tprocedures .

Now let’s expand this to compare k≥3 population means. As with the t-test, we can graphically get an

idea of what is going on by looking at side-by-side boxplots. (See Example 12.3, p. 748, along with Figure

12.3, p. 749.)

1 Basic ANOVA concepts

1.1 The Setting

Generally, we are considering a quantitative response variable as it relates to one or more explanatory

variables, usually categorical. Questions which fit this setting:

(i) Which academic department in the sciences gives out the lowest average grades? (Explanatory vari-

able: department; Response variable: student GPA’s for individual courses)

(ii) Which kind of promotional campaign leads to greatest store income at Christmas time? (Explanatory

variable: promotion type; Response variable: daily store income)

(iii) How do the type of career and marital status of a person relate to the total cost in annual claims

she/he is likely to make on her health insurance. (Explanatory variables: career and marital status;

Response variable: health insurance payouts)

Each value of the explanatory variable (or value-pair, if there is more than one explanatory variable) repre-

sents a population or group. In the Physicians’ Health Study of Example 3.3, p. 238, there are two factors

(explanatory variables): aspirin (values are “taking it” or “not taking it”) and beta carotene (values again are

“taking it” or “not taking it”), and this divides the subjects into four groups corresponding to the four cells

of Figure 3.1 (p. 239). Had the response variable for this study been quantitative—like systolic blood pres-

sure level—rather than categorical, it would have been an appropriate scenario in which to apply (2-way)

ANOVA.

1.2 Hypotheses of ANOVA

These are always the same.

H0: The (population) means of all groups under consideration are equal.

Ha: The (pop.) means are not all equal. (Note: This is different than saying “they are all unequal ”!)

1.3 Basic Idea of ANOVA

Analysis of variance is a perfectly descriptive name of what is actually done to analyze sample data ac-

quired to answer problems such as those described in Section 1.1. Take a look at Figures 12.2(a) and 12.2(b)

(p. 746) in your text. Side-by-side boxplots like these in both figures reveal differences between samples

taken from three populations. However, variations like those depicted in 12.2(a) are much less convincing

that the population means for the three populations are different than if the variations are as in 12.2(b). The

reason is because the ratio of variation between groups to variation within groups is much

smaller for 12.2(a) than it is for 12.2(b).

Analysis of Variance (ANOVA) for Different Scenarios, Exercises of Mathematical Statistics

Related documents

Partial preview of the text

Download Analysis of Variance (ANOVA) for Different Scenarios and more Exercises Mathematical Statistics in PDF only on Docsity!

Recall, when we wanted to compare two population means, we used the 2-sample t procedures.

1 Basic ANOVA concepts

1.1 The Setting

1.2 Hypotheses of ANOVA

Ha : The (pop.) means are not all equal. (Note: This is different than saying “they are all unequal ”!)

1.3 Basic Idea of ANOVA

reason is because the ratio of variation between groups to variation within groups is much

1.4 Assumptions of ANOVA

Fortunately, ANOVA is somewhat robust (i.e., results remain fairly trustworthy despite mild violations of

2 One-Way ANOVA

2.1 Notation

n ∑ i j

2.2 Splitting the Total Variability into Parts

MSE =

P ( F 3,6 > 9.78) = 0.

P ( F 2,20 > 5 ) = between 0.01 and 0.

The X-ed out P -value is between 0.001 and 0..

Brand 5 55961.5 11192.3 2.

Error 54 254539.26 4,713.

Total 59 310,500.

2.4 Multiple Comparisons

Comparisons of Two-way to One-Factor-at-a-Time

Two-way ANOVA table

1. the variation explained by factor A

2. the variation explained by factor B

3. the variation explained by the interaction of A and B

4. the variation explained by randomness

The Two-way ANOVA model

and the ratio of the largest group standard deviation to the smallest group standard deviation is at most 2.

Main Effects

Interaction Effects