Chapter 6 Comparing Two Proportions | Slides Law

6.1 Introduction 113

Chapter 6

Comparing Two Proportions

6.1 Introduction

In this chapter we consider inferential methods for comparing two population propor-

tions p1and p2. More specifically, we consider methods for making inferences about the

difference p1−p2between two population proportions p1and p2. The inferential methods

for a single proportion pdiscussed in Chapter 5 are based on a large sample size normal

approximation to the sampling distribution of ˆp. The inferential methods we will discuss

in this chapter are based on an analogous large sample size normal approximation to the

sampling distribution of ˆp1−ˆp2. Sections 6.2 and 6.3 deal with inferential methods appro-

priate when the data consist of independent random samples. The modifications needed

for dependent (paired) samples are discussed in Section 6.4.

6.2 Estimation for two proportions (independent samples)

In some applications there are two actual physical dichotomous populations so that

p1denotes the population success proportion for population one and p2denotes the pop-

ulation success proportion for population two. In other applications, such as randomized

comparative experiments p1and p2denote hypothetical population success probabilities

corresponding to two treatments. We will assume that the data correspond to two inde-

pendent sequences of Bernoulli trials: a sequence of n1Bernoulli trials with population

success probability p1and an independent sequence of n2Bernoulli trials with population

success probability p2. The assumption that these are independent sequences of Bernoulli

trials means that the outcomes of all n1+n2trials are independent. When sampling from

physical populations these assumptions are equivalent to assuming that the data consist

of two independent simple random samples (of sizes n1and n2) selected with replacement

from dichotomous populations with population success proportions p1and p2. In this con-

text the assumption of independence basically means that the method used to select the

random sample from the first population is not influenced by the method used to select

the random sample from the second population, and vice versa.

The observed success proportions ˆp1and ˆp2are the obvious estimates of the two pop-

ulation success proportions p1and p2; and the difference ˆp1−ˆp2between these observed

success proportions is the obvious estimate of difference p1−p2between the two population

success proportions. The behavior of ˆp1−ˆp2as an estimator of p1−p2can be determined

from its sampling distribution. As you might expect, since ˆp1and ˆp2are unbiased es-

timators of p1and p2, ˆp1−ˆp2is an unbiased estimator of p1−p2. Thus the sampling

Chapter 6 Comparing Two Proportions, Slides of Law