






Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
A portion of lecture notes from a statistics 431 course focusing on paired samples and two-sample inference. The basics of paired samples, the difference between means for paired data, and the benefits of using paired data over unpaired data. It also introduces the concept of inference about two population proportions and provides large-sample tests and confidence intervals.
Typology: Study notes
1 / 11
This page cannot be seen from the preview
Don't miss anything!







there’s no correspondence between observations in the first sample and observations in the second. (The number of Xi ’s is not even the same as the number of Yi ’s, in general.)
one Yi , to form the n pairs (Xi , Yi ).
data, we should focus on comparing the pairs (Xi , Yi ) to each other.
Time by Patient
Drowsiness
+1h +2h
89
1011
12 l l
P
+1h +2h
l l
P
+1h +2h
l l
P
+1h +2h
l l
P
+1h +2h
l l
P
+1h +2h
l l
P
+1h +2h
l l
P
+1h +2h
l l
P
+1h +2h
l l
P
l l
P
l l
P
l l
P
l l
P l l
P l l
P l l
P
l l
P
89
1011
12 l l
P
89
1011
12 l l
P19 (^) l P20l l l
P
l l
P
l l
P l l
P
l l
P
l l
P l l
P
l l
P l l
P
l l
P l l
P
l l
P
l l
P
l l
P
l l
P
89
1011
12 l l
P
89
1011
12
l l
P
l l
P
l l
P
l l
P l l
P
l l
P
l l
P
l l
P
l l
P
Xi ’s are independent of the Yi ’s as we did with unpaired data.
in the sample variance, we get
n
for large samples. For small samples, T ∼ tn− 1 , once we add the assumption that each Di is normal.
HA : μD > 1 0 ⇒ reject when T > cα HA : μD < 1 0 ⇒ reject when T < cα.
they are uncorrelated.
paired analysis, the positive correlation is reflected in S^2 D, which usually turns out smaller than S^2 X + S Y^2.
increased testing power via a larger T statistic, and narrower CIs.
do a paired analysis, but you would have been better off without pairing!
population, i.e. the fraction of the population possessing some characteristic.
and in “B” it is p 2.
of successes in the sample. Similarly, we make n independent draws from population “B”, and call Y the number of successes.
drawn independently, so X is independent of Y.
conducted to study Salk’s polio vaccine. Randomizing a child to the control (placebo) group is like drawing from a population having probability p 1 of “success” (contracting polio). Randomizing a child to the treatment (vaccination) group is like drawing from a population with probability p 2 of contracting polio.
then X + Y ∼ Bin(m + n, p).
p ˆ = (X + Y )/(m + n).
pˆ 1 − ˆp 2 √ pˆ( 1 − ˆp)( 1 /m + 1 /n)
and the procedures HA : p 1 − p 2 6 = 0 ⇒ reject when |Z | > zα/ 2 HA : p 1 − p 2 > 0 ⇒ reject when Z > zα HA : p 1 − p 2 < 0 ⇒ reject when Z < −zα.
denominator,
pˆ 1 − ˆp 2 − ( p 1 − p 2 ) √ pˆ 1 ( 1 − ˆp 1 )/m + ˆp 2 ( 1 − ˆp 2 )/n
for p 1 − p 2 , using Z. Rearrange it to obtain the confidence interval
p ˆ 1 − ˆp 2 ± zα/ 2
pˆ 1 ( 1 − ˆp 1 ) m
pˆ 2 ( 1 − ˆp 2 ) n
p ˜ 1 = (X + 1 )/(m + 2 ) and pˆ 2 with the analogous p˜ 2. This can improve the quality of the normal approximation, hence the correctness of the CI.