Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
Community
Ask the community for help and clear up your study doubts
Discover the best universities in your country according to Docsity users
Free resources
Download our free guides on studying techniques, anxiety management strategies, and thesis advice from Docsity tutors
Answers to practice questions for a statistics final exam focusing on two-sample hypothesis testing and confidence intervals. It includes calculations for various statistical tests such as t-tests, z-tests, and chi-square tests, as well as discussions on concepts like standard errors, degrees of freedom, and confidence intervals.
Typology: Exams
1 / 4
Stat 20 Fall 06 A. Adhikari
The box has 155 tickets. Each ticket has two parts: the left side shows the 1/0 that will be the result if the patient gets assigned to the treatment group, and the right side shows the 1/0 that will be the result if the patient gets assigned to the control group. We see a simple random sample of 78 of the left sides, and the remaining 77 right sides. The null hypothesis states that the proportion of 1’s on the left side of all 155 tickets is equal to the proportion of 1’s on the right side of all 155 tickets (that is, the treatment has no effect).
z = 3.46. This is the usual “two sample” calculation in the context of the randomized experiment. The SE for difference = 7.21%
(ii)
( 100 30
)
7.97%. Expect 30 blue tickets, SE 4.58, z = 0.11.
(i). The chance of hitting the theoretical probability exactly will decrease.
(ii) At least 79. This is the 80th percentile of the midterm distribution.
85.8.
78.82%. The rms error is 8.43.
(ii) There’s a perfectly good random sample but the two sets of responses are dependent because they are obtained from the same people. There is no information about the nature of the dependence.
The box has 15,000 tickets, one for each patient. Each ticket shows blood pressure. The average and SD of the box are unknown. The null hypothesis says that the average of the box is equal to
The average of the box is less than 120.
t (degrees of freedom = 5) = − 1 .2.
The data support the investigators’ belief. (Or, the data do not support the conclusion that the population average is lower than 120.)
94.21%. Binomial n = 5, p = 0.2, k = 0, 1 , 2. Add the three terms.
8.075%. I expect to lose $0 give or take $35.36.
(ii) is approximately 25%. This is an “Are you awake?” question. With hundreds of throws, the distribution of the number of hits is roughly normal and centered at 20%. Therefore on a single day the chance of “more than 20% hits” is about 50%. The throws are independent, so the answer is 0. 5 × 0 .5 = 0.25.
(ii) equal to 15.5%. It’s the center of the interval; recall the method of construction. This is another “Are you awake?” question.
(ii) goes from 12.25% to 18.75%. The z is 2.6. The SE for the percent is 1.25%, because the
distance between the center and each end of the 95%-CI must be 2 times the SE for the percent. [See if you can find the sample size (at least to a pretty good approximation) and the number of senior citizens in the sample. Those numbers are not necessary for the problems here, but it’s instructive to find them.]
400 × 0. 6 × 0 .4 = 9.8. Convert 179.5 and 220.5 to standard units to get z = − 6 .17 and − 1 .99 respectively. The area in that region is essentially equal to the area to the left of −2, which is 2.275%. We have shown that 2.275% of the students will conclude that the coin is fair, which is the wrong conclusion for this coin. But the remaining 100% − 2 .275% = 97.725% of the students will conclude, correctly, that the coin is not fair.
If you used 95% instead of 95.45% as the area in the range ±2, that’s OK. You should still get 293 as your answer.
for the percent in both cases; they’re almost equal, even if you use the correction factor. Those are estimates based on the sample percents so the exact SEs will be slightly different, and we’ll never know what they are. But for them to differ by a factor of 2, something hugely unlikely has to have happened: namely that two very similar random samples have come out of two hugely different populations. Don’t bet on it. By the way, the square root law works on sample sizes, not population sizes.
The regression effect tells you, without any calculation, that the answer must be bigger than 50%. Here are the calculations.
The z corresponding to the 40th percentile of the standard normal curve is − 0 .25. So the 40th percentile of final scores is − 0. 25 × 12 + 70 = 67. The given midterm score is z = − 0 .25 in standard units and the corresponding regression estimate of the final score is 68.2. The r.m.s. error is 9.6. Use the normal curve to find the percent over 67: now z is − 0 .125, and 0.125 is halfway in between 0.1 and 0.15 on your table. Hence the range of answers; not surprisingly, the actual answer is 55%.
Contrast this with our usual statement that “successive rolls of a die are independent”. They are, if you know which die you’re rolling. But if the die itself is unknown because it was picked randomly, then information about the first few rolls can provide information about the die, as above, which in turn can affect the probabilities of events in future rolls.
You can’t use the “SE for the difference” formula because the proportion “for” and the proportion “against”are seriously dependent. After all, if you know one then you can find the other; the correlation is −1. You have to look more carefully at what is being estimated.
Let p be the population proportion of voters for the proposition. Then the margin of victory is p − (1 − p) = 2p − 1. So its estimate is 2ˆp − 1 where ˆp is the sample proportion of “for” voters. In our sample the observed value of ˆp is 0.53 and the estimate is (2 × .53) − 1 = .06 as stated in the problem. Now think of properties of standard deviation: the −1 will not affect the SE, but the factor of 2 will. So the SE of our estimate is 2 times the SE of the sample proportion “for”, that is, 2 ×
√
. 53 ×. 47 /400 = 0.05.
You can see this in another way, as follows. Construct the 68%-confidence interval for the proportion “for”. That’s 0. 53 ±
√
. 53 ×. 47 /400 which is (50.5%, 55 .5%). The margins of error corresponding to the two endpoints are respectively 1% and 11%. So (1%, 11%) is a 68%-confidence interval for the margin of error. The SE for the margin of error must be half the width of the interval, which is 5%.