Docsity
Docsity

Prepare for your exams
Prepare for your exams

Study with the several resources on Docsity


Earn points to download
Earn points to download

Earn points by helping other students or get them with a premium plan


Guidelines and tips
Guidelines and tips

STAT303 Exam #3, Fall 2008: Hypothesis Testing and Confidence Intervals - Prof. J. Carroll, Exams of Data Analysis & Statistical Methods

The questions and answers for exam #3 of the stat303 course, which covers hypothesis testing and confidence intervals. Students are required to apply various statistical tests to determine if there is a significant difference between population means or proportions based on given data.

Typology: Exams

Pre 2010

Uploaded on 02/13/2009

koofers-user-ue8
koofers-user-ue8 🇺🇸

5

(1)

10 documents

1 / 4

Toggle sidebar

Related documents


Partial preview of the text

Download STAT303 Exam #3, Fall 2008: Hypothesis Testing and Confidence Intervals - Prof. J. Carroll and more Exams Data Analysis & Statistical Methods in PDF only on Docsity!

STAT303 Sec 508-

Fall 2008

Exam

Form A

Instructor: Julie Hagen Carroll

Name:

  1. Don’t even open this until you are told to do so.
  2. There are 20 multiple-choice questions on this exam, each worth 5 points. There is partial credit. Please mark your answers clearly. Multiple marks will be counted wrong.
  3. You will have 60 minutes to finish this exam.
  4. If you have questions, please write out what you are thinking on the back of the page so that we can discuss it after I return it to you.
  5. If you are caught cheating or helping someone to cheat on this exam, you both will receive a grade of zero on the exam. You must work alone.
  6. When you are finished please make sure you have marked your CORRECT section (Tuesday 12:45 is 508, 2:20 is 509, and 3:55 is 510) and FORM and 20 answers, then turn in JUST your scantron.
  7. Good luck!
  1. Suppose I wanted to know if there were more purple Skittles ©R^ than green ones. One sample of fifty had 13 purples and the other had only 8 greens out of 50. My test statistic was 1.23. What is my p-value and correct conclusion?

A. Since it’s obvious that there are more purple than green, I can make that conclusion without running a hypothesis test. B. My p-value is 0.2186, so I would be unable to con- clude there are more purple than green. C. My p-value is 0.1093, so I would be unable to con- clude there are more purple than green. D. My p-value is 0.8907, so I would be unable to con- clude there are more purple than green. E. My p-value is 0.8907, so there are 89.07% more purples than greens.

  1. Which of the following is an invalid z-test?

A. H 0 : μ = 50 vs. HA : μ < 50 with a non-normal sample of size 50 B. H 0 : π = 0.5 vs. HA : π < 0 .5 with a sample of size 50 C. H 0 : π = 0.85 vs. HA : π < 0 .85 with a sample of size 50 D. H 0 : π 1 = π 2 vs. HA : π 1 < π 2 with independent samples of size 50 E. All of the above are valid z-tests.

  1. When can we use the paired t-test?

A. when we want the most powerful 2-sample test B. when we have dependent samples C. when we have similar variances and equal samples sizes D. when we have equal sample sizes (we don’t need the variances to be the same) E. Two of the above are correct.

  1. The hypotheses H 0 : μ = 350 vs HA : μ < 350 are examined using a sample of size n = 20 with mean = 345 and standard deviation = 13. What is the p-value of this test if we assume that the data is normal?

A. 0. B. 0. C. 0. 10 > p-value > 0. 05 D. 0. 20 > p-value > 0. 10 E. We don’t have a t-table for negative numbers, so we can’t say.

  1. After once again losing a football game to the college’s archrival, the alumni association conducted a survey to see if alumni were in favor of firing the coach. A simple random sample of 100 alumni from the population of all living alumni was taken. Sixty-five of the alumni in the sample were in favor of firing the coach. Suppose the alumni association wished to see if the majority of alumni are in favor of firing the coach. To do this they test the hypotheses H 0 : π = 0.50 vs HA : π > 0 .50. What is the test statistic and p-value for this hypothesis test?

A. 3 and 0. B. 3 and 0. C. 3.14 and 0. D. 3.14 and 0. E. 3.14 and 0.

  1. Researchers have selected a simple random sample of size n = 12 from a population for the purpose of test- ing the null hypothesis H 0 : μ = μ 0 against the alter- native hypothesis HA : μ > μ 0. Studies indicate that a substantial degree of skewness can be expected in the population from which the sample was obtained. What might the researchers consider doing in this situation?

A. There really isn’t anything they can do; they sim- ply have to proceed with what the data give them. B. They might proceed as usual but alter the p-value later to compensate for the skewness in their data. C. They might use a non-parametric (NP) test pro- cedure. D. They might try a transform to the data so that it’s not skewed. E. Each of C and D above are possible approaches they might consider using.

  1. A sociologist is studying the effect of having children within the first two years of marriage on the divorce rate. Using hospital birth records, she selects a simple random sample of 200 couples who had children within the first two years of marriage. She checks to see how many couples are divorced within five years. What type of hypothesis test should she run?

A. a 1-sample test of means to see if the average di- vorce rate is different from the national average B. a 2-sample test of means to see if the average di- vorce rate is different for childless couples C. a paired t-test comparing husbands and wives D. a 1-sample test of proportions comparing her pro- portion of divorces with the national percent after five years of marriage E. a 2-sample test of proportions comparing the pro- portions of divorced couples with the proportion of those still married

  1. Suppose we were not sure if our sample of 30 is normal. In which of the following circumstances would we NOT be safe using a t procedure?

A. The mean and median of the data are not exactly equal. B. A histogram of the data is moderately skewed. C. The sample standard deviation is large. D. A boxplot of the data shows the presence of a large outlier. E. Thirty is always large enough for the sample mean to be approximately normal.

  1. Using the three confidence intervals below, what is the correct range of the p-value when testing H 0 : π 1 = π 2 vs. HA : π 1 6 = π 2?

90% (-0.222, 0.042) 95% (-0.247, 0.067) 99% (-0.230, 0.117)

A. p-value > 0. 10 B. 0. 10 > p-value > 0. 05 C. 0. 05 > p-value > 0. 01 D. p-value< 0. 01 E. You need a hypothesized value to determine the p-value

  1. Which of the following are always necessary assump- tions for any of the 2-sample tests?

A. The two samples must always have at least 30 ob- servations unless the data is normal. B. The two samples must always be independent whether testing means or proportions. C. The two samples must always have similar vari- ances when testing means. D. All of the above are necessary. E. None of the above are always necessary.

  1. Suppose we do not believe that students tend to improve their SAT-M score the second time they take the test. Based on a 90% confidence interval, (− 22. 56 , 72 .56), we wish to test H 0 : μ = 0 vs HA : μ 6 = 0 at the 5% significance level. Which of the follow- ing statements is true?

A. We cannot make a decision since the confidence level we used to calculate the confidence interval is 90%, and we would need a 95% confidence interval. B. We reject H 0 at the 10% level, since the value 0 falls in the 90% confidence interval, but we fail to reject at the 5% level. C. We reject H 0 at the 5 and 10% levels, since the value 0 falls in the 90% confidence interval. D. We fail to reject H 0 at the 5 and 10% levels, since the value 0 falls in the 90% confidence interval and would therefore also fall in the 95% confidence interval. E. We reject H 0 at the 5% level, but we fail to reject at the 10% level.

  1. A simple random sample of 60 households in the city of Greenville is taken. In the sample, there are 45 house- holds that decorate their houses with lights for the holi- days. A simple random sample of 50 households is also taken from the neighboring town of Brownsboro. In the sample, there are 40 households that decorate their houses. We want to know if there is a difference in pro- portions of households that decorate their houses with lights for the holidays. Given a 95% confidence interval, (− 0. 206 , 0 .106), what conclusions can be made?

A. At the 5 and 10% levels we can conclude there is a difference in the true proportions of decorated houses for the two cities. B. At the 1% we can conclude that the true propor- tions of decorated houses for the two cities is the same. C. Brownsboro has a higher percentage of decorated houses than Greenville. D. All of the above are true. E. None of the above are true.

  1. What are be an example of a Type II error in the prob- lem above?

A. concluding that Brownsboro has a higher percent- age of decorated houses than Greenville when ac- tually they are the same B. failing to conclude that Brownsboro has a higher percentage of decorated houses than Greenville when it does C. failing to prove there is a difference in the percent of decorated houses in the two cities when one ex- ists D. failing to prove the percent of decorated houses in the two cities is the same when they are the same E. failing to prove the percent of decorated houses in the two cities is the same when they are the different

  1. A study was to be undertaken to determine if a partic- ular training program would improve physical fitness. An SRS of 31 university students was selected to be enrolled in the fitness program. One important mea- sure of fitness is maximum oxygen uptake. Measure- ments of oxygen uptake in untrained individuals are known to follow a normal distribution with a mean of 45 ml/kg/min. The researchers wished to determine if there was evidence that their sample of students dif- fered from the general population of untrained subjects. The measurements made on the subjects coming into this study produced a mean of 47.4 with a standard deviation of 5.3. If the 98% confidence interval were determined to be (45. 2 , 49 .6), an interval which may or not be correct, what would the researchers conclude?

A. The sample is not statistically significant at the 0.02 level. We cannot say the true mean is not 45. B. The sample mean is significantly different from μ = 45 at the 0.04 level since this is a two-sided test. C. There would be no reason to believe the selected students come from a different population. D. This suggests, at the 0.02 level, that there is ev- idence to conclude that the sample comes from a population with a mean different from μ = 45. E. This suggests, at the 0.01 level, that there is ev- idence to conclude that the sample comes from a population with a mean different from μ = 45.

  1. Referring to the previous question, if we had used a 95% confidence interval instead of a 98%, what would happen to the confidence interval and the resulting p- value?

A. The confidence interval would be longer and the p-value would decrease. B. The confidence interval would be shorter and the p-value would increase. C. The confidence interval would not change and the p-value also would not change. D. The confidence interval would be shorter and the p-value would decrease. E. The confidence interval would be shorter but the p-value would not change.

  1. When is increasing the sample size NOT a good idea?

A. when it leads to non-practical significance B. when its greater than the sample size necessary for a given margin of error C. when it’s cost prohibitive D. All of the above are correct. E. more is always better

  1. Assume that sample data, based on two independent samples of size 25, give us x 1 = 514.5, x 2 = 505, s 1 = 23, and s 2 = 28.2. If we wanted to test if the true means are equal or not, what would be the appropriate range of the p-value if the test statistic value is 1.305?

A. 0. 20 > p-value > 0. 10 B. 0. 10 > p-value > 0. 05 C. 0. 15 > p-value > 0. 10 D. 0. 30 > p-value > 0. 20 E. 0. 20 > p-value > 0. 15

  1. Which of the following is false for the data above? Note: all possible choices for the p-value are greater than 5%.

A. At the 5% significance level, we would fail to prove that there is a difference between the two popula- tion means, μ 2 and μ 1. B. The margin of error for the difference between the two sample means would be smaller if we were to take larger samples. C. If a 99% confidence interval were calculated in- stead of the 95% interval, it would include more values for the difference between the two popula- tion means. D. The difference in the true means would be in the 95% confidence interval. E. All of the above are true statements about the data.

  1. What would be a Type I error for the previous test?

A. concluding that the true means were the same when they were actually different B. concluding that the true means were different when they were actually the same C. failing to conclude that the true means were the same when they were actually different D. failing to conclude that the true means were dif- ferent when they were actually the same E. failing to conclude that the true means were dif- ferent when they were actually were different

  1. A public health worker suspects that pipes in one of the older A&M dorms is causing students living there to have excess lead in their blood, i.e., over 100ppb (parts per billion). What set of hypotheses should they test to insure the health of the students in the dorm?

A. H 0 : μ ≤ 100 vs. HA : μ > 100 B. H 0 : μ ≥ 100 vs. HA : μ < 100 C. H 0 : μ = 100 vs. HA : μ 6 = 100 D. H 0 : μ < 100 vs. HA : μ ≥ 100 E. H 0 : μ > 100 vs. HA : μ ≤ 100

1C,2C,3B,4C,5A,6E,7D,8D,9A,10E,11D 12E,13C,14D,15E,16D,17A,18D,19B,20B