Statistical Hypothesis Testing and One-Way ANOVA, Exams of Data Analysis & Statistical Methods

Various questions and scenarios related to statistical hypothesis testing, including t-tests, z-tests, ANOVA, and chi-square tests. It also covers the interpretation of p-values, residual plots, and regression inference. One specific example focuses on comparing the strain readings of four machines.

Typology: Exams

2019/2020

Uploaded on 11/25/2020

koofers-user-1rn
koofers-user-1rn 🇺🇸

9 documents

1 / 5

Toggle sidebar

This page cannot be seen from the preview

Don't miss anything!

bg1
STAT303: Sections 508 510
Spring 2004
Exam #3
Form A
1. Don’t EVEN open this until you are told to do so.
2. Be sure to mark your section number and your test form (A, B or C) on the scantron and sign your
name where indicatred on your scantron.
3. There are 20 multiple-choice questions, each worth 5 points. There is partial credit you should
only include the answers you are sure of to maximize your points. Be sure to mark your scantron
CLEARLY. Multiple answers will be wrong!
4. You will have 60 MINUTES to complete this exam.
5. This exam is worth 100 points and constitutes 20% of your course grade.
6. If you are caught cheating or helping someone else cheat, you will both be given a ZERO for this
exam. This includes discussing this exam with another student before next Thursday, April 29th!!!
7. GOOD LUCK! If you have any questions or comments, please include them on this exam. Turn in
your scantron, help sheets and this exam when you are finished.
8. You may have the Review guide, flowchart and 3 another sheets of notes.
pf3
pf4
pf5

Partial preview of the text

Download Statistical Hypothesis Testing and One-Way ANOVA and more Exams Data Analysis & Statistical Methods in PDF only on Docsity!

STAT303: Sections 508 − 510

Spring 2004

Exam

Form A

1. Don’t EVEN open this until you are told to do so.

2. Be sure to mark your section number and your test form (A, B or C) on the scantron and sign your

name where indicatred on your scantron.

3. There are 20 multiple-choice questions, each worth 5 points. There is partial credit − you should

only include the answers you are sure of to maximize your points. Be sure to mark your scantron

CLEARLY. Multiple answers will be wrong!

4. You will have 60 MINUTES to complete this exam.

5. This exam is worth 100 points and constitutes 20% of your course grade.

6. If you are caught cheating or helping someone else cheat, you will both be given a ZERO for this

exam. This includes discussing this exam with another student before next Thursday, April 29th!!!

7. GOOD LUCK! If you have any questions or comments, please include them on this exam. Turn in

your scantron, help sheets and this exam when you are finished.

8. You may have the Review guide, flowchart and 3 another sheets of notes.

Coe fficie ntsa 1928.206 888.680 2.170. 1.909 .047 40.28. (Constant) Beginning Salary B Std. Error Unstandardized Coefficients t Sig a. Dependent Variable: Current Salary

  1. Are avid readers more likely to wear glasses than those who read less frequently? Three hundred men in the Korean army were selected at random and characterized as to whether they wore glasses and whether the amount of reading they did was above average, average, or below average. What type of test should we run to answer the question? A. a χ^2 test for multiple proportions B. an ANOVA F test for multiple means C. a 2-sample t test for the mean of those wearing glasses vs. the mean of those who don't D. a 2-sample z test for the proportion of those wearing glasses vs. the proportion of those who don't E. A non-parametric test since we don't know if the data is normal or not Or
  2. In the computer science department of a large university, many students change their major after the first year. A detailed study of the 256 students enrolled as first-year computer science majors in one year was undertaken to help understand this phenomenon. Students were classified on the basis of their status at the beginning of their second year, and several variables measured at the same time of their entrance to the university were obtained. If they decided that the students’ high school math grades were the best variable to use, what type of test should they use? Assume they divided the students into three second- year majors: computer science, other engineering or science, other, and all the necessary assumptions are met. A. A χ^2 test of independence if they used A, B, C, D or F as the grade categories. B. A One-way ANOVA using numeric grades. C. A paired t test since it has the most power. D. Any of the above would be ok. E. Only two of the above would work.
  3. What is the best conclusion given the output below? A. Since the p -value is 0.031, we can conclude at the 5% level that Beginning Salary is a good predictor for Current Salary. B. Since the p -value is 0.031, we can conclude at the 5% level that Beginning Salary is NOT a good predictor for Current Salary. C. Since the p -value is 0.000, we can conclude at the 5% level that Beginning Salary is a good predictor for Current Salary. D. Since the p -value is 0.000, we can conclude at the 5% level that Beginning Salary is NOT a good predictor for Current Salary. E. Since the p -value is 0.031, we can conclude at the 1% level that Beginning Salary is a good predictor for Current Salary.
  4. Suppose I need to know whether the true test score is under 70, so I want to test H 0 : μ = 70 vs. HA: μ <
  5. If I sample the population 20 times and reject (conclude the true mean is under 70) twice (2 out of 20 times), what does this tell me? A. The true mean really is under 70 since I rejected twice. B. The true mean is probably not under 70. The 2 out 20 rejections, 10%, is just my sample estimate of α, the chance of making a Type I error. C. The true mean is probably not under 70. The 2 out 20 rejections, 10%, is just my sample estimate of β, the chance of making a Type II error. D. The true mean is under 70. The 2 out 20 rejections, 10%, is just my sample estimate of α, the chance of making a Type I error. E. The true mean is under 70. The 2 out 20 rejections, 10%, is just my sample estimate of β, the chance of making a Type II error.
  6. Which of the following is NOT an assumption of One-way ANOVA? A. Each sample must be random. B. The samples must be independent. C. The samples must be normal. D. The samples must have equal means. E. All of the above are valid assumptions.
  7. Which of the following is an example of a matched pairs design? A. A teacher compares the pre-test and post-test scores of students. B. A teacher compares the scores of students using a computer-based method of instruction with the scores of other students using a traditional method. 2

D. The plot of the means does not tell us whether the means are statistically different or not. E. None of the above statements are true for this plot.

  1. What is the purpose of residual plots? A. to determine if x and y are linearly related B. to determine if the assumptions for One-Way ANOVA test of equal means are valid C. to determine if the true slope of the line, β 1 , is 0 or not D. Both A. and C. are correct. E. None of the above are correct. Or
  2. The US Golf Association regularly tests golf equipment to ensure that it conforms to USGA standards. Suppose it wishes to compare the mean distance traveled by four different brands of golf balls when struck by a driver. How should they gather their data? A. get the data from the 4 different companies and run an ANOVA test to compare the means B. poll golfers at a randomly selected golf course and record their brand and average drive (distance the ball is driven) C. take a random sample of each brand and have a golfer hit all of them, in random order D. take a random sample of golf courses and record which brands are used E. take a random sample of each brand and have four golfers hit all of them, in random order
  3. Rejecting H 0 : β 1 = 0 in the regression inference procedure says A. the x ’s are not useful in predicting the y ’s. B. the slope of the regression line is large ( i.e. , a big number). C. the probability of making a Type II error is NOT
  4. D. Exactly two of the above are correct. E. None of the above are correct.
  5. The desired percentage of silicon dioxide in a certain type of cement is 5.0. A random sample of n =36 cement specimens gave had an average percentage of 5.21 with a standard deviation of 0.38. If we wanted to test whether the cement specimens came from a population that had a true mean equal to the desired silicon dioxide percentage, which type test should we do? A. Case 6: the 1-sample test of a proportion using the desired percentage as the null value. B. Case 1: the 1-sample test of a mean using the desired percentage as the true mean null value and 0.38 as the true standard deviation. C. Case 3: the 1-sample test of a mean using the desired percentage as the true mean null value and 0.38 as the sample standard deviation. D. Case 11: the 2-sample test for 2 proportions using the desired percentage as one and the sample proportion as the other. E. Case 9: the 2-sample test for 2 means using the desired percentage as the true mean and the sample mean as the other.
  6. Which of the following would lead us to believe that the t procedures were NOT safe to use for small samples? A. The sample means and medians for the two groups were slightly different. B. The distributions of each group were moderately skewed. C. The sample standard deviations were markedly different. D. One of the samples had a few large outliers. E. Any of the above would indicate we should use a non-parametric procedure.
  7. An SRS of 100 flights of a large airline (call this airline 1) showed that 64 were on time. An SRS of 100 flights of another large airline (call this airline 2) showed that 80 were on time. Let π 1 and π 2 be the proportion of all flights that are on time for these two airlines. 90, 95 and 99% confidence intervals for the difference π 1 − π 2 are: (−0.259, −0.055), (−0.279, − 0.035) and (−0.317, 0.003) respectively. What is the range of the p -value for testing H 0 : π 1 = π 2 vs. HA: π 1 ≠ π 2? A. p -value > 0. B. 0.10 > p -value > 0. C. 0.05 > p -value > 0. D. p -value < 0. E. You need a hypothesized value to know whether it’s in the confidence interval or not. We don’t have one, so we can’t determine the p -value. And
  8. Which of the following are necessary assumptions for a χ^2 test of independence? A. We must have at least 30 in each cell. B. If we don’t have 30, then the data must be normal. C. The standard deviations (variances) must be equal. D. All of the above are necessary. E. None of the above are necessary. Or 4

Chi-Square Te sts Chi-Square 2.714 1. Value df Asymp. Sig. (2-sided) M inority Classification * Ge nde r Crosstabulation 176 194 370 40 64 104 216 258 474 Count Count Count No Yes Female Male Gender Total

  1. A Simple Random Sample of 25 male faculty members at a large university found that 10 felt that the university was supportive of female and minority faculty. An independent SRS of 20 female facutly found that only 5 felt that the university was supportive of female and minority faculty. A 95% confidence interval for the difference in the true proportions of male, πM, and female, πF, faculty at the university who felt that the university was supportive of female and minority faculty at the time of the survey is: 0.15 ± 0.270. If we want to test H 0 : πM = π F vs. HA:^ πM ≠^ πF, what conclusion should we make at the 5% significance level? A. We cannot decide without running a test of hypotheses. B. Since both sample proportions (0.4 for males and 0.25 for females) are not less than 0.05, we fail to reject H 0 and conclude we could not prove that the true proportions are different. C. Since the sample proportions (0.4 for males and 0.25 for females) are not equal, we reject H 0 and conclude that the true proportions are not equal. D. Since the confidence includes 0, we fail to reject H 0 and conclude that we could not prove the true proportions are different. E. Since the confidence does not include 0, we reject H 0 and conclude that the true proportions are different. And
  2. Which of the following is/are necessary assumptions for the data and test procedure above? A. We must have at least 30 in each sample. B. Since the sample sizes are less than 30, the data must be normal. C. The standard deviations (variances) must be equal. D. All of the above are necessary. E. None of the above are necessary.
  3. Suppose we have two SRS’s from two distinct populations and the samples are independent. We measure the same variable for both samples. Suppose both populations of the values of these variables are normally distributed but the means and standard deviations are unknown. For the purposes of comparing the two means we can use the pooled t -test procedure if A. the means for the two populations are actually equal. B. the standard deviations for the two populations are actually equal. C. the sample means for the two populations are equal. D. the sample sizes are the same. E. Two of the above are correct.
  4. A SRS of size 100 is taken from a population having proportion 0.8 of successes. An independent SRS of size 400 is taken from a population having proportion 0.5 of successes. The sampling distribution for the difference in the sample proportions, p 1 − p 2 , has mean A. 0. B. 0. C. 0. D. 0. E. 0.
  5. What does it means when we say a hypothesis test is ‘statistically significant’? A. It means that we rejected the null hypothesis. B. In One-way ANOVA, it means that the effect on the means is significant. C. In regression, it means that the line is a significant prediction equation for y. D. All of the above are true. E. None of the above are true.
  6. What conclusion can be made from the output below? A. There are less minority females than males at the 10% level of significance. B. There is not a linear relationship between minority classification and gender in the data above. C. Gender has no effect on minority classification in the data above. D. There is not a statistically significant relationship between minority classification and gender in the data above at the 5% level of significance. E. There is a statistically significant relationship between minority classification and gender in the data above at the 5% level of significance. 1A (or 1E), 2C, 3B, 4D, 5A, 6C, 7B, 8E, 9B, 10B (or 10C), 11E (or 11C), 12E, 13C, 14D, 15C, 16E (or 15D, 16E), 17B, 18A, 19D, 20D 5