STAT 303 Exam 1 - Fall 2006 - Section 507, Exams of Data Analysis & Statistical Methods

The instructions and questions for a statistics exam taken by students in a university class. The exam covers topics such as linear regression, correlation, five-number summary, outliers, z-scores, and probability. Students are required to answer multiple-choice questions and work alone.

Typology: Exams

Pre 2010

Uploaded on 02/10/2009

koofers-user-y6z
koofers-user-y6z 🇺🇸

10 documents

1 / 6

Toggle sidebar

This page cannot be seen from the preview

Don't miss anything!

bg1
Name: __________________________ UIN #:____________________________
STAT 303 Sections 507
Fall 2006
EXAM #1
Form A
Instructor: Lian Liu
Sept. 21, 2006
1. Don’t EVEN open this until you are told to do so.
2. There are 20 multiple-choice questions on this exam, each worth 5 points. Please
mark your answers clearly on the scantron. Multiple marks will be counted wrong.
3. You will have 60 minutes to finish this exam.
4. If you are caught cheating or helping someone to cheat on this exam, you both will
receive a grade of zero on the exam. You must work alone.
5. Good luck!
pf3
pf4
pf5

Partial preview of the text

Download STAT 303 Exam 1 - Fall 2006 - Section 507 and more Exams Data Analysis & Statistical Methods in PDF only on Docsity!

Name: __________________________ UIN #:____________________________

STAT 303 Sections 507

Fall 2006

EXAM

Form A

Instructor: Lian Liu

Sept. 21, 2006

1. Don’t EVEN open this until you are told to do so.

2. There are 20 multiple-choice questions on this exam, each worth 5 points. Please

mark your answers clearly on the scantron. Multiple marks will be counted wrong.

3. You will have 60 minutes to finish this exam.

4. If you are caught cheating or helping someone to cheat on this exam, you both will

receive a grade of zero on the exam. You must work alone.

5. Good luck!

  1. Suppose the average weight of a baby (in pounds) can be predicted based on age (in months) by a linear regression. A researcher studied a dataset of 245 babies, and reported his results by giving the least- squares equation: weight = 6.9 + 1.2 ( age ), along with the fact that r² = 0.56. Then which of the following is true? A. On average, babies gain 1.2 pounds per month. B. The predicted weight of a baby at birth is 6.9 pounds. C. 56% of the variation in weight is explained by the regression model. D. Exactly two of the above. E. All of the above.
  2. Which of the following statements is true? A. Correlation (r) is a value between 0 and 1 B. If r = 0.92, it shows a stronger linear relationship than r = - 0. C. Positive correlation means all of the data values are positive D. If r = 1, it means the explanatory variable X causes the response variable Y E. None of above
  3. What is the five-number summary? A. min, first quartile, median, second quartile, max B. min, first quartile, second quartile, third quartile, max C. min, first quartile, second quartile, third quartile, IQR D. min, mean, median, IQR, max E. mean, median, IQR, standard deviation, range
  4. Which of the following is/are greatly affected by outliers? A. Sample mean B. Sample median C. Sample standard deviation D. Two of the above are greatly affected by outliers. E. All of the above are greatly affected by outliers.
  5. Suppose after this exam I calculate everybody’s z-scores. What does it mean to get a z-score of +3.0 on this exam? A. It means that you missed 3 questions on the exam. B. It means that you got 3 times as many questions on this exam correct as the average student. C. It means that your grade was 3 standard deviations above the average on this exam. D. It means that your grade on this exam was in the upper 3% of the class. E. It means that your grade was 3 points higher than the average on this exam.
  6. The distribution of high speed is bell shaped and symmetric. Approximately 99.7% of high speed’s are between 110 and 200. From this information, what is probability that a car has a high speed less than 170? A. 95% B. 84% C. 68% D. 65% E. 50%
  1. It is likely that more education is a cause of higher income – many highly paid professions require advanced education. However, people who have high ability are more likely to get many years of education than people who are less able. Of course, people with outstanding ability are more likely to have high earnings even without much education. The ability here is A. A population parameter B. The response variable C. A confounding variable D. Undercoverage E. None of above
  2. What is the best description of the data having those graphs below? A. Symmetric, normal and unimodal B. Symmetric, non-normal and bimodal C. Skewed to the left, non-normal and unimodal D. Skewed to the right, normal and bimodal E. Skewed to the right, non-normal and unimodal
  3. Here are the stemplot of the numbers of home runs that Babe Ruth hit in each of his 15 years with the New York Yankees, 1920 to 1934: The interquartile range is A. 4 B.19 C. 22 D. 38 E. 46
  4. Which of the following is TRUE? A. Zip code is a discrete numeric variable, but gender is a categorical variable. B. Zip code is a categorical variable, but social security number is a discrete numeric variable. C. Zip code is a discrete numeric variable, and phone number is also a discrete numeric variable. D. Area code is a categorical variable, but social security number is a discrete numeric variable. E. Area code is a categorical variable, but height is a continuous numeric variable.
  1. A simple random sample of 60 undergraduates at Johns Hopkins University found that 50% of those sampled felt that drinking was a problem among college students. A simple random sample of 30 undergraduates at Ohio State University found that 70% felt that drinking was a problem among college students. The number of undergraduates at Johns Hopkins University is approximately 2000, while the number at Ohio State is approximately 40,000. We conclude A. The sample from Johns Hopkins has less sampling variability than that from Ohio State B. The sample from Johns Hopkins has more sampling variability than that from Ohio State C. The sample from Johns Hopkins has almost the same sampling variability as that from Ohio State D. It is impossible to make any statements about the sampling variability of the two samples because the students surveyed were different E. None of the above
  2. Suppose you have calculated that the correlation between the numbers of hours spent studying for a statistics test and the grade on the statistics test. The resulting correlation is 1.25. Which of the following do you know to be true? A. People who studied longer got lower grades on the test. B. Studying longer caused people to do better on the test. C. People who studied longer got higher grades on the test. D. A calculation error was made. E. You should not calculate the correlation between two variables since both variables are categorical.
  3. The following is a two-way table for a survey that compared whether men and women were binge drinkers or not. What’s the percentage that a male turns out to be a binge drinker? Female Male Binge drinker 1699 1850 Not a binge drinker 7302 4076 A. 31.22% B. 12.39% C. 52.13% D. 47.87% E. 11.38%
  4. Assuming nothing about the shape of the distribution of a sample, which of the following is true? A. Approximately 95% of the observations fall within 3 standard deviation of the mean. B. Approximately 95% of the observations fall within 2 standard deviations of the mean. C. Approximately 68% of the observations fall within 1 standard deviations of the mean. D. Two of the above are true. E. None of the above is true.
  5. In order to take a sample of 1200 people from a population, the population is first divided into men and women. Then a simple random sample of 500 men and a separate simple random sample of 700 women are obtained. This is an example of A. Simple random sampling B. Stratified random sampling C. Cluster sampling D. Voluntary response E. Multi-stage sampling