Midterm Exam Questions for Statistical Methods | STAT 2000, Exams of Data Analysis & Statistical Methods

Material Type: Exam; Class: Statistical Methods; Subject: Statistics; University: Utah State University; Term: Spring 2000;

Typology: Exams

Pre 2010

Uploaded on 07/30/2009

koofers-user-fui
koofers-user-fui ๐Ÿ‡บ๐Ÿ‡ธ

4.3

(1)

10 documents

1 / 14

Toggle sidebar

This page cannot be seen from the preview

Don't miss anything!

bg1
1
Stat 2000 International โ€“ Sample Midterm
The Midterm consists of 35 questions: 20 multiple-choice questions (with exactly 1
correct answer) and 15 text-based questions where you have to provide a verbal
explanation or calculate one or multiple numerical values. Some of the questions require
you to use WebStat or any of the interactivities from within CyberStats.
The exam is worth a total of 320 points. The number of points for each question is
indicated in parentheses at the beginning of each question. You have exactly 90 minutes
to complete the exam. Try to correctly answer as many questions as possible during this
time period. You are allowed to answer questions in any order. Start with a question
that seems the easiest for you. If you cannot answer a question within a short time, move
to another question, and come back to the previously unanswered questions toward the
end of the exam.
Obviously, you are allowed to correct your answers. However, only your last submitted
answer will be graded. If you change a previously correct answer and your last submitted
answer is incorrect, you will obtain 0 points for your last submitted answer.
Note:
The actual exam will be fully given within CyberStats. This means you will have to mark
your choices of multiple-choice questions and fill in answers to text-based questions
within CyberStats. In the actual exam, interactivities will be directly linked to the
questions. Make sure to memorize your CyberStats password for the exam.
pf3
pf4
pf5
pf8
pf9
pfa
pfd
pfe

Partial preview of the text

Download Midterm Exam Questions for Statistical Methods | STAT 2000 and more Exams Data Analysis & Statistical Methods in PDF only on Docsity!

Stat 2000 International โ€“ Sample Midterm

The Midterm consists of 35 questions: 20 multiple-choice questions (with exactly 1 correct answer) and 15 text-based questions where you have to provide a verbal explanation or calculate one or multiple numerical values. Some of the questions require you to use WebStat or any of the interactivities from within CyberStats.

The exam is worth a total of 320 points. The number of points for each question is indicated in parentheses at the beginning of each question. You have exactly 90 minutes to complete the exam. Try to correctly answer as many questions as possible during this time period. You are allowed to answer questions in any order. Start with a question that seems the easiest for you. If you cannot answer a question within a short time, move to another question, and come back to the previously unanswered questions toward the end of the exam.

Obviously, you are allowed to correct your answers. However, only your last submitted answer will be graded. If you change a previously correct answer and your last submitted answer is incorrect, you will obtain 0 points for your last submitted answer.

Note:

The actual exam will be fully given within CyberStats. This means you will have to mark your choices of multiple-choice questions and fill in answers to text-based questions within CyberStats. In the actual exam, interactivities will be directly linked to the questions. Make sure to memorize your CyberStats password for the exam.

1. (11 Points) Experiment with the interactivity used in Questions 17-19 in Exercises 2 in Unit A-8: Correlation โ€“ Describing Bivariate Data before answering this question.

Which of these is easiest?

a. Determining the difference between a plot with r = 1 and r =. b. Determining the difference between a plot with r = .85 and r =. c. Determining the difference between a plot with r = .50 and r =. d. Determining the difference between a plot with r = 0 and r =.

2. (7 Points) A study is done to compare the extent of heart disease in people who drink 1 to 2 alcoholic drinks per day to the extent of heart disease in non-drinkers. The researcher is able to study 200 individuals of each type.

Other factors that might affect the extent of heart disease are smoking habits and exercise habits. The smoking habits of the two groups of people are similar, but those who drank generally exercised less than the non-drinkers. In this study, the response variable is:

a. exercise b. heart disease c. smoking d. drinking status

3. (7 Points) Suppose you read that a study has found that 38% of college students engage in 'risk-taking behavior'.

Which question should be asked about this study?

a. How many students were surveyed? b. How were these students selected to be in the sample? c. What is the definition of "risk-taking behavior"? d. All of the above

6. (11 Points) Use WebStat. Load from " Data > Sample data " the data set "Left-right feet lengths.dat" and compute the correlation coefficient for the variables "Left" and "Right".

The correlation is closest to

a. zero b. -. c.. d..

7. (7 Points) The following temperatures were recorded on a February day for 25 cities:

25 31 34 35 36 41 42 42 44 45 45 46 47 48 48 48 49 50 50 52 52 53 53 53 54

The median of these 25 numbers is closest to

a. 25 b. 41 c. 47 d. 51

8. (7 Points) Two events are dependent if when one occurs:

a. The probability of the other is unchanged b. The other event will always occur c. The other will never occur d. The probability of the other is changed

9. (11 Points) Annually, about 25% of the U.S. population gets a cold and between 35 -50% gets the flu, but both illnesses are highly age-related. In 1994, for the 5-24 age group, 32.7 million of 74.8 million got the flu. For people 45 years of age and older, 18.8 million of 81.7 million got the flu.

Calculate the probability of getting the flu for the 45-and-older group.

10. (7 Points) For eye color, brown is dominant and blue is recessive. Suppose both parents have one gene for brown eyes and one for blue. A Punnett square is used to determine the probability of an offspring having brown eyes. This probability is found to be 3/4.

Is this estimate an example of experimental, theoretical or personal probability?

a. This is an experimental probability, since a Punnett square was used to determine the probabilities. b. This is a theoretical probability, since a Punnett square was used to determine the probabilities. c. This is a personal probability, since a Punnett square was used to determine the probabilities. d. This is an experimental, as well as personal probability, since a Punnett square was used to determine the probabilities.

11. (7 Points) A researcher selects a sample from a list of all patients at one of five large hospitals in the following manner. A patient is chosen from the first 25 on the list, then every 25th patient from that point forward is selected.

This is an example of a:

a. simple random sample b. systematic sample c. stratified sample d. cluster sample

12. (7 Points) A screening test for high blood pressure (a diastolic blood pressure of 90mm Hg or higher) produced the following results:

Test Result Hypertension Present

Hypertension Absent

Total

Positive Test 477 163 640 Negative Test 173 4687 4860 Total 650 4850 5500

What is the probability that a person who tests positive has hypertension? a. 74.5% b. 25.4% c. 9.8% d. 8.7%

15. (11 Points) Suppose that 30 students will participate in an experiment in which the effectiveness using a web-based approach to teaching statistics is compared to the effectiveness of a textbook-based approach.

Describe how the researcher could assign participants to the two different approaches.

16. (7 Points) This is a scatterplot showing X = grade in English 101 versus Y = grade in Psychology 101 for 120 U. of I. students.

How many people have a B in English 101 and a C in Psychology 101?

a. 0 b. 1 c. 2 d. There is not enough information to answer the question

17. (11 Points) There is a problem with the situation in this question. Describe the problem.

A list of registered automobile owners is used to select a random sample for a survey about whether people think homeowners should pay a surtax to support public parks.

18. (12 Points) The interactive histogram associated with Questions 50-56 in Exercises 5 in Unit A-5: Describing Data Graphically exhibits the responses of 136 statistics students when asked, "How many dogs plus cats have you and your family had in your lifetime?"

Do you see skewness? If so, in which direction?

19. (11 Points) The daily high temperatures for the first week in January, 1996, at Berkeley are 67, 67, 57, 57, 57, 61, 60. The mean of these values is 60.8571, and the standard deviation is 4.488.

What are the quartiles of the temperatures?

20. (7 Points) Heights for 100 individuals range from 55 inches to 75 inches. Half the individuals have heights around 65 inches, a quarter have heights near 55 inches, and the rest have heights near 75 inches.

The SD of heights is about:

a. 10 inches or less b. 20 inches c. 40 inches d. 400 inches or more

24. (7 Points) A random sample of individuals in a county has been selected for a survey. You have been hired to conduct the survey and decide to use random digit dialing. The first individual you contact tells you that she has no free time and you should call someone else.

What should you do and why?

a. Call someone else; the next number is in the same county so it does not matter which individual is included b. Try to get the person to respond anyway; if you substitute someone else the sample will be biased in favor of people with more time c. Ignore this individual and reduce the sample size by one; people who do not have a lot for free time probably will not respond truthfully anyway d. Replace this individual with another one chosen randomly from the county; one randomly chosen individual is as good as another

25. (12 Points) The interactivity associated with Questions 10-13 in Exercises 2 in Unit A-7: Scatter Plots is based on 11 observations with 6 variables. Use it to construct some plots. For the next question, choose among the following choices for the best description of each plot.

โˆ’ Linear โˆ’ Linear, but with an outlier โˆ’ Curved โˆ’ None of the above

Best description for X = X1 and Y = Y3?

26. (7 Points) A community council wishes to determine the probability that a randomly chosen driver comes to a complete stop at an intersection before proceeding through the intersection. They estimate this probability by positioning workers at several key intersections during rush hour to record whether drivers come to a complete stop or not.

The number of drivers that come to a complete stop is an example of a

a. continuous random variable b. uniform distribution c. normal distribution d. discrete random variable

27. (11 Points) A business currently has 5 telephone lines. Suppose the number of lines in use at any one time has the following distribution:

Phone lines Probability

0 lines. 1 line. 2 lines. 3 lines. 4 lines. 5 lines.

What is the probability that at least 2 lines are in use?

28. (7 Points) A fair coin is repeatedly flipped, where the player wins $4.00 if a head shows, otherwise the player loses $4.00. You win $4 when a 'head' is flipped; you lose $4 when a 'tail' is flipped. The expected value of the amount won or lost is $0.

X -$4 +$

Pr( X ) 0.5 0.

What is the standard deviation for the random variable x = amount won or lost on a single play?

a. 0 b. 2 c. 4 d. 6

32. (11 Points) The U.S. Census Bureau projected that in the year 2000, 8% of the residents of California will be black and 30% of Hispanic origin. An opinion poll taken in the year 2000 is based on a random sample of 700 California residents.

Find the expected (mean) number of blacks in the poll and the standard deviation of the number of blacks in such polls.

33. (12 Points) A university realized that about 80% of its students successfully participate in the introductory statistics class. Assume that the binomial settings hold.

In a class of 50 students, how many students do we expect to pass? And what is the probability that at least 45 students from this class pass?

Use a binomial probability calculator of your choice to answer this question.

34. (12 Points) Use WebStat. Load from " Data > Sample data " the data set "Chicago 42 tallest building.dat". Compute (and report) the mean and median of the variable "Heights"

Based on a comparison of just these 2 numbers, is there skewness towards the higher values, skewness towards the lower values, or can we assume the data is reasonably symmetric?

35. (11 Points) Use WebStat. Load from " Data > Sample data " the data "Age first wedding.dat". Construct boxplots for the variables "Female" and "Male", using fences to identify any outliers.

Which of these answers is correct?

a. The oldest female in this data set is older than the oldest male. b. More than 75% of the men are 20 years or older. c. More than 50% of the woman are 25 years or younger. d. All of the above are correct.