Random Sample - Introduction to Statistics - Exam, Exams of Statistics

This is the Past Exam of Introduction to Statistics which includes Variables, Description, Different Houses O, Quantitative, Square Footage, Monthly Gas Bill, Monthly Electric Bill, Heights, Histogram etc. Key important points are: Random Sample, Commissioned, Survey, Prefer Brand, Majority, Significance, Situation, Value, Coffee Drinkers, Evidence

Typology: Exams

2012/2013

Uploaded on 02/26/2013

aparna
aparna 🇮🇳

4.2

(10)

112 documents

1 / 17

Toggle sidebar

This page cannot be seen from the preview

Don't miss anything!

bg1
University of Toronto Scarborough
STAB22 Final Examination
April 2009
For this examination, you are allowed two handwritten letter-sized
sheets of notes (both sides) prepared by you, a non-programmable,
non-communicating calculator, and writing implements.
This question paper has 17 numbered pages. Before you start,
check to see that you have all the pages. You should also
have a Scantron sheet on which to enter your answers, and a set
of statistical tables. If any of this is missing, speak to an invigilator.
This examination is multiple choice. Each question has equal
weight, and there is no penalty for guessing. To ensure that
you receive credit for your work on the exam, fill in the bubbles
on the Scantron sheet for your correct student number (under
“Identification”), your last name, and as much of your first name
as fits.
Mark in each case the best answer out of the alternatives given
(which means the numerically closest answer if the answer is a
number and the answer you obtained is not given.)
If you need paper for rough work, use the back of the sheets of this
question paper. The question paper will be collected at the end of
the examination, but any writing on it will not be read or marked.
Before you begin, two more things:
Check that the colour printed on your Scantron sheet matches
the colour of your question paper. If it does not, get a new
Scantron from an invigilator.
Complete the signature sheet, but sign it only when the in-
vigilator collects it. The signature sheet shows that you were
present at the exam.
1
pf3
pf4
pf5
pf8
pf9
pfa
pfd
pfe
pff

Partial preview of the text

Download Random Sample - Introduction to Statistics - Exam and more Exams Statistics in PDF only on Docsity!

University of Toronto Scarborough

STAB22 Final Examination

April 2009

For this examination, you are allowed two handwritten letter-sized sheets of notes (both sides) prepared by you, a non-programmable, non-communicating calculator, and writing implements.

This question paper has 17 numbered pages. Before you start, check to see that you have all the pages. You should also have a Scantron sheet on which to enter your answers, and a set of statistical tables. If any of this is missing, speak to an invigilator.

This examination is multiple choice. Each question has equal weight, and there is no penalty for guessing. To ensure that you receive credit for your work on the exam, fill in the bubbles on the Scantron sheet for your correct student number (under “Identification”), your last name, and as much of your first name as fits.

Mark in each case the best answer out of the alternatives given (which means the numerically closest answer if the answer is a number and the answer you obtained is not given.)

If you need paper for rough work, use the back of the sheets of this question paper. The question paper will be collected at the end of the examination, but any writing on it will not be read or marked.

Before you begin, two more things:

  • Check that the colour printed on your Scantron sheet matches the colour of your question paper. If it does not, get a new Scantron from an invigilator.
  • Complete the signature sheet, but sign it only when the in- vigilator collects it. The signature sheet shows that you were present at the exam.
  1. A simple random sample of 70 coffee drinkers was taken. Each sampled coffee drinker was asked to taste two unmarked cups of coffee, one of which is actually Brand A and the other is Brand B, and was asked which one they preferred. 44 coffee drinkers preferred Brand A, and the other 26 preferred Brand B. Use this information to answer this question and the next one. The people who commissioned the survey are trying to find out whether a majority of all coffee-drinkers (that is, more than 50% of them) prefer brand A. Carry out a suitable test of significance to assess the evidence. What is the P-value of your test? (a) between 0.05 and 0. (b) between 0.025 and 0. (c) greater than 0. (d) between 0.01 and 0. (e) less than 0.
  2. In the situation of Question 1, it turned out that the coffee drinkers had always been given an unmarked cup containing Brand A coffee first, and Brand B coffee second. How do you react to this knowledge? (a) The P-value was small, so there will still be good evidence that Brand A is preferred. (b) The coffee drinkers received their coffee in unmarked cups, so it doesn’t matter which brand is actually tasted first. (c) Brand A must have an advantage by being tasted first, so there cannot be a significant difference between brands A and B. (d) The P-value was large, so there will still be no evidence that Brand A is preferred. (e) A better approach would have been to toss a coin to decide whether each drinker gets Brand A first or Brand B first.
  3. A university financial aid office took a simple random sample of students to see how many of them were employed the previous summer. Of the 750 men sampled, 703 had been employed the previous summer; of the 650 women sampled, 592 had been employed the previous summer. Use this information for this question and the next one. Test whether the proportion of all male students employed last summer is different from the proportion of all female students employed last summer. What is the P-value of your test? (a) between 0.05 and 0. (b) between 0.01 and 0. (c) between 0.025 and 0. (d) less than 0. (e) greater than 0.
  4. From the information given in Question 3, what is the upper limit of a 95% confidence interval for the difference between the proportion of men employed the previous summer and the proportion of women employed the previous summer? (Take the difference as men minus women.) (a) 0. (b) -0. (c) 0. (d) 0. (e) -0.
  1. A student’s mark on a Psychology exam has a normal distribution has a normal distribution with mean 65 and SD 10. The same student’s mark on a Chemistry exam has a normal distribution with mean 60 and SD 15. The two marks are independent of each other. Use this information for this question and the next two. What is the probability that the student scores over 80 on both exams? (a) between 0.01 and 0. (b) between 0.20 and 0. (c) between 0.10 and 0. (d) less than 0. (e) greater than 0.
  2. What is the probability that the student scores over 80 on exactly one of the two exams?

(a) between 0.01 and 0. (b) between 0.20 and 0. (c) between 0.10 and 0. (d) less than 0. (e) greater than 0.

  1. What is the probability that the student’s mean mark in the two courses is over 80?

(a) between 0.10 and 0. (b) greater than 0. (c) between 0.01 and 0. (d) between 0.20 and 0. (e) less than 0.

  1. The time it takes (for any student) to complete a STAB22 final exam is a random variable having a normal distribution with mean 160 minutes and standard deviation 15 minutes. Use this information for this question and the next one. Anne and Bob are two friends writing this exam. What is the probability that Anne will complete the exam at least ten minutes before Bob completes the exam? (Assume that all students start the exam at the same time and that their completion times are independent.) (a) this probability is greater than 0.45 but less than 0. (b) this probability is less than 0. (c) this probability is greater than 0.35 but less than 0. (d) this probability is greater than 0.01 but less than 0. (e) this probability is greater than 0.
  1. Using the information given in Question 12, if 300 students are writing the exam, what is the ap- proximate probability that more than 275 students will complete the exam in less than 180 minutes? Choose the closest value from the options below. (a) 0. (b) 0. (c) 1. (d) 0. (e) 0.
  2. In a certain city, the probability that a call to a randomly-chosen telephone number will reach a live person is 0.3. Suppose 10 such calls are made. What is the probability that at least one of them reaches a live person? (a) less than 0. (b) between 0.50 and 0. (c) greater than 0. (d) between 0.30 and 0. (e) between 0.10 and 0.
  3. An experiment was conducted to compare a new drug with a standard drug with respect to mean recovery time. It is known that the mean recovery time for the standard drug is 26 days. Following the experiment, which involved 65 patients, a 95% confidence interval estimate was constructed for the mean recovery time (in days) for patients on the new drug. The 95% confidence interval was found to be from 24.6 to 27.8. Which of the following statements is a valid conclusion based on this information? (a) The experimenter should reject the claim that the new drug is the same as the standard drug with respect to mean recovery time. (b) There is evidence, at the 5% level of significance, that the new drug is better than the standard drug. (c) There is insufficient evidence, at the 1% level of significance, to reject the claim that there is no difference between the new drug and the standard drug with respect to mean recovery time. (d) There is sufficient evidence, at the 5% level of significance, to reject the claim that there is no difference between the new drug and the standard drug with respect to mean recovery time. (e) None of the other statements is a valid conclusion based on the information from this study.
  4. Following the analysis of a well-designed completely randomized experiment it was reported that the observed effect was “statistically significant”. Which of the following statements best explains the meaning of the phrase “statistically significant”? (a) The observed result made sense to the experimenter since it was what was hoped would happen. (b) The observed effect happened because the experiment was properly designed and carried out without bias. (c) The experimenter carefully employed the basic principles of experimental design in conducting the study. (d) The observed effect was sufficiently large so that it would rarely occur simply by chance. (e) The laws of probability say that this observed result would be expected to happen by chance.
  1. The distribution of the heights of students in a large class is approximately normal. Moreover, the average height is 68 inches, and approximately 16% of the students were taller than 71 inches. Based on the 68-95-99.7 rule, approximately what percentage of students in this class are taller than 65 inches? (a) 2.5% (b) 84% (c) 5% (d) 16% (e) 95%
  2. A company producing a certain item has two machines: Machine A and Machine B. Machine A is an old machine and 10% of the items produced by this machine are defectives. Machine B is a new machine and 5% of the items produced by this machine are defectives. A random sample of two items was selected from each machine for quality control purposes. What is the probability that the sample from machine B will have MORE defective items than the sample from machine A? Choose the closest answer from the options below. (a) 0. (b) 0. (c) 0. (d) 0. (e) 0.
  3. In an experiment, the soles of boys’ shoes are made of two different synthetic materials, A and B. To see which material is better (lasts longer), measurements were made on the amount of wear of the soles of shoes worn by 10 boys. Each boy wore a special pair of shoes - the left sole was made with material A and the right sole with material B. Based on this information, which one of the following statements is true? (a) The decision as to whether the left or the right sole is to be made with material A or B should be determined by using a table of the standard normal distribution. (b) The type of material is the response variable, with two levels, A and B. (c) None of the above other statements is true. (d) This is an example of a completely randomized design. (e) Increasing the sample size (number of replications) is one way to eliminate the need for any randomization in assigning the materials.
  4. The amount of money spent by a customer at a discount store is a random variable with mean $ and standard deviation $30. What is the approximate probability that a randomly selected group of 50 shoppers will spend a total of more than $5300? Choose the number closest to the answer from the options given below. (a) 0. (b) 0. (c) 0. (d) 0. (e) 0.
  1. According to a report, the mean salary for mayors in all Canadian cities is $120,000. A newspaper believes that this figure is not correct, and wishes to assess the evidence against it. A reporter at the newspaper takes a simple random sample of 10 Canadian cities, and finds that the mean salary for mayors in those cities is $132,200. Use this information for this question and the following one. What would a suitable null hypothesis for a test of significance say? (a) The sample mean salary is equal to $132,200. (b) The mean salary of all mayors is equal to $120,000. (c) The mean salary of all mayors is not equal to $120,000. (d) The sample mean salary is not equal to $132,200. (e) The mean salary of all mayors is greater than $120,000.
  2. Using the information in Question 24, what would a suitable alternative hypothesis for a test of signif- icance say? (a) The mean salary of all mayors is not equal to $120,000. (b) The sample mean salary is not equal to $132,000. (c) The mean salary of all mayors is equal to $120,000. (d) The sample mean salary is equal to $132,200. (e) The mean salary of all mayors is greater than $120,000.
  3. A fair 10-sided die has faces numbered from 1 to 10. The probability of rolling a 1 on any roll is 0.1. If the die is rolled 12 times, what is the value of r such that the probability of rolling r 1’s or less is 0.8891? (a) there is no value of r that works (b) 2. (c) 2 (d) 5 (e) 1
  4. In a population, 45% of people have type O blood, 40% have type A, 11% have type B, and 4% have type AB. Use this information for this question and the next one. Consider an accident victim with type B blood. She can only receive a transfusion from a person with type B or type O blood. There are three people from this population who are willing to donate blood, but they don’t know what their blood type is. Assuming that the blood types of the three individuals are independent, what is the probability that at least one of them will be a suitable donor? Choose the closest answer from the following options. (a) 0. (b) 0. (c) 0. (d) 0. (e) 0.

Regression Analysis: CR versus HSC The regression equation is CR = 0.0952 - 0.000630 HSC

Predictor Coef SE Coef T P Constant 0.09521 0.01155 8.24 0. HSC -0.0006303 0.0001442 -4.37 0.

S = 0.0104614 R-Sq = omitted R-Sq(adj) = omitted

Use this information for this question and the next one. Which of the following statements is true? (a) The linear regression of CR on PVTY explains more than 50% of the variability in CR. (b) Using the regression of CR on PVTY, the residual for an observation with PVTY = 3.7 and CR = 0.05 is greater than 0.005. (c) The linear regression of CR on HSC explains more than 25% of the variability in CR. (d) None of the other four statements is true. (e) The MINITAB output for the regression analysis of CR versus PVTY above indicates the need for a higher order (curved) model rather than a linear model.

  1. Using the information in Question 29 above, calculate the slope of the regression line for predicting PVTY from HSC). Choose the closest value from the options below. (a) − 0. 5 (b) − 0. 3 (c) − 0. 2 (d) − 0. 4 (e) − 0. 1
  1. A car magazine commissions a study to determine whether single drivers do more driving for pleasure than married drivers. The magazine collects samples of 35 single drivers and 35 married drivers, and records how many kilometres per week of pleasure driving each driver does. Some Minitab output from an analysis is shown below:

Two-Sample T-Test and CI: single, married Two-sample T for single vs married N Mean StDev SE Mean single 35 193.3 27.0 4. married 35 190.4 24.7 4.

Difference = mu (single) - mu (married) Estimate for difference: 2. 95% CI for difference: (-9.50924, 15.16639) T-Test of difference = 0 (vs not =): T-Value = 0.46 P-Value = 0.649 DF = 67

Using the output as given, what conclusion should the car magazine draw from its study, using α = 0. 05 if necessary? (a) There is evidence that the amount of pleasure driving done by single and married people is different on average. (b) There is no evidence that single people do more pleasure driving than married people, because the P-value is about 0.325. (c) There is evidence that single people do more pleasure driving than married people, on average. (d) Single people do about 15 kilometres per week of pleasure driving more than married people do. (e) There is no evidence that single people do more pleasure driving than married people, because the P-value is about 0.649.

  1. A carnival game offers a $100 cash prize for anyone who can break a balloon by throwing a dart at it. It costs $15 for each throw. John is willing to spend up to $45 trying to win. That means he is going to stop if he breaks the balloon on or before the 3rd throw and stop at the 3rd throw even if he failed to break it on the 3rd throw. The chance that John will break the balloon in any throw is 0.1. Assume that breaking the balloon on any attempt is independent of what happens on the other attempts. John’s net gain is the amount of money he won minus the total cost. Denote this net gain by X. Which of the following numbers is the closest to the mean of X? (a) $ (b) $ (c) $ (d) −$ (e) −$
  1. Using the information in Question 33 above, which of the following statements is true?

(a) Consider the 90% confidence interval for μEmp, the population mean GPA of students who were employed. The margin of error of this confidence interval is greater than 0.11. (b) Based on the on the information given in Question 33, we see that t-procedures cannot be applied for this data set. (c) Consider the t-test for testing the null hypothesis μEmp = 3.0 against the alternative hypothesis μEmp < 3 .0. The P-value of this test is less than 0.01. (d) None of the other four statements is true. (e) Consider the t-statistic for testing the null hypothesis H 0 : μEmp = 3.0 against the alternative hypothesis Ha : μEmp < 3 .0. This test statistic is between − 2 .00 and 2.00.

  1. Consider the information given in Question 33 above. If we were interested in testing the null hypothesis that the population mean GPAs of the two groups were equal, against the alternative that the means were different, what could we say about the P-value for this test? It must be (a) between 0.02 and 0. (b) between 0.05 and 0. (c) between 0.01 and 0. (d) greater than 0. (e) less than.
  2. A random variable X might have one of two probability distributions. The null hypothesis is that X has this distribution: Value 0 1 2 3 Probability 0.50 0.40 0.08 0. The alternative hypothesis is that X has this distribution: Value 0 1 2 3 Probability 0.10 0.30 0.35 0. A test of significance is carried out by observing one value of X, and rejecting the null hypothesis in favour of the alternative if the observed value is 2 or larger. Use this information for this question and the following one. What is the probability of a Type I error for this test? (a) 0. (b) cannot determine it, because we need to know the exact value that was observed. (c) 0. (d) 0. (e) 0.
  3. In Question 36, a test of significance was described for the distribution of a random variable X. What is the probability of a Type II error for this test? (a) 0. (b) 0. (c) 0. (d) 0. (e) 0.
  1. When would you prefer to use t procedures (confidence interval, test of significance) instead of z procedures? (a) When you want to obtain a smaller confidence interval for the same sample size. (b) When you have a small sample. (c) When the population standard deviation is not known. (d) When the population distribution is approximately normal. (e) When you have a large sample.
  2. The costs of major surgery can vary substantially from one place to another. A study of the costs involved in a particular surgery was done in California and Montana. The 95% confidence intervals for the mean costs in the two states were reported to be from $5826.76 to $6173.24 in Montana and from $6061.41 to $6338.59 in California. No other information was given in the report. Assume that these intervals were calculated based on the normal distribution (i. e. assume that both the sample sizes were very large). Calculate the upper limit of the 95% confidence interval for the difference between the two population means (i.e. mean cost in California minus mean cost in Montana). Choose the closest answer from the following options. (a) $ (b) $ (c) $ (d) $ (e) $
  3. A 90% confidence interval for a population mean, based on a sample of size 30, is from 40 to 65. This confidence interval turned out to be too big. Which of the following is a way of making the confidence interval shorter? (a) Make the sample size bigger. (b) Use the median instead of the mean. (c) Make the sample size smaller. (d) Choose a higher level of confidence.
  4. A magazine is considering the launch of an online edition. In a small survey, the magazine contacted a random sample of 500 current subscribers and asked whether they would be interested in an online edition. Some of these subscribers showed an interest and some did not. Based on the information from this sample, the investigators tested the null hypothesis H 0 : p = 0.3 against Ha : p > 0 .3 , where p is the population proportion of subscribers who will be interested in an online edition. The value of the Z-statistic for testing H 0 : p = 0.3 against Ha : p > 0 .3 was 5.86. If we were interested in testing H 0 : p = 0.4 against Ha : p > 0 .4 based on the same sample used above, what could we say about the value of the Z-statistic for this test? This value is: (a) between 1.5 and 3.0. (b) between 3.0 and 4.5. (c) greater than 4.5. (d) between 0.5 and 1. (e) less than 0.5.
  1. A nutrition laboratory tested a random sample of 50 “reduced sodium” hot dogs. The mean sodium content of the sample was 309mg. Let μ denote the population mean sodium content. A test was carried out to test the null hypothesis H 0 : μ = 300 against Ha : μ > 300. The P-value of this test was 0.038. A test was also carried out of the null hypothesis H 0 : μ = 298 against Ha : μ > 298 (using the same sample); the P-value was 0.015. Assume that the data satisfy all assumptions required for the tests. If we calculate a 95% confidence interval for μ using this sample, what can we say about its margin of error? Choose the correct range for this margin of error from the following alternatives. (a) between 12.00mg and 16.00mg (b) between 8.00mg and 12.00mg (c) greater than 16.00mg (d) between 4.00mg and 8.00mg (e) less than 4.00mg
  2. A random variable X has this probability distribution: Value 1 3 5 Probability 0.1 0.3 0. What is the variance of X? (a) 2. (b) 3. (c) 1. (d) 1. (e) 4.
  3. Let Z be a standard normal random variable and let T be a random variable with a t distribution with 5 degrees of freedom. Which of the following events is most likely (i.e. has the highest probability) to occur? (a) Z and T are both greater than 2. (b) Z is less than − 2. 0 (c) T is greater than 2. (d) Z is greater than 2. (e) Z is greater than or equal to 2.
  4. A 99% confidence interval for a population mean goes from 27 to 33. The interval was based on a sample size of 50. The interval was calculated using a known population standard deviation, but unfortunately the value has been lost. What is the population standard deviation? (a) 10. (b) 24. (c) 8. (d) 1. (e) 16.
  1. An insurance company is trying to estimate the mean number of sick days that food service workers take per year. The company wishes to use a 90% confidence interval. The standard deviation of the number of sick days is known to be 2.5. How big a sample does the insurance company need to take if they wish to estimate the mean to within 0.5 days? (a) 42 (b) 30 (c) 8 (d) 68
  2. An association of Christmas tree growers in Indiana commissioned a survey, in which a simple random sample of all Indiana residents was taken. One of the questions in the survey was “Did you have a Christmas tree during the last holiday season”? 400 people were surveyed, and 331 answered “yes”. What is the upper limit of a 99% confidence interval for the proportion of all Indiana residents who had a Christmas tree during the last holiday season? (a) 0. (b) 0. (c) 0. (d) 0. (e) 0.
  3. A regression was carried out to predict a variable y from a variable x. The residuals from the regression were plotted against x, as shown below:

What do you learn from the plot? (a) A curve should be fitted rather than a straight line. (b) The residuals have a straight-line relationship with x. (c) A straight-line relationship describes the relationship well. (d) The predictions become less accurate as x increases. (e) The correlation between x and y is low.