Download Exam B: Statistical Inference and Hypothesis Testing and more Exams Economic statistics in PDF only on Docsity!
Stuff I Need to Know for Econ 15A-B
These questions represent everything you need to know for the upcoming exam (only as far as we have covered in lecture, of course). Bulleted items are important and useful, but they will not be questions on any exams; only the numbered items will be relevant to the exam. ========================================================= Exam B3 things to know Whenever possible, show all your work. If the problem allows you to calculate a particular number, or some expression shown in class, you must calculate that number/expression, and reduce all fractions. Unless specified otherwise, questions about confidence intervals refer to confidence intervals for the mean, with the variance known. Use the T-distribution values in this table whenever they are given; if the appropriate T- distribution is not here, use the normal distribution instead. For this exam only, when reporting the results of a test, you do not need to report the p-value. p = .900 .905 .910 .915 .920 .925 .930 .935 .940 .945. @qnorm(p) = 1.28 1.31 1.34 1.37 1.41 1.44 1.48 1.51 1.55 1.60 1. @tdist(p, 4) = 1.53 1.58 1.62 1.67 1.72 1.78 1.84 1.90 1.97 2.05 2. @tdist(p, 5) = 1.48 1.52 1.56 1.60 1.65 1.70 1.75 1.81 1.87 1.94 2. @tdist(p, 6) = 1.44 1.48 1.52 1.56 1.60 1.65 1.70 1.75 1.81 1.87 1. p = .955 .960 .965 .970 .975 .980 .985 .990. @qnorm(p) = 1.70 1.75 1.81 1.88 1.96 2.05 2.17 2.33 2. @tdist(p, 4) = 2.23 2.33 2.46 2.60 2.78 3.00 3.30 3.75 4. @tdist(p, 5) = 2.10 2.19 2.30 2.42 2.57 2.76 3.00 3.36 4. @tdist(p, 6) = 2.02 2.10 2.20 2.31 2.45 2.61 2.83 3.14 3.
- What is a Type I error, written in probability theory notation?
- What is a Type II error, written in probability theory notation?
- Describe, in ordinary English, what a Type I error is.
- Describe, in ordinary English, what a Type II error is.
- is directly related to one of the two Types of errors; what Type, and how?
- is directly related to one of the two Types of errors; what Type, and how?
- In what fundamental way is the null hypothesis different from the alternative hypothesis?
- In a one-sided hypothesis test, what role, if any, does H 1 play in the calculations?
- Suppose a quantile q has been determined for a one-sided hypothesis test where H 1 : . Show how to calculate from the appropriate inequality involving the t statistic and q to an inequality involving the sample mean estimator and the critical boundary.
- What are the components of a statistical test, as given in lecture and the slides?
- Run and report on a statistical test for the mean of a N(, ^2 ), with data {2, 1, 4, 7, 3}, = .05, H 0 : = –1, H 1 : = 3.
- Give the general formula for the t-statistic.
- Give a formula for the p-value in probability theory notation for a one-sided test, where H 1 : .
- Describe, in ordinary English, what the p-value tells you.
- In a two-sided hypothesis test, what role, if any, does H 1 play in the calculations?
- What things are included in a report of a statistical test?
- Run and report on a statistical test for the mean of N(, ^2 ), with data {6, 1, 0, 7, 3}, = .05, H 0 : = –1, H 1 : > –1.
- If you ran a statistical test in Eviews, where H 0 : = 60, and H 1 : < 60, Eviews would report a larger p-value than is in fact correct. Why?
- If you ran a statistical test in Eviews, where H 0 : = 60, and H 1 : > 60, Eviews would report a larger p-value than is in fact correct. Why?
- Describe how to run a hypothesis test for the mean where H 0 : = 45 and H 1 : < 45; i.e., list the basic quantities to be determined, and how to put them together, and what to compare, and how to conclude.
- Describe how to run a hypothesis test for the mean where H 0 : = 45 and H 1 : > 45; i.e., list the basic quantities to be determined, and how to put them together, and what to compare, and how to conclude..
- Describe how to run a hypothesis test for the mean where H 0 : = 45 and H 1 : ≠ 45; i.e., list the basic quantities to be determined, and how to put them together, and what to compare, and how to conclude.
- What is the difference between a one-sided and a two-sided hypothesis test?
- For a test of the equality of means, what is H 0?
- Explain how increasing does (or doesn’t) change the chances of a Type I error.
- Explain how increasing does (or doesn’t) change the chances of a Type II error.
- Explain how increasing does (or doesn’t) change the chances of a Type I error.
- Explain how increasing does (or doesn’t) change the chances of a Type II error.
- Explain how increasing n does (or doesn’t) change the chances of a Type I error.
- Explain how increasing n does (or doesn’t) change the chances of a Type II error.
- If every other aspect of a statistical test is the same, which has more power, a one- sided or two-sided hypothesis test? Explain. (Assume that all the relevant assumptions behind both tests are correct.) Consider the probability formulas for a confidence interval, and for a two-sided hypothesis test. How are they alike? How are they different?
- “A two-sided test is more powerful, because you test both sides of the distribution.” Explain what is wrong with this claim.
- What is the primary advantage and primary disadvantage of a two-sided test, as compared with a one-sided test?
- Give the mathematical formula for the estimate of the variance of the mean in a test for equality of means.
- If there are 30 observations in the first group, and 38 observations in the second group, how many degrees of freedom are there in a test for the equality of means?
- Suppose you read that in a survey of 20 UCI students, the average number of hours spent online per day is 6.2, with a standard deviation of 1.4, and that in a survey of 30 UCSD students, the average was 4.8, with a standard deviation of 1.7. Use this data to conduct a test for the equality of means at the .05 level.
- A cranky comment on your blog complains about “kids these days” spending too much time on “frivolous” activities like Twittering, Facebooking, etc. She cites a study that found that persons aged 20 – 25 spend on average about 3.2 hours per day on these activities, and declares that there is a problem because anything more than 1.5 hours per day is too much. You follow the link to this study and find that the poll used 18 responses and had a standard deviation of 1.9. Conduct a one-sided test, at the .05 level, to see if these data supports this critic’s comments.
- In a given sample of size 45, if the mean and standard deviation are 6 and 3, respectively, what is the t-statistic for the null hypothesis that the population mean is 7?
- ** [Not required] Do problem 11.1 in the book.
- Do problem 11.2 in the book.
- Do problem 11.3 in the book.
- Do problem 11.4 in the book.
- Do problem 11.5 in the book.
- Do problem 11.6 in the book.
- Do problem 11.7 in the book.
- Do problem 11.8 in the book.
- Do problem 11.11 in the book.
- Do problem 11.13 in the book.
- Do problem 11.14 in the book.
- Do problem 11.15 in the book.
- Do problem 11.16 in the book.
- We studied four different types of statistical tests; name them.
- Describe the hypotheses involved in a significance test between two simple hypotheses; i.e., what is general form of H 0 and H 1?
- Describe the hypotheses involved in a one-sided significance test; i.e., what is general form of H 0 and H 1?
- Describe the hypotheses involved in a two-sided significance test; i.e., what is general form of H 0 and H 1?
- Describe the hypotheses involved in a test for the equality of means; i.e., what is general form of H 0 and H 1?
- In actual research, one of the background assumptions behind statistical testing is frequently violated. What is this assumption, and why is this violation often not a serious concern?
- Give a “real life” example of when you might want to use a significance test between two simple hypotheses.
- Give a “real life” example of when you might want to use a one-sided significance test.
- Give a “real life” example of when you might want to use a two-sided significance test.
- Give a “real life” example of when you might want to use a test for the equality of means.
- Which, if either, is larger, the cumulative distribution of T(8) at -3.2 or the cumulative distribution of T(27) at -3.2. Explain your answer.
- Which, if either, is larger, the cumulative distribution of T(8) at -3.2 or the cumulative distribution of T(8) at 3.2. Explain your answer.
- Which, if either, is larger, the cumulative distribution of N(0, 1) at -3.2 or the cumulative distribution of T(8) at -3.2. Explain your answer.
- Give a formula for the p-value in probability theory notation for a two-sided test.
- Give a formula for the p-value in probability theory notation for a one-sided test, where H 1 : .
- Suppose x^ = 10, sx = 7.1, nx = 12, and y^ = 14, sy = 3.2, ny = 15. You want to conduct a hypothesis test for the equality of the means. What are the sum of squared deviations from the mean for x and y?
- Describe how to run a hypothesis test for the equality of means; i.e., list the basic quantities to be determined, and how to put them together, and what to compare, and how to conclude.
- Why do we typically use a T-distribution to conduct statistical tests rather than a Normal distribution?
- Give the specific formula for the t-statistic for a test for equality of means.
- When conducting a two-sided hypothesis test, why do we divide s by n?
- When conducting a two-sided hypothesis test, why do we subtract H0 from x , and then divide the result by sx^?
- If the p-value of a test is .04, is the result significant at = .05? Why or why not?
- If the p-value of a test is .06, is the result significant at = .05? Why or why not?
- “If we wanted to be really certain of a statistical test, then we would set = .000000000001.” Explain why this is typically unwise practice.
- When conducting a two-sided test, why do we use instead of ?
- When conducting a two-sided test, why do we double the p-value that we would get if our test was one-sided?
- Run and report on a statistical test for the mean of N(, ^2 ), with data {6, 1, 0, 7, 3}, = .05, H 0 : = –1, H 1 : ≠ –1.
- Run and report on a statistical test for the equality of means, with data x = {6, 1, 0, 7, 3}, y = {4, 1, 5, 0}, = .01.
- What are the null and alternative hypotheses for a test of the equality of means?
- Run and report on a statistical test for the mean where = .01, H 0 : = 17, H 1 : = 22, x^ = 19.7, s = 9.1, n = 24.
- Run and report on a statistical test for the mean where = .01, H 0 : = 17, H 1 : < 17, x^ = 15.4, s = 9.1, n = 28.
- Run and report on a statistical test for the mean where = .03, H 0 : = 17, H 1 : > 17, x^ = 19.7, s = 4.1, n = 28.
- Run and report on a statistical test for the mean where = .03, H 0 : = 17, H 1 : ≠ 17, x^ = 19.7, s = 4.1, n = 28.
- Run and report on a statistical test for the equality of means where = .05, H 0 : x = y, H 1 : x ≠ y,, x = 19.7, s = 4.1, nx = 38, y^ = 19.1, s = 2.9, ny = 49. ============================================================== Exam B2 things to know p = .900 .905 .910 .915 .920 .925 .930 .935 .940 .945. @qnorm(p) = 1.28 1.31 1.34 1.37 1.41 1.44 1.48 1.51 1.55 1.60 1. @tdist(p, 4) = 1.53 1.58 1.62 1.67 1.72 1.78 1.84 1.90 1.97 2.05 2. @tdist(p, 12) = 1.36 1.39 1.42 1.46 1.50 1.54 1.58 1.63 1.67 1.73 1. p = .955 .960 .965 .970 .975 .980 .985 .990. @qnorm(p) = 1.70 1.75 1.81 1.88 1.96 2.05 2.17 2.33 2. @tdist(p, 4) = 2.23 2.33 2.46 2.60 2.78 3.00 3.30 3.75 4. @tdist(p, 12) = 1.84 1.91 1.99 2.08 2.18 2.30 2.46 2.68 3. Unless specified otherwise, questions about confidence intervals refer to confidence intervals for the mean, with the variance known. Chap 10
- Give the definition of a consistent estimator.
- What is the formula for the Law of Large Numbers?
- In words, what does the Law of Large Numbers say?
- Explain why the Law of Large Numbers is true
- Give the definition of a biased estimator.
- Show that X is an unbiased estimator of .
7. Is ^
n i X i X n (^) 1
an unbiased estimator of ^2? If not, give its bias. Give the definition of the mean squared error of an estimator. Use the definition of the MSE of an estimator to calculate that the MSE is equal to the estimator’s variance plus its squared bias.
- Give the formula for S^2. Why do we use this estimator?
- What is a point estimate?
- What information does a confidence interval supply that point estimates do not?
- Why are confidence intervals useful?
- What are the two reasons why we often only need the N(0, 1) distribution when we constructed confidence intervals for X , when the variance was known?
- Describe what the quantile function does (i.e, it takes you from what to what?).
- What is the relation between the quantile function and the cdf?
- Suppose X ~ N(0, 1). Explain how we use the quantile function to translate our choice of a confidence level into a confidence interval for X.
- What is the formula for the lower bound of a confidence interval when the variance is known? Label the parts.
- What is the formula for the upper bound of a confidence interval when the variance is known? Label the parts.
- What is the formula for the lower bound of a confidence interval when the variance is unknown? Label the parts.
- What is the formula for the upper bound of a confidence interval when the variance is unknown? Label the parts.
- When the variance is known, what distribution do we refer to?
- When the variance is unknown, what distribution do we refer to?
- Practically speaking, what does the t-distribution do that the Normal does not?
- From which distribution, N(0,1) or T(8), are you more likely to produce a number larger than 3? Explain why.
- What are the boundaries of a confidence interval at the .99 confidence level if the variance in the 14 samples is known to be 6, and the sample mean was 8?
- The firm you work for has a rather poorly-thought-out policy that any estimations of the cost of a certain activity must be within a margin of error (interpreted as a confidence interval at some confidence level.) of no more than ± 10% of the estimated cost. A project that you’re working on has an average cost of $10,000, which was derived from a sample of all 11 known observations with a known standard deviation of 1,869. To comply with your company’s policy, what is the highest confidence level you may use? (Estimate this by using the chart above to give the two points which it must fall in between, or the extreme point if it is greater/less than all those given.)
- You work in an IT firm that specializes in a certain sort of information retrieval for corporations. Your boss has assigned you the task of estimating how long a certain sort of advanced retrieval takes on average. Since a major business is seriously considering an account with your company, your boss wants you to be “99 percent sure” that your estimate falls within a 10-point total spread that you will give him. The variance of these sorts of retrievals is known to be 4800. What is the least number of trials of the system that you would have to run to complete your task (using the techniques that this test covers)?
- Your colleague at work was gathering some data for you, and you noticed that she accidentally estimated the standard deviation as the square root of the average of the sum of the squared deviations from the sample mean. Her estimate was 270 for the 15 observations she collected. You wish to use an unbiased estimate of the sample variance. What is it (show your work)?
- Calculate a 99% confidence interval for a data set of 14 observations, whose mean is 6.2, and whose variance is known to be 7.
- Calculate a 99% confidence interval for a data set of 14 observations, whose mean is 6.2, and whose standard deviation is known to be 7.
- When constructing confidence intervals using the normal distribution, the book discusses two reasons for choosing a lower bound that is the negative value of the upper bound. What are these reasons?
- If you are using student’s T distribution with 11 degrees of freedom, how large was your sample?
- Calculate a 99% confidence interval for a data set of 14 observations, whose mean is 6.2, and whose variance is estimated to be 7.
- Calculate a 99% confidence interval for a data set of 14 observations, whose mean is 6.2, and whose standard deviation is estimated to be 7.
- Calculate a 95% confidence interval for the data set {3, 5, 2, 1, 3}. Treat your estimate of the variance as if it were known to be the true variance.
- Calculate a 95% confidence interval for the data set {3, 5, 2, 1, 3}. Recognize that your estimate of the variance is only an estimate.
- For a given data set, which is larger, the length of a 95% confidence interval, or the length of a 99% confidence interval? Explain your answer.
- Suppose two data sets x and y have identical means and variances, but x is made up of 17 observations, and y contains 20. Which will have a larger confidence interval? Explain your answer.
- For a given data set you construct 99% confidence intervals using first the standard normal distribution, and then using the appropriate t-distribution. Which is larger, and why?
- Draw the graph of the quantile function for N(0,1); include a numeric scale for both axes, and label the points on the graph for the quantiles corresponding to the probabilities: .005, .025, .05, .95, .975, and .995.
- If you were constructing a confidence interval for a collection of 17 data points (where the variance is unknown), what distribution exactly would you use?
- If you were constructing a 99% confidence interval for a collection of 17 data points (where the variance is unknown), what quantile would you have Eviews calculate for you? (Don’t just give the command; say in English what you would be asking Eviews to calculate.)
- How many times larger will your margin of error be if you switch from a 95% to 99% confidence level (show your work)?
- How many times larger will the length of your confidence interval be if you switch from a 90% to a 95% confidence level (show your work)?
- If your sample size is n, how much larger must you make your sample size if you want your margin of error to be cut in half? (Assume all other quantities remain fixed.)
- Since the quantile function is related to the cdf, it is a little unclear how it can be used to determine the upper quantile. Explain how this works.
- Suppose you constructed 200 95% confidence intervals for the mean from one population, using a sample of size 100. What is the expected number of intervals that would contain ? What is the expected number that would not? What is the formula for a confidence interval for a proportion? In a recent study of 3,000 Crackle.com visitors, 2,016 visitors watched Starship Troopers. Construct a 98% confidence interval for the true proportion of Crackle’s Starship Troopers viewers. Give two examples of where there would be great practical benefit in estimating a proportion and calculating a confidence interval, and explain why your examples are good ones.
- Give two examples of where there would be great practical benefit in estimating a mean (not a proportion) and calculating a confidence interval, and explain why your examples are good ones.
- Give an example of where there would be great practical benefit in estimating a mean (not a proportion) and calculating a confidence interval using a T-distribution, and explain why your examples are good ones.
- Describe a study (real or fictional) where there are at least three sources of error or uncertainty that are not accounted for by our methods for constructing confidence intervals.
- Sketch the graphs of the pdfs of N(0, 1) and T(5) on one set of axes. Label each graph, and numerically label the two axes.
- Give the formula for the margin of error (when the variance is known).
- Give the formula for the margin of error (when the variance is unknown).
- What happens to the length of a confidence interval if all quantities remain the same except that x is doubled? Calculate/explain your answer.
- What happens to the size of the margin of error if all quantities remain the same except that x^ is doubled? Calculate/explain your answer.
- Your 94% margin of error had a length of 7. What is the size of the margin of error if you change the confidence level to 98%?
- Your 94% confidence interval had a length of 7. What is the length of the confidence interval if you change the confidence level to 98%?
- Do exercise 10.3 in the book.
- Do exercise 10.7 in the book.
- Do exercise 10.10 in the book.
- Do exercise 10.11 in the book.
- Do exercise 10.12 in the book.
- Do exercise 10.13 in the book.
- Do exercise 10.14 in the book.
- Do exercise 10.15 in the book.
- Do exercise 10.16 in the book.
- Do exercise 10.17 in the book.
- Do exercise 10.22 in the book.
- Do exercise 10.23 in the book.
- Do exercise 10.24 in the book.
- The firm you work for has a rather poorly-thought-out policy that any estimations of the cost of a certain activity must be within a margin of error (interpreted as a confidence interval at some confidence level.) of no more than ± 15% of the estimated cost. A project that you’re working on has an average cost of $15,000, which was derived from a sample of all 13 known observations where the standard deviation was estimated to be 5,668. To comply with your company’s policy, what is the highest confidence level you may use? (Estimate this by using the chart above to give the two points which it must fall in between, or the extreme point if it is greater/less than all those given.)
- Suppose you have decided to gather some fixed number n of observations, and create a confidence interval at some confidence level c, where the variance of the underlying
population is known to be ^2. Before you even gather your n numbers, you already know how long the confidence interval will be. Explain why.
- Suppose you have decided to gather some fixed number n of observations, and create a confidence interval at some confidence level c, where the variance of the underlying population will be estimated. Before you gather your n numbers, don’t know how long your confidence interval will be. Explain why.
- Suppose you have decided to gather some fixed number n of observations, and create a confidence interval at some confidence level c, where the variance of the underlying population is known to be ^2. Before you even gather your n numbers, you already know how big your margin of error will be. Explain why.
- Suppose you have decided to gather some fixed number n of observations, and create a confidence interval at some confidence level c, where the variance of the underlying population will be estimated. Before you gather your n numbers, don’t know how big your margin of error will be. Explain why.
- If you have two data sets that yield identical estimates of the variance, but for the first one you used 20 observations and for the second you used 30, which one will have the larger margin of error? Why?
- Describe in words what a consistent estimator is.
- Give the mathematical formula that characterizes a consistent estimator.
- Describe in words what a biased estimator is.
- Describe in words what an unbiased estimator is.
- Give the mathematical formula that characterizes an unbiased estimator.
- Describe the role that the Central Limit Theorem plays in our construction of confidence intervals.
- Give the Eviews command for the quantile appropriate for a 98% confidence interval using 34 observations, when the variance is only estimated.
- Give the Eviews command for the quantile appropriate for a 98% confidence interval using 34 observations, when the variance is known. ========================================================= Exam B1 things to know Chap 08
- What do pdf and pmf stand for? What kinds of distributions do they respectively apply to?
- What conditions must a function f meet in order to be a pdf?
- For a given distribution, give the formula for expressing the cdf in terms of the pdf. Write a sentence about what this formula means.
- For a given distribution, give the formula for expressing the pdf in terms of the cdf. Write a sentence about what this formula means.
- Suppose X has a continuous distribution. What is pr(X = 0)? Explain your answer.
- Express pr(X < 6.2) in terms of the cdf F.
- Express pr(X > 6.2) in terms of the cdf F.
- Express pr(5.1 < X < 6.2) in terms of the cdf F.
- Express pr(X < 6.2) in terms of the pdf f.
- Express pr(X > 6.2) in terms of the pdf f.
- Express pr(5.1 < X < 6.2) in terms of the pdf f.
- Draw the and label (both axes of) the graphs of the pdf (together, on one set of axes) of U(0,1) and U(-1, 2).
- Draw and label (both axes of) the graphs of the cdf (together, on one set of axes) of U(0,1) and U(-1, 2).
- What is the pdf for the Uniform distribution on (a, b)?
- What is the cdf for the Uniform distribution on (a, b) (do not express this using integrals)?
- What is the pdf for N(, ^2 )?
- Draw and label the graphs of the pdf (together, on one set of axes) of N(0,1) and N(- 1, 2).
- Draw and label the graphs of the cdf (together, on one set of axes) of N(0,1) and N(-1, 2).
- Explain what the Central Limit Theorem says. Be sure to include the background conditions necessary for the theorem to hold, and the “limit statement” as presented in the book that gives the mathematical essence of the theorem.
- Give the limit formula in the Central Limit Theorem, not as it is presented in the book, but as a limit formula of a standardization of a random variable. In this formulation, what random variable is being standardized?
- If X ~ U(2, 7), calculate pr(X ≤ 4) using only the definitions of U(2, 7) and pr(X ≤ 4).
- If X ~ U(2, 7), calculate pr(X ≥ 4) using only the definitions of U(2, 7) and pr(X ≥ 4).
- If X ~ U(2, 7), calculate pr(3 ≤ X ≤ 4) using only the definitions of U(2, 7) and pr(3 ≤ X ≤ 4).
- Starting from the definitions of a uniform distribution and expectation, calculate E[X], if X ~ U(2, 7).
- Starting from the definition of a uniform distribution, calculate E[X] to the form we saw in class, if X ~ U(a, b). Starting from the definition of a uniform distribution, calculate the variance of X to the form we saw in class, if X ~ U(a, b).
- What command would you give if you wanted Eviews to calculate the probability that X < .2, if X ~ U(0, 1)?
- Suppose Y is the standardization of X (Y = X , where and are the mean and standard deviation of X). Calculate the mean of Y.
- Suppose Y is the standardization of X (Y = X , where and are the mean and standard deviation of X). Calculate the standard deviation of Y.
- If X is a continuous probability distribution, why doesn’t it matter whether you determine pr(X < 5.1) or pr(X ≤ 5.1)?
- What is the standardized skew of N(, ^2 )?
- What is the standardized kurtosis of N(, ^2 )?
- How much of the probability of an N(, ^2 ) distribution is within 1.96 standard deviations of the mean?
- If X ~N(, ^2 ), what is what is the probability that X ≤ ( + 1.64)?
- Why does the normal distribution occur frequently in nature?
- Give three examples of populations which are probably (nearly) normally distributed; for each one, write a sentence or two about why they probably are normally distributed.
- For N(, ^2 ), calculate the largest value of its pdf. Show also that any other value will be less than this number.
- What command would you use if you wanted Eviews to compute the probability that X < 3, if X~N(0, 1)?
- Do exercises (1) – (4) 8.3a in the book.
- Do exercise 8.4 in the book.
- Do exercise 8.5 in the book.
- Do exercise 8.6 in the book.
- Do exercise 8.10 in the book.
- If X ~ N(0, 1), then what is the distribution of Y = a + bX (b ≠ 0)?
- If X ~ N(, ^2 ), how would you transform it so that it is distributed as N(0, 1)?
- If X ~ N(, ^2 ), then what is the distribution of Y = a + bX (b ≠ 0)?
- If X ~ N(0, 1), then what is the distribution of Y = 2 + 3X?
- If X ~ N(, ), how would you transform it so that it is distributed as N(0, 1)?
- If X ~ N(, ), then what is the distribution of Y = 2 + 3X?
- For what values is the pdf of N(6, 4) greater than 0?
- Write a couple sentences explaining the differences between x , (^) X , and
- Calculate the value of E^ (^ X ) in terms of the mean of Xi, (i.e., like we did in class).
- Calculate the variance of (^) X in terms of the variance ^2 of Xi (i.e., like we did in class).
- If {X 1 ,…, X 25 } is a random sample from N(100, 25), then what is the variance of (^) X?
- If {X 1 ,…, X 25 } is a random sample from N(100, 25), then what is the standard deviation of X?
- Suppose X ~ U(0, 1). Which, if either, is more likely: .3 ≤ X ≤ .4, or .7 ≤ X ≤ .8? Explain your answer.
- Suppose X ~ U(0, 1). Which, if either, is more likely: .3 < X < .4, or .7 ≤ X ≤ .8? Explain your answer.
- Suppose X ~ U(0, 1). Which, if either, is more likely: .3 ≤ X ≤ .41, or .7 ≤ X ≤ .8? Explain your answer.
- Suppose X ~ N(0, 1). Which, if either, is more likely: .3 ≤ X ≤ .4, or .7 ≤ X ≤ .8? Explain your answer.
- Suppose X ~ N(0, 1). Which, if either, is more likely: .3 < X < .4, or .7 ≤ X ≤ .8? Explain your answer.
- Suppose X ~ N(0, 1). Which, if either, is more likely: –1 ≤ X ≤ 1, or 5 ≤ X ≤ 8? Explain your answer.
- What is the total area under the curve of the pdf of N(0, 1)?
- What is the total area under the curve of the pdf of U(2, 4)?
- What is E[X] if X ~ U(a, b)?
- What is E[(X – E[X])^2 ] if X ~ U(a, b)?
- What is E[X] if X ~ U(2, 7)?
- What is E[(X – E[X])^2 ] if X ~ U(2, 7)?
- What is E[X] if X ~ N(, ^2 )?
- What is E[(X – E[X])^2 ] if X ~ N(, )?
- What is E[X] if X ~ N(, ^2 )?
- What is E[(X – E[X])^2 ] if X ~ N(, )?
- The book gives two names for N(, ^2 ). What are they? Chap 09 Do exercise 9.2 in the book. Do exercise 9.4 in the book. Do exercise 9.13 in the book. Do exercise 9.16 in the book. Do exercise 9.17 in the book. ========================================================= 15 A Stuff Chapter 02
- What kind of thing is a random variable?
- Know what the four scales of measurement that we cover are (ratio, interval, ordinal, and categorical), and how to describe them. I.e., what is distinctive about each?
- How are the four scales of measurement that we cover related to one another (i.e., is one sort of scale always an instance of another sort of scale)?
- Give two examples of something that would be measured on a ratio scale, and write a sentence or two about why these are good examples of this scale.
- Give two examples of something that would be measured on an interval scale, and write a sentence or two about why these are good examples of this scale.
- Give two examples of something that would be measured on an ordinal scale, and write a sentence or two about why these are good examples of this scale.
- Give two examples of something that would be measured on a categorical scale, and write a sentence or two about why these are good examples of this scale.
- Do the same for indexical and time series scales. Chapter 03
- How would you determine the median of a data set?
- When is the median an element of a data set? When is it not?
- Describe how the box of a box-and-whisker plot is constructed.
- How far out do the whiskers extend?
- What are the outliers (if any) in a box-and-whisker plot?
- Know how to determine the three quartile points and the four quartiles of a box-and- whisker plot
- Know how to construct a box-and-whiskers plot (by hand) for a relatively small data set, e.g., {3, 4, 3, 2, 1, -8, 4, 3, 2, 0}.
- What kind of information does a box-and-whisker plot visually display?
- Give an example of when and how you might use a box-and-whisker plot.
- How are relative frequency plots different from cumulative frequency plots?
- Know how to sketch a relative frequency plot and a cumulative distribution plot (by hand) for a relatively small data set, e.g., 2% of students are under 18, 10% are 18, 20% are 19, 25% are 20, 20% are 21, 15% are 22, and 8% are over 22.
- What kind of information does a relative frequency plot visually display?
- Give an example of when and how you might use a relative frequency plot.
- Know how to sketch a cumulative distribution plot (by hand) for a relatively small data set, e.g., 2% of students are under 18, 10% are 18, 20% are 19, 25% are 20, 20% are 21, 15% are 22, and 8% are over 22.
- What kind of information does a cumulative distribution plot visually display?
- Give an example of when you might use a cumulative distribution plot.
- How are histograms constructed?
- What is the total area of a histogram?
- Know how to sketch a histogram (by hand) for a relatively small data set, e.g., {1, 1, 3, 2, 5, 5, 9, 2, 4, 2}
- What kind of information does a histogram visually display?
- Give an example of when you might use a histogram.
- How do histograms and relative frequency plots differ? What different kinds of information do they display? Chapter 04
- For m’ 1 , give: Its name
- For the mean, give: Its symbol or symbols, if there is more than one
- For m’ 1 , give: Its mathematical formula (using sigma notation)
- For m’ 1 , give: A one sentence characterization of what it measures
- For m’ 1 , give: A brief description of how you could make it larger by altering the data set.
- For m’ 1 , give: A brief description of how you could make it smaller by altering the data set.
- For m’ 1 , give: Using a calculator, how to calculate its value, in the way we do it in class (i.e., showing your work at each step), for a small data set, e.g., {3, 4, -5, 8, 4}, {12, 3, -7, 2}
- For m’ 1 , give: At least two “real life” examples where knowing that moment would provide you with valuable information. (Be sure to be able to explain why your examples are good ones.)
- When data (e.g., household incomes, annual profits) are grouped into categories, what is the cell mark of each category?
- For m 2 , give: Its name
- For the variance, give: Its symbol or symbols, if there is more than one
- For m 2 , give: Its mathematical formula (using sigma notation)
- For m 2 , give: A one sentence characterization of what it measures
- For m 2 , give: A brief description of how you could make it larger by altering the data set.
- For m 2 , give: A brief description of how you could make it smaller by altering the data set.
- What does a large value of m 2 indicate about a data set?
- What does a small value of m 2 indicate about a data set?
- For m 2 , give: Using a calculator, how to calculate its value, in the way we do it in class (i.e., showing your work at each step), for a small data set, e.g., {-3, 4, 5, 8, 4}, {12, 3, 7, -2}
- For m 2 , give: At least two “real life” examples where knowing that moment would provide you with valuable information. (Be sure to be able to explain why your examples are good ones.)
- For m 2^ , give: Its name
- For the standard deviation, give: Its symbol or symbols, if there is more than one
- For m 2^ , give: Its mathematical formula (using sigma notation)
- For m 2^ , give: A one sentence characterization of what it measures
- For m 2^ , give: A brief description of how you could make it larger by altering the data set.
- For m 2 , give: A brief description of how you could make it smaller by altering the data set.
- For m 2^ , give: Using a calculator, how to calculate its value, in the way we do it in class (i.e., showing your work at each step), for a small data set, e.g., {3, -4, 5, 8, 4}, {12, 3, -7, 2}
- For m 2^ , give: At least two “real life” examples where knowing that moment would provide you with valuable information. (Be sure to be able to explain why your examples are good ones.)
- For m 3 , give: Its name
- For the unstandardized skew, give: Its symbol or symbols, if there is more than one
- For m 3 , give: Its mathematical formula (using sigma notation)
- For m 3 , give: A one sentence characterization of what it measures
- For m 3 , give: A brief description of how you could make it larger by altering the data set.
- For m 3 , give: A brief description of how you could make it smaller by altering the data set.
- What does a large value of m 3 indicate about a data set?
- What does a small value of m 3 indicate about a data set?
- For m 3 , give: Using a calculator, how to calculate its value, in the way we do it in class (i.e., showing your work at each step), for a small data set, e.g., {3, 4, -5, 8, 4}, {12, 3, -7, 2}
- For m 3 , give: At least two “real life” examples where knowing that moment would provide you with valuable information. (Be sure to be able to explain why your examples are good ones.)
- Why do we often standardize m 3?
- For m 4 , give: Its name
- For the unstandardized kurtosis, give: Its symbol or symbols, if there is more than one
- For m 4 , give: Its mathematical formula (using sigma notation)
- For m 4 , give: A one sentence characterization of what it measures
- For m 4 , give: A brief description of how you could make it larger by altering the data set.
- For m 4 , give: A brief description of how you could make it smaller by altering the data set.
- What does a large value of m 4 indicate about a data set?
- What does a small value of m 4 indicate about a data set?
- For m 4 , give: Using a calculator, how to calculate its value, in the way we do it in class (i.e., showing your work at each step), for a small data set, e.g., {3, 4, -5, 8, 4}, {12, 3, -7, 2}
- For m 4 , give: At least two “real life” examples where knowing that moment would provide you with valuable information. (Be sure to be able to explain why your examples are good ones.)
- Why do we often standardize m 4?
- For ^ ˆ 1 , give: Its name
- For the standardized skew, give: Its symbol or symbols, if there is more than one
- For ^ ˆ 1 , give: Its mathematical formula (using sigma notation)
- For ^ ˆ 1 , give: A one sentence characterization of what it measures
- For ^ ˆ 1 , give: A brief description of how you could make it larger by altering the data set.
- For ^ ˆ 1 , give: A brief description of how you could make it smaller by altering the data set.
- For ^ ˆ 1 , give: Using a calculator, how to calculate its value, in the way we do it in class (i.e., showing your work at each step), for a small data set, e.g., {3, 4, 5, 8, 4}, {12, 3, 7, 2}
- For ^ ˆ 1 , give: At least two “real life” examples where knowing that moment would provide you with valuable information. (Be sure to be able to explain why your examples are good ones.)
- For ^ ˆ^2 , give: Its name
- For the standardized kurtosis, give: Its symbol or symbols, if there is more than one
- For ^ ˆ^2 , give: Its mathematical formula (using sigma notation)
- For ^ ˆ^2 , give: A one sentence characterization of what it measures
- For ^ ˆ^2 , give: A brief description of how you could make it larger by altering the data set.
- For ^ ˆ^2 , give: A brief description of how you could make it smaller by altering the data set.
- For ^ ˆ^2 , give: Using a calculator, how to calculate its value, in the way we do it in class (i.e., showing your work at each step), for a small data set, e.g., {3, 4, 5, 8, 4}, {12, 3, 7, 2}
- For ^ ˆ^2 , give: At least two “real life” examples where knowing that moment would provide you with valuable information. (Be sure to be able to explain why your examples are good ones.)
- How do you construct standardized scores?
- Know how to calculate the mean of a data set whose entries have been standardized.
- Know how to calculate the variance (here, m 2 ) of a data set whose entries have been standardized.
- What is the relation between ^ ˆ 1 and m 3?
- What is the relation between ^ ˆ^2 and m 4? Eviews
- Describe how you would use Eviews to construct a histogram.
- Describe how you would use Eviews to construct a box-and-whiskers plot.
- Describe how you would use Eviews to construct a cumulative distribution plot.
- Describe how you would use Eviews to construct a scatterplot.
- Describe how you would use Eviews to construct a correlation matrix between 3 series.
- Describe how you would use Eviews to construct a covariance matrix between 3 series.
- What would you write the white syntax bar if you wanted to restrict your sample to those observations for which the variable (series) x was greater than or equal to 6 but less than 12?
- What would you write in the white syntax bar if you wanted to create a new variable y that was related to an existing variable (series) by: y = 200 - .1x?
- What would you write in the white syntax bar if you wanted to know the mean of x?
- What would you write in the white syntax bar if you wanted to know the standard deviation of x? Chapter 05
- Calculate the covariance of the data set {(1, 0), (0, 1), (2, -1), (3, 1)}.
- Calculate the correlation of the data set {(1, 0), (3, 1), (0, -1), (1, 1)}.
- Standardize the data set {(1, 0), (3, 1), (0, -1), (1, 1)} and then calculate the correlation.
- If the standard deviation of x and y are 4 and 7 respectively, then what is the correlation between x and y if m 11 = 12?
- What are the possible values of the covariance?
- What are the possible values of r?
- What does it mean if r is large and positive?
- What does it mean if r is nearly zero?
- What does it mean if r is a large negative number (i.e., not close to zero)?
- What is the symbol for the correlation coefficient?
- Express the correlation coefficient in terms of moments.
- What is the symbol for the covariance?
- Using sigma notation, give the mathematical formula for the correlation coefficient.
- Using sigma notation, give the mathematical formula for the covariance.
- What precisely does the correlation coefficient measure?
- Give two examples where you would want to know the correlation between two variables.
- Explain the relationship between covariance and correlation in terms of standardized variables.
- Explain the relationship between covariance and correlation in terms of moments.
- Give an example of why “correlation does not imply causation”.
- Why can’t you take a sample of Californians and a sample of New Yorkers, and measure the correlation between the two groups’ salaries?
- Know how to do exercise 5.12 in the book.
- Know how to do exercise 5.13 in the book.
- Know how to do exercise 5.20 in the book.
- Know how to do exercise 5.22 in the book.
- Know how to do exercise 5.29 in the book. X=1 X = 2 Y = 1 5 2 Y = 2 4 1 Y = 3 1 7
- Calculate the joint probability that both X and Y are 2.
- Calculate the marginal probability that Y = 3.
- Calculate the probability that Y = 1 given that X = 2. X=1 X = 2 Y = 1 5 2 Y = 2 or 3 5 8
- Calculate the correlation between X and Y.
- Give the mathematical formula for^ ˆ^. Chapter 06
- What is the definition of a sample space?
- In a given trial or experiment, how many of the possible outcomes in the sample space can occur? How many must occur?
- What are the three axioms of probability?
- What is the general formula for the probability of event A or B occurring?
- What is the formula for the conditional probability of event A occurring, given that event B occurs?
- Give the definition of statistical independence of two events.
- Give the definition of statistical independence of two random variables.
- Give the definition of statistical independence of n random variables.
- Prove that if both A and B have non-zero probabilities, and if A is statistically independent of B, then B is statistically independent of A.
- What does i.i.d. stand for?
- Which, if any, of the purported assignment of probabilities to the outcomes in a sample space are legitimate? Why or why not? Table 1 Outcomes in the sample space E1 E2 E3 E4 E5 E6 E7 E pr 1 .2 .2 .1 .05 .05 .1 .3 0 pr 2 .3 .1 .1 .2 .1 .1 .1. pr 3 .3 .05 .04 .05 .1 .1 .1. pr 4 .2 .2 .3 .1 –.1 .1 .1. Table 2 Outcomes in the sample space E1 E2 E3 E4 E5 E6 E pr .2 .3 .1 .05 .05 .1. A = {E1, E2}, B = {E2, E3, E4}, C = {E6}, D = {E2, E7, E6}, F = {E1, E7}
- Using Table 2, show your work as you calculate pr(A or B) and pr(B or C).
- Using Table 2, show your work as you calculate pr(A and B) and pr(B and C).
- Using Table 2, show your work as you calculate pr(A|B) and pr(B|C).
- Using Table 2, show your work as you determine whether A and D are independent.
- Using Table 2, show your work as you determine whether A and F are independent.
- Do exercise 6.3 in the book.
- Do exercise 6.6 in the book.
- Do exercise 6.7 in the book.
- Do exercise 6.8 in the book.
- Do exercise 6.9 in the book.
- Do exercise 6.18 in the book.
- Do exercise 6.25 in the book.
- Do exercise 6.26 in the book.
- Do exercise 6.27 in the book.
- Do exercise 6.28 in the book.
- Construct a numerical example that shows a case where pr(A) < pr(B|A).
- Construct a numerical example that shows a case where pr(A) = pr(B|A).
- Construct a numerical example that shows a case where pr(A) > pr(B|A).
- Draw a diagram that displays a case where pr(A) < pr(A | B).
- Draw a diagram that displays a case where pr(A) = pr(A | B).
- Draw a diagram that displays a case where pr(A) > pr(A | B).
- Construct a numerical example that shows a case where pr(A|B) < pr(B|A).
- Construct a numerical example that shows a case where pr(A|B) = pr(B|A).
- Construct a numerical example that shows a case where pr(A|B) > pr(B|A).
- Draw a diagram that displays a case where pr(A|B) < pr(B|A).
- Draw a diagram that displays a case where pr(A|B) = pr(B|A).
- Draw a diagram that displays a case where pr(A|B) > pr(B|A).
- Write a sentence describing what Simpson’s paradox is (do not just give an example).
- Give a numerical example of Simpson’s paradox, and calculate the relevant values to show that it is an example.
- Explain the conditions that make Simpson’s paradox occur.
- Do exercise 6.36 in the book. Chapter 07
- Be able to calculate n! e.g., what is 12!?
- How many ways are there to order 10 of 10 objects?
- How many ways are there to order k of n objects (k < n)?
- How many ways are there to order 4 of 10 objects?
- How many ways are there to pick out (unordered) 6 objects from a group of 12?
- Give the mathematical formula for (^) k n .
- Calculate (^) 4 9 by hand (do the calculations explicitly).
- What does (^) k n express?
- Explain why, or simply calculate that: (^) ^ n k n k n .
- The next few questions refer to this set of (actual) students. {Ryan, Enrique, Mallory, Muhammad, Arsen, Kristine, Tevan, Adir, Mohammed}
- How many students are there?
- How many ways can these students be ordered?
- How many groups of 3 students are there?
- How many groups of 5 students are there that contain Arsen and Tevan?
- What is the probability of randomly selecting 4 students, and getting Muhammad and Mohammed in your group?
- Suppose you identify each student by the first letter of their first name (i.e., you don’t distinguish students whose first name starts with the same first letter). How many ways can these students be ordered now?
- There is a class of 20 with 13 boys and 7 girls. How many ways are there to form a committee of 3 boys and 3 girls?
- There is a class of 20 with 13 boys and 7 girls. A committee will be formed by randomly selecting 6 persons. What is the probability that the committee will have 3 boys and 3 girls?
- Give a real-life example where you would want to use the binomial coefficient (just to find a number, not to calculate a probability).
- Give a real-life example where you would want to calculate a probability using binomial coefficients (not in the context of a binomial distribution).
- Give the mathematical formula for the binomial distribution.
- In the formula for the binomial distribution, what are the possible values for p?
- In the formula for the binomial distribution, what are the possible values for n?
- In the formula for the binomial distribution, what are the possible values for k?
- Give two real-life examples where you would want to use the binomial distribution to calculate a probability.
- What are the three background assumptions regarding the underlying sample for the binomial distribution?
- What is the mathematical formula for the cumulative distribution function of the binomial distribution?
- What is the symbol for the mean of a probability distribution?
- What is the symbol for the variance of a probability distribution?
- What is the formula for the mean of a probability distribution?
- What is the formula for the variance of a probability distribution?
- Suppose pr(X = 1) = p, and pr(X = 0) = (1 – p). Calculate the mean of X.
- Suppose pr(X = 1) = p, and pr(X = 0) = (1 – p). Calculate the variance of X.
- For any given person, there is a 25% chance of having a certain inherited genetic trait. There are five unrelated persons in a room. Use the formula for the binomial distribution to calculate the probability that three of them have the trait.
- For any given person, there is a 25% chance of having a certain inherited genetic trait. There are five siblings in a room. Explain why you can’t use the binomial distribution to calculate the probability that three of them have the trait.
- What is the mean of the binomial distribution with parameters n and p?
- What is the mean of the binomial distribution with parameters 4 and .8?
- What is the variance of the binomial distribution with parameters n and p?
- What is the variance of the binomial distribution with parameters 4 and .8?
- Do exercise 7.3 in the book.
- Do exercise 7.4 in the book.
- Do exercise 7.9 in the book.
- Do exercise 7.10 in the book.
- Do exercise 7.18 in the book (assume all trials are independent).
- Do exercise 7.21 in the book
- Do exercise 7.22 in the book.
- Do exercise 7.23 in the book.
- What would you type into the white syntax bar on Eviews if you wanted to know the probability of getting 12 successes from a binomial distribution with parameters n = 30 and p = .47?
- What would you type into the white syntax bar on Eviews if you wanted to know the cumulative probability for 12 successes from a binomial distribution with parameters n = 30 and p = .47?
- Explain the difference between a probability mass function and a cumulative distribution function.
- Give the definition of the expectation of a (discrete) random variable X.
- Give the mathematical formula for the mean of a (discrete) random variable X.
- Give the mathematical formula for the variance of a (discrete) random variable X.
- What is the linearity of expectations?
- Use the linearity of expectations to calculate the mean of n B (^) p (i.e., solve for the expression of the mean given in class).
- Use the linearity of expectations to calculate the mean of 7 B. (^) 4.
- Use the linearity of expectations to calculate the variance of n B (^) p (i.e., solve for the expression of the mean given in class).
- Use the linearity of expectations to calculate the variance of 7 B. (^) 4.
- Calculate the mean of 3 B (^) p directly (do not use the linearity of expectations; solve for the expression of the mean given in class).
- Calculate the mean of 3 B. (^) 3 directly (do not use the linearity of expectations).
- Calculate the variance of 3 B (^) p directly (do not use the linearity of expectations; solve for the expression of the variance given in class).
- Calculate the variance of 3 B. (^) 3 directly (do not use the linearity of expectations).
- Use F, the cumulative probability distribution for a binomial distribution, to express the probability of getting k or fewer successes.
- Use F, the cumulative probability distribution for a binomial distribution, to express the probability of getting k or more successes.
- Explain how it is that (^) k n expresses the number of ways of selecting k objects (unordered) from a set of n objects.
- What question does (the mass function of the) binomial distribution address? X = 1 3 4 5
probability .3 .4 .2 .1
- Referring to the table above, calculate the mean of X.
- Referring to the table above, calculate the variance of X.
- If X, Y, and Z are i.i.d., and E[X] = 5, E[Y] = 7 and E[Z] = 2, then if W = 4 + 2X +3Y +Z, then what is for W?
- If X, Y, and Z are i.i.d., and E[(X – E[X])^2 ] = 2, E[(X – E[X])^2 ] = 1 and E[(X – E[X])^2 ] = 5, then if W = 4 + 2X +3Y +Z, then what is ^ for W?
- Let F(x) be the cumulative probability function for the distribution 8 B. (^) 4. Calculate F(3).
- Let F(x) be the cumulative probability function for the distribution 8 B. (^) 4. Calculate the probability of getting a 6 or higher.