


Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
Solutions to the stat 571 - second midterm exam held on november 20, 2007. The exam covers topics such as levene's test, t-test, mann-whitney test, sample size calculation, and hypothesis testing. Students are expected to understand concepts related to statistical analysis, variance, mean, and confidence intervals.
Typology: Exams
1 / 4
This page cannot be seen from the preview
Don't miss anything!



Name:
For instructorās use:
Total 100
(a) As a first step in the analysis, the researcher performs Leveneās test to determine whether the variances of the two groups might be assumed equal. His computed statistic works out to be 2.367. Determine the appropriate degrees of freedom for the test, and compute the p-value associated with this statistic. What would you conclude about the variances of the two groups?
(b) Based on your conclusion from (a), perform an appropriate two-sided t-test to compare the mean sizes of the two groups. The summary statistics are as follows: n 1 = 10, n 2 = 13, s 1 = 8.06, s 2 = 2.05, x¯ 1 = 86.65, x¯ 2 = 90.30. Compute the statistic, the p-value, and make a conclusion using α = 0.01. Note: the adjusted degree of freedom is 9.90, in case you wish to use it.
(a) The figure below shows two data sets, with boxplots and dotplots superposed. With data set 1 (top panel) the Mann-Whitney gives a p-value p = 0.028 and the t-test test gives p = 0.006. With data set 2 (bottom panel), the Mann-Whitney test p-value is smaller than 0. 028 0. 028 larger than 0. 028 The t-test p-value is smaller than 0. 006 0. 006 larger than 0. 006 Justify briefly.
a
b
l l l l
l l l l
data set 1
a
b
10.0 10.5 11.0 11.5 12.0 12.
l l l l
l l l l
data set 2
(b) Suppose now that the Mann-Whitney test returns p < 0 .05 while the t-test returns p > 0 .05. Which of ādata set 3ā (top panel) and ādata set 4ā (bottom panel) would give such results? data set 3 (top) data set 4 (bottom). a
b
l l l l
l l l l
data set 3
a
b
10.0 10.5 11.0 11.5 12.0 12.5 13.
l l l l
l l l l
data set 4
(c) For data set 3, which test would be most appropriate: Mann-Whitney test t-test? Why?
For data set 4, which test would be most appropriate: Mann-Whitney test t-test? Why?
(b) RNA expression of gene āpolluxā is measured in 20 lotus plants. From each plant, two RNA expres- sions are measured: one from a root extract and one from a leaf extract. Which specific test would you use to know if āpolluxā is expressed at a different level in roots and in leaves? Explain your choice, but donāt do any calculation. You may make assumptions, but say what they are.
(a) If you are able to assume the data is normally distributed, then a confidence interval will always be computed using an appropriate quantile from the standard normal distribution. true false
(b) As α increases, the length of a 100 ā (1 ā α)% confidence interval will decrease. true false
(c) If you assume the variance is known, the confidence interval will have a shorter length than if you assume the variance is unknown, in the case where Ļ happens to be equal to s. true false
For the following three questions, additionally assume normality, independence, and known vari- ance. The ānot-rejection regionā is defined as those values of the sample mean that would not cause rejection of the null hypothesis. (d) The not-rejection region for an α = 0.05 level test will contain exactly the same numbers as a 95% confidence interval based on a particular set of observed data. true false
(e) If you perform an hypothesis test by creating a 100 ā (1 ā α)% CI around the sample mean of a particular set of observed data, and reject if the value of μ under the null hypothesis is not inside the interval, this would give exactly the same result (in terms of reject/not-reject) as performing an hypothesis test by computing a test statistic and p-value, and rejecting if the p-value < α. true false
(f) The length of the not-rejection region for an α-level test is the same as the length of a 100 ā (1 ā α)% confidence interval based on a particular set of observed data. true false