Practice Homework 9 - Statistical Methods for Research | STAT 401, Assignments of Statistics

Material Type: Assignment; Class: STAT METH FOR RSRCH; Subject: STATISTICS; University: Iowa State University; Term: Unknown 1989;

Typology: Assignments

Pre 2010

Uploaded on 09/02/2009

koofers-user-0xg
koofers-user-0xg 🇺🇸

9 documents

1 / 2

Toggle sidebar

This page cannot be seen from the preview

Don't miss anything!

bg1
Stat 401, Section F Homework 9
Due Date: Wednesday, October 31
1. The rowan (Sorbus aucuparia) is a tree that grows in a wide range of altitudes. To study how the tree
adapts to its varying habitats, researchers collected twigs with attached buds from 12 trees growing at
various altitudes in North Angus, Scotland. The buds were brought back to the laboratory and measure-
ments were made on the dark respiration rate. The average altitude from the origin for the 12 trees is
433.33 meters. The average respiration rate computed for the 12 trees is 0.21 µl oxygen per hour per
mg dry weight of tissue. The standard deviation of the altitude for the 12 trees is 214.62 meters and for
respiration rate is 0.077 µl/hr-mg. The correlation between altitude from the origin and dark respiration
rate was 0.887.
(a) We will use linear regression to describe the relationship between altitude and dark respiration.
Which of the two variables would most naturally be considered the explanatory variable and which
would be the response variable?
(b) Compute the equation of the least-squares regression line.
(c) Compute a 95% confidence interval for the slope of the regression line.
(d) Estimate the mean respiration rate for a tree growing at 300 meters above the origin.
(e) Do you think your estimate in part (d) involved interpolation or extrapolation? Explain.
(f) Estimate the standard deviation of mean dark respiration rate for trees growing 300 meters above
the origin.
2. It is generally thought that the percentage of fruits attacked by codling moth larvae is greater on apple
trees bearing a small crop. Apparently the density of the flying moths is unrelated to the size of the crop
on a tree, so the chance of attack for any particular fruit is increased if few fruits are on the tree. Data
on a sample of 10 trees gives a sample linear correlation coefficient of -0.8. Other summary statistics
obtained from the sample of 10 trees are provided below.
Variable Sample Mean Sample Standard Deviation
Crop Size (number of fruit) 110 40
Percentage of Wormy Fruits 45 12
(a) Find the equation of the least-squares regression line for predicting percentage of wormy fruits from
crop size.
(b) Write down the ANOVA table for the simple linear regression of percentage of wormy fruit on crop
size. Compute the Fstatistic and find a p-value.
(c) Based on the Fstatistic and p-value that you computed in part (b), would you say the slope of the
regression line is significantly different from 0?
(d) Find a 99% confidence interval for β1, the slope of the least squares regression line for predicting
percentage of wormy fruit from crop size.
(e) What proportion of the variability in percentage of wormy fruit is explained by the regression of
percentage of wormy fruit on crop size?
(f) Estimate the mean percentage of wormy fruits for trees with a crop size of 150 fruit.
(g) Provide a 95% confidence interval for the mean percentage of wormy fruits for trees with a crop
size of 150 fruit.
1
pf2

Partial preview of the text

Download Practice Homework 9 - Statistical Methods for Research | STAT 401 and more Assignments Statistics in PDF only on Docsity!

Stat 401, Section F Homework 9 Due Date: Wednesday, October 31

  1. The rowan ( Sorbus aucuparia ) is a tree that grows in a wide range of altitudes. To study how the tree adapts to its varying habitats, researchers collected twigs with attached buds from 12 trees growing at various altitudes in North Angus, Scotland. The buds were brought back to the laboratory and measure- ments were made on the dark respiration rate. The average altitude from the origin for the 12 trees is 433.33 meters. The average respiration rate computed for the 12 trees is 0.21 μl oxygen per hour per mg dry weight of tissue. The standard deviation of the altitude for the 12 trees is 214.62 meters and for respiration rate is 0.077 μl/hr-mg. The correlation between altitude from the origin and dark respiration rate was 0.887.

(a) We will use linear regression to describe the relationship between altitude and dark respiration. Which of the two variables would most naturally be considered the explanatory variable and which would be the response variable? (b) Compute the equation of the least-squares regression line. (c) Compute a 95% confidence interval for the slope of the regression line. (d) Estimate the mean respiration rate for a tree growing at 300 meters above the origin. (e) Do you think your estimate in part (d) involved interpolation or extrapolation? Explain. (f) Estimate the standard deviation of mean dark respiration rate for trees growing 300 meters above the origin.

  1. It is generally thought that the percentage of fruits attacked by codling moth larvae is greater on apple trees bearing a small crop. Apparently the density of the flying moths is unrelated to the size of the crop on a tree, so the chance of attack for any particular fruit is increased if few fruits are on the tree. Data on a sample of 10 trees gives a sample linear correlation coefficient of -0.8. Other summary statistics obtained from the sample of 10 trees are provided below.

Variable Sample Mean Sample Standard Deviation Crop Size (number of fruit) 110 40 Percentage of Wormy Fruits 45 12

(a) Find the equation of the least-squares regression line for predicting percentage of wormy fruits from crop size. (b) Write down the ANOVA table for the simple linear regression of percentage of wormy fruit on crop size. Compute the F statistic and find a p-value. (c) Based on the F statistic and p-value that you computed in part (b), would you say the slope of the regression line is significantly different from 0? (d) Find a 99% confidence interval for β 1 , the slope of the least squares regression line for predicting percentage of wormy fruit from crop size. (e) What proportion of the variability in percentage of wormy fruit is explained by the regression of percentage of wormy fruit on crop size? (f) Estimate the mean percentage of wormy fruits for trees with a crop size of 150 fruit. (g) Provide a 95% confidence interval for the mean percentage of wormy fruits for trees with a crop size of 150 fruit.

(h) Suppose a tree with 150 fruit is randomly selected from all trees with 150 fruit. Provide an interval that will contain the percentage of wormy fruit for this tree with 90% confidence.

  1. Consider the data described in problem 23 of Chapter 8 on wine consumption and heart disease mortality in 18 countries. Let X denote the wine consumption in liters per person per year. Let Y denote the heart disease mortality in deaths per 1000 persons. You can find the data set wine.txt on the course web site.

(a) Examine scatter plots of Y on X, log(Y ) on X, Y on log(X), and log(Y ) on log(X). For each plot, state whether a linear relationship looks appropriate or inappropriate. (b) Give the equation of the least-squares regression line for the most appropriate of these four regres- sions relating Y to X. (There are several good choices possible here, you will receive full credit so long as you justify your choice correctly.) (c) Write a statement that provides an interpretation of the slope of the most appropriate least-squares regression line.

  1. Suppose a researcher is studying the effect of a chemical on the concentration of a certain bacteria. She prepares 20 identical bacteria cultures in 20 petri dishes. She randomly assigns each of 5 chemical concentrations (1,2,4,8,16 mg/L) to 4 petri dishes. A bacteria count is obtained for each dish after 24 hours of exposure to the chemical. The researchers notices that the log of the bacteria count seems to be linearly associated with the log of the chemical concentration. The equation of the least-squares regression line for predicting log bacteria count from log chemical concentration is determined to be log(̂ Y ) = 12 − 1 .7 log(X). (Note the least squares regression line gives predictions log(̂ Y ) of log(Y ) on the log scale, and not predictionsof Y_. Also, “_ log ” here refers to the natural log, which is ”ln” on your calculator.) The 95% confidence interval for β 1 was determined to be -0.2 to -3.2.

(a) Fill in the blanks in the following sentence. A doubling of the chemical concentration was estimated to cause a multiplicative change of (95% confidence interval to ) in the median bacteria count. (b) Estimate the median bacteria counts for chemical concentrations of 6 and 12 mg/L, respectively. Show that these estimates are consistent with your statement in part (a). Hint: Transform estimates on the log(Y ) back to the original Y scale using e_._