Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
Community
Ask the community for help and clear up your study doubts
Discover the best universities in your country according to Docsity users
Free resources
Download our free guides on studying techniques, anxiety management strategies, and thesis advice from Docsity tutors
practice Material Type: Notes; Professor: Unger; Class: Methods of Applied Statistics; Subject: Statistics; University: University of Illinois - Urbana-Champaign; Term: Spring 2012;
Typology: Study notes
1 / 17
STAT 420 Fall 2012
Alex obtains a random sample of seven students from STAT 100 class he taught in
Fall Semester of 2007 and wants to use it to see if there is a relationship between
the number of absences and students’ final grade. The data and the scatterplot are
given below.
Number of
Final Grade
Σ x^ = 56,^ Σ y^ = 504,^ Σ x^
= 556, (^) Σ y
= 38,458, (^) Σ x y = 3,600,
Σ (^ x^ –^ x^ )^
= 108, (^) Σ ( y – y )
= 2,170, (^) Σ ( x – x ) ( y – y ) = (^) Σ ( x – x ) y = – 432.
i
0
1
i
i
i
2
a) Find the equation of the least-squares regression line.
b) What proportion of observed variation in final grade percentage is explained by a
straight-line relationship with the number of absences?
regression line?
d) Use the F-test to test the hypothesis that absences from class do not affect the final grade
0
1
1
1
at a 5% significance level. Report the value of the test statistic, critical value(s), and
decision.
e) Use the t-test to test the hypothesis that absences from class do not affect the final grade
0
1
1
1
at a 5% significance level. Report the value of the test statistic, critical value(s), and
decision.
f) Alex claims that each absence lowers the student’s final grade percentage by at least 5
5 percentage points (on average). Test his claim at a 5% significance level. That is, test
0
1
1
1
– 5 at a 5% significance level. Report the value of the
test statistic, critical value(s), and decision.
g) A student claims that the average final grade percentage for the students without absences
0
0
1
0
90 at a 5%
significance level. Report the value of the test statistic, critical value(s), and decision.
h) Construct a 90% confidence interval for the average final grade percentage for a
student who missed 7classes.
sample of equal-size plots. The amount of fertilizer and the yields are:
Plot
Amount of
Fertilizer
(tons)
Yield
(hundreds
of bushels)
2
2
( )
2 ∑ x i −^ x =^ 12.5,^ ∑ (^ xi^ −^ x )^ (^ yi − y )^ =∑(^ xi − x )^ yi =^8 ,^ (^ )^
2
a) The agronomist is interested in predicting yield. What is the dependent variable?
The independent variable?
b) Draw a scatter plot.
c) Find the equation of the least-squares regression
line. Add the regression line to the scatter plot.
d)* Find the sample correlation coefficient.
Consider the model
i
0
1
i
i
where Y is yield,
i
2
f) What proportion of the observed variation in yield is explained by a straight-line
relationship with the amount of fertilizer?
g) Use the regression line to predict the yield for a plot of land with 7 tons of
fertilizer.
h) Use the regression line to predict the yield for a plot of land with 3.5 tons of
fertilizer.
i) Which prediction ( part (g) or part (h) ) is more reliable? Explain.
j) Compute
2
1
l) Is there enough evidence to claim that each additional ton of fertilizer increases the
0
1
1
1
(i) Use a 10% level of significance. (ii) Use a 5% level of significance.
0
1
significance.
0
0
1
0 ≠ 5 at a 10% level of significance.
the number of television commercials broadcast and the sales of its product. The data,
obtained from 5 different cities, are shown in the following table.
Number of TV
Commercials
Sales Units
Σ x^ = 30,^ Σ y^ = 65,^ Σ x^
= 200, (^) Σ y
= 925, (^) Σ x y = 420,
Σ (^ x^ –^ x^ )^
= 20, (^) Σ ( y – y )
= 80, (^) Σ ( x – x ) ( y – y ) = (^) Σ ( x – x ) y = 30.
2
a) Find the equation of the least-squares regression line. Add the least-squares regression
line to the scatter plot.
b) In Anytown, 20 commercials aired. What is your prediction of the sales? Why is it
regression line?
d) What proportion of the observed variation in the sales is explained by a straight-line
relationship with the number of television commercials for the product?
f) Test for the significance of the regression at a 5% level of significance. That is, test
0
1
commercials.
0
1
0
1
Alex obtains a random sample of seven students from STAT 100 class he taught in
Fall Semester of 2007 and wants to use it to see if there is a relationship between
the number of absences and students’ final grade. The data and the scatterplot are
given below.
Number of
Final Grade
Σ x^ = 56,^ Σ y^ = 504,^ Σ x^
= 556, (^) Σ y
= 38,458, (^) Σ x y = 3,600,
Σ (^ x^ –^ x^ )^
= 108, (^) Σ ( y – y )
= 2,170, (^) Σ ( x – x ) ( y – y ) = (^) Σ ( x – x ) y = – 432.
i
0
1
i
i
i
2
a) Find the equation of the least-squares regression line.
1
0 1
b) What proportion of observed variation in final grade percentage is explained by a
straight-line relationship with the number of absences?
0 1
2
Sum: 0 442
RSS
SSRegression = SXX
2
2
2 = − = − SYY
regression line?
e
2 = = −
d) Use the F-test to test the hypothesis that absences from class do not affect the final grade
0
1
1
1
at a 5% significance level. Report the value of the test statistic, critical value(s), and
decision.
0
1
1
1
Regression ∑ (^ −^ )
2
i
∑ (^ −^ )
2
Total ∑ (^ −^ )
2
α
e) Use the t-test to test the hypothesis that absences from class do not affect the final grade
0
1
1
1
at a 5% significance level. Report the value of the test statistic, critical value(s), and
decision.
0
1
1
1
α / 2
α / 2
f) Alex claims that each absence lowers the student’s final grade percentage by at least 5
5 percentage points (on average). Test his claim at a 5% significance level. That is, test
0
1
1
1
– 5 at a 5% significance level. Report the value of the
test statistic, critical value(s), and decision.
0
1
1
1
α
g) A student claims that the average final grade percentage for the students without absences
0
0
1
0
90 at a 5%
significance level. Report the value of the test statistic, critical value(s), and decision.
0
0
1
0
( )
108
2 2
0 00
α
h) Construct a 90% confidence interval for the average final grade percentage for a
student who missed 7classes.
y | x
( )
2
2
α
2 −
( )
2
2
α
2 −
e) (^) Plot A B C D E F G H I
Res. – 0.22 – 0.36 0.64 0.32 – 1 0.68 – 0.64 0.86 – 0.
2
j)
2
0
0
0
0
Commercials
Sales Units
Σ x^ = 30,^ Σ y^ = 65,^ Σ x^
= 200, (^) Σ y
= 925, (^) Σ x y = 420,
Σ (^ x^ –^ x^ )^
= 20, (^) Σ ( y – y )
= 80, (^) Σ ( x – x ) ( y – y ) = (^) Σ ( x – x ) y = 30.
a)
( )( )
( )
2
( )( )
( ) 100
2 2 2
⋅
⋅ ⋅
The least-squares regression line: y ˆ^ = 4 + 1.5 ⋅ x.
c)
2
⇒ (^) Σ ( y – y ˆ )
2
SSRegr = (^ )^
2 2 2
i
⇒ (^) Σ ( y – y ˆ )
2
( )
2 2
2
or ( )
5
2 =? (coefficient of determination)
( )
( )
2
2
∑ (^ − )
2 2
f) Test Statistic:
( )
2
e
i
Rejection Region:
0
Do NOT Reject H
Regression ∑ (^ −^ )
2
Residuals ∑^ (^ −^ )
2
Total ∑ (^ −^ )
2
Rejection Region:
0 if F > F
Do NOT Reject H
( )
∑ (^ − )
2
2
2
i
e
2 −
h) x = 8. ⇒ y ˆ^ = 4 + 1.5 ⋅ x = 4 + 1.5 ⋅ 8 = 16.
Test Statistic:
( )
( )
2
2
2
e
i
Rejection Region:
0
Reject H
i) Test Statistic:
( )
( )
2
2
2
i
e
Rejection Region:
0
Do NOT Reject H