


















































































Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
A set of lecture notes from math 243, covering topics such as scatterplots, regression lines, correlation, and the relationship between explanatory and response variables. The notes include examples and calculations, as well as explanations of key concepts.
Typology: Study notes
1 / 90
This page cannot be seen from the preview
Don't miss anything!



















































































N. Christopher Phillips
9 April 2009
The regression line depends on which variable is the explanatory variable. See Example 5.3 in the book for an example with real data. Here is a more dramatic example with fictitious data.
Data: (2, 4), (5, 10), (8, 4). (Again, just three points.)
0 2 4 6 8 10
2
4
6
8
10
12
Data: (2, 4), (5, 10), (8, 4). The correlation is r = 0.
Data: (2, 4), (5, 10), (8, 4). The correlation is r = 0. So the regression line has slope zero, and turns out to be
̂ y = 6.
Exchange the explanatory and response variables. The data was: (2, 4), (5, 10), (8, 4). It is now: (4, 2), (10, 5), (4, 8).
0 2 4 6 8 10 12
2
4
6
8
10
Data with explanatory and response variables switched: (4, 2), (10, 5), (4, 8).
Data with explanatory and response variables switched: (4, 2), (10, 5), (4, 8). The correlation is still r = 0. So the regression line again has slope zero, and turns out to be ̂ y = 5.
0 2 4 6 8 10 12
2
4
6
8
10
Data: (2, 4), (5, 10), (8, 4).
Both regression lines on the same plot, with the original choice of explanatory variable:
0 2 4 6 8 10
2
4
6
8
10
12
Here are the scatterplots from the lecture of 7 April, with regression lines added. (Example 9, with its poor choice of scale on the vertical axis, has been omitted.)
Here are the scatterplots from the lecture of 7 April, with regression lines added. (Example 9, with its poor choice of scale on the vertical axis, has been omitted.)
Observe that sometimes the line has little to do with the pattern in the scatterplot, and other times it is closely related.
The regression line fits the data reasonably well.
Correlation r ≈ 0. 675563
r 2 ≈ 0 .456386: The change in height explains about 46% of the change in weight.
Scatterplot: height is x, GPA is y.
60 65 70 75 80
1
2
3
4
Regression line: ̂y ≈ 3. 84956 − 0. 0105858 x.
2 4 6 8 10 12 14
2
4
6
8
10
12
14
Clear positive association, roughly linear. Regression line: ̂y ≈ 1 .63308 + 0. 789497 x. Correlation r ≈ 0. 915883 r 2 ≈ 0 .838842: The change in x explains about 84% of the change in y.
2 4 6 8 10 12 14
2
4
6
8
10
12
14
Clear positive association, roughly linear, one outlier. (The other points are as in Example 1.)