Simple Linear Regression: Probabilistic Models and Least Squares Approach, Study notes of Data Analysis & Statistical Methods

An introduction to simple linear regression, focusing on probabilistic models and the least squares approach. It covers the general form of probabilistic models, the first-order (straight-line) probabilistic model, and the five steps of simple linear regression. The document also includes an explanation of fitting the model using the least squares approach, preliminary computations, and comparisons of observed and predicted values. Assumptions and an estimator of σ2 are also discussed.

Typology: Study notes

Pre 2010

Uploaded on 09/02/2009

koofers-user-k5v
koofers-user-k5v 🇺🇸

10 documents

1 / 2

Toggle sidebar

This page cannot be seen from the preview

Don't miss anything!

bg1
11/26/2007
1
Chapter 11
Simple Linear Regression
Probabilistic Models
General form of Probabilistic Models
Y = Deterministic Com
p
onent + Random
2
p
Error
where
E(y) = Deterministic Component
Probabilistic Models
First Order (Straight-Line) Probabilistic Model
ε
β
β
++= xy 10
3
Probabilistic Models
5 steps of Simple Linear Regression
1. Hypothesize the deterministic component
2. Use sample data to estimate unknown model
4
parameters
3. Specify probability distribution of , estimate
standard deviation of the distribution
4. Statistically evaluate model usefulness
5. Use for prediction, estimatation, once model is
useful
ε
Fitting the Model: The Least
Squares Approach
Reaction Time versus Drug Percentage
Subject Amount of Drug x(%) Reaction Time y (second s)
11 1
22 1
33 2
4
4
2
5
4
4
2
55 4
Fitting the Model: The Least
Squares Approach
Least Squares Line has:
•Sum of errors (SE) = 0
•Sum of Squared errors (SSE) is smallest of all straight line
models
xy 10 ˆˆ
ˆ
ββ
+=
6
models
Formulas:
Slope: y-intercept
()()
()() ii
xy i i i i
x
y
SS x x y y x y n
=−=
∑∑
∑∑
1
ˆxy
xx
SS
SS
β
=01
ˆˆ
yx
β
β
=−
2
22
()
() i
xx i i
SS x x x n
=−=
∑∑
pf2

Partial preview of the text

Download Simple Linear Regression: Probabilistic Models and Least Squares Approach and more Study notes Data Analysis & Statistical Methods in PDF only on Docsity!

Chapter 11

Simple Linear Regression

Probabilistic Models

General form of Probabilistic Models

Y = Deterministic Component + Random

2

p

Error

where

E(y) = Deterministic Component

Probabilistic Models

First Order (Straight-Line) Probabilistic Model

y = β + β x + ε

3

Probabilistic Models

5 steps of Simple Linear Regression

  1. Hypothesize the deterministic component
  2. Use sample data to estimate unknown model

4

parameters

  1. Specify probability distribution of , estimate

standard deviation of the distribution

  1. Statistically evaluate model usefulness
  2. Use for prediction, estimatation, once model is

useful

Fitting the Model: The Least

Squares Approach

Reaction Time versus Drug Percentage

Subject Amount of Drug x (%) Reaction Time y (seconds) 1 1 1 2 2 1 3 3 2 4 4 2

5

4 4 2 5 5 4

Fitting the Model: The Least

Squares Approach

Least Squares Line has:

•Sum of errors (SE) = 0

•Sum of Squared errors (SSE) is smallest of all straight line

models

y x 0 1

6

models

Formulas:

Slope: y-intercept

i i xy i i i i

x y SS x x y y x y n

1

ˆ xy

xx

SS

SS

0 1

βˆ = y −βˆ x

2 2 2 (^ ) ( )

i xx i i

x SS x x x n

11/26/

2

Fitting the Model: The Least

Squares Approach

Preliminary Computations x i yi x^2 i x yi i 1 1 1 1 2 1 4 2 3 2 9 6 4 2 16 8 5 4 25 20 ∑ ∑

2 ∑ ∑

7

Totals (^) ∑ xi = (^15) ∑ yi = (^10) ∑ x 2 i^ = (^55) ∑ x yi i = 37

Comparing Observed and Predicted Values for the Least Squares Prediction Equation x y^ ˆ y^ = −.1 +.7 x ( y − ˆ y ) ( yy ˆ)^2 1 1 .6 .4. 2 1 1.3 -.3. 3 2 2.0 0.0. 4 2 2.7 -.7. 5 4 3.4 .6. Sum of Errors = 0 SSE = 1.

Model Assumptions

  1. Mean of the probability distribution of ε is 0
  2. Variance of the probability distribution of ε is constant

for all values of x

  1. Probability distribution of ε is normal

8

y

  1. Values of ε are independent of each other

An Estimator of σ

2

Estimator of σ

for a straight-line model

2

2

SSE SSE s Degrees of freedom for error n

= = −

9

eg ees of f eedom fo e o n

( )

( )

1

2 (^2 )

2

yy xy

i yy i i

SSE SS SS

y

SS y y y

n

s s Estimated Standard Errorof the Regression Model

= − β

∑ ∑

Assessing the Utility of the Model:

Making Inferences about the Slope β 1

Sampling Distribution of (^1) βˆ

σ

10

1

ˆ

xx

SS

β

σ =

Assessing the Utility of the Model: Making

Inferences about the Slope β 1

A Test of Model Utility: Simple Linear Regression

One-Tailed Test Two-Tailed Test

H 0 : β 1 =0 H 0 : β 1 =

11

H (^) a : β 1 <0 (or H (^) a : β 1 >0) H (^) a : β 1 ≠ 0

Rejection region: t< -tα

(or t< -tα when H (^) a : β 1 >0)

Rejection region: |t|> tα/

Where tα and tα/2 are based on (n-2) degrees of freedom

1

1 1

ˆ

xx

Test statistic t

s s SS

β

Assessing the Utility of the Model: Making

Inferences about the Slope β 1

A 100(1-α)% Confidence Interval for β 1

where 1 2 ˆ

ˆ β ± t s

12

1

t s α (^) β

β ±

1

ˆ

xx

s s

SS

β

=