Classical Multiple Linear Regression Model: Assumptions and Notations - Prof. David M. Aad, Study notes of Econometrics and Mathematical Economics

Class notes on chapter 2 of econ 5340, covering the classical multiple linear regression model. The model is introduced with its mathematical representation, and six data-generating assumptions are discussed. These assumptions include linearity, full rank, mean-zero errors, spherical disturbances, nonstochastic regressors, and normality.

Typology: Study notes

Pre 2010

Uploaded on 08/19/2009

koofers-user-mnp
koofers-user-mnp 🇺🇸

10 documents

1 / 3

Toggle sidebar

This page cannot be seen from the preview

Don't miss anything!

bg1
ECON 5340 Class Notes
Chapter 2. The Classical Multiple Linear Regression Model
1Introduction
The multiple linear regression model can be written as
yi=β1xi1+β2xi2+... +βkxik +ii=1, ..., n (1)
where yiis the dependent variable, xij is the jth explanatory variable and iis the error term.
There are kexplanatory variables.
There are nobservations.
xi1is often set equal to one, i=1,...,n,soβ1is an intercept.
The βsarecoecients or parameters to be estimated.
Using matrices, model (1) can be written more compactly as
Y= +(2)
where Yis an n×1column vector of dependent variables, Xis an n×kmatrix of explanatory variables, β
is a k×1column vector of parameters and is an n×1column vector of errors.
For e xa mp le :
Y=
y1
y2
.
.
.
yn
X=
x11 x12 ··· x1k
x21 x22 ··· x2k
.
.
..
.
..
.
.
xn1xn2··· xnk
β=
β1
β2
.
.
.
βk
.
2 Data-Generating Assumptions
There are six data-generating assumptions associated with model (1) or (2).
1
pf3

Partial preview of the text

Download Classical Multiple Linear Regression Model: Assumptions and Notations - Prof. David M. Aad and more Study notes Econometrics and Mathematical Economics in PDF only on Docsity!

ECON 5340 Class Notes

Chapter 2. The Classical Multiple Linear Regression Model

1 Introduction

The multiple linear regression model can be written as

yi = β 1 xi 1 + β 2 xi 2 + ... + β k xik + i i = 1, ..., n (1)

where yi is the dependent variable, xij is the j th explanatory variable and i is the error term.

  • There are k explanatory variables.
  • There are n observations.
  • xi 1 is often set equal to one, i = 1, ..., n, so β 1 is an intercept.
  • The βs are coefficients or parameters to be estimated.

Using matrices, model (1) can be written more compactly as

Y = Xβ +  (2)

where Y is an n × 1 column vector of dependent variables, X is an n × k matrix of explanatory variables, β

is a k × 1 column vector of parameters and  is an n × 1 column vector of errors.

For example:

Y =

y 1

y 2

yn

X =

x 11 x 12 · · · x 1 k

x 21 x 22 · · · x 2 k

xn 1 xn 2 · · · xnk

β =

β 1

β 2

. . .

β k

2 Data-Generating Assumptions

There are six data-generating assumptions associated with model (1) or (2).

2.1 Linearity

The model must take the form of (1) so that it is linear in the parameters (β) and the error term ().

  • The model need not be linear in the Xs or Y s. However, it must be transformable into a form such

as (1).

  • For example, after taking natural logs, yi = exp(β 1 xi + i) can be transformed into ln(yi) = β 1 xi + i,

which is in the form of (1) with an appropriate redefining of y.

2.2 Full Rank

The columns of X need to be linearly independent and there must be at least k observations.

  • In other words, Rank(X) = k.

2.3 Mean-Zero Errors

Conditional on X, the error terms are mean zero.

  • In other words, E[|X] = 0.
  • This implies that E[Y |X] = Xβ, (i.e., the regression of Y on X is the conditional mean Xβ).
  • Including a constant term will guarantee this assumption holds. Assume yi = β 0 + β 1 xi + i has the

property E[i] = μ 6 = 0. By redefining the constant term, 

∗ i =^ i^ −^ μ, and intercept^ β

∗ 0 =^ β 0 +^ μ, the

model can be written with mean-zero errors, yi = β

∗ 0 +^ β 1 xi^ +^ 

∗ i.

2.4 Spherical Disturbances

The error terms should display homoscedasticity (i.e., error variances are constant across observations)

and no autocorrelation (i.e., errors are uncorrelated across observations).

  • Homoscedasticity: V ar[i] = σ

2 for all i = 1, ..., n.

  • No autocorrelation: Cov[i, j ] = 0 for all i 6 = j.
  • Matrix representation: V ar[] = E[ 0 ] =

V ar[ 1 ] Cov[ 2 ,  1 ] · · · Cov[n,  1 ]

Cov[ 1 ,  2 ] V ar[ 2 ] · · · Cov[n,  2 ]

Cov[ 1 , n] Cov[ 2 , n] · · · V ar[n]

= σ 2 In.