





Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
Concepts of Multiple Linear Regression.
Typology: Lecture notes
1 / 9
This page cannot be seen from the preview
Don't miss anything!






Professor
School of Industrial and Systems Engineering
Learning Objectives:
The model parameters are: !!, !", … , !#, σ^2
Data : !!,!, … , !!,# , $! , … , !$,!, … , !$,# , $$ Model : $% = && + &!!%,! + &'!%,' + ⋯ + &#!%,# + )%, * = 1 , … , , Assumptions :
Response Y=
Error ; =
Coefficients : =
Data : !!,!, … , !!,# , $! , … , !$,!, … , !$,# , $$ Model : $% = && + &!!%,! + &'!%,' + ⋯ + &#!%,# + )%, * = 1 , … , ,
1 st^ Order Interaction Model: & = ($ + (%% + (&& + ('%& + + Model with Interactions: Response Surface 2 nd^ Order Interaction Model: & = ($ + (%% + (&& + ('%& + ((%^ &^ + ()&^ &^ + + Simple Linear Regression : Linear regression with one quantitative predicting variable ANOVA : Linear regression with one or more qualitative predicting variables Multiple Linear Regression : Multiple quantitative and qualitative predicting variables
Multiple Linear Regression : Multiple quantitative/qualitative predicting variables x 1 quantitative x 2 qualitative with three levels: D 1 , D 2 , and D 3 dummy variables Model: > = :( + :)?) + :@) + :+@ + ; Intercept varies
If x 1 x 2 interaction: Nonparallel regression lines
Advertisement
The response variable is: Y = State average SAT score (verbal and quantitative combined) The predicting variables are: X 1 = % of total eligible high school seniors in the state who took the exam X 2 = Median income of families of test takers, in hundreds of dollars X 3 = Average number of years that test takers had in social sciences, natural sciences, and humanities X 4 = % of test takers who attended public schools X 5 = State expenditure on secondary schools, in hundreds of dollars per student X 6 = Median percentile of ranking of test takers within their secondary school classes
Bike sharing systems are of great interest due to their important role in traffic management. Dataset: Historical data for years 2011 - 2012 for the bike sharing system in Washington D.C.
The response variable is: Y = Hourly count rentals of bikes Qualitative predicting variables: X 1 = Day of the week X 2 = Month of the year X 3 = Hour of the day (ranging 0-23) X 4 = Year (2011, 2012) X 5 = Holiday Indicator X 6 = Weather condition (with four levels from good weather for level 1 to severe condition for level 4) Quantitative predicting variables: X 7 = Normalized temperature X 8 = Normalized humidity X 9 = Wind speed
Year: A quantitative or a qualitative predicting variable?
Qualitative predicting variables: X 1 = Day of the week X 2 = Month of the year X 3 = Hour of the day (ranging 0-23) X 4 = Year (2011, 2012) X 5 = Holiday Indicator X 6 = Weather condition (with four levels from good weather for level 1 to severe condition for level 4) Quantitative predicting variables: X 7 = Normalized temperature X 8 = Normalized humidity X 9 = Wind speed