


Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
An assignment for a geosciences course (geos 585a) focusing on multiple linear regression. Students are required to use the geosa11.m program to select time series, preprocess data, and perform stepwise entry of predictors. The assignment includes analyzing model statistics, residuals, and assessing assumptions such as distribution of residuals and autocorrelation. Students must identify significant predictors and interpret results.
Typology: Assignments
1 / 4
This page cannot be seen from the preview
Don't miss anything!



Menus appear allowing you 1) select time series to be used as predictand and predictors ,
You will then begin an interactive modeling process that consists of stepwise entry of predictors one-by-one, with examination of model statistics and residuals analysis at each step. Note that at each step you must press “being modeling or add another predictor” , followed by “review results.” For this exercise, run the model out to 4 steps. Then press “Quit.” You should have 4 predictors are in your final model.
You will have four Figure windows. Answer the following questions and turn in your answers with printouts of the figure windows.
regression residuals? Does your analysis of residuals suggest that the assumption is satisfied? What can you say about the distribution of residuals for the extreme case of zero explanatory power of regression? (A “trick” question)
residuals violate this assumption. Refer the Portmanteau statistic, Durbin-Watson statistic, and visual appearance of acf of residuals in your answer.
Fig 4. Residuals analysis 2: autocorrelation of residuals. The top plot is a time series plot of residual. This plot is useful in pointing out possible trend in residuals over time, as well as tendency of large residuals to cluster. At lower left is a scatterplot of residuals at time t against residuals at time t-1, Ideally this scatterplot shows no dependence. A linear pattern might indicate first-order autocorrelation of residuals. At lower right is the acf of the residuals. Ideally, the acf is close to zero at lags. Annotated below the plots are the Portmanteau statistic and the Durbin-Watson test results.
geosa11.m relies heavily user-written functions, including:
armawht1 -- prewhitens time series with AR model crospul2 – builds pointer to rows of time series matrix for cross-validation lagyr3 – builds a matrix of lagged predictors menudm1 – miscellaneous menu function dwstat –Durbin-Watson statistic acf – autocorrelation function portmant – Portmanteau statistic rederr – reduction-or-error statistic stepvbl1 —stepwise entry of variables based on ability to reduce residual variance durbinwt.mat – lookup table for significance of D-W statistic sepred2 – standard error of prediction hatmtx – “hat matrix” mce1 – minimum coverage ellipsoid
Many of the above functions are not used until assignment 12, which brings cross-validation into the regression model.