
Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
Instructions for completing problem set 6 in a statistics 512 course, which involves using sas software to perform regression analysis on given data sets. The problems cover creating a new variable, running a regression, selecting the best subset of variables using the cp criterion, checking assumptions, and comparing regression lines for two different populations.
Typology: Assignments
1 / 1
This page cannot be seen from the preview
Don't miss anything!

Statistics 512: Problem Set No. 6 Due October 17, 2008
For the following 3 problems use the computer science data that we have been dis- cussing in class. You can get a copy of the data set csdata.dat from the class website. The variables are: id, a numerical identifier for each student; GPA, the grade point average after three semesters; HSM; HSS; HSE; SATM; SATV, which were all explained in class; and GENDER, coded as 1 for men and 2 for women.
(a) Give the equation of the fitted regression line using all six explanatory variables. (b) Give the fitted regression line for women (use part a). (c) Give the fitted regression line for men (use part a).
DO NOT attempt to run proc reg on a subset of the data to answer this question.
A testing laboratory with equipemtn that simulates highway driving studies for two makes (A, B) of a certain type of truck tire the relation between operating cost per mile (Y ) and cruising speed (X 1 ). The observations are given in CH11PR15.DAT, where the columns are ordered (Yi, Xi, 1 , Xi, 2 ) where Xi, 2 = 1 for Make A and Xi, 2 = 0 for Make B. An engineer now wishes to decide whether or not the regression of operating cost on cruising speed is the same for the two makes of tires. Assume the error variances for the two makes are the same and that an interaction-effect regression model is appropriate.