Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
Community
Ask the community for help and clear up your study doubts
Discover the best universities in your country according to Docsity users
Free resources
Download our free guides on studying techniques, anxiety management strategies, and thesis advice from Docsity tutors
Material Type: Assignment; Professor: Jennings; Class: Applied Regression Analysis; Subject: STAT-Statistics; University: Purdue University - Main Campus; Term: Fall 2008;
Typology: Assignments
1 / 2
Statistics 512: Problem Set No. 3 Due September 19, 2008
1 V
θ 1
θ 2 θ 1
Concentration Rate 0.02 76 0.02 47 0.06 97 0.06 107 0.11 123 0.11 139
Concentration Rate 0.22 159 0.22 152 0.56 191 0.56 201 1.10 207 1.10 200
(a) Generate a scatterplot of V vs C. Comment on the shape. (b) Define new variables for (^) V^1 and (^) C^1 in SAS, and generate a scatterplot of the new variables. Does the fit appear linear? Do any assumptions appear to be violated? The new variables can be defined as follows (if the dataset original contains the raw data): data reaction; set original; vinv = 1/v; cinv = 1/c; (c) How is the distribution of (^) C^1 different from the distribution of C? Are there any points that may be more influential in determining the fit? (d) Determine the least squares regression line for (^) V^1 vs (^) C^1. Save the residuals and predicted values. Does the residual plot suggest any problems? (e) Convert this regression line back into the original nonlinear model and plot the predicted curve on a scatterplot of V vs C. Comment on the fit. To generate the predicted curve, simply take the predicted values from the regression model and “re-invert” them. For example, suppose results is the data set containing the residuals and predicted values (variable pred). data invert; set results; predv = 1/pred; symbol1 v = circle i = none c = black; symbol2 v = plus i = sm5 c = red; proc gplot data = invert; plot vc predvc / overlay;
For the next 3 questions, use the grade point average data described in the text with Problem 1..
(a) Verify that the sum of the residuals is zero by running proc univariate with the output from the regression. (b) Plot the residuals versus the explanatory variable and briefly describe the plot noting any unusual patterns or points. (c) Plot the residuals versus the order in which the data appear in the data file and briefly describe the plot noting any unusual patterns or points. (d) Examine the distribution of the residuals by getting a histogram and a normal prob- ability plot of the residuals by using the histogram and qqplot statements in proc univariate. What do you conclude?
(a) Make a table comparing the results of this analysis with the results of the analysis of the original data. Include in the table the following: fitted equation, t-test for the slope, with standard error and p-value, R^2 , and the estimate of σ^2. Summarize the differences. (b) Repeat parts (b), (c), and (d) from the previous problem and explain how these plots help you to detect the unusual observation.