Prepare for your exams
Get points
Guidelines and tips
Sell on Docsity
Docsity AI

Prepare for your exams

Study with the several resources on Docsity

Earn points to download

Earn points by helping other students or get them with a premium plan

Guidelines and tips

Sell on Docsity

Docsity AI

Prepare for your exams

Study with the several resources on Docsity

Find documents

Prepare for your exams with the study notes shared by other students like you on Docsity

Search for your university

Find the specific documents for your university's exams

Docsity AINEW

Summarize your documents, ask them questions, convert them into quizzes and concept maps

Explore questions

Clear up your doubts by reading the answers to questions asked by your fellow students

Earn points to download

Earn points by helping other students or get them with a premium plan

Share documents

20 Points

For each uploaded document

Answer questions

5 Points

For each given answer (max 1 per day)

All the ways to get free points

Get points immediately

Choose a premium plan with all the points you need

Study Opportunities

Choose your next study program

Get in touch with the best universities in the world. Search through thousands of universities and official partners

Community

Ask the community

Ask the community for help and clear up your study doubts

Free resources

Our save-the-student-ebooks!

Download our free guides on studying techniques, anxiety management strategies, and thesis advice from Docsity tutors

Exam Applied Statistical Regression Good Luck!, Lecture notes of Statistics

International University Institute of Luxembourg (IUIL)Statistics

alternative hypothesis, the test statistics or the critical values. ... A multiple regression model of the following form is fitted to a data set.

Typology: Lecture notes

2021/2022

Uploaded on 08/05/2022

char_s67 🇱🇺

4.5

(116)

1.9K documents

1 / 11

This page cannot be seen from the preview

Don't miss anything!

Dr. M. Dettling Summer 2011

Exam Applied Statistical Regression

Approved: Any written material, calculator (without communication facility).

Tables: Attached.

Note: All tests have to be done at the 5%-level.

If the question concerns the significance of a factor (or similar) and

if nothing else is indicated, you don’t need to give the null- and

alternative hypothesis, the test statistics or the critical values.

Exercise 1 is a multiple-choice exercise. In each sub-exercise, exactly

one answer is correct. A correct answer adds 1 plus-point and a

wrong answer 1

2minus-point. You get a minimum of 0 points for

the whole multiple-choice exercise. Tick the correct answer to the

multiple choice exercises in the separately added answer sheet. Do

not stay too long at a part where you experience a lot of difficulties.

Good Luck!

Discover Lecture notes of Statistics International University Institute of Luxembourg (IUIL)

Partial preview of the text

Download Exam Applied Statistical Regression Good Luck! and more Lecture notes Statistics in PDF only on Docsity!

Dr. M. Dettling Summer 2011

Exam Applied Statistical Regression

Approved: Any written material, calculator (without communication facility). Tables: Attached. Note: All tests have to be done at the 5%-level. If the question concerns the significance of a factor (or similar) and if nothing else is indicated, you don’t need to give the null- and alternative hypothesis, the test statistics or the critical values. Exercise 1 is a multiple-choice exercise. In each sub-exercise, exactly one answer is correct. A correct answer adds 1 plus-point and a wrong answer 12 minus-point. You get a minimum of 0 points for the whole multiple-choice exercise. Tick the correct answer to the multiple choice exercises in the separately added answer sheet. Do not stay too long at a part where you experience a lot of difficulties.

Good Luck!

(8 points)

A multiple regression model of the following form is fitted to a data set.

Yi = β 0 + β 1 · xi, 1 + β 2 · xi, 2 + β 3 · xi, 3 + β 4 · xi, 4 + εi, εi ∼ N (0, σ^2 ) i.i.d.

The model is fitted using the software R and the following summary output is obtained.

Coefficients: Estimate Std. Error t value Pr(>|t|) (Intercept) ??? 0.1960 8.438 3.57e- x1 5.3036 2.5316 ??? 0. x2 4.0336 2.4796 1.627 0. x3 -9.3153 2.4657 -3.778 0. x4 0.5884 2.2852 0.257 0.

Residual standard error: 1.892 on 95 degrees of freedom Multiple R-squared: 0.1948,Adjusted R-squared: ??? F-statistic: 5.745 on 4 and 95 DF, p-value: 0.

What is the value of the t-statistics of βˆ 1? a) 0.099 b) 13.43 c) 2.095 d) 0.
How many observations are in the data set? a) 100 b) 99 c) 96 d) 95
Has the null hypothesis H 0 : β 3 = 0 to be rejected on a 5% level? a) Yes b) No c) No answer possible.
What is the estimate of the intercept βˆ 0? a) 1.654 b) 0.324 c) 43.051 d) 1.
What is the estimate of V ar(i). a) 1.892 b) 3.579 c) 1.375 d) 9.
Which of the following intervals is a two-sided 95% confidence interval for β 3? a) − 9. 315 ± 1. 99 · 0. 00028 b) − 9. 315 ± 1. 99 · 2 √.^46695 c) − 9. 315 ± 1. 99 · 0.^00028 √ 95 d) − 9. 315 ± 1. 99 · 2. 466
Have a look at the residual plots. Are the model assumptions on the i fullfilled and if not, what is the main problem? a) Yes. b) No, since leverage points exist. c) No, since the assumption of constant variance of the εi is violated. d) No, since the εi are dependent.

(6 points)

Consider the following scatterplot:

0.0 0.5 1.0 1.

z= z=

The different symbols in the plot correspond to the values of two different groups. The response variable y and the covariable x are continuous, the indicator variable z ∈ { 0 , 1 } encodes the respective group membership.

a) The covariables x and z are interacting. Explain! b) Are x and z correlated? Explain! c) What model would you fit to these data? Write down a model equation? A linear model has been fit to the above data. The R-output is given as follows:

Coefficients: Estimate Std. Error t value Pr(>|t|) (Intercept) 1.31215 0.08485 15.464 < 2e- x 1.09296 0.14606 7.483 9.37e- z -1.25344 0.21241 -5.901 8.84e- x:z -0.35241 0.20656 -1.706 0.

Residual standard error: 0.2766 on 78 degrees of freedom Multiple R-squared: 0.7755,Adjusted R-squared: 0. F-statistic: 89.82 on 3 and 78 DF, p-value: < 2.2e-

d) What are the estimated regression lines for the two groups? e) Is it statistically nesessery to fit two regression lines with different slopes? Motivate your answer.

We repeat the regression analysis but without interaction of x and z.

Coefficients: Estimate Std. Error t value Pr(>|t|) (Intercept) 1.40026 0.06814 20.550 < 2e- x 0.91675 0.10452 8.771 2.72e- z -1.57061 0.10400 -15.103 < 2e-

Residual standard error: 0.28 on 79 degrees of freedom Multiple R-squared: 0.7671,Adjusted R-squared: 0. F-statistic: 130.1 on 2 and 79 DF, p-value: < 2.2e-

f ) Which quantities in the R-Output can be used to compare the two models?

(11 points)

The swiss military carried out a study in order to analyze which soldiers are fit enough to join the special force team AAD10. In this regard, the dependent binary variable (y) reflects state of fitness of a soldier. y = 1 means that the soldier is fit enough for the special force team AAD10, whereas y = 0 indicates that the soldier is not fit enough. The following predictor variables were used for the analysis:

x1: The soldiers age (in years older than 18)
x2: The body mass index
x3: The average amount of sport/exercise per week (in hours)

a) Write down the logistic regression model for this case. b) Look at the following R-Output. Formally, which predictors have a significant influence on the response? Coefficients: Estimate Std. Error z value Pr(>|z|) (Intercept) -15.5543 7.2946 -2.132 0. X1 -0.5859 0.3569 ??? ??? X2 0.5643 0.3317 ??? ??? X3 1.9639 0.8800 ??? ???

(Dispersion parameter for binomial family taken to be 1)

Null deviance: 27.526 on ??? degrees of freedom Residual deviance: 14.177 on 16 degrees of freedom AIC: ???

Number of Fisher Scoring iterations: 6 c) How many observations were used in this logistic regression? d) What are the odds for y = 1 if x 2 is increased by 1 and the other predictors remain the same? e) Estimate the probability for y = 1 with x 1 = 3, x 2 = 25 and x 3 = 2. What would be your prediction for y in this case? f ) We have x 1 = 5 and x 2 = 25. Which value do we have to choose for x 3 in order to get a probability of 50% for y = 1? g) Now we calculate the logistic regression without the predictor variable x 1. Coefficients: Estimate Std. Error z value Pr(>|z|) (Intercept) -7.60061 4.43762 -1.713 0. X2 0.08727 0.14484 0.603 ??? X3 1.53255 0.68010 2.253 ???

(Dispersion parameter for binomial family taken to be 1)

Null deviance: 27.526 on ??? degrees of freedom Residual deviance: 18.158 on 17 degrees of freedom AIC: ???

Which of the two models from above would you prefere concerning AIC? Motivate your answer.

age80+ 2.82207 0.11372 24.816 < 2e-16 *** smokeyes 0.41044 0.04096 10.021 < 2e-16 ***

(Dispersion parameter for poisson family taken to be 1)

Null deviance: 1789.071 on 17 degrees of freedom Residual deviance: 12.661 on 8 degrees of freedom AIC: 153.

Number of Fisher Scoring iterations: 4

d) Compute the fitted value of the first observation. e) The effect of smoke is significant. According to the fitted model, how much more likely is it that a randomly chosen smoking pearson dies from lung cancer in comparison to a randomly chosen non-smoking pearson, given that both belong to the same age group? f ) If there are 436 person in the age group 75–79 with smoke status “yes”, how many of them do you expect to die from lung cancer according to the model? g) Consider the interval [191. 01 , 240 .16]. Is it plausible that this interval is a 95% prediction interval for the number in f )? Explain. (Hint: A Poisson distribution with parameter λ > 100 is well approximated by a normal distribution.)

Table of the cumulative Normal distribution Φ(z) = P [Z ≤ z] , Z ∼ N (0, 1)

 - Bsp.: P [Z ≤ 1 .96] = 0.

z | .00 .01 .02 .03 .04 .05 .06 .07 .08.
.0 | 0.5000 0.5040 0.5080 0.5120 0.5160 0.5199 0.5239 0.5279 0.5319 0. --------------------------------------------------------------------------------------
.1 | 0.5398 0.5438 0.5478 0.5517 0.5557 0.5596 0.5636 0.5675 0.5714 0.
.2 | 0.5793 0.5832 0.5871 0.5910 0.5948 0.5987 0.6026 0.6064 0.6103 0.
.3 | 0.6179 0.6217 0.6255 0.6293 0.6331 0.6368 0.6406 0.6443 0.6480 0.
.4 | 0.6554 0.6591 0.6628 0.6664 0.6700 0.6736 0.6772 0.6808 0.6844 0.
.5 | 0.6915 0.6950 0.6985 0.7019 0.7054 0.7088 0.7123 0.7157 0.7190 0.
.6 | 0.7257 0.7291 0.7324 0.7357 0.7389 0.7422 0.7454 0.7486 0.7517 0.
.7 | 0.7580 0.7611 0.7642 0.7673 0.7704 0.7734 0.7764 0.7794 0.7823 0.
.8 | 0.7881 0.7910 0.7939 0.7967 0.7995 0.8023 0.8051 0.8078 0.8106 0.
.9 | 0.8159 0.8186 0.8212 0.8238 0.8264 0.8289 0.8315 0.8340 0.8365 0.
1.0 | 0.8413 0.8438 0.8461 0.8485 0.8508 0.8531 0.8554 0.8577 0.8599 0.
1.1 | 0.8643 0.8665 0.8686 0.8708 0.8729 0.8749 0.8770 0.8790 0.8810 0.
1.2 | 0.8849 0.8869 0.8888 0.8907 0.8925 0.8944 0.8962 0.8980 0.8997 0.
1.3 | 0.9032 0.9049 0.9066 0.9082 0.9099 0.9115 0.9131 0.9147 0.9162 0.
1.4 | 0.9192 0.9207 0.9222 0.9236 0.9251 0.9265 0.9279 0.9292 0.9306 0.
1.5 | 0.9332 0.9345 0.9357 0.9370 0.9382 0.9394 0.9406 0.9418 0.9429 0.
1.6 | 0.9452 0.9463 0.9474 0.9484 0.9495 0.9505 0.9515 0.9525 0.9535 0.
1.7 | 0.9554 0.9564 0.9573 0.9582 0.9591 0.9599 0.9608 0.9616 0.9625 0.
1.8 | 0.9641 0.9649 0.9656 0.9664 0.9671 0.9678 0.9686 0.9693 0.9699 0.
1.9 | 0.9713 0.9719 0.9726 0.9732 0.9738 0.9744 0.9750 0.9756 0.9761 0.
2.0 | 0.9772 0.9778 0.9783 0.9788 0.9793 0.9798 0.9803 0.9808 0.9812 0.
2.1 | 0.9821 0.9826 0.9830 0.9834 0.9838 0.9842 0.9846 0.9850 0.9854 0.
2.2 | 0.9861 0.9864 0.9868 0.9871 0.9875 0.9878 0.9881 0.9884 0.9887 0.
2.3 | 0.9893 0.9896 0.9898 0.9901 0.9904 0.9906 0.9909 0.9911 0.9913 0.
2.4 | 0.9918 0.9920 0.9922 0.9925 0.9927 0.9929 0.9931 0.9932 0.9934 0.
2.5 | 0.9938 0.9940 0.9941 0.9943 0.9945 0.9946 0.9948 0.9949 0.9951 0.
2.6 | 0.9953 0.9955 0.9956 0.9957 0.9959 0.9960 0.9961 0.9962 0.9963 0.
2.7 | 0.9965 0.9966 0.9967 0.9968 0.9969 0.9970 0.9971 0.9972 0.9973 0.
2.8 | 0.9974 0.9975 0.9976 0.9977 0.9977 0.9978 0.9979 0.9979 0.9980 0.
2.9 | 0.9981 0.9982 0.9982 0.9983 0.9984 0.9984 0.9985 0.9985 0.9986 0.
3.0 | 0.9987 0.9987 0.9987 0.9988 0.9988 0.9989 0.9989 0.9989 0.9990 0.
3.1 | 0.9990 0.9991 0.9991 0.9991 0.9992 0.9992 0.9992 0.9992 0.9993 0.
3.2 | 0.9993 0.9993 0.9994 0.9994 0.9994 0.9994 0.9994 0.9995 0.9995 0.
3.3 | 0.9995 0.9995 0.9995 0.9996 0.9996 0.9996 0.9996 0.9996 0.9996 0.
3.4 | 0.9997 0.9997 0.9997 0.9997 0.9997 0.9997 0.9997 0.9997 0.9997 0.

Exam Applied Statistical Regression Good Luck!, Lecture notes of Statistics

Related documents

Partial preview of the text

Download Exam Applied Statistical Regression Good Luck! and more Lecture notes Statistics in PDF only on Docsity!

Exam Applied Statistical Regression

Good Luck!

Table of the cumulative Normal distribution Φ(z) = P [Z ≤ z] , Z ∼ N (0, 1)