


Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
A university homework assignment focused on estimating with dags in advanced data analysis. Students are required to identify parents and children in a graphical model, list smallest collections of marginal and conditional distributions, determine associations between variables, and use conditional independence to specify regressions. The assignment includes multiple-choice questions and data analysis tasks.
Typology: Exercises
1 / 4
This page cannot be seen from the preview
Don't miss anything!



This homework will illustrate some of the advantages of using a known DAG structure. You will need to read the lectures on graphical models carefully in order to do it. Figure 1 is an elaboration of the graph used in lectures. All problems refer to it, unless otherwise specified. The file fake-smoke.csv contains some (synthetic) data, for use in problem
Parents and children (10 points)
(a) (5 points) For each variable in the model, list its parents; or, if it has no parents, say so. (b) (5 points) For each variable in the model, list its children. (Some variables have no children.)
(a) Yellowing of teeth and cancer? (b) Yellowing of teeth and cancer, controlling for smoking? (c) Yellowing of teeth and cancer, controlling for occupational prestige? (d) Yellowing of teeth and cancer, controlling for smoking and exposure to asbestos? (e) Smoking and cancer, controlling for the amount of tar in the lungs? (f) Asbestos and cancer, controlling for cellular damage? (g) Smoking and cancer, controlling for asbestos? (h) Smoking and asbestos, controlling for cellular damage? (i) Tar in lungs and cancer, controlling for asbestos, smoking, and yel- lowing of teeth?
Figure 1: Graphical model for use in all problems, except part of the last. Signs on arrows indicate the sign of the associations (not necessarily linear) between parents and children.
Figure 2: An alternative DAG for the same variables.