

Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
Material Type: Assignment; Class: + Dis >4; Subject: Mathematics; University: University of Oregon; Term: Unknown 1989;
Typology: Assignments
1 / 3
This page cannot be seen from the preview
Don't miss anything!


Fact 1 (Simpson’s paradox). It is possible for one individual to outperform another in every category measured yet to not perform as well in the aggregate.
Example 2. Let us look at flight delays for two of our local carriers, Alaska Airlines and America West Airlines, the former of which has a hub in Seattle, the latter in Phoenix. At their hubs we have the following data:
It is simple to see how Simpson’s paradox works if we look at a simple enough example. Suppose Dick and Jane both take MA 243 and (somehow!) negotiate negotiate different weightings to compute their final grades. Dick has HW count 10% and the final exam 90%, and Jane has HW count 90% and the final exam count 10%. They get A and A-, respectively, on their HW, and C and C- on their final exams, respectively. So Dick has scored better on both. But he ends up with a C+ and Jane with a B+. Some useful terminology, if thinking in terms of percentages: there are two ways to take an average, a weighted average which depends on the sample sizes, and a “straight” average of percentages (which really is not so straight). The weighted average is the one which calculates the true percentage, but it is susceptible to Simpson’s paradox. A “straight” average behaves predictably in this way, but the final answer depends on the categories by which the data has been broken down.
3.1. Gathering data. There are two basic methods of gathering data:
Definition 3. In an observational study one observes individuals and measures variables of those indi- viduals.
If the study is to lead to conclusions about the overall population there are a two things that must be considered:
Example 4. Phoning 1000 randomly chosen residential phone numbers during the workday, one asks for the answer to two variables, age, and how many hours of TV the subject watches per day. If the phone is not answered, one calls the next number. What are possible problems?
Understanding what problems may arise in collecting data is an artform best learned in the discipline or setting in which you are working.
Definition 5. In an experimental study one treats a group of individuals in a particular way with the goal of discovering the effect of that treatment.
Experimental studies can be fraught with difficulties. If this study is to lead to conclusions about the efficacy of a treatment one must
Example 6. The “placebo effect” and other bias in medical studies, and the need for “double-blind” protocols.
We will discuss this further later. Summarizing our discussions of data collection: To get good conclusions, one must be careful about how one is gathering data and what might be BIAS ing the sample
Example 7. A research firm phones 100 clients of acupuncturists to ask if their treatment has improved their health. What can one learn from this experiment about the efficacy of acupuncture? (Efficacy vs. satisfaction)
If you really want to measure acupuncture’s efficacy, you need to take a group of people and randomly assign half of them to get treated by acupuncture, and half to be untreated (as a control group if you want to compare acupuncture to no treatment) or treated by western medicine (as a control group if you wanted to compare acupuncture to western medicine). This study would be even better if the patients didn’t know whether they were being treated via acupuncture or in the control group.
Example 8. A local Eugene TV station asks callers to phone in during the news to say if they favor the Whole Foods development deal. What can one learn from this poll?