







Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
An overview of the fundamentals of experimental design in statistics, focusing on topics such as randomization, blocking, and permutation tests. It covers the importance of experimental design, desirable criteria, common challenges, and the role of randomization in decreasing uncertainty and protecting against unforeseen error patterns. The document also includes examples and explanations of randomization principles and methods.
Typology: Study notes
1 / 13
This page cannot be seen from the preview
Don't miss anything!








This topic will cover
Treatments, units, and assignment methods specify an experimental design.
controllable factors x 1 x 2... xp ↓ ↓ · · · ↓
(fixed) inputs −→ Process or System (Black Box) −→ response(s)
↑ ↑ · · · ↑ z 1 z 2... zp uncontrollable factors nuisance factors/inherent noise
Because X 1 = X 2 does not imply Y 1 = Y 2 May be:
Puzzler: which model is more general/more useful?
A good design must
Statistical expertise can help by fixing up some common mistakes, chiefly confounding (more later).
Replication – Each treatment is applied to experimental units that are repre- sentative of the population of units to which the conclusions of the experiment will apply. Repetition – Like replication, except that measurement is done on the same experimental unit.
Blinding – Evaluators of a response do not know which treatment was given to which unit. Double-blinding – Both evaluators of the response and the experimental units do not know the assignment of treatment to units.
Control – Treatment is a “standard” treatment that is used as a baseline or base of comparison for other treatments. Placebo – A null treatment used when the act of applying a treatment – any treatment – has an effect.
Good design lets you estimate amount of variability due to each source.
Two influences on a response are confounded if the design makes it impossible to isolate the effects of one from the effects of another.
Selection bias occurs in observational studies when the process of selecting groups to be compared confounds the effects of interest with other effects.
How long does it take for a car’s brakes to stop it, from say, 50 miles per hour?
Blatant confounding
Compare Mercedes and minivans. Do 10 Mercedes trials (on wet pavement). Do 10 minivan trials (on dry pavement). May see differences, but can’t tell why.
Subtle confounding
Compare wet and dry pavement, for minivans. While one driver does 10 trials on wet pavement, another does 10 trials on dry pavement.
More subtle confounding
Compare wet and dry pavement, for minivans, on driver. First do 10 trials on dry pavement, then do 10 trials on wet pavement. Could be confounded with run order.
Common Example: Cannot randomly assign people to smoke or not. Thus, there is little strictly valid evidence that smoking is harmful.
.. .
Example
Raw Trt Ui Rank Run Trt 1 1 0.1398928 1 1 1 2 1 0.4903066 6 2 2 3 1 0.8459779 9 3 3 4 1 0.8692369 11 4 3 5 2 0.6389887 8 5 2 6 2 0.3783782 4 6 1 7 2 0.4057894 5 7 2 8 2 0.8906754 12 8 3 9 3 0.6366516 7 9 2 10 3 0.3087094 3 10 1 11 3 0.8491306 10 11 3 12 3 0.2690837 2 12 1
Obs 1 2 3 4 5 6 7 8 9 10 Des1 A A A A A B B B B B Des2 A B B B A A B A A B
Y¯ 1. − Y¯ 2. − (μ 1 − μ 2 ) =
Des1: 15 (μ 1 + e 1 + μ 1 + e 2 +... + μ 1 + e 5 ) − 15 (μ 2 + e 6 +... + μ 2 + e 10 ) − (μ 1 − μ 2 ) =
e 1 + e 2 + e 3 + e 4 + e 5 − e 6 − e 7 − e 8 − e 9 − e 10
Des2: e 1 − e 2 − e 3 − e 4 + e 5 + e 6 − e 7 + e 8 + e 9 − e 10
Trend: If E(ei) = C × (i − 5 .5) (linear trend), Trend adds − 25 C under Des1; Trend adds 3C under Des2.
Optimal versus linear trend A B B A A B B A A B
Autocorrelations
Cor(ei, ej ) =
1 i = j ρ |i − j| = 1 0 |i − j| > 1.
Var((e 1 + e 2 + e 3 + e 4 + e 5 − e 6 − e 7 − e 8 − e 9 − e 10 )/5) = 25 σ^2 + 1425 ρσ^2 Var((e 1 − e 2 − e 3 − e 4 + e 5 + e 6 − e 7 + e 8 + e 9 − e 10 )/5) = 25 σ^2 + 252 ρσ^2
ρ 6 = 0 changes Var( Y¯ 1. − Y¯ 2 .). Estimates of s^2 i don’t capture this. A random design mitigates.
4 treatment/16 units
HAPHAZARD
Treatment A is assigned to the first four units we happen to encounter, treatment B to the next four units, and so on.
As each unit is encountered, we assign treatment A, B, C, and D based on whether the “seconds” reading on the clock is between 1 and 15, 16 and 30, 31 and 45, or 46 and 60.
Example
Paired t-test/Randomization Paired Test
In a study of egg cell maturation, the eggs from each of four female frogs were divided into two batches, and one batch was exposed to progesterone. After two minutes, the cAMP content was measured. It is believed that cAMP is a substance that can mediate cellular response to hormones.
FROG cAMP Content Control Progesterone Diff 1 6 4 2 2 4 5 - 3 5 2 3 4 4 2 2
|
d| # of occurrences 8 2 6 2 4 4 2 6 0 2
From the table, there are four of sixteen outcomes as “unlikely” or more, simply due to chance. Thus, the p-value is 0.25.
What do we do if we get
Obs 1 2 3 4 5 6 7 8 9 10 Des1 A A A A A B B B B B or Obs 1 2 3 4 5 6 7 8 9 10 Des2 A B A B A B A B A B
by chance?
http://www.thislife.org/Radio Episode.aspx?sched=
or Search on “Meet the Pros” at http://www.thislife.org/
Story times: 20:45 - 47:
In a trial on newborn infants with respiratory failure, the new treatment T was highly invasive: extracorporeal membrane oxygenation (EMCO), while the control treatment C was conventional medical management. A randomized trial was set up which saw a binary response: success or failure of the treatment.
First Trial
Trial Treatment Outcome 1 T survival 2 C death 3 T survival 4 T survival 5 T survival 6 T survival
Trial Treatment Outcome 7 T survival 8 T survival 9 T survival 10 T survival 11 T survival 12 T survival
Example
Goal: Study the effect of vitamin B6 on premenstrual syndrome. Units: Human volunteers, sorted into pairs. (One got B6; the other got a placebo.) Grouping: Severity of symptoms (as evaluated by a questionnaire).
Another nuisance influence: stress at different times of year
“For many students, the beginning of a semester tends to be less stressful than the end, when there are exams to take and papers to write.” “For many people, major holidays are often stressful.”
As a result...
December and January should be treated as different blocks of time in the study.
Block what you can, and randomize what you cannot.