Understanding Confidence Intervals and Standard Errors in Applied Biostatistics | Study notes Mathematical Methods

Health Sciences M.Sc. Programme

Applied Biostatistics

Week 5: Standard Error and Confidence Intervals

Sampling

Most research data come from subjects we think of as samples drawn from a larger

population. The sample tells us something about the population. The notion of

sampling is a familiar one in health care. For example, if I want to measure a

subject’s blood glucose, I do not take all the blood. I draw a sample. One drop of

blood is then used to represent all the blood in the body. I did this three times, from

the same subject (myself) and got three measurements: 6.0, 5.9, and 5.8 mmol/L.

Which of these was correct? The answer is that none of them were; they were all

estimates of the same quantity. We do not know which of them was actually closest.

In research, we collect data on our research subjects so we can draw conclusions

about some larger population. For example, in a randomised controlled trial

comparing two obstetric regimes, the proportion of women in the active management

of labour group who had a Caesarean section was 0.97 times the proportion of women

in the routine management group who had sections (Sadler et al., 2000). (We call this

ratio the relative risk.) This trial was carried out in one obstetric unit in New Zealand,

but we are not specifically interested in this unit or in these patients. We are

interested in what they can tell us about what would happen if we treated future

patients with active management of labour rather than routine management. We want

know, not the relative risk for these particular women, but the relative risk for all

women.

The trial subjects form a sample, which we use to draw some conclusions about the

population of such patients in other clinical centres, in New Zealand and other

countries, now and in the future. The observed relative risk of Caesarean section,

0.97, provides an estimate of the relative risk we would expect to see in this wider

population. If we were to repeat the trial, we would not get exactly the same point

estimate. Other similar trials cited by Sadler et al. (2000) have reported different

relative risks: 0.75, 1.01, and 0.64. Each of these trials represents a different sample

of patients and clinicians and there is bound to be some variation between samples.

Hence we cannot conclude that the relative risk in the population will be the same as

that found in our particular trial sample. The relative risk which we get in any

particular sample would be compatible with a range of possible differences in the

population.

When we draw a sample from a population, it is just one of the many samples we

could take. If we calculate a statistic from the sample, such as a mean or proportion,

this will vary from sample to sample. The means or proportions from all the possible

samples form the sampling distribution. To illustrate this with a simple example, we

could put lots numbered 1 to 9 into a hat and sample by drawing one out, replacing it,

drawing another out, and so on. Each number would have the same chance of being

chosen each time and the sampling distribution would be as in Figure 1(a). Now we

change the procedure, draw out two lots at a time and calculate the average. There are

36 possible pairs, and some pairs will have the same average (e.g. 1 and 9, 4 and 6

both have average 5.0). The sampling distribution of this average is shown in Figure

1(b).

Understanding Confidence Intervals and Standard Errors in Applied Biostatistics, Study notes of Mathematical Methods

Related documents

Partial preview of the text

Download Understanding Confidence Intervals and Standard Errors in Applied Biostatistics and more Study notes Mathematical Methods in PDF only on Docsity!

Health Sciences M.Sc. Programme

Applied Biostatistics

Week 5: Standard Error and Confidence Intervals

Sampling

Relative frequency

Digits 1 to 9

(a) Single digit

Relative frequency

Mean of two digits

(b) Mean of two digits

Standard error

Significance tests and confidence intervals