Inference about means

Methods for inference about means

Statistical inference is the process of drawing conclusions from data, for example by

confidence intervals and significance tests. In this lecture we shall look how we can

draw conclusions from samples about the means of populations.

We shall first look at large samples, and at how we can make inferences about a

single mean, means in paired data, and the difference between the means of two

samples. For each of these we shall use a large sample Normal method or z method.

We shall then look at the same problems for small samples. For a single mean we

shall describe the one sample t method, for paired data the paired t method, and for

the means of two samples the two sample t method, also called the independent

samples t method, or two group t method. For t methods there are strong assumptions

about the distribution of the observations. I shall describe how we can use graphical

methods to investigate these.

We shall not discuss what to do if we have means of more than two samples. The

usual method for any size samples is one-way analysis of variance (anova), the

assumptions of which are as for the two sample t method.

The mean of a large sample

We can find confidence intervals and carry out significance tests for the means of

large samples using the Normal distribution. We make use of two properties of large

samples. First, the means of large samples drawn in the same way will follow a

Normal distribution quite closely, as described in Week 2. Second, the standard

deviation estimated from a large sample will be close to that for the whole population.

This means that the standard error estimated from the sample will be a good estimate.

We find confidence intervals for means of large samples using the Normal

distribution. We first estimate the standard error of the mean of the sample. This is

easy to do from the standard deviation of the observations, it is the standard deviation

divided by the square root of the sample size. Then the 95% confidence interval is the

mean minus 1.96 standard errors to the mean plus 1.96 standard errors.

For example, Figure 1 shows the distribution of birthweight in 1749 singleton

pregnancies to Caucasian mothers in South London. This is clearly negatively skew,

unlike the distribution of birthweight for term births, which is approximately Normal.

These birthweights have mean = 3296.0 g and standard deviation = 563.2 g. The

standard error of the mean is 13.5 g. Because the sample is large, the mean

birthweight will be from a Normal distribution with mean equal to the mean

birthweight in the population and standard deviation very close to the estimated

standard error of the mean, 13.5 g. Hence the 95% confidence interval for the

population mean birthweight will be 3296.0 – 1.96 × 13.5 g to 3296.0 + 1.96 × 13.5 g,

which gives 3270 g to 3322 g. Hence we estimate that the mean birthweight in this

population to be between 3270 and 3322 g.

Inference about means, Schemes and Mind Maps of Pre-Calculus

Related documents

Partial preview of the text

Download Inference about means and more Schemes and Mind Maps Pre-Calculus in PDF only on Docsity!

Methods for inference about means

The mean of a large sample

Comparing the means of two independent large samples

The t distribution

The one sample t method

Checking the assumption of a Normal distribution

The paired t method

The two sample t method

Controls Ulcerated patients

References