









Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
The bootstrap is a method for estimating the variance of an estimator and for finding approximate confidence intervals for parameters.
Typology: Study notes
1 / 15
This page cannot be seen from the preview
Don't miss anything!










This chapter covers the following topics:
Most of this volume is devoted to parametric inference. In this chapter we depart from
the parametric framework and discuss a nonparametric technique called the bootstrap.
The bootstrap is a method for estimating the variance of an estimator and for finding
approximate confidence intervals for parameters. Although the method is nonparametric,
it can be used for inference about parameters in parametric and nonparametric models
which is why we include it in this volume.
We begin by broadening what we mean by a parameter. Let us begin with a few examples.
n
⇠ P where P 2 (P ✓
: ✓ 2 ⇥). Let
b ✓ n
be the maximum likelihood
estimator of ✓. We would like to estimate the variance of
b ✓ (^) n and we want a 1 ↵
confidence interval for ✓.
n
⇠ P and let ✓ = T (P ) denote the mean of P. Hence, ✓ = E[X i
xdP (x). Let
b ✓ n
1
n
n
i=
i
. Again, we would like to estimate the variance of
b ✓ n
and we want a 1 ↵ confidence interval for ✓.
i
✓) = 1/ 2. Let
b ✓ n
denote the sample median. Yet again, we would like to
estimate the variance of
b ✓ n
and we want a 1 ↵ confidence interval for ✓.
In the first example, ✓ denotes the parameter of a parametic model. In the second and third
example, we are in a nonparametric situation; in these cases we think of a “parameter” as
a function of the distribution P and we write ✓ = T (P ). The bootstrap can be used in both
the parametric and nonparametric settings.
Let P n
be the empirical distribution. This is the discrete distribution that puts mass 1 /n at
each datapoint X i
. Hence,
P (^) n (A) =
n
n X
i=
I(X (^) i 2 A). (11.1)
In the nonparametric case, we will estimate the parameter ✓ = T (P ) by
b ✓ (^) n = T (P (^) n ) which
is called the plug-in estimator. For example, when ✓ = T (P ) =
xdP (x) is the mean, the
plug-in estmator is
b ✓ n
n
xdP n
(x) =
n
i=
i
which is the sample mean.
A sample of size n drawn from P n
is called a bootstrap sample, denoted by
⇤
1
⇤
n
n
Bootstrap samples play an important role in what follows. Note that drawing an iid sample
⇤
1
⇤
n
from P n
is equivalent to drawing n observations, with replacement, from the
original data {X 1
n
}. Thus, bootstrap sampling is often described as “resampling the
data.” This can be a bit confusing and we think it is much clearer to think of a bootstrap
sample X
⇤
1
⇤
n
as n draws from the empirical distribution P n
Now we give the bootstrap algorithms for estimating the variance of
b ✓ n
and for construct-
ing confidence intervals. The explanation of why (and when) the bootstrap gives valid
estimates, is deferred until Section 11.5. Let
b ✓ n
= g(X 1
n
) denotes some estimator.
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ● ●
●
●
●
●
0.0 0.5 1.0 1.5 2.
−
−
Figure 11.1: 50 points drawn from the model Y i
i