Statistics for Economics: Simple Random Sampling and Parameter Estimation, Study notes of Economics

The concept of statistical inference in economics, focusing on simple random sampling and parameter estimation. It explains how to learn about a population by taking a sample, the concept of a population distribution and random variable, and the calculation of statistics from a sample to estimate unknown population parameters. The document also touches upon sampling variability and the construction of a histogram to examine the sampling distribution of a statistic.

Typology: Study notes

Pre 2010

Uploaded on 09/02/2009

koofers-user-e1t
koofers-user-e1t 🇺🇸

4

(1)

10 documents

1 / 2

Toggle sidebar

This page cannot be seen from the preview

Don't miss anything!

bg1
ECON 413
Fall 2004
Toward Statistical Inference
Statistics is ultimately concerned with drawing inferences about some population of
interest. We can learn about a population by taking a sample from it and by using the
information in the sample to learn about the population. There are many different
methods of sampling from a population.
In simple random sampling, all possible samples from the population have an equal
chance of being the actual sample selected. Simple random sampling also gives each
member of the population the same chance of being included in the sample. Random
sampling can be done either with replacement or without replacement.
Suppose we are interested in learning about a variable
X
, say, e.g. the rate of return on
the stock of companies that trade on the New York Stock Exchange (NYSE). The
distribution of the values of
X
for all companies that trade on the NYSE is called the
population distribution. If we pick a stock at random from the population of all NYSE
stocks, the value of
X
for that stock is a random variable. The distribution of this
random variable is also the population distribution.
A random sample of size n is a sequence of independent observations 12
(, ,, )
n
X
XX,
where i
X
represents the value of
X
for the ith individual and each i
X
has the
distribution of the population being sampled. We can write the joint p.d.f. (or p.f.) of the
observations in a random sample as
12 1 2
( , ,..., ) ( ) ( )... ( )
nX X Xn
f
xx x f xf x f x=
where X
f
denotes the population p.d.f. (or p.f.).
We are usually interested in learning about certain characteristics of populations that are
called parameters. For example, the mean
µ
and the standard deviation
σ
of a
population distribution are examples of parameters. A parameter is a fixed number.
Because it is based on all the observations in the population, its value is almost always
unknown.
On the other hand, a statistic is a numerical descriptive measure of a sample. Sample
mean
X
, sample median m, and sample standard deviation s are examples of statistics.
Statistics are calculated from the observations in the sample. A statistic can be used to
estimate an unknown parameter.
Example: A recent survey asked a nationwide random sample of 2500 adults if they
agreed or disagreed with the following statement: “I like buying new clothes, but
shopping is often frustrating and time-consuming.” 66% of he respondents (1650 people
out of 2500) agreed.
pf2

Partial preview of the text

Download Statistics for Economics: Simple Random Sampling and Parameter Estimation and more Study notes Economics in PDF only on Docsity!

ECON 413

Fall 2004

Toward Statistical Inference

Statistics is ultimately concerned with drawing inferences about some population of interest. We can learn about a population by taking a sample from it and by using the information in the sample to learn about the population. There are many different methods of sampling from a population.

In simple random sampling , all possible samples from the population have an equal chance of being the actual sample selected. Simple random sampling also gives each member of the population the same chance of being included in the sample. Random sampling can be done either with replacement or without replacement.

Suppose we are interested in learning about a variable X , say, e.g. the rate of return on the stock of companies that trade on the New York Stock Exchange (NYSE). The distribution of the values of X for all companies that trade on the NYSE is called the population distribution. If we pick a stock at random from the population of all NYSE stocks, the value of X for that stock is a random variable. The distribution of this random variable is also the population distribution.

A random sample of size n is a sequence of independent observations ( X (^) 1 , X (^) 2 , …, Xn ),

where X (^) i represents the value of X for the i th individual and each X (^) i has the

distribution of the population being sampled. We can write the joint p.d.f. (or p.f.) of the observations in a random sample as

f ( x 1 (^) , x 2 (^) ,..., xn ) = f (^) X ( x 1 (^) ) f (^) X ( x 2 )... f (^) X ( xn )

where f (^) X denotes the population p.d.f. (or p.f.).

We are usually interested in learning about certain characteristics of populations that are

called parameters. For example, the mean μ and the standard deviation σ of a

population distribution are examples of parameters. A parameter is a fixed number. Because it is based on all the observations in the population, its value is almost always unknown.

On the other hand, a statistic is a numerical descriptive measure of a sample. Sample

mean X , sample median m, and sample standard deviation s are examples of statistics. Statistics are calculated from the observations in the sample. A statistic can be used to estimate an unknown parameter.

Example : A recent survey asked a nationwide random sample of 2500 adults if they agreed or disagreed with the following statement: “I like buying new clothes, but shopping is often frustrating and time-consuming.” 66% of he respondents (1650 people out of 2500) agreed.

ECON 413

Fall 2004

Question : What was the population this study aimed to learn about? What was the sample? What was the parameter of interest? What does the number 66% signify?

Let’s denote the proportion of the sample frustrated with shopping by p ˆ. We can use

p^ ˆ to estimate the proportion of the population that find shopping for clothes frustrating,

which we denote by p.

Question : What would happen if we took repeated samples from the population and calculated the sample proportion p ˆ for each sample? Would the value of the statistic

remain the same?

Answer : No. The value of a statistic changes from sample to sample. This is called sampling variability. We can examine the values taken by a statistic by constructing a histogram. This histogram gives the sampling distribution of the statistic, that is the distribution of values taken by the statistic in all possible samples of the same size.