

Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
The concept of statistical inference in economics, focusing on simple random sampling and parameter estimation. It explains how to learn about a population by taking a sample, the concept of a population distribution and random variable, and the calculation of statistics from a sample to estimate unknown population parameters. The document also touches upon sampling variability and the construction of a histogram to examine the sampling distribution of a statistic.
Typology: Study notes
1 / 2
This page cannot be seen from the preview
Don't miss anything!


Fall 2004
Toward Statistical Inference
Statistics is ultimately concerned with drawing inferences about some population of interest. We can learn about a population by taking a sample from it and by using the information in the sample to learn about the population. There are many different methods of sampling from a population.
In simple random sampling , all possible samples from the population have an equal chance of being the actual sample selected. Simple random sampling also gives each member of the population the same chance of being included in the sample. Random sampling can be done either with replacement or without replacement.
Suppose we are interested in learning about a variable X , say, e.g. the rate of return on the stock of companies that trade on the New York Stock Exchange (NYSE). The distribution of the values of X for all companies that trade on the NYSE is called the population distribution. If we pick a stock at random from the population of all NYSE stocks, the value of X for that stock is a random variable. The distribution of this random variable is also the population distribution.
A random sample of size n is a sequence of independent observations ( X (^) 1 , X (^) 2 , …, Xn ),
where X (^) i represents the value of X for the i th individual and each X (^) i has the
distribution of the population being sampled. We can write the joint p.d.f. (or p.f.) of the observations in a random sample as
f ( x 1 (^) , x 2 (^) ,..., xn ) = f (^) X ( x 1 (^) ) f (^) X ( x 2 )... f (^) X ( xn )
where f (^) X denotes the population p.d.f. (or p.f.).
We are usually interested in learning about certain characteristics of populations that are
population distribution are examples of parameters. A parameter is a fixed number. Because it is based on all the observations in the population, its value is almost always unknown.
On the other hand, a statistic is a numerical descriptive measure of a sample. Sample
mean X , sample median m, and sample standard deviation s are examples of statistics. Statistics are calculated from the observations in the sample. A statistic can be used to estimate an unknown parameter.
Example : A recent survey asked a nationwide random sample of 2500 adults if they agreed or disagreed with the following statement: “I like buying new clothes, but shopping is often frustrating and time-consuming.” 66% of he respondents (1650 people out of 2500) agreed.
Fall 2004
Question : What was the population this study aimed to learn about? What was the sample? What was the parameter of interest? What does the number 66% signify?
Let’s denote the proportion of the sample frustrated with shopping by p ˆ. We can use
p^ ˆ to estimate the proportion of the population that find shopping for clothes frustrating,
which we denote by p.
Question : What would happen if we took repeated samples from the population and calculated the sample proportion p ˆ for each sample? Would the value of the statistic
remain the same?
Answer : No. The value of a statistic changes from sample to sample. This is called sampling variability. We can examine the values taken by a statistic by constructing a histogram. This histogram gives the sampling distribution of the statistic, that is the distribution of values taken by the statistic in all possible samples of the same size.