Statistics for Sociologists: Module 5 - Sampling Distributions and Binomial Distribution, Slides of Statistics

A part of the soci708 course notes at the university of north carolina, chapel hill. It covers the concepts of sampling distributions, the binomial distribution, and the law of large numbers. Examples, simulations, and formulas for calculating binomial probabilities.

Typology: Slides

2011/2012

Uploaded on 12/29/2012

sankait
sankait 🇮🇳

4.2

(13)

113 documents

1 / 25

Toggle sidebar

This page cannot be seen from the preview

Don't miss anything!

bg1
......
Soci708 Statistics for Sociologists
Module 5 Sampling Distributions
1
François Nielsen
University of North Carolina
Chapel Hill
Fall 2009
1
Adapted in part from slides for the course Quantitative Methods in Sociology
(Sociology 6Z3) taught at McMaster University by Robert Andersen (now at
University of Toronto)
1
/
25
pf3
pf4
pf5
pf8
pf9
pfa
pfd
pfe
pff
pf12
pf13
pf14
pf15
pf16
pf17
pf18
pf19

Partial preview of the text

Download Statistics for Sociologists: Module 5 - Sampling Distributions and Binomial Distribution and more Slides Statistics in PDF only on Docsity!

......

Soci708 – Statistics for Sociologists

Module 5 – Sampling Distributions^1

François Nielsen

University of North Carolina Chapel Hill

Fall 2009

(^1) Adapted in part from slides for the course Quantitative Methods in Sociology

(Sociology 6Z3) taught at McMaster University by Robert Andersen (now at University of Toronto)

......

Jacob Bernouilli 1st (Basel 1654–1705)

Family Were Refugees from Antwerp; Ars Conjectandi published 1713)

......

Jacob Bernouilli 1st (stamp)

Note implausible sequence of proportions (see later)

......

Law of Large Numbers (1)

… (^) Jacob Bernouilli contributed to the discovery of a major phenomenon of probability, the Law of Large Numbers: … (^) In the long run, the proportion of a certain outcome of a random trial (say, head turns up when tossing a coin) will tend to stabilize to a stable value … (^) But outcome of one trial is independent of previous outcomes … (^) This is counterintuitive: … (^) People naturally tend to believe in a sort of Law of Small Numbers … (^) People do not normally expect the long “runs” of the same outcome (say, heads in tossing a coin) that occur in true random processes

......

Sampling Distributions Revisited

Population & Sample

… (^) The population distribution of a variable is:

  1. the distribution of its values for all members of the population … (^) E.g., the distribution of IQ test scores in the Belgian population
  2. the probability distribution of the variable when choosing one individual at random from the population. … (^) E.g., choose one Belgian randomly and record the IQ

… (^) A statistic (e.g., x, ˆp, b 1 ) calculated from a random sample or randomized experimental group is a random variable … (^) The probability distribution of a statistic is its sampling distribution … (^) In remainder of Module 6 we look at the sampling distributions of: … (^) counts & proportions … (^) sample means

......

Binomial Distributions

Count X & Proportion ˆp

… (^) In general X is a count of the occurrence of some outcome in a xed number of observations n … (^) E.g., in an agricultural experiment n plants are treated for a fungus; the number X of plants with the fungus is a random variable … (^) The sample proportion is ˆp = X / n … (^) E.g., in the experiment X = 9 out of n = 32 plants have the fungus. The sample proportion is

ˆp =

9 32

… (^) The binomial setting is:

  1. There are a xed number n of observations
  2. The n observations are all independent
  3. Each observation can be classi ed as “success” (1) or “failure” (0)
  4. The probability p of a success is the same for each observation

......

Binomial Distributions

Binomial Distribution

… (^) Choosing an SRS (without replacement) from a population with proportion p of successes is not exactly a binomial setting … (^) E.g., draw 10 cards from a deck, with “red card” a success. Then probability of red on second card is not independent of color of rst card … (^) However, if the population is much larger than the sample – say, 20 times as large – the count X of successes in an SRS of size n has approximately the binomial distribution B(n, p) … (^) E.g., draw sample with n = 200 from about 8,000 graduate students at UNC. “Success” is: student is female. Suppose p = 0.57. Then number of females X is distributed (almost exactly) as B(200, 0.57)

......

Binomial Distributions

Finding Binomial Probabilities (1)

  1. Calculator on the Web … (^) http://rockem.stat.sc.edu/prototype/calculators/index.php
  2. Table of Binomial Probabilities … (^) E.g., Table C in Moore & McCabe (2006)
  3. Software – R … (^) Finding P(X = x) > # P(exactly 2 children out of 5 with type O blood) > dbinom(2,5,0.25) [1] 0. … (^) Finding P(X ≤ x) > # P(2 or fewer children out of 5 with type O blood) > pbinom(2,5,0.25) [1] 0.

......

Binomial Distributions

Finding Binomial Probabilities (3)

  1. Using the Binomial Formulas (Optional; see Moore & McCabe 2006, pp.348–350) … (^) Binomial Coef cient – The number of ways of arranging k successes among n observations is given by the binomial coef cient (^)  n k

n! k!(n − k)! for k = 0, 1,... , n. In the formula the factorial n! for any positive integer is de ned as

n! = n × (n − 1 ) × (n − 2 ) ×... × 2 × 1

and also 0! = 1. … (^) Binomial Probability – If X has distribution B(n, p), the binomial probability that X = k (for k = 0, 1,... , n) is

P(X = k) =

n k

pk( 1 − p)n−k

......

Binomial Distributions

Origin of the Binomial Formula

… (^) Origin of the binomial formula

......

Binomial Distributions

Mean & Standard Deviation of Count & Proportion

… (^) If a count X has the binomial distribution B(n, p), then

μ X = np

σ X =

p np( 1 − p)

… (^) Our estimator of the proportion p of “successes” in the population is the sample proportion

ˆp =

count of successes in sample size of sample

X

n … (^) If ˆp is the sample proportion of successes in an SRS of size n from a large population with proportion p of successes^2

μ ˆp = p

σ ˆp =

r p( 1 − p) n (^2) Check this follows from the rules for linear functions of random variables.

......

Binomial Distribution

Normal Approximation of Counts & Proportions

… (^) Implications of mean and standard deviation of ˆp

  1. μ ˆp = p implies ˆp is unbiased
  2. σ ˆp =

q p( 1 −p) n implies that to divide the standard deviation of^ ˆp by half one must multiply n by four … (^) Normal approximation for counts & proportions: … (^) In an SRS of size n from a large population, when n is large

X is approximately N

np,

p np( 1 − p)

ˆp is approximately N p,

r p( 1 − p) n

where p is the proportion of successes in the population, and X and ˆp = X / n are the count & proportion of successes in the sample, respectively

......

Binomial Distribution

Normal Approximation of Counts & Proportions

… (^) E.g., SRS of n = 200 from population of 8,000 UNC graduate students with proportion females p = .57. What is P(ˆp ≤ 0.5) (i.e., the sample has fewer females than males)? … (^) np = 200 × 0.57 = 114 > 10 and n( 1 − p) = 200 × 0.43 = 86 > 10 so rule of thumb is satis ed … (^) Using binomial probabilities: X is distributed as B(200, 0.57). ˆp = 0.5 correspond to X = 100. P(X ≤ 100 ) = 0.02734091 or .027. … (^) Using the normal approximation: μ ˆp = p = 0.57;

σ ˆp =

q p( 1 −p) n =^ 0.03500714;

P(ˆp ≤ 0.5) = P

ˆp − 0.

= P(Z ≤ −1.999592) = 0.

......

Sampling Distribution of the Sample mean

Experimental Study of the Sampling Distribution of x with n = 3, n = 10, n = 100

… (^) (a) Population distribution of X (income) … (^) Distribution of x for 600 samples: … (^) (b) n = 3 … (^) (c) n = 10 … (^) (d) n = 100