Massima Verosimiglianza e Stima di Massima Verosimiglianza - Prof. Manfredi | Dispense di Statistica

13/11/2023

Department of Economics & Management

Master of Science in Economics, 2023-2024

Advanced statistics

Lecture 18, THU 09/11/2023. Introduction to the «inferential

model» & likelihood-based inference.

Prof. Piero Manfredi

Dep Economics & Management

University of Pisa

The three main problems of statistical estimation

Remarks on the quality control example. The choice of the sample proportion

phat (=8%) as an estimate of the population proportion was just intuitive: we

estimated the population proportion by the sample proportion (“analogy”

argument or «plug-in» principle). A number of questions arise:

POINT ESTIMATION (me).

INTERVAL ESTIMATION

(prof. Giusti)

PROPERTIES OF

ESTIMATORS (prof. Giusti)

Keywords of parametric inference

•Population

•(Unknown) parameters.

•Sampling & Bernoulli sample. Sample distribution.

•Chance variability of sampling («sampling variability»)

•(parametric) statistical model.

•Estimator/estimate.

•Estimation & testing.

•Likelihood: the unifying tool of inferential procedures.

Likelihood theory

•In what follows we introduce the core approach to (point) parameter estimation, namely

the maximum likelihood approach.

•This is the single most important concept of statistical theory. It provides a flexible and

powerful approach to point estimation.

•The intuitive idea is that of focusing on the available observed sample datum and of

chosing as best estimate of an unknown parameter its most likely value i.e., the value

that maximises the probability of observing that datum.

•This probability is called the likelihood. It is the unifying tool of all inferential procedures

in the classical approach to statistical theory.

•We start by introducing the concept of likelihood function.

•We continue by presenting the maximum likelihood approach to (parametric) estimation.

•We finally define the concepts of maximum likelihood estimate and of maximum

likelihood estimator (MLE)

(Refreshing) the distribution of the sample (Lect 15)

•Let (X1,…,Xn) be a a sample (from population X) which can take on values (x1,…,xn) (the

«possible realisations»).

•The distribution of the sample (DS) i.e., the joint probability mass function/joint density function

of the sample observations in the order of selection is

•For Bernoulli samples (i.e., IID)

( ) ( )

,.., 1

,..,

X X n X

p x x X discrete with PMF p x

DS f x x X continuous with PDF f x





=





( )

1 1 2 2

Pr , , ..., nn

X x X x X x= = = =

( ) ( ) ( ) ( )

...

X X X n X

p x p x p x X discrete with PMF p x

DS f x f x f x X continuous with PDF f x

   



=  





Advanced Statistics 2023-2024: Mid-term 31/10/2022 (prof P. Manfredi, Duration: 100 min)

Name…..………………………….………………..Surname……………….………………………….Student number

(matricola)………………….;

……

3. State the following definitions related to sampling theory: (0) the population of interest (X); (1)

simple random sample; (2) distribution of a sample; (3) Bernoulli sample. Then, given a population X,

clarify the difference between the concept of (Bernoulli) sample (X1,X2,..,Xn) from this population and

its realization (x1,x2,..,xn). Finally, postulating that X is a Poisson population (of given parameter 𝜗) for

the number of cars’ accidents per day over the roads of a certain country, assume you draw a

Bernoulli sample of n roads and let (x1,x2,..,xn) be the resulting numbers of daily accidents in each

selected road. Provide the probability distribution of the sample. Last, assuming that 𝜗 = 2.4/𝑑𝑎𝑦,

find the probability to observe the following sample of size 3: (x1=0,x2=0, xn=4).

(Refreshing) the distribution of the sample (Lect 15)

1 2

3 4

5 6

Massima Verosimiglianza e Stima di Massima Verosimiglianza - Prof. Manfredi, Dispense di Statistica

Documenti correlati

Anteprima parziale del testo

Scarica Massima Verosimiglianza e Stima di Massima Verosimiglianza - Prof. Manfredi e più Dispense in PDF di Statistica solo su Docsity!

Department of Economics & Management

Master of Science in Economics, 2023 - 2024

Lecture 18, THU 09/11/2023. Introduction to the «inferential

model» & likelihood-based inference.

Prof. Piero Manfredi

Dep Economics & Management

University of Pisa

The three main problems of statistical estimation

Remarks on the quality control example. The choice of the sample proportion

phat (=8%) as an estimate of the population proportion was just intuitive: we

estimated the population proportion by the sample proportion ( “analogy”

argument or «plug-in» principle). A number of questions arise:

POINT ESTIMATION (me).

INTERVAL ESTIMATION

(prof. Giusti)

PROPERTIES OF

ESTIMATORS (prof. Giusti)

Keywords of parametric inference

(Refreshing) the distribution of the sample (Lect 15)

p x x X discrete with PMF p x

DS

f x x X continuous with PDF f x

p x p x p x X discrete with PMF p x

DS

f x f x f x X continuous with PDF f x

 ^ ^ 

(Refreshing) the distribution of the sample (Lect 15)

The «quality control» problem: the population and its (unknown)

parameters

X

The «quality control» problem: Random sample vs its realization

elsewhere

i

x x

The «quality control» problem: the probability of the observed

sample (the datum…)

elsewhere

i

x x

1 ,..,^50

The «quality control» problem: remarks about the probability of

the observed sample (the datum…)

❑ Unknown : it depends on the unknown parameter.

❑ it actually represents the likelihood of any Bernoulli sample with a sample sum

s=4 successes over n=50 drawings.

❑ Being the sample “Bernulli” i.e., IID, we computed the probability of the

sample as if the drawing was “with replacement.

P X = x ,.., X = x =  1 −

(after Fisher) « likelihood » of the data.

statistical inference.

«design of experiments») is critical to allow

statistical analyses to provide good results.

The likelihood function

Definition. Given a Bernoulli sample ( X 1 ,X 2 ,…,Xn ) with realisation ( x 1 ,x 2 ,…,xn ) from

population X with density f

(x,) / probability mass function p

(x, ), depending on

a vector of unknown parameters , the likelihood function is:

p x

L L x x

f x

Exercise. Confirm that = 0. 08 is a maximum point of the likelihood function.

We say that the realized sample proportion 𝑝Ƹ is a maximum

likelihood estimate (MLE) of the unknown population proportion.

Bernoulli populations: general treatment

x

s n

L L x x p x

b) Using the observed sample mean i.e., the sample proportion 𝑝Ƹ= s/n.

L   

s=0 («only failures» observed) :

L  

s=n («only successes» observed). Then:

0

Remark («regular» vs «non regular» likelihood problems).

The examples s=n, s=0, where the MLE was detected by investigating the