Bayesian Analysis: Obtaining Posterior Marginal Distributions of Parameters - Prof. Mary K, Study notes of Statistics

Bayesian analysis for multiparameter models, where real-world statistical problems often involve more than one unknown quantity. The focus is on obtaining the posterior marginal distribution of the parameter(s) of interest, such as the population mean healing rate µ, while treating other parameters like variance σ2 as nuisance parameters. The joint posterior distribution, the marginal posterior distribution of σ2, and the conditional posterior distribution of µ given σ2. It also introduces the concept of semi-conjugate priors and the use of markov chain monte carlo methods for fitting bayesian models.

Typology: Study notes

Pre 2010

Uploaded on 03/10/2009

koofers-user-yla-1
koofers-user-yla-1 šŸ‡ŗšŸ‡ø

5

(1)

10 documents

1 / 4

Toggle sidebar

This page cannot be seen from the preview

Don't miss anything!

bg1
1
22S:138
Bayesian Statistics
Introduction to Multi-Parameter
Models
Lecture 9
Sept. 19, 2003
Kate Cowles
374 SH, 335-0727
2
Multiparameter models
•Real problems in statistics nearly always in-
volve more than one unknown quantity.
•However, usually only one, or a few, parame-
ters or predictions are of substantive interest.
•Example: newt healing rates
–We may be primarily interested in the pop-
ulation mean healing rate µ, but of course
we don’t really know the value of the pop-
ulation variance σ2.
–So in a realistic model, we must also treat
σ2as an unknown parameter.
ā€¢ā€œnuisance parametersā€
3
•In cases of this kind, the aim of Bayesian
analysis is to obtain the posterior marginal
distribution of the parameter(s) of interest,
e.g.
p(µ|y)
•The general approach is to estimate the joint
posterior distribution of all unknown quanti-
ties in the model, and then integrate out the
one(s) we aren’t interested in.
•Example: in normal means example, we will
find
p(µ, σ2|y)
then
p(µ|y) = Zp(µ, σ2|y)dσ2
4
Example: normal data with both µand
σ2unknown
•Need joint prior on both unknown parame-
ters.
•Consider first the conventional noninforma-
tive prior for this problem
p(µ, σ2)āˆ1
σ2
•This arises by considering µand σ2a priori
independent and taking the product of the
standard noninformative priors for each.
–A priori independence may be a reason-
able assumption here; it says that if we
knew something about one of the unknown
parameters, that wouldn’t give us infor-
mation about the distribution of the other
one.
pf3
pf4

Partial preview of the text

Download Bayesian Analysis: Obtaining Posterior Marginal Distributions of Parameters - Prof. Mary K and more Study notes Statistics in PDF only on Docsity!

22S:

Bayesian Statistics

Introduction to Multi-Parameter Models

Lecture 9 Sept. 19, 2003

Kate Cowles 374 SH, 335- [email protected]

Multiparameter models

  • Real problems in statistics nearly always in- volve more than one unknown quantity.
  • However, usually only one, or a few, parame- ters or predictions are of substantive interest.
  • Example: newt healing rates
    • We may be primarily interested in the pop- ulation mean healing rate μ, but of course we don’t really know the value of the pop- ulation variance σ^2.
    • So in a realistic model, we must also treat σ^2 as an unknown parameter.
  • ā€œnuisance parametersā€

3

  • In cases of this kind, the aim of Bayesian analysis is to obtain the posterior marginal distribution of the parameter(s) of interest, e.g. p(μ|y)
  • The general approach is to estimate the joint posterior distribution of all unknown quanti- ties in the model, and then integrate out the one(s) we aren’t interested in.
  • Example: in normal means example, we will find p(μ, σ^2 |y) then p(μ | y) =

∫ p(μ, σ^2 |y) dσ^2

4 Example: normal data with both μ and σ^2 unknown

  • Need joint prior on both unknown parame- ters.
  • Consider first the conventional noninforma- tive prior for this problem

p(μ, σ^2 ) āˆ

σ^2

  • This arises by considering μ and σ^2 a priori independent and taking the product of the standard noninformative priors for each. - A priori independence may be a reason- able assumption here; it says that if we knew something about one of the unknown parameters, that wouldn’t give us infor- mation about the distribution of the other one.
  • Recall standard noninformative priors for μ when σ^2 is assumed known, and for σ^2 when μ is assumed known.
  • This is not quite a conjugate prior; we will see that the posterior distribution does not factor like this into an inverse gamma times an independent normal.
  • Note that this prior is improper, and the joint posterior is improper if there are fewer than two observations in the current data.

Joint posterior distribution with con- ventional noninformative prior

  • The joint posterior is p(μ, σ^2 ) āˆ (^) σ^12 Ɨ (^) (σ^12 )n 2 exp(

 ļ£¬ļ£­āˆ’ 1 2 σ^2

āˆ‘^ n i=1(yi^ āˆ’^ μ)

2

 

= (^) (σ (^21) )n 2 +1 exp

 ļ£¬ļ£­āˆ’ 1 2 σ^2

 ļ£Æļ£°āˆ‘^ n i=1(yi^ āˆ’^ yĀÆ)

(^2) + n(ĀÆy āˆ’ μ) 2

 

 

= (^) (σ (^21) )n 2 +1 exp

 ļ£¬ļ£­āˆ’ 1 2 σ^2

[ (n āˆ’ 1) s^2 + n(ĀÆy āˆ’ μ)^2

]

where s^2 is the sample variance of the yis:

s^2 =

n āˆ’ 1

āˆ‘^ n i=

(yi āˆ’ yĀÆ)^2

  • yĀÆ and s^2 are the sufficient statistics for μ and σ^2.

7

Steps to the marginal posterior distri- bution of μ

  • We will use these identities from conditional probability p(μ | y) =

∫ p(μ, σ^2 |y) dσ^2 =

∫ p(μ | σ^2 , y)p(σ^2 |y)dσ^2

  • It can be shown by direct integration (GCSR p. 67-68) that the marginal posterior distri- bution of σ^2 is

p(σ^2 |y) āˆ

(σ^2 )

n+ 2

exp

  ļ£¬ļ£¬ļ£­āˆ’(n^ āˆ’^ 1)s

2 2 σ^2

  

  • What parametric density is this?

8 The conditional posterior distribution of μ given σ^2

  • Use what we already know about the poste- rior mean of μ with known variance and a uniform prior on μ:

p(μ | σ^2 , y) = N (¯y,

σ^2 n

  • Again, it can be shown by direct integration (GCSR p. 68-69) that the marginal poste- rior distribution of μ

p(μ | y) =

∫ p(μ | σ^2 , y)p(σ^2 |y)dσ^2

is a Student’s t distribution with

  • mean ĀÆy
  • scale parameter s

2 n

  • degrees of freedom n āˆ’ 1

distributionsā€ of each unknown given all the other model quantities

  • generates a sample path from the Markov chain - at each iteration, generates a realization of each unknown

What does the WinBUGS user have to input?

  • model specification in terms of the distribu- tional relationships between observables and parameters - distributions of observables as functions of parameters (likelihood) - prior distributions of parameters
  • auxiliary files containing
    • data
    • initial values for unknowns

WinBUGS output is samples.

  • correlated
  • of quantities user has requested WinBUGS to ā€monitorā€ - parameters - missing data - functions of either of these