


















Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
Bayes’ theorem tells how our beliefs should be updated in light of the data. Statistical vs Systematic Errors, Nuisance, Parameters, Profile Likelihood, Bayesian, Bayes Theorem, Markov Chain Monte Carlo, Metropolis Hastings Algorithm, Cousins Highland Method, Integrated Likelihoods, Statistical Data Analysis, Lecture Slides, Glen Cowan, Physics Department, University of London, United Kingdom.
Typology: Slides
1 / 26
This page cannot be seen from the preview
Don't miss anything!



















1 Probability, Bayes’ theorem 2 Random variables and probability densities 3 Expectation values, error propagation 4 Catalogue of pdfs 5 The Monte Carlo method 6 Statistical tests: general concepts 7 Test statistics, multivariate methods 8 Goodness-of-fit tests 9 Parameter estimation, maximum likelihood 10 More maximum likelihood 11 Method of least squares 12 Interval estimation, setting limits 13 Nuisance parameters, systematic uncertainties 14 Examples of Bayesian approach
Statistical errors: How much would the result fluctuate upon repetition of the measurement? Implies some set of assumptions to define probability of outcome of the measurement. Systematic errors: What is the uncertainty in my result due to uncertainty in my assumptions, e.g., model (theoretical) uncertainty; modelling of measurement apparatus. The sources of error do not vary upon repetition of the measurement. Often result from uncertain value of, e.g., calibration constants, efficiencies, etc.
Suppose the outcome of the experiment is some set of data values x (here shorthand for e.g. x 1 , ..., x n
We want to determine a parameter θ , (could be a vector of parameters θ 1 , ..., θ n
The probability law for the data x depends on θ : L ( x | θ ) (the likelihood function) E.g. maximize L to find estimator Now suppose, however, that the vector of parameters: contains some that are of interest, and others that are not of interest: Symbolically: The are called nuisance parameters.
Data: Model: measured y i independent, Gaussian: assume x i
i known.
0
1
Correlation between causes errors to increase. Standard deviations from tangent lines to contour
0
1
1 improves accuracy of
1
1
In Bayesian statistics we can associate a probability with
Bayes’ theorem tells how our beliefs should be updated in light of the data x :
0
1 , e.g., Putting this into Bayes’ theorem gives: posterior likelihood × prior ← based on previous measurement reflects ‘prior ignorance’, in any case much broader than !
Bayesian computations involve integrals like often high dimensionality and impossible in closed form, also impossible with ‘normal’ acceptance-rejection Monte Carlo. Markov Chain Monte Carlo (MCMC) has revolutionized Bayesian computation. Google for ‘MCMC’, ‘Metropolis’, ‘Bayesian computation’, ... MCMC generates correlated sequence of random numbers: cannot use for many applications, e.g., detector MC; effective stat. error greater than √ n. Basic idea: sample multidimensional look, e.g., only at distribution of parameters of interest.
Goal: given an n -dimensional pdf generate a sequence of points
Actually one can only prove that the sequence of points follows the desired pdf in the limit where it runs forever. There may be a “burn-in” period where the sequence does not initially follow Unfortunately there are few useful theorems to tell us when the sequence has converged. Look at trace plots, autocorrelation. Check result with different proposal density. If you think it’s converged, try it again starting from 10 different initial points and see if you find same result.
Although numerical values of answer here same as in frequentist case, interpretation is different (sometimes unimportant?)
Sample the posterior pdf from previous example with MCMC: Summarize pdf of parameter of interest with, e.g., mean, median, standard deviation, etc.
Vary π ( θ ) to explore how extreme your prior beliefs would have to be to justify various conclusions (sensitivity analysis). Try exponential with different mean values... Try different functional forms...
Count n events, e.g., in fixed time or integrated luminosity. s = expected number of signal events b = expected number of background events n ~ Poisson( s + b ): Sometimes b known, other times it is in some way uncertain. Goal: measure or place limits on s , taking into consideration the uncertainty in b.