Prepare for your exams
Get points
Guidelines and tips
Sell on Docsity
Docsity AI

Prepare for your exams

Study with the several resources on Docsity

Earn points to download

Earn points by helping other students or get them with a premium plan

Guidelines and tips

Sell on Docsity

Docsity AI

Prepare for your exams

Study with the several resources on Docsity

Find documents

Prepare for your exams with the study notes shared by other students like you on Docsity

Search for your university

Find the specific documents for your university's exams

Docsity AINEW

Summarize your documents, ask them questions, convert them into quizzes and concept maps

Explore questions

Clear up your doubts by reading the answers to questions asked by your fellow students

Earn points to download

Earn points by helping other students or get them with a premium plan

Share documents

20 Points

For each uploaded document

Answer questions

5 Points

For each given answer (max 1 per day)

All the ways to get free points

Get points immediately

Choose a premium plan with all the points you need

Study Opportunities

Choose your next study program

Get in touch with the best universities in the world. Search through thousands of universities and official partners

Community

Ask the community

Ask the community for help and clear up your study doubts

Free resources

Our save-the-student-ebooks!

Download our free guides on studying techniques, anxiety management strategies, and thesis advice from Docsity tutors

Statistical Tests for Parametric Models: Wald, Likelihood Ratio, and Score Tests, Study notes of Biostatistics

University of Washington (UW) - Seattle Biostatistics

An overview of three widely used statistical tests for a hypothesis that fixes the values of some parameters in a parametric model. The tests discussed are the wald test, likelihood ratio test, and score test. The differences between these tests, their assumptions, and their applications. It also covers the use of these tests in linear and generalized linear models, as well as their implementation in r.

Typology: Study notes

Pre 2010

Uploaded on 03/18/2009

koofers-user-k4j 🇺🇸

(1)

10 documents

1 / 19

This page cannot be seen from the preview

Don't miss anything!

Testing

BIOST 570

2005-10-19

Discover Study notes of Biostatistics University of Washington (UW) - Seattle

Partial preview of the text

Download Statistical Tests for Parametric Models: Wald, Likelihood Ratio, and Score Tests and more Study notes Biostatistics in PDF only on Docsity!

Testing

BIOST 570

2005-10-

Three tests

Given a well-behaved parametric model there are three widely used types of test for a hypothesis that fixes the values of some parameters, ie, θ = (β, η) = (β 0 , η). Write p for the dimension of β.

Wald test: Estimate (ˆβ, ηˆ) and compare (ˆβ − β 0 )V (ˆθ)−^1 (ˆβ − β 0 ) to a χ^2 p distribution, where V is the estimated variance matrix of β
Likelihood ratio test: Compare −2((β 0 ) −(ˆβ)) to a χ^2 p distribution
Score test: Compare U (β 0 )I−^1 (β 0 )U (β 0 ) to a χ^2 p distribution, where U is the score and I−^1 is the β submatrix of the inverse of the Fisher information (not the other way around).

The Wald test requires estimation in the full model, the score test requires estimation in the reduced model with β = β 0 and the LRT requires both.

Three tests

β=β 0 β=β

score

Three tests

The Wald test is very easy to use for confidence interval generation, the LRT is slightly harder and the score test is much harder.

The Wald confidence interval is always symmetric around β 0 , which is its main defect. It can often be improved by creating the confidence interval on a transformed parameter where symmetry is more appropriate.

The confint function in the MASS package gives likelihood ratio confidence intervals. The R package hoa provides higher-order approximations to the likelihood for some regression models. If the distributional assumptions are correct, the confidence intervals will be substantially more accurate in small samples.

Generalized linear models

For generalized linear models with V (μ) known exactly the likelihood ratio test has a χ^2 distribution.

When V (μ) is known only up to a dispersion parameter, as in overdispersed Poisson regression, it is usual to use an F or t distribution, but this is not particularly well justified by theory.

By analogy with the residual sum of squares in linear models, the log likelihood is usually reported in terms of the deviance, the difference in − 2 ×loglikelihood between the fitted model and a model with a separate mean parameter for each observation.

The LRT statistic comparing two nested models is simply the difference in deviance

Dispersion parameter

When V (μ) is known only up to a dispersion parameter k it is necessary to estimate k. There are two popular estimators

The variance of the Pearson residuals 1 n − p

∑ i

(Yi − μˆi)^2 V (ˆμi)

The deviance divided by (n − p).

The Pearson residual estimator is probably better. It is valid even when the data do not come from the assumed exponential family model. Also, the deviance estimator can be badly biased, eg for Poisson regression with small means.

A small simulation for data really from a Poisson distribution (so k = 1) gave mean residual deviance of ˆk = 0.77 for μ = 0.5 and ˆk = 0.49 for μ = 0.25.

Implementation: Wald

The Wald test for a single coefficient is printed by summary on a glm object. Most statistical software will automatically display all these single-term Wald tests.

The function regTermTest on the class web page (and taken from the ‘survey’ package) performs Wald tests for specific regression terms.

data(airquality) a<-glm(Ozone~Solar.R+cut(Temp,3)cut(Wind,3),data=airquality, family=Gamma("log")) regTermTest(a, ~Solar.R) Wald test for Solar.R in glm(formula = Ozone ~ Solar.R + cut(Temp, 3) * cut(Wind, 3), family = Gamma("log"), data = airquality) Chisq = 17.48064 on 1 df: p= 2.9025e- regTermTest(a, ~cut(Temp, 3):cut(Wind, 3)) Wald test for cut(Temp, 3):cut(Wind, 3) in glm(formula = Ozone ~ Solar.R + cut(Temp, 3) * cut(Wind, 3), family = Gamma("log"), data = airquality) Chisq = 1.610642 on 4 df: p= 0. regTermTest(a, ~cut(Temp, 3)cut(Wind, 3)) Wald test for cut(Temp, 3) cut(Wind, 3) cut(Temp, 3):cut(Wind, 3) in glm(formula = Ozone ~ Solar.R + cut(Temp, 3) * cut(Wind, 3), family = Gamma("log"), data = airquality) Chisq = 122.8512 on 8 df: p= < 2.22e-

Implementation: LRT

Likelihood ratio tests are done with the anova function. It is necessary to specify which test you want (χ^2 or F ). anova applied to a single model gives sequential tests, applied to two models it compares them.

anova(a, test="F") Analysis of Deviance Table

Model: Gamma, link: log Response: Ozone

Terms added sequentially (first to last) Df Deviance Resid. Df Resid. Dev F Pr(>F) NULL 110 71. Solar.R 1 13.040 109 58.910 48.5287 3.383e- cut(Temp, 3) 2 25.238 107 33.672 46.9616 3.803e- cut(Wind, 3) 2 6.766 105 26.906 12.5894 1.313e- cut(Temp, 3):cut(Wind, 3) 4 0.447 101 26.459 0.4155 0.

Implementation: score test

There is no automatic user-friendly function for score tests in R (or in other software AFAIK), but they are not hard to do from first principles.

The primary difficulty is computing the information matrix for the full parameter under the reduced model. This is not part of the standard model output from either the full or the reduced model.

Implementation: score test

A sneaky trick is that the score test can be computed as the Wald test based on a single iteration of IWLS. If we start IWLS at the fitted μ from the null-hypothesis model and take one iteration, then

βˆ − β 0 =

 ∑ i

∂Ui(β 0 ) ∂β

 

− 1  ∑ i

Ui(β 0 )

 

and the sandwich estimator is  ∑ i

∂Ui(β 0 ) ∂β

 

− 1  ∑ i

Ui(β 0 )Ui(β 0 )T

 

 ∑ i

∂Ui(β 0 ) ∂β

 

− 1

so the ‘Wald test’ formula gives the score test if we are using the sandwich estimator.

If we are using the model-based estimator and estimating a dispersion parameter then we need to make sure that ˆσ^2 is computed at the null hypothesis not at the fitted values.

Implementation: score test

null<-glm(Ozone~cut(Temp,3)cut(Wind,3),data=na.omit(airquality), family=Gamma("log")) onestep<-glm(Ozone~Solar.R+cut(Temp,3)cut(Wind,3),data=na.omit(airquality), family=Gamma("log"), Warning message: algorithm did not converge in: glm.fit(x = X, y = Y, weights = weights, start = start, etastart = etastart, regTermTest(onestep, ~Solar.R) Wald test for Solar.R in glm(formula = Ozone ~ Solar.R + cut(Temp, 3) * cut(Wind, 3), family = Gamma("log"), data = na.omit(airquality), mustart = fitted(null), maxit = 1) Chisq = 14.29433 on 1 df: p= 0.

Implementation: score test

Z<-predict(null,type="link")+resid(null,"working") W<-null$weights iwls1<-lm(Z~Solar.R + cut(Temp, 3) * cut(Wind, 3), weight=W,data=airquality) Error in model.frame(formula, rownames, variables, varnames, extras, extranames, : variable lengths differ iwls1<-lm(Z~Solar.R + cut(Temp, 3) * cut(Wind, 3), weight=W,data=na.omit(airquality)) regTermTest(iwls1,~Solar.R) Wald test for Solar.R in lm(formula = Z ~ Solar.R + cut(Temp, 3) * cut(Wind, 3), data = na.omit(airquality), weights = W) Chisq = 14.29433 on 1 df: p= 0.

Small example

A famous experiment by Fisher tested a woman’s claim that she could tell if milk had been added first or last in a cup of tea. The data are

TeaTasting Truth Guess Milk Tea Milk 3 1 Tea 1 3

Fitting a logistic regression we obtain

Wald CI for log odds ratio, exponentiated: (0. 367 , 220)
Likelihood ratio CI for log odds ratio, exponentiated: (0. 478 , 419)
Wald CI for odds ratio (− 19. 4 , 37 .3)
Likelihood ratio CI for odds ratio: (0. 478 , 419)

Statistical Tests for Parametric Models: Wald, Likelihood Ratio, and Score Tests, Study notes of Biostatistics

Related documents

Partial preview of the text

Download Statistical Tests for Parametric Models: Wald, Likelihood Ratio, and Score Tests and more Study notes Biostatistics in PDF only on Docsity!

Testing

BIOST 570

Three tests

Three tests

Three tests

Generalized linear models

Dispersion parameter

Implementation: Wald

Implementation: LRT

Implementation: score test

Implementation: score test

Implementation: score test

Implementation: score test

Small example