

Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
This is the Past Exam of Generalised Linear Models which includes Values, Fixed, Integer, Expectation, Variance, Probability Mass Function, Standard Notation, Logit Functions, Logistic etc. Key important points are: Distribution, Exponential Family, Distribution, Random Variable, Probability Density, Quantities Appearing, Likelihood Function, Second Derivatives, Single Observation, Distribution
Typology: Exams
1 / 3
This page cannot be seen from the preview
Don't miss anything!


PART II (Third or Fourth Year)
MATHEMATICS & STATISTICS 2 Hours
Math 352: Generalized Linear Models
You should answer ALL Section A questions and ONE Section B question. In Section A there are questions worth a total of 50 marks, but the maximum mark that you can gain there is capped at 40.
SECTION A
A1. (a) Define an exponential family (EF) distribution of a random variable Y with canonical parameter θ based on a probability density or mass function q. You must clearly define different quantities appearing in the definition. [7] (b) Evaluate the log likelihood function, (θ), based on a single observation y from an EF and find expressions for its first and second derivatives, θ(θ) and θθ(θ). [3] (c) Use these results to find the mean and variance of Y following the EF distribution in terms of the derivatives of the cumulative generating function of q; you should state carefully any general result about the score function that you use. [4] (d) Show that the EF based on the probability mass function (pmf)
q(y) =^12 , y = 0, 1 ,
is given by f (y|θ) = e
θy (1 + eθ^ ) ,^ y^ = 0,^1.^
(e) Use (c) to find the relationship between the mean parameter and the canonical parameter in this example. [1] (f) Hence or otherwise derive the mean function and the variance function. [5]
please turn over
SECTION A continued
A2. The following table shows the proportion of carrots showing insect damage in a trial with three blocks (fields) and four dose levels of an insecticide. dose Block 1 Block 2 Block 3 1.52 10/35 17/35 10/ 1.88 6/40 8/40 3/ 2.12 9/42 7/42 1/ 2.36 2/30 14/30 2/ (a) Identify mathematically a GLM that might fit this data. You should carefully state the model, link function and the explanatory variables dose and block (a factor) with additive main effects. Do not include an interaction term. [10] (b) Construct the design matrix for this model. [7] (c) Construct the design matrix for the model again by taking into account the interactions of dose and block. [5] (d) Briefly indicate how would you form the likelihood to estimate parameters for the addi- tive main effect model. [5]
B1. The following table gives the total number of typographical errors (typos) in eight project reports along with the corresponding number of words (rounded to nearest multiple of 250) in those reports.
Word counts 4250 4500 4750 5000 5250 5500 5750 6000 Typos 1 1 2 3 2 3 5 9 (a) Plot the total number of typos versus the word counts to understand their relation. [3] (b) Give two reasons why it would not be sensible to fit a simple normal linear model to this data. [4] (c) Identify mathematically a GLM that might fit this data. You should carefully state the model, link function and covariates. [8] (d) Construct the design matrix for this model. [5] (e) Write down an expression for the log-likelihood function. [5] (f) Write down the equations that must be solved for estimating the parameters. [5] please turn over