
Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
This document from stat613: intermediate theory of statistics course discusses the concept of identifiability in statistical models. Three examples of statistical models (single index model, regression model with collinearity, and factor analysis) and asks students to determine if these models are identifiable. If not, they are asked to suggest a re-parameterization to make the model identifiable. Identifiability is a crucial concept in statistics as it ensures that the model parameters are uniquely determined by the observed data, allowing for meaningful interpretation.
Typology: Assignments
1 / 1
This page cannot be seen from the preview
Don't miss anything!

Stat613: Intermediate Theory of Statistics
Instructor: Jianhua Huang Student: TA: Seokho Lee
This is Problem 2 of Homework 0. Since no perfect solution has been provided, I post the problem again.
Identifiability of Parametrization
A statistical model X ∼ P (x, θ) is called identifiable if and only if P (x, θ) = P (x, θ′) implies that θ = θ′ or, equivalently, if and only if θ 6 = θ′^ implies that P (x, θ) 6 = P (x, θ′). Identifiability means that the model parameters are totally determined by the observed data. Only when the model is identifiable, interpretation of the model parameters is meaningful. If a model is not identified, it can be made identifiable by imposing constraints on the model parameter θ.
The notation used above is generic. The X could represent a random variable, a vector, or even more complicated object. Similarly, the θ could be a number, a vector, or a function, or more complicated object.
Discuss if the following models are identifiable. If not, suggest a re-parameterization to make the model identifiable.
(a) Single index model. Suppose we have data (Xi, Yi) independent identically distributed as (X, Y ). The (X, Y ) ∈ Rd^ × R satisfies Y = f (XT^ β) + ǫ, ǫ ∼ N (0, σ^2 ) (1)
(b) Regression model with collinearity. Suppose the data (X, Y), with X a n × p matrix and Y a n-vector, are from the following model: Y = Xβ + ǫ, E(ǫ) = 0. (2)
The rank of the X matrix satisfies rank(X) = r < p.
(c) Factor analysis. Let Y be a random vector in Rd. Consider the model
Y = Fα + ǫ, ǫ ∼ N (0, σ^2 I), (3)
where F is a d × k unknown matrix and α is a k dimensional zero mean Normal random vector. The data are i.i.d. copies of Y. Usually k is much smaller than d so this model is useful for dimension reduction.