Identifiability of Statistical Models in Stat613: Intermediate Theory of Statistics - Prof, Assignments of Statistics

This document from stat613: intermediate theory of statistics course discusses the concept of identifiability in statistical models. Three examples of statistical models (single index model, regression model with collinearity, and factor analysis) and asks students to determine if these models are identifiable. If not, they are asked to suggest a re-parameterization to make the model identifiable. Identifiability is a crucial concept in statistics as it ensures that the model parameters are uniquely determined by the observed data, allowing for meaningful interpretation.

Typology: Assignments

Pre 2010

Uploaded on 02/10/2009

koofers-user-tc4
koofers-user-tc4 🇺🇸

10 documents

1 / 1

Toggle sidebar

This page cannot be seen from the preview

Don't miss anything!

bg1
Stat613: Intermediate Theory of Statistics
Bonus Problem II (10/13/07)
Instructor: Jianhua Huang Student:
TA: Seokho Lee
This is Problem 2 of Homework 0. Since no perfect solution has been provided, I post the problem again.
Identifiability of Parametrization
A statistical model XP(x, θ) is called identifiable if and only if P(x, θ ) = P(x, θ) implies that θ=θ
or, equivalently, if and only if θ6=θimplies that P(x, θ)6=P(x, θ). Identifiability means that the model
parameters are totally determined by the observed data. Only when the model is identifiable, interpretation
of the model parameters is meaningful. If a model is not identified, it can be made identifiable by imposing
constraints on the model parameter θ.
The notation used above is generic. The Xcould represent a random variable, a vector, or even more
complicated object. Similarly, the θcould be a number, a vector, or a function, or more complicated object.
Discuss if the following models are identifiable. If not, suggest a re-parameterization to make the model
identifiable.
(a) Single index model. Suppose we have data (Xi, Yi) independent identically distributed as (X,Y ). The
(X, Y )Rd×Rsatisfies
Y=f(XTβ) + ǫ, ǫ N(0, σ2) (1)
(b) Regression model with collinearity. Suppose the data (X,Y), with Xan×pmatrix and Yan-vector,
are from the following model:
Y=Xβ+ǫ, E(ǫ) = 0.(2)
The rank of the Xmatrix satisfies rank(X) = r < p.
(c) Factor analysis. Let Ybe a random vector in Rd. Consider the model
Y=Fα+ǫ,ǫN(0, σ2I),(3)
where Fis a d×kunknown matrix and αis a kdimensional zero mean Normal random vector. The data
are i.i.d. copies of Y. Usually kis much smaller than dso this model is useful for dimension reduction.
1

Partial preview of the text

Download Identifiability of Statistical Models in Stat613: Intermediate Theory of Statistics - Prof and more Assignments Statistics in PDF only on Docsity!

Stat613: Intermediate Theory of Statistics

Bonus Problem II (10/13/07)

Instructor: Jianhua Huang Student: TA: Seokho Lee

This is Problem 2 of Homework 0. Since no perfect solution has been provided, I post the problem again.

Identifiability of Parametrization

A statistical model X ∼ P (x, θ) is called identifiable if and only if P (x, θ) = P (x, θ′) implies that θ = θ′ or, equivalently, if and only if θ 6 = θ′^ implies that P (x, θ) 6 = P (x, θ′). Identifiability means that the model parameters are totally determined by the observed data. Only when the model is identifiable, interpretation of the model parameters is meaningful. If a model is not identified, it can be made identifiable by imposing constraints on the model parameter θ.

The notation used above is generic. The X could represent a random variable, a vector, or even more complicated object. Similarly, the θ could be a number, a vector, or a function, or more complicated object.

Discuss if the following models are identifiable. If not, suggest a re-parameterization to make the model identifiable.

(a) Single index model. Suppose we have data (Xi, Yi) independent identically distributed as (X, Y ). The (X, Y ) ∈ Rd^ × R satisfies Y = f (XT^ β) + ǫ, ǫ ∼ N (0, σ^2 ) (1)

(b) Regression model with collinearity. Suppose the data (X, Y), with X a n × p matrix and Y a n-vector, are from the following model: Y = Xβ + ǫ, E(ǫ) = 0. (2)

The rank of the X matrix satisfies rank(X) = r < p.

(c) Factor analysis. Let Y be a random vector in Rd. Consider the model

Y = Fα + ǫ, ǫ ∼ N (0, σ^2 I), (3)

where F is a d × k unknown matrix and α is a k dimensional zero mean Normal random vector. The data are i.i.d. copies of Y. Usually k is much smaller than d so this model is useful for dimension reduction.