

























Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
Material Type: Exam; Class: Data Analysis II; Subject: Statistics; University: University of Missouri - Columbia; Term: Unknown 1989;
Typology: Exams
1 / 33
This page cannot be seen from the preview
Don't miss anything!


























Introduction:
Consider an experiment in swine breeding where we will measure several traits:
Suppose that we have three genetically different types of pigs and we will measure the above variables on each of n = 30 mother pigs from each breed.
Question: How can the joint variability of the p = 6 traits be described?
MANOVA:
Note: We will usually require that Σ be positive definite. This implies that all eigenvalues are positive, and Σ has an inverse Σ−^1 , such that Σ−^1 Σ = Ip×p = ΣΣ−^1.
Correlations:
Define the correlation ρij and the correlation matrix by
ρij =
σij √σ iiσjj
and R =
ρ 11 ρ 12 · · · ρ 1 p ρ 21 ρ 22 · · · ρ 2 p ... ...... ... ρp 1 ρp 2 · · · ρpp
Notice that we can also write the correlation matrix as R = [diag(Σ)]−^1 /^2 Σ[diag(Σ)]−^1 /^2 , where diag(Σ) is just the matrix which has the σii’s on the diagonal and 0’s elsewhere. Additionally, the square root of a symmetric matrix A, denoted A^1 /^2 , is a symmetric matrix such that A = A^1 /^2 A^1 /^2.
Some useful equalities:
Let x and y be random vectors with means μx and μy and variance-covariance matrices Σx and Σy. Let A and B be matrices of constants and c and d be vectors of constants. Then
Let’s show the third of these below:
y 1 |y 2 ∼ Np 1 (μ 1 + Σ 12 Σ− 221 (y 2 − μ 2 ), Σ 11 − Σ 12 Σ− 221 Σ 21 ).
Note: there are analogous results for y 2 |y 1. Ad- ditionally, y 1 and y 2 are independently distributed only if Σ 12 = 0.
(y − μ)′Σ−^1 (y − μ) ∼ χ^2 (p).
We will call this quantity the squared Mahalanobis distance between y and μ.
∑^ k
i=
Aiyi ∼ Nm(
∑^ k
i=
Aiμi,
∑^ k
i=
AiΣiA′ i)
Suppose that ( y 1 y 2
If y 2 = 3, what is the conditional distribution of y 1?
Suppose we have a random sample of size n, y 1 , · · · , yn
from Np(μ, Σ). Then,
n − 1
∑^ n
i=
(yi − ¯y)(yi − ¯y)′^ =
n − 1
{ (^) n ∑
i=
yiy′ i − n¯y¯y′
Note that S is symmetric and contains p(p + 1)/ 2 different random variables. Further, S is an unbi- ased estimator of Σ.
Suppose that we have an independent random sample y 1 , · · · , yn ∼ Np(μ, Σ). The we can write the likelihood function for the data as
L(μ, Σ) =
∏^ n
j=
(2π)p/^2 |Σ|^1 /^2
e−
(^12) (yj −μ)′Σ− (^1) (yj −μ)
(2π)np/^2 |Σ|n/^2
e−
12 ∑n j=1(yj^ −μ)
′Σ− (^1) (yj −μ) .
Then, we can write the m.l.e’s as
ˆμ = ¯y and ˆΣ =
n − 1 n
We can show this by taking the log of the likelihood function and taking derivatives with respect to μ and Σ.
Properties of MLEs
Properties of MLEs, cont.
∂^2 `(θ) ∂θi∂θj
maxH 0 (μ, Σ|Y ) max(μ, Σ|Y )
∂trXA ∂X
= A + A′^ − Diag(A).
∂xij
−X−^1 JiiX−^1 , i = j −X−^1 (Jij + Jji)X−^1 , i 6 = j
Use these rules to find the derivative ∂x′Aa/∂x.
Now, find the derivative ∂trX/∂X.
Suppose that we collect the following sets of two num- bers based upon some process: (18. 4 , 25 .4), (20. 8 , 22 .1), (21. 8 , 27 .8), (19. 3 , 23 .7), (18. 9 , 27 .2), (19. 0 , 26 .6), (21. 7 , 29 .2), (22. 7 , 24 .2), (20. 6 , 24 .8), (18. 0 , 21 .9), (18. 4 , 26 .7), and (22. 6 , 21 .8). Find ¯y.
Next, find S. For simplicity, we will just find the co- variance here in class. (So that you can check them, s 11 = 2.96 and s 22 = 6.04.)
What is the mle of Σ? of μ?
Finally, suppose that we wish to test the null hypothesis that μ 1 = μ 2 = 22.5. NOTE: Under restrictions, the mle of the mean and variance-covariance matrix may not have the same forms that we have discussed previously. First, what will be the value of ν?
Next, suppose that the mles under the restriction are σˆ 11 = 8.82, σˆ 12 = − 6 .73, and σˆ 22 = 13.51. What is the form of the numerator for the likelihood ratio test?
What is the form of the denominator for this test?
Note: it would not be appropriate to assume that 12 observations constitutes a large sample. This is for demonstration purposes only!
We will discuss three different options for assessing mul- tivariate normality. When analyzing data we may wish to look at one or more of these possible methods.
β 1 ,p = E
(y − μ)′Σ−^1 (x − μ)
where x and y are independent, but have the same distribution.
β 2 ,p = E
(y − μ)′Σ−^1 (y − μ)
0 2 4 6 8 10 12
0
2
4
6
8
Chi−Square Quantiles
Mahalanobis Distance
Recall that for a univariate normal distribution, we could test the hypothesis H 0 : μ = μ 0 by using the test statistic
y − μ 0 s/
n
which has a tn− 1 distribution under the null hypothesis. We would reject this null hypothesis if |T | is large relative to t(1−α/ 2 ,n−1) because this indicates that seeing a value as large as ours is rare if the null is true.
This is equivalent to rejecting the null hypothesis if
(y − μ 0 )^2 s^2 /n
= n(y − μ 0 )(s^2 )−^1 (y − μ 0 )
is large. Note that the T 2 statistic has an f(1,n−1) distri- bution under the null hypothesis.
Natural Multivariate Generalization
Suppose that we wish to test H 0 : μ = μ 0 vs. Ha : μ 6 =
μ 0. Define Hotelling’s T 2 by
T 2 = n(y − μ 0 )′S−^1 (y − μ 0 ).
Hotelling’s T 2 can be viewed as a generalized distance between y and μ 0.