Principal Component Analysis - Basic Statistics for Behavioral Sciences - Lecture Notes, Study notes of Statistics for Psychologists

Principal Component Analysis, Variable Reduction Technique, Orthogonal Matrix, Diagonal Elements, Correlation Matrix, Principal Component Analysis, Factor Analysis are learning points of this lecture.

Typology: Study notes

2011/2012

Uploaded on 11/21/2012

ashakiran
ashakiran 🇮🇳

4.5

(27)

261 documents

1 / 2

Toggle sidebar

This page cannot be seen from the preview

Don't miss anything!

bg1
Ch. 12: Principal component analysis
I. Situation
A. For a given set of variables, we want to find a linear
combination of variables which explains the maximum of the
variance.
B. It is a variable-reduction technique.
C. It is an interdependence model (No IV, DV distinction).
A. All extracted principal components are orthogonal to each
other.
II. Model
A. PC1 = z1 = a1y1 + a2y2 + . . + apyp = a’y
PC2 = z2 = b1y1 + b2y2 + . . + bpyp = b’y
B. If a’a =1, then a is said to be normalized. The a can always
be normalized by a pre-multiplication of C where, c =
aa
a
'
.
C. Thus, z = PCi = Ay, where A = orthogonal matrix (C) such
that A’A = I and z’z = (Ay)’(Ay) = y’A’Ay = y’y.
D. The orthogonal matrix, A, transforms y to z which has the
same distance from the origin and the axes are rotated.
E. Thus,
Sz = ASA’ =
2
22
21
...00
0...
00
0.
..0
0
0.
.00
0.
.00
zp
z
z
s
s
s
F. The diagonal elements of Sz (
2
Zp
s
) are eigenvalues (λi) of S,
which are the variances of the principal components of
z = a’y.
III. Procedure
A. Start with a R (correlation matrix) to standardize the
measurement unit (S will give us a different solution from
that of R).
B. Each variable has its own variance (eigenvalue) of 1.
C. The first selection can be made by excluding any PC whose
eigenvalue is smaller than 1.
D. Utilizing the parsimony rule, select a minimum # of PCs with
the maximum variance explained (e.g. 80%).
B. A scree graph of eigenvalues can help in determining the
number of PCs.
Docsity.com
pf2

Partial preview of the text

Download Principal Component Analysis - Basic Statistics for Behavioral Sciences - Lecture Notes and more Study notes Statistics for Psychologists in PDF only on Docsity!

Ch. 12: Principal component analysis

I. Situation A. For a given set of variables, we want to find a linear combination of variables which explains the maximum of the variance. B. It is a variable-reduction technique. C. It is an interdependence model (No IV, DV distinction). A. All extracted principal components are orthogonal to each other.

II. Model A. PC 1 = z 1 = a 1 y 1 + a 2 y 2 +.. + apyp = a’y PC 2 = z 2 = b 1 y 1 + b 2 y 2 +.. + bpyp = b’y B. If a’a =1, then a is said to be normalized. The a can always

be normalized by a pre-multiplication of C where, c = aa

a '

C. Thus, z = PCi = Ay , where A = orthogonal matrix ( C ) such that A’A = I and z’z = ( Ay )’( Ay ) = y’A’Ay = y’y. D. The orthogonal matrix, A , transforms y to z which has the same distance from the origin and the axes are rotated. E. Thus,

Sz = ASA’ =

2

2 2

2 1

zp

z

z

s

s

s

F. The diagonal elements of Sz ( s (^) Zp^2 ) are eigenvalues (λi) of S , which are the variances of the principal components of z = a’y.

III. Procedure A. Start with a R (correlation matrix) to standardize the measurement unit ( S will give us a different solution from that of R ). B. Each variable has its own variance (eigenvalue) of 1. C. The first selection can be made by excluding any PC whose eigenvalue is smaller than 1. D. Utilizing the parsimony rule, select a minimum # of PCs with the maximum variance explained (e.g. 80%). B. A scree graph of eigenvalues can help in determining the number of PCs.

Docsity.com

IV. Difference between Principal Component Analysis and Factor Analysis. A. Principal components are linear combinations of original observed variables with an orthogonal matrix ( z = Ay ). B. Factor Analysis is designed to find hidden(latent) factors to express the observed variables, y = Λf + ε. C. In Principal Component Analysis we try to explain the total variance of the original variables while in Factor Analysis we try to explain the covariance or correlation among the observed variables.

Docsity.com