Factor Analysis - Basic Statistics for Behavioral Sciences - Lecture Notes, Study notes of Statistics for Psychologists

Factor Analysis, Variable Reduction Technique, Interdependence Model, Factor Loading, Factor Pattern, Standardized Regression Coefficient, Common Factors, Eigenvalue are some points from this helpful lecture notes.

Typology: Study notes

2011/2012

Uploaded on 11/21/2012

ashakiran
ashakiran 🇮🇳

4.5

(27)

261 documents

1 / 7

Toggle sidebar

This page cannot be seen from the preview

Don't miss anything!

bg1
Ch. 13: Factor Analysis
I. Situation
A. For a given set of correlated observed variables, we try to
extract hidden (latent) common factors which can explain the
correlated variables.
B. A variable-reduction technique.
C. An interdependence model.
II. Terminology
A. Factor Loading (Factor Pattern): Standardized regression
coefficient to predict original (observed) variables using
common factors.
zy1 = λ11f1 + λ12f2 + . . + λ1mfm + ε1
zy2 = λ21f1 + λ22f2 + . . + λ2mfm + ε2
.
.
zyp = λp1f1 + λp2f2 + . . + λpmfm + εp
* λ = factor loadings, not eigenvalues.
m = # of factors
p = # of variables (m<p).
B. Eigenvalue:
1. The sum of squared factor loadings between a factor and
all variables.
2. The amount of variance of standardized variables
accounted for by a factor.
C. Communality
1. The sum of squared factor loadings between a variable and
all factors.
2. The amount of variance of a standardized variable
accounted for by all factors.
3. Theoretically, it should be smaller than 1, but in
reality, it can be 1 (Heywood case) or greater than 1
(Ultra Heywood case).
4. Reasons for Heywood or Ultra Heywood cases.
a) Too many or too few common factors.
b) Bad prior communality estimates.
c) Not enough subjects for stable estimates (n>5v).
d) Inappropriate models.
D. Factor score: individual subjects’ score on each factor.
III. Mathematical Model
A. Spearman (1904) model (single g-factor model)
1. y = λg + ε
where
y (pX1) = a vector of random variable y,
Docsity.com
pf3
pf4
pf5

Partial preview of the text

Download Factor Analysis - Basic Statistics for Behavioral Sciences - Lecture Notes and more Study notes Statistics for Psychologists in PDF only on Docsity!

Ch. 13: Factor Analysis I. Situation A. For a given set of correlated observed variables, we try to extract hidden (latent) common factors which can explain the correlated variables. B. A variable-reduction technique. C. An interdependence model.

II. Terminology A. Factor Loading (Factor Pattern): Standardized regression coefficient to predict original (observed) variables using common factors.

zy1 = λ 11 f 1 + λ 12 f 2 +.. + λ1mfm + ε 1

zy2 = λ 21 f 1 + λ 22 f 2 +.. + λ2mfm + ε 2 . . zyp = λp1f 1 + λp2f 2 +.. + λpmfm + εp

  • λ = factor loadings, not eigenvalues. m = # of factors p = # of variables (m5v). d) Inappropriate models. D. Factor score: individual subjects’ score on each factor. III. Mathematical Model A. Spearman (1904) model (single g-factor model)
  1. y = λ g + ε where y (pX1) = a vector of random variable y,

λ (pX1) = a vector of parameters (unknown factor loading), g = latent general factor (unknown), and ε (pX1) = a vector of random error.

  1. Assumption a) y and g have unit variance (σ^2 y = 1, σ^2 g = 1). b) ε and g are uncorrelated [ρ(εg) = 0  σ(εg) = 0].
  2. Consequence a) Correlation of yy’ , R (pXp) = Σ (pXp) when standardized, R = E( yy’ ) = E[( λ g + ε )( λ g + ε )’] = E[( λ gg’ λ’ ) +( λ g ε ’) + ( ε g’ λ ’) + ( εε ’)] = λ E(gg’) λ ’ + λ E(g ε’ ) + E( ε g’) λ ’ + E( εε ’)] = λλ’ + E( εε ’) = λλ’ + ψ  Fundamental theorem of the Spearman’s (pX1)(1Xp)^ pXp^ common factor analysis. where

λλ’ =

λ p

λ

λ

2

1

[λ 1 , λ 2 ,.. λ p ] =

2 1 2

2

2 21 2

12 1

2 1

p p p

p

p

2

2 2

2 1

ε p

ε

ε

= Diag(σ^2 ε)

b) Example given

R =

R can be decomposed into λλ’ and ψ , where

  1. Solution a) Decompose the error component ( ε ) into unique factor ( U ) and true error ( ν ). y = Λθ + pX1 (pXm)(mX1) pXp)(pX1) b) Given R (pXp) R = E( yy ’) = E[( Λθ + )( Λθ + )’] = E[( Λθθ’Λ’ ) + ( Λθν’U’ ) + ( Uνθ’Λ’ ) + ( Uνν’U’ )] = Λ E( θθ’ ) Λ’ + Λ E( θν’ ) U’ + U E( νθ’ ) Λ’ + U E( νν’ ) U’

I 0 0 I = ΛΛ’ + UU’ = ΛΛ’ + Ψ = ΛΛ’ + U^2

*Fundamental theorem of Thurstone’s common factor analysis.

c) Example (y 1 y 2 y 3 y 4 )

y 1 y 2 y 3 y 4

R =

θ 1 θ 2 y 1 y 2 y 3 y 4

R = ΛΛ’ + U^2

d) Estimating Λ is the main concern of FA.

  1. Problems a) Unidentifiability: No unique solution for Λ.

b) Indeterminancy: No unique solution for θ  rotation.

IV. Estimation methods A. Principal component method

  1. Correlation matrix (population) R = ΛΛ’ + Ψ (= U^2 )
  2. Covariance matrix (population) Σ = ΛΛ’** + Ψ
  3. Sample covariance matrix S = Λ^Λ^’ + Ψ^  ignore Ψ^ S = Λ^Λ^ ’ = CDC ’ (spectral decomposition) (pXm)(mXp) (pXp)(pXp)(pXp) = CD1/2D1/2C ’ = ( CD1/2 )( CD1/2 )’  Λ ^ = CD1/2^ to fit the dimension pXm (pXm)(mXm) where C = orthogonal matrix with normalized eigenvectors ( c’c = 1) of S , D = diagonal matrix of eigenvalues (θj) of S.
  4. Λ ^ is a pXm matrix

Λ ^ =

p p pm

m

m

1 2

21 22 2

11 12 1

( )( )

1 / 2 pXm mXm

CD =

p p pm

m

m

c c c

c c c

c c c

1 2

21 22 2

11 12 1

θ m

2

1

p p m pm

m m

m m

c c c

c c c

c c c

1 1 2 2

1 21 2 22 2

1 11 2 12 1

  1. Eigenvalue, θˆ i^ = (^) ∑

p

j

ij 1

λˆ^2 , for each factor.

method which includes Varimax as an initial rotation method.