Understanding Construct Validity in Psychological Tests: A Historical Perspective | Study Guides, Projects, Research Psychology

-

----L.

J.

CRONBACH

and

P.

E.

MEEID..-----

Construct

Validity

in

Psychological

Tests

V

AL

IDATION

of psychological tests has

not

yet been adequately concep·

tua1ized,

as

the

APA

Committee

on

Psychological Tests learned when

it

undertook (1950-54)

to

specify what qualities should

be

investigated

before a te

st

is

published. In order to make coherent recommendations

the

Committee found it necessary

to

distinguish four types of validity,

established by different types of research

and

requiring different interpre·

tation.

The

chief

inn

ovation in the

Com

mittee's report was the term

constmct validity.*

Th

is idea

was

fir

st formulated hy a subcommittee

{Mee

hl

and

R.

C.

Cballman) studyi

ng

how proposed recommendations

would apply to proje

ct

ive techniques, and later modified

and

clarified

by the entire Committee {Bordin, Challman, Conrad,

Hu

mphre

ys

,

Super, and

the

present writers).

The

statements agreed

upon

by

tbe

Committee (and by committees of two other associa tions) were pub-

lished

in

the Technical Recommendations ( 59).

The

present interpre-

tation

of

construct validity

is

not

"official" and deals with some areas

in which

the

Committee would probably

not

be

unanimous.

Th

e present

writers are solely responsible for this att

em

pt to explain the concept

and

elaborate its implications.

Identification

of

construct validity

was

not

an

isolated development.

Writers

on

validity during

the

preceding

decade

had shown a great

deal of dissatisfaction

with

conventional notions of validity, and intro·

dnced new terms

and

ideas,

hut

the resulting aggregation

of

types of

* Referred to in a preliminary report (

58)

as

cougrue

nt

validity.

NOTE:

Th

e second a

uthor

worked on this problem in co

nn

ection with his appoint·

mcnt to

the

Minnesota

Cente

r for Philosophy of

Sci

ence. \

Ve

are indebted to

the

o

th

er members o f

the

C

enter

(Herb

e

rt

Feigl,

l'vli

chael Scriven, \.Vilfricl Sellars), and

h>

D. L.

Thi

stlcth

waitc

of

the

Univ

er

sity

of

Illinois, for

th

eir

m;1jor

c:o11trilmtions

to

our

thi11king

a11c1

their suggestions for improving this paper. T he paper

li

rst appc;ll'

l'<I

i

11

/

'

.

~>

·

dwlogie:rl

l311llcti11

,

Jul

y 1955,

an

d

is

repri

nt

ed

here,

wi

th minor :ilti;rations.

hy

p

l'

1111

issio

11

of

Ilic

editor

:i

nd of

the

authors.

174

CONSTRUCT VALIDITY I N PSYCHOLOGICAL TF.STS

validity seems only

to

ha

ve

stirred the

mudd

y waters. Portions of

the

distinctions we shall discuss are implicit

in

Jenkins' paper, "Validity for

'What?" { 33), Gulliksen's "I

nt

rinsic Validity" (27),

Goo<le

nough's di

s-

tinction between tests as "signs"

and

"samples" (22), Cronbach's sepa·

ration of "logical"

and

"empirical" validity (

11

), Guilford's "factorial

va

lidity" (25),

and

Mo

sier's pape

rs

on

"face validity" and "validity gen-

eralization" (

49,

50). Hel

en

Pea

k (

52)

comes close to

an

explic

it

stat

e-

ment

of construct validity as we shall pre

sent

it.

Four

Type

s of Validation

TI1e

categories

into

which the Recommendations divide

va

lidity

studies are: predictive validity, concurrent validity, c

ontent

validity, and

constrnct vali

di

ty.

Th

e

fir

st two of these may

be

considered together

as

criterion-oriented validation procedures.

TI1e

pattern of a criterion-oriented study

is

familiar.

The

investigator

is

primarily intere

st

ed in some criterion whi

ch

he

w

is

hes

to

predict.

li

e

administers the test, obtains

an

independent criterion measure on the

same subjects, and computes a correlation.

If

the criterion

is

obtained

some time after tile test

is

given, h e is studying predictive validity.

If

th

e

test score

and

criterion score are

de

termined

at

essentially

th

e same

time,

he

is

studying concurrent validity. Concurrent validity is studied

when

one

test

is

proposed as a s

ub

sti

tute

for another (for exampl

e,

when

a multiple-choice form

of

spelling test is substi

tuted

for taking dicta-

tion),

or

a test

is

shown to correlate

with

some contemporary criterion

(e.g

.,

psychiatric diagnosis).

Conte

nt

validity is established by showing that the test items arc

a sample

of

a universe in

whic11

the

investigator is interested.

Content

va

lidity is ordinarily

to

be

es

tablished deductivel

y,

by defining a un

i·

verse of items

and

sampling systematically within this universe to

establish

the

test.

Construct validation

is

in

vo

lved whenever a test

is

to

be

int

erpreted

as a

m0<1s

ure of some attribute or quality which is

not

"operationally

defined."

TI1

e problem faced by the investigator is, '

'\Vhat

constn1cts

accou

nt

for ·variance in test

pe

rformance?" Construct

va

lidity calls for

110 new scientific

ap

proach.

Mu

ch curre

nt

research on tests of per-

sonn

lity (9) is consl'ruct

va

lidation, usually without the

ben

efi

t of a

clear formulation of

!'hi

s process.

C

on

struct validity is not to he identified sol

ely

by particular investi-

17

5

Understanding Construct Validity in Psychological Tests: A Historical Perspective, Study Guides, Projects, Research of Psychology

Related documents

Partial preview of the text

Download Understanding Construct Validity in Psychological Tests: A Historical Perspective and more Study Guides, Projects, Research Psychology in PDF only on Docsity!

- ----L. J. CRONBACH and P. E. MEEID..-----

Construct Validity in Psychological Tests

V ALIDATION of psychological tests has not yet been adequately concep·

it undertook (1950-54) to specify what qualities should be investigated

constmct validity.* Th is idea was fir st formulated hy a subcommittee

lished in the Technical Recommendations ( 59). The present interpre-

writers are solely responsible for this attem pt to explain the concept

Writers on validity during the preceding decade had shown a great

* Referred to in a preliminary report ( 58) as cougruent validity.

tinction between tests as "signs" and "samples" (22), Cronbach's sepa·

ration of "logical" and "empirical" validity ( 11 ), Guilford's "factorial

eralization" ( 49, 50). Hel en Pea k ( 52) comes close to an explic it stat e-

Four Types of Validation

TI1e pattern of a criterion-oriented study is familiar. The investigator

same subjects, and computes a correlation. If the criterion is obtained

a multiple-choice form of spelling test is substi tuted for taking dicta-

Conte nt validity is established by showing th at the test items arc

validity is ordinarily to be es tablished deductivel y, by defining a un i·

as a m0<1s ure of some attribute or quality which is not "operationally

defined." TI1 e problem faced by the investigator is, ' '\Vhat constn1cts

accou nt for ·variance in test performance?" Construct va lidity calls for

sonn lity (9) is consl'ruct va lidation, usually without the ben efi t of a

clear formulation of !'hi s process.

oriented validity, as Bechtoldt emphasizes ( 3, p. 1245), "involves the

in empirical validation studies (cf. 46, pp. 49-50, 110-11).

each involves a different emphasis on the criterion. In predictive or con-

behavior or the scores on the criteria ( 59, p. 14).

relates .50 with Y, the amount of palmar sweating induced when we

of X for Y is adequately described by the coefficient, and a statement

ferent results if we induce palmar sweating by economic threat. It is

ternity brothers' ratings on "tenseness." Test X correlates .55 with

criterion variables would be justified only if it had already been shown

Inadequacy of Validation in Terms of Specific Criteria

pletely by itself, they allow two variables to collapse into one whenever

the properties of the operationally defined measures are the same: "If

test may be used for the new one." But accurate inferences are poss ible

construct, but a counselor is more likely to be asked to forecast behavior

around the generalized construct of anxiety. The Techni cal Recom-

to identify any one cntenon m eas ur e or any composite as the criterion

aimed at is, however, usually unwarranted (59, pp. 14-15).

nent at places in the testing literature. Thus Anastasi (2) makes many

To cJa im that a test measures anything over and above its cri terion is

pure speculation" (p. 67 ). Yet elsewhere this article supports construct

arises because some objects feel hotter to the touch than others. The

tween expansion and sensed temperature; ( b) ob servers employ the

the relation of mercury expansion to heat. 111is whole process of con-

criterion has now been relegated to a peripheral position. \Ve have lifted

tended to agree with judgme nts by schoolteachers. If it had no t shown

Expcrjmentation to In vestigate Co ns tru ct Validity

Croup <liffcrcnces. If our understanding of a construct leads us to

expect two gronps to cliffer on the test, this expectation may be tested

Correlation matrices and factor analysis. If two tests are presumed to

external mea sure of either the first or the second variable exists.) If the

psychologist wishes to know 'why his tests are valid.' He can place tests

it is economical in explanatio n; it leads to the creation of pure tests

define <] hy all behaviors of a given lype. \Vhi eh set of fa ctors from a

sponses will be given on some questionnaire by a subject he bas

ing." If "creativity" is defined as something independent of knowledge,

then a correlation of .40 between a presumed t es t of creativity and a

test of arithmetic knowledge would indicate that at least 16 per cent

of th e reliable test variance is irrelevant to creativity as defined. Labora·

It shon ld be particularly noted that rej ec ting the nuH hypothesis does

not finish tl1 e job of cons truct val idation ( 35, p. 28 4). The problem

v:1ri:1hlc. Th e msk is to st·atc as definitely :i s possi ble the degree of

validily the l es t is presn rn cd to have.

to th e works by Braithwaite (6, especially Chapter III), Carnap (7; 8,

pp. 56-69), Pap (5 1), Sellars (SS, 56), Feigl (19, 20), Beck (4), Kneale

(37, pp. 92- 11 0), Hempe] (29; 30, §7).

work.

2. The laws in a nomo]ogical network may relate (a) observable

properties or quantities to each other; or (b) theoretical constructs to

observables; or ( c) different theoretical constructs to one another. TI1ese

3. A necessary condition for a construct to be scientifically ad missible

observables. Admissible constructs may be r emote from observation, i.e.,

a long derivation may intervene between the nomologicals which im-

4. "Learning more about" a theoretical construct is a matter of elaho·