Reliability - Tests and Measurements - Lecture Notes, Study notes of Psychological Data

Reliability, General Information, Field of Psychological Testing, Consistency in Measurement, Particular Sample, Classical Test Theory, True Score, Intellectual Ability, Intelligence Test, Depression Scale. You might have found many different lecture notes on Tests and Measurements regarding psychological disorder over internet but this set I am uploading is best you can find.

Typology: Study notes

2011/2012

Uploaded on 12/11/2012

saikumar
saikumar 🇮🇳

4.2

(44)

141 documents

1 / 6

Toggle sidebar

This page cannot be seen from the preview

Don't miss anything!

bg1
RELIABILITY
OBJECTIVE:
To increase understanding of the concept of reliability and to provide first hand
experience with the calculation of interscorer and test-retest reliabilities.
GENERAL INFORMATION - RELIABILITY
The term reliability as used in the field of psychological testing refers to consistency in
measurement. Classical Test theory or "true score" theory proposes that a score on a
test consists of two parts. The first part is the "true" amount of whatever is being
measured (e.g., intellectual ability on an intelligence test, or depression on a depression
scale). The second part consists of unsystematic error of measurement (e.g. time of
day the test was given, the particular sample of questions chosen for the test, the
lighting in the room, a faulty pen, etc.). All of these extraneous variables may affect test
takers differentially. This part of the score is referred to as the error portion of the score.
A reliability coefficient is a numerical index of reliability that expresses the ratio between
the "true" score on a test and the total variance and basically gives an indication of how
free the test is from errors in measurement. In test theory, a "true score" on a test can
be thought of as the average (hypothetical) score that would be obtained if an individual
were to take the test an infinite number of times. In other words, the true score is the
remaining part of the score once it has been stripped of random error.
In testing, one goal is to determine true score difference. What we know is:
X = T + e
where: X represents an observed score,
T represents a true score, and
e represents an error score (the part of the score due to random,
nonsystematic influences on the test).
We only know an individual's observed score (X), which is their measured score on a
test. With one equation and two unknowns, we cannot directly compute their true score,
which is what we really want to know. Stated another way, we only know a person's
measured or observed score; we want to know their true score. But, by definition, we
cannot ever know a true score. So we must somehow estimate error and then estimate
a true score.
To estimate true scores, we will work with the variances of observed scores, true
scores, and error scores with the resulting formula:

X
T
e
222

Docsity.com
pf3
pf4
pf5

Partial preview of the text

Download Reliability - Tests and Measurements - Lecture Notes and more Study notes Psychological Data in PDF only on Docsity!

RELIABILITY

OBJECTIVE:

To increase understanding of the concept of reliability and to provide first hand experience with the calculation of interscorer and test-retest reliabilities.

GENERAL INFORMATION - RELIABILITY

The term reliability as used in the field of psychological testing refers to consistency in measurement. Classical Test theory or "true score" theory proposes that a score on a test consists of two parts. The first part is the "true" amount of whatever is being measured (e.g., intellectual ability on an intelligence test, or depression on a depression scale). The second part consists of unsystematic error of measurement (e.g. time of day the test was given, the particular sample of questions chosen for the test, the lighting in the room, a faulty pen, etc.). All of these extraneous variables may affect test takers differentially. This part of the score is referred to as the error portion of the score.

A reliability coefficient is a numerical index of reliability that expresses the ratio between the "true" score on a test and the total variance and basically gives an indication of how free the test is from errors in measurement. In test theory, a "true score" on a test can be thought of as the average (hypothetical) score that would be obtained if an individual were to take the test an infinite number of times. In other words, the true score is the remaining part of the score once it has been stripped of random error.

In testing, one goal is to determine true score difference. What we know is:

X = T + e

where: X represents an observed score, T represents a true score, and e represents an error score (the part of the score due to random, nonsystematic influences on the test).

We only know an individual's observed score (X), which is their measured score on a test. With one equation and two unknowns, we cannot directly compute their true score, which is what we really want to know. Stated another way, we only know a person's measured or observed score; we want to know their true score. But, by definition, we cannot ever know a true score. So we must somehow estimate error and then estimate a true score.

To estimate true scores, we will work with the variances of observed scores, true scores, and error scores with the resulting formula:

 (^) X  (^) Te (^2 2 )

This formula indicates that the total variance in a distribution of observed scores is equal to the sum of the true variance plus the error variance.

Test reliability then gets defined as the ratio of true score variance (actual differences) to observed score variance (obtained differences); in other words, the proportion of variance in the observed scores that can be accounted for by variance in true scores.

The reliability of a test is designated r 11 to indicate that the same construct is being measured twice with the same instrument:

r T X

11

2  2

If there were no error in the scores, and all of the scores were equal to the true scores, then the calculated r 11 would be equal to 1. If all of the observed scores in a distribution contained equal parts of error and true ability (or whatever is being measured), then the value of r 11 would be .5. In other words, if all the variance is accounted for by true variance, then r 11 would be 1.00; if none of it is ,then the value of r 11 would be 0.

If r 11 is the proportion of the observed score variance that is accounted for by true score variance, then 1 - r 11 is the proportion of observed score variance accounted for by error. We can then rewrite the equation as follows:

1  re

X

11

2

2

Since we will never know the true score variance, we are left with estimating a test's reliability. If we can obtain an estimate of the variance in the error, then we can estimate a person's true score.

The error variance can be estimated with the following formula:

e^2^  X^2  1  r 11 

We have just explained Classical test theory. Now we'll see how we obtain reliability estimates to evaluate a test.

The reliability of a test is expressed as a positive decimal number ranging from 0.00 to 1.00: r 11 = 1.00 indicates perfect reliability, and r 11 = 0.00 indicates the complete absence of reliability. Reliability is usually estimated by analyzing the effects of variations in administration conditions (test-retest) and test content (parallel or alternate forms) on test taker's scores. Reliability is not affected by systematic changes in scores

Apply this formula to the data above letting X equal the first score on the exam and Y equal the score received on retest. Similarly for the interscorer data, X can equal the first judge's rating and Y the second judge's rating.

Step by step instructions for calculating the Pearson r follow:

Step 1 Setup a table with the 5 headings below listed across the top:

X, X^2 , Y, Y^2 , and XY

Step 2 List the values for X next to the corresponding values for Y in the table where they belong. Add all values for X, placing that sum at the bottom of the X column. Add all values for Y and place that sum at the bottom of the Y column.

Step 3 Multiply each value of X by its corresponding Y value and place that product in the corresponding XY column. Add all the XY products and place that sum at the bottom of the XY column.

Step 4 Square all the values of X and add all these values, placing the sum at the bottom of the X

2 column. Square all the values of Y and add these values together, placing the sum at the bottom of the Y

2 column.

Step 5 Insert the numbers into the formula and solve for r.

YOUR HOMEWORK:

In this assignment, you will practice calculating and interpreting reliability estimates. Use Pearson's r for the correlation coefficient. You are asked to make a decision about the admissions procedures based on your results.

Name__________________________________________________________

RELIABILITY- WORKSHEET

  1. Calculate the correlation coefficient ( r ) between the first and second administration of the test given to the applicants. This constitutes test-retest reliability. This assignment can be completed by computer provided you show all the required information (see below).

X X^2 Y Y^2 XY

Correlation =

  1. Calculate the correlation coefficient ( r ) between the judges’ ratings. This constitutes inter-rater reliability.

X X^2 Y Y^2 XY

Correlation =