Using the Wilcoxon Test for Analyzing Data from Repeated-Measures Designs | Exams Psychology

Graham Hole Research Skills, version 1.0

The Wilcoxon test:

Use this when the same participants perform both conditions of your study:

i.e., it is appropriate for analysing the data from a repeated-measures design with two

conditions. Use it when the data do not meet the requirements for a parametric test

(i.e. if the data are not normally distributed; if the variances for the two conditions are

markedly different; or if the data are measurements on an ordinal scale). Otherwise, if

the data meet the requirements for a parametric test, it is better to use a repeated-

measures t-test (also known as a "dependent means" or "matched pairs" t-test).

The logic behind the Wilcoxon test is quite simple. The data are ranked to

produce two rank totals, one for each condition. If there is a systematic difference

between the two conditions, then most of the high ranks will belong to one condition

and most of the low ranks will belong to the other one. As a result, the rank totals will

be quite different and one of the rank totals will be quite small. On the other hand, if

the two conditions are similar, then high and low ranks will be distributed fairly

evenly between the two conditions and the rank totals will be fairly similar and quite

large. The Wilcoxon test statistic "W" is simply the smaller of the rank totals. The

SMALLER it is (taking into account how many participants you have) then the less

likely it is to have occurred by chance. A table of critical values of W shows you how

likely it is to obtain your particular value of W purely by chance. Note that the

Wilcoxon test is unusual in this respect: normally, the BIGGER the test statistic, the

less likely it is to have occurred by chance).

This handout deals with using Wilcoxon with small sample sizes. If you have

a large number of participants, you can convert W into a z-score and look this up

instead. The same is true for the Mann-Whitney test. There is a handout on my

website that explains how to do this, for both tests.

Step by step example of the Wilcoxon test:

Suppose we wanted to know if people's ability to report words accurately was

affected by which ear they heard them in. To investigate this, we performed a

dichotic listening task. Each participant heard a series of words, presented randomly

to either their left or right ear, and reported the words if they could. Each participant

Using the Wilcoxon Test for Analyzing Data from Repeated-Measures Designs, Exams of Psychology