Statistical Methods for Psychology

was motivated by a question sent to me by Jennifer Mahon at the University of Leicester, England, who has graciously allowed me to use her data for this example. Ms Mahon was interested in the question of whether the likelihood of dropping out of a study on eating disorders was related to the number of traumatic events the participants had experienced in childhood. The data from this study are shown below. I have taken the liberty of altering them very slightly so that I don’t have to deal with the problem of small expected frequencies at the same time that I am trying to show how to make use of the ordinal nature of the data. The altered data are still a faithful representation of the effects that she found.

Number of Traumatic Events 012341 Total Dropout 25 13 9 10 6 63 Remain 31 21 6 2 3 63

Total 56 34 15 12 9 126

At first glance we might be tempted to apply a standard chi-square test to these data, testing the null hypothesis that dropping out of treatment is independent of the number of traumatic events the person experienced during childhood. If we do that we find a chi- square of 9.459 on 4 df,which has an associated probability of .051. Strictly speaking, this result does not allow us to reject the null hypothesis, and we might conclude that traumatic events are not associated with dropping out of treatment. However, that answer is a bit too simplistic. Notice that Trauma represents an ordered variable. Four traumatic events are more than 3, 3 traumatic events are more than 2, and so on. If we look at the percentage of participants who dropped out of treatment as a function of the number of traumatic events they had experienced as children, we see that there is a general, though not a monotonic, increase in dropouts as we increase the number of traumatic events. However, this trend was not allowed to play any role in our calculated chi-square. What we want is a statistic that does take order into account.

A Correlational Approach

There are several ways we can accomplish what we want, but they all come down to as- signing some kind of ordered metric to our independent variables. Dropout is not a problem because it is a dichotomy. We could code dropout as 1 and remain as 2, or dropout as 1 and remain as 0, or any other two values we like. The result will not be affected by our choice of values. When it comes to the number of traumatic events, we could simply use the numbers 0, 1, 2, 3, and 4. Alternatively, if we thought that 3 or 4 traumatic events would be much more important than 1 or 2, we might use 0, 1, 2, 4, 6. In practice, as long as we chose numbers that are monotonically increasing, and are not very extreme, the result will not change much as a function of our choice. I will choose to use 0, 1, 2, 3, and 4. Now that we have established a metric for each independent variable, there are several different ways that we could go. We’ll start with one that has good intuitive appeal. We will simply correlate our two variables.^3 Each participant will have a score of 0 or 1 on Dropout, and a score between 0 and 4 on Trauma. The standard Pearson correlation between those

Section 10.4 Analysis of Contingency Tables with Ordered Variables 307

(^3) Many articles in the literature refer to Maxwell (1961) as a source for dealing with ordinal data. With one minor
exception, Maxwell’s approach is the one advocated here, though it is difficult to tell that from his description
because his formulae were selected for computational ease.

Statistical Methods for Psychology

Get our desktop app

Company

Features

Documentation

Resources