Analysis of Contingency Tables Contingency Tables ... | Schemes and Mind Maps Design

Newsom

Psy 525/625 Categorical Data Analysis, Spring 2021 1

Analysis of Contingency Tables

Contingency Tables

Contingency tables, sometimes called cross-classification or crosstab tables, involve two categorical

variables. More generally, we will refer to the two variables as each having I or J levels. For simplicity, we

will start by assuming two binary variables, forming a 2 × 2 table, in which I = 2 and J = 2. The four cells

of the design can contain counts, nij, or proportions, pij. The first subscript is an index of which level of

the first variable is referred to. In the 2 × 2 case, i = 1 for the first row or i = 2 for the second row, and j =

1 for the first column or j = 2 for the second column. For example, the count in the cell for the

intersection of the first column and first row will be n11. And the count in the cell in the intersection of the

first row and second column is n12 and so on. Marginal values are referred to with a “+” symbol to

designate that either i or j have been combined. So, for example, n1+ is used for the first row marginal

total count, and n+1 is used for the first column marginal total count. Cell counts and proportions with the

notation are presented below.

Cell counts/frequencies

n11

n12

n1+

n21

n22

n2+

n+1

n+2

n++

Proportions

p11

p12

p1+

p21

p22

p2+

p+1

p+2

p++

The proportions outside of the cells of each table (e.g., p+1, p+2, p1+, p2+) are marginal proportions that

relate to the marginal distribution for that variable. Each marginal proportion involves only the counts for

one of the variables and the total sample size, pi+ = ni+/n++ and p+j = n+j/n++. (Note that either n++ or n may

be used). The proportions inside the cells of the table (p11, p12, p21, p21) are joint proportions that relate to

the joint distribution of the two variables. Each joint proportion is the count for that cell divided by the total

count, pij = nij/n++. Another distinction is the conditional proportion, which relates to the conditional

distribution. Conditional proportions represent estimates of the conditional probability that P(Yi = 1) given

the value of Yj, written as P(Yi = 1|Yj).1 The conditional probability that event A occurs given event B is the

same as the joint probability of A and B occurring relative to the probability of B occurring (or the B

sample space).

( ) ( )

( )

|PA B

PAB PB

∩

Similarly, the sample conditional proportion estimates the

conditional probability, such at

i j ij i ij i

p p p nn

= =

In contingency tables, this is a row proportion (percentage),

because it is the proportion of all cases in one row (ni+ ) that

appear in one column, nij (e.g., proportion of males that say

“no”).

Andrew Batishchev

Bayes’ Theorem

It is well worth a brief digression to discuss the famous theorem called Bayes’ theorem, proposed by the

eighteenth-century mathematician/clergyman Thomas Bayes, because of its very widespread application

to statistics. Bayes’ theorem is a simple method of computing the conditional probability of one event

given another event if the probability of both events and the other conditional probability is known.

1 The subscripts for the Y variable are potentially confusing, as Y here generally is assumed to be an individual score, with Yi representing the

individual score for the row variable with a particular value (e.g., i = 2) and Yj representing the individual score on the column variable with a

particular value (e.g., j = 1).

Analysis of Contingency Tables Contingency Tables ..., Schemes and Mind Maps of Design

Related documents

Partial preview of the text

Download Analysis of Contingency Tables Contingency Tables ... and more Schemes and Mind Maps Design in PDF only on Docsity!

∩

−

is often printed along with the Pearson χ^2. It is not so much a modification of the chi-square test as an

studies (e.g., Camilli & Hopkins, 1978) suggest that Pearson’s χ^2 has nominal alpha values with

Yates suggested a correction to the Pearson’s χ^2 based on the notion that a test of discrete variables

same way. Planned follow-up analyses to an omnibus Pearson χ^2 in complex contingency tables are

χ − χ

Below I illustrate SPSS, R, and SAS procedures for the Pearson’s χ^2 test to independents on their

Joint and Marginal Frequencies

Cell Proportions within Each Column

independents. This difference was not significant, χ^2 (1) = 1.12, p = .29 The phi coefficient, φ = .03,