Identifying & Controlling Confounders in Epidemiology: A Guide, Study Guides, Projects, Research of Psychology

An in-depth explanation of confounding in epidemiological studies. It discusses how confounding occurs, its impact on the measurement of association between an exposure and health outcome, and methods for assessing and controlling for confounders. examples using rate ratios and odds ratios, as well as practical exercises. It is a valuable resource for students and researchers in public health and epidemiology.

Typology: Study Guides, Projects, Research

2021/2022

Uploaded on 09/27/2022

karthur
karthur 🇺🇸

4.8

(8)

230 documents

1 / 5

Toggle sidebar

This page cannot be seen from the preview

Don't miss anything!

bg1
ERIC at the UNC CH Department of Epidemiology Medical Center
Confounding Bias, Part I
E R I C N O T E B OO K S E R I ES
Confounding is one type of
systematic error that can occur in
epidemiologic studies. Other types of
systematic error such as information
bias or selection bias are discussed
in other ERIC notebook issues.
Confounding is an important concept
in epidemiology, because, if present,
it can cause an over- or under-
estimate of the observed association
between exposure and health
outcome. The distortion introduced
by a confounding factor can be large,
and it can even change the apparent
direction of an effect. However,
unlike selection and information
bias, it can be adjusted for in the
analysis.
What is confounding?
Confounding is the distortion of the
association between an exposure
and health outcome by an
extraneous, third variable called a
confounder. Since the exposure of
interest is rarely the only factor that
differs between exposed and
unexposed groups, and that also
affects the health outcome or
disease frequency, confounding is a
common occurrence in etiologic
studies.
Confounding is also a form a
bias. Confounding is a bias because it
can result in a distortion in the
measure of association between an
exposure and health outcome.
Confounding may be present in any
study design (i.e., cohort, case-control,
observational, ecological), primarily
because it's not a result of the study
design. However, of all study designs,
ecological studies are the most
susceptible to confounding, because it
is more difficult to control for
confounders at the aggregate level of
data. In all other cases, as long as
there are available data on potential
confounders, they can be adjusted for
during analysis.
Confounding should be of concern
under the following conditions:
1. Evaluating an exposure-health
outcome association.
2. Quantifying the degree of
association between an exposure
and health outcome. For example,
you might want to quantify how
being overweight increases the
risk of cardiovascular disease
(CVD). If you were concerned
about age as a confounder, you
would “control for” the effect of
age in your statistical modeling.
Second Edition Authors:
Lorraine K. Alexander, DrPH
Brettania Lopes, MPH
Kristen Ricchetti-Masterson, MSPH
Karin B. Yeatts, PhD, MS
Second Edition
pf3
pf4
pf5

Partial preview of the text

Download Identifying & Controlling Confounders in Epidemiology: A Guide and more Study Guides, Projects, Research Psychology in PDF only on Docsity!

Confounding Bias, Part I

E R I C N O T E B O O K S E R I E S

Confounding is one type of systematic error that can occur in epidemiologic studies. Other types of systematic error such as information bias or selection bias are discussed in other ERIC notebook issues. Confounding is an important concept in epidemiology, because, if present, it can cause an over- or under- estimate of the observed association between exposure and health outcome. The distortion introduced by a confounding factor can be large, and it can even change the apparent direction of an effect. However, unlike selection and information bias, it can be adjusted for in the analysis. What is confounding? Confounding is the distortion of the association between an exposure and health outcome by an extraneous, third variable called a confounder. Since the exposure of interest is rarely the only factor that differs between exposed and unexposed groups, and that also affects the health outcome or disease frequency, confounding is a common occurrence in etiologic studies.

Confounding is also a form a bias. Confounding is a bias because it can result in a distortion in the measure of association between an exposure and health outcome. Confounding may be present in any study design (i.e., cohort, case-control, observational, ecological), primarily because it's not a result of the study design. However, of all study designs, ecological studies are the most susceptible to confounding, because it is more difficult to control for confounders at the aggregate level of data. In all other cases, as long as there are available data on potential confounders, they can be adjusted for during analysis. Confounding should be of concern under the following conditions:

  1. Evaluating an exposure-health outcome association.
  2. Quantifying the degree of association between an exposure and health outcome. For example, you might want to quantify how being overweight increases the risk of cardiovascular disease (CVD). If you were concerned about age as a confounder, you would “control for” the effect of age in your statistical modeling.

Second Edition Authors:

Lorraine K. Alexander, DrPH

Brettania Lopes, MPH

Kristen Ricchetti-Masterson, MSPH

Karin B. Yeatts, PhD, MS

Second Edition

In one study, the rate ratio might change from 4.0 to 3.7 when controlling for age, whereas in another study, a rate ratio of 4 may change to 1.2 after controlling for age.

  1. Multiple causal pathways may lead to the health outcome. If there is only one way to contract the health outcome or disease, confounding cannot occur. This criterion is almost always met as health outcomes can inevitably be caused by different agents, different transmission routes, or different biological or social mechanisms.

A few examples of research questions in which you would want to consider confounding are listed below:

  1. Does being overweight increase the risk of coronary heart disease (CHD) -- independently of cholesterol, hypertension, and diabetes?
  2. Does tobacco advertising entice adolescents to experiment with tobacco independently of whether or not their parents smoke?

Assessing confounding

Each potential confounder has to meet two criteria before they can be confounders: Criterion 1 is that the potential confounder must be a known risk factor for the health outcome or disease.

Broadly speaking, a risk factor is any variable that is:

  1. Already known to be "causally related" to the health outcome or disease (though not necessarily a direct cause) AND
  2. Antecedent to the health outcome or disease on the basis of substantive knowledge or theory, and/or on previous research findings.

The confounding factor must be predictive of the health outcome or disease occurrence apart from its association with exposure; that is, among unexposed (reference) individuals, the potentially confounding factor should be related to the health outcome or disease.

With an epidemiological data set, one can calculate whether or not a potential confounder is a risk factor using the following mathematical formula: Criterion 1 for confounding: mathematical formula Criterion 1 for confounding is the following: among the unexposed, there should be an association between the confounder and the health outcome. To convert this to a mathematical equation, the first thing to realize is that Criterion 1 involves calculating a measure of association ("there should be an association between the confounder and the health outcome"). Examples of measures of association are: risk ratios, rate ratios, odds ratios, and risk differences

  • the type of measure depends on the type of data available, and the scale on which the measure of association is assessed (additive or multiplicative scale). This measure of association will be calculated among the unexposed population only. For a prospective cohort study where we want to measure the association on a multiplicative scale, we will calculate the following rate ratio (RR): RR CD/E- = risk ratio confounder in unexposed Rate of new cases among population A Rate of new cases among population B where the rate of new cases = the number of new cases divided by the total number of susceptible individuals. Population A is comprised of all individuals who have the confounder (C+) but who are unexposed (E-), and population B is comprised of all individuals who don't have the confounder (C-) or the exposure (E-).

To decide whether a variable is working independently of the association of interest, there must be a biological or social mechanism to causally link the exposure of interest to the disease or health outcome. Such decisions should be made on the basis of the best available information, including non-epidemiological (i.e., clinical, sociological, psychological, or basic science) data. This criterion is obviously satisfied if the confounding factor precedes the exposure and health outcome or disease.

For instance, if interested in assessing the association between physical inactivity and cardiovascular disease (CVD), body weight should not be controlled for if being overweight may be an intermediary step in the causal pathway between physical inactivity and CVD.

In contrast, if the proposed causal pathway is independent of body weight, then body weight can be considered a potential confounder. If intervening variables are controlled for in the analysis, it may reduce or eliminate any indications in the data of a true association between disease and exposure.

ERIC Notebook Confounding Bias Part II and Effect Measure Modification, discuss control of confounders in epidemiological studies.

are unexposed (E-). Note the additional inclusion crite- ria for case-control studies: the individuals included in this calculation must include only those who have the potential to be cases (the control group). Now that the risk ratio, rate ratio, or odds ratio for the association between the confounder and exposure has been calculated, how is it interpreted? For the con- founder associated with the exposure, this association has to be greater than 1 (for a harmful association) or less than 1 (for a protective association).

Terminology

Confounding bias: A systematic distortion in the measure of association between exposure and the health outcome caused by mixing the effect of the exposure of primary interest with extraneous risk factors.

Practice Questions Answers are at the end of this notebook Researchers have conducted a cohort study in country A to examine the association between a diet high in fat and the risk of colon cancer. The researchers believe that vitamin use may be a confounder. Use the 2x2 tables below to determine if vitamin use is a confounder in the high fat diet- colon cancer association.

Among people exposed to a high fat diet (n=2474):

Among people not exposed to a high fat diet (n=1650):

  1. Is vitamin use an independent risk factor or protective factor for colon cancer?

Colon cancer

No colon cancer

Total Exposed to a high fat diet

254 2220 2474

Not ex- posed to a high fat diet

150 1500 1650

Colon cancer

No colon cancer

Total Takes daily vita- min

150 1830 1980

Does not take daily vitamin

104 390 494

Colon can- cer

No colon cancer

Total Takes daily vitamin

50 800 850

Does not take daily vitamin

100 700 800

  1. Is vitamin use differentially distributed between the high fat diet and low fat diet groups?

  2. Compare the crude risk ratio with the risk ratios stratified by vitamin use.

References

Dr. Carl M. Shy, Epidemiology 160/600 Introduction to Epidemiology for Public Health course lectures, 1994- 2001, The University of North Carolina at Chapel Hill, Department of Epidemiology

Rothman KJ, Greenland S. Modern Epidemiology. Second Edition. Philadelphia: Lippincott Williams and Wilkins,

The University of North Carolina at Chapel Hill, Department of Epidemiology Courses: Epidemiology 710, Fundamentals of Epidemiology course lectures, 2009- 2013, and Epidemiology 718, Epidemiologic Analysis of Binary Data course lectures, 2009-2013.

Answers to Practice Questions

  1. Risk ratio of vitamin users getting colon cancer among the non-exposed group: (50/850) / (100/800)= 0. A risk ratio of 0.47 shows that vitamin use is a moderate inverse predictor of colon cancer. In this study population, vitamin use was protective for colon cancer. 2 Among people who eat a high fat diet there are 1980/2474= 80% vitamin users Among people who do not eat a high fat diet there are 850/1650= 52% vitamin users So vitamin use is differentially distributed among the high fat and low fat diet exposure groups.
  2. The crude risk ratio (not stratified by vitamin use) is the risk of colon cancer from high fat diet exposure / the risk of colon cancer from low fat diet exposure. Crude risk ratio = (254/2474) / (150/1650) = 1. The risk ratio for colon cancer among vitamin users with a high fat diet is: Risk ratio = (150/1980) / (50/850)= 1.

The risk ratio for colon cancer among non-vitamin users with a high fat diet is:

Risk ratio = (104/494) / (100/800) =1.

The crude risk ratio of 1.13 and the vitamin-specific risk ratio of 0.47 (from question 1) are not in between the stratified risk ratios, they are both lower than the stratified risk ratios. Thus, the crude risk ratio is confounded by vitamin use.

Acknowledgement The authors of the Second Edition of the ERIC Notebook would like to acknowledge the authors of the ERIC Notebook, First Edition: Michel Ibrahim, MD, PhD, Lorraine Alexander, DrPH, Carl Shy, MD, DrPH, Gayle Shim okura, MSPH and Sherry Farr, GRA, Departm ent of Epidem iology at the University of North Carolina at Chapel Hill. The First Edition of the ERIC Notebook was produced by the Educational Arm of the Epidem iologic Research and Information Center at Durham, NC. The funding for the ERIC Notebook First Edition was provided by the Department of Veterans Affairs (DVA), Veterans Health Administration (VHA), Cooperative Studies Program (CSP) to prom ote the strategic growth of the epidemiologic capacity of the