Cohort Studies

Design, Applications, Strengths & Weaknesses of Cohort Studies

Cohort studies evaluate a possible association between exposure and outcome by following a group of exposed individuals over a period of time (often years) to see whether they develop the disease or outcome of interest. A cohort is a group of individuals who share a common characteristic, such as workers from the same factory. Participants are selected on the basis of exposure, and should be free of the outcome under investigation at the start of the study.

The incidence of disease in the exposed group is then compared to the incidence of disease in an unexposed group, and a relative risk (incidence risk or incidence rate) is calculated to assess whether the exposure and disease are causally linked.

Cohort studies may be prospective or retrospective, but both types define the cohort on the basis of exposure, not outcome.

Prospective cohort studies – participants are identified and followed up over time until the outcome of interest has occurred, or the time limit for the study has been reached. A temporal relationship between exposure and outcome can thus be established.

Retrospective cohort studies – exposure and outcome have already occurred at the start of the study. Pre-existing data, such as medical notes, can be used to assess any causal links, so lengthy follow-up is not required. This type of cohort study is therefore less time consuming and costly, but it is also more susceptible to the effects of bias. For example, the exposure may have occurred some years previously and adequate reliable data on exposure may be unavailable or incomplete. In addition information on confounding variables may be unavailable, inadequate or difficult to collect.

1. Issues in the design of cohort studies

  • Selection of study groups
  • The aim of a cohort study is to select study participants who are identical with the exception of their exposure status. All study participants must be free of the outcome under investigation and have the potential to develop the outcome under investigation.

    If the exposure is common, the study population can be selected before classifying individuals as exposed or unexposed, for example a selection of the general population. An occupational cohort includes individuals who share the same workplace, which may make it easier to follow the population up. If the exposure is rare, the study population may be chosen on the basis of exposure, to ensure sufficient exposed individuals are enrolled. For example, this may be workers at a particular factory who regularly handle a chemical of interest. The comparison group might be workers at the same factory whose roles do not bring them into contact with the chemical. If a control group is chosen from the general population, there is a risk of bias due to the ‘healthy worker effect’: the general population is usually less healthy than the workforce because it includes those unable to work due to illness.1

  • Measuring exposure
  • Levels of exposure (e.g. packs of cigarettes smoked per year) are measured for each individual at baseline at the beginning of study and assessed at intervals during the period of follow-up. When several exposures are being considered simultaneously, the non-exposed group should comprise all those with none of the risk factors under investigation.

    A particular problem occurring in cohort studies is whether individuals in the control group are truly unexposed. For example study participants may start smoking or they may fail to correctly recall past exposure. Similarly, those in the exposed group may change their behaviour in relation to the exposure such as diet, smoking or alcohol consumption.

    Exposure data may be obtained from a number of sources including medical or employment records, standardized questionnaires, interviews and by physical examination.

  • Measuring outcome
  • Outcome measures may be obtained from various sources including routine surveillance of cancer registry data, death certificates, medical records or directly from the participant. Note that the method used to ascertain outcome must be identical for both exposed and unexposed groups to avoid the risk of measurement bias. Information on outcomes should be collected by members of the study team who are blind to participants’ exposure status, to reduce the risk of observer bias.

  • Methods of follow up
  • The follow up of study participants in a cohort study is a major challenge, and it may take many years for a sufficient proportion of participants to have reached an outcome. A great deal of cost and time is required to ensure adequate follow-up of cohort members and to update measures of exposures and confounders, as well as monitoring participants’ health outcomes. Failure to collect outcome data for all members of the cohort will affect the validity of study results.

2. Potential sources of bias in cohort studies

A major source of potential bias in cohort studies is due to losses to follow up. Cohort members may die, migrate, change jobs or refuse to continue to participate in the study. In addition losses to follow up may be related to the exposure, outcome or both. For example individuals who develop the outcome may be less likely to continue to participate in the study. The degree to which losses to follow up are correlated with either exposure or outcome can lead to serious bias in the measurement of effect of exposure and outcome.2

Another major source of potential bias in cohort studies arises from the degree of accuracy with which subjects have been classified with respect to their exposure or disease status. Differential misclassification, when one group of participants is more likely to have been misclassified than the other, can lead to an over- or underestimate of the effect between exposure and outcome.2

3. Analysis of cohort studies

Analysis of a cohort study uses the ratio of either the risk or rate of disease in the exposed cohort, compared with the rate or risk in the unexposed cohort.

If follow-up times differ markedly between participants, a rate may be more appropriate.The risk ratio uses as a denominator the entire group recruited at the start of the study while the rate ratio uses as a denominator the person years which takes account of losses to follow-up.

Table 1. Calculation of the rate ratio from a hypothetical cohort study of smoking and cancer of the pancreas followed for 1 year.

From the data in table 1 taken from a hypothetical cohort study to investigate the association between smoking and cancer of the pancreas the relative and attributable risk can be calculated as follows;

The relative risk of 15 indicates that the risk of cancer of the pancreas is 15 times higher among smokers than non-smokers.

The attributable risk of cancer of the pancreas due to smoking is 1.4 cases per 1000 per year

This can be interpreted as: smoking accounts for 93% of all cases of cancer of the pancreas among smokers.

&nbs;&nbs;&nbs;&nbs;&nbs;&nbs;Standardised mortality or morbidity ratio (SMR)

Standardised mortality and morbidity ratios are another commonly used method of presenting results in a cohort study.

4. Strengths and weaknesses of cohort studies

Strengths

  • Multiple outcomes can be measured for any one exposure.
  • Can look at multiple exposures.
  • Exposure is measured before the onset of disease (in prospective cohort studies).
  • Good for measuring rare exposures, for example among different occupations.
  • Demonstrate direction of causality.
  • Can measure incidence and prevalence.

Weaknesses

  1. Costly and time consuming.
  2. Prone to bias due to loss to follow-up.
  3. Knowledge of exposure status may bias classification of the outcome.
  4. Being in the study may alter participant's behaviour.
  5. Poor choice for the study of a rare disease.
  6. Classification of individuals (exposure or outcome status) can be affected by changes in diagnostic procedures.

5. Examples of cohort studies3

One of the most famous examples of a cohort study is Sir Richard Doll’s study of the hazards of cigarette smoking in a cohort of nearly 35,000 British doctors. Baseline information about their smoking habits was obtained in 1951, and periodically thereafter. Cause-specific mortality was then monitored for 50 years and the results showed an excess mortality associated with smoking, chiefly due to vascular, neoplastic, and respiratory diseases.

Other cohorts include the Framingham cardiovascular studies, following people living in the town of Framingham, Massachusetts. Much of our current knowledge about heart disease, such as the effects of diet, exercise, and common medications such as aspirin, is based on this longitudinal cohort study. In the UK, the Whitehall studies have followed cohorts of British civil servants, demonstrating how groups in the cohort with differing levels of a characteristic such as cholesterol subsequently have different rates of ischaemic heart disease.

Cohort studies are also useful to study what happens if something unusual happens - a rare exposure - to a group of people. The most famous example of this is the cohort of people who were survived the atomic bomb explosions at Hiroshima and Nagasaki.

Doll R et al. Mortality in relation to smoking: 50 years' observations on male British doctors. BMJ 2004; 328 (7455): 1519

E G Rael et al. Sickness absence in the Whitehall II study, London: the role of social support and material problems. Journal of Epidemiology and Community Health 1995:49 (5): 474-81

Cohort studies are addressed by the following MFPH Part A past paper questions:

  1. January 2003 – Paper IA, Question 1
  2. June 2004 – Paper IA, Question 2
  3. Jan 2006 – Paper IA, Question 2

References

  1. Bailey L, Vardulaki K, Langham J, Chandramohan D. Introduction to Epidemiology. Open University Press, 2005.
  2. Hennekens CH, Buring JE. Epidemiology in Medicine, Lippincott Williams & Wilkins, 1987.
  3. http://www.edmundjessop.org.uk/fulltext.doc - Accessed 10/01/09

© Helen Barratt, Maria Kirwan 2009