Use of routine vital and health statistics to describe the distribution of disease in time and place and by person

1. Sources of routine data

Various sources of routinely collected data are available for use in epidemiological studies. These include:

  • Demographic data from census and population registers
  • Death certificates
  • Cancer registrations
  • Birth registrations
  • Congenital malformations registrations
  • Infectious disease notifications
  • Hospital episode data
  • Health surveys
  • Royal College of General Practitioners weekly returns

Some of these data are collected routinely, on a national or regional basis, such as deaths, cancer registrations, births, fertility and infectious disease notifications. However the availability and quality of routine health statistics that can be utilized for epidemiological investigations may vary considerably:

  • Data may be collected on a systematic and ongoing basis or only intermittently (such as in health surveys)
  • They may be collected for the entire population or for a sample of the population
  • They may be available nationally, regionally or locally
  • They may be cohort or cross-sectional in concept
  • They may be person or episode based
  • Much of the mortality and morbidity data in the UK is collected separately for England, Wales, Scotland and Northern Ireland, making comparisons between countries more problematic, particularly as data collection may not be standardized between countries
  • The ability to compare mortality or morbidity, for example cancer rates or cancer survival, between countries may prove problematic because coverage, as well as both the methods of data collection and the classification of certain cancers; may vary significantly between countries.

In England most health statistics are published by the Office for National Statistics (ONS) and a large amount is now available electronically.


2. Demographic data

Demography is the scientific study of populations, including their size, structure, dispersal and development.

Basic demographic data are an essential part of any epidemiological investigation as they form the baseline count of the total population being studied. Reliable denominators are necessary for the calculation of the various measures of disease frequency in epidemiological studies.

The primary source of denominator data in the UK is derived from the national census which is carried out every 10 years. The last UK census was held in 2001 and the next one is scheduled for 2011. In addition to population counts by age and sex, the UK census also collects data on a number of socio-economic variables including ethnicity (since 1991), country of birth, accommodation, education, employment and long-term illness (since 1991).

There is no central agency that collates statistics for all the four home nations in the UK, so separate censuses are undertaken simultaneously by each of the three government census agencies in England and Wales, Scotland and Northern Ireland. Census data has become increasingly available in electronic format much of which can be downloaded from the relevant census office.

Issues in the use of census data

In the UK, the census is conducted every 10 years. As a result population data can become quickly out of date and may lead to spurious measures of disease frequency and effect. In addition data may be incomplete for some population sub-groups, for example those in hard to reach communities.

However, annual mid-year population estimates based on updates from the most recent census, taking into account births, deaths, net migration, and ageing of the population, are produced by the Office for National Statistics. Mid-year population estimates are also available for Scotland and Northern Ireland. In some, particularly urban, settings, there may be a significant issue with migration rates which can lead to potential inaccuracy in some local population projection estimates.


3. Mortality Data

Sources of routine mortality data in the United Kingdom

Complete and continuous mortality data are readily available in the UK. In England and Wales the information is derived from the registration of deaths certified by an attending medical practitioner or coroner (Procurator Fiscal in Scotland). Death certificates include information on the immediate and underlying causes of death, age at death, sex, address and occupation.

Death certificates are sent to the Office for National Statistics (ONS) where the underlying causes of death is classified according to the Tenth Revision of the International Classification of Diseases (ICD_10).

Mortality statistics for England and Wales are compiled by the Office for National Statistics (ONS) and published annually:

Mortality data for Scotland and Northern Ireland are available from the relevant national statistical offices.

Mortality data are more complete and reliable than many other routine sources of morbidity data, but it is still subject to some level of misclassification and can be influenced by a number of factors including:

  • Diagnostic accuracy
  • The certifying experience of the attending medical practitioner
  • Correct classification and coding of the death certificate


4. Morbidity Data

Sources of routine morbidity data

Routine data on morbidity are less readily available than data relating to mortality. However, a range of morbidity data is available that may be utilized for use in epidemiological studies and include:

  • Cancer registrations
  • Statutory notifications of infectious disease
  • Laboratory reporting of microbiological data
  • General practitioner reporting of clinical data
  • Hospital episode data
  • Data from health surveys
  • Morbidity surveys from general practice
  • Royal College of General Practitioner weekly returns

Cancer Statistics - England

Cancer registrations are conducted by nine independent regional registries that collect data on cancer cases in their regions. Regional registries supply cancer data to the ONS for the provision of cancer statistics

These data are published annually by the ONS (3-4 years after the year in which the cancer was diagnosed) in:

  • Cancer Statistics: Registration Series MB1
  • National and regional information on cancer incidence. Statistical tables are presented on numbers and rates of newly diagnosed cases of cancer by site, age-group and sex. Age standardized rates (using the European standard population) for the latest ten year period by cancer site and sex are also included.

    Available online at; http://www.statistics.gov.uk/statbase/
  • Cancer Registrations Wales
  • Cancer registrations in Wales are co-coordinated by the Welsh Cancer Intelligence and Surveillance Unit (WCIS). 

    http://www.wales.nhs.uk/sites3home.cfm?OrgID=242
  • Cancer Registrations Scotland
  • The cancer registration system in Scotland is coordinated by the information and statistics division (ISD) of the NHS in Scotland.

    http://www.isdscotland.org

    International Agency for Research on Cancer (IARC)
    International cancer incidence data including IARC Scientific Publication No.155: Cancer Incidence in Five Continents Vol. III

    Available at: http://www.iarc.fr/en/publications/pdfs-online/epi/sp155/index.php

Care is needed when examining trends over time or between different regions. For example cancer registries, may differ with respect to methods of data collection, completeness of registrations or recording of data items. Submission of data to the registries is voluntary. In addition, misclassification of cancer cases or changes in coding systems over time may affect the reliability of the data particularly when examining trends over time.

Hospital Episode Statistics (HES)

The Department of Health’s Hospital Episodes Statistics (HES) provide a wide range of information about inpatient treatment delivered by NHS hospitals in England. HES data has been collected since 1989 and comprises patient-based records of finished consultant episodes (ordinary admissions and day cases) by diagnosis, operation and specialty from NHS hospitals in England.1

More than 12 million new records are generated each year. Each contains a range of information about an individual patient’s hospital admission including;

  • Clinical information about diagnoses and operations
  • Data on age-group, sex and ethnicity
  • Geographical information on where the patient was treated and the area in which they live.
  • Date of admission, length of stay and time waited.
  • Whether the patient was on a waiting list or was admitted as an emergency

Prior to the introduction of the HES in 1989, the hospital in-patient enquiry (HIPE) service collected data on discharges and deaths, although this is not directly comparable to HES data.

Detailed information regarding the format and analysis of HES data is available online at

http://www.hesonline.nhs.uk

Statutory Notifiable Diseases

In England and Wales cases of certain infectious diseases must be notified to the proper officer of the relevant local authority. These data are analysed and published by the Health Protection Agency’s Centre for Infections (CfI). In addition to notifications a wide range of microbiological data is also collected.

Health Protection Agency: http://www.hpa.org.uk

Vital Statistics for England and Wales

Vital statistics for England and Wales are produced by the ONS and include data for the number of conceptions, live births, stillbirths and cause of death for persons in England and Wales by age and sex. Individual studies for specific diseases are also presented. Data is presented by local authorities, health authorities and wards.

Available from: http://www.data-archive.ac.uk/findingData/

Health Surveys

  • The UK General Household Survey

    The General Household Survey (GHS) started in 1971 and is carried out annually by the Social Service Division of the ONS. It comprises a random sample of 13,000 households in the UK. Data is collected for a number of socio-economic factors including health, use of health services, the prevalence of smoking and drinking, alcohol consumption, fertility history, employment, education and income.

    Available from: http://www.statistics.gov.uk/lib2002/default.asp

     

  • The Health Survey for England

    The Health Survey for England (HSE) was established in 1991 by the Department of Health and comprises a series of annual surveys about the nation's health. The HSE is designed to be representative of the general population in England and aims to provide a measure of the health status of the population.1 Annual surveys cover the adult population aged 16 and older living in private households. Children have been included in the survey since 1995.

    Each year the survey focuses on different demographic groups or diseases and their risk factors and looks at health indicators including cardiovascular disease, physical activity and eating habits. In addition to completing a health questionnaire, those surveyed are followed up by a nurse visit during which various physical measurements including blood pressure, lung function tests, blood and saliva are collected.1

    The Health Survey for England is available online from the Department of Health at:

    http://www.dh.gov.uk/PublicationsAndStatistics


5. Uses of routine health statistics

Routine health statistics may be utilised in a variety of epidemiological investigations including descriptive studies (cross sectional surveys or comparative mortality or morbidity studies) and disease surveillance. For example, cancer registration data may be used to examine trends in cancer incidence over time or variations in the incidence of specific cancers by age, sex, socio-economic status or geographic area. Cancer registration data may also be used to evaluate the effectiveness of cancer screening programmes or cancer care.

Routine or vital health statistics may also be used in analytical studies. However their usefulness will depend on their completeness, accuracy, coverage and comparability between different regions or countries.


6. Strengths and weaknesses of routine data

Routinely collected data may be subject to incompleteness, bias and measurement error.

Strengths

Weaknesses

Readily available Lack of completeness, with potential bias
Low cost Often poorly presented and analyzed
Useful for establishing baseline characteristics Where there are small numbers of cases, it may be possible to identify individuals, threatening confidentiality
Identify cases in a case control study Data may not be collected in a uniform way across the entire population
Generating aetiological hypotheses Techniques of data collection may vary geographically, e.g. recording data, coding
Derive expected numbers in a cohort study or as a source for ascertaining outcome in a cohort study Equivalent data not always available for all countries
Useful for examining trends of disease over time and by place Delay between collection and publication


7.  Using routine statistics in epidemiological studies2

Descriptive studies, exploring patterns of disease distribution in populations often make use of routinely available health data. The results can be used for a variety of purposes, including generating hypotheses about the possible determinants of disease and for service planning.

Data derived from routine statistics are usually analysed with respect to three main categories:

  • Time – when?
  • Place – where?
  • Person – who?

a) Time

There are three broad patterns of variation in disease incidence with time:

  • Secular (long-term) trends – changes in disease incidence over a number of years that do not conform to an identifiable cyclical pattern. For example, the secular trend in mortality from TB in England shows a steady decline over many years. However, this does not give any indication of the cause of the decline.
  • Periodic changes including seasonality – regular or cyclical changes in incidence, for example in infectious diseases. Cases of influenza typically reach a peak in the winter months.
  • Epidemics – strictly speaking, an epidemic is a temporary increase in the incidence of a disease in a population

b) Place

Variations in disease incidence by place also fall under three main headings:

  • Broad geographical differences – sometimes related to factors such as climate, or social and cultural habits. Some cancers show marked geographical differences in incidence.
  • Local differences – distribution of a disease may be limited by the localisation of the cases, for example a contaminated water supply.
  • Variations within a single institution – variations in attack rates by hospital ward, for example, may help identify possible sources or routes of spread of a gastrointestinal infection.

c) Person

The chances of an individual developing a disease may be affected by their personal characteristics, so exploring this area may provide useful clues about possible causes. Personal characteristics can be classified in several ways, but important features include:

  • Intrinsic factors such as age, gender, ethnic group and marital status – these are not amenable to change
  • Personal habits and lifestyle

 

References

  1. Leadbeter D, Harnessing Official Statistics, Radcliffe, 2000.
  2. Farmer, R., Miller, D. and Lawrenson, R., 1996. Epidemiology and Public Health

    Medicine
    (4th ed.), Blackwell Science, Oxford.

© Helen Barratt, Maria Kirwan 2009