Epidemiology A Manual For South Africa

EPIDEMIOLOGY
A MANUAL FOR SOUTH AFRICA
ann aren een em enee
Edited by ee
J]M Katzenellenbogen
G Joubert
SS Abdool Karim
Neen eneen
neem!
OREGON HEALTH SCIENCES UNIVERSITY
LIBRARY
SEP 24 1999
PORTLAND, OREGON
) Epidemiology :
A manual for South Africa
Editedby JM Katzenellenbogen
G Joubert
SS Abdool Karim
Cape Town
OXFORD UNIVERSITY PRESS
SOUTHERN AFRICA
Ugo
Oxford University Press Walton Street, Oxford ox2 6 pp, United Kingdom
Offices in
Oxford, New York
Athens, Bangkok, Calcutta
Cape Town, Chennai, Dar es Salaam, Delhi
Florence, Hong Kong, Istanbul, Karachi, Kuala Lumpur
Madrid, Melbourne, Mexico City, Mumbai
Nairobi, Paris, Singapore, Taipei, Tokyo, Toronto
and associated companies in
Berlin, Ibadan
Oxford is a trade mark of Oxford University Press
Epidemiology: A Manual for South Africa

ISBN 0 19 571308 7
First published 1991 by the Medical Research Council as
Introductory Manual for Epidemiology edited by J M
Katzenellenbogen, G Joubert, and D Yach. This expanded edition
first published 1997
© Oxford University Press Southern Africa 1997
All rights reserved. No part of this publication may be reproduced,

stored in a retrieval system, or transmitted, in any form or by any
means, without the prior permission in writing of Oxford
University Press. Enquiries concerning reproduction should be
sent to the Rights Department, Oxford University Press Southern
Africa, at the address listed below.
This book is sold subject to the condition that it shall not, by way of
trade or otherwise, be lent, re-sold, hired out or otherwise circulated
without the publisher’s prior consent in any form of binding or
cover other than that in which it is published and without a similar
condition including this condition being imposed on the subsequent
purchaser.
Published by Oxford University Press Southern Africa,

Harrington House, 37 Barrack Street, Cape Town 8001, South Africa
Text set in 8,5 pt on 10,5 pt Utopia by Photoprint and Unifoto
Reproduction by Photoprint and Unifoto
Cover reproduction by RJH Graphic Reproductions
Printed and bound by Creda Press, (Pty)Ltd, Eliot Avenue, Eppindust II
contents
List of contributors
Acknowledgements
How to use this book
Foreword
Preface
Section A Contextualizing epidemiology

Introduction
N Key concepts in epidemiology 10
General philosophical issues in epidemiology 25
Section B Epidemiological research methods and

protocol development
Planning a research project 49
Literature review 94
Setting objectives for research 56
Study design 64
Sampling 74
Data collection and measurement 82
Other issues
Seaman
oua to consider in the protocol 95
Section C Data presentation, analysis, and interpretation

11 An introduction to data presentation, analysis,
and interpretation 101
12 Integrating epidemiological concepts 124
Section D Common approaches in epidemiology
13 The use of routinely available data in
epidemiological studies 133
14 Disease surveillance 140
15 Health systems research (HSR) 147
16 Rapid epidemiological assessment 158
17 Community studies 164
18 Knowledge, attitude, belief, and practice
(KABP) surveys 169
19 Qualitative methodology: an introduction 176
Section E Epidemiology applied to content areas

20 Mortality studies 187
21 Outbreak investigations 196
22 Estimating immunization coverage 205
23 Environmental epidemiology 211
24 Occupational epidemiology 218
25 Disability studies 224
26 Psychiatric epidemiology 230
27 Anthropometric studies 243
28 Assessment of dietary intake 248
29 Measurements in dental epidemiology 254
30 Health economics 261
APPENDIX 1: Standardization (adjustment) of rates 213

APPENDIX 2: Standardization of observers and
instruments Zee
APPENDIX 3: Derivation of the 30 x 7 EPI sample 243
APPENDIX 4: Random number table 276
Bibliography Zot
Index 289
list of contributors
Max Bachmann, MB BCh, DOH, MSc, FFCH, Dale McMurchy, ma, Health Economics Unit,
Department of Community Health, University of Department of Community Health, University of
Cape Town Cape Town
Donald Berry, ms Bch, University of the Carol A Metcalf, mB BCh, BSc (Hons), MPH, Department
Witwatersrand of Epidemiology and Human Papillomavirus
David Bourne, Bsc, B Phil, Department of Research Group, University of Washington, Seattle
Community Health, University of Cape Town Mohamed H Moola, BDs, LDSRCS, DDPH, MSc,
Debbie Bradshaw, Bsc (Hons), MSc, D Phil, Faculty of Dentistry, University of the Western
CERSA, Medical Research Council Cape
David Coetzee, BA, MB BCh, DTM&H, DPH, FFCH, Jonathan E Myers, Bsc, MB BCh, DIM&H, MD,
Department of Community Health, University of Department of Community Health, University of
the Witwatersrand Cape Town
Nicol Coetzee, MB BCh, FFCH (SA), DIM&H, DA, Charles D H Parry, B Soc Sci, BSc Hons, MSc, MA, PhD,
Department of Community Health, University of National Urbanisation & Health Research
Cape Town Programme, Medical Research Council
Judy M Dick, Ba, B Soc Sc, MSc Med, PhD, National Linda M Richter, php, CERSA, Medical Research
Programme for TB Research, Medical Research Council
Council Leslie Swartz, MSc, PhD, Child Guidance Clinic,
Athmanundh Dilraj, Bsc, BSc (Hons), M Med Sc, Cape Town
CERSA, Medical Research Council Donald H Skinner, B Bus Sc, B Soc Sc (Hons), MA,
John S S Gear, BSc, MB BCh, DTM&H, DPH, D Phil, FCP, Trauma Centre for Victims of Violence & Torture
Wits Rural Faculty, University of the ’ Elmar Thomas, MB BCh, B Soc Sc (Hons), MA,
- Witwatersrand Department of Medical Anthropology, University
Margaret N Hoffman, MB BCh, BSc (Hons), of California, Berkeley
Department of Community Health, University of Stephen M Tollman, Bsc, MB BCh, MA, MPH, Health
Cape Town Systems Development Unit, Department of
CarelB IJsselmuiden, MD, FFCH, MPH, DPH, DTM&H, Community Health, University of the
Department of Community Health, University of Witwatersrand
Pretoria, CERSA, Medical Research Council Yasmin E R von Schirnding, BSc, BSc (Hons), MSc,
Georgina Joubert, Ba, MSc, Department of PhD, Greater Johannesburg Transitional
Biostatistics, University of the Orange Free State Metropolitan Council, Department of
Salim S Abdool Karim, MB BCh, MSc, Dip Data, FFCH, Environmental Health, Johannesburg
M Med, CERSA, Medical Research Council Hester van der Walt, Dip Nursing, Midwifery,
Judith M Katzenellenbogen, Bsc, BSc (Hons), MSc, Paediatric Nursing, Community Health, mea,
CERSA, Medical Research Council CERSA, Medical Research Council
Louise Kuhn, MA, MPH, PhD, Gertrude H Sergievsky Petro Wolmarans, Msc, National Research
Center, Columbia University, New York Programme for Nutritional Intervention, Medical
Marietjie L Langenhoven, Msc, National Research Research Council
Programme for Nutritional Intervention, Medical Derek Yach, MBBCh, BSc (Hons), MPH, Policy Advisory
Research Council Coordinating Team, World Health Organization
Catherine Mathews, msc, CERSA, Medical Merrick Zwarenstein, MB BCh, MSc(Med), MSc,
Research Council, Department of Community Health Systems Division, CERSA, Medical
Health, University of Cape Town Research Council
acknowledgements how to use this book
We acknowledge the substantial contribution of This book is divided into sections, each of which
Derek Yach both as co-editor of the 1991 edition addresses a particular aspect of epidemiology.
of the Introductory Manual for Epidemiology in Sections A, B, and C explain the theory and
Southern Africa, as well as for conceptualizing the principles of epidemiology. Here the basic
idea of producing such a manual. abstract ideas and skills of epidemiology are
We thank Neil Cameron, Margaret Hoffman, and described in a way that is designed to be clear and
Marian Jacobs for contributing to the epidemiologi- accessible not only to medical students and
cal manual used in the 1987 workshop at the annual professionals, but also to others who are interest-
Epidemiological Society of Southern Africa con- ed in epidemiology. You will need to read all three
ference; Debbie Bradshaw, Yusuf Chikte, Nicol of these sections, preferably in order, to under-
Coetzee, Di Cooper, Rudi Eggers, Margaret Hoffman, stand how epidemiology works, and before you
Sarie Human, Rod Jackson, Marian Jacobs, Trefor can go on to make full use of Sections D and E.
Jenkins, Neil Myburgh, Sue Naidoo, David Rees, Sections D and E constitute the ‘applied’ part of
Robert Schall, Donald Skinner, Malcolm Steinberg, this manual; they show how the theory and prin-
Krisela Steyn, Mary Lou Thompson, and Ruth ciples explained in the first three sections can be
Watson for reviewing parts of the manual, Tryphina applied in different subject areas of health. So
Masuku, Amelia Mulder, Wynda van Eyssen, and these last two sections demonstrate what
Nazneen Ally for typing, administration, and co- happens when epidemiology is used in specific
ordination, Jenny Altschuler and Sharoni Cohen for disciplines and ‘in the field’. Here you can
selected graphics, Helen Moffett for her valuable browse, choosing to focus on whatever particu-
editorial contribution and guidance, and finally the larly interests you. Or you can look up various
many students who attended our various basic subject areas to get an idea of how epidemiology
epidemiology courses and urged us to produce this can be used in the various disciplines that you
book. study or practice.
Copyright acknowledgements can be found on This manual also introduces and explains the
p. 288. technical terms used in epidemiological study
This book was made possible by the financial and practice. The first time that a term appears in
support of the Medical Research Council. the text, it is defined. If you are browsing and
would like to check the definition of a term, or if
you need to look up a term, turn to the index. The
first page number that is listed for a term will have
MEDICAL RESEARCH COUNCIL the definition of the term (which will be marked
CERSA in bold for easy identification) on that page.
The bibliography contains both references and
further reading for those who wish to explore
topics in more detail.
foreword
‘Catch-up growth’ describes the accelerated the apartheid state began to break down, a
response of a stunted child once the adverse number of gifted young epidemiologists sought
circumstances that caused the stunting are training both within the country and abroad. A
removed. This phenomenon serves as a metaphor second and simultaneous need is to transmit to
for what must happen in the new South Africa ifa all public health workers the basic elements of the
new public health is to break out of the caked and discipline needed to work in the field. To meet
soiled shell that encased the country in the this need, the new generation of epidemiologists
apartheid era. While the effects on the population must spread the knowledge they have acquired
will be felt for generations to come, much that is beyond the narrow bounds of those with whom
new will have to be done to ensure long-term they can make personal contact. With this book,
good health for all South Africans. they can begin the process.
Public health and its basic sciences were among This book takes an essentially practical
the fields most starved during the apartheid era. approach that should make it accessible to
This impoverishment of South African epidemiol- professional health workers. It incorporates
ogy and public health must be dealt with in this substantial applications of epidemiology in the
new and hopeful era. More than that, it must be primary health care arena, reinforcing the fact
quickly remedied. Public health is a key element if that the discipline must make a significant contri-
the historical impoverishment of the whole bution to the planning and evaluation of public
society is to be undone. Public health in South health care programmes. Furthermore, it takes a
Africa therefore has a mission, which can not be step towards raising the level of understanding of
effectively pursued without rebuilding and the results of epidemiological studies so central to
_ deploying the academic disciplines on which the policy analysis.
field must rely. Not only does South Africa need a new cadre of
Epidemiology, according to a recognised defin- epidemiologists, but the incorporation of epide-
ition, is the study of the distribution and the miology training for every kind of health worker.
determinants of states of health in populations, This book represents an important beginning in
with the object of preventing and controlling ill the education that is essential to create a healthier
health. There can be no rational and effective South African future.
public health without a foundation of epidemio-
logicai understanding. It is therefore important to Mervyn Susser
grasp the scope of epidemiological knowledge: Editor: American Journal of Public Health
how it is acquired and evaluated, what uncertain- New York
ties and likelihoods attend it, and how it is tested
and advanced. Rachel Gumbi
A first need is thus to build a new cadre of Chief Director: Human Resources Directorate
epidemiologists. South Africa is fortunate that in Department of Health
the years since the mid-1980s, as the isolation of Pretoria
* ** *
preface
Over the last decade and a half, epidemiology has A chance meeting in August 1994 with a repre-
undergone a sustained resurgence in South Africa. sentative from Oxford University Press resulted in
Increasingly, there has been a recognition of the the production of a fresh version of the manual.
need for epidemiology to provide a research base The new editorial team took note of the criticisms
that can inform policies to transform health in the and comments received from users of the manual.
country. With the co-operation of the contributors, the text
Epidemiological skills in South Africa, however, was revised and extended, and the examples and
were few and the Epidemiological Society of information updated to produce a new book. We
Southern Africa, recognising the broad education- are grateful to all the contributors who, despite
al needs for the discipline, organised training their busy schedules, made the effort to improve
workshops both regionally and nationally to their sections or write new sections.
address this need. Workshops, organised in In this book, as with the first manual, we have
tandem with annual Epidemiological Conferences, aimed to give a comprehensive introduction to
have been particularly successful and have epidemiology, with a specific South African
reached a wide range of health professionals inter- flavour, using local examples and highlighting
ested in epidemiology. The demand for copies and local issues. An effort has been made to further
reprints of the manuals and booklets produced for simplify the language to make the text accessible
participants of these workshops prompted us to to those whose first language is not English. We
produce a more comprehensive manual which have avoided complex formulae as much as
incorporated aspects of earlier material, but with a possible, noting that the manual does not replace
broader epidemiological scope, to coincide with a good introductory statistical text. We have dedi-
the 10th Annual Epidemiological Conference held cated a chapter to qualitative research as well as
in Cape Town in July 1991. citing uses of such research throughout the book
A wide range of contributors ably provided the to highlight how both qualitative and quantitative
material for publication and the technical produc- research are useful and often complementary
tion was done by staff of the Medical Research tools to investigate public health issues.
Council. The manual was launched at the confer- This manual targets the community of health
ence, and complimentary copies were sent to workers — and interested members of the public
institutions where epidemiology was being — who have little or no experience of epide-
taught. Lecturers and other users of the manual miology. We felt that it was important that this
were asked to evaluate it. book meet the needs of all health workers (such as
Increasingly, the manual was being prescribed doctors, nurses, and physiotherapists) who are
for epidemiology courses at universities, nursing starting their first research projects. In order to
colleges, technikons, and since 1992 at the newly meet this need, this book stands as a comprehen-
developing Schools of Public Health. By mid- sive text on epidemiology while retaining a pract-
1994, the 1 000 copies had run out and the editors ical, down-to-earth approach.
had dispersed geographically. The future of the
manual was uncertain, in spite of an obvious Judy Katzenellenbogen
need for a basic local epidemiological text and the Gina Joubert
very positive response by users of the manual. Salim Abdool Karim
* K K *
SECTION A
Contextualizing epidemiology
Introduction
J pay eter negen, JGear, STollman
Key concepts in epidemiology

Causation of disease
J Katzenellenbogen, M Hoffman
Levels of prevention
J Katzenellenbogen, M Hoffman
Measures of the frequency ofhealth evenis _
G Joubert |
Demography
D Bourne, G Joubert
Health indicators
J Katzenellenbogen
General philosophical iissues in

epidemiology
Ethical aspects of epidemiolagical research —
C IJsselmuiden
Community participation
J Katzenellenbogen, C Mathews, H van derWalt
Planning for the implementation of research —
J Dick
Introduction
When you study epidemiology you will find the own ‘bag of research tools’ which can be used to
textbooks are full of surprising facts. Do you know further knowledge in the field.
who described the benefits of fresh fruit in the The research methods and findings have to be
prevention of scurvy? It was Andrew Lind, an open to scrutiny by others and peer review of
officer in the British Royal Navy. What has the research is an essential aspect of the research
Broad Street pump got to do with cholera? It was process.
the source of contaminated water during Broadly speaking, there are two types of
London’s cholera epidemic of the mid-19th research: basic (fundamental) and applied
century. John Snow was the man who recognized research. While both types of research lead to the
the hazard and took the handle off the pump. creation of new knowledge, basic research gener-
Birth and death statistics have been collected for ates fundamentally new knowledge and tech-
centuries and recorded in church records. It was nologies requiring the involvement of highly spe-
Florence Nightingale in Britain who promoted cialized academics from the basic sciences. Basic
their use as a planning tool. Today, birth and mor- scientists are less interested in what can be done
tality data are indicators of changing circum- with the new knowledge: they really pursue the
stances in countriés and communities, and are knowledge for its own sake and may spend years
important weapons in the epidemiologist’s fight working before their research efforts bear fruit.
against disease. Applied research, on the other hand, investigates
South Africa too, had its early pioneers who current problems, and uses existing knowledge
contributed significantly to the control of disease and technology to address the problems.
through critical observation and analysis. It is In general, knowledge and understanding are
essential to develop these characteristics if you advanced by attempts to formally test a proposed
want to become an epidemiologist. It is like being theory. The way theories/hypotheses have been
a detective; the clues are always there. Your chal- tested has changed over time. Ways of establish-
lenge is first to find them and then to interpret ing causal relationships have been debated
what they mean. throughout the history of scientific philosophy.
So what is the essence of epidemiology and Currently, there are two main schools of thought
how can it make a difference to people’s health? on the growth of knowledge. Verificationists
To answer these questions we need to start by support the concept of new research findings
considering what research is and why it is done. being confirmed through repeated studies, that
is, new information is accepted when new
research findings are ‘sufficiently’ verified. Falsifi-
Research cationists, on the other hand, support the con-
Research is the systematic search or inquiry (‘re- cept that all knowledge is provisional until
search’) for knowledge. Scientifically acceptable refuted through research. This means that infor-
methods are used to investigate issues in order to mation, like existing knowledge, cannot be
arrive at valid conclusions. Every discipline has its proven conclusively but can become provisional
3
a contextualizing epidemiology
knowledge if repeated attempts to refute it fail. Public health (community health)

Knowledge, in this instance, is regarded as provi-
sional until a better explanation replaces current Public health is defined in Last’s Dictionary of
explanations. Epidemiology as ‘one of the efforts organised by
These philosophical differences have implica- society to protect, promote and restore the health
tions when it comes to conducting a study. Is your of the population. It is a combination of sciences,
study trying to refute a hypothesis or confirm new skills and beliefs that are directed to the mainte-
information? Falsificationists (also referred to as nance and improvement of the health of the
refutationists) have drawn from the philosophies people through collective and social action. The
propounded by Popper, while verificationists programmes, services and institutions involved
have drawn from the philosophical approach emphasise the prevention of disease and the
promoted by Bradford Hill and Susser (see health needs of the population as a whole. Public
Chapter 2: Causation of disease). health activities change with changing tech-
Generally, the public health ethic has de- nology and social values but the goals remain the
manded action to prevent and control disease same: to reduce the amount of disease, pre-
even if information is incomplete. Whether new mature death, discomfort and disability in the
information is derived through the approach of population. Public health is thus a social institu-
verificationists or falsificationists, public health tion, a discipline and a practice’ (Last 1988).
practitioners will view the finding no differently. Thus in contrast to clinical medicine, which
This highlights the need for public health practi- focuses largely on medical care of ill individuals
tioners to implement control programmes versus who present themselves for examination, diagno-
the need of scientists to ensure that a new finding sis and treatment, public health focuses on the
is valid. group or the community at large.
The combination of theory and observation Public health has its origins in attempts to
is still challenged because it is impossible to control epidemics through the segregation and
prove absolutely that the conclusions reached quarantine of affected individuals. During the
reflect ‘the truth’. Thus, philosophically, a funda- industrial revolution, methods of environmental
mental connection between cause and effect can control, including occupational health, were
never be proved, and the search for ‘the truth’ developed, and sanitary reform promoted.
continues. Further impetus was provided by the discovery
of vaccinations, of which smallpox was the first.
Why is research done? At the turn of the century, high-risk groups (for
People have many reasons for conducting example, children) were targeted for health
research. For some individuals research is merely promotion activities — introducing an aspect of
a means to an end, such as the attainment of personal health to public health practice.
further qualifications or public recognition. Some In Britain, public health practice was modelled
have entered the ‘paper chase’ rat race and focus around the concept of the Medical Officer of
solely on publications which emerge from the Health (MOH), who co-ordinated and directed
study. Sometimes the survival of the institute or health activities at district level, carving a domi-
department even relies on the income generated nant role for medically trained people in this field.
by publications. In the United States, students and practitioners
However, most people pursue an investigation were from the outset drawn from a wide variety of
out of an interest in the topic or a perceived need disciplines, and the field developed a strong
for information in their work situation. They use multi-disciplinary tradition.
the results of the research to further scientific Worldwide, the public health movement has
knowledge and/or as a rational basis for their been marginalized since the 1940s by the increas-
work. This may mean that the results can assist ing curative focus in health. The saying ‘Preven-
when making decisions, motivating for resources tion is better than cure’ has not won political
from funders and/or government agencies, or dominance, and public health programmes —
monitoring progress and evaluating programmes. which have the potential to improve the health of
Research can also be used to empower the sub- large groups — have had relatively low priorities
jects of the research or to gather information on national health budgets.
which can be used to advocate changes in policy The terminology used to describe the public
and resource allocation. health field has changed over time. In recent
4
1_sintroduction
times, the field has been referred to as ‘commu- providers; and communities. The idea is not that
nity medicine’ or ‘community health’ in some priorities will be set purely at national level, but
countries. In South Africa the term ‘community rather that national priorities will be forrned from
health’ is generally used and the field is still an ‘upward synthesis’ of local, district and provin-
largely under the control of the medical profés- cial concerns.
sion. However, people involved in a wide variety In South Africa, the ENHR framework has been
of disciplines are centrally involved in the field, endorsed by a wide range of organizations and
including nurses, health inspectors, sociologists, institutions as an organizing framework well
psychologists, biostatisticians and epidemiolo- suited to local needs. An ENHR strategy can be
gists. expected to strengthen the capacity of the provin-
Public health programmes require a variety of cial and national departments of health to
skills to ensure adequate planning, implementa- address important yet previously neglected prob-
tion and evaluation of the effort. Core disciplines lenis.
are epidemiology and _ biostatistics; health
services evaluation and administration; beha-
vioural and biological sciences; occupational and
Epidemiology
environmental health; health education and Epidemiology is ‘the study of the distribution and
health economics. determinants of health-related conditions and
events in populations, and the application of this
Public health research study to the control of health problems’ (Last
The findings of the Commission on Health 1988).
Research for Development (1990) indicate the key Descriptive epidemiology describes the occur-
role of research in improving health, especially rence of disease and other health-related charac-
the health of disadvantaged groups. The commis- teristics in human populations in terms of person
sion concluded that health research is a vital (the who: age, sex, social class), place (the where:
investment in the future and that the seeds of geographic location) and time (the when: season,
research can yield a wealth of effective action. time of day/week).
The commission sees health research as serving Analytic epidemiologic studies investigate the
two main purposes. Firstly, country-specific association between a given disease or health
research is essential to determine health prob- status and possible causative or protective fac-
lems, analyse measures for dealing with them and tors. An attempt is made to explain why health-
help decide on actions to achieve health improve- related outcomes occur ‘and whether interven-
ments with limited resources. The second area, tions work.
global health research, relates to the development Epidemiology forms the research arm of public
of new knowledge and technology to cope with health, providing the scientific basis upon which
the major unsolved problems worldwide. This public health policy decisions are made. It pro-
category of research requires global co-operation. vides methodological research skills in order to
The commission called these two purposes — manage public health problems in a more effec-
country-specific and global health research — tive way.
‘essential national health research’ (ENHR): An important distinction to make is the differ-
‘essential’ because they address a country’s prio- ence between the terms ‘the epidemiology of a
rity health problems, and ‘national’ referring to particular disease’ and ‘epidemiology the disci-
the importance of research that is directly tar- pline’. The epidemiology of a particular disease
geted at local or national level problems. While describes the distribution and determinants of
every country should develop a strong capacity to that disease. So for example, the epidemiology of
undertake country-specific research, each coun- HIV infection demonstrates that young adults
try needs to consciously decide for itself the aged 16-30 years have the highest incidence rate
resources it can contribute to the global health of HIV infection, women have a threefold higher
agenda. risk of HIV infection and that approximately 25%
ENHR is, in effect, a strategy for setting prior- of children born to HIV positive women are
ities, and managing and developing a country’s infected with HIV.
resources for health research. It depends crucially Epidemiology the discipline describes particu-
on working partnerships between researchers; lar methodological tools/approaches used to
health service policy-makers, managers and gather information about the prevalence, distri-
5
bution, determinants and course of any condi- are surprised to learn that 50 years ago the
tion. It is epidemiology the discipline that forms country embarked on a series of PHC experi-
the content of this book. ments which gained the attention of practitioners
and researchers worldwide.
The role of epidemiologists This started in 1938 with a nutrition survey
Epidemiologists function at two levels, often which documented, for the first time, the health
simultaneously. On the first level, as ‘pure’ scien- status of South Africa’s African majority. The
tists, they try to find ‘truth’ or the real cause(s) of study, initiated by Harry Gear (then Deputy Chief
disease, encouraging sceptical debate about Medical Officer of the Union), Sidney Kark and
causal inference. At a second level they function Harding Le Riche, concluded that: ‘Diet defi-
as public health officials, where they advise others ciency diseases, syphilis, malaria, bilharzia,
involved in making public health decisions. In tuberculosis, scabies and impetigo, and prevent-
these public health roles, they and the public they able crippling ... form no small array of factors
serve must recognize the crucial need to act in contrary to the maintenance of good health ... No
such a way as to err on the side of public health. amount of juggling can succeed in separating the
This often means taking action in the presence of influence of one as opposed to the others ... The
scientific uncertainty and it also means that in- outstanding fact is that they are all preventable
correct decisions may be made. Epidemiologists ...’ (Kark 1994).
have a key role in ensuring that decisions Faced with this reality, and without a ready-
involving policies and practices are continuously made solution, in 1940 the government initiated a
evaluated and redefined in the light of new rural health unit staffed by Kark, his colleague and
evidence. wife Emily, and Edward and Amelia Jali. Within
five years, work at Pholela provided evidence for a
revolutionary model of health care provision
Epidemiology in South Africa (Slome 1962; Tollman 1994).
For centuries South Africans have been cared for What, then, was so extraordinary about
by herbalists. These skilled practitioners had, by Pholela? The work is impressively documented
observation and analysis, identified plants with and well worth reading (Kark 1962), but we single
therapeutic properties. If we were to learn more out three innovations:
of their ways and techniques, there can be no analysis of the health and health-related issues
doubt that we would describe some of them as the of groups or communities (community diagno-
ancestors of modern epidemiology. sis) parallel with individual patient assessment;
In modernizing, South Africa became exposed use of epidemiology as a tool for daily use, to
to diseases of overcrowding and urbanization. monitor, evaluate and inform health actions;
The mining industry heralded an explosion in recognition that social, cultural and environ-
respiratory disease. Pneumonia was a huge mental factors are crucial determinants of
problem; Sir Spencer Lister, one of the early direc- community health status.
tors of the South African Institute for Medical Re-
search, was a pioneer in pneumococcal vaccine We take the last point for granted, but have yet to
development. The 1918 flu epidemic led the fully incorporate our understanding of these
health planners of the day to lay down strict na- determinants to strengthen our health promotion
tional health measures to control and monitor the efforts, whether the problem is diarrhoea and the
spread of infectious disease. Their rules were use of oral rehydration therapy (ORT) or ischae-
guided by epidemiological principles. Siegfried mic heart disease and exercise programmes.
Annecke helped to dramatically reduce malaria in Using tuberculosis as an example, the near-
South Africa through his observation of the effi- legendary John Cassel gives insight to the under-
cacy of DDT in mosquito control. standing of local customs and values needed if
In the realm of primary health care epidemio- new ideas are to fit with existing cultural frame-
logy, South Africa was a world leader by the mid- works (Cassel 1955).
1940s.
From model to movement: Gluckman, the
Pholela and the roots of primary health care IFCH, and COPC
(PHC) Henry Gluckman, later Minister of Health,
Given South Africa’s painful history, many people chaired the National Health Service Commission.
6
1 _sintroduction
In 1944, this commission recommended a coun- on the epidemiology of rheumatic heart disease, a
trywide network of health centres as the basis for major cause of morbidity and mortality in urban
fundamental health reform in South Africa blacks, was limited. However, the definitive study
(Gluckman 1945). Their recommendation was was conducted by Margie McLaren, who per-
based on the Pholela experience, the pilot study. suaded the clinical cardiologists of Johannesburg
Forty health centres were established. Eight in to join her in a massive survey of Soweto school-
and around Durban formed practice-research children. This survey revealed many cases of
sites for an Institute of Family and Community advanced and previously undetected rheumatic
Health (IFCH). Linked to Natal University, the heart disease. Her work stimulated a number of
Institute trained staff for health centre teams, major attempts at rheumatic heart disease
including a new category — ‘health recorders’ — control.
to maintain community, household and family What about South Africa’s major killers; gastro-
records. enteritis, pneumonia, malnutrition, violence,
Although abruptly and prematurely curtailed tuberculosis, cervical cancer, cardiovascular
by the political events following the change of disease? Did epidemiologists pay enough atten-
government in 1948, such efforts caught the inter- tion to these priorities? Turning the pages of the
national imagination. They showed the link South African Medical Journal during this period
between community epidemiology, primary care tells us that they did not; information about the
and community health. Based on adaptations distribution, causation and control of many of
from many countries — the USA, Israel, Spain, these disorders remained inadequate. We have
Nicaragua, Britain — this approach became virtually no useful baseline information about
known as ‘community oriented primary care’ or how effective our interventions have been.
COPC. As an important research base for the Epidemiology and epidemiologists have let us
World Health Organization and Unicef’s ‘Health down — certainly in the mid-70s, priorities were
for All’ strategies, COPC continues as one of still with the exotic and the intellectually novel.
several broad strands making up the fabric of There are a few notable exceptions: in the areas of
contemporary primary health care (Tollman nutrition and iron metabolism, to mention just
1991). l two, epidemiologists were doing work which
‘Failure to build upon [such] early models cost would make an impact on South Africa’s health
the country dear, not only in financial terms but problems.
also,'and more importantly, in terms of prevent-
able death, disease and disability’ (Yach, Tollman Epidemiology in the 1980s
1993). After all this time, the work done in places Until the 1980s, much of South African epidemio-
such as Alexandra (Alexandra Health Centre logy had been targeted at disease description, had
1991), Mamre (Mamre Community Health been conventional in its adherence to a biomed-
Project 1988), Gelukspan (Bac 1990) and else- ical paradigm, and had steered away from
where is once again inspiring South African confronting South Africa’s greatest disease, apart-
efforts in district health development. heid.
All of this changed in the 1980s. A ‘new’ genera-
South African epidemiology in the 50s, 60s tion of epidemiologists emerged. The thrust of
and 70s this new wave of epidemiology spread across the
From the 1950s to the mid-1970s, the science of country. It was essentially a multi-disciplinary
epidemiology was developed as acritical tool movement, but was somewhat under-repre-
both in the description of disease and in monitor- sented by nurses and environmental health offi-
ing the control of disease. Many diseases were cers. Centres of endeavour were found at univer-
notifiable and research scientists used this sities, research organizations and rural hospitals.
routinely collected data to develop hypotheses Notable activity in the training of nurses in field
about causation and distribution of disease. A few epidemiology took place in the small epidemio-
examples from Gauteng illustrate the kind of logy unit in the Department of Health. This unit,
epidemiological research done during this with limited resources, also conscientiously
period. analysed the routinely collected national data,
Gear’s work on the Coxsackie virus, poliomye- reporting findings monthly in the publication
litis and rickettsial diseases such as tickbite fever, Epidemiological Comments.
had a strong epidemiological foundation. Work The notable thing about the ‘new’ epidemio-
7
logy was its content. It began to address quality and epidemiology in South Africa is being
and provision of care. It confronted unequivocally strengthened by this. Indeed, international orga-
the horrendous inequalities in mortality amongst nizations such as the World Health Organization
South Africa’s population groups. It revealed in- are increasingly utilizing the epidemiological
equalities in per capita expenditure on health expertise in South Africa to strengthen their pro-
between the ‘homelands’ and white South Africa. grammes and efforts.
It pointed to the horrific working conditions
experienced by much of South Africa’s labour The practice of epidemiology in the
force. Cholera epidemics were revealed to be
almost exclusively epidemics within ‘homelands’.
rest of Africa
Immunization rates in white cities were high; in The practice of epidemiology in Africa is, on the
black townships, they were abysmally low. whole, weak. A large proportion of epidemio-
Epidemiology’s torch shone into the dark, logical research in Africa is done by organizations
excluded and forgotten corners of South Africa’s based in Europe and the United States, without
disease cave and what was hidden in these always involving indigenous Africans. There is
corners was disgraceful. Epidemiology brought concern about foreign agencies ‘using’ Africa as a
legitimacy and factual support to the rhetoric of laboratory for research. This can result in the
the politically oppressed. The struggles for health development of undue dependency and failure to
and political rights had been irresistibly joined. increase epidemiological expertise on the conti-
The evidence would not go away, although many nent.
tried to suppress the emerging information. However, some organizations, such as the
Ulma group in Sweden (which has been active in
Coming of age Tanzania, Ethiopia and Somalia in particular)
It is clear that much of the basis for a vibrant have a long-term commitment to their projects.
culture of epidemiology now exists in South In these cases, development of local expertise is
Africa. already evident. The French INSERM group,
The Epidemiology Society of Southern Africa London School of Hygiene and Tropical Medi-
(ESSA), founded in 1982, has become a meeting cine, and the Tropical Disease Research Pro-
place for the learned, the learning and anyone gramme of the WHO have been focusing strongly
else remotely interested in public health research on developing epidemiological expertise in parts
and the measurement sciences. of Africa. The Centers for Disease Control (CDC)
Epidemiology is now being taught in courses have initiated the global Field Epidemiology
dedicated to the subject. These courses, first Training Programme on the African continent
developed in the Cape, are forming the founda- and have shown substantial involvement in 11
tion upon which epidemiology is being taught in countries through the Combating Childhood
the schools of public health which emerged in the Communicable Diseases programme.
mid-1990s. There is a commitment to strengthen- There is also a great need for further formal
ing epidemiological training and practice among training courses in epidemiology. The WHO has
nurses. Occupational health epidemiology flour- been instrumental in instituting two Master of
ishes in Cape Town, Johannesburg and Durban. Public Health (MPH) programmes in Nairobi and
The Centre for Epidemiological Research in Dar-es-Salaam that have a strong epidemio-
Southern Africa (CERSA) has played a unique logical bias. In Southern Africa, the Blair Research
role, providing first a training ground, and then Laboratory focuses on tropical disease research in
ample opportunity for methodological and Zimbabwe, and the WHO Ndola group fulfils a
thematic applications: from urbanization to in- similar role in Zambia. Most countries, however,
fectious diseases to chronic diseases of lifestyle. still lack a core group inside the country to
And health policy and health systems research develop epidemiological expertise.
and development have been identified as critical
to the thrust of health reform. Epidemiological
skills are being drawn upon extensively in the
The 1990s and beyond
development of health policy on the national and This overview presents a chequered history: origi-
regional levels. nal contributions in the 1940s and 50s; consider-
International contact and recognition, minimal able achievement accompanied by a turning
in the pre-democracy years, are now abundant away from harsh reality in the 60s and 70s; and
8
1_sintroduction
highly motivated re-engagement during the 80s For epidemiologists, to nationalize their exper-
and 90s, accompanied by the laying of a strong — tise. There are structures helping to achieve
though still incomplete — foundation. closer collaboration. The emerging Essential
A host of challenges lie ahead. Some of those National Health Research framework, the pro-
that stand out are the following. gressive Primary Health Care Network and the
For departments of community health, nursing Community Health Association of Southern
colleges and the new schools of public health, Africa are suitable structures.
to strengthen their expertise and ensure that @ For CERSA, to become a truly national resource,
essential epidemiological skills are soundly better able to respond to geographically
acquired by the new generation of public peripheral needs, and with its expertise more
health practitioners and researchers. accessible to universities.
¢ For clinical departments and medical faculties, @ For us all, to extend our working relationships
through the tools of clinical epidemiology, to with our neighbours in Southern Africa, as well
improve clinical judgement, understand costs as colleagues in the rest of Africa.
and consequences, and appreciate that quality
of care is both science and art, technique and Epidemiologists have risen above the polarizing
judgement. differences of the past. Today we look forward to
@ For policy-makers, managers and providers, to building on the expertise and the capacity for crit-
cultivate the critical outlook and open-minded- ical analysis we already have, and on a new gene-
ness that recognizes policy analysis and pro- ration of energetic people, whose commitment to
gramme evaluation as integral to sound health justice and equity will ensure they pursue the
practice. truth as best they can in the interests of a healthy
For the epidemiological method, to embrace new South Africa.
more analysis and less description. We know
much about the problems; we still have to get to
grips with what works, what does not and why.
Key concepts in
epidemiology
Causation of disease ¢ age of onset of hepatitis B virus infection below

the age of one year;
The definition of epidemiology reflects the cen-
current age over 30 years
tral role that the identification of factors involved
can constitute a necessary cause of hepatocellular
in the causation of disease has in the discipline.
cancer.
For example, epidemiological (and other) re-
A necessary cause can thus be a combination of
search has given us insight into factors causing
risk factors that must be present in order for the
tuberculosis. This enables us to say that tuber-
disease to occur.
culosis is an infectious disease caused by the my-
A sufficient cause, on the other hand, will in-
cobacterium Tuberculosis found especially in
variably lead to disease if it is present. A sufficient
malnourished or immuno-compromised individ-
cause is distinct from a necessary cause: for
uals, often from socio-economically deprived
example, the poliovirus is a necessary cause for
environments. Clearly, this description is an over-
paralytic polio, but it is not a sufficient cause
simplification, but it does highlight the nature of
because only about one out of 50-100 people who
disease causation where multiple factors are
become infected with poliovirus will develop
involved.
paralytic polio. The rabies virus is one of the very
few risk factors which is both a necessary and a
Necessary and sufficient causes of disease sufficient cause. Clinical rabies can only occur if
Causes of disease are rarely a single entity. Several infection with the rabies virus occurs (necessary
factors often need to be present before the cause) and clinical rabies will always occur if in-
disease process occurs and becomes clinically fection with rabies virus occurs (sufficient cause).
evident. These factors are referred to as ‘risk Most single risk factors are neither necessary
factors’, ‘agents’, ‘causes’, ‘causative factors’, and nor sufficient causes. For example, lung cancer
a variety of other terms. For the purpose of this occurs among non-smokers (smoking is not a
section, the term risk factors will be used. necessary cause) and many smokers do not get
There are two central concepts in the causation lung cancer (smoking is not a sufficient cause).
of disease. The first comprises the risk factor or Smoking, however, is an important risk factor for
group of risk factors that is essential before a lung cancer (as well as many other diseases) and
disease occurs. For example, malaria cannot is widely accepted as a cause of lung cancer. The
occur unless the malaria parasite, plasmodium concept of cause is not synonymous with a neces-
species, is present. A risk factor that must be sary or a sufficient cause, since many causes of
present before a disease occurs is called a neces- disease are neither necessary nor sufficient.
sary cause. A necessary cause need not be a single The organisms which cause infectious diseases
risk factor. are necessary causes and are sometimes referred
The following combination of three risk factors, to as ‘agents’. This concept of ‘agent’ is useful to
namely: distinguish the role of the organism from that of
¢ hepatitis B virus carrier state; the environment and the host (human beings).
10
2 = keyconcepts in epidemiology
Other factors related to the host and the environ- does not hinge on the agent alone. Tuberculosis
ment may need to be present for the disease to specifically, and disease in general, cannot be
occur. For example, infection with the TB bacillus attributed to one factor only — hence the accep-
is a necessary but not sufficient cause of tuber- tance of the concept of multiple causation. This
culosis disease, that is, even if the agent is multi-factorial nature of disease can be under-
present, the disease will not definitely occur. stood in terms of the interaction, often complex,
Other factors must be present as well. between three groups of factors: those related to
Factors related to the host which commonly the agent, the host and the environment. The
influence occurrence of disease include age, sex, concept applies as much to infectious diseases as
social class, status, lifestyle, and psychological it does to non-infectious ones, although, as men-
character. tioned above, in non-infectious diseases there
Environmental (extrinsic) factors include, for may be no single agent.
example, the presence of disease-transmitting The interaction of these three groups of factors
vectors and the availability of food (biological largely determines an individual’s state of health
environment); social customs related to living at a given point. The health status of a person may
habits, interpersonal behaviour and attitudes be seen as an equilibrium between the agent-
(social environment); the economic organization host-environment with the host and the agent
of the society and the level and distribution of balancing on a fulcrum of the environment — the
wealth (economic environment); and climatic epidemiological triangle (Figure 2.1). Harmful
conditions and the presence of industrial waste change in any of the factors could lead to a
products (physical environment). change in equilibrium, causing disease. Small
In non-communicable diseases, there is no negative changes can, at times, be compensated
‘agent’ and various concepts have been devised to for, if the existing balance is not too precarious.
understand the interplay of a multitude of risk The epidemiological triangle is particularly
factors which eventually lead to disease. useful for understanding the causation of infec-
tious diseases. Since the application of epidemi-
Models depicting multiple causation ology to chronic and non-infectious diseases,
An understanding of the disease and its causation new models have developed which de-emphasize
are wee Mee acm
Agent(s) Human host

(biological, nutrient, chemical, (age, Sex, race, genetic
physical, mechanical) factors, personality)
At equilibrium
Balance upset by Balance upset by

change in fulcrum po- changes in host and/or
sition Environment environment
(physical, social, eco-
nomic, biologic)
FULCRUM
Adapted from: Leavell H R, Clark E G, 1965.
11
the agent or organism (for example, there is no Establish whether the risk factor or exposure
known ‘organism’ for heart disease) by regarding occurred before the disease/outcome. That
it as one component of the environment. The idea is, the temporal sequence of the exposure
of a ‘web of causation’ was developed to depict and disease must be compatible with causa-
the extremely complex chain of events/factors tion.
that contribute to the occurrence of disease @ The stronger the association between the expo-
(MacMahon, Pugh 1970). Another model which sure and the disease, the more likely the expo-
puts less emphasis on the agent is that of the sure is to be a cause of the disease.
‘wheel’, where the biological, social and physical ¢ Ifastudy shows that an increased exposure
environment impinge on the hub or, core of the results in an increased occurrence of the
wheel corresponding to the individual’s genetic disease (for example, if the lung cancer rate
constitution. For different diseases, the different among heavy smokers is higher than among
components may play a more or less prominent light smokers), it meets the dose-response cri-
role in the causation of the disease (Mausner, terion.
Kramer 1985). Results from different studies using different
approaches in different settings should give
Evaluating research evidence to establish similar results (consistency). The reason for in-
causation consistent results should be explained.
While the above description of the factors associ- A plausible explanation of the mechanism by
ated with tuberculosis sounds straightforward, which the causative agent causes the disease
many laboratory, epidemiological, and clinical should be put forward to fulfill the criterion of
investigations had to be undertaken in order to biological plausibility.
arrive at this understanding. In general, an under-
standing of what causes a particular disease More recently, Susser has suggested a hierarchical
depends on an accumulated body of knowledge approach to applying the causal criteria. Rather
based on sound research. than a mechanical application of the use of causal
In the case of a living organism being an agent criteria, the hierarchical approach proposes a
or necessary cause of a disease, the following rational approach which begins with the strength
criteria were postulated by Koch: of association. If an association does indeed exist,
the organism must be present in every case of the next step is to determine the temporal relation-
the disease ship between cause and outcome. Only if the cause
the organism must be isolated and grown in occurs conclusively before the outcome are the
pure culture other criteria applied (Susser 1991).
the organism must, when inoculated into a
susceptible animal, cause the specific disease Conclusion
the organism must then be recovered from the It is essential to know about the different causes
animal and identified. of diseases and their interaction in order to
develop appropriate intervention programmes to
Koch’s postulates are of limited use because prevent or control disease. Health programmes
several diseases do not have animal models, and can have a major impact on disease directly at the
behavioural and environmental causes of disease level of the agent (for example, improved water
cannot fulfil his criteria. supplies decrease agents for diarrhoea), the host
More recently, guidelines have been developed (for example, immunization against measles pre-
for deciding whether a factor causes a disease or vents disease even when overcrowding occurs),
not. The emphasis is on strong research design and/or the environment (for example, improved
with evidence from experimental designs having road design decreases traffic accidents). Health
the most influence (see Chapter 7: Study design). policy and services (curative and preventive) thus
However, because such trials are rarely used to play an important role in determining the final
establish causes of disease in humans, strict crite- health status of the population.
ria are used to evaluate non-experimental (obser-
vational) studies investigating causation.
Sackett, Bradford Hill (Bradford Hill 1965; Sack-
Levels of prevention
ett, Haynes, Tugwell 1985) and others published The following anonymous poem, while by no
guidelines which can be summarized as follows. means a major literary work, contains an import-
12
2 keyconcepts in epidemiology
rs Cer eRe lem ll oeg
TELA 7J
SZ
LWW
y
SS
sy
PSssSe
YSSS
j
SOSsS
RSS
Seawe:
So
a) No prevention b) Secondary and c) Primary prevention

tertiary prevention
ant message for those interested in public health. While there seems to be agreement that pre-
vention is better than cure, traditionally very little
It was a dangerous cliff as they freely confessed, is done to actively promote prevention. This can
Though the walk near its crest was so pleasant; be understood when examining the historical
But over its terrible edge there had slipped development of medicine, where orientation
A duke and full many a peasant. toward treatment has dominated. Over the years,
So people said something would have to be activities have been restricted to treating the
done, injured and the sick who present themselves to
But their projects did not tally; health services. Only more recently has there
Some said ‘Put a fence around the edge of the been an appreciation that health responsibilities
cliff, need to be more extensive.
Some ‘An ambulance in the valley’. Before being able to take fuller responsibility in
the control and management of a disease, it is
This poem describes one of the major dilemmas important to understand both the multiple
found in modern health care. Much time and causes of the disease and the natural history of
money is spent providing ambulance and emer- the disease.
gency services at the bottom of the proverbial cliff
(Figure 2.2). Thought is seldom given to whether Natural history of disease
it might be better to go to the top of the cliff, build The natural history of disease refers to the sys-
a fence and put up notices warning people of the tematic description of the course of disease over
dangers of going near the edge — thus identifying time, unaffected by treatment. Several stages
the primary preventive needs of the community. form part of the natural history.
HS
During the susceptibility phase, risk factors disease and the lifestyle associated with this
only are present. should be targeted for prevention. In South
During the preclinical phase, the biological Africa, there are communities who are currently
process of the disease has started. There are undergoing urbanization and cultural transition,
some physical manifestations, but no disease is which should be targeted for primordial preven-
present. tion programmes. Government and legislative
¢ In the’clinical disease phase, signs and symp- support should protect the population from the
toms of disease are present. promotion and sale of unhealthy products, while
Recovery, the presence of chronic disease, dis- encouraging activities in all sectors of the
ablement, and death are all possible outcomes community to promote health (Steyn, Buch
of the disease process. 1992).
Once these stages have been documented, a com- Primary prevention: At this level, interventions
prehensive approach can be planned for the are aimed at healthy people — individuals or
management and control ofa disease. groups. Measures are taken to promote optimum
Epidemiology, through the use of its different health or provide specific protection of target
study designs, contributes substantially to the groups against disease and injury. This can be
generation of knowledge concerning the natural done on a non-personal level, where the risk of
history of diseases. This is done by identifying risk the condition is reduced by engineering out the
factors (stage of susceptibility), evaluating screen- risk (for example, banning asbestos products),
ing programmes (presymptomatic stage) and or on the personal level by prevention (for
describing consequences of disease and treat- example, providing of measles vaccination and
ment (stage of clinical disease and disability). On promotion.
the basis of this information, interventions can be
planned and promoted. Secondary prevention: At this level, measures are
aimed at people who have a disease which has not
Intervention at different stages of disease yet produced symptoms. Thus secondary preven-
Interventions must be on as many levels as pos- tion measures include early diagnosis and
sible. Each level of action is seen as a level of prompt treatment to prevent the development of
prevention. Thus there should be an attempt to clinical disease and complications from the
prevent the disease from occurring at all. Once disease. For example, secondary prevention of
the disease has occurred, it should be identified tuberculosis can take place through screening in
as early as possible and prevented from getting the workplace (for early diagnosis) and prompt
worse. If residual effects of the disease remain, treatment by means of medication.
these should be minimized, and _ disability
prevented wherever possible. The effectiveness of Tertiary prevention: At this level, measures
interventions differs according to the level tar- involve the treatment of the disease/condition in
geted. its later stages and rehabilitation to optimize
Essentially, a framework is needed for prevent- function, thus preventing/minimizing impair-
ative endeavours. A common approach to pre- ment and disability. For example, a tuberculosis
vention divides the process into four parts repre- patient may be treated after going to a doctor with
senting all potential levels of intervention, as clinical symptoms. In most cases, the patient can
applicable. be treated with medication and return to normal
function. In some cases, there may be residual
Primordial prevention: This level aims to curb respiratory problems and rehabilitation may be
the development of unhealthy lifestyle patterns needed, including occupational training in a new
among groups/populations which have not yet type of job.
developed these unhealthy patterns. Health Figure 2.3 provides the outline of the frame-
promotion is thus aimed at populations and work for levels of prevention and gives examples
groups, and attempts to deal with anticipated of each type of activity.
health risks which are not yet present. The classi-
cal example of the need for primordial prevention Conclusion
is in the cardiovascular disease area, where com- Epidemiology provides the findings which are’
munities which have low levels of cardiovascular used to plan interventions on all levels of preven-
14
Figure 2.3 Framework for levels of prevention
ONSET OF DISEASE DISEASE OUTCOME
Recovery
Healthy state py Preclinical > Clinical Chronic disease
: ae
(pre-disease) disease disease Disablement
Death
PRIMORDIAL PRIMARY PREVENTION SECONDARY PREVENTION TERTIARY PREVENTION

PREVENTION
Health Health Specific Early Prompt Treatment Rehabilitation
promotion promotion protection diagnosis treatment
target lifestyle @ health @ immunization @ screening @ arrest of prevention ¢@ retrain

factors with education @ personal @ case-finding disease orlimitation remaining
highdisease | nutrition hygiene process of disability capacities
risk personal @ environmental maximally
@ target groups development controls @ retrain
where risks education @ protection from maximal
are not yet @ housing occupational independence,
high @ recreation hazards and including
accidents employment
tion of a disease. In doing so it forms one of the lation at risk and the period of observation) (Table
pillars of public health, which aims to improve 2.1). Rates express the frequency of some charac-
the health of the population as a whole. teristic per 1 000 (or 10 000 or 100 000) persons in
the population per unit of time. Some key rates
are listed in Table 2.2. For some rates the numera-
Measures of the frequency of health
tor is not actually part of the denominator, for
events example, in the case of the infant mortality rate,
Most epidemiological studies focus on the occur- where the numerator is the number of deaths of
rence or frequency of disease or risk factors. It is children under the age of one ina given year, and
useful to know the numbers of people with the denominator is the number of live births in
certain conditions to determine the case load on the same year. (A child of three months who dies
the health services. However, to be able to in January will contribute to the numerator of that
compare the frequency of disease in different year’s IMR despite having been born in the
groups or in a group over time, one has to calcu- previous year and thus not contributing to the
late rates, since they relate the number of cases to denominator.) Strictly speaking such quanti-
the size of the population at risk in the specified ties are ratios, but the terminology ‘rate’ is
group in the specified period. If one determines firmly entrenched (see Table 2.1). A critical part
that in AreaA there are 50 cases of a disease and in of a rate is the period of observation/study
Area B 500, one cannot conclude that the disease expressed, for example, as ‘per year’ or ‘per
is a bigger problem in Area B. What if there are month’.
only 100 people in Area A compared to 10 000 in
Area B? For comparison, one has to take into Incidence and prevalence
account the size of the populations (and popula- Disease frequency can be measured by incidence
tion structure with respect to age, for example) in or prevalence.
the two areas. Incident cases are the number of new cases
reported during a specified period in a defined
Rates population (Figure 2.4). The epidemiologist can
A rate is calculated by dividing the numerator determine the number of new cases by making
(number of cases) by the denominator (the popu- use of routine surveillance systems (such as noti-
15
Table 2.1 Components of a rate: numerator and denominator
If there are two sick people in one year in a population of size 100,
the disease rate = 2 i ———— numerator

100 x 1 year —<_——____——_ denommator
= 20 per 1000 per year

(or 200 per 10 O00 or 2 000 per 100 OOO per year)
The numerator is a sub-part of the denominator. In the case of a ratio, however, the numerator is not
necessarily part of the denominator. For example, if 20 boys and 25 girls are born, the ratio of boys to
girls is 20:25 or 0,80.
Table 2.2 Key Ce
Birth rate
Number of live births during the year x 1.000
Total population
Fertility rate
Number of live births to women 15-49 years of age during the year x 1000
Number of women 15-49 years of age in the population
Death rates
Crude death rate
Total number of deaths during the year
x 1 000
Total population
Cause-specific death rate
Number of deaths due to a specific cause during the year
x 100 000
Total population
Maternal mortality rate
Number of deaths of women due to pregnancy or childbirth during year
x 10 000
Number of births (live and stillbirths) during the year
Infant mortality rate (IMR)
Number of deaths of children under 1 year of age during the year
x 1000
Number of live births during the year
Neonatal death rate
Number of deaths of children under 28 days of age during the year
x 1 000
Number of live births during the year
Perinatal mortality rate
Number of fetal deaths (28+ weeks) and deaths of children under 7 days
x 1 000
Number of fetal deaths (28+ weeks) and live births during the year
fications) but has to take into account the quality late the cumulative incidence or the incidence
of the surveillance system, for example, in terms rate. The cumulative incidence is the proportion
of completeness. Studies which follow individuals of the at-risk population who become diseased in
up for a specified time can also be conducted to a specified period. The at-risk population im this
determine the number of incident cases. case only consists of individuals who are at risk of
To calculatea rate for incidence, one can calcu- becoming new cases, that is, it excludes those
16
Table 2.3 Difference pee rates of incidence and prevalence
Incidence Prevalence
Numerator = new cases Numerator = all cases
Denominator = healthy population at risk Denominator = total population at risk to be a case

at start of time period (cumulative incidence)
OR
Sum of person-time of ppeeneian

(incidence rate)
Measured over a specified time period Measured at a point in time
who have the disease at the start of the time numerator. If person 2, on the other hand, is fol-
period. For example, to calculate the cumulative lowed for 3 months, and then develops disease,
incidence of measles in the unvaccinated, the de- this person contributes 0,25.years to the denomi-
nominator excludes all old cases and those who nator and one disease case to the numerator.
have been vaccinated. Person 3, who is followed for 6 months before
Cumulative incidence is: being lost to follow-up, being healthy at that time,
contributes 0,5 years to the denominator and
Number of new cases in a specified time period nothing to the numerator.
The number in the at-risk population at the start The incidence rate is:
of the time period
Number of new cases of a disease during a speci-
However, when cumulative incidences are calcu- fied time period
lated‘ for large populations, for example, for a Total person-time of observation
country as a whole, the denominator consists of
the total population, since excluding those who The incidence rate is thus expressed as the
are already diseased will change the resulting cu- number of cases per person-time.
mulative incidence very little. Prevalent cases, in contrast with incident
The cumulative incidence assumes that the cases, are the number of people who have a
entire population at risk at the beginning of the particular disease at a specific time (Figure 2.4).
time period is followed up for the specified period Prevalence thus focuses on whether a disease is
(for example, one year). However, some individ- present at a specific time and not on when it
uals may be lost to follow-up and thus different occurred. It is also known as point prevalence,
members of the at-risk population may have dif- since it refers to the number of cases at a point
fering lengths of follow-up. in time. Prevalent cases include cases which
The incidence rate (sometimes referred to as developed some time ago and new cases (inci-
incidence density) takes into account the length dent cases), but excludes the cases which have
of time each individual in the at-risk population died or recovered (Figure 2.5).
remained under observation. The numerator is To calculate the prevalence, the numerator is
the number of new cases and the denominator is the number of prevalent cases (that is, all cases at
the sum of the individual time periods of observa- a given time) and the denominator all at-risk
tion till disease onset or loss to follow-up, in terms people in the group investigated (that is, all indi-
of person-time (person-days, person-months or viduals at risk of being a case, thus also including
person-years). For example, if person 1 is follow- old cases) (Table 2.3). For example, to calculate
ed for the whole period of one year, without uterine cancer prevalence, the denominator
developing disease, this person contributes one would be all women, excluding those who have
year to the denominator, and nothing to the had hysterectomies.
ia
Prevalence is:
Figure 2.4 Counting incident and
prevalent cases Number of existing cases
The number in the at-risk population (at a spec-
ific point in time)
Prevalence is commonly referred to as prevalence

rate; this is incorrect, because it is not expressed
per unit of time.
The longer the duration of a disease, the higher
its prevalence. Thus, if a disease is rapidly fatal, it
may have a low prevalence in spite of a high inci-
dence. On the other hand, if treatment success
improves so that more cases survive, but with
some residual disease, prevalence would increase
even if incidence remains unchanged.
Primary prevention strategies aim to reduce
new cases of disease, that is, incidence. Secondary
DAY 1 DAY 30 DAY 60 prevention strategies aim to decrease prevalence
The day on which healthy individuals develop a disease by decreasing the severity and the duration of
is indicated by the point where the line under the case disease (thus increasing the number of cases who
number starts. The duration of the disease is indicated become cured).
by the length of the line. The number of incident cases in Table 2.3 summarizes the differences between
the period Day 1 to Day 30 is 9 cases (cases 1 to 9) and prevalence and incidence.
in the period Day 30 to Day 60 4 cases (cases 10 to 13).
If a cross-sectional study is done on Day 30 to Crude and specific rates
determine prevalence, prevalence will be 2 (cases 8 and 9). If the total number of cases in a population is
divided by the size of the population at risk, a
crude (overall) rate is calculated. So, for example,
one can determine the crude death rate in a
Featc= We BE) =e)
population. However, the epidemiologist is more
Pete eee ieemait interested in what the rates are for subgroups, for
and prevalence example, for the sexes or for different age groups.
These are called specific rates and they enable the
researcher to study the variation of the condition
of interest between subgroups of a population.
Thus one could have disease rates which are sex-,
age-, and cause-specific, for example, heart
disease death rates for men aged 55-64 years.
Comparison of rates
Epidemiologists often wish to compare disease or
death rates between groups, for example, be-
tween countries, sexes or age groups, or in the
same group over time. To be able to meaningfully
Prevalence: compare such rates, one needs to be sure that any
existing cases observed differences in rates are real and not due
to differences between the groups in the distribu-
tion of other factors. For example, what conclu-
sions can one reach if the overall mortality rate in
Group A is higher than that in Group B, but it is
known that Group A has a larger proportion of
Source: Kibel M A, Wagstaff LA, 1995. elderly people than Group B? Because death rates
increase rapidly with age, older populations have
18
higher crude mortality rates. If the mortality rate which lead to changes in the number of people
of a population has increased over the last and composition of a population are fertility,
century, is it due to a real change in mortality mortality and migration.
experience, or to changes in the age structure of Population-based rates are essential in epi-
the population? If two populations are similar demiology. While great care is often taken in
with respect to factors related to the disease, one establishing the numerator, too often in the South
could feel confident about comparing crude African context, less attention is given to obtain-
rates. If they differ, however, a comparison of ing an accurate estimate of the denominator —
crude rates may give misleading results. the population size. The primary source of demo-
Standardization (also called adjustment) is a graphic data is the census, which endeavours to
way of removing the effect of differences in the obtain basic data for the whole population. Cen-
composition of various populations and so over- suses are usually conducted once every ten years,
coming the problem of making erroneous com- but since 1980 there has been a South African
parisons based on crude rates. If, for example, age census approximately once every five years,
is related to the outcome of interest (for example, because of large-scale migratory population
heart disease), and the two populations differ movements. In using census data, it is important
with respect to age distribution, one needs to to realize that there is often an undercount, and it
adjust the rates. The standardized rates provide a must be established whether the published
new summary rate which takes into account the census figures have been adjusted or not. Census
differences between the groups. (Details are given data is often supplemented by routine surveys.
in Appendix 1.) It is, however, important to always
examine the subgroup-specific rates (for exam- Population composition
ple, age-specific rates) prior to standardization. If The composition of a population can be
the rates differ markedly between subgroups, described in terms of factors such as age, sex,
standardization will obscure these differences. race/ethnicity, occupation, income, education
Comparing the rates of two groups in a study and literacy. The population composition deter-
requires a different approach to the one discussed mines to a large extent the types of health prob-
here and is dealt with in Chapter 11: An introduc- lems experienced, and which health services have
tion to data presentation, analysis, and interpreta- to be delivered. An important measure of popula-
tion. tion composition is the dependency ratio, which
describes the relationship of the dependent part
of the population to the potentially productive
Demography part of the population. In Table 2.4, the depen-
From the previous section on measuring health dency ratios for South Africa as a whole, as well as
events, it is clear that knowledge of the size and for specified subgroups, are given. These figures
characteristics of a population or community is were calculated by Mazur (1995) from the house-
necessary to be able to interpret health informa- hold survey conducted during the second half of
tion meaningfully. The composition of the popu- 1994 by the University of Cape Town’s Southern
lation also has implications for its pattern of African Labour and Development Research Unit,
health. To plan and deliver health services effect- for the Project for Statistics on Living Standards
ively it is necessary to know what the community and Development.
looks like. For these reasons, there is a close rela- A population pyramid is a useful summary
tionship between public health, epidemiology representation of certain aspects of population
and demography. composition. It is a graphical display of the per-
Demography is the scientific description of the centage composition of a population in terms of
characteristics of populations, and embraces all age and sex. Age is usually grouped in five-year
aspects of population structure and changes intervals. The percentages are represented by
which can be measured numerically. This proportionately drawn horizontal bars. By con-
involves primarily the measurement of the size of vention, the percentages relating to the males are
a population (how many people there are), com- drawn on the left of the pyramid and the females
position (what the characteristics, such as sex and on the right. The age groups are drawn from the
age, of the people are), distribution (where the youngest at the bottom of the pyramid to the
people in the region are), and changes in num- oldest at the top.
bers of people. The main demographic processes The influence of births, deaths and migration
19
on the population composition can be seen from mothers in each age group (for example, 15-19)
the shape of the pyramid. A developing country by the number of women in that age group. In
with high birth rates and low life expectancy will Table 2.4, the percentage of teenagers (15-19)
have a high proportion of young people and thus ever pregnant is given for South Africa as a whole,
a population pyramid with a triangular shape and as well as for various subgroups (Mazur 1995).
a broad base. On the other hand, a developed This figure is an approximation of the 15-19 year
country with higher life expectancy will have a age-specific fertility rate.
higher percentage of people in the older age
groups and thus a population pyramid with a Total fertility rate:
narrower base and steeper sides. Events such as The best single summary index of fertility (if the
famines, wars, or mass migration in the recent age-specific data are available) is the total fertility
past will be reflected by irregularities in the shape rate. It is calculated by adding the age-specific
of the pyramid. Population pyramids of the rural fertility rates together over the reproductive ages.
and urban populations of South Africa are shown If five-year age intervals are used, each rate is first
in Figure 2.6 (Mazur 1995). The rural population multiplied by 5. This represents the number of
clearly has a broader base than the urban popula- children (disregarding mortality) who would be
tion. born to a group of women over the reproductive
period, as they experience current age-specific
Demographic processes fertility rates. It incorporates the age structure of
Three processes determine the changing size of a the reproductive female population, which the
population: fertility (births), mortality (deaths) general fertility rate does not. In Table 2.4, the
and migration. The balance between these three number of children ever born alive to women
determines whether a population increases, de- aged 45-49 is given (Mazur 1995). This is an
creases or remains stable in size. approximation of the total fertility rate.
The growth rate of a population is given by the
following equation: Mortality
Mortality is of great importance in demography
(births-deaths) + (immigration-emigration) as well as epidemiology. In epidemiology, it is
total population often used as a substitute (although a biased one)
for morbidity (disease), since mortality informa-
Fertility tion is collected routinely and is more easily
The determinants of fertility are complex. We will obtainable than morbidity information.
restrict ourselves to some basic measures of fertil- The crude death rate expresses the number of
ity. deaths per 1000 population. As pointed out
earlier, other more specific death rates, for
Crude birth rate: example, by cause or age, may be more useful.
The number of births per 1 000 persons in the Important death rates are listed in Table 2.2. The
population. This measure requires only total infant mortality rate (IMR) is considered to be of
births and total population. It is a crude measure, great importance for public health. It is not only
since it does not take into account the age or sex an indicator of health among the young, but the
structure of the population. WHO has used the IMR as an indicator of the
quality of the social, economic and physical
General fertility rate: environment of the whole community, as well as
The number of births per 1 000 women aged 15- the effectiveness of health services. In South
44 years. In this case, the denominator consists Africa, the IMR varies widely between population
only of the females of child-bearing age in the groups, but decreases in the IMR have occurred
population. Sometimes the age group used is because of improvements in socio-economic
15-49 years. conditions, nutrition and access to health care.
In recent years, the under-5 year mortality rates
Age-specific fertility rate: have received increasing attention. These values
If the numbers of the female population and for South Africa are indicated in Table 2.4 (Mazur
births are known by age of the mother, an age- 1995). These values were obtained using indirect
specific fertility rate can be calculated. It is methods (see Chapter 20: Mortality studies).
obtained by dividing the total number of births to A useful index of overall longevity is life
20
Sr re CCR Rec mc Cea

Tw Vine ye
Rural population
e
&}/o}+
Male Do
oO}
Female
oa ao oO
a woa4
wo
kf
Nm
| nm|<
a)o
ASI o|s/o|s
|wola|ala
oO
Percent
Urban population
Male
-8 a, -4 -2
Percent
Source: Mazur RE, 1995
expectancy. It is easily understood, as it is have a longer expectation of life than men. This
expressed in years. Life expectancy at birth is, on index is often represented separately for males
average, how many years a person just born can and females.
be expected to live for. It is a theoretical concept An increase in life expectancy can be due to
which imagines a cohort of people just born expe- people living longer, or less people dying in the
riencing the current age-specific mortality rates young age groups. In most countries, increases in
throughout their lives. Because of the differences expectation of life has been brought about mainly
in mortality rates of males and females, women by the reduction of infant mortality.
21
Table 2.4 Selected demographic indicators for South Africa
Depend- % Female Children Mortality rates life

ency ratio teenagers ever born per 1000 expectancy
pregnant* alive to at birth
mothers IMR Child Under 5
45-49
years**
SA Total 66,7 22 4,30 81 38 116 57,8
Race
African 71,8 25 5,01 86 43 125 56.5
Coloured 62,6 17 4A7
Indian 48,8 2 3,37
White 44,7 1 2,56
Residence
Rural 25 5.37 94 50 139 54,5
Urban 20 3,80 69 30 96 60,9
Metro 49,5 15 3,37 57 22 78 64,2
Education
0-6 24 4,95 95 51 140 54.2
7s 22 3,51 62 25 85 62,9
9+ 15 2,62 58 23 79 64,0
Expenditure
(monthly, per person)-
<R150 O27 27 5,91 — 89 45 130 55,7
R150-R499 63,1 © 22 4,36 79 Ee 113 58,3
R450+ 3/,9° 4 2,94 63 26 87 62,6
*15-19 year old age specific fertility rate

** an approximation of the total fertilityrate
Source: Mazur R E, 1995.
Life expectancies are usually calculated during tion for which services need to be provided is a
census years, when reliable information on popu- major public health priority. Urbanization is also
lation size is available, and reliable mortality rates important to health in that it is often associated
can be calculated. with lifestyle or environmental changes which
impact on health (Yach, Mathews, Buch 1990).
Migration and urbanization Defining a migrant is difficult, as many people do
Migration refers to human movement in a geo- not necessarily remain in the urban area they
graphical sense. Socio-economic and political move to, but shift back and forth between rural
factors lead to the migration of people. Migration and urban regions cyclically.
consists of immigration (people moving into a
specific region) and emigration (people moving Demographic transition
out ofa specific region). Populations are not static. Over time, societies
In recent years, there has been rapid urbaniza- tend to experience decreases in the birth and
tion in South Africa caused by migration to cities. mortality rates. The drop in mortality tends to
The counting of the ever-changing urban popula- occur first, followed by a drop in fertility some
ae
time later. There is thus a rise in the population in the accessibility of health services.
the intermediate period. At the beginning of such Although the value of a single indicator is lim-
a period of change, the majority of deaths are due ited, the use of a standard set of indicators
to infectious diseases, whereas at the end, the enables public health professionals to compare
majority are due to the chronic diseases of later key health outcomes between areas and over
life. This change in disease pattern is sometimes time. Such routine monitoring of the population’s
called the ‘epidemiological transition’ (Omran health status requires standard, easily collectable
1971). Chronic diseases, often referred to as ‘con- and interpretable measures.
ditions of affluence’, now contribute substantially Health indicators are different from health
to the burden of disease in developing countries, indices, which are composite measures combin-
especially among the poor. South Africa and other ing several individual indicators. An example of a
developing countries may well experience a ‘pro- newly developed health index to determine the
tracted transition’ where the mixture of chronic national burden of disease is the Disability
disease and infection could co-exist for some Adjusted Life Years (DALY). Demographic data is
time, as happened in Mexico (Soberon, Frenk, used in conjunction with information collected
Sepulveda 1986). from community studies of disablement to deter-
mine this index. It is calculated by adding the
Conclusion years lived with a disability (weighted according
Without accurate demographic information and to severity of the disability) to the years lost due to
a knowledge of how to interpret demographic premature mortality. The DALY is thus a compre-
data, an epidemiologist will not be able to evalu- hensive measure of ill health, reflecting both non-
ate the extent or patterns of health problems, or fatal outcomes and mortality, that can be used to
foresee any changes which are likely to take place quantify the extent of disease.
in these. Health indices may initially appear to provide a
more adequate assessment of the health of popu-
lations. However, not all indices are particularly
Health indicators valuable. They can never truly summarize health
in its totality, and can at best address some of its
Measuring the health of populations components. If a single composite measure is
Health is a difficult concept to define because of used, the specific influence of the contributory
its complex and multifaceted nature. For the pur- measures is often lost, and composite measures
poses of determining the health of populations, may be more open to abuse than simple
an important aspect of any definition is its measures.
measurability. Given the holistic nature of health,
no single measure would do justice to the con- Uses of health indicators
cept, although measurement of single com- Health indicators have several uses. They help to
ponents can provide useful information. The identify and quantify health problems. Since
researcher interested in single or multiple com- health status depends on economic factors,
ponents of health can choose from a variety of health indicators may provide insight beyond the
health statistics. health status of the populations. For example,
There are health measurements selected from a
larger pool of health statistics which summarize
Table 2.5 Common sources of
or represent several statistics, or serve as useful
proxy (substitute) measures for the relevant in- data for health indicators
formation. These measurements, called health
indicators, reflect health situations, and are vari- @ Vital events registers, e.g. death tapes
ables which ‘help to measure changes’, especially @ Census
when these changes cannot be measured directly. @ Routine health services records
They usually reflect only partial components of @ Epidemiological surveillance
health, for example, the infant mortality rate @ Sample surveys
focuses only on the health of children under the Disease registers
age of one year. However, the WHO does view this ¢ Alternative sources of data, e.g. church
indicator as also being an indicator of the health records
status of the community as a whole, as well as of
23
a_contextualizing epidemiology
they often reflect socio-economic development,

Table 2.6 Health indicators exposures to specific risks, health services access
sheetbe A ae) and utilization, and other factors related to health
in the population. By identifying the magnitude
of health problems, such indicators help in
1Healthpoiley Indicators
setting priorities and allocating resources. This
— ee commitment to health for all
often provides the necessary stimulus for new,
improved strategies. Indicators may also help to
evaluate national and local health activities by
being used as markers of progress (or failure).
_The usefulness of an indicator is judged by its
validity, reliability, and ability to provide
adequate information relative to the cost of
collecting information. Indicators should also be
well understood and accepted by the providers of
the services and the users of the information.
Selection of health indicators for monitoring

population health
Indicators rarely comply with all the above crite-
ria. The decision to use a health indicator
depends to a large extent on the organization or
technical and financial feasibility of data collect-
ing and analysis. Common sources of data for
health indicators are included in Table 2.5.
The WHO compiled a list which — while by no
means complete — provides an outline of the
basic types of indicators that can be used to
monitor health in the context of the ‘Health for all
by the year 2000’ programme. The different types
of health indicators include health policy indica-
tors, social and economic indicators related to
health, indicators of the provision of health care,
and health status indicators (Table 2.6). Most
health indicators in categories 3 and 4 in Table 2.6
are covered in detail in sections D and E.
The particular indicators recommended by the
WHO are the simpler and cheaper ones that
would be accessible to health departments in
developing countries. It is recommended that
each country compile its own list of indicators to
monitor the health ofits population.
24
General —
philosophical
issues in
epidemiology
Ethical aspects of epidemiological The increased awareness of the importance of

research ethical considerations in research followed the
experimentation on prisoners by Nazi doctors
Introduction during the Second World War. In this case help-
The purpose of this section of the chapter is to less people were forcibly exposed to medical
make ethical reasoning and its application to re- experiments of dubious scientific value, the
search more accessible to the readers of this book. results of which would not have been made avail-
This is not a comprehensive treatise on research able to them in any case. The post-war Nurem-
ethics, but rather an introduction to the nature berg trials (in which Nazi war criminals were
and processes of ethical reasoning in research. tried) highlighted these abuses and pronounced
Specifically, the aims are: : that no experimentation could be done on a
- @ to make ethical considerations accessible to all person without their specific permission. The
who conduct, read, or commission research, Nuremberg Code of 1947, and subsequently the
specifically epidemiological research Declaration of Helsinki adopted by the World
@ to highlight the major ethical issues surround- Medical Association in 1964, made voluntary
ing the proper conduct of research informed consent a central requirement of ethic-
to indicate some common controversies in ally conducted research. In spite of this, the prac-
research ethics, and to suggest ways of dealing tice of obtaining informed consent or refusal from
with them. research subjects/participants did not become
routine until well into the 1970s in most Western
Ethics is the branch of philosophy dealing with countries.
morality. That is, ethics attempts to distinguish Since then, the ethics of medical and health
between what is right and what is wrong in order research have been the subject of intense debate
to give people guidance on how to behave. Over and scrutiny by governmental agencies, research
time, most of the the ethical aspects of our beha- institutions, the public at large, and — especially
viour have been clarified, and most of us know in the United States — the courts. Because of this,
what is good and what is bad. However, if a new there is now a much deeper understanding of
situation arises, it is not always clear what action ethical and moral values related to research, and a
is ethically good and what is not. In research, far greater awareness of these issues among the
because we almost always deal with new situa- general public (the potential research subjects/
tions, ethical reasoning is prominent. The appli- participants). A series of general and institution-
cation of ethics to medical and health practice specific reports have been drawn up to outline to
and research is called bio-ethics. This chapter is prospective researchers what requirements need
about bio-ethics, but for the sake of simplicity, to be met for their research to be considered
the words ‘ethics’ and ‘ethical reasoning’ rather ethical, and what procedures research proposals
than ‘bio-ethics’ and ‘bio-ethical reasoning’ are must undergo to ensure that future research is
used throughout. ethically sound. One of the most important of
25
these is an independent ethical review. positive responses before the drugs are tested on
To understand the need for an independent large groups of patients — be short-circuited so
ethical review related to research, it is important that AIDS sufferers do not have to wait for poten-
to understand some of the major motives behind tially beneficial drugs until the lengthy clinical
research. To illustrate how these motives may trial process has been completed.
operate in preparing or conducting research,
three examples are given of cases where well- These three examples provide a good insight into
intentioned research was done in an ethically un- the context of research ethics. In the first two
acceptable way. examples, the main function of ethical review
would have been to protect research subjects
EXAMPLE A: from being exposed to risks related to research. In
In a now classic article on bio-medical ethics, example A, risky procedures were being used
Beecher reviewed 22 studies in the New England without subjects’ permission. In example B,
Journal of Medicine in 1966 which in his view en- denial of existing treatment in the first instance,
dangered the health or the life of the study sub- and denial of more effective treatment when it
jects without informing them of the risks or became available during the course of the study,
obtaining their permission. He concluded that posed risks to the study subjects. In example C,
‘unethical or questionably ethical procedures’ the usual ethical requirements that are put in
were not uncommon among researchers. He place to protect people from research risks have
further concluded that the motivations for such come under pressure because of perceived bene-
procedures could be found in pressures on young fits of this research. Demands for the reform of
researchers for personal advancement (Beecher the ethical review process during the 1980s
1966). His article stimulated increased regulation included the suggestion that not only should
of research, including the establishment of com- research subjects be protected from research
pulsory review procedures by institutional ethics risks, but the potential benefits of research should
committees. also be considered in deciding on the ethics of a
proposal. Several trials were halted midway
EXAMPLE B: because the benefits of a new treatment had been
In a similarly classic case, researchers from the shown beyond doubt. This was done so that the
United States Public Health Service conducted an benefit could be applied widely. Protection from
experiment involving over 400 black male resi- research risks is therefore only one reason for
dents of Tuskegee County in the southern state of ethical review. The second major reason is to
Alabama. Between 1932 and 1972, when the study ensure the fair distribution of potential benefits.
was terminated, these men were observed for the Although we focus below on more detailed
effects of untreated syphilis. This study was ini- issues in ethical aspects of research, it is these two
tiated at a time when syphilis treatment was rela- facets that are the real focus of ethical review: the
tively ineffective. Nevertheless, while the men protection from risks related to research and the
were told that they were being ‘tested’, many equitable distribution of potential benefits of
thought this was for ‘rheumatism’ or ‘bad research.
stomach’. They were not told that syphilis treat-
ment was being withheld. Even after penicillin Ethics and ethical reasoning
became available during the 1940s, and was Acomprehensive description of ethics and ethical
found to be effective against syphilis, the men reasoning is given by Beauchamp in his book
were not given this information. They remained Principles of Biomedical Ethics (1989). Here,
under observation without receiving penicillin ‘to we will provide only a brief insight into the area
allow the important scientific process to conti- of ethical reasoning to enable prospective
nue’ (Jones 1993). researchers to understand the processes that
should be followed by review committees to
EXAMPLE C: which research protocols are submitted.
AIDS activists in the United States are demanding Most people have a good grasp of the moral
that the normal route of Phase I, II and III clinical values in society. There is no difficulty, for exam-
trials — which involve animal experimentation ple, in making judgements about corruption,
followed by testing of drugs on small groups of deception, killing, telling the truth, and ensuring
healthy human volunteers to test for toxicity and that research results are made available to the
26
3 general philosophical issues in epidemiology
public. But there are two instances when the use and constructively, they can contribute towards
of ‘common sense morality’ is not sufficient. the development of more stable moral values that
Firstly, new challenges are being provided by are accepted by large sections of society.
each new research protocol, by new therapies, by Ethical reasoning takes place whenever there
new interventions, or by the use of new techno- is a need to provide a moral justification for
logies. Although existing morality can help guide a particular action, for example a particular
us towards appropriate action in known situa- research project or procedure. Such actions may
tions, the new situation may not fit existing moral be justified on the basis of ethical rules, which, in
rules or behaviour. In the recent context, for turn, are based on ethical principles derived from
example, there has been an intense debate in an ethical theory. Telling the truth is a generally
South Africa on whether or not abortion should accepted moral value, and lying, therefore, is
be legalized. We have the technology and medical morally wrong. A researcher who decides to
skills to ensure that abortion can be done in a way deceive research subjects must therefore justify
that guarantees the health of the pregnant his or her decision to engage in an immoral
woman. Given this background, there is a moral action. The researcher may argue that obtaining
dilemma. The dilemma is that there are solid the truth is an accepted moral rule, and that
moral reasons why abortion should be legalized deception must take place, otherwise he or she
(such as a doctor’s duty to optimize the health of cannot obtain the truth. (For example, if we want
the pregnant woman, or society’s duty to uphold to research the quality of care given by practition-
the right of a pregnant women to self-determina- ers to patients or clients, telling the practitioners
tion) while, at the same time, there are equally what is being studied may change the way they
solid reasons why abortion should be prohibited deliver care. We will therefore not be able to find
(such as society’s duty to uphold the right to life of out the truth.) This rule of obtaining the truth can
unborn babies, or society’s duty to protect those be based on one or more ethical principles such
who cannot protect themselves). We have to go as the principle of beneficence (interventions or
beyond ‘common sense’ moral reasoning, and actions, including research, must, on balance, do
to develop a comprehensive understanding of more good than harm). This principle, in turn, is
morality, ethics and ethical reasoning, in order to derived from a more complex body of principles
solve these difficult moral problems. and rules that make up ethical theory (Figure 3.1).
A second series of challenges to ethical reason- The two most prominent types of ethical theory
ing is‘ provided by the differences in morality are the utilitarian and the deontological theories.
between cultures, religions, and ideologies. These In utilitarian theory, the most important under-
challenges are compounded by the absence of a lying value for judging whether an action is ethic-
common basis for making moral judgements. ally right or wrong is the amount of ‘good’ it
Instead, we are slowly developing our own set of produces. Thus this approach prioritizes the
moral values and virtues in a process that is slow ‘greatest benefit to the greatest number of people’
and painful. For example, the decision by the as the main criterion for judging an action.
Constitutional Court to abolish capital punish- Deontological theories hold that actions are to be
ment is a first attempt to emphasize one particu- judged on the basis of certain values, irrespective
lar moral value, namely the right of persons to life. of the consequences for the ‘general good’ they
Yet this decision has elicited extremely emotional produce. All religious moral reasoning falls in this
reactions, especially from those who feel that group, as it generally holds that certain actions
another moral value, namely the right to a safe are right (or wrong), not because of the results
and crime-free environment, should have a they produce, but because the religion’s manifest
higher priority than the right to life. The role of (such as the Bible, Torah, or Koran) states that
the Constitutional Court in this case is similar to such actions are required and, therefore, good.
that of an ethical review committee in the context
of research: to attempt to develop a shared basis Ethical principles in medical and health-
of ethical arguments that can be applied to the related research
individual research proposal and to similar pro- In spite of these differences, there is often agree-
posals in the future. Given the multi-cultural ment between different theories on the moral
heritage of our country, it is likely that there will value of particular actions. These agreements
often be disagreement between reviewers. How- have been reached through ethical review, court
ever, if these differences are discussed publicly judgements, popular debates, and in many other
27
ways. As a consequence, ethical review of medical providing information that allows individuals to
and health-related research has distilled three make autonomous decisions.
main ethical principles against which almost any Secondly, the.principle of respect for persons
proposal can be assessed from an ethical perspec- calls for special measures to be taken to maximize
tive. These principles have provided a very valu- the autonomy of and protection for incompetent
able tool to assist in the ethical review of medical persons. These include people with an intellec-
practice and research. Theyare outlined in Figure tual disability (permanent incompetency) or
3.2 and discussed below. persons under the influence of alcohol or pre-
medication for surgery, or in a state of emotional
The principle of respect for persons shock (temporal incompetency), and persons
Respect for persons has two major components. with reduced autonomy (such as children, pris-
The first one, often called the principle of oners or refugees).
autonomy, relates to the notion that a competent The principle of autonomy is the main principle
individual has the right to self-determination. underlying the process of obtaining informed
Other people, including health and medical prac- consent for medical procedures or for research. To
titioners and researchers, must respect this right allow an autonomous person to make an auton-
to self-determination and have an obligation to omous choice about whether or not to participate
enable the autonomy of persons by, for example, in research, five conditions have to be met.
Figure 3.1 Process ofethical reer y4
General approach Example: Practical use of approach
Ethical theory Utilitarian approach

it
Ethical principles Principle of beneficence
Ethical rules Obtain ietruth
fed fson how to act with moral justification Refrain sa disclosing to health workers why the
research is being done
Figure 3.2 Application of ethical principles to health research
Ethical principles
Respect for persons Beneficence Justice
Autonomy Maximization of Non-malevolence Beneficence Right of person Duty of other

autonomy and to demand certain person to provide
protection of things these
incompetent persons
@ assessment of competence minimize negative effects on @ benefits of the research must

to make decisions individuals/groups being be made available to persons/
disclosure of information studied populations being studied
@ understanding of information @ research subjects should benefit @ all who stand to benefit from
@ voluntary decision-making ¢ consultation if conflict concerning research should contribute to
specific authorization when ‘benefit’ risks or discomfort endured
participating make benefits of research
widely available
28
1 The person must be competent to make deci- ensure not only disclosure, but also understand-
sions ing of what is being disclosed.
As outlined above, the competence to make deci- 4 Any decision, whether consent or refusal, needs
sion may be impaired both permanently or tem- to be voluntary
porarily. In general, any adult is regarded as com-
petent unless there is evidence to the contrary. In In other words, there should be no undue influ- -
addition, although children are often declared ence on the prospective participant to consent (or
‘legally competent’ at a certain age (18 in South refuse) to participate. This includes the duty of
Africa), it is obvious that between childhood and researchers not to coerce or manipulate subjects.
adulthood, the child becomes progressively more For example, prospective study subjects taken
competent. This means that there are certain from a health care setting must be told specific-
decisions about research participation or medical ally that refusal to participate in research will not
interventions that a child can make before being lead to a reduction in the health care they will
legally competent. Researchers are therefore receive. If incentives to participate are being con-
encouraged to elicit consent (or refusal) from templated, it would constitute an ‘undue incen-
children who are old enough to be able to make tive’ to offer an unemployed or poor person a
decisions about participation, in addition to significant cash benefit for participation. Simi-
obtaining consent from their parents. larly, research on persons with restricted auton-
omy, for example, prisoners or students (if the
2 There has to be disclosure of information researcher is also their professor), needs to make
special provisions to ensure that their decisions
An autonomous person can only make an auton- are not influenced by coercion. Lastly, manipu-
omous decision if all the relevant facts needed to lating people to participate by over-emphasizing
make a decision are provided. The researcher is, the potential social benefits of the research would
therefore, under an obligation to inform prospec- also infringe on the voluntariness of any decision
tive study participants about: . taken.
what the study is for
@ what will be done to or with the participant 5 Aspecific authorization needs to be given by
¢ what potential risks or discomforts the partici- the participant
pant faces.
After disclosure has taken place, the researcher
The disclosure requirement also explicitly places has an obligation to ask for explicit consent or
researchers under the obligation to inform all refusal to participate, whether written or oral.
study participants of innovations in medicine
that could make participants change their mind The principle of beneficence
about participation. This requirement was In general, this principle requires that the actions
ignored in the Tuskegee study. of health and medical practitioners or researchers
be directed at improving the well-being of
3 There has to be understanding of the patients or study participants, and they should
information not cause harm to the patient or participant.
While this may sound obvious, there are several
If the information being provided in the process important issues in research that violate or may
of full disclosure is phrased in a language or in a potentially violate this principle.
manner that does not allow the person to under- This principle is often divided into two com-
stand the information, then there can be no in- ponents: firstly, a principle of non-malevolence
formed consent. This is especially important for _ (the obligation not to do harm), and secondly, a
researchers in a cross-cultural situation, as is principle of beneficence (the obligation to pre-
often the case in South Africa. Researchers often vent harm, to remove harm, and to do good). For
come from a different social class and cultural purposes of this text, both are considered under
background to study participants. They also come one ethical principle of beneficence.
from a specific discipline with its own profes- Firstly, even a non-invasive questionnaire or
sional jargon. Special care needs to be taken to interview may have detrimental consequences
29
for study subjects. Participants offer time which in less dramatic areas (for example, effectiveness
may result in other priorities not being met (for of a breast-feeding education campaign).
example, interviewing rural women may reduce Lastly, research risks in case of research on
the time they have for water or wood collection or minors (‘legally incompetent persons’) should
child care). Confidentiality of answers may be always be minimal. This is based on the principle
broken, and the participant may suffer as a conse- that only competent adults can make an in-
quence (for example, an opinion about legaliz- formed decision to subject themselves to severe
ation of prostitution may be counted against a risk, and that no one has the right to make such a
church member). Results of a study may detri- decision on behalf of others.
mentally affect the group to which the particip-
ants belong (for example, the initial findings that The principle of justice
HIV was predominantly transmitted through The notion of justice means that people receive
homosexual contact reinforced public stigmati- what is due to them. It includes, therefore, both
zation of homosexual behaviour. Later recogni- the right of the person to demand certain goods,
tion of heterosexual transmission has to some and the duty of others, such as researchers, to
extent countered this). provide these. This principle is extremely com-
Secondly, research may be done for the benefit plex in that it has an application in almost every
of the researcher, not for the benefit of the partici- field of life. In the context of research ethics, this
pants or for society in general. Typically, many principle is probably the least well developed of
students are required to conduct a research the three principles discussed.
project for degree purposes, while in other set- In research, the concept of justice is usually
tings, research is required for promotion. In such interpreted to mean afair distribution of benefits
cases, it is not always clear how the study partici- of the research (‘distributive justice’). For exam-
pants will benefit. ple, although most AIDS/HIV vaccine research
Thirdly, there is often a tension between the being prepared will be conducted in the develop-
researcher’s opinion of what is good for partici- ing world, developing countries (and the popula-
pants, and the participants’ opinion of what is tions originally researched) may not be able to
good for themselves. This is typically the case in afford these very vaccines and will thus not get
situations where foreign researchers come to the benefit of the research.
Africa to conduct studies, on HIV/AIDS for exam- The principle of justice also leads to the inter-
ple. Whereas African priorities, at least in the short pretation that all who stand to benefit from the
term, are usually to develop appropriate preven- research should contribute to its risks and dis-
tive education programmes or to investigate comforts. In other words, research that may
optimal home-care mechanisms, foreign research benefit both the rich and the poor should be con-
often focuses on the biological properties of HIV ducted among both groups, not only the poor.
infection. In such a case, extensive consultation Similarly, although medical students are an easy
with the participants or their representatives is target group for medical researchers, they should
essential to develop a study that will satisfy the not always be the study subjects for research that
needs of both participants and researchers. may have applications for a wider audience. The
Research always demands some inputs, costs, same applies to communities that tend to be-
or discomfort from study participants. For come ‘over-surveyed’ because they happen to be
example, in the case of more invasive research, co-operative, or because they have a well-orga-
such as taking blood specimens, discomfort and nized clinic.
possibly more serious side effects are expected. There are now demands that research be dis-
The ‘do no harm’ interpretation of the benefi- tributed more fairly to make the potential benefits
cence principle must therefore be restated as the more widely available. For example, in the pro-
notion that, on balance, the research must do cess of setting up research, both facilities and
more good than harm. This leads to another human resource development need to be pro-
important notion, namely that the research risks vided. This provides a direct benefit to the group
must be commensurate with the expected bene- being researched. Similarly, in many areas of the
fits. In other words, in the case of research into a world, the only opportunity poor people may
life-threatening condition (for example, cancer have to obtain good quality medical care is if they
treatment) the research risks may be more severe participate in a research project. In South Africa,
than in case of research that pursues knowledge for example, anti-retro-viral medication against
30
HIV infection is not provided through the public obtained from the prospective study participant.
health services because of its costs. Conse- In the case of minors, the legal parents or carers
quently, only the rich and those in clinical trials are allowed to consent on their behalf. For
testing these drugs will have access to them. research that carries no risk or only minimal risk,
this is usually adequate. Where research is con-
Some specific ethical issues and controversies templated in which a minor may be exposed to
in epidemiological research more than minimal risk, a debate arises as to
The substance and components of ‘informed whether or not proxy consent is acceptable.
consent’ have been discussed under the principle Those who find it unacceptable believe that
of respect for persons. There are, however, some research with more than minimal risks on child-
areas that need to be dealt with in more detail. ren can never be ethical. Because of this extreme
conclusion, many believe that the appropriate
Communal versus individual consent way to approach this dilemma is to ask whether
Informed consent requirements usually refer to the child would consent if he or she were a
obtaining consent from the individuals who are ‘reasonable adult’.
prospective research participants. However, in Similar issues arise in the case of research to be
South Africa and elsewhere, when conducting epi- conducted on temporarily or permanently in-
demiological research, researchers often ask for competent persons, such as people with an intel-
consent from traditional or other local leaders to lectual disability, aged people with neuro-cere-
research ‘their’ communities. This is a difficult bral diseases, or unconscious persons.
issue. Insofar as some cultures are more commu- A special situation arises when an otherwise
nally than individually oriented, it seems appro- competent adult refers the researcher to a third
priate. Yet it clearly reduces the principle of auto- person. This may happen, for example, in the
nomy. If a communal leader gives permission, the case of a married woman who states that ‘her
researcher continues and obtains consent from husband or her children must decide’.
individuals. However, if the leader refuses for Typically, the ‘Western’ approach to this situation
whatever positive or negative reasons, the indi- is to demand first-person informed consent in
viduals concerned can no longer be asked whether spite of the expressed wish. A more culturally
or not they wish to participate. In this case, poten- sensitive way might be to consider that the
tial benefits of research may not accrue to them prospective study participant has exercised the
because of someone else’s decision. right to self-determination by making the
This presents a dilemma, as obtaining consent decision that a ‘significant other person’ should
from leaders appears to violate the right to self- decide.
determination of individuals in the community.
On the other hand, studying a community or Partial disclosure
group without the consent of those in authority The requirement for full disclosure as part of the
may either put participants at risk, or make it im- informed consent process argues against delib-
possible to undertake the research. The imple- erate omission of relevant information and de-
mentation of later health interventions may also ception. Yet there are situations in which full dis-
be compromised. closure will influence the subject of study. For
Where research or interventions deal with example, in a study of the attitudes of health
communities rather than with individuals, it is workers towards HIV-positive patients, the
advisable to seek consent from the ‘community’ results of a questionnaire survey would probably
rather than the individual. In such cases, it makes differ widely according to how the aim of the
sense to ask for consent from legitimate commu- study was explained. Rather than fully explaining
nity leaders. (Examples of such situations could the aim of the study before asking the questions,
include studies on the fluoridation of drinking the participants could be asked to take part in a
water, mass education campaigns on HIV/ general questionnaire on hospital or health
AIDS prevention, or on the influence of infra- matters, and subsequently be asked attitude
structural projects.) questions interspersed with questions on
different matters. In such a case, omission or
Proxy consent as opposed to ‘first person’ apparent deception may be the only way of
informed consent obtaining a true insight, and may therefore be
In general, informed consent refers to consent considered ethical.
3
Written versus oral consent research requires a long relationship between the
The emphasis on written consent, sometimes wit- participants and the researchers, it is more
nessed written consent, stems from the need of appropriate to consider informed consent as an
health workers to protect themselves from the ongoing process. In longitudinal studies and
possibility of future court proceedings instituted long-term trials, for example, it would be wise to
by participants who may claim not to have been continue the process of information provision
adequately informed. A second motivation is that and obtaining consent. The case of Tuskegee is a
a clearly presented consent form will assist the clear example of the need to consider informed
researcher in ensuring that all relevant topics consent as a process rather than as an occasion.
have been covered. A third reason for emphasiz-
ing written consent is that the process of signing Confidentiality and the use of databases or
the consent form will help the participant to medical records
realize that he or she is now aresearch partici- Study participants can usually expect the results
pant. In other words, placing their signature on obtained from their participation to remain
the form brings home the reality of the decision confidential. Efforts to ensure this are essential,
made by the participants. such as not obtaining personal information
The first reason, protection of health workers, is unless it is absolutely necessary, and deleting per-
only a relative one, in that no court will accept a sonal information as soon as there is no longer a
consent form as proof of adequate disclosure. The need for it. However, in studies where large
second motivation to use written consent is not a numbers of participant characteristics are
real motivation, in that one can have a clearly pre- collected, it may be possible to identify persons
pared guideline on what to say to participants even after removal of personal information. For
without the need for a signature. The third moti- example, in a study of a village in a rural area,
vation may be the most important reason for there will be very few male teachers of 35 years old
obtaining written informed consent. Yet it is this who are member of the church choir. In such
very motivation which often causes people such cases, additional measures must be taken to
as illegal immigrants, refugees, sex workers and ensure that individuals cannot be identified.
those with any fear of authority to refuse to Patients entering hospitals or clinics may rea-
participate. This is not because they do not wish sonably expect their records to remain confiden-
to participate in the research project, but because tial. Medical researchers should ensure this. Simi-
they fear they will be harmed by unintended con- larly, use of other databases must guarantee con-
sequences of having their names listed. These fidentiality of data. In South Africa, the study of
fears may be justified, if research records should medical records and other databases is allowed,
be made public by negligence, break of confiden- provided adequate steps are taken to keep data
tiality, or court orders. In cultures where signing confidential. In some European countries,
is considered superfluous once someone has however, use of records and databases is only
given his or her word, insisting on a signature allowed after obtaining written informed consent
could offend participants. from the persons involved.
In general, ethics review committees in South
Africa demand written informed consent even in Use of specimens for other purposes
the above cases, and even if the population There are many laboratories that store speci-
studied is illiterate. This seems irrational, and mens, especially blood specimens, that have
may be based on a routine application of North served their purpose. Originally, consent would
American procedures rather than on well-consid- have been obtained from the research partici-
ered ethical reasoning. Should a researcher there- pants to take the specimen for a specific purpose.
fore not wish to obtain written consent, it is wise Once that has been done, can other research be
to provide a full motivation, including ethical undertaken on these specimens without obtain-
arguments where possible, to the reviewers. ing further consent? It appears that current think-
ing considers this practice to be acceptable, pro-
Informed consent: an occasion versus a process vided the additional research has been submitted
The procedure of ‘getting informed consent’ is once more to an ethical review committee, and
often seen as a one-off action. This may be appro- provided that the results remain unlinked to the
priate if the research is of a one-off nature, such as individuals concerned. Nevertheless, this is not
a cross-sectional survey. However, when the an easy issue. Even if precautions are taken
32
regarding confidentiality, the research may have but direct feedback to individuals and communi-
negative results for groups of people. ties, or publication through the media, or to rele-
vant government departments, is an essential
Quality of research part of an ethical study proposal.
Low quality research is unethical in that it
exposes participants to the risks or costs of Ethical review of research
research without the chance of future benefit. In the United States, all academic and research
Similarly, simple duplication of research done institutions are required by law to have ethical
elsewhere (for example, to obtain a degree) may review committees. These are called Institutional
be unethical because it will not lead to any new Review Boards, or IRBs, whose members include
knowledge or action, even if the research was health professionals, ethicists, and ‘lay’ commu-
conducted perfectly. However, conducting a nity members. No such legislation exists in South
research project that was done elsewhere because Aftica, but probably all academic and research
there may be sincere doubt as to whether or not it institutions, and even some of the large service-
would work in the local situation can be defended rendering health institutions, do have a commit-
on the basis that it might lead to new knowledge tee charged with the ethical review of research
or action, and therefore costs or risks on the part proposals. Most committees do not, however,
of the participants can be justified. It is good to include ethicists or community members.
distinguish between these two types of repeating Researchers working in institutions without a
research by naming the first ‘duplication’ and the specifically appointed ethics review committee,
second ‘replication’. Duplication of research is or practitioners working on their own who want
not justifiable, and therefore not ethical, whereas to carry out research, can send their proposals to
replication is justifiable, and therefore ethically the review committees of nearby institutions. In
acceptable. addition, the Medical Research Council and the
Medical Association of South Africa provide
Conflicts of interest ethical review services in such cases.
As elsewhere in life, conflicts of interest may From the previous sections of this chapter, it is
occur in research. Often this takes the form of immediately apparent that there are many vague
sponsorship of research by a company that has an areas in the field of bio-ethics. For example, how
interest in a particular outcome of the study, such can we balance research risks and benefits? Who
as in ‘drug trials. It is important to ensure absolute balances them? What is ‘minimal risk’? When is a
independence between researcher and sponsor- person temporarily or permanently incompetent?
ship for the study to be considered ethically At what age can children be expected to make
acceptable. A not-so-obvious conflict of interest decisions concerning non-invasive research?
occurs when research is done for degree and pro- What constitutes full disclosure? When is decep-
motional purposes. The double motives in such tion appropriate?
research may at times be at odds with each other. The purpose of ethics review committees is to
Whereas the research should, for example, be of apply ethical theory and ethical principles to
benefit to those participating, pressures from particular (research) situations and proposals,
promoters or deadlines may force the research and to assist in the development of ethical
process to be modified to serve the student’s aims research. It should be clear that many situations
first and those of the participants second or not at preclude easy answers, and that deliberations
all. should often be extensive. For purposes of trans-
parency and of empowering the researchers (and
Obligation to feedback and publish communities), it would be beneficial if ethics
A last point, often forgotten in the rush to produce review committees would open their delibera-
research reports towards a degree, is that degree tions to at least the researchers, yet this is often
manuscripts tend to gather dust in libraries with- not done. As a researcher, however, one should
out their contents becoming known. Most ethics insist on a full report of the ethical reasoning that
review committees will nowadays insist on an was applied if the research proposal was rejected,
acceptable plan for the dissemination of informa- or substantial modifications were suggested. For-
tion gained from research before approving tunately, most ethics review committees will
research proposals. Publication in relevant peer- enter into debate with prospective researchers in
reviewed journals is an obvious way of doing this, such instances.
33
Conclusion tries, where communities were simply passive

It should be clear from-this section that it is not recipients of assistance. More importantly, it re-
always easy to come up with the ‘correct answers’ flected a commitment to involve previously mar-
to ethical dilemmas. The ethical dilemmas out- ginalized communities in the social, political and
lined here attest to this fact. To ensure ethical economic processes that affected them.
conduct of research, it is essential to use ethical Community participation plays an important
guidelines (Medical Research Council 1993; role in epidemiology, since many epidemiological
CIOMS 1991) and to speak to colleagues, mem- studies take place in the community or involve
bers of groups among whom the research will be the evaluation of community health projects. It is
conducted, and ethical review committees. Even hoped that by involving non-researchers in the
then, the research proposal may still be rejected. research process, they will learn skills and gain
The only way open is to then either accept this confidence. The community will be more likely to
judgement, or to elicit the opinion of another, accept the project and it will be easier to imple-
credible, ethics review committee. ment the results of the study.
Note that there is no accreditation mechanism
for ethics review committees in South Africa. The community
Although most research organizations and aca- The ‘community’ is usually referred to in geo-
demic institutions, and even some health service graphic terms as a group of people living in the
departments, have committees that deal with the same defined area, sharing the same basic organi-
ethical review of research proposals, the compo- zation. A community can also be thought of as a
sition, expertise, continuous education, and group of people sharing the same basic interests,
reporting requirements of such committees has with membership changing as interests change.
not been regulated by law or otherwise. This is Target populations or risk groups may also be
unlike the situation in the United States, where considered as communities, for example the
Institutional Review Boards are established in community of people with disabilities ina village.
terms of legislation and must comply with regula-
tions concerning important characteristics such Community participation
as membership, including non-professionals and Community participation is a social process
community members, composition in terms of whereby specific groups are actively involved in
disciplines (such as ethicists, philosophers, law- decision-making and the implementation of
yers), and reports to be produced. It is hoped that development, health or research projects, as well as
such regulations will be forthcoming in South sharing in the benefits of the project/ programme.
Africa in the near future. Until then, it is not wise The concept of ‘empowerment’ has become
to ‘shop around’ until you find an ethics commit- central to the practice of community participa-
tee that will approve your proposal. Rather stick tion in projects. The term refers to a liberating
with a review committee from one of the larger grassroots process of the acquisition of power by
research or academic institutions. oppressed or marginalized people. Participants
Whatever the choice, it is not ethical to go are favourably influenced by their involvement in
ahead with the research without credible ethical the programme/activity. They increasingly take
approval. The system may be slow and sometimes responsibility, initiative and decisions, thereby
incorrect, but it is the best system available to increasing self-reliance and self-determination
protect society against researchers with divine on an individual, project and community level.
aspirations!
Application of community participation
Community participation It is useful to differentiate between community
participation in projects or programmes (which
Introduction focus on an ongoing community activity or
During the last three decades there has been service), and community participation in re-
increasing emphasis on the concept and practice search, also referred to as participatory research
of ‘participation by the people’ in a wide variety of (which involves an investigation or evaluation).
activities related to health, education, socio-
economic development and research. This Community participation in programmes
change in emphasis was a result of the poor There is no single view on the role of the commu-
success of projects, mainly in developing coun- nity in educational, social service and health pro-
34
Table 3.1 Common approaches to community participation in health

projects
Approach Initiation Role of Role of Strategies to Orientation

professional lay-people achieve goals to power
structures
Public Existing Full responsib- None, passive Environmental Completely

Health services ility for design recipients infrastructural part of
and execution and mass
of programme programmes to
promote health
Health Professionals, Initiate and Used as addi- Projects which Work closely
planning academics control tional manpower include members with power
and to gain of the community structures
public support in order to be
for projects successful
Community Community- Limited role: Dominant, self- Process goals Question the
development elected worker part of group help, voluntary very important commitment of
facilitates co-operation involvement in power struc-
initiation by and develop- economic, health tures to real
community ment of indi- and social issues progress
genous leader-
ship
Activists and Dominant and Mobilization of Demand redis-

disadvantaged serving the disadvantaged tribution
groups interests of the sectors. Demands of power,
disadvantaged for redistribution resources and
of power decision-
making
grammes. Approaches to participation differ Participatory research

regarding the process of initiating the activities, Participatory research is a co-operative enquiry
the role of professionals and lay people in such between researchers and the community being
activities, the strategies used to achieve project * researched, aimed at the identification and crit-
goals and the orientation towards the established ical analysis of the community’s problems and
power structures. needs, and the search for solutions to these prob-
Four common approaches to community lems (Le Boterf 1994). Participatory research
involvement in community programmes are essentially consists of three interrelated and
contrasted in Table 3.1 (see also Rothman 1987; traditionally distinct activities: research, educa-
Rifkin 1981). It is important to choose the appro- tion and collective action (usually for develop-
priate approach for a particular community and ment or social transformation). The combination
project. A flexible approach should be used. Com- of the creation of knowledge about a commu-
munity participation is not an all-or-nothing ap- nity’s problems and needs with collective, con-
proach, so failure to achieve complete participa- crete action to change and improve that commu-
tion in all aspects of the project does not mean nity’s situation is perhaps the most unique aspect
that meaningful participation did not occur at of participatory research (De Koning 1994).
some levels, involving some people. In participatory research ‘the community’
35
a
refers in general to potential users, beneficiaries motivations for participatory research could vary
or subjects of research. The varying motivations from obtaining more reliable data, to encourag-
for participatory research will determine whether ing people to cooperate in a programme, to giving
‘the community’ refers to the most oppressed and people more control over facilities (Lammerink
disadvantaged in society, the clients of a health 1994).
service, or the health workers in a health project The action component of participatory re-
or service. search could, for example, have as its goal the em-
The concept of participatory research evolved powerment of the community concerned (a
in the early 1970s as an alternative social science process), or the more efficient service delivery for
research method which began to challenge the that community (a product).
notions of neutrality and objectivity on which The extent of community participation in
social science research methodology was based. research can vary from involvement in every step
Various important historical trends contributed through to the other extreme, where researchers
to the development of participatory research use community participation to facilitate data
(Tandon 1994). One of these trends came from collection and the implementation of their own
the practice of adult educators in Third World results. In any one research project there are a
countries, who developed an approach to educa- range of activities in which a variety of partici-
tion in which the learner began to control his or pants can be involved. This has been termed the
her own learning process. These adult educators participatory continuum (Maquire 1994).
then began to develop a similar approach to Many epidemiological studies are carried out
research, in which the researched individuals or with very little involvement from the commu-
community participate in the research process nities or health workers that are the subjects of
and have some control over it. Another influence research. Their involvement seldom extends
was the work of Paulo Freire and Ivan Illich, who beyond being interviewed, answering question-
placed participatory research as an educational naires, occasionally being recruited as field-
process within the framework of an ‘alternative workers, or utilizing some of the results. In con-
pedagogy’ or popular education. During the mid- trast, a participatory research approach could
1970s, with the failures of top-down, expert- mean that:
designed development projects and programmes, the research question and the research priori-
the debate on development began to view the ties are based on the needs of the community as
question of participation as a critical variable in seen and formulated by the community itself;
human development (Tandon 1994). This further the methods of data collection are designed
influenced the evolution of participatory re- with community members and are under-
search. standable and acceptable to the community;
The main aim of participatory research is to @ the data collection is undertaken by commu-
provide a forum for the voiceless and powerless to nity members;
be creative actors and subjects, not merely @ there is community participation in the ana-
‘objects’ in the process of research and know- lysis;
ledge creation. The intention is to provide ways @ the results are distributed in a way that makes it
for ordinary and oppressed people to produce the possible for people from outside the scientific
information they require to understand and to profession to participate in the discussion.
deal with problems in their lives and larger insti-
tutional structures. Full participation of all the people in the com-
However, motivations for participatory re- munity, or health workers in a project, in all
search will vary, in ways similar to those aspects of the research may be clumsy, cumber-
described for community participation in pro- some and not always appropriate. The extent of
grammes in Table 3.1. Where community partici- participation can be decided on by the commu-
pation is seen as a process in which disadvan- nity and may depend on the project’s aims and
taged people work together to overcome prob- resources, and whether there is time for all parti-
lems, challenge the distribution of power and cipants to take part in each stage of the research
resources in society, and gain more control over process. The level of motivation of the group is an
their lives, participatory research is used to important consideration.
enhance their awareness and confidence, and to Full participation may slow down the research
empower their actions (Lammerink 1994). Other process to such an extent that all impetus and
36
enthusiasm is lost. Progress must be evident to cation of tasks and the process of accountability
maintain the interest and commitment of the need to be clearly and openly communicated
group, while not compromising the process. The during the research.
balance is a delicate one. For specialized tasks, The participatory research process has advan-
the group may need to consider delegating tages which go beyond the research itself. It can
responsibility. Each participant remains account- strengthen the democratic processes in a health
able to the group, and the value of the participa- project or community; it can lead to greater self-
tory approach need not be lost. reliance in the participating group and greater
Participatory methodology and participatory involvement in decision-making; it can encour-
research can be qualitative or quantitative. There age trust, responsibility and equality among all
are no typical participatory research methods. concerned; and it can equip project and com-
Conventional research methods such as struc- munity members with the knowledge, skill and
tured questionnaires can be used in a participa- attitudes needed for working together to tackle
tory way. Other unconventional, accessible their problems and meet their needs. While the
methods of data collection, analysis and report- research process can be empowering, the re-
ing results may need to be created in response to search product itself may also benefit from
the problem, the context, and in response to community involvement, although this is not a
‘where the participants are at’ (Meulenberg- primary motivation for participatory research. It
Buskens 1994). may be a more accurate reflection of local real-
Examples of these are mapping exercises in ities, problems and needs, thus providing a better
which community members create diagrams and grounding for more culturally relevant and thus
pictures (maps) visually illustrating information more effective and meaningful local action
about their community, group discussions, a ‘pic- (Maquire 1994). Thus any action, policy or social
tionnaire’ (see Figure 3.6), group tallying exer- programme emerging from such research is less
cises to collate results, the use of visual images, likely to marginalize these communities, and
radio, photographs, exhibitions and popular more likely to truly serve them (Khanna 1994).
theatre to report results. The participatory One of the requirements for engaging in partici-
researcher needs flexibility and creativity -to patory research is time. One of its major dis-
design innovative, appropriate methods. advantages is that it is often a long, very time-
A participatory approach to research does not consuming process. In participatory research, the
mean that the scientific merit of the methods or researchers may relinquish control over the identi-
the validity of the results needs to be compro- fication of the research question, as well as over re-
mised. The researchers need to facilitate an search funds, and they may experience this as a dis-
educational process in which the community advantage. A further difficulty researchers may ex-
participants are introduced to previously un- perience relates to the process of establishing the
familiar research methods, technologies and involvement of a community, or accountability to a
processes in accessible language, free of scientific community. In deciding which individuals or orga-
jargon. Thus the researcher needs to take on the nizations represent the community’s interests, re-
role of educator or trainer. Time must be set aside searchers may become embroiled in that commu-
for the learning and the participation processes. nity’s political conflicts and power struggles.
In the research process, the researcher too
becomes a learner and may find him- or herself Evaluation of community participation in
changed and challenged in sometimes profound projects
and unsettling ways through the process of Many health projects explicitly state community
engaging in collective investigation, education participation as a goal of the project. Indeed,
and perhaps action (Maquire 1994). community participation is seen as an important
Respect and clear communication are crucial component of the primary health care approach
qualities in the relationship between the re- which has been adopted by many countries,
searchers and the community in the research including South Africa. However, it is usually
process. The resources and experience brought difficult to evaluate how well this aspect of a
by each party into the research partnership are service or project is being implemented.
different. This should be acknowledged, and each A variety of criteria for judging the quality of
group should appreciate and respect what they participation in projects have been put forward
can learn from and share with the other. The allo- by different authors. Cohen and Uphoff (1977)
37
suggest three main criteria for evaluation. The

first considers the kind of participation which Pet ge ee Ce framework to
takes place in the project/service, and suggests assess participation in projects
that evaluators reflect on participation in deci-
sion-making, implementation, benefits and eval-
uation. The second criterion considers the kind of
people participating (who?) — local residents, Z
&o
local leaders, government personnel or foreign %,oD
personnel. The last criterion considers how patti- f
.
cipation takes place. A suggested adaptation to %2
this framework is to include an autonomy/ ’ Management
a
dependency scale with each item to incorporate

the issue of empowerment.
A framework proposed by Rifkin, Muller and
Bichmann (1988) assesses how wide participation
in projects is on a continuum developed for each
of five important aspects of project development:
needs assessment, management, resource mobil-
ization, organization, and leadership.
Values are plotted along the five axes to give a
visual presentation of the width of participation Source: Rifkin S B, Muller F, Bichmann W, 1988.
(see Figure 3.3). These values are not ‘absolute’ or
‘correct’, but provide a value against which re-
peated assessments can be compared. Results of munity participation is something that develops
repeated assessments can be superimposed on gradually, and programmes need a tool which can
the baseline to see if participation in the project provide an assessment of current situations while
has broadened. This is important because com- also allowing for comparisons over time.
Sree CCRC Ce Rm Ce |
community participation
Needs and skills assessment Organization of health committee
Initial needs assessment Formation of committee

Skills identification Decision making of committee
Ongoing research and evaluation Accountability of committee
Management
Management by committee
Management by staff
Skills development
Leadership of health committee Resource mobilization
How leader was chosen Raising resources

Role of leader in allowing participation Resources from the community
Role of other committee members Control over allocation
Source: Chetty K, Owen P, 1994.
38
siaquiow
39
general philosophical issues in epidemiology
yj -NUWWOD
}efoid
sowwelsojd
Aju
puoheg
pua}xe
siaquiew
0}
BuldojaAap
Aywnoe
ou 1A
uoNnueu}
‘s|}!4s
seyIWWod pue eis
se
e}s joafoid 0} poy tui)
-dojarep
JUS
juawdojeaap
ynq
0efoid 0} payuuy ue Si[Iys ale Sj|jIys dojaAep S||D¥S dojaaap
sjiI4S
aen!wwoo
dojanep 0] sowiwessold 0] SowWWeIsO1d 0} suiWeJso1d ON SIIIMS
jo
Ayyige
4unoode
0}
auly
senuoyjne senuoyine
sejngaiJo
“yes
yoRq
WOd
jyeys UUM-yodai
01 SI Ayyiqejyunoooe
yoofoudSISQUOU
0} Aqiiqequnoooe
aeiwwoo
saljigisuodsas
Be}IWWOO Ayigeyunoooe Jo oul| Jo aur] “Sanss! pe}oajas JO aur] “aayWwW0S

UO SdPIWLUOD 8} O} 0} AjiqejuNosNe
pauinsse
Jeajoup ‘seyiuwos
Ajjeonew
01 Aijiqejunoooe
‘aayIwWwodD
juaweseuew
JO aury “seyIWWOO QU} 0} yOoRg-Woda OU yoeq suodai eis yels yNOUUM He}js |euoissa}
0} Aq
Aigisuodsai
juowaseue|\
peyesajap
ay} 0} yoeq-uodei YM Ing Hers yoa[oid 0} -oid Aq payeuiwop
Aq
-oyne
uonesejeq
jnq
90Y pe UUM seyIWWOD Aq eenwwoo Aq peyesajep Aja}e|dwo9 jng ‘ye}s
jou Aq
eS
palesejep senijiqisuodsal AjjeussO} SaNiiqisuodses 0} payesajep Ajjeuuo} eis yoofoid
0}
3
Juowaseue|| Jusweseue|| jou JuUsWaseUe|| Aq Juaweseuey

Jo
jusweseuew
PsAlOAUl
YUM
JOU SI da;IWWWOO
aun nq ‘Ayiqisuodsed
syedxa Jo eis yoofoid/spodxe Juowaeseuew
uonezijnn /Nuoujne yyeoy yyM awos seyiepes] | jyeis eafoid/syedxe
ayeudoiddy juowesgeuew poueus si yse] “yes yoofoid/syedxe /Nuoyine yyeoy
*Aiqisuodsas Juaw /Nuoyne yyWyeoy yi 10 Ayjiqisuodsai
INANOdWOO9
"peseueU-}|as Jo ped Joye0/8 oy}
Aja}e|dwo0o 404 Ayjiqisuodsey Sulye} -eseuew awos aAey ay} 40 Ayjiqisuodsa 8y} S| swweJsoid BOTIWLWOD ay}
seywwo) | ‘peseuew-sjes seyiww0D | Jeped| pue sayIWWOD au} Ajurew JuaWeseue|| au} jo juaweseue; | Aq JUswegeue/\
LIND
Vida
j00foid e jo JUOWOSeUeU OY} UI UO}edIDI}Aed APUNUIWIOD SulyUues 10} PLIOIUD ZS 9/qGeL
Chetty and Owen (1994) adapted and expanded written-up, or if written-up, not read, or if read,
Rifkin’s criteria to allow assessment of participa- not used or acted upon. Only a minuscule propor-
tion using a more detailed method of scoring. tion, if any, of the findings affect policy and they
Rifkin’s five axes were each divided into three are usually a few simple totals’ (Chambers 1989).
components, assessed, and ranked from 1 to 5 As the costs of research escalate, there is grow-
according to predetermined guidelines. This was ing concern about the value society receives in
developed in the South African setting, recogniz- return for the investment it makes in research.
ing the diverse contexts in which community The traditional view that the pursuit of knowledge
participation takes place. for its own sake is the mark of a civilized society,
Figure 3.4 gives the components into which and should be supported as an act of faith, is no
each of Rifkin’s axes were divided and provides a longer unquestioningly accepted. A limited
useful guideline for projects in the initial plan- research budget demands that choices have to be
ning stage, as well as later in the project when made. The current trend in research funding
participation needs to be improved. Table 3.2 focuses positively on areas that are most likely to
describes criteria for ranking each of the three yield a return to society (Twiss 1988).
components along the ‘management’ axis, show- The purpose of epidemiological research is to
ing some of the qualities that allow a project to be improve public health by identifying and redefin-
ranked high on community participation. ing solutions to problems through the generation
When evaluating community participation it is of new knowledge. The generation of new know-
important to identify exactly what aspects and ledge or technologies alone is not an adequate
degree of community participation the specific goal in such an applied discipline.
project or service aims at. Once these have been Most researchers hope their results will be read
identified, qualitative and quantitative methods and acted upon by policy-makers, and assume
can be used to see if the goals have been reached. that the utilization of research information
Chetty and Owen propose the use of semi-struc- depends primarily on the intrinsic worth of that
tured interviews with as many different role information. This assumption has been proved
players as possible in a given project. Qualitative empirically to be false (Weiss, Bucuvalas 1977).
methods in the form of focus groups and in-depth For a research project to make a contribution to
interviews can also be extremely useful. health problems, the researcher needs to be inno-
vative not only in generating new knowledge, but
Conclusion also in disseminating it to potential users (stake-
An approach embodying community participa- holders). Training in epidemiological research
tion can be applied to many public health skills rarely emphasizes any aspects of the imple-
endeavours, including epidemiological studies. mentation of research recommendations, result-
Such an approach demands that researchers ing in limited practical application of research
acknowledge, respect, understand and involve findings in the public health field.
the ‘populations’ they study. A participatory
approach, while not appropriate in all epidemio- The ‘implementation of research’
logical studies, can enhance both the research To ‘implement’ means to ‘bring into effect’. The
results and the implementation of its findings and definition must be interpreted in a flexible way if
recommendations. Public health has been it is to be a useful concept in research. It may be
defined as efforts ‘directed to the maintenance used in a situation when the extension of know-
and improvement of the health of the people ledge leads to changes in policy, behaviour, deci-
through collective or social action’ (Last 1988). sion-making rules or the allocation of resources.
Community participation provides a way of Research may yield a wide range of changes. At
ensuring that epidemiological findings can facil- the simplest level, the change may be in the level
itate such action. of knowledge or understanding of a problem. This
knowledge may precipitate the development of
Planning for the implementation of new skills or lead to a change in the behaviour of
research individuals or groups. The new information may
lead to changes in organizational policy or, at the
Introduction most complex level, a change in individual
‘Much of the material remains unprocessed, or assumptions, values and beliefs. These changes
if processed, unanalysed, or -if analysed, not may infiltrate from the individual to the social
40
group. The research objective of changing the

level of knowledge of individuals is simple to Figure 3.5 Policy development
achieve. Developing new skills, attitudes and cycle
values is increasingly difficult to achieve.
The researcher has a role to play in all four steps Identify problem
of the policy development cycle (see Figure 3.5). tie eggs
Researchers can help health providers clarify Evaluate strategy Plan strategy
loosely identified problems and develop an
appropriate research approach. The major input Implement strategy
of research tends to consist of describing the
nature, extent and determinants of the problem.
This information is useful to health providers who global rather than specific recommendations
are planning an intervention designed to remedy may confuse the policy-makers. Policy-makers
the problem. The researcher may have a limited complain that research recommendations are
role to play in the actual implementation of the frequently too impractical to implement.
intervention strategy. During the evaluation of The information generated from research is
the strategy, the researcher makes a major contri- often published in scientific journals. These jour-
bution by assessing outcomes related to interven- nals may not be easily accessible or of interest to
tions introduced. Frequently, the outcome eval- potential research users. If research information
uation will proceed to an economic evaluation to were published in more appropriate ways, then
determine the cost-effectiveness of health care decision-makers might be more likely to use it
interventions. (Gouws 1988). This would require a careful ana-
lysis of the target population and the optimal way
Obstacles to the utilization of research in which the information could affect their deci-
information sion-making.
Effective communication between the researcher While there is a gap between policy-makers/
and the research user facilitates the fuller use of providers and researchers, there is an even
research results. Effective communication is nota greater gap between the public and researchers. It
one-way process, but a dialogue that takes place is thus important to consider not only the imple-
over. time. Good communication allows both menters of change but also the would-be reci-
sides to share their concerns, and enables the pients of that change.
message to be reshaped to suit the needs, con-
cerns and circumstances of the receiver. The fact Facilitating the implementation of research
that the researcher and the decision-maker Closing the communication gap between the
seldom form part of the same health team world of research and routine organizational
hampers communication. Researchers may lack practice helps to ensure better use of research
understanding of the decision-making process results. Participation in the knowledge-building
and may frame the research question in a way process has proved to be a successful way of
that has little relevance to decision-makers. The ensuring that research-based information is
research process takes time to execute, and this appropriately implemented.
may result in a delay in providing the research Researchers need to:
user with essential information for decision- co-operate with decision-makers in establish-
making. Policy-makers may want information in ing their information needs. Decision-makers
days while researchers may need months to should be involved in the development of
provide research results. research objectives. Consultation with people
Decision-making in health care services is a in the field of application — clinicians and
complex process which may be affected by vari- managers alike — will ensure that the research
ables other than research-based information, will be relevant outside the strictly research/
such as economic, political or cultural agendas. academic environment.
Researchers tend to communicate their results @ identify all potential users and benefactors
in highly specialized terminology which may not (stakeholders) of the research information at
be easily interpreted by policy-makers who are the inception of the project. These may include
not trained in biostatistics and research method- governmental and non-governmental service
ology. The tendency for researchers to make providers, and community-based organiza-
4l
tions. It is very important that the stakeholders sented by HIV vaccine trials is the extent to which
closest to the unit of study be involved; for researchers must educate the study subjects
example, village structures and not regional about ‘safer sex’ practices. The research needs
structures, if the focus is on village level. ‘unsafe sex’ to prove the value of the vaccine, yet
¢ define the implementation objectives at the the principle of beneficence demands that we go
protocol development stage. These differ from out of our way to encourage ‘safer sex’ practices.
general scientific objectives in that they state If you were asked to take part in this trial as a re-
the purpose of the study and what is to be done searcher, how would you solve this dilemma?
with the results. For example, a study of the
prevalence of malnutrition (scientific objec- 2. A research project involving prostitutes in
tive) is only one step towards the true imple- inner-cities needs their co-operation over a
mentation objective — reduction of malnutri- period of time. The researcher has to obtain their
tion in the community. personal details to ensure follow-up, and devel-
@ involve the stakeholders in the research ops an intricate system of recording information
process and keep their interest in the methods so that only she knows the identity of the persons.
and analysis of results. The moral dilemma in this case centres around
@ give feedback regarding the results directly, the researcher. The only way to help improve the
accessibly and promptly to the stakeholders. health of the prostitutes is through the research,
for which personal details are needed. Yet, prosti-
Conclusion tution is illegal, and if a court of law were to order
The issue of how intimately the researcher should the researcher to hand over all records, she would
be involved in facilitating the implementation of then be required to do so, thus breaking her pro-
research recommendations is a controversial mise of confidentiality.
one. In the past, there has been a tendency for What would you do in this case?
epidemiological research to be practised inde-
pendent of any health practice application. There 3. A researcher wishes to investigate the quality of
is a need to link research to policy and many re- care given by pharmacists to children with infan-
search organizations and funders are now insist- tile gastro-enteritis. The questions relate both to
ing that implementation objectives are stated the diagnosis reached and to the therapy pro-
explicitly in epidemiological protocols and fund- vided. Ideally, the researcher would like to com-
ing proposals. Many of the issues involved in pare the quality of care given by pharmacists to
implementing research recommendations are the care given by general practitioners. However,
difficult to negotiate. In attempting to define the this will not be possible for methodological
issues, researchers may take the first step in reasons, as the two professions will require en-
developing a strategy to improve the utilization of tirely different assessment procedures. On asking
research-based information. the opinion of a pharmacist about the study, she
suggests that the study not be conducted at this
time, as it could prejudice the tense debate
between medical practitioners and pharmacists
EXAMPLE 3.1 about providing, and charging, for primary care.
Some ethical dilemmas to stimulate In other words, any negative results of the study
could be used by medical practitioners to argue
ethical reasoning their case for not allowing pharmacists to engage
1. The development of an effective preventive HIV in primary care activities.
vaccine would constitute one of the great medical Clearly, the information on the quality of care
miracles of this or the next century. HIV vaccine delivered by any health worker is pertinent to a
trials are in the process of beginning, and many good health service, and is therefore morally jus-
have proposed developing countries as trial sites. tified on the basis of beneficence. On the other
In any HIV trial, researchers will have to compare hand, the results may lead to an unfair advantage
a vaccinated group with an unvaccinated (or (at least perceived as such by pharmacists) in the
placebo-vaccinated) group for incidence of infec- dispute between the two groups, and may there-
tion with HIV. To become infected with HIV, fore not be justifiable on the basis of the principle
however, one needs to engage in ‘unsafe sexual of justice.
practices’. One of the great moral dilemmas pre- How would you deal with this?
42
EXAMPLE 3.2 the ‘organized’ sector of the community. Addition-

Community participation: Mamre ally, a community-elected committee works
closely with the academic arm of the project in
Community Health Project planning and running activities. Initially, the com-
The Mamre Community Health Project was mittee’s absence in the day-to-day management
launched by the Medical Research Council and of the project and its lack of access to the funds
the Community Health Department of the which were administered by the university threat-
University of Cape Town (UCT) primarily as a ened to undermine its full participation in the
public health research initiative in 1986. project. Restructuring of the project and redefini-
Community participation was a working princi- tion of the role of the committee has addressed
ple of the project from its inception. Access to the these problems. Currently, the committee has the
community was first negotiated by members of the power to veto research, teaching, and services, and
research team together with the major groupings cettain of the funds have been placed under their
in Mamre: the church, the Village Management direct control. The most recent development, in
Board, and the school. A public meeting attempted keeping with the ultimate goal of integrating the
to get broader support for the project from Mamre project into the district health plan, is that certain
residents. A steering committee, comprising services have been incorporated into the local
mainly professional Mamre residents, volunteered health services and funded by the local authority.
to assist the project in its research, thus function- The Mamre Community Health Project has had
ing as a reference group for research efforts. By the an influence on the broader development of the
time the baseline research was undertaken, con- community. The politically non-aligned position
sent had been obtained and limited participation of the project and the fact that health is an issue
of the community had been achieved through its around which people of differing political persu-
representation on the steering committee, and asions are prepared to cooperate, created the
involvement of local women as interviewers. space for members of this politically divided com-
Efforts by the research team to popularize the munity to co-operate in a project for the common
research findings included posters at community good. The policy of broad consultation and
events, slide-tape shows facilitated by the inter- accountability was in itself an important process
viewers in the homes of Mamre residents, and the for the (then) disenfranchised and marginalized
delivery of a simple, illustrated booklet of results community, and contributed in a substantial way
to each household in Mamre. to the community’s increasing sense of determin-
The completion of the baseline study describing its own development. The community is able
ing the community and its health stimulated inte- to participate in research, understand the results,
rest in the project from different quarters. More and use them for advocating changes to improve
clinically orientated departments at UCT became the quality of life for the community at large.
involved in the project, and the community itself On a less visible level, the project has been
expressed the need to see something ‘done’ about responsible for human resource development
the problems identified. Thus the project focus through the training of project staff, as well as
broadened to include substantial service and through its specific activities. These include life-
training components. skills training and career guidance among the
The introduction of new types of activities in youth, involvement of members of the commun-
the project provided the opportunity to increase ity as facilitators for the lifestyle modification pro-
community participation and control. The aim grammes, and the training of families to cope with
was to involve the community as much as pos- the disabled or chronically ill in the community.
sible in deciding on the policy and activities of the In terms of the evaluation of community parti-
project. Academics, clinicians, and researchers cipation using the Chetty and Owen adaptation of
from UCT and the Medical Research Council Rifkin’s participation criteria of the management
remained involved as technical advisors or con- component, the project ranks 3 on management
sultants on the project, and staff employed by the by committee, 5 on management by project staff,
project (who were members of the community) and 5 on skills development.
ran the day-to-day activities of the project. The exposure and access to health, welfare
A quarterly meeting of representatives from all and community development agencies outside
organizations and clubs in Mamre provides an Mamre have increased substantially as a result of
opportunity for consultation with and feedback to the policy of promoting the development of net-
43
works to support and inform activities in Mamre. EXAMPLE 3.4

Close working relationships have been developed Community participation: Respira-
with people in the community, and communal tory health survey
efforts, now largely independent of the Mamre
Community Health Project, continue to flourish. A union identified a particular health problem
facing its workers and asked the Health Inform-
References: ation Centre (HIC) in Johannesburg to help inves-
Katzenellenbogen J, Pick W, Hoffman M, Weir G. ‘Commun- tigate the problem. A respiratory health survey was
ity participation in the Mamre Community Health Project.’ undertaken to determine whether a certain dust in
South African Medical Journal 1988; 74:335-8. the workplace could cause respiratory disease,
Katzenellenbogen J, Swartz L, Hoffman M. ‘From research to and a participatory approach was adopted.
service provision: the Mamre Community Health Project — The request came from a democratic trade
seven years later’ (editorial). South African Medical Journal union representing the interests of the people
1995; 85(9): 843-4. who were to be researched. Information about
Louw], Katzenellenbogen J, Carolissen R. ‘Community the study was given to the entire workforce, and
health needs, community participation and evaluation.’ the go-ahead was confirmed at the hostels and at
Evaluation and Programme Planning 1995; 18(14): 365-9. the start of the early morning shift.
The protocol, designed by the HIC according to
the union’s requirements, was discussed with
EXAMPLE 3.3 management, and shift changes were allowed to
accommodate the study. The study consisted of a
Community participation: Ithusheng
respiratory questionnaire and lung function tests
VHW evaluation at the beginning and end of the working week.
During 1990, the Ithusheng Health Project in the Bronchodilator and skin-prick testing was done
Northern Transvaal did a community survey to to determine the atopic status of the working
evaluate the work of their Village Health Workers population.
(VHWs). The external researchers first involved the The data analysis was done by HIC staff and the
VHWs in determining which aspects of their work results presented to the union and management
they saw as most important and how they would in writing. In addition, a workers’ report-back was
evaluate those aspects of their work. The VHWs organized. The entire workforce came to a 6 am
then drew up aset of evaluation questions to use in meeting, where, through translation, the results
a community survey. In order to accommodate of the study were explained.
VHWs with relatively low literacy, the questions Together with the union, the researchers
were put into a pictionnaire (see Figure 3.6). This produced education booklets on the hazards of
made it possible for less literate VHWs to use in the industry. Workers used the results of this
their own villages. After collecting the data, the study to put forward demands for safer methods
VHWSs analysed the data by means of the hand- of handling the raw material, with less dust. After
tally method. The responses on each completed two years, better working conditions were intro-
pictionnaire were called out by the VHWs and duced in every factory organized by this union.
recorded on a big sheet (see Figure 3.7). After Action was thus taken as a result of the research.
adding up the totals, VHWs got immediate feed- In this example, research was conducted
back of the results. They then participated in the through the initiative of the workers. Limited par-
analysis of the results and in discussing the impli- ticipation was achieved in the research process it-
cations of the results on their work. In this case, the self in that few, if any, research skills were learned
health workers participated in the design as well as by the workers. However, the researchers re-
the implementation and analysis of the evaluation. mained accountable to the workforce throughout
The main objective of the researchers was to teach and did involve the union in the production of the
the VHWs how to evaluate their own work and how booklet, one component of the implementation
to build evaluation into their project planning. of the results. The target population also received
benefit from the study.
Reference:
Van der Walt H, Hoogendoorn L. ‘Training for self-evalu- Reference:
ation. Experiences at Ithusheng Health Centre, South ‘Health Information Centre. Worker participation in a
Africa.’ Learning for Health 1995; 7:8-12. respiratory health survey.’ Critical Health 1989; 20:30-34.
44
SP eM MCC MUR UCm hmeCLeCd)
Husheng VHW survey October 1990
eme kae?
1. Ngawana wa gogo wa mafelelo ona le mengwago
(How old is your child ?)
fi. Pale rsEa Ee

2. Ofepile ngwana wogogo wa matelelo bjang®
€ Did you L hioe your child?)
%,O mamisite ngwana wa gago mako e kae ?

CIF yes, for how long!P)
Sone TB eee tO Fil le

4.© ubora!
naa: >
eb lela lagago le hlwekile pit ?)
a 's es Pues ey yard? |s there melee
[= ®|2 “ef p1__—

5. eae int ees rte
eb dir etse a Motswako:
ge Fos opt ei ane make thesu dai
ee Taet¥o Cif yes,tell me how) [Eee]
6. LO
meetse a motswako neng ?
O mota
CHew much and when do you give it )
8, Le nyaka gore ba tsa Maphelo bale direle eng©

(What would you like your health worker tp do°
45
ar ee eM Na ORCC hme LE
s 135 (897%)
46
Literature review _
_ G Joubert,J Katzenellenbc
Sampling oe
G Joubert, J Katzenellenbogen
Datacollection and measurement

. oe
iv
Planning a
research project
Introduction track records. It is also good to have one or two

people with research experience on your research
The research process includes planning, execu-
team.
tion, analytic and reporting phases (Figure 4.1).
The roles of the different team members must
This section covers the planning phase, which
be spelt out to avoid duplication or omission of
involves systematic preparation for a research
tasks. Researchers should be given clear areas of
study.
responsibility, for example data manager, field
co-ordinator, questionnaire designer-in-chief
Figure 4.1 The research process and sampling, so that continuity is provided for
each task.
Regular meetings should be held to keep every-
Planning phase body in touch with the study, to make joint deci-
sions and to provide a forum for discussing
concepts and understandings that are different
among the different members.
Execution phase At the protocol development stage it should be
discussed which team members will take
responsibility for the writing up of publications
resulting from the study. In this way, ambiguities
Analytic phase
and conflict about authorship can be avoided at a
later stage.
The following three common mistakes should
Reporting phase be avoided when assembling the research team:
¢ giving over-committed and high-profile people
token co-investigator status on the project.
Before any individual researcher can be recog-
Assembling the research team nized as a co-investigator, that person’s role
Community health research is often done by and responsibilities must be made clear;
research teams rather than by individuals. The including incompetent people who fail to do
research team should not have too many mem- their tasks well and in time. It can be very frus-
bers, as this may hinder effective communication trating for a group when one member’s incom-
and make meetings cumbersome. However, it is petence has to be continually considered and
good to have people who contribute different compensated for;
skills and input into the design and implementa- @ creating the impression that everyone men-
tion of the study. Co-workers should preferably tioned on the protocol will be a co-author of
be people with whom you know you will work every publication resulting from the study. It
well, are reliable, facilitate action and have good should be made clear that there is no ‘right’ to
49
b epidemiological research methods and protocol development
authorship — it must be earned through intel- comparability of results. Protocols are also
lectual contribution and work. submitted to funding agencies (for example the
Medical Research Council) when funds are
sought, and to ethical committees for approval.
The study protocol Universities require protocols before confirming
The study protocol, sometimes called a research the registration of a student for postgraduate
proposal, is a formal written document which is study. The protocol eventually helps in the
prepared prior to the study. It documents the writing up of the study.
reasons for the study and the details of the The protocol is a dynamic document which
methodology and, in doing so, gives a clear plan keeps changing as the planning of the project
of exactly what needs to be done. progresses. A very crude protocol might be pre-
The study protocol has many purposes. It helps sented in a pre-protocol session where colleagues
the researcher focus on critical issues, clarify review the initial ideas and brainstorm their feas-
ideas and keep on track. It also enables outsiders ibility with the researcher. Based on input at these
to understand what is intended so they can sessions, a more thorough and systematic proto-
comment critically either on the written docu- col can be drawn up and critiqued at a later proto-
ment itself, or at peer group protocol meetings col meeting. Changes and adaptations based on
where protocols are reviewed. A clearly written recommendations made by peers and consult-
protocol also enables other researchers to repeat ants should be made so that the benefit of the
the same design in another setting and allows for peer review process is not lost. Key stakeholders
in the research should be invoived in the protocol
development to ensure that there is agreement on
Table 4.1 Major headings ina the importance of the issues being studied, that
chteateted there is acommitment to act on the results, and to
enhance participation. Once the protocol has
Tite,piraeet investigator, co-investi been accepted by an ethics committee it should,
institutional affiliations, and qualifications however, not be changed further. If more changes
are required, these changes have to be approved
Summary/Abstract by the ethics committee.
1 Introduction The process of protocol development can be
1.1 Literature revi W slow if the researcher is inexperienced, if there are
1.2 Motivation for the study (problem) few resources and consultants available, and if
1.3 Purpose the area of research is a new one.
1.4 Specific objectives
41.5 Implementation ee Contents of the protocol
Methods o A protocol should start with a title and authors
2.1 Definition of terms page. An informative (but not cumbersome) title
2.2 Study design which clearly describes the purpose of the study is
2.3 Study population and sampling important. This information is placed on many
databases. Cryptic titles are not desirable. The
principal investigator’s name, qualifications and
institutional affiliation should then be stated.
3.1 Responsibilities of investigators Some funding or reviewing institutions (for exam-
3.2 Responsibilities of staff ple, the National Institutes for Health in the USA)
3.3 Time schedule : do not accept more than one principal invest-
Data management and ae igator, while other institutions are more lenient. A
_ Resources _ list of co-investigators (either in order of import-
5.1 Available resources ance to the study or in alphabetical order) with
5.2 Budget and budget motivation. qualifications and institutional affiliations must
Ethical and legal considerations — also be included.
Reporting of results _ The title page is followed by a brief summary or
References _ abstract which gives the reader a general over-
Appendices view of the proposed study. A short statement of
the purpose of the study, the major research
50
4 planning a research project
questions to be asked and the methods to be 4 Data management and.analysis: Details

used, should be included. should be given of the means of data manage-
The major headings to be used in a protocol are ment and analysis (computer or not), the
included in Table 4.1. A brief indication of what methods of analysis, and any statistical support
each heading covers is given below. The main or advice received to date, or planned for the
points are discussed in detail in the other chap- future. Useful additions to a protocol are
ters of this section. ‘dummy’ or mock tables which indicate how
the final tables of results will look.
1 The introduction describes the problem to be 5 Resources: Detailed estimates of personnel,
studied and reviews relevant available informa- equipment and financial resources (for salaries,
tion. It provides the motivation for the re- travel costs, equipment, computing, telephone
search, and the specific research questions that calls, stationery) need to be recorded. An esti-
are being asked. The overall purpose or aim, mated budget is useful and often essential.
and specific objectives of the study should be 6 Ethical and legal considerations: In the proto-
clearly stated. col, plans for safeguarding the rights and wel-
2 The methods section gives a comprehensive fare of participants should be explained. Steps
description of the methodology to be used. For to ensure ethical and legal access to commu-
the sake of exactness and clarity, definition of nities, groups or records should be specified.
terms must be included. All key terms referred \] Reporting of the results: The protocol should
to in the aim or topic of the study, and the state the specific format and manner in which
specific objectives, must be defined. This results will be reported to both the public in
ensures that all people reading the protocol general and particular individuals and groups.
have the same understanding. Definitions also Target groups which can facilitate the imple-
help the researcher clarify concepts for him- or mentation of recommendations based on the
herself. research findings should be identified.
The investigator must state which type of 8 References should be included, using an
epidemiological study (study design) will be accepted format.
used to best meet the objectives. ; 9 Appendices should include a copy of the ques-
The study population must be identified tionnaire/data capture sheet and any other
precisely in terms of sex, age, place, condition, items of importance.
and any other relevant factor. If sampling is to
be used, the sampling strategy must be A detailed checklist of points which should be
described in detail. The source of the sampling covered in a protocol is given in Table 4.2. This
frame, and the intended sample size (and checklist was drawn up to ensure that all impor-
reasons for this) should be stated. tant scientific issues would be covered in the
The method of data collection (measurement) protocol and considered by the protocol reviewer.
should be specified, as well as the variables or If a protocol is submitted to a funding agency, a
characteristics to be measured. Measuring detailed budget would also have to be provided.
instruments should be described, and an indi-
cation of their validity and reliability provided. Bias in research
The protocol should report on or state the Bias can enter the research process at any stage,
intention to conduct any pilot studies. These and special attention should be given in the pro-
are mini-studies which test part(s) of the study tocol to outlining how bias will be avoided at the
before the main study. Pilot studies usually various stages.
check the methods (for example, instruments The Chambers Dictionary defines bias as a
or logistics), obtain data to assist in sample size ‘one-sided inclination of the mind’. In research,
estimation and/or test the adequacy of field bias takes on a more technical meaning, but
training. _remains true to the above definition in some
3 The practical logistics of the study must be ways. It is any process which produces results or
carefully planned. These include clear task allo- conclusions that differ from the truth in a sys-
cation, organization of venues, shifts or duties, tematic (one-sided) way.
and transport. A realistic approximate time Bias can enter the research process at any stage:
schedule for carrying out various aspects of the in the stage of reviewing the literature (for
study should be attached. example, the researcher restricts articles to
ol
Table 4.2 Protocol checklist
Protocol title
i LARA
Date received Date to be presented

Components to be considered when reviewing the protocol — please attach detailed comments, if any
Acceptable Minor Major Should N/A

changes changes be
required required included
Literature review
Definition of the problem/
hypothesis/research question
Aims and objectives
Implementation objective
Method: Study design
Study population
Sample size
Variables
Measurement
Validity
Reliability
Questionnaire
Coding
Data management
Analysis
Write-up
Pilot study
Project management
Timetabie
Ethics
Additional aspects (specify)
10 Do you recommend that this protocol be

¢@ Accepted (i.e. the suggested changes are minor)
Revised and resubmitted for acceptance
@ Rejected (i.e. the hypothesis has a major flaw)
Comments:
(Source: Centre for Epidemiological Research, Medical Research Council)
52
4 planning aresearch project
those that support a certain view or finding) In this manual, attention is given to bias which
insample selection (for example, selection of can occur in the selection process, data collection
non-comparable groups as cases and controls process, and due to confounding.
may give spurious results) Once present, bias is difficult to deal with and it
in executing an intervention (for example, is well worth the effort to be cautious in the proto-
patients from a control group may unintention- col design phase, so as to avoid bias wherever
ally receive or be influenced by intervention, possible.
thus ‘contaminating’ the results)
in measuring exposures and outcomes (for
example, instruments can over- or under-score
Conclusion
" measurements taken, hiding real differences or However well a study is planned, difficulties will
showing differences where there are none) arise. Public health research does not take place
@ in data analysis (for example deciding on the in ‘an isolated, structured environment like a
significance level after rather than before the laboratory, and thus the chances are quite high
analysis can lead to biased results) that something unforeseen can happen.
in interpreting the data (for example, there may The development and completion of the proto-
be over-interpretation of statistically but not col is merely the beginning of an often rather
clinically significant results) difficult emotional experience for the researcher.
@ in publication of results (for example, publica- Figure 4.2 describes some of the ups and downs
tion of positive results only may give a false that researchers can anticipate during the differ-
impression of the findings). ent phases of research.
The other chapters in this section will describe
Sackett (1979) published a catalogue of some selected elements of the research process in
50 biases which can occur in epidemiological detail. These include literature review, setting
studies. It is less important to know all the differ- objectives for a study, study design, study popula-
ent ‘jokers’ which can invalidate research find- tion and sampling, measurement, data manage-
ings than to know the principal situations in ment and analysis, resources, ethical and legal
which they can arise in the design of the research. considerations, and reporting of data.
TSPattes CM sel meet itm
©) Submit for ©
Q publication In print
Ecstatic Completion of fieldwork
CS Ready to andparty)
Y collect data
Click The idea Try to implement
“ results/ ,
Rejections recommendations
Initial protocol
development Corrections
Delays
Emotional ©
reaction Adjustments
Implement
Practical action at last
difficulties Peer review
feedback
Critique
cS
No-one interested
Coding a What now? Start writing up
catastrophe 6 @&) mental block
Miserable Planning Data collection Analytic Reporting Publication Implementation

phase phase phase phase phase phase
Experience has shown that researchers undergo similar patterns of emotions while involved with any research project.
Each stage of the research process tends to start on a positive note, but inevitably obstacles arise, with each stage having
its own set of difficulties. Problem-solving abilities and persistence are tested to the utmost. Beginner researchers should
note that once the fieldwork is completed, there is still a lot of work to be done. :
The above graphic describes emotional reactions (Y axis) to typical events during the different phases of a study (X
axis). It aims to help the researcher anticipate some of the emotional swings so that when (not if!) they occur, difficulties
can be seen to fit into a common pattern.
58
Literature review
Introduction for the researcher to spend at a university library.

Rural researchers can form their own support
Before starting a research project, the researcher
system by organizing a journal club which meets
should review the relevant literature to find out
regularly to discuss recent publications.
what work has been done in that field (content
The MRC and several universities provide a
area) and what methods have been used. At
computer literature search service. This facility
times, methods used in other content areas are
enables researchers to gain access to abstracts of
reviewed in order to adapt them to the research
all articles published in indexed journals (that is,
question. Such a review will enable the researcher
reputable journals that have been accepted onto
to clearly define what contribution the planned
a computer database) all over the world. By speci-
study will make: how it would contribute to exist-
fying keywords (words that describe the main
ing knowledge or how it differs from what has
topics of the article), users can extract a listing of
already been done. The researcher will become
all articles on a specified topic published over a
aware of the problems that other researchers
certain period. This listing usually consists of
have had to face. The literature review can be
authors’ names, journal name, article title and the
used as motivation for the study or for the specific
abstract. By specifying the years of interest, the
way in which the study will be carried out.
researcher can decide whether to look only at
recent publications or at what has been published
Finding the literature in the last 20 years. There are various journal
Doing a thorough review of the relevant literature databases, for example, those dealing with
may seem a daunting task to the researcher. One medical journals and those dealing with psycho-
way to start is to obtain one relevant article and logical journals. Most of the medical literature
follow up the articles referenced within that can be found on the Medline data base. De-
article, carrying on in this way. A review article of pending on one’s area of interest, various data-
the area or methodology (that is, an article which bases may be useful. But the aim is to obtain rele-
gives a broad overview) will provide excellent vant articles and therefore keywords should be
references to follow up. Researchers attached to clearly specified. Unfortunately, not all local jour-
universities can request such articles at the nals are indexed, for example, the Southern
university library or obtain them from other African Journal of Epidemiology and Infection.
libraries through the interlibrary loan system. One may thus have to look through these journals
Relevant articles can be identified using the Index individually to trace relevant articles. In addition,
Medicus at medical school libraries. Other theses available at university libraries may
researchers, such as health personnel working at provide valuable information.
rural clinics, may have problems gaining access to Such a literature review covers published
the literature. The ideal would be to have a con- results only. If you know of other researchers
tact at a university through whom requests could working on the same topic, you should contact
be channelled, but some time should be allocated them for copies of draft articles or internal
54
5 literature review
reports. There may, in addition, be many reports Systematizing and summarizing the
which are unpublished because they show incon- literature
clusive or negative results. They may be very diffi-
cult to track down, but their absence may bias the A large number of articles can be identified as
review. Relevant official reports, such as reports relevant to a specific topic, and one should there-
by the Medical Officer of Health, or reports pub- fore devise a system of organizing and summariz-
lished by the Medical Research Council, Human ing them before they start piling up. Various ap-
Sciences Research Council, or other bodies proaches can be used. For example, the research
should also be reviewed. topic can be divided into various themes, and ar-
ticles can be grouped into these themes. One
article may, however, fit into more than one
Reading the literature theme, and you could make copies of the article,
While going through the literature, whether or use a system of cross-referencing. Whichever
through a computer-based search or a reading of system you use, it is useful to have a card index
various journals, the researcher will of course system as well. Each article should be numbered
come across articles which are not directly related and the details of authors and publication noted
to the topic of interest. These should not be on the card, as well as keywords and quality of the
discussed in the literature review, since the litera- article. The cards should be ordered alphabetic-
ture review should show one’s knowledge and ally so that you can quickly check whether you
summarize relevant literature, not provide an have an article by a certain author.
exhaustive (and boring!) list of everything that has
ever been written on the subject. The aim is to Conclusion
synthesize relevant literature in a critical way.
From the title of an article, you should be able When writing up the literature review for a proto-
col or paper, you should draw together your
to determine whether the article is at all relevant
to the research topic. If it does seem relevant, you reading, critically summarizing only the really key
issues. In a thesis, the write-up would however be
should carefully read the summary. If it indicates
that the paper is relevant, read the entire article more detailed in order to show afull understand-
ing of the issues. In both instances, the literature
carefully, paying special attention to the Methods
review should describe scientific discoveries
section, to be sure that the Results and Conclu-
which build on past knowledge, thus drawing a
sion have a sound basis. Attention should be
picture of the current state of knowledge and
given to measurement, sampling and issues con-
highlighting gaps in the current state of know-
cerning study design, and articles should be
ledge. This will ensure that the contribution your
classified according to the quality of methods
study might make is clearly contextualized.
used. (Details given in the rest of Section B will
enable you to judge the quality of the methods
used.) Ideally, articles should not be included in
the write-up of the literature review if you did not
read more than just the abstract or summary.
55
Setting
objectives
for research
Introduction before deciding on the focus and scope of a spec-

ific project motivated by the problem. Exploring
Every research protocol must state what the
the problem thoroughly can help you to identify a
intended study wants to achieve. The process of
suitable topic or research question.
deciding on the research question might be
Any problem, whether it is a health system
lengthy, but is an extremely important one as it
problem (for example, the clinic is crowded; why?
lays the foundation of the protocol. This chapter
What can be done about it?) or a problem relating
deals with the process of formulating the research
to the relationship between exposure and out-
question(s) for a particular study and describes
come (for example, does smoking cause lung
steps taken before deciding on the methodolo-
cancer?), should be broken down systematically
gical detail of the study.
into its major components by the prospective
Each step will be described. It is useful to note
researcher so that the problem may be viewed
that while most researchers use a similar step-
wise approach, they do not always call the steps critically.
It is thus important to develop skills in critical
by the same name. So for example, Step 2 might
thinking. Such skills enable one to look at prob-
be called setting up the purpose or aim or topic or
lems in such a way that preconceptions are put
general objective of the study. These are words
that all mean more or less the same thing. It is aside, and issues are not considered at face value,
but from all angles, acknowledging complexities.
thus useful to remember the content of the steps
rather than the words used to decribe them. This should lead the researcher to a better under-
standing and a clearer idea of which aspects of the
problem would be most appropriate to tackle
Step 1: Identification and analysis of initially. (Note that critical thinking is not a skill
the research problem needed for research only — it is essential for any
Just as necessity leads to invention, so a public task requiring a good grasp of problems and plan-
health problem leads to public health research. ning for effective action.)
However, prospective researchers often decide David Werner, community development theo-
on a topic without thoroughly considering the rist and health activist, emphasizes the need to
original problem motivating the investigation develop a critical understanding of health issues.
and thus skip an essential step in the process of A much-used tool to facilitate critical thinking is
identifying objectives. the ‘why game’ (Werner and Bower 1991). We use
Since a public health problem can be broad and a South African example to illustrate the game
complex, it is important for the researcher to and how it can help develop a critical under-
come to grips with all aspects of the problem standing of health problems.
56
6 setting objectives for research
Consider the following scenario in an urban with arrows pointing to the circles they ‘caused’.
squatter setting: Repeat asking ‘why?’ to each response (brain-
Raa storm) until you run out ofideas.
/ 3-year old This process is repeated for each of the events.
Sipho plays You may return a few times to earlier ‘whys’ as
in stagnant you gain momentum and get new ideas. Some of
water / the events have similar causes, so these common
causes should be linked with arrows.
a " Sipho / mother tries It would be useful for the reader to do this exer-
/ dies on to care for him cise before going on to the next paragraph.
thewayto | ; athome giving | Once the ‘why?’ process has been exhausted, it
hospital _/ anenema _/ is useful to present the analysis in a clear, more
systematic way. This can be done by organizing
Starting with the first event circled (that is, Sipho factors and causes into larger categories such as
playing in stagnant water) ask the question ‘why?’ political factors, socio-economic factors, health
and put your response in a circle with an arrow service factors, disease-related factors and so on.
pointing towards the event. There may be a The choice of categories is up to you, but they
number of reasons why Sipho is playing in stag- should flow from the ‘reasons’ generated in the
nant water, so add these to the diagram. It is problem analysis. You should now be able to give
useful to generate as many reasons as possible. a critical analysis of why Sipho died. Table 6.1
Now for each reason generated, ask ‘why?’ and shows the factors which possibly contributed to
put your second-generation reasons in circles his death.
Preis)
( Rome Mle eile mie mM Met LC tee)
ROR yee ey 141]
Environmental factors Socio-economic factors

@ poor housing @ high cost of living coupled with low salary or
¢ poor (or absent) sewage system unemployment
poor drainage extreme poverty impacting on Sipho' s life
poor roads and layout of township through malnutrition
@ inadequate water supply ¢ low maternal education
no recreational facilities (safe and hygienic
Cultural factors
play areas)
« high fertility considered desirable, thus ee
Service-related factors family size
@ no 24-hour health service in the community some traditional practices harmful when ne
¢@ hospital inaccessible has diarrhoea (enema)
poor transport system
Political factors
¢ poor health education of mothers concerning
@ maldistriobution of land and wealth created and
causes and treatment of diarrhoea
reinforced by past political regimes
¢ poor planning, budgeting, and thus imple-
despite the political will to improve the quality
mentation of primary health care policy
of life for all South Africans, change on the
Disease-related factors ground is slow
water-borne disease @ economic restraints exist which disallow the
virulent virus government to rectify all social and economic
virus /bacteria induce diarrhoea and vomiting, imbalances in the short term
resulting in dehydration of the child
if child malnourished, the illness is more
severe and compromising to the child
if untreated, child can die from dehydration
Sf
a eee I ee aaa
In the early planning stage of a research project, Vague statements of purpose lead to vague
we often do not have aseries of events to analyse, studies. The purpose is a statement of the exact
but a central issue or problem. The same process ‘destination’ that you want to reach with the
of asking ‘why’ can be undertaken in exploring study: the methods are the ‘road’ you will take to
the central problem, for example, the high inci- get there. Exactness in the purpose requires deep
dence of unplanned teenage pregnancy. consideration of the problem which motivated
It is useful to do part of the problem exploration the study and a purposeful decision to restrict the
with colleagues and to consult the literature — study to one key aspect of the problem or its solu-
outside input always helps you to find further tion.
factors and question your own preconceived ideas. The purpose may be expressed in the form of a
Illustrated are diagrams of analyses of two statement that is as specific as possible, using
health problems which are commonly of concern action verbs (for example, ‘To determine the inci-
in South Africa (Figures 6.1 and 6.2). Note that the dence of diarrhoea among children under five in
same process was undertaken, but that the the summer months in Mamelodi’) or in the form
diagrams represent the end-product after all the of a question (for example, ‘What proportion of
messy and challenging work of asking ‘why?’ has Mamelodi children get diarrhoea during the
been completed. summer months?’).
The problem-exploration process does not Here are some examples of vague versus expli-
automatically suggest a research topic or ques- cit purpose statements.
tion. Instead, it improves understanding of the
problem and the multiple issues which impact on Vague purpose statements:
it. Ithelps to isolate the component of the problem 1 to study the problem of measles in South Africa.
that the researcher wants to study. The exact re- dS to investigate alcohol consumption as a con-
search question depends on existing knowledge tributor to adult mortality.

(including descriptive information about the size oo to determine the health profile of the Mamelodi
of the problem, who it affects, where and when it community.

occurs), what other studies have been done, what
is feasible to do in terms of logistics, money and Carefully considered purpose statements:
resources, and the possible usefulness of results ltto study trends in measles notifications in
yielded by the proposed study. South Africa from 1985 to 1994.
2 to determine the proportion of adult deaths
that are due to alcohol-related conditions.
Step 2: Explicit statement of the
3 to determine the nutritional status of children
purpose or aim of the study under tive years in Mamelodi.
Once the central problem has been fully explored,
the focus or topic of study needs to be decided In practice, the process of narrowing the focus of
upon and explicitly stated. Step 2 thus involves study from all the components and aspects of the
detailing the purpose, aim or intent of the study. problem to a clear, definable and achievable pur-
This states in general terms what information the pose is not easy. It is at this stage of the research
study hopes to obtain. process that initial contact can be made with
The statement of purpose of the study should experienced researchers, supervisors or tutors
preferably be no longer than one sentence, but who could assist in this process.
this sentence must state clearly and precisely Usually, the general problem as defined and
what the most important contribution of this explored can give rise to many possible research
study will be. The purpose serves as a reference questions which each need study in themselves.
point by which the relevance of each specific Thus, in order to do full justice to researching the
objective (see Step 3) will be measured. problem, a number of studies may need to be
The purpose is the part of the protocol that done over a period of time, each contributing to
many people will read before deciding whether to the overall understanding of the problem and
read the whole protocol. The purpose is not a thus to the solution (see Figure 6.3).
general statement of the problem; rather it is a Generally, in a single study one should only
specific (not woolly or vague) statement which attempt to answer one or at the most two general
articulates the main issue being described or the research questions. It is more useful to be able to
central hypothesis being tested. answer one or two questions well than to provide
58
setting objectives for research
“L66T ‘Vv aa|umoug ‘| UeUJEUeLUUJed ‘IND JASSIAAWIeA :39INOS

JaAo|dwea Woy
uoddns pure Hulpue}s Sinoy Hulusdo
uonedn209 > —— > \_-2pun ayenbapeu| Ua|UaAUOIU|
saul}
(AguOW
duipem
Hu07
JO
9WWI]
1SO9
Ul)
Ajiuuey yo SOAI}elad WO} WOdd
ns
uolyisodwiog jel00s ayenbapeu| S90I/UAS JO janet UBIH

Ayyigissaooe M07
6
S]uawiasinbal Huljjasunoo auwihe

UaW}eaJ}
JO ayenbepeu| uaw}eel}
Hulpueyssapun 9/qeynsuf
yualjed 100d ©
sjuaned gi jo S@d1UaS JO
a}ed 18)/NejJap Yop
Saduanbasuod pue ‘sesned
s6nip
se)n6e4|
‘suBis Jo sbpajmouy
Ajddns
Hulse}
jo
AYIUNWIW0d 1004
qualonsuy
uolsiedns
qualoiynsu}
Sal}i[IG!SSOO
JUaW}eaJ} aAIVeUJa}e swojduwAs Jo uononpal (swoydwis ‘subis
souljapin6
yainb Jo asuodsed Ou)
ayelidouddeu|
A\se8) UdIIpUod
jo Ayiqeyeny
UaW}e8/} 0} asuodsaYy jo ssausnolas
891ag
$10}92}
$10}92) 91W/0U099 puke jeJN}jNd-0190$ $10}9} paje|ai-aseasiq
sjuoned gi Suowe 93724 10};NeJop YRIY 0} BUIZNGIA}UOD S10}9By JO Meta ee) CEE Meaes E
St ee CELE eC re Ce aMTE
CYT Eertom rtm cL
Class factors Socio-political factors Service factors
pueeeal=7
Unemployment Apartheid in No sexuality education
Emphasis on unacceptable
Poor schooling methods: limited choice
Poor economic op-

| Distrust of services
Limited options portunity
No recreational Poor No contracep- Teenagers poorly
Delayed marriage facilities compliance tive use catered for
Psyche: Sexual abuse, Judgemental attitude

logical incest, rape of health workers
Ee HIGH RATE OF No abortion Abortion

[Boredom
TEENAGE PREGNANCY services illegal
Peer pressure Early sexual

activity
Adolescent rebellion
Limited planning to
Poor locus of control avoid pregnancy
Defeatism/fatalism
Limited conscious
future planning
Limited Limited decision-making Set hehavigural Gender

super- power
as : re customs for teenage relations
vision Denial of vulnerability g
pregnancy
A to pregnancy
Gender ster: :
Poor teenage/parent No sex pplicetaaidat
communication education Emphasis on prove fertility
Pace fertility and men to prove virility
model Breakdown of
cultural norms \dealized expectation
Pressure not to cohesion of women
fall pregnant
Teenage pregnancy in community
increasingly acceptable
Familial factors Cultural factors Gender-related factors
60
rst Mee MFCM] cere researching a problem
specific problem
ed
specific problem Se sseiahie General
General smn A Cr ag Ce
ae result or
problem >[speaiteproblem| 5 study| rest solution
specific problem ERTS | een
inconclusive information on a broad range of resources available and the usefulness of the
questions. The researcher needs to be patient and results. If no descriptive information is available,
systematic, and humble in recognizing that the that is, if there is a need for numbers and frequen-
project will not solve the entire health problem. cies of characteristics related to the problem, one
We will use the Sipho scenario described earlier of the first five studies should be undertaken. If
to illustrate how aparticular public health prob- there is enough basic information, but a need to
lem can give rise to a number of separate studies, understand more about what may be contribut-
each with their own purpose, aim or topic, but ing to the problem, studies asking similar ques-
still relating to the overall problem described. tions to numbers 6, 7 or 8 should be undertaken.
If there is good understanding of the causes of the
Studies arising from the Sipho scenario: illness, then studies should be done to evaluate
1 Aprofile of environmental conditions in the interventions designed to reduce the problem
squatter camp. (numbers 9 to 11).
2 Asituational analysis of health facilities in the
area, including access to primary, secondary Step 3: Setting specific objectives
and tertiary care.
3 Aprofile of childhood mortality in the area, The purpose or aim of a study should be broken
including cause-specific mortality rates. down into clear, mutually exclusive objectives
4 The prevalence of malnutrition among child- which are logically connected. To a large extent,
ren under five in the community. these determine the subsequent planning of the
a A study describing knowledge, attitudes and study. Specific objectives indicate the specific in-
practices of caregivers concerning diarrhoea formation the study must yield and the detailed
and its treatment. research questions that must be answered. Spec-
6 Acomparison of the childhood mortality in ific objectives are often stated in operational
communities with and without piped water. terms, which are easy to apply in practice. They
Socio-economic risk factors for diarrhoea. are also commonly phrased in measurable terms
onMaternal education as arisk factor for death
(in such a way that the answer is a number or rate)
among children who get diarrhoea. and need to be clear, specific and unambiguous.
9 Acomparison of the diarrhoea mortality in a Specific objectives indicate the individual com-
community before and after the introduction ponents of what will be achieved (but not how it is
of piped water into that community. to be done: that is left for the methods section of
10 An evaluation of the effect of acommunity the protocol).
health worker programme focusing on appro- Descriptive studies usually set out to describe
priate treatment by mothers of children with characteristics of the group being investigated,
diarrhoea. and the statistical goal is usually simple data
11 An evaluation of different methods of rehydra- description or estimation of a characteristic in the
tion to determine which is most effective in a study population (see Chapter 7: Study design).
hospital setting. In analytic studies, the objectives are to
examine associations between characteristics
The above represent some of the studies emerg- and possibly to seek new knowledge on the
ing from the problem exploration done in this cause(s) and the natural history of diseases or
case. The exact choice of study depends on the conditions. Intervention studies attempt to deter-
kind of information already available, the interest mine if an intervention is associated with an
and skills of the researcher, the money and improvement (or deterioration) in health status.
61
a
a a a
Thus analytic and intervention studies always project. It helps to keep the researcher on the
involve hypotheses or predictions of a relation- path originally set out, and enables outsiders to
ship between a factor (or factors) and the evaluate the protocol and/or study.
outcome under study. The hypothesis or predic-
tion is statistically tested after relevant informa-
tion has been collected and analysed, and the EXAMPLE 6.1
hypothesis is either rejected or not rejected. Setting research objectives: Sipho’s
The research hypothesis or question is the pre-
diction or supposition which motivates the study.
death
Once the question or hypothesis statement has Step 1: The problem (what is motivating the
been framed carefully, the hypothesis has to be study?)
stated in such a way that the prediction may be
evaluated by appropriate statistics. This new Sipho’s death, which was explored at the begin-
statement of the hypothesis is called the statis- ning of Chapter 6 (see Table 6.1).
tical hypothesis. Thus the research hypothesis or
question leads to the statistical hypothesis, a Step 2: The purpose or aim of the study (overall
statement made in mathematical terms. For topic of a particular study, chosen from a number
example, the research question ‘Is there an asso- of possible topics)
ciation between education level and food hygiene
practice?’ needs to be framed in the appropriate To identify risk factors for diarrhoea among pre-
statistical terminology to enable the use of the school (under the age of six) children in Khaye-
t-test or the calculation of confidence intervals for litsha, Cape Town.
mean differences. Thus the statistical hypothesis
will state that there is no difference in the mean Step 3: Specific objectives (specific information
education level of people who practice ‘good’ the study will yield, linked to the purpose or aim)
versus ‘bad’ hygiene.
The intention to test a hypothesis is stated as a To ascertain which of the following factors are
specific objective, along with other specific objec- associated with diarrhoea.
tives where appropriate. ¢ Factors related to water: distance to nearest
tap, quality of water used
Step 4: Implementation objectives Personal factors: mother’s hygiene practices,
mother’s level of education, mother’s knowl-
It is important to state what will be done with the
edge of causes and prevention of diarrhoea and
research results and what will be done before and
dehydration
during the study to ensure that the results are
@ Environmental factors: method of water
used. This links into ‘why’ the study is being done.
storage, location of toilet, site of refuse disposal
There should be clarity about the implementa-
Socio-economic factors: availability of sugar
tion objectives of the study so that decision-
and salt for oral rehydration solution, ability to
makers, service providers, and possible benefac-
afford transport to health services
tors can join the research team early and ensure
@ Health service factors: distance from home,
that results are useful and used.
opening times, cost of services, consultation of
traditional healer
Conclusion
Exploration of the problem motivating the Step 4: Implementation objectives
research helps the researcher to get a broad, crit-
ical view of the issues related to the central To make recommendations for interventions tar-
problem identified. geted at reducing the incidence and prevalence of
Setting objectives for the study is another diarrhoea based on the results of the study.
important (and often difficult) part of the re-
search process, as it helps to focus the study by Reference:
describing exactly what it hopes to achieve. It Adapted from Vundule C. ‘A study on the risk factors for
helps the researcher break the study into different diarrhoea in Khayelitsha, a peri-urban settlement area in the
parts, and acts as a guide during the design of the Western Cape.’ Epidemiology research report, Community
methods, as well as during later parts of the Health Department, UCT, 1994.
62
EXAMPLE 6.2
Setting research objectives: the case
of AIDS research
Step 1: The problem. (what is motivating the Step 3: Specific objectives (specific information
study?) the study will yield, linked to the aim)
There is an AIDS epidemic worldwide and in Assume that the aim chosen to be most appro-
Africa. Currently no cure or vaccine for AIDS priate at this stage is:
exists. Behaviour change remains the only imple- to investigate psychological factors associated
mentable intervention area. with behaviour change among high school
students in Soweto.
Step 2: The purpose or aim (overall topic of a par- The following objectives may be stated.
ticular study, chosen from a number of possible ¢ To determine knowledge of HIV transmission
topics) and prevention among individuals in the group
targeted for research
Given the above problem, one of the following To determine attitudes towards people infected
purposes/aims could be appropriate for the with HIV among individuals in the group tar-
study. geted for research
To determine the practice of behaviours related ¢ To investigate the extent to which individuals
to AIDS (in different target groups) in the group targeted for research believe them-
To investigate psychological factors associated selves to be at risk
with behaviour change (in different target To determine attitudes towards particular pre-
groups) ventive behaviours (for example, condom use)
To determine access to condoms (in different @ To determine self-confidence in asserting pre-
target groups) ventive behaviours
To evaluate AIDS education techniques
To determine difficulties in implementing new Step 4: Implementation objectives
AIDS education programmes
@ To determine factors related to the receptive- To make recommendations to education author-
ness (of different groups) to AIDS health educa- ities concerning health education in schools
tion programmes regarding sexuality and AIDS.
63
Study design
Introduction and a health outcome, the study aims to explain

or analyse the problem. This is called an analytic
Study design or study type refers to the overall
study, and study B would be an analytic study.
research approach or strategy taken. Studies
If the study aims to see if an intervention has an
which share the same approach are said to have
effect, it aims to evaluate the intervention. This is
the same study design despite having different
an intervention study, also called an experi-
subject matter (content).
mental study. Study C would be an intervention
Studies can be categorized according to various
study.
criteria.
Study design and researcher control
Study design and the research question In studies investigating whether an exposure is
The most fundamental categorization is accord- associated with a particular outcome, study
ing to the nature of the research question. When design is also determined by the amount of con-
you have clearly formulated the objectives of your trol the researcher has over who is exposed.
study, you decide which study design is most Broadly speaking, studies can be classified as
appropriate to reach these objectives. In practice, experimental or observational (Figure 7.1). In
you also have to consider time, money and per- experimental studies, the researcher conducts an
sonnel resources. experiment, allocating participants to different
exposures to determine the influence these expo-
Here are three typical research questions: sures have on characteristics of the participants.
A Whatis the prevalence of HIV infection in the The researcher determines which participants
Free State? will be in which group (exposed or unexposed).
B Does passive smoking cause lung cancer? Since humans are the research participants,
C Is drug A or B the most effective treatment ethical considerations mean that the exposures
against typhoid? can only take the form of interventions aimed at
decreasing disease or improving health. Epide-
How would you decide to approach these three miological studies to analyse the effect of a risk
studies? factor cannot be done through an experiment (for
If the study aims to determine the size (how example to investigate smoking as a risk factor for
much?) of a problem, or the demographic (who?), disease, the researcher cannot order certain
seasonal (when?) or geographical (where?) distri- participants to smoke and others not to smoke).
bution of a problem, then the study aims to Most risk factor studies are thus observational;
describe the problem. Such a study is called a this means that the researcher observes charac-
descriptive study. StudyA would thus be descrip- teristics of respondents who have selected to be
tive. in exposure or non-exposure categories (for
If the study aims to investigate whether there is example, people who decide to be smokers) or are
a relationship or a causal link between a factor exposed or not exposed due to circumstances (for
64
7 ~~ study design
example, people who live in highly polluted exposure to treatment

areas). Of the three studies described above, A Sick aee Ee eahy state
and B are observational, compared to C, which is
experimental. Observational studies can be either For example:
descriptive or analytic. appendectomy
appendicitis _____.o.o4.4§ , healthy state
Study design and the presence ofa
comparison group TB medication
Analytic studies can explain or analyse a situation @ tuberculosis ____________» healthy state
by virtue of the formal (statistical) comparison of
groups of subjects. On the other hand, descriptive As can be seen, the beginning state comes first,
studies primarily describe a group as a whole. then the exposure, and then the outcome. This
Often subgroups are examined and compared in may seem like common sense. However, these
a descriptive study, but not to investigate a causal three elements are not always studied in the
relationship. Informal or exploratory compar- temporal sequence in which they occur.
isons in descriptive studies are useful for generat- Ideally, a study to determine a causal relation-
ing ideas for future analytic studies. ship should follow the ‘natural’ temporal se-
quence. Such a study would thus collect informa-
Study design and the temporal sequence of tion as events occur, that is, forward in time.
exposure and outcome Information on the exposure is collected before
When we try to determine if an exposure is caus- the outcome is known. Studies which collect
ing an illness, we need to determine when the exposure data after the outcome is known, go
outcome (illness) and exposure (factor) occur ‘backwards in time’ to collect exposure data.
relative to each other. Some studies collect exposure and outcome data
Consider the following chain of events: at the same time from a group whose outcome
status is unknown before the study.
exposure to cause
beginning state___....__-_—-—_—i: end state Subjects can be selected for a study:
when they are all well (or all sick) prior to expo-
An individual starts off in the beginning state. sure
@ She is then exposed to some factor, possibly a according to outcome, that is, by virtue of being
causal factor. ill (cases) or not ill (controls)
@ The end state occurs after the exposure. @ by virtue of membership of a large group where
neither exposure or outcome is known before
In health, this chain of events may occur as fol- selection.
lows:
Study design and study base
exposure to cause While the criteria above are used to differentiate
healthy state____-_______»_—_—___ ill state between study designs, we need to recognize that
different designs are merely different approaches
For example, to representing what happens in a given popula-
tion in a given time period.
maternal smoking The study population (that is, those at risk of
¢ healthy foetus ______ low birth weight the outcome) is also referred to as the source
population from which cases and controls or
hepatitis B exposed and unexposed subjects are selected.
¢ healthy person ____ liver cancer The term ‘study base’ describes the population
being studied and the time period over which
high fat diet outcomes are measured in the study population.
@ healthy person _____ high cholesterol All outcomes measured in a study come from the
study base: this means that they occur in the
The starting point need not always be a healthy defined source population during the time period
state. The effect of an exposure (the treatment) on stipulated in the study base. The study base can
illness can also be studied: thus be seen as the denominator measured in a
65
Figure 7.1 Study types
Descriptive
(describes)
Observational studies
Cross- sectional
Analytical ————————————__>
(explains) > Case-control
se,dana diigisoetigmenies si Cohort
> Community trial

Experimental studies ——_—_—___————-© Field trial
(evaluates intervention) —_———+—8 Randomized controlled clinical trial
longitudinal study. The study base constitutes the State residents over 25 years of age). As discussed
number of people at risk and the time they are at earlier, the situation regarding a single group is
risk. The study base is typically measured in units described in terms of the size of the problem, who
of ‘person-time’ (for example, person-years is affected, where it is found, or when it occurs.
exposed to a risk factor or treated with a drug. Why do we do such studies? The main use of
Also see Chapter 2: Measures of the frequency of descriptive studies is to give service providers and
health events). planners information that will help them design
In a cohort or case-control study, the study services and allocate resources efficiently. Often
base is all individuals who could potentially have such studies generate questions for further
had the study outcome over the time period of the studies. So, for example, a case series which
study. In a cross-sectional study, the measure- describes the socio-demographic and medical
ments are all taken at the same time, so time does characteristics of a few patients with a similar
not have to be measured. disease may provide the impetus for a more rigor-
The concept of a study base is particularly ous investigation of risk factors in an analytic
useful when designing an analytic or intervention study.
study, or when critically evaluating the methods A descriptive study which sets out to compre-
of a study. If the researcher does not ensure that hensively describe the current state of affairs of a
all outcomes (cases) from the study base are certain aspect of health is also known as a situa-
accounted for, or if controls are not a represent- tional analysis. A community diagnosis which
ative sample of the study base producing the consists of detailed information about health
cases, the results of the study may be biased. problems and perceptions in a community is a
(We suggest that the above section be reread descriptive study. If the community diagnosis
after analytic studies have been covered later in also includes information on the associations
this chapter.) between exposures and disease, it is analytic.
Figure 7.1 summarizes the classification of
study types according to the criterion of the Analytic studies
research question. The rest of the chapter will
give more details on the most common study de- Analytic studies attempt to explain or analyse the
signs used in epidemiology. The broad categories situation by comparing groups, in this way identi-
of study design have already been mentioned fying risk factors for disease. For example, the
above. question ‘Is hypertension in Free State residents
related to obesity/smoking/alcohol use?’ is a
research question which requires an analytic
Descriptive studies study. A number of different analytic study de-
A descriptive study, also referred to as a survey, signs are available to answer this type of question.
sets out to quantify the extent of a problem (for Analytic studies can be done in the form of
example, the prevalence of hypertension in Free cross-sectional, case-control or cohort studies.
66
7 study design
Table 7.1 Contrasting eT eae EE ater
Exposed
Not exposed
Cross-sectional A+B+C+D individuals Cross-classified for

exposure/disease status
Case-control (A + C) cases and Determine exposure status

(B + D) controls
Follow-up (A + B) exposed and Follow-up to determine

(C + D) unexposed disease outcome
These three designs differ with respect to how income. But be careful of the joker in the pack! For
dividuals are sampled and the sequence in which example, a superficial look at the results may
exposure and outcome information is collected show that people who have hypertension (expo-
(see Table 7.1). sure) are more likely to have heart disease (dis-
ease) than normotensives and one may deduce
Cross-sectional analytic studies that this is because of hypertension. But this
In across-sectional analytic study, a sample of the finding may be explained by the fact that hyper-
study population is investigated and information tensives tend to be older than normotensives,
is collected on risk factors (exposures) and and older people are more likely to have heart
disease (outcome) at a point in time. The disease. This example illustrates how age can
information sought can be current (how many confound the relationship between blood pres-
cigarettes do you smoke a day at present?) or past sure and heart disease.
(how many cigarettes did you smoke on average In this case, age has distorted the relationship
each day during the past year?). between exposure and disease and is therefore
The study has a descriptive component which called a confounder. Thus when comparing the
enables researchers to calculate the prevalence exposed and unexposed, it is important to con-
of risk factors as well as prevalence of disease. sider, on the basis of knowledge of the subject
The analytic part of the study consists of com- area, whether there are factors which are or could
paring exposure groups with respect to disease be related to exposure as well as disease. Such
presence. The total sample is thus divided into confounders can distort the relationship between
four subgroups (shown in Table 7.1), namely: (A) exposure and disease, masking a real association
those who are exposed and diseased; (B) those or showing an apparent, but spurious associa-
who are exposed and not diseased; (C) those tion. Confounding occurs in all types of studies,
who are unexposed and diseased; and (D) those not just cross-sectional studies. (See Chapter 11
who are unexposed and not diseased. Examples for further discussion on confounding.)
of such cross-classifications are given in Sec- Because cross-sectional studies take a ‘slice’ in
tion C. time, current behaviour (for example, current
Such cross-classification allows statistical smoking habits) is usually obtained. But diseased
comparisons between subgroups, and may people may have stopped smoking because of ill-
show arelationship between exposure and out- ness and so current smoking information will not
67
a
help the researcher to illuminate smoking as a sible, exposure information should thus be valid-
risk factor. It may be more useful to take a ated by objective sources.
smoking history, rather than asking only about Case-control studies are increasingly done as
current smoking habits. In general, when decid- part of larger cohort studies. Such case-control
ing which questions to ask about risk factors, one studies are termed nested case-control studies.
has to consider whether past exposure may be Exposure information on all participants is
more relevant than recent exposure. Neverthe- collected at the start of the study period. During
less, it is usually difficult to assess the temporal follow-up, cases are identified and a sample of the
relationship between exposure and disease in a remainder of the group is selected as controls.
cross-sectional study. The researcher can seldom Exposure status is thus measured before disease
be sure which occurred first — the disease or the status is determined, and recall bias is avoided.
exposure. However, the temporal sequence of Case-control studies are very vulnerable to
exposure and outcome can occasionally be estab- selection bias. The essential requirement is that
lished (for example, exposure during pregnancy, the control group accurately represents the
followed by outcome in the child). source population, that is, those contributing to
If the disease of interest is rare, a very large the study base. The cases (selected according to a
sample has to be studied to be sure of selecting clear case definition) need not be representative
enough diseased people to allow for adequate of all cases in the population, but could be cases
analysis. The cross-sectional analytic study may at one clinic, or cases among factory workers.
turn out to be too costly, due to the large sample Similarly, the controls need not represent the
size required. In such circumstances, one would entire non-diseased population but should come
rather consider doing a case-control study. from the same source population as the cases (for
example from the referral area of the clinic, or
Case-control studies from the same factory).
Essentially in a case-control or case-referent In a case-control study, careful definition of the
study, cases (those with disease) are compared to study base is important to ensure that the cases
appropriately selected controls (without disease) and controls selected are from the same study
to determine whether they differ with respect to base. When the study base (or source population)
exposures. So, for example, the proportion of pas- is defined at the outset, the challenge is to find all
sive smokers among cases of lung cancer is com- the cases that arise from this (primary) study
pared to the proportion of passive smokers base. If the study base comprises all individuals
among controls (without lung cancer). In tradi- living in a particular area over a set period of time,
tional case-control studies, after cases and con- it may be difficult to track down all the cases. On
trols have been selected, exposure information is the other hand, if cases are defined at the outset,
gathered. the challenge is to find a suitable control group
The case-control design can be considered an which represents the source population that gave
efficient sampling technique to measure expo- rise to the cases. A tertiary hospital-based case-
sure-disease associations in a study base (the set control study is an example of a study base being
of persons or person-time from which diseased defined on the basis of the cases recruited into the
subjects become cases). Exposure is only mea- study.
sured in a sample of all subjects in the study base If hospital or clinic cases are selected, one or
(those who were selected as cases and controls two control groups may be used: one hospital-
from the source population). Typically, all the based group and one community group. Hos-
outcomes from the study base (that is, cases) are pital-based controls should not have a condition
included in a case-control study. The controls are related to the disease or exposure being invest-
used to represent the probability of exposure in igated. Neighbourhood controls are examples of
the population at risk of becoming cases. community controls where there is an attempt to
A disadvantage of such case-control studies is control for socio-economic factors. When two
that the retrospectively gathered exposure in- separate control groups are used (this rarely hap-
formation may be inaccurate or biased (measure- pens), the cases and each control group are com-
ment bias). People with disease (cases) may pared separately and results are checked for con-
remember past events more clearly, or, on the sistency.
other hand, may deny certain past behaviours Matching is often used in case-control studies.
which their doctor had warned against. If pos- This means that for each case, one (or more)
68
7 study design
control with similar characteristics is selected. If (normal birthweight) are followed up. The disease
cases and controls are matched individually, a or outcome rates (for example, failure at school)
matched analysis has to be done, otherwise asso- within these two groups are then compared at the
ciations will be underestimated. An alternative to end of a predetermined follow-up period. The
individual matching is group matching, where cohort’s time spent under observation is the
controls are chosen to ensure that the control study base out of which cases arise.
group as a whole is similar to the group of cases The major disadvantages of cohort studies are
with respect to the matching variables. Thus the time and cost considerations. If the disease(s) of
proportion of males amongst the controls will, for interest take a long time to manifest, individuals
example, be the same as among the cases. The have to be followed up for a long period. The
variables that need to be matched are the ones longer the follow-up needed, the more likely you
that are possible confounders. Matching does not are to lose touch with participants, thus produc-
remove confounding, but it does ensure that ing bias in the study. At the start of the study,
numbers are equal in stratified analyses (see steps should be taken to minimize this loss to
Chapter 11). follow-up. One approach is to ask participants to
A case-control study can only investigate one provide the names and addresses of some close
disease, but many exposures can be considered. If friends and relatives through whom they could be
the exposures of interest are rare in the popula- traced. Health personnel can also be co-opted to
tion, however, a case-control study is inefficient. do frequent home visits. It is important to trace a
As stated before, it is an excellent approach if the sample of drop-outs to establish whether they
disease is rare. In recent years, however, the use represent a subgroup which differs from those
of case-control studies has been extended to the who do not drop out. Statistical methods, namely
study of common diseases. New case-control de- survival techniques, exist which enable one to use
signs (new in the way in which the controls are incomplete follow-up information in the calcula-
sampled) have also been described. The selection tion of risk of disease.
of cases for a case-control study can be made If adequate historical records (for example,
from prevalent or incident cases. If incident cases hospital, medical or university alumni databases)
are being recruited, then the controls can be se- exist, one can assemble exposed and non-
lected either after all cases have been recruited, or exposed groups from such records and follow
as each case is recruited. In the latter situation, a them up to the present to determine disease
person originally selected as a control can there- status. Historical records can also be used to
fore become a case at a later stage. Further discus- determine disease outcome. These are examples
sion of these designs, as well as the appropriate of historical cohort studies, in which data collec-
methods of analysis, are given by Rodrigues and tion is non-concurrent.
Kirkwood (1990). A synthesis of issues involved in
the selection of controls in case-control studies is
Experimental studies
given by Wacholder, McLaughlin, Silverman,
Mandel (1992). As outlined before, a study is experimental if the
researcher assigns the exposure to the subjects.
Cohort studies Such studies are also called intervention studies
These studies are also called follow-up studies since they evaluate some intervention aimed at
since healthy individuals with known exposure decreasing or preventing disease.
are followed up for a period of time during which Different intervention studies have different
disease status is determined. These studies can levels of scientific rigour. The main criteria by
investigate the cause of disease as well as the which experimental studies are judged are: 1)
natural history of disease. whether the intervention is compared to some
A cohort is a group which shares a certain char- other group (another intervention or a control
acteristic. For example, a birth cohort is a group group); and 2) the way in which people are alloc-
born during a certain period. Some of these will ated to experimental and control groups. Valid
have had a low birthweight (exposed) and others evaluation of an intervention is generally only
a normal birthweight (unexposed). In a cohort possible if the intervention is compared to some
study, a group of disease-free individuals who are other group. In addition, the researcher should
exposed to a certain risk factor (for example, low follow very strict rules in allocating participants,
birthweight) and a group which is not exposed to ensure that all individuals have the same
69
a ee
chance of being allocated to a given group. patients are randomized to different treatment
Randomized controlled clinical trials (RCTs), (parallel design).
in which patients are assigned to different treat- In RCTs, patients are followed for a specified
ments, are regarded as the gold standard for period to determine outcome (for example, alive/
intervention studies. In a randomized controlled dead, recover/do not recover). The person obser-
clinical trial, a group of patients is selected and ving the outcome after the intervention should be
patients are then randomly allocated (random- unaware which treatment a patient received (so
ized) to different treatments, or treatment and that the observer is ‘blind’ to the exposure). The
placebo groups. (Please refer to Chapter 8 for fur- patient should be similarly unaware, so that nei-
ther discussion of the random _ principle.) ther the patient nor the observer can be biased by
Random allocation in two groups of 15 each is, for knowledge of the treatment received (double
example, done by drawing 15 random numbers blind), especially if the outcome is subjective,
between 1 and 30. These 15 patients are allocated such as the extent of pain experienced on a scale
to one group and the remainder to the other from 0 to 10.
group. Special rules can be followed (for example, The different treatments should be equally
block randomization) to ensure that after a given acceptable according to present knowledge, that
number of patients have been enrolled in the is, patients should not be subjected to an inter-
study, there will be equal numbers in each treat- vention that is known to be inferior or unsafe.
ment group. This is why it is important for an ethics committee
Randomization aims to ensure that patients on to scrutinize the study protocol. Other ethical
different treatments are comparable with respect considerations are that patients should give in-
to baseline characteristics, as well as known and formed consent, all adverse events (for example,
unknown risk factors. The possibility of con- headaches or nausea) should be noted, and it
founding, which can occur in analytical studies, is should be clearly specified when patients will be
thus reduced. For example, the most severely withdrawn from the study in the event of harmful
affected patients should not get one treatment effects being found, or a clear indication of the
and the milder cases another. To ensure that the superiority (or inferiority) of one of the inter-
groups are similar with respect to disease severity, ventions.
prognostic stratification can be used: randomiza- Randomized controlled trials are trials of effi-
tion is done in strata (subgroups) of disease sever- cacy (see Chapter 15: Health systems research). In
ity. It may be expected that people with differing cases where randomized controlled trials are not
characteristics (for example, males and females) possible because of ethical or logistic reasons,
will differ in their response to treatment. In such case-control and follow-up studies can be used
cases, it is important that patients should be to assess interventions. So, for example, case-
stratified by sex at the start of the study, and ran- control studies have been used to assess the effi-
domization then done within each sex. Stratifica- cacy of aspirin in reducing the risk of myocardial
tion can range from stratification in two large infarction.
strata to randomization of matched pairs. The Besides RCTs, experimental epidemiological
main purpose of stratification is to ensure that studies include field trials, in which healthy indi-
important confounders are distributed similarly viduals are assigned to different interventions
in the treatment and control groups. It is particu- aimed at prevention (for example, a new typhoid
larly important in trials with small samples where vaccine), and community intervention trials, in
randomization alone may not ensure this. which groups from the general population are
If the condition being studied is such that after assigned to interventions. Since the risk of devel-
treatment has been withdrawn, the patient re- oping disease may be very low amongst healthy
verts to the disease status he or she had before the individuals, more subjects may be needed for a
study, as in chronic conditions such as asthma, a field trial than for a clinical trial. Furthermore,
cross-over design can be used. Each participant individuals have to be visited in the community.
receives all treatments, in a randomly chosen se- Intervention studies on people in the community
quence, with wash-out periods in between, to are therefore more generally performed through
remove the effect of the previous treatment. community trials. So, for example, in the Coro-
Fewer patients are needed, but since all parti- nary Risk Factor Study (CORIS) conducted by the
cipants undergo all treatments, the study takes MRC, HSRC and Department of Health, three
longer to complete than in the design where southwestern Cape towns each received a differ-
70
7 ~~ study design
ent level of intervention aimed at reducing heart Ecologic studies

disease (Rossouw, Jooste, Charlton, Jordaan,
Langenhoven, Jordaan, Steyn, Swanepoel, Ros- Let us consider the following example of an
souw 1993). Some interventions (for example, ecologic study: From published data, it is clear
water fluoridation), cannot be randomized by that in South Africa the infant mortality rate (IMR)
individual, only by group, and are thus ecologic among coloured babies is much lower than that
studies. The randomization of groups rather than among African babies. It is also known that
individuals leads to fewer administrative prob- maternal smoking rates are high among pregnant
lems and decreases the chances of contamination coloured women and low among pregnant
(where a person randomized to one intervention African women. From this information, it seems
actually receives the other interventicn). How- as if there is a relationship between maternal
ever, the analysis has to take into account that smoking (exposure) and IMR (outcome), namely
groups and not individuals have been random- that maternal smoking protects against infant
ized. Standard statistical procedures thus cannot deaths. This does not make much sense since we
be used. An approach which can be used if the know that IMR is linked to low birthweight, and
number of groups per intervention is large (say 15 low birthweight is linked to maternal smoking.
or more) is to calculate a summary outcome for What aspect of this study design is different from
each group. Standard methods can then be used the other designs we have mentioned, and has led
to analyse these group summaries. If only one us to this fallacy?
group is randomized for each intervention, it is In all study types described above (apart from
impossible to disentangle the effect of the inter- the community trial), observations are made on
vention from the natural variation between individuals. In an ecologic study, such as the one
groups. described above, however, the unit of observation
In a before-after study, measurements are is a group of individuals, for example, in acity,
made before and after some intervention on a region or country. Such a study requires informa-
group of individuals. Such studies are quasi- tion on exposure and disease for each of the
experimental, since there is no random alloca- groups being investigated. Such information can
tion of intervention: all participants receive the usually be obtained from routinely available data
intervention. For example, the AIDS knowledge sources. So, for example, one could examine the
of school children may be measured before and relationship between the Gross National Product
after an AIDS education programme. The before per capita of selected countries (exposure) and
and after results are compared to determine the IMR of the countries (disease).
whether the intervention has improved the Often proxy (substitute) measures for exposure
children’s knowledge. This type of study is the and disease have to be used, for example, mortal-
weakest form of intervention study. If the educa- ity as a proxy for disease. The degree of associa-
tion programme is conducted over, say, a period tion between exposure and disease is made
of six weeks, there might have been an AIDS tenuous by the use of such proxies and by the fact
poster campaign in the community which in- that measurements are averages over groups.
creased the children’s knowledge, rather than Furthermore, there may be other factors
the education programme. Only if there is some (confounders) which are related to both exposure
comparable control group who did not receive and disease and which may distort the associa-
the intervention and in whom the researcher can tion between exposure and disease. Information
determine whether any changes in knowledge on such factors may not be available and appro-
occurred, can the effect of the intervention be priate analysis cannot be done. In the example
evaluated. above, level of urbanization could be a possible
In some before/after studies, the before confounder: coloured women are more urban-
measurement is made in a community and the ized than African women, therefore coloured
after measurement some time later in the same women tend to smoke more but also tend to have
community. Clearly, some members of the initial ’ better access to health care, which leads to a
group might have left and newcomers arrived. lower IMR. Smoking does not prevent infant
Thus, before and after measurements are made deaths!
on groups including different individuals. If the Despite their limitations, ecologic studies are
before/after groups are not comparable, evalua- useful for describing differences between groups,
tion is very difficult. and pointing out possible avenues for further
ral
a
investigation. The value of ecologic studies is prescribed, health promotion activities, and
often underrated because of the possibility of referral to secondary and tertiary centres
ecologic fallacy. Within ecologic study designs, it @ Areview of non-clinical activities that impact
is possible to assess the extent of the fallacy and to on care
some extent adjust for this. Furthermore, certain @ Adescriptive survey of staff and patient atti-
studies can only be conducted as ecologic studies tudes towards the service and perceived needs
because the exposure and/or outcome can only
be assessed ecologically, for example, the rela- 3 Aprofile of childhood mortality in the area,
tionship between fluoride levels in water and including cause-specific mortality rates
dental caries. In other instances, ecologic level Descriptive study:
variables are more appropriate and accurate a) Where mortality statistics are well kept:
measures than individual level variables. For Use of routinely collected mortality and
example, the variable ‘poverty’ is an ecologic vari- demographic data to determine mortality
able demarcating a depressed socio-economic rates
neighbourhood, whereas ‘individual income’ b) Where no coherent system of mortality data
provides no information on the general socio- collection exists:
economic conditions of the environment that the @ Ahousehold survey where caregivers report
subject is exposed to. Studies of community- on children in their care and any childhood
based health promotion activities using mass deaths over the previous year
media often require ecologic studies.
4 The prevalence of malnutrition among child-
ren under five in the community
Conclusion Descriptive study:
The research topics given below represent some @ Ahousehold survey in which every child under
of the studies emerging from the problem explo- the age of five has a nutritional assessment
ration done in the previous chapter. Having read
the chapter on study design, you can now decide 5 Astudy describing knowledge, attitudes and
which design would best suit a given research practices of caregivers concerning diarrhoea
topic or question. and its treatment
We suggest that you go through the list on page Descriptive study:
61 and try to allocate a suitable design before @ Asample of caregivers is interviewed using a
reading our comments below. Please note that for questionnaire which determines their know-
some topics, there may be more than one ‘cor- ledge of diarrhoea and its treatment, their atti-
rect’ option and some alternative approaches are tudes, and also the treatments they administer
not described.
6 Acomparison of the childhood mortality in
Studies arising from the Sipho scenario communities with and without piped water
1 Asituational analysis of environmental condi- Ecologic study:
tions in the squatter camp (situational analyses @ This can be done by comparing mortality
often require multiple approaches) statistics from communities with and without
Descriptive study: piped water
@ A full assessment of environmental infrastruc-
ture done by environmental officers 7 Socio-economic risk factors for diarrhoea
@ Adescriptive sample survey of houses to Cohort study:
determine the environmental profile at house- @ Agroup of children are followed up fora
hold level period of time to see if they get diarrhoea. At
the end of the study, a comparison is made of
2 Asituational analysis of health facilities in the diarrhoea rates among children from homes
area with ‘low’ and ‘high’ socio-economic status
Descriptive study: Case-control study:
@ Areview of services offered, staff employed @ Socio-economic variables are compared be-
and expenditure tween cases of diarrhoea attending a PHC
¢ Acollection of descriptive information on clinic and a group of neighbourhood controls
clinic attendance, illness profile, medicines who do not have diarrhoea
72
7 ~~ study design
8 Maternal education as a risk factor for death treatment by mothers of children with diar-
among children who get diarrhoea rhoea. The CHW programme is introduced
Case-control study (hospital or clinic-based): and after a set period of time, various outcome
@ The education level of mothers of children measures (for example, maternal knowledge
who die from diarrhoea is compared to the of ORS, under-five mortality rate, incidence of
education level of mothers of children who get severe dehydration following diarrhoea) are
diarrhoea but do not die compared between the experimental and con-
trol villages
9 Acomparison of the diarrhoea mortality ina
community before and after the introduction of 11 To evaluate which method of rehydration is
piped water into that community most effective in a hospital setting
Before-after study (evaluating an intervention) Randomized controlled clinical trial:
¢@ ‘Children with severe dehydration admitted to
10 An evaluation of the effect ofa community the hospital are randomly allocated to two
health worker (CHW) programme focusing on treatment groups (there is no group that does
appropriate treatment by mothers of children not receive treatment). They are followed up
with diarrhoea and various outcome measures are taken
Community trial:
@ Villages are randomly selected to be recipients
of a CHW programme focusing on appropriate
73
Sampling
Introduction The sample

While the study design chosen determines the If we want to investigate the use of maternal ser-
general approach of the study, it does not de- vices in Kimberley, for example, do we need to
scribe who the study will investigate or who it will interview everyone in the study population? It
involve. This chapter looks at the concepts of would be a very cumbersome, expensive, and
study population and sampling and gives details almost impossible task.
of sampling techniques used mainly in descrip- It is neither practical nor necessary to study all
tive studies. individuals in the study population. Rather, a
sample (subset or subgroup) of individuals can be
studied closely, ensuring that good quality in-
Study (target) population formation is obtained. By summarizing the find-
When doing a study, it is important to clearly ings of the sample, the researcher intends to
define the group you want to gather information make deductions about the study population.
and make conclusions about. This group, called Therefore, the sample should closely reflect or
the study (target) population, should be clearly represent the study population. If the sample is
defined in terms of place and time, as well as representative, one can generalize sample results
other factors relevant to the study. For example, if to the population.
one wants to study the use of maternal health ser- Scientific sampling methods have been devel-
vices in Kimberley, the study population is oped to ensure that samples are representative.
defined as ‘all women of child-bearing age (15—49 The way in which a sample will be selected must
years)’ (who), ‘resident in Kimberley’ (place), ‘at be described clearly in the protocol so that-
the time of the study’ (time). readers can judge how representative the sample
It is essential to be sure who is included in your will be.
study population and who is not. For example, As outlined in Chapter 7: Study design, in case-
researchers who want to determine some charac- control, follow-up, and experimental studies, it is
teristic of members of a community often decide important that the groups being compared are
to conduct the study on clinic attenders (for comparable: it is no use comparing apples and
example, HIV status of antenatal clinic attenders), pears. In descriptive and cross-sectional studies,
since they are community members who can where the aim of the study is to decribe some
be reached with little effort. Clinic attenders are, characteristic(s) of a population, the samples
however, not representative of the general should be representative and thus reflect what
population, since those who attend the clinic exists in the study population as a whole. This
usually differ from community members who do chapter is thus of specific relevance for the latter
not. two study designs.
74
8 sampling
Random sampling
Table 8.1 Medical doctors listed
Random sampling (not to be confused with hap-
hazard or arbitrary sampling!) is a selection tool in the Cape Peninsula telephone
which can ensure that one’s sample is representa- ‘ directory
tive of the population. This is also known as prob-
ability sampling, since each individual in the Abbas MR
study population has a known chance (probabil- Abdurahman N
ity) of being included in the sample. While the Abdurahman Z
researcher controls the sampling process, she or Abel S,‘Opthalmic -—
he has no control over exactly which individuals Abels CS
are selected. In the end, whether an individual is Abrahams A
selected or not is determined by chance. Abrahams L
In the discussion that follows, it will be Abrahamse C GA, Ophthalmologist
assumed that information is required on indi- Ackovic K L, Gynaecologist
viduals. If, however, information on households Adam A
(water, sanitation, income, density) is needed, AdamOFAS, ree
random samples are similarly drawn from all Adam SA
households. Adam U
To be able to do random sampling, one needs a Adams AA
sampling frame. This is a list or some representa- Adams G, Gastro-enterologist
tion (for example, a map) of the study population, Adams MC -
either in terms of individuals or in terms of groups Adams S
of individuals (for example, households or Adendorff T W J
villages), depending on the specific type of etc.
random sampling used. The items on the sam-
pling frame are called sampling units. It is import-
ant that the sampling frame contains all the indi- in order to investigate their knowledge of emer-
viduals in the study population: a sample can only gency services, you would make a list of all
be representative if the original frame is com- doctors and give each doctor on the list a number
plete. Table 8.1 is an example of a section of a (Dr Abbas is number 1, Dr N Abdurahman
sampling frame consisting of medical doctors number 2, and so on). In the case of a simple
listed in the Cape Peninsula telephone directory. random sample of households having to be se-
It is often fairly difficult to get a good sampling lected in the mapped area, each house is alloc-
frame from which to sample: where would one get ated a number, as shown in Figure 8.1.
a sampling frame for selecting households in a To select those who are to be included in the
large squatter camp? In townships or peri-urban sample, one has to draw random numbers. This
areas, there are often no reliable maps to use as can be done by drawing numbers from a hat,
sampling frames. It is then necessary to explore using a table of random numbers (see Appendix
other possibilities, such as aerial photographs or 4), or by generating random numbers on the
hand-drawn maps to use as a sampling frame. computer. If one wishes to select a sample of 20
Figure 8.1 shows a hand-drawn map of part of from a population of 80, the individuals whose
Khayelitsha (Pick, Cooper, Klopper, Myers, Hoff- numbers correspond to the first 20 random num-
man, Kuhn 1990). bers drawn (between 1 and 80) will be included in
the sample (see Table 8.2).
Simple random sampling
In simple random sampling of individuals, the Stratified random sampling
sampling unit refers to an individual and each If you have some knowledge that strata (sub-
individual in the population has an equal chance groups) of the population differ with regard to the
of being selected for the sample. The researcher measurements being made, you would want
has to draw up a complete list of all individuals in these strata to be represented adequately in the
the population and each individual has to be sample. For example, new peri-urban squatters
given a number. So, in order to choose a simple may be at highest risk for disease. Although they
random sample of medical doctors in Cape Town form only a small proportion of the study popula-
75
Figure 8.1 Hand-drawn map of houses in Khayelitsha
SITE B W AREA QND ocT '89

ROAD 16
| J
Ke oy 8 ey=p sy S52 $53
xB
%
x
BayBreStone]
X% Y 5b o
<2
xn
uy
= XG
a DAM WITH
BAD SMELLIN
xy s
> aM
ed
ess
Dp
xD *KO
X~
PK
on VELISWAS
xIxd x PUB
NOMANB LA) &° bb/ woh es ee
STORE Ge de er ge %569
Source: Pick WM et al, 1990.
tion, they can be chosen as a distinct stratum. In proportional stratified sampling. So, in the exam-
the case of the medical doctors, some are general ple of the medical practitioners in Cape Town, if
practitioners and some are specialists, and they 20% of the doctors are specialists, then 20% of the
may have different levels of knowledge about sample must be drawn from the list of specialists.
emergency services. The population should The results of the specialists and the general prac-
therefore be divided into these mutually exclusive titioners are then combined to get the result of the
(that is, no individual should fit into more than group overall.
one stratum) and exhaustive (that is, each indi- However, equal numbers can be chosen from
vidual should be able to find a stratum to fit into) each stratum to facilitate comparisons between
strata. A simple random sample of individuals is strata. This is often needed when there are very
then selected from each stratum. This is called small strata, which, if chosen proportionately, will
stratified random sampling. not have enough sampled individuals to allow
It is thus necessary to have a full list of the indi- statistical comparison. If strata are not chosen
viduals in each stratum. Typical strata are age proportionately to their size in the study popula-
groups, sexes, geographical areas or social class tion, the analysis involved in obtaining overall
categories. Strata should be selected so that the results is more complex. The researcher has to
variation of the characteristic of interest between weigh stratum-specific results proportionately to
strata is maximized and variation within strata obtain overall results (see Appendix 1: Standard-
minimized. Thus there should be little difference ization).
with respect to the characteristic of interest Simple random sampling and stratified ran-
within a subgroup, and large differences between dom sampling are easy to apply in, for example, a
subgroups. school where full class lists can be obtained and
Often the number of individuals selected from grouped in strata according to sex, age or stand-
each stratum is proportional to the size of the ard. If, however, one wishes to study households
stratum in the study population. This is called in a metropolitan area, it may be cumbersome to
76
8 sampling
Table 8.2 How to draw a simple random sample
RANDOM NUMBER TABLE

The following example describes the drawing of
a sample of 20individuals from a population of
80 using random number tables.
Give every individual a number (80 individuals).
Select a table of two-digit random numbers

(these may be found in most
statistics textbooks).
Close your eyes and put your pen on one of the

numbers in the random number table, for exam-
ple 45. This will be your starting point.
Start with the selected number and choose a di-
rection (up, down, left or right).
We have chosen downwards. Record the num-
Os
@@OG+
bers that appear in the table, moving in the
chosen direction until you have selected 20 num-
bers which lie between 1 and 80.
Any number above 80, and numbers which roe)BB
have already been selected, are ignored.
@)
give each household a unique number to enable thus made in terms of time, personnel, and trans-
simple random sampling, and then to travel port.
through the area visiting the selected households This method is especially convenient if there is
which are dispersed all over the area. In such a list of clusters, but not of individuals in each
situations, cluster sampling may be considered. cluster. Many community-based studies are done
using cluster sampling. On a map, households in
Cluster sampling a metropolitan area can easily be grouped into
In cluster sampling, the study population is first clusters of, say, five households each, and clusters
divided into groups or bunches (clusters) of a selected randomly. In Chapter 22: Immunization
certain size. So, for example, the map could be coverage, examples of specific types of cluster
divided into city blocks and the doctors could be samples are given.
divided into clusters according to columns on a One of the most often-used forms of sampling
page in the telephone directory. Clusters and not is stratified cluster sampling. In a study (Yach,
individual units are then selected at random. Typ- Katzenellenbogen, Conradie 1987) which aimed
ical clusters are schools, city blocks or villages. to estimate the infant mortality rate in a rural area
After a random sample of the clusters has been of Ciskei, strata were defined in terms of village
selected, all individuals, or a random sample of type: urban, traditional, resettled and re-reset-
the individuals in the selected clusters, are tled. All villages were classified into one of the
included in the sample. strata and villages (the clusters) were then chosen
This method is convenient in studies involving randomly from each stratum. In small villages, all
large geographical areas. In rural research, households were included in the sample. Large
villages are often treated as clusters. A full list of villages were divided into clusters, some of which
villages is drawn up, a sample of villages is drawn were then randomly selected. Within a house-
at random and individuals within these villages hold, all women of the specified age group were
included in the sample. Considerable savings are studied. This is an example of multistage sam-
Hig
pling: sampling takes place at various stages or depends on the number in the population and the
levels, for example, villages are first selected size of the sample. If one wishes to draw a system-
according to type of village, and thereafter clus- atic sample of 20 individuals from a population of
ters of households within a selected village are 80, the sampling interval is 80/20 = 4, that is every
chosen. fourth person will be selected. To determine the
Cluster sampling does have disadvantages. If random starting point one has to select a random
the characteristics being measured are random number within the first sampling interval, in this
(heterogeneous) within each cluster (for example, case between 1 and 4.
some children are immunized, some not) then It is important that the list or queue is not
the clusters are representative of the population. ordered according to some system or cyclical pat-
The results obtained from the cluster sample tern. If street blocks consist of five houses each, of
would be similar (precise and unbiased) to that which the two corner houses are bigger than the
from a simple random sample, and the cluster others, a systematic sample of every fifth house
sample would have been easier to study than a would not give a representative sample.
random sample. But if people in the same cluster Systematic sampling is particularly practical ifa
tend to be alike (for example, all are immunized) sample of patient records has to be selected.
and different from those in other clusters, then
the cluster sample is much less efficient than a
simple random sample, and results could be
Non-random sampling
biased. It is therefore suggested that the sample Various types of non-random (haphazard) sam-
size of a cluster sample should be at least double pling exist, for example, judgement or quota sam-
that of the corresponding sample size in a study pling. In judgement sampling, the researcher se-
using a simple random sample. Additionally, lects individuals based on his or her own judge-
many clusters with few individuals per cluster ment. Clearly, no matter how objective one tries
should be chosen, rather than few clusters con- to be in choosing a balanced sample, it is impos-
taining many individuals. In general, no fewer sible to say how representative such a sample is.
than 30 clusters should be selected. Furthermore, In quota sampling, the interviewer is given a
it is very important that the analysis takes the description of the types of people who should be
cluster design into account. (See Chapter 22: interviewed and the number needed of each type.
Immunization coverage.) Statistical techniques The selection of individuals is left to the inter-
for doing this adequately are still being devel- viewer and there is no attempt at randomness.
oped. Often consecutive persons are included in a
sample until a certain number has been studied,
Systematic sampling or patients seen over a certain period constitute
Systematic sampling is often done intuitively. A the sample. For example, to study the prevalence
teacher may choose the child in every third desk of major depression in a rural community, all
to go to see the school nurse, or a nursing man- patients attending the clinic in the community
ager may go through every fifth patient’s file to during one calendar month could be included in
see what kind of patients come toa clinic. the sample. Clearly those visiting the clinic may
Thus in systematic sampling, individuals are differ from those who do not visit the clinic. In
selected systematically from some list or order- addition, the prevalence of the disease during
ing, for example, a queue. In the example of the December may differ from that during July. If, to
medical practitioners, every tenth doctor may be address the problems of seasonal variation, it is
selected from the list. Similarly, every third house decided therefore to rather study a sample of
may be selected from the map. patients attending the clinic throughout the year,
The complete population need not be known including the first ten patients each morning, one
before one starts to select the sample. So, for has to consider whether the patients arriving first
example, you could select a random sample from in the morning may differ from those who come
the admissions to a hospital, as people are admit- later (for example, those who are employed prob-
ted. The procedure is random, since the starting ably visit the clinic before work, and may be less
point is selected at random. Each nth individual inclined to have depression than the unemployed
after the starting point is then selected systemat- who come later in the day). These are examples of
ically. bias (see below) to which non-random samples
If the population size is known, the strategy are particularly prone.
78
8 sampling
Non-random sampling is generally not recom- the spectrum one is interested in. Issues which
mended in quantitative studies, since the level of emerge in the interviews or discussion could be
representativeness is questionable and _ infer- quantified or explored further in (quantitative)
ences (generalizations) can thus not be made studies using representative samples.
about the study population. Most statistical pro-
cedures also assume that samples have been ran-
Sampling bias
domly drawn.
There are, however, situations where non- Bias occurs if the sample is not representative of
random sampling is useful and the only way open the study population, that is, if the individuals in
to the researcher. In studying rare diseases, it is the sample’ differ systematically from the study
often very difficult to locate sufficient cases in a population. For example, if you use volunteers as
community-based random sample. Thousands of respondents, you must realize that those who
people may have to be screened at great cost choose to volunteer may be people who are par-
before a handful of cases are found who can be ticularly interested in the research topic, perhaps
interviewed. Snowballing (or networking) is a more so than the study population from which
sampling technique whereby each case is asked they come. Similarly, hospital cases may be more
to name all other people he or she knows with the severely ill than cases in the community. Sackett
disease. By following these people up and obtain- (1979) has outlined a number of biases which may
ing further names from them, a sample is col- occur and it is useful for researchers to familiarize
lected. Similarly, members of a community could themselves with these.
be asked to name all the people they know with Bias can occur even if random sampling is
the disease. Clearly, this sample could be biased, used. If the sampling frame used is not a full list of
since the most isolated cases may not be men- the study population, one may obtain biased
tioned by anyone and thus not included. The results. For example, using telephone directory
cases could also form separate non-overlapping entries as the sampling frame for a city’s inhabit-
networks which would mean that networking ants would lead to a biased sample consisting
would lead only to cases belonging to one net- only of those people who have telephones and
work, which may be very different from the listed numbers. They would probably differ
others. demographically from those who do not have
A similar approach has also been used to reach telephones or prefer to have unlisted telephone
so-called ‘hidden’ populations: hidden because numbers.
their activities are clandestine, for example, intra- Even if the researcher has a complete list of the
venous drug users. Many studies on drug usage study population, it may be difficult to locate cer-
have been done by studying random samples of tain people for interviewing. If you conduct a
drug users who present at some institution (drug community survey by visiting homes during the
treatment programmes, criminal courts), but day, you are likely to find children, the elderly,
those not presenting could be vastly different and unemployed people at home. Special
from those who do. A sampling technique called attempts have to be made to reach the employed
targeted sampling has been developed to reach people who were selected to be in the sample, by
members of ‘hidden’ populations in the commu- going back in the evening or over a weekend. An
nity through a combination of techniques such as individual cannot be excluded from the sample
networking and key informants (Watters and just because he or she is not at home at the time of
Biernacki 1989). A study of sexual behaviour and the interviewer’s visit. Furthermore, such an indi-
condom use among prostitutes serving the mine vidual should not be replaced by a ‘similar’
hostels used snowball sampling to identify the person who happens to be available. The person
subjects. Since prostitution is illegal and a hidden who is at home in the afternoon may be very dif-
activity, this was the only feasible sampling ferent from the one who is not.
method (Jochelson, Mothibeli, Leger 1991). People do refuse to participate, however. Espe-
If the researcher wishes to form a focus group cially in postal surveys where respondents have to
for discussion or do in-depth interviews on a return completed questionnaires by mail, the
topic (see Chapter 19: Qualitative methodology), it initial response rate can be very low. In postal sur-
may be perfectly acceptable (and indeed desir- veys attempts (second and third mailings, incen-
able) not to select a random sample. Purposive
tives for participation) should be made to
sampling allows for selection of individuals from increase the response rate. In any study where the
79
response is less than 100% (the ideal, but a value, a sample of 323 is needed. To be within
response rate of above 80% would also be accept- 10%, one would only need 81 subjects.
able) you should try to obtain demographic in- If the study sets out to determine the signi-
formation on the non-respondents or a random ficance of a difference between groups or the
sample of the non-respondents. In this way, you association between factors, the researcher has to
could investigate whether the respondents and specify how large a clinical difference or associa-
non-respondents differ in any substantial way, tion needs to be detected as statistically signi-
thus introducing bias in the results. ficant. For example, if a difference in blood pres-
sure of 10 mm Hg (clinical difference) has to be
detected as statistically significant, a smaller
Sample size sample will be required than if a difference of 5
When the researcher has decided to select a mm Hg has to be detected as statistically signi-
sample from the study population, the question is ficant. On the basis of knowledge of the subject
how big a sample should be selected. An unneces- matter, the size of a clinically important differ-
sarily large sample entails wastage. On the other ence has to be decided on. Ina large sample, a dif-
hand, if a study is done on a sample which is too ference of 2 mm Hg may be detected as statis-
small, the results produced will be neither useful tically significant, but this may not be a clinically
or conclusive, and a lot of time and money will significant difference. For further discussion on
have been wasted. For example, if a study is done clinical significance, statistical significance and
to determine the prevalence of asthma in a sample size, refer to Chapter 11.
community, a sample size of 10 might yield two Often time, staff and cost constraints are the
asthmatics, giving a result of 2/10 = 20%. Had one most important factors influencing the sample
additional asthmatic been identified, the percent- size chosen. The researcher can only complete a
age would have been 30%. Likewise, if one less sample if the size is practically feasible. The size of
had been identified, the percentage would have the study population and the duration of each
been 10%. We can see that these results are very interview or examination must be taken into
unstable — one extra case swings the percentage account to determine what is practically feasible.
tremendously. If we had a sample of size 200, 40 It is usually necessary to get the help ofa statis-
cases would have yielded the 20% prevalence, but tician or epidemiologist in order to calculate
one or two extra cases (41/200 or 42/200) would the formula-based sample size, and to discuss
hardly have affected the percentage. Thus the whether the study is worth doing if the practically
larger sample size has yielded a result that is feasible size differs from the theoretically calcu-
much more stable and reliable. lated one. Most formulae for calculating sample
Formulae exist for calculating the ‘correct’ size assume that simple random sampling will be
sample size needed, but the researcher has to be used. You should therefore discuss with the statis-
able to provide some information before these tician or epidemiologist which sampling strategy
formulae can be applied. If the study sets out to will be used (for example, cluster sampling), so
estimate the prevalence of a certain characteris- that the sample size can be adjusted.
tic, the researcher has to have an idea of what the Many studies set out to calculate the preva-
prevalence is likely to be (2%, 20% or 50%). If the lence or level not of one variable but of many. For
aim is to estimate the level of a certain variable, example, in a community survey to investigate
for example, blood pressure, you have to consider various possible risk factors for heart disease
the expected level and variability of the charac- (smoking, alcohol usage, obesity, cholesterol
teristic. Similarly, if the difference in risk in two levels) none of the risk factors can be said to be
groups is to be estimated, some idea of the ex- the one of prime interest. So in this case, the for-
pected difference is needed. Such information mulae based on the level and variability ofa single
can be obtained from a pilot study or from the variable are not useful. A statistician or epidemio-
published results of previous studies. In addition, logist should be consulted to help decide on a
you have to consider how precise or stable you sample size which would give some meaningful
want the sample estimates to be. The more stable results.
the estimates must be, the larger the sample size To obtain good estimates for prevalences in
needed. For example, if you expect the prevalence subgroups (for example, of obesity in the age
to be 30%, and would like to be 95% sure that the categories 15-24 and 25-34 years), adequate sam-
sample estimate is within 5% of the population ple sizes are needed in each subgroup. A sample
80
8 sampling
whose size was calculated to give an overall crude To determine which percentage of patients were
estimate will not yield precise subgroup-specific referred from other levels of the health service
estimates. and which percentage could have been treated at
a primary care facility, patients’ records had to be
Conclusion studied. Since the study population is so large, a
sample of records had to be drawn.
Sampling is done in order to study a subgroup ofa The sampling strategy used was as follows. The
population, while still being able to make general- period from March 1988 to February 1989 was
izations that apply to the population from which divided into quarters. One month was randomly
the group was drawn. To minimize the chances of selected from each quarter. From each selected
unrepresentativeness, probability sampling is month, a random sample of days was selected.
used. This requires a sampling frame and a proce- Thereafter, a 20% systematic sample of records
dure which provides randomness in the selection. (every fifth record) of patients attending for the
Often a simple procedure will suffice, but in other first time were chosen. In this way, bias due to
cases, more complex sampling strategies are seasonal, day of the week, or time of the day vari-
required, and a statistician should be consulted. ation was minimized, using a multistaged sam-
pling strategy with stratification, simple random
and systematic sampling.
EXAMPLE 8.1
Sampling strategy to eliminate bias Reference:
Rutkove S B, Abdool Karim S S, Loening WE. ‘Patterns of
Each year, approximately 35 000 patients attend care in an overburdened tertiary hospital outpatients
the Paediatric Outpatient Department at King department.’ South African Medical Journal 1990; 77:476-8.
Edward VIII Hospital in Durban for the first time.
81
Data collection
and measure-
ment
Introduction Measurement instruments

The collection of information for a study is called The type of measurement instrument used de-
measurement. It is the process by which values pends on:
are obtained for the characteristics (variables) of the population under investigation
individuals being studied. This is obviously a @ the type of information needed
central aspect of research. Whether you want to @ the environment of data collection
measure children’s weights, compliance with a @ the type of observer/interviewer
treatment, illness rates in a community, or time/money/human resources available.
women’s attitudes towards contraception, you
need to measure some characteristics. Questionnaires are commonly used in epidemio-
logical surveys and will receive closer attention in
Measurement can be done bya variety of means: this manual than other methods of measurement.
@ by measurement with instruments (baumano- Even when questionnaires are not used in a study,
meter, scale, thermometer, histological the information gathered has to be recorded ona
measurement); data capture sheet. Many of the issues in ques-
by questioning (by interviewer or self-adminis- tionnaire design also apply to the data capture
tered questionnaire); sheet. On some occasions, data can be entered
@ by the use of documentary sources (such as directly onto computers in the field.
hospital folders, clinic reports);
@ by direct observation. Questionnaires
Loosely defined, a questionnaire is a list of ques-
There is usually more than one method of collect-
tions which are answered by the respondent, and
ing the desired data. For example, information
which give indirect measures of the variables
about the prevalence of hypertension may be
under investigation.
obtained by measuring blood pressure with a
sphygmomanometer (instrument measurement),
Questions can be asked in different ways — by
self-administration or by interview. A self-admin-
by asking subjects whether a doctor has ever told
istered questionnaire requires the respondents
them that they have high blood pressure (ques-
to fill in the questionnaire by themselves. In most
tioning in an interview), or by referring to medical
instances, the respondent reads the questionnaire
records (documents). In practice, information
independently, but at times the questions may be
gathered often consists of replies to questions
asked of respondents as well as measures made
read out one at a time and answers filled in in a
structured manner. An interviewer may stand by
by the interviewer, for example, heights and
to assist with any problems that may arise. Self-
weights.
administered questionnaires are commonly com-
pleted in groups (for example, in a class) or posted
to respondents. The latter is called a postal ques-
82
9 datacollection and measurement
tionnaire. In all these circumstances, the ques- chooses, for example, ‘What is your opinion on
tionnaires have to be extremely clear and well laid the use of oral contraception?’ The alternative is
out, since untrained people will be completing to have closed questions where the answer is
them. either ‘yes’ or ‘no’, anumber, or a choice of one of
In structured interviews, the interviewers a group of predetermined answer categories.
follow a well-defined structure to prevent them Closed questions encourage quicker, more
from placing their own interpretation on the standardized data collection, but may limit
question. Questions are asked in the same way, responses or inhibit the respondent. At times, a
with the same probes and clarifications, while question may be asked in an open-ended way,
recording is also uniform. but the answer may be placed into a predeter-
The reliability of the information obtained mined category when recording the response to
(stability on repeated measurement) increases the question.
with objectivity and standardization. For example:
In an unstructured interview, an interviewer
has a set of guide questions or themes that must Question: ‘What is your opinion about the use of
be covered during the interview. The sequence, oral contraceptives?’
wording and approach used depends on the Actual response: ‘They disturb the hormones in
interview situation. The respondent is thus given one’s body and don’t work.’
much more freedom to express thoughts and Recorded response:
opinions. The interviewer tries to achieve the (i) approved (ii) disapproved; (iii) indifferent.
study aims without imposing a structure on the
respondent. Reliability decreases with the sub- Questionnaires often have a mixture of open- and
jectivity of the unstructured interview, and thus closed-ended questions. Respondents often have
interviewers need to be more skilled and expe- to indicate on a scale of 3, 4, 5 or 6 (called a Likert
rienced for these types of interviews. (See Chapter Scale) what their attitudes towards some issues
19: Qualitative methodology.) are.
Each method of questioning has advantages For example:
and disadvantages, and the choice of the method
for the project must be practical and appropriate Please indicate your opinion on the following
for that project. Table 9.1 outlines some advan- statement by circling the appropriate number
tages ‘and disadvantages of some of the methods along the line.
discussed above.
A cure will be found for AIDS within the next five
Steps in questionnaire development years.
It is necessary to allow adequate time to develop a Strongly Strongly
questionnaire methodically and thoroughly. The Agree Disagree
following steps provide a guide for questionnaire 1 2 3 4 5 6
development.
In general, it is advisable to have an even-
Step 1: List the variables to be measured numbered scale to avoid excessive selection of
One should aim for a well-conceived, concise the middle value. The greater the number of
questionnaire. Variables to be collected should be values provided, the greater the tendency among
within the scope of the study. Make a short list of respondents to avoid extreme values. Thus it is
variables based on the study objectives stated in not advisable to offer too wide a scale.
the protocol. Many researchers do not analyse all
their information. Ensure that only essential (b) Develop questions which will best elicit the
issues are covered. Unnecessary or unrelated variables to be measured.
questions which create the impression ofa ‘fish- At times, one variable may need a series of two
ing trip’ should be avoided. or three questions to capture the true characteris-
tic. This is often true of a variable that has. a
Step 2: Formulate the questions number of components or facets (for example,
(a) Decide on the questions. socio-economic status) or of a complex variable
Questions can be open-ended where the comprising a few variables (for example, house-
respondent replies in whatever way she or he hold density is the number of individuals in a
83
Table 9.1 Main advantages and disadvantages of different methods of

data collection
Advantages Disadvantages
Survey of records:
@ little expense collecting the data format cumbersome as data cellected for
non-research reasons
allows historical comparison e data incomplete
can be quick (although time often ¢ variables inconsistently defined and recorded
underestimated) by different people at different times
set variables collected, so limits scope of study
Individual interviews
personal contact can facilitate response time-consuming and expensive

and quality information
can be done when respondents have low ¢ interpersonal dynamics may interfere with
literacy and cannot fill in self-administered data collection (e.g. suspicion)
forms
can collect data from people who are interviewer variation affects reliability
otherwise not reachable (e.g. people
from rural or informal areas)
Telephone surveys
@ relatively cheap (no transport and travel @ sampling bias: excludes people with unlisted
time costs) numbers or no phone at all
@ can cover wide area: no need to move @ nonverbal cues absent: affects
between locations communication
@ suspicions aroused when call received

at home
¢ questionnaire usually needs to be short, thus

limiting information
Mailed questionnaires
relatively cheap and can cover wide @ generally response rates very low (<30%)
geographical area
no interviewer variation @ people may have problems filling in the form,

leading to poor quality data
@ can be anonymous @ difficult to assess who filled in the form
84
household divided by the number of rooms in the 6 Afrikaans

house). 7 Other {specify..:..d.5.4.0: )
For example: @ Find out from potential respondents what
questions are meaningful to them and how to
Variable = number of live births phrase the questions to ensure that they are
understandable and acceptable. Local slang
Questions: may enhance the suitability of the question-
How many of your liveborn children are living naire. Avoid technical language wherever
with you? possible.
How many of your liveborn children are living
away from you? Step 3: Decide on the detailed practical logistics
How many of your liveborn children have died? of each question
¢ Does this question or section need introduc-
(c) Phrase the questions. tion or explanation?
@ Questions should be simple, concise, and very ¢ Certain questions may only apply to some of
specific. Make sure that there are no ambigui- the respondents, and thus should be
ties. The respondent should know exactly what ‘funnelled’. This means that the questions for
is being asked. the ‘select’ group should be put into a block
@ Decide where open- or closed-ended questions labelled clearly so that these questions are only
are most appropriate. asked to suitable respondents, or are only
Ask one question at a time. At times, questions answered by suitable respondents. The follow-
can be broken up into component parts to pro- ing example funnels the questions twice, once
vide a fuller, more meaningful answer. selecting only females, and then selecting only
Avoid questions which suggest to the respon- those who are not menstruating.
dent the answer that is expected or wanted
(leading question). For example, Females only:
‘Do you think smoking is bad for your health?’ Do youstill get your periods?
versus ‘What is your attitude towards smoking: ayes
do you think it is good for one’s health, bad for 2 No
one’s health or that it has no effect on one’s If no:
health?’ Did your periods stop:
@ Avoid loaded questions, that is, questions that 1 Naturally
lead to specific associations or emotional reac- 2 Due to surgery
tions (sensitive or personal information). 3 Due to radiation/chemotherapy
People differ as to the types of questions that 4 Due to injectable contraception
they perceive as sensitive or threatening, and 5' Other (specify) 1G22.08
KOM R ens
no assumptions should be made about this. 6 Unknown
Examples of sensitive characteristics may
include: income, age, race, educational level, Design the recording procedure keeping in
alcohol use, sexual practice, and psychological mind who the interviewers or recorders of in-
stress. formation are.
Closed question categories should be mutually @ Should prompts (a question or statement
exclusive (that is, no overlap of categories) and which triggers a response, usually without
exhaustive (that is, they cover all possibilities). actually suggesting what the answer should be)
Thus, always allow for an ‘other’ category be used at all? If yes, how and when?
where the interviewer specifies the answer
when recording. For example, if the question ‘Have you ever
For example: used contraception?’ is asked but not under-
What language do you speak at home? stood by the respondent, rather than use exam-
1 Zulu ples like pills and injections (which gives them
2 Xhosa clues as to what they think you want to hear),
3 Sotho interviewers are trained to paraphrase the
4 Tswana question, saying ‘Have you ever done anything
5 English to prevent yourself from getting pregnant?’
85
sry it gene Al ine 9 me A aa A EA AS AO AI A Re
There is still the danger that someone will data collection process. Whether the question-
give a negative answer when they should not naire is done by interview or by self-administra-
have. In that case, the interviewer can ask from tion, try to make it as clear as possible so that
alist prompting the respondent. The answers errors can be minimized. Here are some guide-
are divided into categories which make it clear lines:
which responses were prompted and which type the questionnaire whenever possible
were not. So, using our example of contra- @ ensure good visual spacing (double spacing
ceptive use: may make the questionnaire too cumbersome,
but may be possible if the print is reduced)
Have you ever used any type of birth control or @ use highlights if needed
family planning? @ indent intelligently
@ print without smudges
Have you ever used any of the following: @ make neat blocks (on computer or using a sten-
unprompted |prompted cil) for ticking answers or filling in numbers
_ Birth control pills yes/no yes/no @ different colour paper may be used for the dif-
(oral contraceptives/ ferent sections of the interview, for example,
‘the pill’) household information on white, demographic
Injections information on green, and environmental in-
Intra-uterine device
won formation on blue paper
(IUD, loop, coil) @ visual markers such as arrows and blocks can
Diaphragm make administering the questionnaire easier.
Foam/jelly/cream
Condom Step 6: Consider the scale of measurement of the
NOSterilization variables
(tubes tied) In considering the process and means of
o Partner’s vasectomy measurement, an awareness of the scale of
9 Other (for example, measurement of the variables (see Chapter 11)
withdrawal) helps the researcher to plan adequately for the
Specify. analysis stage. For example, when collecting in-
formation on heights, measures are done and
It is important to train interviewers to use recorded. Here the observations have numeric
prompts in a uniform way. Record on the ques- meaning, and an average can be calculated. How-
tionnaire whether responses were prompted or ever, when collecting information on blood type,
not. each observation is categorized as O, A, B, or AB,
@ At times, a question may require that the inter- and the number of observations in each category
viewee be shown an example or picture to is counted. Here we cannot find an ‘average’
choose from. Decide when this is necessary and blood type: we have to calculate the percentage of
howit will be done. observations in each category. An understanding
of the scales of measurement is very important, as
Step 4: Determine the sequence of the questions different analyses are suitable for different types
The sequence of the questions must be carefully of variables.
planned to ensure logical flow. Basic demo-
graphic information is usually unthreatening and Step 7: Consider what coding needs to be done
is often asked first. Sensitive questions are usually once the data has been collected
put later in the questionnaire, so that some trust Before the raw information can be analysed
has been built up by the time they are asked. In quantitatively it must be prepared for analysis.
addition, if subjects refuse to answer these sensi- The process by which the information is con-
tive questions, the interviewer has at least col- verted into numbers or categories is called
lected some information. coding. So, for example, if male = 1 and female = 2,
all the respondents who are male are given a
Step 5: Plan the layout and design of the ques- number 1 next to the variable ‘gender’. Note that
tionnaire for the variable ‘gender’ the code ‘1’ is merely a
The visual presentation of the questionnaire is an category, but for the variable ‘age’ the code ‘1’
important aspect which can influence the whole would have numeric meaning.
86
Every variable that is being collected must have

set codes with which to code responses. This is Figure 9.1 Data capture sheet
easy for variables which measure characteristics,
such as weight and height, but gets more cumber- Observa- | Age | Sex | BP | Height |Weight| Rx | Hos-
some when open-ended questions have to be tion pital
given codes. These open-ended questions should, 1 ELSE
oe © wl eS
where possible, be given codes before the study 2 Ae bi 5Vesbashes ds ea
so that responses can be coded rapidly. In prac- 3 3) 4 a
tice, many researchers only allocate codes to 4 Mecobb) SE BR sil:eg ll
these more qualitative questions once they see 5 AES ll,tlh edeSe
the types of responses that have emerged. They B fedJ Necline
then allocate codes to the broad themes that
8
come up in the data.
Coding is an important task during the data col-
lection phase. It can be time-consuming, espe-
cially when large surveys are done and the ques-
tionnaire is long. Adequate time should be alloc-
ated to this process, and money budgeted to pay
coders. The task can also serve as a form of super- a on
vision and checking to assess the quality of the
data. If mistakes are found while still in the field,
errors can be checked and corrected. Sometimes
coding can be done directly by the interviewer,
but in this case the recording skills of the inter- nh oO
viewer must be of a high standard, and some

checking procedures should be built into the
process.
Step 8: Consider the means of data analysis 25 a!

ee lo
Information collected on the questionnaire or
data capture sheet needs to be summarized and
analysed. If a quantitative analysis is planned, the
design must be discussed with the person(s) who (Figure 9.2). The hole corresponding to the
will do the analysis, since the design must take appropriate category for each variable for that
into account how the data will be analysed. There individual is torn or punched out. Analysis is done
are several possibilities. by threading a knitting needle through the hole
corresponding to the variable option that is being
(a) Analysis by hand analysed. After giving the pack of cards a
A data capture sheet (Figure 9.1) can be made, thorough shake, all those cards conforming to a
with each variable going vertically and each certain option on the variable will fall to the
observation running horizontally. Information is ground. Simply counting the fallen cards will give
filled into each space. the frequency of that option.
From suchalisting, the researcher can easily do
counts of certain variables, for example, the (b) Analysis by computer
number of males in the group. Cross-classifica- Computer analysis of data is by far the quickest
tion (for example, how many of the males com- and most accurate way of obtaining results. How-
pared to the females have high blood pressure?) ever, many people do not have access to com-
may be rather cumbersome. puters or do not know how to use them.
Alternatively, a special data card can be de- Researchers can choose between personal
signed to capture data for each individual. The computers or mainframe computers. Mainframe
card can be used for the research only, or can computers are extremely big computers which
form part of clinical records. Variables are spaced analyse large data sets. They are used by big insti-
along the edge of the card with punch holes corre- tutions such as universities, businesses, and
sponding to each possible option on all variables government departments. As computer technol-
87
epidemiological research methods and protocol development
Oa
Figure 9.2 Tuberculosis clinic card designed for hand analysis
SAISOd QAIISOd
e
aAleHaN e 9AINSOd
OAeBoN e QAIISOd
e
aaneban ¢ BAIISOd
¢
aaneban
ISSLNINS
©
JILSONSVIC ADOIOLSIH
«
AWY-X
LSSHO
¢
|EINI|d
UOISNJ9
© JeIIH
Auyyedoupe
SNOILVDILSSANI ynpy
ad aAlssalHOld
Wid
e
aseasip LOAYIG
WALAdS
AYNLINIWNALAdS
e
DIAGNOSTIC CRITERIA
PLACE OF SUPERVISION AT
END OF TREATMENT MOVEMENT OF PATIENT THROUGH TREATMENT SERVICES
Referred by to date
SPUTUM CULTURE
AT 60 DOSE
School
Positive
Domiciliary (sisters) Ne
ia
itce Negative
So
cc
e
cr
rts COMPLIANCE
REASON FOR EXIT cst
oOWw
ec ON EXIT
Cured TB ke
a LNIWLVIYL
STIWLIG
Less than 75%
i =
2
oF 75% or more
Absconded, etc
Transferred out of health region END COMPLIANCE (except fortransfer within health region)
OUTCOME
transferred
stopped
Rx
Date
or health
region
of
out
no. of doses 100
possible no. of Rx days
Source: Dick J, Youngleson S M, 1994.
ogy develops, mainframe computers are being re- If the data analysis of a project is being done
placed by more efficient, smaller machines. within a big institution, special computer
Personal computers (PCs) are (generally) small punchers will punch the information onto the
computers which are suitable for personal use or mainframe. Preparation of the data for computer-
departmental use. While very large data sets ization is important and specific procedures have
cannot be accommodated by PCs, many software to be adhered to. The variables for each individual
programmes are now available which enable need to be coded in such a way that the computer
researchers to do sophisticated analyses on these can receive the information. Usually the answers
machines. A suitable data capture system (for are given numeric codes (for example, male = 1
example, Lotus, dBASE, Epi Info) needs to be and female = 2). There must be blocks for coding
decided upon. The questionnaire must be de- along the right-hand margin of the questionnaire.
signed to ensure easy transfer of information by Only coded information may be placed in this
the researcher from the questionnaire onto the space (usually marked ‘For official use only’) and
PC. this information is then typed into the computer
88
by the punchers. Each coding block has a unique ones. Open-ended questions will generate a
number. The coding blocks start from 1 and are variety of responses, which can be used to deter-
numbered consecutively. The coding block num- mine the options for the main study or the codes
bers allow the punchers to find their way around to be used. Common responses will be chosen as
the questionnaire and allow the computer to read the categories that will appear on the final ques-
in specific blocks as specific variables. For exam- tionnaire or coding sheet. Logistical issues, such
ple, the data in block 1 might be allocated for area as the time taken and the suitability of categories
of residence, blocks 2 and 3 for the age of the should also be recorded.
respondent, and so on. At the end of the pilot interview, the responder
Micro hand-held computers are compact port- might be asked how he or she felt about being
able computers which allow for entry cf data at asked questions in this way. Comments on the
the point of data collection. Values are punched timing and the situation should be welcomed.
in by the interviewer, circumventing the need for Suth feedback could be crucial for the main
filling in questionnaires and coding. This method study.
also allows for the rapid analysis of data so that
interim results are available immediately, and the Step 10: Make necessary changes
monitoring of fieldwork is possible. Possible tech- After the pilot study, the interviewers should be
nical problems such as power or battery failure fully debriefed and comments noted. As many of
must be kept in mind and planned for, otherwise the criticisms as possible should be addressed, so
the data collection can become a nightmare. that the final questionnaire benefits fully from the
feedback.
Step 9: Pilot (‘pretest’) the questionnaire The training of the interviewers for the main
A pilot study is a test run of aspects of the main study should also help as a final pilot. Inter-
study. In developing a questionnaire, a series of viewers often pick up problems and should feel
small pilot studies are needed to refine the instru- part of the questionnaire design. In order to bene-
ment. fit from their input, make sure that there is
Pilot studies require an in-depth look at the enough time between the interviewer training
questionnaire with the aim of improving its qua!- and the start of the study to make the final neces-
ity. Usually only a few subjects are chosen per sary changes.
pilot study, for example, 5 to 20 individuals. Remember to check and recheck coding block
In the early stages of developing the question- numbers. A miscount can cause havoc when the
naire, rudimentary questions can be asked of col- data is being punched into the computer.
leagues and friends in order to investigate the
wording and the clarity of the questionnaire.
Glaring problems will be picked up at this stage.
Quality of the data collected
Later pilot studies should be addressed to If the results of an epidemiological study are to be
groups which are increasingly similar to the target sound, the quality of the information collected
population. For example, if the study hopes to needs to be good. How cana researcher evaluate
determine the immunization coverage of chil- the quality of the information?
dren under five years in a particular community, The first, rather intuitive, way of checking data
the pilot studies could take place at the paediatric quality is to look at indicators of the context in
outpatient department of a teaching hospital, which the measurement process took place. The
then at a day hospital covering a similar popula- second is a more formal evaluation of measure-
tion, and lastly using a small community sample. ment error.
At first the researcher must be active in the pilot,
but later the questionnaire should be adminis- Indicators of the context in which measure-
tered by interviewers similar to those to be used ment takes place
in the main study. Before full analysis of the data, it is useful to ex-
During the pilot study, the interviewer should tract information which can give a sense of how
record words and sentences that are not under- and from whom data were obtained. Three types
stood and questions that require prompting or of data are useful for this purpose.
explanation. There should be space on the ques-
tionnaire to write down the respondents’ reac- 1 Data indicating how much data is missing
tions to certain questions, especially sensitive Data may be missing because of failure to locate
89
ee
he 8 Eee
an individual, non-co-operation of an individual same individual (or group) repeatedly higher or

(either by refusal to be included in the survey or repeatedly lower than the real value. This
refusal to answer specific questions), illegible systematic error is called bias. See Chapter 12 for
entries,or lost records. The extent to which data is more information on precision and bias.
missing should be investigated and reported in These two concepts are illustrated in the bull’s-
writing up the results of the study. eye diagrams in Figures 9.3 to 9.6. The centre of the
bull’s-eye represents the ‘truth’ which we would
2 Data on the characteristics of respondents like our data to reflect (thus, deviation from the
Different types of respondents may give different centre will represent poor validity). We would also
information concerning the same issues. The like our data to reflect this ‘truth’ when the mea-
respondents’ age and sex, status in the home, sures are repeated. So, repeated studies should give
occupational status, and other relevant variables similar results represented by the close proximity of
should be recorded. These may give an indication the ‘shots’ to each other (repeatability).
of the reliability of response.
When information on a particular person is (Please note: the concepts of reliability and valid-
being collected, that person is called the index ity extend to other research processes as well. So
person. Self-reporting by the index person is desir- we can refer to bias in sampling analysis, and so
able in most cases, although in unusual situations, on. Please refer to Chapter 4 and Chapter 12.)
for example, in the case of trauma victims or
peopie with intellectual disability, proxy responses The possible sources of error in measurement are:
may be preferred. A proxy response means that 1 the instrument
someone else responds on behalf of the index 2 the observer
person. In such cases, it is important to record the 3 the subject
relationship of the proxy to the index person. 4 the environment.
3 Data on the setting in which the information Improving and evaluating reliability
was collected Variation between measures (that is, poor reli-
The environment in which data collection takes ability) can be decreased by addressing the
place can influence the data. It may be important source of the variation.
to note the time, place, and even the temperature, (i) The instrument variation (giving different
as such details may influence the quality of the results for the same individuals) can be
data. It may also be possible for the interviewer to reduced by standardization and calibration of
record a subjective assessment of the interview the instrument. The quality of the instrument
and the quality of the information provided. or questionnaire must be improved. (See
Appendix 2: Standardization of observers and
Formal evaluation of measurement error: instruments.)
reliability and validity (ii) Observer variation can stem from differences
Data is only as good as the measurement instru- among observers or interviewers (inter-
ments used to measure the characteristics. observer variation), as well as differences in
Ideally, we would like to measure ‘the truth’ every the same observer or interviewer on separate
time we measure the characteristics. Any devia- occasions (intra-observer variation).
tion from this constitutes measurement error. These variations between measures can be
Measures are usually considered in terms of reduced by:
their reliability and validity. @ setting exact ways of measuring (standardiza-
Reliability refers to the degree of similarity of tion of measurement or interview)
the information obtained when the measurement @ intensive training periods for all observers and
is repeated on the same subject or the same interviewers
group. Is the same value arrived at every time the supervision and periodic checks on their work
measurement is taken, or do the values vary a lot selection of observers and interviewers along
on repeated administration? similar criteria with reference to age, sex,
Validity refers to the extent to which a measure educational level, and other relevant character-
actually measures what it is meant to measure. istics.
The measure lacks validity if an observer or (iii) Subject variations may be due to biological
instrument measures the characteristic in the inconsistencies (for example, blood pressure
90
FT ge eeMy Yi i TELT nor Figure 9.4 Reliable but not ae

outs bites itis -@ measurement
Sr aM NAT Figure es ELIC RTM lire

lel(=)
CMe ire Cemiiterei cites measurement
Source: Botha JL, Yach D, 1986.
does not stay the same all day) or inconsisten- can be evaluated by repeating measures of a sub-
cies of memory. Repeated measures make it sample of the study sample, or (in the case of ques-
possible to assess and adjust for biological tionnaires) by asking related questions which, if in
variation. disagreement, will show inconsistencies.
Instrument, observer, and subject variations Statistics used to assess the agreement on
91
i a
E I e
repeated application of questions depend on the class.

type of variable, and a biostatistician should be Criterion-related validity involves evaluating the
consulted about how to evaluate repeat measures results of the index study against the most valid
meaningfully. measure available (the gold standard). The gold
standard is used as a criterion to see how many
Improving and evaluating validity values were correctly identified. The sensitivity
Different levels of validity exist: and specificity of a measurement instrument are
Face validity refers to the validity ‘on the face of statistics which assess the criterion-related valid-
it’ and refers to the extent to which the measure or ity of variables (see Figure 9.7).
question makes sense. Predicted validity requires that the measure, if
Content validity requires that the measure in- used, confirms a known or theoretically hypo-
cludes or accounts for all the elements of a vari- thesized association. For example, social class is
able or issue being investigated. For example, known to be associated with school performance.
educational level and occupational class do not If the validity of using ‘family ownership of a
account for all the elements of the variable social motor car’ as a measure of social class is being
Figure 9.7 Evaluation of CCRC Rm

ECLA e
sensitivity and specificity
The true situation as yielded by criterion measure
Positive Negative
Positive
Results
yielded by
instrument
being assessed
Negative
A
Sensitivity: (true positive rate)= (A+C)
Percentage of positive test results out of all true positives
BBP
one
Specificity: (true negative rate) = (B+D)
Percentage of negative test results out of all true negatives
A
Positive predictive value = (A+B)
The percentage of positive test results that are truly positive
eet
Negative predictive value = (C+D)
The percentage of negative test results which are truly negative
92
assessed, the investigator will determine to what approach is that the researcher understands what
extent ‘family ownership of a motor car’ is associ- is on the questionnaire and can monitor the
ated with school performance. answers. However, it is very important that the
Inconsistent validity may refer to a measure translation is standardized between interviewers
which is valid for one population or group, but during the training sessions. This method is not
might be different in different groups, popula- suitable for in-depth and semi-structured inter-
tions, and environments. Thus questionnaires or views, where exact replies must be recorded and
tests designed for a particular population in the interpreted by the researcher.
northern hemisphere might be unsuitable and in- The second option is for fully translated ques-
valid for populations in developing communities. tionnaires to be used in the interviews. The initial
Questionnaires are particularly vulnerable to translation should be done by someone fluent in
measurement bias, and numerous types have the language, who can judge whether certain
been described in the literature (Sackett 1979). plirases are understandable and acceptable. This
Please refer to Chapter 18: Knowledge, attitude, translation should be proofread by a second
beliefand practice studies. person. The next step is for someone else to re-
translate the questionnaire back into the original
Other issues related to data language. If the re-translation corresponds well to
collection the original questionnaire, the translation is
accepted as adequate. Please note that linguists
Ethics are not necessarily the best people to translate
It is important to get permission from partici- questionnaires. Often health workers who work
pants to be interviewed, after you have explained with people similar to the respondents will have a
the study and method of interview to them. This is better feel for what can be understood by the
called informed consent. person on the street.
Ethically, researchers are required to protect
the identity of study participants. Ideally inform- Interviewers
ation should remain anonymous, that is, names Interviewers should be carefully selected (on the
should not be recorded at all, not even on the basis of, for example, educational level, sex, expe-
questionnaire. However, often some record of the rience), well trained, well supervised, and ade-
respondents’ names needs to be kept so that quately paid. It is important that similar types of
follow-up or even referral is possible. If it is people collect similar information, so that inter-
planned that all participants will be informed of views are as similar as possible. This improves the
their personal results (for example, blood results) reliability of information.
or the overall study results, names and addresses One study might employ different types of
are clearly needed. In such cases, it is ethically people to do different tasks in the data collection
acceptable that information remains confiden- process. A registered nurse might measure
tial, which means that information that can be heights, weights, and blood pressures while a
traced to a particular individual is available only middle-aged woman with high school education
to the researchers. might do the interviews. Yet another person, well
known in the community and with good interper-
Translation sonal skills, might inform people ahead of time of
Clearly, the best way to interview people is to ad- the study and motivate them to participate. (See
dress them in their own language. If the question- Chapter 17: Example 17.2.)
naire is to be used on people who speak a lang-
uage other than that of the researcher, translation Training
will be an issue. Training of staff involved in the data collection is
Interviewers should be fluent in the language of fundamental to good quality information. It is
interview. One option is for the questionnaire well worth spending a few days training inter-
itself to be in a language that the researcher viewers, observers, and motivators so that every-
understands (for example, English), but for the one knows exactly what is expected of them.
interview to be conducted in another language. In During the training period, the nature of the
such a case, the interviewer translates the ques- research, the reason why the research is being
tionnaire, and thus has to be fluent in the lang- done, and the objectives of the study should be
uage of the questionnaire. The advantage of this carefully explained before the tasks are discussed
93
in detail. Cohesion among the team should be forms, and checking that the data was not fudged
facilitated and rules, conditions of employment, all have to be considered. From time to time, the
and other specific instructions explicitly dis- fieldworkers may have to meet with the super-
cussed. visor or co-ordinator to discuss important issues.
There should be clear lines of accountability
and the nature of the supervision should be ex-
Conclusion
plained. For example, the interviewers should
know that they will receive instructions from the Whether data is collected in the community (see
field co-ordinator or supervisor, who will also Chapter 17), the workplace (see Chapter 24), the
check their work for errors. health care setting (see Chapter 15), or by post or
telephone, the measurement tools need to be
Field supervision carefully considered to ensure good quality data.
There should be a clear plan of how supervision The validity and reliability of the data should be
will be done in the field. Site visits, checking of evaluated when describing the results.
94
10 Other issues to
consider in the
protocol
Data management and analysis analysis is planned and available (see Chapter 22:
Immunization coverage).
The proposed analysis of the data should form an The decisions what to measure and how to
integral part of the planning stage of the research measure it are closely linked to the analysis. In
project. On a practical and technical level, the drawing up a questionnaire, you have to consider:
researcher has to decide beforehand who will be how will I analyse this question? At this stage, you
analysing the data, and this person should be should draw up dummy tables (or table shells).
involved from the start. Where and how will the These are tables which indicate how you intend
data will be analysed? Will it be necessary to buy a to represent your data. There should be tables
PC or a special analysis programme (and learn describing sample characteristics (for example,
how to use these)? Will you have to approach pro- age/sex), as well as associations between vari-
fessional biostatisticians, for example, at a uni- ables. These tables should clarify whether you
versity or research institution? Funds may be have included questions on all the factors needed
needed for the analysis (for buying special equip- for the proposed analysis, or whether many
ment or paying for the services of a biostatisti- redundant questions are asked.
cian), and these have to be budgeted for. In set- Once the data has been collected, you can start
ting up a time schedule for the project, you by completing these dummy tables; this gives a
should include a reasonable period for analysis, good starting point for analysis. Researchers who
and not think that the time-consuming part of the have not thought about their analysis throughout
project is completed once the data has been their project planning often find that once the
collected. data has been collected, they do not know what to
More fundamentally, however, the analysis do with it. A project which proceeded well
forms part of each step of the study. It is not just through its field-work stage can grind to a halt at
the final step before write-up. When defining the the analysis stage if limited thought has been
study objectives, you have to decide how to given to the analysis. Dumping the collected data
analyse the data in order to reach the objectives. on a Statistician who has not been involved in the
Is such an analysis at all possible, or would you rest of the project does not solve this problem,
have to reformulate your objectives? It is thus since the researcher is the one who has to tell the
necessary to express objectives in measurable statistician what questions he or she would like to
terms. have answered.
When doing the literature review, you should Sometimes during the analysis phase, the
note the methods of analysis used by other _researcher discovers that some of the intended
researchers. When deciding on study type and analyses cannot be performed (for example,
sampling scheme, you have to consider whether because there is too much missing data or too few
methods are available for the analysis needed. For people with a certain characteristic). This can be
example, if a complex sampling scheme is used, minimized by carefully considering the analysis
you must be sure that the appropriate means of beforehand. Similarly, you may find new associa-
95
‘Table 10.1 Budget for AIDS study — 1 March 1994 to 28 February 1995
A Recurrent expenses
Personnel 270 000

Current employees
Dr A (10% effort) 20 000
Ms G (25% effort) 35 000
Ms K (10% effort) 15 000
To be employed
Research officer 110 000
Fieldworker (x 2) 80 000
Consultants
Dr Y (R 100 per hour) 10 000
Supplies 15 500
Questionnaires (x 1 000)
Blood collection kits (x 1 000)
Stationery and photocopying
Emergency first aid kit
Travel 37 800
Lease of vehicle (x 12 months)
Vehicle maintenance (40c per km x 18 000 kms)
Use of private vehicle (R1,10 per km x 6 000 kms)
Patient care costs 7 500

Hospital visits and in-patient stays
Laboratory tests
Drug treatments
General 3 000
Cost of reprints 1000
Attending local conference 2 000
B Capital expenses
Purchase of vehicle (4x4 van) 120 000

Equipment for laboratory tests 8 000
Total capital expenses
Total direct costs R461 800
Cc Administrative overhead (20%) R92 560
TOTAL BUDGET R554 360
96
10 other issues to consider in the protocol
tions to investigate in the analysis, but you should

concentrate on the intended analysis and not Table 10.2 Actions to facilitate an
spend too much time ‘fishing’ for findings. ethical approach
Resources Obtain permission from a recognized ethics

committee
The execution of a research study depends on the
availability of funds and resources needed for the Obtain permission for access to information/
investigation. Some studies can be completed on groups from:
a shoestring budget, but research planners @ Authorities (for example, local government,
should be cautious of under-budgeting, as this institution
can compromise the validity of the study. Organizations
The additional resources needed for the study *e Community
(overt costs) and those that are available (hidden
costs) need to be considered. Personnel needs Regarding people:
may include technical staff (for example, field- @ Informed consent
workers, coders, field co-ordinators), and acade- @ Confidentiality/Anonymity
mics such as statisticians and other consultants. @ Protection from risk
Other costs include equipment, printing, station- — physical
ery, transport, venues, and special tests or invest- — emotional
igations. — social
You should use whatever resources are avail- — other
able in the setting in which the research will take
place. Service providers may second some health Act on findings
personnel to do the field work, and may provide @ Keep in mind implementation goals when
some of the equipment needed. planning
You can apply for research funds from the Med- @ Write up
ical Research Council, the universities, private ¢@ Publish
corporations (for example, Anglo American), and @ Motivate for action
non-governmental organizations (for example,
the Health Systems Trust and the Heart Founda- Publish
tion). Acknowledge the work of others
A formal budget should be prepared, whether Obtain permission to publish or use
applying for funds or not. Table 10.1 illustrates a material
budget. Depending on the type of project, differ- Report negative and positive findings
ent items may be listed, and some of those
mentioned here eliminated.
Ethical and legal considerations

All researchers have a responsibility to provide time. Confidentiality or anonymity should be
benefits to people in general and to stay within ensured.
the law. To ensure that methods and approaches In a protocol, plans for safeguarding the rights
used in a study are legally and ethically accept- and welfare of the participating subjects and the
able, it is essential that protocols be submitted to method of obtaining informed consent should be
an ethics committee well before the start of the explained.
study. All universities, as well as some other insti- Other issues to be considered include permis-
tutions, have such committees. sion for access, the need for action on findings,
Benefits must outweigh the personal risk — and procedures to adhere to when publishing re-
physical, psychological or social — of participa- sults. Table 10.2 provides brief guidelines to facili-
tion in the study. Study techniques should thus tate an ethical approach.
not be harmful, and participants should be A full discussion of the ethical aspects of
informed of the nature of the study, its proce- epidemiological research can be found in Chapter
dures, and their right to refuse or withdraw at any 3: General philosophical issues in epidemiology.
97
Report writing and scope of the study are stated. The study is
located in terms of reviewed literature.
Many research projects are never written up as 3 The Study population and Methods section
reports or journal articles. The knowledge gained gives details of the study methods and should
from both the results and the methods is thus not
include:
passed on to others. This is obviously a waste of
study design
resources. All researchers should be encouraged
study population
to make their project available for others (other how the sample was selected
researchers, service providers, and decision-
response rate
makers) to read. This should be done even if the
@ characteristics measured
project was a ‘failure’, for example, if sampling @ methods of data collection
biases were encountered or a poor response was
@ quality of the data
achieved. A description of problems encountered
@ how the data was analysed.
will help other researchers to avoid such prob- 4 The Results section includes written descrip-
lems. tions of the findings, as well as tables and charts
No project can be considered finished before it
to clarify the text. The results of statistical tests
has been written up. The write-up could be in the
should be given.
form of a scientific article, a comprehensive All tables and graphs should be clearly titled,
working paper containing all details, or a special
and totals and figures should make sense.
report to decision-makers/service providers. Everything in the tables and figures should be
Results should also be reported back to inter-
relevant to the topic.
viewers and research participants in an accessible
ao The Discussion provides the chance for the
form. writer to interpret the findings, air opinions,
compare the results with other findings, and
The qualities of a good report point out the shortcomings of the study.
@ The title should clearly explain what the report
6 The Conclusion rounds off the report and gives
is about. It should attract potential readers.
some indication of the scientific and practical
The summary or abstract should inform the
implications of the study.
potential reader in brief what the aims,
7 References should always be included.
methods, population, and results were.
@ The style of the report should be easily under-
standable. Language should be clear and The report has a similar structure to the protocol.
simple, while the structure should have a If the protocol has been prepared carefully, the
logical flow (outlined below). Graphics and writing up flows easily. The results and the dis-
tables should be clear and informative. cussion are the sections which need new effort.
@ The report should be concise with no unneces-
sary information. Digression and waffling have
Conclusion
no place in a good report.
From the above, it is clear that many administra-
The logical structure of areport tive issues need to be considered in the protocol,
1 The Abstract is asummary written after the rest in addition to the methodological issues. These
of the report has been written. administrative issues should receive close atten-
2 The Introduction puts the reader in the picture. tion, since they could sabotage an otherwise well
The motivation for the study and the objectives planned project.
98
Data presentation, analysis,
and interpretation
11 An introduction to data presentation,

analysis, and interpretation
G Joubert
12 Integrating epidemiological concepts |

S S Abdool Karim .
11 An introduction
to data
presentation,
analysis, and
interpretation
Introduction Many researchers may feel at a disadvantage

since they do not have a good mathematical back-
This chapter aims to provide an (almost) formula- ground, and may even fear formulae. Such a
free introduction to the presentation, analysis background is not necessary to enable you to
and interpretation of epidemiological data. master the basic statistical concepts. However,
Statistics have a contribution to make in all researchers should consider upgrading their
phases of a research project from the planning, mathematical skills (by attending courses or
through the data analysis, the interpretation and working through textbooks) to fully come to grips
discussion of the results, to their publication. In with the statistical methodology. We hope that
this section, we focus on the processing and ana- this section will instil an interest even in those
lysis of data, as well as the presentation and inter- readers who have a fear of numbers, and encour-
pretation of results, with the aim of familiarizing age them to develop the necessary skills.
researchers with important statistical concepts. A
researcher should consult a statistician to discuss Variables
the statistical aspects of the research project in
the planning phase, as well as the analysis and The characteristics which one measures, and
write-up phase. A basic understanding of statis- about which data are collected, are referred to as
tical concepts will ensure that the researcher gets variables (Figure 11.1).
the most benefit from interaction with a statis- Categorical variables specify which category an
tician. Such an understanding will also enable him observation falls into. Such variables are called
or her to evaluate published research findings. nominal if categories are in no order, but are only
Statistical concepts play a dominant role in identified by name, for example, the categories
epidemiology, as you can see when reading any male or female for the variable sex. If there is
epidemiological journal article. However, the some order amongst the categories, the variable
statistical techniques used are not always appro- is called ordinal, for example, the variable social
priate. This might be because many statistical class which may be recorded in categories 1
textbooks place the emphasis on ‘how’ a statis-
tical method is applied (the cookbook approach),
Figure 11.1 Types of variables
rather than ‘when’ and ‘why’. In addition, the
widespread availability of personal computers
Categorical
(PCs) and statistical software enables a researcher
to do complex analyses without having to under- ae te
Nominal Ordinal
stand the techniques used, or even looking at the
data closely. The PC will provide an answer, Numerical
whether the question asked is appropriate or not.
Data analysis cannot be done adequately by eyeaten
Discrete
iach a
Continuous
simply following a few ‘how to’ rules.
101
c datapresentation, analysis, and interpretation
(professional) to 5 (manual), or the severity of a therefore important to check questionnaires or

disease (mild, moderate, severe). A categorical data recording sheets regularly during the data
variable which has only two categories is called a collection phase so that any queries can be
dichotomous or binary variable. Categorical vari- followed up, ideally while-still in the field. Errors
ables are also often referred to as qualitative vari- can also occur when the data is entered onto the
ables. . computer. Double entry of data (which means
Quantitative variables are called numerical that the data is typed independently by two data
since the allocated numbers have intrinsic quan- typists and the two sets compared for discrepan-
titative meaning. Numerical variables are dis- cies) should be done to minimize typing errors.
crete if the variable can only take on certain There may also have been errors in the original
values (for example, the number of children in a data source, such as a hospital file. The data
family can be 2 but not 2,75 or 2,751). Discrete checking outlined below is done after computer-
variables are often counts of events, for example ization, but before analysis.
the number of patients seen at a clinic per day. A Table 11.1 lists suggested data checking pro-
numerical variable is continuous if the variable cedures. For these procedures, you need a com-
can assume any numerical value (for example, puter listing of the values of all variables, as well
height can be measured very precisely on a conti- as the interrelationship of variables. If any strange
nuous scale). values are found, you should make alist of the
The choice of appropriate methods of analysis subject observation numbers or questionnaire
depends on the types of variables under invest- numbers of those cases. Any queries resulting
igation. from the data checking procedure should be
investigated by going back to the raw data (for
Data checking before analysis example, questionnaires), not by guessing. It is
Before any analysis is done, the data set must be therefore important that questionnaires are kept
carefully checked to identify strange values (out- and ordered so that it is easy to pull out the forms
liers) and errors. Such errors can strongly influ- with queries. Some queries will be easy to solve,
ence and bias the results, and should therefore be for example, a 1 might be punched as a 7. Others
detected and corrected before the data is ana- may be difficult to solve, and if the value in the
lysed (as the saying goes, garbage in, garbage out). raw data is completely impossible (for example,
Errors could occur when data is coded on ques- sex = Z instead of M or F and the sex cannot be
tionnaires or transcribed from patient files. It is deduced from the name), the value should be
Table 11.1 Suggested data checking procedures
One variable at a time non-smoker and the missing value should be

made 0.
Categorical variables @ Check missing values
@ Check that all the categories on the computer
listing are plausible codes e.g. for the variable Cross-checking of variables _
sex M = Male, F = Female, X= ?Z=? ¢ If demographic information
is asked in more
¢ Check missing values than one question, do the answers agree:
sexi = M and sex2 = F?
Numerical variables @ Ina longitudinal study, are the changes in
@ Check that the values fall in plausible range. values between assessments plausible? age
Are the extremes possible? e.g. height = 250 at week 1 = 25 years, age at week 52 = 35?
height = 5 - @ Do related questions give plausible results? _
@ Are values of 0 really zero or do they indicate 1. Sex = M and oral contraceptive usage = Yes
missing values? (if the latter, then O must be 2. Age = 27 and age at which started smoking
made missing). = 33.
¢ Are missing values really missing or do they in- _ 3. Do you smoke? = No
dicate 0, e.g. a missing value for the number How many cigarettes do you smoke? = 10
of cigarettes smoked per day may indicate a
102
11 anintroduction to data presentation, analysis, and interpretation
Table 11.2 Listing of 10 observations
| Subject Age Sex Current smoker Ever smoked Age

started
smoking
ae 415
2
3
4
5
6
4 %
8
9 TS
eee
StS
10 ss
22
4
<2
2k <
x2
ZeaxxZ2eix
classified as missing. The aim is to eliminate the data (for example, about the distribution of
errors, not fudge the data. Respondents may give the data). Through graphical display you can
conflicting responses when asked their opinions determine whether your data satisfies these
or attitudes. These are not data errors and cannot assumptions and thus whether the intended
be ‘cleaned’. (If extreme values which are not im- statistical analyses are appropriate. Graphical
plausible occur and cannot be checked for some display makes the data accessible to the re-
reason, statistical techniques which are robust searcher, as well as to the readers when results are
against extreme values can be used in the ana- published. Patterns or relationships detected
lysis. See Summarizing the sample data, below.) through, or questions raised by, exploratory data
Graphical display of the data is described below analysis can be evaluated by formal statistical
and is also useful in the data checking phase. methods (also called confirmatory data analysis).
If data is entered directly onto a PC in the field,
the ‘data entering programme (for example, Epi
Info) should be set up in such a way that implaus- Graphical display of numerical variables
ible values can be discovered immediately. So, for The aim of displaying a numerical variable is to
example, one should specify a plausible range for investigate certain characteristics of the data,
a continuous variable such as height. If a value namely where the central value lies, and what the
which falls outside this range is typed in, the pro- spread and shape (distribution) of the data are.
gramme alerts the data typist that a strange value The distribution shows how many observations
has been entered, and must be corrected. have a given value or lie within a certain range of
Table 11.2 gives a listing of the information values out of all the possible values. As will be dis-
regarding age, sex and certain aspects of smoking cussed later, the shape of the distribution is
history of 10 subjects included in a study of important in deciding on appropriate summary
community members aged 20 years and older. statistics and statistical analyses. A distribution
How many problems can you identify? has a symmetric shape if approximately the same
number of observations lie above and below the
centre value of the distribution. This will be illus-
Exploratory data analysis trated later, using various types of graphical dis-
Before any formal statistical analysis is done, the plays (Figure 11.2).
data (especially numerical variables) should be The following small data sets will be used to
explored through graphical display (also known illustrate basic concepts.
as exploratory data analysis). Such an exploration i) Inahypertension study, diastolic blood pres-
helps the researcher get to know the data. Errors sure was measured on a sample of 16 respond-
or strange values, and patterns and relationships ents. Their values were:
in the data can be detected. Furthermore, many 75, 84, 80, 97, 105, 188, 64, 78, 68, 86, 79, 105, 89,
statistical procedures make assumptions about 88, 93, and 92.
103
ce data presentation, analysis, and interpretation
Figure 11.2 Types of data distribution
Shape of stem and Shape of

leaf plot box plot
Skewed to high
values
low
Skewed to low
values
Symmetric =
; could be difficult to
Bimodal C detect in box plot
ii) Two groups of children were scored on apsy- From: Joubert G, Schall R. ‘Some statistical con-
chological test. The scores were (in order): cepts.’ In Kibel M A and Wagstaff L A (eds). Child
Group 1 Group 2 health for all: A manual for Southern Africa, Cape
(n= 24) (n=25) Town: Oxford University Press, 1995.
38 46 46 50 52 41 41 414146
52 55 57 59 62 46 46 48 52 52 Stem and leaf plot
63 63 64 65 69 54 54 58 60 60 In a stem and leaf plot, each observation of a
6971717475 6171747680 numerical variable is plotted. All possible leading
79 81 87 94 84 85 93 104 148 values form the stem, and each data value is
104
11 = anintroduction to data presentation, analysis, and interpretation
represented by writing its trailing digit in the detecting distributional differences between
appropriate row next to the stem, in this way groups. The back-to-back stem and leaf plot of
forming the leaves. Before the plot can be drawn, the psychological data is depicted in Figure 11.3.
the data are ordered from small to large. The value 148 in Group 2 seems to be an outlier
In the blood pressure example, the ordered and should be checked. From the stem and leaf
data set is: plot, it is clear that the values in Group 1 followa
64, 68, 75, 78, 79, 80, 84, 86, 88, 89, 92, 93, 97, 105, symmetric distribution, but the values in Group 2
105, 188 are not symmetric (also called skew), as most of
the values are low.
Since the values range from 64 to 188, the leading
digits which are to form the stem are 6, 7, 8, 9, 10, Box plot
11, 12, 13, 14, 15, 16, 17, 18. The stem andleaf plot Asecond graphical display appropriate for nume-
is then as follows: ri¢al variables is the box plot. A box plot displays
the date in summarized form. Not all points are
18 8 plotted in a box plot, only selected summary
Lt values — namely the median, 25th and 75th
16 percentiles, and the maximum and minimum
15 values.
14 The median (or 50th percentile) is that value
13 which divides the sample values, ordered from
12 small to large, in half, that is, half the values lie
1l above the median, the other half below. If the
10 55 sample size is odd, the middle value in the
9 237 ordered series is the median. If the sample size is
8 04689 even, the median is the average of the two middle
é 589 values. Therefore the median of the 16 blood
6 48 pressure values is the average of the 8th and 9th
values, namely the average of 86 and 88, thus 87.
Any value which lies far away from the other Quartiles divide the sample values in quarters.
values (an outlier) can be detected easily. The The lower quartile is also called the 25th percen-
valué 188 is suspect and should be checked. It tile, and the upper quartile the 75th percentile.
may be a typing error: 88 typed as 188. Apart from The distance between the lower and upper quar-
the extreme value, the values seem to follow a tile is called the interquartile range. For the 16
symmetric distribution. Figure 11.2 depicts what blood pressure values, the lower quartile must lie
different data distributions would look like when between the 4th and 5th values in the ordered
illustrated by a stem and leaf plot. series (values 78 and 79). The 25th percentile is
If groups are to be compared, stem and leaf defined as being a quarter of the way between
plots drawn next to one another are useful for these two values, namely at 78,25. The upper
Figure 11.3 Stem and leaf plot of children’s psychological test scores
GROUP1 GROUP2
(n= 24) (n= 25)
105
c data presentation, analysis, and interpretation
a a a eR
quartile lies between the 12th and 13th values

(values 93 and 97). The-75th percentile is defined are weee eee mee
as being three-quarters of the way between the psychological test scores
two values, namely at 96.
In the box plot, the upper and lower quartiles
are displayed by the top and bottom of the rect-
angle which contains the central 50% of the data.
The median is indicated by a horizontal line
within the rectangle. The quartiles are connected
by lines to the largest and smallest value in the
sample. (Some computer packages also indicate
extreme values other than the maximum and
minimum in the box plot.) If the distance from 100
the lower quartile to the median is approximately
equal to the distance from the upper quartile to 90
the median, the distribution is symmetric. Box 80
plots can thus be used to detect outlying values,
70
as well as to determine the shape of the distribu-
tion of the values. The box plot of the 16 blood 60
pressure values is as follows: 50
¥
180 40
160 30
140 GROUP 1 GROUP2
120 n=24 n=25
100 Source: Kibel M A, Wagstaff L A, 1995
80
60 x
The box plots of the psychological test data are Bivariate plot
shown in Figure 11.4. The symmetry of Group 1is Bivariate or scatter plots are used to plot two
clear, since the median lies approximately half- numerical variables against one another, one
way between the two quartiles. Group 2 is asym- variable on the X axis (horizontal), the other on
metric, as can be seen from the fact that the the Y axis (vertical).
median is much closer to the lower quartile than These plots are especially helpful when one
to the upper quartile. Figure 11.2 depicts how dif- wishes to investigate relationships between vari-
ferent data distributions look when illustrated by ables, and should be drawn before astatistical
a box plot. evaluation of a relationship is done by means of,
for example, correlation or regression. They can
Histogram help to identify errors (Figure 11.5) and clarify
This graphical display is generally used not to relationships between variables (Figure 11.6).
evaluate the distribution of a numerical variable
at the start of the analysis, but rather to summa- Graphical display of categorical variables
rize a number of observations in compact form. A Whereas the graphical display of numerical vari-
histogram is drawn for a numerical variable ables described above is essential before formal
which has been grouped into categories. For statistical analysis can be done, the methods
example, blood pressure can be grouped into 10 described here for categorical variables are pri-
mm Hg groups. Each rectangle in the histogram marily used to present the results. The choice of
can represent either the number of observations method is a matter of personal preference. Bar
or the percentage of observations that fall in that graphs and pie charts can be used to represent
interval. The population pyramid described in categorical variables. The height of the bar in a
the section on demography in Chapter 2 is a spe- bar graph (or the size of the slice in a pie chart)
cial type of histogram used to depict the age and can represent either the number of observations
sex composition of a population. in a given category of the variable, or the percent-
106
11. = anintroduction to data presentation, analysis, and interpretation
ori wwMeN CMe mre rs
44 54 64 74 84 94 104 Weight
in the bivariate plot of height and weight, the case with height = 189 cm and weight = 44 kg stands out as being question-
able. In an investigation of only height or weight, the values for this case would not have seemed strange.
Figure 11.6 Bivariate plot of the relationship between TT Femited)

and age
AGE
Year
20 40 60
The relationship between lung function and age is depicted in the following plot. The plot clearly indicates that lung
function values only start to decrease with increasing age after the age of 40 years.
107
Advanced graphical techniques can be used in

Figure 11.7 Bar graph showing exploratory data analysis to look at complex rela-
MCR My My qrlescmt Cyto a tionships between variables.
Graphical display of data is done to let the data
bution of a sample of factory reveal the truth about itself. The way the
workers (n = 100) researcher chooses to depict the data graphically
should reflect this truth. Figure 11.9 discusses
honesty in graphical display.
Summarizing the sample data

By examining the graphical display of the data,
the researcher can decide which measures should
be used to summarize the data.
Summary statistics, also known as descriptive
statistics, are used to summarize and describe the
data in a concise form. The basic summary of the
data is the first step of the statistical analysis,
whether you have conducted a descriptive study
where the aim is to estimate the level or distribution
low medium high extremely
of a characteristic, or an analytic study where the
high
aim is to investigate associations between charac-
Dust exposure category
teristics. By summarizing the information asafirst
step, you can immediately see whether all intended
analyses will be meaningful. If, for example, only
two individuals out of asample of 100 have a certain
age of all observations that fall in a given category. characteristic, there is little point in exploring the
So, for example, the dust exposure category distri- association of that characteristic with other factors,
bution of factory workers can be represented by a due to the problem of small sample size.
bar graph, as shown in Figure 11.7. It is important
that the graph is clearly labelled, indicating the Categorical variables
categories depicted, whether percentages or fre- Categorical variables can be summarized by the
quencies are indicated, and the sample size. A bar number and percentage of study subjects which
graph is similar to a histogram, but the bars are are classified into a given category. For example, a
drawn with spaces in between. sample of 95 contains 16 males (17%) and 79 fe-
Figure 11.8 is an example of a so-called block males (83%). It is important to state the sample
chart or three-dimensional bar graph. For four size. The percentage can be rounded to the near-
different age/sex groups, the percentages of est whole number (especially in small samples),
respondents who mentioned different categories or can be given to one decimal place.
of AIDS prevention strategies are displayed (a
respondent could choose more than one option). Numerical variables
So, for example, 18,2% of males under the age of For numerical variables, on the other hand, one
18 mentioned condoms, whereas 8,1% of females needs to indicate where the central location of the
under 18 years mentioned condoms. From the data lies, as well as what the variability of the data
bar graph it is clear that males tended to mention is (that is, what the spread of the data is).
condoms more often than females did, whereas
females mentioned one partner more often than Measures of central location
males did. The most commonly used measure of the central
location is the arithmetic mean or average
Conclusion (denoted by xX). This is the sum of all individual
Exploratory data analysis has been presented values (xj) divided by the number of individuals
here as a tool for getting to know the data. How- in the group (n):x; (where > is the sum of all the
ever, it is an approach to data analysis which goes n
much further than what has been outlined above. observations from 1 to n).
108
So, for the 16 blood pressure values the mean is has a large influence on the mean, pulling it away
(64+ 68+ 75+ 78+ 79 +80 + 84+ 86+ 88+89+92+ from the other four values. If the distribution is
93 + 97+ 105+ 105+ 188) /16=91,9 asymmetric or if there are outliers, you should
rather use the median as measure of central loca-
(The mean has at most one decimal place more tion. The median is called a robust alternative to
than the original values. The above mean should the mean, since it is not sensitive to outliers or
not be given as 91,9375 since it implies spurious ac- asymmetry in the distribution of the data. For the
curacy in the original measurements, namely that values 1, 2, 3, 4, 5, the median is 3. For the values
the original measurements were taken to the fourth 1, 2, 3, 4, 50, the median is also 3, indicating that
decimal place.) the median is not influenced by the extreme value
The mean is sensitive to extreme values, espe- of 50. If the distribution of the data is symmetric,
cially in small samples. For example, if the sample the median and mean will be close together.
consists of five values, namely 1, 2, 3, 4 and 5, the ‘For some variables (especially laboratory deter-
mean is 15/5 = 3. However, if the values are 1, 2, 3, minations), you may encounter censored obser-
4 and 50, the mean is 60/5 = 12 . The high value 50 vations. These are observations for which the
Figure 11.8 Block chart of the percentage of respondents (in four age/
ese RRR Cue roe me it
strategies (a respondent could mention more than one option)
‘| FEMALE > 18
=53
11,3 % als £A18,9%
FEMALE
< 18
n=/74
foal|8,1 % 18,9 % 149%
MALE
= 18
nase
34,4 % 5,7 % 11,4%
MALE < 18
4
31,8 %
Condoms One Careful Visit Other

partner about Clinic
partner
choice
Mathews C, Kuhn L, Metcalf C A, Joubert G. Cameron N A. ‘Knowledge, attitudes and beliefs about
AIDS in township school students in Cape Town.’ South African Medical Journal 1990; 78:511-16.
109
i LCE CL A AAA AA ICC EAT EA A A
ee cat csc ate
Figure 11.9 Honesty in graphical display
To ensure that the graphical display reveals the truth about the data, one should keep the following in mind.
Categorical variables
If bar graphs or pie charts are drawn to reflect percentages in different categories, one should state (in the caption or title
of the graphic) the number out of which percentages were calculated. To represent 1 out of 3 as 33,3%, without stating
that the sample only consisted of 3 individuals, is misleading.
Numerical variables
When drawing scatter plots of two variables, it is important that reasonable scales are used. Reasonable scales are scales
which do not put too much ortoo little emphasis on the actual results. Scales should also start at the nought value if
possible. The following plot seems to indicate that there was abig increase in the percentage of mortality due to cause X
between 1985 and 1986. However, close inspection of the scales reveals that the percentage increased from 10,5% in
1985 to 11,0% in 1986, hardly as dramatic an increase as the plot suggests. It may therefore be a good idea to include the
0 value on the scales.
Percentage
11,0
10,9
10,8
10,7
10,6
10,5
10,4
10,3
10,2
10,1
10,0
1981 1982 1983 1984 1985 1986 1987
Year
actual value is not known, but it is known that the tain value (for example, a scale which weighs
value is greater than a certain maximum (for most respondents as 60 kg), but is of little use as a
example, age greater than 65 years) or smaller measure of central location.
than a certain minimum (for example, a labora- In the case of a skew distribution, a mathe-
tory instrument can detect very low levels of matical transformation of each value (for exam-
cholesterol, lower than a certain minimum, but it ple, the log transformation where the log of each
cannot determine the exact values). In these cases, value is calculated) can be used to make the dis-
since the exact extreme values are not known, the tribution more symmetric. The mean of the trans-
median is the appropriate summary measure. formed values is calculated and then transformed
The mode is the value which occurs most often. back. In the case of the log transformation, the
This may be useful in determining whether a antilog of the mean of the transformed values is
measuring instrument has a preference for a cer- taken. This is known as the geometric mean.
110
For the psychological test scores in Group 1, the graphical display) in order to decide on appro-
mean (64,3) is the appropriate measure of central lo- priate summary statistics. The following type of
cation. For the other group, however, the median has data is often encountered: grams of alcohol con-
to be used. If you want to compare the two groups, sumed weekly were noted in a heart disease risk
you must, of course, use the same summary statistics factor survey. 76% of respondents did not drink
for both groups. If the values of one group are asym- (so grams of alcohol consumed equal zero) and
metric or if there is an extreme value, the locations of the remainder drank between 20 and 1 000 grams.
both groups should be summarized by medians. Is it sensible to summarize the group’s alcohol
consumption with the overall mean of 78,2 g? Per-
Measures of variability (spread) centiles would be more appropriate (median = 0
The variability of a data set is the degree to which and 75th % = 0), or alternatively the researcher
observations in the data set vary from each other could state that 76% did not drink, and then state
with respect to a particular characteristic. For the mean or the median consumption of the
example, consider two samples of 10 children drinkers.
each with ages as follows: For a given numerical variable, the central
value (summarized by the mean or median) may
Group 1: 2, 4, 6, 7, 8, 10, 11, 12, 14, 16 be of little interest. For example, it may be more
Group 2:6, 6, 8, 8, 8, 10, 10, 10, 12, 14 relevant to the researcher to know what percent-
age of the sample has high blood pressure than
The mean and median ages in both groups are 9 what the mean blood pressure is. A numerical
years, but the observations in Group 1 vary much variable can be categorized and then summarized
more than those in Group 2. The variability of as a categorical variable. The choice of categories
Group1is thus greater than that of Group 2. for a numerical variable should make sense clini-
The range (the largest value minus the smallest cally (for example, hypertension is defined as sys-
value, or denoted by indicating the largest and tolic blood pressure of 160 and above or diastolic
smallest values separately) is a measure of vari- blood pressure of 95 and above). The choice can
ability which we might consider using intuitively: depend on the distribution of the data (for exam-
in Group 1, the range is 2 to 16, in Group 2, it is 6 ple, grouping individuals over 65 all together in
to 14. However, the range is not very useful as a one age group if there are only a few who are be-
measure of variability, since it is greatly influ- tween 65 and 74, 75 and 84, and so on), but should
enced by one extremely large or small value. not be made in such a way as to influence the
A commonly used measure of variation is the results in a given direction.
standard deviation. The standard deviation is
defined as: Estimating the population
parameters
¥ (x;-X)?
n-1 A researcher studies a sample in order to make
statements (inference) about the population from
and thus gives an indication of the average dis- which the sample was selected. The summary
tance from the mean. The variance is the square statistics outlined above describe the sample
of the standard deviation. data, but what can be said about the population?
A robust alternative to the standard deviation is A parameter is a characteristic (for example, the
the interquartile range (75th percentile minus 25th mean) of a population. A statistic is a character-
percentile). The standard deviation is used with the istic of a sample of the population, and is an es-
mean, the interquartile range with the median. timate of the corresponding parameter in the
Thus, if the data is asymmetric or if there are out- population (for example, the sample mean es-
liers, the median and interquartile range (or 25th timates the population mean). However, as the
and 75th percentiles) should be used as summary simple example in Table 11.3 indicates, different
measures. samples of the same size randomly selected from
a given population will not necessarily provide
Conclusion the same estimates, nor will they provide es-
Summary statistics should give a true representa- timates which are identical to the population
tion of the data. The distribution of the data parameters. This is due to random variation in
should therefore be considered (through some any sample and is known as sampling error. If the
111
eee RRC Tela Cer from

the same population
The population consists of 50 individuals. Two characteristics are of interest: age and sex
Individual Age Sex Individual Age Sex

q 55 EF 26 81 M
2 38 M 2/1 63 F
3 63 M 28 87 M
4 1s F . 29 94 M
5 62 M 30 68 M
6 46 F BL #1 F
i 59 F 32 69 M
8 a2 r 33 46 M
9 65 M 34 tt F
10 64 M 20 87 M
a1 50 F 36 62 Fr
42 | 55 M 37 94 M
13 67 M 38 52 M
14 69 M 39 81 M
15 38 F 40 19 F
16 59 M 41 74 M-
17 69 F 42 fae M
18 52 _M 43 63 F
19 46 F 44 19 M
20 64 M 45 £21 F
21 74 M 46 63 F
22 50 e 47 57 M
23 | 67 M 48 65 M
24 15 F 49 69 M
25 46 E 50 62 M
Two simple random samples of size 10 are selected
Sample 1: individuals 17,14,46,11,39,30,02,08,15,22

Sample 2: individuals 32,08,16,13,24,48,23,42,14,21
Sample 1 estimates : 4 males out of 10 subjects (40%); mean age 57,8 years
Sample 2 estimates : 8 males out of 10 subjects (80%); mean age 66,2 years
Population parameters:
30 males out of 50 subjects (60%); mean age 64,6 years
entire population were studied, the population on imprecise estimates.

parameters (for example, population mean) The precision of an estimate depends on.both
could be calculated exactly. By studying only a the variability of the data (if all the data values are
sample, the researcher obtains imprecise esti- within a small range, there is small underlying
mates. It is important, therefore, to calculate the variability in the measurement, and the estimate
precision of a sample estimate (for example, the will consequently be more precise) and the
precision of a mean, mean difference, proportion, sample size (the smaller the sample, the less
or odds ratio). Statistical techniques are needed precise the estimate). The standard error is a
to enable researchers to make statements based measure of the (im)precision of a sample statistic
112
as an estimate of the population parameter. The includes 68% of the observations, and the area
standard error is calculated using the sample size within 1,96 standard deviations on either side of
and the variability of the data. For example, the the mean includes 95% of the observations.
standard error of the sample mean is ja The Many biological variables, for example, birth-
sample estimate, however, remains the best weight, are in fact approximately normally distri-
single estimate of the population parameter. buted. In small data sets, it may be difficult to
Before we discuss how the standard error can validate the assumption of normality adequately,
be used to make a statement about the popula- and in small samples with skew distributions,
tion parameter, we need to know something normality cannot be assumed. In such cases,
about probability distributions. nonparametric methods of analysis need to be
considered (see Parametric versus nonparametric
Probability distributions methods below).
A probability distribution is a theoretical distribu- Other important probability distributions are
tion which provides information on the probabil- the Binomial distribution for binary variables, the
ity of observing different ranges of values for a Poisson distribution for count data, and the expo-
given variable. A probability distribution can be nential distribution for continuous variables.
thought of as a data distribution based on an in-
finitely large sample. Such probability distribu- Confidence intervals
tions exist for categorical as well as numerical As discussed before, if various samples of the
variables. Probability distributions are commonly same size are drawn from a given target popula-
characterized by their mean and variance. tion, a range of different estimates will be ob-
One of the probability distributions used most tained. These estimates will follow some theo-
often is the Normal (Gaussian) distribution. Most retical distribution. We use this information to
of the commonly used methods of statistical calculate a confidence interval for the population
analysis (for example t-tests, correlation, and parameter, based on our sample estimate.
regression) are based on the assumption that the A confidence interval gives a range of para-
continuous variables which are analysed follow a meter values considered plausible for the popula-
Normal distribution. The Normal distribution. is tion, based on the sample data, and is a useful
bell-shaped and is symmetric around its mean way of describing the (im)precision of the es-
(Figure 11.10). Furthermore, the area within one timate. A confidence interval is calculated by
standard deviation on either side of the mean using the sample estimate, its standard error, the
Figure 11.10 ur Normal distribution
68% of area
<< 95% of area —_—_—__—_—__>

sd = standard deviattion
113
ei
a a a a
degree of certainty (confidence) we want to asso- about a parameter of one or more populations
ciate with the confidence interval (say 95% or (see Chapter 6: Setting objectives for research). To
99%), and the cutoff value of the appropriate determine whether, for example, two factors are
probability distribution (for example 1,96 in the associated, we need to formulate a null hypo-
case of 4 95% confidence interval based on the thesis (Hg) which can then be tested. The null
Normal distribution). If we calculate a 95% confi- hypothesis is the hypothesis of no association or
dence interval, we can say that, in a series of iden- no difference.
tical studies based on different samples from the A test statistic is then calculated byastatistical
same population, 95% of the 95% confidence formula, using the sample data. When the null hy-
intervals calculated from these studies will pothesis is true (that is, there is no difference or
include the true population parameter. An impre- association), this test statistic follows a certain
cise estimate (with large standard error due to distribution (for example t, x7). By comparing the
small sample size and/or large variability in the test statistic with this distribution, we can deter-
sample data) will lead to a wide confidence inter- mine how likely it is to obtain the observed
val (that is, a wide range of values which are sample data if the null hypothesis were true (that
considered plausible for the population para- is, if there were no association or no difference).
meter). For example, if in a sample of 10 clinic This probability is known as the p-value associ-
attenders, four are found to be HIV positive, the ated with the test statistic. The p-value is the
percentage positive is 40% with a 95% confidence probability of observing the test statistic or a
interval ranging from 12% to 74% (indicated as more extreme result if the null hypothesis is true.
[12% ; 74%] or [12%-74%]). The confidence inter- The p-value is thus the probability of finding an
val is wide since the sample is small. We are thus association or a difference, if there is in reality no
uncertain what the HIV prevalence is. If, however, association or no difference.
the sample consists of 100 and 40 are HIV posi- If the p-value is large (for example, p = 0,80), the
tive, the percentage positive is 40% with a 95% data is consistent with the null hypothesis. This
confidence interval ranging from 30% to 50%, a implies that the null hypothesis cannot be re-
narrower confidence interval than in the case of jected, using the observed data. However, a large
the smaller sample. (See Table 11.4 for a further p-value does not imply that the null hypothesis
example.) can be accepted. For example, the sample may be
Methods for calculating confidence intervals too small to detect the difference that really exists.
for most epidemiological applications (for exam- This is known as the Type II error.
ple, means, differences between means, medians, If the p-value is small (for example p = 0,01) the
proportions, odds ratios, correlation coefficients, data is unlikely to have been as observed if the
and survival time) are described, with examples, null hypothesis is true. In other words, there is
in Statistics with Confidence by Gardner and evidence that there is indeed a significant asso-
Altman (1989). ciation or difference, and the null hypothesis is
rejected.
Hypothesis (significance) testing An arbitrary cut-off point of the p-value,
If the researcher is interested in determining the namely 0,05, is often chosen. This cut-off value is
level of a characteristic (for example, the preva- called the significance level of the test, and refers
lence of HIV infection in the Free State), the size to the probability of rejecting the null hypothesis
of any difference between groups (for example, ifit is in fact true (the Type I error). Ifwe are trying
the mean age difference of mothers with low to prove that an association or difference exists,
birthweight babies and mothers with normal we would like the probability of finding a relation-
birthweight babies), or the strength of an associa- ship or difference when there is in fact none, to be
tion (for example, the risk of heart disease in small. P-values are often classified as ‘significant’
hypertensive patients compared to normotensive if p<0,05 and ‘non-significant’ (often written in
patients), confidence intervals should be calcu- the literature as NS) if p>0,05. This over-simplistic
lated as outlined above. On the other hand, the use of p-values is now strongly discouraged in the
researcher may wish to test some hypothesis: are medical literature. If the reader of an article sees
two factors significantly associated or are two that a difference between two groups was found
groups significantly different? In this case, the re- to be ‘non-significant’, there is no information
searcher is interested in hypothesis testing. whether the p-value might have been 0,06 or 0,60.
A statistical hypothesis is an assumption made Clearly p-values of 0,06 and 0,60 should be inter-
114
11 an introduction to data presentation, analysis, and interpretation
preted differently and both should not be simply the confidence interval is interpreted to decide
classified ‘non-significant’. Exact p-values should whether the differenceis clinically important.
be given in papers so that readers can decide for Whereas a significance test cannot indicate that
themselves whether they agree with the re- there is no difference, a confidence interval can. If
searcher on the ‘significance’ of the results. in the above example, a change of three units or
Significance testing can indicate whether there more is taken to be clinically meaningful, the con-
is a statistically significant difference between fidence interval indicates that there was no clin-
two groups, ora Statistically significant associa- ically meaningful change. If the confidence inter-
tion between two variables. As pointed out above, val had ranged from 2 to 4, we would not be able
a significance test cannot indicate that there is no to draw a conclusion, since the interval includes
difference. values which indicate a clinically meaningful
change, as well as values which indicate that there
Statistical versus clinical/public health was no clinically meaningful change.
significance The decision on how large a difference or
Hypothesis testing determines the statistical change is required for it to be clinically meaning-
significance of an observed effect. If two large ful, is based on knowledge of the subject matter.
samples are compared, small differences between In the case of measurements such as blood pres-
groups may be found to be statistically signific- sure, it may be clear what is clinically meaningful,
ant, although such small differences may not be but in other circumstances it may be less clear-
of clinical or public health importance, or prac- cut. For example, if the prevalence of HIV in one
tical value. Similarly, in small samples, large dif- group is 3%, but 5% in the other, is that a meaning-
ferences may fail to reach statistical significance, ful difference?
although they seem clinically important. (See Confidence intervals and p-values are closely
Table 11.4 for an example of sample size and related: if the difference between the means of
statistical significance.) A researcher thus cannot two groups is significant at the 5% level (that is,
base decision-making purely on statistical signi- p<0,05), then the 95% confidence interval for the
ficance, but must consider the clinical or public difference will exclude the value zero (that is, the
health significance of the results. This can be confidence interval will also indicate that there is
done by calculating confidence intervals and a difference). Confidence intervals are thus some-
interpreting them clinically, that is, interpreting times also used for purposes of statistical signific-
the size of the difference or association in terms of ance (the result is significant, since the null value
its meaning for the patient, disease, or health ser- is not inside the confidence interval). This is not
vice. the purpose of a confidence interval. If the focus
In a follow-up study to determine associations is on estimation (which is most often the case),
between high density lipoprotein (HDL) choles- confidence intervals should be calculated and the
terol and women’s employment (Haertel, Heiss, range of values interpreted clinically to determine
Filipiak, Doering 1992), it was found that the 761 whether the findings are clinically important.
women who had been homemakers during the
first survey and had remained homemakers Parametric versus non-parametric methods
during the second survey had a mean decrease in If we can assume that our data comes from an
HDL cholesterol of 0,99 mg/dl. The standard error underlying distribution such as the Normal distri-
was 0,41 andastatistical test to assess whether bution, statistical methods (hypothesis testing or
this decrease was greater than zero revealed that confidence intervals) based on the assumptions
the decrease was statistically significant (p = of the underlying distribution can be used. These
0,015). The 95% confidence interval for the mean methods are called parametric, since the underly-
decrease was 0,19 to 1,79. There is clearly a de- ing distribution is assumed to come from a para-
crease, but on the basis of clinical knowledge, it metric family of distributions. Nonparametric
has to be decided whether a decrease of at most methods are used if such assumptions cannot be
two units over a period of more than two years is made. For example, the Mann-Whitney U test is
clinically meaningful on a variable which had a the nonparametric equivalent of the two-sample
mean value of 65 at the first visit. The fact that t-test. These distribution-free methods which
such a small difference is statistically significant is make no assumptions about the data distribution
due to the large sample size. A significance test are often ranking techniques, which focus on the
thus indicates whether there is a difference, but order or ranking of observations, and not on their
115
numerical values. lung cancer). Often the information gathered in

such studies can be summarized in a 2 x 2 table as
What to consider when choosing an follows:
appropriate analysis
There will be no attempt to provide a detailed Diseased Not diseased
outline of which statistical technique to use under Exposed A B A+B
which conditions; we rather give some idea of the Notexposed __C D C+D
factors which need to be considered when choos- A+C B+D A+B+C+D=N
ing a statistical analysis.
Out of a total of N respondents, there are A who
The type of variable being analysed are exposed and diseased, B exposed and not
Categorical and numeric variables require differ- diseased, C not exposed but diseased, and D not
ent statistical techniques. Clearly we would not exposed and not diseased.
summarize a categorical variable by a mean (what To investigate the association, we could per-
is the meaning of a mean sex of 1,2?) Similarly, if form a statistical test, namely a chi-square or
we wish to compare two groups on a categorical Fisher’s exact test. Such a test would determine
variable, we would not use at-test or a confidence whether there is a statistically significant differ-
interval for mean differences, but a x? test, or ence between the proportion of exposed who are
Fisher’s exact test, or a confidence interval for the diseased, and the proportion of unexposed who
differences in percentages. are diseased. The test result will depend on the
sample size. For example, it may be significant in
Assumptions necessary for the procedures a large sample, but in not a small sample (see
As mentioned before, there are many proce- Table 11.4). The test result will merely say
dures which should only be used if the variables whether there is a statistically significant differ-
are normally distributed. All distributional as- ence, and will tell us nothing about how strong
sumptions of the intended analysis should be the relationship between the exposure and out-
investigated to determine whether they are sat- come is. Researchers often want to have an estim-
isfied by your data. A further assumption which ate of the degree or strength of the association (for
has to be considered is whether groups are inde- example, the exposed are twice as likely to devel-
pendent. If controls were matched to cases, or op disease as the unexposed). Such measures of
repeat measurements were made on the same the strength of association, which have the added
individuals, the groups (controls and cases, advantage of not being dependent on sample
before and after measurements) are not inde- size, are described below.
pendent. Let us first assume that the data has been col-
lected by means of a cross-sectional study, and
The characteristic being tested that N individuals have been sampled and cross-
Different techniques are appropriate for differ- classified as to exposure and disease status. Con-
ent characteristics. For example, if normality sidering the hypothetical example with real num-
and independence can be assumed, a two- bers below may make the concepts clearer.
sample t-test would be appropriate to compare
the means of two groups, while an F-test would Diseased Not diseased
be appropriate to compare the variability in the Exposed 20 80 100
two groups. Notexposed 30 270 300
50 350 400
Analyses commonly used in Clearly, among the exposed (A + B) we can say
epidemiology that the risk of disease is A/(A + B) (in the example
20/100 = 0,2 or 20%). Similarly, among the non-
Measures of strength of association exposed (C + D) the risk of disease is C/(C + D) (in
ina2x2 table the example 30/300 = 0,1 or 10%). One measure of
Definitions and interpretation the strength of association between exposure and
Many epidemiological studies set out to invest- disease is the ratio of the risk among the exposed
igate the association between an exposure (for to the risk among the non-exposed. This is called
example, smoking) and a disease (for example, the relative risk (or risk ratio) and is given by
116
A/(A+B) = A(C+D) posed, and thus the relative risk and risk differ-
C/(C+D) C(A+B). ence cannot be calculated. It is also not possible
to calculate the odds of disease among the
In this example, it is thus 0,2/0,1 = 2. exposed and unexposed. A case-control study
does however provide information on the odds of
Since risks are probabilities or proportions which exposure among the cases (A/C) and controls
lie between 0 and 1 (or percentages which lie be- (B/D) which can be used to calculate the odds
tween 0% and 100%), the relative risk is always a ratio:
positive number. A relative risk of 1 indicates that
the risk of disease in the exposed group is the A/G cy VAD
same as in the unexposed group. The exposure B/D BC.
and disease are thus not associated. A relative risk
larger than 1 (as in our example) indicates that the From the above, it is clear that it is very important
exposure is a risk factor, whereas a relative risk to identify the type of study in which the exposure
less than 1 indicates that the exposure is protec- and disease information was gathered in order to
tive. know which measure of strength of association
A risk difference, that is, the difference be- can be estimated. In cross-sectional and follow-
tween the risk among exposed and non-exposed up studies, the relative risk or risk difference
can also be calculated: A/(A + B) — C/(C + D). In should be the measure of preference.
our example, the risk difference is 10% or 0,10. The odds ratio and relative risk are dimension-
The risk difference lies between -1 and 1 (or less and have been criticized because of this fea-
-100% and 100% if the risks are expressed as per- ture. Say, for example, the rate among the ex-
centages) with a difference of 0 indicating no posed is 5 per million and 1 per million among the
association. unexposed, then the relative risk is 5 and the risk
A third measure of the strength of the associa- difference 4 per million. If the rate among the ex-
tion is the odds ratio. The chance of disease can posed is 5 per thousand and among the unex-
be measured by the odds, where the odds of posed 1 per thousand, the relative risk is also 5 but
disease among the exposed is A/B (20/80 = 0,25.in the risk difference 4 per thousand, indicating a
our example) and among the non-exposed C/D more serious increase than in the first case, al-
(30/270 = 0,11 in our example). The odds ratio is though the relative risks are the same. It has
then’ therefore been said that the risk difference is the
appropriate measure of the practical magnitude,
(A/B) = AD in terms of public health importance, of an asso-
(C/D) BC. ciation. However, if one risk factor has a higher
relative risk than another, the first risk factor is
In our example the odds ratio is thus 0,25/0,11 = thought to play a more important role in the
Zeek causation of the disease.
The interpretation of the odds ratio is similar to The attributable risk indicates the proportion
that of the relative risk. If the disease is rare (that by which the rate of disease will be reduced if the
is, if A is very small relative to A + B, and C is very exposure were eliminated. It takes into account
small relative to C + D), the odds ratio approx- the relative risk, as well as how common the risk
imates the relative risk. factor is in the population being studied. So,
If we are analysing data from a follow-up (pro- factor A may have a lower relative risk than risk
spective) study (in which groups of exposed and factor B, but because more people are exposed to
non-exposed are sampled and followed up), the risk factor A, the attributable risk will indicate that
chances of disease and therefore the relative risk, factor A is more important to the health of the
risk difference, and odds ratio can be calculated community. The proportion by which the rate of
as for the cross-sectional study. In a case-control disease will be reduced if the exposure were
study, however, a predetermined number of cases ‘ eliminated is higher for factor A. The attributable
(diseased) and controls (not diseased) are risk is therefore of great importance in health
selected and their numbers may not reflect the services planning, namely in deciding which
proportions of diseased and non-diseased in the risk factors to target for prevention and inter-
population. It is therefore not possible to estimate vention. The attributable risk is calculated as
the risk of disease among the exposed and unex- follows:
Ve
P(RR-1) Controls
1+P(RR-1) Exposed Not exposed
C
where Pis the proportion of the population who a Exposed A B A+B
are affected by the factor. s Not exposed _C D C+D
The risk ratio, risk difference, odds ratio, and e A+C B+D A+B+C+D
attributable risk are estimates and should there- s pairs
fore be reported with standard errors or 95% con-
fidence intervals to indicate the precision of the The odds ratio is then estimated by B/C.
estimates.
More than two levels of exposure
Paired observations If an exposure has more than two levels, an r x 2
The 2 x 2 tables outlined above cannot be used if table can be set up (where r is the number of
observations are paired, for example, in a levels of exposure). For example, daily cigarette
matched case-control study where a control was smoking can be classified as 0, 1-2, 3-4, 5-9, 10
selected for a specific case. The 2 x 2 table is then and more. If the association between the level of
set up as follows, consisting not of the number of smoking and the presence of lung disease is to be
individuals included in the study but the number studied, a 5 x 2 table can be formed. To calculate
of pairs. relative risks or odds ratios, one level of exposure is
chosen as the reference level (generally the level
Table 11.4 Statistical significance, strength of association .

FLOMre Lil (oe Phe
A study is done on 50 subjects to investigate the association between exposure A and disease B.
The following results are obtained:
present absent
A: present 15 (75%) 5 (25%) 20 (100%)
absent 20 (66%) 10 (33%) 30 (100%)
35 50
Null hypothesis : the two factors are not associated.
Atest statistic which follows the X? distribution is calculated to evaluate the association between the
two categorical variables A and B, and the p-value equals 0,53. One would thus conclude that the two
factors are not associated. If, however, the study was done on 500 subjects and exactly the same con-
figuration of results obtained, namely:
B
present absent
A: present 150 (75%) 50 (25%) 200 (100%)
absent 200 (66%) 100 (33%) 300 (100%)
350 150 500
the test statistic results in a p-value of 0,05, thus there seems to be evidence of an association.
The statistical test measures the significance of the association (which depends on the sample size),
but not the degree of association. The odds ratio, on the other hand, measures the degree or strength
of the association, independent of sample size.
The odds ratio in the 50 subject case is 1,50 with a 95% confidence interval of 0,36 to 6,42. For the
500 subjects, the odds ratio is also 1,50, but the narrower confidence interval of O, “ to 2,28 reflects
the better precision of the odds ratio estimate from the bigger sample.
118
with the least risk of disease is taken as the refer- Males Females
ence, thus in this case, the level of 0 cigarettes) Diseased Not Diseased Not
and each of the other levels is compared to the diseased diseased
reference level in turn (thus forming four 2 x 2 Exposed 20 (45%) 24} 44 | 131 (61%) 84) 215
tables). Not
exposed 36 (33%) 73|109} 48 (45%)
Stratification 56 971153 | 179 143 |322
Extraneous factors, such as sex or age, may compli- RR=1,38 RR=1,36
cate the estimation of the measure of association
between exposure and disease (see Chapter 7: Omitting the confounder sex thus introduced a
Study design). By using a stratified analysis, the re- slightly stronger relationship than actually exists.
searcher attempts to adjust for, or remove the effect Confounders can also mask a real association. For
of, an extraneous factor (confounder) which is re- example, sex can be a confounder in the association
lated to both the exposure and the disease. Rather between smoking and hypertension. In this case,
than one (crude) measure of association, one women have a low prevalence of smoking but a high
would calculate measures of association for each of risk of disease, whereas men have a lower risk of dis-
the strata of the extraneous factor and possibly ease, but a higher prevalence of smoking. It may
compute an adjusted (common) measure. therefore seem as if smokers (because they are pre-
Age is often an extraneous factor which has to be dominantly males) have a lower risk of hypertension
accommodated in the analysis. For example, in a than non-smokers (because they are predominantly
cross-sectional study investigating the association females). S
between hypertension and heart disease, it might After stratification, an adjusted (common) rel-
be found that the hypertensives are significantly ative risk using the Mantel-Haenszel method can
older than the non-hypertensives, and those with be calculated. This technique calculates a com-
heart disease older than those without heart dis- mon relative risk by using the stratum-specific
ease. Is the significant association between hyper- relative risks and the stratum size.
tension and heart disease perhaps only due to the The extraneous factor can also act as an effect
fact that the hypertensives are older people and modifier, in which case the relative risks or odds
therefore more likely to have heart disease? We thus ratios in the different strata are markedly differ-
want to remove the effect of age to get a clear pic- ent. This is also often called interaction, as the
ture of the hypertension and heart disease associa- exposure and disease association interacts with
tion. In this case, we would calculate relative risks the extraneous variable, and depends on which
or odds ratios for separate age strata, for example, stratum of the extraneous variable we are invest-
20-29, 30-39, 40-49, 50-59, 60-69, 70+. igating. In the following example, sex is an effect
Confounders can have various effects on the as- modifier for the relationship between insulin and
sociation between exposure and disease. The triglycerides:
confounder can lead to over- or under-estimation
of the association between exposure and disease. Males Females
In the following example, the association be-
high normal high _ normal
tween obesity (exposure) and hypertension (out-
insulinhigh 16(12%) 113
come) is investigated in a sample of QwaQwa resi-
insulinnormal 14(8%) 160 {174 | 5(5%) 96
dents 45 years and older (Mollentze, Joubert 30 273 303 | 14 129 | 143
1994). The crude relative risk is 1,50. RR= 1,54 RR=4,33
Diseased Not diseased

Exposed 151 (58,3%) 108 259 In such a case, you should report the stratum-
Notexposed 84 (38,9%) 132 216 specific relative risks and not attempt to calculate
235 240 475 acommon relative risk.
Having decided to stratify, you might find that
However, sex is a possible confounder, since the stratum-specific relative risks and the crude
relative risk are very similar. In this case, the vari-
women are more likely to be obese, and women
able used to stratify cannot be considered to be a
are more likely to be hypertensive. Sex is thus
confounder, and the stratification can be ignored
associated with the exposure and the disease.
Stratifying by sex, the results are as follows: and the crude relative risk reported.
119
a ec
Correlation clear that there is in fact a very distinct relation-

Measures of the strength of association between ship between the two variables. This relationship
two dichotomous variables have been outlined is not linear, however: where x is less than 100, the
above. If, however, the researcher wants to two variables are negatively correlated, whereas
describe the relationship between two numerical for x above 100, the correlation is positive.
variables, she or he can calculate a correlation co- Correlation does not measure agreement. If, for
efficient. (The term correlation is often used in- example, two examiners have to do blood pres-
correctly, when association between categorical sure readings, and they both evaluate the same 20
variables is in fact what is meant.) patients to determine how well their measure-
Two variables are positively correlated if an ments agree, it is not appropriate to calculate the
increase in one variable is associated with an correlation coefficient. As Figure 11.12 indicates,
increase in the other (for example, an increase in there can be a strong linear relationship (that is, a
height is usually associated with an increase in strong correlation) even if one examiner consis-
weight). If a decrease in one variable is associated tently notes a higher reading than the other.
with an increase in the other, the two variables are Correlation measures whether the readings ‘go in
negatively correlated (for example, lung function the same direction’ (that is, whether if one exam-
decreases as age increases). If a decrease or iner notes a high reading, the other does as well),
increase in one variable is not associated with any not whether they actually get exactly the same
change in the other variable, the variables are not readings. Bland and Altman (1986) describe the
correlated (for example, birthweight and hour of appropriate methods of analysis to measure
birth). The correlation coefficient provides a agreement between numerical variables.
quantitative measure of the relationship between Correlation is closely related to simple linear
two numerical variables. The correlation coeffi- regression. In simple linear regression, the values
cient can range from —1 (perfect negative correla- of one numerical variable (the dependent vari-
tion) to 1 (perfect positive agreement), with 0 able) are predicted by the values of the other vari-
indicating no correlation. able (the independent variable), for example,
The Pearson correlation coefficient (r) is given lung function can be predicted using age. If you
by specifically want to predict one variable from
another, regression should be done. However, if
(2 @&j-X) Yj-Y) / WL@ji-H? Xj-N) you do not want to make predictions, but rather
where X is the mean of one variable, y the mean of wish to describe the relationship between two
the other. The individual values of the two vari- variables, the correlation coefficient should be
ables are indicated by x; and yj. calculated.
If either or both of the variables have outlying
values, a nonparametric correlation coefficient Multivariate analysis
based on the ranks of the observations can be cal- Multivariate analysis encompasses a range of
culated. This is called the Spearman rank correla- statistical techniques which, on the basis of math-
tion coefficient (r,). ematical models, can evaluate the inter-relation-
A high correlation does not imply cause and ship of more than two variables. A mathematical
effect. Furthermore, the correlation between two model is one which describes the relationship be-
variables may be spurious due to the influence of tween variables in mathematical form. For exam-
a third variable. The effect of a third variable can ple, whereas simple linear regression is used to
be removed by calculating a partial correlation predict a dependent (y variable or outcome vari-
coefficient. able) from one independent variable (x variable),
Correlation measures the linear (straight line) multiple linear regression is a multivariate tech-
relationship between two variables. Thus a strong nique which predicts a dependent variable from
correlation coefficient implies a strong linear two or more independent variables, assuming a
relationship. It is therefore important to draw a linear relationship. So, for example, a person’s
scatter plot (see Exploratory data analysis earlier lung function can be predicted from their age,
in this chapter) to see whether the relationship is height and weight.
indeed linear, before calculating a correlation co- Multiple linear regression can only be used if
efficient. Figure 11.11 shows the relationship be- the dependent variable is numerical, for example,
tween two variables. The correlation between the a person’s lung function. However, often the
two equals zero, but from the scatter plot it is dependent variable is dichotomous, for example,
120
11 an introduction to data presentation, analysis, and interpretation
arte wemeOe Teramr acct ee als tig

display before calculating a correlation coefficient
pas
x
2
>
—
o
a
2
oO
7 21 Se OR BT at ae 4G
Observer 2
Two observers were asked to score 16 individuals. A strong positive correlation was found when calculating a correlation
coefficient. The graphical display is more informative, however, indicating that observer 2 consistently gave higher scores
than observer 1.
121
c datapresentation, analysis, and interpretation
died/survived or recovered/not recovered. In

such cases, multiple logistic regression is the Table 11.5 Guidelines on the
appropriate method of analysis. The proportional presentation of results
hazards model, which is used to investigate the
role of independent variables on survival time, is a @ In asurvey, the response rate should be cal-
further example of a multivariate technique. culated and responders compared to non-re-
Multivariate techniques can use data very effi- sponders on key characteristics.
ciently and enable the researcher to identify ¢ If reliability or validity st
which variables from a large set of independent have been included in theda
variables are related to the dependent variable, phase, the quality of the information
after adjusting for other important independent - gathered should be described by outlining
variables (confounders). However, these tech- the validity and reliability.
niques are complicated and should not be per- Adequate description ofthe basicdata
formed without a thorough understanding of the should precede and complement the stat
mathematical models and assumptions under- istical analysis. There is no point in stating
lying them. This understanding is also required ‘the difference was significant’ without sum-
for an accurate interpretation of the computer marizing what the difference was.
output of such analyses. A statistician should be The basic description will also indicate
consulted for such analyses. how representative the sample |is of the
As Rothman (1989) points out, these tech- population. /
niques can place a barrier between the researcher ¢ Distributional sccuripuene a statistical
and the data. Other methods of analysis (for methods used must be justified.
example, stratification) can bring about a closer ¢ Confidence intervals should be presented if
understanding of the data. It is therefore import- the aim of the investigation is estimation.
ant to do the simpler analyses first, in order to get
to know the data and its irregularities, before an
appropriate multivariate analysis is considered.
many analyses themselves.
Table 11.5 gives some guidelines on the presen-
Conclusion tation of results, emphasizing that the basic data
This chapter has provided an introduction to should be displayed and described clearly, and
statistical concepts, and we encourage readers to that the data should be explored. Table 11.6 is a
refer to the recommended reading for further copy of the checklist for statistical review of
details. Not every researcher needs to become a general papers submitted to the British Medical
statistician to be a good epidemiologist. Rather, Journal (a more detailed checklist is used for
researchers should make a point of working papers dealing with clinical trials). This checklist
closely with statisticians on research projects. outlines the major statistical questions re-
However, an understanding of the terminology searchers should consider. Clearly, a statistical
and principles used by statisticians will make the analysis can only be considered good if the study
collaboration with statisticians more fruitful and design and sampling of the project were appro-
enjoyable, and will enable researchers to handle priate.
122
11. = anintroduction to data presentation, analysis, and interpretation
pel Ce er amie UC merece): we

British Medical Journal
EVE CT INO ee a Ge ee Hate GT EVICW! ace

ee es
Design features .
1 Was the objective of the study sufficiently described? Yes Unclear No
2 Was an appropriate study design used to achieve Yes Unclear No

the objective?
3 Was there a satisfactory statement given of source Yes Unclear No

of subjects?
4 Was apre-study calculation of required sample size reported? Yes No
Conduct of study
5 Was a Satisfactory response rate achieved? Yes No Unclear
Analysis and presentation :

6 Was there a statement adequately describing or referencing Yes No
all statistical procedures used?
7 Were the statistical analyses used appropriate? Yes Unclear No
8 Was the presentation of statistical material satisfactory? Yes No
9 Was the conclusion drawn from the statistical Yes Unclear No

analysis justified?
Recommendation on paper
10 _ Is the paper of acceptable statistical standard for publication? Yes No
11. If ‘No’ te question 10, could it become acceptable with Yes No

suitable revision?
FREVIC WER eo irer ies seacatecs ee ere eee eile tas

(Taken from Statistics with Confidence, Gardner and Altman 1989)
123
12 Integrating
epidemiological
concepts
The relationship between sample ferent samples from the same population differed
size, study design and precision of from the population parameters. Using the same
population data, Table 12.1 shows how the confi-
estimates dence intervals become narrower as the sample
Introduction size increases.
Epidemiological studies are fundamentally about However, the magnitude of the benefit in terms
measurement. Variability is intrinsic to biological of greater precision diminishes as the sample size
systems and the process of sampling from a increases. A critical sample size is reached when
population also introduces variability. In this sec- the standard error does not decrease substan-
tion of the chapter, we deal with the issue of pre- tially, even though the sample size increases. This
cision, sometimes referred to as ‘random error’ or situation, which exemplifies the ‘law of diminish-
‘reliability’. ing returns’, highlights the importance of select-
Precision and validity are two separate con- ing the most appropriate sample size. If the
cepts, since it is possible to be precise yet not sample is too small, precision is compromised,
valid. If, for instance, a scale for measuring weight while if the sample is too big, resources are being
in an anthropometric study was not calibrated, wasted.
and all measurements started from 1 kg instead of At one extreme, when the entire population is
0 kg, the mean weight would not be valid since it sampled (that is, a census is taken), the sample
would be 1 kg more than the true mean weight. mean is the population mean. Generally, increas-
However, when the same child is weighed repeating the sample size also has the effect of moving
edly on the same scale, the scale produces iden- the sample mean closer to the population mean
tical weights, demonstrating that the scale is pre- (Table 12.1).
cise even though the weights are not valid. Preci- More importantly, sample size plays a substan-
sion refers to the degree of variability around an tial role in determining the standard error, since
estimate of a parameter, and is reflected by the the formula to calculate the standard error is the
width of the confidence interval. standard deviation divided by the square root of
The precision of an estimate is affected by sam- the sample size. For example, in the case of the
ple size. The more subjects studied, the greater the mean, the standard error is given by:
degree of precision of the estimate. This is reflected
in narrower confidence intervals as sample size in- ze & (xj-X)?
creases. Precision is also influenced by study \a n=1
design. The degree of precision of an estimate can
be increased if the study is designed efficiently. The large influence of the sample size in the
denominator of the formula above, makes sample
Sample size size an important determinant of the width of the
In Table 11.3 in Chapter 11, an example was pro- confidence interval, and therefore of the level of
vided which showed how the estimates from dif- precision. When preparing a study protocol,
124
12 integrating epidemiological concepts
Table 12.1 Estimates yielded by increasing eT Cy TP het)
The population of 50 individuals is provided in Table 11.3 in Chapter 11.
Sample 1: Individuals: 17, 14, 46, 11, 39, 30, 02, 08, 15, 22
Sample 2: Individuals: (Sample 1) and 37, 10, 45, 33, 36, 07, 26, 09, 42, 25
Sample 3: Individuals: (Sample 2) and 24, 44, 23, 21, 12, 16, 01, 03, 34, 13
Sample 1 Sample 2 Sample 3

Sample size
Mean age 57,8 62,1 63,2
Standard deviation 14,4 14,9 13,1
Standard error 4,5 3.3 2,4
95% confidence interval (47,5;68,1) (55,1;69,0) (58,3;68,1)
% Males 40% 50% 56,/%
95% confidence interval (9,6%; 70%) (28,1%;71,9%) (39%; 74,4%)
There are 60% males in the population and the mean age is 64,6 years.
careful consideration should be given to the a 10% margin of error, requires a sample size of 97
calculation of the sample size (see Chapter 8: if simple random sampling is used, but a sample
Sampling). size of 210 if cluster sampling of 7 per cluster is
used (assumed design effect of 2). However, in
Design considerations certain instances, such as rural areas where the
For a given sample size, it is possible to improve total number of potential subjects is not known,
the tevel of precision in the study by designing the the larger sample size for a cluster survey may be
study differently, namely more efficiently. This cheaper and logistically simpler than a simple
section will present some ways in which precision random sample survey with a smaller sample
may be improved without increasing the sample size.
size. Many of the issues raised in this section are Increasing the proportion of subjects with the
to be considered as trade-offs between the need exposure being studied can improve precision. In
for precision and the cost or feasibility of con- a cross-sectional survey to assess the association
ducting the study. This approach of trying to con- between having pierced ears (for earrings) and
sider the costs of the study on the one hand, and hepatitis B virus infection, the following was
the level of precision on the other, is sometimes found:
referred to as the ‘efficiency’ of the study design.
Greater efficiency is achieved if more precision is Hepatitis B virus infection
obtained with the same number of (or fewer) Yes No
subjects. Pierced ears Yes 6 14 20
The sampling strategy selected when designing No 38 342 380
a study influences the degree of precision. Cluster 44 356 400
samples are prone to distortions due to the simil-
arity between subjects in a cluster (intra-cluster Prevalence of pierced ears = 5%
homogeneity). Therefore, in order to obtain the Relative Risk (RR) = 3 (95% confidence interval:
same degree of precision as a simple random 1,44; 6,25)
sample, larger samples are required. For example, The same study, conducted in a community
a community-based study to assess vaccination where ear piercing was encouraged, produced the
coverage, which is expected to be about 50% with following results:
c¢ data presentation, analysis, and interpretation
ng ng ee a
Hepatitis B virus infection recruiting a case and a control subject is the same,
Yes No altering the ratio of cases to controls is efficient
Pierced ears Yes 30 70 100 and improves precision, as long as the smallest
No 30 270 300 cell in the 2 x 2 table increases. In most instances,
60 340 400 however, recruiting a control is cheaper and
quicker, particularly where the disease is rare. In
Prevalence of pierced ears = 25% this instance, increasing the number of controls
RR=3 (95% confidence interval: 1,91; 4,72) for a given number of cases leads to higher preci-
Both studies had a sample size of 400, an RR of 3 sion. However, the law of diminishing returns
and a 10% prevalence of hepatitis B virus infec- applies and there is a critical point beyond which
tion among subjects without pierced ears. How- a substantial increase in the number of controls
ever, the latter community has a higher preva- has little benefit in terms of improved precision.
lence of pierced ears, thereby leading to a higher
precision in the estimate of the RR as reflected by Conclusion
the narrower confidence intervals. However, it is Precision should not simply be equated with
not simply increasing the prevalence of the expo- sample size. The precision of the estimates ob-
sure (in this example, pierced ears) that improves tained from a sample depends on sample size and
precision; rather, it is the effect that increasing the way the study was designed. For a fixed sam-
the prevalence of the exposure has on increasing ple size, improved efficiency is possible with care-
the value of the smallest cell in the table. In the ful design of the study. While sample size plays an
first table, the smallest cell is 6, while in the important part in determining the level of preci-
second table, the smallest cell is 30. To demon- sion, study design becomes particularly import-
strate this point further, if the prevalence of ant when resources preclude an increase in the
pierced ears was 50%, the following results would sample size.
have been obtained:
Validity and bias: in search of truth
Hepatitis B virus infection
Yes No Introduction
Pierced ears Yes 60 140 200 Bias was described in Chapter 4. This section aims
No 20 180 200 to synthesize information and provide some
80 320 400 additional insights. Bias is a deviation from that
which is correct and true. Bias is distinct from
Prevalence of pierced ears = 50% precision; precision refers to repeatability, while
RR=3 (95% confidence interval :1,88; 4,78) bias refers to systematic deviation from the truth.
In this table, the prevalence of pierced ears Bias is also referred to as ‘systematic error’ as
increases to 50% from 25% in the previous table. opposed to ‘random error’, which is another term
Yet the precision is lower, demonstrating that infor precision. Validity is the absence of significant
creasing the level of exposure does not uniformly bias.
lead to improved precision. An increase in pre- A small degree of bias may be inevitable in re-
valence has led to a decrease in the size of the search, for example, in almost all studies some of
smallest cell. the selected subjects may choose not to partici-
A study in which a higher proportion of sub- pate (participation bias), with the result that the
jects has the exposure leads to a higher level of sample does not fully represent the study popula-
precision, as long as the smallest value in the cells tion. While the presence or absence of bias should
of the 2 x 2 table increases as well. On the other be evaluated in a study, it is more important to
hand, the disadvantage of purposefully selecting assess the magnitude of the bias and the direction
a community with a higher prevalence of expo- in which it is likely to alter the study findings.
sure in order to improve the precision of the esti- The first step in assessing the validity of a study
mates is that the findings may no longer be estimate is to identify potential biases. The next
generalizable, since the study subjects may no step is to assess whether each bias is large or
longer represent the wider population. small. Then each bias needs to be assessed in
A similar benefit is achieved by altering the terms of whether the bias could reduce the esti-
ratio of cases to controls in order to improve pre- mate, exaggerate the estimate, or do either. The
cision in a case-control study. Where the cost of final step is to identify methods by which each
126
12 = integrating epidemiological concepts
bias could have been reduced (for example, cali- of coffee a day. It also transpired that those who
brating the scale after every 20 subjects have been drank more than four cups of coffee per day also
weighed) and determine if adequate steps were smoked more cigarettes. Since it is known that
taken in the study to reduce bias. smoking is associated with ischaemic heart
The presence of a large bias in a study (for disease, it is possible that heavy coffee drinkers
example, the bias introduced by a low response had more myocardial infarction emergencies be-
rate of only 30%) seriously undermines the valid- cause of their smoking, and not because of their
ity and value of the findings. Every effort should coffee consumption. This is an example of bias
be made to reduce and avoid bias in a study, for due to confounding.
example, by conducting repeat follow-ups with There are many kinds of bias, but all biases can
incentives to increase the response rate. be classified into one of three categories: selec-
tion bias, information bias, and confounding.
Types of bias This section deals with the first two types.
In a case-control study to assess whether passive
smoking during the antenatal period was asso- Selection bias
ciated with stillbirth, mothers who had had still- Representative samples are drawn from a popula-
births were interviewed soon after the birth. A tion in order to make the findings of the study
control group of mothers who had had normal applicable to the population. In some instances,
births were also interviewed soon after birth to the subjects being studied do not represent the
get a history of passive smoking during the ante- population from which they were selected. The
natal period. The verbal histories of exposure two main ways in which selection bias occurs is
were compared to urinary cotinine levels (a bio- through a flaw in the sampling procedure or
logical measure of cigarette smoke exposure used through non-participation of certain subjects,
here as a ‘gold standard’). It was found that many either at the start of the study (non-responders), or
of the women with normal births underestimated during the course of the study (loss to follow-up).
their exposure to passive smoke, while the Sampling bias, which is one form of selection
women with stillbirths overestimated their expo- bias, is covered in Chapter 8. In summary, incom-
sure to passive smoke. This is an example of recall pleteness of the sampling frame and the use of
bias. The women who had just been through the non-random sampling procedures are the main
traumatic experience of having astillbirth were sources of sampling bias. In an opinion poll for
searching for antenatal factors to blame for their the 1994 South African elections, interviews were
loss, with the result that they overestimated conducted at shopping malls and showed similar
smoke exposure. On the other hand, the women levels of support for the two major political
with normal healthy babies were too busy enjoy- parties. The sample was however biased, since
ing their newborns to think back and consider shopping mall samples under-represent the poor
whether they were exposed to smoke during the sections of the community, as well as rural popu-
antenatal period. lations, while urban, middle-class people are
In a cohort study to assess whether obesity is overrepresented. This sample did not represent
associated with hypertension, the blood pressure the voters of South Africa, since the sampling
of obese and non-obese subjects was measured frame did not include all voters.
using the same sphygmomanometer and cuff. In a descriptive survey to assess vaccination
Since this cuff was not big enough for obese arms, coverage in Durban, the register of electricity
there was a systematic exaggeration of the blood consumers was used as the sampling frame and a
pressure levels among those obese subjects with random sample of houses from the register were
large arms. This bias led to an artificially raised visited to obtain information on vaccination
relative risk of the association between obesity coverage of children in the household. This study
and hypertension. This is an example of measure- found a vaccination coverage level which was
ment bias. higher than all previous estimates. This was
In a cross-sectional study of coffee consump- ascribed to selection bias as a result of the exclu-
tion and history of emergency medical care for sion of houses without electricity (for example,
myocardial infarction, it was found that subjects shacks in squatter settlements), where vaccina-
who drank more than four cups of coffee a day tion coverage is known to be low.
had twice as many myocardial infarction emer- In studies which include a representative
gencies as subjects who drank less than one cup sample of homes, non-participation becomes an
127
ee
issue when several homes are too difficult to antibodies. The test is 99% sensitive and 98% spe-
reach, or no one is found at home, even with cific. This means that for every 100 HIV infected
repeat visits. The exclusion or replacement of individuals who are tested, one will be misclassi-
these homes can lead to a biased sample, since fied as HIV negative. Similarly, for every 100 truly
these very hard-to-reach homes often occur HIV negative subjects, two will be misclassified as
where health status is poor and access to health HIV positive (see below).
services is difficult. From a practical point of view,
the study protocol should include a procedure for True status
dealing with hard-to-reach and locked homes. HIV+ HIV-
For example, after two visits, a non-responding ELISA result HIV+ 99 2 101
home may be replaced with the next closest HIvV- 1 98 99
house. The extent of replacement must be docu- 100 100 200
mented and included in the discussion of the
main study findings. Sensitivity = 99%
In a community-based cohort study, 1 000 Specificity = 98%
adults were followed up annually to assess the False positive rate = 2%
incidence rate of ischaemic heart disease. It was False negative rate = 1%
found that over 10 years, 30% of the subjects
moved out of the study area — many moving to In an analytic study, the exposure variable, the
wealthier suburbs. This 30% of subjects who were outcome variable, and/or the related variables
lost to follow-up left behind a sample which was could be misclassified. (Misclassification of
biased. The bias ocurred due to the selection related variables such as confounders is beyond
process of who left; in this case, most of the the scope of this book; more information on this
upwardly mobile business-orientated people, can be obtained from Ahlbom and Norell (1990):
mostly smokers with type A personalities, left the Introduction to Modern Epidemiology.) When the
area. Ischaemic heart disease is more common in degree of misclassification of the exposure vari-
this group, with the result that the cohort study able is influenced by the outcome variable, differ-
would underestimate the incidence rate for ential misclassification of exposure is said to
ischaemic heart disease if only the remaining 70% have occurred. When the degree of misclassifica-
of subjects were studied. tion of the outcome variable is influenced by the
Selection biases are often subtle and it is exposure variable, differential misclassification
important to pay particular attention to the sam- of outcome has occurred. Non-differential mis-
pling design in order to reduce selection bias to a classification occurs when the degree of mis-
minimum. classification of either the exposure or outcome
variable is not dependent on the other.
Information bias
Information bias refers to systematic inaccuracies Non-differential misclassification
in measurement, recording, management, and Differential and non-differential misclassification
analysis of data. Such inaccuracies lead to mis- can have markedly different effects on the esti-
classification, for example, subjects who have mates obtained in a study. Non-differential mis-
been exposed to the risk factor being studied may classification of either exposure or outcome
be misclassified as unexposed. Misclassification usually biases the study estimate towards the
occurs when the measurement ascribed to a sub- null, that is no effect or RR=1.
ject is incorrect and the margin of error is suffi- Extending the previous example to condoms
ciently large to lead to the subject being placed in and HIV infection, in a study of 10 000 subjects
a different category during the data analysis. the following results were obtained:
Misclassification is most simply understood in
terms of the sensitivity and specificity of the ELISA
measurement method used. (See Figure 9.7 in HIV+ HIV-—
Chapter 9 for more details on sensitivity and Condom use Yes 100 1900 2 000
specificity.) For example, in a study to assess No 800 7200 8000
whether condoms protect against HIV infection, 900 9100 10000
the presence or absence of HIV infection is deter-
mined by an immunological test (ELISA) for HIV RR=0,50 (95% confidence interval: 0,41; 0,61)
128
12 integrating epidemiological concepts
Based on the HIV ELISA results, there appears misclassification should be minimized by using
to bea substantial protective effect of condoms in the most accurate measurement methods.
this study, since 5% of condom users and 10% of
non-users were HIV positive. Differential misclassification
Since the level of inaccuracy of the ELISA result Differential misclassification of exposure occurs
is not dependent upon whether condoms were when the degree of misclassification of the expo-
used, the HIV status has been non-differentially sure variable varies across strata of the outcome
misclassified. The misclassification of HIV status variable. Similarly, differential misclassification
can be crudely taken into account by adjusting for of outcome occurs when the degree of misclassi-
99% sensitivity and 98% specificity of the ELISA fication of the outcome variable varies across
test. strata of the exposure variable. Differential mis-
classification can bias the study estimate towards
ELISA or away from the null, unlike non-differential
HIV+~ HIV-— misclassification, which usually biases the study
Condom use Yes 63 1937 2000 estimate towards the null.
No 664 7336 8000 Earlier in this chapter, an example of a case-
727 9273 10000 control study to investigate the association be-
tween stillbirths and passive smoking was used to
RR =0,38 (95% confidence interval: 0,29; 0,49) demonstrate information bias. This investigation
The value of 63 for the first cell in the above included 100 women who had experienced still-
table is derived by adjusting as follows: births and 200 women with healthy newborns.
@ the sensitivity of 99% means that 1 of every 100 The results, based on a verbal history to assess
truly HIV infected has been incorrectly labelled passive smoking, were as follows:
HIV negative; adjusting for this among condom
users results in 101 HIV positive and 1 899 HIV Stillbirth
negative. Yes ___No
@ the specificity of 98% means that 38 (2% of Verbalhistoryof Yes 30 40 70
1 899) HIV negative condom users have been passive smoking No (0. et60230
incorrectly classified as HIV positive; adjusting 100 200 #300
for this results in 63 (101 minus 38) HIV positive
and 1 937 HIV negative condom users. OR = 1,71 (95% confidence interval: 0,95; 3,08)
repeat the above steps for condom non-users. An association was found, although not statist-
ically significant, between passive smoking and
The above adjustment to remove the non-differ- stillbirths when verbal history was used to
ential misclassification emanating from the inac- measure passive smoking. In addition, urinary
curacy of the Elisa test, results in an even larger cotinine was measured to corroborate the history;
protective effect. That is, the non-differential mis- and produced the following results:
classification of HIV status biased the RR towards
the null. Even though the HIV Elisa test has a high Stillbirth
degree of accuracy, namely 99% sensitive and Yes _No
98% specific, the effect on the RR is substantial, Urinarycotinine Yes 25 50 75
moving it from 0,38 to 0,5. No Co 150 225
Most measurements undertaken in a study are 100 200 #300
not 100% accurate, so there is some degree of in-
accuracy. As long as this measurement inac- OR = 1 (95% confidence interval: 0,55; 1,8)
curacy leads to non-differential misclassification, The biological measurement of passive smok-
any non-null finding in a study could not have ing revealed that five women with stillbirths had
been the result of this measurement bias, since overestimated their exposure to passive smoke,
non-differential misclassification usually biases while 10 women with healthy newborns had
towards the null. In statistical terms, non-differ- underestimated their exposure. This is an exam-
ential misclassification generally leads to a Type ple of differential misclassification; the misclassi-
I] error and not a Type |error. In order to increase fication of exposure differs across the outcome
the probability of finding an association where categories. It is noteworthy that only 5% of the
one truly exists, the amount of non-differential subjects had been misclassified, yet the effect on
29
a
the odds ratio is substantial — the odds ratio of 1 ence of asthma among both those who had been
is biased to 1,71. exposed and those who had not been exposed to
In a cross-sectional study investigating the rela- passive smoke. However, the overestimation was
tionship between passive smoking and asthma substantially higher among subjects who had
(measured by peak expiratory flow rate), 1 000 been exposed to passive smoke, resulting in dif-
children were investigated and the following ferential misclassification. The effect of the bias in
results were found: this study was to exaggerate the association be-
tween passive smoking and-asthma.
Asthma Misclassification is often difficult to avoid be-
Yes No cause of inherent inaccuracies in many measure-
Passivesmoking Yes 160 240 400 ment tools. When misclassification cannot be
No 60 540 600 avoided, the design of the study should ensure
220 780 #1000 that the misclassification arising from measure-
ment inaccuracy is non-differential.
RR =4 (95% confidence interval: 3,06; 5,23)
It was later learned that the fieldworker who Conclusion
performed the peak expiratory flow rate measure- Selection bias produces results that may not
ment knew the passive smoking history of the be generalizable to the study population. In
subjects. There was concern that he may have analytic studies, a bias in the selection of one
biased the peak flow measurement by making group, for example, the control group in a case-
more effort when performing the peak flow control study, can produce results which are not
measurement on subjects who had a history of valid.
passive smoking. To assess this potential bias, the Information bias manifests in misclassification,
measurements were performed again by another which can be interpreted by the sensitivity and
fieldworker, who was blinded to the subject’s specificity of the measurement. When the mis-
smoking history status. The repeat study pro- classification is non-differential, the study esti-
duced the following results: mate is usually biased towards the null. Should a
study demonstrate an association in the presence
Asthma of non-differential misclassification, then the
Yes- No association is probably much stronger. Differen-
Passivesmoking Yes 100 300 400 tial misclassification can bias the study estimate
No 50 550 600 either towards or away from the null.
150 850 1000 Selection and information bias call into ques-
tion the validity of the study findings. Every
RR=3 (95% confidence interval: 2,19; 4,11) attempt should be made to reduce both these
The first fieldworker overestimated the preval- biasés to a minimum.
130
1 418 Knowledge, attitude, belief, and
practice poe er
4 19 Quaativemethodology a"
introduction - /
_ D Skinner, H van der Walt
ey
og
a
RES
13 The use of
routinely
available data in
epidemiological
studies
Introduction migration, population sizes can change dramat-

Routinely available data is data collected on an ically over relatively short periods (particularly in
ongoing basis or at regular intervals through sur- informal settlements), rendering census figures
veillance systems or registration procedures. This out of date in such areas. Data can be inaccurate,
information is used by health professionals and as in the case of cause of registered deaths, where
policy-makers at various levels to monitor ser- high percentages of deaths are classified as ‘signs,
vices and formulate policy. symptoms, and ill-defined conditions’ (see Chap-
Routinely available data is collated regularly ter 20). When using published data, one must be
and can provide a rich source of health-related aware of the possibility of changes in definitions
information. In this chapter, we will focus on or coding practices over time (see Example 20.3).
information which has already been collated. Keeping these problems in mind, the researcher
Generally, large numbers are collected, with wide can however gain important insights by using
coverage (for example, nationally registered routinely available data sensibly. Table 13.1 lists
deaths). Since the researcher using such data routinely available South African data, their
often spends little or no time on data collection, sources, and asummary of their limitations.
most effort can be channelled into a thorough Various types of epidemiological studies can be
analysis. It is, however, a disadvantage that the re- performed using routinely available data.
searcher plays no part in the data collection, since Descriptive studies of the prevalence or incidence
one normally gets a feel for the quality of one’s of diseases and health resources availability are
data during the collection phase. Researchers most commonly conducted. However, analytical
who make use of routinely available data should studies can be conducted; for example, to deter-
ensure that (in addition to acknowledging the mine factors related to TB treatment defaulting at
source), they know exactly how the data was col- a clinic (see Example 13.1). The impact of inter-
lected and what they represent: all small print in vention programmes can also be evaluated using
the published reports should be scrutinized! The routinely available data.
researcher should also, as part of the analysis, Because of legislation, routinely available data
carry out quality checks if possible (see Example was often collected according to population
14.1). group (race), but not according to other import-
Routinely available data can be of poor quality ant socio-economic indicators. Researchers who
because of incompleteness; for example, not all analysed data on the basis of population group
notifiable cases are in fact notified, and censuses were often criticized for promoting a racial cate-
of populations can be undercounts. Routinely - gorization of the population. Careful attention
available data can also be out of date. To be able needs to be paid to the appropriate use of race as
to calculate disease rates in an area, an estimate a variable for analysis. Without being racist, race
of the population size is needed. Official censuses can be used when genetic factors play a role, or
are generally only conducted at certain time where issues of racial or ethnic equity are being
intervals (usually every five or 10 years). Due to analysed. As the government strives to undo the
133
d common approaches in epidemiology
ey eee MM eM CMCC ACC cL

ATT WNa(er:|
TYPE SOURCE LIMITATION
MORTALITY
— national Death reports by Central Statistical Summary data
Services (yearly) Excluding TBVC* until 1994
Under-registration and misclassification
of cause for Africans in particular
Historical death tapes MRC, UCT Excluding TBVC until 1994

(1968 onwards) Under-registration and misclassification
of cause for Africans in particular
— local Annual reports by Medical Officers of Summary data

Health (yearly) Selected urban areas only
Misclassification of cause of death
MORBIDITY
— national Notifiable (mainly infectious) conditions Summary data
reported in Epidemiological Comments Under-reporting
of Dept of Health (monthly) Regional variation in diagnosis
reporting practices
Cancer register kept by SAIMR Histology based, i.e. only cases
(yearly report) confirmed by histology
Virology surveillance from all RSA Under-reported
laboratories reported by the National Only public laboratories
Institute for Virology (monthly)
SASPREN: surveillance conducted Not a random sample of practitioners
by volunteer primary care practitioners
— local Notifiable conditions reported by local Summary data
authorities (Annual reports) Selected urban areas only
Under-reporting
Hospital discharge records No knowledge of the exact population data
Diagnoses not standardized
POPULATION
— national Census reports and data by Central Excluding TBVC until 1994
Statistical Services (5—10 yearly) Under-enumeration
Intercensual estimates by Central Excluding TBVC until 1994
Statistical Services (periodically) Under-enumeration
HSRC estimates and projections
(irregularly)
Population register (ongoing) Untested
Development Bank reports on Summary data
‘homeland’ areas (irregularly) Under-enumeration
134
13 = the use of routinely available data in epidemiological studies
Table 13. 7 (continued) Type, sources, and limitations of chet

available data in South Africa
TYPE SOURCE LIMITATION
BIRTHS
— national Central Statistical Service reports Excluding TBVC until 1994
(yearly) Under-registration
Out of date
Population register (ongoing) Untested
— local Annual reports by Medical Officers Selected urban areas only
of Health (yearly) Under-reporting
HEALTH CARE
South African Medical and Dental Postal address and not address of
Council's register of doctors and practice
dentists (ongoing) (1975, 1983, 1986, Inconsistent inclusion of practitioners
1989 at MRC) outside RSA
Census of doctors by Central Statistical
Services (1979, 1983)
South African Nursing Council’s Inconsistent inclusion of
register of enrolled and registered non-practising nurses
nurses (angoing) (1983, Postal address and not address of
1986, 1988, 1989 at MRC) practice
South African Hospital and Nursing Summary data
Yearbook including hospital and
clinical resources (yearly)
RAMS — register of medical aid Limited information
doctors (ongoing)
RAMS — medical aid claims report
and data on tape (ongoing)
ReHMIS — survey of resources in Untested
the public sector
* TBVC = Transkei, Bophuthatswana, Venda, Ciskei
damage of previous apartheid policies, it will be through a disclaimer justifying the inclusion of
important to monitor progress towards equity in population group in their analysis. Debates have
health services and health status. It will therefore arisen over the naming of certain categories. In
be necessary to continue to record population the past, the use of ‘coloured’ rather than colour-
group for certain data. However, at the same time, ed, and African rather than black usually reflected
it is important that ethnicity, education, occupa- dissent from official apartheid terminology.
tion, and income variables are used to reflect the
cultural and socio-economic stratification of
South African society. These variables need to be
Mortality data
included in the routine information systems. In the past, all registered deaths in the RSA ex-
Researchers who do not wish to promote a cluding Transkei, Bophuthatswana, Venda, and
racial categorization often express their views Ciskei were collected by the Central Statistical
135
d= common approaches in epidemiology
a a a re
Services and published annually by age, sex, deaths in a specific age/sex group are due to the
population group, and cause of death, as well as cause of death of interest. So, for example, it was
by magisterial district and occupational group. found that 7,8% of all deaths in children under 15
Since the- repeal of the Population Registration years during 1981-5 were due to injuries (non-
Act during 1991, this data has not been collected natural causes) (Kibel, Bradshaw, Joubert 1990).
according to population group. During 1994, the When calculating mortality rates, in addition to
independent homelands were re-incorporated the mortality information, we need information
into South Africa and their data is now being on the size and composition of the population.
processed by the Central Statistical Services. The Using such information, it was determined that
data available before and after 1994 is thus not annually 55,3 per 100 000 coloured children aged
strictly comparable. five to nine died due to injuries during 1981-5
The underlying cause of death is coded accord- (Kibel et al 1990). Mortality obviously looks at the
ing to the International Classification of Diseases most extreme outcome on the health/disease
(ICD) into one of 999 causes. These causes are spectrum, but in the absence of national morbid-
grouped into 17 main chapters (for example, neo- ity data, it provides useful information. (Also see
plasms, circulatory). The ICD is revised and Cha 20:pte r
Mortality studies.)
expanded occasionally. During the 1980s and the
first half of the 1990s, the ninth revision of the ICD Population data (denominator)
was the standard one. It has subsequently been
replaced by the 10th revision. The ICD is used National censuses are usually conducted every
internationally and provides a basis for inter- five years. Any census has the problem of possible
national comparison. The quality of the classifi- under-enumeration. It has been estimated that
cation and of the reporting are crucial in deter- the 1991 census under-enumerated the African
mining the quality of the data. population by as much as 17% (Central Statistical
Although death registration is compulsory and Services 1992). In the other population groups,
permission for burial can only be obtained if the these figures are somewhat lower. Of course, the
death is registered, it has been estimated that as extent of under-enumeration differs geographic-
many as 50% of African deaths are not registered ally, but this has not been estimated. It is thus
(Botha and Bradshaw 1985), and thus not re- problematic to calculate rates for Africans, espe-
flected in the national mortality figures. Since we cially in the case of mortality where the numer-
cannot assume that underregistration is uniform ator data (mortality figures) are also inaccurate.
with regard to cause, age, sex, or area, this data For the African population, it is best to use locally
must be used cautiously. In addition, it has been available metropolitan or regionally collected
found that up to 20% of registered African deaths data (such as the African population estimates
are coded as being due to ‘symptoms, signs, and which the HSRC provided for the greater Cape
ill-defined conditions’. The majority of such Town area in 1988).
death certificates are completed by police officers As health services move to district manage-
(Van der Merwe, Yach, Metcalf 1991) and head- ment, there is an increasing need for an author-
men in the rural areas. This hampers the inter- itative source of district level population projec-
pretation that can be made regarding the other tions for the years after a census.
causes of death in this group.
The Births and Deaths Act was revised in 1992
Births
and the medical certificate changed so that
details of the cause of a non-natural (external This information is crucial for determining the
cause) death need not be submitted. This has infant mortality rate. National data are only avail-
been disastrous for surveillance of injury-related able some years after collection, and in the case of
deaths. Authorities such as the Western Cape Africans, are particularly incomplete. Most large
Regional Services Council have therefore set up cities include the number of births, based on noti-
their own surveillance system by collecting this fications, in their annual reports.
data at the mortuaries.
To gain insight from the published mortality
Morbidity data
data, we can calculate proportional mortality or
mortality rates. In the case of proportional Certain diseases, mostly of an infectious nature,
mortality, one calculates what proportion of all are notifiable to the Department of Health. The
136
13 = the use of routinely available data in epidemiological studies
major diseases include tuberculosis, hepatitis B, (for example, doctors to population ratio). This
measles, malaria and typhoid. Unfortunately, can be used to describe the distribution of avail-
health professionals do not always notify the able resources (see Example 13.2).
authorities of these conditions, and deaths are Various budgets can be used to gather inform-
particularly poorly reported. When considering ation about health care expenditures in the public
deaths due to notifiable conditions, it is advisable sector, and data from RAMS medical aid claims
to use the registered mortality data. can be used for information about expenditure on
A national cancer registry based on histological services. It is, however, extremely difficult to
diagnoses has been created by the South African obtain information regarding private consump-
Institute for Medical Research. Annual reports are tion of pharmaceuticals. Health care expenditure
published giving incidence rates by age, sex, can be expressed either as a total amount, or per
population group and site of the cancer. Disease- capita.
specific registers are very important for the sur- * ReHMIS (Regional Health Management In-
veillance of the disease, as they can provide more formation System) is a national survey of the
accurate measures of incidence than mortality resources in the public health service (REHMIS
data. System Documentation 1994). This was initiated
Many records are kept by hospital administra- to help in the restructuring of the health service,
tions. Not much epidemiological use has been and is expected to be repeated on a regular basis.
made of South African hospital records. Analyses
of records from Groote Schuur and Red Cross Conclusion
Children’s hospitals have shown the potential
value of such sources, as long as the quality of dis- The epidemiologist has an important role to play
charge diagnoses can be standardized (Henley, at all levels of the health service by assessing the
Smit, Roux, Zwarenstein 1991). quality of routinely available data and meaning-
Clinic records also provide a rich source of fully interpreting such data. By letting the data
information. Much of this information is not providers know how the data is used and what its
generally collated, and anyone who wishes to limitations are, it is possible to change the data
analyse such data may have to spend a lot of time collection procedure so that it yields better qual-
extracting the needed information from patient ity data. The data providers are generally willing
records. Example 13.1 describes how clinic to discuss ways of improving the routinely col-
records were used to investigate methodological lected data once they realize that someone is
issues regarding the measurement of treatment making use of it. Feedback should also be given to
compliance. the staff who collect routine data, particularly at
The South African Sentinel Practitioner Re- local level. These staff are often unaware of what
search Network (SASPREN) is a network of pri- happens to the information, and they may feel
mary care practitioners who have developed a they are wasting their time filling in endless forms
surveillance system (Volmink, Furman 1991). A that no one ever looks at. Feedback may motivate
pilot project has demonstrated the feasibility and them and thus result in better quality inform-
value of the routine collection and processing of ation. In addition, they can describe the practical
selected data, such as trends in hypertension and problems they experience in collecting the in-
depression. formation, and solutions can be found jointly.
Health services data EXAMPLE 13.1

Information about the available human health Measuring patient compliance with
resources can be obtained from registers of the
tuberculosis treatment
South African Medical and Dental Council, the
South African Nursing Council, and the register of At a community-based TB clinic in Cape Town, a
medical aid doctors. Information regarding other cohort of all new patients entered on the official
resources can be obtained from the South African notification list for the two major suburbs served
Hospital and Nursing Yearbook, which publishes by the clinic during a period of 120 treatment days
statistics on the number of beds available in each was studied. Demographic information about
hospital or clinic. Such information is often these patients was collected from clinic or noti-
expressed in terms of resources per population fication records. Treatment, outcome, and com-
137
i ee EE EE
pliance information was collected from the pa- The Hospital and Nursing Yearbook, statistical
tients’ treatment cards. reports from the provinces and the Central Stat-
On the last day of acceptance into the group (30 istical Services, as well as reports from the Na-
June 1987), a cross-sectional survey was con- tional Association of Private Hospitals were used
ducted identifying those patients in the group as sources of hospital data. For the clinic data, the
who were still on the clinic treatment list. Compli- Yearbook, reports from the regional directors of
ance and associations between demographic the then Department of National Health and
variables and compliance were measured at this Population Development, timetables from clinics
point (cross-sectional study, compliance to date), under various local authorities, and the statistical
as well as at the end of treatment (longitudinal abstracts from the Development Bank of South-
study, end compliance). A patient was defined as ern Africa were used.
compliant if he or she had attended on 75% or To verify hospital information, the 1988 Hos-
more of possible treatment days. In the longitudi- pital and Nursing Yearbook was regarded as the
nal survey, a time lag of some months was anti- most comprehensive source. All other sources, as
cipated until all patients had completed their well as other editions of the Yearbook, were com-
treatment. To contain the study, a cut-off point pared to it. For a random sample of hospitals, in-
was defined after which compliance was impos- formation was confirmed telephonically with
sible (160 days after entry into the cohort, since hospital administrators. The hospital data of the
treatment consists of 120 treatment days). Yearbook was found to be comprehensive, but
The cross-sectional study underestimated non- errors were found in totals of figures. Hospital bed
compliance: 19,5% compared to 40,3% obtained data of the various sources did not differ mark-
in the longitudinal study. The cross-sectional edly. The hospital data was updated using the
result was biased, as this study only included 1994 Yearbook.
treatment survivors, and measured compliance No data source was found to have a compre-
of all patients on a particular day, irrespective of hensive list of clinics within the provinces. There-
the time since notification. fore, exhaustive telephone contact with local
In both analyses it was found that children were authorities was made to improve the information.
less compliant than adults. The provincial distribution of hospital beds per
1000 population is shown in Table 13.2. The
Reference: report points out that the total number of private
Youngleson S M, Joubert G. ‘A comparison of cross-sectional and public sector beds does not accurately reflect
and longitudinal survey methods in measuring patient com- the number of beds accessible to the general
pliance with tuberculosis treatment.’ South African Medical population. Therefore, the number of public
Journal 1991; 80(7):331-5. sector beds (referral, general, and special) as well
as acute public beds (only referral and general)
per 1 000 population was also calculated by pro-
vince. The provinces were fairly comparable with
EXAMPLE 13.2 respect to acute public beds, apart from Mpuma-
langa which had 1,5 acute beds per 1 000 popula-
Health facilities in the nine provinces
tion (compared to the other provinces, which
of South Africa ranged from 2,0 to 2,5 per 1 000 population).
Since the provision and distribution of health ser- The distribution of clinics is shown in Table
vices is an important aspect of the development 13.3. Some provinces do worse than the WHO’s
of a national health plan, this study was done to recommended 10000 people per clinic. This
collate, verify, and analyse data on the number information has not been routinely analysed
and types of hospital beds (Table 13.2) and the before, and the quality is thus not known.
number of fixed clinics (Table 13.3), and relate The report recommends that comprehensive
these to the population sizes in each of the nine analyses should be carried out at regional and
provinces of South Africa. A comprehensive data- subregional level, taking into account, for exam-
base of this information was compiled for the ple, population density and growth, location and
period 1988-1993. accessibility of services (Chetty 1995).
138
13 the use of routinely available data in epidemiological studies
Table 13.2 Distribution of total public and private hospital beds per 1 itil)
population, South Africa 1993
Province Number of hospital beds Beds per 1 000 population
Eastern Cape 23 207 3,5

Mpumalanga 6 058 2,1
Free State 11 493 4,1
Gauteng 41 297 6,0
KwaZulu-Natal 32 826 3,8
Northern Cape 3 090 4,0
Northern Province 12846 -* 2,5
North-West 14518 3,3
Western Cape 19 664 5,4
Total 161 949 4,0
Table 13.3 Total number of clinics and population per clinic,

South Africa 1993
Province Number of clinics Population per clinic
Eastern Cape 530 12576

Mpumalanga 162 Tf 522
Free State ' 168 16 694
Gauteng 213 25 080
KwaZulu-Natal ais 22919
Northern Cape 422 6 261
Northern Province 287 17 842
North-West 268 13 085
Western Cape 365 9918
Total 2 548 15979
Reference:
Medical Journal 1995;
Chetty K S. ‘An integrated analysis of health facilities in the nine provinces of South Africa.’ South African
85:245-50.
139
14 Disease
surveillance
Introduction nation of data on specific diseases to those who

needed to know.
Disease surveillance in public health is the
The 1918 influenza epidemic in South Africa,
systematic collection, collation and analysis of
which caused 150000 deaths, occurred largely
outcome-specific data, and timely dissemination
because there was no surveillance system in place
to those who need to know, especially those who
to recognize it at an early stage, or to target effect-
can use this information to improve public
ive interventions. Thus, when the Department of
health. Hence, surveillance has also been referred
Health was constituted in 1919, certain diseases
to as the collection of data for action. It is a type of
were made notifiable.
ongoing observational study that involves con-
tinuous monitoring of disease occurrence within
a population. Uses of surveillance
The dictionary defines surveillance in terms of
The primary use of surveillance systems is to
police surveillance, as meaning to ‘watch or guard
establish the long-term trends and patterns in
over a person, especially a suspected person, a
disease occurrence at local, regional, and national
prisoner, or the like’. In terms of public health
levels. This serves as a background which allows
surveillance, the suspect is the disease in a popu-
for the detection of unusual disease patterns. For
lation.
example, long-term trends in tuberculosis (TB)
Since the surveillance of infectious diseases
show that since 1987, the overall incidence of TB
constitutes the bulk of public health surveillance
has been rising, after remaining fairly constant for
in South Africa, infectious diseases examples will
about 10 years (Figure 14.1). A breakdown of the
mainly be used to illustrate the general principles
data into racial groups shows that this increase is
of surveillance in public health.
largely attributable to an increase in the rate in
the coloured population. This example shows
Historical background how surveillance enables high-risk groups to be
The history of public health surveillance can be identified.
traced back to efforts to control bubonic plague in The immediate use of surveillance data is to
Europe in the 14th century. The principles of sur- trigger disease control efforts. In the latter part of
veillance were first exemplified by William Farr, 1988, paediatricians and general practitioners in
Superintendent of the Statistical Department of Cape Town reported an increase in the occur-
the General Registry in London, in a series of rence of pertussis (whooping cough) (Strebel,
letters on the causes of death in England appear- Metcalf, Hussey, Smith, Hanslo, Cameron 1989).
ing from 1839 to 1870, and a collection of papers Once it became clear that an epidemic of Borde-
on ‘Vital Statistics’ published in 1885. The tella pertussis was being experienced, the follow-
modern era of disease surveillance was initiated ing control measures were recommended: clin-
in the 1950s by Langmuir and his colleagues, who ical guidelines to consider a case as pertussis,
focused on the collection, analysis, and dissemi- antimicrobial (erythromycin) therapy, affected
140
14 disease surveillance
Se Se
a a
Sram meMC em Re mE tl
Incidence rate
4 Overall x Black -A- Coloured -~- White —< Asian

Source: ‘The trend of tuberculosis through the eyes of cohorts.’ Epidemiological Comments Nov 1992.
schoolchildren to remain at home for seven days veillance often stimulate health-related research.
after treatment or three weeks if appropriate
treatment was not given, treatment for contacts Main purposes of surveillance:
(erythromycin for upper respiratory tract infec- @ Morbidity and mortality reporting
tion) and strict adherence to immunization ¢@ Documenting distribution and spread of
schédules regarding diphtheria, pertussis, and diseases
tetanus (DPT). Whooping cough had been made @ Establishing long-term trends in disease occur-
notifiable in the city of Cape Town in 1974, unlike rence
the rest of the country, where it was not notifiable. Detection of epidemics
In 1994, pertussis was made notifiable nationally. @ Identifying high-risk groups or areas
Surveillance data is useful in evaluating inter- @ Estimation of magnitude of the health problem
vention measures. A mass measles immunization Facilitating planning of control and prevention
campaign in 1990 was evaluated using surveil- measures
lance data on measles admissions to a tertiary Evaluation of intervention measures
hospital. An initial short-term evaluation (six @ Resource allocation in public health planning
months post-campaign) found the campaign to @ Setting research priorities
be effective in reducing measles incidence (Figure @ Archival information for describing the natural
14.2) (Abdool Karim S$ S, Abdool Karim Q, Cha- history of diseases.
mane 1991).
However, when a second evaluation was done Surveillance data can also be used to direct
using the same surveillance data over a longer resources for swift and effective control, as well as
period (21 months post-campaign), it was found to estimate the magnitude of the health problem
that the effect of the campaign had been lost (in the long term).
(Abdool Karim S S, Abdool Karim Q, Dilraj, Cha-
mane 1993). The prime reason for this was that Methods of surveillance
the high level of vaccination coverage achieved
during the campaign had not been sustained. Surveillance data is typically obtained through
The main purposes of surveillance are sum- health provider-initiated reports, such as noti-
marized below. Unusual events detected by sur- fication forms filled in by nurses (passive surveil-
141
Figure 14.2 Monthly measles admissions to a KwaZulu-Natal tertiary

hospital
Measles admissions
200
Pre-campaign Post-campaign
N J M M J $ M M
1989 1990 Month 1991
Source: Abdool Karim S S, Abdool Karim Q, Dilraj R, Chamane M, 1993.
lance) or health department-solicited reports, veillance systems may be used to support or com-
such as the recruitment of throat swab specimens plement each other.
from health centres by the National Institute for
Virology for the isolation and monitoring of Routine surveillance
influenza viruses (McAnerney, Johnson, Schoub Routine surveillance means that every case of a
1994) (active surveillance). Passive surveillance is particular condition seen must be reported and
good for conditions that have clear symptomato- counted. Routinely reported data is typically
logy (and few asymptomatic infections), such as obtained via passive surveillance. A typical exam-
measles. Active surveillance is useful for diseases ple of routine surveillance is notifiable disease
which can be easily missed — it reminds health reporting. The main reason for making diseases
care providers to be on the lookout for these con- notifiable is to control those diseases which con-
ditions. A good example of a disease that needs stitute a danger to public health. Thirty-six condi-
active surveillance is malaria, where it is import- tions are notifiable in South Africa, and these are
ant in controlling malaria to detect asymptomatic mainly infectious diseases. A current list of notifi-
malaria carriers, in order to reduce the prevalence able conditions is readily available in Epidemio-
of parasite carriers. logical Comments, a publication of the Depart-
Passive surveillance is much less costly than ment of Health. The 1994/95 list of notifiable con-
active surveillance and is therefore used most ditions is shown in Table 14.1. Notifiable disease
often. reporting is the main routine system for morbid-
Methods of disease surveillance include rou- ity data collection in South Africa.
tine reporting, sentinel reporting, special surveys If a condition is notifiable, the notification to
or programmes, and outbreak investigations. health authorities is a statutory obligation, the
These are discussed in turn below. Several sur- regulations of which are contained in the Health
142
14 = disease surveillance
Act (63 of 1977). A notification may be submitted

Table 14.1 Notifiable diseases in
by any person who is legally competent to diag-
nose the condition, not necessarily a medical South Africa, 1994/5
practitioner. The Act stipulates that if the condi-
tion is a communicable disease, it must be re- Acute flaccid paralysis*
ported without delay orally and confirmed in Acute rheumatic fever
writing within 24 hours. The notification is made Anthrax
to the health section of the local authority, or to Brucellosis
the provincial department of health, in areas Cholera
where no local authority exists. If a death occurs, Congenital syphilis |
the authorities must be notified separately to Diphtheria
allow estimation of case fatality rates. Food poisoning —
Each local authority is required to submit, on a «Haemorrhagic fevers of Africa (Congo fever,
weekly basis, a summary of all notifications Dengue fever, Ebola fever, Lassa fever,
(including nil returns, which are blank forms indi- Marburg fever, Rift Valley fever) _
cating that no cases were seen) and deaths to the Haemophilus influenza type b*
provincial department of health for processing Lead poisoning
and analysis. Regular feedback is done via Legionellosis
Epidemiological Comments. Responsibility for Leprosy
control rests, in the first instance, with the local Malaria
authority, while resource provision, surveillance, Measles
and record-keeping are functions provided by the Meningococcal infection
provincial and national departments of health. Paratyphoid fever
Laboratory-based surveillance is another Plague
method of routine surveillance. This system uses Poisoning from agricultural or stock remedy
biological specimens sent to state laboratories by Poliomyelitis
health providers. Laboratory-based surveillance Rabies
is particularly important in infectious disease sur- Smallpox
veillance and data from this system is often tied Tetanus
into the notifiable disease reporting system. Its Tetanus neonatorum
potential in this country has been largely unex- Trachoma
plored and its role therefore needs to be ex- Tuberculosis: Pulmonary and other forms.
panded. The laboratory is useful in finding the Typhoid fever
etiological agent. Some diseases, such as typhoid, Typhus fever (louse and rat flea borne)
salmonellosis, and shigellosis can only be notified Viral hepatitis A & B, non-A non-B, unspecified
after laboratory confirmation, because of the Yellow fever
non-specificity of the clinical syndrome. In South Whooping cough (Bordetella pertussis)
Africa, laboratory data on viruses are compiled by
the National Institute for Virology (NIV) from re- *Notifiable as from 1994
ports submitted by the seven diagnostic labor-
atories in the country, and published monthly in
the South African Virus Laboratories: Surveillance free of polio until most cases of acute flaccid
Bulletin. Special clinics, such as those for ante- paralysis are fully evaluated in terms of stool cul-
natal care, sexually transmitted diseases, and tures and/or 60-day follow-up to confirm that
tuberculosis, are important ongoing sources of suspected or possible cases are not polio.
biological specimens for specific disease occur-
rence. Sentinel surveillance
Laboratory-based surveillance becomes essen- Sentinel surveillance uses data from a few se-
tial when it comes to the eradication of diseases lected sites rather than data from all sites. Most
that can be clinically confused with others. For sentinel surveillance systems are passive surveil-
example, polio can be confused clinically with lance systems. Since it is much easier to monitor
other causes of acute flaccid paralysis (such as and improve information collection at a few sites,
Guillain-Barre syndrome or transverse myelitis). it is possible to get accurate and more complete
Thus, it will not be possible to certify countries data on the sentinel population using sentinel
143
surveillance rather than with total population While data from sentinel sites, such as ante-
passive surveillance. Data from well-selected sites natal and sexually transmitted diseases clinics,
such as specific hospitals or health centres is have traditionally been used to estimate the pre-
available more quickly, is more reliable, and costs valence of HIV, special population-based HIV
much less than data for a whole region. This in- sero-surveys conducted on several occasions in
formation can act as an early warning system. For conjunction with the malaria programme have
diseases with a relatively high incidence, such as also been useful in estimating the HIV prevalence
tuberculosis, health centres (TB clinics) could be in rural KwaZulu-Natal (see Kustner, Swane-
selected as sentinel reporting sites. For less fre- velder, Van Middelkoop 1994, and Abdool Karim
quently occurring diseases requiring special care, Q, Abdool KarimSS, Singh, Short, Ngxongo 1992).
for example, neonatal tetanus, tertiary hospitals
could be used. An example of a sentinel hospital is Outbreak surveillance
Clairwood Hospital in Durban, as it is a referral Without surveillance it would be difficult to
hospital for several infectious diseases, in particu- detect outbreaks of disease — if the baseline pre-
lar measles. Admission data from this hospital has valence of disease is not known, then epidemics
been used with reasonable accuracy to depict the cannot be recognized. Many epidemics can go
trend in measles incidence that occurs in the undetected if no surveillance is in place. When a
whole province of KwaZulu-Natal. disease outbreak occurs, investigation becomes a
Other sentinel surveillance systems include the high priority. Outbreak investigations are under-
influenza surveillance system run by the National taken to establish the source and/or cause of the
Institute of Virology, the Medical Research Coun- outbreak. The main aim of an outbreak investig-
cil’s trauma surveillance system in the Cape and ation is to quickly control the outbreak with tar-
KwaZulu-Natal, the South African General Practi- geted interventions. It also provides important in-
tioner Network, and the National Cancer Registry. formation on vaccine efficacy, age-specific attack
rates and age-specific case fatality rates. The
Special surveys/special surveillance increase in measles incidence in South Africa in
programmes the 1992-4 period has been largely due to out-
Periodically, new or unusual disease problems breaks in older children (over five years) who had
require special surveillance programmes to be set been previously vaccinated. An investigation of a
up. With the emergence of multi-drug resistance measles outbreak in a Cape Town school revealed
and the greater risk of tuberculosis among those an attack rate of 7,6% (25/329 children), immuni-
with HIV/AIDS, a national tuberculosis pro- zation coverage (at least 1 dose) of 91% and vac-
gramme was initiated to document cases and cine efficacy of 79% (Coetzee, Hussey, Visser,
treat tuberculosis more accurately. A National TB Barron, Keen 1994). Outbreak investigations can
Register was initiated in 1994 to document in- also identify certain social, ethnic, or religious
formation on every case of TB in South Africa. clusters of susceptible individuals.
Since malaria is endemic to northern KwaZulu- The steps in conducting outbreak investiga-
Natal and sections of Mpumalanga, a special pro- tions are covered in Chapter 21 of this book. The
gramme, the Malaria Control Programme, was importance of surveillance in the post-outbreak
developed to address this problem in this region. phase, as a tool for evaluating preventive inter-
Surveillance agents of this programme visit vention during the outbreak, should not be over-
almost all houses approximately once every six looked.
weeks and collect clinical data as well as blood for
parasite microscopy. In this way, effective surveil-
Problems in surveillance
lance of malaria in this region is maintained. The
malaria programme is one of the few examples of Health care providers, health officials and policy-
active surveillance undertaken by the Depart- makers need rapid access to accurate, timely data
ment of Health in South Africa. in a useful form. Lack of such data results in poor
Another example is sewer pads for active sur- planning, unnecessary illness, mortality, and
veillance for cholera in water sources. To find out economic costs.
whether cholera is present in the community, it is There have been many criticisms of passive
more cost-effective to place a pad which retains surveillance. Under-reporting has often been
microorganisms from sewage than it is to screen cited as the major problem with passive surveil-
members of the community. lance. Consequently, a study to determine the
144
14 = disease surveillance
tte cn pee sen sc A
ei
reasons for under-reporting of notifiable condi- ably because unqualified staff fill in the forms
tions was conducted amongst doctors at King without adequate supervision.
Edward VIII Hospital in Durban (Abdool Karim, Since one of the main uses of surveillance data
Dilraj 1996). Some of the reasons for under- is to institute swift disease control efforts, time-
reporting highlighted in that study are discussed liness of reporting is important. However, admin-
here. istrative delays often hamper timely and effective
@ There is poor knowledge of conditions that are disease control efforts.
notifiable. Although 35 conditions were report- Surveillance programmes are very seldom sub-
able in the year of the study, doctors correctly ject to regular evaluation or validation. Despite
mentioned an average of only six of them. the problem of under-reporting in passive sur-
@ Health workers are not aware that any person veillance, it is still possible to obtain meaningful
who is legally competent to diagnose the condi- incidence rates and trends from notification data,
tion is required by law to report it. provided that both the level of under-reporting
@ The notification form is too complicated, as too and the consistency of these levels are estimated.
many details are requested. This discourages Such an approach was used to validate hepatitis B
reporting, or results in poor-quality data being reporting (Abdool Karim SS, Abdool Karim Q
sent in from service level. This in turn compli- 1991; see Example 14.1).
cates analysis of data at higher levels of the Active surveillance is an effective form of data
health services. collection because completeness and timeliness
@ There is lack of feedback on reported cases. The can be ensured, but it is not implemented widely
analysis of data received at every level of report- because of the high cost involved. Where speci-
ing is not fed back timeously to those who do mens are essential for diagnosis in asymptomatic
the reporting. If and when feedback occurs, it is cases, laboratory cost is an added expense.
often not done ina useful and accessible form. Depending on the disease condition under active
Without feedback, health workers adopt the surveillance, telephone and transport expenses
attitude that it makes no difference whether or could add to the cost. A further limitation of active
not they report or what they report. National surveillance is that the size of the area may be too
publications of the Department of Health such small for proper estimation of incidence rates.
as Epidemiological Comments, and provincial For example, for cancer, which has an incidence
newsletters such as Epidemiology Focus in of 150/100 000, a large area is needed to get stable
KwaZulu-Natal, address this problem to some rates.
extent. Data on isolations and antibiotic resist- In sentinel surveillance, the selection of sites is
ance of several bacterial organisms are pub- crucial. If sites are not well chosen, the introduc-
lished in the South African Medical Journal. tion of bias will make the results unrepresentative
This feedback is provided by the Antibiotic of the total population. People who choose to
Study Group of South Africa, which collates the make use of these services and who are included
data from seven academic hospitals through- in the surveillance may be different from those
out the country. who use other services. Sentinel surveillance is
@ There is lack of training in surveillance at also inappropriate for monitoring rare diseases,
undergraduate level and continuing medical such as polio. Since a single case of polio is re-
education. The medical schools and nursing garded as an epidemic, the consequence of miss-
colleges need to include the notification ing one case is enormous. Thus every health ser-
process and the importance of notification for vice has to be involved in the monitoring in order
disease control and health planning in their to detect every single case of suspected polio.
undergraduate training.
@ At understaffed institutions, the doctors are too Conclusion
busy to spendalot of time filling in the notifica-
tion form. In such instances, other hospital per- While the major criticism of passive surveillance
sonnel should be used to support the doctors in is under-reporting, it is not always critical to
their reporting function. obtain complete counts of most diseases in order
to plan and undertake disease control interven-
Completeness of reporting is another problem tions. When the incidence of a disease is high, the
with routine surveillance, in that returns are sent trends instead of the complete counts of the cases
incomplete to the regional offices. This is prob- are adequate to plan and institute disease control
145
efforts. This is provided that the under-reporting utilization of the virology laboratory service was
rates are reasonably consistent and the under- used as a basis of comparison to validate the
lying reporting mechanism does not vary unpre- hepatitis B notification data.
dictably. The level of under-notification was estimated
On the other end of the spectrum, diseases tar- by comparing the number of notifications for
geted for eradication, such as polio, demand sur- hepatitis B received by the Department of Health
veillance systems that are sensitive enough to with the number of positive hepatitis B laboratory
rapidly detect every case of the target disease. For results submitted to the NIV for the period
polio eradication, use of the case definition based 1985-8. Virologists estimated that approximately
on acute flaccid paralysis (AFP) enables the sur- 10% of positive laboratory results were repeat
veillance system to detect virtually all cases of tests, and the number of positive results was
paralytic polio. The choice of a case definition can therefore adjusted by this figure. It was found that
thus determine the sensitivity of the surveillance hepatitis B was grossly under-reported. Nation-
system for the disease in question. Effective ally, on average only one case of hepatitis B was
surveillance systems have detected AFP rates of notified for every seven positive hepatitis B labor-
1/100 000 and 0,45/100 000 children aged under atory results. There was a marked difference in
five years in the Americas and Tunisia respect- the regional notification : laboratory result ratios,
ively. In such instances, zero reporting is very ranging from 1:3 in the Cape to 1:17 in the Trans-
important when there are no cases. vaal.
It should be borne in mind that for vaccine The consistency of under-notification over
preventable diseases, surveillance should include time was fairly good nationally, ranging from 1:7
not only disease data, but also vaccination cover- to 1:9 over the four-year study period. The con-
age and vaccine efficacy. In vector-borne diseases sistency of under-notification varied greatly
such as malaria, surveillance of the vector popu- according to region, however. The Cape region
lations (distribution and species) is also part of had the highest and most constant level (1:3) in
the surveillance system. each year of the study, while the former Transvaal
While the technology for good surveillance sys- at the other extreme had very low and erratic
tems is available, the necessary commitment to levels ranging between 1:14 and 1:25 over this
apply these for the benefit of public health is not period.
always present. Surveillance systems should be The marked differences in the level of notifica-
evaluated to assess the accuracy and complete- tion between regions may spuriously create
ness of reporting, as well as the timeliness of considerable inter-region differences in incid-
results and the usefulness of the results to policy- ence rates. It is essential therefore that differences
makers. in the reporting level be taken into consideration
when making regional comparisons of hepatitis B
incidence rates calculated from notification data.
EXAMPLE 14.1 On the other hand, the small differences between
Validation of the hepatitis B report- the national reporting ratios for each of the study
years indicate that the analysis of hepatitis B noti-
ing system in South Africa fications could provide useful information on
Since the different types of viral hepatitis infec- trends in the incidence of this disease at a na-
tions are clinically indistinguishable, serology is tional level.
essential for diagnostic purposes. Consequently,
clinicians with access to laboratory facilities tend Reference:
to use them, and the laboratory data includes a Abdool Karim, S S, Abdool Karim, Q. ‘Under-reporting in
large proportion of the clinical cases of hepatitis B hepatitis B notifications.’ South African Medical Journal
being seen in the health care service. The high 1991; 79:242-4.
146
15 Health systems
research (HSR)
Introduction of learning from experience. We should use the

lessons learned to improve existing activities and
Health Systems Research (HSR) is research done those we are planning. Evaluation should not
on the health system and all its component parts stop at a list of problems and their possible
and activities. causes, but should include recommendations to
HSR is applied research, which aims to support increase, decrease, or change activities.
the decision-making process at all levels of the Evaluation requires a standard for comparison.
health system by providing relevant information. This would usually be drawn from the stated goals
The purpose of this research is to improve the of the service under evaluation. But the health
operation of the health system, leading to an system has grown and developed its traditions over
improvement in the impact of health care, which time, and so explicit goals are seldom stated or writ-
in turn leads to an improvement in thehealth of ten down. This is a weakness, because if we do not
the population. specify where we are trying to get to (in other words,
HSR encourages collaboration between re- if we do not have an explicit goal), we may never get
searchers, health care users, communities, health there. It is often required of HSR that it starts by
care providers, managers, and politicians in the clarifying and stating the goals of the health system
design and conduct of research. Collaborative or health care under evaluation.
research can help different stakeholders to reach These goals can be drawn from an understand-
a common understanding of problems, their ing of the concept of quality. What is quality in
causes, and solutions. health care? Quality health care is safe, contri-
butes to good health, meets user expectations,
Health system goals, evaluation and is the best use of finite health care resources.
Thus we see that HSR is often evaluation re-
research, and quality improvement search, aimed at measuring, and helping to im-
Until recently, we (the public, politicians, health prove, the elements which together make up
care managers, and professionals delivering care) ‘quality health care’. Figure 15.1 outlines the
have simply assumed that the care we provide or components of health care that can be evaluated.
receive is of high quality. This assumes that we
know what is meant by quality, we know how to Understanding health care and the
assess whether it is present, and we have com- health system
pleted such assessments. These assumptions are
seldom true. HSR can help us replace assump- The health system is a set of components that
tions with evaluation. function together to support and improve the
Evaluation means looking back at our work to health of the population. The health system con-
see if we have reached our planned goals, and, if sists of a set of cultural beliefs about health, ill-
not, examining where the problems lie (Mathews, ness, and health care, as well as the societal and
Yach, Buch 1989). Evaluation is a systematic way institutional framework in which these beliefs
147
Figure 15.1 Tc of health care irate LCs
Inputs ————————_———————_»> Processes ————_—__————————_> Outcomes
Staff Affordability User satisfaction

Equipment Accessibility Complete health
Money Acceptability Residual disease
Availability Residual disability
Utilization Death
Coverage Efficacy
Equity Effectiveness
Efficiency Cost effectiveness
and values are turned into health and health care action to achieve these goals. Once the goals of
behaviours. The cultural beliefs about health and the entire health system or a part of it are defined,
health care form the basis for health seeking and policies for achieving these goals must be se-
health care delivery behaviour. The societal lected. Policy evaluations assess whether the poli-
framework includes the family and the commu- cies of stakeholders are consistent with each
nity, while the institutional framework includes other and with health goals, and whether or not
other sectors, such as the education system, they are being implemented.
which impact on health and the health care A large NGO is funding a national project which
system. aims to improve the partnership between organi-
The health care system is the collection of zations training health professionals, the com-
people (clinical and support staff, communities, munities to be served, and the health services
families, and patients), things (buildings and authorities, by providing support to conduct
vehicles, drugs and money, timetables, rules and more student training in clinics within the com-
plans) and events (consultations, procedures, munities concerned (Zwarenstein, Lewin 1995).
meetings, and evaluations) that come together (as An evaluation is planned which will assess the
health care) to provide assistance for illness, ill- perceptions of the project aim held by each of
ness prevention, and health promotion. Although these partners. Descriptions of the number of stu-
HSR offers useful insights for research on the dents spending time in community clinics will
entire health system, in this chapter we empha- help to showif the policy is being implemented.
size research on the health care system. The infrastructure of the health service consists
of the physical and human inputs, and the proc-
The stages of health care delivery edural framework which forms the underlying
Health care can be thought of as if it were an foundation of the service. Evaluations of inputs
industrial process, which starts with production may assess the distribution and amount of long-
plans (policy), builds a factory and develops rules lasting inputs, such as buildings. Another type of
and procedures for running it, adds raw materials structural evaluation may assess the procedural
(inputs), combines them in processes to produce framework — such as the quality of the training,
health care of different kinds (outputs), which facilities, administrative and technical proce-
may lead to improved health (outcome). dures needed to provide care — against existing
This production line is permanent, but not un- norms, previous measurements, or other com-
changing. A well-managed health care system is parable areas, services, or facilities.
repeatedly re-examined and improved. HSR can A study was done to evaluate the quality of
contribute to this repeated cycle of planning, pro- health care provision in a sample of public and
duction, and evaluation, followed by improved private sector primary care sites (Beattie, Rispel,
planning and production. Broomberg, Price, Cabral 1995). Several indica-
Policy development is the first stage of health tors of quality of care were measured. These were
care delivery. Policy is the vision of a government assessed using indicators for the condition of the
or organization, stated as goals and plans of buildings and grounds, privacy for examination
148
15 health systems research
and changing, adequacy of space, and presence tended regularly, and whether the care that they
of patient toilets. These assessments were con- received (for example, regular eye testing to look
ducted by an external team of evaluators. for early, treatable complications of the disease)
Inputs are the people, material goods, and met internationally accepted guidelines.
financial resources which go into making up the Outputs are the items or units of care which are
health care system. They include both the long- produced in the interaction between health care
lasting capital inputs (fixed facilities such as clinic processes and inputs. Output evaluations fre-
or hospital buildings, durable capital equipment quently tell us how much care is being produced.
such as ambulances and X-ray machines) and the We may want to know if these outputs are being
shorter duration inputs paid for with running produced with a minimum of waste, which
costs (including equipment and consumables, requires a comparison of inputs and outputs.
such as scalpels, intravenous catheters, ban- Routine output evaluation of immunization
dages, drugs, and stationery, and personnel, such programmes can be based on the number of
as nurses, doctors, administrators, and cleaners). immunizations conducted ata site by bringing in
Input evaluations compare the inputs provided a denominator, the number of children of immu-
against existing norms or standards, previous nizable age. This will measure coverage as a pro-
measurements, or comparable areas and ser- portion of eligible children, and is reported
vices. They may also be analysed to see whether annually in most cities’ Medical Officer of Health
resources are being distributed fairly (equity). reports.
Where comparison between areas or over time is Outcomes are health states of people, groups,
the main purpose, an indicator might be selected or communities that result from interaction with
from among all the inputs to illustrate the com- the health system. Outcome evaluations check
parison. One example of such an indicator is ‘hos- that the intervention was effective, that the
pital beds’. Inputs are measured to get an idea of desired health goal was achieved, and that the
the cost of an element of health care. The various patient, client, or community obtained preven-
inputs are often summarized by their total money tion, amelioration, cure, or rehabilitation of a
value. particular problem.
A major problem in national health planning in Continuing the immunization example above,
South Africa is the absence of expenditure data. the outcome of improved immunization coverage
The National Health Expenditure review (Health is decreased incidence of the diseases against
Systems Trust 1995) has added up how much which immunization was conducted. A study was
money is being spent on what kinds of health done of the trends in admissions to a major infec-
care. This understanding of total financial input tious diseases hospital, to assess the impact of a
into the health sector is a great help in national measles immunization programme on the
planning. Data on physical resources, such as disease (Abdool Karim S S, Abdool Karim Q,
staff and beds, is also described by province and Chamane 1991). It showed an initial fall in admis-
by magisterial district. Thus, for the first time, sions, followed by a gradual rise to previous levels
resources can be allocated fairly according to geo- (see Chapter 14: Disease surveillance).
graphical area. This helps in planning the redistri-
bution of resources to ensure increasing equity. Health care goals: the elements of quality
Processes are the complex interactions which The idea of quality is more complicated in health
occur in the delivery of health care, and which care than in most industries, because so many
take place when infrastructure and other inputs elements are included in the notion of quality
are applied to communities or individual patients (Table 15.1). Each element of quality can offer a
with health problems. Process evaluations assess starting point from which to develop a standard
the way in which care is provided, and whether it against which a health care activity can be evalu-
meets any institutional guidelines which may ated.
exist. Although quality is sometimes used as a
A study of the quality of diabetes care in public - synonym
ynonym for single
g elements, it is also an overall
sector primary care clinics was done in the term for all of these elements. Quality care offers
Western Cape (Levitt, Bradshaw, Zwarenstein, all of the elements described.
Bawa, Katzenellenbogen, Dopfmer [1996]). This The table excludes a number of elements, such
study assessed a representative sample of patient as cost. Cost, the amount of resources used to
records to see what proportion of patients at- offer a service measured in money terms, is dealt
149
gee) (eweRoe Rem Um CU ears
ELEMENT DESCRIPTION
Accessibility Geographical, financial, social, physical access to health care services
Effectiveness How well does a process or intervention work?
Comprehensiveness The ability of the service to meet the wide range of needs of individuals and
communities
Appropriateness The level and range of care given the needs of the patient and community and
the resources available
Acceptability The match between patient and community expectations and the care provided
Coverage The proportion of the target population successfully in contact with the health
service
Adherence How well treatment regimens are followed — determined by clinical, patient,
and community factors
Continuity The potential for continued interaction between the health service and pa-
tients or communities, such as through defaulter tracing programmes, out-
reach, and a coherent record-keeping system
Relations The quality of the social interaction between health workers and patients
Efficiency _ The relationship between input and output: maximizing the outputs for a given
Input
Equity The availability of equal care for people with equal needs
with in Chapter 30: Health economics. about the distribution of input, process, or output
Availability, affordability, accessibility, and per capita, in different areas, or by different social
acceptability are terms for different aspects of the groups. We may compare demand with need, or
‘fit’ between the health care supplied and the utilization with need as a way to identify commu-
potential users’ needs and wants. Availability nities with excess or under-utilization. Commu-
describes the degree to which supply of services nities should have equal use for equal need. See
and resources meets the theoretical need. Chapter 30 for further discussion of equity.
Affordability is an economic concept: to what Efficacy, effectiveness, and efficiency refer to
extent do the prices of services match the clients’ different aspects of the concept ‘how well does
income and willingness to pay? Accessibility this care work?’
takes into account transport, travel time, and Efficacy studies ask the question: Does the
distance. Acceptability describes users’ percep- health care intervention (usually a drug) work
tions of the appropriateness of services, and when given under ideal circumstances (a care-
emphasizes communication and mutual respect fully carried out randomized double blind con-
between staff and patients, continuity of care, trolled trial with complying patients)?
and convenience. Effectiveness describes the ability of the inter-
Equity studies try to find out if a health service vention to work in the messy real world, with
is fairly shared out in proportion to need. These ordinary patients under normal health service
studies are usually based on routine information conditions.
150
Efficiency is an economic term with two broad study is to be done). This will be illustrated with
meanings. an example of a health care activity, and the HSR
Technical efficiency involves minimization of which could be carried out to evaluate and plan
resources (such as staff and time) used in under- improvements to it.
taking a particular task. This is often evaluated One of the most cost effective interventions in
using operations research methods. Allocative all of health care is immunization against child-
efficiency asks whether a particular health service hood illnesses, especially in impoverished com-
approach is the best way to use resources to max- munities, where immunity is poor and severe
imum effect. Here the term is used for studies complications of otherwise mild diseases are
which ask either ‘Is this the cheapest way to common. Several different HSR studies could be
achieve these results?’ or, ‘Is this health pro- useful in improving immunization programmes.
gramme worth implementing in comparison with We will illustrate a few of the most important
other uses of these resources?’ These studies questions that managers may wish to answer
usually require help from an economist (see about immunization, categorizing them accord-
Chapter 30: Health economics). ing to the structure outlined in Table 15.2.
Often, immunization programmes have been
An integrated view of HSR: linking operating for some time, but their success is un-
known. A study looking at the output (stage) of
research design with sectors, levels,
the immunization programme would be useful.
units, stages and elements of care In this study, coverage (element) would be appro-
HSR can address many different questions. In priate to assess success. One approach would be
order to understand the possibilities, it may be to do a descriptive, random community sample
useful to see health care as a complicated system survey (study design) in one district (level) of chil-
in which different sectors operate (public and pri- dren (unit) to see if they were immunized appro-
vate), each with several levels (first contact, refer- priately for age. This study thus determines out-
ral). Health care delivery consists of several stages put per population.
(policy, process, outcome), is organized into dif- We immunize children because we want to
ferent units (clinic, carer, consultation, client) protect them against a particular disease, say
and can be evaluated according to performance measles. The desired outcome (stage), then, is a
in any of several elements (equity, effectiveness, fall in incidence of the disease, measles, and a
accessibility). reduction in deaths caused by it. Another
Any HSR study can be categorized and placed approach would be to look at time trends (study
by selecting one description from each column in design similar to a before and after study, but over
Table 15.2. It shows some examples of descrip- a longer period) in reported measles cases, with
tions, but others might be added. Any study can the assumption that a fall is due to the immuniza-
be fully described by stating the sector and level tion programme. This is a study of effectiveness
of the health system to which it refers, the unit, (element); the unit of study (children) remains
stage and particular element of health care deliv- the same, while the level (district) could stay the
ery to be studied, and the study design (how the same or change. If the level was raised (provincial
Table 15.2 Ways i looking at a care systems
eae
Input Effectiveness
study deste
Rapid appraisal
Public Ward Carer
Private (for profit) District Clinic Process Equity Descriptive
Private (not for profit) Region Hospital Output Cost Analytic
Province Contact | Outcome Efficiency Before-after
National Script Coverage Cross-sectional
International Patient Satisfaction Case-control
Individual Need Cohort
Community Trial
151
or national), the main purpose of the study might problem identified by a health service purchaser
be to assess equity (element) between areas. (in some countries, health ministries have
What would a manager do if coverage was created authorities which ‘buy’ health services
found to be good (the output stage of the pro- from providers on behalf of the government),
gramme is successful) but measles cases or provider, or user. The problem is often tackled
deaths were still being recorded (the programme in such a way that more issues are taken into
is failing to achieve the desired outcome)? This account than appeared in the original problem
implies that either the inputs or the processes of statement. In other words, a comprehensive
immunization (stages) are deficient in some way, understanding of health and health care in a
and studies of these should be done urgently. social, political, and economic context is applied
Since protection may fail if the vaccine is not kept to research.
cold (a subsidiary goal of the measles immuniza- Because it is problem-based, it is also problem-
tion programme), evaluation of the process of solving — often providing cure, but sometimes
vaccine storage needs to be conducted. prevention. If patients are dissatisfied with care,
Other studies could also be useful: immuniza- service providers need to know quickly what the
tion has to be given in certain ways, which could main problems are, and how these can be rem-
be evaluated, as could the success of the training edied. If the research is only made ayailable in
received by nurses in administering the immu- two or three years time (the usual gap between
nization. (Both of these are process evaluations of beginning and publishing research), patients will
quality, the first of the unit contact, the second of have moved on or died. Often such studies must
the carer.) Do patients believe in and want be quick and their questions must be very
measles immunization (the element of accept- focused. Much of the skill in HSR lies in making
ability)? trade-offs between speed of answer on the one
While the child is being immunized against hand, and depth of information on the other.
measles, are its other health problems attended to Solutions also have to be relevant to local needs
(the element of comprehensiveness)? Do staff and circumstances. This requires more local
relate well to patients (the element of relations)? knowledge than any outsider can have, and so,
given that HSR skills are scarce, projects are often
Characteristics of HSR conducted as collaborative research, where a
HSR is different from most epidemiological re- skilled consultant collaborates with an insider
search in its approach and the way it uses differ- who needs the results of the research to improve
ent methods. The characteristics of HSR are listed his or her health care provision. The research
below: users may then conduct the study with help from
¢@ HSR focuses on finding implementable solu- a consultant. The ability to share research skills is
tions to priority problems. an absolute requirement, and itself an important
@ HSR findings help to improve services which goal of HSR.
lead to better health.
HSR provides usable timely information to Methodology
decision-makers at all levels. HSR problems are usually complex, and may
HSR is participatory, involving concerned need a combination of several methods from
parties in all stages of research. multiple disciplines. An HSR project will often
@ HSR demystifies research and decentralizes it begin with a situation analysis. This may help to
from high-level institutions. formulate the problem to be solved. From this, a
@ HSR can be done by people in services as part of number of questions might be generated. These
their daily activities. questions could be addressed by qualitative or
¢@ HSR emphasizes simple, fast, low-cost study quantitative methods.
designs. Qualitative approaches drawn from the social
@ HSR is multi-disciplinary, and uses multiple sciences include depth and focus group inter-
methods. views and text analysis. Thus an HSR study might
@ HSR is iterative: it follows the chain of action/ include depth interviews to understand patient or
evaluate/plan/new action/ re-evaluate/re-plan. staff perceptions of a service or a care problem, or
focus group interviews to allow different percep-
Approach tions of a problem to be discussed.
HSR is problem-based, usually responding to a HSR studies create a number of specific diffi-
Wa2
culties when using qualitative methods. HSR is

oriented towards problem definition and solu- Table 15.3 Examples of an
tion, and towards participatory research aaa
approaches. All options and viewpoints should
be heard. Sometimes vested interests may bias ~ @ What proportion of the population X has con- —
the inquiry, especially where one collaborating dition Y?
partner attempts to drive the study according to ¢ Given condition Y, what proportion of a
those vested interests. This is less difficult if the group seeks care? _
problem is being handled in a non-adversarial ¢ What type of care is sought? :
way, in other words, if all parties have the same @ What proportion of those who seek care re- .
interest in solving the problem. This is rarely the ceive it?
case, and HSR researchers need to be aware of ,@ At what cost to Hoceler, the health care
potential conflicts and the ethical implications system, and the society?
of conducting research whose recommenda- ¢ Is the proportion ofpeople who obtain are :
tions may favour one or another party in a for this condition the same in all areas
conflict. ¢ What proportion of those who obtain careis
Another issue to take into account is the lack of satisfied with the care obtained?
trust that staff and patients might feel towards @ is treatment or care of type A better|
each other and management. If a study is per- (more effective) than that of type B?
ceived to be launched by management, without Is treatment of type A cheaper than.
participation from staff, some informants might given similar effectivenéss? =
not be open. -@ What proportion of those with condition | i
In both of these situations, an external consult- correctly identified using test Z?
ant with health systems experience can play a ¢ How well do staff and patients -
valuable role in resolving conflict and bridging communicate? _ es
barriers. ¢ Why is staff morale low?
Quantitative approaches derived from epide-
miology and biostatistics might include descrip-
tive, analytic, and intervention studies, as well as Tips for conducting health systems
analyses of routinely collected data from health or research
management information systems. The methods
used are dictated by the problem and the context. Carl Taylor, an experienced health care evaluator,
HSR raises specific difficulties here too. The has said that the most important single issue in
main difficulty derives from the complex nature HSR is to learn to ask the right research question
of the data collected in some HSR studies. While (Taylor 1984). Some examples of typical HSR
an immunization prevalence survey produces a questions are shown in Table 15.3.
simple data set, a study whose goal is to improve The following section might help to guide you
the quality of care in an ambulatory clinic may in developing a successful HSR project.
consist of many more variables, each of which is The aim of HSR is to understand and help solve
itself complex to collect and analyse. particular priority problems in specific situations,
The underlying phenomena which we are and so these questions should be applied in rela-
trying to measure (such as equity, access, and the tion to areal problem in a specific situation.
other elements in Table 15.1) are complex, and af-
fected by many factors. These phenomena are not 1 Are you patient enough to start an HSR
the same everywhere, and so data collection tools process?
and indicators are not standardized, and may
need to be designed according to the local need Promoting improvements in the health system is
for, and availability of, information. seldom a single one-off event — it is an ongoing,
These difficulties can be alleviated by keeping long-term process which should be flexible
the main problem in mind, generating clear enough to respond to changing needs in the
hypotheses before designing the data collection population. HSR is like music: timing is import-
methods, and collecting only data whose use in ant.
analysis is clear from the start. Good pilot studies Sometimes we may raise questions before
help. others are ready for them; if no response is ob-
153
d= = common approaches in epidemiology
ee a a i ST
tained, that issue may need to be put on hold for as to influence decisions, or not implementable.
some time, until it rises higher on the agenda. The
exact evaluation question might also change over 4 Is your chosen research problem important?
time, to match the changing needs for inform-
ation. Health care programmes may also change, Research skills and resources are scarce, and
both in personnel and methods of operation. should be focused on important problems. A
Evaluations should thus also be repeated and problem is important if many people are affected,
ideally integrated into the process of health care if the diseases involved are serious, with major
delivery, to be conducted regularly by those who health consequences, or if many people use the
deliver care. health care services that are identified for evalu-
ation. If the costs of care, or lack of it, are high and
2 Do you understand the context? if policies are about to change, the issue becomes
more important. Sometimes a question is of aca-
Before an evaluation is possible, the environment demic interest only. If, for example, no alternative
should also be understood: the background, the policy option exists, there is little point evaluating
surroundings, or the context in which the prob- policy options. Also, are you sure that research is
lem is defined. The objective and subjective necessary? Are you sure that adequate answers
health needs of the community served by the are not already available?
health system are an important part of this con-
text. The community and the society, its politics, 5 Is your research politically feasible?
its culture, its economy, the available resources,
its social development, and the degree of partici- The questions for an HSR study should be
pation in decisions are all important aspects of discussed with and reflect the priorities of man-
the environment which should be understood agers and providers of care, as well as the con-
before embarking on an HSR study. The health cerns of the community in which the service
system itself is part of this environment. The exists. If any stakeholder groups are hostile to the
researcher needs to understand where the prob- research, or feel it is irrelevant, success is unlikely.
lem and service under study fits into the picture Either the research itself will fail or the recom-
of the structure and functioning of the whole mendations will be out of touch with key stake-
health system: its size, its place in the organiza- holders’ needs, and therefore not implemented.
tion, the organizational culture, the cost of care, But, on the other hand, there must be enough un-
the skill and experience of staff, their relations certainty among all stakeholders to allow space
with each other, with managers, politicians, for research to influence policy. If minds are
users, the community, and other services. The made up already, this may not happen.
‘target’ population and the planned activities and
expected benefits of the health care programme 6 Can your research be done in the available
under study should be clear. These details are time?
usually found by studying the goals and object-
ives of the programme closely. If these are not Policy-makers have shorter deadlines than
understood, you should probably undertake a researchers; often a decision has to be taken fast
situation analysis before starting a specific HSR for budgetary or public pressure reasons. In this
study. situation, an HSR researcher needs to assess if the
research can be completed in time to influence
3 Is your research question specific? the decision. If not, a simpler, quicker, less
acceptable research design may be needed, or a
Evaluation questions need to be specific. Is your longer-term question needs to be formulated.
question understood by all participants in the
same way? 7 Will your answer be heard?
Does the question specify a sector, level, stage
of care, unit of study, and issue which are to be Sometimes the perceived quality of an answer is
investigated? Vague questions result in vague affected by whose answer it is. Will the research
projects which may never finish. Even if the pro- team have the credibility to produce an accept-
ject does reach an end point, the answers may be able answer? If not, you should consider collab-
vague, open to re-interpretation, not phrased so orating with people whose credibility is greater.
154
Conclusion increased VHW activity in comparison with that

in the control villages, and that more mothers in
HSR is an important development in the history
‘experimental’ villages had appropriate ORS and
of research. It offers an approach to applied re-
diarrhoea knowledge and practices. This com-
search in health care which is more engaged with
munity trial gave firm evidence of benefit
and linked to health service needs. It offers a help-
assessed by process measures, suggesting that
ing hand to decision-makers in the health system,
dehydration from diarrhoea would be reduced if
caught between rising costs and higher expecta-
the programme was implemented widely. The
tions, between community wants and health ser-
ORS training programme was subsequently
vice plans, between increasing health needs, and
widely introduced and was extended to deal with
shortages of personnel and resources.
other interventions for childhood diseases.
To those who do HSR and those who use it (in-
A further aspect of the programme was the sys-
creasingly the same people, as skills spread more
tématic training of VHWs in using a new record
widely), HSR offers tools to make better use of
card, with monthly follow-up visits and recording
resources to achieve better health (Zwarenstein
of information for the whole group of interven-
1990).
tions, rather than the irregular visits that had
occurred previously without documentation. A
second survey undertook to evaluate this
EXAMPLE 15.1 strengthened programme. One particular village,
Rural child survival programme Thornhill, was selected as the site for the imple-
mentation of the VHW cards and follow-up sys-
evaluations: Hewu, Eastern Cape tem. The researchers selected an indicator by
A series of studies formed part of a four-year-long which each of the interventions would be meas-
collaborative research project between Medical ured. Some of these indicators measured afinal
Research Council researchers and the staff of the health outcome (for example, weight for age),
community-based health service in the Hewu dis- while some measured intermediary outputs such
trict of the Eastern Cape. Hewu is a poor rural as coverage of children in the community by the
area, with substantial resettlement. Its health VHWs (what proportion of households had been
problems are mostly childhood disease and visited within the last month) and ORS knowledge
mortality. The health services focus on these, of mothers, and others measured.
relying mainly on rural clinics staffed by nurses, A survey of the village children was done to es-
with a village health worker scheme to improve timate the coverage and effectiveness of the long-
outreach to communities. standing approach on each of the indicators. The
The first collaboration evaluated was a pro- new approach began immediately after the
posed new education programme for village survey, and was evaluated by two follow-up sur-
health workers (VHWs). The research question veys, a year and two years later. In other words, it
was: does a training programme for VHWs in oral was a before/after design. The new programme,
rehydration methods result in an improved oral in comparison with the previous one, resulted in
rehydration solution (ORS) programme in the better immunization rates, longer duration of
community served by the VHWs in comparison to breast-feeding, better coverage with village
the community whose VHWs did not receive the health workers visiting more regularly, but basic-
programme? This study was a community trial of ally the same poor nutritional status (outcome
the effectiveness of a training programme in a real measure). This shows that health care has com-
community setting. The evaluation looked at pro- plex relationships to health, especially in the
cess measures (mothers’ knowledge) rather than presence of poverty. Even though many elements
output (successfully treated diarrhoea cases) or of the programme were working well, the pro-
outcome (reduced diarrhoeal complications and gramme did not make a major impact on one of
death). the key indicators of child health, namely growth.
A survey of ‘experimental’ villages (where
VHWs received the training) and control villages References:
(no training) was conducted six weeks after the Kuhn L, Zwarenstein M, Thomas GC, Yach D, Conradie H H,
training programme. Results showed that more Hoogendoorn L, Katzenellenbogen J. ‘Village health-
mothers in the experimental villages had been workers and GOBI-FFF: An evaluation ofa rural pro-
visited, suggesting that the training programme gramme.’ South African Medical Journal 1990; 77:471-5.
155
ee
ee
Kuhn L, Zwarenstein M. ‘Evaluation ofa Village Health and establish broad priorities for improvements.
Worker Programme: The Use of Village Health Worker It relied heavily on qualitative methodology to
Retained records.’ International Journal of Epidemiology understand service providers’ perceptions, and
1990; 19(3):11-18. simple routine data sources to estimate the load
Yach D, Hoogendoorn L, Von Schirnding YE R. ‘Village and available capacity of the staff to take on more
health workers are able to teach mothers how to safely pre- work.
pare sugar/salt solutions.’ Paediatric and Perinatal Epidemi- The researcher interviewed experienced staff,
ology 1987; 1:153-61. held group discussions with patients, staff, NGO
Yach D, Katzenellenbogen J, Conradie H. ‘Infant Mortality staff, and representatives of political organiza-
Rate Study: Hewu District.’ South African Journal of Science tions to gain insights into the running of the CHC.
1987; 83:417-21. He examined and re-analysed some routine data,
reviewed previous research on similar health ser-
vices in Khayelitsha and elsewhere, and observed
EXAMPLE 15.2 the functioning of CHC services by walking
Evaluating health care to help imple- around at different times of the day and week.
The CHC was clearly busy, but it had under-
ment change: the Khayelitsha studies utilized resources available, especially as staff
Sometimes an HSR collaboration must first find members were not always busy during after-
the questions that need to be answered through noons. There were gaps in services; for example,
an overview or situation analysis, and then look sick children were not always treated if they
for appropriate methods of answering them. This arrived for immunization, and children who
example describes how research contributed to arrived ill and also needed immunization often
the process of trying to improve the quality of care received care, but not immunization. There were
in a community health centre (CHC) in a large also overlaps; for example, patients with STDs
peri-urban settlement in Cape Town. were being treated by all three authorities, in
The CHC treated about 1 000 patients oe day, three different places, at different times, by differ-
and had more than 150 staff, employed by three ent staff, using different methods. The study also
different health authorities. The local authority collected many ideas for improving the services
was responsible for running specialized clinics for using existing resources.
sexually transmitted diseases (STDs), family plan- The situation analysis was presented, circu-
ning (FP), tuberculosis, and preventive and pro- lated, and widely discussed. It recommended that
motive care for well children under six. They felt the managers of separate authorities working in
that fragmentation of health authorities had the CHC should initially focus on combining cur-
caused gaps and overlaps in services provided ative and preventive services for children under
within one CHC, and wanted to co-ordinate their six years old, to eliminate overlaps and gaps.
services with those provided by the other author- While the integration of preventive and
ities, especially the provision of curative care for curative services was being planned, managers
ill children. Community organizations had also focused on the problem of long periods of waiting
criticized the quality of care and demanded im- for care, which had been identified in the situa-
provements. tion analysis, having attracted frequent patient
However, nobody had studied what the various complaints.
problems were, how severe they were, what was
causing them, or how they could be solved. The Waiting times
local authority thus asked a researcher to conduct The researcher designed a study in collaboration
a situation analysis, or rapid appraisal of the CHC, with management to measure how long patients
working part-time over two months. The situ- spent at the CHC curative and preventive clinics.
ation analysis tried to identify overlaps and gaps The study also aimed to find out when (times of
in the services provided by different authorities, the day, and days of the week) waiting times were
and to collect problems, their main causes and longest, and whether the flow of patients and
solutions as perceived by staff, management, work processes could be improved. One specific
patients, and local organizations. Qualitative and suggestion had been that patients receive their
quantitative methods were used. drugs during the consultation, without having to
This study was a multi-method overview of a wait at the pharmacy. The study needed to assess
given health setting, to identify priority problems, whether this would save time. The idea was that
156
after a first study, improvements would be made hours of the week. Most patients arrived early,
to reduce waiting times, and then the study could causing crowding in the mornings, and an empty
be repeated. CHC in the afternoons. Patients waited only 15
The study hired a fieldworker who stood at the minutes at the pharmacy, which did not justify
only gate to the CHC for a week, noting patients’ changing the way drugs were issued. The patient-
entry and exit times. Each patient was given a slip flow mapping showed a number of stops where
of paper with the time of entry, which was taken patients were waiting unnecessarily (for example,
back at time of exit. Information on which cur- they could be weighed while waiting for a health
ative or preventive clinics were visited were also education session) or that could be cut out. The
entered by staff. Also, on three days, at different follow-up survey, after the preventive clinic had
times each day, a fieldworker recorded the times been streamlined, but the curative clinic had not
at which patients arrived at the pharmacy, and changed, showed that average waiting times in
the times at which they left with their drugs. The the curative clinic increased to 5,1 hours, but
researcher developed a flow diagram of how dropped to 1,5 hours in the preventive clinic.
patients moved though the clinic, noting every The study thus confirmed that waiting times
point at which patients were served, or waited to were a problem, and that reorganizing the pre-
be served. This survey was repeated in exactly the ventive clinic reduced the problem, but that
same way (excluding the pharmacy study) a year changing the way of issuing drugs would not solve
later. the problem.
The study showed that waiting times were very
long (an average of 4,1 hours in the curative clinic References: _
and 2,6 hours in the preventive clinic). Waiting Bachmann M. Site B Health Centre Khayelitsha: Situation
times varied a lot in the preventive clinic because Analysis and Recommendations. Community Health
immunization was only provided during a few Department, UCT, February 1993.
157
16 Rapid
epidemiological
assessment
Introduction during emergencies or in times of crisis. After

natural disasters such as droughts, earthquakes,
In 1981, the term ‘rapid epidemiological assess-
or floods, REA can be used to find the highest risk
ment’ was coined by the United States National
populations to which food supplementation pro-
Academy of Science’s Advisory Committee on
grammes will be targeted.
Health for Medical Research and Development.
At present, REA largely focuses on two aspects
Rapid epidemiological assessment (REA) was
of epidemiology. The first is sampling methods
described as a collection of methods which pro-
which reduce the time and resources required to
vide health information more rapidly and simply
collect and analyse data from individuals. The
and at a lower cost than standard methods of data
second aspect includes methods for the collec-
collection, yet also yield reliable results for use
tion, organization, analysis, and presentation of
primarily at the local level (that is, quick but not
data at the community level.
dirty!) For example, in many developing coun-
tries, the use of methodology such as random
sample surveys is not feasible and too costly, and Broad types of REA
other innovative REA methods have thus been
Table 16.1 outlines the broad types of REA
developed.
methods used. These are discussed in more detail
below.
The uses of REA
REA methods are strongly goal-orientated to Small area surveys and sampling methods
health service and community needs, as opposed Epidemiological sampling techniques generally
to the use of more complex methods that aim to aim to obtain representative samples of fairly
answer more theoretical questions. The users of large areas. This usually involves many resources,
REA usually have in mind the implementation of and results are often not fed back to the local level
their findings. in time to influence the actions or decisions taken.
REA can be used under routine conditions to REA samples should be relevant to a particular
evaluate health service functioning where time service. This could be the area served by an indi-
constraints on staff and financial constraints on vidual community health worker, the area for
resources are a Critical factor. For example, the which a clinic is responsible, or an entire local
evaluation of vaccination coverage in the com- authority. Information is required at the local
munity by means of the EPI methodology or lot level to evaluate whether the services are meeting
quality assurance sampling (see Chapter 22: preset targets.
Immunization coverage), and the evaluation of
missed opportunity for vaccination in the clinical Expanded Programme on Immunization (EPI)
setting, provide rapid and reliable results and The WHO Expanded Programme on Immuniza-
require minimal technology and resources. tion (EPI) developed a rapid method for assessing
REA can also be effectively and efficiently used vaccination coverage using cluster sampling that
158
159
rapid epidemiological assessment
sulpunojuoo
SW3|qO1d ajqeljas ‘Wueloie ‘pidey
YUM uonenjene pue SuLoUO/\ Spoujelw jO1]U09 8seg
uonenjers pue ZuLOUO|\
JOYUUOW 0}
yoinb pue ‘deayd ‘ajdwis aie JUusWSSsesse
YdIUM pasn sJozeoipul AxOld YSU pue snjeys YjjeoH suojeoipul AUNWWOD
pea}}09
SISOUBBIP JO9IOOU! Ul si eJep OU aJ8Y4M UJeaP JO
sunjnsai suajqoid |je0ay SaSned JO SJOJEDIPU! S@AID saipnys ALjeOW saisdoyne jeqia\
%OL Ueyy
1912313 9q pjnoyus saiyioes senbiuyoe}
uyyesy ye aje1 AaAijoq deayo pue ajdwis saipnys AUeUO Yyuig SuIpeda1d4
16
goue||ionuns
aniejuesaidas aq 9uljNOJ 0} pasedwoo Ayeyow pue
jou Aew says Jo ajdwes pauinbai seoinosei ssa] Aypiquoww Jo saipnis aouejsjlaAins jaunUaS spoujew soue]|I@AINS
JOPYUOW OF
yoinb pue ‘deayo ‘ajduis ae juaLussesse YSU quawussasse yieay
YdIUM pasn SJoyedIpUl AXOJd pue snjejs jejuawWUOIIAUA jequdwuOJAUS pidey
pesinbas aq jou Aew suljdwes saipnys
aanejuesaidal anneyjuenb suljonpuoo juaussesse
aq jou Aew sijnsey | uonewuojul yjdep-ul SepIAdld aJ0joq Juawssessy o1ydeisouyje pidey
pejeoljduo0o
anbiuyoe} suljdues pouinbe azis ajdwes |jews UONeN|eAS pue SULIOWUO|\ aoueinsse Aqijenb 107
(aouajeAaid)
suljdwes
spoyjew
aAnduoseq
salpnys
Asea Ajeolsiso7
Shavins
uones!unwiwu!
|jBWS
junoode OU! Uaye} eq
eae
pue
ysnw ZuLesnjo Jo s}oeyA pauinbes you swesy Buljdues uo swiweigoid papuedxy
SA9DVLNVAGVSIG ASAUNS 40 AdAL
TTS BUC woe LEE SER Tea ht: ieat
is well suited to settings in which a sampling criteria. Both adaptations required very small
frame is not readily available (Henderson, Sunda- sample sizes and have been useful in identifying
resan 1982). In the 30 x 7 cluster sampling tech- areas urgently needing attention.
nique, 30 clusters and a starting point in each Although LQAS requires relatively small sample
cluster are chosen randomly, while subsequent sizes, the sampling technique is more compli-
houses are chosen according to fixed easy-to- cated than the EPI method. A shortcoming of the
follow rules until seven children of the appro- method compared to the EPI approach is that it
priate age are found. requires a sampling frame from which lots are
The use of this simple approach can be general- randomly chosen. In addition, the method does
ized to many other health service-related prob- not provide an estimate of the variable of interest
lems. However the effects of clustering (and the for the lot sampled. For example, it will not pro-
associated standard error) must be taken into vide an immunization rate or contraceptive utili-
account during the analysis (Bennett, Woods, zation rate. This is because the samples are too
Liyanage, Smith 1991; Coetzee, Ferrinho, Reinach small to provide acceptable confidence limits,
1993). and can only classify a lot as acceptable or not
acceptable.
Lot Quality Assurance Sampling (LQAS) Thus LQAS has limited use in estimating overall
Lot Quality Assurance Sampling (LQAS) has its coverage rates. EPI methods are preferable if the
origin in industrial quality control where a certain major interest is a community-wide coverage
number of items (for example, tins) in a batch or figure.
lot is tested. On the basis of the number of defect-
ive items, the whole batch is judged as being Rapid ethnographic assessment
defective or acceptable. Since testing is costly, Rapid ethnographic assessment can be used to
one would like to test as few items as possible. obtain important in-depth anthropological data
The researcher has to take into account two pos- about cultural and other factors that influence
sible risks: the risk of classifying a lot with less health-seeking behaviour, without requiring ex-
than a specified threshold of defective items as pensive large-scale studies.
being defective (‘provider risk’); and the risk of By the use of focus groups, health care provider
classifying a lot with more defective items than a interviews and natural group interviews (groups
certain threshold as acceptable (‘consumer risk’). assembled not for the purpose of the study but
The technique is increasingly being used in the who are formed ‘naturally’, such as roadside food
health care setting for health programme evalu- sellers, mothers waiting at a clinic), important
ation and continual monitoring (Lemeshow, information is obtained. They can provide in-
Taber 1991). When applied to health services sights into motivation, attitudes, feelings, and
management, an area under a particular manager behaviour related to the use of health facilities.
could be divided into subunits which are called Simple self-administered questionnaires can also
lots. An ideal lot is the smallest unit that could be distributed through an existing administrative
provide meaningful information when evaluating system. Rapid ethnographic assessment has been
a programme. Each ‘lot’ (which could be the geo- used in this way to identify maternal factors that
graphical terrain of a community health nurse) is influence the use of preventive health services
sampled and some characteristic of interest is and treatment compliance.
measured, for example, the use of contraception Often such qualitative assessments are the best
by women 12 to 49 years of age. If the lot is way of forming the basis for more substantial
rejected, some management initiative is insti- quantitative studies. Such assessments are crit-
tuted in that sub-area. This means that resources ical prior to designing interventions.
do not have to be expended in sub-areas that are
actually doing well and: can rather be applied Rapid environmental health assessment
where most needed. Small area surveys are also useful in rapid en-
Recently the technique has been adapted to vironmental health assessment. They were ori-
assess immunization coverage, and the quality of ginally developed for assessing environmental
care and adequacy of health services provided by health factor needs, for example, in the peri-
local primary health care units (Lanata, Black urban areas of Zaire (Bertrand, Mock, Franklin
1991). Coverage is judged adequate or inadequate 1988). The aim was to determine environmental
by comparing it with a series of predetermined indicators which could be simply, cheaply, and
160
16 rapid epidemiological assessment
quickly monitored as measures of community The use of sentinel surveillance could be ex-
health. Three classes of indicators were consid- panded to include a list of tracer conditions (that
ered: is, selected diseases of importance) reflecting a
@ those associated with the individual range of changes that need to be monitored and
@ those associated with the dwelling demand an immediate public health response.
@ those related to the broader social and physical Sentinel health event surveillance has been used
environment. for infectious diseases (for example, poliomyelitis
and acute flaccid paralysis) as well as in occupa-
Socio-economic status is defined according to tional health (for example, a report of malignancy
local circumstances. For example, in Zaire, a of the lung in a chemical worker should trigger an
quality of life scale was developed using items in- evaluation for possible work site carcinogens).
cluding the presence of electricity, type of water
supply, and type of food preparation. The assess- Preceding birth technique
ments are useful for defining the relationship Mortality data are not strictly classified as surveil-
between child health status and physical/social lance, but do provide important information for
environmental factors, and have their main use in health surveillance. Although infant and child-
peri-urban areas where there are informal settle- hood mortality are sensitive indicators of social
ments. welfare and the relative health of a population,
mortality surveillance is a problem in most devel-
Surveillance methods oping countries. This is because neither the
Surveillance refers to the need for ‘continual numerators (the deaths by-age) nor the denom-
watchfulness’ of the distribution and trends of inator (the population at risk) for conventional
selected health outcomes with a view to acting death rates are accurate or even available.
when certain boundaries are passed. Surveillance Besides the use of sentinel surveillance to
methods are needed to continually monitor decrease the problem of inaccurate reporting, an
change. This is particularly important during a alternative method of assessing childhood mor-
period of rapid socio-economic growth, urban- tality rates has been developed by the demo-
ization, and political change, when the impact.of graphers Brass and McCrae. It is called the ‘pre-
several variables, from a change in the bread price ceding birth technique’ and has been tried in
to an increase in the level of political violence, many developing countries (Hill, David 1988). All
could have major impacts on health. Most sys- it requires is a question to mothers at delivery
tems collect too much data, however, (often with about whether or not the child preceding the cur-
loss of quality and accuracy) and analysis or rent delivery is still alive.
action is not timely, or is based on too small a pro- This provides a useful simple method, requir-
portion of data collected. In addition, the best ing limited resources, of monitoring the child-
surveillance is usually found where the risk is hood mortality of an area at delivery facilities in
least! (See also Chapter 14.) that area. If the delivery rate at health facilities is
above 70%, the method has been shown to accu-
Sentinel surveillance rately reflect the true childhood mortality rate of
Because of the lack of resources and the burden of the community.
routine surveillance at all health facilities, a sam-
ple of facilities or sentinel surveillance sites can Verbal autopsies
be identified in order to measure the health status To overcome the problem of the high proportion
of the population without studying the entire of ill-defined deaths in developing countries,
population. retrospective maternal interviews (verbal autop-
An example of this is the influenza early warn- sies) have been developed in a standardized way
ing system used by the National Institute of Virel- to assist in determining cause of death. Algo-
ogy in South Africa. Absenteeism surveillance is rithms (a’series of questions which allow a dia-
carried out in a number of schools prior to deter- - gnosis to be made easily) for commonly occurring
mining the activity of the influenza virus on an childhood conditions have been tested in a
annual basis. This surveillance system enables number of settings (Kalter 1990). In general,
the authorities to determine the current virus verbal autopsies were shown to be able to diag-
type so that an appropriate vaccine can be devel- nose major illnesses contributing to death in chil-
oped and deployed. dren with acceptable sensitivity and specificity.
161
eee ee ——————E———— ee
This approach is particularly important in reduc- of exposure to improved water supply and sanita-
ing uncertainty about the causes of death that are tion on diarrhoeal disease (De Zoysa, Feachem
lurking inside the ‘ill-defined’ categories, as well 1991).
as validating the diagnoses given by police offi- Similar methods can be used to assess the
Cers. effectiveness (rather than efficacy) of vaccination.
Cases (with disease) are compared to controls
Community indicators of risk or health status with respect to their vaccination status. However,
Several approaches are used to identify ‘at risk’ one shortcoming of such a study can be that the
populations towards which public health efforts vaccinated group may differ from the non-vac-
or health services need to be directed. Groups as cinated group in ways which are related both to
opposed to individuals are the target of the study. the risk of disease and vaccine status. For exam-
One approach is to use proxy measures of health ple, poorer people are more likely to get tuber-
status to determine which communities are at culosis and also not to be vaccinated (Comstock
risk. 1990). The impact of this bias can be assessed and
For example, the nutritional status of primary can be minimized by selecting appropriate con-
schoolchildren is a useful indicator of community trols (for example, by using household controls to
malnutrition (if school attendance is high). Sim- limit socio-economic bias).
ilarly, night blindness is a useful proxy for vitamin The case-control design can also be used in the
A deficiency. Tracer conditions, for example diar- evaluation of the Pap test in preventing cervical
rhoea, can be used to evaluate the quality of pri- cancer through early case detection (cases are
mary health care in a community. Other indic- those who have cervical cancer) and the health
ators of health service functioning have been impact of family planning practice (cases are
developed. For example, in missed opportunity those classified by some standard means as
studies, each contact a client makes with the having ‘poor health’).
health service is seen as an opportunity for health Case-control studies nested inside large popu-
workers to intervene more comprehensively on lation-based surveys can also be very effective.
issues other than the original reason for the visit.
Missed opportunity studies are thus able to Conclusion
rapidly assess the extent to which, for example,
curative services perform preventive functions REA is not a new discipline, but it represents a
(see Example 16.2). new approach to epidemiological research. It
draws on many well-known methods and also
Case-control methods for evaluation of stresses speed and simplicity, adaptation to local
health services conditions, and the need to obtain information
Traditionally, the case-control design has been timeously at a level of precision demanded by ser-
restricted to risk-factor studies of chronic vice providers. The strengths and weaknesses of
diseases. Most health service utilization studies each method need to be identified so that it is
have been based on large population-based stud- always used appropriately. As with any epidemio-
ies which are expensive and may take a long time logical study, rapid assessments should be con-
to complete. Case-control studies are more rapid ducted according to high standards.
and efficient. They can provide reliable results
and are an effective means of evaluating health
programmes, products, and procedures, pro- EXAMPLE 16.1
vided attention is paid to confounding. Accessibility of condoms to teenagers
In a study in Columbia, users were compared to
non-users (as outcomes) in a child health service
visiting family planning services
study (Selwyn 1978). The non-user group is By distributing condoms and promoting their
usually not included in studies of health services. use, Clinics that provide family planning services
It is, however, important to study the character- can play an important role in preventing the
istics of non-users. spread of AIDS. To assess the accessibility of con-
In a Cebu study, children who were brought to doms to teenagers at family planning services in
the clinic with diarrhoea (cases) were compared Durban, a simple random sample of 12 clinics run
to children who were brought in with respiratory by the Natal Provincial Administration and the
complaints (controls) in order to assess the effect Durban City Health Department was selected.
162
16 rapid epidemiological assessment
The fieldwork was conducted during April and the percentage of children leaving curative health
May 1991 by four trained and supervised teenage facilities in the Western Cape who do not have
fieldworkers, who visited clinics on different days evidence of measles immunization, a study was
and at different times. They kept detailed notes of carried out over a two-week period from the end
their experiences, and these notes were subse- of January 1990, immediately before the start of
quently analysed for content. the national measles immunization campaign.
Clinics were found to be accessible to people The sample consisted of two major teaching hos-
using public transport, but some were difficult to pitals in the greater Cape Town area, two second-
locate due to lack of signposts. Fieldworkers (par- ary referral hospitals (one in central Cape Town,
ticularly the two female fieldworkers) were em- the other in a small country town), and four day-
barrassed and intimidated by security guards at hospitals (the latter serving communities of low
the clinics. Clinic staff were generally efficient socio-economic status).
and welcoming, but did not give a high priority to All parents or carers leaving the hospitals with
condom distribution, and gave them ina setting child patients aged 6 to 59 months were inter-
which lacked privacy. Information on how to use viewed at the exit from the hospital. Approx-
condoms, if given at all, was supplied by means of imately 1000 such exit interviews were con-
pamphlets. Information on AIDS was rarely ducted. Interviews were conducted by trained
offered. fieldworkers who were not health service em-
The recommendations of the study include the ployees. Only a few senior staff members at the
following: selected hospitals were informed of the purpose
@ signs providing directions to clinics should be of the study. ,
put up; In 47% of cases, the health personnel at the
adequate stocks of condoms should be kept; hospital had not asked to see the child’s health
@ privacy should be ensured; card. If the fact that the carer could provide no
@ clinic staff should use the opportunity to dis- documentation of measles immunization is taken
cuss AIDS prevention with teenagers attending as indicating that the child had not been immun-
the clinic. ized, more than 60% of children were missed
opportunities (that is, they were in a setting in
Reference: which they could have been vaccinated but were
Abdool Karim Q, Preston-Whyte E, Abdool Karim SS. not). If no documentation was available and no
‘Teenagers seeking condoms at family planning services. history of measles immunization could be pro-
Part I. Auser’s perspective.’ South African Medical Journal vided by the carer, between 2% and 39% of chil-
1992 82;356-9. dren (depending on the type of hospital and its
location) were missed opportunities.
Similar studies could be applied to many other
aspects of prevention that could be practised
EXAMPLE 16.2 cheaply and easily in the curative sector. These
Missed opportunities for measles studies can be carried out over a 24- to 48-hour
period and have a large implementation
immunization potential.
Measles is still a major cause of childhood mortal-
ity and morbidity in South Africa. The WHO Reference:
recommends that all contacts with health ser- Yach D, Metcalf C, Lachman P, Hussey G, Subotsky E, Blig-
vices, for curative as well as preventive health naut R, Flisher AJ, Schaaf H S, Cameron N. ‘Missed oppor-
care, should be used to immunize children who tunities for measles immunization in selected Western Cape
do not have proof of immunization. To determine hospitals.’ South African Medical Journal 1991: 79;437-9.
163
17 Community
studies
Introduction health in the Khayelitsha township of Cape Town.

defined the household as ‘anyone who shared the
In the absence of adequate routinely available
same pot or who had economic links with the
health data, sample surveys may be the only way household’ (Cooper, Pick, Myers, Hoffman,
to investigate health issues in specific commu-
Sayed, Klopper 1991). This definition was prob-
nities or the population at large. To investigate a lematic, as absentee members (for example, chil-
community, it may be logistically easier to con-
dren living away from home and elderly parents
duct surveys at certain institutions in the commu-
living in the homelands but dependent on the
nity, for example, health centres, the workplace or
income generated by household members) also
school, rather than in the general population.
had to be included. Ina study of diabetes in a sim-
However, if the sample is not drawn from the
ilar population, the household was defined as
general population, sections of the population
those who lived together (for at least four nights a
may not be represented, and results will not be
week) and shared the same pot (Levitt, Katzenel-
generalizable to the population. For example,
lenbogen, Bradshaw, Hoffman, Bonnici 1993). So
clinic attenders are not necessarily representative
boarders who rented space in a house but did not
of the community in which the clinic is found:
share food with the rest of the household formed
they may be poorer, sicker, or older than the
a separate household, but someone who lived in a
community. In work-based studies research is
shack alongside the main house and shared food
done on workers, who by virtue of being in the
with the main household formed part of that
workforce are healthier than the general popula-
household.
tion (‘healthy worker effect’). For these reasons, a
The sampling frame (that is, some listing of the
large proportion of epidemiological studies have
community from which sampling units are
to be conducted in the community.
chosen) should be as complete as possible. The
Community surveys are often descriptive stud-
sampling frame and the way it is compiled
ies which try to estimate health-related events in
depend on which unit is taken as the sampling
the community. However, other study designs
unit. If the sampling unit is a household or plot,
(for example, case-control and intervention stud-
street maps and aerial photographs often provide
ies) can also be community-based.
the most complete sampling frame possible.
Scouting or drawing up one’s own maps (for
Sampling _example, in new peri-urban communities) may
The researcher must decide whether the sam- be necessary in certain communities. In addition,
pling unit will be the plot, the household, the indi- estimates of the population size are sometimes
vidual, the street, or some other unit. The house- needed for stratification. For example, a propor-
hold is a useful and commonly used unit of sam- tional stratified sample of a city, with stratifica-
pling, but it is not a simple concept. Careful tion by suburb, requires population estimates of
consideration needs to be given to how to define the different suburbs or the relative proportions
‘the household’. For example, a study of women’s of the different suburbs’ population sizes. Even
164
17 community studies
when the number of households selected per sub- tions. Others might require a physical examina-
urb reflects the correct proportions per suburb, tion. All this needs to be taken into account before
differential household sizes between suburbs the exact location is determined. Equipment
may produce disproportional sampling at the must be light and convenient to transport. If data
level of the individual. That is, if one suburb has is to be collected at central venues, these must be
larger household sizes on average than another, acceptable to community members. If for any
these suburbs may be over- or under-represented reason there is a stigma attached to a particular
in the sample. venue, the response rate will drop and the validity
In general, sampling in a community setting is of the study will be compromised.
often complex and difficult, and may involve
many stages (see Examples 17.1 and 17.2).
Fieldworkers
Selection plans need to be predetermined so
that no selection bias occurs in the field. For Fieldworkers are the key to the success of a
example, if on a selected plot only one household community study. They may be interviewers or
(out of, say, four) must be studied, the convention recruiters, that is, individuals who motivate
may be to spin a bottle from the middle of the plot people to come to the central venue where others
and choose the house to which the spun bottle do the interviewing. Yet another group may do
points. Likewise, if sampling is done in stages, the physical measurements or observe features in the
household might be selected first and then one participants. Most commonly, fieldworkers in a
individual within that household may be ran- community study enter people’s homes and
domly selected for interviewing or measurement interview them there. s
by drawing numbers out of a bag. The researcher It is often helpful to have recruiters and/or
must ensure that sampling methods are not cul- interviewers from the community itself. Such
turally offensive. Some community members fieldworkers understand the community, speak
may become suspicious that ‘witchcraft’ is being the same language, understand what is con-
performed if the research involves throwing dice, sidered a stigma by the community, and could be
spinning a bottle, or picking names from a bag. useful in motivating people to participate. The
In some situations, a survey may include all the use of local interviewers, however, may put
people in an area, with no sampling taking place. people off. If the data being collected is sensitive
This occurs when the community is relatively in nature, community members might not feel
smal or when one of the purposes of the study is comfortable giving this information to people
to provide a full census of the population. they would occasionally meet in the street. There
might be competition for jobs and other
resources in that particular community, causing
Data collection resentment towards an interviewer who has
Data collection may occur in people’s homes, at a gained employment from the study.
central venue in the community, or even outside Researchers should ensure that the study is not
the community. Participants often have to be manipulated to benefit an individual or a particu-
motivated and encouraged to participate in the lar group. In a community where there are reli-
study. Many individuals may refuse to parti- gious, political, or other divisions, it is not advis-
cipate, and this must be taken into account able to choose people from only one camp as
during the planning stage of the study. This does interviewers. Data collection is a neutral process
not mean increasing the sample size, but implies and should not alienate any potential particip-
planning strategies to increase the willingness of ants. Interviewers need to be extremely sensitive
individuals to participate. In the community, to avoid further polarization of divided commu-
people are not a captive audience and therefore nities.
need to be met on their own ground and on their Careful attention needs to be given to issues
own terms. After-hour visits, repeat visits, and such as how the interviewers introduce them-
appointments with absent household members selves, how they dress (for example, do nurses
can help improve the response rate. wear uniforms or not?), and what security precau-
The location where data is collected also deter- tions they take.
mines exactly what information can be collected. Another fieldwork issue is that data collection
Some types of surveys require that the parti- in the community is often extremely hard work. If
cipants have privacy when they answer ques- people are not at home, repeat visits (often after
165
hours and on weekends) need to be made, and in

the end the selected person might still refuse to Table 17.1 Possible stakeholders
participate. Often researchers do not acknow- in a research project
ledge the work of interviewers, who may have
spent three days getting only a few interviews, the community at large
despite working very hard. The issue of produc- ¢ local leaders
tivity of interviewers can be very sensitive and at @ project health workers
times interviewers may try to fudge the data so as _ @ outside professionals
to appear hard-working. After all, this is their job, - @ local authorities
and they wish to produce what is expected! @ government bodies (e.g. to
Fieldwork co-ordinators need to ensure that ‘@ commercial companies —
the quality of the data remains good by closely ¢ universities and research councils
supervising workers and checking forms regu- the ‘researched’ — the subjects of study
larly. The co-ordinator(s) must also provide sup- target groups
port to fieldworkers, who are faced with an ex- @ churches ©
tremely difficult task. funders
@ welfare agencies
Dealing with the community researchers
| project staff
Permission to enter and work in the community
may need to be requested from any formal civic
body that exists in the area, as well as any other
social, political, or religious grouping that repre- community often have different interests, and
sents significant sections of the community. In a there are often different power blocks. The con-
rural area, this might mean getting permission cept of stakeholders is a useful one for under-
from the headman and in an urban area, from the standing the nature and extent of participation.
local authority, civic association, and/or local Stakeholders are individuals or groups who have
community organization. Different authorities particular interests in the project in that they are
may have to be approached for permission for affected by it or stand to gain from it in different
different areas and different groupings. ways (see Table 17.1).
The process of consulting with community Many people may be affected by the project,
representatives in this way is often cumbersome, but may not have power over it. A participatory
requiring many meetings, many appointments, approach may aim to include all or only some of
and many explanations. Inevitably, researchers the stakeholders. Certain stakeholders can be
with the best intentions are likely to leave out prioritized from the outset; this is a value choice
some groupings or leaders, especially where of the researcher. The involvement of targeted
groupings are very mobile and informal. How- people should be stated explicitly as a goal from
ever, it can be very important to go through these the beginning. Stakeholders may have a conflict
steps, especially in areas where there is suspicion of interests over time and resolution of the con-
and instability. flict will be important for the future of the project.
In a sense, organizing access to a community is Barriers to community participation do exist.
also the first step towards providing feedback of Professional health workers and researchers are
your results to the target population, in that usually distinguished from most community
permission is often given on condition that the members by their class position and their access
results are fed back after the study. to resources. They have often been trained in spe-
This enables the community to understand cialized urban-based institutions, and may be
what the problems are and to work from a more regarded by community members as outsiders.
informed position. This is especially important They often lack the necessary skills or insight
from an implementation point of view, if any fur- needed to cope with community dynamics and
ther interventions are planned for the future. problems. The legacy of oppression in South
In many community studies, community Africa can exacerbate the miscommunication and
participation is a stated goal of the project. How- suspicion.
ever, the ‘community’ cannot be considered as a Facilitators for communication are listed in
homogeneous group: different people in the Table 17.2.
166
17 community studies
needs constant problem solving and adaptation

Table 17.2 Facilitators for to cope with community dynamics and pressures.
Creel Details of the study must be well-planned and
organized, but also flexible.
@ Active listening rather than top-down com-
munication and giving of advice.
@ Consultation should not be limited to EXAMPLE 17.1
individual leaders: broad consultation The prevalence of late-onset diabetes
should be encouraged. An activity initiated in the Cape Town African townships
by a well organized group or structure
(preferably with a grassroots base) has the The prevalence of late-onset diabetes in the
best chance of survival.
urban African community in Cape Town was the
A framework for dialogue needs to be set
focus of a study done in 1990. A multi-staged pro-
up, uSually in the form of a contract in which
portional cluster sample (1 000 in all) of the com-
mutual responsibilities and conditions for munity in the townships was drawn from an array
compensation and benefit are ciearly set
of official maps, scouted maps, and aerial photo-
out. This facilitates a sense of ownership
graphs. Fieldworkers who acted as motivators
had to visit clusters of homes, hostels, shacks and
and accountability.
Respect the means (process), not only the
tents, take a census of everybody in the house-
end (outcome) of the project. This means
hold, and randomly (by means of picking a num-
that participation takes time and cannot be
ber out of a bag) choose a person over the age of
30 as the candidate for the study. The selected
rushed.
@ Non-directive approach.
individual then had to be motivated to attend a
clinic where a blood test would be performed, a
questionnaire administered, and anthropometric
The researcher(s) may also set unrealistic goals measurements taken. An additional requirement
for community participation and empowerment, was that participants had to fast from 10 pm on
which may create unnecessary tension and stress. the night preceding the clinic visit.
An understanding of the structure and composi- Transport was provided to and from their
tion of the community will assist in setting realis- homes. Participants were provided with a meal
tic goals. The level of internal organization of the and a food parcel on completion of their inter-
target group and the material conditions of their view and measurements. If a selected person
lives will determine to a large extent the level of refused to participate, he or she was not replaced
participation possible. Professional workers and by another person.
community members often have different ex- Permission to conduct the study was negoti-
pectations of the project, creating conflicting ated with a variety of governmental and non-
agendas. Material benefits may be a priority for governmental groups — health authorities, town
some groups, and some may doubt their own councils, community and civic organizations, and
capacity for participation and underestimate non-governmental health organizations. The
their potential contribution. Often there may be same interviewers were used throughout the
no representative or credible bodies to work study, but motivators covered only the area in
through, and true consultation may be difficult. which they themselves lived, because much of
the work required after-hours (early morning and
evening) involvement.
Conclusion The logistics of the study caused it to run into
Studies conducted in the community are very dif- many practical, financial, unrest, labour-related,
ferent from those based in a clinic, hospital, and respondent-related problems. After six
institution, or centralized location. In the com- months, the final response rate of 76% was
munity, many factors impinge on the study. Com- beyond expectation and was a result of an ex-
munity, political, social, and economic events treme effort by all involved.
may interfere or facilitate the success of a study. Newly diagnosed diabetics and hypertensives
The time of sunset and climatic conditions are were traced personally and referred for treat-
also factors to be considered when planning a ment. Non-diabetics were informed of their
community study. Day-to-day management results by post. Unfortunately, a number of
167
i = EE
notices were not delivered due to problems with The study was initiated by the District Medical
postal services in squatter areas. Officer of Health, who approached headmen in
This example is one of a typical complex urban- the area for permission. Villages, stratified by
based community study. The logistical problems village type (traditional, resettlement, and urban),
of covering multiple areas, bringing people to a were randomly chosen, with all households in
central venue, using relatively invasive tech- selected villages being included in the sample.
niques (taking blood), and following up people Village health workers (VHWs) who were part of
identified as positive are exemplified in this the district health service were trained to admin-
study. The fact that the study was conducted in a ister questionnaires to all women of childbearing
politically turbulent period exacerbated many age in their villages. Focusing on children born in
problems. The researchers identified a need for the previous five years, the infant mortality rate
extra training of researchers in the future in pro- was found to be 42 per 1 000; infants who were at
ject management, including conflict resolution greatest risk were those who were one of a twin,
and financial management. born at home, or living in a resettlement village.
Gastroenteritis was the most common cause of
Reference: death.
Levitt N S, Katzenellenbogen J M, Bradshaw D, Hoffman M Feedback in the form of pie charts, histograms,
N, Bonnici B. ‘The prevalence and identification of risk and other illustrative material was given to the
factors for NIDDM in urban Africans in Cape Town, South VHWs at a meeting. Discussion of the results led
Africa.’ Diabetes Care 1993;16(4):601-7. to improved VHW understanding of the issues
and enabled VHWSs to integrate their new know-
ledge and experience into their work.
This study illustrates a typical rural commu-
EXAMPLE 17.2 nity-based epidemiological study where existing
infrastructure was used to the maximum, and ser-
The profile of infant mortality in vices strengthened by the long-term involvement
Hewu, Ciskei (1986) of outside researchers.
In 1986, a community survey was conducted by
researchers of the MRC in the Hewu district of the Reference:
Ciskei to determine the infant mortality rate in Yach D, Katzenellenbogen J, Conradie H. ‘Infant mortality
the region and to look at risk factors for infant rate study: Hewu District.’ South African Journal of Science
deaths. 1987; 83:417-21.
168
18 Knowledge,
attitude, belief,
and practice
(KABP) surveys
Introduction effects of the condition or illness

¢ their perception of the efficacy, costs, and bene-
Many of the major health problems facing society fits of any proposed actions
today, including the HIV/AIDS pandemic, heart
disease, cancer, and injury, amongst others, In addition, it is suggested that events or situ-
require public health interventions to change ations may trigger the necessity, or urgency of
personal behaviour. Personal health-related health-related decisions or actions.
behaviour may be influenced by many factors. Several other theories have been combined
Knowledge, Attitude, Belief and Practice (KABP) with the Health Belief Model in an effort to under-
surveys are based on the theory that individuals’ stand the role of psychological factors in health-
knowledge (facts), combined with their attitudes related behaviour. Attribution theory, as applied
and beliefs (positive or negative feelings and to the health field (for example, Hewstone 1983)
opinions), may predict their health-related beha- emphasizes two key concepts — locus of control
viour. KABP surveys aim to measure these psy- and locus of causality — both of which concern
chological and personal variables in order to the individual’s perception of events as being
better understand why people act the way they do under personal control in contrast to being deter-
(often in ways which are detrimental to their own mined by external influences. So, for example,
health), so that more effective programmes may people with limited locus of control may feel
be developed to reduce the toll of these diseases. powerless to change their lifestyle or behaviour
because they feel influences outside themselves
Social psychological theories as control their lives.
Bandura’s Social Learning Theory (Bandura
applied to health-related behaviour 1992) stresses the need for personal, social, and
Several social psychological theories address the behavioural skill development to enable indi-
question of how behaviour is determined by atti- viduals to put into practice their convictions re-
tudes and beliefs. The most prominent of these is garding health-maintaining or health-promoting
the Health Belief Model (Janz, Becker 1984), behaviour. So, those who want to change their
which attempts to identify beliefs and the way behaviour may need ‘training’ (for example, time
they may interact to influence individuals’ con- management skills or assertiveness training) to en-
scious decisions to undertake certain health- able them to behave in a health-promoting manner.
related actions.
Ashort history of KABP surveys
The most important health beliefs that influence
actions are thought to be: KABP surveys were originally developed as KAP
¢ the person’s perceived vulnerability to a par- surveys to assist the development and imple-
ticular condition or illness mentation of family planning programmes
the person’s perceptions of the severity or (Bulmer, Warwick 1983). The first surveys were
169
concerned with knowledge of contraceptive

methods and reproductive functioning, attitudes Table 18.1 Important steps in
towards family size, child spacing, contraception, rite Cad ire Bele
sterilization, and abortion, and practice or use of
birth control methods (Population Council 1972). 41 Consult educationalists, programme man-
The extensive application of KAP surveys in the agers, programme participants, social sci-
field of population research led to the concept of entists, and health workers who will use the
the KAP gap (Bongaarts 1991; Westoff 1988). This information or who will develop the health
concept referred to the common finding of a gap promotion programme.
or discrepancy between women’s expressed 2 Review literature to identify which variables
desire to limit or space their children (usually - may be associated with the behaviour of in-
high), and their reported use of contraceptive terest. 2
methods (usually low). Within this context, the 3 Explore, using qualitative, open-ended or in-
KAP gap was believed to be a measure of the un- formal methods, the characteristics of the
met need for contraception, particularly among target audience, and identify what issues
women in developing countries and in countries are important to their understanding of the
with strong religious or ideological opposition to topic, and how they express their concerns
family planning. in their own language.
Subsequently, KABP surveys have been applied 4 Design the questionnaire, making sure that
to a wide range of problems, including tobacco terms are unambiguous and non-leading,
and alcohol consumption, adherence to medica- and attempt to build in ways of evaluating
tion schedules, use of preventive health services, reliability and validity.
traffic safety, and many others. KABP surveys re- 5 Administer the questionnaire after identify-
emerged strongly in the 1980s in response to the ing the sample. Ensure a high response
need to curb and prevent the spread of HIV infec- rate, and maintain confidentiality.
tion. In 1987, the Global Programme on AIDS of 6 Analyse the data. Interpret the results crit-
the WHO produced a KABP survey instrument ically and consider what effect each aspect
which has subsequently been applied extensively. of the methods may have had on the re-
sults.
What are KABP surveys? 7 Report back to all involved in the research
and synthesize into pilot programme recom-
KABP surveys are based on the theory that an mendations.
individual’s health-related behaviour is influ-
enced by their knowledge of the disease and
necessary health-promoting actions to prevent or @ Programme design: to guide programme devel-
ameliorate the condition, as well as their attitudes opment and implementation by identifying
and beliefs (positive or negative feelings and psychological, social, and behavioural factors
opinions) towards the disease or health-promot- associated with health problems.
ing actions. KABP questionnaires aim to measure Evaluation: to evaluate the success and cover-
these psychological variables of knowledge, atti- age of programmes by measuring changes in
tudes, and beliefs, and to measure the health- KABP factors. Repeated KABP surveys may
related behaviour these variables are believed to identify the shortcomings or strengths of an
predict. KABP surveys usually take the form of existing programme. This knowledge may help
interviewer-administered or self-administered in the modification of a programme to make it
standardized questionnaires. The information is more effective.
collected to assist with the design, implementa- @ Advocacy: to generate support for health pro-
tion and evaluation of health programmes. grammes by demonstrating existing attitudes
The uses of KABP surveys may be categorized and needs.
as follows. @ Advancement of social and health sciences: to
¢ Description: to describe personal health beha- increase understanding of associations be-
viour and the demographic and psychological tween social or psychological factors, and
characteristics associated with this behaviour health problems.
among aspecified population at a particular See Table 18.1 for important steps in conducting a
point in time. KABP survey.
170
18 knowledge, attitude, belief, and practice (kabp) surveys
Measurement of knowledge, atti- ‘Condoms are good at preventing pregnancy if

tudes, beliefs, and behaviour used properly.’
STRONGLY AGREE, MILDLY AGREE, MILDLY
The characteristics measured by KABP surveys DISAGREE, STRONGLY DISAGREE, DO NOT
are difficult to define and quantify. For a long KNOW. :
time, people have been trying to define these con-
cepts, debating whether or not they are the most And from the section dealing with beliefs:
important variables to predict behaviour, and ‘What are the chances that you yourself might
developing appropriate psychometric instru- catch AIDS? Would you say that it is ...?”
ments to measure these variables. Unfortunately, NOT LIKELY AT ALL, VERY SMALL CHANCE,
some KABP surveys may treat the measurement MODERATE CHANCE, GOOD CHANCE, DO NOT
of the psychological variables too simplistically. KNOW, NOT SURE.
Knowledge The response possibilities in these questions are

Knowledge poses the fewest problems of mea- in the form of Likert scales.
surement. The most important items of know-
ledge need to be defined. These may be based on Behaviours/practice
the content of health education programmes, or In terms of the measurement of behaviours or
on common misconceptions. Questions need to practice, it has to be kept in mind that survey
be devised which are unambiguous and which do questionnaires assess reports of behaviour and
not pre-empt, lead, or steer the respondent into not behaviour itself. Reports of behaviour are
certain answers. Kelsey, Thompson, and Evans subject to a number of influences which can
(1986) provide helpful guidelines on how to potentially distort them.
develop questions for surveys in general.
A typical question dealing with knowledge is @ Social desirability: People may be reluctant to
taken here from the WHO Global Programme on divulge information about themselves which
AIDS Research Package (1990): portrays them in a bad light, and may say that
they engage in behaviours which are socially
‘Do you think that a person can be infected and desirable even when this is not the case. This
have the virus that causes AIDS but not have tendency is likely to be most pronounced with
any symptoms?’ behaviours which carry severe social sanction,
such as illegal activities, but may extend to
The response possibilities are: many health-related behaviours, especially in-
YES, NO, DO NOT KNOW, NOT SURE. timate, sexual, and associated social beha-
viours. This tendency may also be observed in
KABP surveys may also raise novel questions the reporting of attitudes and beliefs. It may be
about the health topic in respondents’ minds and stronger if the interviewer fails to convince the
some feedback or information (debriefing) may respondent that strict confidentiality will be
be helpful after the survey interview. maintained, or if there are hints that the inter-
viewer judges certain attitudes or behaviours in
Attitudes and beliefs a negative way.
The most common approach to measuring atti- Recall: People may forget details of their beha-
tudes and beliefs is to ask respondents to indicate viours, particularly those of minimal psycholo-
how far they agree with selected statements. The gical significance to them. Some questions may
intensity of the attitude is quantified using require attention to detail which respondents
measurement scales such as Likert scales, forced- have neither the ability nor the motivation to
choice descriptions or Semantic Differentials. For recall. A different phenomenon (known as tele-
more detailed information on how to develop scoping) occurs when certain events are so
measurement scales and items, see DeVellis significant that they are compressed in
(1991) and Maloney and Ward (1976). memory into the reference time period. In
For example, a question from the WHO AIDS other words, if you ask someone whether an
questionnaire in the section dealing with atti- event which may have been particularly
tudes towards condoms is phrased as follows: important to them (for example, whether they
have been hospitalized, lost a child, been
Pal
arrested, or had an unplanned pregnancy) ity of a questionnaire usually depends on com-

occurred within a specified time frame, say, paring the answers on the questionnaire to an
within the last two years, they may report its external criterion. However, KABP surveys mea-
occurrence even if it happened earlier than two sure hypothetical variables which are not directly
years ago. observable. Validity therefore has to be inferred
from the extent to which different methods of
In general, questions which are simple and which measuring the same variables or constructs agree
refer to recent and short time periods are likely to with each other, and the extent to which expected
obtain the most reliable information in response. relationships among variables, and in predicting
behaviour, are observed. For more discussion of
how to investigate reliability and validity, see De-
KABP surveys as one type of survey
Vellis (1991). (See also Chapter 9.)
procedure
KABP surveys form part of a wider class of sur- Data analysis
veys. Therefore, issues of sampling, questionnaire KABP survey data are normally analysed descript-
design, interviewing and data analysis, which ively, with response preferences being associated
apply to surveys in general, apply also to KABP with demographic and sociological character-
surveys. Issues specific to KABP studies will be istics.
outlined here.
Specific criticisms of KABP studies
Sampling
KABP surveys rely on probability sampling to KABP studies have come under considerable crit-
ensure that the results are representative of the icism. Some critics have argued that the meth-
population of interest in the study. Bias (system- odology is so seriously flawed that KABP surveys
atic or non-random error) may be introduced if a may lead to a systematically distorted under-
large percentage of people who should have been standing of health phenomena, particularly in
interviewed are not, either because they refused, cross-cultural settings. Some specific criticisms of
or because they could not be located by the inter- KABP studies include the common finding that
viewers. For example, if those who are strongly knowledge, attitudes and beliefs are poor predic-
opposed to family planning refuse to be inter- tors of behaviour, that KABP surveys are too rigid,
viewed, the results of a survey will portray more that they lack reliability and validity, and that they
favourable attitudes towards family planning provide simplistic answers to complex questions
than truly exist among the population being about the determinants of behaviour.
studied.
KAB do not predict P (practice)
Questionnaire design In general, research indicates at best a weak con-
KABP surveys employ standard questionnaires, nection between knowledge, attitudes, beliefs,
usually with fixed or closed-form response altern- and behaviour (Tanur 1992). It is important to
atives. To maximize reliability and standard- realize that not all attitudes and beliefs are alike;
ization, these questionnaires are administered some are strong in the sense that they have pro-
according to a fixed question order. Interviewers found effects on individuals’ thinking and beha-
are not encouraged to allow the respondent to viours, and may resist very strong pressures
expand on certain points, because the same ques- toward change. Other attitudes and beliefs are
tions must be asked in the same way. weak, vulnerable to situational pressures, and
Questionnaires should be designed to have a have little, if any, impact on an individual’s think-
high reliability and validity. Most of the time, reli- ing or action. Attitudes which are very strongly
ability refers to the degree of consistency found held may be better predictors of behaviours than
when the same questionnaire is administered to weaker ones. Attitudes and beliefs which are
the same respondent at a later stage (test-retest closely linked and specific to the behaviour of
reliability), or the same questionnaire is adminis- interest may be more important than more
tered by a different interviewer (inter-rater reli- general attitudes and beliefs.
ability). In addition, different questions asking An important criticism of KABP surveys is that
about the same topic should elicit consistent they may ignore the social and material context in
replies. On the other hand, determining the valid- which behaviour or practice occurs. For example,
172
18 knowledge, attitude, belief, and practice (kabp) surveys
the competing influence of the opinion of a The reliability and validity of KABP surveys
woman’s male partner may override the woman’s Where they have been specifically examined, the
own opinions about unprotected sexual inter- reliability and validity of some existing KABP sur-
course. Under certain circumstances, situational veys have not been as good as frequently assumed
factors, such as drug or alcohol use, economic (Okediji 1973; Stone, Campbell 1984). For exam-
imperatives, environmental factors, peer pres- ple, a study from Bangladesh showed that one-
sure, and social norms may play a greater role in fifth of men and one-third of women who used
predicting behaviour than the individual psycho- contraceptives in a sample denied this in a subse-
logical variables usually included in a KABP quent KAP survey (Bulmer, Warwick 1983).
survey. The availability of existing KABP survey ques-
tionnaires may encourage their uncritical use in
The rigidity of KABP questionnaires new settings. The re-use of questionnaires used in
KABP surveys rely on pre-coded questions. They other studies in new settings is useful to facilitate
may, therefore, provide answers many respond- comparisons across studies, but should not stop
ents might not otherwise be capable of formulat- extensive piloting to iron out the problems of
ing for themselves or find suitable for themselves. using the questionnaire in the novel setting.
It has been pointed out that KABP surveys assume Furthermore, the reliability and validity of the
the respondent already has a well-formed evalu- questionnaires used should be tested through
ative opinion on the subject in mind (that is, they methods such as repeat questions within the
have thought about the subject and have views questionnaire, intensive probing, combining
about it) and merely needs to access this response open and closed questions, repeat interviews
and report it to the interviewer. From this point of with single respondents, interviews with altern-
view, the attitude is assumed to be accessible and ative sources (for example, husbands, if wives
spontaneous (Tanur 1992). However, when the were the initial primary target), and so on.
respondent has no prior attitude (when they have
not thought about the topic or their relation to it), Simplistic approaches to KABP surveys
they may have to construct a response on the No one would deny that the relationships be-
spot. In this case, the attitude may be transient, tween health-related behaviour and the charac-
unstable, and somewhat meaningless for predict- teristics measured by KABP surveys are complex
ing behaviour assumed to be related to the atti- and difficult to quantify. However, KABP surveys
tude or belief. Attitude and belief questions in are sometimes substantially over-interpreted. For
KABP studies are seldom preceded by enquiries example, many KABP studies of family planning
about how much respondents have thought and HIV prevention are based on interviews with
about the matter beforehand, if at all. Some individual women of reproductive age. This im-
responses to KABP questionnaires may be efforts plies that sexual and reproductive decisions are
at politeness to apparently irrelevant queries, an individual matter, which is largely controlled
especially in developing countries where subjects by women. Simply interpreting responses indi-
are unaccustomed to being interviewed, and may cating a willingness to prevent HIV transmission
be overwhelmed by the presence of a stranger in or prevent pregnancy to mean that condoms or
their home and the pressure to be hospitable. birth control methods will be used by these
KABP surveys may neglect to collect inform- women, and that they are probably the best
ation on attitudes and beliefs which are most method for dealing with the problem, may over-
important to the respondents, if not originally look the many social and other impediments that
conceived as important in the survey. As the con- bar these women from actually carrying out their
tent of KABP surveys is defined by researchers, stated intent. Further interviews with women’s
they may overlook issues important to the partners may also be informative.
respondents. This is most likely to occur when the Another example which shows that the know-
researchers are unfamiliar with the social and cul- ledge, attitudes, beliefs, and practices of others
tural conditions of the communities being stud- may be more important than that of the people
ied (Ratcliffe 1976; Stone and Campbell 1984), whose health is directly affected, comes from a
and particularly when the cultural and social gap South African study. In this study, knowledge,
between the researcher and the researched com- attitudes, and practices with regard to hyperten-
munity is wide. sion and its management were investigated
among family practitioners (Pick, Steyn 1992).
173
Family practitioners are likely to be primarily survey. Extensive exploratory research and pre-
responsible for ensuring appropriate detection testing, some of which needs to make use of
and management of hypertension among their qualitative methods, is essential for the com-
patients. Therefore, hypertension management petent design, execution, and interpretation of
aids to assist family practitioners may be more KABP surveys.
effective in controlling high blood pressure in the
community than simply directing education at Conclusion
the community itself.
KABP surveys may provide useful baseline in-
formation when designing health programmes
Problems of applying KABP surveys that aim to influence personal behaviour, and re-
in developing countries and cross- peated KABP surveys may provide useful inform-
cultural settings ation about the effectiveness of programmes.
How the information from a KABP survey can be
Many problems of surveys become exaggerated
used in the design, implementation, or modifica-
when surveys are applied in developing coun-
tion of health programmes needs to be planned
tries, and especially among largely poor and une-
ducated populations. It is not so much that there
from the outset. Too often KABP studies are done
are problems which are unique to these circum-
without bridging the gap to implementation and
building this information into intervention. It is
stances, but that they are more frequent and
essential that educationalists, programme man-
severe (Bulmer, Warwick 1983).
agers, programme participants, social scientists
Translation and health workers are involved in the research
It is difficult to ensure that translated question- process.
naires truly represent the original. This is espe- Several possible problems in the design and
cially important if a study uses questionnaires in execution of these surveys may threaten the
more than one language. Translation poses fur- validity of the research findings. It is important to
ther problems when the target language(s) lacks bear this in mind, and to interpret data critically
the vocabulary suited to the goals of the ques- and cautiously. To clarify the role of the many
tionnaire. In many instances, these complex lin- social and psychological factors in public health,
guistic issues are left to interviewers to navigate and to better assist the development of health
on the spot. This may place unrealistic demands promoting programmes, data from KABP surveys
on interviewers. To evaluate comparability, it may need to be combined with further inform-
may be necessary to translate the original into the ation, including that arising from in-depth and
target language, and then to get someone else to qualitative research.
back-translate the new questionnaire into the
original language. Discussion of the translated
and the original version of the questionnaire by EXAMPLE 18.1
bilingual people may be of further benefit. The use of KABP surveys in a school
Socio-cultural context
setting
A proper understanding of the socio-cultural con- A KABP survey of 377 high school students in
text is essential to being able to conduct research Cape Town found that students had a poor and
(see also Chapter 26: Psychiatric epidemiology). superficial knowledge of how HIV was trans-
Pilot and pre-testing of basic ideas, question- mitted, and had fragmentary and inaccurate in-
naires, and field strategy is essential for achieving formation about how to protect themselves. Few
accessibility to the population and discovering recognized AIDS as a threat to themselves or their
relevant cultural practice and expression. It is community, and people with AIDS were viewed
essential to involve individuals from local com- with hostility. Three-quarters of the students
munities in all stages of the research. were sexually active, and almost all expressed a
Qualitative research procedures, such as strong desire to learn more about the disease
anthropological methods, in-depth or unstruc- (Mathews, Kuhn, Metcalf, Joubert, Cameron
tured interviewing, participant observation, and 1990). This descriptive information was used to
focus groups, may be a useful starting point in lobby support for school-based AIDS education
formulating the questions asked in a KABP programmes, which at that time had received
174
18 _ knowledge, attitude, belief, and practice (kabp) surveys
little attention.
The information also formed the basis of the
development of a pilot education programme.
KABP surveys were undertaken before and after
this pilot programme was implemented, and at
similar intervals at a comparison school. These
surveys showed that there were significant
increases in students’ knowledge of HIV transmis-
sion and prevention, and significant increases in
students’ reported acceptance of people with
AIDS following the programme. However, there
were no significant changes in behavioural inten-
tions. Again, this information was used to garner
support for further school-based AIDS education
programmes, and to modify and improve the pro-
gramme.
Reference:
Kuhn L, Steinberg M, Mathews C. ‘Participation of the
school community in AIDS education: An evaluation ofa
high school programme in South Africa.’ Aids Care 1994;
6:161-71.
EXAMPLE 18.2 Knowledge and education about dangerous health

Survey into attitudes, beliefs and practices don’t necessarily change behaviour
practices regarding smoking
Using a population-based sampling frame, a
representative sample of adult South Africans
weré selected and interviewed by professional
survey workers. The subjects were interviewed
about their knowledge of the harmful effects of
smoking, their attitudes and beliefs towards to-
bacco control measures, and their own smoking
practices.
The study found that the majority of smokers
(68,5%) knew that smoking was harmful to health,
but that a larger proportion of non-smokers
(79,7%) knew this to be the case. Most people
interviewed supported public policies to control
tobacco consumption, such as stopping the sale
of cigarettes to children, increasing taxes on ciga-
rettes, and preventing advertisement of tobacco
products in the media. The study suggests that
there is widespread support among the public for
legislative measures to create a social environ-
ment which is conducive to reduce smoking.
Reference:
Martin G, Steyn K, Yach D. ‘Beliefs about smoking and
health and attitudes towards tobacco control measures.’
South African Medical Journal 1992; 82:241-5.
175
19 Qualitative
methodology:
an introduction
Introduction contexts are seen as being shaped by the norms

and conventions of the society in which they are
Many of the most pressing questions about
found. All these norms, conventions, and social
human health care are linked to the attitudes and institutions find their final form in cultural and
perceptions of both caregivers and users of health belief systems. The personal and social meaning
services. Sometimes people behave in ways or that people use to structure their lives cannot just
hold views which are difficult to understand. For be treated asa statistical variable, but needs to be
instance, some people stubbornly refuse to take
explored if the researcher hopes to answer many
their medication, no matter what. If you try to
basic research questions.
understand their motivation, they may tell you a
The qualitative approach is subjective in that
number of things that you did not think were
the focus is on how the respondents experience
important, but are important to them.
and understand the particular situation.
Qualitative methods allow researchers to
understand how the subjects of research perceive
their situation and their role within this context. It Uses of qualitative research
is out of these perceptions that behaviour, includ-
Four instances in which qualitative research can
ing health-related behaviour, is born. Qualitative
be used are outlined below.
research can help us find out why these beha-
viours occur or why people hold these views.
1 Qualitative research may be used in the early
Qualitative methods are a collection of formal
phase ofa study when the researcher wants to
research methodologies that allow researchers
get the feel of an area before starting with a
to get in-depth information on their subjects,
survey, or to clarify the basic conditions that are
generally by talking to them or observing them.
present there. Qualitative research can be used
Qualitative data consists mainly of descriptions of
onan exploratory basis to establish hypotheses
people or places, or of conversations. Such infor-
for future research.
mation cannot easily be handled by statistical
bh Qualitative research can be used along with
procedures, and these procedures are generally
other types of research in order to get an addi-
inappropriate.
tional perspective on the problem. It can also
provide the meaning behind a superficial
Philosophical approach connection made bya statistical survey.
Underlying the qualitative approach is a philo- wo It is often the only method used when the aim is
sophy concerning how behaviours and social pro- to get an in-depth sense of what people think of
cesses are determined. This philosophy argues a particular object or event.
that attempts to explain the social world in terms 4 The in-depth interaction makes this approach
of overt behaviour miss the most vital part of it, particularly appropriate when used as part ofa
namely that each behaviour or action carries process to establish some form of action or
meaning which needs to be explored. People and campaign in a community.
176
19 qualitative methodology: an introduction
Methods of data collection ing other points if they seem important during the
course of the interview. No sequence is imposed,
Research objectives are formulated broadly and and each point can be the beginning of a series of
not framed around variables which are defined in probing questions. The greater the level of struc-
rigorous measurable dimensions, or in terms of ture provided, the less spontaneity the inter-
specific hypotheses. The emphasis is on invest- viewee is allowed.
igating the variables in their complexity and in An appointment should be organized well in
context. Overviews of the three most commonly advance at a mutually acceptable place, so that
used methods of data collection will be provided time is allocated and the respondent can prepare.
here to give a flavour of the research style. It is When organizing the meeting, the nature of the
important to note that the style of the research research should be explained. A person’s unwill-
method used has to be adapted to the situation, ingness to be interviewed must be respected, but
and the demands of the research question. reservations should be discussed if possible.
The initial phase of the interview is very impor-
Depth interviews tant, as this is when trust is established or lost.
This is probably the most common form of qualit- Openness and frankness about the nature of the
ative data collection. It generally takes the form of research will help to gain trust at this point. Con-
a discussion between the interviewer and inter- firm that the person is agreeable to having the
viewee about the area of research. The interview interview taped, reaffirm the nature of the re-
is commonly recorded on a tape recorder (with search, and re-establish consent. Privacy and
permission from the interviewee). The inter- quiet should be ensured where possible.
viewer has to direct the discussion to some extent The interviewer asks an open question, and
so that the required information can be obtained, then by the use of summarizing techniques and
but respondents are allowed to talk and cover the questions of clarification, draws out the meaning
area in their own terms and from their own and sense of reality that the interviewee places on
perspective. This method is generally used when the issues under question. Summarizing involves
detailed information is needed from individuals. reflecting back to the interviewee what was said in
Examples of its use would be to determine how such a way as to provide an immediate check on
TB patients feel about a new treatment plan, to the interviewer’s understanding, and to allow the
assess the illness experience of a person with interviewee to further develop his or her thoughts
cancer, or to find out how STD patients feel about on the question. Clarification questions serve to
the idea of using condoms. get explanations on issues raised by the inter-
The major advantage is that it gives the re- viewee about which the researcher is unclear.
spondent the opportunity for personal explana- The atmosphere should be kept cordial and
tion and detailed responses. In addition, the indi- open. An interview should preferably not last
vidual focus allows the interviewer to draw out more than an hour in most cases. It can be carried
more detail while the respondent is talking and on later if more time is required.
thinking about the subject. The major disadvan- The interview needs to be concluded with great
tage is that, compared to the other qualitative discretion and care. Any emotions and hopes that
methods, the respondent is more removed from the interview raises should be respected and
his or her own context and may feel threatened. talked through, as appropriate. At the end of the
This could result in bias. interview, the respondent must be told where the
Considerable preparation must take place research will be going from this point, should be
before going into the field. The researcher needs reassured about confidentiality, and, where pos-
to be clear about the purpose of the research, sible, the offer of later feedback should be made.
have a good knowledge of the setting in which the
research takes place, and have read widely in the Focus group interviews
area to be researched. Interviewers may also have The focus group method involves a number of
to prepare themselves emotionally, as they often people meeting in a group in which the parti-
have to deal with difficult and emotive issues. cipants talk to one another under the guidance of
Most commonly, the interviewer will enter the a facilitator. The purpose is to generate relevant
field with alist of points to be covered in the inter- ideas and information around a pre-arranged
view. These points give direction, but ideally topic. The aim of focus groups is to provide
should not inhibit the interviewer from discuss- insights into the attitudes, perceptions, and
177
d= = common approaches in epidemiology
ee — ee
opinions of participants. corder. There are a number of special issues that

Focus groups can be set up ina relatively short the facilitator should be aware of while running
time. They can produce results that may directly the group. These include the following.
represent how people are feeling as they attempt ¢ The group must remain focused and discus-
to re-create the social situation. People are often sions on Side issues should be limited.
stimulated by the discussion, and reveal facts and @ Difficult personalities such as dominant
opinions that they might not otherwise have people, excessively shy people, those who
chosen to reveal. It may also give group members ramble, and argumentative members need to
the chance to clarify their attitudes and beliefs. be monitored and controlled so that they do
Focus groups can be used flexibly, for example, not disrupt the group. It is important to geta
to evaluate health projects and educational at- balance of participation between the different
tempts, and to obtain perceptions of what the members.
major needs in a community are. Focus groups ¢ Potential group members need to be reminded
can help to build community involvement in the of the meeting by the facilitator (or some other
research or the intervention being evaluated or intermediary person) in advance, to minimize
planned. The major disadvantage of this method the problem of cancellations and no-shows.
is possible peer pressure within the group, which @ The facilitator needs to guard against his or her
may prevent the members from saying what they powerful role biasing the group.
believe. @ The facilitator will at times have to use probes,
Ideally, a group should number between six such as direct questions, to elicit additional
and ten members, plus a facilitator and observer information of interest. Probes should not be
or recorder. Membership of the group should be overused, as this will inhibit discussion.
fairly homogeneous. If the subject is sensitive,
care needs to be taken in mixing groups. For At the end of the discussion, group members
example, if the subject is sexuality, one should should be thanked for their participation and told
consider interviewing men and women separ- what will happen to the information that was
ately. gathered in the group. For every group discus-
The selection of group members should aim to sion, there should be an audio tape record. In
represent all relevant subgroups. On the other addition, the recorder should keep detailed notes
hand, natural groupings, such as women who on the body language and social processes in the
meet every day around a central water tap, may group, as well as brief notes on what was said.
also be used, as they represent a specific target
group. Participant observation
The site selected for the discussion should be This particular style of data collection requires re-
private and comfortable, free from disturbance, searchers to involve themselves directly in the
and at a convenient spot for the members of the lives and worlds of those being studied and expe-
group. The optimal time span for a group discus- rience their reality. In this process, the researcher
sion is between 60 and 90 minutes, but the gains a greater understanding of the context. The
demands of the study need to be taken into method is based on the assumption that under-
account. It may also be possible to organize more standing of the inner perspectives of subjects can
than one meeting of the group. only be achieved by actively participating in the
As with the depth interviews, the facilitator subjects’ world and gaining insight by means of
should work from a pre-established list of points observation. This makes it potentially the most
or schedule, which can be used to ensure that all powerful tool for developing an understanding of
important points are covered. As with the depth the experience and meanings attached to certain
interviews, the schedule should not be allowed to behaviours and phenomena. Its major disadvan-
limit discussion. tage is its time-consuming nature.
Prior to starting the group interview, it is im- The method is not commonly used in health
portant for the facilitator to open the discussion research, but it does have potentially important
by welcoming the group and getting everybody applications. Some such applications include
introduced, providing an overview of the topic, observations of how the flow of patients through a
outlining the ground rules of the discussion, get- clinic is working (health services setting), an
ting permission for the use of the tape recorder, assessment of how well safety regulations are
and explaining the role of the observer or re- adhered to, and how the workers feel about these
178
19 = qualitative methodology: an introduction
(occupational health setting), and in investig- details of the future research plans given. The
ations of infectious diseases (such as AIDS), it researcher needs to be clear about when to leave
may determine indicators of risky patterns of the field by either setting a date or a point in terms
sexual behaviour. The data collection covers a of the quantity of data obtained, as it is tempting
range of activities, including observing, listening, to stay in the field indefinitely.
asking questions, and recording, which can be
done either covertly or overtly. Other methods
The first stage in a participant observation is to There are a number of other qualitative methods
select and get access to the site of the research. which are available to the researcher, but are less
The researcher needs to be free to move around commonly used.
easily and participate in the activities of the Researchers often keep research diaries to
community or group, while preferably remaining supplement the data collected. Research diaries
relatively inconspicuous. The process of gaining utifize valuable information picked up while the
access is vitally important, as it determines in researcher is interacting in the research setting.
many ways how responsive the community or The use of documentary evidence includes the
group is likely to be to the researcher. The nature analysis of photographs, tapes, life histories, and
of the research should be explained to the re- old documents. Other more experiential methods
search subjects as far as possible, especially how would include activities such as role play, project-
the information will be used. Where possible, ive techniques and visualization exercises. While
access should not only be negotiated with those all the methods of data collection discussed in
in authority, but also with individuals at different this chapter are often used individually, they may
levels, so that a broader understanding can be also be used in combination. The different meth-
achieved. A number of techniques to facilitate the ods are used to assess different types of inform-
process of gaining access are covered in the refer- ation and to back up findings found in one of the
enced texts. other research methods.
Once in the field, the researcher has to decide
to what extent to participate in the activities of the
community. Some researchers merely act as Other methodological issues
observers and try to maintain a distance from the
activities of the community, while others seek to Sampling
get involved as much as possible. This is both a The usual epidemiological practice of formal
practical and an ethical decision. random sampling with ‘concern for adequate
Collecting field notes can be difficult. While sample size is inappropriate in qualitative stud-
it will be impossible to record everything that ies. The depth of the data gathered is the primary
happens, it is vital that very detailed data record- goal, rather than statistical inference. Within
ing takes place. This can be done in a range of qualitative methods, greater emphasis is placed
ways, most commonly field notes, audio taping, on purposive sampling. This is a sampling
and video taping. Field notes should consist of method in which the researcher deliberately
relatively concrete, complete, accurate, and chooses respondents or settings in order to en-
detailed descriptions of social processes and their sure that the sample covers the full range of pos-
contexts. sible characteristics. In this approach to sam-
One particular issue concerning field notes is pling, the researcher purposively tries to obtain a
that the subjects being observed may be threat- sample that represents all important subgroups
ened by the note-taking, and this could in turn of the population, by targeting specific sectors.
compromise the research. In such instances, only This includes sampling deviant or extreme cases,
brief notes should be made as unobtrusively as typical cases, critical cases, politically important
possible and the fuller version should be done ata or sensitive cases, and sampling to maximize the
later stage from memory, when the researcher is variation. It is hoped that the diversity of the
alone. target group, as well as what is typical can be
Finally, when leaving the field, the researcher covered in this way. Haphazardly selecting
must be aware of the powerful social connections people for convenience is discouraged, as the
made in this type of research, and the effect that results can be heavily biased (Patton 1980).
the research process has had on the community. Sample size is often determined by logistical
Full and proper goodbyes need to be said and issues of available time and funding. Two poten-
179
Date pile, Te IEE A SR Re re eS
tial ways of defining sample size in the field are sions could be that they are unerotic, that they are
firstly to continue in the field until the researcher embarrassing to use, or that they are difficult to
has enough information to answer the initial re- obtain. :
search questions, or until no more or very little As the data collection continues and the codes
new information is being gained from successive become more complex, there is a need to start
interviews. The second method is to use the initial summarizing and to begin making some sense of
interviews to establish hypotheses and use suc- the data. The use of theoretical memos is import-
cessive interviews to validate or adjust these ant and should begin early on. These are brief
hypotheses. Interviews will continue until the notes of potential hypotheses or observations
hypotheses require no further adaptations in that can be made while coding and examining the
terms of the new information being gathered, and data. All notes should be filed away to be used as
the researcher feels that they adequately explain a resource later on when doing the final inter-
what is being researched. pretation of the data. During the project, the
notes will become more complex and closer to
Analysis of qualitative data the final interpretations that will need to be
The development of methods for analysis has made. From all of the above, the key issues and
lagged behind the development of data collec- important factors need to be drawn out and the
tion, but some important texts have appeared final interpretations stated. It is important to
(Miles and Huberman 1984; Strauss 1987). It is relate the final interpretations to existing theories
important, however, that the researcher does not about the subject to obtain greater depth and
blindly follow others’ practices, but adapts the understanding.
analytic procedures to the demands of the situ- A number of techniques may be used to verify
ation and the data. While there are no fixed rules and interpret results. These include the triangu-
for the analysis of qualitative data, there are some lation method, in which the results using differ-
key aspects that need to be noted. ent methods are compared to see if they comple-
The general term often used for the process is ment each other. Another method is to set up ini-
content analysis, which basically means that the tial hypotheses and then to search the data for
data is explored in detail for common themes and disconfirmatory results. If any such results are |
these are then established into units of meaning found, then the hypotheses may need .to be
or codes. At a later stage, these codes become the adapted to take them into account. These help to
basis of further analysis in terms of their content make qualitative results generalizable. Validity
and meaning. Although this is costly and time- and reliability can be further enhanced by com-
consuming, taped material should be typed or paring results between interviewers, by getting a
written out (transcribed), as the written word is number of people to analyse the same sections of
the basic medium for analysis. The task of tran- the same material, or by relating the results to the
scribing should not be underestimated. Depend- theory on which the research was based.
ing on the quality of the tape recording, it can take
four to eight hours to transcribe a one-hour tape. Training
Every single word that is said by both the inter- This chapter is a brief overview of qualitative
viewer and the interviewee has to be written methods and cannot be considered a training
down. From early on in the data collection pro- guide. The references for this chapter will give
cess, coding should begin so that the key areas some additional guidelines on theory and tech-
can be drawn out to re-inform the data collection niques, but do need to be backed up with experi-
process. ential learning and support. Initial training can
Coding consists of searching the data for often be passed on via a short set of workshops by
common themes which can be established as . someone who is skilled in this area. It is only in
categories into which later information can be the process of working with others that the tech-
inserted. Over time, this coding should move niques can really be known and mastered. Key to
from the original crude codes to becoming more this is the fact that the researcher as a person is
complex. At this point, codes should start to over- the research instrument for qualitative methods.
lap, so more complex codes may need to be devel- Just as the questionnaire is the instrument in a
oped and subdimensions of these codes created. survey, the researcher is the means by which in-
For example, if the general code concerns atti- formation is collected in qualitative techniques.
tudes towards the use of condoms, the subdimen- For this reason, a lot of hard work needs to go into
180
getting yourself ready for the task, even though ion of skills. Epidemiology has traditionally
many skills are simply extensions of existing focused on the use of quantitative data, which
social skills. Short courses on qualitative methods allows for focused projects and the testing of pre-
are offered by most research training institutions. set hypotheses. The qualitative method provides
Additional suggestions. for training include the research opportunities which extend the amount
establishment of study groups within institutions of information that may be gathered, particularly
to learn about the methods. Another would be to as it provides a greater level of understanding of
work alongside more senior and experienced the processes that affect the results we seek in the
researchers. health field. Interest in the area is increasing, but
it remains an undeveloped field in epidemiology.
Advantages and disadvantages of
qualitative methods
Advantages of qualitative methods
@ Since qualitative research is done within EXAMPLE 19.1
people’s contexts, it can produce results that Depth interviews to evaluate a
directly represent how people feel. During this community health worker project
process, the researcher will also get a closer
feeling of the general social functioning of the Depth interviews were used as one of the research
person or the community. methods for evaluating a community health
@ The results obtained are often more accessible, worker (CHW) project in Khayelitsha near Cape
as they are descriptions of real situations rather Town. A sample of 10 community members who
than statistical measures or diagrams, which had been visited by a CHW during the previous two
are often beyond people’s experience. weeks were interviewed in their own homes. Inter-
@ Innewareas of research, qualitative methods viewing techniques such as clarifications and
can be useful for the production of new ideas. reflective summaries were used to allow respond-
They can also provide information for other ents to explore how they felt about the visit by the
studies. CHW. The interview schedule was built around a
@ Ifnew information appears during the research single main question: ‘Please tell me about the last
precess, there is space within the research visit of the CHW.’ Prompts were prepared to elicit
structure to explore it. information that was not volunteered, for instance,
The ‘Hawthorn effect’ is used to describe the the purpose of the CHW visit, how the respondent
effect of being watched or observed: people felt about the visit, the health problems experi-
often change their behaviour when they think enced by the respondent, whether the CHW asked
they are being observed. In qualitative re- about these problems, and the information or ser-
search, a skilled researcher will be able to use vice rendered by the CHW. With the permission of
the greater flexibility of the qualitative method the respondents, the interviews were tape-
to get around this problem by using probes and recorded.
picking up on non-verbal cues. On analysis it was found that community mem-
bers had volunteered examples of social and per-
Disadvantages of qualitative methods sonal issues that they had discussed with CHWs.
@ There are often problems with the analysis of Comments like ‘I feel very happy to talk to Nom-
the information, particularly due to researcher philo about my problems and my secrets that are
subjectivity and bias. worrying me. To talk to her gives me great relief’
@ The process of transcribing and analysing data and ‘Nomphilo is like a sister to me’ enabled re-
can be time-consuming and costly. searchers to understand the texture of the rela-
@ The material acquired may also not be easily gen- tionship between the community members and
eralizable, due to the small sample size, and the the CHWs.
non-random sampling technique often used.
Reference: :
Matthews C, Van der Walt H, Barron P. ‘A shotgun marriage:
Conclusion Qualitative evaluation of a community health worker project
Qualitative research methods are a useful and in Khayelitsha.’ South African Medical Journal 1994;
valuable addition to an epidemiologist’s collect- 84(10):659-63.
181
EXAMPLE 19.2 eight patients with tuberculosis were asked what

TB photonovel it was like for them to live with the disease.
There is a rising incidence of tuberculosis in the The results showed that patients felt stigmat-
Western Cape and one of the most serious prob- ized and they were concerned about infecting
lems is that only about 60% of patients complete their relatives and friends. They also spoke about
their treatment. During 1991, a project was started the difficulty of continuing with directly observed
with the aim of developing effective materials for therapy once their symptoms disappeared. This
patient education (see Table 19.1). During the information was used as the basis for a photo
first stage of the project, nurses who work in the novel targeted at TB patients and their relatives
TB service used focus groups to discover how pa- (Figure 19.1). The photo novel was pre-tested in
tients experience their illness. Groups of six to focus groups with TB patients. They were asked to
Table 19.1 TB case holding qualitative research process
PHASE WHO ACTIVITIES LEARNING

PARTICIPATED OPPORTUNITIES
1 Problem Committee of health Discuss problem and

Identification of service managers and __ possible strategies
high incidence of TB researchers
2 Problem exploration Core team (4) — Literature survey

Non-adherence to TB — Analyse patient
treatment : education
3. Data gathering Clinic nurses (7) Focus group discussion Skills in focus group |
‘How do patients Data analysis discussions. Listening
experience TB?’ gives new insight
Groups of patients Tell nurse about Affirmative of own

experience experience
4 Data gathering Coreteam(1) . Depth interview Deeper understanding

Life story of TB
patient TB patient (1) Tells life story Her story can help others
5 Useofdata Core team Script writing Media skills

Development of
photo novel Clinic nurses and Advice on script More insight into life of
domestic workers - Act as models TB patients
Neighbours at ‘shoot’ Contribute ‘props’ Awareness of TB
6 Pre-testing Core team Focus group Confirmation.

‘What do you think : discussion It works well
of the photo novel?’
TB patients Feedback on photo Others share similar
novel experiences. Their
experiences count.
Were recorded for
others to see
182
re wR Ut mC it oD
COUGH A LOT ? YOU SWEs
Suster, ek is altyd so moeg )

en ek skel heeldag my 4& pan
J
man en kinders. :
bYOU] MAYEHAVEST. Bo) TU GLY Cia wae
VT YOUR CLINE G8 DOCTOR aut youd CL 08 DOCTOR
AS SOON AS POSE ad SOOM AS MORALE
.
Jy is verniet bang vir aansteek, as

Sarah! Ek is op volle TB behandeling
en nou kan ek geen kieme versprei
HII
ALIVITINILIILIIN
a F
es a a eS SS Se SSS eel
ve oe ot
first read the booklet and then say what they walked past him all the time, and a student nurse
thought and felt about the booklet. came to his bed, read his bed letter, paged
Their response was enthusiastic and they through his file, and apparently checked the in-
strongly identified with the booklet. Some of the formation against the computer printout in her
patients said ‘This could have been my life’. This hand.
feedback convinced the management of the TB 08:42 Two student nurses were standing next to
Control Programme that the booklet fulfilled a the patient’s bed, talking to each other. A staff
patient need and they decided to use it as one of nurse and student nurse came to tidy up the
the means to support and encourage patients to patient’s bed. As they moved him around, he
adhere to their treatment. groaned at times. Although they spoke to each
other, they never addressed him.
Reference: 09:11 A student nurse came to take the patient’s
Van der Walt H. ‘The role of participatory research to blood pressure. The patient was lying down with
enhance patient adherence to tuberculosis treatment’ in his blankets pulled over his head. The student
Proceedings of the International Symposium on Participatory nurse pulled the blankets away, took his arm and
Research in Health Promotion. Compiled by Korrie de started to take his blood pressure. She was speak-
Koning, Liverpool School of Tropical Medicine, August 1994. ing in Xhosa to the staff nurse who was at the
opposite bed, taking patients’ temperatures.
When she had finished, she made a note of the
EXAMPLE 19.3 blood pressure in the patient’s file, washed her
hands, and walked on.
An intervention study to determine 09:15 The staff nurse reached the patient’s bed-
levels of care in a hospital side and said: ‘I must take your temperature.’ She
During an intervention study which focused on shook the thermometer and put it in the patient’s
the levels of care in a large tertiary urban hospital, mouth, saying ‘Okay, close your mouth.’ She
a medical anthropologist used participatory stood staring out of the window as she stood
observation to examine the communication be- holding his wrist but said nothing further. She
tween nurses and doctors, and between health took the thermometer out and the patient asked
care workers and patients. The researcher and her ‘How much is it?’ ‘Thirty seven,’ she answered.
research assistant alternated on the daily 07:00 to ‘Thirty seven?’ the patient repeated. The staff
16:00 shifts between the intervention and control nurse smiled, wrote the temperature down on the
wards. Their observations included routine activ- chart and walked away.
ities of nursing and medical staff, ward rounds, 09:20 A second student nurse came to the
and meetings related to the intervention. In addi- patient, apparently wanting to take his blood
tion, the researchers used individual and group pressure. ‘Hulle het dit al gevat’ (‘They have taken
interviews. The data gave a description of the it already’) he said. The nurse ignored him, took
organizational culture on the wards; for instance, his arm, and took his blood pressure again. After-
the differences in autonomy and authority be- wards she stood staring at his file for a long time,
tween nurses and doctors, gender and class dif- and then wrote something in it and walked away.
ferences between doctors and nurses, verbal and During those two hours the patient received
nonverbal communication between health care two orders, asked one question, and gave a nurse
workers, and between health care workers and information that an observation had already been
patients. done. She chose to ignore him.
The following example is a description of an
observation which illustrates a typical interaction Reference:
between nurse and patient. Gibson, D. ‘Communication between health care workers
One patient was observed for about two con- per se and between health care workers and patients: Level
secutive hours from 07:15 when the patient was of Care Intervention Study, Qualitative findings.’ Unpub-
lying in bed, groaning from time to time. Nurses lished report, MRC, 1994.
184
SECTION
Epidemiology applied tto

content areas
Mortality Studies:
_ D Bradshaw
Outbreak investigations
SMetcalf, N Coetzee —
2 Estimating amunieation esverage
N Coetzee, D Berry, D Yach
Environmenta epidemiology:
Y von Schirnding .
Occupational epidemiology
JMyers .
Disability studies
J Katzenellenbogen
Psychiatric epidemiology
C Parry, L Swartz
Anthropometric studies
G Joubert
Assessment of dietary intake
M Langenhoven, P Wolmarans
Measurements in dental epidemiology
M Moola
Health economics
D McMurchy, E Thomas
20 Mortality studies
Introduction high infant and child mortality, low adolescent

and young adult mortality, and increasing rates
When placed in the context of public health, in-
with increasing age. This pattern can be seen in
formation about the number of deaths and their
Figure 20.1 which shows, as.an example, the male
underlying causes can be used to develop strat-
mortality rates due to the ill-defined causes for
egies for improving lifespan and quality of life. Al-
though death provides a negative image of health, each race group. Male rates tend to be higher than
female rates.
the difficulties of actually measuring health or
When comparing the mortality rates experienced
even disease make mortality rates very useful
by different populations, it is therefore essential
indicators. Routinely collected death information
or data collected from surveys and follow-up to take account of any differences in the age and
sex structure of the populations. A comparison of
studies can be used for both descriptive and ana-
the crude mortality rates might reflect differences
lytic purposes. In the most basic type of study, the
in the age and sex distribution of the populations,
level. of mortality is assessed for a particular
rather than differences in the actual levels of
population or study group. In more sophisticated
mortality. The hypothetical example in Appendix
analyses, geographic variations, trends over time,
1 (see p. 273) is an illustration of how the older
seasonal or occupational variations, or other fac-
group has a higher crude mortality rate, despite
tors related to mortality are investigated to gain
having consistently lower mortality rates for each
an understanding of the risk factors associated
age group. The crude death rate in Group B is 180
with mortality. The terms death rates and mortal-
per 100 000 compared to the death rate of 155 per
ity rates are used interchangeably in this text.
100 000 in Group A. However, when the death
It is strongly recommended that readers study
rates for each age group are compared, it is
Chapter 2: Key concepts in epidemiology before
noticed that the rates in Group B are actually
reading this chapter.
lower than those in Group A. The age distribu-
tions of the two populations differ, with popula-
Issues in mortality studies tion B having a higher proportion in the older age
Determining the level of mortality categories. The crude rates consequently reflect
the difference in the age structures. When com-
Summary rates paring mortality in two populations, mortality
In order to measure the level of mortality, one rates for each of the different age and sex groups
needs to calculate a rate by taking the number of of the populations should be compared whenever
deaths relative to the population over a defined possible. These are known as age- and sex-
period of time. Mortality rates can only be calcu- specific rates.
lated if the death data as well as the population When there are many age and sex categories,
figures are reliable. the comparison will become very complex and
In general, mortality levels vary with age, and difficult to assimilate. In this case, it will be neces-
the rates typically follow a ‘bath-tub’ shape, with sary to use a summary index which removes the
187
e epidemiology applied to content areas
rte we Meester CR me Cm Ce eT dy)

African males for the period 1984-86
- - White
— Coloured
-- Asian
— Urban Black
Oo
Oo
Oo
w
~
@
—
SO
fee
35 - 44 45 - 54 55 - 64 65 + Age
effect of the age and sex differences. The results population. The SMR for Group A compared to
are known as standardized rates. A ‘directly’ stan- Group B as a standard would be 1,35. The
dardized rate is the average rate of the study observed number of deaths was 620, and we would
population that would occur if the study popula- expect 458 deaths if the age-specific rates from
tion were to have the age and sex distribution of Group B occurred in the population of Group A.
the selected standard population. In the hypothet- It can be seen from Appendix 1 that the value of
ical example in Appendix 1, the age-standardized a relative index depends on the method used to
rate for Group B is 150 per 100 000, compared to calculate it. While the nature of the relationship
190 per 100 000 in Group A. Calculating the age- will remain the same, the magnitude also de-
standardized rates allows one to compare the two pends on what population was selected as the
summaty rates after having removed the effect of standard. It is crucial to report details of the stan-
the age differences in the two populations. dard population and the method of standardiza-
tion when reporting standardized rates.
Relative rates Directly standardized rates have the advantage
A relative index that compares the mortality in two that when the same standard population has
populations can be calculated. The comparative been used for the standardization, the rates may
mortality factor (CMF) is the ratio of the ‘directly’ be compared to one another. This is because the
standardized rate of the one population divided by age and sex distribution for each CMF has been
that of the other population. In Appendix 1, the adjusted in the identical way. Unfortunately,
CMF when comparing group A to Group B is 127%. when the same standard population is used for
The age-adjusted mortality in GroupAis 1,27 times indirect standardization, the SMRs are not com-
higher than the age-adjusted mortality in Group B. parable to one another, as the age and sex distri-
On the other hand, the standardized mortality butions have been adjusted differently for each
ratio (SMR) is based on ‘indirect’ standardization SMR. However, the sampling error for the SMR is
and is used when the category-specific rates (for generally smaller than the sampling error in the
example, age- and sex-specific rates) cannot be case of the CMF, because only the total number of
calculated. In Appendix 1, the SMR is the number deaths need to be estimated, compared to all the
of observed deaths divided by the number that specific rates in the case of the CMF. The variabil-
would be expected if the category-specific rates of ity of an SMR will therefore be smaller than that of
the standard population occurred in the study a CMF, which makes it a useful index when the
188
20 = mortality studies
numbers of deaths in categories are relatively which occurred a long time ago). Since popula-
small. Since accurate data on the age distribution tions are often very mobile, it is also unclear what
of the population is often not available in South groups the respondents in a survey represent.
Africa, the SMR is generally used to analyse data. In another indirect method, the McCrae and
Brass method, women are asked at the time of
Key mortality rates used as indicators of health delivery about the outcome of their previous
The infant mortality rate (IMR) is a particularly delivery. This clinic-based approach is much
valuable age-specific mortality indicator. It is de- more practicable than a field survey. However,
fined as the number of deaths of children under the potential biases in this method are unknown
the age of one year divided by the number of in South Africa.
births in that year. It is used not only as an indic- Although the infant mortality rate is widely
ator of the health status of children under one used, many demographers are now proposing
year, but is also used to reflect the health status of that a childhood mortality rate, which reflects the
the whole population. Clearly, it is easier to moni- chance of dying over a few years, would be a better
tor this rate than the mortality rates over all ages. index to estimate, as it is less subject to error than
However, in South Africa there is insufficient in- the infant mortality rate. The childhood mortality
formation to calculate the IMR for all groups. rate usually considers children up to the age of
Many births and deaths are not registered. Rou- five years and is known as the Under 5 Mortality
tinely available data can thus not be used to cal- Rate (U5MR). It is also important to focus on the
culate nationwide IMRs. U5MR rather than the IMR as the burden of high
For this reason, surveys are conducted to col- risk of death shifts from infants to toddlers.
lect this information. Information is collected In South Africa, the importance of adult mortal-
from women included in the survey on the ity is increasingly becoming apparent. The 45Q15
number of children they have had in the past five is a very useful summary index of adult mortality.
(or two or 10) years, and of those, how many have Based on a life table calculation, it is the probabil-
died. This data is grouped into a cohort of births ity of a 15-year-old individual dying before reach-
and the probability of dying in each time-period is ing the age of 60. In 1985, it was estimated that
calculated. This is known asalife table analysis, 40% of 15-year-old South African males and 23%
and is used to estimate the IMR or the childhood of 15-year-old South African females would die
mortality rate (the probability of a child dying before the age of 60. These rates of adult mortality
before the age of five). Surveys can also be done to are extremely high, and are comparable with
collect information which can be used to estimate those in the rest of sub-Saharan Africa.
the IMR through indirect demographic tech-
niques. These do not collect the number of births Determining cause of death
and infant deaths that occur during the study In the health context, it is important to investigate
period, as these are subject to various biases when the causes of death. Unfortunately, the death
collected directly, and require very large samples. certificate used in South Africa in the 1980s and
Instead, these surveys depend on the relationship 1990s does not facilitate the systematic collection
between some more easily obtained data and the of information needed to determine the under-
desired mortality indicator. Indirect methods lying cause of death. No distinction is made on
provide estimates of IMR which relate to a prior the certificate between the direct cause of death
period. The most commonly used example of an and the underlying cause, which means that the
indirect method, the Brass method, is to survey a certifiers have to use their discretion as to which
sample of women and ask them how many chil- cause they report. Causes are coded according to
dren they have ever had, and how many of those the International Classification of Diseases. The
children have died. The proportion of those dead, underlying cause of death is of public health im-
according to the age of mother, can be converted portance, since it is the cause at which prevention
into an infant mortality rate on the basis of model should be aimed.
life tables. The advantage of the Brass method is If the size and age distribution of the popula-
that it does not require the dates of birth or death tion is known, an analysis of the cause of death
of the children, but relies rather on the mother’s can be based on age- and cause-specific rates, or
age. Both these survey methods have problems age-adjusted rates by cause. In the case of the
with recall bias and proxy reporting (some house- denominator (that is, population figures) not
hold members may not be aware of infant deaths being known or being unreliable, it is possible to
189
assess the relative importance of different causes The disability adjusted life year (DALY) is a
of death by considering the proportional mortal- newly developed index that combines the years
ity. lived with a disability (weighted according to the
As in the case of mortality rates, the cause of severity) with the years of life lost due to premature
death pattern is influenced by age and sex. For death. This index uses data from morbidity and dis-
comparative purposes, it might be necessary to ability surveys in conjunction with mortality data.
calculate the proportion of deaths due to a part-
icular cause within each age and sex group. When Determining geographic variations
there are numerous age/sex categories, a method The geographic variation in mortality is often
of summarizing the information is necessary. One investigated to identify factors related to the
index is the Proportional Mortality Ratio (PMR). disease. The magisterial district of residence of
This is calculated by taking the total number of each person who dies is available from routinely
observed deaths due to a particular cause, and collected mortality data. This allows rates over the
dividing it by the total number of deaths expected whole country to be calculated at the level of
due to that cause if the proportion for each magisterial district. An example of such a study
age/sex group is that of the standard population. indicated that the rates of certain cancers were
Sometimes it is difficult to interpret the PMR. For unusually high in the areas around Prieska in the
example, if the PMR is high for a particular cause Northern Cape, where asbestos is found, thus
of death, it is not possible to distinguish whether implicating asbestos as a cause of the cancer
the death rate due to this cause of death is high or (Botha, Irwig, Strebel 1986).
the rates for the other causes are low, making the Indirect standardization is often incorrectly
proportion appear high. applied in such analyses. If the population data is
The potential years of life lost (PYLL) is also a available, it would be better to use direct standard-
summary index. This index does not attempt to ization, as the SMR for one magisterial district is
standardize for age, but actually utilizes the differ- not comparable to the SMR for another. Further-
ences in the age structure of the deaths. The total more, it must be borne in mind that the coding of
number of years of life lost by a population can be magisterial district information might not be con-
calculated by totalling the years that each death is sistent; for example, some deaths of Wynberg resi-
‘premature’ (for example, the number of years dents have been coded as occurring in Cape Town.
that each death occurs before the age of 65, or any
other selected age cut-off point). In order to calcu- Determining trends over time
late the total PYLL, the number of deaths at a par- It is important to assess whether mortality levels
ticular age is multiplied by the difference between change over time. The first step in such an ana-
that age and the selected age cut-off. In South lysis is to examine graphs showing the mortality
Africa, it is reasonable to consider 65 as a suitable rates over time. If possible, age-specific rates or
cut-off age, as this is close to the life expectancy at standardized rates should be investigated to
birth. Mathematically, the PYLL is expressed as check that apparent changes in mortality rates are
64 not due to changes in the age structure over time.
PYLL => d,(65-x), where d, is the number of Cervical cancer mortality trends in South Africa
from 1949 to 1990 are shown in Figure 20.2 (Bailie,
deaths occurring at age x. Selvey, Bourne, Bradshaw 1996). The world age
standard was used as standard population. It can
The total potential years of life lost can be divided be seen that the age-standardized rate for white
according to the underlying cause of death, and women declined after the mid-1960s, while that
the relative importance of different causes of of coloured women rose. The rate for Asian
death can be measured by taking the potential women fluctuated from year to year because of
years of life lost for a particular cause relative to the relatively small number of deaths each year.
the total potential years of life lost. In this index, it However, it also shows a downward trend from
is clear that the deaths that occur in younger age 1970. The data for African women was not avail-
groups are given greater weighting than the able for this period.
deaths that occur in older age groups. This gives Many factors are related to changes in cervical
the PYLL particular importance in public health, cancer mortality, but the introduction of cytolo-
where there might be more incentive to reduce gical screening (PAP smears) during the 1960s
premature mortality. and 1970s has had an impact in many developing
190
srw erm A eee er mac Lae Cr
000
/1Mortality
rate
1949 1954 1959 1964 1969 1974 1979 1984
—+ Coloured —A— Asian
Sra we me LB birth cohorts — whites
Birth cohorts
-¥— 1951= 1960
1941 - 1950
1931 - 1940
1921 - 1930
1911 - 1920
000
/1Mortality
rate 1901 - 1910
1891 - 1900
1881 - 1890
F
OF
koe
+ 1871 - 1880
Age of death
191
Figure 20.4 Cancer of the cervix: mortality rates for birth cohorts — coloureds
Birth cohorts
“7 1951
-1960
°So “?~ 1941 - 1950
S
a
2
wo
~ 4931
-1940
= O 1921 - 1930
os
4c
oO
=| AX 1911 - 1920
-@ 1901
-1910
1891 - 1900
KZ
ee: a if | I L 1 L
20 25 30 35 49 45 50 59 60 65 70
Age of death
countries. A cohort analysis of the South African One method of summarizing cohort mortality
data is shown in Figures 20.3 and 20.4. These information is to use the life table technique. For
show that the pattern of mortality in successive each age, the probability of dying is calculated by
birth cohorts for white women has dropped and is taking the number of deaths in that age group
consistent with a reduction in mortality following relative to the average number of individuals in
the advent of cytological screening in the 1960s. that age group. From this cohort life table, the
The same trend is not evident in the birth cohort expectation of life at any age can easily be calcu-
analysis of the mortality for coloured women, lated. The expectation of life at birth is commonly
where there is no evidence ofa drop. used as a summary index to describe the overall
When analysing the trends in mortality rates, it is mortality pattern.
important to distinguish between a cohort effect It is important to be aware of possible changes
— the effect of mortality on an entire population in coding practices which might account for
born at the same time — andaperiod effect, changes in the mortality level (Example 20.3). The
which is a change in mortality patterns over time. type of analysis will depend on the variability in
Statistical models can be used to extract the three the data and the patterns seen. For example, the
components from trend data: age, period, and analysis of the trend in diarrhoea will require the
cohort effects. seasonal variation (high summer rates, low winter
Another way of analysing mortality data is to rates) to be taken into account through a special
select a cohort at birth and follow their mortality statistical procedure called time series analysis.
pattern through life. This is often used in the form Comparisons in proportions over time are very
of a historical cohort analysis, where historical difficult to interpret, as apparent changes in one
records are investigated to retrieve mortality cause might be due to changes in a different
information. Records might be those routinely cause. For example, the proportion of deaths due
available through the registration system, or might to circulatory diseases in white males dropped
be obtained from less conventional sources such between 1970 and 1984, but this could be due to
as church records, immigration records, or special the increase in injury-related deaths rather than
interest collections. Data collected in this way to a decrease in circulatory diseases.
needs to be assessed carefully for completeness.
192
Ecologic studies EXAMPLE 20.1

Mortality statistics from different areas are often The estimated cause of death profiles
combined in an effort to understand the relation- for 1990: proportional mortality and
ship between a particular factor and the mortality
levels in these areas. These are known as ecologic potential years of life lost (PYLL)
studies. A scattergram or bivariate plot, compar- An analysis of the 1990 RSA death data obtained
ing the mortality rates against some other factor, from Central Statistical Services indicated exten-
usually indicates whether there is an underlying sive under-reporting of deaths. Comparing the
relationship or not. It is unlikely that a relation- number of deaths in each age group against the
ship will be demonstrated unless the mortality census, it was estimated that only 55% of African
rates as well as the factor being investigated dis- deaths were registered. Adjusting the African
play a wide variation. Genuine relationships do, deaths by this factor, it was estimated that there
however, tend to be masked by various biases in wére 199 537 deaths in the whole of South Africa
the data. For example, the quality of mortality in- that year, which accounted for a total loss of 4,4
formation varies from region to region. Similarly, million years of life. The death of a person younger
the ability to measure the exposure of interest than 65 years was considered to be premature.
may vary. Ecological studies are becoming in- The estimated proportional mortality for the
creasingly important with the growing awareness top 20 causes is shown in Figure 20.5 (Bradshaw,
of the effects of environmental agents on health. Laubscher, Schneider 1995). The ill-defined cat-
Although they can contribute towards an under- egory accounts for the highest proportion of 23%.
standing of the etiology of a disease, it is import- Other respiratory conditions, which include
ant to realise the limitations of these studies. bronchitis and chronic obstructive diseases,
There is an ‘ecological fallacy’ in assuming that a account for 9,6%, and cerebrovascular disease
population relationship between a certain expo- (stroke) accounts for 7,4% of the deaths. The over-
sure and some mortality implies that individuals all death profile in South Africa reflects a com-
who are exposed to the factor are at higher risk of bination of poverty-related diseases, chronic
dying. In order to draw such a conclusion, it is disease related to a Western lifestyle, and trauma.
necessary to undertake individual-based studies. The relative impact of the causes of death based
on the PYLL gives more weight to the deaths
Mortality information in which occur at younger ages than those at older
ages. For example, it can be seen from Figure 20.6
South Africa that conditions originating in the peri-natal
Aside from the problem created by under-report- period contribute the largest proportion of the
ing and misclassification of mortality inform- total PYLL (20,3%). Implicit in any index is the
ation, South African mortality data does not con- ‘value of life’ at different ages. The PYLLs are
tain useful socio-economic variables. It is there- important from a public health point of view as
fore not possible to analyse mortality data they can be used to identify the causes that would
according to socio-economic class. Information have the greatest impact on the premature loss of
on race was previously used as a proxy measure life. The profile suggests that public health prior-
for socio-economic class, thus providing valuable ities in South Africa include infant mortality,
insights into mortality differences. However, as infectious diseases, violence, TB, nutritional defi-
from 1991, demographic information on regis- ciencies and chronic respiratory conditions.
tered deaths no longer includes race. Adequate
measures of socio-economic class (for example,
usual occupation) should be included in mortal- EXAMPLE 20.2.
ity and census information, as well as race group. Using the indirect method of estimat-
Despite these limitations, careful analyses and ing the infant mortality rate in Hewu,
interpretation of mortality data can provide
extremely valuable health care and epidemi- Eastern Cape
ological information. Routine collection and A survey was conducted in the Hewu district of
analysis of mortality data should become a cor- Eastern Cape during May 1986 to obtain an esti-
nerstone of monitoring health status in the newly mate of the infant mortality rate (IMR), as it was
developing Health Information System. known that the death and birth registers were in-
complete. Some 5 102 households were included in
193
Sra tee CUM Cyemt LLL South Africa (1990)
(Total — 199 537 deaths)

percent
Sign/symptom ill-defined 23,33
Other respiratory 9,58
Cerebrovascular 7,35
Certain perinatal conditions 6,22
Ischaemic heart 4,86
Pulmonary circulatory 4,76
TB 4,51
Digestive cancer 3,99
Intestinal infection 3,85
Other violence 3,82
Other digestive 2,91
Endocrine/metabolism 2,90
Respiratory cancer 2,02
Genitourinary cancer 1,84
Nervous system 1,82
Other bacterial tts
Urinary system 1,52
Hypertensive 1,50
Homicide 1,49
Nutritional deficiency 1,19
10 20 30 percent
Sra em CC Ae mC Africa (1990)
(Total — 4 423 907 PYLL)

percent
Certain perinatal conditions
Sign/symptom ill-defined 14,32
Intestinal infection 10,75
Other respiratory 8,74
Other violence 5,94
TB 4,32
Nutritional deficiency 3,37
Nervous system 2,83
Pulmonary circulatory 2,61
Cerebrovascular 2.31
Homicide 2,28
Other digestive 2ohe
Other bacterial Zale
Congenital anomaly 1,79
Transport accidents 1,58
Digestive cancer 1,58
Viral 1,49
Other accidents 1,37
Endocrine/metabolism 1,16
Ischaemic heart 1,14
30 percent
194
20 mortality studies
Figure 20.7 Number of deaths per year due to ICD codes 412 and 414 ex-
pressed as a percentage of the average number for the period 1968-1984
t
eo
Asymptomatic IHD
m
ob
©
o>
=
hw
®
>
°o
®
D
f©
ad
®
=>
wo
-
o seem Coloureds
~
Cc —~-— Asians
®
2
®D
a
Chronic IHD
am 412
21S td, (OpRTO> TiALOas oe 00 aol 82 83 84

Year
the survey, and data was collected from 7 792 classification during the period 1968 to 1984, the
women of child-bearing age. From the replies to numbers of deaths for a given cause in each year
questions about their children born in the preced- for whites, coloureds, and Asians were separately
ing five years, it was estimated that 41 per 1 000 expressed as percentages of the average number
liveborn children died in the period 1981-1986. of deaths due to that cause in that population
Using information on all children ever born, group over the whole period.
Brass’s indirect demographic technique indic- The graph (Figure 20.7) of the trend in the mor-
ated that for the same period, the IMR ranged tality due to two codes related to ischaemic heart
from 80-100 per 1 000 live births. disease (IHD) shows that between 1980 and 1983
The discrepancy in the estimates could be there was a marked change in the coding practice.
attributed either to recall bias, or bias due to The sharp drop in deaths categorized as due to
proxy reporting (in the case of the survey), or to chronic IHD (ICD 412) was matched by an in-
the model life table being inappropriate (in the crease in deaths due to asymptomatic IHD (ICD
case of the indirect estimate). 414). Clearly, researchers who focus on trends
observed in mortality in one specific selected
Reference: cause, should also look at trends in all closely
Yach D, Katzenellenbogen J, Conradie H. ‘Ciskei infant related causes to ensure that observed changes
mortality study: Hewu district.’ South African Journal of are real and not due to coding changes.
Science 83:416-21.
EXAMPLE 20.3
Achange in mortality coding practice
Reference:
Bradshaw D, Botha H, Joubert G, Pretorius J P, Van WykR,
in RSA Yach D. Review of South African Mortality (1984). Medical
To evaluate the consistency of the cause of death Research Council, Technical Report No 1, Parow, 1987.
195
21 Outbreak
investigations
Introduction investigated in South Africa. Between 1960 and

« 1993, only 43 outbreak investigations were re-
The history of epidemiology is intimately linked ported in the South African Medical Journal
with the investigation of disease outbreaks. John (Coetzee, Malusi, Moeti 1993), although the
Snow, an English physician, developed the prin- number of reports of South African outbreak
ciples of outbreak investigations in his classic investigations is greater than this if other publica-
research of cholera outbreaks in 19th-century tions, such as Epidemiological Comments and
London (see Example 21.1). The terms ‘epidemic’ foreign journals, are considered.
and ‘outbreak’ are often used interchangeably. An The number of cases needed before an invest-
epidemic is the occurrence of cases of an illness in igation is warranted depends on the nature of the
a community or region at a level that is clearly in disease. In outbreaks of communicable diseases,
excess of the background incidence of disease for or of diseases which are serious in terms of
this defined group during a particular season and morbidity and/or mortality (for example plague,
time period. (Season is included as many infec- cholera, viral haemorrhagic fever), only two or
tious diseases show seasonal variation in inci- three cases can signal the need for a full outbreak
dence rates.) The term ‘outbreak’ generally refers investigation.
to an epidemic which affects fewer people, is of Although people commonly associate outbreaks
shorter duration, and is localized within a specific or epidemics with infectious disease, it is impor-
geographical area. The Centers for Disease Con- tant to consider the possibility that an outbreak
trol and Prevention of the USA define an outbreak may be due to shared exposure to non-
as ‘the occurrence of two or more cases which are infectious toxic agents (for example, outbreaks of
epidemiologically related’. organophosphate poisoning in farm workers
This was the case in the 1975 Marburg epidemic exposed to pesticides). There are several modes
that occurred in Johannesburg, in which three whereby a causative agent, whether infectious or
cases and one death were recorded. The purpose non-infectious, can be transmitted. In outbreaks
of investigating outbreaks is to identify factors where the cause is unknown, the type of disease
that are associated with the source and/or cause symptoms will usually give some clue about the
of the outbreak, and are crucial to disease control source. For example, in outbreaks where the
and prevention interventions. Properly con- symptoms are mainly gastrointestinal (such as
ducted outbreak investigations are invaluable in nausea, vomiting, or diarrhoea), the most likely
evaluating reasons for failure of control pro- mode of transmission is through ingestion of
grammes (such as immunization programmes), contaminated fluid (for example, water or milk) or
informing public health policy, and uncovering foodstuffs. In trying to determine the source of a
new risk factors (and even causal associations) in disease, it is important to bear in mind that
poorly understood diseases. animals can play a role by acting as reservoirs of
Because of limited resources andalack of field infection (for example, rats can carry leptospirosis
epidemiologists, relatively few outbreaks are and plague; several domestic and wild animals can
196
21 ~~ outbreak investigations
carry rabies), and that insects (for example, mos- These possible explanations must be consid-
quitos carrying malaria) and arthropods (for ered and excluded each time a potential outbreak
example, ticks carrying types of fever) can transmit has been detected. For example, a spurious out-
infection. The following section outlines a basic break of meningococcal meningitis occurred in a
stepwise approach to outbreak investigations. rural town when a new hospital staff member
based diagnoses on microscopy of scrapings from
Stepwise approach to outbreak skin haemorrhages in suspected cases. This diag-
nostic method is particularly inaccurate, and 70%
investigation of cases notified from this hospital turned out to
The efficient use of time is important in outbreak be conditions other than meningococcal menin-
investigations. Effective control measures should gitis (Turk 1993).
be instituted as quickly as possible to minimize Once an outbreak has been identified and
the severity and public health impact of the proved not to be merely an artifact of reporting or
outbreak. This is best achieved by teamwork and case ascertainment, a definite diagnosis of the ill-
by following a systematic problem-orientated ness must be made. This will involve taking good
approach, being logical and focused, and con- clinical histories, examining patients, and obtain-
tinually re-evaluating hypotheses as new informing laboratory test results (where appropriate and
ation comes to light. The emphasis given to the feasible) at the outset of the investigation (see
different steps below will vary depending on the Example 21.2).
extent to which the causative agent, the source of To ensure uniformity in case detection, a case
exposure, and other key determinants are known definition should be established. The case defini-
or suspected in the initial stages of the investigation should be as objective and specific as
tion. In practice, there is overlap between the possible while being easy to apply. It may com-
steps listed, and several steps are done concur- prise symptoms and signs (clinical definition)
rently, partially due to time constraints. and/or laboratory investigations (laboratory
confirmed case), depending on the disease under
Detecting outbreaks investigation and access to laboratory tests.
The most important surveillance system | is When symptoms such as diarrhoea or fever
disease notification, which is a statutory obliga- make up part of the definition, they need to be
tion on health workers to notify the health clearly defined for the purposes of the invest-
authorities of patients with certain conditions. igation (for example, ‘fever’ could be defined
However, it is likely that many outbreaks are not as a sublingual temperature of at least 38,5 °C;
detected, while others are not detected in time to ‘diarrhoea’ could be defined as the occurrence of
carry out a timely investigation because of delays three or more stools of liquid consistency per
in reporting and failure of clinicians to notify day for at least three successive days). If the out-
cases. Some diseases which have been respons- break is due to a recognized condition, reference
ible for outbreaks locally were not notifiable at should be made to published case definitions
the time (for example, whooping cough and (for example, the Morbidity and Mortality
shigellosis). Laboratory-based surveillance of Weekly Report 1990 supplement), which serve
microbial isolates and serology results are also as a basis for standardization in comparisons
important for the detection of outbreaks, particu- between different outbreaks.
larly hospital (nosocomial) outbreaks, and out- It is useful to have a broad or lenient working
breaks due to non-notifiable conditions. case definition (with high sensitivity) to identify
potential cases who need further evaluation (to
Confirming the diagnosis minimize the chance of missing cases), and to
Initial reports of disease outbreaks are sometimes have a more rigorous case definition (with high
unreliable and may provide misleading inform- specificity) for confirmed cases. It is often useful
ation about the causative agent. Apparent in- to have an intermediate type of definition of
creases in disease incidence may also result from ‘probable cases’ for suspected cases who do not
improved case detection, improved reporting, meet all the criteria for confirmed cases. For fur-
changes in the diagnostic criteria, changes in ther details on these different levels of case defini-
population size or composition, or false positive tion, see the Morbidity and Mortality Weekly
laboratory tests. Report 1990 supplement.
197
and a sense of teamwork at the start of the invest-

Table 21.1 Preliminary assess- igation. A team leader should be appointed to
CaM Re re oversee the investigation, preferably someone
with good epidemiological skills. The people
investigation involved in the field work should be relieved of all
other duties (if feasible) in order to devote their
41 Determine the extent of the outbreak: con- time and attention to the task at hand.
tact hospitals, clinics, general Precuones Local health personnel who are familiar with
and/or schools. ) the area and local population, and who will be
2 Establish the causative agent if possible, responsible for carrying out control measures,
using laboratory tests if appropriate. should also be involved. A local health official
3 Inform all relevant authorities and get per- should be assigned the task of public relations to
mission to investigate. _ handle queries from members of the community
4 Assemblea teamof investigators. and to provide information to the media.
5 Try to identify all people at risk. Published material should be consulted once a
_ 6 Obtain samples from the environment that presumptive diagnosis has been made, including
may be related to the source or transmis- reviews of the basic epidemiology of the condi-
_ sion of the agent (e.g. water or food) and ~ tion(s) being considered, and accounts of similar
arrange for laboratory analysis. outbreak investigations. The Control of Com-
_ 7 Obtain any relevant lists of persons at risk municable Diseases Manual (Benenson 1995)
of exposure, lists of food ae yeed poi- is a useful resource. It lists communicable
_soning outbreak). diseases alphabetically and summarizes key in-
8 Arrange for public relations. formation on the epidemiology and methods of
9 Read up on the epidemiology of the condi- control for each condition. (It is recommended
_tion(s) being investigated once a presumpt- that anyone likely to be involved in outbreak
_ ive diagnosis has been made. — investigations have a copy of this book in their
personal or work library.) A thorough under-
standing of the epidemiology of the disease(s)
Preparing to investigate helps investigators to ask relevant questions and
Once the presence of an epidemic or outbreak do appropriate and relevant environmental and
has been confirmed, a decision should be made laboratory investigations, based on the most
whether to do an outbreak investigation. This will likely sources and modes of spread.
depend on the seriousness and public health
importance of the condition, the likelihood that Describing the outbreak
further investigation will help in effecting appro- The cases which are first noticed in an outbreak
priate control measures, and the resources may not bea representative sample of the total
(human and other) available to devote to the population affected. It is important to look for
investigation. further cases (which meet the broad working case
Once it has been decided to launch an outbreak definition), in addition to those which alerted the
investigation, this decision should be acted on as authorities to the presence of an outbreak. This is
rapidly as possible. The steps outlined in Table because known cases may be onlya fraction of
21.1 should be carried out as soon as possible the total, and may differ from other cases in
after receiving the outbreak notification. A multi- important attributes, which could misdirect the
disciplinary team should be assembled. The team course of the investigation. This is especially
of investigators should include people with a important if initial reports are of hospitalized
range of skills such as epidemiologist(s), micro- cases, as only the most severe cases may be hos-
biologist(s), public health physicians, and envir- pitalized, and hospital cases may differ from
onmental health officer(s). A field manager, field- community cases. (For example, in measles and
workers, health inspectors, and data clerks (to whooping cough epidemics, hospitalized cases
enter data onto computers) also need to be re- tend to be younger and are less likely to be im-
cruited. As the people participating in outbreak munized than community cases.) All sources of
investigations usually come from a range of information which can be used to identify cases
health agencies and institutions, it is important to should be considered, including records of hos-
establish good communication and co-operation pital admissions, clinic attendance, general prac-
198
21 ~~ outbreak investigations
titioners, school absenteeism records, disease pital, or limited to a small geographic area, it is
notifications, and death records. A decision useful to plot the spatial location of cases on a
should be taken as to which sources to use. spot map (Figure 21.2). This helps to demonstrate
A detailed description of cases is given accord- clustering of cases by place and may provide clues
ing to ‘person, time, and place’. This summary of about the source of exposure.
cases is useful for defining who is at risk of con- The scale of the spot map will depend on the in-
tracting disease, as well as formulating hypo- vestigation and the geographical extent of the epi-
theses as to the likely sources of exposure, and demic. If the cases are associated with an institu-
modes of transmission of the disease. tion such as a school or hospital, they should be
plotted by location within the building (that is,
Person classroom or ward), and if they are confined to a
Cases should be summarized in terms of key fac- geographic area, they should be plotted on a
tors such as age, sex, occupation, ethnicity, socio- large-scale map showing features such as roads
economic status, and immunization status (as and rivers. The location of potential sources of
well as other disease-specific relevant exposures). exposure to infection, such as sources of drinking
water and ablution facilities, should be shown on
Time the spot map.
The time course of the epidemic should be stud-
ied by drawing up an epidemic curve, that is, a Ancillary environmental and microbiological
histogram with the number of cases (on the Y investigations
axis) plotted against the date of onset (on the X Depending on the nature’ of the outbreak and
axis). The shape of the epidemic curve is useful in whether or not the microbial agent responsible is
determining the nature of the source of infection. normally endemic in the population affected by
A common exposure at a single point in time the outbreak, it is often useful to do environ-
(termed a ‘point source epidemic’) results in an mental and microbiological investigations to
epidemic curve with a steep upward curve with a complement epidemiologic investigations.
well-defined peak and a relatively short time Evidence from environmental investigations
course (Figure 21.1). In contrast to this, exposures complements epidemiologic evidence, and, if one
occurring over a period of time and conditions is lucky, may confirm the source of infection. This
which are readily spread from person to person, aspect of the outbreak investigation should be
are characterized by a broader, shallower epi- overseen by someone with expertise in this kind
demic curve, consisting of a less steep upward of investigation, such as a _ microbiologist,
curve, with one or more broad peaks occurring hospital infection control officer, environmental
over a protracted period of time. The date of any scientist, or environmental health officer.
community events which could be related to the Environmental specimens of possible sources
exposure should also be shown on the X axis of of infection (for example, food, drinking water,
the epidemic curve, as this may yield important swimming water) should be taken for micro-
clues to the point at which infection was intro- biological investigation to provide evidence of the
duced into the population. This may in turn pro- source of exposure.
vide clues about the source of exposure. If you have access to microbiologists working in
The epidemic curve will also show whether the specialized research laboratories such as the
number of new cases is still on the rise or whether South African Institute for Medical Research, the
it is declining, and so will help to indicate the National Institute for Virology, or universities
urgency of instituting control measures. (It is not with medical schools, it may be useful (depending
uncommon to find that the outbreak is already on on the organism under investigation) to request
the decline, in which case the investigation may that a microbiologist do specialized tests (for
be less urgent, as additional control measures example, ‘DNA fingerprinting’), using mole-
may not be needed.) cular biology techniques, to characterize the
organism in specimens from patients and
Place environmental specimens. The use of ‘molecular
Data collected should include home address, ad- epidemiology’ may be useful in confirming or
dress at work or school, location of any function refuting epidemiologic links between cases,
attended, and recent travel history. If the out- and in confirming environmental sources of the
break is in an institution such as a school or hos- outbreak strain.
199
e . epidemiology applied to content areas
Figure 21.1 Snow’s epidemic curve illustrating a point source epidemic -
a
o
a
oS
°
—
So
—
o
2
Ee
S
=
Pump handle removed
EP Labdataet aban
HORSAtie ecm COAT eZ eS 2 4 6 8 18 20 22 24 26 28 30
August Date of onset September
Source: Snow J. On the mode of communication of cholera. London, 1855. Reprinted in: Frost WH (ed). Snow on Cholera.
New York: Commonwealth Fund (1936).
Generating a hypothesis Testing hypotheses

The list of cases should be scrutinized for Discovering that an exposure is common among
common characteristics and exposures among cases does not implicate it with certainty. This
cases, as well as unusual characteristics or occur- exposure may be common among non-cases as
rences, as both can ‘provide important clues well. It is thus necessary to do an analytical study
about the source of exposure. John Snow sus- comparing risk factors and exposures of interest
pected that water from the Broad Street pump in cases and non-cases, to help establish the most
was contaminated with cholera when he dis- likely source of infection. Secondary cases due to
covered that the only case who lived a long way person-to-person transmission from contact with
away from the pump had water from it delivered a known primary case should be excluded from
to her daily, because she liked the taste (see analytical studies designed to ascertain the
Example 21.1). source of infection, as their inclusion will weaken
Six of seven hospital-acquired Legionnaire’s the magnitude of association with potential
disease cases had been ventilated in intensive sources of exposure. It is useful to consult an
care units, prompting the hypothesis that patient epidemiologist when planning the design and
ventilators were the source of infection. In execution of hypothesis-testing studies. One of
this case, the common exposures led to this two basic analytical study designs may be used. A
hypothesis (see Example 21.2). The descriptive cohort design is used when the outbreak is con-
information, combined with a knowledge of fined to a clearly defined group of people and it is
the relevant epidemiological literature of the feasible to study all people at risk of becoming
disease or conditions being considered, is used cases (see Example 21.2). Point-source food
to formulate hypotheses as to the most likely poisoning outbreaks that follow a meal or a
source(s) of exposure and mode(s) of trans- gathering of people are a good example of the
mission. type of situation where it is useful to do a cohort
200
21 outbreak investigations
A) =) oa®
am N el “ ” Bsi £ A] =m ° ed fe]eat & a] ” weeoecH Q ea} 6 c=)ct)& odel@ te)+ a
ie
fatal attacks ofcholera.
A study of the geographical distribution of

cholera deaths near Golden Square in London
1854 ’ showed a direct association with use of the
water pump in Broad Street.
Source: Galbraith N S, 1984.
201
study of the entire population at risk. All people linked to disease (case) status if the magnitude of
(or as many people as possible) who ate at the the exposure-disease association (either the rela-
suspected place or function during the suspected tive risk or the.odds ratio) is large in relation to
exposure period should be interviewed so that the other exposures considered.
specific attack rate and the relative risk (rate ratio) If the outbreak is due to one of the Expanded
associated with eating each food item can be cal- Programme on Immunization (EPI) target dis-
culated. A case-control design is used when it is eases against which infants are routinely immun-
not possible to define or to study all people at risk ized (for example, poliomyelitis, measles, whoop-
of becoming a case. The magnitude of the asso- ing cough), you should investigate the possibility
ciations between exposures of interest and that the outbreak is due to inadequate vaccine
disease (case) status is the most important type of coverage and/or to vaccine failure (that is, poor
evidence provided by analytical studies, and can vaccine efficacy). Guidelines on carrying out vac-
be used to assess which exposure is most likely to cine efficacy studies in an outbreak setting are
be the source of infection. In a cohort study, this provided in the reference by Orenstein et al
is expressed as relative risks, while in a case- (1985), and some of these principles are illus-
control study, it is expressed as odds ratios. trated in the reference by Coetzee et al (1994) de-
In outbreaks in large populations or where scribing an investigation of a measles outbreak in
there is not a clearly defined group of people Cape Town.
among whom the majority of cases originated, it
is necessary to do a case-control study. A number Data collection
of sources may be used to select appropriate con- If feasible, each case and control (or family
trols. Suggested sources include: people attend- member of each case, if the case is a child or has
ing the same health facility or family practice, died) should be interviewed using a standardized
pupils from the same school(s), or people from questionnaire. It is imperative that the method of
the same workplace or neighbourhood as the data collection is uniform, both among cases and
cases. As only a sample of people with each expo- between cases and controls. For example, per-
sure of interest is studied, the rate of disease asso- sonal interviews of cases and telephonic inter-
ciated with each exposure cannot be calculated, views of controls could influence the replies to
and odds ratios are used instead of relative risks questions. For the same reason, a structured stan-
as the measure of effect. Controls must be free of dardized questionnaire is essential, as the actual
the disease in question, and must have a similar questions asked and the way they are asked may
potential for exposure to the risk factors under influence answers. Items to consider in drawing
investigation as the cases. up the questionnaire are listed in Table 21.2.
If the disease is strongly associated with certain Items should be selected from this list by con-
characteristics such as age and sex, it may be use- sidering what is known of the epidemiology of the
ful to match cases and controls to limit confound- condition being investigated, omitting items
ing by these factors. Matching should only be which are not potentially relevant to the invest-
considered if the matching factor is strongly asso- igation.
ciated with disease status, but the matching Medical records are generally not a suitable
factor is not itself a factor of interest. It is advis- substitute for case interviews, as they tend to be
able to consult an epidemiologist or a biostatis- clinically orientated, and epidemiologically relev-
tician if matching is used, as it is necessary to take ant information on exposures of interest is often
matching factors into account in the analysis. omitted from the record. However, they are useful
In outbreaks affecting a small number of cases, as supplementary sources of information, espe-
it may be useful to have more controls than cases cially for providing data on the results of dia-
to improve the precision of the odds ratio estim- gnostic tests and procedures.
ates. Up to four controls (matched or unmatched) Descriptive information should be entered
may be selected for each case. In outbreak invest- onto a computer as soon as possible in order to
igations using eithe, a case-control or cohort create a basic list summarizing key attributes (for
design, the number of cases is often too small to example, age, sex, date of onset, area of resi-
detect statistically significant associations, even dence) of cases. A laptop computer is useful if
for causal exposures. However, the lack of statisti- available, especially if the investigation is being
cal significance (p-values >0,05) should not deter done in arural area. This means that information
you from inferring that an exposure is causally can be entered and analysed on the spot while the
202
21 _~=soutbreak investigations
Recommendations should be clear and prac-

Table 21.2 Items to consider in tically feasible in terms of local circumstances
outbreak investigation question- and resources available. Both short- and long-
term control measures should be implemented.
naires
Short-term measures
@ Symptoms (have a checklist and get a Measures to prevent further new cases develop-
yes/no response for each item). ing include preventing exposure to the source
Precise date of onset (rather than date of and/or breaking the chain of transmission. It is
visit to doctor or admission to hospital). often necessary to take interim control measures
« Demographic characteristics (e.g. date of based on preliminary information, before all
birth, sex, home and work/school address, likely hypotheses have been tested. Any such con-
ethnicity). trol measures should be reconsidered during the
Places which could be related to source of course of the investigation and revised in the light
exposure and transmission to others of further evidence.
(household, classroom/ward, recent travel).
Time and place of contact with other cases. Long-term measures
Whether there is a previous history of the Conclusions of the investigation should explain
disease being investigated (in conditions why the outbreak occurred at the specific time
where disease confers immunity against fur- and place and identify weaknesses in existing sur-
ther episodes, for example measles, whoop- veillance and preventive measures. These find-
ing cough). ings should be used to draw up measures to pre-
Immunization history (documented evidence vent similar outbreaks in the future, in the same
of immunization should be obtained). setting, and in other similar settings. The disease
Source(s) of drinking and bathing water. surveillance system should be reviewed and, if
Occupational details (especially where there suboptimal, measures should be taken to im-
has been occupational exposure to chem- prove surveillance.
icals or animals). The investigation should be written up clearly
Leisure activities (for example fishing, in the form of a report as soon as possible. The
hiking, camping, watersports). report should cover the background to the invest-
Animal contact. igation; a description of the epidemic in terms of
Tick or mosquito bites (in outbreaks of person, time, and place; hypotheses considered,
fevers). with epidemiological and laboratory evidence
Other disease and medication history (look- either supporting or refuting these hypotheses;
ing especially for conditions which may lead conclusions as to the cause, source, and mode of
to increased susceptibility to the condition transmission; and finally, recommendations for
under investigation). both short- and long-term control. Copies of the
report should be submitted to the authority
requesting the investigation, local health author-
investigation is in progress. The computer pro- ities, and other collaborators, as well as to the
gramme Epi Info is specially designed for use in appropriate public health authorities at provin-
outbreak investigations and is relatively easily cial and national level.
mastered by people with limited computer expe- The community in which the outbreak occur-
rience. red should be kept informed while the invest-
igation is under way (without giving away hypo-
Implementing control measures theses under investigation prematurely, as this
Once hypotheses have been tested, a decision may bias the results of analytical studies), and in-
must be made as to the most likely causative formed of the results of the investigation to allay
agent, source(s), and mode(s) of transmission, as rumours and unnecessary fears, as well as to gain
well as host factors (for example, lack of immu- their co-operation with control measures. The
nization) which may have led to increased results can be fed back to the community by
susceptibility to the condition under investiga- means of a presentation at a community meeting
tion. This will enable recommendations to be and/or through a press release.
made on specific control strategies. In concluding the investigation, consideration
203
should be given to writing the investigation up for woman had used water from the Broad Street
publication. Publication of the findings is useful if pump, and that the brewery had its own well
there is the potential for similar outbreaks in which supplied the workers with drinking water.
other localities, or as a reference if innovative This outbreak had the characteristics of a point
epidemiological detective work and analytic source epidemic (Figure 21.1). The well in Broad
study designs were required to ‘solve’ the out- Street was no longer contaminated by the time
break. the pump handle was removed, as the epidemic
The investigation of disease outbreaks is one of was almost over by this time.
the most practical and useful applications of
epidemiology. A thorough investigation that is Reference:
clearly communicated and well documented SnowJ. ‘On the mode of communication of cholera.’
contributes significantly to progress in public London, 1855. Reprinted in: Buck C, Llopis A, Najera E,
health delivery and clinical practice. Terris M (eds). The Challenge of Epidemiology: Issues and
Selected Readings. Washington DC: Pan American Health
Organization, 1988.
EXAMPLE 21.1
John Snow’s investigation of cholera EXAMPLE 21.2
in the Broad Street area of London, An investigation of Legionnaire’s
1854 disease in a Johannesburg hospital
In 1854, Dr John Snow investigated an outbreak of Between November 1985 and February 1986, 12
cholera in the area around Broad Street, London. cases of Legionnaire’s disease were identified at a
Prior experience of cholera epidemics in 1831-2 Johannesburg hospital. Based on the knowledge
and 1848-9 had led him to believe that cholera that the incubation period is two to 10 days, it was
was transmitted by contaminated drinking water, determined that two cases definitely acquired the
but his theory did not have much support at the disease in hospital, while five other cases might
time. An outbreak of cholera in his parish in 1854 have acquired the disease in hospital. Investiga-
gave him the opportunity to test this hypothesis. tions were carried out to locate a hospital source.
In his initial investigation, he obtainedalist of all A spot map was drawn of all hospital cases by
reported cholera deaths in the three affected sub- ward, and as water is known to bea reservoir for
districts the week ending 2 September from the L. pneumophila, the causative agent, water was
General Register Office. He plotted them on a cultured from all hot water outlets in the hospital.
street map and found remarkable clustering in Although L. pneumophila was cultured in water
the area around the Broad Street pump (Figure from three outlets, the distribution of these out-
21.2). He took specimens of water from the Broad lets did not correspond to the distribution of
Street pump daily and noted that the water devel- cases, so they were disregarded as the source. It
oped a surface film and an offensive smell if left to was noted that six of the seven potentially hos-
stand for a few days. He persuaded the parish pital-acquired cases had been ventilated.
Board of Guardians to remove the pump handle Attack rates in intensive care unit (ICU) pa-
on 8 September to prevent further cases from this tients were calculated. according to whether or
source. A more extensive investigation in Novem- not they had been on a ventilator. The attack rate
ber that year, comprising a door-to-door survey was 35% in ventilated patients, compared to 2%
of all households and businesses in the area, in ICU patients who were not ventilated. This im-
further implicated the Broad Street pump: of the plicated ventilators as the most likely source of
Broad Street residents, the cholera attack rate was cases who acquired the disease in hospital, a find-
80/137 (58%) in people who were reported to ing in keeping with known modes of transmission
have drunk water from the Broad Street pump, of this condition.
compared to 20/297 (7%) of those who did not
drink water from this source. Notable exceptions Reference:
also incriminated the pump as the source: one Strebel P M, Ramos J M, Eidelman IJ etal. ‘Legionnaires’
case occurred in a woman who lived outside the disease in a Johannesburg teaching hospital: Investigation
area, while 70 workers in a Broad Street brewery and control of an outbreak.’ South African Medical Journal
were unaffected. Investigation revealed that the 1988; 73:329-33.
204
22 Estimating
immunization
coverage
Introduction method for staff to evaluate and monitor their

progress (Department of Health 1994).
Childhood diseases which can be prevented by
Clinic staff generally rely on costly and infre-
immunization still account for considerable
quent population-based survey estimates that
morbidity and mortality in South Africa. Immun-
poorly reflect the outcome of their individual
ization programmes aim to decrease the number efforts. Routine statistics of vaccine doses admin-
of such deaths and diseases by increasing the istered are collected by all clinics, and should be
immunization coverage, that is, the percentage of
used more effectively at this level to monitor the
the at-risk population which is immunized. It is success of outreach activities, and the proportion
important that the success of such programmes is of children who do not receive all successive
evaluated. This can be done either by disease sur- doses of each particular vaccine (drop-off). If the
veillance or by determining the immunization quality of this data can be improved, routine re-
coverage. Disease surveillance (which entails ports will have the advantage of providing contin-
monitoring the incidence and distribution of uous and accessible information on coverage at
disease) serves as an indirect indicator of low district or local level (Borgdorff, Walker 1988).
immunization coverage or poor vaccine quality.
In contrast to disease surveillance, direct assess-
Administrative method
ment of immunization coverage is an indicator of The traditional mainstay of coverage assessment
programme success in reaching susceptible chil- (at district and regional level) has been the rou-
dren and preventing disease. Accurate immun- tine reporting of immunizations issued, divided
ization coverage estimates have an additional by the estimated population. However, both the
use, namely that of estimating vaccine efficacy numerator (doses of vaccine given) and the
(using the ‘screening’ method) by comparing the denominator (usually births in an area or the es-
proportion of disease cases among immunized timated population at risk) lack reliability, with
children to the estimated coverage in the popula- resultant overestimation of true coverage. Values
tion (Orenstein, Bernier, Dondero 1985). This in excess of 100% are commonly presented. Table
chapter will highlight methods used to assess 22.1 summarizes the sources of error.
immunization coverage in order to direct pre- Such coverage assessment data should be ap-
ventative services to areas of need, and monitor proached with healthy scepticism until proof of
progress to the attainment of programme object- validity is available. It is for this reason that other
ives. methods of assessment and population-based
surveys are given more weight (Table 22.2).
Methods using routine data
The 1994 immunization programme review of the Clinic-based coverage
Department of Health highlighted a poor under- An alternative approach to determining coverage
standing of the concept of immunization cover- makes use of birth registers found in most clinics.
age among staff at clinics, with no standardized Notified births are entered into the register
205
Table 22.1 etre of error in coverage data from vaccination returns
Numerator Denominator
@ Wasted doses (that is, doses recorded @ Census inaccuracy

may not equal doses given)
¢ Inaccuracy of reporting: ¢ Underestimation of births in the target

a) accidental population (the greatest error is in the most
b) deliberate needy areas; results in an unpredictable
_ bias) j
Repeat vaccination (a child given two ¢ Migration (particularly in rapidly

doses of the same vaccine cannot be urbanizing areas)
distinguished from two children receiving
one dose)
The true ‘at risk’ group (should include
___ only susceptibles) is usually unknown
chronologically and dates of vaccination entered interview and who are therefore excluded from
across the page. Any additional children not born the survey may represent a select group whose
in the clinic catchment area are entered when immunization differs from those who are at
they present to the clinic for services. Children home;
that move out of the area are deleted from the immunization records may not always be avail-
register. These registers are used to trace de- able;
faulters, but also provide a ready means of calcu- the complex study design may affect results.
lating clinic-specific coverage for any birth
cohort. The advantage over the administrative
method is that the numerator arises directly out EPI sampling
of the denominator, yielding real coverage per- The Expanded Programme on Immunization
centages which cannot exceed 100% (Table 22.2). (EPI of the WHO has distinguished itself in the
These estimates, however, only reflect coverage application of a practical sampling technique,
in patients who are known to the clinic, and extra- uniquely designed for coverage assessment. The
polation to the population is possible only in EPI methodology was developed with a very spe-
stable areas where the clinic is easily accessible to cific purpose in mind — that of enabling health
the whole community, and is used by the whole service staff (as opposed to researchers) to obtain
community. a reliable coverage figure (a simple proportion)
(Henderson, Sundaresan 1982; Lemeshow,
Robinson 1985).
Community-based surveys The derivation of the 30 x 7 sample (that is, 30
A survey which goes into the community to deter- clusters of 7 children) was arrived at by fairly
mine coverage has the inherent advantage of straightforward extrapolation from simple ran-
measuring the numerator (those immunized) and dom sampling (see Appendix 4).
denominator (both the immunized and non- The WHO EPI manual Evaluate Vaccination
immunized) at the same time. Although signi- Coverage has proven extremely useful as a prac-
ficantly more expensive, community-based sur- tical guide to the techniques involved. First, the
veys can provide coverage estimates that are population is divided into convenient groups or
more representative than those arising from rou- clusters (villages, street blocks). These divisions
tine sources (Table 22.2). are then weighted according to estimated popula-
However, biases and other obstacles to accu- tion size. Thirty clusters are then selected, using
rate assessment can still occur; for example: simple random sampling. Within each selected
children who are not at home at the time of the cluster, a starting point for the survey is then
206
22 = estimating immunization coverage
Brel): er Mere eelty
A comparison of three methods used in estimating immunization coverage
Method Advantages Disadvantages
Administrative Population-based Values exceed 100%

Inaccurate numerator and
denominator
Clinic registers Readily available Clinic population

Efficiency measure, mamage- Staff time
ment tool
EPI cluster survey Population-based Expensive

Reliable Sampling bias
Lack of precision
(Adapted from a presentation by Dr P Strebel at the National EPI review seminar in Pretoria on 21 September 1994.)
directly established by further sampling (second the scope of this chapter.

phase sampling). Once in the field, individual Statistically speaking, children within each
households are selected consecutively from these cluster should be sampled randomly, and some
starting points by the application of a rule such as criticism has been levelled at EPI for its policy of
‘next house to the right’. choosing households consecutively, thus intro-
However, if the population structure of the divi- ducing selection bias. Houses in a cluster are
sion is very heterogeneous (that is, some are fully spatially related, which could result in their occu-
immunized, others not), it may be advisable to pants having similar immunization status.
divide it into a greater number of smaller divi- Since EPI coverage surveys have such specific
sions or strata (see Example 22.1). objectives, they cannot readily be used for pur-
One of the methods described in Evaluate Vac- poses beyond estimating a simple proportion,
cination Coverage is that of going to the centre of and attempts at extending them without consid-
a village and spinning a bottle or using some erable increase in sample size will generally suffer
other random method for choosing a random from lack of statistical power.
starting direction, and then counting the number The general economy of design, however
of houses between the central point and the out- elegant, also implies that there is very little scope
skirts of the settlement. The standard approach for conclusions concerning subregions. For
for obtaining a starting point is to select one of the example, studies in the Transkei and former Cape
houses in the chosen direction by means of Province give reliable figures for the whole region,
simple random sampling. But this house does not but say little about which specific local areas need
have the same chance of being selected as any the greatest attention (Byarugaba 1989; Bach-
other house in the village. Any homestead nearer mann, Thomson, Coetzee, Klopper, Skibbe 1992).
the centre has a much higher chance of being se- One can increase the sample size until immuniza-
lected than one on the outskirts of the village. tion coverage can be determined with more pre-
This can be understood when one realizes that cision for each subregion (or stratum) (Chapman
the probability of being selected is not propor- 1990; Coetzee, Yach, Blignaut, Fisher 1990). This
tional to the line segment in which the house is particularly important in large cities and urban
would be found, but to the area which contains it. areas where high immunization coverage city-
There will be more cherries on the outer half of a wide may conceal pockets of low coverage that
slice of cake, presuming that they are evenly dis- act as foci for continuing transmission of target
tributed. A method has been developed to make diseases. For example, in peri-urban areas, the
this approach more representative, but is beyond lowest coverage areas are those with the highest
207
Sennen ee ee ee eee ee eee
proportion of recent migrants (see Example 22.2). items are children and the lots areas. This method
Lot quality assurance sampling (see below) also allows one to determine whether immunization
provides a useful method of determining sub- coverage in a small area is acceptable or not, but
region coverage status. does not estimate the actual coverage rate. It is
The EPI sampling technique was originally de- ideally suited for smaller urban areas where the
veloped for rural areas in developing countries base population of different strata is too small for
and is not always ideal. Where relatively few chil- the EPI cluster method (Rosero-Bixby, Grimaldo,
dren of the appropriate age group are present ina Raabe 1990; Lanata, Black 1991; Cutts 1991, 1994).
community, the standard technique of going
door-to-door may prove very tedious. In densely House-to-house sentinel population
populated urban and peri-urban areas, simple investigation
random or systematic sampling is feasible. Ad- In urban areas with limited resources, lack of
aptations to the EPI sampling technique have available expertise, and poor census data, health
been successfully implemented, and are de- services may focus on specific problem zones
scribed in the Examples. where most cases of disease occur (sentinel popu-
A useful adjustment to the EPI sampling lations). Door-to-door investigation of all chil-
methodology has been Kok’s modification used dren in these defined areas can be undertaken
in Kenya for dispersed populations (Kok 1986). It rapidly (as during an outbreak), or at convenient
uses primary school lists as a sampling frame to intervals by health service staff with minimal
target areas where children live, and then goes to training. As no sampling is required and analysis
neighbouring households, so avoiding the direct is done by hand, this approach to estimating im-
bias of selecting children who have siblings in munization coverage in problem areas has
school. Kok’s original assumptions were the fol- proven to be less costly than EPI cluster sample
lowing: surveys of equivalent size (Weeks, Barenzi,
@ the early primary school child is ‘the most Wayira 1992).
randomly and proportionally distributed regis-
tered sampling unit in the community’; Measurement considerations
¢ school attendance should be (‘arbitrarily’) not
less than 70% (this may be a problem in isolated Attention must be given to fieldworker training,
rural areas of South Africa); and questionnaire design, and definition (determina-
@ there should not be certain groups who are tion) of vaccination status. Questionnaires which
excluded from schooling. are unambiguous and generate useful variables
should be used. Variables of interest are receipt or
This method, which was tested in the Machakos not of vaccination, type, number, and age at
district of Kenya, has proven a useful tool for EPI which vaccines were received. Age should be
internationally, and some South African re- determined using date of birth. If date of birth is
searchers have put it into practice. unknown, the researcher should be aware that
Another modification involves the use of primi- there may be a preference for a given month (for
parous mothers for assessment of vaccination example, January) in the responses. Any age in-
coverage. Though the reason for choosing primi- accuracies could have an important effect on age-
paras is not apparent, the method does seem to specific immunization coverage calculations. The
have potential merit if extended to include all training manual for EPI coverage methodology
mothers giving birth at a particular health facility. recommends surveying children between 12 and
The conditions under which this method would 24 months of age as a measure of immunization
give reliable results would be if the home delivery efforts over the preceding year. Information on
rate was low, and follow-up excellent. where vaccination took place may be useful from
amanagement point of view.
Lot Quality Assurance Sampling (LQAS) As in all studies, fieldworkers should be well
Details of this sampling method are given in trained and motivated before data collection
Chapter 16. The basic principle is that a certain begins, and periodically checked while in the
number of items from a lot (batch) is tested and field. Careful documentation of information on
the whole batch ‘rejected’ or ‘accepted’ on the vaccination cards is vital, and workers should
basis of the number of defectives in the chosen know exactly what each entry means. The import-
items. In the case of immunization coverage, the ance of accurate recording should be empha-
208
22 _~=—sestimating immunization coverage
sized, as should the need to encourage parents to recommended that researchers make the effort to
find cards which they have ‘temporarily mis- familiarize themselves with the problem (Hen-
placed’. The response rate should be improved by derson, Sundaresan 1982). A minor adjustment
revisiting locked dwellings at a later stage. For this can be made to the chi-square test that consider-
purpose, and also for quality control inspection, ably improves the accuracy of the test.
the name and address of each family should be Work done at the MRC has concentrated on a
carefully recorded. method called the ‘bootstrap’ technique (Rei-
Vaccination status can be determined by vac- nach, Groeneveld, Neethling 1990). This method
cination information on the Road to Health card, requires much computing for each data set, as it
clinic records, in some cases school records, involves numerous resamplings (with replace-
physical signs such as BCG scar, or a verbal report ment) from the data. Unbiased estimates of the
by the respondent (usually the mother). Ignoring population variance (or any other parameter) can
the subjective verbal report clearly leads to an be ‘obtained. Methods are now being sought to
underestimation of coverage in absolute terms; streamline this technique.
yet the advantage of having a reproducible mea- In short, if the researcher is interested in ana-
surement cannot be overemphasized. Depending lysing anything beyond simple coverage propor-
on how immunization status is defined, different tions from a cluster sample, detailed statistical
coverage estimates may be derived. In most sur- assistance will be required during both design
veys, immunization coverage is estimated using and analysis. For more detailed information
all children with vaccination information accord- about the methods described above, refer to
ing to a health card as the numerator and all chil- Berry and Yach (1991), and for their application in
dren as the denominator. This can be considered South Africa, see Berry, Yach, Hennink 1991,
a worst-case estimate (since all children for Coetzee et al 1990, and Coetzee, Ferrinho,
whom a card cannot be shown are categorized as Reinach 1993.
not immunized). A best-case estimate would be
using the same numerator as above, and all chil- Conclusion
dren with cards as the denominator. eee
The regular estimation of immunization coverage
both these figures should be reported.
by health services forms the basis of immuniza-
Immunization data should be collected so that
tion programme evaluation. Only if these es-
coverage can be calculated for individual anti-
timates provide accurate data on specific sub-
gens, and overall coverage for immunizations
regions can they enable programme managers to
given on time (within 28 days of due date accord-
direct preventive efforts to areas with the greatest
ing to schedule). Coverage should also be calcu-
lated according to immunization status at 12 potential for disease transmission.
months of age, and at the time of the survey. The
standardization of coverage calculations accord-
ing to these criteria will facilitate comparisons be- EXAMPLE 22.1
tween areas and over time (Ferrinho, Buch 1991). Adaptations to the EPI sampling
strategy in the Mosvold health ward
Analysis and the Free State
When complex sampling methods (such as clus- A study to determine vaccination coverage in
ter sampling) are used, there is a need to correct Northern KwaZulu-Natal involved the use of a
for the design effects at the analysis stage. Simple 1:50 000 grid map. Numbers were allocated to
quantities and proportions are minimally biased each 5 x 5,5 kilometre square in proportion to
by most complex designs. However, the calcula- homestead density. These figures were then used
tion of confidence intervals depends on the to ensure that probability of inclusion in the study
design. would be proportional to population size. Sam-
Similarly, using tests and measures based on "pling began at a point on the road nearest to the
simple random sampling — such as the chi- centre of the square. This particular version of the
square test for association and the odds ratio as a EPI-type study is very useful in rural settings with
measure of the strength of association — may widespread populations.
give biased results because of the complex design. Chapman made use of a very similar method in
The differences can be dramatic, and it is strongly his survey of Free State farms to determine pri-
209
ceria ripen | ae atin een A RR a peA Ot OR
mary health care coverage and the health status fied using standard EPI methodology. The larger
of Africans living on farms. The first stage of the sample of 450 (45 clusters of 10 each) allowed for
study used census data to allocate a number of post-sampling- stratification into formal and
clusters to broad areas. A 1:250 000 map was used informal dwellings.
for the second stage, but here millimetres along Immunization coverage was lowest in children
vertical and horizontal axes were used for select- from informal dwellings. Because the average
ing a random starting point. The farm located number of people is higher in informal dwellings,
where this point fell would be visited, and dwell- the use of dwellings as uniform sampling units
ings would be sampled consecutively, starting at probably resulted in under-representation of
the one nearest the farmhouse. An improvement these households.
may have been to use a more random method of In Khayelitsha, coverage was estimated in 12 to
selecting the first house. 23-month-old children living in three geo-
graphically distinct strata: permanent housing,
References: temporary housing, and a transit camp.
Buchmann EJ, Ngesi N, Tembe R, GearJ SS, lJsselmuiden C A sampling frame was developed by mapping
B. ‘Vaccination status of children aged 12-23 months in the the houses in each stratum during community
Mosvold health ward of KwaZulu.’ South African Medical surveys preceding the coverage survey. Stratified
Journal 1987: 72:337-8. multistage sampling selected 46 cluster starting
Chapman RD. ‘The Orange Free State experience. Deter- points, with allocation proportional to the estim-
mining both primary health care coverage and the health ated number of houses per stratum. From each
status of blacks living on farms.’ CHASA Journal 1990 starting point, 10 children were selected using
1(1):30-36. standard EPI methodology. The larger sample
size (46 clusters of 10 children each) allowed for
the estimation of stratum-specific coverage. Chil-
EXAMPLE 22.2 dren living in temporary housing had the lowest
Immunization coverage estimation immunization coverage, and were most likely to
be recent immigrants from the rural Eastern Cape.
adapted for use in peri-urban
settings
A study to evaluate the success of an outreach References:
immunization programme in Alexandra town- Coetzee DJ, Ferrinho P, Reinach S G. ‘A vaccination survey
ship used aerial photographs to define a sampling using EPI methodology to evaluate the impact ofa child
frame. The whole township was divided into 108 health outreach programme in an urban area of South
blocks (120-150 houses each), of which 45 were Africa.’ WHO Bulletin 1993; 71(1):33-9.
randomly selected. Within each of these blocks, a Coetzee N, Yach D, Blignaut R, Fisher S A. ‘Measles vaccina-
random starting point was selected, from which tion coverage and its determinants in a rapidly growing peri-
point 10 children (12-23 months old) were identi- urban area.’ South African Medical Journal 1990; 78:733-7.
210
23 Environmental
epidemiology
Introduction affect physical or mental health in some way,

either positively or negatively. An approach to the
Canaries were used in bygone days to detect toxic
discipline of environmental health which focuses
concentrations of carbon monoxide in mines.
on environmental hazards of human origin is
Today, environmental epidemiologists are the
useful in that it focuses on potentially remediable
‘canaries’ capable of giving warning of impending
problems (as opposed to, for example, ‘natural’
environmental disaster.
pollutants such as background radiation, pollens,
‘Fortunately, our fate is not to die as the unfor-
or climatic factors).
tunate canaries of the coal miners did, but to sing
Environmental epidemiology is an important
— to call out in clear tones the nature and type of
tool for investigating the distribution and deter-
impending health danger that threatens.’ (Gold-
minants of environmental diseases in human
smith, 1988.)
populations. Epidemiologists have thus been get-
The discipline of environmental health
ting more and more involved in evaluating the
emerged in the 1980s, with the increasing recog-
impact of environmental pollution on human
nition that efforts at environmental improvement
health. Environmental health professionals are
and protection could have a positive effect on
playing an increasingly important role in environ-
disease prevention. Other than immunization
mental impact assessments, and, for example, in
programmes, there are few health interventions
ensuring that health considerations are taken into
with similar opportunities for primary pre-
account early in the planning procedure of pro-
vention. In order to implement preventive mea-
jects which impact on the environment.
sures, it is necessary to understand the way in
The extremely complex nature of environ-
which the environment influences health.
mental diseases places special methodological
demands on the design of studies investigating
Definition of environmental health the relationship between environmental expo-
In its broadest context, environmental health sure and disease. For example, one is often deal-
comprises the aspects of human health and ing with a range of non-specific ill-health effects
disease that are determined by factors in the (some of which may only manifest themselves
environment. The WHO includes in this the study after many years), and the causes may be due to
of both the direct pathological effects of chemical, many different factors, rather than to a single
physical, and some biological agents, as well as source. Considering the implications for govern-
the (often indirect) effects on health and well- ment and industry of instituting better control
being of the broad physical, psychological, measures and policies to regulate pollutants, it is
social, and aesthetic environment, which includes essential for studies to be conducted in a rigorous
housing, urban development, land use, and and unbiased way. Specific issues relating to
transport. exposure and outcome (effect) estimation are dis-
Virtually every aspect of the environment may cussed here.
211
The nature of environmental The actual harm that can result from an expos-
diseases ure is related both to its toxicity and the cumu-
lative dose to the target organ. The actual effective
Despite well-documented cases of environmental dose is the amount of toxic material which
disasters, accidents, and spills (for example, reaches the critical target organ in the body. It
Bhopal, where tens of thousands of people were depends on the concentration in the environ-
poisoned in 1984 by methyl isocyanates, or Spain, ment, as well as on factors related to uptake, such
where 13 000 people were poisoned in 1981 by as the route of entry into the body and the phys-
chemically contaminated cooking oil), the contri- ical and chemical forms of the toxic substance.
bution of environmental exposures to the causa- Almost any chemical could cause harm if taken
tion of many diseases often goes unrecognized. into the body in sufficiently large amounts. How-
ever, in environmental health, one is often con-
Environmental health outcomes cerned with those chemicals which may have
adverse health effects in small doses.
Types of outcomes In conducting risk assessments necessary for
Outcomes (effects) in environmental health stud- setting safety standards, it is particularly import-
ies may include death, specific defined diseases ant to establish the nature of dose-response rela-
(for example, lung cancer), disease categories tionships (or exposure-response relationships).
(respiratory illnesses such as pneumonia, Whilst toxicology has an important role to play,
asthma, and bronchitis), symptom complexes problems involved in extrapolating from animals
(coughing and wheezing) and biochemical/ to humans, and from high-dose to low-dose situ-
physiological changes which may not necessarily ations, make it necessary to rely on well-
result in symptoms. In fact, a whole hierarchy of conducted epidemiological studies.
health effects may occur, ranging from minor In general, the toxic effects of pollutants can be
temporary ailments through to acute illnesses divided into broad classes such as acute toxicity,
and chronic disease, with relatively resistant and chronic toxicity, cancer effects (carcinogenicity),
susceptible people at either extreme of the distri- birth defects caused by substances (terato-
bution. Thus there may be different exposure- genicity), and genetic effects (mutagenicity). One
effect relationships for different subsets in the distinguishing characteristic between these cate-
population. gories is the fact or assumption that dose thresh-
Ill-health effects of environmental exposures olds exist for acute and toxic effects, but these
may occur over a short or long period, they may thresholds do not exist, or have not been demon-
be reversible or irreversible, they may increase or strated for carcinogenic effects.
decrease over time, and they may be continuous
or temporary. Effects may be acute — for exam- Detection of environmental health outcomes
ple, they may follow quite soon after exposure Case detection in environmental studies can be
(often a single major dose) to an irritant. More subject to selection bias if members of a group are
often, however, chronic disease occurs as a result aware of their exposure status and its association
of cumulative exposure to complex mixes over with the outcome of interest. Self-reports of
long periods. Examples include asbestos expos- symptoms are often used as a health outcome in
ure and mesothelioma, and radiation exposure studies with uncertain exposures and few or no
and leukemia. The dispersal of the population at objectively measurable health effects. Potentially
risk over time, and the long incubation period, exposed groups may report outcomes at an in-
make it difficult to reconstruct exposures. For creased rate compared to a non-exposed group,
example, cancer from exposure to asbestos may which may result in over-reporting of cases
only develop some 20 years or more after the ini- among the exposed group. Examples of such situ-
tial exposure, during which time the original ex- ations include communities worried about toxic
posed population may have moved and become waste-site exposures, and office workers con-
exposed to other risk factors. Tracing the original cerned about indoor air quality. In such in-
population in such cases can be extremely diffi- stances, it is necessary to distinguish between
cult and sometimes impossible. Acute health demographic and other factors known to be re-
effects are thus usually easier to detect than lated to symptom reporting, over-reporting of
chronic health effects, which may be difficult to symptoms due to environmental worry, in-
relate to specific hazards or sources. creased symptoms due to stress caused by
Ze,
23 environmental epidemiology
environmental worry, and increased symptoms measurement of the pollution levels currently
due to environmental exposures. emanating fromatoxic waste site will reflect the
Under-reporting of cases (particularly among current exposure situation, but exposure 10 years
rural Africans in South Africa) could also produce earlier may have been quite different (because
spurious results in environmental epidemiologic different materials were being dumped, or be-
studies, depending on which exposures and cause of subsequent environmental modification
which geographic areas are studied. and migration of the original pollutants). Prob-
Frequently, when we talk about low-level lems may also arise because the measurement
environmental exposure, we are concerned with technology may have changed over time, with
effects which may occur in a very small percent- current technology increasingly able to measure
age of the population only. This may necessitate lower levels.
very large-scale epidemiological studies. In such
cases, it may be advantageous to carry out multi- Multiple agents
centre or even multi-country studies. To increase A large number of causes may contribute to most
the probability of detecting a real effect in a small environmental diseases. The various contributing
population, researchers could follow the popula- causes (confounding factors) may obscure the
tion up for many years to accrue sufficient contribution of any one particular factor. For
person-years of risk. A further alternative is to example, the contribution of sources in the
investigate a number of disease symptoms re- domestic (home) environment, the local or com-
lated to a particular exposure. munity environment, and the regional environ-
In other cases, small area studies are needed, ment (climatic zone, geographic latitude and
particularly where clusters of rare diseases are longitude) may need to be assessed. It is impor-
investigated with respect to their possible links tant to assess an individual’s total exposure in the
with environmental conditions. Routine data on various environments in which exposure may
health outcomes may be very limited, which take place, as well as to identify other substances
means that other data sources must be used. Also, that may modify its effects.
available data may only cover large geographical
areas, which are usually not subject to undue ex- Interactive effects
posure to environmental contamination from a Low-level chemical exposures in particular are
localized source. often factors which play a contributory rather
than a primary role in the causation of disease.
Exposures may be interactive, resulting in a
Exposure estimation reductive effect, an additive effect, or a synergistic
effect (where the combined effect is greater than
General exposure issues the sum of the individual effects). For example,
The validity of studies in the field of environ- the incidence of lung cancer is very high in ura-
mental epidemiology depends to a large extent on nium miners and asbestos workers who are
the assessment of exposure. This is one of the smokers, but may be less significantly elevated
most difficult areas in this type of research, due to among workers with similar occupational expos-
the problems involved in the definition and ures who are not smokers. A number of host and
measurement of exposure, as there will usually be environmental factors can modify the effects of
a degree of misclassification of exposure. If the toxic substances.
probability of misclassification is the same for
those who develop a specific disease and those Multiple sources and pathways
who do not, the effect will usually be to dilute the In environmental research, multiple sources and
differences between the groups. pathways of exposure are frequently encountered
There are two dimensions to estimating expos- (Figure 23.1 illustrates the multiple sources and
ure, namely the concentration level and the dura- pathways of lead exposure). One difficulty is the
tion of exposure. In investigating acute effects, ‘general lack of knowledge about many aspects of
the current exposure concentration level may be the behaviour of environmental pollutants. .In
adequate, but in studying chronic effects (after a order to estimate exposure accurately, it is
long exposure period, or long latency period), important to know how toxic agents enter into the
past exposure concentration levels are important, environment, whether they become dispersed or
as well as the duration of exposure. For example, reconcentrated, and what the pathways of human
213
Figure 23.1 Multiple sources and pathways of exposure:

the example of lead
Direct inhalation
!
Route (2) Food
od
Source: Rutter M, Russell Jones R, 1983.
exposure are. Exposure to chemical substances at the outlet of a sewerage works. This, however,
are normally mediated by complex environ- may have little significance in terms of people
mental pathways, and more than one route may who might actually be affected by such emissions,
contribute to human uptake. Often one pathway since they are unlikely to live in the immediately
contributes the major proportion of the pollutant. exposed area. On the other hand, ambient air
Should this critical pathway not be identified, the quality monitors may be placed at the top of
multiple pathways contributing to the total ex- multistorey buildings where air quality may be
posure must be carefully assessed. Adequate con- considerably better than at the actual breathing
trol measures can only be applied when the rela- level of people on the ground. Normally only a
tive importance of the various routes of exposure few areas and a few substances are monitored.
has been established. What is generally required for epidemiological
studies, however, is detailed knowledge of tem-
poral and spatial variability of concentrations of
Environmental monitoring exposures. In the case of many pollutants, there is
Research data is often obtained from local, a sharp decrease in the concentration level as one
provincial, or national monitoring programmes, moves away from the source. There may also be
which are normally carried out for regulatory pur- vertical variations in the source, and, as illus-
poses and not for exposure-related studies. Regu- trated in the example above, air sampling points
latory environmental data is collected with the placed considerably above breathing level may be
aim of detecting peak levels to assess compliance safe from vandalism, but are inappropriate for
with standards or guidelines. Data collection is use in population exposure studies.
often done at places of little relevance in terms of When variations in pollutant levels are consid-
human exposure. For example, an air monitor erable (for example, in most air pollution studies),
may be placed in the immediate vicinity of a par- it may be more appropriate to use personal air
ticular factory, or water quality may be monitored samplers or filter badges rather than stationary
214
23 = environmental epidemiology
artigw ea Ce Dice)
Hourly mean concentrations of smoke (1g/m3)
2000
1600
1200
800
12
Hour of day
Source: Kemeny E, Ellerbeck R H, Briggs A B, 1988.
air samplers. These have the advantage of being the average exposure level may be important,
mobile and have the potential to measure an indi- while in another, the use of peak values may be
vidual’s total exposure. If large populations are more relevant (for example, in assessing noise ex-
being monitored, however, such samplers may posure). Cumulative exposures may be signific-
not be practical on a large scale, and exposures ant in the assessment of, for example, radiation
can be characterized at group level using station- exposure, where multiple exposures may be
ary samplers. largely cumulative.
As spatial variations in pollution levels pose
specific sampling problems in terms of where to Biological monitoring
sample, so variations in the level of pollution over There is a growing interest and increasing re-
time (for example, seasonal and daily variations) search into biological and biochemical markers of
pose problems of when and how often to sample. exposure (for example, DNA adducts), which
Pollution levels in Soweto, for example (see should improve the effectiveness of exposure
Figure 23.2), are highest in the early morning and assessments in the future. Biological monitoring
evening, when coal-burning activities peak. Thus aims to estimate the level of specific chemicals at
temporal variations in the level of the pollutant the critical target sites where their toxic effects are
will have implications for the frequency and type initiated. This level is the biologically effective
of monitoring. Should measurements be made, dose or the internal exposure level. Concentra-
for example, on a continuous basis, several times tions of chemicals in body fluids or excreta may
a day, once a week, monthly, or at some other be measured, as can effects such as mutations,
time interval? The biological variation of the chromosomal aberrations, or sister chromatid ex-
pollutant in the human system, for example, changes induced by chemicals in cells and tissues
whether or not rapid absorption and excretion of exposed individuals. Blood and urine are com-
takes place, complicates the matter further. monly used for biological monitoring, as well
Ultimately, the exposure data for the study must as hair (for example, to detect mercury or lead
be summarized: the choice of an appropriate concentrations), nails (arsenic), teeth, bone
summary measure may be critical to the ultimate (lead), faeces and breast milk (organochlorine
understanding of the exposure. In one instance, pesticides).
215
Target population confounders (for example, smoking, occupation)

and exposure and outcome data for exactly
In contrast to occupational epidemiology, which
corresponding areas, severely limit this design,
focuses on young to middle-aged adult popula-
however. Studies which have relied on this design
tions (predominantly male) who are relatively
include cancer epidemiology studies, in which,
healthy when they start working, environmental
for example, lung cancer rates in populations
epidemiology focuses on the unborn foetus, in-
were assessed in relation to air pollution levels or
fants and young children, and the elderly. Infants
smoking rates, and oesophageal cancer rates
and young children are at particular risk, as they
assessed in relation to salt consumption in the
take in more of a contaminant relative to their
community.
size than do adults, they have immature and vul-
’ Case-control studies can only be used when
nerable physiologies, and, because of their youth,
enough is known about the outcome to properly
they have a greater probability of manifesting
identify and classify cases and controls. In press-
long-term effects. An advantage in studying chil-
ing environmental exposure problems, where
dren in environmental health studies is that they
timeliness of findings may be important, the
are unlikely to be affected by common con-
case-control study, being relatively quick to con-
founders such as smoking and occupational ex-
duct in most circumstances, may be appropriate.
posure.
Examples of case-control studies in environ-
Certain individuals (as opposed to groups) may
mental epidemiology are the study of birth
be particularly sensitive or vulnerable to agents
defects in newborns in relation to pesticides or
that may be relatively harmless or trivial to most
radiation exposures, or pneumonia rates in chil-
other people. Control measures should therefore
dren in relation to indoor air pollution (such as
be geared not only to people of average health,
wood-burning fires).
but to the most susceptible subgroups.
Cohort studies may be used when enough is
known about the exposure to characterize it
Study design effectively, or when evaluating unknown or sus-
In studying the relationship between exposures pected outcomes. As it is possible to characterize
and outcomes, the environmental epidemiologist most exposures at least partially, cohort studies
may employ a number of strategies or study de- are often the design of choice in environmental
signs. If the exposure can be characterized, and epidemiology. They also allow a more complete
the outcome defined, any of the usual study de- investigation of complex exposures or multiple
signs can be used. outcomes. Large environmental cohort studies
Cross-sectional studies are usually the first ap- are currently being conducted in South Africa. An
proach used in assessing exposure-outcome rela- example is the study of respiratory signs and
tionships and, being exploratory in nature, have symptoms in a cohort of children born in Johan-
the advantage of being able to simultaneously nesburg and Soweto (Yach, Cameron, Padaya-
investigate the relationship between several ex- chee, Wagstaff, Richter, Fonn 1991). These chil-
posures and many outcomes. An example would dren were regularly followed up over a period of
be a study in which a questionnaire is distributed time, and the development of various signs and
to a cross-section of a community with questions symptoms studied in relation to various expos-
on the following: exposure to various environ- ures to factors in the home environment, such as
mental factors (such as outdoor air pollution and cooking and heating practices, overcrowding,
tobacco smoke, pollution in the workplace), and and smoking. Information on exposures was ob-
various outcomes such as chronic obstructive tained initially from mothers prior to giving birth,
lung disease, asthma, and various respiratory and subsequently at various intervals of the chil-
signs and symptoms. The disadvantage of such a dren’s development. Levels of pollution in chil-
study is that it is difficult to ascertain the direction dren’s homes were measured using personal
of association of the exposures and outcomes. samplers (Von Schirnding, Mokoetle, Mathee
Ecologic studies are also commonly used, as 1994).
they require information only at a group (rather Another example is a study conducted in the
than individual) level with respect to disease fre- Western Cape of ill-health effects associated with
quency and exposure. The invalid assumption of swimming in polluted marine waters. People
homogeneity of exposure of individuals in an were interviewed at the beach about their swim-
area, the frequent absence of information about ming status and other factors and subsequently
216
23 = environmental epidemiology
by telephone, to ascertain whether any gastro- discipline which has undergone rapid develop-
intestinal, respiratory, or related symptoms had ment in recent years because of increasing con-
developed in the two or three days after the beach cern about deteriorating environmental quality
outing. Illness rates were examined in relation to and its effect on human health. Environmental
swimmers and non-swimmers exposed to varying epidemiological studies have become increas-
degrees of water quality, assessed by various ingly more sophisticated and refined, with more
microbiological indicators. The study will be used emphasis and attention given to exposure assess-
to set recreational water quality standards for ment, health outcome definition, and study
South Africa (Von Schirnding, Strauss, Kfir, Ro- design, as discussed above. While some studies
bertson, Fattal, Mathee, Franck, Cabelli 1993). and approaches are complex and costly to con-
Experimental studies are rarely conducted in duct, demanding expensive environmental and/
environmental epidemiology nowadays, except or biological monitoring programmes, this is not
in situations where it is possible to try out some always the case. Much progress has been made
corrective action in reducing exposure. Such an with relatively simple ‘descriptive’ studies, which
intervention might involve the treatment of a have led to important observations and theories
water supply, or the addition of scrubbers to a about the nature of environmental exposures and
plant stack, or the removal of soil and control of associated outcomes. It is hoped that this field of
dust in mitigating environmental lead exposure. epidemiology will continue to receive the neces-
sary support in the future, as it is critical that the
results of such research be used to form the foun-
Conclusion
dation of future regulatory policies.
Environmental epidemiology is an emerging
ONE
24 Occupational
epidemiology
Introduction completing medical examinations, for exam-

ple, a chest X-ray and lung function tests when
The study of occupational health dates back to the outcome is respiratory disease.
the early 17th century, with the work of Bernadino @ Ifan association is suspected or shown, this can
Ramazzini. One of its distinguishing features is lead directly to preventive activity, which is re-
the attribution of disease causation to workplace latively easily instituted collectively in a work-
exposures. The early stages of the discipline were place setting as opposed to an individual-based
typically concerned with clinical description, and curative intervention, which is typical of
only secondarily, if at all, with pathogenesis or the clinical medicine. The chronic nature and irre-
mechanism of disease. Occupational epidemi- versibility of most occupational diseases
ology is simply epidemiology in the context of strongly supports a preventive orientation.
occupational health, and describes the distribu- For example, in the case of respiratory diseases,
tion and determinants of disease in occupational chronic bronchitis and pneumoconiosis (scar-
groupings. It is thus a basic tool of occupational ring of the lungs) are irreversible and usually
health practice. Two useful references relating to progressive with time. Being able to demon-
research methods and case study material are strate a causal association with dust exposure
Checkoway, Pearce, Crawford-Brown (1989) and underlines the importance of reducing dust
Steenland (1993). levels for the group as a whole as the most
effective means of preventing ill-health (see
What makes occupational Example 24.1).
epidemiology different? Industrial activity is very closely linked with
vested economic, social, and political interests.
Several factors peculiar to occupational health Because this is a field of social conflict in which
make occupational epidemiology a topic worthy participants frequently have widely differing
of special mention, as distinct from general perspectives, occupational health is a difficult
epidemiology (Myers, Von Schirnding 1992). The area for scientists and health professionals.
most notable of these are the following. Ethical considerations related to occupational
@ Workers form well-defined groups which lend epidemiologic findings assume particular
themselves to quantitative observation. importance, and special codes of ethics have
There is a strong focus on demonstrating asso- existed for many years to cater for these special
ciations between exposure and effect. Expos- needs.
ures are relatively well defined, and exposure Cost considerations are paramount in occupa-
information is relatively easily obtained from tional health because those who pay the costs,
records or directly by measurement. Health usually management, are constrained by the
outcome data is likewise fairly easily obtained need to realize profits. Workers are also
from the worker group, usually by administer- constrained by limited funds, particularly in
ing a detailed questionnaire interview and South Africa. There are thus strong pressures to
218
24 occupational epidemiology
conduct occupational epidemiology thesis generation and testing, as well as follow-up

economically and efficiently. Negative atti- and evaluation of interventions, are enhanced by
tudes toward research are frequently encoun- this. Membership of trade unions and pension
tered among industrial stakeholders. funds is useful for tracing people.
Sampling
Methodological issues in occupa- Sampling is mostly unnecessary due to the small
tional epidemiology size of surveyed populations, which usually
number several hundreds, and in some cases, a
Study design few thousand. Should it be necessary, a sampling
All survey designs are used, but some are particu- frame is always readily available, subject to diffi-
larly common and appropriate. The commonest culties in tracing workers who have left work for
type of study is the cross-sectional or prevalence various reasons.
study. This is particularly so in developing coun-
tries such as South Africa, where it is difficult to Measurement
retain contact with a cohort because of the in- The ability to characterize exposure accurately is
stability of working (often migrant) populations the great strength of occupational epidemiology.
with low levels of skill. Compared with much general epidemiology, ex-
Case-control studies are also common, but posures are very evident. It is also usually possible
these and cohort studies are not frequently en- to get some measurable handle on these expos-
countered in South Africa. One variant of the ures. Exposure categorization into qualitative
latter is the retrospective cohort study which is (yes/no), semi-quantitative (high/medium/low/
often found in occupational epidemiology, as it no) and quantitative variables is almost always
meets the criterion that exposure information (in possible.
this case, workplace records) must exist prior to Exposure indices can thus be generated. This
defining the historical cohort. Intervention stud- allows quantitative risk assessments to be made
ies are also easy to do and are becoming popular by generating measures of association between
due to increasing emphasis on evaluation .of exposure and outcome and also gradations of
occupational health measures and services. association, in particular exposure-effect (con-
Experimental studies in most areas of occupa- tinuous outcome variable) and exposure-
tional health are prohibited by ethical considera- response (categorical outcome variable) relation-
tions, although some laboratory experimentation ships.
is done on healthy volunteers to gauge acute Some caution is needed with respect to expos-
effects of common occupational exposures. Some ure. Internationally, serious industrial hygiene
people, however, feel that ‘experiments’ go on all practices only date back a few decades, and South
the time in the workplace in the sense that Africa is particularly far behind in this regard.
workers are knowingly, and with legal sanction, Where exposure measurements are available in
repeatedly exposed to known toxic and carcino- industry, they often focus exclusively on high ex-
genic substances. Two examples are the carcino- posure areas. This results from the typical indus-
gen asbestos and the asthma-causing agent trial hygiene approach, which is concerned prin-
toluene disocyanate, to which spray painters are cipally with whether environmental exposures
exposed. are below legislated occupational exposure limits.
One factor that particularly strengthens survey Such compliance-oriented methods are not ideal
design in occupational epidemiology is the ready for accurate estimation of exposure levels.
availability of control groups. These are mostly Despite these problems, health effects abound
internal (from the same factory population) con- because exposures are usually so gross in devel-
trols and are generated on the basis of objective oping countries, and may be detected relatively
exposure measurements. Internal controls are easily, even in rather small groupings. Exposure
generally superior to external controls, as control data, however imperfect, may exist in company
and index group subjects are relatively homo- records, or may be reconstructed with relative
geneous with respect to most factors other than ease by performing current measurements during
the exposure of interest. unchanged production processes, or by simula-
The identifiable nature of the group is another tion of exposure conditions in different historical
strength of occupational epidemiology. Hypo- phases of production.
219
e@ epidemiology applied to content areas
Precision is generally not a problem due to the estimation by the long induction periods charac-
possibility of easy and repeated access to persons, teristic of most occupational diseases. This takes
records and processes, without serious logistic on particular importance because so few longit-
implications in taking multiple measurements. udinal studies are done and there is a heavy reli-
ance on historical exposure data. Recall bias is a
Qualitative methods form of the more general information bias.
Qualitative research is becoming increasingly Strong confounding biases frequently operate
important in the workplace. It may be used to in industry. There is a common tendency for
determine perceptions of risk and to help prior- those most heavily exposed to hazards in the
itize problems and solutions. It may also be used workplace to have unhealthier lifestyles, which
to identify sources of potential resistance to frequently cause the same adverse health out-
strategies for improving occupational health comes as the workplace exposure. Workers ex-
practice, as well as auditing occupational health posed to dusty workplace environments tend in
services. Such methods are particularly appro- general to be heavy smokers as well. Both these
priate in the context of different workplace cul- exposures cause lung diseases. Another con-
tures, notably those of management and workers. founding factor which may be associated with
Groups within the workplace may have very dif- dust exposure and respiratory ill-health is pre-
ferent ideas about risk and priorities. In these and vious or current tuberculosis infection.
other ways, qualitative methods can help shape Internal validity of occupational epidemiology
what is measured and how it is measured in the studies is usually very good, while external valid-
study of work-related phenomena. ity or generalizability is more complicated due to
the obscure object of inference. For example, ina
Biases community-based survey, ifa sample from a large
Occupational studies are subject to all the same general population is selected and a determina-
generic biases as other epidemiologic studies. tion made of dust and fume exposure at work,
Certain biases appear to be more specific to occu- together with respiratory symptoms and signs
pational epidemiology, such as the ‘healthy and pulmonary function decreases, estimates of
worker effect’, particularly for cross-sectional the ‘true’ association in the general population
studies. In South Africa, because unemployment from which the sample was drawn can be deter-
is such a problem, workers tend to stay with a job mined. In occupational health, on the other hand,
even if it is affecting their health. Health effects in an entire workforce is studied. But what do the
most industries tend to be gradual and chronic measures of association obtained in such a study
and do not lead easily to workers departing. This estimate? What is being estimated is the true
is, however, just a form of the more general se- parameter of association in a hypothetical popu-
lection bias; subjects who are part of the group se- lation of all similar workers working in similar
lected for study tend not to include those who circumstances!
have left because of ill-health due to the exposure
of interest, or some other associated exposure at Analysis
work, and who are thereby lost to follow-up. The Various problems arise as a result of the relatively
group that is followed appears ‘healthier’ because small number of subjects in a typical study.
it consists of a disproportionate number of those Simple stratified analysis is not practical in these
who have survived ill-health caused by the study groups if it is necessary to look beyond the
exposure. This bias is very difficult to adjust for exposure (for example, dust), the outcome of
and bedevils most studies, whatever the design, interest (for example, chronic obstructive pul-
where follow-up is incomplete. Another form of monary disease) and one other covariate (for
this relates to the use of controls from the general example, smoking), and the number of subjects in
population who are by definition ‘sicker’, in that each stratum can thus become very small. Other
they may not be employed or employable. Use of relevant factors may be age, previous tuberculosis
such controls will dilute the effect of the pre- infection, and environmental exposure to sul-
sumed exposure. phur dioxide or domestic fuel combustion pro-
Another specific problem relates to recall bias ducts.
for exposure, which may be driven in the direc- However, there has been substantial devel-
tion of overestimation in the hope of obtaining opment in readily available statistical modelling
compensation, or in the direction of under- methods, which allow more precise parameter
220
estimation from relatively sparse data than strati- mesothelioma, which is almost exclusively
fication analysis offers. caused by asbestos). The adverse health out-
Multivariate methods are well represented in comes typically studied may be produced by
occupational epidemiology literature. One limita- many different causative agents, both non-
tion of these multivariate methods is that in- occupational and occupational, in many differ-
creased precision is achieved at the expense of ent industries. Lack of specificity has had some
possible increased bias of parameter estimates. difficult consequences for those disabled by
The choice ofa particular model always involves occupational diseases. This is partly due to the
assumptions about the shape of the exposure- persistence of the old-fashioned notion of indus-
effect relationship, which may be incorrect, trial disease having specific causation. Modern
giving rise to a biased estimate. concepts of work-related diseases that have mul-
Indirect standardization (see Appendix 1: Stan- tiple and complex etiologies have been slow to
dardization), which yields the Standardized take on in the minds of health professionals and
Mortality (or morbidity) ratio (SMR), is the most administrators in state departments responsible
frequently used comparative measure in longit- for compensation. Attribution of risk for occupa-
udinal occupational mortality and cancer incid- tional disease to particular exposures or condi-
ence studies, because it provides a more stable tions is therefore an important part of occupa-
estimate when the study groups are relatively tional epidemiology. Simple stratified analysis
small. This has the disadvantage that it may not can yield estimates of attributable proportion or
be used for comparisons between different ex- etiologic fraction due to the effect of the exposure
posure categories within the study group, but in question. This has also taken a step forward
only between the index exposure category and since the advent of modern computerized statis-
the standard population from which the standard tical methods for investigating the partial effects
rates are drawn. This unfortunately means loss of of some occupational risk factors, while adjust-
ability to accurately and meaningfully estimate ing for several other occupational and non-
the exposure-outcome relationship in a situation occupational characteristics and exposures.
where categorization of workers by exposure is
possible. . Ethics
Association Ethical issues are generally more complex in
Applying Bradford Hill’s criteria for establishing occupational epidemiology than in general epi-
causality of an association (Bradford Hill 1965), demiology, and questions of confidentiality and
causal interpretation is enhanced in occupa- use of information generated are best addressed
tional epidemiology by strong associations and before conducting research.
exposure-response relationships (biological gra- These considerations favour participatory re-
dient). Additionally, because occupational search designs and clearly understood agree-
health research is efficient in terms of numbers ments, preferably in the form of written contracts,
of subjects.and resources expended, establishing between relevant parties at the workplace and re-
the consistency of findings by being able to searchers. For instance, there is considerable
repeat the study in other circumstances is en- potential for victimization of workers, or more
hanced. Time ordering of exposure-disease subtle forms of discrimination (including job
events is usually no problem in occupational loss) for those who participate in surveys. This
epidemiology. It is unlikely that a lung disease could be due to the discovery of occupational or
would cause someone to expose themselves to coincidental non-occupational illness. The re-
dust — the contrary would be more likely to searcher should, however, retain a clear right to
apply. Biological plausibility is usually provided publication of results.
by a well-developed international toxicology The application of cost-benefit analysis raises
database and existing theories of the mechanism important questions related to the recipient of
of disease causation. benefits and the payer of costs. These have fre-
On the other hand, specificity (where one quently been seen as different groups — workers
cause is uniquely responsible for one adverse pay costs and industry reaps the benefit. It is
health outcome), is not a strong suit in occupa- important that cost and benefit be derived for the
tional epidemiology (one notable exception is same subject.
221
Resources and logistical issues sion, participative labour policies from factory
floor to parliament, and strong state involvement
Resources and funding possibilities present cer-
in occupational epidemiology and occupational
tain difficulties. They are more plentiful within
health generally. The Scandinavian countries are
the private sector, but with this relative abun-
the best example, and have occupational health
dance comes increased control over the epidemi-
systems that are light years ahead of anywhere
ologist. In particular, there is a strong ‘negative
else. By contrast, some developed countries such
findings publication bias’ as opposed to the ‘posi-
as the USA and Britain are sometimes hardly any
tive findings publication bias’ typical of general
better than South Africa when it comes to imple-
epidemiology. Many occupational health profes-
menting occupational epidemiology knowledge.
sionals working in industry are formally pro-
scribed by contract from publishing findings in
the scientific literature without express permis- Conclusion
sion from their employers. This once again raises The workplace has much to offer as an applica-
ethical problems. Occupational health does not tion area for epidemiology. The relatively well-
usually enjoy a high priority with public funding controlled occupational setting, well-defined
agencies, which tend to see occupational health study groups and exposures, industrial dyna-
as a private sector responsibility, thus posing mism and continual change provide many stimu-
additional difficulties for independent occupa- lating challenges. Because of these factors, de-
tional epidemiology research. spite the difficult ethical and social issues that
Access to problem areas is difficult and usually confront the epidemiologist, the workplace is
dependent on management goodwill. Where likely to continue to be a powerhouse of applica-
access is not blocked by those with vested inter- tion and development of epidemiologic method
ests, a participatory research methodology can and knowledge.
considerably enhance the quality of data and the
interpretation of results, by tapping the implicit
knowledge of workers about exposure, the inter-
pretation of analyses, and most importantly, the EXAMPLE 24.1
implementation of recommendations. Further- Brickfields study
more, participatory research methods are ideally
Brickmaking is a dusty process with potential
suited to an environment with an existing organ-
adverse respiratory effects. Brick dust is addition-
izational infrastructure (such as trade unions or
company health and safety structures) involved ally known to contain silica which causes lung
scarring or pneumoconiosis. Until this study, no
in the promotion of health and the prevention of
disease. Data may be cross-validated from differ- data existed for South African brickfields, which is
ent sources, and enthusiasm can be engendered fairly typical of much of South African industry. In
for the research process. This can help to lighten 1985, a cross-sectional analytic study was con-
the respondent burden, minimizing the provision ducted on all active workers (some 560 of them) in
of false information, removing fear of victimiza- four brickfields in the Western Cape over a two-
tion (in the case of TB, for example), and height- week period in order to determine the relation-
ening the possibility of efficient implementation ship between various dust components and res-
of recommendations. piratory disease in this setting.
Access for this study was gained as a result of
industrial conflict which had led to a strike. The
Implementation of findings health survey was one of the management con-
Implementation possibilities are potentially cessions to workers who had failed to obtain wage
strong, but depend critically upon socio-eco- increases.
nomic factors. Experience in occupational epi- Workers’ perceptions were essential in helping
demiology provides much insight into the limits to design a measurement strategy for dust at
of implementation of epidemiologic findings different points in the production process. Man-
generally, and highlights some of the social con- agement, medical staff, and workers all had
straints upon the development of occupational different subjective perceptions of dustiness at
epidemiology as a science. Implementation is various points. Additionally, the content of offi-
institutionalized in only relatively few countries. cial job descriptions varied widely within the
These countries have high degrees of social cohe- same job category when detailed information was
222
obtained from workers. Details on exposures tive causative variables (exposures), while allow-
were collected qualitatively in pilot studies so as ing for simultaneous adjustment by known or
to ensure good quantitative measurement for the suspected confounding factors (age, smoking).
main study. Categorical outcomes (presence or absence of
Various components of dust in the atmosphere symptoms or pneumoconiosis) were analysed by
of the workplace could be identified and meas- multiple logistic regression and provided odds
ured, including total dust, respirable dust, and the ratios for dust exposure adjusted for confounding
respirable free silica fraction. This made it pos- factors.
sible to construct various dust exposure indices It was possible to show that neither smoking
for both current and cumulative exposures in nor age could explain excess respiratory symp-
workplaces for which dust profiles had been toms, X-ray pneumoconiosis, or decreased lung
gathered. A detailed interview schedule was ad- function. After adjusting for these known causes
ministered to all workers to obtain occupational, of ill health, there was a persistent residual effect
social, and health history data. Medical examina- of dust, the magnitude of which could be deter-
tions, chest X-rays, and lung functions tests were mined.
performed. Causal interpretation of the associations found
Occupational exposure histories were valid- in this study between silica dust exposure in the
ated against company employment records. As workplace and lung scarring was supported by
workers had little sense of which exposures the application of Bradford Hill’s criteria — most
caused which adverse health effects, they were of which were met.
unlikely to have provided biased data leading to There was no immediate implementation of
false associations. findings from this research. Due to industrial ten-
Outcome measures included continuous vari- sions during and after the study period, the
ables relating to lung function as well as categor- results were received with some hostility by man-
ical variables related to X-ray findings of lung agement. Conflict continued and some of the
scarring and respiratory symptoms. plants closed.
Ethical problems could have arisen because Nine years later, the principal author met the
many study subjects had very poor lung function, group medical officer for the brick company con-
and if their individual results had been made cerned at an occupational health conference and
available to management in the course of the discovered that many occupational health im-
study, they could have been discriminated provements had been introduced and that the
against. Therefore, before the study began, the results of the study had percolated slowly and in-
main stakeholders agreed that no one would be directly into improved practice over time.
retrenched on medical grounds, and all those
with conditions requiring medical attention References:
would be appropriately referred. Workers with Myers J E, Lewis P, Hofmeyr W. ‘Respiratory health of brick-
health problems were transferred to less dusty workers in Cape Town, South Africa: background, aims and
work, and workers’ compensation was initiated dust exposure determinations.’ Scandinavian Journal of
for deserving cases. Work Environment and Health 1989; 15(3):180-87.
In this case, no one was victimized, and the Myers J E, CornellJ E. ‘Respiratory health of brickworkers in
existence of a trade union representing workers Cape Town, South Africa: symptoms, signs and pulmonary
provided a safer environment for the researchers function abnormalities.’ Scandinavian Journal of Work
from an ethical point of view. Environment and Health 1989; 15(3):188-94.
During the analysis of the results, multiple Myers J E, Garisch D, Louw SJ. ‘Respiratory health of brick-
linear regression for continuous variable out- workers in Cape Town, South Africa: radiographic abnor-
comes provided partial correlation coefficients as malities.’ Scandinavian Journal of Work Environment and
measures of the explanatory power of presump- Health 1989; 15(3):195-7.
223
25 Disability studies
Introduction and will not be dealt with here.

Internationally, crude prevalence rates of dis-
Mortality (death) statistics formed the basis upon
ablement vary between 0,2% and 20%. In general,
which population health was assessed until the
higher rates are found in people with lower edu-
end of the 19th century, when indicators of
cational levels and lower occupational status, older
morbidity (illness) were introduced. These pre-
people, rural dwellers, and poorer people. Inter-
valence and incidence rates of disease provided
national variations are not necessarily due to differ-
no insight into the severity of the conditions or
their effect on people’s lives. Thus, more recently, ences in actual rates, but may be due to differing
age distributions of the populations and inconsist-
the concept of morbidity has been extended to in-
corporate the personal and social consequences ent definitions and measurements of disablement.
The lack of a hard and fast definition of ‘dis-
of disease, and quality of life measures. So, for
example, population health can be measured ablement’ has hampered research in this field.
One difficulty is that consequences can vary in
according to expected years of good quality of life
by using data from disablement studies. severity. For example, one person with a walking
Studies of disablement investigate people with disability may not be able to walk at all, while
another may have trouble walking between
long-term functional loss, and thus focus on the
long-term consequences of the disease or injury. rooms. Another problem is the involvement of
The purpose of these studies is to obtain data to many disciplines in the disability field, each with
assist health planners, people with disabilities its own conceptual framework, tradition, and ser-
themselves, and society at large. This data pro- vice. Important domains of disablement are not
vides the basis for primary prevention pro- always considered and reported; for example,
grammes (such as the promotion of car seat-belt mobility problems may be investigated in detail
use) as well as services which enable people with without reference to self care and work-related
disabilities to develop their potential and im- problems.
prove their quality of life. Despite these good Population censuses and household sample
intentions, health planners, researchers, and ser- surveys are the most common data collection
vice providers worldwide have experienced enor- techniques used in epidemiological studies of
mous problems in obtaining data which mea- disablement. However, no standardized survey
sures the consequences of disease. design has been developed, so sampling meth-
The most basic research question to be asked in ods, screening questions, and disability defini-
epidemiological studies of disablement is, ‘What tions and assessments vary from study to study.
proportion of the population is disabled?’ This
will be the main issue discussed in this chapter. International classification of
There are many clinical research questions
related to disablement and rehabilitation, but
impairment, disability and handicap
these are not necessarily epidemiological studies In an attempt to standardize the conceptual
224
25 = disability studies
framework and definitions of disablement, the seriously compromised by its cumbersome

International Classification of Impairment, Dis- nature. It is unnecessarily complex and fails to
ability and Handicap (ICIDH) was published by recommend practical and appropriate applica-
the WHO in 1980 as the first step towards improv- tions for clinical practice or for research. When
ing the quality of information about disablement. first published, the manual did not highlight
It includes a fuller range of consequences of ill- these shortcomings and effectively launched the
ness, injury, and other conditions that lead classification into a methodological vacuum.
people to make contact with the health care Additionally, international disability rights groups
system. While the much-used Jnternational Clas- object to the ICIDH on the grounds that the
sification of Diseases (ICD) provides classification terminology ‘abnormality’ and ‘loss’ label dis-
of diagnosis/etiology, the ICIDH focuses on disabled people as ‘less than normal’.
ablement and the consequences of injury, illness, Some of these problems are being addressed,
and other conditions. and a revised version is being developed in
The ICIDH classifies disablement into three consultation with people with disabilities, dis-
broad categories: impairment, disability, and ablement workers, and researchers around the
handicap. Impairment is any loss or abnormality world. A WHO Collaborating Centre for the ICIDH
of psychological, physiological, or anatomical has been established in the Netherlands to co-
structure or function. It is the consequence of ordinate research and other developments in this
an illness, injury, or other condition at organ field.
level. Disability is any restriction or lack (result-
ing from an impairment) of ability to perform an Study design :
activity in the manner or within the range consid-
Most epidemiological studies of disablement are
ered normal for a human being. It is concerned
descriptive studies answering research questions
with compound or integrated activities expected
such as ‘who is disabled?’, ‘what proportion of the
of the person or of the body as a whole, as repre-
population is disabled?’, and ‘what impairments
sented by tasks, skills, and behaviour. Handicap
caused the disabilities?’ However, analytic de-
is a disadvantage for a given individual, resulting
signs including case-control and cohort studies
from an impairment or a disability, that limits or
can be used to answer questions related to risk
prevents the fulfilment of a role that is normal
factors for disablement. An example is a study
(depending on age, sex, social, and cultural fac-
which follows up a Cape Town birth cohort and
tors) for that individual. Handicap results from an
looks at the effect of prematurity (exposure) on
interaction between disabled people and their
perceptual-motor performance at the age of five
social, economic, physical, and political environ-
years (outcome) (Coetzer 1995). Intervention
ment. In this chapter, the term ‘disablement’ is
studies can assess the effectiveness of treatment
used to cover all three domains discussed above.
or other interventions.
The ICIDH provides parallel classifications
along these three axes in great detail. The se-
quence of events in the development of disable- Sampling
ment is usually: The sampling issues in disablement prevalence
studies are similar to those of all chronic diseases
Disease — Impairment — Disability — Handicap of low prevalence. Because the prevalence of dis-
ablement is relatively low, sample sizes often
FOR EXAMPLE: need to be very large so that the prevalence can be
Diabetes — leg amputated — walking disability — estimated with good precision. For example, in
occupational handicap order to accrue 10 disabled cases, 120 people may
have to be interviewed.
Note that a person can experience multiple Generally, larger sample sizes are needed if a
impairments, disabilities, and handicaps. study is impairment-specific, or if impairment-
specific rates and age-specific rates are being
Since the first publication of the ICIDH manual in investigated. Younger populations will also need
1980, it has gradually been used in a variety of set- larger sample sizes, because of relatively lower
tings. However, many valid criticisms have been prevalences.
levelled at the classification, indicating the need Because of the large sample sizes needed,
for a revision. As it stands now, its usefulness is random cluster sampling is a common approach
225
_ to obtain a representative sample. Clusters of conducted in South Africa over the last five to 10
houses are selected and individuals within these years have used this framework.
households are interviewed. Strictly speaking, Usually the researcher needs to decide which
only one person per household should be inter- domain of disablement will be the primary focus
viewed after selection from among the others in of the proposed study. Will a person be defined as
the household by random techniques. However, ‘disabled’ if she or he has an impairment (such as
it is common for all the individuals within a inflamed joints) or a disability (for example, a
household to be studied, creating a second level locomotor disability) of some kind?
cluster (that is, the family), in that family mem- If, for example, a study sets out to measure dis-
bers are not selected independently. Given that ability, the research can investigate all types of
illnesses and disabilities may run in families, this disability or focus only on one (for example, loco-
approach may introduce a bias. Nevertheless, the motor disability). In general, surveys with a lim-
logistical gain of boosting sample size by count- ited focus are more common because specific
ing all the individuals in the households often professions (such as physiotherapists), medical
necessitates this approach. areas of specialization (such as orthopaedics), or
In Canada and New Zealand, community- agencies (such as the Organization for the Phys-
based prevalence was estimated using a national ically Disabled) may initiate the investigation.
census as the sampling frame. Everyone in the Clearly a limited focus is much easier and the
country was asked a screening question, and a screening questions (questions asked of all people
sample of positive and negative responders to the sampled in order to determine who needs to be
screening question were followed up to confirm investigated further) need to cover limited pos-
disability status. In the follow-up phase, a sub- sibilities only. Each particular disability may have
stantial proportion of positive responses con- its own series of screening questions, for example,
verted to negative, and vice versa. visual disabilities (‘Is there anyone in this house-
In studies where only a rough estimate of the hold who has difficulty seeing?’ plus others which
prevalence is needed, but more details are flow from this question) and locomotor disabil-
needed concerning needs and diagnostic/sever- ities (‘Is there anyone in this household who has
ity breakdown of the disabled in the community, difficulty moving?’). However, a drawback of sur-
service providers can attempt to identify all the veys with a limited focus is that sample sizes have
disabled cases by compiling a disability register. to be much larger to obtain precise estimates.
This is done by asking all key informants, This is because disablement in general is an un-
community bodies (for example, churches and common condition, and specific functional prob-
schools), and health services for the names of all lems such as communication disorders are even
the people with disability they know of. People more uncommon.
with disability and their families, in turn, provide Crude disablement rates alone are not a par-
the names of other people with disability. This ticularly useful measure. A crude rate of 12% gives
method is called networking and is excellent for us a figure but tells us very little that could be used
case finding, although not necessarily representa- in policy and service development. Impairment-
tive of all the people with disability in the com- specific rates (for example, the proportion of
munity. people in the population with amputation-
related disability) provide insight into the causes
of disablement, while age-specific rates provide
Measurement of disease
insight into the ages affected and allow for mean-
consequences ingful comparisons between populations.
Classification Screening questions

The major problems with the measurement of the Disablement surveys often involve two phases: a
consequences of disease lie in the definitions of screening phase and a follow-up phase to confirm
disablement. As the ICIDH develops and is the disablement status and/or ask for more
routinely used in research, definitions of disable- detailed information. The nature of the screening
ment should converge with time. The conceptual questions and the additional questions asked
framework dividing disablement into the three when screened positive, determine to a large
domains — impairment, disability and handicap extent the prevalence found in the study.
— is a useful one. Most studies of disablement The use of screening techniques that are
226
impairment-specific and limited in scope (for

example, blind, deaf, amputee) results in the Table 25.1 Variables commonly
identification of the most severely affected cases, Tite CsTeM mel ersl tae aN
because these cases are obviously more disabled
than those with milder functional loss. Broad- Demographic
ranging disability oriented questions concerning @ Age and sex
functional and activity limitations such as hear- @ Place of residence (urban, rural)
ing, moving, and personal care activities, screen ¢ Highest level of education and/or training
larger proportions of the population as having a achieved
form of disability. Following a positive response ¢@ Occupational status
to a screening question, most surveys request an ¢ Details of economic activity
additional description of either the disabilities Marital status
associated with the reported impairment or the
impairments which underlie the reported disabil- Household information
ity. These respondents may be further evaluated @ Household environment (availability and
by a rehabilitation specialist, such as a speech or access to toilet, water, domestic power)
occupational therapist. However, many surveys @ Number of people in household/family com-
do not ask such additional questions. position
The best estimate of disablement is probably ¢ Dwelling type and number of rooms
found when disability questions are used as a sur-
vey screen (for example, ‘Does anyone in this Describing the disability experience
household have difficulty walking or moving @ Additional impairments
around?’), and then followed up by impairment- @ Age at onset
specific questions (for example, ‘What is the ¢@ Cause of impairment
reason for this difficulty?’) Handicap has not been ¢ Disability status
found to be useful as a screening device for iden- Severity of impairment or disability
tifying people with disability. ¢@ Special aids used
Services or treatment received
Other variables commonly collected @ Special topics
Besides asking questions related to disability
itself, surveys often ask additional wide-ranging
questions. Table 25.1 contains alist of variables
that are commonly covered. investigation of a sub-sample of the study sample.
The sub-sample could include either only those
Validation subjects reporting impairments (to pick up false
Questionnaires are indirect measuring instru- positives), or subjects from both (reported) im-
ments that record mostly subjectively reported paired and non-impaired groups. The latter
information. Reported disability and impairment option enables you to calculate the sensitivity and
might not reflect the true level and nature of specificity of the screening process, and is more
disablement. Proxy (substitute) reporting often useful in assessing the validity of the findings.
worsens this problem, although in some cases In areas where access to and utilization of re-
proxy reporting by a family member may be more habilitation services is good (this is rare in South
reliable than self-reporting by the disabled Africa), a review of institutional and professional
person him or herself (for example, in the case of records may be a cheaper option. However,
a person with an intellectual disability). records are often poorly designed and completed,
The validity of the information can be im- pointing to the need for a well-designed and co-
proved by testing the subjects either in the field or ordinated information system, which can be a
at a foliow-up visit, or when they attend a hospital useful resource both for case management and
or clinic. Validating the reported impairment may for the collection of disablement statistics.
involve a full medical examination and/or special
investigations such as X-rays and IQ testing. The
Analyses
cost of such investigations may be prohibitive,
using most of the financial resources available for The use of a summarized form of the ICD is
the study. Alternatives to this include in-depth recommended to give a classification of the
22h
etiologies of the impairment(s) that is medically EXAMPLE 25.1

meaningful and internationally comparable The prevalence of motor disability
(some codes may be needed for non-medical rea-
and impairments in Manguzi,
sons given, such as witchcraft). A simplified ver-
sion of the ICIDH can provide a standardized KwaZulu-Natal
approach to the coding and presentation of im- A study was carried out to measure the preval-
pairments and disabilities. ence of motor disability and impairments in the
The calculation of confidence intervals for es- Manguzi health ward, a rural area in KwaZulu-
timates of the prevalence of disability is recom- Natal. An interview survey was done to determine
mended. motor disability and a follow-up medical exami-
nation carried out for motor impairment.
Conclusion A two-stage random cluster sample technique
was used, with the homestead (numbered by
The major contribution of the ICIDH to popula- malaria control officers) used as the sampling
tion-based disablement surveys is a conceptual unit. Two hundred and fifty homesteads (10% of
one, in that a common framework now exists as a the total) were selected. Information collected by
reference point. However, there are technical dif- trained interviewers included demographic de-
ficulties in applying concepts and categories. A tails and the presence or absence of perceived
concerted effort needs to be made internationally disabilities of each homestead member. Per-
to develop more standardized approaches to ceived difficulties in moving and walking, and
population-based studies. their causes, were enquired about. A practical
In South Africa, there is limited population- field diagnosis, including a simple walking test,
based disablement information to assist in plan- was made by a trained occupational therapist.
ning for rehabilitation and disability support ser- Motor impairment was assessed by full follow-up
vices. Rehabilitation services are also poor in medical assessments, including X-rays, at a dis-
most areas outside the major urban centres. trict hospital.
These two facts appear to point to a need for more Information on 52% of the sampled people was
disablement surveys to provide information for proxy reported. A crude motor disability rate of
service planning. However, it is difficult, expen- 86/1 000 and a crude motor impairment rate of
sive, and time-consuming to conduct a scientifi- 52/1 000 was found. The commonest impairment
cally rigorous disablement survey. Issues of mea- was osteo-arthritis of the hip, which may be the
surement, large sample sizes, and validity dis- same disease that has been reported in neigh-
courage widespread attempts to undertake such bouring areas, and which is referred to as Mseleni
studies. In many instances, large and expensive joint disease.
surveys do not lead to service development or im-
provement. In such cases, the use of existing clini- Reference:
cal information is a cheaper and reasonable alter-
McClaren P A, GearJ S, Irwig LM, Smit A E. ‘Prevalence of
native. This emphasizes the need for developing motor impairment and disability in a rural community in
good information systems, including disability Kwazulu.’ International Journal of Rehabilitation Medicine
registers, which can be consulted when needed. 1987; 8:98-104.
Researchers and health planners need to devel-
op approaches which can be implemented
rapidly, at relatively low cost, to yield useful
results at district level. Full-scale epidemiological EXAMPLE 25.2
surveys should only be undertaken after all the The crude disability prevalence rate
methodological and financial issues have been
carefully considered. Such studies could be done in Mitchell’s Plain, Cape Town (1989)
on a regional rather than local level, where results A study was undertaken in 1989 to investigate the
can have an impact on service planning for a large crude disability prevalence rate in Mitchell’s
area, and can contribute towards measuring Plain, a dormitory town near Cape Town. The re-
population health according to expected years of search was initiated by the Community Occu-
good quality of life. pational Therapy project serving the area, in
228
order to broaden its service and identify disabil- with mild disability who had no occupational
ity-related problems. A proportional stratified handicap.
cluster sampling design was used with a sample All individuals (12,9%) who fell into the dis-
size of 500 households. The 82% household re- abled category from the first screening question
sponse rate yielded 2 424 individuals in 472 were followed up by experienced physiothera-
households. pists or occupational therapists, with 85% being
During Phase 1 of the data collection, rehabili- traced. The screening questions were repeated,
tation assistants interviewed a responsible adult and an in-depth interview concerning disability-
from each household. A census was taken of each related information was done on those who re-
household and two sets of screening questions ported to be unable or limited in the age-specific
were asked about household inhabitants. The key activities. A crude disability rate of 3,9% (95%
first question screened for all the categories of confidence limit 2,8% to 4,9%) was found. The
disability (such as movement, strange behaviour, authors considered the figure of 12,9% to approx-
or hearing) and also enquired about regular imate the impairment rate in the community, and
treatment and appointments for health prob- the figure of 3,9% to be the rate of moderate to
lems. If a positive response was given to any of severe disability.
these questions, a second screening question
was asked about the person’s ability to perform Reference:
key activities specific for the person’s age (for Katzenellenbogen J, Joubert G, Rendall, K, Coetzee T.
example, in the case of adults — ability to work; ‘Methodological issues in a disablement prevalence study:
in the case of children — ability to play). The Mitchells Plain, South Africa.’ Disability and Rehabilitation
second question tended to screen out people 1995; 17:350-57.
229
26 Psychiatric
epidemiology
Introduction identified physiologically or by a predictable pat-

tern of signs and symptoms. Evidence of pathol-
The political changes that have swept South ogy is often not concrete.
Africa and the economic realities underlying Many of the signs and symptoms associated
them have stimulated policy changes in a number with psychiatric illness depend upon social
of areas. For example, in the early 1990s, there judgements. For example, what one person may
was a shift towards primary health care and an view as problematic social withdrawal, another
attempt to integrate mental health services into may see as appropriate reserve. Beliefs about
primary health care. Efforts to achieve such an what is acceptable behaviour will differ in differ-
integration, however, have been hampered by ent cultural contexts. This means that behaviour
numerous factors, including a lack of sound seen to be pathological in one cultural context
epidemiological research in the mental health may not be viewed as such in another.
field. A further issue that arises is the lack of agree-
Psychiatric epidemiology could play a key role ment between study subjects and researchers on
in planning mental health services, setting up the one hand, and between psychiatric re-
effective training programmes for health workers, searchers and clinicians on the other, about term-
and developing strategies to promote mental inology and diagnostic labels. For example, when
health and prevent mental illness. It can do this some people become very agitated, unable to
by identifying cases and risk factors for mental concentrate, and therefore unhappy, they may
health problems (particularly those amenable to label this depression, whereas in the diagnostic
intervention), and by assessing the efficacy of system of the psychiatrist it may be more properly
interventions to improve mental health (Yach, labelled anxiety. The classic example of profes-
Kuhn 1989). sional inconsistency of labelling was the observa-
tion in the 1960s that schizophrenia was more
What is different about psychiatric commonly diagnosed in the USA than in Britain.
epidemiology? This was not necessarily because of different
symptom patterns, but because conditions which
Part of the problem faced by any psychiatric re- would not be called schizophrenia by British psy-
search is that psychiatric illness is often stigma- chiatrists would be classified as such by their
tized and is therefore likely to be hidden from re- North American colleagues.
searchers more often than other conditions. In Later in this chapter, some of the ways in which
many countries, psychiatric illness is not given a psychiatric epidemiologists have handled these
high priority, in spite of the fact that it is relatively issues are discussed, but it is important to note
common, uses substantial resources, and is that, in psychiatry, decisions have to be culturally
socially disruptive (Dick, Spencer, Watermeyer, and socially informed. This applies to other
Bourne, Wolff, Moyle 1978). branches of epidemiology as well, but it is espe-
In psychiatry, disorders typically cannot be cially necessary in psychiatric epidemiology.
230
26 psychiatric epidemiology
Study design of mental health services and their prior usage

of such services, and their attitude to mental health
Many of the psychiatric epidemiological studies
treatment (Cooper 1984; Links 1983; Robins 1990).
in Africa have been descriptive rather than analy-
The information gained from cross-sectional
tic, and have mainly sought to provide inform-
analytic studies will also facilitate the ‘graduation’
ation on the burden of illness by focusing on pre-
to other designs, for example, case-control stud-
valence alone. Beyond looking at demographic
ies, follow-up studies, and even experimental
variables such as age, race, and gender, re-
studies. Such studies are needed if psychiatric
searchers have rarely investigated factors asso-
epidemiologists are to play as useful a role in the
ciated with the distribution of mental disorders in
mental health arena as epidemiologists have
the population studied. Most of the research has
played in the general health arena.
also been cross-sectional, attempting to provide
‘snapshots’ of the disease status of a community
and current exposure to risk at-a single point in Measurement issues
time.
There have been calls for a move away from Measurement of outcomes
cross-sectional descriptive to cross-sectional Investigators can determine psychiatric morbid-
analytic studies (Cooper 1984; Gillis 1987). Reeler ity by adopting either a single or a two-stage
(1987), in particular, has stressed the need for re- approach. Ina single-stage study, all the subjects
search to shift from what he calls ‘classificatory’ in the sample are typically interviewed by clin-
to ‘management’ studies, that is, studies which ically trained interviewers using a structured
can supply practical information to those who interview, on the basis of which cases are identi-
need to make decisions. In particular, researchers fied. The high cost of using skilled interviewers is
have been encouraged to obtain information on a major drawback of this approach. In a two-stage
the clinica! severity of the dysfunction, the degree approach, a population is briefly surveyed during
of functional impairment, current support sys- the first stage by means of a rapid, simply admin-
tems, interviewees’ knowledge of the availability istered, and cost-effective screening test that
srl) (lM eee ara mR mes
Options
Diagnostic system Diagnostic and Statistical Manual of Mental

Disorders, 4th edition or International
Classification of Diseases, 10th edition
Study population Community members, attenders at primary

health care clinics, elderly, etc.
Focus of inquiry What types of disorders are of interest and at

what levels of severity?
Time factors and constraints Time frame for inquiry (e.g. lifetime, past
month); time needed for test administration
Type of instrument Screening or second stage
Level of clinical skill Clinicians or trained lay interviewers
Prior application in study areas Has instrument been translated into language
of population of interest? Has it been
calibrated on that population?
Ease of analysis Availability of computer scoring algorithms
231
identifies possible cases. In the second stage, (such as functional impairments, disabilities, and
‘high-risk’ individuals are re-interviewed so that handicaps).
they may be confirmed as actual cases. Ifa sample Initiatives to- make psychiatric epidemiology
of those found to be negative on the initial screen- more valid have gone beyond calls for expanding
ing is also re-interviewed, the second stage also the focus of inquiry to investigating the way in
permits assessment of the validity of the screen- which people are classified. The traditional, cate-
ing instrument in the study population. gorical approach to classifying psychopathology
A number of instruments are available for is coming under increasing scrutiny. An altern-
epidemiologists to ascertain the psychiatric ative (though less commonly used strategy), is to
status of child, adolescent, and adult populations. adopt a dimensional approach where the focus is
One of the major factors influencing the choice of more on symptoms and symptom clusters than
outcome measures, and hence instruments, is the individual diagnoses. Other critics (for example,
research question. Is the researcher studying a Miller and Swartz [1992]), have argued that com-
particular age group (such as the elderly)? Is he or bining conventional epidemiological methods
she interested in studying a single type of dys- with qualitative methods will help to overcome
function (such as depression) or a range of prob- the limitations of using conventional epidemi-
lems? Is she or he interested in whether the ological methods to investigate the depth and
person or group under study has ever experi- complexity of psychological phenomena.
enced certain kinds of problems or symptoms
(lifetime prevalence) or whether they are cur- Measurement of exposures
rently experiencing them (current prevalence)? A In the mental health field, there are a number of
number of factors influencing the choice of which exposures which are associated with psychiatric
instrument to use are indicated in Table 26.1. illness.
Various screening instruments and structured
diagnostic interviews currently used to determine The effects of social and family arrangements
the psychiatric status of child and adult popula- Social support and network structures may help a
tions in developing countries are listed in tables person function longer and more productively
26.2 and 26.3. The tables also indicate the primary than if that person had been socially isolated.
application for which the instrument was de- Conversely, very stressful social or family circum-
signed, the general focus and type of instrument stances may aggravate psychiatric disorders.
(that is, type of population to be studied and There is a large literature on the specific ways in
whether the instrument is to be used for screen- which social networks and support structures
ing or a second-stage assessment), and other operate, and how they affect the course of dis-
issues which determine choice of instrument. For orders. The literature on what is known as ‘ex-
a more detailed discussion of commonly used pressed emotion’ (EE) has shown that people
psychiatric epidemiological instruments, please with psychiatric problems are more likely to re-
refer to Bech et al (1993), Tansella, de Girolamo, lapse in situations where members of their family
Sartorius (1992), and the WHO Division of Mental are Over-involved with them and over-critical of
Health (1993). them.
Various researchers have stressed that psy- Part of the challenge psychiatric epidemi-
chiatric epidemiology needs to go beyond psy- ologists face when trying to take account of the
chiatric symptomatology in case identification. role social and family arrangements play in affect-
Marsella (1992), for example, recommends that ing the course of illness, lies in separating the ill-
case definition should also include psychosocial ness itself from the factors which may affect it.
problems such as family violence, divorce, and When an association is demonstrated between
racism. He argues that it may be more helpful to social isolation and relapse, for example, it can be
examine various clinical parameters of mental because people isolate themselves as a result of
disorders and social pathologies than to focus on relapse, or because the social isolation has trig-
diagnoses. In particular, he stresses that greater gered the relapse. We can begin to speak of isola-
attention should be paid to studying symptom tion as a risk factor for relapse only once we are
parameters (for example, frequency, severity, and clear that the isolation is not in fact part of the ill-
duration of symptoms and problem behaviours), ness. One way this can be done methodologically
clinical parameters (for example, onset, course, is to demonstrate that the isolation predates any
and outcome), and psychosocial parameters changes in the person’s behaviour that suggest
232
233
psychiatric epidemiology
Aayeiyoksdoinen ulJUBWSSassy Jed!UI|D 10} ajnNDeUDS = NVOS ‘MaIAId}U| O1SOUZeIG jeuoneUsaju| ayisOdwoy = |qIO
Jes = OUS
‘ginPeyoS MeIALe}Uj ONSOUSEIG = SIC ‘a|NPEYoS MaIAE}U jed!UI|D = S| ‘UOeUIWeXA aJe}S JUeSeld = 4Sd ‘QNBUUONSENO UINCSH [219USH = OHH ‘asleuuonsend suiuoday
‘\sISojoyoAsd Jo IseIYOASd e Aq apelu BUleg SeSOUBEIp YIM ‘SIOMOIMEIU! Ae| pauies posn saey seipnisx
aes
Bisqpjop
OL6T
T86T
SUIgOY
90uU9190jO1
wiyywosyje
O86T
O66T 6867
Jajyndwog
18 vl6T e189 B8UIM| Z?LET B19qgPI0D ‘je 12 Bulpsey eu
pais}siulwpe
SuIgoYy
‘/2
‘/2
‘2
38
SUIM
12
je
12
SOA
SOA
SOX
SOA
‘SI@MAINOIU!
ON
SIBMIIN9IU!
SIOMIIN9\U!
SIOMIIMOIUI
~SUBIOIUI|D
(UOISIOApoures|Ayjensn
suelojulo
—jjes
posiape
suelojuljo
JEOIUI]D
poules|
{4s
peules|
pouies|
pouresp
poulel
poures
26
}aAIT
suuejqoid
Ae}
Ae}
Ae}
Ae}
Jo
|
onoyodsd
|
Ja}SIUIWIpe
|
spuadap)
suosJad
|-OE/"SUIW
yyM
é0Z
“xoIddy
IST
OZ—-OT
10}
O6-09
OZ-OT
09
06-09
uo
“SUILU
“SUI
OUI]
GT—G
“SUI
“SUIW
“SUIW
“SUI
G7
0}
OF
asgejs |
agejs puz Ajurew agels puz Ajurew aes puoses
pug
Suluaesos SUIUdBINS ageis puz | Ajulewi SUIUddIOS SUIUDBIOS _ /sulusead9g
ee
SUUI}OHI|
Aynbul
‘yUOU 1SP|
UsNO
SyoeM
yjuoW
owed
‘gposida SUIOsI| sy}UOWW Yeem Bulped
OUT
seg]
sed
Ma}
40}
Juasalg | —Syeem Z UIUUM | XIS sed/JUaUND -aid/USIND
|
suonsenb (09'0€'8Z‘eT Sule}
aAISUBPY wo dwhs 838z O@AISUS}XJ eAisualxXy OvT “3°9) SOURA GZ‘vz% jo s9qUINN
S$souj|l siapJosip SJapsJOSIp
onoyoksd | Ayyeuosied ysow | Ayjeuosiad ysow SJOPJOSIP SJapsosip SJOpsOSIp
jeuoljoun "joxe SlapsJosip | *|oxe ‘SJapJOSIP ouyelyoAsd JEJUBW BJOAVS SJapJOSsip onoyoAsd-uou
Jo onOINEN | OUJeIYOASC JOfe- | dUye!YyOASd JoleW\ jo asuey 0} a}eu9poW onoyodsd-uon pue onoyoksd snoo4
suonejndod suonejndod suonejndod saujunos
jeojuljo-uoU jeoluljo suonejndod | sojoejd jeuoues suorjeindod suonejndod suidojaAop uoljeojdde
pue jeolul|o pue jeJouas Aywunuiwog | pue Aylunwuiwog jeoiuijg | sonoeid jesouey} ul ssulnes OHd Auewiid
STC CRMC EPC PPCPt de: yay LE
e
| (| dee) om:
M PELe MSP
234
PCC COEUR UES) an
Aiewiid uojjeojdde
OHd SSUINSS
Ul OHd SSUIIOS
Ul jeiaue5
pue Jedjuljo jeiauey suoneindod jeisuey suoneindod
sUIdO}esAepSOLJUNOD SuIdo|aAepS9JUNOD suonejindod
paunjons paunjonas painjons painjonis paunjonis-lwes
OU]
OF 4d9}S/UJWIPe
|qd OT—-G “SUIW “Sul O9-Gr “SUI Ja8u0]) -O OQ-GY “SUI *O QOG—-O9 “SUIW
4! PIIYO
SI} (4OIS -d Q6-09 “SUI :d Q6-O9 “SUI
epidemiology applied to content areas
oul) ewe
10) Auinbul
| queuing ‘sed Juadino |ea one Sed 9/JedA sujuoW ysed/juaung
JaART
Jo JEdIUI[D pautes
Ae| SUBIDIUND paurex/sueiiu
Ae] | pouies,
Ae| SUBIDIUND
jIMs paaaiyoe SIOMAIAO]UI SIOMAIAJOU! SJOMAIAO1UI
JOMOIAIOU|
suo}}eAlasgo
Aewid aouaiajas |a!5329 ‘2 T86T |aI59 ‘1e S86T s1uelia}

7% ‘Weqdued Oj||a}S0D
29 12 V86ET yonUy-SINg
39 ‘Je
wWy}OMe
483ndwWog
LL6L O86T
qualed = dq ‘pilyo = Ox
‘UBIPIIUD JO} BINPAYOS MaIM9}U] ONSOUBEIG = OSIG ‘UaP|IYO 10) Ma|AJa}U ONSOUBeI = VOI ‘USIPIIUD 10} MalMeyU] dn-mo}jo4 = O[4 {UUpIUD osleUUONSENd YoRasey = DY
UOISI9A |BOIBO|OlWapidy-elussydoziyoS pue Ssispsosiq eANIeYy J0J o[NpeUuds-alpply = J-SavS—y
the possibility of relapse (that is, temporal rela- pressed. Recall bias can, however, be fully dealt
tionship between outcome and exposure). The with in psychiatric epidemiology, as in other
insidious nature of onset and relapse in mental branches of epidemiology, by the design of
illness, and hence the difficulty of pinpointing prospective studies.
onset of disease or of a particular phase in a The fact that different people can experience
disease, requires very careful recording of what are apparently the same external events in
changes in behaviour, as well as an appreciation very different ways leads the enquiry in two diver-
of the ways in which person and context are inter- gent directions. The first of these concerns indi-
twined. vidual vulnerability; the second concerns the
influence of broader cultural and social arrange-
Stressful life events ments on the ways in which communities under-
Intuitively, one might assume that unpleasant stand, manage, and respond to circumstances.
experiences or events would contribute to the The following two sections will deal with these
onset or exacerbation of psychiatric disorder, and questions.
research bears out this view.
Early work by Holmes and Rahe (1967) in the Individual vulnerability
1960s, however, demonstrated that it may not be A person may be vulnerable to disorder on the
the quality (positive or negative) of events which basis of a range of factors intrinsic to that person,
contributes to the course of disorder, but rather for example, the way they were brought up, phys-
the amount of change or readjustment which ical health, stressful events, and genetic factors
events cause in people’s lives. Thus, in the many (Kaplan, Sadock 1988). At first it may appear con-
scales that have been developed to measure life ceptually difficult to separate these factors from
events and life change, there are predictable the disorder itself, and there may also be some
items, such as the death of a spouse and deten- circularity in reasoning.
tion in jail, as well as items such as marriage, birth For example, if it can be shown that people who
of a child, and so on. There is ongoing debate are habitually suspicious are more prone than
about which sorts of events can predict the course others to a psychiatric disorder typified by exag-
of illness. In the case of depression, for example, gerated suspicion, have we demonstrated any-
events involving loss may be more important thing at all? In spite of these difficulties, it can be
than in other disorders. demonstrated empirically that there is no inevit-
The same event may have different meanings able link between any given personality style and
and implications for different people (Brown, psychiatric disorder.
Harris 1978). The view that individuals’ personal The two dominant contemporary approaches
experience or appraisal of events is more crucial to classification in psychiatry (the Diagnostic and
than the events themselves has considerable Statistical Manual of Mental Disorders, fourth
methodological implications. If a researcher is inte- edition, of the American Psychiatric Association,
rested only in events, these can be assessed fairly and the International Classification of Diseases,
quickly using a checklist, but if appraisal is of inter- tenth edition, of the WHO) have attempted to
est, then a detailed and relatively individualized in- deal conceptually with separating clinical dis-
terview format has to be developed. order and other factors intrinsic to the affected
One important methodological advantage of a person by classifying the clinical disorder on one
reasonably in-depth interview is that, apart from axis and personality disorders (and mental han-
providing access to the respondent's perceptions dicap) on other axes.
of events, it also helps clarify whether stressful The separate focus on personality disorder and
events occurred before or after the onset or on personality style (although not uncontrover-
exacerbation of symptoms. As we have seen sial) is useful both in psychiatric epidemiology
before, this is a key question for developing and more especially in studies of chronic diseases
models of cause of disorder. A further advantage of lifestyle, where issues of behaviourial change
of careful questioning in this area is that it facilit- and adherence to treatment are vital.
ates an exploration of recall bias in the reporting Often the concern in such studies is not with
of events. A depressed person, for example, may psychopathology as such, but with factors
recall many unpleasant experiences which would which may contribute to disease and affect re-
not be remembered by a person who is not de- covery.
235
Broader social, economic, and cultural whether South Africans as a whole suffered from
influences higher rates of anxiety than people elsewhere,
From an intuitive point of view, it seems logical to and an attempt was made to explain differences
suppose that rates of psychiatric disorder would in rates with reference to apartheid (Swartz 1987).
increase in proportion to actual physical hardship The problem with this type of analysis is that it
experienced by people. Psychiatric epidemi- assumes a single causal pathway, and ignores a
ologists have explored the relationships between whole host of other possible interpretations of the
resource allocation to groups and populations, data — including the possibility that an inter-
and psychopathology. Though the evidence is not nationally used scale may work differently in
uniform, it does appear that there are relation- South Africa for cultural reasons.
ships between patterns of pathology and broad _In a rapidly changing world where diversity is
socio-economic factors (Blackburn 1991; Dawes, experienced in every country, there has been con-
Donald 1994). A major challenge in such research siderable interest in the effect of cultural factors
is to develop models capable of explaining the on psychopathology. Some early work in this field
mechanisms whereby broad social factors impact was clearly racist — for example, it was once
on psychopathology. thought that African people were incapable of
The psychiatric epidemiology of apartheid — feeling depressed, as they lacked the necessary in-
specifically the relationship between apartheid sight for reflection and concern about their lives
and mental health — can be viewed as one exam- (Littlewood, Lipsedge 1989). This flies in the face
ple of concern about broad social factors. Surpris- of practically all contemporary epidemiological
ingly little epidemiological work has systematic- evidence, which shows high rates of depression in
ally addressed the psychopathological effects of Africa (Littlewood, Lipsedge 1989).
apartheid (Swartz 1987). The challenges of re- The extent to which cultural arrangements
search in this area at a time of national recon- affect mental disorder, and how, is an area receiv-
struction in South Africa are greater than ever ing increasing attention. There is considerable
before. debate, for example, on whether schizophrenia
Other broad social factors that may influence has a better outcome in the Third World than
the mental health of populations can be seen as elsewhere, as early epidemiological evidence
intrinsically undesirable (such as war and civil seemed to suggest. These debates are part of a far
strife). Others can be viewed as broad and com- larger challenge to psychiatric epidemiology as a
plex social movements whose effects need to be whole, and this challenge is dealt with in the next
explored. Urbanization, for example, is a global section.
phenomenon leading to massive changes in
social networks and human relationships, and the Issues of validity in psychiatric
effects of urbanization on mental health are epidemiological research
beginning to be explored. Threats to the validity (accuracy) of epidemi-
Part of the problem in assessing the effects of ological investigations arise from failure to assess
apartheid and urbanization on mental health is the validity of the instruments, to calibrate them,
that it is very difficult to tease out particular ex- and to assess their reliability (repeatability). Psy-
posures for particular persons and to link these to chiatric epidemiologists have also faced diffi-
specific outcomes. There is also such a wide range culties in finding an appropriate way to define
of experience of broad social phenomena such as and measure. disorder, particularly in non-
apartheid, that it is often difficult to define pre- Western settings. These difficulties demonstrate
cisely what is meant by the term. It may be more how it is possible for a gap to exist between what
worthwhile from a methodological point of view is the ‘truth’, and what is measured.
to focus more narrowly on a phenomenon which In general, the validity of,instruments may be
can be measured precisely, and which can either enhanced by conducting medical examinations
serve as a proxy for or be an example of a broader to check whether physical disorders are causing
social phenomenon. It is of course possible to the mental disturbance. The use of multiple
compare the mental health of people living under sources of data, to double check information or to
certain social conditions with those living under fillin gaps, also improves the quality of the data.
other social conditions through ecological de- The validity of screening instruments can also
signs, but such designs are weak. For example, be enhanced by following up a certain percentage
some decades ago there was some interest in of screen negatives, and modifying morbidity es-
236
timates on the basis of the number of screen nega- (Gillis, Elk, Ben-Arie, Teggin 1982).
tives who turn out to be cases. Reliability is as A number of strategies have been proposed for
important as validity, and it is crucial that inter- handling these difficulties. Before undertaking an
observer reliability be established (Swanson- epidemiological study, it is essential that re-
Fisher, Martin 1981). searchers learn about abnormal behaviour pat-
The validity of instruments developed in terns and the socio-cultural context within which
Europe and North America may be compromised such behaviour occurs (Draguns 1984). This
by application in a new context. Researchers fre- could be achieved, for example, by conducting
quently rely on cut-offs which have not been anthropological studies that try to understand the
adjusted for the population under study. This can cultural context of and the reasons for specific
result in inaccurate estimates of psychiatric activities. Based upon such an anthropological
morbidity. investigation, it might be useful to supplement
Besides the general issues of validity, there are ingtruments by adding questions about somatic
at least two major threats to the validity of psychi- expressions of mental illness or specific cultural
atric epidemiological instruments when applied phenomena. These questions can be analysed
in developing countries. The first arises from cul- separately. Other strategies include using person-
tural differences, and the second from the lang- nel familiar with subtle cultural nuances as inter-
uage translation process. viewers or interpreters, asking subjects to explain
Epidemiology is in itself a cultural enterprise, some of their responses, rating a symptom as
embodying Western notions of disease and basic negative whenever there is any doubt about it
Western assumptions about ways that people being positive, conducting physical examinations
function — the split between body and mind, and to make allowances for the influence of physical
between emotions and intellect, are just two exam- health on mental status, and modifying the con-
ples. Psychiatric epidemiology is thus a Western tent of certain questions to make them more rele-
product which tends to take as a given Western vant to interviewees’ everyday experience.
ways of seeing the world, Western classification Translation of psychological tests into African
systems, and Western healing systems. Most languages can also affect the validity of epidemi-
people in the world, however, do not conceptual- ological studies. This occurs primarily because
ize or experience their illness in this way. languages differ in the way they express inner dis-
This has important consequences, the most tress such as depression and anxiety (Swartz,
serious being how to judge whether behavi- Ben-Arie, Teggin 1985). The first step in reducing
our is indeed pathological or not. When measur- the threat to validity cdused by the translation
ing psychiatric morbidity, problems arise from process is to translate the instrument in such a
the confusion between culturally distinctive way that the concept behind each item is ade-
behaviour and psychopathological manifesta- quately conveyed, incorporating local idioms of
tions. For example, in some African settings appa- distress and, if necessary, asking a series of sep-
rent ‘hallucinations’ and ‘delusions’ may occur arate questions rather than a single question. The
even among psychiatrically healthy people and second step is to get someone to do an independ-
may result from encounters with ancestors whose ent translation back into the original language.
influence is believed to be tangible (Gillis, Ben- The third step is to straighten out the differences
Arie, Elk, Teggin 1982). between the original and the back-translation.
A second and related problem stems from the Another set of strategies can be applied during
way that the specific content of a psychiatric test the interview. Where interpreters are used, they
may culturally influence responses. For example, should preferably be from a similar cultural back-
questions about experiences such as watching TV ground to the respondents. They should be fluent
or reading newspapers may result in biased rein the interviewee’s language, have mental health
sponses, as respondents may not have access to expertise, and be skilled in the use of the instru-
these objects. ment in question. The interview process should be
Thirdly, the format of psychiatric tests may constantly monitored by evaluating the relevance
influence responses. For example, the interrog- of answers and observing interviewee’s non-verbal
ative style of most standardized tests may be behaviour. It may even be necessary to ask inter-
foreign to the expectations and practice of many viewees to paraphrase some questions to deter-
Africans, and verbal and non-verbal responses mine how they understand them, or to rephrase
may not reflect the true state of the interviewee ‘difficult’ questions using standard alternatives.
Basil
It will therefore be helpful to conduct pilot only on major recognized psychiatric illnesses
studies comparing the results with assessments (such as schizophrenia) and major affective dis-
made by clinicians, comparing the distribution of orders (disorders of mood). These illnesses are
scores between normal and patient samples, and important in the public health field, as they are
to pre-test the translation. The validity of an distressing, and may be chronic and place a
instrument in a cross-cultural setting may also be burden of care on family and community mem-
assessed by conducting formal statistical assess- bers and the health services. However, psychiatric
ment (for example, factor analysis) comparing epidemiology should and does go beyond the
scores obtained in a cross-cultural setting and study of the major psychiatric conditions. It is
studies elsewhere. often stated that psychological factors are
responsible for much of the work that has to be
done at primary health care level. The implication
Psychiatric epidemiology in the (spoken or unspoken) is that these factors waste
primary health care context primary health care resources. The first task of an
approach to psychiatric epidemiology at this level
General approach is to sort out a series of questions about psycho-
Following the Alma Ata declaration, there has logical factors in primary health care, and to
been increasing interest and emphasis on inte- decide which of these questions is relevant to the
grating mental health services into primary setting being studied.
health care (PHC). This has important implica- A central question which has been asked in psy-
tions for psychiatric epidemiology. What does it chiatric epidemiology concerns the paths people
mean to integrate mental health services into pri- follow on their journey to becoming known psy-
mary health care? At one important level, it chiatric patients. Table 26.4, reproduced from
means treating obvious or known cases of mental Goldberg and Huxley (1992), shows that in Britain
disorder as far as possible at the primary level and not all people who suffer from mental disorder
in an integrated way. At another, though, it means (defined as having an episode lasting at least two
rethinking the ways in which we understand weeks in a calendar year) consult primary care
health and illness behaviour (that is, the ways in services, and that there is a progressive filtering
which people interpret, respond to, and act on by which people become users of mental illness
their experiences of ill health) and reorientating services and, even more rarely, psychiatric in-
PHC programmes to accommodate these factors. patients.
Anyone with clinical experience in any area of
health care will be familiar with patients whose Of central importance in Goldberg and Huxley’s
behaviour does not easily fit with the health care work are two observations.
system. These may include people who come for Not everyone suffering from mental disorder
help when no physical problem can be found, comes to the attention of mental illness ser-
people who experience pain and discomfort out vices, or even to the attention of the health ser-
of proportion to their pathophysiology, people vices in general.
who do not take medication as prescribed, people The process whereby one becomes a patient in
who continue with lifestyles which are bad for mental illness services depends only partly on
their health, people who abuse their bodies with the symptoms from which one suffers.
cigarettes, alcohol, and addictive drugs, parents
who seem to have their children’s best interests at As Goldberg and Huxley note, the filters between
heart, but who fail to do what is necessary to keep different levels in their model depend partly on
the children healthy, and so on. The list is endless. how people who experience symptoms under-
The AIDS epidemic has convinced many sceptical stand and act on them (illness behaviour), clini-
health care professionals that serious attention cian competence (ability to detect disorder), and
must be given to psychosocial factors as part of patterns of referral and resources available (fefer-
any primary health care programme. ral to mental illness services and admission to
Psychiatric epidemiology has much to offer any psychiatric hospitals).
health worker or researcher in the quest to under- - In order to tease out the different contributors
stand and influence patterns of health and illness. of paths to psychiatric care and status as a psychi-
Psychiatric epidemiology is, however, narrowly atric patient, we need to be able to answer ques-
understood and is sometimes thought to focus tions both about what constitutes a psychiatric
238
A series of studies which demonstrate many of

Table 26.4 Five levels and four the issues involved in thinking about psychiatric
Te LCL AY epidemiology in relation to primary health care,
and which also address some other questions
raised in this chapter, were undertaken as part of
Level 1: The community 1st filter
the Mamre Community Health Project (Miller,
(illness behaviour)
Swartz 1990; Reynolds, Swartz 1993; Rumble,
Swartz, Parry, Zwarenstein [in press]; Swartz,
Level 2: Total mental morbidity — attenders
Miller 1991). A brief discussion of those studies is
in primary care 2nd filter
given below, for illustrative purposes.
(ability to detect disorder)
Psychiatric epidemiology in practice: the

Level 3: Mental disorders identified bydoc-
Mamre Community Health Project
tors (‘Conspicuous Psychiatric Mor-
The Mamre Community Health Project, a collab-
orative enterprise between the people of Mamre,
3rd filter
the University of Cape Town, and the Medical Re-
(referral to mental illness services)
search Council, was established in 1986 as an
opportunity for epidemiological research and to
Level 4: Total morbidity — mental illness ser-
Ath filter improve the health of Mamre (see also Example
3.2). The project is unusual, as it has had alarge
(admission to psychiatric beds)
psychological component from early on in the
project’ history.
Level 5: Psychiatric in-patients
In 1986, a community-wide survey was under-
(Source: Goldberg D, Huxley P 1980, 1992) taken of health status in Mamre. The researchers
were interested in getting an idea of the preval-
ence of psychiatric disorders in Mamre. Wishing
case and about behaviour of patients and practi- to use a term for psychiatric illness that was
tioners in the context of the particular resources acceptable to the community, they asked re-
available to them. Equally important is the observa- spondents in a pilot study whether they suffered
tion that a very substantial proportion of people from their ‘nerves’. A very large number of
who suffer from episodes of mental illness will not women answered yes — far more than could ever
come into contact with health services at all, or, if conceivably have been psychiatric patients
they do, they will not be perceived to be suffering according to what is known of the expected
from mental illness. In some cases, the person who number of patients in any given community. As a
presents at a PHC service may bearelative experi- result, the question was modified to ask whether
encing the burden of living with a person who is far people had been treated for their nerves in the
more symptomatic, but who will not use services at past year. Roughly 2,9% of respondents answered
all. For example, a person could be very withdrawn in the affirmative, with a male:female ratio of
and suspicious, and therefore unlikely to use any approximately 1:3. At this point, psychologists
services at all. This withdrawal and suspicion could were asked to join the project team.
place a great burden ona relative or someone else Using the Goldberg and Huxley (1992) model,
in the household, who would then go for help be- psychologists were able to explore not only
cause of their own distress. mental disorder itself, but also important factors
A further implication of the Goldberg and concerning its identification and management.
Huxley model is that many people suffering from The questions asked at each level and filter, and
distressing symptoms need never become psy- the ways in which these were addressed, are
chiatric patients as such. This does not neces- shown in Table 26.5. Some details of the research
sarily make their experience any less distressing strategy have been omitted and altered slightly in
or important as a focus for concern on the part of order to maintain conceptual clarity.
health service planners and epidemiologists. There is no psychiatric inpatient facility ip
Looking at the psychiatric epidemiology of Mamre, so only the filter to this level of care — and
primary health care can provide ways of under- not details about the actual care received — was
standing not only disorders in themselves, but studied. It would have been impossible to survey
issues in service provision as well. all residents of Mamre who were in hospital else-
239
where (especially in Cape Town) at any given time range of ways, both quantitatively and qualit-
to assess directly, for example, level of care atively. The issues of validity so central to assess-
received. ment in psychiatric epidemiology were especially
The various studies in Mamre allowed for a clear in the cross-sectional, community-based
spectrum of questions to be addressed in a wide survey, which is described briefly.
Pre) (wes Cr Cr Creamy Ly) the Mamre eT LAG Prey

Project as an application of the Goldberg and Huxley method
QUESTION ‘RESEARCH STRATEGY
Level 1: Mental illness in the community

How common is psychiatric morbidity in the Community-based random survey (cross-
Mamre community? sectional) Note: some methodological issues
raised by this study are discussed below
First filter: illness behaviour

What factors lead people in Mamre with i) Questions about illness behaviour in the
psychiatric morbidity to approach primary random survey;
health care facilities for help? ii) Qualitative study of ‘nerves’ amongst known
psychiatric patients
Level 2: Total psychiatric morbidity in primary care

What proportion of people from Mamre using Cross-sectional survey of psychiatric morbidity
primary health care services suffer from amongst all people from Mamre using
psychiatric symptoms? primary health care services both in the village
and in the large neighbouring town
Second filter: Detection of disorder

To what extent do primary health care personnel i) Rating sheets filled in by primary health care
detect psychiatric symptoms, and what are personnel on all people assessed during
some of the factors affecting this detection the cross-sectional survey, and comparison
pattern? of their ratings with scores obtained on a
standard instrument
ii) Semi-structured interviews with primary
health care personnel about their approach
to identifying psychiatric morbidity
Level 3: Conspicuous psychiatric morbidity

What proportion of people making use of PHC Rating scales mentioned above
services are recognized by practitioners as
having psychiatric morbidity?
Third filter: Referral to psychiatrists

What factors (apart from detection of disorder) i) Interviews with PHC practitioners
contribute to PHC practitioners’ decisions to ii) Review of records of known psychiatric
refer to psychiatric services? patients in Mamre =
Level 4: Total psychiatric patients

Who are the known psychiatric patients in i) Review of the records and register kept by
Mamre, and what are the characteristics and the psychiatric sister who deals with Mamre
needs of themselves and their families? ii) Survey of all known psychiatric patients and
their families
240
Table 26.5 (continued) Research strategy used tnthe N eree Community

Health Project as an application of the Goldberg and Huxley method:
Fourth filter: Admission to psychiatric beds

What factors play a role in determining whether i) Detailed admission histories of all known
Mamre residents are admitted to psychiatric psychiatric patients, coupled with interviews
hospitals in Cape Town? regarding access to psychiatric resources
ii) Intervention study of the effect of a support
group on rates of re-admission to psychiatric _
hospitals
«
Level 5: Psychiatric in-patients |
There is no psychiatric in-patient facility in Mamre,
so only the filter to this level of care was studied.
Randomly selected adults were assessed methods in South Africa (Rumble, Swartz, Parry,
using the Self-Reporting Questionnaire (SRQ) as Zwarenstein, [in press]).
a first-stage screen, and the Present State Exam-
ination (PSE-9) as the second-stage criterion.
Conclusion '
Demographic, health care utilization, and sub-
stance abuse data were also collected. Using the Psychiatric epidemiology in South Africa (and in
PSE-9 CATEGO Index of Definition of 5, the the developing world generally) is at an exciting
weighted prevalence of psychiatric morbidity stage. Methodologically, designs that have been
was 27,1%, the majority of cases being given a used up to this point have been rather unsophist-
tentative diagnosis of depressive or anxiety dis- icated. Part of the challenge now is to move to
order. Even though the ‘CATEGO’ algorithm was more sophisticated designs, including those
initially chosen as a standard for assessing which assess the efficacy of interventions. Metho-
whether a person met the criteria for classifica- dologically, much remains to be done in South
tion as a case of psychiatric morbidity, it Africa in terms of developing psychiatric epi-
became clear that this was not appropriate, as a demiology in the direction of analytical and inter-
substantial proportion of cases were identified vention studies. Questions of validity are also cru-
by CATEGO as having paranoid disorders. It cial to consider.
emerged on careful re-analysis of the data that Concerned as it is with the difficult-to-measure
what may be culturally appropriate ways of ex- but crucial realities of people’s daily lives, psychi-
plaining disorder in Mamre (such as believing atric epidemiology has been forced to grapple
that it was caused by the bad deeds of others) with how one makes as accurate an assessment of
was being rated as part of a paranoid syndrome those lives as possible. Many of the issues which
on the PSE. It was therefore necessary to rein- face psychiatric epidemiology are therefore rel-
terpret this type of phenomenon in the light of evant to epidemiology in general. One of the most
knowledge of local cultural beliefs. The SRQ’s important of these is the need for epidemiology to
weighted sensitivity and specificity were 49% be informed by local practices.
and 82% respectively. Sensitivity was therefore Modern psychiatric epidemiology is taking in-
very low, and further research on criterion va- creasing account of the reality that there are
lidity of the SRQ in South Africa is needed. Over- many perspectives on disorder and its treatment.
all, the SRQ correctly identified 67% of cases Its methods are being combined with qualitative
and non-cases. No demographic variables pre- and anthropological approaches, as we have
dicted psychiatric morbidity, but there was an seen. It is also important, however, to recognize
indirect link between morbidity and primary the extent to which psychiatric epidemiology’s
health care utilization. In summary, the study engagement with a diverse world has helped
provides useful information about prevalence, focus our attention on the nature of the epidemi-
and important lines of enquiry regarding the ological enterprise itself.
refinement of psychiatric epidemiological
241
@ epidemiology applied to content areas
EXAMPLE 26.1 the Luganda back into English and modifica-

Psychiatric disorders in two African tions were made to the original translation. The
translated version was then used in a psychi-
villages atric department in Kampala to test it against
Probably the most cited psychiatric epidemi- clinical judgement. Questions that did not elicit
ological research study in Africa is one conducted answers relevant to the symptoms they were
by John Orley in Uganda and reported by him and designed to illuminate were modified. Later, a
John Wing in an article published in 1979. Dr pilot study involving 25 respondents was car-
Orley, a British-trained psychiatrist, had carried ried out in a rural area.
out extensive anthropological and ethnographic Procedures: All interviews were carried out by
field work in East Africa for five years prior to the Dr Orley, assisted by a research assistant (a
study. man from the area). A physical examination
was also undertaken to rule out malaria and
Study objective: To determine the nature and other problems which might underlie psychi-
extent of psychiatric problems in two African atric symptomatology.
villages using a standard system of defining, @ Results: 27% of the men and 24% of the women
eliciting, and recording symptoms. were found to have psychiatric disorders at the
@ Design: A cross-sectional study. threshold level and above. Five percent had
@ Sample: 206 residents of two rural villages in definite psychiatric disorders. Most of these
Uganda. Subjects included adult residents in all conditions were depressive, but hypomanic
homesteads along selected village paths. and anxiety states were also represented. A
Instrument: A brief (48-item) form of the Pre- large number of respondents had both depres-
sent Status Examination (PSE-9) which was sive disorders and anxiety states.
translated into Luganda by a Ugandan medical
student. Another Ugandan then retranslated
Reference:
Orley J, Wing JK. ‘Psychiatric disorders in two African vil-
lages.’ Archives of General Psychiatry 1979; 36:513-20.
242
2i Anthropometric
studies
Introduction In determining height and weight it is essential

that the measurements are made in a standard-
Anthropometric studies are carried out to deter-
ized way, meaning that the instruments (for
mine the nutritional status and growth patterns of
example, scales) as well as observers must be
a group or groups. This can be done to identify
standardized (see Appendix 2). Age should be cal-
individuals who are under- or overnourished, or culated using date of birth (obtained from, for
to investigate the relationship between nutri- example, a Road to Health card) and date of
tional status and some disease outcome (for examination. In cases where an accurate birth-
example, obesity as a risk factor for heart disease, date is not available, it is appropriate to use the
or undernutrition as a risk factor for infectious weight for height indicator rather than the two
disease or poor school performance). Anthropo- age-related indicators. Because children vary
metric studies can also be used to monitor or considerably with regard to the onset of puberty,
evaluate the effect of nutrition intervention pro- these nutritional indicators should not be used
grammes, for example, supplementary feeding.
for children over the age of 10.
Repeated assessments can establish whether the
overall nutrition situation is improving or deteri-
Reference population and cutpoints
orating, and whether existing nutritional pro-
Each child’s age, height, and weight are deter-
grammes should be modified. The standard of mined and compared with values obtained from a
nutrition reflects quality of life and, less directly, reference population. The most often used refer-
socio-economic development. In children, one ence standard is the data collected on a broad
normally identifies the poorly nourished and cross-section of normal American children by the
growth retarded as a risk group, whereas in adults United States National Center for Health Stat-
obesity is considered a risk factor. istics (NCHS), which has also been tabulated by
the WHO. Some researchers object to the use of
Assessing the nutritional status of the American data as a reference, because chil-
children dren elsewhere may differ anthropometrically
from American children, not because of nutri-
Nutritional indicators tional status, but due to other factors, such as
genetics. For example, even well-nourished
Definition Japanese children in the USA and Chinese chil-
Age, height, and weight can be used to determine dren in Hong Kong are shorter than American
nutritional status in children, since these three children on average. However, the American data
measurements form the following indicators: is generally accepted as the important inter-
@ height for age; national reference standard.
weight for age; For the reference population, percentiles have
@ weight for height. been calculated by sex and age (in years and
months). For example, for girls aged five years,
243
the NCHS-tabulated 3rd percentile of weight for above the expected on the NCHS tables. Gross
age is 14 kg; that is, in the reference population, childhood obesity is associated with orthopaedic
only 3% of girls aged five weighed 14 kg or less. If problems and overweight with psychological prob-
we examine a girl of five and find that she weighs lems. Most fat babies lose their fat in childhood, but
13,5 kg, we would say that she falls ‘below the 3rd obese children tend to become obese adults.
percentile weight for age’. By referring to the
reference population, we can determine which Interpretation
children or what proportion of children are The three indicators each reflect a different
undernourished. Different definitions of under- aspect of nutrition. A child who is low on height
nutrition exist; often the 3rd percentile or the 5th for age is stunted and has a history of undernutri-
percentile are used as the cut-off level below tion (that is, chronic undernutrition). If this child
which undernutrition is diagnosed. More re- has a normal weight for height, it is currently
cently, the Nutrition Unit of the WHO has recom- being normally fed. Weight for height reflects cur-
mended the use of the median (the 50th per- rent nutritional status. A child could have a low
centile) minus 2 x standard deviation as the cut- weight for age if it is stunted (has low height for
off. (The values given by this cutpoint are similar age) or thin (has low weight for height) or both.
to those obtained when using the 3rd percentile.)
In the WHO publication Measuring Change in Other measures of nutritional status
Nutritional Status (1983), various percentiles, as Mid-upper arm circumference can be used as an
well as the median minus selected multiples of indicator of wasting. Mild to moderate cases of
the standard deviation, are tabulated for the malnutrition will not be detected by this
NCHS reference population. A researcher can measurement, but under famine conditions it is a
thus manually look up a cut-off point and classify good measure to use. In some research studies,
an individual child on that basis. The data has skinfold thicknesses are also measured. (Head
also been computerized, and a computer pro- circumference is generally used to measure brain
gramme, Epi Info, can be used to categorize chil- growth, not nutritional status.)
dren or summarize anthropometric data. A
child’s growth can thus be monitored by plotting Assessing the nutritional status of
his or her weight at consecutive ages, for example.
This is commonly done at well-baby clinics, adults
where weights are plotted on the child’s Road to To be able to define obesity accurately, one needs
Health chart (Figure 27.1 shows the chart for the to be able to measure body fat. Body fat can be
Birth-to-ten project). In this way, any faltering in measured by the seldom-used technique of
weight can be detected and prompt action taken. underwater weighing, which is costly and cum-
Not only children who fall below the 3rd _per- bersome. More easily measurable characteristics
centile, but any child who shows a decline in therefore have to be used as indicators of body
weight for age, is seen as possibly being at risk. fat. Many studies have been conducted to assess
Another measure which is sometimes used is the value of skinfold thicknesses and various
that of the percentage of expected value for a weight-height relationships as predictors of body
given indicator. For a given age or sex, the median fat. Skinfold measurements (biceps, triceps, sub-
of the indicator in the reference population is scapular, and suprailiac) have generally been
taken as the expected value, and the observed found to correlate well with body fat measured by
value can thus be expressed as a percentage of underwater weighing, and formulae exist for con-
expected value. The five-year-old girl has an verting skinfold thicknesses to an estimate for
expected weight for age of 17,7 kg (that is, the body fat. For accurate determinations of skinfold
median weight for age for a five-year-old girl is measurements, researchers need to use standard-
17,7 kg), so her weight of 13,5 kg is 76,3% of ized callipers, and to be well-trained in the
expected. In this context, 80% of expected is often methodology for these measurements, as meas-
used as the cut-off level below which undernutri- urement variation is often large. Age- and sex-
tion is diagnosed. specific reference data for skinfold thicknesses is
Obesity in children receives less attention than available from the Health and Nutritional Exam-
undernutrition because of uncertainties about the ination Survey (HANES) of the United States.
long-term dangers of childhood obesity. Obesity The indicator most often used to assess the
can be defined as a weight for height which is 20% nutritional status of adults, however, is the Body
244
245
anthropometric studies
winyepayoogay,
Yul
ayep
apueew syjuow
BGujpaoauem
I ueA ajseyiy!SSe|y (Spuom p - ¢) Sedua}UeS
UONLINWJEPUN Jo UONeOISSeID (aps00M p - ) aUUIS
yates FETC TERET
= HP = ee
oe 0 hsRe =
ea ee
27
POO
Peete De te ete | erate?
4
pe pe A ee
Pee a bk |)see aT. Gee Peter otot
ee = ee wi | Le Boe eR 2 Si ale!
eit |eee ba dole
rt [\:
aH
OL
beadBe
ary:
Lda
AG Se al SE Ee EE el
ro
oh
Zz
ry
|
.
1a
|\|
a
\
=
\
ETT TrineTl
‘SIEKTES EN VOORVALLE
‘SLN3A3 ONY SS3NTH

L— Ly
(1/9/8)Sujpacnwew Jepue jo Suipeorsiog
;‘ Elslslay | melee
LT |
Pro
Ye
\\
hI
4
a
|
Nt
eta eee Pete Vel
¢ eile
ela& a!
seer
eee Pee El
eee EH TCO
UU
sgt
BOUIBJWNDIII
oO iim VSEDPEES
soe
SCE MT
(wo)
PRAY
sehiesdes
qi co a ie se Cre tT tte is te Fe2 sere
SUIWEYA
uo4|
pue
oe se | ieult
leipat eis peiblilieet a oS
aJ0q96 eqeq @MNN GN aU xNIqGa6 aIN X HINGED A ONINNYIdIASNISID WO Aqeq MAN GN Pssn JON x Pas) A ONINNWId ATIWVS
MI
nan
TANT
POEM Le mee UL iewe ee
Mass Index (BMI), also known as the Quetelet age): 44,3% of the boys and 44,1% of the girls.
index. To be able to calculate this index, we need However, less than 4% of the boys and girls
to determine a subject’s height and weight (once showed signs of acute malnutrition (fell below
again ensuring that this is done in a standardized third percentile of weight for height). It was found
way). that significantly more children from informal
BML is defined as: weight (in kg) dwellings had chronic malnutrition than children
height (in m) x height (in m) from formal dwellings: 53,2% compared to 39,5%.
BMI values can be categorized into undernour- To determine the reliability of the measure-
ished (women with BMI <19, men with BMI <20), ments, a random sample of 45 children were
normal (women with BMI 19-<24, men with BMI measured and weighed again. Poor reliability was
20-<25), overweight (women with BMI 24-<30, found, possibly due to the difficulty of measuring
men with BMI 25-<30) and obese (BMI 30 or more children aged 12-23 months in informal settle-
for both sexes). ments.
The distribution of fat over the body is also Weight for age results were compared with in-
important when assessing the nutritional status formation routinely documented on master cards
of adults. Individuals can have a gynoid shape (fat at the Alexandra Health Care Centre well-baby
is localized in the lower part of the body; pear- clinic from June 1990 to December 1990. This
shaped) or an android shape (fat is localized in source indicated that 5,5% of children were below
the upper part of the body; apple-shaped). These the third percentile of weight for age, compared
different shapes have been found to be associated to 17,7% of the boys and 9,3% of the girls in the
with different diseases. So, for example, people community survey. The researchers concluded
with an android shape are more at risk for that this may indicate that the health centre is not
diabetes, ischaemic heart disease, and hyperten- reaching those children in greatest need.
sion. Measurements of waist, hip, buttocks, and Recommendations made for nutrition pro-
thigh circumference can be used to determine gramme development include that an evaluation
which shape an individual has. should be made of who attends the health ser-
vices, and how to attract those in greatest need.
Conclusion Furthermore, health workers should be trained to
measure anthropometry accurately.
Nutritional status is both an important risk factor
and health outcome to measure and monitor. Reference:
Strict guidelines on how measurements are made Coetzee DJ, Ferrinho P. ‘Nutritional status of Alexandra
and interpreted have te be followed to ensure that township. Clinic-based data and a community survey.’
valid results and conclusions are drawn. South African Medical Journal 1994; 84(7):413-15.
EXAMPLE 27.1 EXAMPLE 27.2

Nutritional status of children aged A profile of obesity in arural and
12-23 months in Alexandra township urban black population in the Free
To evaluate the nutritional status of children aged State
12-23 months in Alexandra township, a commu- In a random sample of households in a rural
nity survey was conducted in April 1990. Cluster region of the Free State and an urban area, vari-
sampling was used to select 450 children. Weight ous risk factors for coronary heart disease were
was measured to the nearest 0,1 kg in under- investigated in participants aged 25 years and
clothes only. Scales were calibrated to zero before older. Obesity was one of the risk factors. Two
a child was weighed. Height was measured to the doctors performed all anthropometric measure-
nearest 0,1 cm, using standard WHO methods. To ments which included height, weight, waist and
calculate age, date of birth was taken from the hip circumference. All measurements were
Road to Health card or obtained from the mother. done following strict guidelines, and techniques
Measurements were made by experienced were periodically standardized. In the age
observers. strata, the following percentages of female
Nearly half the children had chronic malnutri- respondents were found to be obese (BMI 30 or
tion (fell below the third percentile height for more):
246
27 anthropometric studies
FEMALES 25-34 years, to 20,4% in the urban males aged

Rural Urban 55-64 years. The mean waist:hip ratio of women
Age stratum % obese stratum % obese in both groups and for all age groups exceeded
group size size 0,8, which is seen as the cutoff value for an
increased risk of, for example, ischaemic heart
25-34 142 27,5% 105 31,1%
disease. Whether this cutoff value holds for this
35-44 110 41,8% 126 42,9%
population must still be established.
45-54 99 42,4% 105 54,3%
55-64 102 49,0% 68 47,1%
Reference:
65+ 121 31,4% 64 42,2%
Mollentze W F, Moore AJ, Steyn AF, Joubert G, Steyn K,
Oosthuizen G M, Weich DJ V. ‘Coronary heart disease risk
The percentage of male respondents who were
factors in a rural and urban Orange Free State black popula-
obese ranged from 3,9% in the urban males aged tion.’ South African Medical Journal 1995; 85(2):90-96.
247
28 Assessment of
dietary intake
Introduction Methods of data collection in dietary

In the case of health outcomes for which there studies
is no ‘agent’ responsible, epidemiologists com-
monly investigate people’s lifestyle, habits, or There are several methods available for the
environment. Researchers investigate these assessment of food intake at the individual level
factors firstly to describe them in the popula- (for example, weighed record method, 24-hour
tion, and secondly to determine if there is a re- recall method). Various factors, such as the pur-
lationship between a factor and a health out- pose of the study (to obtain qualitative food in-
come. take data or nutrient intake quantified in grams
Dietary intake, which is intricately bound and milligrams), qualities of the study population
with environment, socio-economic status and (for example, age, literacy level), and availability
culture, is one such ‘lifestyle’ factor which is of resources (interviewers, scales for weighing
often investigated. In epidemiology, the saying food portions), determine the choice of method.
‘You are what you eat’ is translated into the The advantages and disadvantages of each
questions ‘What do you eat (exposure)?’ and method should be weighed against the priorities
‘What is your health status (outcome)?’ of the study when making a final choice (Table
Thus, in epidemiology, dietary studies (also 28.1). The methods for assessing food intake can
known as food consumption studies) are car- be divided into two basic categories:
ried out in order to collect diet information at a ¢ recording of data at the time of eating
national, institutional, household, or individual @ recall of food eaten in the past (immediate,
level. It is important to have national informa- recent or distant past).
tion on the per capita food consumption, the
availability of food, and the trends for these Recording of data at the time of eating
over the years, so that national food and nutri-
tional planning and policy formulation can take Weighed records
place. At the individual level, food consump- All food and drinks are to be accurately weighed
tion data is needed to: assess the adequacy of or measured, and recorded on record sheets for
dietary intake of free-living or institutionalized the duration of the dietary study (for example,
respondents; expose differences in the diet or seven days or 30 days). Each respondent needs to
eating pattern of experimental versus control be equipped with a scale, as ordinary kitchen
participants; determine relationships between scales are not accurate enough. Research-quality
dietary components and disease states; moni- scales, however, are very expensive. Weighing
tor diet as a variable in a research project; and and recording absolutely everything taken at
to intervene in a study population. Methods for mealtimes and between meals place a heavy
the collection of per capita food availability at a burden on the respondents, and the longer the
national level, or dietary studies at household period of record-keeping, the more respondents
or institutional level, will not be discussed here. tend to simplify their diets. Only literate, very
248
28 assessment of dietary intake
Ces eMC eC rCR cere icra t ty

dietary data collection at individual level
Method Main Main Researcher nutri- Data as

advantages disadvantages tional background Foods Nutrients
RECORDS
Weighed Very accurate Expensive. NEEDED — for
record Heavy burden detailed instruc-
on respondent. tions on weighing v v
Limited sample size. and recording and
Respondent literacy to check records.
essential.
Simplification of diet
by respondent.
Estimated No special equip- Heavy burden NEEDED — to train
record ment needed on respondent. and instruct on
Simplification of recording and v v
diet by respondent. how to quantify.
Respondent literacy
essential and thus 4
leads to under-
representation of
people with low
intelligence/education.
RECORDS Very precise Combination of the NEEDED — as
COMBINED above. above, and to
WITH DIRECT : check records and v v
CHEMICAL samples.
ASSESSMENT
RECALLS
24-Hour Low burden on Relies on memory. NEEDED — to
recall respondent. One recall not interview and/or
Low refusal rate. representative of train interviewers, ¥ Y
Takes little time. individual's intake to quantify intake.
Suitable for large — only for groups
numbers. of >50.
Does not repre-
sent usual intake.
Underestimation
of intake.
Quantified Low burden on Experienced inter- NEEDED — for the
food respondent. viewer required. interview, designing
Usual pattern Long interview. the questionnaire, ¥ Y
frequency
determined. Heavy demands to quantify intake.
Large numbers on interviewer.
possible. Overestimation
of intake.
Food Low burden on Not suitable NEEDED — for
frequency respondent. for nutrient designing the
calculations. questionnaire. v —
(qualitative) Classify respon-
dents in gross
frequency categories.
249
dedicated people qualify as suitable respondents. search methodology indicates that data from at
Because onlya select group of respondents are least 50 respondents is needed to describe the
suitable, random sampling cannot be done and ‘usual’ intake-for a group. Thus subgroups (for
the representativeness of samples is usually example, age or sex groups), have to have at least
limited. 50 participants. More than one 24-hour recall by
The researcher needs to train the respondent the same respondent, for example, for two week-
and give detailed instructions on the weighing days and one weekend day, cancel some of the
and recording of foods and drinks taken, as well disadvantages of the method.
as training on how to deal with combined dishes
such as ‘potjiekos’, samp and beans, pizza, Quantified food frequency
samoosas, and how to manage recording of meals The interviewer has a structured questionnaire,
not eaten at home. Records need to be checked and the respondent recalls how often and in what
often (daily in some studies). quantities the foods listed were eaten over a
specified period of time; for example, the last
Estimated records month, three or six months, or the last year. This
All foods and drinks are to be recorded on record is a long interview which needs an experienced
sheets in terms of household measures (such as interviewer. Visual aids are used to assist in quan-
spoon- or cupfuls), or in dimensions (for exam- tifying intake. In a Western-type mixed diet, this
ple, one meat patty, 8 cm diameter, 1 cm thick). method tends to overestimate intake because the
The researcher needs to train the respondent on wide variety of different foods means that a large
how to quantify food intake. In addition, written number of foods is recorded. Respondents are
instructions are usually part of the record sheets. usually unable to compensate in their reporting
Records are usually kept for seven days, but the for substitution (where some foods replace others
period is determined by the purpose of the study. during a specified time period). The quantified
For example, for representative information on food frequency gives a better indication of usual
nutrients for which intake can vary greatly food intake than the 24-hour recall. The re-
depending on food choice (such as vitamin A or searcher needs insight into the eating habits of
cholesterol), nine days or 28 days of record-keep- the research population, to ensure that often-
ing might be necessary. The estimated record used foods are included in the questionnaire.
method puts a lesser burden on the respondent
than the weighed record method. Food frequency method (qualitative food intake
In some studies, duplicate portions of all foods data)
eaten are to be assembled by the respondent for The interviewer uses a structured questionnaire
chemical assessment of the nutrient content of to fill in the number of times specific food items
the total food intake. Respondents find it difficult were eaten over a specified period of time. The
to dispose of valued food items such as alcoholic quantity of food eaten is not asked. Such inform-
drinks, cake, steak, and slabs of chocolate in such ation can be used to monitor respondents’ dietary
a way! intake in a study (for example, data to prove that
diet changed or remained unchanged during an
Recall of food eaten in past experiment), or to expose differences in the fre-
quency of intake between groups, for example
24-hour recall age, sex, Or experimental groups. Data from this
Instead of writing down dietary intake on record method cannot be used to calculate intake of nu-
sheets, the respondent is interviewed on all food trients.
intake for the previous day, that is, a 24-hour
period. The interviewer needs to be experienced Dietary histories
and must use visual aids to assist the respondent The term ‘dietary history method’ is often used,
in quantifying food intake. These can include sometimes incorrectly. According to interna-
food models, real food, and spoons of varying tional definition, a dietary history consists of a
sizes. combination of a 24-hour recall with a meal pat-
This method relies on the memory of the tern (that is, how many food intake periods per
respondent and intake is usually underestimated. day, and what foods are usually consumed at each
‘Yesterday’s’ intake is not representative of an of these periods) over an extended period of time
individual’s usual intake, and dietary study re- with food quantities specified in household
250
measures, as well as a checklist of foods usually of portions (portion size specified) of foods eaten
consumed. from the five basic food groups (milk, meat and
alternatives, vegetables and fruit, cereals, and
Important aspects of data collection fats) can be calculated. For example, these could
In dietary studies it is important to take note of calculate that children under five are taking only
the following aspects. 215 ml milk daily, compared to the recommenda-
@ Respondents must be trained to keep records. tion of 500 ml milk daily, or that adults are having
Correct reading of the scale and the discipline two portions of vegetables and fruit daily instead
not to estimate a weight are of crucial import- of the recommended minimum of four portions.
ance in the weighed record method. For the es-
timated record method, respondents need to Calculation in terms of nutrients
be taught in how much detail to record and For this, one needs information on food composi-
how quantities should be estimated and tion. This can be obtained by chemical analysis
recorded. (depending on the aim of the study, but not often
@ The training of the interviewers for the recall done, as it is very expensive) or from food com-
methods must be standardized. It is of great position tables (commonly used). Food composi-
importance that quantifying of intake is ‘stan- tion tables usually give the amount of nutrients in
dardized’ if more than one interviewer is used. a specified amount of food, for example, per 100 g
This implies training the fieldworkers together, of food. There are 3 g protein and 120 mg calcium
with food models and real food available to in 100 g milk, for example. The Medical Research
visualize quantities, and to reach mutual Council has compiled Food Composition Tables
definitions, even if these definitions apply only from American, British, and (where available)
to one specific study (for example, the size ofa South African food composition data. In the 1991
‘medium’ apple, or a ‘heaped’ spoonful). version of the Food Composition Tables, only
18% were South African values.
Validity and reproducibility Dietary data can be analysed by hand, but this is
very cumbersome. The quality of the results is at
Several articles have been published reviewing
risk because so many calculations need to be made.
the limitations of the various methods for assess- Dietary data can also be analysed by computer; and
ing food consumption. There is no ‘gold standard’ programmes such as SAS can be used. For comput-
for the validation of dietary study methods. One
erization, dietary intake data needs to be coded.
method may be compared with another method The MRC Food Quantities Manual is very useful for
(relative validity), or a new method may be converting food reported in household measures
validated against an external criterion such as a into grams of food consumed. Two software pack-
biological marker. Repeatability (measuring the ages for personal computers, containing the MRC
same subject under the same conditions) is influ- food composition database, are available. Food-
enced by the day-to-day variation in the subject’s finder is a nutrient analysis programme which can
food intake, as well as by seasonal variation and be used for small scale research, while Foodfundi
other factors. Professional is a nutrient analysis and counselling
programme for the therapeutic consultant. For
Calculation of intake more information on these products, contact the
Dietary data is usually presented either in terms authors.
of food intake or nutrient intake. The exception is
the qualitative food frequency method, where in- Standards for interpretation of
take is not quantified and nutrient intake thus
dietary data
cannot be calculated.
Food intake, expressed in number of portions
Calculation in terms of foods consumed from different food groups, can be
This, in its simplest form, involves counting (by - compared with recommendations for healthy
hand or computer) of the frequency with which eating, to describe adequacy of the diet. Nutrient
specific foods are eaten. Computer programmes data can be compared with recommendations for
have been developed at the National Research energy, protein, vitamin, and mineral intake.
Programme for Nutritional Intervention of the South Africa does not have its own recommenda-
Medical Research Council by which the number tions for these, but uses those of the United States
251
of America (the US Recommended Dietary Allow- Intensive training of the interviewers was
ances [RDA], revised 1989), or the WHO (1985), or spread over five days and conducted by dietitians.
Britain (1991). Cut-off points such as <75%, <67%, Included in the training were general aspects of
<50%, >=100% and >=120% of the standard are interviewing techniques; awareness of the subtle
used to describe the adequacy of nutrient intake. differences in foods (for example, full cream, low
South Africa has its own dietary guidelines for fat, skim milk; butter, brick margarine, tub mar-
the prevention of coronary heart disease: less garine); awareness of food preparation methods
than 30% of energy should come from fat, and and ways of quantifying food intake, including
carbohydrate should contribute at least 55% of the use of food models to describe volume or size
energy. of portions.
The information was coded bya trained coder
Conclusion with knowledge of the scope of the food composi-
tion tables, and punched into a mainframe com-
Several methods are available to assess dietary puter for analysis. The results provided a profile
intake. Data can be described in terms of foods or of the dietary pattern in an urban African com-
nutrients, and standards for evaluation are avail- munity.
able. Dietary studies are not easy to execute or to
analyse; therefore, where evaluation of the diet
References
seems to be an essential part of a study protocol,
it is recommended that: Bourne LT, Langenhoven M L, Steyn K, Jooste P L,
LaubscherJ A, Van der Vyver E. ‘Nutrient intake in the
adietitian, knowledgeable in dietary research urban African population of the Cape Peninsula, South
methodology, is included as part of the re- Africa. The Brisk Study.’ Central African Journal of
search team; Medicine 1993; 39:238-47.
abaseline period (usually at least two weeks), Bourne LT, Langenhoven M L, Steyn K, Jooste PL,
NesamvuniA E, Laubscher]J A. ‘The food and meal pattern
during which respondents follow their usual in the urban African population of the Cape Peninsula,
diet, is included in dietary intervention studies South Africa; the Brisk Study.’ Central African Journal of
in order to provide for the placebo effect; Medicine 1994; 40:140-48.
@ depending on the design of the study, a refer- Bourne LT, Langenhoven M L, Steyn K, Jooste P L,
Laubscher J A, Bourne D E. ‘Nutritional status of 3-6 year-
ence group is included in dietary intervention
old African children in the Cape Peninsula.’ East African
studies to provide for possible influences such Medical Journal 1994; 71:695-702.
as the placebo effect and seasonal variation on
the outcome of the study.
EXAMPLE 28.2
EXAMPLE 28.1 24-hour dietary recall: coding food

Use of the 24-hour dietary recall intake for analysis
method in the BRISK study To code food intake data, food codes from the
MRC Food Composition Tables database are
In the Coronary Risk Factor study on the African used. The MRC Food Quantities Manual is used to
population in the Cape Peninsula (BRISK), diet- translate food intake in household measures into
ary information was collected on children and grams of food.
adults in order to describe dietary patterns in
these groups, and to investigate their eating pat- Time | Whatfood Howwas How much
tern as a potential coronary risk factor. The 24- ofday| anddrink it prepared was eaten
hour recall method was used, and the interviewers
were senior professional nursing sisters, fluent in 2 slices
the vernacular. (10 cm thick)
The deciding factors in the choice of method hard (brick) | 2 level teaspoons
were the large sample size (n = 1 500); the short lcup
interview time available; the low respondent 1 medium
literacy rate anticipated; and the site of data col-
lection (home interviews meant that plates, Food codes
spoons, and other containers were likely to be brown bread = 4002
available to assist in quantifying food intake). brick margarine = 6508
252
full cream milk = 0006 b Day of the week: Monday = 1; Tuesday = 2; etc.
apple = 7001 c Meal of the day e.g. breakfast = 1; in-between =
2Petc;
Computer coding sheets containing 80 blocks d The last block is used for the card number. If
per line can be used, or a special coding sheet (see more than 80 spaces are required for coding
example below) can be used. one meal, the identification line is repeated on
the next coding sheet and the card number will
CODING SHEET then be 2.
a a a e e b c d e Additional information can be coded, for exam-
4 0 0 2 0 0 6 0 ple, gender, experimental period, type of diet,
6 5 0 8 0 0 1 0 etc.
0 0 0 6 0 2 5 0
7 0 0 1 0 1 5 0 Second to 10th line:
In each line, the first four blocks are used for the
Explanation: food code and the last four blocks for the amount
The first line of the coding sheet can be used for of food eaten.
identification, e.g.: This information can then be computerized for
a First three or more blocks: code number of analysis of the data.
study respondent;
2S
29 Measurements
in dental
epidemiology
Introduction — periodontitis (advanced chronic gum disease)

Oral mucosal diseases including oral cancer
Until recently, community dentistry in South Africa
— Oral-facial complex cancer
was viewed as a ‘do-good’ discipline that lacked the
— Oral disease of the soft tissues of the mouth,
seriousness of more urgent primary health care
including aphthous ulcers, candida infections
projects. Due to the ‘mildness’ of oral health condi-
(thrush), lichen planus and leucoplakia
tions, oral health is often neglected, even though Dentofacial anomalies
dental caries constitute one of the most common
— malocclusion of teeth and jaws (crooked or
preventable diseases in South Africa.
misaligned teeth)
However, dental epidemiology has instilled in
— enamel opacities and fluorosis (teeth
community dentistry the tools with which to
stained due to high fluoride intake)
measure oral or dental disease, as well as a more
¢ Social dental indicators
rigorously scientific approach. The key issue in
dental epidemiology is that of measurement, and The international system of tooth identification is
this chapter will focus on this. used in the measurement of oral diseases. The
first digit specifies the quadrant of the mouth and
Measurement of oral disease the second the actual tooth. The permanent den-
tition is allocated first digits of 1 to 4, while the
The measurement of oral (dental) diseases and primary dentition is given first digits of 5 to 8. The
health status is undertaken by clinicians and dental teeth are thus numbered as follows:
epidemiologists in both qualitative and quantita-
tive terms. The dental clinician examines and maxilla
measures dental diseases at the individual level, primary dentition: 55 54535251 6162636465
focusing on a patient’s oral condition and then permanent: 18 171615141312 21 22 23 24 25 26
choosing a treatment plan in the best interests of 27 28
that patient. The dental epidemiologist, however, mandible
measures dental status and dental conditions at a permanent: 48 47 46 45 44 43 42 41 3132333435
particular point in time in the general population. 36 37 38
This point judgement requires standardized crite- primary: 85 84 83 82 81 7172737475
ria which differentiate between clinical and
epidemiological examination, as clinical investiga-
tions rely necessarily on subjective judgement. The Use of indices
principal focus of oral epidemiology has been the In order to standardize and have common criteria
measurement of oral disease in the following areas. for recording dental diseases, dental clinicians
have created indices which measure traits and
Dental caries (tooth decay) conditions in the mouth so that these can be
Periodontal disease numerically described.
— gingivitis (gum disease) Russell (1956) has defined an index as ‘numeri-
254
29 measurements in dental epidemiology
cal value describing the relative status of a popu- standard deviation and standard error.
lation on a graduated scale with definite upper The DMFS index is increasingly being used, as it
and lower limits, which is designed to permit and allows for the monitoring and evaluation of trends
facilitate comparison with other population clas- in the distribution and progression of dental caries,
sified by the same criteria and methods.’ and can therefore be used as a tool to evaluate the
quality of care. The index indicates the distribution
of-either caries incidence or treatment on each
Measurement of dental caries tooth and surface in an individual’s mouth. Dental
Dental caries indices planners are currently using the measures of this
The criteria and methodology for the diagnosis of index to plan future needs for oral health care, and
dental caries and the definition of related treat- as a quality assurance measure.
ment needs of individual teeth are well defined
(WHO, 1987). Four indices are commonly used to dinft
measure prevalence, incidence, and severity of
This is an index for primary teeth and is recorded in
a similar way to the DMFT except that it is only used
dental caries. They are:
for children of five years and under. The index can
@ DMFT for permanent teeth
range from zero to amaximum of 20.
@ DMEFS for permanent teeth surfaces
¢@ dmft for primary teeth
¢@ dmfs for primary teeth surfaces.
dmfs
The dmfs measures the status of each tooth surface in
the primary dentition. This score can range from zero
DMFT to 88. Recording of dmfs and dmft from the age of six
The DMFT index measures the status of perma- years and upwards is problematic, as it is difficult to
nent teeth in young people and adults. Each tooth ascertain whether a primary tooth has been exfoliat-
in the mouth is examined using set criteria, and ed or extracted when looking at missing teeth.
teeth are classified as decayed (D), missing (M),
filled (F), or sound. The sum of decayed, missing, Comments on dental caries indices
and filled teeth gives the DMFT score for the indi- These indices record past caries history and treat-
vidual. DMFT scores can thus range from zero to ment received by individuals. Certain difficulties are
32 units. Teeth which are unerupted are not experienced when allocating scores to extracted teeth
scored as missing, but described as unerupted (teeth that are missing) due to caries, as the tooth loss
teeth, and thus not reflected in the score. may be due to periodontal disease. The DMFT index
In epidemiological surveys on a population, each reaches a saturation in older age groups, thus pre-
individual’s DMFT score is obtained and a mean venting registration of further attack. When using the
value with standard deviation or standard error is DMEFS index, the allocation of a score to extracted
calculated for the population. Distribution of teeth is a problem as the tooth may have been at-
scores often deviates from the normal, and the tacked on one surface only, even though its extraction
median, 25th, and 75th percentiles should then be score is four or five surfaces (depending on the tooth
used as summary statistics. type). The other disadvantage of the indices is that
each tooth is scored the same regardless of the severi-
DMFS ty of the lesion. The index merely notes whether a
The DMEFS index measures the status of each tooth tooth or surface of a tooth is decayed, filled, or miss-
surface of permanent dentition. It is computed in ing. The indices do not differentiate between arrested
the same way as the DMFT, except that the unit of caries and active caries, or between small lesions
measurement is the surface of the tooth rather than detected clinically or by radiographs. Furthermore,
the tooth itself. In the adult permanent dentition, a occlusal caries have become difficult to diagnose
total of 144 surfaces is recorded for this index: five clinically because of the impact of fluorides on teeth.
surfaces per tooth, except for the canines and inci- There is recent evidence that the diagnosis of secon-
sors which have only four. Thus, it is possible to have dary caries along the wall lesion in restored teeth is
a DMES score ranging from zero to a maximum of difficult to make. It is only certain when there is sub-
144. Any new lesion or filling can be identified and sequent tooth cavitation of the filling or tooth.
progression recorded. This index is therefore more Caries indices give an indication of past and treat-
sensitive than the DMFT, and is able to reveal small- ment experience. In longitudinal studies, they can be
er changes in caries progression. The increased re- used to measure the rate of caries attack and, in clini-
cording units of this index allow for more variability cal studies and preventive programmes, the efficacy
in the individual and mean scores, thus affecting the of preventive agents (toothpaste, for example).
255
Periodontal health Other plaque and calculus indices

There are numerous other indices which measure
Periodontal health is the general term describing
plaque (debris).and calculus (calcified plaque) in
the status of the periodontal tissues, which
the mouth. Most of these indices are used by clin-
include the gingiva, gingival and connective
icians for measuring specific objectives. The
tissue attachments. The measurement of peri-
epidemiologist, however, requires an index which
odontal health is based on two broad categories:
is rapid and easy to use, and may therefore opt for
1 oral cleanliness
the WHO recommendation of recording soft
2 disease related to the gingiva and periodontium.
deposits and calculus visible to the naked eye as
To measure these categories, clinicians and dental
present or absent.
epidemiologists have formulated indices to
The clinician in private practice may want to
measure the extent and severity of the dirt (plaque)
take a more detailed record, and will therefore use
in the mouth, and the inflammatory conditions of
more sensitive indices suited to individual use,
gingivitis and periodontitis. These indices are either
namely the Hygiene Index and the Simplified Oral
reversible or irreversible. Recent developments in
Hygiene Index.
measurement of periodontal disease include a
composite index which provides a measure for
severity and treatment need. The Simplified Oral Hygiene Index (OHI-S)
The Simplified Oral Hygiene Index (OHI-S)
(Greene, Vermillion 1960) is an index which
Measurement of plaque and debris
measures the extent of debris (plaque) and calcu-
The measurement of cleanliness has been record-
lus (tartar) on the surfaces of individual teeth.
ed by several indices, but the two most common
One component measures the extent of plaque
are the Simplified Oral Hygiene Index (OHI-S)
on the surfaces of teeth. Each designated tooth
and the Plaque Index (PI).
surface is assigned a score of 0 to 3, expressing the
extent of debris on the tooth.
Plaque Index (PI) by Silness and Loé, 1964 A second component measures the extent of
The Plaque Index is a relatively crude index suit- visible calculus above the gum margin that is
able for epidemiological studies to determine oral present on the same teeth surfaces measured for
cleanliness. It is frequently used to demonstrate plaque. These scores also range from 0to 3.
the effectiveness of oral hygiene measures in The mean debris and calculus scores for each
tooth-brushing programmes for children and for patient are determined by adding the individual
reinforcing oral hygiene promotion among adults score and dividing by the number of surfaces exam-
at individual or group levels.
ined. The patient’s individual OHI-S is obtained by
The most important consideration in the index is adding the mean debris and calculus scores.
the measurement of the thickness of plaque (sticky The examination is limited to six permanent
dirt or debris on teeth) along the gingival margin area tooth surfaces: the labial surface of the upper right
on all four surfaces of each tooth. The plaque index is central incisor (tooth 21), the labial surface of the
computed for all surfaces — mesial, distal, facial, and lower left central incisor (tooth 41), the buccal
palatal — on all or selected teeth, or specific areas of
surfaces of the upper first permanent molars (teeth
teeth. To reveal plaque, a light source, gentle drying 16 and 26), and the lingual surfaces of the lower first
of the teeth and gingiva, a dental mirror, and probe permanent molars (teeth 36 and 46).
are required. Plaque is not stained. The PI is aggre-
gated by scoring each of the four surfaces of the
selected teeth from zero to three, and then adding Measurement of gingivitis and
the scores and dividing by the number of teeth. The periodontitis
score from zero to three is in an ordinal scale, and the
categories describe the differences in presence of Gingivitis
plaque on the surfaces. Gingivitis is the inflammation of the gingiva char-
0 = tooth surface is clean with no visible plaque acterised by redness, swelling, and pain. The
1 = tooth surface looks clean but plaque can be condition is reversible with good oral hygiene and
removed from its gingival third with a sharp effective plaque control. Gingivitis is measured by
explorer the use of different gingivitis indices. The most
2 = visible plaque commonly used is the Gingival Index of Loé
Q ll tooth surface covered with abundant (1967). Dental epidemiologists use the Gingival
plaque Index (GI) in conjunction with the Plaque Index.
256
Gingival Index (Loé 1967) The PDI thus contains a gingivitis index in scores
This index involves numerical scoring of four 1, 2, and 3, and a measure of attachment loss
areas — mesial, distal, facial/buccal, and lingual — independent of gingivitis in scores 4, 5, and 6.
of selected teeth. It is a partial recording system;
six teeth are selected for examination, namely 16, Community Periodontal Index of Treatment
12, 24, 34, 42, and 46. The scores from the four Needs (CPITN)
areas are added together and averaged to give the The WHO and the International Dental Feder-
Gl for the tooth; these are in turn added and ation (FDI) have recommended the use in epide-
divided by the number of teeth measured to miological surveys of a periodontal index which
express a mean for the individual. The score is measures both the level of periodontal disease
attributed according to the following criteria: and the need for treatment. This index is known
0 = normal gingiva; no bleeding on probing as the Community Periodontal Index of Treat-
1 = mild inflammation, slight change in colour, ment Needs (CPITN).
slight alteration of gingival surface, no Three indicators of periodontal status are mea-
bleeding sured in this assessment:
2 = moderate inflammation, erythema, swelling 1 presence or absence of gingival bleeding
3 = severe inflammation, severe erythema and 2 supra- or subgingival calculus
swelling, tendencies towards spontaneous 3 periodontal pockets, subdivided into shallow —
haemorrhage, some ulceration. (4-5 mm) and deep (6+ mm).
A specially designed periodontal probe with a 0,5

Periodontal disease
mm ball tip is used, bearing a black band which is
Periodontitis is the inflammation of the perio-
between 3,5 mm and 5,5 mm from the ball tip.
dontium resulting in deep pocketing, loss of
The mouth is divided into sextants defined by
attachment and tooth mobility, and loss of func-
teeth numbers 18-14, 13-23, 24-28, 38-34, 33-43,
tion of the tooth. The use of indices to determine
and 44-48. A sextant should be examined only if
the severity of periodontitis, the degree of inflam-
there are two or more teeth present which are not
mation, and the destruction of supporting struc-
indicated for extraction. When only one tooth
ture (attachment loss) must be evaluated. There
remains in a sextant, it should be included in the
are a variety of periodontal indices, the most
adjacent sextant.
useful of which are described here.
For adults aged 20 years and over, the teeth to be
The Periodontal Disease Index of examined are: 17, 16, 11, 26, 27, 47, 46, 31, 36, and
Ramfjord (1959) 37. The two molars in each posterior sextant are
This is the most common and useful index for paired for recording and if one is missing, there is
epidemiological studies. The system is based on no replacement. If no index teeth or tooth is pres-
partial recording of six: typical teeth commonly ent in a sextant qualifying for examination, all the
known as the Ramfjord teeth: 16, 21, 24, 36, 41, remaining teeth in that sextant are examined.
and 44. If one of these teeth is missing, the tooth For young people up to the age of 19 years, only six
distal to the missing tooth may be substituted. teeth are used: 16, 11, 26, 36, 31, and 46. This modifi-
The measurement of periodontal pocketing is cation is made to avoid classifying the deepened
carried out using a special probe (University of crevices associated with eruption as periodontal
Michigan O) calibrated from zero to > 9 mm. The pockets (false pockets). For the same reason, when
criteria for the scoring of the PDI are: assessing children younger than 15 years, recording
0 = noinflammation, no alteration of gingiva of pockets should not be attempted; only calculus
1 = mild to moderate gingivitis at some location and bleeding should be considered.
on the gingival margin The WHO recommends that a special probe
2 = mild to moderate gingivitis of the entire should be used for ‘sensing’ to determine pocket
gingival margin surrounding the tooth _ depth and to detect subgingival calculus and
3 = advanced gingivitis with severe erythema, bleeding responses. The results are recorded with
ulceration, and spontaneous bleeding the highest scores in the appropriate box in
4 = upto3mm ofattachment loss, measured descending order of severity as follows:
from the cemento-enamel junction 4 = pocket6+mm
5 = 3-6mm of attachment loss 3 = pocket4or5mm
6 = more than 6 mm attachment loss 2 = calculus present
200
1 = bleeding present 8. Candididias (112.00 —09)

0 = healthy 9. Recurrent oral aphthae (528.20)
The CPIIN assesses the status of the gingiva, oral
debris, attachment loss, and periodontal pockets. Most epidemiological data of the oral-facial
This gives a composite assessment of the periodon- complex is further classified according to the
tal status of the individual. From this, it can be de- origin, site, and type of lesions. Recent trends in
termined whether an individual requires complex the diagnosis of these conditions have empha-
treatment for the management of pockets; oral sised the need to look at the risk factors when
hygiene instruction and scaling to remove calculus measuring the prevalence of the diseases
in case of moderate pockets; oral hygiene instruction (Johnson, 1994).
alone when the only signs are bleeding on probing, ‘ Currently, more than 80% of oral cancers are of
orno treatment when none of the above is found. the squamous cell type, which is the third most
The index is quick and easy to use, especially in common type of cancer in the developing world.
large epidemiological surveys. The data may be The standardized age incidence rate between 1988-
expressed as frequencies and mean values for the 1992 in South Africa for oral pharyngeal cancer was
various variables. However, some researchers hold 25,8 per 100 000 (Hille, Shear, Sitas [in press]).
the view that the use of the CPITN index overesti-
mates the treatment needs within a population, as
the treatment strategies are based on one-to-one
Assessment of malocclusion and
operator to patient strategy. Furthermore, if these dentofacial anomalies
treatment need results are used for planning The WHO (1977, 1987) has noted that there is no
purposes, the resources apparently required would universally accepted method for the recording of
be in excess of what most service providers would dentofacial anomalies that are of significance to
be able to afford. the public health planner, although there are
numerous methods to measure anomalies for the
Measurement of oral mucosal individual based on treatment needs. Attempts
have been made by the WHO (1987) and the FDI
diseases (1982) to develop measures which would be able
The WHO has developed several guides for the to measure occlusal traits of populations.
measurement of oral diseases. Two guides are of
special interest for the collection and reporting of Diagnostic classification
data for oral mucosal diseases in a standardized Angle’s classification (1899) is the best known of this
form: the International Classification of Diseases: type of diagnosis, and has been refined to include
Application to Dentistry and Stomatology (ICD- incisor classification for grading the severity of the
DA) (WHO, 1978) and the Guide to Epidemiology anomaly between the two arches (mandibular and
and Diagnosis of Oral Mucosal Disease and Con- maxilla) in the mouth. This classification has limita-
ditions (WHO, 1980). The prevalence and incidence tions, as it essentially records a diagnosis of a trait
of oral conditions are expressed as simple counts that deviates from the norm, without indicating
that may be turned into rates and proportions using treatment need.
an appropriate and standardized denominator. The WHO (1987), in its pathfinding survey
manual, recommends a simple categorization of
Classification of common oral mucosal malocclusion. Two levels of anomaly are distin-
diseases guished: very slight (a twisted or tilted tooth or slight
1. Carcinoma (140-149 of ICD-DA) crowding); or spacing and other anomalies that are
2. Leucoplakia (528.6X, 523.84) generally regarded as having an unacceptable effect
2.1 Homogeneous: lesions that are uniformly on facial appearance, or which cause significant
white reduction in masticatory or speech function.
2.2 Non-homogeneous: lesions in which part of The FDI (1982) has recommended the assess-
the lesion is white and part appears ment of occlusal traits to determine the prevalence
reddened. of malocclusion and dental irregularities and to
. Erythroplakia (528.70) estimate the treatment needs of a population, to
Leukokeratosis nicotina palati (528.72) enable basic planning of orthodontic services. The
. Lichen planus (698.00 —- 697.09) examination is to be made on children with perma-
. Oral submucous fibrosis (528.80) nent teeth, as some of the minor problems of occlu-
. Acute herpetic gingivastomatitis (054.20) sion in the stage of development are self-correcting.
258
The occlusal traits recorded refer to spacing and/or @ fluorosis

crowding of anterior teeth, the measurement of enamel hypoplasia
maxillary/mandibular overjet and overbite, and enamel opacities
anterior/posterior relationship of the molars. The ¢ intrinsic staining.
traits give a composite picture which determines
treatment need, and which can be recorded. Dental fluorosis is a hypoplasia of the dental
enamel caused by excessive ingestion of fluoride
Treatment need indices during the period of tooth calcification. The
Two occlusal indices have been developed to assess degreee of hypoplasia depends on the quantity of
treatment need (Index of Orthodontic Treatment fluoride ingested during the period of calcifica-
Need or IOTN) and the standard of treatment tion. The degree of hypoplasia can range from a
needed (Peer Assessment Rating or PAR Index). barely noticeable condition to severe and ugly
The IOTN assesses and ranks malocclusion in brown staining, with pitted and even flaking
terms of the significance of various occlusal traits enamel. For purposes of quantitative recording of
for individual’s dental health and _ perceived the various degrees of severity of defects of
aesthetic impairment. The purpose of the index is enamel due to fluoride ingestion, several indices
to identify those individuals who would most likely have been developed. These are:
benefit from orthodontic treatment. Dean’s dental Fluorosis Index
The PAR Index consists of a scoring system and @ Al-Alous Modified Fluorosis Index
a ruler that allows analysis of a study cast in @ Modified Fluorosis Index of Fejekov-Thylstrup
approximately two minutes. The PAR Index pro- ¢@ Community Fluorosis Index.
vides a single summary score for overall align-
ment and occlusion. Depending on the objectives of the study, differ-
Both indices are in use in industrialized coun- ent indices may be used. More details of each
tries where the demand for orthodontic treat- index can be found in the dental literature.
ment in the public and private oral health services
is high. However, there has been limited use of
Social dental indicators
these indices in developing countries, because of
fiscal and human resource constraints. The majority of dental indices measure the
morbidity and mortality of dental diseases, and
Comments on the assessment of malocclusion do not include the social and psychological
Approaches to the diagnosis and treatment of mal- impact of dental disease in the assessment of
occlusion have two distinct pathways. The ortho- dental status. As we know, the WHO (1978) has
dontist dealing with the individual is primarily provided a definition of health which extends
concerned with the minute functional capacity and beyond the absence of disease and incorporates
aesthetic appearance of teeth. The public dental both social and psychological aspects of health.
health planner is concerned with an assessment of Jago (1977) and Cushing (1985) have advocated
the prevalence of malocclusion in the general the development of social dental indicators which
population, and concerned with major dentofacial incorporate the clinical and social characteristics
anomalies which affect gross appearance and func- of dental experiences and self-assessment of oral
tion. The additional problem in an acceptable health. Socio-dental indicators would be able to
approach to treatment need indices is the whole meet the needs of patients as well as clinicians.
question of perception of need and occlusal dishar-
mony by both the individual and the dental profes- Trends in oral health
sional. In developing countries where human,
financial, and physical resources are limited, ortho- Dental caries have declined significantly in a
dontic treatment has a very low priority. number of developed countries (Downer 1993).
Some of the reasons for the decline have been
given as increased use of fluorides, improved oral
Assessment of other conditions hygiene measures, and reduction in sugar
Measurement of fluorosis consumption. The prevalence and severity of
The universal use of fluorides at the individual dental caries in developing countries continue to
and community level has indicated the need for increase; for example, the prevalence rate among
assessment of enamel opacities. The most city dwellers in some African cities was between
common conditions measured are: 51% to 93% (Manji, 1991). In South Africa, the
259
mean DMFT for 12-year-olds is 1,7, with a preva- Conclusion

lence of caries of 26% (du Plessis, 1989). This is
considered low according to WHO indicators for Oral diseases can be measured as simple counts
this age group. and expressed as rates and ratios for prevalence
Recent findings from longitudinal studies and incidence measures, if only trends in oral
support the concept that periodontal destruction diseases need to be obtained. More detailed and
proceeds in random bursts at specific sites, and advanced indices have been devised by dental
demonstrate that some individuals have a higher epidemiologists to measure the severity and extent
risk of attachment loss. There is further evidence of oral diseases. Recent trends have included
that cigarette smoking is a significant risk factor in composite indices that measure treatment needs
causation of dental disease, and there is evidence (CPITN), types of treatment, and the effects of
of HIV-associated periodontal disease. Periodontal preventive measures given to individuals and
disease has become more prevalent and severe in communities.
populations in developing countries, especially in
Asia and Africa. CPITN data from the NOHS in
South Africa (1989) shows that gingivitis is ubiqui-
tous in lower socio-economic groups.
260
30 Health
economics
Introduction ductive capacity are intrinsically linked to the

health of its population. Poor health can impose
This chapter aims to introduce the reader to economic hardship on individuals, households,
health economics. In doing so, it will show how and nations. The economic costs of ill health in-
the supply and demand for health care differs clude loss of days worked, loss of income, de-
from that for most other goods, and it will high- creased productivity, the opportunity cost of
light the economic objectives of health care. It will medical services, and a fall in savings. Studies
also describe methodologies for economic evalu- have shown that in some African countries, pro-
ation. This includes the identification of the ductivity of workers could be increased by as
inputs and outputs of health interventions, an much as 20% if morbidity rates were reduced
introduction to the costing of these interventions, (World Bank 1993). This holds tremendous impli-
and an explanation of how the results from epi- cations for a country’s Gross Domestic Product.
demiological studies can be used in economic Effective antenatal and well-baby interventions,
evaluation. and adequate childhood nutrition have a signi-
Economics involves the analysis of trade-offs ficant impact on an individual’s health and well-
between scarce resources. Decisions on how best being over the entire lifespan. In economic terms,
to use available resources are made daily at the this translates into a population that is more pro-
national, sectoral, and household level. In par- ductive and less of a liability on health care re-
ticular, evaluating the potential costs and benefits sources.
of choosing one alternative over another — the Conversely, economic hardship at the indi-
opportunity cost — is important to economic vidual and national level can result in poor health.
thought. Ahousehold’s health status is affected by the food
The relatively new discipline of health eco- consumed, the source of drinking water, self-care
nomics applies economic principles to health practices, access to health services and medica-
care settings. Because good health is an integral tion, and education levels. Many of these are
part of individual and national wellbeing, eco- related to level of income. Changes in the nature
nomic principles are applied to establish how of a nation’s economy (such as a decline in eco-
best to allocate scarce resources to achieve im- nomic growth or a shift in production patterns)
proved health status. Studies in the economics of can also affect health status adversely. Other eco-
health have investigated questions such as what nomic phenomena that affect a nation’s health
proportion of a nation’s resources should be de- status include the extent to which it offers social
voted to health, how a nation’s health expendi- welfare programmes and safety nets for the poor,
ture should be allocated, and how the mix be- its income distribution, the percentage of the
tween private and public health care provision population’s income spent on basic necessities
can be optimized. such as food, and the unemployment rate. Hence,
Many countries see health as a valuable invest- the overall economic development of the country
ment. A nation’s economic development and pro- has a direct effect on the health of its population.
261
This explains why health can be considered an contributes to market failure.

investment commodity or good. In order to minimize the unpredictability of
demand, insurance companies are likely to select
the youngest, healthiest members of society as
The health care market
clients. Those who are chronically ill, unable to
In order to undertake economic evaluations of afford premiums, or considered to be high risk,
health interventions, we must understand the tend to be excluded. This phenomenon of ad-
nature of the health care market. Health, like verse selection further skews the workings of the
other products or goods, has buyers and sellers, health care market.
or consumers and providers. However, health is
fundamentally different from other commodities Economic objectives of health care
bought and sold in the marketplace. The way this
difference is perceived by the various stake- Efficiency
holders affects the functioning of a nation’s The principle of efficiency in health economics
health care market, as well as debates concerning addresses whether limited resources are being
access and equity. used in the best possible way. It also determines
In the ideal marketplace, consumer and pro- whether resources are used in a way that achieves
ducer decisions are triggered by price mech- the best value for money. Three types of efficiency
anisms. Price is the measure of how much income can be assessed in health economics.
a consumer is willing to sacrifice for a good, and The first is technical efficiency, which looks for
determines a producer’s decision regarding how combinations of the fewest possible resources or
much ofa product he or she is willing and able to inputs needed to meet the objectives of a service
supply. In order for the combination of consumer or programme. For example, it would determine
and producer decisions to work well, or for a various combinations of the least number of staff,
market to be perfectly competitive, the market equipment, supplies, etc, needed to treat 500 out-
must fulfill certain criteria outlined in Table 30.1. patients a day at a clinic.
The health care market, for the most part, does The second is economic efficiency, which
not operate according to the theory of perfect establishes the least cost combination of inputs
competition. This ‘market failure’ in the health needed to achieve a desired output — treatment
sector, where certain conditions are not met, im- of 500 clinic outpatients in this example. Cost-
plies that there is an inefficient allocation of effectiveness analyses and cost-utility analyses
health care resources. The special nature of the assess technical and economic efficiency.
health care market is contrasted with perfect Third is allocative efficiency, which looks at the
competition in Table 30.1. This illustrates the health system as a whole and determines whether
extent to which the government needs to inter- pursuing a certain policy or programme is worth-
vene in the market to rectify instances where the while in the first place. Next it assesses whether
market fails to achieve efficiency and societal services are being provided at least cost and
objectives. The nature and extent of such inter- whether economies of scale (the most appro-
vention can be determined by economic evalu- priate level of service at the lowest cost) have been
ations. achieved. (Cost-benefit analysis, described
The health care market’s failure to conform to below, does this.) For all of these models, detailed
the criteria of perfect competition has many information on the inputs and outputs of a pro-
implications. For example, risk and uncertainty gramme is needed.
results in the development of insurance markets.
As the need and cost of health care generally Effectiveness
cannot be predicted (criterion 3), individuals may Health planners seek measures of effectiveness to
try to protect themselves against the unknown by establish whether health care interventions
purchasing health insurance. Health insurance achieve their stated objectives, while health eco-
gives rise to two potential problems in the health nomists use measures of effectiveness in cost
care market: moral hazard and adverse selection. effectiveness analyses to assess efficiency. Health
Many argue that insured patients present little economists depend on epidemiologists for in-
incentive to their doctors (or themselves) to eco- formation about the effect of health interven-
nomize on health care. This potential to over- tions. Firstly, they need to confirm that the inter-
supply and/or overuse services (moral hazard) ventions they are evaluating are effective. Health
262
30 health economics
Table 30.1 The health care market contrasted Te Paes

Re OEE ted
CRITERIA FOR PERFECT
COMPETITION — THE HEALTH CARE MARKET
a Many buyers There is often a sole purchaser of health care (a monopsonist);

i.e. the state, large enterprise, or insurance companies.
Self-interest (producers Consumers may not always seek to maximize utility; those who
seek to maximize profits and require health care may be unwilling or unable to seek it. Thus,
consumers seek to maximize health is sometimes considered a merit good; one which warrants
satisfaction gained from government intervention to ensure that it is not under-consumed.
consuming a good or utility)
Predictability of need and ll-health is often unforseeable, and hence the demand for health
cost care and the related costs are difficult to predict.
Full information regarding The supplier of health care usually holds a monopoly on
the product information, which limits competition and may encourage
monopolistic behaviour.
The ability of the consumer Health workers have greater understanding and access to
to assess fully the benefits information than the patient, and thus are directly involved in the
and opportunity cost of a patient’s decision to consume health care. This confers monopoly
product — consumer powers on the provider and hinders the consumer's ability to make
sovereignty an informed choice of provider. This hinders consumer sovereignty.
- Ahomogeneous product Neither health care nor its suppliers (public, private, NGOs)
are homogeneous.
There are many sellers and The long training period for health professionals and professional
they can easily enter and licensing bodies create barriers to easy entry into the health
leave the health care market care market.
(no barriers to entry)
No substantial fall in unit Several components of health care provision lead to economies
costs with larger-scale proof scale and natural monopolies. Cases in which only a few
duction — or no significant suppliers exist (such as hospitals and pharmaceutical
economies of scale — that companies) negate perfect competition.
would encourage monopolies
No spillover effects When health interventions affect society at large, rather than just
(externalities) resulting the individual receiving care, they are considered public goods.
from production or Externalities (spillover effects) are benefits or costs that have
consumption important implications for economic analysis. Being treated for
gonorrhoea, for example, benefits not only the persons involved,
but also decreases their sexual partners’ chances of getting
the disease. If society is better off because it knows that the sick
are being well cared for, this is a spillover effect of health care
(a caring externality).
263
@ epidemiology applied to content areas
interventions are deemed effective when they In general, if resources are not being allocated
result in improved health status. to those with the greatest needs, the efficiency of
In epidemiology, the evaluation of effectiveness the service is reduced. This is an important issue
can involve the measurement of health outputs or to consider in a decision-making or policy con-
outcomes resulting from an environmental text. Thus the situation in South Africa has not
change or a specific health care intervention. only been inequitable, but inefficient. In this con-
Whereas epidemiology compares the outcomes text, an intervention that is technically efficient is
of various interventions, an economic evaluation also equitable. Any attempt to extend services to
of effectiveness compares the cost of these out- those in greatest need will improve the efficiency
comes. The results from various epidemiological of the health care system.
studies (such as randomized control trials and If the goal is equity of access and utilization of
case-control studies) that measure effect are used health services, efficient use of scarce resources
for this type of economic evaluation. For exam- may have to be sacrificed for equity gains. While a
ple, the value of resources put into health care can 100% immunization rate is an equitable goal, a
be compared with indicators of effectiveness such slightly lower rate of 90% may be more efficient. A
as a change in blood pressure or years of life lot of resources would have to be used to achieve
gained. Thus, indicators of effectiveness link the the last 10 percent. Both national governments
results of epidemiological studies to economic and local health policy-makers are faced with
analyses of efficiency. such trade-offs between equity and efficiency.
Note that effectiveness (that is, it works) does not These decisions at the margins are often the focus
automatically mean that an intervention is efficient of health care debate.
(it works and is the best value for money). However,
technical, economic, and allocative efficiency are
contingent on effectiveness.
Economic evaluation
Economic evaluation considers two components
Equity of the production process for any given activity,
When assessing the economic objectives of health namely inputs and outputs. Inputs are resources
care, we must ask to what extent the allocation of such as labour and equipment that are put into
health care is equitable and reflects society’s will. producing goods and services, in this case health
While equity is usually a specified policy for health in care. Outputs are the results of production, while
most countries, it is not a straightforward notion. the process describes the transforming of inputs
Most simply put, it questions whether some mem- into outputs — the final result of the production
bers of society are willing to sacrifice some health process. In economics, the relationship between
gain in exchange fora ‘fairer’ distribution of health these components is analysed within the frame-
gain for all. work of the production function, which repre-
There are two dimensions of equity to consider. sents the relationship between inputs and out-
Horizontal equity means equal treatment of puts. The production function would indicate
equals; this would require that all those with similar that the output of one cured TB patient is de-
health care needs be given equal opportunity to pendent on (or a function of) the interaction be-
access and treatment. For example, in South Africa, tween inputs such as staff, medicines, and equip-
there has been much debate on how to ensure that ment.
in each province everyone has equal access to PHC. The health sector presents special problems for
Vertical equity means unequal treatment for un- economic analysis because it does not produce
equals; it implies equality in service utilization and simple units of output, but several different types
positive discrimination towards those who are less of outputs simultaneously. Health cannot be
willing or able to use health services. Thus, those expressed as a simple variable; thus various
with greater needs (more severe illnesses, inability output measures are expressed as a proxy for
to pay, and so on) would obtain concessional ‘health’. Moreover, a number of different inputs
access. For example, more resources for TB preven- are used in the production of health; these can
tion and control might be directed toward an area often be difficult to identify and quantify. The fol-
with a very high prevalence of TB, or lower-income lowing sections on inputs and outputs will pro-
groups may have their treatment and transport to vide researchers with some useful tools to use in
the services subsidized. economic evaluation.
264
30 health economics
Inputs costs, on the other hand, will vary depending on

Economics assesses the different combinations of the number of patients and procedures. Exam-
inputs in terms of their effect on output. This may ples of variable costs are syringes, sutures, or
include determining the impact on output of swabs. ,
changing the relative proportions of labour and A classification of costs often used in financing
equipment, assessing the impact on output of and budgeting exercises is capital and recurrent
changing the amount of one input while keeping costs. Capital costs (many of which are fixed
others constant, or establishing the cheapest and costs) refer to resources whose lifespan is greater
most efficient way to produce a service. than one year. They include the cost of buildings,
While some inputs (such as personnel num- vehicles and equipment, and staff training. The
bers, equipment, and drugs) are relatively easy to value of capital goods is calculated based on their
quantify, other inputs that go into producing current replacement cost. This cost can be
optimal health care (such as socialization, cul- divided over their projected lifespan and adjusted
ture, networks of friends, workmates and care- using a suitable discount rate. In this way, capital
givers, personal preferences, as well as clinic care) costs can be added to operating or recurrent costs
are less easy to quantify. While an ideal economic in annual budgets or financial statements to show
evaluation would incorporate all components of total costs. Recurrent costs refer to those things
the production process, the focus is usually on the purchased regularly or used within a fiscal year.
quantifiable ones below. Among others, these include staff salaries, sup-
In order to conduct an economic appraisal of plies, operation, and maintenance. An example of
goods and services that make up the delivery and a budget for a PHC clinic is Shown in Table 30.2.
utilization of health care, one must assess their The costs discussed above form part of the in-
cost. While the everyday notion of cost and the trinsic cost of manufacturing a good and/or pro-
understanding used in economics are not all that viding a service. Such costs incurred in provision
different, certain important distinctions must be of health care are termed direct costs. Direct costs
noted between lay terminology and strict eco- may also include government grants, donations,
nomic usage of the term. volunteer time, and costs to the consumer (pa-
tient) such as user fees, medication, and travel to
Identifying costs and from the clinic.
Economic cost is generally seen in terms of sacri-
ficing something. This may be money, time, or
another good. The value of sacrificing one good Pel Ceetee eeUe mile
for another is termed the opportunity cost. If one a PHC clinic: an example
considers the choice, for example, between buy-
ing a new dialysis machine or extending measles
immunization coverage to a further 500000
people for the same amount of money, the oppor-
tunity cost of the extended coverage is the dialysis
machine conceded.
The total cost of a good or service is the sum
total of all costs that contribute to its production.
The total cost of providing an immunization ser-
vice for a given area, for example, includes
salaries, equipment for and maintenance of the
refrigeration system, the immune sera, buildings,
and many other aspects of the service. Total cost ‘Personnel =
comprises fixed and variable costs. Supplies
It is important to distinguish between fixed _ Vehicle maintenan
costs and variable costs. Fixed costs (often refer- —Inservicetrainng =
red to as ‘overheads’) tend to remain constant _ Other operating costs _
regardless of the quantity of output. A hospital,
for example, will have a number of relatively fixed
costs (buildings, rental, equipment, etc), inde-
pendent of the number of patients seen. Variable
However, the patient often has further costs not reflect their presence, government intervention
directly related to the provision of the good or ser- may be warranted. For example, if fees are set at
vice itself. These indirect costs borne outside the market rates, individuals with gonorrhoea may
health sector and incurred not only by patients, well decide not to go to a STD clinic. Given the
but also by others, may include time lost from implications of this, a good case could be made
work, or the consumption of something else fore- for state subsidization of such services. Some
gone. For example, the process of having one’s commodities or services would otherwise not be
child immunized against measles includes wait- produeéed at all, whereas others would be pro-
ing time (an opportunity cost) and subjective duced on an inappropriate scale. It is important
costs, which are often intangible and difficult to to consider these if we seek to reflect the true cost
measure. The economic value of these subjective or benefit of an intervention.
costs (pain, changes in normal daily activity,
mobility, apprehension, anxiety, and decreased Measures of cost
social performance) are included in the category To ascertain the unit cost or average cost of an
of indirect costs. The section below on Quality immunization service, for example, one must
Adjusted Life Years (QALYs) looks at these issues divide the total cost by the number of immuniza-
in detail. tions done. For a given number of immuniza-
Table 30.3 gives an example of a breakdown of tions, this establishes the average or mean cost
direct and indirect costs. In this case, it shows the per immunization.
direct costs (hospital, TB, drugs, outpatients) of The marginal cost of a service or good is the
HIV/AIDS in Lesotho from 1994-8. Indirect costs cost of producing (or providing) one extra unit of
are expressed as the value of productive years of that good or service. One does net use fixed costs
life lost as a result of AIDS morbidity and mortal- (for example, buildings) in determining the mar-
ity. ginal cost, but looks exclusively at changes in vari-
. The social cost or benefit of any one interven- able costs. The marginal cost of extending im-
tion is the sum of the cost or benefit to the indi- munization coverage to one more person may be
vidual and any additional cost or benefit to small (if that person lives on a route that is already
society. Since these additional benefits may be covered by the service), or large (if that individual
quite extensive, and market mechanisms do not lives some distance away from the normal route
peel CmecLUMO ae mcrae mes odie

Creel aeRh a) ge
Hospital — TOTAL
patient | Direct COSTS
Discounted | *PYLL Direct +
Indirect
1994 |2545 843 [4186546 |87616 |294 365 7114369 |6240676 | 24441810 | 30682486
1098)4683798 |5800651 [161900 |445500 | 11091850 |9017793 | 45553460 | 54571 253
1996 |7 703489 |7 525332 |268717 |630673 | 16128211 | 12 311 611 | 75875013 | 88186624
1997 14 603 169 |9 255 230 |409606 |844902 | 22112907 | 15 794 932 | 115 718 702 : 131 513 634
1998 | 16 263 122 |10 891 143/580 967 |1078870 28 814 102 | 19 209 401 | 164 094 254 | 183 303 655
*Value of potential years of life lost in monetary terms

Source: McMurchy D, 1994
266
30 ~=health economics
taken). This cost is an important one to consider This is particularly important ifan economic
when assessing whether to expand or contract a evaluation that compares different treatment
service. options is to be carried out. Alternatives can be
It is often difficult to calculate the marginal cost identified from the clinical literature, in discus-
unless detailed data is available. The researcher sions with experts in the field, and from present
could calculate the additional cost of extending a providers of a given service. It may be necessary
service to a group of people (for example, an addi- to construct a number of possible alternative
tional 100 patients) rather than calculating the models for a given service. Note that this step
cost of immunizing one more person. This is may be unnecessary if one is simply costing an
called incremental cost. Calculating the cost of intervention and a formal economic evaluation
extending an immunization service to a rural is not to be carried out.
village not previously covered will give one the
Assemble data and calculate/estimate costs
incremental cost of providing this service. The
involved
incremental unit cost (incremental cost divided
by village population) can be used as an altern- Sources for cost data may include accounts,
ative to the marginal cost. budgets, patient folders, management inform-
ation, staff employment records. In some cases,
Acosting framework: tools and tips for economic only total cost data are available and it may be
evaluation necessary to estimate costs based on the best
In undertaking a formal costing study of a given information available (staffinterviews might be
service, all costs must be taken into considera- useful). A way of minimizing imprecision is to
tion. There are a number of accessible texts that discount for future costs, and to conduct sensi-
address the process of costing (for example, tivity analyses (see pp. 270, 272).
Creese, Parker [n.d.]), and only a brief overview
@ Exclude costs
will be given here.
Some costs (for example, externalities) may be
Identify all elements involved in the interven-
irrelevant to the analysis performed, or too
tion being evaluated
expensive or time-consuming to collectina
This is achieved most easily by charting all comprehensive way. The analyst needs to
activities that are part of a given health care decide what to include in the costing on the
delivery process (for example, a clinic or an basis of its feasibility and likely implications.
immunization service). Costs may need to be For example, he or she: might decide to exclude
broken down into detailed categories: what, for indirect costs when costing a primary health
example, are the instruments needed, as well as care clinic.
the staff, building, equipment and vehicle
Allocate money values to the costs
requirements for immunizing the inhabitants
of a district against measles? What are the direct While charges may be equivalent to costs ina
and indirect costs to the patient? A useful perfect market, this is rarely so in the health
checklist derived from Luce and Elixhauser care sector where cross-subsidization, shifting
(1990) is the following: of costs (by transfer payments), and imperfect
— direct personal costs: overheads, staff market forces skew charges significantly. Bear-
salaries, appliances, medication, transporta- ing this in mind, it is often feasible to use a
tion, rehabilitation, etc; given charge as a proxy for cost, unless it is
— direct non-personal costs: support services, evident that the error produced is substantial. A
research and development, health education, more accurate alternative consists of calcu-
surveillance and control, etc; lating total cost by adding together the costs of
— indirect costs (wages and time): lost all components in the production process. This
productivity, income and leisure time, home is particularly relevant in studies of the public
caregivers, childcare, etc; sector where fees are unequal to costs.
— indirect costs (intangible): psychosocial costs, @ Beware of double counting
pain, changes in daily living patterns, etc.
A common error made in costing exercises is
@ Identify the possible alternatives to the inter- counting the same cost twice under different
vention being evaluated headings. For example, the cost of medication
267
may have been entered both as a user charge employed elsewhere in the health sector. Further-
(to the patient) and in terms of its cost to the more, shadow pricing can also be used to reflect
clinic (procurement, storage, preparation, the social cost of an intervention.
dispensing, administration, monitoring, treat-
ment for adverse effects, etc). This will result in Outputs
an inflated total cost. A similar error can be Output indicators aim to measure changes in
made by allocating the same cost both to direct health status. No indicator is without short-
and indirect costs columns. comings, and all are value-laden and difficult to
measure. In economic evaluations of health care
Discounting interventions, these indicators are weighed
The implementation and impact of a health inter- against the cost of the programme or service.
vention is likely to occur over time. Thus, we are
likely to compare its costs and benefits over a Putting value on human life
number of years. If we must choose between two A common approach used to establish the value
or mote services or’projects, the net benefits of of an intervention places a monetary value on
each should be compared. It is necessary to use human life. Often this is measured in terms of
discounting in such an economic evaluation, in years of life gained. These monetary values can be
order to account for society’s desire for short- used to measure ‘benefits’ in cost-benefit ana-
term results. Discounting deflates the value of lysis. In general, there are three methods com-
money in the future to account for this positive monly used to value life and well-being (Mooney
time preference. As*the net benefits across all 1992):
years are compared in terms of their value at the The implied values approach takes into con-
time of evaluation, discounting then favours sideration the value that the government or soci-
those projects that provide them sooner. The pre- ety attaches to well-being or a life saved. These
sent value of net benefits is determined using a costs can be implied from the marginal cost of a
formula (Luce, Elixhauser 1990) which utilizes the preventative intervention or from the cost of
discount rate available from the Reserve Bank. introducing life-saving high technology.
Discounting should not be perceived as an Measurements are based on existing policies and
adjustment for inflation. value systems. For example, the marginal cost to
the state of expanding neonatal intensive care for
Shadow pricing @ premature infants could be used as a proxy for the
Price often does not reflect economic or oppor- value of an infant’s life.
tunity cost adequately. For example, the price of The human capital approach equates the value
goods purchased from abroad may not reflect of life to an individual’s lifetime productive capa-
their real value if the exchange rate is over-valued. city measured in terms of his or her income earn-
Economic distortions such as tariffs, subsidies, ings. Specifically, this is average discounted
import and export taxes also distort price. Like- labour costs adjusted according to the country’s
wise, the wage rate may not reflect the true value employment rate.
of labour: A limited number of doctors, repres- Finally, the consumer’s willingness to pay
ented by a professional body, may drive the price places a value on the amount she or he is pre-
of their labour above its market value. Conversely, pared to pay to reduce the risk of mortality or
if the state is the sole purchaser (monopsonist) of morbidity. The value of a health intervention is
nurses’ labour, it may keep wages below their true thus based on the amount that the consumer is
market value. Ultimately, the value of labour willing to pay for it.
should be assessed based on its availability All three of these approaches are proxies and
(elasticity). each presents its own set of problems. The
Shadow pricing allows prices to be adjusted to human capital approach equates life to livelihood
reflect their true economic value. In the case of and fails to encompass those who are not form-
labour, if there is high involuntary unemploy- ally employed. The implied values approach,
ment, the shadow price of unskilled labour may which uses the cost of mortality-reducing inter-
approach zero. If, on the other hand, there is a ventions introduced by the public sector as a
limited supply of highly skilled labour, the proxy for society’s value of life, is based on the
shadow price would represent the opportunity values of current decision-makers (McQuire,
cost of that individual’s labour if he or she were Henderson, Mooney 1992). The ‘willingness to
268
30 health economics
pay’ approach may be the most economically This model has been criticised because it is
sound because it assumes consumer sovereignty, based on trade-off decisions usually made under
but there are shortcomings in this approach. hypothetical situations; it looks exclusively at the
movement from good health to bad; its valuations
Health status indicators assume that an additional year of suffering is
Health status indicators are valuable when com- equivalent to the previous one; and QALYs do not
paring health status both across time and cross- measure externalities associated with health care.
Nonetheless, the introduction of QALYs into
sectionally. Mortality and morbidity rates are the
health economics represents an attempt to in-
most frequently used measures of health output.
These indicators are vague and those working in corporate intangible costs into the discipline.
the health field have sought to find more precise
Disability adjusted life years (DALYs)
ones. These include: disease-specific measure-
DALYs were developed by the World Bank to
ments which quantify clinical change (for exam-
measure the burden of disease (World Bank
ple, changes in cholesterol levels); measures of
1993). They measure the potential years of life lost
dysfunction (number of productive working days
due to disability or death ata given age. While
lost to illness or inability to perform certain tasks);
they are similar to QALYs, they explicitly include
health profiles which give individuals a rating in
disease type (using disease-specific severity
identified health ‘domains’ according to levels of
weightings), focus exclusively on disability, and
disability and distress (such as the Nottingham
are weighted for age based on the individual’s
Health Profile); and global health indicators
predicted productive value. Many of the criti-
which measure states of health (utilities) relative
cisms of QALYs are equally applicable to DALYs.
to each other on an interval scale of 0-1 (death —
DALYs have also been faulted for giving an age
perfect health).
weighting based solely on the individual’s
assumed productive capacity, without taking into
Quality adjusted life years (QALYs) account other socio-cultural variables.
The health status indicators covered above are
limited by their inability to measure the effect of
the duration of ill health. One methodology that
Economic evaluation methods
attempts to account for this is the QALY or the The economic evaluation of health-related inter-
Quality Adjusted Life Year. In this model, the ventions looks at how inputs relate to outputs, or
number of years of life gained from a medical the complete production function. Essentially, it
intervention are adjusted to account for quality of compares costs to consequences. Good cost and
life — disability and distress levels. This is espe- epidemiological data is crucial to economic
cially relevant to medical interventions that have evaluation. The results of this type of evaluation
the potential to incapacitate the patient. Like the are only as good as the information used. De-
global health indicators mentioned above, this cision-makers need to make decisions on how to
model depends on a single index to value levels of derive maximum benefit from a fixed budget.
health, and measures quality on a scale of 0-1 Economic evaluation is one tool that may provide
using health rating scales (Kind, Rosser, Williams a framework for these choices.
1983), time trade-offs between disability and A number of types of economic evaluation
years of life, or ‘standard gamble’ — the trade-off exist. Different ones are appropriate for different
between chronic illness and the chance of death contexts, and they differ from one another in the
due to a potentially hazardous treatment (Tor- extent of their analysis and .their evaluation of
rance 1987). QALYs extend beyond global health outcome. Four methods are briefly described
indicators to weight the index relative to years of below. Cost-minimization analysis is used to find
life gained. Ultimately, this approach can be used the least costly way of implementing a project-or
to measure the trade-off between being relatively programme. Cost-benefit analysis is used to
healthy over a short period of time against being determine whether a programme should be,intro-
in poor health over a long period of time. It can duced or not. And cost-effectiveness and cost-
also be used to assess the relative impact of differ- utility analyses compare programmes to deter-
ent health interventions when the costs of these mine which produces the desired outcome at the
interventions are expressed in terms of the least cost. In cost-utility analysis, the outcome is
number of QALYs they produce. expressed in terms of QALYs.
269
Cost-minimization analysis years of life gained. Results are expressed in terms

Cost-minimization analysis is based on the of a ratio, such as costs per unit of output (for
assumption that a programme or project is in- example, cost per fully immunized child) or the
trinsically worthwhile, and does not attempt to effect per monetary unit (for example, number of
evaluate alternative programmes. Rather, it is fully immunized children per R1 000). A given
used to establish which of two or more inter- project can be deemed efficient if it maximizes
ventions that have similar outputs costs the least benefit achieved under a given budget or min-
or is technically more efficient. The marginal and imizes the cost of reaching a given objective.
average costs associated with an intervention are
calculated to assess the least costly means of Cost-utility analysis
achieving programme goals and optimum service Cost-utility analysis (CUA) was adapted from
levels. cost-effectiveness analysis to measure the effects
of a programme or project in terms of utility or
Cost-benefit analysis satisfaction gained from receiving the interven-
Cost-benefit analysis (CBA) is an economic evalu- tion (a quality-adjusted outcome). Like CEA, it
ation tool which measures costs and benefits maintains a focus on cost minimization or effect
exclusively in monetary terms. It allows for com- maximization, but allows for comparison across
parison of various options for achieving a pro- different medical fields and through time. Results
gramme objective, or of different possible health are usually expressed in terms of cost per QALY.
interventions (allocative efficiency). In assessing As aresult, this method allows the effectiveness of
whether an intervention is worthwhile, the results one type of treatment to be assessed against
of a CBA will determine whether benefits out- another treatment for the same ailment, and
weigh the costs. Most CBAs concentrate on direct incorporates more than one measure of outcome.
net benefits. However, indirect benefits (such as CUA is also used to judge different health inter-
the value of increased production resulting from ventions relative to each other and compares
the programme), and intangible benefits which these by assessing marginal cost per QALY
reflect a change in health status (such as reduced gained. QALY league tables are used for this pur-
pain and suffering), can also be included. Since pose. For example, it may be found that the mar-
most health programmes are implemented over a ginal cost per QALY gained from providing renal
given time period and contain hidden and dis- dialysis is R20 000, while that of providing renal
torted costs, discounting and shadow pricing transplants is R10 000. This may carry policy
should be used where applicable. Because health implications.
outcomes are not easily measured in monetary Table 30.4 summarizes the four methodologies
terms, CBA has proven most useful in measuring for economic evaluation described above. It illus-
the direct net benefits of an intervention in terms trates the units of measurement used for inputs
of resource savings. The human capital approach (costs) and for consequences (outputs, effects, or
is most often used to establish indirect benefit. outcomes).
Willingness to pay and implied values are used
less often, although improved methods of deter- Sensitivity analysis
mining willingness to pay have led to a shift back There is uncertainty in all economic analyses.
to the use of cost-benefit analysis in assessing Given that precise data is often unavailable and a
outcomes. Such an analysis could compare the number of assumptions have to be made in the
cost of a renal dialysis with the value of the life process of evaluation, it is necessary to test the
saved. The value of the life could be determined sensitivity of the results to changes in these
by the amount the individual would pay for the assumptions. For example, one might test the
intervention. impact of increasing and/or decreasing certain
costs by a given percentage, or of quantifying out-
Cost-effectiveness analysis puts in a different manner. Sensitivity analysis
Cost-effectiveness analysis (CEA) is a tool used for gives a base estimate high and low parameters. It
comparing alternative uses of resources that can also indicate whether results depend on a
achieve similar outcomes. In CEA, only costs are particular assumption and are therefore incon-
expressed in monetary terms in the evaluation of Clusive, or whether changing certain assumptions
an intervention; benefits are usually expressed in has little effect on results. Thus, sensitivity analy-
natural units, such as mortality, morbidity, or sis will give some indication as to the level of
270
30 =health economics
detail necessary in a study. In some cases, results studies may reflect the most efficient path to
may be reliable using fairly rough data, whereas follow, they do not necessarily provide the most
in other cases more specific indicators are neces- equitable solutions. Their value stems mostly
sary. from the fact that they ‘make health care planning
comprehensive rather than partial, systematic
rather than piecemeal, and value judgements
Conclusion explicit rather than implicit’ (McQuire, Hender-
One must bear in mind that not all costs are easily son, Mooney 1992).
quantifiable, and may have effects which are not
purely financial. Economic evaluations are also
not free of the analyst’s value judgements. They EXAMPLE 30.1
require co-operation between clients, health Economic impact of the AIDS
workers, health managers, epidemiologists, and
economists. Furthermore, evaluations are costly
epidemic in SA
and time-consuming and are not presently used This study assesseds the direct and indirect costs
extensively in the health sector outside of narrow of AIDS to South Africa. Direct costs included
fields such as the evaluation of pharmaceuticals. direct personal costs (such as hospitalization and
Therefore, the results of economic evaluation drugs for HIV/AIDS patients) and non-personal
should be seen as no more than atool for de- costs (testing, research, and prevention pro-
cision-makers. In the provision of health care, grammes). Direct personal costs were estimated
there is a need for a balanced interplay between using a hypothetical model for utilization of
economic appraisal and considerations of equity, health care resources at each stage of the disease.
ethics, and political expediency. While evaluation The major cost component of this section was
pel (me tm Ce Tame ete Pent in economic

evaluations
Type of economic Measurement of Consequences Measurement of

evaluation costs consequences
Cost minimization Rand, $ etc. Identical None

Cost effectiveness Rand, $ etc. Single effect, common Natural units — e.g.
to the alternatives but years of life gained,
achieved to different units of blood pressure
degrees reduction
Cost-benefit Rand, $ etc. Single or multiple effects, Rand, $ etc.

not necessarily common
to alternatives
Common effects may be

achieved to different
degrees
Cost-utility Rand, $ etc. Single or multiple effects, Quality adjusted

not necessarily common years of life
to alternatives
Common effects may be

achieved to different
degrees
271
found to be hospitalization. The model generated benefits of a vaccine for Haemophilus influenzae
cost estimates for the public and private sectors type B infection (Hib) in the 1992 Cape Town
and for the country as a whole. Non-personal birth cohort (n = 46 537). The risk of disease over a
costs were calculated based on estimates of the five-year period was calculated to be 666 to
total number of HIV tests conducted, and the 100 000 births. Net benefits are the difference be-
total annual cost of research and education. It tween the costs of using or not using a HibTITER.
was estimated that the direct cost (both personal Costs were the sum of direct medical costs (per-
and non-personal) of HIV/AIDS to South Africa in sonal and non-personal) and indirect costs. The
1995 would be between R4713 million and bulk of direct benefits (avoidable disease costs)
R10 007 million. Indirect costs were based on the were the hospitalization of patients with menin-
total number of work years lost. The indirect cost gitis, pneumonia, and septicaemia. Indirect costs
of HIV/AIDS for 1995 was estimated at R1 887 and benefits were measured using the human
million (after the figures were adjusted for a 50% capital and willingness to pay approaches. Costs
replacement rate of disabled labour). The study and benefits were discounted through time at a
then went on to predict the economic impact of rate of 2%. Unvaccinated disease patterns and
HIV/AIDS on GNP; it found a 0,12% loss of earn- vaccine efficacy data were compiled from clinical
ings for 1995 and a 1,5% loss of earnings by 2005. data. Sensitivity analyses were conducted on a
Thereafter, various scenarios of service mix were number of assumptions; coverage was assessed at
simulated to establish their impact on costs. The 85%, 90% and 95%. The value of life estimates and
study concluded that AIDS could have a profound the discount rate chosen were found to have the
impact on health care resources, seriously hinder greatest impact on the model's results. Net bene-
economic growth in the medium term, and profit was measured as the difference between cost of
duce numerous other indirect costs. The results disease averted and the cost of the vaccination
of such a costing exercise can be used to show programme itself. It was estimated that the costs
policy-makers the importance of early preventa- of introducing the vaccine to the 1992 Cape Town
tive measures and can assist them with their long- birth cohort would have been R8,3 million, while
term financial plans. the economic cost of Hib disease was projected at
R10,7 million to R11,8 million. Had the vaccine
Reference: been introduced, the net benefit (costs saved)
Broomberg, J, Steinberg M, Masobe P, Behr G. ‘The Eco- would have ranged from R2,4 million to R3,5 mil-
nomic Impact of the AIDS Epidemic in South Africa’ in Bar- lion. This example shows how economic evalu-
nett T, Blaikie P (eds). AIDS in Africa; Its Present and Future ation can be used to motivate for a particular
Impact, London: Belhaven Press 1992. health intervention. It can be used to show that
even if the costs upfront appear to be substantial,
the cost of doing nothing may be greater.
EXAMPLE 30.2
Costs and benefits of vaccination Reference:
programme for Hib disease Hussey, GD, Lasser M L, Reekie W D. ‘The costs and benefits
ofa vaccination programme for Haemophilus influenza type
In this study, the researchers conducted a cost- B disease.’ Southern African Medical Journal 1995; 85(1):
benefit analysis to measure the hypothetical 20-25.
272
appendix
appendix 1: Standardization ‘observed’ number), the category distribution of

the two populations (for example, age distribu-
(adjustment) of rates tion) and the category-specific rates of a standard
population (which could be one of the popula-
If a variable (for example, age or sex) is related to tions being compared, if the information is avail-
the outcome of interest, and the two or more able). By multiplying each category-specific rate
populations which are to be compared regarding of the standard population with the number of
the outcome differ with respect to the distribu- people in the matching category of the popula-
tion of that variable, one needs to standardize or tion of interest and then adding these, one
adjust rates before they can be compared. This ap- obtains the ‘expected’ number of cases in the
pendix refers to standardization with respect to population. By dividing the observed by the
age distribution — a similar approach would be expected number of cases, one has calculated a
used to standardize for sex or age and sex. standardized morbidity (or mortality) ratio for
Standardization can take the form of direct or that population. (This is also described in Chapter
indirect standardization, depending on the type of 20: Mortality studies.)
information that is available for the populations. A word of caution about indirect standardiza-
To perform direct standardization, the research- tion: one uses the category distribution of each
er needs age-specific rates for each population, as population (which may be vastly different) to
well as a standard or reference age distribution. An calculate that population’s expected number of
example is provided in Table 1. For each popula- cases. This method has therefore also been called
tion, the standardized rate is calculated by multi- the ‘changing base method’. When one compares
plying each age-specific rate with the number of standardized morbidity ratios, one is thus actual-
persons in the standard distribution who are in that ly comparing measures based on differing stan-
age category, adding these values, and finally divid- dards. Direct standardization (or the ‘fixed base
ing by the number of people in the standard popu- method’) involves the comparison of measures
lation. One could also multiply each age-specific based on the same standard, and thus represents
rate with the proportion of the standard population the true essence of standardization.
which falls in that age category, and then add these Besides comparing rates in different popula-
‘weighted’ age-specific rates. Adjusted rates are tions, standardization can also be used if the
thus obtained by mathematical manipulation of researcher has done a study on a sample in which
category-specific rates (in this case age-specific the proportion of the sample in each stratum (for
rates) and provide hypothetical summary (overall) example, age group) does not correspond to the
rates for purposes of comparison. proportion of the target population in each
As a standard distribution, the researcher can stratum (that is, one does not have a proportional
use the age distribution of either of the popula- stratified sample). Often strata of equal size are
tions of interest, their joint distribution, or some included in a sample (for example, 100 subjects
other standard, such as the World Standard. The for each age group), and this distribution does not
standardized rates obtained do, of course, depend reflect the population composition. Crude preva-
on which standard is used. Thus, applying the age- lences or rates found in such samples could be
specific rates of two populations to various stan- very distorted because of the unrealistic age
dards can lead to varying conclusions. distribution. In these cases, one has to adjust the
There are many criticisms of standardization, stratum-specific rates to obtain the overall rate in
precisely because it summarises. Researchers the population. Table 2 provides an example.
often pay little attention to the informative cate-
gory-specific rates, focusing only on the stan-
dardized rate. No single summary rate can reflect appendix 2: Standardization
the richness of category-specific rates, but if there of observers and instruments
are many categories, it may be difficult to make
useful deductions based on these rates. To ensure that good quality measurements are
Indirect standardization is used when catego- taken, the observers (measurers) as well as the
ry-specific rates are not available for one or more measuring instruments need to be standardized
of the populations of interest. For this standard- (that is, they should be adjusted to agree with a
ization method, the researcher uses the crude measuring standard). The observers have to be
number of cases in each of the populations (the trained thoroughly in the proper methods of using
23
epidemiology: a manual for south africa
gel) (ewRT rea eer elpece Cm emae A Ml

Timaom eit d yity
Group A
Age group Population size Proportion of Number of deaths Mortality rate
population in in age group in age group
age group
Under 25 18 000 0,45 90 0,005

25 to 49 10 000 0,25 50 0,005
50 to 69 6 000 0,15 210 0,035
70+ 6 000 0)15 270 0,045
Total 40 000 1,00 620 0,0155
Group B
Age group Population size Proportion of Number of deaths Mortality rate
population in in age group in age group
age group
Under 25 5 000 0:25 5 0,004

25 to 49 5 000 0,25 10 0,002
50 to 6 5 000 0,25 150 0,030
70+ 5 000 0,25 200 0,040
Total 20 000 1,00 365 0,018
The crude death rate in Group A is 0,0155 (or 15,5 Younger people tend to have a lower mortality
per 1 OOO), and in Group B 0,018 (or 18 per rate than older people. We therefore decide to
1 000). However, from the age-specific mortality standardize to a reference population, say one
rates, itis clear that the rates in Group A are with an age structure as follows (where the age
consistently higher than in Group B. From the distribution is indicated by the proportion in each
population figures, it can be seen that Group A is age group):
clearly a younger population than Group B.
Under 25 0,30 of population

25 to 49 years 0,30
50 to 69 years 0,20
70+ 0,20
The standardized rate for Group A is then

0,005 x 0,30 + 0,005 x 0,30 + 0,035 x 0,20 + 0,045 x 0,20 = 0,019 (19 per 1 000)
and for Group B
0,001 x 0,30 + 0,002 x 0,30 + 0,030 x 0,20 + 0,040 x 0,20 = 0,015 (15 per 1 000)
the instruments. For example, the WHO (1983) stip- contact with the heels and the length read to the
ulates that crown to heel lengths of children under nearest 0,1 cm.’
two years of age should be determined using a After such training, and before field work starts,
wooden length-board in the following way: the consistency and accuracy of the observers’
‘The infant is laid on the board, which is itself readings should be investigated. Each observer
on a flat surface. The head is positioned firmly should make repeat measurements on some sub-
against the fixed headboard, with the eyes looking jects. These subjects should be similar to those who
vertically. The knees are extended, usually by firm will be measured in the study. In this way, one can
pressure applied by an assistant, and the feet are determine whether an observer measures consis-
flexed at right angles to the lower legs. The tently. However, a consistent observer may be
upright sliding footpiece is moved to obtain firm measuring consistently incorrectly (for example,
274
appendix
consistently reading weight too heavy).

Therefore, measurements made by each observer BE (ese Wetec
ea ieee Berl
iiyls
must be compared with those of an accepted to the population age structure
standard to investigate accuracy of measure-
ment. The standard is commonly the training In a study to determine the prevalence of hyper-
instructor or the fieldworker supervisor. Annex 1 tension in Mamre residents aged 15 years and
of the WHO publication Measuring change in older, it was decided to include all inhabitants
nutritional status (1983) outlines methods of older than 45 years, as a service to the commun-
analysis of such repeated measurements to deter- ity. Samples were drawn from the younger age
mine consistency and accuracy. Even from a groups. The resulting sample is thus clearly much
simple listing of the different measurements, one older than the Mamre population.
should get an indication whether there are prob-
lems with their quality, and whether these are due Sample Population
to carelessness, or whether measurements are Males Females Males Females
consistently wrong. Similar investigations of
measurement quality can be done during the 13;6% 10:4: SD 25532.5
data collection phase. 15.5 1474 24,5 23,0
All measuring instruments have to be standard- f9,3° 18,6 17,4 18,2
ized before each measuring session, especially if 22,f 22,4 LO2 Of
instruments are handled roughly in the field, and 16;6.. 19,0 G35 0°78
moved around. So, for example, the weight record- LO4 15,054 Bees
ed by the scales should be compared with a known 100% 100% 100% 100%
weight. All scales should give the same reading
The prevalence of hypertension in the females
when measuring the same standard weight. If no
in the age groups was
standard weights are available, one could, for
example, use a 10 litre plastic bottle filled with 15-24 1,4 45-54 35,3
water, which will weigh 10 kg. The weights used for 25-34 bid 55-64 49,2
standardization should fall within the range of 35-44 14,5 65+ 56,9
weights being measured in the study. All scales The crude prevalence of hypertension among
should be moved to the zero point before each females in the sample is 29,8% whereas the
measurement. When choosing a measuring instru- age-adjusted prevalence for females is 16,3%
ment, one should take into account the ease with
which it can be used, as well as its accuracy. Many (1,4x 0,325 + 5,1x0,23+ 14,5x0,182 +
researchers feel, for example, that ordinary bath- 35,3 x 0,107 + 49,2 x 0,078 + 56,9 x 0,078)
room scales should not be used since they become Source: Hoffman M, Joubert G. ‘Mamre hypertension
inaccurate over time. study.’ Medical Research Council: Unpublished data 1990.
appendix 3: Derivation of the stated value for p. It so happens that a proportion of

0,5 (50%) requires the greatest sample size for a given
30 x 7 EPI sample precision, and this proportion is thus used in the
calculations. The required sample size for simple
With some mathematical manipulation, a formula random sampling is 96.
for sample size can be calculated. The design effect in a complex sampling design
n= 24d is the ratio of the variance of the estimated para-
meter under the design to that under a simple
with n the sample size, z the normal deviate (1,96), p random sampling design of the same sample size.
the expected proportion, q = 1-p, and d the required The design effect of a cluster sample is usually
precision (half the width of the confidence interval). less than 2, meaning that the cluster sample size
One factor here is the ‘precision’ or the degree to need seldom be more than double that of a simple
which data centres (groups) around asingle point. If random sample. This gives a figure of 192. Thirty is
this is given as, say, 0,1, the implication is that at a a large enough number of clusters for most statisti-
95% level of confidence, any estimate will not fall cal tests to apply, and so it was thought that a
further than 10 percentage points away from the sample of 30 clusters with 7 children was adequate.
219
appendix 4: Random number table

08939 53632 41345 65379 20165 32576 13967 90616 17995 92422
92578 23668 08801 39792 59541 SO 58830 60923 36068 68101
83994 91054 90377 22776 23263 34593 98191 77811 83144 98563
43080 71414 40760 01831 44145 48387 93018 22618 98547 87716
39372 ~ 46789 26381 37186 85684 79426 05395 17538 56671 82181
83046 58644 04452 98912 53406 30224 00687 32099 86414 29590
99808 32539 96961 88917 60847 64826 41332 64557 15354 Las
28478 70870 68912 75644 33648 21097 23745 52593 01849 37760
09916 19651 28659 95093 12626 19919 05879 56003 83100 94572
19537 66067 20569 28808 87722 67059 12851 73573 25776 92500
23013 (05574 26320 07754 09642 88068 41626 57139 68199 94938
55838 80585 80967 60540 34528 62310 63106 17843 39104 74036
92279 87344 93556 75233 09394 79265 91047 32891 77925 71530
27850 23IS2 89336 26026 52130 78544 02090 05645 15060 39550
01760 54605 11794 79312 69728 04554 99775 57659 47981 68954
81889 70751 87501 88247 41966 57574 67745 88304 20118 25964
74722 14654 15425 60665 25162 04987 03467 75915 24282 62456
56196 75068 ' 44643 92240 51651 79743 13598 63901 61020 91003
96842 62021 00543 45073 65545 87612 35765 26079 34589 72821
25619 98328 59393 71401 93871 20611 78830 87477 15390 05044
91746 05084 04781 82933 54564 80986 94843 40178 87483 63288
92384 84706 76778 98313 98875 08427 60687 88272 83448 06237
86390 62208 95735 14535 25591 22730 06059 31786 36181 31016
60458 83606 57510 92609 38061 94881 26736 06489 98303 31419
03783 39922 05489 73630 92379 91602 18193 84741 44704 05558
31011 36035 37113 98362 56149 51634 04468 62096 32361 35301
20555 05621 48728 41776 12101 96615 70781 55151 93876 66892
56466 36766 12400 43510 49456 05140 85736 68155 37306 10438
26875 67304 61950 65962 38223 35676 70043 99178 64677 95457
90648 84770 OZ OL 93814 27760 22232 83545 01183 55188 20482
26197 72840 01264 52019 00739 36259 10905 39097 36437 66743
72522 34445 D3 Io 13840 97262 59007 78685 41044 38103 59216
12370 ‘41270 36290 46307 51230 90614 82613 80148 37371 02895
81028 60112 31415 47478 02131 85480 93699 92876 13958 47867
61573 38634 77650 18189 10283 97999 95442 90657 84963 93863
98511 46300 91199 30492 62159 98525 31710 03540 35844 83200
76606 10834 75548 30779 54744 26450 66001 57949 53685 00567
20237 16311 15733 47599 43998 35594 17577 85113 52487 48900
21022 86025 26951 87480 82317 06580 98627 32536 07573 52612
47512 11564 41777 46581 03492 01722 78900 57901 37307 02727
80598 59041 28861 41793 91007 69907 00376 73086 35132 53014
01892 34226 88327 21926 36607 22307 04376 25491 13563 51955
89657 70349 15176 57916 10911 44218 67108 04678 24097 02476
97983 65616 11841 80504 76452 34176 16986 94328 13091 29592
59727 92033 14654 59622 25844 18460 78162 02832 13528 55683
12340 72894 26303 01771 73895 27432 99536 50328 06141 83886
48049 33318 67463 04914 22316 89663 37132 15825 60759 22131
85953 16537 25639 05004 99269 50577 10036 05022 39800 93605
03426 78111 37828 23967 03350 04397 96227 37787 60680 23993
Source: Fleiss, 1981
276
bibliography
bibliography
Section A: Contextualizing epidemiology 2 Key concepts in epidemiology
1 Introduction References
Beaglehole R, Bonita R, Kjellstrom T. Basic epidemiology.
References Geneva: World Health Organization 1993.
‘Alexandra Health Centre.’ Special edition South African Bradford Hill A. ‘The environment and disease: Association
Medical Journal 1991; 80:4. or causation?’ Proceedings of the Royal Society ofMedicine
Bac M. ‘PHCin practice.’ CHASA-in-Action: Optimal Health 1965; 58:295-300.
for All. 1st International Conference, 18-20 June 1990. Goldberg M, Dab W. ‘Complex indexes for measuring a
Cassel J A. ‘A comprehensive health program among South complex phenomenon.’ Chapter 7.5 in Abelin T,
African Zulus.’ In Paul B D (ed.) Health, culture and Brzezinski ZJ, Carstairs V D L (eds). Measurementin
community. New York: Russell Sage Foundation health promotion and protection. WHO Regional
1955; 15-41. Publications, European Series: No 22 (1987). *
Commission on Health Research for Development. Jeffe A M. ‘Health status indicators: their utility in chronic
Health research: Essential link to equity in development. disease evaluation research.’ Journal of Chronic Diseases
1980; 33:567-79.
New York: Oxford University Press 1990.
KarkS L, Steuart GW, (eds)..A practice of social medicine. Jinabhai C C, Coovadia H M, Abdool Karim SS. ‘Socio-
Edinburgh: E & S Livingstone 1962. medical indicators of health in South Africa.’ International
Journal of Health Services 1986; 16:163-76.
KarkS L, Le Riche H. ‘A health study of South African Bantu
Leavell H R, Clark EG. Preventive medicine for the doctor in
school children. Report on a somatometric and clinical
his community. New York: McGraw Hill Book Company
study conducted during 1938-39.’ Manpower. Scientific
1965.
Journal Devoted to Manpower Research in South Africa
Lilienfeld A M, Lilienfeld D E. Foundations of epidemiology
1994; 3:2-141.
(2nd edn). Oxford: Oxford University Press 1980.
LastJ M. A dictionary of epidemiology (2nd edn). New York:
MausnerJ S, Kramer S. Epidemiology —An introductory text.
Oxford University Press 1988.
Philadelphia: W B Saunders Company 1985.
‘Mamre Community Health Project.’ Special edition South MacMohonB, Pugh T F. Epidemiological principles and
African Medical Journal 1988; 74:7. methods (Chapter 2). Boston: Little, Brown and Co. 1970.
‘Report of the National Health Services Commission on the Mazur RE. Population structure, fertility and childhood
Provision of an Organized National Health Service for All mortality in South Africa: Lessons to be learned from
Sections of the People of the Union of South Africa 1942- analysis of the Poverty survey. Tygerberg: Medical Research
1944 [The Gluckman Report].’ Cape Town, Republic of Council 1995.
South Africa: South African Government Printer 1945. Noack H, Abelin T. ‘Conceptual and methodologic aspects of
Rothman K. Modern epidemiology. Boston: Little, Brown and measurement in health and health promotion.’ Chapter 5
Co. 1986. in Measurement in health promotion and protection edited
Slome C. ‘Community health in rural Pholela.’ In Kark S L, by Abelin T, Brzezinski ZJ, Carstairs VD L. WHO Regional
Steuart G W (eds). A practice of social medicine. Edinburgh: Publications, European Series: No 22 (1987).
Eand S Livingstone 1962; 260-91. Omran A R. ‘The epidemiological transition: a theory of the
Tollman S. ‘The Pholela Health Centre — The origins of epidemiology of population change.’ Millbank Memorial
community-oriented Primary Health Care (COPC).’ Fund Quarterly 1971; 49: 509-39.
South African Medical Journal 1994; 84:653-8. Rothman K. Modern epidemiology. Boston: Little, Brown and
Tollman S. ‘Community-oriented primary care: origins, Co. 1986.
evolution, application.’ Social Science and Medicine 1991; Sackett D, Haynes R B, Tugwell P. Clinical epidemiology—
32:633-42. A basic science for clinical medicine. Boston: Little, Brown
Yach D, Tollman S. ‘Public Health Initiatives in South Africa and Co. 1985.
in the 1940s and 1950s: Lessons for a Post-Apartheid Era.’ Schach E. ‘Alternative ways of presenting statistics on
Annals of the Journal of Public Health 1993; sickness, disability and death.’ Chapter 11 in Measurement
83:1 043-50. in health promotion and protection edited by Abelin T,
Brzezinski ZJ, Carstairs VD L. WHO Regional ee
Further reading European Series: No 22 (1987): 213-31.
Soberon G, FrenkJ,Sepulveda J. ‘The health care retdimitin
Hanlon JJ, Pickett GE. Public health, administration and
Mexico: before and after the 1985 earthquake.’ American
practice. Missouri: The C V Mosby Company 1979.
Journal of Public Health 1986; 76:673-80.
Parry C DH, Tollman SM, Yach D. ‘Towards an essential
Steyn K, Buch E. Chronic diseases of lifestyle in South Africa:
national health research strategy for South Africa.’
A policy proposal. Medical Research Council 1992.
(Opinion) South African Medical Journal 1992; Susser M. ‘What is a cause and how do we know one?
82:299-300.
A grammar for pragmatic epidemiology.’ American
Tollman S M. ‘The Commission on Health Research for Journal of Epidemiology 1991; 133:635-48.
Development: Its work and relevance for South Africa.’ World Health Organization. Development ofIndicators for
Urbanisation and Health Newsletter CERSA, Medical Monitoring Progress towards Health for All by the Year
Research Council, April 1991. 2000. Geneva 1981.
Wall S. ‘University of Umea Department of Epidemiology Yach D, Mathews C, Buch E. ‘Urbanisation and health:
and Health Care Research Annual Report 1989.’ University methodological difficulties in undertaking epidemiologi-
of Umea, Sweden, April 1990. cal research in developing countries.’ Social Science and
Medicine 1990; 31:507-14.
214,
Further reading Symposium on Participatory Research in Health

Promotion. Compiled by Korrie de Koning, Liverpool
Fleiss J L. Statistical methods for rates and proportions.
School of Tropical Medicine, August 1994.
New York: John Wiley and Sons 1981.
Meulenberg-Buskens I. ‘The participatory researcher as
Jamison DT, Mosley
W H, Measham A R, Bobadilla J L.
educator in research: Some notes.’ Paper at the
Disease control priorities in developing countries. Oxford
International Symposium on Participatory Research in
University Press 1993.
Health Promotion held at Liverpool in September 1993.
Mostert WP, Lotter J M (eds). South Africa’s demographic
Published in the Proceedings of the International
future. Pretoria: Human Sciences Research Council 1990.
Symposium on Participatory Research in Health
Omran AR. ‘The epidemiological transition: a theory of the
Promotion. Compiled by Korrie de Koning, Liverpool
epidemiology of population change.’ Millbank Memorial
School of Tropical Medicine, August 1994.
Fund Quarterly 1971; 49:509-39.
Rifkin S B. ‘The role of the public in the planning, manage-
The International Bank. ‘World Development Report.
ment and evaluation of health activities and programmes,
Investing in health. World Development indicators’ 1993.
including self-care.’ Social Science and Medicine 1981;
15A:377-86.
3 General philosophical issues in epidemiology Rifkin S B, Muller F, Bichmann W. ‘Primary health care:
On measuring participation.’ Social Science and Medicine
References 1988; 26:931-40.
Beauchamp TL, Childress J F. Principles of biomedical ethics Rothman J. ‘Three models of community organisation
(3rd edn). New York: Oxford University Press 1989. practice, their mixing and phasing.’ In Cox F M, ErlichJ L,
Beecher HE. ‘Ethics and clinical research.’ New England Rothman J, Trotman J E (eds). Strategies of community
Journal ofMedicine 1966; 274:1354-60. organisation (4th edn). Itasca, Ill.: F E Peacock Publishers
Chambers R. ‘Making health policy management intersect- 1987.
oral: issues of information analysis and use in less devel- Tandon R. ‘The historical roots and contemporary urges in
oped countries.’ Social Science and Medicine 1989; participatory research; implications for health care.’
29(4):503-14. Keynote address at the International Symposium on
Chetty K, Owen P. ‘Criteria to evaluate community partici Participatory Research in Health Promotion held at
pation in Primary Health Care projects.’ Working paper Liverpool in September 1993. Published in the Proceedings
no. 9. Health Economics Unit, Department of Community of the International Symposium on Participatory Research
Health, University of Cape Town 1994. in Health Promotion. Compiled by Korrie de Koning,
CohenJ M, UphoffN T. Rural development participation — Liverpool School of Tropical Medicine, August 1994.
Concepts for measuring participation for project design,
implementation and evaluation. Amherst, USA: Cornell Further reading
University 1977. Beauchamp TL, Faden R R, Wallace RJ, Walters L (eds).
Council for International Organizations of Medical Sciences Ethical issues in social science research. Baltimore:
(CIOMS). International guidelines for ethical review of Johns Hopkins University Press 1982.
epidemiological studies. Geneva: CIOMS 1991. Benatar SR (ed). Ethical and moral issues in contemporary
Gouws DJ. ‘Stakeholders in innovation.’ RSA 2000 1988; 10(1): medical practice. Cape Town: University of Cape Town
Human Sciences Research Council. Printing Department 1986.
Guidelines on ethics for medical research (revised edition). De Beer C, Hammond M. ‘Community participation in
Tygerberg: South African Medical Research Council 1993. primary health care: participation on whose terms?’
Jones J H. Bad Blood. The Tuskegee syphilis experiment In: Hammond M, Gear] (eds). Workbook in Community
(2nd edn). New York: The Free Press 1993. Health. Cape Town: Oxford University Press 1986; 326-8.
Khanna R. ‘Participatory Action Research (PAR) in women’s Declaration of Alma Ata. Adopted by participants at the
health.’ Keynote address at the International Symposium International Conference on Primary Health Care, Alma
on Participatory Research in Health Promotion held at Ata, USSR, September 1978.
Liverpool in September 1993. Published in the Proceedings Faden RR, Beauchamp TL.A history and theory of informed
of the International Symposium on Participatory Research consent. New York: Oxford University Press 1986.
in Health Promotion. Compiled by Korrie de Koning, Flack HE, Pellegrino E D (eds). African-American
Liverpool School of Tropical Medicine, August 1994. perspectives on biomedical ethics. Washington:
Lammerink M P, Wolffers I (eds). Some selected examples of Georgetown University Press 1992.
participatory research. The Hague: Special Programme for IJsselmuiden CB, Faden R R. ‘Research and informed
Research of the Directorate General for International Co- consent in Africa— another look.’ New England Journal of
operation (DGIS) 1994. Medicine 1992; 326:830-33.
Le BoterfG. ‘Participatory research as a process of critical Katzenellenbogen J. ‘Community participation in the
education: methodological guidelines.’ In: Lammerink M Mamre Community Health Project.’ South African Medical
P and Wolffers I (eds). Some selected examples of participa- Journal 1988; 74:335-8.
tory research. The Hague: Special Programme for Research Levine RJ. Ethics and regulationofclinical research
of the Directorate General for International Co-operation (2nd edn). Baltimore: Urban and Schwarzenberg 1986.
(DGIS) 1994. Office for Protection from Research Risks, National
Madquire P. ‘Participatory research from one feminist Institutes of Health. Protecting human research subjects.
perspective. Moving from exposing androcenticism to Institutional Review Board Guidebook. Washington: US
embracing possible contributions of feminism to partici- Government Printer 1993.
patory research theory and practice.’ Keynote address at Palmer CT, Anderson MJ. ‘Assessing the development of
the International Symposium on Participatory Research in community involvement.’ World Health Statistics 1986;
Health Promotion held at Liverpool in September 1993. 39:345-52.
Published in the Proceedings of the International Ramphele M. ‘Participatory research — The complexities.’
278
bibliography
Working Paper, Department of Social Anthropology, Steineck G, Ahlbom A. ‘A definition of bias founded on the
University of Cape Town 1989. concept of the study base.’ Epidemiology 1992; 3(6):477-82.
Rifkin S B. ‘Health planning and community participation.’
World Health Forum 1986; 7:156-62. 5 Literature review
Rifkin S B. ‘Lessons from community participation in health
programmes.’ Health Policy and Planning 1987; 1(3): 240-49. Further reading
Rothman J. Using research in organisations:A Guide to
successful application. Sage Publications 1980. Sackett D L, Haynes RB, Guyatt G H, Tugwell P. Clinical
Titchen A, Binnie A. ‘Action research: a strategy for theory epidemiology:A basic science for clinical medicine (2nd
generation and testing.’ International Journal of Nursing edn). Chapters 11, 12 and 13. Boston: Little, Brown and
Studies 1994; 31:(1)1-12. Co. 1991.
Twiss B. ‘Exploitation of science through technology and
innovation.” RSA 2000 1988; 10(1). Human Sciences 6 Setting objectives for research
Research Council.
Van der Walt H, Hoogendoom L. ‘Training for self-evalua- References
tion: Experiences at Ithusheng Health Centre, South Werner D, Bower B. Helping health workers learn
Africa.’ Learning for Health, Liverpool School of Tropical pp. 26.2-26.7. Palo Alto: Hesperian Foundation 1991.
Medicine, Education Resource Group Mar-Aug 1995; 7:8-12. Varkevisser C M, Pathmanathan I, Brownlee A. Designing
Van der Walt H. ‘A rural community health project for the and conducting HSR Projects HSR Training Series.
aged in Genadendal, South Africa.’ M Ed. dissertation. Volume 2 Part 1. International Development Research
Department of Adult and Higher Education, University of Centre (Canada) and World Health Organization (Geneva)
Manchester 1983. 19Q1:
Van der Walt H. ‘The role of participatory research to
enhance patient adherence to tuberculosis treatment.’ 7 Study design
In: Proceedings of the International Symposium on
Participatory Research in Health Promotion. Compiled by References
Korrie de Koning, Liverpool School of Tropical Medicine,
August 1994. Rodrigues L, Kirkwood B R. ‘Case-control designs in the
Veatch R M (ed). Cross cultural perspectives in medical ethics: study of common diseases: Updates on the demise of the
readings. Boston: Jones and Bartlett Publishers 1989. rare disease assumption and the choice of sampling
Walters S. ‘Participatory research — Issues and questions scheme for controls.’ International Journal ofEpidemiology
(working paper).’ In: Research Methods for Higher Degrees. 1990; 19:205-13.
Cape Town: Faculty of Education, University of Cape Rossouw] E, Jooste P L, Chalton D O, Jordaan ER,
Town 1983; 102-14. 4 : Langenhoven M L, Jordaan P C, Steyn M, Swanepoel AS,
Weiss C H, Bucuvalas MJ. ‘The challenge of social research Rossouw LJ. Community-based intervention: the
to decision-making.’ In Weiss C H (ed) Using social science Coronary Risk Factor Study (CORIS) International Journal
research in public policy-making. Lexington, MA: DC of Epidemiology 1993; 22:428-38.
Health 1977.
Wacholder S, McLaughlin J K, Silverman DT, MandelJ S.
Werner D. ‘Empowerment and health.’ Contact April 1980; ‘Selection of controls in case-control studies.’ American
102:1-9. Journal ofEpidemiology 1992; 135(9):1019-50.
Werner D. ‘Health care and human dignity — a subjective
look at community-based rural health programmes in Further reading
Latin America.’ Contact Aug 1980; 57:2-16. LilienfeldA M, Lilienfeld D. Foundations of epidemiology
Yach D, Dick J. ‘Implementation of research: The key to (2nd edn). Oxford: Oxford University Press 1980.
closing the gap between public health knowledge and Rothman KJ. Modern epidemiology New York: Little, Brown
action.’ Journal of Comprehensive Health 1991; 2(1):15-20. and Co. 1986.
Sackett D L, Haynes RB, Guyatt GH, Tugwell P. Clinical
epidemiology:A basic science forclinical medicine
Section B: Epidemiological research methods (2nd edn). Boston: Little, Brown and Co. 1991.
and protocol development
8 Sampling
4 Planning a research project
Reference References
Sackett D L. ‘Bias in analytic research.’ Journal of Chronic Jochelson K, Mothibeli M, LegerJ P. ‘Human
Immunodeficiency Virus and migrant labour in South
Diseases 1979; 32:51-63.
Africa.’ International Journal of Health Services 1991;
21:157-73.
Further reading PickW M, Cooper D, KlopperJ ML, MyersJ E, Hoffman MN,
Abrahamson J H. Survey methods in community medicine Kuhn LA. ‘A study of the effects of urbanisation on the
(3rd edn). New York: Longman Publishers 1988. health of women in Khayelitsha, Cape Town. Working
BothaJ, Yach D. ‘Epidemiological research methods (VI). Paper No. 1: rationale and methods.’ Cape Town:
Planning a research project.’ South African Medical Department of Community Health, University of Cape
Journal 1987; 72:563-6. Town 1990.
Hulley S B, Cummings S R. Designed clinical research: an Rutkove S B, Abdool Karim S S, Loening
W E. ‘Patterns of
epidemiological approach Baltimore: Williams and care in an overburdened tertiary hospital outpatients
Wilkens 1988. department.’ South African Medical Journal 1990; 77:476-8.
2
Sackett D LS. ‘Bias in analytic studies.’ Journal of Chronic ment.’ The Lancet 1986; 307-10.
Diseases. 1979; 32:51-3. Gardner MJ, Altman DG (eds.). Statistics with confidence
Watters J K, Biernacki P. ‘Targeted sampling: Options for the — Confidence intervals and statistical guidelines.
study of hidden populations.’ Social Problems 1989; London: British Medical Journal 1989.
36(4):416-30. Haertel U, Heiss G, Filipiak B, Doering A. ‘Cross-sectional
Yach D, Katzenellenbogen J, Conradie H. ‘Ciskei infant and longitudinal associations between high density
mortality study: Hewu district.’ South African Journal of lipoprotein cholesterol and women’s employment.’
Science 1987; 83:416-21. American Journal ofEpidemiology 1992; 135(1):68-78.
Mollentze W F, Joubert G. ‘Coronary risk factor study of
Further reading black rural and urban populations in the Free State.’
Biernacki P, Waldorf D. ‘Snowball sampling: Problems and University of the Orange Free State 1994. Unpublished
techniques of chain referral sampling.’ Social Methods data.
and Research 1981; 10:141-63.
Cohen J. Statistical power analysis for the behavioural Further reading
sciences. New York: Academic Press 1969. Altman D G. Practical statistics for medical research. London:
Donner A, Brown KS, Brasher P. ‘A methodological review of Chapman and Hall 1991.
non-therapeutic intervention trials employing cluster Anderson A, Richter L M. ‘““Birth-to-Ten”: error detection
sampling 1979-1989.’ International Journal of and correction in a longitudinal address database.’
Epidemiology 1990; 19(4):795-800. Southern African Journal ofEpidemiology and Infection
Goodman LA. ‘Snowball sampling.’ Annals ofMathematical 1994, 9:16-19.
Statistics 1961; 32:148-70. Chambers J M, Cleveland WS, Kleiner B, Tukey P A. Graphical
Levy PS, Lemeshow S. Sampling for health professionals. methods for data analysis. Boston: Duxbury Press 1983.
Belmont (California): Lifetime Learning Publications 1981. Draper N, Smith H. Applied regression analysis. New York:
Stoker DJ. ‘Basic sampling methods.’ In SchnetlerJ (ed). John Wiley 1981.
Survey methods and practice. Pretoria: Human Sciences Fleiss J L. Statistical methods for rates and proportions.
Research Council 1989. New York: John Wiley 1981.
Greenland S. ‘Randomisation, statistics and causal inference. ,
9 Datacollection and measurement Epidemiology 1990; 1(6):421-29.
Hoaglin D C, Mosteller F, Tukey JW. Understanding robust
Further reading and exploratory data analysis. New York: John Wiley 1983.
AbrahamsonJ H. Survey methods in community medicine Hosmer D W, Lemeshow S. Applied logistic regression.
(4th edn). Edinburgh: Churchill Livingstone 1990. New York: John Wiley 1989.
Joubert G, Hoffman M. ‘Data quality in a community health Jooste P L, Jordaan E, Van Eck M. ‘Error detection and
survey — the experience with the Mamre Community correction for improving the quality of epidemiological
Health project.’ Southern African Journal ofEpidemiology research data: Experience gained from a cardiovascular
and Infection 1990; 5(4):79-82. community study.’ Southern African Journal of
RezlerA G. The assessment ofattitudes. WHO/EDUC/712/ Epidemiology and Infection 1988; 3:12-16.
152. Geneva: World Health Organization. Kahn HA. An introduction to epidemiologic methods.
Sackett D LS. ‘Bias in analytic studies.’ New York: Oxford University Press 1983.
Journal of Chronic Diseases 1979; 32:51-63. Lee ET. Statistical methods for survival data analysis.
Woodward CA, Chambers LW. Guide to questionnaire Belmont, California: Wadsworth 1980.
construction and question writing (1st edn). Mainland D. ‘Statistical ritual in clinical journals: Is there a
Canadian Public Health Association 1980. cure? —I. British Medical Journal 1984; 288:841-3.
Mainland D. ‘Statistical ritual in clinical journals : Is there a
cure? —II.’ British Medical Journal 1984; 288:920-22.
10 Other issues to consider in the protocol Rosner B. Fundamentals of biostatistics. Boston:
Duxbury Press 1982.
Further reading
Rothman KJ. Modern epidemiology. New York:
MRC News. Special issue: Compilation of articles of general Little, Brown and Co. 1986.
interest 1989; 20(5). (This issue contains articles on, for Siegel S. Non-parametric statistics for the behavioural
example, how to write scientific papers, how to present sciences. New York: McGraw-Hill 1956.
research results visually, the roles of authors and editors, Tufte E R. The visual display ofquantitative information.
and ethical issues in publishing research.) Connecticut: Graphics Press 1983.
Also refer to references and further readings given for Chapter
3 and those given in Section C. 12 Integrating epidemiological concepts
Section C:Data presentation, analysis, References

and interpretation Ahlbom A, Norell S. Introduction to modern epidemiology.
Chestnut Hill: Epidemiology Resources Inc. 1990.
11 An introduction to data presentation, analy-
sis, and interpretation Further reading
References Breslow NE, Day NE. Statistical methods in cancer research.
Volume II— The design and analysis of cohort studies.
Bland J M, Altman DG. ‘Statistical methods for assessing Oxford: Oxford University Press 1987.
agreement between two methods of clinical measure- HulleyS B, Cummings SR (eds). Designing clinical research.
280
bibliography
Baltimore: Williams and Wilkins 1988. McAnerneyJ M, Johnson S, Schoub B D. ‘Surveillance of

Kahn HA. An introduction to epidemiologic methods. respiratory viruses. A 10-year laboratory-based study.’
New York: Oxford University Press 1983. South African Medical Journal 1994; 84:473-7.
KelseyJ,Thompson WP, Evans A S. Methods in observation- Strebel P, Metcalf C, Hussey G, Smith D, Hanslo D, Cameron
al epidemiology. New York: Oxford University Press 1986. N. ‘An epidemic of pertussis syndrome in Cape Town.’
McMahon B, Pugh T F. Epidemiology principles and South African Medical Journal 1989; 75:399.
methods. Boston: Little, Brown and Co. 1970.
Rothman KJ. Modern epidemiology. Boston: Little, Brown Further reading
and Co. 1986.
Cutts F T, Markowitz LE. ‘Successes and failures in measles
Schlesselman J J. Case-control studies. Design, conduct,
control.’ Journal of Infectious Disease 1994; 170
analysis. New York: Oxford University Press 1982.
(Suppl 1):S32-41.
Cutts F T, Smith P G, Colombo S, Mann G, Ascherio A,
Section D: Common approaches in epidemiology Soares A C. ‘Field evaluation of measles vaccine efficacy in
Mozambique.’ American Journal of Epidemiology 1990;
13 The use of routinely available data in 131:349-55.
Department of Health. ‘Seventy five years of notifications.’
epidemiological studies
Epidemiological Comments 1994; 21 (3):43-62.
References Expanded Programme on Immunization. ‘Disease surveil-
lance: information for action.’ Expanded Programme on
Botha) L, Bradshaw D. ‘African vital statistics — A black Immunization Update. World Health Organization,
hole?’ South African Medical Journal 1985; 67:977-81. Geneva 1987.
Central Statistical Services. Population census 1991. Expanded programme on Immunization. ‘Surveillance.’ EPI
Economic characteristics of the population. CSS report no. Alert No. 7 April. World Health Organization, Geneva 1993.
03-01-23(1991). Pretoria: Central Statistical Services 1992. Expanded programme on Immunization. ‘Surveillance for
Henley L, Smit M, Roux P, Zwarenstein M. ‘Bed use in the polio — the Tunisia experience.” EPIAlert No. 8 July. World
medical wards of Red Cross War Memorial Children’s Health Organization, Geneva 1993.
Hospital, Cape Town.’ South African Medical Journal 1991; Evans A S. ‘Surveillance and sero-epidemiology.’ In Evans
80(10):487-90. AS (ed) Viral infections of humans: Epidemiology and
Kibel S M, Bradshaw D, Joubert G. Injury-related deaths in control (2nd edn). New York: Plenum Medical Book
South African children. Parow: Medical Research Council Company 1984.
1990. Langmuir A D. ‘The surveillance of communicable diseases
ReHMIS System Documentation. Pretoria: Medunsa 1994. of national importance.’ New England Journal of Medicine
Van der Merwe §S, Yach D, Metcalf C. ‘Peering into the black 1963; 268:182-92.
hole — The quality of black mortality data in the city of Marier R. ‘The reporting of communicable diseases.’
Port Elizabeth and South Africa.’ South African Medical American Journal of Epidemiology 1977; 105:587-90.
Journal 1991; 79(8):419-22. Metcalf C. ‘Outbreak investigations.’ In Katzenellenbogen J,
VolmtinkJ, Furman S. ‘The South African Sentinel Joubert G, Yach D (eds). Introductory manual for epidemi-
Practitioner Research Network: Organization, Objectives, ology in Southern Africa. Tygerberg: Medical Research
Policies and Methods.’ South African Family Practice 1991; Council 1991.
12:467-71. Sitas F, Zwarenstein M, Yach D, Bradshaw D. ‘A national
sentinel surveillance network for the measurement of ill-
Further reading health in South Africa.’ South African Medical Journal
1994; 84:91-4.
Alderson M R. Mortality, morbidity and health statistics.
New York: Stockton Press 1988. Strebel P M, Kustner H GV. ‘Disease outbreak investigations
— objectives, methods and importance.’ South African
Medical Journal 1988; 73:334-6.
14 Disease surveillance Teutsch S M, Thacker S B. ‘Planning a public health surveil-
lance system.’ Epidemiological Bulletin: Pan American
Abdool Karim S S, Abdool Karim Q, Chamane M. ‘Impact ofa Health Organization 1995; 16(1):1-6
measles campaign on measles admissions to a Natal Thacker S B, Choi K, Brachman PS. The surveillance of
hospital.’ South African Medical Journal 1991; 80:579-81. infectious diseases. Journal of the American Medical
Abdool Karim, Q, Abdool Karim, S S, Singh B, Short R, Association 1983; 249:1 181-5.
Ngxongo S. ‘Seroprevalence of HIV infection in rural South Thacker S B, Stroup D F. ‘Future directions for comprehen-
Africa.’ AIDS 1992; 6:1 535-9. sive public health surveillance and health information
Abdool Karim S S, Abdool Karim Q, Dilraj A, Chamane M. systems in the United States.’ American Journal of
‘Unsustainability of ameasles immunization campaign — Epidemiology 1994; 140:383-97.
rise in measles incidence within 2 years of the campaign.’ Wharton M, ChorbaT L, Vogt RL, Morse DL, Buehler
JW.
South African Medical Journal 1993; 83:322-3. ‘Case definitions for public health surveillance.’ Morbidity
Abdool Karim S§, Dilraj A. ‘Reasons for under-reporting of and Mortality Weekly Report 1990; 39(RR-13):1-43.
notifiable conditions.’ South African Medical Journal
1996; 86: 834-6.
Coetzee N, Hussey G, Visser G, Barron P, Keen A. ‘The 1992 15 Health systems research (HSR)
measles epidemic in Cape Town: a changing epidemiolog-
ical pattern.’ South African Medical Journal 1994; 84:145-9. References
Kustner H GV, Swanevelder J P, van Middelkoop A. Abdool Karim S S, Abdool Karim Q, Chamane M. ‘Impact of a
‘National HIV surveillance — South Africa, 1990-1992.’ measles campaign on measles admissions to a Natal
South African Medical Journal 1994; 84:195-200. hospital.’ South African Medical Journal 1991; 80:579-81.
281
Beattie A, Rispel L, Broomberg J, Price M, Cabral J. 17 Community studies

‘The quality of Primary Care in SA: The results of a facility
based assessment.’ Paper no. 4: Centre for Health Policy, References
Johannesburg 1995.
Levitt N S, KatzenellenbogenJ M, Bradshaw D, Hoffman MN,
Levitt N, Zwarenstein M, Doepfmer S, Bawa A,
BonniciB. ‘The prevalence and identification of risk factors
Katzenellenbogen J, Bradshaw D. ‘Public sector primary care
for NIDDM in urban Africans in Cape Town, South Africa.’
of diabetics: a record review of quality of care in Cape Town.’
Diabetes Care 1993; 16(4):601-7.
South African Medical Journal 1996; 86: 1013-17.
Cooper D, Pick WM, Myers J E, Hoffman MN, Sayed R,
National Health Expenditure Review, Health Systems Trust,
KlopperJ ML. ‘Urbanization and women’s health in
1995.
Khayelitsha: Part II. Health status and use of health
Zwarenstein M, Lewin S. ‘Developing an Evaluation Strategy
services.’ South African Medical Journal 1991; 79:28-432.
for the Western Cape Community Partnerships Project.’
Final report of the consultants. Medical Research Council,
August 1995. Further reading
Abrahamson J H. Survey methods in community medicine
Further reading (3rd edn). Longman Publishers 1988.
Hoffman M, Yach D, Katzenellenbogen J, Pick W, KlopperM
Crombie! K with Davids HT. Research in health care: design, L. ‘The Mamre Community Health Project — rationale and
conduet and interpretation of health services research. methods.’ South African Medical Journal 1988; 74:323-8.
Chichester: Wiley 1996.
MausnerJ S & Kramer S. Epidemiology: An introductory text
Programme on Health Systems Research and Development. (2nd edn). Saunders Publishers 1985.
From research to decision-making: case studies on the use of Van der Walt H. ‘A rural community health project for the
health systems research. Geneva: WHO 1991.
aged in Genadendal, South Africa.’ M.Ed. dissertation.
Department of Adult and Higher Education, University of
16 Rapid epidemiological assessment Manchester 1983.
References 18 Knowledge, attitude, belief, and practice

Bennet S, Woods T, Liyanage W M, Smith D L. ‘A simplified (KABP) surveys
general method for cluster sample surveys of health in References
developing countries.’ World Health Statistics Quarterly
1991; 44:98-106. Bandura A. ‘A social cognitive approach to the exercise of
Bertrand WE, MockN E, Franklin R R. ‘Differential control over AIDS infection.’ In Di Clemente R(ed).
Correlates of Nutritional Status in Kinshasa, Zaire.’ Adolescents and AIDS: A generation in Jeopardy, 89-116.
International Journal ofEpidemiology 1988; 17:556-67. Newbury Park, California: Sage Publications 1992.
Coetzee DJ, Ferrinho P, Reinach SG. ‘A Vaccination survey Bongaarts J. ‘The KAP Gap and the unmet need for contra-
using the EPI methodology to evaluate the impact ofa ception.’ Population and Development Review 1991;
child health outreach programme in an urban area in 17:293-313.
South Africa.’ WHO Bulletin 1993; 71:33-9. Bulmer M, Warwick D (eds) Social research in developing
Comstock G W. ‘Vaccine evaluation by case-control or countries: Surveys and censuses in the Third World.
prospective studies.’ American Journal ofEpidemiology Chichester: John Wiley 1983.
1990; 131:205-7. DeVellis, R F. Scale development: Theory and applications.
De ZoysaI, Feachem RG. ‘Interventions for the control of Newbury Park, California: Sage Publications 1991.
diarrhoeal diseases among young children: rotavirus and Global Programme on AIDS: Social and Behavioural Research
cholera immunization.’ WHO Bulletin 1985; 63:569-83. Unit. Research Package: Knowledge, Attitudes, beliefs and
Henderson R H, Sundaresan T. ‘Cluster sampling to assess practices on AIDS (KABP). Geneva: WHO 1990.
immunisation coverage: a review of experience witha Hewstone M. ‘Attribution theory and common-sense expla-
simplified sampling method.’ WHO Bulletin 1982; 60:253- nations: An introductory overview.’ In Hewstone M (ed).
60. Attribution theory: Social and functional extensions; 170-
Hill AG, David P H. ‘Monitoring changes in child mortality: 86. Oxford: Blackwell 1983.
new methods for use in developing countries.’ Janz N & Becker M. ‘The health belief model a decade later.’
Health Policy and Planning 1988; 3:214-26. Health Education Quarterly 1984; 11:1-47.
Kalter H D, Gray RH, BlackR E, Gultiano S A. ‘Validation of Kelsey] L, Thompson WD, Evans A S. Methods in observat-
post-mortem interviews to ascertain selected causes of ional epidemiology. New York: Oxford University Press
death in children.’ International Journal ofEpidemiology 1986.
1990; 19:380-86. Kuhn L, Steinberg M, Mathews, C. ‘Participation of the
Lanata CF, BlackR E. ‘Lot quality assurance sampling school community in AIDS education: An evaluation ofa
techniques in health surveys in developing countries: high school programme in South Africa.’ AIDS Care 1994;
advantages and current constraints.’ World Health 6:161-71.
Statistics Quarterly 1991; 44:133-9. Maloney M P, Ward M P. Psychological assessment:A con-
Lemeshow S, Taber S.‘Lot quality assurance sampling: ceptual approach. New York: Oxford University Press 1976.
single and double sampling plans.’ World Health Statistics Martin G, Steyn K, Yach D. ‘Beliefs about smoking and health
Quarterly 1991; 44:115-32. and attitudes toward tobacco control measures.’
Selwyn BJ. ‘An epidemiological approach to the study of South African Medical Journal 1992; 82:241-5.
users and non-users of child health services.’ Mathews C, Kuhn L, Metcalf C, Joubert G, Cameron N.
American Journal of Public Health 1978; 69:231-5. ‘Knowledge, attitudes, and beliefs about AIDS in township
school students in Cape Town.’ South African Medical
Journal 1990; 78:511-16.
282
bibliography
Okediji F. ‘Theoretical and methodological critique of Bradshaw D, Laubscher R, Schneider M. ‘Estimated Cause of
surveys of knowledge, attitudes and practice of family Death Profiles for the nine new provinces based on 1990
planning in Africa.’ International Review ofModern data.’ South African Medical Research Council, Technical
Sociology 1973; 3:1-11. Report, Parow, 1995.
Pick WM, Steyn K. ‘Hypertension in family practice.
Astudy of knowledge, attitudes and practices.’ Further reading
South African Medical Journal 1992; 82:257-9. Alderson M R. Mortality, morbidity and health statistics.
Population Council. A manual for surveys offertility and
USA: Stockton Press 1988.
family planning: Knowledge, attitude and practice.
Bradshaw D, Botha H, Joubert G, Pretorius J P, Van Wyk R,
New York: Population Council 1972.
Yach D. Review of South African Mortality (1984). Medical
Ratcliffe J. ‘Analyst biases in KAP surveys: A cross-cultural Research Council, Technical Report No 1, Parow, 1987.
comparison.’ Studies in Family Planning 1976; 7:322-30.
ManualX. Indirect techniques for demographic estimation.
Stone L, Campbell J. ‘The use and misuse of surveys in inter- Department of International Economic and Social Affairs.
national development: An experiment from Nepal.’ Population Studies, No. 81. United Nations, New York 1983.
Human Organization 1984; 43:27-37.
TanurJ M (ed). Questions about questions: Inquiries into the
cognitive bases of surveys. New York: Russell Sage 24 Outbreak investigations
Foundation 1992.
Westoff C. ‘Is the KAP-gap real?’ Population and References
Development Review 1988; 14:225-32. Coetzee N, Hussey GD, Visser G, Barron P, Keen A. ‘The 1992
measles epidemic in Cape Town — a changing epidemiol-
19 Qualitative methodology: an introduction ogical pattern.’ South African Medical Journal 1994;
84:145-9.
Reference Coetzee N, Malusi S, MoetiS. ‘Lack of reporting on disease
outbreaks and epidemics.’ South African Medical Journal
Patton M Q. Qualitative evaluation methods. Beverly Hills: 1993; 83:531-2. a
Sage Publications 1980. Gear, J S, Cassell GA, Gear AJ, Trappler B, Clausen L, Meyers
AM, KewMC, Bothwell T H, Sher R, Miller GB, Schneider
Further reading J, Koornhof HJ, Gomperts ED, Isaacson M, GearJ H.
Basch CE. ‘Focus Group Interview: An under-utilised ‘Outbreak of Marburg virus disease in Johannesburg.’
research technique for improving theory and practice in British Medical Journal Nov 29 1975; 4(5 995):489-93.
health education.’ Health Education Quarterly 1987; TurkI. ‘A meningococcal meningitis epidemic really?’
—
14(4):411-48. Epidemiological Comments 1993; 20:78-9.
Bell J. Doing your research project. Milton Keynes:
Open University Press 1987. Further reading
Bernard H R. Research methods in anthropology.
Qualitative and quantitative approaches. Newbury Park, Benenson AS (ed). Control of communicable diseases
manual (16th edn). Washington DC: American Public
California: Sage Publications 1994.
Health Association 1995.
Eng E, Glik D, Parker K. ‘Focus-group method: Effects on
Centers for Disease Control. ‘Case definitions for public
village-agency collaboration for child survival.’
health surveillance.’ Morbidity and Mortality Weekly
Health Policy and Planning 1990; 5(1):67-76.
Report October 19 1990; 39:RR-13.
Miles M B, Huberman A M. Qualitative data analysis:
Dwyer DM, Strickler H, Goodman RA, Armenian H K.
Asourcebook of new methods. Beverly Hills:
‘Use of Case-Control Studies in Outbreak Investigations.’
Sage Publications 1984. Epidemiology Reviews 1994; 16(1):109-23.
The Open University. Research methods in Education and Fonseca M GP, Armenian H K. ‘Use of Case-Control Method
the Social Sciences. Milton Keynes: Open University Press in Outbreak Investigations.’ American Journal of
1979. Epidemiology 1991; 133:748-52.
Patton M Q. How to use qualitative methods in evaluation Galbraith N S. ‘The application of epidemiological methods
(chapter 5). Newbury Park, California: Sage Publications in the investigation and control of an acute episode of
1987. infection.’ In: Holland WW, Detels R, Knox G (eds).
Strauss A L. Qualitative analysis for social scientists. Oxford textbook of public health vol. 4. New York:
Cambridge: Cambridge University Press 1987. Oxford University Press 1985; 3-21.
Gregg MB. ‘The principles of an epidemic field investiga-
Section E: Epidemiology applied to content areas tion.’ In Holland WW, Detels R, Knox G (eds). Oxford
textbook ofpublic health vol 3. New York:
Oxford University Press 1985; 284-99.
20 Mortality studies KelseyJ L, Thompson WD, Evans AS. ‘Epidemic investiga-
tion.’ In Methods in observational epidemiology.
References New York: Oxford University Press 1986; 212-53.
BothaJ L, Irwig LM, Strebel P M. ‘Excess mortality from Orenstein W A, Bernier R H, Dondero T J, Hinman A R, Marks
stomach cancer, lung cancer, and asbestosis and/or JS, Bart KJ, Sirotkin B. ‘Field evaluation of vaccine efficacy.’
mesothelioma in crocidolite mining districts in South WHO Bulletin 1985; 63(6):1 055-68.
Africa.’ American Journal of Epidemiology 1986; 123(1):30- Palmer S R. ‘Epidemiology in search of infectious diseases:
40. methods in outbreak investigation.’ Journal of
Bailie RS, Selvey CE, Bourne D, Bradshaw D.‘Trends in Epidemiology and Community Health 1989; 43:311-14.
cervical cancer mortality in South Africa.’ Submitted to the Palmer S R. ‘Outbreak Investigation: The Need for “Quick
International Journal ofEpidemiology 1996; 25(3): 488-93. and Clean” Epidemiology.’ International Journal of
283
Epidemiology 1995; 24(Suppl 1): S34-S38. Statistics Quarterly 1991; 44:133-9.

Strassburg M A. Guidelines for the investigation and control Lemeshow S, Robinson D. ‘Surveys to measure programme
of outbreaks of EPI target diseases. World Health coverage and impact: a review of the methodology used by
Organization document EPI/GEN/84/7/Rev 1 (R585-1085- the Expanded Programme on Immunisation.’
686-287-688). (This is a practical guide with separate World Health Statistics Quarterly 1985; 38:65-75.
sections on investigating diphtheria, measles, neonatal Orenstein W A, Bernier RH, Dondero TJ, Hinman AR,
tetanus, poliomyelitis, and whooping cough.) Marks J S, Bart KJ, Sirotkin B. ‘Field evaluation of vaccine
Strebel P M, Kustner H GV. ‘Disease outbreak investigations efficacy.’ WHO Bulletin 1985; 63(6):1 055-68.
— objectives, methods and importance.’ South African Reinach SG, Groeneveld HT, Neethling A. ‘Estimation of
Medical Journal 1988; 73:334-6. confidence intervals for parameters estimated from data
Tillett HE. ‘Statistical analysis of case-control studies of obtained in complex sampling procedures.’ Internal
communicable diseases.’ International Journal of communication, Institute for Biostatistics of the South
Epidemiology 1986; 15:126-33. African Medical Research Council 1990.
Rosero-Bixby L, Grimaldo C, Raabe C. ‘Monitoring a primary
22 Estimating immunization coverage health care programme with lot quality assurance
sampling: Costa Rica, 1987.’ Health Policy and Planning
References 1990; 5(1):30-39.
Weeks R M, Barenzi J F Z, WayiraJ R M. ‘Alow-cost,
Bachmann OM, Thomson R, Coetzee N, KlopperJ ML,
community-based measles outbreak investigation with
Skibbe A. ‘An evaluation of the 1990 immunisation
follow-up action.’ WHO Bulletin 1992; 70(3):317-21.
campaign in the Cape Province.’ Southern African Journal
ofEpidemiology and Infection 1992; 7(3):71-5.
Berry DJ, Yach D. Review of epidemiologically relevant 23 Environmental epidemiology
immunisation research in South Africa: 1985-1990.
Technical Report for Smith, Kline and French. The Centre References
for Epidemiological Research in South Africa, March 1991. GoldsmithJ R. ‘Improving the prospects for environmental
Berry DJ, Yach D, Hennink M HJ. ‘An evaluation of the epidemiology.’ Archives ofEnvironmental Health 1988;
national measles vaccination campaign in the new shanty 43(2).
areas of Khayelitsha.’ South African Medical Journal 1991; Von Schirnding YE R, Strauss N, Kfir R, Robertson P, Fattal B,
79:433-6. Mathee A, Franck M, Cabelli VJ. ‘Bather morbidity from
BorgdorffM W, Walker GJ A. ‘Estimating vaccination recreational exposure to sea water.’ Water, Science and
coverage: routine information or sample survey?’ Technology 1993; 27 (3-4): 183-6.
Journal of Tropical Medicine and Hygiene 1988; 91:35-42. Von Schirnding Y E R, Mokoetle K, Mathee A. ‘Air Pollution
Byarugaba J. ‘Transkei immunisation survey, 1988.’ and Health.’ Enviroteach 1994; 3(2):17-19.
Southern African Journal of Epidemiology and Infection Yach D, Cameron N, Padayachee N, Wagstaff L, Richter L,
1989; 4(1):3-6. Fonn S. ‘Birth to ten: child health in South Africa in the
Coetzee DJ, Ferrinho P, Reinach SG. ‘A vaccination survey 1990s. Rationale and methods ofa birth cohort study.’
using EPI methodology to evaluate the impact of a child Paediatric and Perinatal Epidemiology 1991 5(2): 211-33.
health outreach programme in an urban area of South
Africa.’ WHO Bulletin 1993; 71(1):33-9. Further reading
Coetzee N, Yach D, Blignaut R, Fisher S A. ‘Measles vaccina-
tion coverage and its determinants in a rapidly growing Blumenthal D S. ‘A perspective on environmental health.’
peri-urban area.’ South African Medical Journal 1990; In: Blumenthal DS (ed). Introduction to environmental
78:733-7. health. New York: Springer Publishing Company 1985.
Chapman RD. ‘The Orange Free State experience — De Koning HW (ed). Setting environmental standards.
Determining both primary health care coverage and the Geneva: World Health Organization 1987.
health status of blacks living on farms.’ CHASA journal Lave LB, UptonAC (eds). Toxic chemicals, health and the
1990; 1(1): 30-36. environment. Baltimore: Johns Hopkins University Press
Cutts F T. ‘Strategies to improve immunisation services in 1987.
urban Africa.’ WHO Bulletin 1991; 69(4):407-14. Lippmann M L, Schlesinger R B. Chemical contamination in
Cutts F T, Monteiro O, Tabard P, CliffJ. ‘Measles control in the human environment. Oxford: Oxford University Press
Maputo, Mozambique, using a single dose of Schwarz 1979.
vaccine at age 9 months.’ WHO Bulletin 1994; 72(2):227-31. Mendell M. ‘Data in setting with likely over-reporting due to
Department of Health. National Review of the Expanded environmental worry.’ Paper presented at 2nd Annual
Programme on Immunisation in South Africa: Meeting of the International Society for Environmental
Preliminary National Report. Pretoria, 1994. Epidemiology, University of California, Berkeley USA:
Ferrinho P, Buch E. ‘Methodological aspects of vaccination 12-15 August 1990.
coverage surveys in South Africa.’ Southern African Von Schirnding YE R. ‘Methodological issues in air pollution
Journal of Epidemiology and Infection 1991; 6(3&4):52-7. epidemiology.’ In Proceedings 1st IUAPPA Regional
Henderson R H, Sundaresan T. ‘Cluster sampling to assess Conference on Air Pollution, Pretoria, South Africa, 24-26
immunisation coverage: a review of experience with a October 1990.
simplified sampling method.’ WHO Bulletin 1982; Von Schirnding YE R. ‘The need for environmental epidemi-
60(2):253-60. ology in developing countries.’ In Proceedings
Kok PW. ‘Cluster sampling for immunisation coverage.’ International Conference on Epidemiology, Beijing,
Social Science and Medicine 1986; 22(7):781-3. China, 24-26 April 1989.
Lanata CF, Black RE. ‘Lot quality assurance sampling World Health Organization. ‘Guidelines on studies in
technique in health surveys in developing countries: environmental epidemiology.’ Environmental Health
advantages and current constraints.’ World Health Criteria 27. Geneva: World Health Organization 1983.
284
bibliography
World Health Organization. Toxic oil syndrome. tion disorders in a rural area: the role of the community
Copenhagen: World Health Organization 1984. speech and hearing therapy workers.’ South African
World Health Organization. Environmentand health: Journal of Communication Disorders 1992; 39:55-61.
the European Charter. Geneva: World Health Schneider M, Rispel L, Doherty J. The development ofguide-
Organization 1990. lines for rehabilitation services in the Greater Soweto Area.
Paper No. 34, Centre for Health Policy, Department of
24 Occupational epidemiology Community Health. University of the Witwatersrand,
Johannesburg 1994.
References Zaman §S S, Kahn N Z, Islam S, Bann S, Dixit S, Shrout P,
Bradford Hill A. ‘The environment and disease: Association Durkin M. ‘Validity of the “Ten Questions” for screening
or causation?’ Proceedings of the Royal Society ofMedicine serious childhood disability: Results from urban
1965; 58:295-300.
Bangladesh.’ International Journal of Epidemiology 1990;
Checkoway H, Pearce N V, Crawford-Brown DJ. 19(3):613-20.
Research methods in occupational epidemiology.
New York: Oxford University Press 1989. 26 Psychiatric epidemiology
Myers J E, Cornell J E. ‘Respiratory health of brickworkers in
Cape Town, South AFrica: symptoms, signs and References
pulmonary function abnormalities.’ Scandinavian Journal American Psychiatric Association (APA). Diagnostic and
of
Work Environment and Health 1989; 15(3):188-94. statistical manual of mental disorders (4th edn) (DSM-IV).
Myers J E, Garisch D, Louw SJ. ‘Respiratory health of Washington DC: American Psychiatric Association 1994.
brickworkers in Cape Town, South Africa: radiographic Bech P, Malt U F, Dencker S J, Ahlfors U G, Elgen K,
abnormalities.’ Scandinavian Journal ofWork Lewander T, Lundell A, Simpson GM, Lingjaerde O (eds).
Environment and Health 1989; 15(3): 195-7. ‘Scales for assessment of diagnosis and severity of mental
Myers J E, Lewis P, Hofmeyr W. ‘Respiratory health of brick- disorders.’ Acta Psychiatrica Scandinavica Supplementum
workers in Cape Town, South Africa: background, aims No. 372. Vol 87. Copenhagen: Munksgaard 1993.
and dust exposure determinations.’ Scandinavian Journal Blackburn C. Poverty and health: Working with families.
of
Work Environment and Health 1989; 15(3):180-87. London: Open University Press 1991.
Myers J E, von Schirnding YE R. ‘Occupational and environ- Brown G, Harris T. Social origins of depression. London:
mental epidemiology — Similarities and contrasts.’ Tavistock 1978.
South African Medical Journal 1992; 81(11):546-9. Cooper B. ‘Psychotherapy, psychiatric epidemiology and
Steenland K(ed). Case studies in occupational epidemiology. health services.’ Social Psychiatry 1984, 19:93-5.
New York: Oxford University Press 1993. Dawes A, Donald D (eds). Childhood and adversity:
Psychological perspectives from South African research.
25 Disability studies Cape Town: David Philip 1994.
Dick B, Spencer 1 WF, Watermeyer GS, Bourne DE,
References Wolff E P M, Moyle GD. ‘Chronic illness in non-institut-
Coetzer T. ‘General development, visual perception and ionalized persons. Part I. Prevalence and epidemiology.’
motor function of children with birthweight under 1 250 g South African Medical Journal 1978; 53:892-904.
and their fullterm, normal birthweight peers at 5-6 years of DragunsJ G. ‘Assessing mental health and disorder across
age.’ Master of Philosophy (Maternal and Child Health) cultures.’ In Pedersen P B, Sartorius N and Marsella AJ
thesis, Department of Paediatrics and Child Health, (eds) Mental health services: The cross cultural context
University of Cape Town (in progress). 31-58. Beverly Hills: Sage Publications 1984.
World Health Organization. International classification of Gillis LS. ‘Research into community-based essential mental
Impairments, Disabilities and Handicaps —A manual of health care services.’ Paper presented at the Seminar on
classification relating to the consequences of disease. Geneva: community-based essential health care services.
World Health Organization 1980. Johannesburg, June 1987.
Gillis LS, Elk R, Ben-Arie O, Teggin A. ‘The present State
Further reading Examination: Experiences with Xhosa-speaking psychiat-
Chamie M. ‘Survey design strategies for the study of disability.’ ric patients.’ British Journal of Psychiatry 1982; 141:143-7.
World Health Status Quarterly 1989; 42(3):122-46. Goldberg D, Huxley P. Mental illness in the community.
Coetzee LT, Katzenellenbogen J, Rendall K. ‘Coping in London: Tavistock 1980.
Mitchell’s Plain: Qualitative analysis of a group of handi- Goldberg D, Huxley P. Common mental disorders:
capped elderly in the community.’ South African Family A bio-social model. London: Routledge 1992.
Practice 1993; 14(8):341-6. Holmes TH, Rahe RH. ‘The Social Readjustment Rating
Badley E M. ‘The ICIDH: Format, application in different Scale.’ Journal of Psychosomatic Research 1967; 11:213-18.
settings, and distinction between disability and handicap.’ Links P S. ‘Community surveys of the prevalence of child-
International Disability Studies 1989; 9:122-5. hood psychiatric disorders: A review.’ Child Development
Concha M &Lorenzo T. ‘The prevalence of disability in a 1983; 54:531-48.
rural area of South Africa-Gazankulu.’ South African Littlewood R, Lipsedge M. Aliens and alienists:
Journal of Occupational Therapy 1993; 23(2):6-15. Ethnic minorities and psychiatry (2nd edn).
Cornielje H, Ferrinho P, Coetzee D, Reinach SG. London: Unwin Hyman 1989.
‘Development of a community based rehabilitation Marsella AJ. ‘Urbanization and mental disorders:
programme for a poor urban area in South Africa. An overview of theory and research, and recommendat-
A Disability prevalence study — Part I.’ ions for interventions and research’ (Unpublished mono-
Journal of Comprehensive Health 1993; 4(1):26-32. graph). Geneva: Mental Health Division, World Health
Schneider M. ‘The nature and management of communica- Organization 1992.
285
Miller T, Swartz L. ‘Access to psychiatric resources: The case method with extent of “overweight” and various weight-
of Mamre.’ Presented at the Association for Sociology in height relationships in the assessment of obesity.’
Southern Africa Conference, Stellenbosch, July 1990. British Journal of Nutrition 1977; 38:271-84.
Miller T, Swartz L. ‘Psychology and epidemiology: World Health Organization. Measuring change in nutritional
An uncomfortable alliance?’ South African Journal of status. Geneva: WHO 1983.
Psychology 1992; 22:52-8.
Newton J. Preventing mental illness in practice. London: 28 Assessment of dietary intake
Routledge 1992.
Orley J, Wing) K. ‘Psychiatric disorders in two African References
villages.’ Archives of General Psychiatry 1979; 36:513-20.
Parry CDH. ‘Areview of psychiatric epidemiology in Africa: Department of Health. Report on Health and Social Subjects
Strategies for improving validity when using instruments 41. Dietary Reference Values for Food Energy and Nutrients
transculturally.’ Transcultural Psychiatric Research Review for the United Kingdom. Report of the Panel on Dietary
1996; 33:173-8. Reference Values of the Committee on Medical Aspects of
Reeler A P. ‘Psychological disorders in Africa II: Clinical Food Policy. London: HMSO 1991.
issues.’ Central African Journal of Medicine 1987; 33:15-19. Food and Nutrition Board, National Academy of Sciences —
Reynolds J, Swartz L. ‘Professional constructions ofa “lay” National Research Council. ‘USA 1989 Revised
illness: “nerves” in a rural “coloured” community in South Recommended Dietary Allowances.’ Journal of the
Africa.’ Social Science and Medicine 1993; 36:657-63. American Dietetic Association 1989; 89:1 748-52.
Robins LN. ‘Psychiatric epidemiology— ahistoric review.’
Social Psychiatry & Psychiatric Epidemiology 1990; 25:16-26. Further reading
‘Rumble S, Swartz L, Parry C D H, Zwarenstein M. Bingham SA. ‘The dietary assessment of individuals;
‘Prevalence of psychiatric morbidity in the adult populat- methods, accuracy, new techniques and recommendat-
ion ofa rural South African village.’ Psychological Medicine ions.’ Nutrition Abstracts and Reviews 1987; 57:705-42.
(in press). Cameron M E, Van Staveren W A (eds). Manual on method-
Swanson-Fisher R W, Martin CJ. ‘Standardized interviews in ology for food consumption studies. Oxford:
psychiatry: Issues of reliability.’ British Journal of Oxford University Press 1988.
Psychiatry 1981; 139:138-43. Department of Health. Guide to Healthy Eating.
Swartz L, Miller T. “Minor psychiatric disability and primary Pretoria 1990.
health care utilisation in Mamre.’ Presented at the Langenhoven ML, Kruger M, Gouws E, Faber M. MRC Food
Epidemiological Society of South Africa Conference, Composition Tables (3rd edn). Parow: Medical Research
University of the Western Cape, July 1991. Council 1991.
Swartz L. ‘Transcultural psychiatry in South Africa. Part II.’ Langenhoven M L, Conradie PJ, Wolmarans P, Faber M.
Transcultural Psychiatric Research Review 1987; 24:5-30. MRC Food Quantities Manual (2nd edn). Parow:
Swartz L, Ben-Arie O, Teggin AF. ‘Subcultural delusions and Medical Research Council 1991.
hallucinations: Comments on the present State Langenhoven M L, Gouws E, Wolmarans P, Van Eck M.
Examination in a multi-cultural context.’ British Journal of The analysis of dietary data at RIND: Nutrients and food
Psychiatry 1985; 146:391-4. groups. Parow: Medical Research Council 1989.
Tafari S, Aboud F E, Larson C P. ‘Determinants of mental Passmore R, Nicol B M, Rao MN, Beaton GH, DeMayer EM.
illness in a rural Ethiopian adult population.’ Handbook on Human Nutritional Requirements. Rome:
Social Science and Medicine 1991; 32:197-201. Food and Agricultural Organization of the United Nations
Tansella M, de Girolamo G, Sartorius N. Annotated bibliog- 1974.
raphy of psychiatric epidemiology. London: Gaskell 1992.
World Health Organization. The ICD-10 classification of 29 Measurements in dental epidemiology
mental and behavioural disorders: Clinical descriptions
and diagnostic guidelines 1992. References
Al-Alous W, Jackson D, Crompton G, Jenkins O C. ‘Enamel
27 Anthropometric studies mottling in a fluoride and non-fluoride community.’
British Dental Journal 1975; 138:9.
Further reading Angle E H. ‘Classification of malocclusion.’ Dental Cosmos
Bray GA. ‘Definition, measurement and classification of the 1989; 41:248.
syndromes of obesity.’ International Journal of Obesity Bollmer B W, Sturzenberger O P, Lenoff R W, Bosma
1978; 2:99-112. ML, Lang NP, Mallet ME, MeckelA H. ‘A comparison of
Frisancho AR. ‘Newnorms of upper limb fat and muscle three clinical indices for measuring gingivitis.’ Journal of
areas for assessment of nutritional status.’ American Clinical Periodontology 1986; 13:392-395.
Journal of Clinical Nutrition 1981; 34:2 540-5. Chamberlain, J. ‘Evaluation of screening for cancer.’
Keys A. ‘Overweight, obesity, coronary heart disease and Community Dental Health 1993; 10. Supplement 1:5-11.
mortality.’ Nutrition Review 1980; 38:297-307.
Cohen, LK, Jago J D. ‘Socio-dental indicators.’ In Eloresian
J, Siegmann AE (eds.) Socio-Medical Health Indicators
Kibel MA, Wagstaff A (eds). Child health for all: Amanual
1983.
for Southern Africa. Cape Town: Oxford University Press
Cushing AN, Sheiham A, Maizels J. ‘Developing socio-
199e ‘i
dental indicators — the social impact of disease.’
Steyn K, Fourie J, RossouwJ E, Langenhoven M L, Joubert G,
Community Dental Health 1986; 3:3-17.
Chalton D O. ‘Anthropometric profile of the coloured
Dean, HT. ‘Classification of mottled enamel diagnosis.’
population of the Cape Peninsula.’ South African Medical Journal of the American Dental Association 1934;
Journal 1990; 78:68-72. : 21:1421-1426.
Womersley J, Durnin J VGA. ‘A comparison of the Skinfold Dean, HT, Elvove, E. ‘Some epidemiological aspects of
286
bibliography
chronic endemic fluorosis.’ American Journal of Public Further reading

Health 1936; 26:567-575.
Pelton, WA (ed). The Epidemiology of Oral Health. Harvard
Federation Dentaire International. Standard method for University Press: 1969.
assessment of occlusal traits. London: 1982. Pindborg, J J. Oral Cancer and Precancer. Wright & Sons Ltd:
Greene, J C, Vermillion, J R. ‘Oral hygiene index: a method 1980.
for classifying oral hygiene status’. Journal of the American
Dental Association 1960; 61:172.
Greene, J C, Vermillion, J R. ‘The simplified oral hygiene 30 Health economics
index.’ Journal of the American Dental Association 1964;
68:25. ; References
Johnson N W (ed). Dental Caries Vol I: Risk Markers for Oral Creese A, Parker D (eds). Cost analysis in primary health care:A
Diseases. Cambridge University Press: 1991. training manual for programme managers. Geneva: WHO,
Johnson NW (ed). Periodontal Diseases Vol IT: Markers for Unicef, Aga Khan Foundation.
disease Susceptibility and Activity. Cambridge University Department of Clinical Epidemiology and Biostatistics,
Press: 1991. McMaster University. ‘How to read clinical journals: VII.
Johnson NW (ed). Oral Cancer Vol III: Risk Markers of Oral To understand an economic evaluation.’ Canadian
Diseases. Cambridge University Press: 1991. Medical Association Journal, 1984; 130:1 428-33.
Knutson, J W. ‘An index of prevalence of dental caries in Drummond MF, Smith GT, Wells N. Economic evaluation in
school children.’ Public Health Report 1944; 59:253-263. the development of medicines. London: Office of Health
Loé H and Silness J. ‘The gingival index, plaque index and Economics 1988.
the retention index system.’ Journal of Periodontology Fox D. ‘Health policy and the politics of research in the
1964; 38:610 United States.’ Journal of Health Politics, Policy and Law
Lunn H, Richmond §S, Mitropoulos C. ‘The use of the 1990; 15:481-99.
orthodontic treatment need (IOTN) as a public health tool: Kind P, Rosser R M, WilliamsA H. ‘Valuation of quality of life:
a pilot study.’ Community Dental Health 1993; 10 some psychometric evidence’, in Jones-Lee M W (ed) The
(2):111-121. valueof life and safety. Amsterdam: North Holland 1983.
National Cancer Registry of South Africa. Cancer in South Luce B R, Elixhauser A. ‘Estimating Costs in the Economic
Africa, 1989. Johannesburg: 1990. Evaluation of Medical Technologies.’ International
National Oral Health Survey: South Africa 1988/89. Journal of Technology Assessment in Health Care 1990;
Department of Health, Pretoria: 1994. 6:57-75.
Ramfjord S P. ‘Indices for prevalence and incidence of McMurchy D. ‘The Economic Impact of HIV/AIDS in
periodontal disease.’ Journal of Periodontology 1959; Lesotho: Direct & Indirect Costs 1993-1998.’
30:31. : WHO Bulletin, December 1994.
Rateitschalk, K H, WolfH F, Hassell T M. Color Atlas of McGuire A, Henderson J, Mooney G. The economics of health
Periodontology 25-34. care. London: Routledge 1988.
Richmond S, Shaw WC, O’Brien KD, Buchanan IB, Jones Mills A, Gilson L. Health economics for developing countries:
R, Stephens, C D, Roberts CT, Andrews M. ‘The Asurvival kit. London: Evaluation and Planning Centre
Development of the PAR Index (Peer Assessment Rating): for Health Care 1992.
reliability and validity.’ European Journal of Orthodontics Mooney G. Economics, medicine and health care.
London: Harvester Wheatsheaf 1992. .
1992; 14 (2):125-139.
Russel, A L. ‘A system of classification and scoring for Torrance G W, Thomas WH, Sackett D L. ‘Utility
Maximization Model for Evaluation of Health Care
prevalence surveys of periodontal diseases’. Journal of
Programs.’ Health Service Research 1972; 7:118-33.
Dental Research 1956; 35:350.
World Bank. World Development Report 1993: Investing in
Sheiham A, Maizels J, Maizels A. New Composite Indicators
Health. New York: Oxford University Press.
for Health. Community Dental Health 1987; 4:407-414.
Slack L G (ed). Dental Public Health: An Introduction to
Appendix 2
Community Dentistry. Wright & Sons Limited: 1981.
Thylstrup A, Fejerskov O. ‘Clinical appearance of dental
Reference
fluorosis in permanent teeth in relation to histological
changes.’ Community Dental Oral Epidemiology 1978; World Health Organization. Measuring change in nutritional
6:315-328. status. Geneva: WHO 1983.
World Health Organization. Basic Methods: Oral Health
Surveys (3rd edn). Geneva: 1987. Appendix 4
World Health Organization. Guide to epidemiology and
diagnosis of oral mucosal diseases and conditions. Reference
Munksgaard, Copenhagen: 1980 Fleiss JL. Statistical methods for rates and proportions. New
World Health Organization, Oral Health Unit. A guide to York: John Wiley and Sons 1981.
Oral Health epidemiological investigations. Geneva: 1979.
World Health Organization. International classification of
diseases: Application to Dentistry and Stomatology
(ICD-DA). Geneva: 1978.
287
copyright acknowledgements No 1: rationale and methods.’ Cape Town: Department of

Community Health, UCT 1990. Printed by courtesy of the
authors.
The editors and publisher acknowledge with thanks permis- Rifkin’s framework for assessing participation in projects,
sion to reproduce copyright material in this book. Every effort taken from: Rifkin S B, Muller F, Bichmann W. ‘Primary health
has been made to trace copyright holders, but where this has care: on measuring participation’. Social Science and Medicine
proved impossible, the editors and publisher would be grateful 1988; 26:931—40. Printed by courtesy of the authors.
for any information which would enable them to amend inad- Revised problem analysis diagram of factors contributing to
vertent omissions in future editions. high defaulter rate among TB patients, taken from: Varkevisser
Tuberculosis clinic card designed for hand analysis, taken C M, Pathmanathan I, Brownlee A. Designing and Conducting
from: Dick K, Youngleson S M. ‘The design ofa case register for HSR Projects. HSR Training Series, Vol 2 Part 1. International
tuberculosis. A pilot study in the southwestern Cape’. South Development Research Center, Canada and World Halth
African Medical Journal 1994; 84:621-3. Printed by courtesy of Organization, Geneva: 1991. Printed by courtesy of the
the authors. International Development Research Center.
Checklist for statistical review, taken from: Gardner M J, Figures showing the effect of systematic and/or random
Altman DG (eds). ‘Statistics with confidence — confidence error on the precision and validity of measures of effect, taken
intervals and statistical guidelines’. London: British Medical from: BothaJ L, Yach D. ‘Epidemiological research methods.
Journal 1989. Printed by courtesy of BMJ Publishing Group. Part II. Descriptive studies’. South African Medical Journal
Adaptation of a hand[{drawn map of Khayelitsha, taken from: 1986; 70:769. Printed by courtesy of the Medical Association of
Pick, W M, etal. ‘A study of the effects of urbanisation on the South Africa.
health of women in Khayelitsha, Cape Town. Working paper
288
index
index causation of disease 10-12

guidelines for establishing 12
models of multiple causation 11-12
A necessary cause 10-11
sufficient cause 10-11
active surveillance 142 cause of death
problems in 145 determining 189-90
adult mortality 189 profiles (study of) 193
age-specific fertility rate 20 census data 19
age-specific mortality rates 187 Centre for Epidemiological Research in Southern Africa
AIDS research (CERSA) 8
assessment of economic impact of epidemic 271-2 CERSA see Centre for Epidemiological Research in Southern
KAPB surveys in schools 174-5 Africa
setting objectives for 63 childhood mortality rate 189
analysis cholera outbreak (London, 1854).204
in disability studies 227-8 classification (of disablement) 224-5, 226
in occupational epidemiology 220-1 closed questions 83
analytic studies 64, 66-7 cluster sampling 77-8
different types of 67-9 CME see comparative mortality factor
for testing hypothesis 200-2 coding 86-7, 180
objectives of 61 cohort studies 69, 202, 216-17
anthropometric studies 243-7 Commission on Health Research for Development (1990) 5
examples of 246-7 communal versus individual consent 31
for adults 244-6 communication 167
for children 243-4 community (definition of) 34
arithmetic mean 108-9 community health see public health
association 221 community health worker project (evaluating) 181
attitudes, measurement of 171 community indicators 162
attributable risk 117-18 community intervention trials 70
attribution theory 169 community oriented primary care (COPC) 7
community participation 34-40, 166-7
application of 34-7
criterion for ranking 39
bar graphs 106-8 definition of 34
before-after studies 71 evaluation of 37-40
behaviours, measurement of 171 example of study with 43-6
beliefs, measurement of 171 factors and components for evaluation of 38
in health projects 35, 37-40
bias .
in programmes 34-5
in case-control studies 68
in KAPB survey sampling 172 in research 35-7 :
Community Periodontal Index of Treatment Needs (CPITN)
in occupational epidemiology 220
257-8
in research process 51-3
community studies 164-8
in sampling 79-81
data collection for 165
information 128-30
dealing with community in 166-7
selection 127-8
examples of 167-8
types of 127
fieldworkers and 165-6
validity and 126-30
sampling for 164-5
biological monitoring 215
community-based surveys 206-8
birth rate 16
comparative mortality factor (CMF) 188-9
births data
confidence intervals 113-14, 115
sources and limitations of 135, 136
confidentiality 32
used for estimating immunization coverage 205-6
conflicts of interest 33
bivariate plot 106, 107, 121
confounders 119
block chart 109
consent 31, 32
BMI see Body Mass Index
content analysis 180
Body Mass Index (BMI) 244-6
content validity 92
box plot 104, 105-6 control measures 203-4
budget 96, 97 COPC see community oriented primary care
correlation 120
c . cost-benefit analysis 270
cost-effectiveness analysis 270
case definition 197 cost-minimization analysis 270
case-control studies 68-9, 162, 202, 216, 219 costs
case-referent studies see case-control studies framework for evaluating 267-8
categorical variables 101-2, 108 identifying 265-6
and honesty in graphical display 110 measuring 266-8
graphical display of 106-8 cost-utility analysis 270
289
epidemiology: a manual for South Africa
CPITN see Community Periodontal Index of Treatment methods of data collection in 248-51
Needs national level 248
criteria-related validity 92 validity and reproducibility of 251
cross-sectional analytic studies 67-8, 216, 219 disability, classification of 224-5
example of 222-3 Disability Adjusted Life Years (DALY) 23, 190, 269
crude rates 18 disability prevalence (studies to determine) 228-9
birth 20 disability studies 224-9
death (mortality) 20, 187 examples of 228-9
culture 237 variables included in 227
cumulative incidence 16-17 disclosure 31
discounting 268
D disease surveillance 140-6
definition of 140
for detecting outbreaks 197
DALY see Disability Adjusted Life Years
" history of 140
data analysis 87-9, 95-7, 101-23
main purpose of 141
by computer 87-9
methods of 141-4
by hand 87, 88
problems in 144-5
checking data before 102-3
to determine immunization coverage 205
commonly used 116-22
uses of 140-1
exploratory 103-8
DMFS/dmfs index 255
for estimating immunization coverage 209
DMFT/dmft index 255
in KAPB surveys 172
dummy tables 95
summarizing 108-111
what to consider when choosing appropriate 116
data capture sheet 87
data checking procedure 102
data collection 82-94 ecologic studies 71-2, 193, 216
for estimating immunization coverage 208-9 economic evaluation 264-9
for qualitative research 177-9 examples of studies 271-2
in community studies 165 measurement of costs and consequences 271
in dietary studies 48-51 methods of 269-71
in outbreak investigation 202-3 economic objectives 262-4
issues related to 93-4 effectiveness 262-4
methods of 84, 177-9 efficiency 262
quality of data 89-93 ENHR see essential national health research
using questionnaires for 82-9 environmental diseases
data distribution (types of) 104-6 causes of 213
data interpretation 101-23 nature of 212-13
data management 95-7 environmental epidemiology 211-17
data presentation 101-23 environmental health
guidelines for 122
assessment of exposure 213-15
data quality 89-93
definition of 211
data types 89-90
environmental health outcomes 212-13
death rates 16
detection of 212-13
demographic indicators (for South Africa) 22
types of 212
demographic processes 20-2
environmental investigations 199
demographic transition 22-3
environmental monitoring 214
demography 19-23
dental caries (measurement of) 255
EPI see Expanded Programme on Immunization
EPI30x7 sample 206, 275
dental epidemiology (measurements in) 254-60
deontological theories 27 epidemic (definition of) 196
dependency ratio 19, 22 epidemic curve 199, 200
depth interviews 177 epidemiological concepts, integrating 124-30
examples of use of 181 epidemiological triangle 11
descriptive statistics 108 epidemiologist, role of 6
descriptive studies 64, 66 epidemiology 5-6
objectives of 61 analytic 5
diabetes, late-onset (prevalence of) 167-8 definition of 5
diagnostic interviews 232, 233-4 descriptive 5
dietary data discipline of 5-6
calculation of 251 future of 8-9
collection of 248-51 in South Africa (history of) 6-8
interpretation of 251-2 ofa disease 5
dietary histories 250-1 Epidemiology Society of Southern Africa (ESSA) 8
dietary studies 248-53 equity 264
examples of 252-3 ESSA see Epidemiology Society of Southern Africa
individual level 248, 249 essential national health research (ENHR) 5
290
index
ethical principles (in research) 27-34 contrasted with perfect competition situation 263
ethical reasoning health care system 148
ethics and 26-7 economic evaluation of 264-9
examples of ethical dilemmas to stimulate 42 ways of looking at 151
process of 28 health economics 261-72
ethical review (of research) 33-4 health indicators 23-4
ethical theory 27, 28 key mortality rates used as 189
ethics 25-6, 96 proposed by WHO 24
and data collection 93 selection of for monitoring population health 24
and ethical reasoning 26-7 sources of data for 23
and occupational epidemiology 221 uses of 23-4
examples of ethically unacceptable research 26 health insurance 262
guidelines for action 96 health services data
in clinical trials 70 sources and limitations of 135, 137
issues of in epidemiological research 314 study of in provinces 138-9
ethics review committees 33 health status indicators 269
evaluation 147 health system 147-8
Expanded Programme on Immunization (EPI) 158-60 Health Systems Research (HSR) 147-57
sampling method used in 206-8, 209-10 approach of 152
experimental studies 64, 66, 69-71, 217, 219 characteristics of 152
exploratory data analysis 103-8 examples of questions for 153
exposure, more than two levels of 118-19 examples of studies in 155-7
exposure estimation 213-15 integrated view of 151-3
methodology of 152-3
F tips for conducting 153-5
health-related behaviour 169 ’
face validity 92 height for age 243-4
family planning services (REA of) 162-3 hepatitis B reporting system 146
feedback 33 histogram 106
fertility 20 hospital care (observations of levels of) 184
fertility rate 16 house-to-house sentinel population investigation 208
field supervision 94 HSR see Health Systems Research
field trials 70 hypothesis 62
fieldworkers (in community studies) 165-6 generating a 200
fluorosis (measurement of) 259 testing a 200-2
focus group interviews 177-8 hypothesis (significance) testing 114-15
follow-up studies see cohort studies
food consumption studies see dietary studies
G ICIDH see International Classification of Impairment,

Disability and Handicap
general fertility rate 20 IFCH see Institute of Family and Community Health
geographic variation, determining 190 immunization coverage, estimating 205-10
geometric mean 110 analysis 209
Gingival Index 257 comparison of three methods used in 207
gingivitis (measurement of) 256-7 examples of 209-10
Gluckman Commission see National Health Service in peri-urban settings 210
Commission (1944) measurement considerations 208-9
graphical display (of data) 103-8 using community-based surveys 206-8
categorical variables 106-8 using routine data 205-6
honesty in 110 impairment, classification of 224-5
numerical variables 103-6 implementation (of epidemiological findings) 40-2, 222
growth patterns see anthropometric studies facilitating 41-2
growth rate (population) 20 obstacles to 41
IMR see infant mortality rate
incidence 15-18
incidence density see incidence rate
handicap (classification of) 224-5 incidence rate 17
haphazard sampling see non-random (haphazard) sampling incident cases 15-16
Health Belief Model 169 counting 18
health care inconsistent validity 93
components of 148 Index of Orthodontic Treatment Need (IOTN) 259
economic objectives of 262-4 index person 90
elements which can be evaluated 150-1 indices
goals of 149-51 for dental caries 255
health care delivery (stages of) 148-9 for dental diseases 254-5
health care market 262 for fluorosis 259
291
for gingivitis and periodontitis 256-8 258-9

for malocclusion 259 Mamre Community Health Project
for periodontal health 256 community participation and 43-4
individual interviews 84 research strategy used in 239-41
infant mortality (profile of) 168 matching 68-9, 202
infant mortality rate IMR) 20 measles immunization (REA to assess) 163
indirect method of estimating (example of) 193-5 measurement 82
used as indicator of health 189 in dental epidemiology 254-60
information bias 128-30 in disability studies 226-7
informed consent 31, 32 in occupational epidemiology 219-20
inputs (health care system) 149, 265-8 in psychiatric epidemiology 231-8
Institute of Family and Community Health (IFCH) 7 of exposures 232-6
International Classification of Impairment, Disability and of outcomes 231-2
Handicap (ICIDH) 225 . sources of error in 90
intervention studies 219 validity of 236-8
examples of 184 measurement bias 127
see also experimental studies measurement instruments 82
interviewers 93 factors affecting choice of 222
IOTN see Index of Orthodontic Treatment Need measures of strength of association 116-18
Ithuseng VHW evaluation 44, 45, 46 measures of variability 111
median 105, 109
microbiological investigations 199
J
migration 22
misclassification 128-30
judgement sampling 78
differential 129-30
non-differential 128-9
K missed opportunities studies 162
mode 110
KAP gap 169 molecular epidemiology 199
KAP surveys 169-70 morbidity data, sources and limitations of 134, 136-7
KAPB surveys 169-75 mortality 20-2
characteristics measured by 171-2 mortality coding practice (study of) 195
criticism of 172-4 mortality data, sources and limitations of 134, 135-6
description of 170 mortality rates 16, 136
examples of surveys 174-5 determining 187-93
history of 169-70 mortality studies 187-95
issues specific to 172 examples of 193-5
predicting practice with 172-3 determining trends over time 190-2
problems of in developing countries 174 in South Africa 193
reliability and validity of 173 issues in 187-93
simplistic approaches to ] 73-4 mortality surveillance 161
steps in conducting 170 multiple linear regression 120-2
uses of 170 multiple pathways 213-14
knowledge, measurement of 171 multiple sources 213-14
multivariate analysis 120-2
L
N
laboratory-based surveillance 143
legal considerations 96 : National Health Service Commission (1944) 6-7
Legionnaire’s disease investigation 204 natural history of disease 13-14
life expectancy 20-2 networking see snowballing (networking)
life table analysis 189, 192 non-random (haphazard) sampling 78-9
literature Normal (Gaussian) distribution 113
finding the 54-5 notifiable disease reporting 142-3, 177
reading the 55 notifiable diseases 143
systematizing and summarizing 55 null hypothesis 114
literature review 54-5 numerical variables 102, 108-11
for outbreak investigation 198 and honesty in graphical display 110
writing up 55 graphical display of 103-6
logistics (of occupational epidemiology) 222 nutritional indicators (for children) 243-4
Lot Quality Assurance Sampling (LQAS) 160, 208 nutritional status see anthropometric studies; dietary studies
LQAS see Lot Quality Assurance Sampling
Oo
mailed questionnaires 84 observational studies 64-5, 66

malaria control programme 144 occupational epidemiology 218-23
malocclusion and dentofacial anomalies (measurement of) ethics and 221
292
index
etcee
examples of study in 222-3 psychiatric epidemiology in 238-41

implementation of findings in 222 roots of 6
methodological issues in 219-21 primary prevention 14
resources and logistical issues in 222 primordial prevention 14
unique aspects of 218-19 principle of autonomy 28-9
odds ratio 117 principle of beneficence 29-30
open-ended questions 83 principle of justice 30-1
oral health (trends in) 259-60 principle of non-malevolence 29-30
oral (mucosal) disease principle of respect for persons 28
classification of 258 probability distributions 113
measurement of 254, 258 probability sampling see random sampling
outbreak (definition of) 196 processes (delivery of health care) 149
outbreak investigations 196-204 proportional mortality 136
describing the outbreak 198-9 Proportional Mortality Ratio (PMR) 190
examples of 204 proportional stratified sampling 76
person, time and place 199 proxy consent 31
preliminary assessment 198 proxy response 90
preparing to investigate 198 psychiatric epidemiology 230-42
step-by-step approach to 197-204 example of research study 242
outbreak surveillance 144 in practice 239-41
outcomes (health system) 149 in primary health care context 238-41
outputs (health care system) 149, 268-9 issues of validity of research in 236-8
over-reporting 212-13 unique aspects of 230
psychiatric illness
P example of study of 242
exposures associated with 232-6
paired observations 118 levels and filters in 239, 240-1 *
PAR see Peer Assessment Rating Index public health 4-5
parameter 111
research 5
publish (obligation to) 33
parametric versus non-parametric methods 115-16
purposive sampling 77
partial disclosure 31
p-value 114, 115
participant observation 178-9
PYLL see potential years of life lost
participatory research 35-7
passive surveillance 141-2
problems in 144-5 Q
PDI see Periodontal Disease Index
Peer Assessment Rating Index (PAR) 259 QALYS see Quality Adjusted Life Years
periodontal disease (measurement of) 257 qualitative data 176
Periodontal Disease Index (PDI) 257 qualitative methodology 176-84
periodontal health 256 advantages and disadvantages of 181
PHC see primary health care examples of use of 181-4
Pholela 6 issues regarding 1277-81
pictionnaire 45 philosophy underlying 176
pie charts 106-8 uses of 176
pilot study 89 qualitative methods 176
Plaque Index (PI) 256 qualitative research 176
PMR see Proportional Mortality Ratio advantages and disadvantages of 181
examples of 181-4
point prevalence see prevalent cases
in the workplace 220
policy development 148-9
methodological issues in 179-81
cycle 41
methods of data collection for 177-9
population composition 19-20
outline of process (example) 182
population data, sources and limitations of 134, 136 uses of 176
population parameters, estimating 111-16 Quality Adjusted Life Years (QALYs) 269
population pyramid 19-20, 21 quality health care 147, 149-51
potential years of life lost (PYLL) 190 quality of research 33
practice, measurement of 171 quartiles 105-6
preceding birth technique 161 questionnaires 82-9
precision 124 analysing 86-7
predicted validity 92-3 development 83-9
prediction see hypothesis for KAPB surveys 172, 173
prevalence 15-18 for outbreak investigations 202-3
rate 17, 17-18 formulating questions for 83-6
prevalent cases 17 layout and design of 86, 172, 208
counting 18 pictionnaire 45
prevention of disease 12-15 piloting 89
at different stages 14, 15 used in psychiatric epidemiology 233-4
primary health care (PHC) Quetlet index see Body Mass Index
293
analysing 116-18
R measures of 15-19
road to health card 245
random number table 276
routine surveillance 142-3
random sampling 75-8
problems in 145
randomized controlled clinical trials 70
routinely available data
range 111
examples of use of 137-9
rapid environmental health assessment 160-1
for estimating immunization coverage 205-6
rapid epidemiological assessment (REA) 158-63
mortality data 135-6
broad types of 158-62
types, sources, and limitations of 134-5
family planning services study 162-3
using 133-5
measles immunization study 163
uses of 158
rapid ethnographic assessment 160 Ss
rates 15
calculating 15, 16 sample 74
comparison of 18-19 sample size 80-1
key 16 estimates yielded by increasing 125
REA see rapid epidemiological assessment formula for 275
recall bias 127 statistical significance, strength of association and 118
relative mortality rates 188-9 study design, precision of estimates and 124-5
relative risk 116-17 sampling 74-81
reliability (measurement) 90, 91 bias in 79-8, 127
improving and evaluating 92-3 EPI 206-8
of KAPB surveys 172, 173 example of strategy to eliminate bias in 81
report writing 98, 203 for disability studies 225-6
research 3-4 in community studies 164-5
applied 3 in KAPB surveys 172
basic (fundamental) 3 in occupational epidemiology 219
country-specific 5 in qualitative studies 180
global health 5 LQAS 208
public health 5 non-random (haphazard) 78-9
reasons for conducting 4 random 75-8
research diaries 179 techniques for REA 158-62
research hypothesis 62 sampling frame 75
research objectives, setting 56-63 for community study 164
examples of 62-3 sampling units 75
specific 61-2 for community study 164
research problem, identifying and analysing 56-8, 59, 60, 61 scales of measurement 86
research process 49 scatter plot 121
bias in 51-3 screening techniques 226-7
planning phase of 49-53 secondary prevention 14
researchers’ patterns of emotions during 53
selection bias 127-8
research project, probable stakeholders in 166
self-administered questionnaire 82-3
research protocol see study protocol
sensitivity analysis 270-1
research questions 64
sentinel surveillance 143-4, 161
developing 83-6
problems in 145
for health systems research 153-4
formulating 56-63 shadow pricing 268
in disability studies 226-7 significance level 114
in KAPB surveys 171 simple linear regression 120
methods of questioning 82-3 simple random sampling 75, 77
research report Simplified Oral Hygiene Index (OHI-S) 256
qualities of a good 98 smoking (KAPB survey regarding) 175
structure of 98 SMR see standardized mortality ratio
research team 49-50 snowballing (networking) 79
- for outbreak investigation 198 social dental indicators 259
researcher control 64-5 Social Learning Theory 169
researchers, emotional state of 53 social psychological theories 169
(see also fieldworkers; research team) socio-cultural context 174
resources 96 special surveillance programmes 144
for occupational epidemiology 222 problems in 145
respiratory disease specific rates 18
health survey 44 specificity 221
occupational study 222-3 specimens (use for other purposes) 32-3
Rifkin’s framework 38 spot map 199, 201
risk difference 117 standard deviation 111
risk factors 10-11 standard error 112-13, 124
294
index
standardization (adjustment) 19, 273-5 ' tally sheet 46

direct 273, 274 target population 74 ;
indirect 221, 273, 275 in environmental epidemiology 216
of observors and instruments 273-5 targeted sampling 79
of rates 188, 273 TB (tuberculosis)
standardized mortality rates 188 photo novel 182-4
standardized mortality ratio (SMR) 188-9 study of patient compliance in treatment 137-8
statistic (definition of) 111 telephone surveys 84
statistical concepts 101 tertiary prevention 14
statistical hypothesis 62, 114 theoretical memos 180
statistical review, checklist for 123 total fertility rate 20
statistical significance 115, 118 training 93-4, 180-1, 208-9
stem and leaf plot 104-5 translation 93, 174, 237
stratification 119-20 24-hour recall 250
stratified cluster sampling 77 Typeland Il error 114 \
stratified random sampling 75-7 a
strength of association 118
structured interviews 83
U
study
implementation objectives of 62 under-5 mortality rates 20, 189
statement of purpose or aim of 58-61 under-reporting 144-5
study base 65-6 in environmental studies 213
study design 64-73 in hepatitis B notifications 146
and comparison group 65 unstructured interview 83
and research control 64-5 urbanization 22
and research questions 64 utilitarian theory 27
and study base 65-6
and temporal sequence of exposure and outcome 65 Vv
commonly used 66-73
considerations 125-6 vaccination programme for hypothetical disease 272
for disability studies 225 vaccination status 209
in environmental epidemiology 216-17 vaccine efficacy, estimating 205, 206
in occupational epidemiology 219 validation 227
in psychiatric epidemiology 231 validity 124
matching research topics to 72-3 and bias 126-30
study protocol 50-3 value (of human life) 268-9
checklist for a 52 variables 101-2
contents of a 50-1 types of 101
major headings in a 50 verbal autopsies 161-2
study (target) population see target population
study types 66 WwW
matching research topics to 72-3
summary rates (mortality) 187-8 weight for age 243-4
summary statistics 108 weight for height 243-4
surveillance methods for REA 161-2 (see also disease ‘why game’ 56-7
surveillance) written versus oral consent 32
survey of records 84
systematic sampling 78
T
table shells see dummy tables
295
ec gorge
aw no Tee Aa)
oe (> WAN Bemsur lapis marly
4? o , We ta atrapt Msriad
J =- i" "a
i£ cout b te ai plana} 7 M~ p< ane er gy
Lavine
7
» SPO ‘
Somehan me r
=
= “Wy 2 Rest eye as - a
eT! Wagir bes ey a *e
an hee: i fe wPRCLin: Pe Wesrae ip B28 ofcata Yo

Se Gee 8ews-eseagk
ba
. 2
ees F Uf as ~eyaatle
(ine THVT: Sly : re «+ * SP ope, erreantsof ena, sam
ss hi gD
ages = ‘a ayigen: fea mLab A Bram govt So leap peidagitet Sarge
aiat “ derBx iy 7 Greeti ai
( ne ' ; 18 87to
; Inia wonay req iniaistebe
f >
tahonitneltig
Sptigi att 4
i tise fh) a ee ; ou lesen
+,
- FB yatveclie Retsil a me : :
Sémrthiesascller Al SO indy 9 eh “ograyeti «5
r + Papal 6 Pale: petals. aac.g obo knceu. dopenevehans
> ye ny ee
j : : ere b 4 7 wtp -. <4 - Mizo l esp cones bine
; ' a { nl DS vem vibettebray
oases
Wate ae Y en aon iso
ieee RR
ae (g}o apres
eee
SE
HMA OL $6 Wee Hei fe = / 2 7S! enotasrtenee
Moa gES ig Biles Eis” ane _ 5S wel
albayiitietealD
204. “yee chien > 7 » aR: ishige cmmseiee | |
+ ey | sg Bee der apbooisqunaant:
ta) yuu - a a a" Et qe temaiiqe sluelinyegal
fil: |ets4 Tuy . “alr 3 - GST aronjor damesintgecietam
e - ee eis? mi ~ nee widest bake levotonglate |
- ; ei dabtiny aan ae E SResisteliioora
= ae a See - 5-02 6 Bosteerer
Tee? et 2 PMT oh aetna whe
sed aoe nea dtp Seeing Aiea Si S eee ccm Crepes)ea
eo et ‘ ae «(eee oe meerer ae
ee eT ren scoreear gD
ou
a ¥ ee <¢ ova f nicest ‘hake 7

ek
hae Soa nbs Aten pet Tempore2-3 607<otinhrane
a ~t Pa ely t' th io a
hoe iumilye <= By 2-401 MIA so}ebociem

pee xi ‘
PDA 5 ea ny 1a p—@, S44 ; (ooraiienats:
qa ier. fi) ieges) hs a
tr ets ta a es Heine n
tywow
EB aot
Sens Lae neat eh
ea aeeeeeienaineeanedinnacent ee me
segannoemn
peor tae : a ;
: el + ae : } ¢
nt See gt eee J
ma aa Wearguacaale eile Si tm ae
6. LA ays @ ~@, %
ee ee | i E . a Sp
7 ar 4 sgoultng A =
ae Dba ial mr een VS y +
ee an, Se
: — 5 .]
rts pee oa Abe oe yy + - q
or eee) “ ? ‘ . ¥
away aa @ muetiay +s) ef .

Ss
ehiiees)
ane
& ‘ , 7
> “he. _ = 7
rity Sri
i ee rn - rey:
re, ” 2G . 7 “i _
ah iatta avietog 145 7 Sy
yelp ie tsi fee, * ros
alte, Y Rog 1) he
= e
i Pe ‘ :
7 4 : er
4 «eee - ——— Tt awe i —_— -i
wu
Health care workers and medical students often find epidemiology
dauntingly abstract, even though it provides a vital background to
the work they do. This comprehensive manual provides an acces-
sible introduction to epidemiology and its application in primary
health care. The basic principles and tools are clearly explained and
applied to a wide range of specific topics — from psychiatric to
environmental epidemiology.
This updated manual offers:
ae ereninnnurtehielte guidance on the skills necessary

to demystify epidemiological research and study
» a Strong emphasis on the practical application of
epidemiology to field situations
» examples of research that highlight local health issues
the multidisciplinary approach and knowledge of a
wide range of South African experts
clear guidelines on how to write protocols and
manage statistics
numerous helpful figures and graphs.
This clear and accessible book will be invaluable not only to medical
students and professionals, but also to all others interested in
epidemiology, or in current and future health trends in South Africa.
ISBN 0-19-571308-7
9 "780195 il

Epidemiology A Manual For South Africa

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Epidemiology A Manual For South Africa

Uploaded by

Copyright:

Available Formats

EPIDEMIOLOGY

A MANUAL FOR SOUTH AFRICA

ann aren een em enee

Oxford is a trade mark of Oxford University Press

Epidemiology: A Manual for South Africa

All rights reserved. No part of this publication may be reproduced,

Published by Oxford University Press Southern Africa,

Section A Contextualizing epidemiology

Section B Epidemiological research methods and

Section C Data presentation, analysis, and interpretation

Section E Epidemiology applied to content areas

APPENDIX 1: Standardization (adjustment) of rates 213

Key concepts in epidemiology

General philosophical iissues in

knowledge if repeated attempts to refute it fail. Public health (community health)

Causation of disease ¢ age of onset of hepatitis B virus infection below

are wee Mee acm

Agent(s) Human host

Balance upset by Balance upset by

Adapted from: Leavell H R, Clark E G, 1965.

rs Cer eRe lem ll oeg

a) No prevention b) Secondary and c) Primary prevention

Figure 2.3 Framework for levels of prevention

ONSET OF DISEASE DISEASE OUTCOME

PRIMORDIAL PRIMARY PREVENTION SECONDARY PREVENTION TERTIARY PREVENTION

target lifestyle @ health @ immunization @ screening @ arrest of prevention ¢@ retrain

Table 2.1 Components of a rate: numerator and denominator

the disease rate = 2 i ———— numerator

= 20 per 1000 per year

Table 2.2 Key Ce

Table 2.3 Difference pee rates of incidence and prevalence

Numerator = new cases Numerator = all cases

Denominator = healthy population at risk Denominator = total population at risk to be a case

Sum of person-time of ppeeneian

Measured over a specified time period Measured at a point in time

Prevalence is commonly referred to as prevalence

Sr re CCR Rec mc Cea

Source: Mazur RE, 1995

Table 2.4 Selected demographic indicators for South Africa

Depend- % Female Children Mortality rates life

SA Total 66,7 22 4,30 81 38 116 57,8

*15-19 year old age specific fertility rate

they often reflect socio-economic development,

Selection of health indicators for monitoring

Ethical aspects of epidemiological The increased awareness of the importance of

Figure 3.1 Process ofethical reer y4

General approach Example: Practical use of approach

Ethical theory Utilitarian approach

Ethical rules Obtain ietruth

Figure 3.2 Application of ethical principles to health research

Respect for persons Beneficence Justice

Autonomy Maximization of Non-malevolence Beneficence Right of person Duty of other

@ assessment of competence minimize negative effects on @ benefits of the research must

Conclusion tries, where communities were simply passive

Table 3.1 Common approaches to community participation in health

Approach Initiation Role of Role of Strategies to Orientation

Public Existing Full responsib- None, passive Environmental Completely

Activists and Dominant and Mobilization of Demand redis-

grammes. Approaches to participation differ Participatory research

suggest three main criteria for evaluation. The

dependency scale with each item to incorporate

Initial needs assessment Formation of committee

Leadership of health committee Resource mobilization