Professional Documents
Culture Documents
Epidemiology A Manual For South Africa
Epidemiology A Manual For South Africa
Edited by ee
J]M Katzenellenbogen
G Joubert
SS Abdool Karim
Neen eneen
neem!
OREGON HEALTH SCIENCES UNIVERSITY
LIBRARY
SEP 24 1999
PORTLAND, OREGON
) Epidemiology :
A manual for South Africa
Editedby JM Katzenellenbogen
G Joubert
SS Abdool Karim
Cape Town
OXFORD UNIVERSITY PRESS
SOUTHERN AFRICA
Ugo
Oxford University Press Walton Street, Oxford ox2 6 pp, United Kingdom
Offices in
Oxford, New York
Athens, Bangkok, Calcutta
Cape Town, Chennai, Dar es Salaam, Delhi
Florence, Hong Kong, Istanbul, Karachi, Kuala Lumpur
Madrid, Melbourne, Mexico City, Mumbai
Nairobi, Paris, Singapore, Taipei, Tokyo, Toronto
and associated companies in
Berlin, Ibadan
List of contributors
Acknowledgements
How to use this book
Foreword
Preface
Max Bachmann, MB BCh, DOH, MSc, FFCH, Dale McMurchy, ma, Health Economics Unit,
Department of Community Health, University of Department of Community Health, University of
Cape Town Cape Town
Donald Berry, ms Bch, University of the Carol A Metcalf, mB BCh, BSc (Hons), MPH, Department
Witwatersrand of Epidemiology and Human Papillomavirus
David Bourne, Bsc, B Phil, Department of Research Group, University of Washington, Seattle
Community Health, University of Cape Town Mohamed H Moola, BDs, LDSRCS, DDPH, MSc,
Debbie Bradshaw, Bsc (Hons), MSc, D Phil, Faculty of Dentistry, University of the Western
CERSA, Medical Research Council Cape
David Coetzee, BA, MB BCh, DTM&H, DPH, FFCH, Jonathan E Myers, Bsc, MB BCh, DIM&H, MD,
Department of Community Health, University of Department of Community Health, University of
the Witwatersrand Cape Town
Nicol Coetzee, MB BCh, FFCH (SA), DIM&H, DA, Charles D H Parry, B Soc Sci, BSc Hons, MSc, MA, PhD,
Department of Community Health, University of National Urbanisation & Health Research
Cape Town Programme, Medical Research Council
Judy M Dick, Ba, B Soc Sc, MSc Med, PhD, National Linda M Richter, php, CERSA, Medical Research
Programme for TB Research, Medical Research Council
Council Leslie Swartz, MSc, PhD, Child Guidance Clinic,
Athmanundh Dilraj, Bsc, BSc (Hons), M Med Sc, Cape Town
CERSA, Medical Research Council Donald H Skinner, B Bus Sc, B Soc Sc (Hons), MA,
John S S Gear, BSc, MB BCh, DTM&H, DPH, D Phil, FCP, Trauma Centre for Victims of Violence & Torture
Wits Rural Faculty, University of the ’ Elmar Thomas, MB BCh, B Soc Sc (Hons), MA,
- Witwatersrand Department of Medical Anthropology, University
Margaret N Hoffman, MB BCh, BSc (Hons), of California, Berkeley
Department of Community Health, University of Stephen M Tollman, Bsc, MB BCh, MA, MPH, Health
Cape Town Systems Development Unit, Department of
CarelB IJsselmuiden, MD, FFCH, MPH, DPH, DTM&H, Community Health, University of the
Department of Community Health, University of Witwatersrand
Pretoria, CERSA, Medical Research Council Yasmin E R von Schirnding, BSc, BSc (Hons), MSc,
Georgina Joubert, Ba, MSc, Department of PhD, Greater Johannesburg Transitional
Biostatistics, University of the Orange Free State Metropolitan Council, Department of
Salim S Abdool Karim, MB BCh, MSc, Dip Data, FFCH, Environmental Health, Johannesburg
M Med, CERSA, Medical Research Council Hester van der Walt, Dip Nursing, Midwifery,
Judith M Katzenellenbogen, Bsc, BSc (Hons), MSc, Paediatric Nursing, Community Health, mea,
CERSA, Medical Research Council CERSA, Medical Research Council
Louise Kuhn, MA, MPH, PhD, Gertrude H Sergievsky Petro Wolmarans, Msc, National Research
Center, Columbia University, New York Programme for Nutritional Intervention, Medical
Marietjie L Langenhoven, Msc, National Research Research Council
Programme for Nutritional Intervention, Medical Derek Yach, MBBCh, BSc (Hons), MPH, Policy Advisory
Research Council Coordinating Team, World Health Organization
Catherine Mathews, msc, CERSA, Medical Merrick Zwarenstein, MB BCh, MSc(Med), MSc,
Research Council, Department of Community Health Systems Division, CERSA, Medical
Health, University of Cape Town Research Council
acknowledgements how to use this book
We acknowledge the substantial contribution of This book is divided into sections, each of which
Derek Yach both as co-editor of the 1991 edition addresses a particular aspect of epidemiology.
of the Introductory Manual for Epidemiology in Sections A, B, and C explain the theory and
Southern Africa, as well as for conceptualizing the principles of epidemiology. Here the basic
idea of producing such a manual. abstract ideas and skills of epidemiology are
We thank Neil Cameron, Margaret Hoffman, and described in a way that is designed to be clear and
Marian Jacobs for contributing to the epidemiologi- accessible not only to medical students and
cal manual used in the 1987 workshop at the annual professionals, but also to others who are interest-
Epidemiological Society of Southern Africa con- ed in epidemiology. You will need to read all three
ference; Debbie Bradshaw, Yusuf Chikte, Nicol of these sections, preferably in order, to under-
Coetzee, Di Cooper, Rudi Eggers, Margaret Hoffman, stand how epidemiology works, and before you
Sarie Human, Rod Jackson, Marian Jacobs, Trefor can go on to make full use of Sections D and E.
Jenkins, Neil Myburgh, Sue Naidoo, David Rees, Sections D and E constitute the ‘applied’ part of
Robert Schall, Donald Skinner, Malcolm Steinberg, this manual; they show how the theory and prin-
Krisela Steyn, Mary Lou Thompson, and Ruth ciples explained in the first three sections can be
Watson for reviewing parts of the manual, Tryphina applied in different subject areas of health. So
Masuku, Amelia Mulder, Wynda van Eyssen, and these last two sections demonstrate what
Nazneen Ally for typing, administration, and co- happens when epidemiology is used in specific
ordination, Jenny Altschuler and Sharoni Cohen for disciplines and ‘in the field’. Here you can
selected graphics, Helen Moffett for her valuable browse, choosing to focus on whatever particu-
editorial contribution and guidance, and finally the larly interests you. Or you can look up various
many students who attended our various basic subject areas to get an idea of how epidemiology
epidemiology courses and urged us to produce this can be used in the various disciplines that you
book. study or practice.
Copyright acknowledgements can be found on This manual also introduces and explains the
p. 288. technical terms used in epidemiological study
This book was made possible by the financial and practice. The first time that a term appears in
support of the Medical Research Council. the text, it is defined. If you are browsing and
would like to check the definition of a term, or if
you need to look up a term, turn to the index. The
first page number that is listed for a term will have
MEDICAL RESEARCH COUNCIL the definition of the term (which will be marked
CERSA in bold for easy identification) on that page.
The bibliography contains both references and
further reading for those who wish to explore
topics in more detail.
foreword
‘Catch-up growth’ describes the accelerated the apartheid state began to break down, a
response of a stunted child once the adverse number of gifted young epidemiologists sought
circumstances that caused the stunting are training both within the country and abroad. A
removed. This phenomenon serves as a metaphor second and simultaneous need is to transmit to
for what must happen in the new South Africa ifa all public health workers the basic elements of the
new public health is to break out of the caked and discipline needed to work in the field. To meet
soiled shell that encased the country in the this need, the new generation of epidemiologists
apartheid era. While the effects on the population must spread the knowledge they have acquired
will be felt for generations to come, much that is beyond the narrow bounds of those with whom
new will have to be done to ensure long-term they can make personal contact. With this book,
good health for all South Africans. they can begin the process.
Public health and its basic sciences were among This book takes an essentially practical
the fields most starved during the apartheid era. approach that should make it accessible to
This impoverishment of South African epidemiol- professional health workers. It incorporates
ogy and public health must be dealt with in this substantial applications of epidemiology in the
new and hopeful era. More than that, it must be primary health care arena, reinforcing the fact
quickly remedied. Public health is a key element if that the discipline must make a significant contri-
the historical impoverishment of the whole bution to the planning and evaluation of public
society is to be undone. Public health in South health care programmes. Furthermore, it takes a
Africa therefore has a mission, which can not be step towards raising the level of understanding of
effectively pursued without rebuilding and the results of epidemiological studies so central to
_ deploying the academic disciplines on which the policy analysis.
field must rely. Not only does South Africa need a new cadre of
Epidemiology, according to a recognised defin- epidemiologists, but the incorporation of epide-
ition, is the study of the distribution and the miology training for every kind of health worker.
determinants of states of health in populations, This book represents an important beginning in
with the object of preventing and controlling ill the education that is essential to create a healthier
health. There can be no rational and effective South African future.
public health without a foundation of epidemio-
logicai understanding. It is therefore important to Mervyn Susser
grasp the scope of epidemiological knowledge: Editor: American Journal of Public Health
how it is acquired and evaluated, what uncertain- New York
ties and likelihoods attend it, and how it is tested
and advanced. Rachel Gumbi
A first need is thus to build a new cadre of Chief Director: Human Resources Directorate
epidemiologists. South Africa is fortunate that in Department of Health
the years since the mid-1980s, as the isolation of Pretoria
* ** *
preface
Over the last decade and a half, epidemiology has A chance meeting in August 1994 with a repre-
undergone a sustained resurgence in South Africa. sentative from Oxford University Press resulted in
Increasingly, there has been a recognition of the the production of a fresh version of the manual.
need for epidemiology to provide a research base The new editorial team took note of the criticisms
that can inform policies to transform health in the and comments received from users of the manual.
country. With the co-operation of the contributors, the text
Epidemiological skills in South Africa, however, was revised and extended, and the examples and
were few and the Epidemiological Society of information updated to produce a new book. We
Southern Africa, recognising the broad education- are grateful to all the contributors who, despite
al needs for the discipline, organised training their busy schedules, made the effort to improve
workshops both regionally and nationally to their sections or write new sections.
address this need. Workshops, organised in In this book, as with the first manual, we have
tandem with annual Epidemiological Conferences, aimed to give a comprehensive introduction to
have been particularly successful and have epidemiology, with a specific South African
reached a wide range of health professionals inter- flavour, using local examples and highlighting
ested in epidemiology. The demand for copies and local issues. An effort has been made to further
reprints of the manuals and booklets produced for simplify the language to make the text accessible
participants of these workshops prompted us to to those whose first language is not English. We
produce a more comprehensive manual which have avoided complex formulae as much as
incorporated aspects of earlier material, but with a possible, noting that the manual does not replace
broader epidemiological scope, to coincide with a good introductory statistical text. We have dedi-
the 10th Annual Epidemiological Conference held cated a chapter to qualitative research as well as
in Cape Town in July 1991. citing uses of such research throughout the book
A wide range of contributors ably provided the to highlight how both qualitative and quantitative
material for publication and the technical produc- research are useful and often complementary
tion was done by staff of the Medical Research tools to investigate public health issues.
Council. The manual was launched at the confer- This manual targets the community of health
ence, and complimentary copies were sent to workers — and interested members of the public
institutions where epidemiology was being — who have little or no experience of epide-
taught. Lecturers and other users of the manual miology. We felt that it was important that this
were asked to evaluate it. book meet the needs of all health workers (such as
Increasingly, the manual was being prescribed doctors, nurses, and physiotherapists) who are
for epidemiology courses at universities, nursing starting their first research projects. In order to
colleges, technikons, and since 1992 at the newly meet this need, this book stands as a comprehen-
developing Schools of Public Health. By mid- sive text on epidemiology while retaining a pract-
1994, the 1 000 copies had run out and the editors ical, down-to-earth approach.
had dispersed geographically. The future of the
manual was uncertain, in spite of an obvious Judy Katzenellenbogen
need for a basic local epidemiological text and the Gina Joubert
very positive response by users of the manual. Salim Abdool Karim
* K K *
SECTION A
Contextualizing epidemiology
Introduction
J pay eter negen, JGear, STollman
When you study epidemiology you will find the own ‘bag of research tools’ which can be used to
textbooks are full of surprising facts. Do you know further knowledge in the field.
who described the benefits of fresh fruit in the The research methods and findings have to be
prevention of scurvy? It was Andrew Lind, an open to scrutiny by others and peer review of
officer in the British Royal Navy. What has the research is an essential aspect of the research
Broad Street pump got to do with cholera? It was process.
the source of contaminated water during Broadly speaking, there are two types of
London’s cholera epidemic of the mid-19th research: basic (fundamental) and applied
century. John Snow was the man who recognized research. While both types of research lead to the
the hazard and took the handle off the pump. creation of new knowledge, basic research gener-
Birth and death statistics have been collected for ates fundamentally new knowledge and tech-
centuries and recorded in church records. It was nologies requiring the involvement of highly spe-
Florence Nightingale in Britain who promoted cialized academics from the basic sciences. Basic
their use as a planning tool. Today, birth and mor- scientists are less interested in what can be done
tality data are indicators of changing circum- with the new knowledge: they really pursue the
stances in countriés and communities, and are knowledge for its own sake and may spend years
important weapons in the epidemiologist’s fight working before their research efforts bear fruit.
against disease. Applied research, on the other hand, investigates
South Africa too, had its early pioneers who current problems, and uses existing knowledge
contributed significantly to the control of disease and technology to address the problems.
through critical observation and analysis. It is In general, knowledge and understanding are
essential to develop these characteristics if you advanced by attempts to formally test a proposed
want to become an epidemiologist. It is like being theory. The way theories/hypotheses have been
a detective; the clues are always there. Your chal- tested has changed over time. Ways of establish-
lenge is first to find them and then to interpret ing causal relationships have been debated
what they mean. throughout the history of scientific philosophy.
So what is the essence of epidemiology and Currently, there are two main schools of thought
how can it make a difference to people’s health? on the growth of knowledge. Verificationists
To answer these questions we need to start by support the concept of new research findings
considering what research is and why it is done. being confirmed through repeated studies, that
is, new information is accepted when new
research findings are ‘sufficiently’ verified. Falsifi-
Research cationists, on the other hand, support the con-
Research is the systematic search or inquiry (‘re- cept that all knowledge is provisional until
search’) for knowledge. Scientifically acceptable refuted through research. This means that infor-
methods are used to investigate issues in order to mation, like existing knowledge, cannot be
arrive at valid conclusions. Every discipline has its proven conclusively but can become provisional
3
a contextualizing epidemiology
times, the field has been referred to as ‘commu- providers; and communities. The idea is not that
nity medicine’ or ‘community health’ in some priorities will be set purely at national level, but
countries. In South Africa the term ‘community rather that national priorities will be forrned from
health’ is generally used and the field is still an ‘upward synthesis’ of local, district and provin-
largely under the control of the medical profés- cial concerns.
sion. However, people involved in a wide variety In South Africa, the ENHR framework has been
of disciplines are centrally involved in the field, endorsed by a wide range of organizations and
including nurses, health inspectors, sociologists, institutions as an organizing framework well
psychologists, biostatisticians and epidemiolo- suited to local needs. An ENHR strategy can be
gists. expected to strengthen the capacity of the provin-
Public health programmes require a variety of cial and national departments of health to
skills to ensure adequate planning, implementa- address important yet previously neglected prob-
tion and evaluation of the effort. Core disciplines lenis.
are epidemiology and _ biostatistics; health
services evaluation and administration; beha-
vioural and biological sciences; occupational and
Epidemiology
environmental health; health education and Epidemiology is ‘the study of the distribution and
health economics. determinants of health-related conditions and
events in populations, and the application of this
Public health research study to the control of health problems’ (Last
The findings of the Commission on Health 1988).
Research for Development (1990) indicate the key Descriptive epidemiology describes the occur-
role of research in improving health, especially rence of disease and other health-related charac-
the health of disadvantaged groups. The commis- teristics in human populations in terms of person
sion concluded that health research is a vital (the who: age, sex, social class), place (the where:
investment in the future and that the seeds of geographic location) and time (the when: season,
research can yield a wealth of effective action. time of day/week).
The commission sees health research as serving Analytic epidemiologic studies investigate the
two main purposes. Firstly, country-specific association between a given disease or health
research is essential to determine health prob- status and possible causative or protective fac-
lems, analyse measures for dealing with them and tors. An attempt is made to explain why health-
help decide on actions to achieve health improve- related outcomes occur ‘and whether interven-
ments with limited resources. The second area, tions work.
global health research, relates to the development Epidemiology forms the research arm of public
of new knowledge and technology to cope with health, providing the scientific basis upon which
the major unsolved problems worldwide. This public health policy decisions are made. It pro-
category of research requires global co-operation. vides methodological research skills in order to
The commission called these two purposes — manage public health problems in a more effec-
country-specific and global health research — tive way.
‘essential national health research’ (ENHR): An important distinction to make is the differ-
‘essential’ because they address a country’s prio- ence between the terms ‘the epidemiology of a
rity health problems, and ‘national’ referring to particular disease’ and ‘epidemiology the disci-
the importance of research that is directly tar- pline’. The epidemiology of a particular disease
geted at local or national level problems. While describes the distribution and determinants of
every country should develop a strong capacity to that disease. So for example, the epidemiology of
undertake country-specific research, each coun- HIV infection demonstrates that young adults
try needs to consciously decide for itself the aged 16-30 years have the highest incidence rate
resources it can contribute to the global health of HIV infection, women have a threefold higher
agenda. risk of HIV infection and that approximately 25%
ENHR is, in effect, a strategy for setting prior- of children born to HIV positive women are
ities, and managing and developing a country’s infected with HIV.
resources for health research. It depends crucially Epidemiology the discipline describes particu-
on working partnerships between researchers; lar methodological tools/approaches used to
health service policy-makers, managers and gather information about the prevalence, distri-
5
a contextualizing epidemiology
bution, determinants and course of any condi- are surprised to learn that 50 years ago the
tion. It is epidemiology the discipline that forms country embarked on a series of PHC experi-
the content of this book. ments which gained the attention of practitioners
and researchers worldwide.
The role of epidemiologists This started in 1938 with a nutrition survey
Epidemiologists function at two levels, often which documented, for the first time, the health
simultaneously. On the first level, as ‘pure’ scien- status of South Africa’s African majority. The
tists, they try to find ‘truth’ or the real cause(s) of study, initiated by Harry Gear (then Deputy Chief
disease, encouraging sceptical debate about Medical Officer of the Union), Sidney Kark and
causal inference. At a second level they function Harding Le Riche, concluded that: ‘Diet defi-
as public health officials, where they advise others ciency diseases, syphilis, malaria, bilharzia,
involved in making public health decisions. In tuberculosis, scabies and impetigo, and prevent-
these public health roles, they and the public they able crippling ... form no small array of factors
serve must recognize the crucial need to act in contrary to the maintenance of good health ... No
such a way as to err on the side of public health. amount of juggling can succeed in separating the
This often means taking action in the presence of influence of one as opposed to the others ... The
scientific uncertainty and it also means that in- outstanding fact is that they are all preventable
correct decisions may be made. Epidemiologists ...’ (Kark 1994).
have a key role in ensuring that decisions Faced with this reality, and without a ready-
involving policies and practices are continuously made solution, in 1940 the government initiated a
evaluated and redefined in the light of new rural health unit staffed by Kark, his colleague and
evidence. wife Emily, and Edward and Amelia Jali. Within
five years, work at Pholela provided evidence for a
revolutionary model of health care provision
Epidemiology in South Africa (Slome 1962; Tollman 1994).
For centuries South Africans have been cared for What, then, was so extraordinary about
by herbalists. These skilled practitioners had, by Pholela? The work is impressively documented
observation and analysis, identified plants with and well worth reading (Kark 1962), but we single
therapeutic properties. If we were to learn more out three innovations:
of their ways and techniques, there can be no analysis of the health and health-related issues
doubt that we would describe some of them as the of groups or communities (community diagno-
ancestors of modern epidemiology. sis) parallel with individual patient assessment;
In modernizing, South Africa became exposed use of epidemiology as a tool for daily use, to
to diseases of overcrowding and urbanization. monitor, evaluate and inform health actions;
The mining industry heralded an explosion in recognition that social, cultural and environ-
respiratory disease. Pneumonia was a huge mental factors are crucial determinants of
problem; Sir Spencer Lister, one of the early direc- community health status.
tors of the South African Institute for Medical Re-
search, was a pioneer in pneumococcal vaccine We take the last point for granted, but have yet to
development. The 1918 flu epidemic led the fully incorporate our understanding of these
health planners of the day to lay down strict na- determinants to strengthen our health promotion
tional health measures to control and monitor the efforts, whether the problem is diarrhoea and the
spread of infectious disease. Their rules were use of oral rehydration therapy (ORT) or ischae-
guided by epidemiological principles. Siegfried mic heart disease and exercise programmes.
Annecke helped to dramatically reduce malaria in Using tuberculosis as an example, the near-
South Africa through his observation of the effi- legendary John Cassel gives insight to the under-
cacy of DDT in mosquito control. standing of local customs and values needed if
In the realm of primary health care epidemio- new ideas are to fit with existing cultural frame-
logy, South Africa was a world leader by the mid- works (Cassel 1955).
1940s.
From model to movement: Gluckman, the
Pholela and the roots of primary health care IFCH, and COPC
(PHC) Henry Gluckman, later Minister of Health,
Given South Africa’s painful history, many people chaired the National Health Service Commission.
6
1 _sintroduction
In 1944, this commission recommended a coun- on the epidemiology of rheumatic heart disease, a
trywide network of health centres as the basis for major cause of morbidity and mortality in urban
fundamental health reform in South Africa blacks, was limited. However, the definitive study
(Gluckman 1945). Their recommendation was was conducted by Margie McLaren, who per-
based on the Pholela experience, the pilot study. suaded the clinical cardiologists of Johannesburg
Forty health centres were established. Eight in to join her in a massive survey of Soweto school-
and around Durban formed practice-research children. This survey revealed many cases of
sites for an Institute of Family and Community advanced and previously undetected rheumatic
Health (IFCH). Linked to Natal University, the heart disease. Her work stimulated a number of
Institute trained staff for health centre teams, major attempts at rheumatic heart disease
including a new category — ‘health recorders’ — control.
to maintain community, household and family What about South Africa’s major killers; gastro-
records. enteritis, pneumonia, malnutrition, violence,
Although abruptly and prematurely curtailed tuberculosis, cervical cancer, cardiovascular
by the political events following the change of disease? Did epidemiologists pay enough atten-
government in 1948, such efforts caught the inter- tion to these priorities? Turning the pages of the
national imagination. They showed the link South African Medical Journal during this period
between community epidemiology, primary care tells us that they did not; information about the
and community health. Based on adaptations distribution, causation and control of many of
from many countries — the USA, Israel, Spain, these disorders remained inadequate. We have
Nicaragua, Britain — this approach became virtually no useful baseline information about
known as ‘community oriented primary care’ or how effective our interventions have been.
COPC. As an important research base for the Epidemiology and epidemiologists have let us
World Health Organization and Unicef’s ‘Health down — certainly in the mid-70s, priorities were
for All’ strategies, COPC continues as one of still with the exotic and the intellectually novel.
several broad strands making up the fabric of There are a few notable exceptions: in the areas of
contemporary primary health care (Tollman nutrition and iron metabolism, to mention just
1991). l two, epidemiologists were doing work which
‘Failure to build upon [such] early models cost would make an impact on South Africa’s health
the country dear, not only in financial terms but problems.
also,'and more importantly, in terms of prevent-
able death, disease and disability’ (Yach, Tollman Epidemiology in the 1980s
1993). After all this time, the work done in places Until the 1980s, much of South African epidemio-
such as Alexandra (Alexandra Health Centre logy had been targeted at disease description, had
1991), Mamre (Mamre Community Health been conventional in its adherence to a biomed-
Project 1988), Gelukspan (Bac 1990) and else- ical paradigm, and had steered away from
where is once again inspiring South African confronting South Africa’s greatest disease, apart-
efforts in district health development. heid.
All of this changed in the 1980s. A ‘new’ genera-
South African epidemiology in the 50s, 60s tion of epidemiologists emerged. The thrust of
and 70s this new wave of epidemiology spread across the
From the 1950s to the mid-1970s, the science of country. It was essentially a multi-disciplinary
epidemiology was developed as acritical tool movement, but was somewhat under-repre-
both in the description of disease and in monitor- sented by nurses and environmental health offi-
ing the control of disease. Many diseases were cers. Centres of endeavour were found at univer-
notifiable and research scientists used this sities, research organizations and rural hospitals.
routinely collected data to develop hypotheses Notable activity in the training of nurses in field
about causation and distribution of disease. A few epidemiology took place in the small epidemio-
examples from Gauteng illustrate the kind of logy unit in the Department of Health. This unit,
epidemiological research done during this with limited resources, also conscientiously
period. analysed the routinely collected national data,
Gear’s work on the Coxsackie virus, poliomye- reporting findings monthly in the publication
litis and rickettsial diseases such as tickbite fever, Epidemiological Comments.
had a strong epidemiological foundation. Work The notable thing about the ‘new’ epidemio-
7
a contextualizing epidemiology
logy was its content. It began to address quality and epidemiology in South Africa is being
and provision of care. It confronted unequivocally strengthened by this. Indeed, international orga-
the horrendous inequalities in mortality amongst nizations such as the World Health Organization
South Africa’s population groups. It revealed in- are increasingly utilizing the epidemiological
equalities in per capita expenditure on health expertise in South Africa to strengthen their pro-
between the ‘homelands’ and white South Africa. grammes and efforts.
It pointed to the horrific working conditions
experienced by much of South Africa’s labour The practice of epidemiology in the
force. Cholera epidemics were revealed to be
almost exclusively epidemics within ‘homelands’.
rest of Africa
Immunization rates in white cities were high; in The practice of epidemiology in Africa is, on the
black townships, they were abysmally low. whole, weak. A large proportion of epidemio-
Epidemiology’s torch shone into the dark, logical research in Africa is done by organizations
excluded and forgotten corners of South Africa’s based in Europe and the United States, without
disease cave and what was hidden in these always involving indigenous Africans. There is
corners was disgraceful. Epidemiology brought concern about foreign agencies ‘using’ Africa as a
legitimacy and factual support to the rhetoric of laboratory for research. This can result in the
the politically oppressed. The struggles for health development of undue dependency and failure to
and political rights had been irresistibly joined. increase epidemiological expertise on the conti-
The evidence would not go away, although many nent.
tried to suppress the emerging information. However, some organizations, such as the
Ulma group in Sweden (which has been active in
Coming of age Tanzania, Ethiopia and Somalia in particular)
It is clear that much of the basis for a vibrant have a long-term commitment to their projects.
culture of epidemiology now exists in South In these cases, development of local expertise is
Africa. already evident. The French INSERM group,
The Epidemiology Society of Southern Africa London School of Hygiene and Tropical Medi-
(ESSA), founded in 1982, has become a meeting cine, and the Tropical Disease Research Pro-
place for the learned, the learning and anyone gramme of the WHO have been focusing strongly
else remotely interested in public health research on developing epidemiological expertise in parts
and the measurement sciences. of Africa. The Centers for Disease Control (CDC)
Epidemiology is now being taught in courses have initiated the global Field Epidemiology
dedicated to the subject. These courses, first Training Programme on the African continent
developed in the Cape, are forming the founda- and have shown substantial involvement in 11
tion upon which epidemiology is being taught in countries through the Combating Childhood
the schools of public health which emerged in the Communicable Diseases programme.
mid-1990s. There is a commitment to strengthen- There is also a great need for further formal
ing epidemiological training and practice among training courses in epidemiology. The WHO has
nurses. Occupational health epidemiology flour- been instrumental in instituting two Master of
ishes in Cape Town, Johannesburg and Durban. Public Health (MPH) programmes in Nairobi and
The Centre for Epidemiological Research in Dar-es-Salaam that have a strong epidemio-
Southern Africa (CERSA) has played a unique logical bias. In Southern Africa, the Blair Research
role, providing first a training ground, and then Laboratory focuses on tropical disease research in
ample opportunity for methodological and Zimbabwe, and the WHO Ndola group fulfils a
thematic applications: from urbanization to in- similar role in Zambia. Most countries, however,
fectious diseases to chronic diseases of lifestyle. still lack a core group inside the country to
And health policy and health systems research develop epidemiological expertise.
and development have been identified as critical
to the thrust of health reform. Epidemiological
skills are being drawn upon extensively in the
The 1990s and beyond
development of health policy on the national and This overview presents a chequered history: origi-
regional levels. nal contributions in the 1940s and 50s; consider-
International contact and recognition, minimal able achievement accompanied by a turning
in the pre-democracy years, are now abundant away from harsh reality in the 60s and 70s; and
8
1_sintroduction
highly motivated re-engagement during the 80s For epidemiologists, to nationalize their exper-
and 90s, accompanied by the laying of a strong — tise. There are structures helping to achieve
though still incomplete — foundation. closer collaboration. The emerging Essential
A host of challenges lie ahead. Some of those National Health Research framework, the pro-
that stand out are the following. gressive Primary Health Care Network and the
For departments of community health, nursing Community Health Association of Southern
colleges and the new schools of public health, Africa are suitable structures.
to strengthen their expertise and ensure that @ For CERSA, to become a truly national resource,
essential epidemiological skills are soundly better able to respond to geographically
acquired by the new generation of public peripheral needs, and with its expertise more
health practitioners and researchers. accessible to universities.
¢ For clinical departments and medical faculties, @ For us all, to extend our working relationships
through the tools of clinical epidemiology, to with our neighbours in Southern Africa, as well
improve clinical judgement, understand costs as colleagues in the rest of Africa.
and consequences, and appreciate that quality
of care is both science and art, technique and Epidemiologists have risen above the polarizing
judgement. differences of the past. Today we look forward to
@ For policy-makers, managers and providers, to building on the expertise and the capacity for crit-
cultivate the critical outlook and open-minded- ical analysis we already have, and on a new gene-
ness that recognizes policy analysis and pro- ration of energetic people, whose commitment to
gramme evaluation as integral to sound health justice and equity will ensure they pursue the
practice. truth as best they can in the interests of a healthy
For the epidemiological method, to embrace new South Africa.
more analysis and less description. We know
much about the problems; we still have to get to
grips with what works, what does not and why.
Key concepts in
epidemiology
10
2 = keyconcepts in epidemiology
Other factors related to the host and the environ- does not hinge on the agent alone. Tuberculosis
ment may need to be present for the disease to specifically, and disease in general, cannot be
occur. For example, infection with the TB bacillus attributed to one factor only — hence the accep-
is a necessary but not sufficient cause of tuber- tance of the concept of multiple causation. This
culosis disease, that is, even if the agent is multi-factorial nature of disease can be under-
present, the disease will not definitely occur. stood in terms of the interaction, often complex,
Other factors must be present as well. between three groups of factors: those related to
Factors related to the host which commonly the agent, the host and the environment. The
influence occurrence of disease include age, sex, concept applies as much to infectious diseases as
social class, status, lifestyle, and psychological it does to non-infectious ones, although, as men-
character. tioned above, in non-infectious diseases there
Environmental (extrinsic) factors include, for may be no single agent.
example, the presence of disease-transmitting The interaction of these three groups of factors
vectors and the availability of food (biological largely determines an individual’s state of health
environment); social customs related to living at a given point. The health status of a person may
habits, interpersonal behaviour and attitudes be seen as an equilibrium between the agent-
(social environment); the economic organization host-environment with the host and the agent
of the society and the level and distribution of balancing on a fulcrum of the environment — the
wealth (economic environment); and climatic epidemiological triangle (Figure 2.1). Harmful
conditions and the presence of industrial waste change in any of the factors could lead to a
products (physical environment). change in equilibrium, causing disease. Small
In non-communicable diseases, there is no negative changes can, at times, be compensated
‘agent’ and various concepts have been devised to for, if the existing balance is not too precarious.
understand the interplay of a multitude of risk The epidemiological triangle is particularly
factors which eventually lead to disease. useful for understanding the causation of infec-
tious diseases. Since the application of epidemi-
Models depicting multiple causation ology to chronic and non-infectious diseases,
An understanding of the disease and its causation new models have developed which de-emphasize
FULCRUM
11
a contextualizing epidemiology
the agent or organism (for example, there is no Establish whether the risk factor or exposure
known ‘organism’ for heart disease) by regarding occurred before the disease/outcome. That
it as one component of the environment. The idea is, the temporal sequence of the exposure
of a ‘web of causation’ was developed to depict and disease must be compatible with causa-
the extremely complex chain of events/factors tion.
that contribute to the occurrence of disease @ The stronger the association between the expo-
(MacMahon, Pugh 1970). Another model which sure and the disease, the more likely the expo-
puts less emphasis on the agent is that of the sure is to be a cause of the disease.
‘wheel’, where the biological, social and physical ¢ Ifastudy shows that an increased exposure
environment impinge on the hub or, core of the results in an increased occurrence of the
wheel corresponding to the individual’s genetic disease (for example, if the lung cancer rate
constitution. For different diseases, the different among heavy smokers is higher than among
components may play a more or less prominent light smokers), it meets the dose-response cri-
role in the causation of the disease (Mausner, terion.
Kramer 1985). Results from different studies using different
approaches in different settings should give
Evaluating research evidence to establish similar results (consistency). The reason for in-
causation consistent results should be explained.
While the above description of the factors associ- A plausible explanation of the mechanism by
ated with tuberculosis sounds straightforward, which the causative agent causes the disease
many laboratory, epidemiological, and clinical should be put forward to fulfill the criterion of
investigations had to be undertaken in order to biological plausibility.
arrive at this understanding. In general, an under-
standing of what causes a particular disease More recently, Susser has suggested a hierarchical
depends on an accumulated body of knowledge approach to applying the causal criteria. Rather
based on sound research. than a mechanical application of the use of causal
In the case of a living organism being an agent criteria, the hierarchical approach proposes a
or necessary cause of a disease, the following rational approach which begins with the strength
criteria were postulated by Koch: of association. If an association does indeed exist,
the organism must be present in every case of the next step is to determine the temporal relation-
the disease ship between cause and outcome. Only if the cause
the organism must be isolated and grown in occurs conclusively before the outcome are the
pure culture other criteria applied (Susser 1991).
the organism must, when inoculated into a
susceptible animal, cause the specific disease Conclusion
the organism must then be recovered from the It is essential to know about the different causes
animal and identified. of diseases and their interaction in order to
develop appropriate intervention programmes to
Koch’s postulates are of limited use because prevent or control disease. Health programmes
several diseases do not have animal models, and can have a major impact on disease directly at the
behavioural and environmental causes of disease level of the agent (for example, improved water
cannot fulfil his criteria. supplies decrease agents for diarrhoea), the host
More recently, guidelines have been developed (for example, immunization against measles pre-
for deciding whether a factor causes a disease or vents disease even when overcrowding occurs),
not. The emphasis is on strong research design and/or the environment (for example, improved
with evidence from experimental designs having road design decreases traffic accidents). Health
the most influence (see Chapter 7: Study design). policy and services (curative and preventive) thus
However, because such trials are rarely used to play an important role in determining the final
establish causes of disease in humans, strict crite- health status of the population.
ria are used to evaluate non-experimental (obser-
vational) studies investigating causation.
Sackett, Bradford Hill (Bradford Hill 1965; Sack-
Levels of prevention
ett, Haynes, Tugwell 1985) and others published The following anonymous poem, while by no
guidelines which can be summarized as follows. means a major literary work, contains an import-
12
2 keyconcepts in epidemiology
TELA 7J
SZ
LWW
y
SS
sy
PSssSe
YSSS
j
SOSsS
RSS
Seawe:
So
ant message for those interested in public health. While there seems to be agreement that pre-
vention is better than cure, traditionally very little
It was a dangerous cliff as they freely confessed, is done to actively promote prevention. This can
Though the walk near its crest was so pleasant; be understood when examining the historical
But over its terrible edge there had slipped development of medicine, where orientation
A duke and full many a peasant. toward treatment has dominated. Over the years,
So people said something would have to be activities have been restricted to treating the
done, injured and the sick who present themselves to
But their projects did not tally; health services. Only more recently has there
Some said ‘Put a fence around the edge of the been an appreciation that health responsibilities
cliff, need to be more extensive.
Some ‘An ambulance in the valley’. Before being able to take fuller responsibility in
the control and management of a disease, it is
This poem describes one of the major dilemmas important to understand both the multiple
found in modern health care. Much time and causes of the disease and the natural history of
money is spent providing ambulance and emer- the disease.
gency services at the bottom of the proverbial cliff
(Figure 2.2). Thought is seldom given to whether Natural history of disease
it might be better to go to the top of the cliff, build The natural history of disease refers to the sys-
a fence and put up notices warning people of the tematic description of the course of disease over
dangers of going near the edge — thus identifying time, unaffected by treatment. Several stages
the primary preventive needs of the community. form part of the natural history.
HS
a contextualizing epidemiology
During the susceptibility phase, risk factors disease and the lifestyle associated with this
only are present. should be targeted for prevention. In South
During the preclinical phase, the biological Africa, there are communities who are currently
process of the disease has started. There are undergoing urbanization and cultural transition,
some physical manifestations, but no disease is which should be targeted for primordial preven-
present. tion programmes. Government and legislative
¢ In the’clinical disease phase, signs and symp- support should protect the population from the
toms of disease are present. promotion and sale of unhealthy products, while
Recovery, the presence of chronic disease, dis- encouraging activities in all sectors of the
ablement, and death are all possible outcomes community to promote health (Steyn, Buch
of the disease process. 1992).
Once these stages have been documented, a com- Primary prevention: At this level, interventions
prehensive approach can be planned for the are aimed at healthy people — individuals or
management and control ofa disease. groups. Measures are taken to promote optimum
Epidemiology, through the use of its different health or provide specific protection of target
study designs, contributes substantially to the groups against disease and injury. This can be
generation of knowledge concerning the natural done on a non-personal level, where the risk of
history of diseases. This is done by identifying risk the condition is reduced by engineering out the
factors (stage of susceptibility), evaluating screen- risk (for example, banning asbestos products),
ing programmes (presymptomatic stage) and or on the personal level by prevention (for
describing consequences of disease and treat- example, providing of measles vaccination and
ment (stage of clinical disease and disability). On promotion.
the basis of this information, interventions can be
planned and promoted. Secondary prevention: At this level, measures are
aimed at people who have a disease which has not
Intervention at different stages of disease yet produced symptoms. Thus secondary preven-
Interventions must be on as many levels as pos- tion measures include early diagnosis and
sible. Each level of action is seen as a level of prompt treatment to prevent the development of
prevention. Thus there should be an attempt to clinical disease and complications from the
prevent the disease from occurring at all. Once disease. For example, secondary prevention of
the disease has occurred, it should be identified tuberculosis can take place through screening in
as early as possible and prevented from getting the workplace (for early diagnosis) and prompt
worse. If residual effects of the disease remain, treatment by means of medication.
these should be minimized, and _ disability
prevented wherever possible. The effectiveness of Tertiary prevention: At this level, measures
interventions differs according to the level tar- involve the treatment of the disease/condition in
geted. its later stages and rehabilitation to optimize
Essentially, a framework is needed for prevent- function, thus preventing/minimizing impair-
ative endeavours. A common approach to pre- ment and disability. For example, a tuberculosis
vention divides the process into four parts repre- patient may be treated after going to a doctor with
senting all potential levels of intervention, as clinical symptoms. In most cases, the patient can
applicable. be treated with medication and return to normal
function. In some cases, there may be residual
Primordial prevention: This level aims to curb respiratory problems and rehabilitation may be
the development of unhealthy lifestyle patterns needed, including occupational training in a new
among groups/populations which have not yet type of job.
developed these unhealthy patterns. Health Figure 2.3 provides the outline of the frame-
promotion is thus aimed at populations and work for levels of prevention and gives examples
groups, and attempts to deal with anticipated of each type of activity.
health risks which are not yet present. The classi-
cal example of the need for primordial prevention Conclusion
is in the cardiovascular disease area, where com- Epidemiology provides the findings which are’
munities which have low levels of cardiovascular used to plan interventions on all levels of preven-
14
2 = keyconcepts in epidemiology
Recovery
Healthy state py Preclinical > Clinical Chronic disease
: ae
(pre-disease) disease disease Disablement
Death
tion of a disease. In doing so it forms one of the lation at risk and the period of observation) (Table
pillars of public health, which aims to improve 2.1). Rates express the frequency of some charac-
the health of the population as a whole. teristic per 1 000 (or 10 000 or 100 000) persons in
the population per unit of time. Some key rates
are listed in Table 2.2. For some rates the numera-
Measures of the frequency of health
tor is not actually part of the denominator, for
events example, in the case of the infant mortality rate,
Most epidemiological studies focus on the occur- where the numerator is the number of deaths of
rence or frequency of disease or risk factors. It is children under the age of one ina given year, and
useful to know the numbers of people with the denominator is the number of live births in
certain conditions to determine the case load on the same year. (A child of three months who dies
the health services. However, to be able to in January will contribute to the numerator of that
compare the frequency of disease in different year’s IMR despite having been born in the
groups or in a group over time, one has to calcu- previous year and thus not contributing to the
late rates, since they relate the number of cases to denominator.) Strictly speaking such quanti-
the size of the population at risk in the specified ties are ratios, but the terminology ‘rate’ is
group in the specified period. If one determines firmly entrenched (see Table 2.1). A critical part
that in AreaA there are 50 cases of a disease and in of a rate is the period of observation/study
Area B 500, one cannot conclude that the disease expressed, for example, as ‘per year’ or ‘per
is a bigger problem in Area B. What if there are month’.
only 100 people in Area A compared to 10 000 in
Area B? For comparison, one has to take into Incidence and prevalence
account the size of the populations (and popula- Disease frequency can be measured by incidence
tion structure with respect to age, for example) in or prevalence.
the two areas. Incident cases are the number of new cases
reported during a specified period in a defined
Rates population (Figure 2.4). The epidemiologist can
A rate is calculated by dividing the numerator determine the number of new cases by making
(number of cases) by the denominator (the popu- use of routine surveillance systems (such as noti-
15
a contextualizing epidemiology
If there are two sick people in one year in a population of size 100,
The numerator is a sub-part of the denominator. In the case of a ratio, however, the numerator is not
necessarily part of the denominator. For example, if 20 boys and 25 girls are born, the ratio of boys to
girls is 20:25 or 0,80.
Birth rate
Number of live births during the year x 1.000
Total population
Fertility rate
Number of live births to women 15-49 years of age during the year x 1000
Number of women 15-49 years of age in the population
Death rates
Crude death rate
Total number of deaths during the year
x 1 000
Total population
Cause-specific death rate
Number of deaths due to a specific cause during the year
x 100 000
Total population
Maternal mortality rate
Number of deaths of women due to pregnancy or childbirth during year
x 10 000
Number of births (live and stillbirths) during the year
Infant mortality rate (IMR)
Number of deaths of children under 1 year of age during the year
x 1000
Number of live births during the year
Neonatal death rate
Number of deaths of children under 28 days of age during the year
x 1 000
Number of live births during the year
Perinatal mortality rate
Number of fetal deaths (28+ weeks) and deaths of children under 7 days
x 1 000
Number of fetal deaths (28+ weeks) and live births during the year
fications) but has to take into account the quality late the cumulative incidence or the incidence
of the surveillance system, for example, in terms rate. The cumulative incidence is the proportion
of completeness. Studies which follow individuals of the at-risk population who become diseased in
up for a specified time can also be conducted to a specified period. The at-risk population im this
determine the number of incident cases. case only consists of individuals who are at risk of
To calculatea rate for incidence, one can calcu- becoming new cases, that is, it excludes those
16
2 keyconcepts in epidemiology
Incidence Prevalence
OR
who have the disease at the start of the time numerator. If person 2, on the other hand, is fol-
period. For example, to calculate the cumulative lowed for 3 months, and then develops disease,
incidence of measles in the unvaccinated, the de- this person contributes 0,25.years to the denomi-
nominator excludes all old cases and those who nator and one disease case to the numerator.
have been vaccinated. Person 3, who is followed for 6 months before
Cumulative incidence is: being lost to follow-up, being healthy at that time,
contributes 0,5 years to the denominator and
Number of new cases in a specified time period nothing to the numerator.
The number in the at-risk population at the start The incidence rate is:
of the time period
Number of new cases of a disease during a speci-
However, when cumulative incidences are calcu- fied time period
lated‘ for large populations, for example, for a Total person-time of observation
country as a whole, the denominator consists of
the total population, since excluding those who The incidence rate is thus expressed as the
are already diseased will change the resulting cu- number of cases per person-time.
mulative incidence very little. Prevalent cases, in contrast with incident
The cumulative incidence assumes that the cases, are the number of people who have a
entire population at risk at the beginning of the particular disease at a specific time (Figure 2.4).
time period is followed up for the specified period Prevalence thus focuses on whether a disease is
(for example, one year). However, some individ- present at a specific time and not on when it
uals may be lost to follow-up and thus different occurred. It is also known as point prevalence,
members of the at-risk population may have dif- since it refers to the number of cases at a point
fering lengths of follow-up. in time. Prevalent cases include cases which
The incidence rate (sometimes referred to as developed some time ago and new cases (inci-
incidence density) takes into account the length dent cases), but excludes the cases which have
of time each individual in the at-risk population died or recovered (Figure 2.5).
remained under observation. The numerator is To calculate the prevalence, the numerator is
the number of new cases and the denominator is the number of prevalent cases (that is, all cases at
the sum of the individual time periods of observa- a given time) and the denominator all at-risk
tion till disease onset or loss to follow-up, in terms people in the group investigated (that is, all indi-
of person-time (person-days, person-months or viduals at risk of being a case, thus also including
person-years). For example, if person 1 is follow- old cases) (Table 2.3). For example, to calculate
ed for the whole period of one year, without uterine cancer prevalence, the denominator
developing disease, this person contributes one would be all women, excluding those who have
year to the denominator, and nothing to the had hysterectomies.
ia
a contextualizing epidemiology
Prevalence is:
Figure 2.4 Counting incident and
prevalent cases Number of existing cases
The number in the at-risk population (at a spec-
ific point in time)
Comparison of rates
Epidemiologists often wish to compare disease or
death rates between groups, for example, be-
tween countries, sexes or age groups, or in the
same group over time. To be able to meaningfully
Prevalence: compare such rates, one needs to be sure that any
existing cases observed differences in rates are real and not due
to differences between the groups in the distribu-
tion of other factors. For example, what conclu-
sions can one reach if the overall mortality rate in
Group A is higher than that in Group B, but it is
known that Group A has a larger proportion of
Source: Kibel M A, Wagstaff LA, 1995. elderly people than Group B? Because death rates
increase rapidly with age, older populations have
18
2 = keyconcepts in epidemiology
higher crude mortality rates. If the mortality rate which lead to changes in the number of people
of a population has increased over the last and composition of a population are fertility,
century, is it due to a real change in mortality mortality and migration.
experience, or to changes in the age structure of Population-based rates are essential in epi-
the population? If two populations are similar demiology. While great care is often taken in
with respect to factors related to the disease, one establishing the numerator, too often in the South
could feel confident about comparing crude African context, less attention is given to obtain-
rates. If they differ, however, a comparison of ing an accurate estimate of the denominator —
crude rates may give misleading results. the population size. The primary source of demo-
Standardization (also called adjustment) is a graphic data is the census, which endeavours to
way of removing the effect of differences in the obtain basic data for the whole population. Cen-
composition of various populations and so over- suses are usually conducted once every ten years,
coming the problem of making erroneous com- but since 1980 there has been a South African
parisons based on crude rates. If, for example, age census approximately once every five years,
is related to the outcome of interest (for example, because of large-scale migratory population
heart disease), and the two populations differ movements. In using census data, it is important
with respect to age distribution, one needs to to realize that there is often an undercount, and it
adjust the rates. The standardized rates provide a must be established whether the published
new summary rate which takes into account the census figures have been adjusted or not. Census
differences between the groups. (Details are given data is often supplemented by routine surveys.
in Appendix 1.) It is, however, important to always
examine the subgroup-specific rates (for exam- Population composition
ple, age-specific rates) prior to standardization. If The composition of a population can be
the rates differ markedly between subgroups, described in terms of factors such as age, sex,
standardization will obscure these differences. race/ethnicity, occupation, income, education
Comparing the rates of two groups in a study and literacy. The population composition deter-
requires a different approach to the one discussed mines to a large extent the types of health prob-
here and is dealt with in Chapter 11: An introduc- lems experienced, and which health services have
tion to data presentation, analysis, and interpreta- to be delivered. An important measure of popula-
tion. tion composition is the dependency ratio, which
describes the relationship of the dependent part
of the population to the potentially productive
Demography part of the population. In Table 2.4, the depen-
From the previous section on measuring health dency ratios for South Africa as a whole, as well as
events, it is clear that knowledge of the size and for specified subgroups, are given. These figures
characteristics of a population or community is were calculated by Mazur (1995) from the house-
necessary to be able to interpret health informa- hold survey conducted during the second half of
tion meaningfully. The composition of the popu- 1994 by the University of Cape Town’s Southern
lation also has implications for its pattern of African Labour and Development Research Unit,
health. To plan and deliver health services effect- for the Project for Statistics on Living Standards
ively it is necessary to know what the community and Development.
looks like. For these reasons, there is a close rela- A population pyramid is a useful summary
tionship between public health, epidemiology representation of certain aspects of population
and demography. composition. It is a graphical display of the per-
Demography is the scientific description of the centage composition of a population in terms of
characteristics of populations, and embraces all age and sex. Age is usually grouped in five-year
aspects of population structure and changes intervals. The percentages are represented by
which can be measured numerically. This proportionately drawn horizontal bars. By con-
involves primarily the measurement of the size of vention, the percentages relating to the males are
a population (how many people there are), com- drawn on the left of the pyramid and the females
position (what the characteristics, such as sex and on the right. The age groups are drawn from the
age, of the people are), distribution (where the youngest at the bottom of the pyramid to the
people in the region are), and changes in num- oldest at the top.
bers of people. The main demographic processes The influence of births, deaths and migration
19
a contextualizing epidemiology
on the population composition can be seen from mothers in each age group (for example, 15-19)
the shape of the pyramid. A developing country by the number of women in that age group. In
with high birth rates and low life expectancy will Table 2.4, the percentage of teenagers (15-19)
have a high proportion of young people and thus ever pregnant is given for South Africa as a whole,
a population pyramid with a triangular shape and as well as for various subgroups (Mazur 1995).
a broad base. On the other hand, a developed This figure is an approximation of the 15-19 year
country with higher life expectancy will have a age-specific fertility rate.
higher percentage of people in the older age
groups and thus a population pyramid with a Total fertility rate:
narrower base and steeper sides. Events such as The best single summary index of fertility (if the
famines, wars, or mass migration in the recent age-specific data are available) is the total fertility
past will be reflected by irregularities in the shape rate. It is calculated by adding the age-specific
of the pyramid. Population pyramids of the rural fertility rates together over the reproductive ages.
and urban populations of South Africa are shown If five-year age intervals are used, each rate is first
in Figure 2.6 (Mazur 1995). The rural population multiplied by 5. This represents the number of
clearly has a broader base than the urban popula- children (disregarding mortality) who would be
tion. born to a group of women over the reproductive
period, as they experience current age-specific
Demographic processes fertility rates. It incorporates the age structure of
Three processes determine the changing size of a the reproductive female population, which the
population: fertility (births), mortality (deaths) general fertility rate does not. In Table 2.4, the
and migration. The balance between these three number of children ever born alive to women
determines whether a population increases, de- aged 45-49 is given (Mazur 1995). This is an
creases or remains stable in size. approximation of the total fertility rate.
The growth rate of a population is given by the
following equation: Mortality
Mortality is of great importance in demography
(births-deaths) + (immigration-emigration) as well as epidemiology. In epidemiology, it is
total population often used as a substitute (although a biased one)
for morbidity (disease), since mortality informa-
Fertility tion is collected routinely and is more easily
The determinants of fertility are complex. We will obtainable than morbidity information.
restrict ourselves to some basic measures of fertil- The crude death rate expresses the number of
ity. deaths per 1000 population. As pointed out
earlier, other more specific death rates, for
Crude birth rate: example, by cause or age, may be more useful.
The number of births per 1 000 persons in the Important death rates are listed in Table 2.2. The
population. This measure requires only total infant mortality rate (IMR) is considered to be of
births and total population. It is a crude measure, great importance for public health. It is not only
since it does not take into account the age or sex an indicator of health among the young, but the
structure of the population. WHO has used the IMR as an indicator of the
quality of the social, economic and physical
General fertility rate: environment of the whole community, as well as
The number of births per 1 000 women aged 15- the effectiveness of health services. In South
44 years. In this case, the denominator consists Africa, the IMR varies widely between population
only of the females of child-bearing age in the groups, but decreases in the IMR have occurred
population. Sometimes the age group used is because of improvements in socio-economic
15-49 years. conditions, nutrition and access to health care.
In recent years, the under-5 year mortality rates
Age-specific fertility rate: have received increasing attention. These values
If the numbers of the female population and for South Africa are indicated in Table 2.4 (Mazur
births are known by age of the mother, an age- 1995). These values were obtained using indirect
specific fertility rate can be calculated. It is methods (see Chapter 20: Mortality studies).
obtained by dividing the total number of births to A useful index of overall longevity is life
20
2 keyconcepts in epidemiology
e
&}/o}+
Male Do
oO}
Female
oa ao oO
a woa4
wo
kf
Nm
| nm|<
a)o
ASI o|s/o|s
|wola|ala
oO
Percent
Urban population
Male
-8 a, -4 -2
Percent
expectancy. It is easily understood, as it is have a longer expectation of life than men. This
expressed in years. Life expectancy at birth is, on index is often represented separately for males
average, how many years a person just born can and females.
be expected to live for. It is a theoretical concept An increase in life expectancy can be due to
which imagines a cohort of people just born expe- people living longer, or less people dying in the
riencing the current age-specific mortality rates young age groups. In most countries, increases in
throughout their lives. Because of the differences expectation of life has been brought about mainly
in mortality rates of males and females, women by the reduction of infant mortality.
21
a contextualizing epidemiology
Race
African 71,8 25 5,01 86 43 125 56.5
Coloured 62,6 17 4A7
Indian 48,8 2 3,37
White 44,7 1 2,56
Residence
Rural 25 5.37 94 50 139 54,5
Urban 20 3,80 69 30 96 60,9
Metro 49,5 15 3,37 57 22 78 64,2
Education
0-6 24 4,95 95 51 140 54.2
7s 22 3,51 62 25 85 62,9
9+ 15 2,62 58 23 79 64,0
Expenditure
(monthly, per person)-
<R150 O27 27 5,91 — 89 45 130 55,7
R150-R499 63,1 © 22 4,36 79 Ee 113 58,3
R450+ 3/,9° 4 2,94 63 26 87 62,6
Life expectancies are usually calculated during tion for which services need to be provided is a
census years, when reliable information on popu- major public health priority. Urbanization is also
lation size is available, and reliable mortality rates important to health in that it is often associated
can be calculated. with lifestyle or environmental changes which
impact on health (Yach, Mathews, Buch 1990).
Migration and urbanization Defining a migrant is difficult, as many people do
Migration refers to human movement in a geo- not necessarily remain in the urban area they
graphical sense. Socio-economic and political move to, but shift back and forth between rural
factors lead to the migration of people. Migration and urban regions cyclically.
consists of immigration (people moving into a
specific region) and emigration (people moving Demographic transition
out ofa specific region). Populations are not static. Over time, societies
In recent years, there has been rapid urbaniza- tend to experience decreases in the birth and
tion in South Africa caused by migration to cities. mortality rates. The drop in mortality tends to
The counting of the ever-changing urban popula- occur first, followed by a drop in fertility some
ae
2 = keyconcepts in epidemiology
time later. There is thus a rise in the population in the accessibility of health services.
the intermediate period. At the beginning of such Although the value of a single indicator is lim-
a period of change, the majority of deaths are due ited, the use of a standard set of indicators
to infectious diseases, whereas at the end, the enables public health professionals to compare
majority are due to the chronic diseases of later key health outcomes between areas and over
life. This change in disease pattern is sometimes time. Such routine monitoring of the population’s
called the ‘epidemiological transition’ (Omran health status requires standard, easily collectable
1971). Chronic diseases, often referred to as ‘con- and interpretable measures.
ditions of affluence’, now contribute substantially Health indicators are different from health
to the burden of disease in developing countries, indices, which are composite measures combin-
especially among the poor. South Africa and other ing several individual indicators. An example of a
developing countries may well experience a ‘pro- newly developed health index to determine the
tracted transition’ where the mixture of chronic national burden of disease is the Disability
disease and infection could co-exist for some Adjusted Life Years (DALY). Demographic data is
time, as happened in Mexico (Soberon, Frenk, used in conjunction with information collected
Sepulveda 1986). from community studies of disablement to deter-
mine this index. It is calculated by adding the
Conclusion years lived with a disability (weighted according
Without accurate demographic information and to severity of the disability) to the years lost due to
a knowledge of how to interpret demographic premature mortality. The DALY is thus a compre-
data, an epidemiologist will not be able to evalu- hensive measure of ill health, reflecting both non-
ate the extent or patterns of health problems, or fatal outcomes and mortality, that can be used to
foresee any changes which are likely to take place quantify the extent of disease.
in these. Health indices may initially appear to provide a
more adequate assessment of the health of popu-
lations. However, not all indices are particularly
Health indicators valuable. They can never truly summarize health
in its totality, and can at best address some of its
Measuring the health of populations components. If a single composite measure is
Health is a difficult concept to define because of used, the specific influence of the contributory
its complex and multifaceted nature. For the pur- measures is often lost, and composite measures
poses of determining the health of populations, may be more open to abuse than simple
an important aspect of any definition is its measures.
measurability. Given the holistic nature of health,
no single measure would do justice to the con- Uses of health indicators
cept, although measurement of single com- Health indicators have several uses. They help to
ponents can provide useful information. The identify and quantify health problems. Since
researcher interested in single or multiple com- health status depends on economic factors,
ponents of health can choose from a variety of health indicators may provide insight beyond the
health statistics. health status of the populations. For example,
There are health measurements selected from a
larger pool of health statistics which summarize
Table 2.5 Common sources of
or represent several statistics, or serve as useful
proxy (substitute) measures for the relevant in- data for health indicators
formation. These measurements, called health
indicators, reflect health situations, and are vari- @ Vital events registers, e.g. death tapes
ables which ‘help to measure changes’, especially @ Census
when these changes cannot be measured directly. @ Routine health services records
They usually reflect only partial components of @ Epidemiological surveillance
health, for example, the infant mortality rate @ Sample surveys
focuses only on the health of children under the Disease registers
age of one year. However, the WHO does view this ¢ Alternative sources of data, e.g. church
indicator as also being an indicator of the health records
status of the community as a whole, as well as of
23
a_contextualizing epidemiology
24
General —
philosophical
issues in
epidemiology
these is an independent ethical review. positive responses before the drugs are tested on
To understand the need for an independent large groups of patients — be short-circuited so
ethical review related to research, it is important that AIDS sufferers do not have to wait for poten-
to understand some of the major motives behind tially beneficial drugs until the lengthy clinical
research. To illustrate how these motives may trial process has been completed.
operate in preparing or conducting research,
three examples are given of cases where well- These three examples provide a good insight into
intentioned research was done in an ethically un- the context of research ethics. In the first two
acceptable way. examples, the main function of ethical review
would have been to protect research subjects
EXAMPLE A: from being exposed to risks related to research. In
In a now classic article on bio-medical ethics, example A, risky procedures were being used
Beecher reviewed 22 studies in the New England without subjects’ permission. In example B,
Journal of Medicine in 1966 which in his view en- denial of existing treatment in the first instance,
dangered the health or the life of the study sub- and denial of more effective treatment when it
jects without informing them of the risks or became available during the course of the study,
obtaining their permission. He concluded that posed risks to the study subjects. In example C,
‘unethical or questionably ethical procedures’ the usual ethical requirements that are put in
were not uncommon among researchers. He place to protect people from research risks have
further concluded that the motivations for such come under pressure because of perceived bene-
procedures could be found in pressures on young fits of this research. Demands for the reform of
researchers for personal advancement (Beecher the ethical review process during the 1980s
1966). His article stimulated increased regulation included the suggestion that not only should
of research, including the establishment of com- research subjects be protected from research
pulsory review procedures by institutional ethics risks, but the potential benefits of research should
committees. also be considered in deciding on the ethics of a
proposal. Several trials were halted midway
EXAMPLE B: because the benefits of a new treatment had been
In a similarly classic case, researchers from the shown beyond doubt. This was done so that the
United States Public Health Service conducted an benefit could be applied widely. Protection from
experiment involving over 400 black male resi- research risks is therefore only one reason for
dents of Tuskegee County in the southern state of ethical review. The second major reason is to
Alabama. Between 1932 and 1972, when the study ensure the fair distribution of potential benefits.
was terminated, these men were observed for the Although we focus below on more detailed
effects of untreated syphilis. This study was ini- issues in ethical aspects of research, it is these two
tiated at a time when syphilis treatment was rela- facets that are the real focus of ethical review: the
tively ineffective. Nevertheless, while the men protection from risks related to research and the
were told that they were being ‘tested’, many equitable distribution of potential benefits of
thought this was for ‘rheumatism’ or ‘bad research.
stomach’. They were not told that syphilis treat-
ment was being withheld. Even after penicillin Ethics and ethical reasoning
became available during the 1940s, and was Acomprehensive description of ethics and ethical
found to be effective against syphilis, the men reasoning is given by Beauchamp in his book
were not given this information. They remained Principles of Biomedical Ethics (1989). Here,
under observation without receiving penicillin ‘to we will provide only a brief insight into the area
allow the important scientific process to conti- of ethical reasoning to enable prospective
nue’ (Jones 1993). researchers to understand the processes that
should be followed by review committees to
EXAMPLE C: which research protocols are submitted.
AIDS activists in the United States are demanding Most people have a good grasp of the moral
that the normal route of Phase I, II and III clinical values in society. There is no difficulty, for exam-
trials — which involve animal experimentation ple, in making judgements about corruption,
followed by testing of drugs on small groups of deception, killing, telling the truth, and ensuring
healthy human volunteers to test for toxicity and that research results are made available to the
26
3 general philosophical issues in epidemiology
public. But there are two instances when the use and constructively, they can contribute towards
of ‘common sense morality’ is not sufficient. the development of more stable moral values that
Firstly, new challenges are being provided by are accepted by large sections of society.
each new research protocol, by new therapies, by Ethical reasoning takes place whenever there
new interventions, or by the use of new techno- is a need to provide a moral justification for
logies. Although existing morality can help guide a particular action, for example a particular
us towards appropriate action in known situa- research project or procedure. Such actions may
tions, the new situation may not fit existing moral be justified on the basis of ethical rules, which, in
rules or behaviour. In the recent context, for turn, are based on ethical principles derived from
example, there has been an intense debate in an ethical theory. Telling the truth is a generally
South Africa on whether or not abortion should accepted moral value, and lying, therefore, is
be legalized. We have the technology and medical morally wrong. A researcher who decides to
skills to ensure that abortion can be done in a way deceive research subjects must therefore justify
that guarantees the health of the pregnant his or her decision to engage in an immoral
woman. Given this background, there is a moral action. The researcher may argue that obtaining
dilemma. The dilemma is that there are solid the truth is an accepted moral rule, and that
moral reasons why abortion should be legalized deception must take place, otherwise he or she
(such as a doctor’s duty to optimize the health of cannot obtain the truth. (For example, if we want
the pregnant woman, or society’s duty to uphold to research the quality of care given by practition-
the right of a pregnant women to self-determina- ers to patients or clients, telling the practitioners
tion) while, at the same time, there are equally what is being studied may change the way they
solid reasons why abortion should be prohibited deliver care. We will therefore not be able to find
(such as society’s duty to uphold the right to life of out the truth.) This rule of obtaining the truth can
unborn babies, or society’s duty to protect those be based on one or more ethical principles such
who cannot protect themselves). We have to go as the principle of beneficence (interventions or
beyond ‘common sense’ moral reasoning, and actions, including research, must, on balance, do
to develop a comprehensive understanding of more good than harm). This principle, in turn, is
morality, ethics and ethical reasoning, in order to derived from a more complex body of principles
solve these difficult moral problems. and rules that make up ethical theory (Figure 3.1).
A second series of challenges to ethical reason- The two most prominent types of ethical theory
ing is‘ provided by the differences in morality are the utilitarian and the deontological theories.
between cultures, religions, and ideologies. These In utilitarian theory, the most important under-
challenges are compounded by the absence of a lying value for judging whether an action is ethic-
common basis for making moral judgements. ally right or wrong is the amount of ‘good’ it
Instead, we are slowly developing our own set of produces. Thus this approach prioritizes the
moral values and virtues in a process that is slow ‘greatest benefit to the greatest number of people’
and painful. For example, the decision by the as the main criterion for judging an action.
Constitutional Court to abolish capital punish- Deontological theories hold that actions are to be
ment is a first attempt to emphasize one particu- judged on the basis of certain values, irrespective
lar moral value, namely the right of persons to life. of the consequences for the ‘general good’ they
Yet this decision has elicited extremely emotional produce. All religious moral reasoning falls in this
reactions, especially from those who feel that group, as it generally holds that certain actions
another moral value, namely the right to a safe are right (or wrong), not because of the results
and crime-free environment, should have a they produce, but because the religion’s manifest
higher priority than the right to life. The role of (such as the Bible, Torah, or Koran) states that
the Constitutional Court in this case is similar to such actions are required and, therefore, good.
that of an ethical review committee in the context
of research: to attempt to develop a shared basis Ethical principles in medical and health-
of ethical arguments that can be applied to the related research
individual research proposal and to similar pro- In spite of these differences, there is often agree-
posals in the future. Given the multi-cultural ment between different theories on the moral
heritage of our country, it is likely that there will value of particular actions. These agreements
often be disagreement between reviewers. How- have been reached through ethical review, court
ever, if these differences are discussed publicly judgements, popular debates, and in many other
27
a contextualizing epidemiology
ways. As a consequence, ethical review of medical providing information that allows individuals to
and health-related research has distilled three make autonomous decisions.
main ethical principles against which almost any Secondly, the.principle of respect for persons
proposal can be assessed from an ethical perspec- calls for special measures to be taken to maximize
tive. These principles have provided a very valu- the autonomy of and protection for incompetent
able tool to assist in the ethical review of medical persons. These include people with an intellec-
practice and research. Theyare outlined in Figure tual disability (permanent incompetency) or
3.2 and discussed below. persons under the influence of alcohol or pre-
medication for surgery, or in a state of emotional
The principle of respect for persons shock (temporal incompetency), and persons
Respect for persons has two major components. with reduced autonomy (such as children, pris-
The first one, often called the principle of oners or refugees).
autonomy, relates to the notion that a competent The principle of autonomy is the main principle
individual has the right to self-determination. underlying the process of obtaining informed
Other people, including health and medical prac- consent for medical procedures or for research. To
titioners and researchers, must respect this right allow an autonomous person to make an auton-
to self-determination and have an obligation to omous choice about whether or not to participate
enable the autonomy of persons by, for example, in research, five conditions have to be met.
fed fson how to act with moral justification Refrain sa disclosing to health workers why the
research is being done
Ethical principles
28
3 general philosophical issues in epidemiology
1 The person must be competent to make deci- ensure not only disclosure, but also understand-
sions ing of what is being disclosed.
As outlined above, the competence to make deci- 4 Any decision, whether consent or refusal, needs
sion may be impaired both permanently or tem- to be voluntary
porarily. In general, any adult is regarded as com-
petent unless there is evidence to the contrary. In In other words, there should be no undue influ- -
addition, although children are often declared ence on the prospective participant to consent (or
‘legally competent’ at a certain age (18 in South refuse) to participate. This includes the duty of
Africa), it is obvious that between childhood and researchers not to coerce or manipulate subjects.
adulthood, the child becomes progressively more For example, prospective study subjects taken
competent. This means that there are certain from a health care setting must be told specific-
decisions about research participation or medical ally that refusal to participate in research will not
interventions that a child can make before being lead to a reduction in the health care they will
legally competent. Researchers are therefore receive. If incentives to participate are being con-
encouraged to elicit consent (or refusal) from templated, it would constitute an ‘undue incen-
children who are old enough to be able to make tive’ to offer an unemployed or poor person a
decisions about participation, in addition to significant cash benefit for participation. Simi-
obtaining consent from their parents. larly, research on persons with restricted auton-
omy, for example, prisoners or students (if the
2 There has to be disclosure of information researcher is also their professor), needs to make
special provisions to ensure that their decisions
An autonomous person can only make an auton- are not influenced by coercion. Lastly, manipu-
omous decision if all the relevant facts needed to lating people to participate by over-emphasizing
make a decision are provided. The researcher is, the potential social benefits of the research would
therefore, under an obligation to inform prospec- also infringe on the voluntariness of any decision
tive study participants about: . taken.
what the study is for
@ what will be done to or with the participant 5 Aspecific authorization needs to be given by
¢ what potential risks or discomforts the partici- the participant
pant faces.
After disclosure has taken place, the researcher
The disclosure requirement also explicitly places has an obligation to ask for explicit consent or
researchers under the obligation to inform all refusal to participate, whether written or oral.
study participants of innovations in medicine
that could make participants change their mind The principle of beneficence
about participation. This requirement was In general, this principle requires that the actions
ignored in the Tuskegee study. of health and medical practitioners or researchers
be directed at improving the well-being of
3 There has to be understanding of the patients or study participants, and they should
information not cause harm to the patient or participant.
While this may sound obvious, there are several
If the information being provided in the process important issues in research that violate or may
of full disclosure is phrased in a language or in a potentially violate this principle.
manner that does not allow the person to under- This principle is often divided into two com-
stand the information, then there can be no in- ponents: firstly, a principle of non-malevolence
formed consent. This is especially important for _ (the obligation not to do harm), and secondly, a
researchers in a cross-cultural situation, as is principle of beneficence (the obligation to pre-
often the case in South Africa. Researchers often vent harm, to remove harm, and to do good). For
come from a different social class and cultural purposes of this text, both are considered under
background to study participants. They also come one ethical principle of beneficence.
from a specific discipline with its own profes- Firstly, even a non-invasive questionnaire or
sional jargon. Special care needs to be taken to interview may have detrimental consequences
29
a contextualizing epidemiology
for study subjects. Participants offer time which in less dramatic areas (for example, effectiveness
may result in other priorities not being met (for of a breast-feeding education campaign).
example, interviewing rural women may reduce Lastly, research risks in case of research on
the time they have for water or wood collection or minors (‘legally incompetent persons’) should
child care). Confidentiality of answers may be always be minimal. This is based on the principle
broken, and the participant may suffer as a conse- that only competent adults can make an in-
quence (for example, an opinion about legaliz- formed decision to subject themselves to severe
ation of prostitution may be counted against a risk, and that no one has the right to make such a
church member). Results of a study may detri- decision on behalf of others.
mentally affect the group to which the particip-
ants belong (for example, the initial findings that The principle of justice
HIV was predominantly transmitted through The notion of justice means that people receive
homosexual contact reinforced public stigmati- what is due to them. It includes, therefore, both
zation of homosexual behaviour. Later recogni- the right of the person to demand certain goods,
tion of heterosexual transmission has to some and the duty of others, such as researchers, to
extent countered this). provide these. This principle is extremely com-
Secondly, research may be done for the benefit plex in that it has an application in almost every
of the researcher, not for the benefit of the partici- field of life. In the context of research ethics, this
pants or for society in general. Typically, many principle is probably the least well developed of
students are required to conduct a research the three principles discussed.
project for degree purposes, while in other set- In research, the concept of justice is usually
tings, research is required for promotion. In such interpreted to mean afair distribution of benefits
cases, it is not always clear how the study partici- of the research (‘distributive justice’). For exam-
pants will benefit. ple, although most AIDS/HIV vaccine research
Thirdly, there is often a tension between the being prepared will be conducted in the develop-
researcher’s opinion of what is good for partici- ing world, developing countries (and the popula-
pants, and the participants’ opinion of what is tions originally researched) may not be able to
good for themselves. This is typically the case in afford these very vaccines and will thus not get
situations where foreign researchers come to the benefit of the research.
Africa to conduct studies, on HIV/AIDS for exam- The principle of justice also leads to the inter-
ple. Whereas African priorities, at least in the short pretation that all who stand to benefit from the
term, are usually to develop appropriate preven- research should contribute to its risks and dis-
tive education programmes or to investigate comforts. In other words, research that may
optimal home-care mechanisms, foreign research benefit both the rich and the poor should be con-
often focuses on the biological properties of HIV ducted among both groups, not only the poor.
infection. In such a case, extensive consultation Similarly, although medical students are an easy
with the participants or their representatives is target group for medical researchers, they should
essential to develop a study that will satisfy the not always be the study subjects for research that
needs of both participants and researchers. may have applications for a wider audience. The
Research always demands some inputs, costs, same applies to communities that tend to be-
or discomfort from study participants. For come ‘over-surveyed’ because they happen to be
example, in the case of more invasive research, co-operative, or because they have a well-orga-
such as taking blood specimens, discomfort and nized clinic.
possibly more serious side effects are expected. There are now demands that research be dis-
The ‘do no harm’ interpretation of the benefi- tributed more fairly to make the potential benefits
cence principle must therefore be restated as the more widely available. For example, in the pro-
notion that, on balance, the research must do cess of setting up research, both facilities and
more good than harm. This leads to another human resource development need to be pro-
important notion, namely that the research risks vided. This provides a direct benefit to the group
must be commensurate with the expected bene- being researched. Similarly, in many areas of the
fits. In other words, in the case of research into a world, the only opportunity poor people may
life-threatening condition (for example, cancer have to obtain good quality medical care is if they
treatment) the research risks may be more severe participate in a research project. In South Africa,
than in case of research that pursues knowledge for example, anti-retro-viral medication against
30
3 general philosophical issues in epidemiology
HIV infection is not provided through the public obtained from the prospective study participant.
health services because of its costs. Conse- In the case of minors, the legal parents or carers
quently, only the rich and those in clinical trials are allowed to consent on their behalf. For
testing these drugs will have access to them. research that carries no risk or only minimal risk,
this is usually adequate. Where research is con-
Some specific ethical issues and controversies templated in which a minor may be exposed to
in epidemiological research more than minimal risk, a debate arises as to
The substance and components of ‘informed whether or not proxy consent is acceptable.
consent’ have been discussed under the principle Those who find it unacceptable believe that
of respect for persons. There are, however, some research with more than minimal risks on child-
areas that need to be dealt with in more detail. ren can never be ethical. Because of this extreme
conclusion, many believe that the appropriate
Communal versus individual consent way to approach this dilemma is to ask whether
Informed consent requirements usually refer to the child would consent if he or she were a
obtaining consent from the individuals who are ‘reasonable adult’.
prospective research participants. However, in Similar issues arise in the case of research to be
South Africa and elsewhere, when conducting epi- conducted on temporarily or permanently in-
demiological research, researchers often ask for competent persons, such as people with an intel-
consent from traditional or other local leaders to lectual disability, aged people with neuro-cere-
research ‘their’ communities. This is a difficult bral diseases, or unconscious persons.
issue. Insofar as some cultures are more commu- A special situation arises when an otherwise
nally than individually oriented, it seems appro- competent adult refers the researcher to a third
priate. Yet it clearly reduces the principle of auto- person. This may happen, for example, in the
nomy. If a communal leader gives permission, the case of a married woman who states that ‘her
researcher continues and obtains consent from husband or her children must decide’.
individuals. However, if the leader refuses for Typically, the ‘Western’ approach to this situation
whatever positive or negative reasons, the indi- is to demand first-person informed consent in
viduals concerned can no longer be asked whether spite of the expressed wish. A more culturally
or not they wish to participate. In this case, poten- sensitive way might be to consider that the
tial benefits of research may not accrue to them prospective study participant has exercised the
because of someone else’s decision. right to self-determination by making the
This presents a dilemma, as obtaining consent decision that a ‘significant other person’ should
from leaders appears to violate the right to self- decide.
determination of individuals in the community.
On the other hand, studying a community or Partial disclosure
group without the consent of those in authority The requirement for full disclosure as part of the
may either put participants at risk, or make it im- informed consent process argues against delib-
possible to undertake the research. The imple- erate omission of relevant information and de-
mentation of later health interventions may also ception. Yet there are situations in which full dis-
be compromised. closure will influence the subject of study. For
Where research or interventions deal with example, in a study of the attitudes of health
communities rather than with individuals, it is workers towards HIV-positive patients, the
advisable to seek consent from the ‘community’ results of a questionnaire survey would probably
rather than the individual. In such cases, it makes differ widely according to how the aim of the
sense to ask for consent from legitimate commu- study was explained. Rather than fully explaining
nity leaders. (Examples of such situations could the aim of the study before asking the questions,
include studies on the fluoridation of drinking the participants could be asked to take part in a
water, mass education campaigns on HIV/ general questionnaire on hospital or health
AIDS prevention, or on the influence of infra- matters, and subsequently be asked attitude
structural projects.) questions interspersed with questions on
different matters. In such a case, omission or
Proxy consent as opposed to ‘first person’ apparent deception may be the only way of
informed consent obtaining a true insight, and may therefore be
In general, informed consent refers to consent considered ethical.
3
a contextualizing epidemiology
Written versus oral consent research requires a long relationship between the
The emphasis on written consent, sometimes wit- participants and the researchers, it is more
nessed written consent, stems from the need of appropriate to consider informed consent as an
health workers to protect themselves from the ongoing process. In longitudinal studies and
possibility of future court proceedings instituted long-term trials, for example, it would be wise to
by participants who may claim not to have been continue the process of information provision
adequately informed. A second motivation is that and obtaining consent. The case of Tuskegee is a
a clearly presented consent form will assist the clear example of the need to consider informed
researcher in ensuring that all relevant topics consent as a process rather than as an occasion.
have been covered. A third reason for emphasiz-
ing written consent is that the process of signing Confidentiality and the use of databases or
the consent form will help the participant to medical records
realize that he or she is now aresearch partici- Study participants can usually expect the results
pant. In other words, placing their signature on obtained from their participation to remain
the form brings home the reality of the decision confidential. Efforts to ensure this are essential,
made by the participants. such as not obtaining personal information
The first reason, protection of health workers, is unless it is absolutely necessary, and deleting per-
only a relative one, in that no court will accept a sonal information as soon as there is no longer a
consent form as proof of adequate disclosure. The need for it. However, in studies where large
second motivation to use written consent is not a numbers of participant characteristics are
real motivation, in that one can have a clearly pre- collected, it may be possible to identify persons
pared guideline on what to say to participants even after removal of personal information. For
without the need for a signature. The third moti- example, in a study of a village in a rural area,
vation may be the most important reason for there will be very few male teachers of 35 years old
obtaining written informed consent. Yet it is this who are member of the church choir. In such
very motivation which often causes people such cases, additional measures must be taken to
as illegal immigrants, refugees, sex workers and ensure that individuals cannot be identified.
those with any fear of authority to refuse to Patients entering hospitals or clinics may rea-
participate. This is not because they do not wish sonably expect their records to remain confiden-
to participate in the research project, but because tial. Medical researchers should ensure this. Simi-
they fear they will be harmed by unintended con- larly, use of other databases must guarantee con-
sequences of having their names listed. These fidentiality of data. In South Africa, the study of
fears may be justified, if research records should medical records and other databases is allowed,
be made public by negligence, break of confiden- provided adequate steps are taken to keep data
tiality, or court orders. In cultures where signing confidential. In some European countries,
is considered superfluous once someone has however, use of records and databases is only
given his or her word, insisting on a signature allowed after obtaining written informed consent
could offend participants. from the persons involved.
In general, ethics review committees in South
Africa demand written informed consent even in Use of specimens for other purposes
the above cases, and even if the population There are many laboratories that store speci-
studied is illiterate. This seems irrational, and mens, especially blood specimens, that have
may be based on a routine application of North served their purpose. Originally, consent would
American procedures rather than on well-consid- have been obtained from the research partici-
ered ethical reasoning. Should a researcher there- pants to take the specimen for a specific purpose.
fore not wish to obtain written consent, it is wise Once that has been done, can other research be
to provide a full motivation, including ethical undertaken on these specimens without obtain-
arguments where possible, to the reviewers. ing further consent? It appears that current think-
ing considers this practice to be acceptable, pro-
Informed consent: an occasion versus a process vided the additional research has been submitted
The procedure of ‘getting informed consent’ is once more to an ethical review committee, and
often seen as a one-off action. This may be appro- provided that the results remain unlinked to the
priate if the research is of a one-off nature, such as individuals concerned. Nevertheless, this is not
a cross-sectional survey. However, when the an easy issue. Even if precautions are taken
32
3 general philosophical issues in epidemiology
regarding confidentiality, the research may have but direct feedback to individuals and communi-
negative results for groups of people. ties, or publication through the media, or to rele-
vant government departments, is an essential
Quality of research part of an ethical study proposal.
Low quality research is unethical in that it
exposes participants to the risks or costs of Ethical review of research
research without the chance of future benefit. In the United States, all academic and research
Similarly, simple duplication of research done institutions are required by law to have ethical
elsewhere (for example, to obtain a degree) may review committees. These are called Institutional
be unethical because it will not lead to any new Review Boards, or IRBs, whose members include
knowledge or action, even if the research was health professionals, ethicists, and ‘lay’ commu-
conducted perfectly. However, conducting a nity members. No such legislation exists in South
research project that was done elsewhere because Aftica, but probably all academic and research
there may be sincere doubt as to whether or not it institutions, and even some of the large service-
would work in the local situation can be defended rendering health institutions, do have a commit-
on the basis that it might lead to new knowledge tee charged with the ethical review of research
or action, and therefore costs or risks on the part proposals. Most committees do not, however,
of the participants can be justified. It is good to include ethicists or community members.
distinguish between these two types of repeating Researchers working in institutions without a
research by naming the first ‘duplication’ and the specifically appointed ethics review committee,
second ‘replication’. Duplication of research is or practitioners working on their own who want
not justifiable, and therefore not ethical, whereas to carry out research, can send their proposals to
replication is justifiable, and therefore ethically the review committees of nearby institutions. In
acceptable. addition, the Medical Research Council and the
Medical Association of South Africa provide
Conflicts of interest ethical review services in such cases.
As elsewhere in life, conflicts of interest may From the previous sections of this chapter, it is
occur in research. Often this takes the form of immediately apparent that there are many vague
sponsorship of research by a company that has an areas in the field of bio-ethics. For example, how
interest in a particular outcome of the study, such can we balance research risks and benefits? Who
as in ‘drug trials. It is important to ensure absolute balances them? What is ‘minimal risk’? When is a
independence between researcher and sponsor- person temporarily or permanently incompetent?
ship for the study to be considered ethically At what age can children be expected to make
acceptable. A not-so-obvious conflict of interest decisions concerning non-invasive research?
occurs when research is done for degree and pro- What constitutes full disclosure? When is decep-
motional purposes. The double motives in such tion appropriate?
research may at times be at odds with each other. The purpose of ethics review committees is to
Whereas the research should, for example, be of apply ethical theory and ethical principles to
benefit to those participating, pressures from particular (research) situations and proposals,
promoters or deadlines may force the research and to assist in the development of ethical
process to be modified to serve the student’s aims research. It should be clear that many situations
first and those of the participants second or not at preclude easy answers, and that deliberations
all. should often be extensive. For purposes of trans-
parency and of empowering the researchers (and
Obligation to feedback and publish communities), it would be beneficial if ethics
A last point, often forgotten in the rush to produce review committees would open their delibera-
research reports towards a degree, is that degree tions to at least the researchers, yet this is often
manuscripts tend to gather dust in libraries with- not done. As a researcher, however, one should
out their contents becoming known. Most ethics insist on a full report of the ethical reasoning that
review committees will nowadays insist on an was applied if the research proposal was rejected,
acceptable plan for the dissemination of informa- or substantial modifications were suggested. For-
tion gained from research before approving tunately, most ethics review committees will
research proposals. Publication in relevant peer- enter into debate with prospective researchers in
reviewed journals is an obvious way of doing this, such instances.
33
a contextualizing epidemiology
34
3 general philosophical issues in epidemiology
Health Professionals, Initiate and Used as addi- Projects which Work closely
planning academics control tional manpower include members with power
and to gain of the community structures
public support in order to be
for projects successful
Community Community- Limited role: Dominant, self- Process goals Question the
development elected worker part of group help, voluntary very important commitment of
facilitates co-operation involvement in power struc-
initiation by and develop- economic, health tures to real
community ment of indi- and social issues progress
genous leader-
ship
35
a contextualizing epidemiology
a
refers in general to potential users, beneficiaries motivations for participatory research could vary
or subjects of research. The varying motivations from obtaining more reliable data, to encourag-
for participatory research will determine whether ing people to cooperate in a programme, to giving
‘the community’ refers to the most oppressed and people more control over facilities (Lammerink
disadvantaged in society, the clients of a health 1994).
service, or the health workers in a health project The action component of participatory re-
or service. search could, for example, have as its goal the em-
The concept of participatory research evolved powerment of the community concerned (a
in the early 1970s as an alternative social science process), or the more efficient service delivery for
research method which began to challenge the that community (a product).
notions of neutrality and objectivity on which The extent of community participation in
social science research methodology was based. research can vary from involvement in every step
Various important historical trends contributed through to the other extreme, where researchers
to the development of participatory research use community participation to facilitate data
(Tandon 1994). One of these trends came from collection and the implementation of their own
the practice of adult educators in Third World results. In any one research project there are a
countries, who developed an approach to educa- range of activities in which a variety of partici-
tion in which the learner began to control his or pants can be involved. This has been termed the
her own learning process. These adult educators participatory continuum (Maquire 1994).
then began to develop a similar approach to Many epidemiological studies are carried out
research, in which the researched individuals or with very little involvement from the commu-
community participate in the research process nities or health workers that are the subjects of
and have some control over it. Another influence research. Their involvement seldom extends
was the work of Paulo Freire and Ivan Illich, who beyond being interviewed, answering question-
placed participatory research as an educational naires, occasionally being recruited as field-
process within the framework of an ‘alternative workers, or utilizing some of the results. In con-
pedagogy’ or popular education. During the mid- trast, a participatory research approach could
1970s, with the failures of top-down, expert- mean that:
designed development projects and programmes, the research question and the research priori-
the debate on development began to view the ties are based on the needs of the community as
question of participation as a critical variable in seen and formulated by the community itself;
human development (Tandon 1994). This further the methods of data collection are designed
influenced the evolution of participatory re- with community members and are under-
search. standable and acceptable to the community;
The main aim of participatory research is to @ the data collection is undertaken by commu-
provide a forum for the voiceless and powerless to nity members;
be creative actors and subjects, not merely @ there is community participation in the ana-
‘objects’ in the process of research and know- lysis;
ledge creation. The intention is to provide ways @ the results are distributed in a way that makes it
for ordinary and oppressed people to produce the possible for people from outside the scientific
information they require to understand and to profession to participate in the discussion.
deal with problems in their lives and larger insti-
tutional structures. Full participation of all the people in the com-
However, motivations for participatory re- munity, or health workers in a project, in all
search will vary, in ways similar to those aspects of the research may be clumsy, cumber-
described for community participation in pro- some and not always appropriate. The extent of
grammes in Table 3.1. Where community partici- participation can be decided on by the commu-
pation is seen as a process in which disadvan- nity and may depend on the project’s aims and
taged people work together to overcome prob- resources, and whether there is time for all parti-
lems, challenge the distribution of power and cipants to take part in each stage of the research
resources in society, and gain more control over process. The level of motivation of the group is an
their lives, participatory research is used to important consideration.
enhance their awareness and confidence, and to Full participation may slow down the research
empower their actions (Lammerink 1994). Other process to such an extent that all impetus and
36
3 general philosophical issues in epidemiology
enthusiasm is lost. Progress must be evident to cation of tasks and the process of accountability
maintain the interest and commitment of the need to be clearly and openly communicated
group, while not compromising the process. The during the research.
balance is a delicate one. For specialized tasks, The participatory research process has advan-
the group may need to consider delegating tages which go beyond the research itself. It can
responsibility. Each participant remains account- strengthen the democratic processes in a health
able to the group, and the value of the participa- project or community; it can lead to greater self-
tory approach need not be lost. reliance in the participating group and greater
Participatory methodology and participatory involvement in decision-making; it can encour-
research can be qualitative or quantitative. There age trust, responsibility and equality among all
are no typical participatory research methods. concerned; and it can equip project and com-
Conventional research methods such as struc- munity members with the knowledge, skill and
tured questionnaires can be used in a participa- attitudes needed for working together to tackle
tory way. Other unconventional, accessible their problems and meet their needs. While the
methods of data collection, analysis and report- research process can be empowering, the re-
ing results may need to be created in response to search product itself may also benefit from
the problem, the context, and in response to community involvement, although this is not a
‘where the participants are at’ (Meulenberg- primary motivation for participatory research. It
Buskens 1994). may be a more accurate reflection of local real-
Examples of these are mapping exercises in ities, problems and needs, thus providing a better
which community members create diagrams and grounding for more culturally relevant and thus
pictures (maps) visually illustrating information more effective and meaningful local action
about their community, group discussions, a ‘pic- (Maquire 1994). Thus any action, policy or social
tionnaire’ (see Figure 3.6), group tallying exer- programme emerging from such research is less
cises to collate results, the use of visual images, likely to marginalize these communities, and
radio, photographs, exhibitions and popular more likely to truly serve them (Khanna 1994).
theatre to report results. The participatory One of the requirements for engaging in partici-
researcher needs flexibility and creativity -to patory research is time. One of its major dis-
design innovative, appropriate methods. advantages is that it is often a long, very time-
A participatory approach to research does not consuming process. In participatory research, the
mean that the scientific merit of the methods or researchers may relinquish control over the identi-
the validity of the results needs to be compro- fication of the research question, as well as over re-
mised. The researchers need to facilitate an search funds, and they may experience this as a dis-
educational process in which the community advantage. A further difficulty researchers may ex-
participants are introduced to previously un- perience relates to the process of establishing the
familiar research methods, technologies and involvement of a community, or accountability to a
processes in accessible language, free of scientific community. In deciding which individuals or orga-
jargon. Thus the researcher needs to take on the nizations represent the community’s interests, re-
role of educator or trainer. Time must be set aside searchers may become embroiled in that commu-
for the learning and the participation processes. nity’s political conflicts and power struggles.
In the research process, the researcher too
becomes a learner and may find him- or herself Evaluation of community participation in
changed and challenged in sometimes profound projects
and unsettling ways through the process of Many health projects explicitly state community
engaging in collective investigation, education participation as a goal of the project. Indeed,
and perhaps action (Maquire 1994). community participation is seen as an important
Respect and clear communication are crucial component of the primary health care approach
qualities in the relationship between the re- which has been adopted by many countries,
searchers and the community in the research including South Africa. However, it is usually
process. The resources and experience brought difficult to evaluate how well this aspect of a
by each party into the research partnership are service or project is being implemented.
different. This should be acknowledged, and each A variety of criteria for judging the quality of
group should appreciate and respect what they participation in projects have been put forward
can learn from and share with the other. The allo- by different authors. Cohen and Uphoff (1977)
37
a contextualizing epidemiology
&o
local leaders, government personnel or foreign %,oD
personnel. The last criterion considers how patti- f
.
cipation takes place. A suggested adaptation to %2
this framework is to include an autonomy/ ’ Management
a
Sree CCRC Ce Rm Ce |
community participation
Needs and skills assessment Organization of health committee
Management
Management by committee
Management by staff
Skills development
38
siaquiow
39
general philosophical issues in epidemiology
yj -NUWWOD
}efoid
sowwelsojd
Aju
puoheg
pua}xe
siaquiew
0}
BuldojaAap
Aywnoe
ou 1A
uoNnueu}
‘s|}!4s
seyIWWod pue eis
se
e}s joafoid 0} poy tui)
-dojarep
JUS
juawdojeaap
ynq
0efoid 0} payuuy ue Si[Iys ale Sj|jIys dojaAep S||D¥S dojaaap
sjiI4S
aen!wwoo
jo
Ayyige
4unoode
0}
auly
senuoyjne senuoyine
sejngaiJo
“yes
yoRq
WOd
jyeys UUM-yodai
01 SI Ayyiqejyunoooe
yoofoudSISQUOU
0} Aqiiqequnoooe
aeiwwoo
saljigisuodsas
pauinsse
Jeajoup ‘seyiuwos
Ajjeonew
01 Aijiqejunoooe
‘aayIwWwodD
juaweseuew
JO aury “seyIWWOO QU} 0} yOoRg-Woda OU yoeq suodai eis yels yNOUUM He}js |euoissa}
0} Aq
Aigisuodsai
juowaseue|\
peyesajap
ay} 0} yoeq-uodei YM Ing Hers yoa[oid 0} -oid Aq payeuiwop
Aq
-oyne
uonesejeq
jnq
90Y pe UUM seyIWWOD Aq eenwwoo Aq peyesajep Aja}e|dwo9 jng ‘ye}s
jou Aq
eS
palesejep senijiqisuodsal AjjeussO} SaNiiqisuodses 0} payesajep Ajjeuuo} eis yoofoid
0}
3
jusweseuew
PsAlOAUl
YUM
JOU SI da;IWWWOO
aun nq ‘Ayiqisuodsed
syedxa Jo eis yoofoid/spodxe Juowaeseuew
uonezijnn /Nuoujne yyeoy yyM awos seyiepes] | jyeis eafoid/syedxe
ayeudoiddy juowesgeuew poueus si yse] “yes yoofoid/syedxe /Nuoyine yyeoy
*Aiqisuodsas Juaw /Nuoyne yyWyeoy yi 10 Ayjiqisuodsai
INANOdWOO9
"peseueU-}|as Jo ped Joye0/8 oy}
Aja}e|dwo0o 404 Ayjiqisuodsey Sulye} -eseuew awos aAey ay} 40 Ayjiqisuodsa 8y} S| swweJsoid BOTIWLWOD ay}
seywwo) | ‘peseuew-sjes seyiww0D | Jeped| pue sayIWWOD au} Ajurew JuaWeseue|| au} jo juaweseue; | Aq JUswegeue/\
LIND
Vida
j00foid e jo JUOWOSeUeU OY} UI UO}edIDI}Aed APUNUIWIOD SulyUues 10} PLIOIUD ZS 9/qGeL
a contextualizing epidemiology
Chetty and Owen (1994) adapted and expanded written-up, or if written-up, not read, or if read,
Rifkin’s criteria to allow assessment of participa- not used or acted upon. Only a minuscule propor-
tion using a more detailed method of scoring. tion, if any, of the findings affect policy and they
Rifkin’s five axes were each divided into three are usually a few simple totals’ (Chambers 1989).
components, assessed, and ranked from 1 to 5 As the costs of research escalate, there is grow-
according to predetermined guidelines. This was ing concern about the value society receives in
developed in the South African setting, recogniz- return for the investment it makes in research.
ing the diverse contexts in which community The traditional view that the pursuit of knowledge
participation takes place. for its own sake is the mark of a civilized society,
Figure 3.4 gives the components into which and should be supported as an act of faith, is no
each of Rifkin’s axes were divided and provides a longer unquestioningly accepted. A limited
useful guideline for projects in the initial plan- research budget demands that choices have to be
ning stage, as well as later in the project when made. The current trend in research funding
participation needs to be improved. Table 3.2 focuses positively on areas that are most likely to
describes criteria for ranking each of the three yield a return to society (Twiss 1988).
components along the ‘management’ axis, show- The purpose of epidemiological research is to
ing some of the qualities that allow a project to be improve public health by identifying and redefin-
ranked high on community participation. ing solutions to problems through the generation
When evaluating community participation it is of new knowledge. The generation of new know-
important to identify exactly what aspects and ledge or technologies alone is not an adequate
degree of community participation the specific goal in such an applied discipline.
project or service aims at. Once these have been Most researchers hope their results will be read
identified, qualitative and quantitative methods and acted upon by policy-makers, and assume
can be used to see if the goals have been reached. that the utilization of research information
Chetty and Owen propose the use of semi-struc- depends primarily on the intrinsic worth of that
tured interviews with as many different role information. This assumption has been proved
players as possible in a given project. Qualitative empirically to be false (Weiss, Bucuvalas 1977).
methods in the form of focus groups and in-depth For a research project to make a contribution to
interviews can also be extremely useful. health problems, the researcher needs to be inno-
vative not only in generating new knowledge, but
Conclusion also in disseminating it to potential users (stake-
An approach embodying community participa- holders). Training in epidemiological research
tion can be applied to many public health skills rarely emphasizes any aspects of the imple-
endeavours, including epidemiological studies. mentation of research recommendations, result-
Such an approach demands that researchers ing in limited practical application of research
acknowledge, respect, understand and involve findings in the public health field.
the ‘populations’ they study. A participatory
approach, while not appropriate in all epidemio- The ‘implementation of research’
logical studies, can enhance both the research To ‘implement’ means to ‘bring into effect’. The
results and the implementation of its findings and definition must be interpreted in a flexible way if
recommendations. Public health has been it is to be a useful concept in research. It may be
defined as efforts ‘directed to the maintenance used in a situation when the extension of know-
and improvement of the health of the people ledge leads to changes in policy, behaviour, deci-
through collective or social action’ (Last 1988). sion-making rules or the allocation of resources.
Community participation provides a way of Research may yield a wide range of changes. At
ensuring that epidemiological findings can facil- the simplest level, the change may be in the level
itate such action. of knowledge or understanding of a problem. This
knowledge may precipitate the development of
Planning for the implementation of new skills or lead to a change in the behaviour of
research individuals or groups. The new information may
lead to changes in organizational policy or, at the
Introduction most complex level, a change in individual
‘Much of the material remains unprocessed, or assumptions, values and beliefs. These changes
if processed, unanalysed, or -if analysed, not may infiltrate from the individual to the social
40
3 general philosophical issues in epidemiology
4l
a contextualizing epidemiology
tions. It is very important that the stakeholders sented by HIV vaccine trials is the extent to which
closest to the unit of study be involved; for researchers must educate the study subjects
example, village structures and not regional about ‘safer sex’ practices. The research needs
structures, if the focus is on village level. ‘unsafe sex’ to prove the value of the vaccine, yet
¢ define the implementation objectives at the the principle of beneficence demands that we go
protocol development stage. These differ from out of our way to encourage ‘safer sex’ practices.
general scientific objectives in that they state If you were asked to take part in this trial as a re-
the purpose of the study and what is to be done searcher, how would you solve this dilemma?
with the results. For example, a study of the
prevalence of malnutrition (scientific objec- 2. A research project involving prostitutes in
tive) is only one step towards the true imple- inner-cities needs their co-operation over a
mentation objective — reduction of malnutri- period of time. The researcher has to obtain their
tion in the community. personal details to ensure follow-up, and devel-
@ involve the stakeholders in the research ops an intricate system of recording information
process and keep their interest in the methods so that only she knows the identity of the persons.
and analysis of results. The moral dilemma in this case centres around
@ give feedback regarding the results directly, the researcher. The only way to help improve the
accessibly and promptly to the stakeholders. health of the prostitutes is through the research,
for which personal details are needed. Yet, prosti-
Conclusion tution is illegal, and if a court of law were to order
The issue of how intimately the researcher should the researcher to hand over all records, she would
be involved in facilitating the implementation of then be required to do so, thus breaking her pro-
research recommendations is a controversial mise of confidentiality.
one. In the past, there has been a tendency for What would you do in this case?
epidemiological research to be practised inde-
pendent of any health practice application. There 3. A researcher wishes to investigate the quality of
is a need to link research to policy and many re- care given by pharmacists to children with infan-
search organizations and funders are now insist- tile gastro-enteritis. The questions relate both to
ing that implementation objectives are stated the diagnosis reached and to the therapy pro-
explicitly in epidemiological protocols and fund- vided. Ideally, the researcher would like to com-
ing proposals. Many of the issues involved in pare the quality of care given by pharmacists to
implementing research recommendations are the care given by general practitioners. However,
difficult to negotiate. In attempting to define the this will not be possible for methodological
issues, researchers may take the first step in reasons, as the two professions will require en-
developing a strategy to improve the utilization of tirely different assessment procedures. On asking
research-based information. the opinion of a pharmacist about the study, she
suggests that the study not be conducted at this
time, as it could prejudice the tense debate
between medical practitioners and pharmacists
EXAMPLE 3.1 about providing, and charging, for primary care.
Some ethical dilemmas to stimulate In other words, any negative results of the study
could be used by medical practitioners to argue
ethical reasoning their case for not allowing pharmacists to engage
1. The development of an effective preventive HIV in primary care activities.
vaccine would constitute one of the great medical Clearly, the information on the quality of care
miracles of this or the next century. HIV vaccine delivered by any health worker is pertinent to a
trials are in the process of beginning, and many good health service, and is therefore morally jus-
have proposed developing countries as trial sites. tified on the basis of beneficence. On the other
In any HIV trial, researchers will have to compare hand, the results may lead to an unfair advantage
a vaccinated group with an unvaccinated (or (at least perceived as such by pharmacists) in the
placebo-vaccinated) group for incidence of infec- dispute between the two groups, and may there-
tion with HIV. To become infected with HIV, fore not be justifiable on the basis of the principle
however, one needs to engage in ‘unsafe sexual of justice.
practices’. One of the great moral dilemmas pre- How would you deal with this?
42
3 general philosophical issues in epidemiology
44
3 general philosophical issues in epidemiology
eme kae?
1. Ngawana wa gogo wa mafelelo ona le mengwago
(How old is your child ?)
45
a contextualizing epidemiology
ar ee eM Na ORCC hme LE
s 135 (897%)
46
Literature review _
_ G Joubert,J Katzenellenbc
Sampling oe
G Joubert, J Katzenellenbogen
49
b epidemiological research methods and protocol development
authorship — it must be earned through intel- comparability of results. Protocols are also
lectual contribution and work. submitted to funding agencies (for example the
Medical Research Council) when funds are
sought, and to ethical committees for approval.
The study protocol Universities require protocols before confirming
The study protocol, sometimes called a research the registration of a student for postgraduate
proposal, is a formal written document which is study. The protocol eventually helps in the
prepared prior to the study. It documents the writing up of the study.
reasons for the study and the details of the The protocol is a dynamic document which
methodology and, in doing so, gives a clear plan keeps changing as the planning of the project
of exactly what needs to be done. progresses. A very crude protocol might be pre-
The study protocol has many purposes. It helps sented in a pre-protocol session where colleagues
the researcher focus on critical issues, clarify review the initial ideas and brainstorm their feas-
ideas and keep on track. It also enables outsiders ibility with the researcher. Based on input at these
to understand what is intended so they can sessions, a more thorough and systematic proto-
comment critically either on the written docu- col can be drawn up and critiqued at a later proto-
ment itself, or at peer group protocol meetings col meeting. Changes and adaptations based on
where protocols are reviewed. A clearly written recommendations made by peers and consult-
protocol also enables other researchers to repeat ants should be made so that the benefit of the
the same design in another setting and allows for peer review process is not lost. Key stakeholders
in the research should be invoived in the protocol
development to ensure that there is agreement on
Table 4.1 Major headings ina the importance of the issues being studied, that
chteateted there is acommitment to act on the results, and to
enhance participation. Once the protocol has
Tite,piraeet investigator, co-investi been accepted by an ethics committee it should,
institutional affiliations, and qualifications however, not be changed further. If more changes
are required, these changes have to be approved
Summary/Abstract by the ethics committee.
1 Introduction The process of protocol development can be
1.1 Literature revi W slow if the researcher is inexperienced, if there are
1.2 Motivation for the study (problem) few resources and consultants available, and if
1.3 Purpose the area of research is a new one.
1.4 Specific objectives
41.5 Implementation ee Contents of the protocol
Methods o A protocol should start with a title and authors
2.1 Definition of terms page. An informative (but not cumbersome) title
2.2 Study design which clearly describes the purpose of the study is
2.3 Study population and sampling important. This information is placed on many
databases. Cryptic titles are not desirable. The
principal investigator’s name, qualifications and
institutional affiliation should then be stated.
3.1 Responsibilities of investigators Some funding or reviewing institutions (for exam-
3.2 Responsibilities of staff ple, the National Institutes for Health in the USA)
3.3 Time schedule : do not accept more than one principal invest-
Data management and ae igator, while other institutions are more lenient. A
_ Resources _ list of co-investigators (either in order of import-
5.1 Available resources ance to the study or in alphabetical order) with
5.2 Budget and budget motivation. qualifications and institutional affiliations must
Ethical and legal considerations — also be included.
Reporting of results _ The title page is followed by a brief summary or
References _ abstract which gives the reader a general over-
Appendices view of the proposed study. A short statement of
the purpose of the study, the major research
50
4 planning a research project
to in the aim or topic of the study, and the state the specific format and manner in which
specific objectives, must be defined. This results will be reported to both the public in
ensures that all people reading the protocol general and particular individuals and groups.
have the same understanding. Definitions also Target groups which can facilitate the imple-
help the researcher clarify concepts for him- or mentation of recommendations based on the
herself. research findings should be identified.
The investigator must state which type of 8 References should be included, using an
epidemiological study (study design) will be accepted format.
used to best meet the objectives. ; 9 Appendices should include a copy of the ques-
The study population must be identified tionnaire/data capture sheet and any other
precisely in terms of sex, age, place, condition, items of importance.
and any other relevant factor. If sampling is to
be used, the sampling strategy must be A detailed checklist of points which should be
described in detail. The source of the sampling covered in a protocol is given in Table 4.2. This
frame, and the intended sample size (and checklist was drawn up to ensure that all impor-
reasons for this) should be stated. tant scientific issues would be covered in the
The method of data collection (measurement) protocol and considered by the protocol reviewer.
should be specified, as well as the variables or If a protocol is submitted to a funding agency, a
characteristics to be measured. Measuring detailed budget would also have to be provided.
instruments should be described, and an indi-
cation of their validity and reliability provided. Bias in research
The protocol should report on or state the Bias can enter the research process at any stage,
intention to conduct any pilot studies. These and special attention should be given in the pro-
are mini-studies which test part(s) of the study tocol to outlining how bias will be avoided at the
before the main study. Pilot studies usually various stages.
check the methods (for example, instruments The Chambers Dictionary defines bias as a
or logistics), obtain data to assist in sample size ‘one-sided inclination of the mind’. In research,
estimation and/or test the adequacy of field bias takes on a more technical meaning, but
training. _remains true to the above definition in some
3 The practical logistics of the study must be ways. It is any process which produces results or
carefully planned. These include clear task allo- conclusions that differ from the truth in a sys-
cation, organization of venues, shifts or duties, tematic (one-sided) way.
and transport. A realistic approximate time Bias can enter the research process at any stage:
schedule for carrying out various aspects of the in the stage of reviewing the literature (for
study should be attached. example, the researcher restricts articles to
ol
b epidemiological research methods and protocol development
Protocol title
i LARA
Literature review
Definition of the problem/
hypothesis/research question
Aims and objectives
Implementation objective
Method: Study design
Study population
Sample size
Variables
Measurement
Validity
Reliability
Questionnaire
Coding
Data management
Analysis
Write-up
Pilot study
Project management
Timetabie
Ethics
Additional aspects (specify)
Comments:
52
4 planning aresearch project
those that support a certain view or finding) In this manual, attention is given to bias which
insample selection (for example, selection of can occur in the selection process, data collection
non-comparable groups as cases and controls process, and due to confounding.
may give spurious results) Once present, bias is difficult to deal with and it
in executing an intervention (for example, is well worth the effort to be cautious in the proto-
patients from a control group may unintention- col design phase, so as to avoid bias wherever
ally receive or be influenced by intervention, possible.
thus ‘contaminating’ the results)
in measuring exposures and outcomes (for
example, instruments can over- or under-score
Conclusion
" measurements taken, hiding real differences or However well a study is planned, difficulties will
showing differences where there are none) arise. Public health research does not take place
@ in data analysis (for example deciding on the in ‘an isolated, structured environment like a
significance level after rather than before the laboratory, and thus the chances are quite high
analysis can lead to biased results) that something unforeseen can happen.
in interpreting the data (for example, there may The development and completion of the proto-
be over-interpretation of statistically but not col is merely the beginning of an often rather
clinically significant results) difficult emotional experience for the researcher.
@ in publication of results (for example, publica- Figure 4.2 describes some of the ups and downs
tion of positive results only may give a false that researchers can anticipate during the differ-
impression of the findings). ent phases of research.
The other chapters in this section will describe
Sackett (1979) published a catalogue of some selected elements of the research process in
50 biases which can occur in epidemiological detail. These include literature review, setting
studies. It is less important to know all the differ- objectives for a study, study design, study popula-
ent ‘jokers’ which can invalidate research find- tion and sampling, measurement, data manage-
ings than to know the principal situations in ment and analysis, resources, ethical and legal
which they can arise in the design of the research. considerations, and reporting of data.
©) Submit for ©
Q publication In print
Ecstatic Completion of fieldwork
CS Ready to andparty)
Y collect data
Click The idea Try to implement
“ results/ ,
Rejections recommendations
Initial protocol
development Corrections
Delays
Emotional ©
reaction Adjustments
Implement
Practical action at last
difficulties Peer review
feedback
Critique
cS
No-one interested
Coding a What now? Start writing up
catastrophe 6 @&) mental block
Experience has shown that researchers undergo similar patterns of emotions while involved with any research project.
Each stage of the research process tends to start on a positive note, but inevitably obstacles arise, with each stage having
its own set of difficulties. Problem-solving abilities and persistence are tested to the utmost. Beginner researchers should
note that once the fieldwork is completed, there is still a lot of work to be done. :
The above graphic describes emotional reactions (Y axis) to typical events during the different phases of a study (X
axis). It aims to help the researcher anticipate some of the emotional swings so that when (not if!) they occur, difficulties
can be seen to fit into a common pattern.
58
Literature review
54
5 literature review
reports. There may, in addition, be many reports Systematizing and summarizing the
which are unpublished because they show incon- literature
clusive or negative results. They may be very diffi-
cult to track down, but their absence may bias the A large number of articles can be identified as
review. Relevant official reports, such as reports relevant to a specific topic, and one should there-
by the Medical Officer of Health, or reports pub- fore devise a system of organizing and summariz-
lished by the Medical Research Council, Human ing them before they start piling up. Various ap-
Sciences Research Council, or other bodies proaches can be used. For example, the research
should also be reviewed. topic can be divided into various themes, and ar-
ticles can be grouped into these themes. One
article may, however, fit into more than one
Reading the literature theme, and you could make copies of the article,
While going through the literature, whether or use a system of cross-referencing. Whichever
through a computer-based search or a reading of system you use, it is useful to have a card index
various journals, the researcher will of course system as well. Each article should be numbered
come across articles which are not directly related and the details of authors and publication noted
to the topic of interest. These should not be on the card, as well as keywords and quality of the
discussed in the literature review, since the litera- article. The cards should be ordered alphabetic-
ture review should show one’s knowledge and ally so that you can quickly check whether you
summarize relevant literature, not provide an have an article by a certain author.
exhaustive (and boring!) list of everything that has
ever been written on the subject. The aim is to Conclusion
synthesize relevant literature in a critical way.
From the title of an article, you should be able When writing up the literature review for a proto-
col or paper, you should draw together your
to determine whether the article is at all relevant
to the research topic. If it does seem relevant, you reading, critically summarizing only the really key
issues. In a thesis, the write-up would however be
should carefully read the summary. If it indicates
that the paper is relevant, read the entire article more detailed in order to show afull understand-
ing of the issues. In both instances, the literature
carefully, paying special attention to the Methods
review should describe scientific discoveries
section, to be sure that the Results and Conclu-
which build on past knowledge, thus drawing a
sion have a sound basis. Attention should be
picture of the current state of knowledge and
given to measurement, sampling and issues con-
highlighting gaps in the current state of know-
cerning study design, and articles should be
ledge. This will ensure that the contribution your
classified according to the quality of methods
study might make is clearly contextualized.
used. (Details given in the rest of Section B will
enable you to judge the quality of the methods
used.) Ideally, articles should not be included in
the write-up of the literature review if you did not
read more than just the abstract or summary.
55
Setting
objectives
for research
56
6 setting objectives for research
Consider the following scenario in an urban with arrows pointing to the circles they ‘caused’.
squatter setting: Repeat asking ‘why?’ to each response (brain-
Raa storm) until you run out ofideas.
/ 3-year old This process is repeated for each of the events.
Sipho plays You may return a few times to earlier ‘whys’ as
in stagnant you gain momentum and get new ideas. Some of
water / the events have similar causes, so these common
causes should be linked with arrows.
a " Sipho / mother tries It would be useful for the reader to do this exer-
/ dies on to care for him cise before going on to the next paragraph.
thewayto | ; athome giving | Once the ‘why?’ process has been exhausted, it
hospital _/ anenema _/ is useful to present the analysis in a clear, more
systematic way. This can be done by organizing
Starting with the first event circled (that is, Sipho factors and causes into larger categories such as
playing in stagnant water) ask the question ‘why?’ political factors, socio-economic factors, health
and put your response in a circle with an arrow service factors, disease-related factors and so on.
pointing towards the event. There may be a The choice of categories is up to you, but they
number of reasons why Sipho is playing in stag- should flow from the ‘reasons’ generated in the
nant water, so add these to the diagram. It is problem analysis. You should now be able to give
useful to generate as many reasons as possible. a critical analysis of why Sipho died. Table 6.1
Now for each reason generated, ask ‘why?’ and shows the factors which possibly contributed to
put your second-generation reasons in circles his death.
Preis)
( Rome Mle eile mie mM Met LC tee)
ROR yee ey 141]
Sf
b epidemiological research methods and protocol development
a eee I ee aaa
In the early planning stage of a research project, Vague statements of purpose lead to vague
we often do not have aseries of events to analyse, studies. The purpose is a statement of the exact
but a central issue or problem. The same process ‘destination’ that you want to reach with the
of asking ‘why’ can be undertaken in exploring study: the methods are the ‘road’ you will take to
the central problem, for example, the high inci- get there. Exactness in the purpose requires deep
dence of unplanned teenage pregnancy. consideration of the problem which motivated
It is useful to do part of the problem exploration the study and a purposeful decision to restrict the
with colleagues and to consult the literature — study to one key aspect of the problem or its solu-
outside input always helps you to find further tion.
factors and question your own preconceived ideas. The purpose may be expressed in the form of a
Illustrated are diagrams of analyses of two statement that is as specific as possible, using
health problems which are commonly of concern action verbs (for example, ‘To determine the inci-
in South Africa (Figures 6.1 and 6.2). Note that the dence of diarrhoea among children under five in
same process was undertaken, but that the the summer months in Mamelodi’) or in the form
diagrams represent the end-product after all the of a question (for example, ‘What proportion of
messy and challenging work of asking ‘why?’ has Mamelodi children get diarrhoea during the
been completed. summer months?’).
The problem-exploration process does not Here are some examples of vague versus expli-
automatically suggest a research topic or ques- cit purpose statements.
tion. Instead, it improves understanding of the
problem and the multiple issues which impact on Vague purpose statements:
it. Ithelps to isolate the component of the problem 1 to study the problem of measles in South Africa.
that the researcher wants to study. The exact re- dS to investigate alcohol consumption as a con-
58
setting objectives for research
saul}
(AguOW
duipem
Hu07
JO
9WWI]
1SO9
Ul)
Ajiuuey yo SOAI}elad WO} WOdd
ns
s6nip
se)n6e4|
‘suBis Jo sbpajmouy
Ajddns
Hulse}
jo
AYIUNWIW0d 1004
qualonsuy
uolsiedns
qualoiynsu}
Sal}i[IG!SSOO
JUaW}eaJ} aAIVeUJa}e swojduwAs Jo uononpal (swoydwis ‘subis
souljapin6
yainb Jo asuodsed Ou)
ayelidouddeu|
A\se8) UdIIpUod
jo Ayiqeyeny
UaW}e8/} 0} asuodsaYy jo ssausnolas
891ag
$10}92}
$10}92) 91W/0U099 puke jeJN}jNd-0190$ $10}9} paje|ai-aseasiq
sjuoned gi Suowe 93724 10};NeJop YRIY 0} BUIZNGIA}UOD S10}9By JO Meta ee) CEE Meaes E
b epidemiological research methods and protocol development
St ee CELE eC re Ce aMTE
CYT Eertom rtm cL
Class factors Socio-political factors Service factors
pueeeal=7
Unemployment Apartheid in No sexuality education
Emphasis on unacceptable
Poor schooling methods: limited choice
Adolescent rebellion
Limited planning to
Poor locus of control avoid pregnancy
Defeatism/fatalism
Limited conscious
future planning
60
6 setting objectives for research
specific problem
ed
specific problem Se sseiahie General
General smn A Cr ag Ce
ae result or
problem >[speaiteproblem| 5 study| rest solution
specific problem ERTS | een
inconclusive information on a broad range of resources available and the usefulness of the
questions. The researcher needs to be patient and results. If no descriptive information is available,
systematic, and humble in recognizing that the that is, if there is a need for numbers and frequen-
project will not solve the entire health problem. cies of characteristics related to the problem, one
We will use the Sipho scenario described earlier of the first five studies should be undertaken. If
to illustrate how aparticular public health prob- there is enough basic information, but a need to
lem can give rise to a number of separate studies, understand more about what may be contribut-
each with their own purpose, aim or topic, but ing to the problem, studies asking similar ques-
still relating to the overall problem described. tions to numbers 6, 7 or 8 should be undertaken.
If there is good understanding of the causes of the
Studies arising from the Sipho scenario: illness, then studies should be done to evaluate
1 Aprofile of environmental conditions in the interventions designed to reduce the problem
squatter camp. (numbers 9 to 11).
2 Asituational analysis of health facilities in the
area, including access to primary, secondary Step 3: Setting specific objectives
and tertiary care.
3 Aprofile of childhood mortality in the area, The purpose or aim of a study should be broken
including cause-specific mortality rates. down into clear, mutually exclusive objectives
4 The prevalence of malnutrition among child- which are logically connected. To a large extent,
ren under five in the community. these determine the subsequent planning of the
a A study describing knowledge, attitudes and study. Specific objectives indicate the specific in-
practices of caregivers concerning diarrhoea formation the study must yield and the detailed
and its treatment. research questions that must be answered. Spec-
6 Acomparison of the childhood mortality in ific objectives are often stated in operational
communities with and without piped water. terms, which are easy to apply in practice. They
Socio-economic risk factors for diarrhoea. are also commonly phrased in measurable terms
onMaternal education as arisk factor for death
(in such a way that the answer is a number or rate)
among children who get diarrhoea. and need to be clear, specific and unambiguous.
9 Acomparison of the diarrhoea mortality in a Specific objectives indicate the individual com-
community before and after the introduction ponents of what will be achieved (but not how it is
of piped water into that community. to be done: that is left for the methods section of
10 An evaluation of the effect of acommunity the protocol).
health worker programme focusing on appro- Descriptive studies usually set out to describe
priate treatment by mothers of children with characteristics of the group being investigated,
diarrhoea. and the statistical goal is usually simple data
11 An evaluation of different methods of rehydra- description or estimation of a characteristic in the
tion to determine which is most effective in a study population (see Chapter 7: Study design).
hospital setting. In analytic studies, the objectives are to
examine associations between characteristics
The above represent some of the studies emerg- and possibly to seek new knowledge on the
ing from the problem exploration done in this cause(s) and the natural history of diseases or
case. The exact choice of study depends on the conditions. Intervention studies attempt to deter-
kind of information already available, the interest mine if an intervention is associated with an
and skills of the researcher, the money and improvement (or deterioration) in health status.
61
b epidemiological research methods and protocol development
a
a a a
Thus analytic and intervention studies always project. It helps to keep the researcher on the
involve hypotheses or predictions of a relation- path originally set out, and enables outsiders to
ship between a factor (or factors) and the evaluate the protocol and/or study.
outcome under study. The hypothesis or predic-
tion is statistically tested after relevant informa-
tion has been collected and analysed, and the EXAMPLE 6.1
hypothesis is either rejected or not rejected. Setting research objectives: Sipho’s
The research hypothesis or question is the pre-
diction or supposition which motivates the study.
death
Once the question or hypothesis statement has Step 1: The problem (what is motivating the
been framed carefully, the hypothesis has to be study?)
stated in such a way that the prediction may be
evaluated by appropriate statistics. This new Sipho’s death, which was explored at the begin-
statement of the hypothesis is called the statis- ning of Chapter 6 (see Table 6.1).
tical hypothesis. Thus the research hypothesis or
question leads to the statistical hypothesis, a Step 2: The purpose or aim of the study (overall
statement made in mathematical terms. For topic of a particular study, chosen from a number
example, the research question ‘Is there an asso- of possible topics)
ciation between education level and food hygiene
practice?’ needs to be framed in the appropriate To identify risk factors for diarrhoea among pre-
statistical terminology to enable the use of the school (under the age of six) children in Khaye-
t-test or the calculation of confidence intervals for litsha, Cape Town.
mean differences. Thus the statistical hypothesis
will state that there is no difference in the mean Step 3: Specific objectives (specific information
education level of people who practice ‘good’ the study will yield, linked to the purpose or aim)
versus ‘bad’ hygiene.
The intention to test a hypothesis is stated as a To ascertain which of the following factors are
specific objective, along with other specific objec- associated with diarrhoea.
tives where appropriate. ¢ Factors related to water: distance to nearest
tap, quality of water used
Step 4: Implementation objectives Personal factors: mother’s hygiene practices,
mother’s level of education, mother’s knowl-
It is important to state what will be done with the
edge of causes and prevention of diarrhoea and
research results and what will be done before and
dehydration
during the study to ensure that the results are
@ Environmental factors: method of water
used. This links into ‘why’ the study is being done.
storage, location of toilet, site of refuse disposal
There should be clarity about the implementa-
Socio-economic factors: availability of sugar
tion objectives of the study so that decision-
and salt for oral rehydration solution, ability to
makers, service providers, and possible benefac-
afford transport to health services
tors can join the research team early and ensure
@ Health service factors: distance from home,
that results are useful and used.
opening times, cost of services, consultation of
traditional healer
Conclusion
Exploration of the problem motivating the Step 4: Implementation objectives
research helps the researcher to get a broad, crit-
ical view of the issues related to the central To make recommendations for interventions tar-
problem identified. geted at reducing the incidence and prevalence of
Setting objectives for the study is another diarrhoea based on the results of the study.
important (and often difficult) part of the re-
search process, as it helps to focus the study by Reference:
describing exactly what it hopes to achieve. It Adapted from Vundule C. ‘A study on the risk factors for
helps the researcher break the study into different diarrhoea in Khayelitsha, a peri-urban settlement area in the
parts, and acts as a guide during the design of the Western Cape.’ Epidemiology research report, Community
methods, as well as during later parts of the Health Department, UCT, 1994.
62
6 setting objectives for research
EXAMPLE 6.2
Setting research objectives: the case
of AIDS research
Step 1: The problem. (what is motivating the Step 3: Specific objectives (specific information
study?) the study will yield, linked to the aim)
There is an AIDS epidemic worldwide and in Assume that the aim chosen to be most appro-
Africa. Currently no cure or vaccine for AIDS priate at this stage is:
exists. Behaviour change remains the only imple- to investigate psychological factors associated
mentable intervention area. with behaviour change among high school
students in Soweto.
Step 2: The purpose or aim (overall topic of a par- The following objectives may be stated.
ticular study, chosen from a number of possible ¢ To determine knowledge of HIV transmission
topics) and prevention among individuals in the group
targeted for research
Given the above problem, one of the following To determine attitudes towards people infected
purposes/aims could be appropriate for the with HIV among individuals in the group tar-
study. geted for research
To determine the practice of behaviours related ¢ To investigate the extent to which individuals
to AIDS (in different target groups) in the group targeted for research believe them-
To investigate psychological factors associated selves to be at risk
with behaviour change (in different target To determine attitudes towards particular pre-
groups) ventive behaviours (for example, condom use)
To determine access to condoms (in different @ To determine self-confidence in asserting pre-
target groups) ventive behaviours
To evaluate AIDS education techniques
To determine difficulties in implementing new Step 4: Implementation objectives
AIDS education programmes
@ To determine factors related to the receptive- To make recommendations to education author-
ness (of different groups) to AIDS health educa- ities concerning health education in schools
tion programmes regarding sexuality and AIDS.
63
Study design
64
7 ~~ study design
Descriptive
(describes)
Observational studies
Cross- sectional
Analytical ————————————__>
(explains) > Case-control
se,dana diigisoetigmenies si Cohort
longitudinal study. The study base constitutes the State residents over 25 years of age). As discussed
number of people at risk and the time they are at earlier, the situation regarding a single group is
risk. The study base is typically measured in units described in terms of the size of the problem, who
of ‘person-time’ (for example, person-years is affected, where it is found, or when it occurs.
exposed to a risk factor or treated with a drug. Why do we do such studies? The main use of
Also see Chapter 2: Measures of the frequency of descriptive studies is to give service providers and
health events). planners information that will help them design
In a cohort or case-control study, the study services and allocate resources efficiently. Often
base is all individuals who could potentially have such studies generate questions for further
had the study outcome over the time period of the studies. So, for example, a case series which
study. In a cross-sectional study, the measure- describes the socio-demographic and medical
ments are all taken at the same time, so time does characteristics of a few patients with a similar
not have to be measured. disease may provide the impetus for a more rigor-
The concept of a study base is particularly ous investigation of risk factors in an analytic
useful when designing an analytic or intervention study.
study, or when critically evaluating the methods A descriptive study which sets out to compre-
of a study. If the researcher does not ensure that hensively describe the current state of affairs of a
all outcomes (cases) from the study base are certain aspect of health is also known as a situa-
accounted for, or if controls are not a represent- tional analysis. A community diagnosis which
ative sample of the study base producing the consists of detailed information about health
cases, the results of the study may be biased. problems and perceptions in a community is a
(We suggest that the above section be reread descriptive study. If the community diagnosis
after analytic studies have been covered later in also includes information on the associations
this chapter.) between exposures and disease, it is analytic.
Figure 7.1 summarizes the classification of
study types according to the criterion of the Analytic studies
research question. The rest of the chapter will
give more details on the most common study de- Analytic studies attempt to explain or analyse the
signs used in epidemiology. The broad categories situation by comparing groups, in this way identi-
of study design have already been mentioned fying risk factors for disease. For example, the
above. question ‘Is hypertension in Free State residents
related to obesity/smoking/alcohol use?’ is a
research question which requires an analytic
Descriptive studies study. A number of different analytic study de-
A descriptive study, also referred to as a survey, signs are available to answer this type of question.
sets out to quantify the extent of a problem (for Analytic studies can be done in the form of
example, the prevalence of hypertension in Free cross-sectional, case-control or cohort studies.
66
7 study design
Exposed
Not exposed
These three designs differ with respect to how in- come. But be careful of the joker in the pack! For
dividuals are sampled and the sequence in which example, a superficial look at the results may
exposure and outcome information is collected show that people who have hypertension (expo-
(see Table 7.1). sure) are more likely to have heart disease (dis-
ease) than normotensives and one may deduce
Cross-sectional analytic studies that this is because of hypertension. But this
In across-sectional analytic study, a sample of the finding may be explained by the fact that hyper-
study population is investigated and information tensives tend to be older than normotensives,
is collected on risk factors (exposures) and and older people are more likely to have heart
disease (outcome) at a point in time. The disease. This example illustrates how age can
information sought can be current (how many confound the relationship between blood pres-
cigarettes do you smoke a day at present?) or past sure and heart disease.
(how many cigarettes did you smoke on average In this case, age has distorted the relationship
each day during the past year?). between exposure and disease and is therefore
The study has a descriptive component which called a confounder. Thus when comparing the
enables researchers to calculate the prevalence exposed and unexposed, it is important to con-
of risk factors as well as prevalence of disease. sider, on the basis of knowledge of the subject
The analytic part of the study consists of com- area, whether there are factors which are or could
paring exposure groups with respect to disease be related to exposure as well as disease. Such
presence. The total sample is thus divided into confounders can distort the relationship between
four subgroups (shown in Table 7.1), namely: (A) exposure and disease, masking a real association
those who are exposed and diseased; (B) those or showing an apparent, but spurious associa-
who are exposed and not diseased; (C) those tion. Confounding occurs in all types of studies,
who are unexposed and diseased; and (D) those not just cross-sectional studies. (See Chapter 11
who are unexposed and not diseased. Examples for further discussion on confounding.)
of such cross-classifications are given in Sec- Because cross-sectional studies take a ‘slice’ in
tion C. time, current behaviour (for example, current
Such cross-classification allows statistical smoking habits) is usually obtained. But diseased
comparisons between subgroups, and may people may have stopped smoking because of ill-
show arelationship between exposure and out- ness and so current smoking information will not
67
b epidemiological research methods and protocol development
a
help the researcher to illuminate smoking as a sible, exposure information should thus be valid-
risk factor. It may be more useful to take a ated by objective sources.
smoking history, rather than asking only about Case-control studies are increasingly done as
current smoking habits. In general, when decid- part of larger cohort studies. Such case-control
ing which questions to ask about risk factors, one studies are termed nested case-control studies.
has to consider whether past exposure may be Exposure information on all participants is
more relevant than recent exposure. Neverthe- collected at the start of the study period. During
less, it is usually difficult to assess the temporal follow-up, cases are identified and a sample of the
relationship between exposure and disease in a remainder of the group is selected as controls.
cross-sectional study. The researcher can seldom Exposure status is thus measured before disease
be sure which occurred first — the disease or the status is determined, and recall bias is avoided.
exposure. However, the temporal sequence of Case-control studies are very vulnerable to
exposure and outcome can occasionally be estab- selection bias. The essential requirement is that
lished (for example, exposure during pregnancy, the control group accurately represents the
followed by outcome in the child). source population, that is, those contributing to
If the disease of interest is rare, a very large the study base. The cases (selected according to a
sample has to be studied to be sure of selecting clear case definition) need not be representative
enough diseased people to allow for adequate of all cases in the population, but could be cases
analysis. The cross-sectional analytic study may at one clinic, or cases among factory workers.
turn out to be too costly, due to the large sample Similarly, the controls need not represent the
size required. In such circumstances, one would entire non-diseased population but should come
rather consider doing a case-control study. from the same source population as the cases (for
example from the referral area of the clinic, or
Case-control studies from the same factory).
Essentially in a case-control or case-referent In a case-control study, careful definition of the
study, cases (those with disease) are compared to study base is important to ensure that the cases
appropriately selected controls (without disease) and controls selected are from the same study
to determine whether they differ with respect to base. When the study base (or source population)
exposures. So, for example, the proportion of pas- is defined at the outset, the challenge is to find all
sive smokers among cases of lung cancer is com- the cases that arise from this (primary) study
pared to the proportion of passive smokers base. If the study base comprises all individuals
among controls (without lung cancer). In tradi- living in a particular area over a set period of time,
tional case-control studies, after cases and con- it may be difficult to track down all the cases. On
trols have been selected, exposure information is the other hand, if cases are defined at the outset,
gathered. the challenge is to find a suitable control group
The case-control design can be considered an which represents the source population that gave
efficient sampling technique to measure expo- rise to the cases. A tertiary hospital-based case-
sure-disease associations in a study base (the set control study is an example of a study base being
of persons or person-time from which diseased defined on the basis of the cases recruited into the
subjects become cases). Exposure is only mea- study.
sured in a sample of all subjects in the study base If hospital or clinic cases are selected, one or
(those who were selected as cases and controls two control groups may be used: one hospital-
from the source population). Typically, all the based group and one community group. Hos-
outcomes from the study base (that is, cases) are pital-based controls should not have a condition
included in a case-control study. The controls are related to the disease or exposure being invest-
used to represent the probability of exposure in igated. Neighbourhood controls are examples of
the population at risk of becoming cases. community controls where there is an attempt to
A disadvantage of such case-control studies is control for socio-economic factors. When two
that the retrospectively gathered exposure in- separate control groups are used (this rarely hap-
formation may be inaccurate or biased (measure- pens), the cases and each control group are com-
ment bias). People with disease (cases) may pared separately and results are checked for con-
remember past events more clearly, or, on the sistency.
other hand, may deny certain past behaviours Matching is often used in case-control studies.
which their doctor had warned against. If pos- This means that for each case, one (or more)
68
7 study design
control with similar characteristics is selected. If (normal birthweight) are followed up. The disease
cases and controls are matched individually, a or outcome rates (for example, failure at school)
matched analysis has to be done, otherwise asso- within these two groups are then compared at the
ciations will be underestimated. An alternative to end of a predetermined follow-up period. The
individual matching is group matching, where cohort’s time spent under observation is the
controls are chosen to ensure that the control study base out of which cases arise.
group as a whole is similar to the group of cases The major disadvantages of cohort studies are
with respect to the matching variables. Thus the time and cost considerations. If the disease(s) of
proportion of males amongst the controls will, for interest take a long time to manifest, individuals
example, be the same as among the cases. The have to be followed up for a long period. The
variables that need to be matched are the ones longer the follow-up needed, the more likely you
that are possible confounders. Matching does not are to lose touch with participants, thus produc-
remove confounding, but it does ensure that ing bias in the study. At the start of the study,
numbers are equal in stratified analyses (see steps should be taken to minimize this loss to
Chapter 11). follow-up. One approach is to ask participants to
A case-control study can only investigate one provide the names and addresses of some close
disease, but many exposures can be considered. If friends and relatives through whom they could be
the exposures of interest are rare in the popula- traced. Health personnel can also be co-opted to
tion, however, a case-control study is inefficient. do frequent home visits. It is important to trace a
As stated before, it is an excellent approach if the sample of drop-outs to establish whether they
disease is rare. In recent years, however, the use represent a subgroup which differs from those
of case-control studies has been extended to the who do not drop out. Statistical methods, namely
study of common diseases. New case-control de- survival techniques, exist which enable one to use
signs (new in the way in which the controls are incomplete follow-up information in the calcula-
sampled) have also been described. The selection tion of risk of disease.
of cases for a case-control study can be made If adequate historical records (for example,
from prevalent or incident cases. If incident cases hospital, medical or university alumni databases)
are being recruited, then the controls can be se- exist, one can assemble exposed and non-
lected either after all cases have been recruited, or exposed groups from such records and follow
as each case is recruited. In the latter situation, a them up to the present to determine disease
person originally selected as a control can there- status. Historical records can also be used to
fore become a case at a later stage. Further discus- determine disease outcome. These are examples
sion of these designs, as well as the appropriate of historical cohort studies, in which data collec-
methods of analysis, are given by Rodrigues and tion is non-concurrent.
Kirkwood (1990). A synthesis of issues involved in
the selection of controls in case-control studies is
Experimental studies
given by Wacholder, McLaughlin, Silverman,
Mandel (1992). As outlined before, a study is experimental if the
researcher assigns the exposure to the subjects.
Cohort studies Such studies are also called intervention studies
These studies are also called follow-up studies since they evaluate some intervention aimed at
since healthy individuals with known exposure decreasing or preventing disease.
are followed up for a period of time during which Different intervention studies have different
disease status is determined. These studies can levels of scientific rigour. The main criteria by
investigate the cause of disease as well as the which experimental studies are judged are: 1)
natural history of disease. whether the intervention is compared to some
A cohort is a group which shares a certain char- other group (another intervention or a control
acteristic. For example, a birth cohort is a group group); and 2) the way in which people are alloc-
born during a certain period. Some of these will ated to experimental and control groups. Valid
have had a low birthweight (exposed) and others evaluation of an intervention is generally only
a normal birthweight (unexposed). In a cohort possible if the intervention is compared to some
study, a group of disease-free individuals who are other group. In addition, the researcher should
exposed to a certain risk factor (for example, low follow very strict rules in allocating participants,
birthweight) and a group which is not exposed to ensure that all individuals have the same
69
b epidemiological research methods and protocol development
a ee
chance of being allocated to a given group. patients are randomized to different treatment
Randomized controlled clinical trials (RCTs), (parallel design).
in which patients are assigned to different treat- In RCTs, patients are followed for a specified
ments, are regarded as the gold standard for period to determine outcome (for example, alive/
intervention studies. In a randomized controlled dead, recover/do not recover). The person obser-
clinical trial, a group of patients is selected and ving the outcome after the intervention should be
patients are then randomly allocated (random- unaware which treatment a patient received (so
ized) to different treatments, or treatment and that the observer is ‘blind’ to the exposure). The
placebo groups. (Please refer to Chapter 8 for fur- patient should be similarly unaware, so that nei-
ther discussion of the random _ principle.) ther the patient nor the observer can be biased by
Random allocation in two groups of 15 each is, for knowledge of the treatment received (double
example, done by drawing 15 random numbers blind), especially if the outcome is subjective,
between 1 and 30. These 15 patients are allocated such as the extent of pain experienced on a scale
to one group and the remainder to the other from 0 to 10.
group. Special rules can be followed (for example, The different treatments should be equally
block randomization) to ensure that after a given acceptable according to present knowledge, that
number of patients have been enrolled in the is, patients should not be subjected to an inter-
study, there will be equal numbers in each treat- vention that is known to be inferior or unsafe.
ment group. This is why it is important for an ethics committee
Randomization aims to ensure that patients on to scrutinize the study protocol. Other ethical
different treatments are comparable with respect considerations are that patients should give in-
to baseline characteristics, as well as known and formed consent, all adverse events (for example,
unknown risk factors. The possibility of con- headaches or nausea) should be noted, and it
founding, which can occur in analytical studies, is should be clearly specified when patients will be
thus reduced. For example, the most severely withdrawn from the study in the event of harmful
affected patients should not get one treatment effects being found, or a clear indication of the
and the milder cases another. To ensure that the superiority (or inferiority) of one of the inter-
groups are similar with respect to disease severity, ventions.
prognostic stratification can be used: randomiza- Randomized controlled trials are trials of effi-
tion is done in strata (subgroups) of disease sever- cacy (see Chapter 15: Health systems research). In
ity. It may be expected that people with differing cases where randomized controlled trials are not
characteristics (for example, males and females) possible because of ethical or logistic reasons,
will differ in their response to treatment. In such case-control and follow-up studies can be used
cases, it is important that patients should be to assess interventions. So, for example, case-
stratified by sex at the start of the study, and ran- control studies have been used to assess the effi-
domization then done within each sex. Stratifica- cacy of aspirin in reducing the risk of myocardial
tion can range from stratification in two large infarction.
strata to randomization of matched pairs. The Besides RCTs, experimental epidemiological
main purpose of stratification is to ensure that studies include field trials, in which healthy indi-
important confounders are distributed similarly viduals are assigned to different interventions
in the treatment and control groups. It is particu- aimed at prevention (for example, a new typhoid
larly important in trials with small samples where vaccine), and community intervention trials, in
randomization alone may not ensure this. which groups from the general population are
If the condition being studied is such that after assigned to interventions. Since the risk of devel-
treatment has been withdrawn, the patient re- oping disease may be very low amongst healthy
verts to the disease status he or she had before the individuals, more subjects may be needed for a
study, as in chronic conditions such as asthma, a field trial than for a clinical trial. Furthermore,
cross-over design can be used. Each participant individuals have to be visited in the community.
receives all treatments, in a randomly chosen se- Intervention studies on people in the community
quence, with wash-out periods in between, to are therefore more generally performed through
remove the effect of the previous treatment. community trials. So, for example, in the Coro-
Fewer patients are needed, but since all parti- nary Risk Factor Study (CORIS) conducted by the
cipants undergo all treatments, the study takes MRC, HSRC and Department of Health, three
longer to complete than in the design where southwestern Cape towns each received a differ-
70
7 ~~ study design
investigation. The value of ecologic studies is prescribed, health promotion activities, and
often underrated because of the possibility of referral to secondary and tertiary centres
ecologic fallacy. Within ecologic study designs, it @ Areview of non-clinical activities that impact
is possible to assess the extent of the fallacy and to on care
some extent adjust for this. Furthermore, certain @ Adescriptive survey of staff and patient atti-
studies can only be conducted as ecologic studies tudes towards the service and perceived needs
because the exposure and/or outcome can only
be assessed ecologically, for example, the rela- 3 Aprofile of childhood mortality in the area,
tionship between fluoride levels in water and including cause-specific mortality rates
dental caries. In other instances, ecologic level Descriptive study:
variables are more appropriate and accurate a) Where mortality statistics are well kept:
measures than individual level variables. For Use of routinely collected mortality and
example, the variable ‘poverty’ is an ecologic vari- demographic data to determine mortality
able demarcating a depressed socio-economic rates
neighbourhood, whereas ‘individual income’ b) Where no coherent system of mortality data
provides no information on the general socio- collection exists:
economic conditions of the environment that the @ Ahousehold survey where caregivers report
subject is exposed to. Studies of community- on children in their care and any childhood
based health promotion activities using mass deaths over the previous year
media often require ecologic studies.
4 The prevalence of malnutrition among child-
ren under five in the community
Conclusion Descriptive study:
The research topics given below represent some @ Ahousehold survey in which every child under
of the studies emerging from the problem explo- the age of five has a nutritional assessment
ration done in the previous chapter. Having read
the chapter on study design, you can now decide 5 Astudy describing knowledge, attitudes and
which design would best suit a given research practices of caregivers concerning diarrhoea
topic or question. and its treatment
We suggest that you go through the list on page Descriptive study:
61 and try to allocate a suitable design before @ Asample of caregivers is interviewed using a
reading our comments below. Please note that for questionnaire which determines their know-
some topics, there may be more than one ‘cor- ledge of diarrhoea and its treatment, their atti-
rect’ option and some alternative approaches are tudes, and also the treatments they administer
not described.
6 Acomparison of the childhood mortality in
Studies arising from the Sipho scenario communities with and without piped water
1 Asituational analysis of environmental condi- Ecologic study:
tions in the squatter camp (situational analyses @ This can be done by comparing mortality
often require multiple approaches) statistics from communities with and without
Descriptive study: piped water
@ A full assessment of environmental infrastruc-
ture done by environmental officers 7 Socio-economic risk factors for diarrhoea
@ Adescriptive sample survey of houses to Cohort study:
determine the environmental profile at house- @ Agroup of children are followed up fora
hold level period of time to see if they get diarrhoea. At
the end of the study, a comparison is made of
2 Asituational analysis of health facilities in the diarrhoea rates among children from homes
area with ‘low’ and ‘high’ socio-economic status
Descriptive study: Case-control study:
@ Areview of services offered, staff employed @ Socio-economic variables are compared be-
and expenditure tween cases of diarrhoea attending a PHC
¢ Acollection of descriptive information on clinic and a group of neighbourhood controls
clinic attendance, illness profile, medicines who do not have diarrhoea
72
7 ~~ study design
8 Maternal education as a risk factor for death treatment by mothers of children with diar-
among children who get diarrhoea rhoea. The CHW programme is introduced
Case-control study (hospital or clinic-based): and after a set period of time, various outcome
@ The education level of mothers of children measures (for example, maternal knowledge
who die from diarrhoea is compared to the of ORS, under-five mortality rate, incidence of
education level of mothers of children who get severe dehydration following diarrhoea) are
diarrhoea but do not die compared between the experimental and con-
trol villages
9 Acomparison of the diarrhoea mortality ina
community before and after the introduction of 11 To evaluate which method of rehydration is
piped water into that community most effective in a hospital setting
Before-after study (evaluating an intervention) Randomized controlled clinical trial:
¢@ ‘Children with severe dehydration admitted to
10 An evaluation of the effect ofa community the hospital are randomly allocated to two
health worker (CHW) programme focusing on treatment groups (there is no group that does
appropriate treatment by mothers of children not receive treatment). They are followed up
with diarrhoea and various outcome measures are taken
Community trial:
@ Villages are randomly selected to be recipients
of a CHW programme focusing on appropriate
73
Sampling
74
8 sampling
Random sampling
Table 8.1 Medical doctors listed
Random sampling (not to be confused with hap-
hazard or arbitrary sampling!) is a selection tool in the Cape Peninsula telephone
which can ensure that one’s sample is representa- ‘ directory
tive of the population. This is also known as prob-
ability sampling, since each individual in the Abbas MR
study population has a known chance (probabil- Abdurahman N
ity) of being included in the sample. While the Abdurahman Z
researcher controls the sampling process, she or Abel S,‘Opthalmic -—
he has no control over exactly which individuals Abels CS
are selected. In the end, whether an individual is Abrahams A
selected or not is determined by chance. Abrahams L
In the discussion that follows, it will be Abrahamse C GA, Ophthalmologist
assumed that information is required on indi- Ackovic K L, Gynaecologist
viduals. If, however, information on households Adam A
(water, sanitation, income, density) is needed, AdamOFAS, ree
random samples are similarly drawn from all Adam SA
households. Adam U
To be able to do random sampling, one needs a Adams AA
sampling frame. This is a list or some representa- Adams G, Gastro-enterologist
tion (for example, a map) of the study population, Adams MC -
either in terms of individuals or in terms of groups Adams S
of individuals (for example, households or Adendorff T W J
villages), depending on the specific type of etc.
random sampling used. The items on the sam-
pling frame are called sampling units. It is import-
ant that the sampling frame contains all the indi- in order to investigate their knowledge of emer-
viduals in the study population: a sample can only gency services, you would make a list of all
be representative if the original frame is com- doctors and give each doctor on the list a number
plete. Table 8.1 is an example of a section of a (Dr Abbas is number 1, Dr N Abdurahman
sampling frame consisting of medical doctors number 2, and so on). In the case of a simple
listed in the Cape Peninsula telephone directory. random sample of households having to be se-
It is often fairly difficult to get a good sampling lected in the mapped area, each house is alloc-
frame from which to sample: where would one get ated a number, as shown in Figure 8.1.
a sampling frame for selecting households in a To select those who are to be included in the
large squatter camp? In townships or peri-urban sample, one has to draw random numbers. This
areas, there are often no reliable maps to use as can be done by drawing numbers from a hat,
sampling frames. It is then necessary to explore using a table of random numbers (see Appendix
other possibilities, such as aerial photographs or 4), or by generating random numbers on the
hand-drawn maps to use as a sampling frame. computer. If one wishes to select a sample of 20
Figure 8.1 shows a hand-drawn map of part of from a population of 80, the individuals whose
Khayelitsha (Pick, Cooper, Klopper, Myers, Hoff- numbers correspond to the first 20 random num-
man, Kuhn 1990). bers drawn (between 1 and 80) will be included in
the sample (see Table 8.2).
Simple random sampling
In simple random sampling of individuals, the Stratified random sampling
sampling unit refers to an individual and each If you have some knowledge that strata (sub-
individual in the population has an equal chance groups) of the population differ with regard to the
of being selected for the sample. The researcher measurements being made, you would want
has to draw up a complete list of all individuals in these strata to be represented adequately in the
the population and each individual has to be sample. For example, new peri-urban squatters
given a number. So, in order to choose a simple may be at highest risk for disease. Although they
random sample of medical doctors in Cape Town form only a small proportion of the study popula-
75
b epidemiological research methods and protocol development
| J
Ke oy 8 ey=p sy S52 $53
xB
%
x
BayBreStone]
X% Y 5b o
<2
xn
uy
= XG
a DAM WITH
BAD SMELLIN
xy s
> aM
ed
ess
Dp
xD *KO
X~
PK
on VELISWAS
xIxd x PUB
NOMANB LA) &° bb/ woh es ee
STORE Ge de er ge %569
tion, they can be chosen as a distinct stratum. In proportional stratified sampling. So, in the exam-
the case of the medical doctors, some are general ple of the medical practitioners in Cape Town, if
practitioners and some are specialists, and they 20% of the doctors are specialists, then 20% of the
may have different levels of knowledge about sample must be drawn from the list of specialists.
emergency services. The population should The results of the specialists and the general prac-
therefore be divided into these mutually exclusive titioners are then combined to get the result of the
(that is, no individual should fit into more than group overall.
one stratum) and exhaustive (that is, each indi- However, equal numbers can be chosen from
vidual should be able to find a stratum to fit into) each stratum to facilitate comparisons between
strata. A simple random sample of individuals is strata. This is often needed when there are very
then selected from each stratum. This is called small strata, which, if chosen proportionately, will
stratified random sampling. not have enough sampled individuals to allow
It is thus necessary to have a full list of the indi- statistical comparison. If strata are not chosen
viduals in each stratum. Typical strata are age proportionately to their size in the study popula-
groups, sexes, geographical areas or social class tion, the analysis involved in obtaining overall
categories. Strata should be selected so that the results is more complex. The researcher has to
variation of the characteristic of interest between weigh stratum-specific results proportionately to
strata is maximized and variation within strata obtain overall results (see Appendix 1: Standard-
minimized. Thus there should be little difference ization).
with respect to the characteristic of interest Simple random sampling and stratified ran-
within a subgroup, and large differences between dom sampling are easy to apply in, for example, a
subgroups. school where full class lists can be obtained and
Often the number of individuals selected from grouped in strata according to sex, age or stand-
each stratum is proportional to the size of the ard. If, however, one wishes to study households
stratum in the study population. This is called in a metropolitan area, it may be cumbersome to
76
8 sampling
Os
@@OG+
bers that appear in the table, moving in the
chosen direction until you have selected 20 num-
bers which lie between 1 and 80.
Any number above 80, and numbers which roe)BB
have already been selected, are ignored.
@)
give each household a unique number to enable thus made in terms of time, personnel, and trans-
simple random sampling, and then to travel port.
through the area visiting the selected households This method is especially convenient if there is
which are dispersed all over the area. In such a list of clusters, but not of individuals in each
situations, cluster sampling may be considered. cluster. Many community-based studies are done
using cluster sampling. On a map, households in
Cluster sampling a metropolitan area can easily be grouped into
In cluster sampling, the study population is first clusters of, say, five households each, and clusters
divided into groups or bunches (clusters) of a selected randomly. In Chapter 22: Immunization
certain size. So, for example, the map could be coverage, examples of specific types of cluster
divided into city blocks and the doctors could be samples are given.
divided into clusters according to columns on a One of the most often-used forms of sampling
page in the telephone directory. Clusters and not is stratified cluster sampling. In a study (Yach,
individual units are then selected at random. Typ- Katzenellenbogen, Conradie 1987) which aimed
ical clusters are schools, city blocks or villages. to estimate the infant mortality rate in a rural area
After a random sample of the clusters has been of Ciskei, strata were defined in terms of village
selected, all individuals, or a random sample of type: urban, traditional, resettled and re-reset-
the individuals in the selected clusters, are tled. All villages were classified into one of the
included in the sample. strata and villages (the clusters) were then chosen
This method is convenient in studies involving randomly from each stratum. In small villages, all
large geographical areas. In rural research, households were included in the sample. Large
villages are often treated as clusters. A full list of villages were divided into clusters, some of which
villages is drawn up, a sample of villages is drawn were then randomly selected. Within a house-
at random and individuals within these villages hold, all women of the specified age group were
included in the sample. Considerable savings are studied. This is an example of multistage sam-
Hig
b epidemiological research methods and protocol development
pling: sampling takes place at various stages or depends on the number in the population and the
levels, for example, villages are first selected size of the sample. If one wishes to draw a system-
according to type of village, and thereafter clus- atic sample of 20 individuals from a population of
ters of households within a selected village are 80, the sampling interval is 80/20 = 4, that is every
chosen. fourth person will be selected. To determine the
Cluster sampling does have disadvantages. If random starting point one has to select a random
the characteristics being measured are random number within the first sampling interval, in this
(heterogeneous) within each cluster (for example, case between 1 and 4.
some children are immunized, some not) then It is important that the list or queue is not
the clusters are representative of the population. ordered according to some system or cyclical pat-
The results obtained from the cluster sample tern. If street blocks consist of five houses each, of
would be similar (precise and unbiased) to that which the two corner houses are bigger than the
from a simple random sample, and the cluster others, a systematic sample of every fifth house
sample would have been easier to study than a would not give a representative sample.
random sample. But if people in the same cluster Systematic sampling is particularly practical ifa
tend to be alike (for example, all are immunized) sample of patient records has to be selected.
and different from those in other clusters, then
the cluster sample is much less efficient than a
simple random sample, and results could be
Non-random sampling
biased. It is therefore suggested that the sample Various types of non-random (haphazard) sam-
size of a cluster sample should be at least double pling exist, for example, judgement or quota sam-
that of the corresponding sample size in a study pling. In judgement sampling, the researcher se-
using a simple random sample. Additionally, lects individuals based on his or her own judge-
many clusters with few individuals per cluster ment. Clearly, no matter how objective one tries
should be chosen, rather than few clusters con- to be in choosing a balanced sample, it is impos-
taining many individuals. In general, no fewer sible to say how representative such a sample is.
than 30 clusters should be selected. Furthermore, In quota sampling, the interviewer is given a
it is very important that the analysis takes the description of the types of people who should be
cluster design into account. (See Chapter 22: interviewed and the number needed of each type.
Immunization coverage.) Statistical techniques The selection of individuals is left to the inter-
for doing this adequately are still being devel- viewer and there is no attempt at randomness.
oped. Often consecutive persons are included in a
sample until a certain number has been studied,
Systematic sampling or patients seen over a certain period constitute
Systematic sampling is often done intuitively. A the sample. For example, to study the prevalence
teacher may choose the child in every third desk of major depression in a rural community, all
to go to see the school nurse, or a nursing man- patients attending the clinic in the community
ager may go through every fifth patient’s file to during one calendar month could be included in
see what kind of patients come toa clinic. the sample. Clearly those visiting the clinic may
Thus in systematic sampling, individuals are differ from those who do not visit the clinic. In
selected systematically from some list or order- addition, the prevalence of the disease during
ing, for example, a queue. In the example of the December may differ from that during July. If, to
medical practitioners, every tenth doctor may be address the problems of seasonal variation, it is
selected from the list. Similarly, every third house decided therefore to rather study a sample of
may be selected from the map. patients attending the clinic throughout the year,
The complete population need not be known including the first ten patients each morning, one
before one starts to select the sample. So, for has to consider whether the patients arriving first
example, you could select a random sample from in the morning may differ from those who come
the admissions to a hospital, as people are admit- later (for example, those who are employed prob-
ted. The procedure is random, since the starting ably visit the clinic before work, and may be less
point is selected at random. Each nth individual inclined to have depression than the unemployed
after the starting point is then selected systemat- who come later in the day). These are examples of
ically. bias (see below) to which non-random samples
If the population size is known, the strategy are particularly prone.
78
8 sampling
Non-random sampling is generally not recom- the spectrum one is interested in. Issues which
mended in quantitative studies, since the level of emerge in the interviews or discussion could be
representativeness is questionable and _ infer- quantified or explored further in (quantitative)
ences (generalizations) can thus not be made studies using representative samples.
about the study population. Most statistical pro-
cedures also assume that samples have been ran-
Sampling bias
domly drawn.
There are, however, situations where non- Bias occurs if the sample is not representative of
random sampling is useful and the only way open the study population, that is, if the individuals in
to the researcher. In studying rare diseases, it is the sample’ differ systematically from the study
often very difficult to locate sufficient cases in a population. For example, if you use volunteers as
community-based random sample. Thousands of respondents, you must realize that those who
people may have to be screened at great cost choose to volunteer may be people who are par-
before a handful of cases are found who can be ticularly interested in the research topic, perhaps
interviewed. Snowballing (or networking) is a more so than the study population from which
sampling technique whereby each case is asked they come. Similarly, hospital cases may be more
to name all other people he or she knows with the severely ill than cases in the community. Sackett
disease. By following these people up and obtain- (1979) has outlined a number of biases which may
ing further names from them, a sample is col- occur and it is useful for researchers to familiarize
lected. Similarly, members of a community could themselves with these.
be asked to name all the people they know with Bias can occur even if random sampling is
the disease. Clearly, this sample could be biased, used. If the sampling frame used is not a full list of
since the most isolated cases may not be men- the study population, one may obtain biased
tioned by anyone and thus not included. The results. For example, using telephone directory
cases could also form separate non-overlapping entries as the sampling frame for a city’s inhabit-
networks which would mean that networking ants would lead to a biased sample consisting
would lead only to cases belonging to one net- only of those people who have telephones and
work, which may be very different from the listed numbers. They would probably differ
others. demographically from those who do not have
A similar approach has also been used to reach telephones or prefer to have unlisted telephone
so-called ‘hidden’ populations: hidden because numbers.
their activities are clandestine, for example, intra- Even if the researcher has a complete list of the
venous drug users. Many studies on drug usage study population, it may be difficult to locate cer-
have been done by studying random samples of tain people for interviewing. If you conduct a
drug users who present at some institution (drug community survey by visiting homes during the
treatment programmes, criminal courts), but day, you are likely to find children, the elderly,
those not presenting could be vastly different and unemployed people at home. Special
from those who do. A sampling technique called attempts have to be made to reach the employed
targeted sampling has been developed to reach people who were selected to be in the sample, by
members of ‘hidden’ populations in the commu- going back in the evening or over a weekend. An
nity through a combination of techniques such as individual cannot be excluded from the sample
networking and key informants (Watters and just because he or she is not at home at the time of
Biernacki 1989). A study of sexual behaviour and the interviewer’s visit. Furthermore, such an indi-
condom use among prostitutes serving the mine vidual should not be replaced by a ‘similar’
hostels used snowball sampling to identify the person who happens to be available. The person
subjects. Since prostitution is illegal and a hidden who is at home in the afternoon may be very dif-
activity, this was the only feasible sampling ferent from the one who is not.
method (Jochelson, Mothibeli, Leger 1991). People do refuse to participate, however. Espe-
If the researcher wishes to form a focus group cially in postal surveys where respondents have to
for discussion or do in-depth interviews on a return completed questionnaires by mail, the
topic (see Chapter 19: Qualitative methodology), it initial response rate can be very low. In postal sur-
may be perfectly acceptable (and indeed desir- veys attempts (second and third mailings, incen-
able) not to select a random sample. Purposive
tives for participation) should be made to
sampling allows for selection of individuals from increase the response rate. In any study where the
79
b epidemiological research methods and protocol development
response is less than 100% (the ideal, but a value, a sample of 323 is needed. To be within
response rate of above 80% would also be accept- 10%, one would only need 81 subjects.
able) you should try to obtain demographic in- If the study sets out to determine the signi-
formation on the non-respondents or a random ficance of a difference between groups or the
sample of the non-respondents. In this way, you association between factors, the researcher has to
could investigate whether the respondents and specify how large a clinical difference or associa-
non-respondents differ in any substantial way, tion needs to be detected as statistically signi-
thus introducing bias in the results. ficant. For example, if a difference in blood pres-
sure of 10 mm Hg (clinical difference) has to be
detected as statistically significant, a smaller
Sample size sample will be required than if a difference of 5
When the researcher has decided to select a mm Hg has to be detected as statistically signi-
sample from the study population, the question is ficant. On the basis of knowledge of the subject
how big a sample should be selected. An unneces- matter, the size of a clinically important differ-
sarily large sample entails wastage. On the other ence has to be decided on. Ina large sample, a dif-
hand, if a study is done on a sample which is too ference of 2 mm Hg may be detected as statis-
small, the results produced will be neither useful tically significant, but this may not be a clinically
or conclusive, and a lot of time and money will significant difference. For further discussion on
have been wasted. For example, if a study is done clinical significance, statistical significance and
to determine the prevalence of asthma in a sample size, refer to Chapter 11.
community, a sample size of 10 might yield two Often time, staff and cost constraints are the
asthmatics, giving a result of 2/10 = 20%. Had one most important factors influencing the sample
additional asthmatic been identified, the percent- size chosen. The researcher can only complete a
age would have been 30%. Likewise, if one less sample if the size is practically feasible. The size of
had been identified, the percentage would have the study population and the duration of each
been 10%. We can see that these results are very interview or examination must be taken into
unstable — one extra case swings the percentage account to determine what is practically feasible.
tremendously. If we had a sample of size 200, 40 It is usually necessary to get the help ofa statis-
cases would have yielded the 20% prevalence, but tician or epidemiologist in order to calculate
one or two extra cases (41/200 or 42/200) would the formula-based sample size, and to discuss
hardly have affected the percentage. Thus the whether the study is worth doing if the practically
larger sample size has yielded a result that is feasible size differs from the theoretically calcu-
much more stable and reliable. lated one. Most formulae for calculating sample
Formulae exist for calculating the ‘correct’ size assume that simple random sampling will be
sample size needed, but the researcher has to be used. You should therefore discuss with the statis-
able to provide some information before these tician or epidemiologist which sampling strategy
formulae can be applied. If the study sets out to will be used (for example, cluster sampling), so
estimate the prevalence of a certain characteris- that the sample size can be adjusted.
tic, the researcher has to have an idea of what the Many studies set out to calculate the preva-
prevalence is likely to be (2%, 20% or 50%). If the lence or level not of one variable but of many. For
aim is to estimate the level of a certain variable, example, in a community survey to investigate
for example, blood pressure, you have to consider various possible risk factors for heart disease
the expected level and variability of the charac- (smoking, alcohol usage, obesity, cholesterol
teristic. Similarly, if the difference in risk in two levels) none of the risk factors can be said to be
groups is to be estimated, some idea of the ex- the one of prime interest. So in this case, the for-
pected difference is needed. Such information mulae based on the level and variability ofa single
can be obtained from a pilot study or from the variable are not useful. A statistician or epidemio-
published results of previous studies. In addition, logist should be consulted to help decide on a
you have to consider how precise or stable you sample size which would give some meaningful
want the sample estimates to be. The more stable results.
the estimates must be, the larger the sample size To obtain good estimates for prevalences in
needed. For example, if you expect the prevalence subgroups (for example, of obesity in the age
to be 30%, and would like to be 95% sure that the categories 15-24 and 25-34 years), adequate sam-
sample estimate is within 5% of the population ple sizes are needed in each subgroup. A sample
80
8 sampling
whose size was calculated to give an overall crude To determine which percentage of patients were
estimate will not yield precise subgroup-specific referred from other levels of the health service
estimates. and which percentage could have been treated at
a primary care facility, patients’ records had to be
Conclusion studied. Since the study population is so large, a
sample of records had to be drawn.
Sampling is done in order to study a subgroup ofa The sampling strategy used was as follows. The
population, while still being able to make general- period from March 1988 to February 1989 was
izations that apply to the population from which divided into quarters. One month was randomly
the group was drawn. To minimize the chances of selected from each quarter. From each selected
unrepresentativeness, probability sampling is month, a random sample of days was selected.
used. This requires a sampling frame and a proce- Thereafter, a 20% systematic sample of records
dure which provides randomness in the selection. (every fifth record) of patients attending for the
Often a simple procedure will suffice, but in other first time were chosen. In this way, bias due to
cases, more complex sampling strategies are seasonal, day of the week, or time of the day vari-
required, and a statistician should be consulted. ation was minimized, using a multistaged sam-
pling strategy with stratification, simple random
and systematic sampling.
EXAMPLE 8.1
Sampling strategy to eliminate bias Reference:
Rutkove S B, Abdool Karim S S, Loening WE. ‘Patterns of
Each year, approximately 35 000 patients attend care in an overburdened tertiary hospital outpatients
the Paediatric Outpatient Department at King department.’ South African Medical Journal 1990; 77:476-8.
Edward VIII Hospital in Durban for the first time.
81
Data collection
and measure-
ment
82
9 datacollection and measurement
tionnaire. In all these circumstances, the ques- chooses, for example, ‘What is your opinion on
tionnaires have to be extremely clear and well laid the use of oral contraception?’ The alternative is
out, since untrained people will be completing to have closed questions where the answer is
them. either ‘yes’ or ‘no’, anumber, or a choice of one of
In structured interviews, the interviewers a group of predetermined answer categories.
follow a well-defined structure to prevent them Closed questions encourage quicker, more
from placing their own interpretation on the standardized data collection, but may limit
question. Questions are asked in the same way, responses or inhibit the respondent. At times, a
with the same probes and clarifications, while question may be asked in an open-ended way,
recording is also uniform. but the answer may be placed into a predeter-
The reliability of the information obtained mined category when recording the response to
(stability on repeated measurement) increases the question.
with objectivity and standardization. For example:
In an unstructured interview, an interviewer
has a set of guide questions or themes that must Question: ‘What is your opinion about the use of
be covered during the interview. The sequence, oral contraceptives?’
wording and approach used depends on the Actual response: ‘They disturb the hormones in
interview situation. The respondent is thus given one’s body and don’t work.’
much more freedom to express thoughts and Recorded response:
opinions. The interviewer tries to achieve the (i) approved (ii) disapproved; (iii) indifferent.
study aims without imposing a structure on the
respondent. Reliability decreases with the sub- Questionnaires often have a mixture of open- and
jectivity of the unstructured interview, and thus closed-ended questions. Respondents often have
interviewers need to be more skilled and expe- to indicate on a scale of 3, 4, 5 or 6 (called a Likert
rienced for these types of interviews. (See Chapter Scale) what their attitudes towards some issues
19: Qualitative methodology.) are.
Each method of questioning has advantages For example:
and disadvantages, and the choice of the method
for the project must be practical and appropriate Please indicate your opinion on the following
for that project. Table 9.1 outlines some advan- statement by circling the appropriate number
tages ‘and disadvantages of some of the methods along the line.
discussed above.
A cure will be found for AIDS within the next five
Steps in questionnaire development years.
It is necessary to allow adequate time to develop a Strongly Strongly
questionnaire methodically and thoroughly. The Agree Disagree
following steps provide a guide for questionnaire 1 2 3 4 5 6
development.
In general, it is advisable to have an even-
Step 1: List the variables to be measured numbered scale to avoid excessive selection of
One should aim for a well-conceived, concise the middle value. The greater the number of
questionnaire. Variables to be collected should be values provided, the greater the tendency among
within the scope of the study. Make a short list of respondents to avoid extreme values. Thus it is
variables based on the study objectives stated in not advisable to offer too wide a scale.
the protocol. Many researchers do not analyse all
their information. Ensure that only essential (b) Develop questions which will best elicit the
issues are covered. Unnecessary or unrelated variables to be measured.
questions which create the impression ofa ‘fish- At times, one variable may need a series of two
ing trip’ should be avoided. or three questions to capture the true characteris-
tic. This is often true of a variable that has. a
Step 2: Formulate the questions number of components or facets (for example,
(a) Decide on the questions. socio-economic status) or of a complex variable
Questions can be open-ended where the comprising a few variables (for example, house-
respondent replies in whatever way she or he hold density is the number of individuals in a
83
b epidemiological research methods and protocol development
Advantages Disadvantages
Survey of records:
@ little expense collecting the data format cumbersome as data cellected for
non-research reasons
can be quick (although time often ¢ variables inconsistently defined and recorded
underestimated) by different people at different times
Individual interviews
can be done when respondents have low ¢ interpersonal dynamics may interfere with
literacy and cannot fill in self-administered data collection (e.g. suspicion)
forms
can collect data from people who are interviewer variation affects reliability
otherwise not reachable (e.g. people
from rural or informal areas)
Telephone surveys
@ relatively cheap (no transport and travel @ sampling bias: excludes people with unlisted
time costs) numbers or no phone at all
@ can cover wide area: no need to move @ nonverbal cues absent: affects
between locations communication
Mailed questionnaires
relatively cheap and can cover wide @ generally response rates very low (<30%)
geographical area
84
9 datacollection and measurement
85
b epidemiological research methods and protocol development
sry it gene Al ine 9 me A aa A EA AS AO AI A Re
There is still the danger that someone will data collection process. Whether the question-
give a negative answer when they should not naire is done by interview or by self-administra-
have. In that case, the interviewer can ask from tion, try to make it as clear as possible so that
alist prompting the respondent. The answers errors can be minimized. Here are some guide-
are divided into categories which make it clear lines:
which responses were prompted and which type the questionnaire whenever possible
were not. So, using our example of contra- @ ensure good visual spacing (double spacing
ceptive use: may make the questionnaire too cumbersome,
but may be possible if the print is reduced)
Have you ever used any type of birth control or @ use highlights if needed
family planning? @ indent intelligently
@ print without smudges
Have you ever used any of the following: @ make neat blocks (on computer or using a sten-
unprompted |prompted cil) for ticking answers or filling in numbers
_ Birth control pills yes/no yes/no @ different colour paper may be used for the dif-
(oral contraceptives/ ferent sections of the interview, for example,
‘the pill’) household information on white, demographic
Injections information on green, and environmental in-
Intra-uterine device
won formation on blue paper
(IUD, loop, coil) @ visual markers such as arrows and blocks can
Diaphragm make administering the questionnaire easier.
Foam/jelly/cream
Condom Step 6: Consider the scale of measurement of the
NOSterilization variables
(tubes tied) In considering the process and means of
o Partner’s vasectomy measurement, an awareness of the scale of
9 Other (for example, measurement of the variables (see Chapter 11)
withdrawal) helps the researcher to plan adequately for the
Specify. analysis stage. For example, when collecting in-
formation on heights, measures are done and
It is important to train interviewers to use recorded. Here the observations have numeric
prompts in a uniform way. Record on the ques- meaning, and an average can be calculated. How-
tionnaire whether responses were prompted or ever, when collecting information on blood type,
not. each observation is categorized as O, A, B, or AB,
@ At times, a question may require that the inter- and the number of observations in each category
viewee be shown an example or picture to is counted. Here we cannot find an ‘average’
choose from. Decide when this is necessary and blood type: we have to calculate the percentage of
howit will be done. observations in each category. An understanding
of the scales of measurement is very important, as
Step 4: Determine the sequence of the questions different analyses are suitable for different types
The sequence of the questions must be carefully of variables.
planned to ensure logical flow. Basic demo-
graphic information is usually unthreatening and Step 7: Consider what coding needs to be done
is often asked first. Sensitive questions are usually once the data has been collected
put later in the questionnaire, so that some trust Before the raw information can be analysed
has been built up by the time they are asked. In quantitatively it must be prepared for analysis.
addition, if subjects refuse to answer these sensi- The process by which the information is con-
tive questions, the interviewer has at least col- verted into numbers or categories is called
lected some information. coding. So, for example, if male = 1 and female = 2,
all the respondents who are male are given a
Step 5: Plan the layout and design of the ques- number 1 next to the variable ‘gender’. Note that
tionnaire for the variable ‘gender’ the code ‘1’ is merely a
The visual presentation of the questionnaire is an category, but for the variable ‘age’ the code ‘1’
important aspect which can influence the whole would have numeric meaning.
86
9 datacollection and measurement
SAISOd QAIISOd
e
aAleHaN e 9AINSOd
OAeBoN e QAIISOd
e
aaneban ¢ BAIISOd
¢
aaneban
ISSLNINS
©
JILSONSVIC ADOIOLSIH
«
AWY-X
LSSHO
¢
|EINI|d
UOISNJ9
© JeIIH
Auyyedoupe
SNOILVDILSSANI ynpy
ad aAlssalHOld
Wid
e
aseasip LOAYIG
WALAdS
AYNLINIWNALAdS
e
DIAGNOSTIC CRITERIA
PLACE OF SUPERVISION AT
END OF TREATMENT MOVEMENT OF PATIENT THROUGH TREATMENT SERVICES
Referred by to date
SPUTUM CULTURE
AT 60 DOSE
School
Positive
Domiciliary (sisters) Ne
ia
itce Negative
So
cc
e
cr
rts COMPLIANCE
REASON FOR EXIT cst
oOWw
ec ON EXIT
Cured TB ke
a LNIWLVIYL
STIWLIG
Less than 75%
i =
2
oF 75% or more
Absconded, etc
Transferred out of health region END COMPLIANCE (except fortransfer within health region)
OUTCOME
transferred
stopped
Rx
Date
or health
region
of
out
no. of doses 100
possible no. of Rx days
ogy develops, mainframe computers are being re- If the data analysis of a project is being done
placed by more efficient, smaller machines. within a big institution, special computer
Personal computers (PCs) are (generally) small punchers will punch the information onto the
computers which are suitable for personal use or mainframe. Preparation of the data for computer-
departmental use. While very large data sets ization is important and specific procedures have
cannot be accommodated by PCs, many software to be adhered to. The variables for each individual
programmes are now available which enable need to be coded in such a way that the computer
researchers to do sophisticated analyses on these can receive the information. Usually the answers
machines. A suitable data capture system (for are given numeric codes (for example, male = 1
example, Lotus, dBASE, Epi Info) needs to be and female = 2). There must be blocks for coding
decided upon. The questionnaire must be de- along the right-hand margin of the questionnaire.
signed to ensure easy transfer of information by Only coded information may be placed in this
the researcher from the questionnaire onto the space (usually marked ‘For official use only’) and
PC. this information is then typed into the computer
88
9 datacollection and measurement
by the punchers. Each coding block has a unique ones. Open-ended questions will generate a
number. The coding blocks start from 1 and are variety of responses, which can be used to deter-
numbered consecutively. The coding block num- mine the options for the main study or the codes
bers allow the punchers to find their way around to be used. Common responses will be chosen as
the questionnaire and allow the computer to read the categories that will appear on the final ques-
in specific blocks as specific variables. For exam- tionnaire or coding sheet. Logistical issues, such
ple, the data in block 1 might be allocated for area as the time taken and the suitability of categories
of residence, blocks 2 and 3 for the age of the should also be recorded.
respondent, and so on. At the end of the pilot interview, the responder
Micro hand-held computers are compact port- might be asked how he or she felt about being
able computers which allow for entry cf data at asked questions in this way. Comments on the
the point of data collection. Values are punched timing and the situation should be welcomed.
in by the interviewer, circumventing the need for Suth feedback could be crucial for the main
filling in questionnaires and coding. This method study.
also allows for the rapid analysis of data so that
interim results are available immediately, and the Step 10: Make necessary changes
monitoring of fieldwork is possible. Possible tech- After the pilot study, the interviewers should be
nical problems such as power or battery failure fully debriefed and comments noted. As many of
must be kept in mind and planned for, otherwise the criticisms as possible should be addressed, so
the data collection can become a nightmare. that the final questionnaire benefits fully from the
feedback.
Step 9: Pilot (‘pretest’) the questionnaire The training of the interviewers for the main
A pilot study is a test run of aspects of the main study should also help as a final pilot. Inter-
study. In developing a questionnaire, a series of viewers often pick up problems and should feel
small pilot studies are needed to refine the instru- part of the questionnaire design. In order to bene-
ment. fit from their input, make sure that there is
Pilot studies require an in-depth look at the enough time between the interviewer training
questionnaire with the aim of improving its qua!- and the start of the study to make the final neces-
ity. Usually only a few subjects are chosen per sary changes.
pilot study, for example, 5 to 20 individuals. Remember to check and recheck coding block
In the early stages of developing the question- numbers. A miscount can cause havoc when the
naire, rudimentary questions can be asked of col- data is being punched into the computer.
leagues and friends in order to investigate the
wording and the clarity of the questionnaire.
Glaring problems will be picked up at this stage.
Quality of the data collected
Later pilot studies should be addressed to If the results of an epidemiological study are to be
groups which are increasingly similar to the target sound, the quality of the information collected
population. For example, if the study hopes to needs to be good. How cana researcher evaluate
determine the immunization coverage of chil- the quality of the information?
dren under five years in a particular community, The first, rather intuitive, way of checking data
the pilot studies could take place at the paediatric quality is to look at indicators of the context in
outpatient department of a teaching hospital, which the measurement process took place. The
then at a day hospital covering a similar popula- second is a more formal evaluation of measure-
tion, and lastly using a small community sample. ment error.
At first the researcher must be active in the pilot,
but later the questionnaire should be adminis- Indicators of the context in which measure-
tered by interviewers similar to those to be used ment takes place
in the main study. Before full analysis of the data, it is useful to ex-
During the pilot study, the interviewer should tract information which can give a sense of how
record words and sentences that are not under- and from whom data were obtained. Three types
stood and questions that require prompting or of data are useful for this purpose.
explanation. There should be space on the ques-
tionnaire to write down the respondents’ reac- 1 Data indicating how much data is missing
tions to certain questions, especially sensitive Data may be missing because of failure to locate
89
b epidemiological research methods and protocol development
ee
he 8 Eee
3 Data on the setting in which the information Improving and evaluating reliability
was collected Variation between measures (that is, poor reli-
The environment in which data collection takes ability) can be decreased by addressing the
place can influence the data. It may be important source of the variation.
to note the time, place, and even the temperature, (i) The instrument variation (giving different
as such details may influence the quality of the results for the same individuals) can be
data. It may also be possible for the interviewer to reduced by standardization and calibration of
record a subjective assessment of the interview the instrument. The quality of the instrument
and the quality of the information provided. or questionnaire must be improved. (See
Appendix 2: Standardization of observers and
Formal evaluation of measurement error: instruments.)
reliability and validity (ii) Observer variation can stem from differences
Data is only as good as the measurement instru- among observers or interviewers (inter-
ments used to measure the characteristics. observer variation), as well as differences in
Ideally, we would like to measure ‘the truth’ every the same observer or interviewer on separate
time we measure the characteristics. Any devia- occasions (intra-observer variation).
tion from this constitutes measurement error. These variations between measures can be
Measures are usually considered in terms of reduced by:
their reliability and validity. @ setting exact ways of measuring (standardiza-
Reliability refers to the degree of similarity of tion of measurement or interview)
the information obtained when the measurement @ intensive training periods for all observers and
is repeated on the same subject or the same interviewers
group. Is the same value arrived at every time the supervision and periodic checks on their work
measurement is taken, or do the values vary a lot selection of observers and interviewers along
on repeated administration? similar criteria with reference to age, sex,
Validity refers to the extent to which a measure educational level, and other relevant character-
actually measures what it is meant to measure. istics.
The measure lacks validity if an observer or (iii) Subject variations may be due to biological
instrument measures the characteristic in the inconsistencies (for example, blood pressure
90
9 datacollection and measurement
does not stay the same all day) or inconsisten- can be evaluated by repeating measures of a sub-
cies of memory. Repeated measures make it sample of the study sample, or (in the case of ques-
possible to assess and adjust for biological tionnaires) by asking related questions which, if in
variation. disagreement, will show inconsistencies.
Instrument, observer, and subject variations Statistics used to assess the agreement on
91
b epidemiological research methods and protocol development
i a
E I e
Positive Negative
Positive
Results
yielded by
instrument
being assessed
Negative
A
Sensitivity: (true positive rate)= (A+C)
Percentage of positive test results out of all true positives
BBP
one
Specificity: (true negative rate) = (B+D)
Percentage of negative test results out of all true negatives
A
Positive predictive value = (A+B)
The percentage of positive test results that are truly positive
eet
Negative predictive value = (C+D)
The percentage of negative test results which are truly negative
92
9 datacollection and measurement
assessed, the investigator will determine to what approach is that the researcher understands what
extent ‘family ownership of a motor car’ is associ- is on the questionnaire and can monitor the
ated with school performance. answers. However, it is very important that the
Inconsistent validity may refer to a measure translation is standardized between interviewers
which is valid for one population or group, but during the training sessions. This method is not
might be different in different groups, popula- suitable for in-depth and semi-structured inter-
tions, and environments. Thus questionnaires or views, where exact replies must be recorded and
tests designed for a particular population in the interpreted by the researcher.
northern hemisphere might be unsuitable and in- The second option is for fully translated ques-
valid for populations in developing communities. tionnaires to be used in the interviews. The initial
Questionnaires are particularly vulnerable to translation should be done by someone fluent in
measurement bias, and numerous types have the language, who can judge whether certain
been described in the literature (Sackett 1979). plirases are understandable and acceptable. This
Please refer to Chapter 18: Knowledge, attitude, translation should be proofread by a second
beliefand practice studies. person. The next step is for someone else to re-
translate the questionnaire back into the original
Other issues related to data language. If the re-translation corresponds well to
collection the original questionnaire, the translation is
accepted as adequate. Please note that linguists
Ethics are not necessarily the best people to translate
It is important to get permission from partici- questionnaires. Often health workers who work
pants to be interviewed, after you have explained with people similar to the respondents will have a
the study and method of interview to them. This is better feel for what can be understood by the
called informed consent. person on the street.
Ethically, researchers are required to protect
the identity of study participants. Ideally inform- Interviewers
ation should remain anonymous, that is, names Interviewers should be carefully selected (on the
should not be recorded at all, not even on the basis of, for example, educational level, sex, expe-
questionnaire. However, often some record of the rience), well trained, well supervised, and ade-
respondents’ names needs to be kept so that quately paid. It is important that similar types of
follow-up or even referral is possible. If it is people collect similar information, so that inter-
planned that all participants will be informed of views are as similar as possible. This improves the
their personal results (for example, blood results) reliability of information.
or the overall study results, names and addresses One study might employ different types of
are clearly needed. In such cases, it is ethically people to do different tasks in the data collection
acceptable that information remains confiden- process. A registered nurse might measure
tial, which means that information that can be heights, weights, and blood pressures while a
traced to a particular individual is available only middle-aged woman with high school education
to the researchers. might do the interviews. Yet another person, well
known in the community and with good interper-
Translation sonal skills, might inform people ahead of time of
Clearly, the best way to interview people is to ad- the study and motivate them to participate. (See
dress them in their own language. If the question- Chapter 17: Example 17.2.)
naire is to be used on people who speak a lang-
uage other than that of the researcher, translation Training
will be an issue. Training of staff involved in the data collection is
Interviewers should be fluent in the language of fundamental to good quality information. It is
interview. One option is for the questionnaire well worth spending a few days training inter-
itself to be in a language that the researcher viewers, observers, and motivators so that every-
understands (for example, English), but for the one knows exactly what is expected of them.
interview to be conducted in another language. In During the training period, the nature of the
such a case, the interviewer translates the ques- research, the reason why the research is being
tionnaire, and thus has to be fluent in the lang- done, and the objectives of the study should be
uage of the questionnaire. The advantage of this carefully explained before the tasks are discussed
93
b epidemiological research methods and protocol development
in detail. Cohesion among the team should be forms, and checking that the data was not fudged
facilitated and rules, conditions of employment, all have to be considered. From time to time, the
and other specific instructions explicitly dis- fieldworkers may have to meet with the super-
cussed. visor or co-ordinator to discuss important issues.
There should be clear lines of accountability
and the nature of the supervision should be ex-
Conclusion
plained. For example, the interviewers should
know that they will receive instructions from the Whether data is collected in the community (see
field co-ordinator or supervisor, who will also Chapter 17), the workplace (see Chapter 24), the
check their work for errors. health care setting (see Chapter 15), or by post or
telephone, the measurement tools need to be
Field supervision carefully considered to ensure good quality data.
There should be a clear plan of how supervision The validity and reliability of the data should be
will be done in the field. Site visits, checking of evaluated when describing the results.
94
10 Other issues to
consider in the
protocol
Data management and analysis analysis is planned and available (see Chapter 22:
Immunization coverage).
The proposed analysis of the data should form an The decisions what to measure and how to
integral part of the planning stage of the research measure it are closely linked to the analysis. In
project. On a practical and technical level, the drawing up a questionnaire, you have to consider:
researcher has to decide beforehand who will be how will I analyse this question? At this stage, you
analysing the data, and this person should be should draw up dummy tables (or table shells).
involved from the start. Where and how will the These are tables which indicate how you intend
data will be analysed? Will it be necessary to buy a to represent your data. There should be tables
PC or a special analysis programme (and learn describing sample characteristics (for example,
how to use these)? Will you have to approach pro- age/sex), as well as associations between vari-
fessional biostatisticians, for example, at a uni- ables. These tables should clarify whether you
versity or research institution? Funds may be have included questions on all the factors needed
needed for the analysis (for buying special equip- for the proposed analysis, or whether many
ment or paying for the services of a biostatisti- redundant questions are asked.
cian), and these have to be budgeted for. In set- Once the data has been collected, you can start
ting up a time schedule for the project, you by completing these dummy tables; this gives a
should include a reasonable period for analysis, good starting point for analysis. Researchers who
and not think that the time-consuming part of the have not thought about their analysis throughout
project is completed once the data has been their project planning often find that once the
collected. data has been collected, they do not know what to
More fundamentally, however, the analysis do with it. A project which proceeded well
forms part of each step of the study. It is not just through its field-work stage can grind to a halt at
the final step before write-up. When defining the the analysis stage if limited thought has been
study objectives, you have to decide how to given to the analysis. Dumping the collected data
analyse the data in order to reach the objectives. on a Statistician who has not been involved in the
Is such an analysis at all possible, or would you rest of the project does not solve this problem,
have to reformulate your objectives? It is thus since the researcher is the one who has to tell the
necessary to express objectives in measurable statistician what questions he or she would like to
terms. have answered.
When doing the literature review, you should Sometimes during the analysis phase, the
note the methods of analysis used by other _researcher discovers that some of the intended
researchers. When deciding on study type and analyses cannot be performed (for example,
sampling scheme, you have to consider whether because there is too much missing data or too few
methods are available for the analysis needed. For people with a certain characteristic). This can be
example, if a complex sampling scheme is used, minimized by carefully considering the analysis
you must be sure that the appropriate means of beforehand. Similarly, you may find new associa-
95
b epidemiological research methods and protocol development
‘Table 10.1 Budget for AIDS study — 1 March 1994 to 28 February 1995
A Recurrent expenses
Supplies 15 500
Questionnaires (x 1 000)
Blood collection kits (x 1 000)
Stationery and photocopying
Emergency first aid kit
Travel 37 800
Lease of vehicle (x 12 months)
Vehicle maintenance (40c per km x 18 000 kms)
Use of private vehicle (R1,10 per km x 6 000 kms)
General 3 000
Cost of reprints 1000
Attending local conference 2 000
B Capital expenses
96
10 other issues to consider in the protocol
Report writing and scope of the study are stated. The study is
located in terms of reviewed literature.
Many research projects are never written up as 3 The Study population and Methods section
reports or journal articles. The knowledge gained gives details of the study methods and should
from both the results and the methods is thus not
include:
passed on to others. This is obviously a waste of
study design
resources. All researchers should be encouraged
study population
to make their project available for others (other how the sample was selected
researchers, service providers, and decision-
response rate
makers) to read. This should be done even if the
@ characteristics measured
project was a ‘failure’, for example, if sampling @ methods of data collection
biases were encountered or a poor response was
@ quality of the data
achieved. A description of problems encountered
@ how the data was analysed.
will help other researchers to avoid such prob- 4 The Results section includes written descrip-
lems. tions of the findings, as well as tables and charts
No project can be considered finished before it
to clarify the text. The results of statistical tests
has been written up. The write-up could be in the
should be given.
form of a scientific article, a comprehensive All tables and graphs should be clearly titled,
working paper containing all details, or a special
and totals and figures should make sense.
report to decision-makers/service providers. Everything in the tables and figures should be
Results should also be reported back to inter-
relevant to the topic.
viewers and research participants in an accessible
ao The Discussion provides the chance for the
form. writer to interpret the findings, air opinions,
compare the results with other findings, and
The qualities of a good report point out the shortcomings of the study.
@ The title should clearly explain what the report
6 The Conclusion rounds off the report and gives
is about. It should attract potential readers.
some indication of the scientific and practical
The summary or abstract should inform the
implications of the study.
potential reader in brief what the aims,
7 References should always be included.
methods, population, and results were.
@ The style of the report should be easily under-
standable. Language should be clear and The report has a similar structure to the protocol.
simple, while the structure should have a If the protocol has been prepared carefully, the
logical flow (outlined below). Graphics and writing up flows easily. The results and the dis-
tables should be clear and informative. cussion are the sections which need new effort.
@ The report should be concise with no unneces-
sary information. Digression and waffling have
Conclusion
no place in a good report.
From the above, it is clear that many administra-
The logical structure of areport tive issues need to be considered in the protocol,
1 The Abstract is asummary written after the rest in addition to the methodological issues. These
of the report has been written. administrative issues should receive close atten-
2 The Introduction puts the reader in the picture. tion, since they could sabotage an otherwise well
The motivation for the study and the objectives planned project.
98
Data presentation, analysis,
and interpretation
102
11 anintroduction to data presentation, analysis, and interpretation
8
9 TS
eee
StS
10 ss
22
4
<2
2k <
x2
ZeaxxZ2eix
classified as missing. The aim is to eliminate the data (for example, about the distribution of
errors, not fudge the data. Respondents may give the data). Through graphical display you can
conflicting responses when asked their opinions determine whether your data satisfies these
or attitudes. These are not data errors and cannot assumptions and thus whether the intended
be ‘cleaned’. (If extreme values which are not im- statistical analyses are appropriate. Graphical
plausible occur and cannot be checked for some display makes the data accessible to the re-
reason, statistical techniques which are robust searcher, as well as to the readers when results are
against extreme values can be used in the ana- published. Patterns or relationships detected
lysis. See Summarizing the sample data, below.) through, or questions raised by, exploratory data
Graphical display of the data is described below analysis can be evaluated by formal statistical
and is also useful in the data checking phase. methods (also called confirmatory data analysis).
If data is entered directly onto a PC in the field,
the ‘data entering programme (for example, Epi
Info) should be set up in such a way that implaus- Graphical display of numerical variables
ible values can be discovered immediately. So, for The aim of displaying a numerical variable is to
example, one should specify a plausible range for investigate certain characteristics of the data,
a continuous variable such as height. If a value namely where the central value lies, and what the
which falls outside this range is typed in, the pro- spread and shape (distribution) of the data are.
gramme alerts the data typist that a strange value The distribution shows how many observations
has been entered, and must be corrected. have a given value or lie within a certain range of
Table 11.2 gives a listing of the information values out of all the possible values. As will be dis-
regarding age, sex and certain aspects of smoking cussed later, the shape of the distribution is
history of 10 subjects included in a study of important in deciding on appropriate summary
community members aged 20 years and older. statistics and statistical analyses. A distribution
How many problems can you identify? has a symmetric shape if approximately the same
number of observations lie above and below the
centre value of the distribution. This will be illus-
Exploratory data analysis trated later, using various types of graphical dis-
Before any formal statistical analysis is done, the plays (Figure 11.2).
data (especially numerical variables) should be The following small data sets will be used to
explored through graphical display (also known illustrate basic concepts.
as exploratory data analysis). Such an exploration i) Inahypertension study, diastolic blood pres-
helps the researcher get to know the data. Errors sure was measured on a sample of 16 respond-
or strange values, and patterns and relationships ents. Their values were:
in the data can be detected. Furthermore, many 75, 84, 80, 97, 105, 188, 64, 78, 68, 86, 79, 105, 89,
statistical procedures make assumptions about 88, 93, and 92.
103
ce data presentation, analysis, and interpretation
Skewed to high
values
low
Skewed to low
values
Symmetric =
; could be difficult to
Bimodal C detect in box plot
ii) Two groups of children were scored on apsy- From: Joubert G, Schall R. ‘Some statistical con-
chological test. The scores were (in order): cepts.’ In Kibel M A and Wagstaff L A (eds). Child
Group 1 Group 2 health for all: A manual for Southern Africa, Cape
(n= 24) (n=25) Town: Oxford University Press, 1995.
38 46 46 50 52 41 41 414146
52 55 57 59 62 46 46 48 52 52 Stem and leaf plot
63 63 64 65 69 54 54 58 60 60 In a stem and leaf plot, each observation of a
6971717475 6171747680 numerical variable is plotted. All possible leading
79 81 87 94 84 85 93 104 148 values form the stem, and each data value is
104
11 = anintroduction to data presentation, analysis, and interpretation
represented by writing its trailing digit in the detecting distributional differences between
appropriate row next to the stem, in this way groups. The back-to-back stem and leaf plot of
forming the leaves. Before the plot can be drawn, the psychological data is depicted in Figure 11.3.
the data are ordered from small to large. The value 148 in Group 2 seems to be an outlier
In the blood pressure example, the ordered and should be checked. From the stem and leaf
data set is: plot, it is clear that the values in Group 1 followa
64, 68, 75, 78, 79, 80, 84, 86, 88, 89, 92, 93, 97, 105, symmetric distribution, but the values in Group 2
105, 188 are not symmetric (also called skew), as most of
the values are low.
Since the values range from 64 to 188, the leading
digits which are to form the stem are 6, 7, 8, 9, 10, Box plot
11, 12, 13, 14, 15, 16, 17, 18. The stem andleaf plot Asecond graphical display appropriate for nume-
is then as follows: ri¢al variables is the box plot. A box plot displays
the date in summarized form. Not all points are
18 8 plotted in a box plot, only selected summary
Lt values — namely the median, 25th and 75th
16 percentiles, and the maximum and minimum
15 values.
14 The median (or 50th percentile) is that value
13 which divides the sample values, ordered from
12 small to large, in half, that is, half the values lie
1l above the median, the other half below. If the
10 55 sample size is odd, the middle value in the
9 237 ordered series is the median. If the sample size is
8 04689 even, the median is the average of the two middle
é 589 values. Therefore the median of the 16 blood
6 48 pressure values is the average of the 8th and 9th
values, namely the average of 86 and 88, thus 87.
Any value which lies far away from the other Quartiles divide the sample values in quarters.
values (an outlier) can be detected easily. The The lower quartile is also called the 25th percen-
valué 188 is suspect and should be checked. It tile, and the upper quartile the 75th percentile.
may be a typing error: 88 typed as 188. Apart from The distance between the lower and upper quar-
the extreme value, the values seem to follow a tile is called the interquartile range. For the 16
symmetric distribution. Figure 11.2 depicts what blood pressure values, the lower quartile must lie
different data distributions would look like when between the 4th and 5th values in the ordered
illustrated by a stem and leaf plot. series (values 78 and 79). The 25th percentile is
If groups are to be compared, stem and leaf defined as being a quarter of the way between
plots drawn next to one another are useful for these two values, namely at 78,25. The upper
Figure 11.3 Stem and leaf plot of children’s psychological test scores
GROUP1 GROUP2
(n= 24) (n= 25)
105
c data presentation, analysis, and interpretation
a a a eR
The box plots of the psychological test data are Bivariate plot
shown in Figure 11.4. The symmetry of Group 1is Bivariate or scatter plots are used to plot two
clear, since the median lies approximately half- numerical variables against one another, one
way between the two quartiles. Group 2 is asym- variable on the X axis (horizontal), the other on
metric, as can be seen from the fact that the the Y axis (vertical).
median is much closer to the lower quartile than These plots are especially helpful when one
to the upper quartile. Figure 11.2 depicts how dif- wishes to investigate relationships between vari-
ferent data distributions look when illustrated by ables, and should be drawn before astatistical
a box plot. evaluation of a relationship is done by means of,
for example, correlation or regression. They can
Histogram help to identify errors (Figure 11.5) and clarify
This graphical display is generally used not to relationships between variables (Figure 11.6).
evaluate the distribution of a numerical variable
at the start of the analysis, but rather to summa- Graphical display of categorical variables
rize a number of observations in compact form. A Whereas the graphical display of numerical vari-
histogram is drawn for a numerical variable ables described above is essential before formal
which has been grouped into categories. For statistical analysis can be done, the methods
example, blood pressure can be grouped into 10 described here for categorical variables are pri-
mm Hg groups. Each rectangle in the histogram marily used to present the results. The choice of
can represent either the number of observations method is a matter of personal preference. Bar
or the percentage of observations that fall in that graphs and pie charts can be used to represent
interval. The population pyramid described in categorical variables. The height of the bar in a
the section on demography in Chapter 2 is a spe- bar graph (or the size of the slice in a pie chart)
cial type of histogram used to depict the age and can represent either the number of observations
sex composition of a population. in a given category of the variable, or the percent-
106
11. = anintroduction to data presentation, analysis, and interpretation
44 54 64 74 84 94 104 Weight
in the bivariate plot of height and weight, the case with height = 189 cm and weight = 44 kg stands out as being question-
able. In an investigation of only height or weight, the values for this case would not have seemed strange.
AGE
Year
20 40 60
The relationship between lung function and age is depicted in the following plot. The plot clearly indicates that lung
function values only start to decrease with increasing age after the age of 40 years.
107
c data presentation, analysis, and interpretation
108
11 = anintroduction to data presentation, analysis, and interpretation
So, for the 16 blood pressure values the mean is has a large influence on the mean, pulling it away
(64+ 68+ 75+ 78+ 79 +80 + 84+ 86+ 88+89+92+ from the other four values. If the distribution is
93 + 97+ 105+ 105+ 188) /16=91,9 asymmetric or if there are outliers, you should
rather use the median as measure of central loca-
(The mean has at most one decimal place more tion. The median is called a robust alternative to
than the original values. The above mean should the mean, since it is not sensitive to outliers or
not be given as 91,9375 since it implies spurious ac- asymmetry in the distribution of the data. For the
curacy in the original measurements, namely that values 1, 2, 3, 4, 5, the median is 3. For the values
the original measurements were taken to the fourth 1, 2, 3, 4, 50, the median is also 3, indicating that
decimal place.) the median is not influenced by the extreme value
The mean is sensitive to extreme values, espe- of 50. If the distribution of the data is symmetric,
cially in small samples. For example, if the sample the median and mean will be close together.
consists of five values, namely 1, 2, 3, 4 and 5, the ‘For some variables (especially laboratory deter-
mean is 15/5 = 3. However, if the values are 1, 2, 3, minations), you may encounter censored obser-
4 and 50, the mean is 60/5 = 12 . The high value 50 vations. These are observations for which the
Figure 11.8 Block chart of the percentage of respondents (in four age/
ese RRR Cue roe me it
strategies (a respondent could mention more than one option)
‘| FEMALE > 18
=53
11,3 % als £A18,9%
FEMALE
< 18
n=/74
MALE
= 18
nase
34,4 % 5,7 % 11,4%
MALE < 18
4
31,8 %
Mathews C, Kuhn L, Metcalf C A, Joubert G. Cameron N A. ‘Knowledge, attitudes and beliefs about
AIDS in township school students in Cape Town.’ South African Medical Journal 1990; 78:511-16.
109
c data presentation, analysis, and interpretation
i LCE CL A AAA AA ICC EAT EA A A
ee cat csc ate
To ensure that the graphical display reveals the truth about the data, one should keep the following in mind.
Categorical variables
If bar graphs or pie charts are drawn to reflect percentages in different categories, one should state (in the caption or title
of the graphic) the number out of which percentages were calculated. To represent 1 out of 3 as 33,3%, without stating
that the sample only consisted of 3 individuals, is misleading.
Numerical variables
When drawing scatter plots of two variables, it is important that reasonable scales are used. Reasonable scales are scales
which do not put too much ortoo little emphasis on the actual results. Scales should also start at the nought value if
possible. The following plot seems to indicate that there was abig increase in the percentage of mortality due to cause X
between 1985 and 1986. However, close inspection of the scales reveals that the percentage increased from 10,5% in
1985 to 11,0% in 1986, hardly as dramatic an increase as the plot suggests. It may therefore be a good idea to include the
0 value on the scales.
Percentage
11,0
10,9
10,8
10,7
10,6
10,5
10,4
10,3
10,2
10,1
10,0
1981 1982 1983 1984 1985 1986 1987
Year
actual value is not known, but it is known that the tain value (for example, a scale which weighs
value is greater than a certain maximum (for most respondents as 60 kg), but is of little use as a
example, age greater than 65 years) or smaller measure of central location.
than a certain minimum (for example, a labora- In the case of a skew distribution, a mathe-
tory instrument can detect very low levels of matical transformation of each value (for exam-
cholesterol, lower than a certain minimum, but it ple, the log transformation where the log of each
cannot determine the exact values). In these cases, value is calculated) can be used to make the dis-
since the exact extreme values are not known, the tribution more symmetric. The mean of the trans-
median is the appropriate summary measure. formed values is calculated and then transformed
The mode is the value which occurs most often. back. In the case of the log transformation, the
This may be useful in determining whether a antilog of the mean of the transformed values is
measuring instrument has a preference for a cer- taken. This is known as the geometric mean.
110
11 = anintroduction to data presentation, analysis, and interpretation
For the psychological test scores in Group 1, the graphical display) in order to decide on appro-
mean (64,3) is the appropriate measure of central lo- priate summary statistics. The following type of
cation. For the other group, however, the median has data is often encountered: grams of alcohol con-
to be used. If you want to compare the two groups, sumed weekly were noted in a heart disease risk
you must, of course, use the same summary statistics factor survey. 76% of respondents did not drink
for both groups. If the values of one group are asym- (so grams of alcohol consumed equal zero) and
metric or if there is an extreme value, the locations of the remainder drank between 20 and 1 000 grams.
both groups should be summarized by medians. Is it sensible to summarize the group’s alcohol
consumption with the overall mean of 78,2 g? Per-
Measures of variability (spread) centiles would be more appropriate (median = 0
The variability of a data set is the degree to which and 75th % = 0), or alternatively the researcher
observations in the data set vary from each other could state that 76% did not drink, and then state
with respect to a particular characteristic. For the mean or the median consumption of the
example, consider two samples of 10 children drinkers.
each with ages as follows: For a given numerical variable, the central
value (summarized by the mean or median) may
Group 1: 2, 4, 6, 7, 8, 10, 11, 12, 14, 16 be of little interest. For example, it may be more
Group 2:6, 6, 8, 8, 8, 10, 10, 10, 12, 14 relevant to the researcher to know what percent-
age of the sample has high blood pressure than
The mean and median ages in both groups are 9 what the mean blood pressure is. A numerical
years, but the observations in Group 1 vary much variable can be categorized and then summarized
more than those in Group 2. The variability of as a categorical variable. The choice of categories
Group1is thus greater than that of Group 2. for a numerical variable should make sense clini-
The range (the largest value minus the smallest cally (for example, hypertension is defined as sys-
value, or denoted by indicating the largest and tolic blood pressure of 160 and above or diastolic
smallest values separately) is a measure of vari- blood pressure of 95 and above). The choice can
ability which we might consider using intuitively: depend on the distribution of the data (for exam-
in Group 1, the range is 2 to 16, in Group 2, it is 6 ple, grouping individuals over 65 all together in
to 14. However, the range is not very useful as a one age group if there are only a few who are be-
measure of variability, since it is greatly influ- tween 65 and 74, 75 and 84, and so on), but should
enced by one extremely large or small value. not be made in such a way as to influence the
A commonly used measure of variation is the results in a given direction.
standard deviation. The standard deviation is
defined as: Estimating the population
parameters
¥ (x;-X)?
n-1 A researcher studies a sample in order to make
statements (inference) about the population from
and thus gives an indication of the average dis- which the sample was selected. The summary
tance from the mean. The variance is the square statistics outlined above describe the sample
of the standard deviation. data, but what can be said about the population?
A robust alternative to the standard deviation is A parameter is a characteristic (for example, the
the interquartile range (75th percentile minus 25th mean) of a population. A statistic is a character-
percentile). The standard deviation is used with the istic of a sample of the population, and is an es-
mean, the interquartile range with the median. timate of the corresponding parameter in the
Thus, if the data is asymmetric or if there are out- population (for example, the sample mean es-
liers, the median and interquartile range (or 25th timates the population mean). However, as the
and 75th percentiles) should be used as summary simple example in Table 11.3 indicates, different
measures. samples of the same size randomly selected from
a given population will not necessarily provide
Conclusion the same estimates, nor will they provide es-
Summary statistics should give a true representa- timates which are identical to the population
tion of the data. The distribution of the data parameters. This is due to random variation in
should therefore be considered (through some any sample and is known as sampling error. If the
111
ce data presentation, analysis, and interpretation
The population consists of 50 individuals. Two characteristics are of interest: age and sex
Sample 1 estimates : 4 males out of 10 subjects (40%); mean age 57,8 years
Sample 2 estimates : 8 males out of 10 subjects (80%); mean age 66,2 years
Population parameters:
30 males out of 50 subjects (60%); mean age 64,6 years
112
11 = anintroduction to data presentation, analysis, and interpretation
as an estimate of the population parameter. The includes 68% of the observations, and the area
standard error is calculated using the sample size within 1,96 standard deviations on either side of
and the variability of the data. For example, the the mean includes 95% of the observations.
standard error of the sample mean is ja The Many biological variables, for example, birth-
sample estimate, however, remains the best weight, are in fact approximately normally distri-
single estimate of the population parameter. buted. In small data sets, it may be difficult to
Before we discuss how the standard error can validate the assumption of normality adequately,
be used to make a statement about the popula- and in small samples with skew distributions,
tion parameter, we need to know something normality cannot be assumed. In such cases,
about probability distributions. nonparametric methods of analysis need to be
considered (see Parametric versus nonparametric
Probability distributions methods below).
A probability distribution is a theoretical distribu- Other important probability distributions are
tion which provides information on the probabil- the Binomial distribution for binary variables, the
ity of observing different ranges of values for a Poisson distribution for count data, and the expo-
given variable. A probability distribution can be nential distribution for continuous variables.
thought of as a data distribution based on an in-
finitely large sample. Such probability distribu- Confidence intervals
tions exist for categorical as well as numerical As discussed before, if various samples of the
variables. Probability distributions are commonly same size are drawn from a given target popula-
characterized by their mean and variance. tion, a range of different estimates will be ob-
One of the probability distributions used most tained. These estimates will follow some theo-
often is the Normal (Gaussian) distribution. Most retical distribution. We use this information to
of the commonly used methods of statistical calculate a confidence interval for the population
analysis (for example t-tests, correlation, and parameter, based on our sample estimate.
regression) are based on the assumption that the A confidence interval gives a range of para-
continuous variables which are analysed follow a meter values considered plausible for the popula-
Normal distribution. The Normal distribution. is tion, based on the sample data, and is a useful
bell-shaped and is symmetric around its mean way of describing the (im)precision of the es-
(Figure 11.10). Furthermore, the area within one timate. A confidence interval is calculated by
standard deviation on either side of the mean using the sample estimate, its standard error, the
68% of area
113
ce data presentation, analysis, and interpretation
ei
a a a a
degree of certainty (confidence) we want to asso- about a parameter of one or more populations
ciate with the confidence interval (say 95% or (see Chapter 6: Setting objectives for research). To
99%), and the cutoff value of the appropriate determine whether, for example, two factors are
probability distribution (for example 1,96 in the associated, we need to formulate a null hypo-
case of 4 95% confidence interval based on the thesis (Hg) which can then be tested. The null
Normal distribution). If we calculate a 95% confi- hypothesis is the hypothesis of no association or
dence interval, we can say that, in a series of iden- no difference.
tical studies based on different samples from the A test statistic is then calculated byastatistical
same population, 95% of the 95% confidence formula, using the sample data. When the null hy-
intervals calculated from these studies will pothesis is true (that is, there is no difference or
include the true population parameter. An impre- association), this test statistic follows a certain
cise estimate (with large standard error due to distribution (for example t, x7). By comparing the
small sample size and/or large variability in the test statistic with this distribution, we can deter-
sample data) will lead to a wide confidence inter- mine how likely it is to obtain the observed
val (that is, a wide range of values which are sample data if the null hypothesis were true (that
considered plausible for the population para- is, if there were no association or no difference).
meter). For example, if in a sample of 10 clinic This probability is known as the p-value associ-
attenders, four are found to be HIV positive, the ated with the test statistic. The p-value is the
percentage positive is 40% with a 95% confidence probability of observing the test statistic or a
interval ranging from 12% to 74% (indicated as more extreme result if the null hypothesis is true.
[12% ; 74%] or [12%-74%]). The confidence inter- The p-value is thus the probability of finding an
val is wide since the sample is small. We are thus association or a difference, if there is in reality no
uncertain what the HIV prevalence is. If, however, association or no difference.
the sample consists of 100 and 40 are HIV posi- If the p-value is large (for example, p = 0,80), the
tive, the percentage positive is 40% with a 95% data is consistent with the null hypothesis. This
confidence interval ranging from 30% to 50%, a implies that the null hypothesis cannot be re-
narrower confidence interval than in the case of jected, using the observed data. However, a large
the smaller sample. (See Table 11.4 for a further p-value does not imply that the null hypothesis
example.) can be accepted. For example, the sample may be
Methods for calculating confidence intervals too small to detect the difference that really exists.
for most epidemiological applications (for exam- This is known as the Type II error.
ple, means, differences between means, medians, If the p-value is small (for example p = 0,01) the
proportions, odds ratios, correlation coefficients, data is unlikely to have been as observed if the
and survival time) are described, with examples, null hypothesis is true. In other words, there is
in Statistics with Confidence by Gardner and evidence that there is indeed a significant asso-
Altman (1989). ciation or difference, and the null hypothesis is
rejected.
Hypothesis (significance) testing An arbitrary cut-off point of the p-value,
If the researcher is interested in determining the namely 0,05, is often chosen. This cut-off value is
level of a characteristic (for example, the preva- called the significance level of the test, and refers
lence of HIV infection in the Free State), the size to the probability of rejecting the null hypothesis
of any difference between groups (for example, ifit is in fact true (the Type I error). Ifwe are trying
the mean age difference of mothers with low to prove that an association or difference exists,
birthweight babies and mothers with normal we would like the probability of finding a relation-
birthweight babies), or the strength of an associa- ship or difference when there is in fact none, to be
tion (for example, the risk of heart disease in small. P-values are often classified as ‘significant’
hypertensive patients compared to normotensive if p<0,05 and ‘non-significant’ (often written in
patients), confidence intervals should be calcu- the literature as NS) if p>0,05. This over-simplistic
lated as outlined above. On the other hand, the use of p-values is now strongly discouraged in the
researcher may wish to test some hypothesis: are medical literature. If the reader of an article sees
two factors significantly associated or are two that a difference between two groups was found
groups significantly different? In this case, the re- to be ‘non-significant’, there is no information
searcher is interested in hypothesis testing. whether the p-value might have been 0,06 or 0,60.
A statistical hypothesis is an assumption made Clearly p-values of 0,06 and 0,60 should be inter-
114
11 an introduction to data presentation, analysis, and interpretation
preted differently and both should not be simply the confidence interval is interpreted to decide
classified ‘non-significant’. Exact p-values should whether the differenceis clinically important.
be given in papers so that readers can decide for Whereas a significance test cannot indicate that
themselves whether they agree with the re- there is no difference, a confidence interval can. If
searcher on the ‘significance’ of the results. in the above example, a change of three units or
Significance testing can indicate whether there more is taken to be clinically meaningful, the con-
is a statistically significant difference between fidence interval indicates that there was no clin-
two groups, ora Statistically significant associa- ically meaningful change. If the confidence inter-
tion between two variables. As pointed out above, val had ranged from 2 to 4, we would not be able
a significance test cannot indicate that there is no to draw a conclusion, since the interval includes
difference. values which indicate a clinically meaningful
change, as well as values which indicate that there
Statistical versus clinical/public health was no clinically meaningful change.
significance The decision on how large a difference or
Hypothesis testing determines the statistical change is required for it to be clinically meaning-
significance of an observed effect. If two large ful, is based on knowledge of the subject matter.
samples are compared, small differences between In the case of measurements such as blood pres-
groups may be found to be statistically signific- sure, it may be clear what is clinically meaningful,
ant, although such small differences may not be but in other circumstances it may be less clear-
of clinical or public health importance, or prac- cut. For example, if the prevalence of HIV in one
tical value. Similarly, in small samples, large dif- group is 3%, but 5% in the other, is that a meaning-
ferences may fail to reach statistical significance, ful difference?
although they seem clinically important. (See Confidence intervals and p-values are closely
Table 11.4 for an example of sample size and related: if the difference between the means of
statistical significance.) A researcher thus cannot two groups is significant at the 5% level (that is,
base decision-making purely on statistical signi- p<0,05), then the 95% confidence interval for the
ficance, but must consider the clinical or public difference will exclude the value zero (that is, the
health significance of the results. This can be confidence interval will also indicate that there is
done by calculating confidence intervals and a difference). Confidence intervals are thus some-
interpreting them clinically, that is, interpreting times also used for purposes of statistical signific-
the size of the difference or association in terms of ance (the result is significant, since the null value
its meaning for the patient, disease, or health ser- is not inside the confidence interval). This is not
vice. the purpose of a confidence interval. If the focus
In a follow-up study to determine associations is on estimation (which is most often the case),
between high density lipoprotein (HDL) choles- confidence intervals should be calculated and the
terol and women’s employment (Haertel, Heiss, range of values interpreted clinically to determine
Filipiak, Doering 1992), it was found that the 761 whether the findings are clinically important.
women who had been homemakers during the
first survey and had remained homemakers Parametric versus non-parametric methods
during the second survey had a mean decrease in If we can assume that our data comes from an
HDL cholesterol of 0,99 mg/dl. The standard error underlying distribution such as the Normal distri-
was 0,41 andastatistical test to assess whether bution, statistical methods (hypothesis testing or
this decrease was greater than zero revealed that confidence intervals) based on the assumptions
the decrease was statistically significant (p = of the underlying distribution can be used. These
0,015). The 95% confidence interval for the mean methods are called parametric, since the underly-
decrease was 0,19 to 1,79. There is clearly a de- ing distribution is assumed to come from a para-
crease, but on the basis of clinical knowledge, it metric family of distributions. Nonparametric
has to be decided whether a decrease of at most methods are used if such assumptions cannot be
two units over a period of more than two years is made. For example, the Mann-Whitney U test is
clinically meaningful on a variable which had a the nonparametric equivalent of the two-sample
mean value of 65 at the first visit. The fact that t-test. These distribution-free methods which
such a small difference is statistically significant is make no assumptions about the data distribution
due to the large sample size. A significance test are often ranking techniques, which focus on the
thus indicates whether there is a difference, but order or ranking of observations, and not on their
115
ce data presentation, analysis, and interpretation
116
11 = anintroduction to data presentation, analysis, and interpretation
A/(A+B) = A(C+D) posed, and thus the relative risk and risk differ-
C/(C+D) C(A+B). ence cannot be calculated. It is also not possible
to calculate the odds of disease among the
In this example, it is thus 0,2/0,1 = 2. exposed and unexposed. A case-control study
does however provide information on the odds of
Since risks are probabilities or proportions which exposure among the cases (A/C) and controls
lie between 0 and 1 (or percentages which lie be- (B/D) which can be used to calculate the odds
tween 0% and 100%), the relative risk is always a ratio:
positive number. A relative risk of 1 indicates that
the risk of disease in the exposed group is the A/G cy VAD
same as in the unexposed group. The exposure B/D BC.
and disease are thus not associated. A relative risk
larger than 1 (as in our example) indicates that the From the above, it is clear that it is very important
exposure is a risk factor, whereas a relative risk to identify the type of study in which the exposure
less than 1 indicates that the exposure is protec- and disease information was gathered in order to
tive. know which measure of strength of association
A risk difference, that is, the difference be- can be estimated. In cross-sectional and follow-
tween the risk among exposed and non-exposed up studies, the relative risk or risk difference
can also be calculated: A/(A + B) — C/(C + D). In should be the measure of preference.
our example, the risk difference is 10% or 0,10. The odds ratio and relative risk are dimension-
The risk difference lies between -1 and 1 (or less and have been criticized because of this fea-
-100% and 100% if the risks are expressed as per- ture. Say, for example, the rate among the ex-
centages) with a difference of 0 indicating no posed is 5 per million and 1 per million among the
association. unexposed, then the relative risk is 5 and the risk
A third measure of the strength of the associa- difference 4 per million. If the rate among the ex-
tion is the odds ratio. The chance of disease can posed is 5 per thousand and among the unex-
be measured by the odds, where the odds of posed 1 per thousand, the relative risk is also 5 but
disease among the exposed is A/B (20/80 = 0,25.in the risk difference 4 per thousand, indicating a
our example) and among the non-exposed C/D more serious increase than in the first case, al-
(30/270 = 0,11 in our example). The odds ratio is though the relative risks are the same. It has
then’ therefore been said that the risk difference is the
appropriate measure of the practical magnitude,
(A/B) = AD in terms of public health importance, of an asso-
(C/D) BC. ciation. However, if one risk factor has a higher
relative risk than another, the first risk factor is
In our example the odds ratio is thus 0,25/0,11 = thought to play a more important role in the
Zeek causation of the disease.
The interpretation of the odds ratio is similar to The attributable risk indicates the proportion
that of the relative risk. If the disease is rare (that by which the rate of disease will be reduced if the
is, if A is very small relative to A + B, and C is very exposure were eliminated. It takes into account
small relative to C + D), the odds ratio approx- the relative risk, as well as how common the risk
imates the relative risk. factor is in the population being studied. So,
If we are analysing data from a follow-up (pro- factor A may have a lower relative risk than risk
spective) study (in which groups of exposed and factor B, but because more people are exposed to
non-exposed are sampled and followed up), the risk factor A, the attributable risk will indicate that
chances of disease and therefore the relative risk, factor A is more important to the health of the
risk difference, and odds ratio can be calculated community. The proportion by which the rate of
as for the cross-sectional study. In a case-control disease will be reduced if the exposure were
study, however, a predetermined number of cases ‘ eliminated is higher for factor A. The attributable
(diseased) and controls (not diseased) are risk is therefore of great importance in health
selected and their numbers may not reflect the services planning, namely in deciding which
proportions of diseased and non-diseased in the risk factors to target for prevention and inter-
population. It is therefore not possible to estimate vention. The attributable risk is calculated as
the risk of disease among the exposed and unex- follows:
Ve
c data presentation, analysis, and interpretation
P(RR-1) Controls
1+P(RR-1) Exposed Not exposed
C
where Pis the proportion of the population who a Exposed A B A+B
are affected by the factor. s Not exposed _C D C+D
The risk ratio, risk difference, odds ratio, and e A+C B+D A+B+C+D
attributable risk are estimates and should there- s pairs
fore be reported with standard errors or 95% con-
fidence intervals to indicate the precision of the The odds ratio is then estimated by B/C.
estimates.
More than two levels of exposure
Paired observations If an exposure has more than two levels, an r x 2
The 2 x 2 tables outlined above cannot be used if table can be set up (where r is the number of
observations are paired, for example, in a levels of exposure). For example, daily cigarette
matched case-control study where a control was smoking can be classified as 0, 1-2, 3-4, 5-9, 10
selected for a specific case. The 2 x 2 table is then and more. If the association between the level of
set up as follows, consisting not of the number of smoking and the presence of lung disease is to be
individuals included in the study but the number studied, a 5 x 2 table can be formed. To calculate
of pairs. relative risks or odds ratios, one level of exposure is
chosen as the reference level (generally the level
A study is done on 50 subjects to investigate the association between exposure A and disease B.
The following results are obtained:
present absent
A: present 15 (75%) 5 (25%) 20 (100%)
absent 20 (66%) 10 (33%) 30 (100%)
35 50
Null hypothesis : the two factors are not associated.
Atest statistic which follows the X? distribution is calculated to evaluate the association between the
two categorical variables A and B, and the p-value equals 0,53. One would thus conclude that the two
factors are not associated. If, however, the study was done on 500 subjects and exactly the same con-
figuration of results obtained, namely:
B
present absent
A: present 150 (75%) 50 (25%) 200 (100%)
absent 200 (66%) 100 (33%) 300 (100%)
350 150 500
the test statistic results in a p-value of 0,05, thus there seems to be evidence of an association.
The statistical test measures the significance of the association (which depends on the sample size),
but not the degree of association. The odds ratio, on the other hand, measures the degree or strength
of the association, independent of sample size.
The odds ratio in the 50 subject case is 1,50 with a 95% confidence interval of 0,36 to 6,42. For the
500 subjects, the odds ratio is also 1,50, but the narrower confidence interval of O, “ to 2,28 reflects
the better precision of the odds ratio estimate from the bigger sample.
118
11 = anintroduction to data presentation, analysis, and interpretation
with the least risk of disease is taken as the refer- Males Females
ence, thus in this case, the level of 0 cigarettes) Diseased Not Diseased Not
and each of the other levels is compared to the diseased diseased
reference level in turn (thus forming four 2 x 2 Exposed 20 (45%) 24} 44 | 131 (61%) 84) 215
tables). Not
exposed 36 (33%) 73|109} 48 (45%)
Stratification 56 971153 | 179 143 |322
Extraneous factors, such as sex or age, may compli- RR=1,38 RR=1,36
cate the estimation of the measure of association
between exposure and disease (see Chapter 7: Omitting the confounder sex thus introduced a
Study design). By using a stratified analysis, the re- slightly stronger relationship than actually exists.
searcher attempts to adjust for, or remove the effect Confounders can also mask a real association. For
of, an extraneous factor (confounder) which is re- example, sex can be a confounder in the association
lated to both the exposure and the disease. Rather between smoking and hypertension. In this case,
than one (crude) measure of association, one women have a low prevalence of smoking but a high
would calculate measures of association for each of risk of disease, whereas men have a lower risk of dis-
the strata of the extraneous factor and possibly ease, but a higher prevalence of smoking. It may
compute an adjusted (common) measure. therefore seem as if smokers (because they are pre-
Age is often an extraneous factor which has to be dominantly males) have a lower risk of hypertension
accommodated in the analysis. For example, in a than non-smokers (because they are predominantly
cross-sectional study investigating the association females). S
between hypertension and heart disease, it might After stratification, an adjusted (common) rel-
be found that the hypertensives are significantly ative risk using the Mantel-Haenszel method can
older than the non-hypertensives, and those with be calculated. This technique calculates a com-
heart disease older than those without heart dis- mon relative risk by using the stratum-specific
ease. Is the significant association between hyper- relative risks and the stratum size.
tension and heart disease perhaps only due to the The extraneous factor can also act as an effect
fact that the hypertensives are older people and modifier, in which case the relative risks or odds
therefore more likely to have heart disease? We thus ratios in the different strata are markedly differ-
want to remove the effect of age to get a clear pic- ent. This is also often called interaction, as the
ture of the hypertension and heart disease associa- exposure and disease association interacts with
tion. In this case, we would calculate relative risks the extraneous variable, and depends on which
or odds ratios for separate age strata, for example, stratum of the extraneous variable we are invest-
20-29, 30-39, 40-49, 50-59, 60-69, 70+. igating. In the following example, sex is an effect
Confounders can have various effects on the as- modifier for the relationship between insulin and
sociation between exposure and disease. The triglycerides:
confounder can lead to over- or under-estimation
of the association between exposure and disease. Males Females
In the following example, the association be-
high normal high _ normal
tween obesity (exposure) and hypertension (out-
insulinhigh 16(12%) 113
come) is investigated in a sample of QwaQwa resi-
insulinnormal 14(8%) 160 {174 | 5(5%) 96
dents 45 years and older (Mollentze, Joubert 30 273 303 | 14 129 | 143
1994). The crude relative risk is 1,50. RR= 1,54 RR=4,33
119
ce data presentation, analysis, and interpretation
a ec
120
11 an introduction to data presentation, analysis, and interpretation
pas
x
2
>
—
o
a
2
oO
7 21 Se OR BT at ae 4G
Observer 2
Two observers were asked to score 16 individuals. A strong positive correlation was found when calculating a correlation
coefficient. The graphical display is more informative, however, indicating that observer 2 consistently gave higher scores
than observer 1.
121
c datapresentation, analysis, and interpretation
122
11. = anintroduction to data presentation, analysis, and interpretation
Design features .
1 Was the objective of the study sufficiently described? Yes Unclear No
Conduct of study
5 Was a Satisfactory response rate achieved? Yes No Unclear
Recommendation on paper
10 _ Is the paper of acceptable statistical standard for publication? Yes No
123
12 Integrating
epidemiological
concepts
The relationship between sample ferent samples from the same population differed
size, study design and precision of from the population parameters. Using the same
population data, Table 12.1 shows how the confi-
estimates dence intervals become narrower as the sample
Introduction size increases.
Epidemiological studies are fundamentally about However, the magnitude of the benefit in terms
measurement. Variability is intrinsic to biological of greater precision diminishes as the sample size
systems and the process of sampling from a increases. A critical sample size is reached when
population also introduces variability. In this sec- the standard error does not decrease substan-
tion of the chapter, we deal with the issue of pre- tially, even though the sample size increases. This
cision, sometimes referred to as ‘random error’ or situation, which exemplifies the ‘law of diminish-
‘reliability’. ing returns’, highlights the importance of select-
Precision and validity are two separate con- ing the most appropriate sample size. If the
cepts, since it is possible to be precise yet not sample is too small, precision is compromised,
valid. If, for instance, a scale for measuring weight while if the sample is too big, resources are being
in an anthropometric study was not calibrated, wasted.
and all measurements started from 1 kg instead of At one extreme, when the entire population is
0 kg, the mean weight would not be valid since it sampled (that is, a census is taken), the sample
would be 1 kg more than the true mean weight. mean is the population mean. Generally, increas-
However, when the same child is weighed repeat- ing the sample size also has the effect of moving
edly on the same scale, the scale produces iden- the sample mean closer to the population mean
tical weights, demonstrating that the scale is pre- (Table 12.1).
cise even though the weights are not valid. Preci- More importantly, sample size plays a substan-
sion refers to the degree of variability around an tial role in determining the standard error, since
estimate of a parameter, and is reflected by the the formula to calculate the standard error is the
width of the confidence interval. standard deviation divided by the square root of
The precision of an estimate is affected by sam- the sample size. For example, in the case of the
ple size. The more subjects studied, the greater the mean, the standard error is given by:
degree of precision of the estimate. This is reflected
in narrower confidence intervals as sample size in- ze & (xj-X)?
creases. Precision is also influenced by study \a n=1
design. The degree of precision of an estimate can
be increased if the study is designed efficiently. The large influence of the sample size in the
denominator of the formula above, makes sample
Sample size size an important determinant of the width of the
In Table 11.3 in Chapter 11, an example was pro- confidence interval, and therefore of the level of
vided which showed how the estimates from dif- precision. When preparing a study protocol,
124
12 integrating epidemiological concepts
Sample 1: Individuals: 17, 14, 46, 11, 39, 30, 02, 08, 15, 22
Sample 2: Individuals: (Sample 1) and 37, 10, 45, 33, 36, 07, 26, 09, 42, 25
Sample 3: Individuals: (Sample 2) and 24, 44, 23, 21, 12, 16, 01, 03, 34, 13
There are 60% males in the population and the mean age is 64,6 years.
careful consideration should be given to the a 10% margin of error, requires a sample size of 97
calculation of the sample size (see Chapter 8: if simple random sampling is used, but a sample
Sampling). size of 210 if cluster sampling of 7 per cluster is
used (assumed design effect of 2). However, in
Design considerations certain instances, such as rural areas where the
For a given sample size, it is possible to improve total number of potential subjects is not known,
the tevel of precision in the study by designing the the larger sample size for a cluster survey may be
study differently, namely more efficiently. This cheaper and logistically simpler than a simple
section will present some ways in which precision random sample survey with a smaller sample
may be improved without increasing the sample size.
size. Many of the issues raised in this section are Increasing the proportion of subjects with the
to be considered as trade-offs between the need exposure being studied can improve precision. In
for precision and the cost or feasibility of con- a cross-sectional survey to assess the association
ducting the study. This approach of trying to con- between having pierced ears (for earrings) and
sider the costs of the study on the one hand, and hepatitis B virus infection, the following was
the level of precision on the other, is sometimes found:
referred to as the ‘efficiency’ of the study design.
Greater efficiency is achieved if more precision is Hepatitis B virus infection
obtained with the same number of (or fewer) Yes No
subjects. Pierced ears Yes 6 14 20
The sampling strategy selected when designing No 38 342 380
a study influences the degree of precision. Cluster 44 356 400
samples are prone to distortions due to the simil-
arity between subjects in a cluster (intra-cluster Prevalence of pierced ears = 5%
homogeneity). Therefore, in order to obtain the Relative Risk (RR) = 3 (95% confidence interval:
same degree of precision as a simple random 1,44; 6,25)
sample, larger samples are required. For example, The same study, conducted in a community
a community-based study to assess vaccination where ear piercing was encouraged, produced the
coverage, which is expected to be about 50% with following results:
c¢ data presentation, analysis, and interpretation
ng ng ee a
Hepatitis B virus infection recruiting a case and a control subject is the same,
Yes No altering the ratio of cases to controls is efficient
Pierced ears Yes 30 70 100 and improves precision, as long as the smallest
No 30 270 300 cell in the 2 x 2 table increases. In most instances,
60 340 400 however, recruiting a control is cheaper and
quicker, particularly where the disease is rare. In
Prevalence of pierced ears = 25% this instance, increasing the number of controls
RR=3 (95% confidence interval: 1,91; 4,72) for a given number of cases leads to higher preci-
Both studies had a sample size of 400, an RR of 3 sion. However, the law of diminishing returns
and a 10% prevalence of hepatitis B virus infec- applies and there is a critical point beyond which
tion among subjects without pierced ears. How- a substantial increase in the number of controls
ever, the latter community has a higher preva- has little benefit in terms of improved precision.
lence of pierced ears, thereby leading to a higher
precision in the estimate of the RR as reflected by Conclusion
the narrower confidence intervals. However, it is Precision should not simply be equated with
not simply increasing the prevalence of the expo- sample size. The precision of the estimates ob-
sure (in this example, pierced ears) that improves tained from a sample depends on sample size and
precision; rather, it is the effect that increasing the way the study was designed. For a fixed sam-
the prevalence of the exposure has on increasing ple size, improved efficiency is possible with care-
the value of the smallest cell in the table. In the ful design of the study. While sample size plays an
first table, the smallest cell is 6, while in the important part in determining the level of preci-
second table, the smallest cell is 30. To demon- sion, study design becomes particularly import-
strate this point further, if the prevalence of ant when resources preclude an increase in the
pierced ears was 50%, the following results would sample size.
have been obtained:
Validity and bias: in search of truth
Hepatitis B virus infection
Yes No Introduction
Pierced ears Yes 60 140 200 Bias was described in Chapter 4. This section aims
No 20 180 200 to synthesize information and provide some
80 320 400 additional insights. Bias is a deviation from that
which is correct and true. Bias is distinct from
Prevalence of pierced ears = 50% precision; precision refers to repeatability, while
RR=3 (95% confidence interval :1,88; 4,78) bias refers to systematic deviation from the truth.
In this table, the prevalence of pierced ears Bias is also referred to as ‘systematic error’ as
increases to 50% from 25% in the previous table. opposed to ‘random error’, which is another term
Yet the precision is lower, demonstrating that in- for precision. Validity is the absence of significant
creasing the level of exposure does not uniformly bias.
lead to improved precision. An increase in pre- A small degree of bias may be inevitable in re-
valence has led to a decrease in the size of the search, for example, in almost all studies some of
smallest cell. the selected subjects may choose not to partici-
A study in which a higher proportion of sub- pate (participation bias), with the result that the
jects has the exposure leads to a higher level of sample does not fully represent the study popula-
precision, as long as the smallest value in the cells tion. While the presence or absence of bias should
of the 2 x 2 table increases as well. On the other be evaluated in a study, it is more important to
hand, the disadvantage of purposefully selecting assess the magnitude of the bias and the direction
a community with a higher prevalence of expo- in which it is likely to alter the study findings.
sure in order to improve the precision of the esti- The first step in assessing the validity of a study
mates is that the findings may no longer be estimate is to identify potential biases. The next
generalizable, since the study subjects may no step is to assess whether each bias is large or
longer represent the wider population. small. Then each bias needs to be assessed in
A similar benefit is achieved by altering the terms of whether the bias could reduce the esti-
ratio of cases to controls in order to improve pre- mate, exaggerate the estimate, or do either. The
cision in a case-control study. Where the cost of final step is to identify methods by which each
126
12 = integrating epidemiological concepts
bias could have been reduced (for example, cali- of coffee a day. It also transpired that those who
brating the scale after every 20 subjects have been drank more than four cups of coffee per day also
weighed) and determine if adequate steps were smoked more cigarettes. Since it is known that
taken in the study to reduce bias. smoking is associated with ischaemic heart
The presence of a large bias in a study (for disease, it is possible that heavy coffee drinkers
example, the bias introduced by a low response had more myocardial infarction emergencies be-
rate of only 30%) seriously undermines the valid- cause of their smoking, and not because of their
ity and value of the findings. Every effort should coffee consumption. This is an example of bias
be made to reduce and avoid bias in a study, for due to confounding.
example, by conducting repeat follow-ups with There are many kinds of bias, but all biases can
incentives to increase the response rate. be classified into one of three categories: selec-
tion bias, information bias, and confounding.
Types of bias This section deals with the first two types.
In a case-control study to assess whether passive
smoking during the antenatal period was asso- Selection bias
ciated with stillbirth, mothers who had had still- Representative samples are drawn from a popula-
births were interviewed soon after the birth. A tion in order to make the findings of the study
control group of mothers who had had normal applicable to the population. In some instances,
births were also interviewed soon after birth to the subjects being studied do not represent the
get a history of passive smoking during the ante- population from which they were selected. The
natal period. The verbal histories of exposure two main ways in which selection bias occurs is
were compared to urinary cotinine levels (a bio- through a flaw in the sampling procedure or
logical measure of cigarette smoke exposure used through non-participation of certain subjects,
here as a ‘gold standard’). It was found that many either at the start of the study (non-responders), or
of the women with normal births underestimated during the course of the study (loss to follow-up).
their exposure to passive smoke, while the Sampling bias, which is one form of selection
women with stillbirths overestimated their expo- bias, is covered in Chapter 8. In summary, incom-
sure to passive smoke. This is an example of recall pleteness of the sampling frame and the use of
bias. The women who had just been through the non-random sampling procedures are the main
traumatic experience of having astillbirth were sources of sampling bias. In an opinion poll for
searching for antenatal factors to blame for their the 1994 South African elections, interviews were
loss, with the result that they overestimated conducted at shopping malls and showed similar
smoke exposure. On the other hand, the women levels of support for the two major political
with normal healthy babies were too busy enjoy- parties. The sample was however biased, since
ing their newborns to think back and consider shopping mall samples under-represent the poor
whether they were exposed to smoke during the sections of the community, as well as rural popu-
antenatal period. lations, while urban, middle-class people are
In a cohort study to assess whether obesity is overrepresented. This sample did not represent
associated with hypertension, the blood pressure the voters of South Africa, since the sampling
of obese and non-obese subjects was measured frame did not include all voters.
using the same sphygmomanometer and cuff. In a descriptive survey to assess vaccination
Since this cuff was not big enough for obese arms, coverage in Durban, the register of electricity
there was a systematic exaggeration of the blood consumers was used as the sampling frame and a
pressure levels among those obese subjects with random sample of houses from the register were
large arms. This bias led to an artificially raised visited to obtain information on vaccination
relative risk of the association between obesity coverage of children in the household. This study
and hypertension. This is an example of measure- found a vaccination coverage level which was
ment bias. higher than all previous estimates. This was
In a cross-sectional study of coffee consump- ascribed to selection bias as a result of the exclu-
tion and history of emergency medical care for sion of houses without electricity (for example,
myocardial infarction, it was found that subjects shacks in squatter settlements), where vaccina-
who drank more than four cups of coffee a day tion coverage is known to be low.
had twice as many myocardial infarction emer- In studies which include a representative
gencies as subjects who drank less than one cup sample of homes, non-participation becomes an
127
c data presentation, analysis, and interpretation
ee
issue when several homes are too difficult to antibodies. The test is 99% sensitive and 98% spe-
reach, or no one is found at home, even with cific. This means that for every 100 HIV infected
repeat visits. The exclusion or replacement of individuals who are tested, one will be misclassi-
these homes can lead to a biased sample, since fied as HIV negative. Similarly, for every 100 truly
these very hard-to-reach homes often occur HIV negative subjects, two will be misclassified as
where health status is poor and access to health HIV positive (see below).
services is difficult. From a practical point of view,
the study protocol should include a procedure for True status
dealing with hard-to-reach and locked homes. HIV+ HIV-
For example, after two visits, a non-responding ELISA result HIV+ 99 2 101
home may be replaced with the next closest HIvV- 1 98 99
house. The extent of replacement must be docu- 100 100 200
mented and included in the discussion of the
main study findings. Sensitivity = 99%
In a community-based cohort study, 1 000 Specificity = 98%
adults were followed up annually to assess the False positive rate = 2%
incidence rate of ischaemic heart disease. It was False negative rate = 1%
found that over 10 years, 30% of the subjects
moved out of the study area — many moving to In an analytic study, the exposure variable, the
wealthier suburbs. This 30% of subjects who were outcome variable, and/or the related variables
lost to follow-up left behind a sample which was could be misclassified. (Misclassification of
biased. The bias ocurred due to the selection related variables such as confounders is beyond
process of who left; in this case, most of the the scope of this book; more information on this
upwardly mobile business-orientated people, can be obtained from Ahlbom and Norell (1990):
mostly smokers with type A personalities, left the Introduction to Modern Epidemiology.) When the
area. Ischaemic heart disease is more common in degree of misclassification of the exposure vari-
this group, with the result that the cohort study able is influenced by the outcome variable, differ-
would underestimate the incidence rate for ential misclassification of exposure is said to
ischaemic heart disease if only the remaining 70% have occurred. When the degree of misclassifica-
of subjects were studied. tion of the outcome variable is influenced by the
Selection biases are often subtle and it is exposure variable, differential misclassification
important to pay particular attention to the sam- of outcome has occurred. Non-differential mis-
pling design in order to reduce selection bias to a classification occurs when the degree of mis-
minimum. classification of either the exposure or outcome
variable is not dependent on the other.
Information bias
Information bias refers to systematic inaccuracies Non-differential misclassification
in measurement, recording, management, and Differential and non-differential misclassification
analysis of data. Such inaccuracies lead to mis- can have markedly different effects on the esti-
classification, for example, subjects who have mates obtained in a study. Non-differential mis-
been exposed to the risk factor being studied may classification of either exposure or outcome
be misclassified as unexposed. Misclassification usually biases the study estimate towards the
occurs when the measurement ascribed to a sub- null, that is no effect or RR=1.
ject is incorrect and the margin of error is suffi- Extending the previous example to condoms
ciently large to lead to the subject being placed in and HIV infection, in a study of 10 000 subjects
a different category during the data analysis. the following results were obtained:
Misclassification is most simply understood in
terms of the sensitivity and specificity of the ELISA
measurement method used. (See Figure 9.7 in HIV+ HIV-—
Chapter 9 for more details on sensitivity and Condom use Yes 100 1900 2 000
specificity.) For example, in a study to assess No 800 7200 8000
whether condoms protect against HIV infection, 900 9100 10000
the presence or absence of HIV infection is deter-
mined by an immunological test (ELISA) for HIV RR=0,50 (95% confidence interval: 0,41; 0,61)
128
12 integrating epidemiological concepts
Based on the HIV ELISA results, there appears misclassification should be minimized by using
to bea substantial protective effect of condoms in the most accurate measurement methods.
this study, since 5% of condom users and 10% of
non-users were HIV positive. Differential misclassification
Since the level of inaccuracy of the ELISA result Differential misclassification of exposure occurs
is not dependent upon whether condoms were when the degree of misclassification of the expo-
used, the HIV status has been non-differentially sure variable varies across strata of the outcome
misclassified. The misclassification of HIV status variable. Similarly, differential misclassification
can be crudely taken into account by adjusting for of outcome occurs when the degree of misclassi-
99% sensitivity and 98% specificity of the ELISA fication of the outcome variable varies across
test. strata of the exposure variable. Differential mis-
classification can bias the study estimate towards
ELISA or away from the null, unlike non-differential
HIV+~ HIV-— misclassification, which usually biases the study
Condom use Yes 63 1937 2000 estimate towards the null.
No 664 7336 8000 Earlier in this chapter, an example of a case-
727 9273 10000 control study to investigate the association be-
tween stillbirths and passive smoking was used to
RR =0,38 (95% confidence interval: 0,29; 0,49) demonstrate information bias. This investigation
The value of 63 for the first cell in the above included 100 women who had experienced still-
table is derived by adjusting as follows: births and 200 women with healthy newborns.
@ the sensitivity of 99% means that 1 of every 100 The results, based on a verbal history to assess
truly HIV infected has been incorrectly labelled passive smoking, were as follows:
HIV negative; adjusting for this among condom
users results in 101 HIV positive and 1 899 HIV Stillbirth
negative. Yes ___No
@ the specificity of 98% means that 38 (2% of Verbalhistoryof Yes 30 40 70
1 899) HIV negative condom users have been passive smoking No (0. et60230
incorrectly classified as HIV positive; adjusting 100 200 #300
for this results in 63 (101 minus 38) HIV positive
and 1 937 HIV negative condom users. OR = 1,71 (95% confidence interval: 0,95; 3,08)
repeat the above steps for condom non-users. An association was found, although not statist-
ically significant, between passive smoking and
The above adjustment to remove the non-differ- stillbirths when verbal history was used to
ential misclassification emanating from the inac- measure passive smoking. In addition, urinary
curacy of the Elisa test, results in an even larger cotinine was measured to corroborate the history;
protective effect. That is, the non-differential mis- and produced the following results:
classification of HIV status biased the RR towards
the null. Even though the HIV Elisa test has a high Stillbirth
degree of accuracy, namely 99% sensitive and Yes _No
98% specific, the effect on the RR is substantial, Urinarycotinine Yes 25 50 75
moving it from 0,38 to 0,5. No Co 150 225
Most measurements undertaken in a study are 100 200 #300
not 100% accurate, so there is some degree of in-
accuracy. As long as this measurement inac- OR = 1 (95% confidence interval: 0,55; 1,8)
curacy leads to non-differential misclassification, The biological measurement of passive smok-
any non-null finding in a study could not have ing revealed that five women with stillbirths had
been the result of this measurement bias, since overestimated their exposure to passive smoke,
non-differential misclassification usually biases while 10 women with healthy newborns had
towards the null. In statistical terms, non-differ- underestimated their exposure. This is an exam-
ential misclassification generally leads to a Type ple of differential misclassification; the misclassi-
I] error and not a Type |error. In order to increase fication of exposure differs across the outcome
the probability of finding an association where categories. It is noteworthy that only 5% of the
one truly exists, the amount of non-differential subjects had been misclassified, yet the effect on
29
ce data presentation, analysis, and interpretation
a
the odds ratio is substantial — the odds ratio of 1 ence of asthma among both those who had been
is biased to 1,71. exposed and those who had not been exposed to
In a cross-sectional study investigating the rela- passive smoke. However, the overestimation was
tionship between passive smoking and asthma substantially higher among subjects who had
(measured by peak expiratory flow rate), 1 000 been exposed to passive smoke, resulting in dif-
children were investigated and the following ferential misclassification. The effect of the bias in
results were found: this study was to exaggerate the association be-
tween passive smoking and-asthma.
Asthma Misclassification is often difficult to avoid be-
Yes No cause of inherent inaccuracies in many measure-
Passivesmoking Yes 160 240 400 ment tools. When misclassification cannot be
No 60 540 600 avoided, the design of the study should ensure
220 780 #1000 that the misclassification arising from measure-
ment inaccuracy is non-differential.
RR =4 (95% confidence interval: 3,06; 5,23)
It was later learned that the fieldworker who Conclusion
performed the peak expiratory flow rate measure- Selection bias produces results that may not
ment knew the passive smoking history of the be generalizable to the study population. In
subjects. There was concern that he may have analytic studies, a bias in the selection of one
biased the peak flow measurement by making group, for example, the control group in a case-
more effort when performing the peak flow control study, can produce results which are not
measurement on subjects who had a history of valid.
passive smoking. To assess this potential bias, the Information bias manifests in misclassification,
measurements were performed again by another which can be interpreted by the sensitivity and
fieldworker, who was blinded to the subject’s specificity of the measurement. When the mis-
smoking history status. The repeat study pro- classification is non-differential, the study esti-
duced the following results: mate is usually biased towards the null. Should a
study demonstrate an association in the presence
Asthma of non-differential misclassification, then the
Yes- No association is probably much stronger. Differen-
Passivesmoking Yes 100 300 400 tial misclassification can bias the study estimate
No 50 550 600 either towards or away from the null.
150 850 1000 Selection and information bias call into ques-
tion the validity of the study findings. Every
RR=3 (95% confidence interval: 2,19; 4,11) attempt should be made to reduce both these
The first fieldworker overestimated the preval- biasés to a minimum.
130
1 418 Knowledge, attitude, belief, and
practice poe er
4 19 Quaativemethodology a"
introduction - /
_ D Skinner, H van der Walt
ey
og
a
RES
13 The use of
routinely
available data in
epidemiological
studies
133
d common approaches in epidemiology
MORTALITY
— national Death reports by Central Statistical Summary data
Services (yearly) Excluding TBVC* until 1994
Under-registration and misclassification
of cause for Africans in particular
MORBIDITY
— national Notifiable (mainly infectious) conditions Summary data
reported in Epidemiological Comments Under-reporting
of Dept of Health (monthly) Regional variation in diagnosis
reporting practices
Cancer register kept by SAIMR Histology based, i.e. only cases
(yearly report) confirmed by histology
Virology surveillance from all RSA Under-reported
laboratories reported by the National Only public laboratories
Institute for Virology (monthly)
SASPREN: surveillance conducted Not a random sample of practitioners
by volunteer primary care practitioners
— local Notifiable conditions reported by local Summary data
authorities (Annual reports) Selected urban areas only
Under-reporting
Hospital discharge records No knowledge of the exact population data
Diagnoses not standardized
POPULATION
— national Census reports and data by Central Excluding TBVC until 1994
Statistical Services (5—10 yearly) Under-enumeration
Intercensual estimates by Central Excluding TBVC until 1994
Statistical Services (periodically) Under-enumeration
HSRC estimates and projections
(irregularly)
Population register (ongoing) Untested
Development Bank reports on Summary data
‘homeland’ areas (irregularly) Under-enumeration
134
13 = the use of routinely available data in epidemiological studies
BIRTHS
— national Central Statistical Service reports Excluding TBVC until 1994
(yearly) Under-registration
Out of date
Population register (ongoing) Untested
— local Annual reports by Medical Officers Selected urban areas only
of Health (yearly) Under-reporting
HEALTH CARE
South African Medical and Dental Postal address and not address of
Council's register of doctors and practice
dentists (ongoing) (1975, 1983, 1986, Inconsistent inclusion of practitioners
1989 at MRC) outside RSA
Census of doctors by Central Statistical
Services (1979, 1983)
South African Nursing Council’s Inconsistent inclusion of
register of enrolled and registered non-practising nurses
nurses (angoing) (1983, Postal address and not address of
1986, 1988, 1989 at MRC) practice
South African Hospital and Nursing Summary data
Yearbook including hospital and
clinical resources (yearly)
RAMS — register of medical aid Limited information
doctors (ongoing)
RAMS — medical aid claims report
and data on tape (ongoing)
ReHMIS — survey of resources in Untested
the public sector
damage of previous apartheid policies, it will be through a disclaimer justifying the inclusion of
important to monitor progress towards equity in population group in their analysis. Debates have
health services and health status. It will therefore arisen over the naming of certain categories. In
be necessary to continue to record population the past, the use of ‘coloured’ rather than colour-
group for certain data. However, at the same time, ed, and African rather than black usually reflected
it is important that ethnicity, education, occupa- dissent from official apartheid terminology.
tion, and income variables are used to reflect the
cultural and socio-economic stratification of
South African society. These variables need to be
Mortality data
included in the routine information systems. In the past, all registered deaths in the RSA ex-
Researchers who do not wish to promote a cluding Transkei, Bophuthatswana, Venda, and
racial categorization often express their views Ciskei were collected by the Central Statistical
135
d= common approaches in epidemiology
a a a re
Services and published annually by age, sex, deaths in a specific age/sex group are due to the
population group, and cause of death, as well as cause of death of interest. So, for example, it was
by magisterial district and occupational group. found that 7,8% of all deaths in children under 15
Since the- repeal of the Population Registration years during 1981-5 were due to injuries (non-
Act during 1991, this data has not been collected natural causes) (Kibel, Bradshaw, Joubert 1990).
according to population group. During 1994, the When calculating mortality rates, in addition to
independent homelands were re-incorporated the mortality information, we need information
into South Africa and their data is now being on the size and composition of the population.
processed by the Central Statistical Services. The Using such information, it was determined that
data available before and after 1994 is thus not annually 55,3 per 100 000 coloured children aged
strictly comparable. five to nine died due to injuries during 1981-5
The underlying cause of death is coded accord- (Kibel et al 1990). Mortality obviously looks at the
ing to the International Classification of Diseases most extreme outcome on the health/disease
(ICD) into one of 999 causes. These causes are spectrum, but in the absence of national morbid-
grouped into 17 main chapters (for example, neo- ity data, it provides useful information. (Also see
plasms, circulatory). The ICD is revised and Cha 20:pte r
Mortality studies.)
expanded occasionally. During the 1980s and the
first half of the 1990s, the ninth revision of the ICD Population data (denominator)
was the standard one. It has subsequently been
replaced by the 10th revision. The ICD is used National censuses are usually conducted every
internationally and provides a basis for inter- five years. Any census has the problem of possible
national comparison. The quality of the classifi- under-enumeration. It has been estimated that
cation and of the reporting are crucial in deter- the 1991 census under-enumerated the African
mining the quality of the data. population by as much as 17% (Central Statistical
Although death registration is compulsory and Services 1992). In the other population groups,
permission for burial can only be obtained if the these figures are somewhat lower. Of course, the
death is registered, it has been estimated that as extent of under-enumeration differs geographic-
many as 50% of African deaths are not registered ally, but this has not been estimated. It is thus
(Botha and Bradshaw 1985), and thus not re- problematic to calculate rates for Africans, espe-
flected in the national mortality figures. Since we cially in the case of mortality where the numer-
cannot assume that underregistration is uniform ator data (mortality figures) are also inaccurate.
with regard to cause, age, sex, or area, this data For the African population, it is best to use locally
must be used cautiously. In addition, it has been available metropolitan or regionally collected
found that up to 20% of registered African deaths data (such as the African population estimates
are coded as being due to ‘symptoms, signs, and which the HSRC provided for the greater Cape
ill-defined conditions’. The majority of such Town area in 1988).
death certificates are completed by police officers As health services move to district manage-
(Van der Merwe, Yach, Metcalf 1991) and head- ment, there is an increasing need for an author-
men in the rural areas. This hampers the inter- itative source of district level population projec-
pretation that can be made regarding the other tions for the years after a census.
causes of death in this group.
The Births and Deaths Act was revised in 1992
Births
and the medical certificate changed so that
details of the cause of a non-natural (external This information is crucial for determining the
cause) death need not be submitted. This has infant mortality rate. National data are only avail-
been disastrous for surveillance of injury-related able some years after collection, and in the case of
deaths. Authorities such as the Western Cape Africans, are particularly incomplete. Most large
Regional Services Council have therefore set up cities include the number of births, based on noti-
their own surveillance system by collecting this fications, in their annual reports.
data at the mortuaries.
To gain insight from the published mortality
Morbidity data
data, we can calculate proportional mortality or
mortality rates. In the case of proportional Certain diseases, mostly of an infectious nature,
mortality, one calculates what proportion of all are notifiable to the Department of Health. The
136
13 = the use of routinely available data in epidemiological studies
major diseases include tuberculosis, hepatitis B, (for example, doctors to population ratio). This
measles, malaria and typhoid. Unfortunately, can be used to describe the distribution of avail-
health professionals do not always notify the able resources (see Example 13.2).
authorities of these conditions, and deaths are Various budgets can be used to gather inform-
particularly poorly reported. When considering ation about health care expenditures in the public
deaths due to notifiable conditions, it is advisable sector, and data from RAMS medical aid claims
to use the registered mortality data. can be used for information about expenditure on
A national cancer registry based on histological services. It is, however, extremely difficult to
diagnoses has been created by the South African obtain information regarding private consump-
Institute for Medical Research. Annual reports are tion of pharmaceuticals. Health care expenditure
published giving incidence rates by age, sex, can be expressed either as a total amount, or per
population group and site of the cancer. Disease- capita.
specific registers are very important for the sur- * ReHMIS (Regional Health Management In-
veillance of the disease, as they can provide more formation System) is a national survey of the
accurate measures of incidence than mortality resources in the public health service (REHMIS
data. System Documentation 1994). This was initiated
Many records are kept by hospital administra- to help in the restructuring of the health service,
tions. Not much epidemiological use has been and is expected to be repeated on a regular basis.
made of South African hospital records. Analyses
of records from Groote Schuur and Red Cross Conclusion
Children’s hospitals have shown the potential
value of such sources, as long as the quality of dis- The epidemiologist has an important role to play
charge diagnoses can be standardized (Henley, at all levels of the health service by assessing the
Smit, Roux, Zwarenstein 1991). quality of routinely available data and meaning-
Clinic records also provide a rich source of fully interpreting such data. By letting the data
information. Much of this information is not providers know how the data is used and what its
generally collated, and anyone who wishes to limitations are, it is possible to change the data
analyse such data may have to spend a lot of time collection procedure so that it yields better qual-
extracting the needed information from patient ity data. The data providers are generally willing
records. Example 13.1 describes how clinic to discuss ways of improving the routinely col-
records were used to investigate methodological lected data once they realize that someone is
issues regarding the measurement of treatment making use of it. Feedback should also be given to
compliance. the staff who collect routine data, particularly at
The South African Sentinel Practitioner Re- local level. These staff are often unaware of what
search Network (SASPREN) is a network of pri- happens to the information, and they may feel
mary care practitioners who have developed a they are wasting their time filling in endless forms
surveillance system (Volmink, Furman 1991). A that no one ever looks at. Feedback may motivate
pilot project has demonstrated the feasibility and them and thus result in better quality inform-
value of the routine collection and processing of ation. In addition, they can describe the practical
selected data, such as trends in hypertension and problems they experience in collecting the in-
depression. formation, and solutions can be found jointly.
pliance information was collected from the pa- The Hospital and Nursing Yearbook, statistical
tients’ treatment cards. reports from the provinces and the Central Stat-
On the last day of acceptance into the group (30 istical Services, as well as reports from the Na-
June 1987), a cross-sectional survey was con- tional Association of Private Hospitals were used
ducted identifying those patients in the group as sources of hospital data. For the clinic data, the
who were still on the clinic treatment list. Compli- Yearbook, reports from the regional directors of
ance and associations between demographic the then Department of National Health and
variables and compliance were measured at this Population Development, timetables from clinics
point (cross-sectional study, compliance to date), under various local authorities, and the statistical
as well as at the end of treatment (longitudinal abstracts from the Development Bank of South-
study, end compliance). A patient was defined as ern Africa were used.
compliant if he or she had attended on 75% or To verify hospital information, the 1988 Hos-
more of possible treatment days. In the longitudi- pital and Nursing Yearbook was regarded as the
nal survey, a time lag of some months was anti- most comprehensive source. All other sources, as
cipated until all patients had completed their well as other editions of the Yearbook, were com-
treatment. To contain the study, a cut-off point pared to it. For a random sample of hospitals, in-
was defined after which compliance was impos- formation was confirmed telephonically with
sible (160 days after entry into the cohort, since hospital administrators. The hospital data of the
treatment consists of 120 treatment days). Yearbook was found to be comprehensive, but
The cross-sectional study underestimated non- errors were found in totals of figures. Hospital bed
compliance: 19,5% compared to 40,3% obtained data of the various sources did not differ mark-
in the longitudinal study. The cross-sectional edly. The hospital data was updated using the
result was biased, as this study only included 1994 Yearbook.
treatment survivors, and measured compliance No data source was found to have a compre-
of all patients on a particular day, irrespective of hensive list of clinics within the provinces. There-
the time since notification. fore, exhaustive telephone contact with local
In both analyses it was found that children were authorities was made to improve the information.
less compliant than adults. The provincial distribution of hospital beds per
1000 population is shown in Table 13.2. The
Reference: report points out that the total number of private
Youngleson S M, Joubert G. ‘A comparison of cross-sectional and public sector beds does not accurately reflect
and longitudinal survey methods in measuring patient com- the number of beds accessible to the general
pliance with tuberculosis treatment.’ South African Medical population. Therefore, the number of public
Journal 1991; 80(7):331-5. sector beds (referral, general, and special) as well
as acute public beds (only referral and general)
per 1 000 population was also calculated by pro-
vince. The provinces were fairly comparable with
EXAMPLE 13.2 respect to acute public beds, apart from Mpuma-
langa which had 1,5 acute beds per 1 000 popula-
Health facilities in the nine provinces
tion (compared to the other provinces, which
of South Africa ranged from 2,0 to 2,5 per 1 000 population).
Since the provision and distribution of health ser- The distribution of clinics is shown in Table
vices is an important aspect of the development 13.3. Some provinces do worse than the WHO’s
of a national health plan, this study was done to recommended 10000 people per clinic. This
collate, verify, and analyse data on the number information has not been routinely analysed
and types of hospital beds (Table 13.2) and the before, and the quality is thus not known.
number of fixed clinics (Table 13.3), and relate The report recommends that comprehensive
these to the population sizes in each of the nine analyses should be carried out at regional and
provinces of South Africa. A comprehensive data- subregional level, taking into account, for exam-
base of this information was compiled for the ple, population density and growth, location and
period 1988-1993. accessibility of services (Chetty 1995).
138
13 the use of routinely available data in epidemiological studies
Table 13.2 Distribution of total public and private hospital beds per 1 itil)
population, South Africa 1993
Reference:
Medical Journal 1995;
Chetty K S. ‘An integrated analysis of health facilities in the nine provinces of South Africa.’ South African
85:245-50.
139
14 Disease
surveillance
140
14 disease surveillance
Se Se
a a
Sram meMC em Re mE tl
Incidence rate
schoolchildren to remain at home for seven days veillance often stimulate health-related research.
after treatment or three weeks if appropriate
treatment was not given, treatment for contacts Main purposes of surveillance:
(erythromycin for upper respiratory tract infec- @ Morbidity and mortality reporting
tion) and strict adherence to immunization ¢@ Documenting distribution and spread of
schédules regarding diphtheria, pertussis, and diseases
tetanus (DPT). Whooping cough had been made @ Establishing long-term trends in disease occur-
notifiable in the city of Cape Town in 1974, unlike rence
the rest of the country, where it was not notifiable. Detection of epidemics
In 1994, pertussis was made notifiable nationally. @ Identifying high-risk groups or areas
Surveillance data is useful in evaluating inter- @ Estimation of magnitude of the health problem
vention measures. A mass measles immunization Facilitating planning of control and prevention
campaign in 1990 was evaluated using surveil- measures
lance data on measles admissions to a tertiary Evaluation of intervention measures
hospital. An initial short-term evaluation (six @ Resource allocation in public health planning
months post-campaign) found the campaign to @ Setting research priorities
be effective in reducing measles incidence (Figure @ Archival information for describing the natural
14.2) (Abdool Karim S$ S, Abdool Karim Q, Cha- history of diseases.
mane 1991).
However, when a second evaluation was done Surveillance data can also be used to direct
using the same surveillance data over a longer resources for swift and effective control, as well as
period (21 months post-campaign), it was found to estimate the magnitude of the health problem
that the effect of the campaign had been lost (in the long term).
(Abdool Karim S S, Abdool Karim Q, Dilraj, Cha-
mane 1993). The prime reason for this was that Methods of surveillance
the high level of vaccination coverage achieved
during the campaign had not been sustained. Surveillance data is typically obtained through
The main purposes of surveillance are sum- health provider-initiated reports, such as noti-
marized below. Unusual events detected by sur- fication forms filled in by nurses (passive surveil-
141
d common approaches in epidemiology
Pre-campaign Post-campaign
N J M M J $ M M
1989 1990 Month 1991
Source: Abdool Karim S S, Abdool Karim Q, Dilraj R, Chamane M, 1993.
lance) or health department-solicited reports, veillance systems may be used to support or com-
such as the recruitment of throat swab specimens plement each other.
from health centres by the National Institute for
Virology for the isolation and monitoring of Routine surveillance
influenza viruses (McAnerney, Johnson, Schoub Routine surveillance means that every case of a
1994) (active surveillance). Passive surveillance is particular condition seen must be reported and
good for conditions that have clear symptomato- counted. Routinely reported data is typically
logy (and few asymptomatic infections), such as obtained via passive surveillance. A typical exam-
measles. Active surveillance is useful for diseases ple of routine surveillance is notifiable disease
which can be easily missed — it reminds health reporting. The main reason for making diseases
care providers to be on the lookout for these con- notifiable is to control those diseases which con-
ditions. A good example of a disease that needs stitute a danger to public health. Thirty-six condi-
active surveillance is malaria, where it is import- tions are notifiable in South Africa, and these are
ant in controlling malaria to detect asymptomatic mainly infectious diseases. A current list of notifi-
malaria carriers, in order to reduce the prevalence able conditions is readily available in Epidemio-
of parasite carriers. logical Comments, a publication of the Depart-
Passive surveillance is much less costly than ment of Health. The 1994/95 list of notifiable con-
active surveillance and is therefore used most ditions is shown in Table 14.1. Notifiable disease
often. reporting is the main routine system for morbid-
Methods of disease surveillance include rou- ity data collection in South Africa.
tine reporting, sentinel reporting, special surveys If a condition is notifiable, the notification to
or programmes, and outbreak investigations. health authorities is a statutory obligation, the
These are discussed in turn below. Several sur- regulations of which are contained in the Health
142
14 = disease surveillance
surveillance rather than with total population While data from sentinel sites, such as ante-
passive surveillance. Data from well-selected sites natal and sexually transmitted diseases clinics,
such as specific hospitals or health centres is have traditionally been used to estimate the pre-
available more quickly, is more reliable, and costs valence of HIV, special population-based HIV
much less than data for a whole region. This in- sero-surveys conducted on several occasions in
formation can act as an early warning system. For conjunction with the malaria programme have
diseases with a relatively high incidence, such as also been useful in estimating the HIV prevalence
tuberculosis, health centres (TB clinics) could be in rural KwaZulu-Natal (see Kustner, Swane-
selected as sentinel reporting sites. For less fre- velder, Van Middelkoop 1994, and Abdool Karim
quently occurring diseases requiring special care, Q, Abdool KarimSS, Singh, Short, Ngxongo 1992).
for example, neonatal tetanus, tertiary hospitals
could be used. An example of a sentinel hospital is Outbreak surveillance
Clairwood Hospital in Durban, as it is a referral Without surveillance it would be difficult to
hospital for several infectious diseases, in particu- detect outbreaks of disease — if the baseline pre-
lar measles. Admission data from this hospital has valence of disease is not known, then epidemics
been used with reasonable accuracy to depict the cannot be recognized. Many epidemics can go
trend in measles incidence that occurs in the undetected if no surveillance is in place. When a
whole province of KwaZulu-Natal. disease outbreak occurs, investigation becomes a
Other sentinel surveillance systems include the high priority. Outbreak investigations are under-
influenza surveillance system run by the National taken to establish the source and/or cause of the
Institute of Virology, the Medical Research Coun- outbreak. The main aim of an outbreak investig-
cil’s trauma surveillance system in the Cape and ation is to quickly control the outbreak with tar-
KwaZulu-Natal, the South African General Practi- geted interventions. It also provides important in-
tioner Network, and the National Cancer Registry. formation on vaccine efficacy, age-specific attack
rates and age-specific case fatality rates. The
Special surveys/special surveillance increase in measles incidence in South Africa in
programmes the 1992-4 period has been largely due to out-
Periodically, new or unusual disease problems breaks in older children (over five years) who had
require special surveillance programmes to be set been previously vaccinated. An investigation of a
up. With the emergence of multi-drug resistance measles outbreak in a Cape Town school revealed
and the greater risk of tuberculosis among those an attack rate of 7,6% (25/329 children), immuni-
with HIV/AIDS, a national tuberculosis pro- zation coverage (at least 1 dose) of 91% and vac-
gramme was initiated to document cases and cine efficacy of 79% (Coetzee, Hussey, Visser,
treat tuberculosis more accurately. A National TB Barron, Keen 1994). Outbreak investigations can
Register was initiated in 1994 to document in- also identify certain social, ethnic, or religious
formation on every case of TB in South Africa. clusters of susceptible individuals.
Since malaria is endemic to northern KwaZulu- The steps in conducting outbreak investiga-
Natal and sections of Mpumalanga, a special pro- tions are covered in Chapter 21 of this book. The
gramme, the Malaria Control Programme, was importance of surveillance in the post-outbreak
developed to address this problem in this region. phase, as a tool for evaluating preventive inter-
Surveillance agents of this programme visit vention during the outbreak, should not be over-
almost all houses approximately once every six looked.
weeks and collect clinical data as well as blood for
parasite microscopy. In this way, effective surveil-
Problems in surveillance
lance of malaria in this region is maintained. The
malaria programme is one of the few examples of Health care providers, health officials and policy-
active surveillance undertaken by the Depart- makers need rapid access to accurate, timely data
ment of Health in South Africa. in a useful form. Lack of such data results in poor
Another example is sewer pads for active sur- planning, unnecessary illness, mortality, and
veillance for cholera in water sources. To find out economic costs.
whether cholera is present in the community, it is There have been many criticisms of passive
more cost-effective to place a pad which retains surveillance. Under-reporting has often been
microorganisms from sewage than it is to screen cited as the major problem with passive surveil-
members of the community. lance. Consequently, a study to determine the
144
14 = disease surveillance
tte cn pee sen sc A
ei
reasons for under-reporting of notifiable condi- ably because unqualified staff fill in the forms
tions was conducted amongst doctors at King without adequate supervision.
Edward VIII Hospital in Durban (Abdool Karim, Since one of the main uses of surveillance data
Dilraj 1996). Some of the reasons for under- is to institute swift disease control efforts, time-
reporting highlighted in that study are discussed liness of reporting is important. However, admin-
here. istrative delays often hamper timely and effective
@ There is poor knowledge of conditions that are disease control efforts.
notifiable. Although 35 conditions were report- Surveillance programmes are very seldom sub-
able in the year of the study, doctors correctly ject to regular evaluation or validation. Despite
mentioned an average of only six of them. the problem of under-reporting in passive sur-
@ Health workers are not aware that any person veillance, it is still possible to obtain meaningful
who is legally competent to diagnose the condi- incidence rates and trends from notification data,
tion is required by law to report it. provided that both the level of under-reporting
@ The notification form is too complicated, as too and the consistency of these levels are estimated.
many details are requested. This discourages Such an approach was used to validate hepatitis B
reporting, or results in poor-quality data being reporting (Abdool Karim SS, Abdool Karim Q
sent in from service level. This in turn compli- 1991; see Example 14.1).
cates analysis of data at higher levels of the Active surveillance is an effective form of data
health services. collection because completeness and timeliness
@ There is lack of feedback on reported cases. The can be ensured, but it is not implemented widely
analysis of data received at every level of report- because of the high cost involved. Where speci-
ing is not fed back timeously to those who do mens are essential for diagnosis in asymptomatic
the reporting. If and when feedback occurs, it is cases, laboratory cost is an added expense.
often not done ina useful and accessible form. Depending on the disease condition under active
Without feedback, health workers adopt the surveillance, telephone and transport expenses
attitude that it makes no difference whether or could add to the cost. A further limitation of active
not they report or what they report. National surveillance is that the size of the area may be too
publications of the Department of Health such small for proper estimation of incidence rates.
as Epidemiological Comments, and provincial For example, for cancer, which has an incidence
newsletters such as Epidemiology Focus in of 150/100 000, a large area is needed to get stable
KwaZulu-Natal, address this problem to some rates.
extent. Data on isolations and antibiotic resist- In sentinel surveillance, the selection of sites is
ance of several bacterial organisms are pub- crucial. If sites are not well chosen, the introduc-
lished in the South African Medical Journal. tion of bias will make the results unrepresentative
This feedback is provided by the Antibiotic of the total population. People who choose to
Study Group of South Africa, which collates the make use of these services and who are included
data from seven academic hospitals through- in the surveillance may be different from those
out the country. who use other services. Sentinel surveillance is
@ There is lack of training in surveillance at also inappropriate for monitoring rare diseases,
undergraduate level and continuing medical such as polio. Since a single case of polio is re-
education. The medical schools and nursing garded as an epidemic, the consequence of miss-
colleges need to include the notification ing one case is enormous. Thus every health ser-
process and the importance of notification for vice has to be involved in the monitoring in order
disease control and health planning in their to detect every single case of suspected polio.
undergraduate training.
@ At understaffed institutions, the doctors are too Conclusion
busy to spendalot of time filling in the notifica-
tion form. In such instances, other hospital per- While the major criticism of passive surveillance
sonnel should be used to support the doctors in is under-reporting, it is not always critical to
their reporting function. obtain complete counts of most diseases in order
to plan and undertake disease control interven-
Completeness of reporting is another problem tions. When the incidence of a disease is high, the
with routine surveillance, in that returns are sent trends instead of the complete counts of the cases
incomplete to the regional offices. This is prob- are adequate to plan and institute disease control
145
d= common approaches in epidemiology
efforts. This is provided that the under-reporting utilization of the virology laboratory service was
rates are reasonably consistent and the under- used as a basis of comparison to validate the
lying reporting mechanism does not vary unpre- hepatitis B notification data.
dictably. The level of under-notification was estimated
On the other end of the spectrum, diseases tar- by comparing the number of notifications for
geted for eradication, such as polio, demand sur- hepatitis B received by the Department of Health
veillance systems that are sensitive enough to with the number of positive hepatitis B laboratory
rapidly detect every case of the target disease. For results submitted to the NIV for the period
polio eradication, use of the case definition based 1985-8. Virologists estimated that approximately
on acute flaccid paralysis (AFP) enables the sur- 10% of positive laboratory results were repeat
veillance system to detect virtually all cases of tests, and the number of positive results was
paralytic polio. The choice of a case definition can therefore adjusted by this figure. It was found that
thus determine the sensitivity of the surveillance hepatitis B was grossly under-reported. Nation-
system for the disease in question. Effective ally, on average only one case of hepatitis B was
surveillance systems have detected AFP rates of notified for every seven positive hepatitis B labor-
1/100 000 and 0,45/100 000 children aged under atory results. There was a marked difference in
five years in the Americas and Tunisia respect- the regional notification : laboratory result ratios,
ively. In such instances, zero reporting is very ranging from 1:3 in the Cape to 1:17 in the Trans-
important when there are no cases. vaal.
It should be borne in mind that for vaccine The consistency of under-notification over
preventable diseases, surveillance should include time was fairly good nationally, ranging from 1:7
not only disease data, but also vaccination cover- to 1:9 over the four-year study period. The con-
age and vaccine efficacy. In vector-borne diseases sistency of under-notification varied greatly
such as malaria, surveillance of the vector popu- according to region, however. The Cape region
lations (distribution and species) is also part of had the highest and most constant level (1:3) in
the surveillance system. each year of the study, while the former Transvaal
While the technology for good surveillance sys- at the other extreme had very low and erratic
tems is available, the necessary commitment to levels ranging between 1:14 and 1:25 over this
apply these for the benefit of public health is not period.
always present. Surveillance systems should be The marked differences in the level of notifica-
evaluated to assess the accuracy and complete- tion between regions may spuriously create
ness of reporting, as well as the timeliness of considerable inter-region differences in incid-
results and the usefulness of the results to policy- ence rates. It is essential therefore that differences
makers. in the reporting level be taken into consideration
when making regional comparisons of hepatitis B
incidence rates calculated from notification data.
EXAMPLE 14.1 On the other hand, the small differences between
Validation of the hepatitis B report- the national reporting ratios for each of the study
years indicate that the analysis of hepatitis B noti-
ing system in South Africa fications could provide useful information on
Since the different types of viral hepatitis infec- trends in the incidence of this disease at a na-
tions are clinically indistinguishable, serology is tional level.
essential for diagnostic purposes. Consequently,
clinicians with access to laboratory facilities tend Reference:
to use them, and the laboratory data includes a Abdool Karim, S S, Abdool Karim, Q. ‘Under-reporting in
large proportion of the clinical cases of hepatitis B hepatitis B notifications.’ South African Medical Journal
being seen in the health care service. The high 1991; 79:242-4.
146
15 Health systems
research (HSR)
147
d= common approaches in epidemiology
and values are turned into health and health care action to achieve these goals. Once the goals of
behaviours. The cultural beliefs about health and the entire health system or a part of it are defined,
health care form the basis for health seeking and policies for achieving these goals must be se-
health care delivery behaviour. The societal lected. Policy evaluations assess whether the poli-
framework includes the family and the commu- cies of stakeholders are consistent with each
nity, while the institutional framework includes other and with health goals, and whether or not
other sectors, such as the education system, they are being implemented.
which impact on health and the health care A large NGO is funding a national project which
system. aims to improve the partnership between organi-
The health care system is the collection of zations training health professionals, the com-
people (clinical and support staff, communities, munities to be served, and the health services
families, and patients), things (buildings and authorities, by providing support to conduct
vehicles, drugs and money, timetables, rules and more student training in clinics within the com-
plans) and events (consultations, procedures, munities concerned (Zwarenstein, Lewin 1995).
meetings, and evaluations) that come together (as An evaluation is planned which will assess the
health care) to provide assistance for illness, ill- perceptions of the project aim held by each of
ness prevention, and health promotion. Although these partners. Descriptions of the number of stu-
HSR offers useful insights for research on the dents spending time in community clinics will
entire health system, in this chapter we empha- help to showif the policy is being implemented.
size research on the health care system. The infrastructure of the health service consists
of the physical and human inputs, and the proc-
The stages of health care delivery edural framework which forms the underlying
Health care can be thought of as if it were an foundation of the service. Evaluations of inputs
industrial process, which starts with production may assess the distribution and amount of long-
plans (policy), builds a factory and develops rules lasting inputs, such as buildings. Another type of
and procedures for running it, adds raw materials structural evaluation may assess the procedural
(inputs), combines them in processes to produce framework — such as the quality of the training,
health care of different kinds (outputs), which facilities, administrative and technical proce-
may lead to improved health (outcome). dures needed to provide care — against existing
This production line is permanent, but not un- norms, previous measurements, or other com-
changing. A well-managed health care system is parable areas, services, or facilities.
repeatedly re-examined and improved. HSR can A study was done to evaluate the quality of
contribute to this repeated cycle of planning, pro- health care provision in a sample of public and
duction, and evaluation, followed by improved private sector primary care sites (Beattie, Rispel,
planning and production. Broomberg, Price, Cabral 1995). Several indica-
Policy development is the first stage of health tors of quality of care were measured. These were
care delivery. Policy is the vision of a government assessed using indicators for the condition of the
or organization, stated as goals and plans of buildings and grounds, privacy for examination
148
15 health systems research
and changing, adequacy of space, and presence tended regularly, and whether the care that they
of patient toilets. These assessments were con- received (for example, regular eye testing to look
ducted by an external team of evaluators. for early, treatable complications of the disease)
Inputs are the people, material goods, and met internationally accepted guidelines.
financial resources which go into making up the Outputs are the items or units of care which are
health care system. They include both the long- produced in the interaction between health care
lasting capital inputs (fixed facilities such as clinic processes and inputs. Output evaluations fre-
or hospital buildings, durable capital equipment quently tell us how much care is being produced.
such as ambulances and X-ray machines) and the We may want to know if these outputs are being
shorter duration inputs paid for with running produced with a minimum of waste, which
costs (including equipment and consumables, requires a comparison of inputs and outputs.
such as scalpels, intravenous catheters, ban- Routine output evaluation of immunization
dages, drugs, and stationery, and personnel, such programmes can be based on the number of
as nurses, doctors, administrators, and cleaners). immunizations conducted ata site by bringing in
Input evaluations compare the inputs provided a denominator, the number of children of immu-
against existing norms or standards, previous nizable age. This will measure coverage as a pro-
measurements, or comparable areas and ser- portion of eligible children, and is reported
vices. They may also be analysed to see whether annually in most cities’ Medical Officer of Health
resources are being distributed fairly (equity). reports.
Where comparison between areas or over time is Outcomes are health states of people, groups,
the main purpose, an indicator might be selected or communities that result from interaction with
from among all the inputs to illustrate the com- the health system. Outcome evaluations check
parison. One example of such an indicator is ‘hos- that the intervention was effective, that the
pital beds’. Inputs are measured to get an idea of desired health goal was achieved, and that the
the cost of an element of health care. The various patient, client, or community obtained preven-
inputs are often summarized by their total money tion, amelioration, cure, or rehabilitation of a
value. particular problem.
A major problem in national health planning in Continuing the immunization example above,
South Africa is the absence of expenditure data. the outcome of improved immunization coverage
The National Health Expenditure review (Health is decreased incidence of the diseases against
Systems Trust 1995) has added up how much which immunization was conducted. A study was
money is being spent on what kinds of health done of the trends in admissions to a major infec-
care. This understanding of total financial input tious diseases hospital, to assess the impact of a
into the health sector is a great help in national measles immunization programme on the
planning. Data on physical resources, such as disease (Abdool Karim S S, Abdool Karim Q,
staff and beds, is also described by province and Chamane 1991). It showed an initial fall in admis-
by magisterial district. Thus, for the first time, sions, followed by a gradual rise to previous levels
resources can be allocated fairly according to geo- (see Chapter 14: Disease surveillance).
graphical area. This helps in planning the redistri-
bution of resources to ensure increasing equity. Health care goals: the elements of quality
Processes are the complex interactions which The idea of quality is more complicated in health
occur in the delivery of health care, and which care than in most industries, because so many
take place when infrastructure and other inputs elements are included in the notion of quality
are applied to communities or individual patients (Table 15.1). Each element of quality can offer a
with health problems. Process evaluations assess starting point from which to develop a standard
the way in which care is provided, and whether it against which a health care activity can be evalu-
meets any institutional guidelines which may ated.
exist. Although quality is sometimes used as a
A study of the quality of diabetes care in public - synonym
ynonym for single
g elements, it is also an overall
sector primary care clinics was done in the term for all of these elements. Quality care offers
Western Cape (Levitt, Bradshaw, Zwarenstein, all of the elements described.
Bawa, Katzenellenbogen, Dopfmer [1996]). This The table excludes a number of elements, such
study assessed a representative sample of patient as cost. Cost, the amount of resources used to
records to see what proportion of patients at- offer a service measured in money terms, is dealt
149
d= common approaches in epidemiology
ELEMENT DESCRIPTION
Comprehensiveness The ability of the service to meet the wide range of needs of individuals and
communities
Appropriateness The level and range of care given the needs of the patient and community and
the resources available
Acceptability The match between patient and community expectations and the care provided
Coverage The proportion of the target population successfully in contact with the health
service
Adherence How well treatment regimens are followed — determined by clinical, patient,
and community factors
Continuity The potential for continued interaction between the health service and pa-
tients or communities, such as through defaulter tracing programmes, out-
reach, and a coherent record-keeping system
Relations The quality of the social interaction between health workers and patients
Efficiency _ The relationship between input and output: maximizing the outputs for a given
Input
Equity The availability of equal care for people with equal needs
with in Chapter 30: Health economics. about the distribution of input, process, or output
Availability, affordability, accessibility, and per capita, in different areas, or by different social
acceptability are terms for different aspects of the groups. We may compare demand with need, or
‘fit’ between the health care supplied and the utilization with need as a way to identify commu-
potential users’ needs and wants. Availability nities with excess or under-utilization. Commu-
describes the degree to which supply of services nities should have equal use for equal need. See
and resources meets the theoretical need. Chapter 30 for further discussion of equity.
Affordability is an economic concept: to what Efficacy, effectiveness, and efficiency refer to
extent do the prices of services match the clients’ different aspects of the concept ‘how well does
income and willingness to pay? Accessibility this care work?’
takes into account transport, travel time, and Efficacy studies ask the question: Does the
distance. Acceptability describes users’ percep- health care intervention (usually a drug) work
tions of the appropriateness of services, and when given under ideal circumstances (a care-
emphasizes communication and mutual respect fully carried out randomized double blind con-
between staff and patients, continuity of care, trolled trial with complying patients)?
and convenience. Effectiveness describes the ability of the inter-
Equity studies try to find out if a health service vention to work in the messy real world, with
is fairly shared out in proportion to need. These ordinary patients under normal health service
studies are usually based on routine information conditions.
150
15 health systems research
Efficiency is an economic term with two broad study is to be done). This will be illustrated with
meanings. an example of a health care activity, and the HSR
Technical efficiency involves minimization of which could be carried out to evaluate and plan
resources (such as staff and time) used in under- improvements to it.
taking a particular task. This is often evaluated One of the most cost effective interventions in
using operations research methods. Allocative all of health care is immunization against child-
efficiency asks whether a particular health service hood illnesses, especially in impoverished com-
approach is the best way to use resources to max- munities, where immunity is poor and severe
imum effect. Here the term is used for studies complications of otherwise mild diseases are
which ask either ‘Is this the cheapest way to common. Several different HSR studies could be
achieve these results?’ or, ‘Is this health pro- useful in improving immunization programmes.
gramme worth implementing in comparison with We will illustrate a few of the most important
other uses of these resources?’ These studies questions that managers may wish to answer
usually require help from an economist (see about immunization, categorizing them accord-
Chapter 30: Health economics). ing to the structure outlined in Table 15.2.
Often, immunization programmes have been
An integrated view of HSR: linking operating for some time, but their success is un-
known. A study looking at the output (stage) of
research design with sectors, levels,
the immunization programme would be useful.
units, stages and elements of care In this study, coverage (element) would be appro-
HSR can address many different questions. In priate to assess success. One approach would be
order to understand the possibilities, it may be to do a descriptive, random community sample
useful to see health care as a complicated system survey (study design) in one district (level) of chil-
in which different sectors operate (public and pri- dren (unit) to see if they were immunized appro-
vate), each with several levels (first contact, refer- priately for age. This study thus determines out-
ral). Health care delivery consists of several stages put per population.
(policy, process, outcome), is organized into dif- We immunize children because we want to
ferent units (clinic, carer, consultation, client) protect them against a particular disease, say
and can be evaluated according to performance measles. The desired outcome (stage), then, is a
in any of several elements (equity, effectiveness, fall in incidence of the disease, measles, and a
accessibility). reduction in deaths caused by it. Another
Any HSR study can be categorized and placed approach would be to look at time trends (study
by selecting one description from each column in design similar to a before and after study, but over
Table 15.2. It shows some examples of descrip- a longer period) in reported measles cases, with
tions, but others might be added. Any study can the assumption that a fall is due to the immuniza-
be fully described by stating the sector and level tion programme. This is a study of effectiveness
of the health system to which it refers, the unit, (element); the unit of study (children) remains
stage and particular element of health care deliv- the same, while the level (district) could stay the
ery to be studied, and the study design (how the same or change. If the level was raised (provincial
eae
Input Effectiveness
study deste
Rapid appraisal
Public Ward Carer
Private (for profit) District Clinic Process Equity Descriptive
Private (not for profit) Region Hospital Output Cost Analytic
Province Contact | Outcome Efficiency Before-after
National Script Coverage Cross-sectional
International Patient Satisfaction Case-control
Individual Need Cohort
Community Trial
151
d= common approaches in epidemiology
or national), the main purpose of the study might problem identified by a health service purchaser
be to assess equity (element) between areas. (in some countries, health ministries have
What would a manager do if coverage was created authorities which ‘buy’ health services
found to be good (the output stage of the pro- from providers on behalf of the government),
gramme is successful) but measles cases or provider, or user. The problem is often tackled
deaths were still being recorded (the programme in such a way that more issues are taken into
is failing to achieve the desired outcome)? This account than appeared in the original problem
implies that either the inputs or the processes of statement. In other words, a comprehensive
immunization (stages) are deficient in some way, understanding of health and health care in a
and studies of these should be done urgently. social, political, and economic context is applied
Since protection may fail if the vaccine is not kept to research.
cold (a subsidiary goal of the measles immuniza- Because it is problem-based, it is also problem-
tion programme), evaluation of the process of solving — often providing cure, but sometimes
vaccine storage needs to be conducted. prevention. If patients are dissatisfied with care,
Other studies could also be useful: immuniza- service providers need to know quickly what the
tion has to be given in certain ways, which could main problems are, and how these can be rem-
be evaluated, as could the success of the training edied. If the research is only made ayailable in
received by nurses in administering the immu- two or three years time (the usual gap between
nization. (Both of these are process evaluations of beginning and publishing research), patients will
quality, the first of the unit contact, the second of have moved on or died. Often such studies must
the carer.) Do patients believe in and want be quick and their questions must be very
measles immunization (the element of accept- focused. Much of the skill in HSR lies in making
ability)? trade-offs between speed of answer on the one
While the child is being immunized against hand, and depth of information on the other.
measles, are its other health problems attended to Solutions also have to be relevant to local needs
(the element of comprehensiveness)? Do staff and circumstances. This requires more local
relate well to patients (the element of relations)? knowledge than any outsider can have, and so,
given that HSR skills are scarce, projects are often
Characteristics of HSR conducted as collaborative research, where a
HSR is different from most epidemiological re- skilled consultant collaborates with an insider
search in its approach and the way it uses differ- who needs the results of the research to improve
ent methods. The characteristics of HSR are listed his or her health care provision. The research
below: users may then conduct the study with help from
¢@ HSR focuses on finding implementable solu- a consultant. The ability to share research skills is
tions to priority problems. an absolute requirement, and itself an important
@ HSR findings help to improve services which goal of HSR.
lead to better health.
HSR provides usable timely information to Methodology
decision-makers at all levels. HSR problems are usually complex, and may
HSR is participatory, involving concerned need a combination of several methods from
parties in all stages of research. multiple disciplines. An HSR project will often
@ HSR demystifies research and decentralizes it begin with a situation analysis. This may help to
from high-level institutions. formulate the problem to be solved. From this, a
@ HSR can be done by people in services as part of number of questions might be generated. These
their daily activities. questions could be addressed by qualitative or
¢@ HSR emphasizes simple, fast, low-cost study quantitative methods.
designs. Qualitative approaches drawn from the social
@ HSR is multi-disciplinary, and uses multiple sciences include depth and focus group inter-
methods. views and text analysis. Thus an HSR study might
@ HSR is iterative: it follows the chain of action/ include depth interviews to understand patient or
evaluate/plan/new action/ re-evaluate/re-plan. staff perceptions of a service or a care problem, or
focus group interviews to allow different percep-
Approach tions of a problem to be discussed.
HSR is problem-based, usually responding to a HSR studies create a number of specific diffi-
Wa2
15 health systems research
153
d= = common approaches in epidemiology
ee a a i ST
tained, that issue may need to be put on hold for as to influence decisions, or not implementable.
some time, until it rises higher on the agenda. The
exact evaluation question might also change over 4 Is your chosen research problem important?
time, to match the changing needs for inform-
ation. Health care programmes may also change, Research skills and resources are scarce, and
both in personnel and methods of operation. should be focused on important problems. A
Evaluations should thus also be repeated and problem is important if many people are affected,
ideally integrated into the process of health care if the diseases involved are serious, with major
delivery, to be conducted regularly by those who health consequences, or if many people use the
deliver care. health care services that are identified for evalu-
ation. If the costs of care, or lack of it, are high and
2 Do you understand the context? if policies are about to change, the issue becomes
more important. Sometimes a question is of aca-
Before an evaluation is possible, the environment demic interest only. If, for example, no alternative
should also be understood: the background, the policy option exists, there is little point evaluating
surroundings, or the context in which the prob- policy options. Also, are you sure that research is
lem is defined. The objective and subjective necessary? Are you sure that adequate answers
health needs of the community served by the are not already available?
health system are an important part of this con-
text. The community and the society, its politics, 5 Is your research politically feasible?
its culture, its economy, the available resources,
its social development, and the degree of partici- The questions for an HSR study should be
pation in decisions are all important aspects of discussed with and reflect the priorities of man-
the environment which should be understood agers and providers of care, as well as the con-
before embarking on an HSR study. The health cerns of the community in which the service
system itself is part of this environment. The exists. If any stakeholder groups are hostile to the
researcher needs to understand where the prob- research, or feel it is irrelevant, success is unlikely.
lem and service under study fits into the picture Either the research itself will fail or the recom-
of the structure and functioning of the whole mendations will be out of touch with key stake-
health system: its size, its place in the organiza- holders’ needs, and therefore not implemented.
tion, the organizational culture, the cost of care, But, on the other hand, there must be enough un-
the skill and experience of staff, their relations certainty among all stakeholders to allow space
with each other, with managers, politicians, for research to influence policy. If minds are
users, the community, and other services. The made up already, this may not happen.
‘target’ population and the planned activities and
expected benefits of the health care programme 6 Can your research be done in the available
under study should be clear. These details are time?
usually found by studying the goals and object-
ives of the programme closely. If these are not Policy-makers have shorter deadlines than
understood, you should probably undertake a researchers; often a decision has to be taken fast
situation analysis before starting a specific HSR for budgetary or public pressure reasons. In this
study. situation, an HSR researcher needs to assess if the
research can be completed in time to influence
3 Is your research question specific? the decision. If not, a simpler, quicker, less
acceptable research design may be needed, or a
Evaluation questions need to be specific. Is your longer-term question needs to be formulated.
question understood by all participants in the
same way? 7 Will your answer be heard?
Does the question specify a sector, level, stage
of care, unit of study, and issue which are to be Sometimes the perceived quality of an answer is
investigated? Vague questions result in vague affected by whose answer it is. Will the research
projects which may never finish. Even if the pro- team have the credibility to produce an accept-
ject does reach an end point, the answers may be able answer? If not, you should consider collab-
vague, open to re-interpretation, not phrased so orating with people whose credibility is greater.
154
15 health systems research
155
d common approaches in epidemiology
ee
ee
Kuhn L, Zwarenstein M. ‘Evaluation ofa Village Health and establish broad priorities for improvements.
Worker Programme: The Use of Village Health Worker It relied heavily on qualitative methodology to
Retained records.’ International Journal of Epidemiology understand service providers’ perceptions, and
1990; 19(3):11-18. simple routine data sources to estimate the load
Yach D, Hoogendoorn L, Von Schirnding YE R. ‘Village and available capacity of the staff to take on more
health workers are able to teach mothers how to safely pre- work.
pare sugar/salt solutions.’ Paediatric and Perinatal Epidemi- The researcher interviewed experienced staff,
ology 1987; 1:153-61. held group discussions with patients, staff, NGO
Yach D, Katzenellenbogen J, Conradie H. ‘Infant Mortality staff, and representatives of political organiza-
Rate Study: Hewu District.’ South African Journal of Science tions to gain insights into the running of the CHC.
1987; 83:417-21. He examined and re-analysed some routine data,
reviewed previous research on similar health ser-
vices in Khayelitsha and elsewhere, and observed
EXAMPLE 15.2 the functioning of CHC services by walking
Evaluating health care to help imple- around at different times of the day and week.
The CHC was clearly busy, but it had under-
ment change: the Khayelitsha studies utilized resources available, especially as staff
Sometimes an HSR collaboration must first find members were not always busy during after-
the questions that need to be answered through noons. There were gaps in services; for example,
an overview or situation analysis, and then look sick children were not always treated if they
for appropriate methods of answering them. This arrived for immunization, and children who
example describes how research contributed to arrived ill and also needed immunization often
the process of trying to improve the quality of care received care, but not immunization. There were
in a community health centre (CHC) in a large also overlaps; for example, patients with STDs
peri-urban settlement in Cape Town. were being treated by all three authorities, in
The CHC treated about 1 000 patients oe day, three different places, at different times, by differ-
and had more than 150 staff, employed by three ent staff, using different methods. The study also
different health authorities. The local authority collected many ideas for improving the services
was responsible for running specialized clinics for using existing resources.
sexually transmitted diseases (STDs), family plan- The situation analysis was presented, circu-
ning (FP), tuberculosis, and preventive and pro- lated, and widely discussed. It recommended that
motive care for well children under six. They felt the managers of separate authorities working in
that fragmentation of health authorities had the CHC should initially focus on combining cur-
caused gaps and overlaps in services provided ative and preventive services for children under
within one CHC, and wanted to co-ordinate their six years old, to eliminate overlaps and gaps.
services with those provided by the other author- While the integration of preventive and
ities, especially the provision of curative care for curative services was being planned, managers
ill children. Community organizations had also focused on the problem of long periods of waiting
criticized the quality of care and demanded im- for care, which had been identified in the situa-
provements. tion analysis, having attracted frequent patient
However, nobody had studied what the various complaints.
problems were, how severe they were, what was
causing them, or how they could be solved. The Waiting times
local authority thus asked a researcher to conduct The researcher designed a study in collaboration
a situation analysis, or rapid appraisal of the CHC, with management to measure how long patients
working part-time over two months. The situ- spent at the CHC curative and preventive clinics.
ation analysis tried to identify overlaps and gaps The study also aimed to find out when (times of
in the services provided by different authorities, the day, and days of the week) waiting times were
and to collect problems, their main causes and longest, and whether the flow of patients and
solutions as perceived by staff, management, work processes could be improved. One specific
patients, and local organizations. Qualitative and suggestion had been that patients receive their
quantitative methods were used. drugs during the consultation, without having to
This study was a multi-method overview of a wait at the pharmacy. The study needed to assess
given health setting, to identify priority problems, whether this would save time. The idea was that
156
15 health systems research
after a first study, improvements would be made hours of the week. Most patients arrived early,
to reduce waiting times, and then the study could causing crowding in the mornings, and an empty
be repeated. CHC in the afternoons. Patients waited only 15
The study hired a fieldworker who stood at the minutes at the pharmacy, which did not justify
only gate to the CHC for a week, noting patients’ changing the way drugs were issued. The patient-
entry and exit times. Each patient was given a slip flow mapping showed a number of stops where
of paper with the time of entry, which was taken patients were waiting unnecessarily (for example,
back at time of exit. Information on which cur- they could be weighed while waiting for a health
ative or preventive clinics were visited were also education session) or that could be cut out. The
entered by staff. Also, on three days, at different follow-up survey, after the preventive clinic had
times each day, a fieldworker recorded the times been streamlined, but the curative clinic had not
at which patients arrived at the pharmacy, and changed, showed that average waiting times in
the times at which they left with their drugs. The the curative clinic increased to 5,1 hours, but
researcher developed a flow diagram of how dropped to 1,5 hours in the preventive clinic.
patients moved though the clinic, noting every The study thus confirmed that waiting times
point at which patients were served, or waited to were a problem, and that reorganizing the pre-
be served. This survey was repeated in exactly the ventive clinic reduced the problem, but that
same way (excluding the pharmacy study) a year changing the way of issuing drugs would not solve
later. the problem.
The study showed that waiting times were very
long (an average of 4,1 hours in the curative clinic References: _
and 2,6 hours in the preventive clinic). Waiting Bachmann M. Site B Health Centre Khayelitsha: Situation
times varied a lot in the preventive clinic because Analysis and Recommendations. Community Health
immunization was only provided during a few Department, UCT, February 1993.
157
16 Rapid
epidemiological
assessment
158
159
rapid epidemiological assessment
sulpunojuoo
SW3|qO1d ajqeljas ‘Wueloie ‘pidey
YUM uonenjene pue SuLoUO/\ Spoujelw jO1]U09 8seg
uonenjers pue ZuLOUO|\
JOYUUOW 0}
yoinb pue ‘deayd ‘ajdwis aie JUusWSSsesse
YdIUM pasn sJozeoipul AxOld YSU pue snjeys YjjeoH suojeoipul AUNWWOD
pea}}09
SISOUBBIP JO9IOOU! Ul si eJep OU aJ8Y4M UJeaP JO
sunjnsai suajqoid |je0ay SaSned JO SJOJEDIPU! S@AID saipnys ALjeOW saisdoyne jeqia\
%OL Ueyy
1912313 9q pjnoyus saiyioes senbiuyoe}
uyyesy ye aje1 AaAijoq deayo pue ajdwis saipnys AUeUO Yyuig SuIpeda1d4
16
goue||ionuns
aniejuesaidas aq 9uljNOJ 0} pasedwoo Ayeyow pue
jou Aew says Jo ajdwes pauinbai seoinosei ssa] Aypiquoww Jo saipnis aouejsjlaAins jaunUaS spoujew soue]|I@AINS
JOPYUOW OF
yoinb pue ‘deayo ‘ajduis ae juaLussesse YSU quawussasse yieay
YdIUM pasn SJoyedIpUl AXOJd pue snjejs jejuawWUOIIAUA jequdwuOJAUS pidey
pesinbas aq jou Aew suljdwes saipnys
aanejuesaidal anneyjuenb suljonpuoo juaussesse
aq jou Aew sijnsey | uonewuojul yjdep-ul SepIAdld aJ0joq Juawssessy o1ydeisouyje pidey
pejeoljduo0o
anbiuyoe} suljdues pouinbe azis ajdwes |jews UONeN|eAS pue SULIOWUO|\ aoueinsse Aqijenb 107
(aouajeAaid)
suljdwes
spoyjew
aAnduoseq
salpnys
Asea Ajeolsiso7
Shavins
uones!unwiwu!
|jBWS
junoode OU! Uaye} eq
eae
pue
ysnw ZuLesnjo Jo s}oeyA pauinbes you swesy Buljdues uo swiweigoid papuedxy
SA9DVLNVAGVSIG ASAUNS 40 AdAL
TTS BUC woe LEE SER Tea ht: ieat
d common approaches in epidemiology
is well suited to settings in which a sampling criteria. Both adaptations required very small
frame is not readily available (Henderson, Sunda- sample sizes and have been useful in identifying
resan 1982). In the 30 x 7 cluster sampling tech- areas urgently needing attention.
nique, 30 clusters and a starting point in each Although LQAS requires relatively small sample
cluster are chosen randomly, while subsequent sizes, the sampling technique is more compli-
houses are chosen according to fixed easy-to- cated than the EPI method. A shortcoming of the
follow rules until seven children of the appro- method compared to the EPI approach is that it
priate age are found. requires a sampling frame from which lots are
The use of this simple approach can be general- randomly chosen. In addition, the method does
ized to many other health service-related prob- not provide an estimate of the variable of interest
lems. However the effects of clustering (and the for the lot sampled. For example, it will not pro-
associated standard error) must be taken into vide an immunization rate or contraceptive utili-
account during the analysis (Bennett, Woods, zation rate. This is because the samples are too
Liyanage, Smith 1991; Coetzee, Ferrinho, Reinach small to provide acceptable confidence limits,
1993). and can only classify a lot as acceptable or not
acceptable.
Lot Quality Assurance Sampling (LQAS) Thus LQAS has limited use in estimating overall
Lot Quality Assurance Sampling (LQAS) has its coverage rates. EPI methods are preferable if the
origin in industrial quality control where a certain major interest is a community-wide coverage
number of items (for example, tins) in a batch or figure.
lot is tested. On the basis of the number of defect-
ive items, the whole batch is judged as being Rapid ethnographic assessment
defective or acceptable. Since testing is costly, Rapid ethnographic assessment can be used to
one would like to test as few items as possible. obtain important in-depth anthropological data
The researcher has to take into account two pos- about cultural and other factors that influence
sible risks: the risk of classifying a lot with less health-seeking behaviour, without requiring ex-
than a specified threshold of defective items as pensive large-scale studies.
being defective (‘provider risk’); and the risk of By the use of focus groups, health care provider
classifying a lot with more defective items than a interviews and natural group interviews (groups
certain threshold as acceptable (‘consumer risk’). assembled not for the purpose of the study but
The technique is increasingly being used in the who are formed ‘naturally’, such as roadside food
health care setting for health programme evalu- sellers, mothers waiting at a clinic), important
ation and continual monitoring (Lemeshow, information is obtained. They can provide in-
Taber 1991). When applied to health services sights into motivation, attitudes, feelings, and
management, an area under a particular manager behaviour related to the use of health facilities.
could be divided into subunits which are called Simple self-administered questionnaires can also
lots. An ideal lot is the smallest unit that could be distributed through an existing administrative
provide meaningful information when evaluating system. Rapid ethnographic assessment has been
a programme. Each ‘lot’ (which could be the geo- used in this way to identify maternal factors that
graphical terrain of a community health nurse) is influence the use of preventive health services
sampled and some characteristic of interest is and treatment compliance.
measured, for example, the use of contraception Often such qualitative assessments are the best
by women 12 to 49 years of age. If the lot is way of forming the basis for more substantial
rejected, some management initiative is insti- quantitative studies. Such assessments are crit-
tuted in that sub-area. This means that resources ical prior to designing interventions.
do not have to be expended in sub-areas that are
actually doing well and: can rather be applied Rapid environmental health assessment
where most needed. Small area surveys are also useful in rapid en-
Recently the technique has been adapted to vironmental health assessment. They were ori-
assess immunization coverage, and the quality of ginally developed for assessing environmental
care and adequacy of health services provided by health factor needs, for example, in the peri-
local primary health care units (Lanata, Black urban areas of Zaire (Bertrand, Mock, Franklin
1991). Coverage is judged adequate or inadequate 1988). The aim was to determine environmental
by comparing it with a series of predetermined indicators which could be simply, cheaply, and
160
16 rapid epidemiological assessment
quickly monitored as measures of community The use of sentinel surveillance could be ex-
health. Three classes of indicators were consid- panded to include a list of tracer conditions (that
ered: is, selected diseases of importance) reflecting a
@ those associated with the individual range of changes that need to be monitored and
@ those associated with the dwelling demand an immediate public health response.
@ those related to the broader social and physical Sentinel health event surveillance has been used
environment. for infectious diseases (for example, poliomyelitis
and acute flaccid paralysis) as well as in occupa-
Socio-economic status is defined according to tional health (for example, a report of malignancy
local circumstances. For example, in Zaire, a of the lung in a chemical worker should trigger an
quality of life scale was developed using items in- evaluation for possible work site carcinogens).
cluding the presence of electricity, type of water
supply, and type of food preparation. The assess- Preceding birth technique
ments are useful for defining the relationship Mortality data are not strictly classified as surveil-
between child health status and physical/social lance, but do provide important information for
environmental factors, and have their main use in health surveillance. Although infant and child-
peri-urban areas where there are informal settle- hood mortality are sensitive indicators of social
ments. welfare and the relative health of a population,
mortality surveillance is a problem in most devel-
Surveillance methods oping countries. This is because neither the
Surveillance refers to the need for ‘continual numerators (the deaths by-age) nor the denom-
watchfulness’ of the distribution and trends of inator (the population at risk) for conventional
selected health outcomes with a view to acting death rates are accurate or even available.
when certain boundaries are passed. Surveillance Besides the use of sentinel surveillance to
methods are needed to continually monitor decrease the problem of inaccurate reporting, an
change. This is particularly important during a alternative method of assessing childhood mor-
period of rapid socio-economic growth, urban- tality rates has been developed by the demo-
ization, and political change, when the impact.of graphers Brass and McCrae. It is called the ‘pre-
several variables, from a change in the bread price ceding birth technique’ and has been tried in
to an increase in the level of political violence, many developing countries (Hill, David 1988). All
could have major impacts on health. Most sys- it requires is a question to mothers at delivery
tems collect too much data, however, (often with about whether or not the child preceding the cur-
loss of quality and accuracy) and analysis or rent delivery is still alive.
action is not timely, or is based on too small a pro- This provides a useful simple method, requir-
portion of data collected. In addition, the best ing limited resources, of monitoring the child-
surveillance is usually found where the risk is hood mortality of an area at delivery facilities in
least! (See also Chapter 14.) that area. If the delivery rate at health facilities is
above 70%, the method has been shown to accu-
Sentinel surveillance rately reflect the true childhood mortality rate of
Because of the lack of resources and the burden of the community.
routine surveillance at all health facilities, a sam-
ple of facilities or sentinel surveillance sites can Verbal autopsies
be identified in order to measure the health status To overcome the problem of the high proportion
of the population without studying the entire of ill-defined deaths in developing countries,
population. retrospective maternal interviews (verbal autop-
An example of this is the influenza early warn- sies) have been developed in a standardized way
ing system used by the National Institute of Virel- to assist in determining cause of death. Algo-
ogy in South Africa. Absenteeism surveillance is rithms (a’series of questions which allow a dia-
carried out in a number of schools prior to deter- - gnosis to be made easily) for commonly occurring
mining the activity of the influenza virus on an childhood conditions have been tested in a
annual basis. This surveillance system enables number of settings (Kalter 1990). In general,
the authorities to determine the current virus verbal autopsies were shown to be able to diag-
type so that an appropriate vaccine can be devel- nose major illnesses contributing to death in chil-
oped and deployed. dren with acceptable sensitivity and specificity.
161
d common approaches in epidemiology
eee ee ——————E———— ee
This approach is particularly important in reduc- of exposure to improved water supply and sanita-
ing uncertainty about the causes of death that are tion on diarrhoeal disease (De Zoysa, Feachem
lurking inside the ‘ill-defined’ categories, as well 1991).
as validating the diagnoses given by police offi- Similar methods can be used to assess the
Cers. effectiveness (rather than efficacy) of vaccination.
Cases (with disease) are compared to controls
Community indicators of risk or health status with respect to their vaccination status. However,
Several approaches are used to identify ‘at risk’ one shortcoming of such a study can be that the
populations towards which public health efforts vaccinated group may differ from the non-vac-
or health services need to be directed. Groups as cinated group in ways which are related both to
opposed to individuals are the target of the study. the risk of disease and vaccine status. For exam-
One approach is to use proxy measures of health ple, poorer people are more likely to get tuber-
status to determine which communities are at culosis and also not to be vaccinated (Comstock
risk. 1990). The impact of this bias can be assessed and
For example, the nutritional status of primary can be minimized by selecting appropriate con-
schoolchildren is a useful indicator of community trols (for example, by using household controls to
malnutrition (if school attendance is high). Sim- limit socio-economic bias).
ilarly, night blindness is a useful proxy for vitamin The case-control design can also be used in the
A deficiency. Tracer conditions, for example diar- evaluation of the Pap test in preventing cervical
rhoea, can be used to evaluate the quality of pri- cancer through early case detection (cases are
mary health care in a community. Other indic- those who have cervical cancer) and the health
ators of health service functioning have been impact of family planning practice (cases are
developed. For example, in missed opportunity those classified by some standard means as
studies, each contact a client makes with the having ‘poor health’).
health service is seen as an opportunity for health Case-control studies nested inside large popu-
workers to intervene more comprehensively on lation-based surveys can also be very effective.
issues other than the original reason for the visit.
Missed opportunity studies are thus able to Conclusion
rapidly assess the extent to which, for example,
curative services perform preventive functions REA is not a new discipline, but it represents a
(see Example 16.2). new approach to epidemiological research. It
draws on many well-known methods and also
Case-control methods for evaluation of stresses speed and simplicity, adaptation to local
health services conditions, and the need to obtain information
Traditionally, the case-control design has been timeously at a level of precision demanded by ser-
restricted to risk-factor studies of chronic vice providers. The strengths and weaknesses of
diseases. Most health service utilization studies each method need to be identified so that it is
have been based on large population-based stud- always used appropriately. As with any epidemio-
ies which are expensive and may take a long time logical study, rapid assessments should be con-
to complete. Case-control studies are more rapid ducted according to high standards.
and efficient. They can provide reliable results
and are an effective means of evaluating health
programmes, products, and procedures, pro- EXAMPLE 16.1
vided attention is paid to confounding. Accessibility of condoms to teenagers
In a study in Columbia, users were compared to
non-users (as outcomes) in a child health service
visiting family planning services
study (Selwyn 1978). The non-user group is By distributing condoms and promoting their
usually not included in studies of health services. use, Clinics that provide family planning services
It is, however, important to study the character- can play an important role in preventing the
istics of non-users. spread of AIDS. To assess the accessibility of con-
In a Cebu study, children who were brought to doms to teenagers at family planning services in
the clinic with diarrhoea (cases) were compared Durban, a simple random sample of 12 clinics run
to children who were brought in with respiratory by the Natal Provincial Administration and the
complaints (controls) in order to assess the effect Durban City Health Department was selected.
162
16 rapid epidemiological assessment
The fieldwork was conducted during April and the percentage of children leaving curative health
May 1991 by four trained and supervised teenage facilities in the Western Cape who do not have
fieldworkers, who visited clinics on different days evidence of measles immunization, a study was
and at different times. They kept detailed notes of carried out over a two-week period from the end
their experiences, and these notes were subse- of January 1990, immediately before the start of
quently analysed for content. the national measles immunization campaign.
Clinics were found to be accessible to people The sample consisted of two major teaching hos-
using public transport, but some were difficult to pitals in the greater Cape Town area, two second-
locate due to lack of signposts. Fieldworkers (par- ary referral hospitals (one in central Cape Town,
ticularly the two female fieldworkers) were em- the other in a small country town), and four day-
barrassed and intimidated by security guards at hospitals (the latter serving communities of low
the clinics. Clinic staff were generally efficient socio-economic status).
and welcoming, but did not give a high priority to All parents or carers leaving the hospitals with
condom distribution, and gave them ina setting child patients aged 6 to 59 months were inter-
which lacked privacy. Information on how to use viewed at the exit from the hospital. Approx-
condoms, if given at all, was supplied by means of imately 1000 such exit interviews were con-
pamphlets. Information on AIDS was rarely ducted. Interviews were conducted by trained
offered. fieldworkers who were not health service em-
The recommendations of the study include the ployees. Only a few senior staff members at the
following: selected hospitals were informed of the purpose
@ signs providing directions to clinics should be of the study. ,
put up; In 47% of cases, the health personnel at the
adequate stocks of condoms should be kept; hospital had not asked to see the child’s health
@ privacy should be ensured; card. If the fact that the carer could provide no
@ clinic staff should use the opportunity to dis- documentation of measles immunization is taken
cuss AIDS prevention with teenagers attending as indicating that the child had not been immun-
the clinic. ized, more than 60% of children were missed
opportunities (that is, they were in a setting in
Reference: which they could have been vaccinated but were
Abdool Karim Q, Preston-Whyte E, Abdool Karim SS. not). If no documentation was available and no
‘Teenagers seeking condoms at family planning services. history of measles immunization could be pro-
Part I. Auser’s perspective.’ South African Medical Journal vided by the carer, between 2% and 39% of chil-
1992 82;356-9. dren (depending on the type of hospital and its
location) were missed opportunities.
Similar studies could be applied to many other
aspects of prevention that could be practised
EXAMPLE 16.2 cheaply and easily in the curative sector. These
Missed opportunities for measles studies can be carried out over a 24- to 48-hour
period and have a large implementation
immunization potential.
Measles is still a major cause of childhood mortal-
ity and morbidity in South Africa. The WHO Reference:
recommends that all contacts with health ser- Yach D, Metcalf C, Lachman P, Hussey G, Subotsky E, Blig-
vices, for curative as well as preventive health naut R, Flisher AJ, Schaaf H S, Cameron N. ‘Missed oppor-
care, should be used to immunize children who tunities for measles immunization in selected Western Cape
do not have proof of immunization. To determine hospitals.’ South African Medical Journal 1991: 79;437-9.
163
17 Community
studies
164
17 community studies
when the number of households selected per sub- tions. Others might require a physical examina-
urb reflects the correct proportions per suburb, tion. All this needs to be taken into account before
differential household sizes between suburbs the exact location is determined. Equipment
may produce disproportional sampling at the must be light and convenient to transport. If data
level of the individual. That is, if one suburb has is to be collected at central venues, these must be
larger household sizes on average than another, acceptable to community members. If for any
these suburbs may be over- or under-represented reason there is a stigma attached to a particular
in the sample. venue, the response rate will drop and the validity
In general, sampling in a community setting is of the study will be compromised.
often complex and difficult, and may involve
many stages (see Examples 17.1 and 17.2).
Fieldworkers
Selection plans need to be predetermined so
that no selection bias occurs in the field. For Fieldworkers are the key to the success of a
example, if on a selected plot only one household community study. They may be interviewers or
(out of, say, four) must be studied, the convention recruiters, that is, individuals who motivate
may be to spin a bottle from the middle of the plot people to come to the central venue where others
and choose the house to which the spun bottle do the interviewing. Yet another group may do
points. Likewise, if sampling is done in stages, the physical measurements or observe features in the
household might be selected first and then one participants. Most commonly, fieldworkers in a
individual within that household may be ran- community study enter people’s homes and
domly selected for interviewing or measurement interview them there. s
by drawing numbers out of a bag. The researcher It is often helpful to have recruiters and/or
must ensure that sampling methods are not cul- interviewers from the community itself. Such
turally offensive. Some community members fieldworkers understand the community, speak
may become suspicious that ‘witchcraft’ is being the same language, understand what is con-
performed if the research involves throwing dice, sidered a stigma by the community, and could be
spinning a bottle, or picking names from a bag. useful in motivating people to participate. The
In some situations, a survey may include all the use of local interviewers, however, may put
people in an area, with no sampling taking place. people off. If the data being collected is sensitive
This occurs when the community is relatively in nature, community members might not feel
smal or when one of the purposes of the study is comfortable giving this information to people
to provide a full census of the population. they would occasionally meet in the street. There
might be competition for jobs and other
resources in that particular community, causing
Data collection resentment towards an interviewer who has
Data collection may occur in people’s homes, at a gained employment from the study.
central venue in the community, or even outside Researchers should ensure that the study is not
the community. Participants often have to be manipulated to benefit an individual or a particu-
motivated and encouraged to participate in the lar group. In a community where there are reli-
study. Many individuals may refuse to parti- gious, political, or other divisions, it is not advis-
cipate, and this must be taken into account able to choose people from only one camp as
during the planning stage of the study. This does interviewers. Data collection is a neutral process
not mean increasing the sample size, but implies and should not alienate any potential particip-
planning strategies to increase the willingness of ants. Interviewers need to be extremely sensitive
individuals to participate. In the community, to avoid further polarization of divided commu-
people are not a captive audience and therefore nities.
need to be met on their own ground and on their Careful attention needs to be given to issues
own terms. After-hour visits, repeat visits, and such as how the interviewers introduce them-
appointments with absent household members selves, how they dress (for example, do nurses
can help improve the response rate. wear uniforms or not?), and what security precau-
The location where data is collected also deter- tions they take.
mines exactly what information can be collected. Another fieldwork issue is that data collection
Some types of surveys require that the parti- in the community is often extremely hard work. If
cipants have privacy when they answer ques- people are not at home, repeat visits (often after
165
d= common approaches in epidemiology
166
17 community studies
notices were not delivered due to problems with The study was initiated by the District Medical
postal services in squatter areas. Officer of Health, who approached headmen in
This example is one of a typical complex urban- the area for permission. Villages, stratified by
based community study. The logistical problems village type (traditional, resettlement, and urban),
of covering multiple areas, bringing people to a were randomly chosen, with all households in
central venue, using relatively invasive tech- selected villages being included in the sample.
niques (taking blood), and following up people Village health workers (VHWs) who were part of
identified as positive are exemplified in this the district health service were trained to admin-
study. The fact that the study was conducted in a ister questionnaires to all women of childbearing
politically turbulent period exacerbated many age in their villages. Focusing on children born in
problems. The researchers identified a need for the previous five years, the infant mortality rate
extra training of researchers in the future in pro- was found to be 42 per 1 000; infants who were at
ject management, including conflict resolution greatest risk were those who were one of a twin,
and financial management. born at home, or living in a resettlement village.
Gastroenteritis was the most common cause of
Reference: death.
Levitt N S, Katzenellenbogen J M, Bradshaw D, Hoffman M Feedback in the form of pie charts, histograms,
N, Bonnici B. ‘The prevalence and identification of risk and other illustrative material was given to the
factors for NIDDM in urban Africans in Cape Town, South VHWs at a meeting. Discussion of the results led
Africa.’ Diabetes Care 1993;16(4):601-7. to improved VHW understanding of the issues
and enabled VHWSs to integrate their new know-
ledge and experience into their work.
This study illustrates a typical rural commu-
EXAMPLE 17.2 nity-based epidemiological study where existing
infrastructure was used to the maximum, and ser-
The profile of infant mortality in vices strengthened by the long-term involvement
Hewu, Ciskei (1986) of outside researchers.
In 1986, a community survey was conducted by
researchers of the MRC in the Hewu district of the Reference:
Ciskei to determine the infant mortality rate in Yach D, Katzenellenbogen J, Conradie H. ‘Infant mortality
the region and to look at risk factors for infant rate study: Hewu District.’ South African Journal of Science
deaths. 1987; 83:417-21.
168
18 Knowledge,
attitude, belief,
and practice
(KABP) surveys
170
18 knowledge, attitude, belief, and practice (kabp) surveys
172
18 knowledge, attitude, belief, and practice (kabp) surveys
the competing influence of the opinion of a The reliability and validity of KABP surveys
woman’s male partner may override the woman’s Where they have been specifically examined, the
own opinions about unprotected sexual inter- reliability and validity of some existing KABP sur-
course. Under certain circumstances, situational veys have not been as good as frequently assumed
factors, such as drug or alcohol use, economic (Okediji 1973; Stone, Campbell 1984). For exam-
imperatives, environmental factors, peer pres- ple, a study from Bangladesh showed that one-
sure, and social norms may play a greater role in fifth of men and one-third of women who used
predicting behaviour than the individual psycho- contraceptives in a sample denied this in a subse-
logical variables usually included in a KABP quent KAP survey (Bulmer, Warwick 1983).
survey. The availability of existing KABP survey ques-
tionnaires may encourage their uncritical use in
The rigidity of KABP questionnaires new settings. The re-use of questionnaires used in
KABP surveys rely on pre-coded questions. They other studies in new settings is useful to facilitate
may, therefore, provide answers many respond- comparisons across studies, but should not stop
ents might not otherwise be capable of formulat- extensive piloting to iron out the problems of
ing for themselves or find suitable for themselves. using the questionnaire in the novel setting.
It has been pointed out that KABP surveys assume Furthermore, the reliability and validity of the
the respondent already has a well-formed evalu- questionnaires used should be tested through
ative opinion on the subject in mind (that is, they methods such as repeat questions within the
have thought about the subject and have views questionnaire, intensive probing, combining
about it) and merely needs to access this response open and closed questions, repeat interviews
and report it to the interviewer. From this point of with single respondents, interviews with altern-
view, the attitude is assumed to be accessible and ative sources (for example, husbands, if wives
spontaneous (Tanur 1992). However, when the were the initial primary target), and so on.
respondent has no prior attitude (when they have
not thought about the topic or their relation to it), Simplistic approaches to KABP surveys
they may have to construct a response on the No one would deny that the relationships be-
spot. In this case, the attitude may be transient, tween health-related behaviour and the charac-
unstable, and somewhat meaningless for predict- teristics measured by KABP surveys are complex
ing behaviour assumed to be related to the atti- and difficult to quantify. However, KABP surveys
tude or belief. Attitude and belief questions in are sometimes substantially over-interpreted. For
KABP studies are seldom preceded by enquiries example, many KABP studies of family planning
about how much respondents have thought and HIV prevention are based on interviews with
about the matter beforehand, if at all. Some individual women of reproductive age. This im-
responses to KABP questionnaires may be efforts plies that sexual and reproductive decisions are
at politeness to apparently irrelevant queries, an individual matter, which is largely controlled
especially in developing countries where subjects by women. Simply interpreting responses indi-
are unaccustomed to being interviewed, and may cating a willingness to prevent HIV transmission
be overwhelmed by the presence of a stranger in or prevent pregnancy to mean that condoms or
their home and the pressure to be hospitable. birth control methods will be used by these
KABP surveys may neglect to collect inform- women, and that they are probably the best
ation on attitudes and beliefs which are most method for dealing with the problem, may over-
important to the respondents, if not originally look the many social and other impediments that
conceived as important in the survey. As the con- bar these women from actually carrying out their
tent of KABP surveys is defined by researchers, stated intent. Further interviews with women’s
they may overlook issues important to the partners may also be informative.
respondents. This is most likely to occur when the Another example which shows that the know-
researchers are unfamiliar with the social and cul- ledge, attitudes, beliefs, and practices of others
tural conditions of the communities being stud- may be more important than that of the people
ied (Ratcliffe 1976; Stone and Campbell 1984), whose health is directly affected, comes from a
and particularly when the cultural and social gap South African study. In this study, knowledge,
between the researcher and the researched com- attitudes, and practices with regard to hyperten-
munity is wide. sion and its management were investigated
among family practitioners (Pick, Steyn 1992).
173
d= common approaches in epidemiology
Family practitioners are likely to be primarily survey. Extensive exploratory research and pre-
responsible for ensuring appropriate detection testing, some of which needs to make use of
and management of hypertension among their qualitative methods, is essential for the com-
patients. Therefore, hypertension management petent design, execution, and interpretation of
aids to assist family practitioners may be more KABP surveys.
effective in controlling high blood pressure in the
community than simply directing education at Conclusion
the community itself.
KABP surveys may provide useful baseline in-
formation when designing health programmes
Problems of applying KABP surveys that aim to influence personal behaviour, and re-
in developing countries and cross- peated KABP surveys may provide useful inform-
cultural settings ation about the effectiveness of programmes.
How the information from a KABP survey can be
Many problems of surveys become exaggerated
used in the design, implementation, or modifica-
when surveys are applied in developing coun-
tion of health programmes needs to be planned
tries, and especially among largely poor and une-
ducated populations. It is not so much that there
from the outset. Too often KABP studies are done
are problems which are unique to these circum-
without bridging the gap to implementation and
building this information into intervention. It is
stances, but that they are more frequent and
essential that educationalists, programme man-
severe (Bulmer, Warwick 1983).
agers, programme participants, social scientists
Translation and health workers are involved in the research
It is difficult to ensure that translated question- process.
naires truly represent the original. This is espe- Several possible problems in the design and
cially important if a study uses questionnaires in execution of these surveys may threaten the
more than one language. Translation poses fur- validity of the research findings. It is important to
ther problems when the target language(s) lacks bear this in mind, and to interpret data critically
the vocabulary suited to the goals of the ques- and cautiously. To clarify the role of the many
tionnaire. In many instances, these complex lin- social and psychological factors in public health,
guistic issues are left to interviewers to navigate and to better assist the development of health
on the spot. This may place unrealistic demands promoting programmes, data from KABP surveys
on interviewers. To evaluate comparability, it may need to be combined with further inform-
may be necessary to translate the original into the ation, including that arising from in-depth and
target language, and then to get someone else to qualitative research.
back-translate the new questionnaire into the
original language. Discussion of the translated
and the original version of the questionnaire by EXAMPLE 18.1
bilingual people may be of further benefit. The use of KABP surveys in a school
Socio-cultural context
setting
A proper understanding of the socio-cultural con- A KABP survey of 377 high school students in
text is essential to being able to conduct research Cape Town found that students had a poor and
(see also Chapter 26: Psychiatric epidemiology). superficial knowledge of how HIV was trans-
Pilot and pre-testing of basic ideas, question- mitted, and had fragmentary and inaccurate in-
naires, and field strategy is essential for achieving formation about how to protect themselves. Few
accessibility to the population and discovering recognized AIDS as a threat to themselves or their
relevant cultural practice and expression. It is community, and people with AIDS were viewed
essential to involve individuals from local com- with hostility. Three-quarters of the students
munities in all stages of the research. were sexually active, and almost all expressed a
Qualitative research procedures, such as strong desire to learn more about the disease
anthropological methods, in-depth or unstruc- (Mathews, Kuhn, Metcalf, Joubert, Cameron
tured interviewing, participant observation, and 1990). This descriptive information was used to
focus groups, may be a useful starting point in lobby support for school-based AIDS education
formulating the questions asked in a KABP programmes, which at that time had received
174
18 _ knowledge, attitude, belief, and practice (kabp) surveys
little attention.
The information also formed the basis of the
development of a pilot education programme.
KABP surveys were undertaken before and after
this pilot programme was implemented, and at
similar intervals at a comparison school. These
surveys showed that there were significant
increases in students’ knowledge of HIV transmis-
sion and prevention, and significant increases in
students’ reported acceptance of people with
AIDS following the programme. However, there
were no significant changes in behavioural inten-
tions. Again, this information was used to garner
support for further school-based AIDS education
programmes, and to modify and improve the pro-
gramme.
Reference:
Kuhn L, Steinberg M, Mathews C. ‘Participation of the
school community in AIDS education: An evaluation ofa
high school programme in South Africa.’ Aids Care 1994;
6:161-71.
Reference:
Martin G, Steyn K, Yach D. ‘Beliefs about smoking and
health and attitudes towards tobacco control measures.’
South African Medical Journal 1992; 82:241-5.
175
19 Qualitative
methodology:
an introduction
sophy concerning how behaviours and social pro- to get an in-depth sense of what people think of
cesses are determined. This philosophy argues a particular object or event.
that attempts to explain the social world in terms 4 The in-depth interaction makes this approach
of overt behaviour miss the most vital part of it, particularly appropriate when used as part ofa
namely that each behaviour or action carries process to establish some form of action or
meaning which needs to be explored. People and campaign in a community.
176
19 qualitative methodology: an introduction
Methods of data collection ing other points if they seem important during the
course of the interview. No sequence is imposed,
Research objectives are formulated broadly and and each point can be the beginning of a series of
not framed around variables which are defined in probing questions. The greater the level of struc-
rigorous measurable dimensions, or in terms of ture provided, the less spontaneity the inter-
specific hypotheses. The emphasis is on invest- viewee is allowed.
igating the variables in their complexity and in An appointment should be organized well in
context. Overviews of the three most commonly advance at a mutually acceptable place, so that
used methods of data collection will be provided time is allocated and the respondent can prepare.
here to give a flavour of the research style. It is When organizing the meeting, the nature of the
important to note that the style of the research research should be explained. A person’s unwill-
method used has to be adapted to the situation, ingness to be interviewed must be respected, but
and the demands of the research question. reservations should be discussed if possible.
The initial phase of the interview is very impor-
Depth interviews tant, as this is when trust is established or lost.
This is probably the most common form of qualit- Openness and frankness about the nature of the
ative data collection. It generally takes the form of research will help to gain trust at this point. Con-
a discussion between the interviewer and inter- firm that the person is agreeable to having the
viewee about the area of research. The interview interview taped, reaffirm the nature of the re-
is commonly recorded on a tape recorder (with search, and re-establish consent. Privacy and
permission from the interviewee). The inter- quiet should be ensured where possible.
viewer has to direct the discussion to some extent The interviewer asks an open question, and
so that the required information can be obtained, then by the use of summarizing techniques and
but respondents are allowed to talk and cover the questions of clarification, draws out the meaning
area in their own terms and from their own and sense of reality that the interviewee places on
perspective. This method is generally used when the issues under question. Summarizing involves
detailed information is needed from individuals. reflecting back to the interviewee what was said in
Examples of its use would be to determine how such a way as to provide an immediate check on
TB patients feel about a new treatment plan, to the interviewer’s understanding, and to allow the
assess the illness experience of a person with interviewee to further develop his or her thoughts
cancer, or to find out how STD patients feel about on the question. Clarification questions serve to
the idea of using condoms. get explanations on issues raised by the inter-
The major advantage is that it gives the re- viewee about which the researcher is unclear.
spondent the opportunity for personal explana- The atmosphere should be kept cordial and
tion and detailed responses. In addition, the indi- open. An interview should preferably not last
vidual focus allows the interviewer to draw out more than an hour in most cases. It can be carried
more detail while the respondent is talking and on later if more time is required.
thinking about the subject. The major disadvan- The interview needs to be concluded with great
tage is that, compared to the other qualitative discretion and care. Any emotions and hopes that
methods, the respondent is more removed from the interview raises should be respected and
his or her own context and may feel threatened. talked through, as appropriate. At the end of the
This could result in bias. interview, the respondent must be told where the
Considerable preparation must take place research will be going from this point, should be
before going into the field. The researcher needs reassured about confidentiality, and, where pos-
to be clear about the purpose of the research, sible, the offer of later feedback should be made.
have a good knowledge of the setting in which the
research takes place, and have read widely in the Focus group interviews
area to be researched. Interviewers may also have The focus group method involves a number of
to prepare themselves emotionally, as they often people meeting in a group in which the parti-
have to deal with difficult and emotive issues. cipants talk to one another under the guidance of
Most commonly, the interviewer will enter the a facilitator. The purpose is to generate relevant
field with alist of points to be covered in the inter- ideas and information around a pre-arranged
view. These points give direction, but ideally topic. The aim of focus groups is to provide
should not inhibit the interviewer from discuss- insights into the attitudes, perceptions, and
177
d= = common approaches in epidemiology
ee — ee
178
19 = qualitative methodology: an introduction
(occupational health setting), and in investig- details of the future research plans given. The
ations of infectious diseases (such as AIDS), it researcher needs to be clear about when to leave
may determine indicators of risky patterns of the field by either setting a date or a point in terms
sexual behaviour. The data collection covers a of the quantity of data obtained, as it is tempting
range of activities, including observing, listening, to stay in the field indefinitely.
asking questions, and recording, which can be
done either covertly or overtly. Other methods
The first stage in a participant observation is to There are a number of other qualitative methods
select and get access to the site of the research. which are available to the researcher, but are less
The researcher needs to be free to move around commonly used.
easily and participate in the activities of the Researchers often keep research diaries to
community or group, while preferably remaining supplement the data collected. Research diaries
relatively inconspicuous. The process of gaining utifize valuable information picked up while the
access is vitally important, as it determines in researcher is interacting in the research setting.
many ways how responsive the community or The use of documentary evidence includes the
group is likely to be to the researcher. The nature analysis of photographs, tapes, life histories, and
of the research should be explained to the re- old documents. Other more experiential methods
search subjects as far as possible, especially how would include activities such as role play, project-
the information will be used. Where possible, ive techniques and visualization exercises. While
access should not only be negotiated with those all the methods of data collection discussed in
in authority, but also with individuals at different this chapter are often used individually, they may
levels, so that a broader understanding can be also be used in combination. The different meth-
achieved. A number of techniques to facilitate the ods are used to assess different types of inform-
process of gaining access are covered in the refer- ation and to back up findings found in one of the
enced texts. other research methods.
Once in the field, the researcher has to decide
to what extent to participate in the activities of the
community. Some researchers merely act as Other methodological issues
observers and try to maintain a distance from the
activities of the community, while others seek to Sampling
get involved as much as possible. This is both a The usual epidemiological practice of formal
practical and an ethical decision. random sampling with ‘concern for adequate
Collecting field notes can be difficult. While sample size is inappropriate in qualitative stud-
it will be impossible to record everything that ies. The depth of the data gathered is the primary
happens, it is vital that very detailed data record- goal, rather than statistical inference. Within
ing takes place. This can be done in a range of qualitative methods, greater emphasis is placed
ways, most commonly field notes, audio taping, on purposive sampling. This is a sampling
and video taping. Field notes should consist of method in which the researcher deliberately
relatively concrete, complete, accurate, and chooses respondents or settings in order to en-
detailed descriptions of social processes and their sure that the sample covers the full range of pos-
contexts. sible characteristics. In this approach to sam-
One particular issue concerning field notes is pling, the researcher purposively tries to obtain a
that the subjects being observed may be threat- sample that represents all important subgroups
ened by the note-taking, and this could in turn of the population, by targeting specific sectors.
compromise the research. In such instances, only This includes sampling deviant or extreme cases,
brief notes should be made as unobtrusively as typical cases, critical cases, politically important
possible and the fuller version should be done ata or sensitive cases, and sampling to maximize the
later stage from memory, when the researcher is variation. It is hoped that the diversity of the
alone. target group, as well as what is typical can be
Finally, when leaving the field, the researcher covered in this way. Haphazardly selecting
must be aware of the powerful social connections people for convenience is discouraged, as the
made in this type of research, and the effect that results can be heavily biased (Patton 1980).
the research process has had on the community. Sample size is often determined by logistical
Full and proper goodbyes need to be said and issues of available time and funding. Two poten-
179
d= common approaches in epidemiology
Date pile, Te IEE A SR Re re eS
tial ways of defining sample size in the field are sions could be that they are unerotic, that they are
firstly to continue in the field until the researcher embarrassing to use, or that they are difficult to
has enough information to answer the initial re- obtain. :
search questions, or until no more or very little As the data collection continues and the codes
new information is being gained from successive become more complex, there is a need to start
interviews. The second method is to use the initial summarizing and to begin making some sense of
interviews to establish hypotheses and use suc- the data. The use of theoretical memos is import-
cessive interviews to validate or adjust these ant and should begin early on. These are brief
hypotheses. Interviews will continue until the notes of potential hypotheses or observations
hypotheses require no further adaptations in that can be made while coding and examining the
terms of the new information being gathered, and data. All notes should be filed away to be used as
the researcher feels that they adequately explain a resource later on when doing the final inter-
what is being researched. pretation of the data. During the project, the
notes will become more complex and closer to
Analysis of qualitative data the final interpretations that will need to be
The development of methods for analysis has made. From all of the above, the key issues and
lagged behind the development of data collec- important factors need to be drawn out and the
tion, but some important texts have appeared final interpretations stated. It is important to
(Miles and Huberman 1984; Strauss 1987). It is relate the final interpretations to existing theories
important, however, that the researcher does not about the subject to obtain greater depth and
blindly follow others’ practices, but adapts the understanding.
analytic procedures to the demands of the situ- A number of techniques may be used to verify
ation and the data. While there are no fixed rules and interpret results. These include the triangu-
for the analysis of qualitative data, there are some lation method, in which the results using differ-
key aspects that need to be noted. ent methods are compared to see if they comple-
The general term often used for the process is ment each other. Another method is to set up ini-
content analysis, which basically means that the tial hypotheses and then to search the data for
data is explored in detail for common themes and disconfirmatory results. If any such results are |
these are then established into units of meaning found, then the hypotheses may need .to be
or codes. At a later stage, these codes become the adapted to take them into account. These help to
basis of further analysis in terms of their content make qualitative results generalizable. Validity
and meaning. Although this is costly and time- and reliability can be further enhanced by com-
consuming, taped material should be typed or paring results between interviewers, by getting a
written out (transcribed), as the written word is number of people to analyse the same sections of
the basic medium for analysis. The task of tran- the same material, or by relating the results to the
scribing should not be underestimated. Depend- theory on which the research was based.
ing on the quality of the tape recording, it can take
four to eight hours to transcribe a one-hour tape. Training
Every single word that is said by both the inter- This chapter is a brief overview of qualitative
viewer and the interviewee has to be written methods and cannot be considered a training
down. From early on in the data collection pro- guide. The references for this chapter will give
cess, coding should begin so that the key areas some additional guidelines on theory and tech-
can be drawn out to re-inform the data collection niques, but do need to be backed up with experi-
process. ential learning and support. Initial training can
Coding consists of searching the data for often be passed on via a short set of workshops by
common themes which can be established as . someone who is skilled in this area. It is only in
categories into which later information can be the process of working with others that the tech-
inserted. Over time, this coding should move niques can really be known and mastered. Key to
from the original crude codes to becoming more this is the fact that the researcher as a person is
complex. At this point, codes should start to over- the research instrument for qualitative methods.
lap, so more complex codes may need to be devel- Just as the questionnaire is the instrument in a
oped and subdimensions of these codes created. survey, the researcher is the means by which in-
For example, if the general code concerns atti- formation is collected in qualitative techniques.
tudes towards the use of condoms, the subdimen- For this reason, a lot of hard work needs to go into
180
19 qualitative methodology: an introduction
getting yourself ready for the task, even though ion of skills. Epidemiology has traditionally
many skills are simply extensions of existing focused on the use of quantitative data, which
social skills. Short courses on qualitative methods allows for focused projects and the testing of pre-
are offered by most research training institutions. set hypotheses. The qualitative method provides
Additional suggestions. for training include the research opportunities which extend the amount
establishment of study groups within institutions of information that may be gathered, particularly
to learn about the methods. Another would be to as it provides a greater level of understanding of
work alongside more senior and experienced the processes that affect the results we seek in the
researchers. health field. Interest in the area is increasing, but
it remains an undeveloped field in epidemiology.
Advantages and disadvantages of
qualitative methods
Advantages of qualitative methods
@ Since qualitative research is done within EXAMPLE 19.1
people’s contexts, it can produce results that Depth interviews to evaluate a
directly represent how people feel. During this community health worker project
process, the researcher will also get a closer
feeling of the general social functioning of the Depth interviews were used as one of the research
person or the community. methods for evaluating a community health
@ The results obtained are often more accessible, worker (CHW) project in Khayelitsha near Cape
as they are descriptions of real situations rather Town. A sample of 10 community members who
than statistical measures or diagrams, which had been visited by a CHW during the previous two
are often beyond people’s experience. weeks were interviewed in their own homes. Inter-
@ Innewareas of research, qualitative methods viewing techniques such as clarifications and
can be useful for the production of new ideas. reflective summaries were used to allow respond-
They can also provide information for other ents to explore how they felt about the visit by the
studies. CHW. The interview schedule was built around a
@ Ifnew information appears during the research single main question: ‘Please tell me about the last
precess, there is space within the research visit of the CHW.’ Prompts were prepared to elicit
structure to explore it. information that was not volunteered, for instance,
The ‘Hawthorn effect’ is used to describe the the purpose of the CHW visit, how the respondent
effect of being watched or observed: people felt about the visit, the health problems experi-
often change their behaviour when they think enced by the respondent, whether the CHW asked
they are being observed. In qualitative re- about these problems, and the information or ser-
search, a skilled researcher will be able to use vice rendered by the CHW. With the permission of
the greater flexibility of the qualitative method the respondents, the interviews were tape-
to get around this problem by using probes and recorded.
picking up on non-verbal cues. On analysis it was found that community mem-
bers had volunteered examples of social and per-
Disadvantages of qualitative methods sonal issues that they had discussed with CHWs.
@ There are often problems with the analysis of Comments like ‘I feel very happy to talk to Nom-
the information, particularly due to researcher philo about my problems and my secrets that are
subjectivity and bias. worrying me. To talk to her gives me great relief’
@ The process of transcribing and analysing data and ‘Nomphilo is like a sister to me’ enabled re-
can be time-consuming and costly. searchers to understand the texture of the rela-
@ The material acquired may also not be easily gen- tionship between the community members and
eralizable, due to the small sample size, and the the CHWs.
non-random sampling technique often used.
Reference: :
Matthews C, Van der Walt H, Barron P. ‘A shotgun marriage:
Conclusion Qualitative evaluation of a community health worker project
Qualitative research methods are a useful and in Khayelitsha.’ South African Medical Journal 1994;
valuable addition to an epidemiologist’s collect- 84(10):659-63.
181
d= common approaches in epidemiology
3. Data gathering Clinic nurses (7) Focus group discussion Skills in focus group |
‘How do patients Data analysis discussions. Listening
experience TB?’ gives new insight
182
19 qualitative methodology: an introduction
re wR Ut mC it oD
COUGH A LOT ? YOU SWEs
man en kinders. :
bYOU] MAYEHAVEST. Bo) TU GLY Cia wae
VT YOUR CLINE G8 DOCTOR aut youd CL 08 DOCTOR
AS SOON AS POSE ad SOOM AS MORALE
.
HII
ALIVITINILIILIIN
a F
d= common approaches in epidemiology
es a a eS SS Se SSS eel
ve oe ot
first read the booklet and then say what they walked past him all the time, and a student nurse
thought and felt about the booklet. came to his bed, read his bed letter, paged
Their response was enthusiastic and they through his file, and apparently checked the in-
strongly identified with the booklet. Some of the formation against the computer printout in her
patients said ‘This could have been my life’. This hand.
feedback convinced the management of the TB 08:42 Two student nurses were standing next to
Control Programme that the booklet fulfilled a the patient’s bed, talking to each other. A staff
patient need and they decided to use it as one of nurse and student nurse came to tidy up the
the means to support and encourage patients to patient’s bed. As they moved him around, he
adhere to their treatment. groaned at times. Although they spoke to each
other, they never addressed him.
Reference: 09:11 A student nurse came to take the patient’s
Van der Walt H. ‘The role of participatory research to blood pressure. The patient was lying down with
enhance patient adherence to tuberculosis treatment’ in his blankets pulled over his head. The student
Proceedings of the International Symposium on Participatory nurse pulled the blankets away, took his arm and
Research in Health Promotion. Compiled by Korrie de started to take his blood pressure. She was speak-
Koning, Liverpool School of Tropical Medicine, August 1994. ing in Xhosa to the staff nurse who was at the
opposite bed, taking patients’ temperatures.
When she had finished, she made a note of the
EXAMPLE 19.3 blood pressure in the patient’s file, washed her
hands, and walked on.
An intervention study to determine 09:15 The staff nurse reached the patient’s bed-
levels of care in a hospital side and said: ‘I must take your temperature.’ She
During an intervention study which focused on shook the thermometer and put it in the patient’s
the levels of care in a large tertiary urban hospital, mouth, saying ‘Okay, close your mouth.’ She
a medical anthropologist used participatory stood staring out of the window as she stood
observation to examine the communication be- holding his wrist but said nothing further. She
tween nurses and doctors, and between health took the thermometer out and the patient asked
care workers and patients. The researcher and her ‘How much is it?’ ‘Thirty seven,’ she answered.
research assistant alternated on the daily 07:00 to ‘Thirty seven?’ the patient repeated. The staff
16:00 shifts between the intervention and control nurse smiled, wrote the temperature down on the
wards. Their observations included routine activ- chart and walked away.
ities of nursing and medical staff, ward rounds, 09:20 A second student nurse came to the
and meetings related to the intervention. In addi- patient, apparently wanting to take his blood
tion, the researchers used individual and group pressure. ‘Hulle het dit al gevat’ (‘They have taken
interviews. The data gave a description of the it already’) he said. The nurse ignored him, took
organizational culture on the wards; for instance, his arm, and took his blood pressure again. After-
the differences in autonomy and authority be- wards she stood staring at his file for a long time,
tween nurses and doctors, gender and class dif- and then wrote something in it and walked away.
ferences between doctors and nurses, verbal and During those two hours the patient received
nonverbal communication between health care two orders, asked one question, and gave a nurse
workers, and between health care workers and information that an observation had already been
patients. done. She chose to ignore him.
The following example is a description of an
observation which illustrates a typical interaction Reference:
between nurse and patient. Gibson, D. ‘Communication between health care workers
One patient was observed for about two con- per se and between health care workers and patients: Level
secutive hours from 07:15 when the patient was of Care Intervention Study, Qualitative findings.’ Unpub-
lying in bed, groaning from time to time. Nurses lished report, MRC, 1994.
184
SECTION
Mortality Studies:
_ D Bradshaw
Outbreak investigations
SMetcalf, N Coetzee —
2 Estimating amunieation esverage
N Coetzee, D Berry, D Yach
Environmenta epidemiology:
Y von Schirnding .
Occupational epidemiology
JMyers .
Disability studies
J Katzenellenbogen
Psychiatric epidemiology
C Parry, L Swartz
Anthropometric studies
G Joubert
Assessment of dietary intake
M Langenhoven, P Wolmarans
Measurements in dental epidemiology
M Moola
Health economics
D McMurchy, E Thomas
20 Mortality studies
- - White
— Coloured
-- Asian
— Urban Black
Oo
Oo
Oo
w
~
@
—
SO
fee
35 - 44 45 - 54 55 - 64 65 + Age
effect of the age and sex differences. The results population. The SMR for Group A compared to
are known as standardized rates. A ‘directly’ stan- Group B as a standard would be 1,35. The
dardized rate is the average rate of the study observed number of deaths was 620, and we would
population that would occur if the study popula- expect 458 deaths if the age-specific rates from
tion were to have the age and sex distribution of Group B occurred in the population of Group A.
the selected standard population. In the hypothet- It can be seen from Appendix 1 that the value of
ical example in Appendix 1, the age-standardized a relative index depends on the method used to
rate for Group B is 150 per 100 000, compared to calculate it. While the nature of the relationship
190 per 100 000 in Group A. Calculating the age- will remain the same, the magnitude also de-
standardized rates allows one to compare the two pends on what population was selected as the
summaty rates after having removed the effect of standard. It is crucial to report details of the stan-
the age differences in the two populations. dard population and the method of standardiza-
tion when reporting standardized rates.
Relative rates Directly standardized rates have the advantage
A relative index that compares the mortality in two that when the same standard population has
populations can be calculated. The comparative been used for the standardization, the rates may
mortality factor (CMF) is the ratio of the ‘directly’ be compared to one another. This is because the
standardized rate of the one population divided by age and sex distribution for each CMF has been
that of the other population. In Appendix 1, the adjusted in the identical way. Unfortunately,
CMF when comparing group A to Group B is 127%. when the same standard population is used for
The age-adjusted mortality in GroupAis 1,27 times indirect standardization, the SMRs are not com-
higher than the age-adjusted mortality in Group B. parable to one another, as the age and sex distri-
On the other hand, the standardized mortality butions have been adjusted differently for each
ratio (SMR) is based on ‘indirect’ standardization SMR. However, the sampling error for the SMR is
and is used when the category-specific rates (for generally smaller than the sampling error in the
example, age- and sex-specific rates) cannot be case of the CMF, because only the total number of
calculated. In Appendix 1, the SMR is the number deaths need to be estimated, compared to all the
of observed deaths divided by the number that specific rates in the case of the CMF. The variabil-
would be expected if the category-specific rates of ity of an SMR will therefore be smaller than that of
the standard population occurred in the study a CMF, which makes it a useful index when the
188
20 = mortality studies
numbers of deaths in categories are relatively which occurred a long time ago). Since popula-
small. Since accurate data on the age distribution tions are often very mobile, it is also unclear what
of the population is often not available in South groups the respondents in a survey represent.
Africa, the SMR is generally used to analyse data. In another indirect method, the McCrae and
Brass method, women are asked at the time of
Key mortality rates used as indicators of health delivery about the outcome of their previous
The infant mortality rate (IMR) is a particularly delivery. This clinic-based approach is much
valuable age-specific mortality indicator. It is de- more practicable than a field survey. However,
fined as the number of deaths of children under the potential biases in this method are unknown
the age of one year divided by the number of in South Africa.
births in that year. It is used not only as an indic- Although the infant mortality rate is widely
ator of the health status of children under one used, many demographers are now proposing
year, but is also used to reflect the health status of that a childhood mortality rate, which reflects the
the whole population. Clearly, it is easier to moni- chance of dying over a few years, would be a better
tor this rate than the mortality rates over all ages. index to estimate, as it is less subject to error than
However, in South Africa there is insufficient in- the infant mortality rate. The childhood mortality
formation to calculate the IMR for all groups. rate usually considers children up to the age of
Many births and deaths are not registered. Rou- five years and is known as the Under 5 Mortality
tinely available data can thus not be used to cal- Rate (U5MR). It is also important to focus on the
culate nationwide IMRs. U5MR rather than the IMR as the burden of high
For this reason, surveys are conducted to col- risk of death shifts from infants to toddlers.
lect this information. Information is collected In South Africa, the importance of adult mortal-
from women included in the survey on the ity is increasingly becoming apparent. The 45Q15
number of children they have had in the past five is a very useful summary index of adult mortality.
(or two or 10) years, and of those, how many have Based on a life table calculation, it is the probabil-
died. This data is grouped into a cohort of births ity of a 15-year-old individual dying before reach-
and the probability of dying in each time-period is ing the age of 60. In 1985, it was estimated that
calculated. This is known asalife table analysis, 40% of 15-year-old South African males and 23%
and is used to estimate the IMR or the childhood of 15-year-old South African females would die
mortality rate (the probability of a child dying before the age of 60. These rates of adult mortality
before the age of five). Surveys can also be done to are extremely high, and are comparable with
collect information which can be used to estimate those in the rest of sub-Saharan Africa.
the IMR through indirect demographic tech-
niques. These do not collect the number of births Determining cause of death
and infant deaths that occur during the study In the health context, it is important to investigate
period, as these are subject to various biases when the causes of death. Unfortunately, the death
collected directly, and require very large samples. certificate used in South Africa in the 1980s and
Instead, these surveys depend on the relationship 1990s does not facilitate the systematic collection
between some more easily obtained data and the of information needed to determine the under-
desired mortality indicator. Indirect methods lying cause of death. No distinction is made on
provide estimates of IMR which relate to a prior the certificate between the direct cause of death
period. The most commonly used example of an and the underlying cause, which means that the
indirect method, the Brass method, is to survey a certifiers have to use their discretion as to which
sample of women and ask them how many chil- cause they report. Causes are coded according to
dren they have ever had, and how many of those the International Classification of Diseases. The
children have died. The proportion of those dead, underlying cause of death is of public health im-
according to the age of mother, can be converted portance, since it is the cause at which prevention
into an infant mortality rate on the basis of model should be aimed.
life tables. The advantage of the Brass method is If the size and age distribution of the popula-
that it does not require the dates of birth or death tion is known, an analysis of the cause of death
of the children, but relies rather on the mother’s can be based on age- and cause-specific rates, or
age. Both these survey methods have problems age-adjusted rates by cause. In the case of the
with recall bias and proxy reporting (some house- denominator (that is, population figures) not
hold members may not be aware of infant deaths being known or being unreliable, it is possible to
189
e epidemiology applied to content areas
assess the relative importance of different causes The disability adjusted life year (DALY) is a
of death by considering the proportional mortal- newly developed index that combines the years
ity. lived with a disability (weighted according to the
As in the case of mortality rates, the cause of severity) with the years of life lost due to premature
death pattern is influenced by age and sex. For death. This index uses data from morbidity and dis-
comparative purposes, it might be necessary to ability surveys in conjunction with mortality data.
calculate the proportion of deaths due to a part-
icular cause within each age and sex group. When Determining geographic variations
there are numerous age/sex categories, a method The geographic variation in mortality is often
of summarizing the information is necessary. One investigated to identify factors related to the
index is the Proportional Mortality Ratio (PMR). disease. The magisterial district of residence of
This is calculated by taking the total number of each person who dies is available from routinely
observed deaths due to a particular cause, and collected mortality data. This allows rates over the
dividing it by the total number of deaths expected whole country to be calculated at the level of
due to that cause if the proportion for each magisterial district. An example of such a study
age/sex group is that of the standard population. indicated that the rates of certain cancers were
Sometimes it is difficult to interpret the PMR. For unusually high in the areas around Prieska in the
example, if the PMR is high for a particular cause Northern Cape, where asbestos is found, thus
of death, it is not possible to distinguish whether implicating asbestos as a cause of the cancer
the death rate due to this cause of death is high or (Botha, Irwig, Strebel 1986).
the rates for the other causes are low, making the Indirect standardization is often incorrectly
proportion appear high. applied in such analyses. If the population data is
The potential years of life lost (PYLL) is also a available, it would be better to use direct standard-
summary index. This index does not attempt to ization, as the SMR for one magisterial district is
standardize for age, but actually utilizes the differ- not comparable to the SMR for another. Further-
ences in the age structure of the deaths. The total more, it must be borne in mind that the coding of
number of years of life lost by a population can be magisterial district information might not be con-
calculated by totalling the years that each death is sistent; for example, some deaths of Wynberg resi-
‘premature’ (for example, the number of years dents have been coded as occurring in Cape Town.
that each death occurs before the age of 65, or any
other selected age cut-off point). In order to calcu- Determining trends over time
late the total PYLL, the number of deaths at a par- It is important to assess whether mortality levels
ticular age is multiplied by the difference between change over time. The first step in such an ana-
that age and the selected age cut-off. In South lysis is to examine graphs showing the mortality
Africa, it is reasonable to consider 65 as a suitable rates over time. If possible, age-specific rates or
cut-off age, as this is close to the life expectancy at standardized rates should be investigated to
birth. Mathematically, the PYLL is expressed as check that apparent changes in mortality rates are
64 not due to changes in the age structure over time.
PYLL => d,(65-x), where d, is the number of Cervical cancer mortality trends in South Africa
from 1949 to 1990 are shown in Figure 20.2 (Bailie,
deaths occurring at age x. Selvey, Bourne, Bradshaw 1996). The world age
standard was used as standard population. It can
The total potential years of life lost can be divided be seen that the age-standardized rate for white
according to the underlying cause of death, and women declined after the mid-1960s, while that
the relative importance of different causes of of coloured women rose. The rate for Asian
death can be measured by taking the potential women fluctuated from year to year because of
years of life lost for a particular cause relative to the relatively small number of deaths each year.
the total potential years of life lost. In this index, it However, it also shows a downward trend from
is clear that the deaths that occur in younger age 1970. The data for African women was not avail-
groups are given greater weighting than the able for this period.
deaths that occur in older age groups. This gives Many factors are related to changes in cervical
the PYLL particular importance in public health, cancer mortality, but the introduction of cytolo-
where there might be more incentive to reduce gical screening (PAP smears) during the 1960s
premature mortality. and 1970s has had an impact in many developing
190
20 = mortality studies
000
/1Mortality
rate
Birth cohorts
1941 - 1950
1931 - 1940
1921 - 1930
1911 - 1920
000
/1Mortality
rate 1901 - 1910
1891 - 1900
1881 - 1890
F
OF
koe
+ 1871 - 1880
Age of death
191
e epidemiology applied to content areas
Figure 20.4 Cancer of the cervix: mortality rates for birth cohorts — coloureds
Birth cohorts
“7 1951
-1960
°So “?~ 1941 - 1950
S
a
2
wo
~ 4931
-1940
= O 1921 - 1930
os
4c
oO
=| AX 1911 - 1920
-@ 1901
-1910
1891 - 1900
KZ
ee: a if | I L 1 L
20 25 30 35 49 45 50 59 60 65 70
Age of death
countries. A cohort analysis of the South African One method of summarizing cohort mortality
data is shown in Figures 20.3 and 20.4. These information is to use the life table technique. For
show that the pattern of mortality in successive each age, the probability of dying is calculated by
birth cohorts for white women has dropped and is taking the number of deaths in that age group
consistent with a reduction in mortality following relative to the average number of individuals in
the advent of cytological screening in the 1960s. that age group. From this cohort life table, the
The same trend is not evident in the birth cohort expectation of life at any age can easily be calcu-
analysis of the mortality for coloured women, lated. The expectation of life at birth is commonly
where there is no evidence ofa drop. used as a summary index to describe the overall
When analysing the trends in mortality rates, it is mortality pattern.
important to distinguish between a cohort effect It is important to be aware of possible changes
— the effect of mortality on an entire population in coding practices which might account for
born at the same time — andaperiod effect, changes in the mortality level (Example 20.3). The
which is a change in mortality patterns over time. type of analysis will depend on the variability in
Statistical models can be used to extract the three the data and the patterns seen. For example, the
components from trend data: age, period, and analysis of the trend in diarrhoea will require the
cohort effects. seasonal variation (high summer rates, low winter
Another way of analysing mortality data is to rates) to be taken into account through a special
select a cohort at birth and follow their mortality statistical procedure called time series analysis.
pattern through life. This is often used in the form Comparisons in proportions over time are very
of a historical cohort analysis, where historical difficult to interpret, as apparent changes in one
records are investigated to retrieve mortality cause might be due to changes in a different
information. Records might be those routinely cause. For example, the proportion of deaths due
available through the registration system, or might to circulatory diseases in white males dropped
be obtained from less conventional sources such between 1970 and 1984, but this could be due to
as church records, immigration records, or special the increase in injury-related deaths rather than
interest collections. Data collected in this way to a decrease in circulatory diseases.
needs to be assessed carefully for completeness.
192
20 = mortality studies
10 20 30 percent
30 percent
194
20 mortality studies
Figure 20.7 Number of deaths per year due to ICD codes 412 and 414 ex-
pressed as a percentage of the average number for the period 1968-1984
t
eo
Asymptomatic IHD
m
ob
©
o>
=
hw
®
>
°o
®
D
f©
ad
®
=>
wo
-
o seem Coloureds
~
Cc —~-— Asians
®
2
®D
a
Chronic IHD
am 412
the survey, and data was collected from 7 792 classification during the period 1968 to 1984, the
women of child-bearing age. From the replies to numbers of deaths for a given cause in each year
questions about their children born in the preced- for whites, coloureds, and Asians were separately
ing five years, it was estimated that 41 per 1 000 expressed as percentages of the average number
liveborn children died in the period 1981-1986. of deaths due to that cause in that population
Using information on all children ever born, group over the whole period.
Brass’s indirect demographic technique indic- The graph (Figure 20.7) of the trend in the mor-
ated that for the same period, the IMR ranged tality due to two codes related to ischaemic heart
from 80-100 per 1 000 live births. disease (IHD) shows that between 1980 and 1983
The discrepancy in the estimates could be there was a marked change in the coding practice.
attributed either to recall bias, or bias due to The sharp drop in deaths categorized as due to
proxy reporting (in the case of the survey), or to chronic IHD (ICD 412) was matched by an in-
the model life table being inappropriate (in the crease in deaths due to asymptomatic IHD (ICD
case of the indirect estimate). 414). Clearly, researchers who focus on trends
observed in mortality in one specific selected
Reference: cause, should also look at trends in all closely
Yach D, Katzenellenbogen J, Conradie H. ‘Ciskei infant related causes to ensure that observed changes
mortality study: Hewu district.’ South African Journal of are real and not due to coding changes.
Science 83:416-21.
EXAMPLE 20.3
Achange in mortality coding practice
Reference:
Bradshaw D, Botha H, Joubert G, Pretorius J P, Van WykR,
in RSA Yach D. Review of South African Mortality (1984). Medical
To evaluate the consistency of the cause of death Research Council, Technical Report No 1, Parow, 1987.
195
21 Outbreak
investigations
196
21 ~~ outbreak investigations
carry rabies), and that insects (for example, mos- These possible explanations must be consid-
quitos carrying malaria) and arthropods (for ered and excluded each time a potential outbreak
example, ticks carrying types of fever) can transmit has been detected. For example, a spurious out-
infection. The following section outlines a basic break of meningococcal meningitis occurred in a
stepwise approach to outbreak investigations. rural town when a new hospital staff member
based diagnoses on microscopy of scrapings from
Stepwise approach to outbreak skin haemorrhages in suspected cases. This diag-
nostic method is particularly inaccurate, and 70%
investigation of cases notified from this hospital turned out to
The efficient use of time is important in outbreak be conditions other than meningococcal menin-
investigations. Effective control measures should gitis (Turk 1993).
be instituted as quickly as possible to minimize Once an outbreak has been identified and
the severity and public health impact of the proved not to be merely an artifact of reporting or
outbreak. This is best achieved by teamwork and case ascertainment, a definite diagnosis of the ill-
by following a systematic problem-orientated ness must be made. This will involve taking good
approach, being logical and focused, and con- clinical histories, examining patients, and obtain-
tinually re-evaluating hypotheses as new inform- ing laboratory test results (where appropriate and
ation comes to light. The emphasis given to the feasible) at the outset of the investigation (see
different steps below will vary depending on the Example 21.2).
extent to which the causative agent, the source of To ensure uniformity in case detection, a case
exposure, and other key determinants are known definition should be established. The case defini-
or suspected in the initial stages of the investiga- tion should be as objective and specific as
tion. In practice, there is overlap between the possible while being easy to apply. It may com-
steps listed, and several steps are done concur- prise symptoms and signs (clinical definition)
rently, partially due to time constraints. and/or laboratory investigations (laboratory
confirmed case), depending on the disease under
Detecting outbreaks investigation and access to laboratory tests.
The most important surveillance system | is When symptoms such as diarrhoea or fever
disease notification, which is a statutory obliga- make up part of the definition, they need to be
tion on health workers to notify the health clearly defined for the purposes of the invest-
authorities of patients with certain conditions. igation (for example, ‘fever’ could be defined
However, it is likely that many outbreaks are not as a sublingual temperature of at least 38,5 °C;
detected, while others are not detected in time to ‘diarrhoea’ could be defined as the occurrence of
carry out a timely investigation because of delays three or more stools of liquid consistency per
in reporting and failure of clinicians to notify day for at least three successive days). If the out-
cases. Some diseases which have been respons- break is due to a recognized condition, reference
ible for outbreaks locally were not notifiable at should be made to published case definitions
the time (for example, whooping cough and (for example, the Morbidity and Mortality
shigellosis). Laboratory-based surveillance of Weekly Report 1990 supplement), which serve
microbial isolates and serology results are also as a basis for standardization in comparisons
important for the detection of outbreaks, particu- between different outbreaks.
larly hospital (nosocomial) outbreaks, and out- It is useful to have a broad or lenient working
breaks due to non-notifiable conditions. case definition (with high sensitivity) to identify
potential cases who need further evaluation (to
Confirming the diagnosis minimize the chance of missing cases), and to
Initial reports of disease outbreaks are sometimes have a more rigorous case definition (with high
unreliable and may provide misleading inform- specificity) for confirmed cases. It is often useful
ation about the causative agent. Apparent in- to have an intermediate type of definition of
creases in disease incidence may also result from ‘probable cases’ for suspected cases who do not
improved case detection, improved reporting, meet all the criteria for confirmed cases. For fur-
changes in the diagnostic criteria, changes in ther details on these different levels of case defini-
population size or composition, or false positive tion, see the Morbidity and Mortality Weekly
laboratory tests. Report 1990 supplement.
197
e epidemiology applied to content areas
198
21 ~~ outbreak investigations
titioners, school absenteeism records, disease pital, or limited to a small geographic area, it is
notifications, and death records. A decision useful to plot the spatial location of cases on a
should be taken as to which sources to use. spot map (Figure 21.2). This helps to demonstrate
A detailed description of cases is given accord- clustering of cases by place and may provide clues
ing to ‘person, time, and place’. This summary of about the source of exposure.
cases is useful for defining who is at risk of con- The scale of the spot map will depend on the in-
tracting disease, as well as formulating hypo- vestigation and the geographical extent of the epi-
theses as to the likely sources of exposure, and demic. If the cases are associated with an institu-
modes of transmission of the disease. tion such as a school or hospital, they should be
plotted by location within the building (that is,
Person classroom or ward), and if they are confined to a
Cases should be summarized in terms of key fac- geographic area, they should be plotted on a
tors such as age, sex, occupation, ethnicity, socio- large-scale map showing features such as roads
economic status, and immunization status (as and rivers. The location of potential sources of
well as other disease-specific relevant exposures). exposure to infection, such as sources of drinking
water and ablution facilities, should be shown on
Time the spot map.
The time course of the epidemic should be stud-
ied by drawing up an epidemic curve, that is, a Ancillary environmental and microbiological
histogram with the number of cases (on the Y investigations
axis) plotted against the date of onset (on the X Depending on the nature’ of the outbreak and
axis). The shape of the epidemic curve is useful in whether or not the microbial agent responsible is
determining the nature of the source of infection. normally endemic in the population affected by
A common exposure at a single point in time the outbreak, it is often useful to do environ-
(termed a ‘point source epidemic’) results in an mental and microbiological investigations to
epidemic curve with a steep upward curve with a complement epidemiologic investigations.
well-defined peak and a relatively short time Evidence from environmental investigations
course (Figure 21.1). In contrast to this, exposures complements epidemiologic evidence, and, if one
occurring over a period of time and conditions is lucky, may confirm the source of infection. This
which are readily spread from person to person, aspect of the outbreak investigation should be
are characterized by a broader, shallower epi- overseen by someone with expertise in this kind
demic curve, consisting of a less steep upward of investigation, such as a _ microbiologist,
curve, with one or more broad peaks occurring hospital infection control officer, environmental
over a protracted period of time. The date of any scientist, or environmental health officer.
community events which could be related to the Environmental specimens of possible sources
exposure should also be shown on the X axis of of infection (for example, food, drinking water,
the epidemic curve, as this may yield important swimming water) should be taken for micro-
clues to the point at which infection was intro- biological investigation to provide evidence of the
duced into the population. This may in turn pro- source of exposure.
vide clues about the source of exposure. If you have access to microbiologists working in
The epidemic curve will also show whether the specialized research laboratories such as the
number of new cases is still on the rise or whether South African Institute for Medical Research, the
it is declining, and so will help to indicate the National Institute for Virology, or universities
urgency of instituting control measures. (It is not with medical schools, it may be useful (depending
uncommon to find that the outbreak is already on on the organism under investigation) to request
the decline, in which case the investigation may that a microbiologist do specialized tests (for
be less urgent, as additional control measures example, ‘DNA fingerprinting’), using mole-
may not be needed.) cular biology techniques, to characterize the
organism in specimens from patients and
Place environmental specimens. The use of ‘molecular
Data collected should include home address, ad- epidemiology’ may be useful in confirming or
dress at work or school, location of any function refuting epidemiologic links between cases,
attended, and recent travel history. If the out- and in confirming environmental sources of the
break is in an institution such as a school or hos- outbreak strain.
199
e . epidemiology applied to content areas
a
o
a
oS
°
—
So
—
o
2
Ee
S
=
Pump handle removed
EP Labdataet aban
HORSAtie ecm COAT eZ eS 2 4 6 8 18 20 22 24 26 28 30
August Date of onset September
Source: Snow J. On the mode of communication of cholera. London, 1855. Reprinted in: Frost WH (ed). Snow on Cholera.
New York: Commonwealth Fund (1936).
A) =) oa®
am N el “ ” Bsi £ A] =m ° ed fe]eat & a] ” weeoecH Q ea} 6 c=)ct)& odel@ te)+ a
ie
201
e epidemiology applied to content areas
study of the entire population at risk. All people linked to disease (case) status if the magnitude of
(or as many people as possible) who ate at the the exposure-disease association (either the rela-
suspected place or function during the suspected tive risk or the.odds ratio) is large in relation to
exposure period should be interviewed so that the other exposures considered.
specific attack rate and the relative risk (rate ratio) If the outbreak is due to one of the Expanded
associated with eating each food item can be cal- Programme on Immunization (EPI) target dis-
culated. A case-control design is used when it is eases against which infants are routinely immun-
not possible to define or to study all people at risk ized (for example, poliomyelitis, measles, whoop-
of becoming a case. The magnitude of the asso- ing cough), you should investigate the possibility
ciations between exposures of interest and that the outbreak is due to inadequate vaccine
disease (case) status is the most important type of coverage and/or to vaccine failure (that is, poor
evidence provided by analytical studies, and can vaccine efficacy). Guidelines on carrying out vac-
be used to assess which exposure is most likely to cine efficacy studies in an outbreak setting are
be the source of infection. In a cohort study, this provided in the reference by Orenstein et al
is expressed as relative risks, while in a case- (1985), and some of these principles are illus-
control study, it is expressed as odds ratios. trated in the reference by Coetzee et al (1994) de-
In outbreaks in large populations or where scribing an investigation of a measles outbreak in
there is not a clearly defined group of people Cape Town.
among whom the majority of cases originated, it
is necessary to do a case-control study. A number Data collection
of sources may be used to select appropriate con- If feasible, each case and control (or family
trols. Suggested sources include: people attend- member of each case, if the case is a child or has
ing the same health facility or family practice, died) should be interviewed using a standardized
pupils from the same school(s), or people from questionnaire. It is imperative that the method of
the same workplace or neighbourhood as the data collection is uniform, both among cases and
cases. As only a sample of people with each expo- between cases and controls. For example, per-
sure of interest is studied, the rate of disease asso- sonal interviews of cases and telephonic inter-
ciated with each exposure cannot be calculated, views of controls could influence the replies to
and odds ratios are used instead of relative risks questions. For the same reason, a structured stan-
as the measure of effect. Controls must be free of dardized questionnaire is essential, as the actual
the disease in question, and must have a similar questions asked and the way they are asked may
potential for exposure to the risk factors under influence answers. Items to consider in drawing
investigation as the cases. up the questionnaire are listed in Table 21.2.
If the disease is strongly associated with certain Items should be selected from this list by con-
characteristics such as age and sex, it may be use- sidering what is known of the epidemiology of the
ful to match cases and controls to limit confound- condition being investigated, omitting items
ing by these factors. Matching should only be which are not potentially relevant to the invest-
considered if the matching factor is strongly asso- igation.
ciated with disease status, but the matching Medical records are generally not a suitable
factor is not itself a factor of interest. It is advis- substitute for case interviews, as they tend to be
able to consult an epidemiologist or a biostatis- clinically orientated, and epidemiologically relev-
tician if matching is used, as it is necessary to take ant information on exposures of interest is often
matching factors into account in the analysis. omitted from the record. However, they are useful
In outbreaks affecting a small number of cases, as supplementary sources of information, espe-
it may be useful to have more controls than cases cially for providing data on the results of dia-
to improve the precision of the odds ratio estim- gnostic tests and procedures.
ates. Up to four controls (matched or unmatched) Descriptive information should be entered
may be selected for each case. In outbreak invest- onto a computer as soon as possible in order to
igations using eithe, a case-control or cohort create a basic list summarizing key attributes (for
design, the number of cases is often too small to example, age, sex, date of onset, area of resi-
detect statistically significant associations, even dence) of cases. A laptop computer is useful if
for causal exposures. However, the lack of statisti- available, especially if the investigation is being
cal significance (p-values >0,05) should not deter done in arural area. This means that information
you from inferring that an exposure is causally can be entered and analysed on the spot while the
202
21 _~=soutbreak investigations
should be given to writing the investigation up for woman had used water from the Broad Street
publication. Publication of the findings is useful if pump, and that the brewery had its own well
there is the potential for similar outbreaks in which supplied the workers with drinking water.
other localities, or as a reference if innovative This outbreak had the characteristics of a point
epidemiological detective work and analytic source epidemic (Figure 21.1). The well in Broad
study designs were required to ‘solve’ the out- Street was no longer contaminated by the time
break. the pump handle was removed, as the epidemic
The investigation of disease outbreaks is one of was almost over by this time.
the most practical and useful applications of
epidemiology. A thorough investigation that is Reference:
clearly communicated and well documented SnowJ. ‘On the mode of communication of cholera.’
contributes significantly to progress in public London, 1855. Reprinted in: Buck C, Llopis A, Najera E,
health delivery and clinical practice. Terris M (eds). The Challenge of Epidemiology: Issues and
Selected Readings. Washington DC: Pan American Health
Organization, 1988.
EXAMPLE 21.1
John Snow’s investigation of cholera EXAMPLE 21.2
in the Broad Street area of London, An investigation of Legionnaire’s
1854 disease in a Johannesburg hospital
In 1854, Dr John Snow investigated an outbreak of Between November 1985 and February 1986, 12
cholera in the area around Broad Street, London. cases of Legionnaire’s disease were identified at a
Prior experience of cholera epidemics in 1831-2 Johannesburg hospital. Based on the knowledge
and 1848-9 had led him to believe that cholera that the incubation period is two to 10 days, it was
was transmitted by contaminated drinking water, determined that two cases definitely acquired the
but his theory did not have much support at the disease in hospital, while five other cases might
time. An outbreak of cholera in his parish in 1854 have acquired the disease in hospital. Investiga-
gave him the opportunity to test this hypothesis. tions were carried out to locate a hospital source.
In his initial investigation, he obtainedalist of all A spot map was drawn of all hospital cases by
reported cholera deaths in the three affected sub- ward, and as water is known to bea reservoir for
districts the week ending 2 September from the L. pneumophila, the causative agent, water was
General Register Office. He plotted them on a cultured from all hot water outlets in the hospital.
street map and found remarkable clustering in Although L. pneumophila was cultured in water
the area around the Broad Street pump (Figure from three outlets, the distribution of these out-
21.2). He took specimens of water from the Broad lets did not correspond to the distribution of
Street pump daily and noted that the water devel- cases, so they were disregarded as the source. It
oped a surface film and an offensive smell if left to was noted that six of the seven potentially hos-
stand for a few days. He persuaded the parish pital-acquired cases had been ventilated.
Board of Guardians to remove the pump handle Attack rates in intensive care unit (ICU) pa-
on 8 September to prevent further cases from this tients were calculated. according to whether or
source. A more extensive investigation in Novem- not they had been on a ventilator. The attack rate
ber that year, comprising a door-to-door survey was 35% in ventilated patients, compared to 2%
of all households and businesses in the area, in ICU patients who were not ventilated. This im-
further implicated the Broad Street pump: of the plicated ventilators as the most likely source of
Broad Street residents, the cholera attack rate was cases who acquired the disease in hospital, a find-
80/137 (58%) in people who were reported to ing in keeping with known modes of transmission
have drunk water from the Broad Street pump, of this condition.
compared to 20/297 (7%) of those who did not
drink water from this source. Notable exceptions Reference:
also incriminated the pump as the source: one Strebel P M, Ramos J M, Eidelman IJ etal. ‘Legionnaires’
case occurred in a woman who lived outside the disease in a Johannesburg teaching hospital: Investigation
area, while 70 workers in a Broad Street brewery and control of an outbreak.’ South African Medical Journal
were unaffected. Investigation revealed that the 1988; 73:329-33.
204
22 Estimating
immunization
coverage
Numerator Denominator
chronologically and dates of vaccination entered interview and who are therefore excluded from
across the page. Any additional children not born the survey may represent a select group whose
in the clinic catchment area are entered when immunization differs from those who are at
they present to the clinic for services. Children home;
that move out of the area are deleted from the immunization records may not always be avail-
register. These registers are used to trace de- able;
faulters, but also provide a ready means of calcu- the complex study design may affect results.
lating clinic-specific coverage for any birth
cohort. The advantage over the administrative
method is that the numerator arises directly out EPI sampling
of the denominator, yielding real coverage per- The Expanded Programme on Immunization
centages which cannot exceed 100% (Table 22.2). (EPI of the WHO has distinguished itself in the
These estimates, however, only reflect coverage application of a practical sampling technique,
in patients who are known to the clinic, and extra- uniquely designed for coverage assessment. The
polation to the population is possible only in EPI methodology was developed with a very spe-
stable areas where the clinic is easily accessible to cific purpose in mind — that of enabling health
the whole community, and is used by the whole service staff (as opposed to researchers) to obtain
community. a reliable coverage figure (a simple proportion)
(Henderson, Sundaresan 1982; Lemeshow,
Robinson 1985).
Community-based surveys The derivation of the 30 x 7 sample (that is, 30
A survey which goes into the community to deter- clusters of 7 children) was arrived at by fairly
mine coverage has the inherent advantage of straightforward extrapolation from simple ran-
measuring the numerator (those immunized) and dom sampling (see Appendix 4).
denominator (both the immunized and non- The WHO EPI manual Evaluate Vaccination
immunized) at the same time. Although signi- Coverage has proven extremely useful as a prac-
ficantly more expensive, community-based sur- tical guide to the techniques involved. First, the
veys can provide coverage estimates that are population is divided into convenient groups or
more representative than those arising from rou- clusters (villages, street blocks). These divisions
tine sources (Table 22.2). are then weighted according to estimated popula-
However, biases and other obstacles to accu- tion size. Thirty clusters are then selected, using
rate assessment can still occur; for example: simple random sampling. Within each selected
children who are not at home at the time of the cluster, a starting point for the survey is then
206
22 = estimating immunization coverage
(Adapted from a presentation by Dr P Strebel at the National EPI review seminar in Pretoria on 21 September 1994.)
proportion of recent migrants (see Example 22.2). items are children and the lots areas. This method
Lot quality assurance sampling (see below) also allows one to determine whether immunization
provides a useful method of determining sub- coverage in a small area is acceptable or not, but
region coverage status. does not estimate the actual coverage rate. It is
The EPI sampling technique was originally de- ideally suited for smaller urban areas where the
veloped for rural areas in developing countries base population of different strata is too small for
and is not always ideal. Where relatively few chil- the EPI cluster method (Rosero-Bixby, Grimaldo,
dren of the appropriate age group are present ina Raabe 1990; Lanata, Black 1991; Cutts 1991, 1994).
community, the standard technique of going
door-to-door may prove very tedious. In densely House-to-house sentinel population
populated urban and peri-urban areas, simple investigation
random or systematic sampling is feasible. Ad- In urban areas with limited resources, lack of
aptations to the EPI sampling technique have available expertise, and poor census data, health
been successfully implemented, and are de- services may focus on specific problem zones
scribed in the Examples. where most cases of disease occur (sentinel popu-
A useful adjustment to the EPI sampling lations). Door-to-door investigation of all chil-
methodology has been Kok’s modification used dren in these defined areas can be undertaken
in Kenya for dispersed populations (Kok 1986). It rapidly (as during an outbreak), or at convenient
uses primary school lists as a sampling frame to intervals by health service staff with minimal
target areas where children live, and then goes to training. As no sampling is required and analysis
neighbouring households, so avoiding the direct is done by hand, this approach to estimating im-
bias of selecting children who have siblings in munization coverage in problem areas has
school. Kok’s original assumptions were the fol- proven to be less costly than EPI cluster sample
lowing: surveys of equivalent size (Weeks, Barenzi,
@ the early primary school child is ‘the most Wayira 1992).
randomly and proportionally distributed regis-
tered sampling unit in the community’; Measurement considerations
¢ school attendance should be (‘arbitrarily’) not
less than 70% (this may be a problem in isolated Attention must be given to fieldworker training,
rural areas of South Africa); and questionnaire design, and definition (determina-
@ there should not be certain groups who are tion) of vaccination status. Questionnaires which
excluded from schooling. are unambiguous and generate useful variables
should be used. Variables of interest are receipt or
This method, which was tested in the Machakos not of vaccination, type, number, and age at
district of Kenya, has proven a useful tool for EPI which vaccines were received. Age should be
internationally, and some South African re- determined using date of birth. If date of birth is
searchers have put it into practice. unknown, the researcher should be aware that
Another modification involves the use of primi- there may be a preference for a given month (for
parous mothers for assessment of vaccination example, January) in the responses. Any age in-
coverage. Though the reason for choosing primi- accuracies could have an important effect on age-
paras is not apparent, the method does seem to specific immunization coverage calculations. The
have potential merit if extended to include all training manual for EPI coverage methodology
mothers giving birth at a particular health facility. recommends surveying children between 12 and
The conditions under which this method would 24 months of age as a measure of immunization
give reliable results would be if the home delivery efforts over the preceding year. Information on
rate was low, and follow-up excellent. where vaccination took place may be useful from
amanagement point of view.
Lot Quality Assurance Sampling (LQAS) As in all studies, fieldworkers should be well
Details of this sampling method are given in trained and motivated before data collection
Chapter 16. The basic principle is that a certain begins, and periodically checked while in the
number of items from a lot (batch) is tested and field. Careful documentation of information on
the whole batch ‘rejected’ or ‘accepted’ on the vaccination cards is vital, and workers should
basis of the number of defectives in the chosen know exactly what each entry means. The import-
items. In the case of immunization coverage, the ance of accurate recording should be empha-
208
22 _~=—sestimating immunization coverage
sized, as should the need to encourage parents to recommended that researchers make the effort to
find cards which they have ‘temporarily mis- familiarize themselves with the problem (Hen-
placed’. The response rate should be improved by derson, Sundaresan 1982). A minor adjustment
revisiting locked dwellings at a later stage. For this can be made to the chi-square test that consider-
purpose, and also for quality control inspection, ably improves the accuracy of the test.
the name and address of each family should be Work done at the MRC has concentrated on a
carefully recorded. method called the ‘bootstrap’ technique (Rei-
Vaccination status can be determined by vac- nach, Groeneveld, Neethling 1990). This method
cination information on the Road to Health card, requires much computing for each data set, as it
clinic records, in some cases school records, involves numerous resamplings (with replace-
physical signs such as BCG scar, or a verbal report ment) from the data. Unbiased estimates of the
by the respondent (usually the mother). Ignoring population variance (or any other parameter) can
the subjective verbal report clearly leads to an be ‘obtained. Methods are now being sought to
underestimation of coverage in absolute terms; streamline this technique.
yet the advantage of having a reproducible mea- In short, if the researcher is interested in ana-
surement cannot be overemphasized. Depending lysing anything beyond simple coverage propor-
on how immunization status is defined, different tions from a cluster sample, detailed statistical
coverage estimates may be derived. In most sur- assistance will be required during both design
veys, immunization coverage is estimated using and analysis. For more detailed information
all children with vaccination information accord- about the methods described above, refer to
ing to a health card as the numerator and all chil- Berry and Yach (1991), and for their application in
dren as the denominator. This can be considered South Africa, see Berry, Yach, Hennink 1991,
a worst-case estimate (since all children for Coetzee et al 1990, and Coetzee, Ferrinho,
whom a card cannot be shown are categorized as Reinach 1993.
not immunized). A best-case estimate would be
using the same numerator as above, and all chil- Conclusion
dren with cards as the denominator. eee
The regular estimation of immunization coverage
both these figures should be reported.
by health services forms the basis of immuniza-
Immunization data should be collected so that
tion programme evaluation. Only if these es-
coverage can be calculated for individual anti-
timates provide accurate data on specific sub-
gens, and overall coverage for immunizations
regions can they enable programme managers to
given on time (within 28 days of due date accord-
direct preventive efforts to areas with the greatest
ing to schedule). Coverage should also be calcu-
lated according to immunization status at 12 potential for disease transmission.
months of age, and at the time of the survey. The
standardization of coverage calculations accord-
ing to these criteria will facilitate comparisons be- EXAMPLE 22.1
tween areas and over time (Ferrinho, Buch 1991). Adaptations to the EPI sampling
strategy in the Mosvold health ward
Analysis and the Free State
When complex sampling methods (such as clus- A study to determine vaccination coverage in
ter sampling) are used, there is a need to correct Northern KwaZulu-Natal involved the use of a
for the design effects at the analysis stage. Simple 1:50 000 grid map. Numbers were allocated to
quantities and proportions are minimally biased each 5 x 5,5 kilometre square in proportion to
by most complex designs. However, the calcula- homestead density. These figures were then used
tion of confidence intervals depends on the to ensure that probability of inclusion in the study
design. would be proportional to population size. Sam-
Similarly, using tests and measures based on "pling began at a point on the road nearest to the
simple random sampling — such as the chi- centre of the square. This particular version of the
square test for association and the odds ratio as a EPI-type study is very useful in rural settings with
measure of the strength of association — may widespread populations.
give biased results because of the complex design. Chapman made use of a very similar method in
The differences can be dramatic, and it is strongly his survey of Free State farms to determine pri-
209
e epidemiology applied to content areas
ceria ripen | ae atin een A RR a peA Ot OR
mary health care coverage and the health status fied using standard EPI methodology. The larger
of Africans living on farms. The first stage of the sample of 450 (45 clusters of 10 each) allowed for
study used census data to allocate a number of post-sampling- stratification into formal and
clusters to broad areas. A 1:250 000 map was used informal dwellings.
for the second stage, but here millimetres along Immunization coverage was lowest in children
vertical and horizontal axes were used for select- from informal dwellings. Because the average
ing a random starting point. The farm located number of people is higher in informal dwellings,
where this point fell would be visited, and dwell- the use of dwellings as uniform sampling units
ings would be sampled consecutively, starting at probably resulted in under-representation of
the one nearest the farmhouse. An improvement these households.
may have been to use a more random method of In Khayelitsha, coverage was estimated in 12 to
selecting the first house. 23-month-old children living in three geo-
graphically distinct strata: permanent housing,
References: temporary housing, and a transit camp.
Buchmann EJ, Ngesi N, Tembe R, GearJ SS, lJsselmuiden C A sampling frame was developed by mapping
B. ‘Vaccination status of children aged 12-23 months in the the houses in each stratum during community
Mosvold health ward of KwaZulu.’ South African Medical surveys preceding the coverage survey. Stratified
Journal 1987: 72:337-8. multistage sampling selected 46 cluster starting
Chapman RD. ‘The Orange Free State experience. Deter- points, with allocation proportional to the estim-
mining both primary health care coverage and the health ated number of houses per stratum. From each
status of blacks living on farms.’ CHASA Journal 1990 starting point, 10 children were selected using
1(1):30-36. standard EPI methodology. The larger sample
size (46 clusters of 10 children each) allowed for
the estimation of stratum-specific coverage. Chil-
EXAMPLE 22.2 dren living in temporary housing had the lowest
Immunization coverage estimation immunization coverage, and were most likely to
be recent immigrants from the rural Eastern Cape.
adapted for use in peri-urban
settings
A study to evaluate the success of an outreach References:
immunization programme in Alexandra town- Coetzee DJ, Ferrinho P, Reinach S G. ‘A vaccination survey
ship used aerial photographs to define a sampling using EPI methodology to evaluate the impact ofa child
frame. The whole township was divided into 108 health outreach programme in an urban area of South
blocks (120-150 houses each), of which 45 were Africa.’ WHO Bulletin 1993; 71(1):33-9.
randomly selected. Within each of these blocks, a Coetzee N, Yach D, Blignaut R, Fisher S A. ‘Measles vaccina-
random starting point was selected, from which tion coverage and its determinants in a rapidly growing peri-
point 10 children (12-23 months old) were identi- urban area.’ South African Medical Journal 1990; 78:733-7.
210
23 Environmental
epidemiology
211
e epidemiology applied to content areas
The nature of environmental The actual harm that can result from an expos-
diseases ure is related both to its toxicity and the cumu-
lative dose to the target organ. The actual effective
Despite well-documented cases of environmental dose is the amount of toxic material which
disasters, accidents, and spills (for example, reaches the critical target organ in the body. It
Bhopal, where tens of thousands of people were depends on the concentration in the environ-
poisoned in 1984 by methyl isocyanates, or Spain, ment, as well as on factors related to uptake, such
where 13 000 people were poisoned in 1981 by as the route of entry into the body and the phys-
chemically contaminated cooking oil), the contri- ical and chemical forms of the toxic substance.
bution of environmental exposures to the causa- Almost any chemical could cause harm if taken
tion of many diseases often goes unrecognized. into the body in sufficiently large amounts. How-
ever, in environmental health, one is often con-
Environmental health outcomes cerned with those chemicals which may have
adverse health effects in small doses.
Types of outcomes In conducting risk assessments necessary for
Outcomes (effects) in environmental health stud- setting safety standards, it is particularly import-
ies may include death, specific defined diseases ant to establish the nature of dose-response rela-
(for example, lung cancer), disease categories tionships (or exposure-response relationships).
(respiratory illnesses such as pneumonia, Whilst toxicology has an important role to play,
asthma, and bronchitis), symptom complexes problems involved in extrapolating from animals
(coughing and wheezing) and biochemical/ to humans, and from high-dose to low-dose situ-
physiological changes which may not necessarily ations, make it necessary to rely on well-
result in symptoms. In fact, a whole hierarchy of conducted epidemiological studies.
health effects may occur, ranging from minor In general, the toxic effects of pollutants can be
temporary ailments through to acute illnesses divided into broad classes such as acute toxicity,
and chronic disease, with relatively resistant and chronic toxicity, cancer effects (carcinogenicity),
susceptible people at either extreme of the distri- birth defects caused by substances (terato-
bution. Thus there may be different exposure- genicity), and genetic effects (mutagenicity). One
effect relationships for different subsets in the distinguishing characteristic between these cate-
population. gories is the fact or assumption that dose thresh-
Ill-health effects of environmental exposures olds exist for acute and toxic effects, but these
may occur over a short or long period, they may thresholds do not exist, or have not been demon-
be reversible or irreversible, they may increase or strated for carcinogenic effects.
decrease over time, and they may be continuous
or temporary. Effects may be acute — for exam- Detection of environmental health outcomes
ple, they may follow quite soon after exposure Case detection in environmental studies can be
(often a single major dose) to an irritant. More subject to selection bias if members of a group are
often, however, chronic disease occurs as a result aware of their exposure status and its association
of cumulative exposure to complex mixes over with the outcome of interest. Self-reports of
long periods. Examples include asbestos expos- symptoms are often used as a health outcome in
ure and mesothelioma, and radiation exposure studies with uncertain exposures and few or no
and leukemia. The dispersal of the population at objectively measurable health effects. Potentially
risk over time, and the long incubation period, exposed groups may report outcomes at an in-
make it difficult to reconstruct exposures. For creased rate compared to a non-exposed group,
example, cancer from exposure to asbestos may which may result in over-reporting of cases
only develop some 20 years or more after the ini- among the exposed group. Examples of such situ-
tial exposure, during which time the original ex- ations include communities worried about toxic
posed population may have moved and become waste-site exposures, and office workers con-
exposed to other risk factors. Tracing the original cerned about indoor air quality. In such in-
population in such cases can be extremely diffi- stances, it is necessary to distinguish between
cult and sometimes impossible. Acute health demographic and other factors known to be re-
effects are thus usually easier to detect than lated to symptom reporting, over-reporting of
chronic health effects, which may be difficult to symptoms due to environmental worry, in-
relate to specific hazards or sources. creased symptoms due to stress caused by
Ze,
23 environmental epidemiology
environmental worry, and increased symptoms measurement of the pollution levels currently
due to environmental exposures. emanating fromatoxic waste site will reflect the
Under-reporting of cases (particularly among current exposure situation, but exposure 10 years
rural Africans in South Africa) could also produce earlier may have been quite different (because
spurious results in environmental epidemiologic different materials were being dumped, or be-
studies, depending on which exposures and cause of subsequent environmental modification
which geographic areas are studied. and migration of the original pollutants). Prob-
Frequently, when we talk about low-level lems may also arise because the measurement
environmental exposure, we are concerned with technology may have changed over time, with
effects which may occur in a very small percent- current technology increasingly able to measure
age of the population only. This may necessitate lower levels.
very large-scale epidemiological studies. In such
cases, it may be advantageous to carry out multi- Multiple agents
centre or even multi-country studies. To increase A large number of causes may contribute to most
the probability of detecting a real effect in a small environmental diseases. The various contributing
population, researchers could follow the popula- causes (confounding factors) may obscure the
tion up for many years to accrue sufficient contribution of any one particular factor. For
person-years of risk. A further alternative is to example, the contribution of sources in the
investigate a number of disease symptoms re- domestic (home) environment, the local or com-
lated to a particular exposure. munity environment, and the regional environ-
In other cases, small area studies are needed, ment (climatic zone, geographic latitude and
particularly where clusters of rare diseases are longitude) may need to be assessed. It is impor-
investigated with respect to their possible links tant to assess an individual’s total exposure in the
with environmental conditions. Routine data on various environments in which exposure may
health outcomes may be very limited, which take place, as well as to identify other substances
means that other data sources must be used. Also, that may modify its effects.
available data may only cover large geographical
areas, which are usually not subject to undue ex- Interactive effects
posure to environmental contamination from a Low-level chemical exposures in particular are
localized source. often factors which play a contributory rather
than a primary role in the causation of disease.
Exposures may be interactive, resulting in a
Exposure estimation reductive effect, an additive effect, or a synergistic
effect (where the combined effect is greater than
General exposure issues the sum of the individual effects). For example,
The validity of studies in the field of environ- the incidence of lung cancer is very high in ura-
mental epidemiology depends to a large extent on nium miners and asbestos workers who are
the assessment of exposure. This is one of the smokers, but may be less significantly elevated
most difficult areas in this type of research, due to among workers with similar occupational expos-
the problems involved in the definition and ures who are not smokers. A number of host and
measurement of exposure, as there will usually be environmental factors can modify the effects of
a degree of misclassification of exposure. If the toxic substances.
probability of misclassification is the same for
those who develop a specific disease and those Multiple sources and pathways
who do not, the effect will usually be to dilute the In environmental research, multiple sources and
differences between the groups. pathways of exposure are frequently encountered
There are two dimensions to estimating expos- (Figure 23.1 illustrates the multiple sources and
ure, namely the concentration level and the dura- pathways of lead exposure). One difficulty is the
tion of exposure. In investigating acute effects, ‘general lack of knowledge about many aspects of
the current exposure concentration level may be the behaviour of environmental pollutants. .In
adequate, but in studying chronic effects (after a order to estimate exposure accurately, it is
long exposure period, or long latency period), important to know how toxic agents enter into the
past exposure concentration levels are important, environment, whether they become dispersed or
as well as the duration of exposure. For example, reconcentrated, and what the pathways of human
213
e epidemiology applied to content areas
Direct inhalation
!
Route (2) Food
od
exposure are. Exposure to chemical substances at the outlet of a sewerage works. This, however,
are normally mediated by complex environ- may have little significance in terms of people
mental pathways, and more than one route may who might actually be affected by such emissions,
contribute to human uptake. Often one pathway since they are unlikely to live in the immediately
contributes the major proportion of the pollutant. exposed area. On the other hand, ambient air
Should this critical pathway not be identified, the quality monitors may be placed at the top of
multiple pathways contributing to the total ex- multistorey buildings where air quality may be
posure must be carefully assessed. Adequate con- considerably better than at the actual breathing
trol measures can only be applied when the rela- level of people on the ground. Normally only a
tive importance of the various routes of exposure few areas and a few substances are monitored.
has been established. What is generally required for epidemiological
studies, however, is detailed knowledge of tem-
poral and spatial variability of concentrations of
Environmental monitoring exposures. In the case of many pollutants, there is
Research data is often obtained from local, a sharp decrease in the concentration level as one
provincial, or national monitoring programmes, moves away from the source. There may also be
which are normally carried out for regulatory pur- vertical variations in the source, and, as illus-
poses and not for exposure-related studies. Regu- trated in the example above, air sampling points
latory environmental data is collected with the placed considerably above breathing level may be
aim of detecting peak levels to assess compliance safe from vandalism, but are inappropriate for
with standards or guidelines. Data collection is use in population exposure studies.
often done at places of little relevance in terms of When variations in pollutant levels are consid-
human exposure. For example, an air monitor erable (for example, in most air pollution studies),
may be placed in the immediate vicinity of a par- it may be more appropriate to use personal air
ticular factory, or water quality may be monitored samplers or filter badges rather than stationary
214
23 = environmental epidemiology
artigw ea Ce Dice)
2000
1600
1200
800
12
Hour of day
air samplers. These have the advantage of being the average exposure level may be important,
mobile and have the potential to measure an indi- while in another, the use of peak values may be
vidual’s total exposure. If large populations are more relevant (for example, in assessing noise ex-
being monitored, however, such samplers may posure). Cumulative exposures may be signific-
not be practical on a large scale, and exposures ant in the assessment of, for example, radiation
can be characterized at group level using station- exposure, where multiple exposures may be
ary samplers. largely cumulative.
As spatial variations in pollution levels pose
specific sampling problems in terms of where to Biological monitoring
sample, so variations in the level of pollution over There is a growing interest and increasing re-
time (for example, seasonal and daily variations) search into biological and biochemical markers of
pose problems of when and how often to sample. exposure (for example, DNA adducts), which
Pollution levels in Soweto, for example (see should improve the effectiveness of exposure
Figure 23.2), are highest in the early morning and assessments in the future. Biological monitoring
evening, when coal-burning activities peak. Thus aims to estimate the level of specific chemicals at
temporal variations in the level of the pollutant the critical target sites where their toxic effects are
will have implications for the frequency and type initiated. This level is the biologically effective
of monitoring. Should measurements be made, dose or the internal exposure level. Concentra-
for example, on a continuous basis, several times tions of chemicals in body fluids or excreta may
a day, once a week, monthly, or at some other be measured, as can effects such as mutations,
time interval? The biological variation of the chromosomal aberrations, or sister chromatid ex-
pollutant in the human system, for example, changes induced by chemicals in cells and tissues
whether or not rapid absorption and excretion of exposed individuals. Blood and urine are com-
takes place, complicates the matter further. monly used for biological monitoring, as well
Ultimately, the exposure data for the study must as hair (for example, to detect mercury or lead
be summarized: the choice of an appropriate concentrations), nails (arsenic), teeth, bone
summary measure may be critical to the ultimate (lead), faeces and breast milk (organochlorine
understanding of the exposure. In one instance, pesticides).
215
e epidemiology applied to content areas
216
23 = environmental epidemiology
by telephone, to ascertain whether any gastro- discipline which has undergone rapid develop-
intestinal, respiratory, or related symptoms had ment in recent years because of increasing con-
developed in the two or three days after the beach cern about deteriorating environmental quality
outing. Illness rates were examined in relation to and its effect on human health. Environmental
swimmers and non-swimmers exposed to varying epidemiological studies have become increas-
degrees of water quality, assessed by various ingly more sophisticated and refined, with more
microbiological indicators. The study will be used emphasis and attention given to exposure assess-
to set recreational water quality standards for ment, health outcome definition, and study
South Africa (Von Schirnding, Strauss, Kfir, Ro- design, as discussed above. While some studies
bertson, Fattal, Mathee, Franck, Cabelli 1993). and approaches are complex and costly to con-
Experimental studies are rarely conducted in duct, demanding expensive environmental and/
environmental epidemiology nowadays, except or biological monitoring programmes, this is not
in situations where it is possible to try out some always the case. Much progress has been made
corrective action in reducing exposure. Such an with relatively simple ‘descriptive’ studies, which
intervention might involve the treatment of a have led to important observations and theories
water supply, or the addition of scrubbers to a about the nature of environmental exposures and
plant stack, or the removal of soil and control of associated outcomes. It is hoped that this field of
dust in mitigating environmental lead exposure. epidemiology will continue to receive the neces-
sary support in the future, as it is critical that the
results of such research be used to form the foun-
Conclusion
dation of future regulatory policies.
Environmental epidemiology is an emerging
ONE
24 Occupational
epidemiology
218
24 occupational epidemiology
Sampling
Methodological issues in occupa- Sampling is mostly unnecessary due to the small
tional epidemiology size of surveyed populations, which usually
number several hundreds, and in some cases, a
Study design few thousand. Should it be necessary, a sampling
All survey designs are used, but some are particu- frame is always readily available, subject to diffi-
larly common and appropriate. The commonest culties in tracing workers who have left work for
type of study is the cross-sectional or prevalence various reasons.
study. This is particularly so in developing coun-
tries such as South Africa, where it is difficult to Measurement
retain contact with a cohort because of the in- The ability to characterize exposure accurately is
stability of working (often migrant) populations the great strength of occupational epidemiology.
with low levels of skill. Compared with much general epidemiology, ex-
Case-control studies are also common, but posures are very evident. It is also usually possible
these and cohort studies are not frequently en- to get some measurable handle on these expos-
countered in South Africa. One variant of the ures. Exposure categorization into qualitative
latter is the retrospective cohort study which is (yes/no), semi-quantitative (high/medium/low/
often found in occupational epidemiology, as it no) and quantitative variables is almost always
meets the criterion that exposure information (in possible.
this case, workplace records) must exist prior to Exposure indices can thus be generated. This
defining the historical cohort. Intervention stud- allows quantitative risk assessments to be made
ies are also easy to do and are becoming popular by generating measures of association between
due to increasing emphasis on evaluation .of exposure and outcome and also gradations of
occupational health measures and services. association, in particular exposure-effect (con-
Experimental studies in most areas of occupa- tinuous outcome variable) and exposure-
tional health are prohibited by ethical considera- response (categorical outcome variable) relation-
tions, although some laboratory experimentation ships.
is done on healthy volunteers to gauge acute Some caution is needed with respect to expos-
effects of common occupational exposures. Some ure. Internationally, serious industrial hygiene
people, however, feel that ‘experiments’ go on all practices only date back a few decades, and South
the time in the workplace in the sense that Africa is particularly far behind in this regard.
workers are knowingly, and with legal sanction, Where exposure measurements are available in
repeatedly exposed to known toxic and carcino- industry, they often focus exclusively on high ex-
genic substances. Two examples are the carcino- posure areas. This results from the typical indus-
gen asbestos and the asthma-causing agent trial hygiene approach, which is concerned prin-
toluene disocyanate, to which spray painters are cipally with whether environmental exposures
exposed. are below legislated occupational exposure limits.
One factor that particularly strengthens survey Such compliance-oriented methods are not ideal
design in occupational epidemiology is the ready for accurate estimation of exposure levels.
availability of control groups. These are mostly Despite these problems, health effects abound
internal (from the same factory population) con- because exposures are usually so gross in devel-
trols and are generated on the basis of objective oping countries, and may be detected relatively
exposure measurements. Internal controls are easily, even in rather small groupings. Exposure
generally superior to external controls, as control data, however imperfect, may exist in company
and index group subjects are relatively homo- records, or may be reconstructed with relative
geneous with respect to most factors other than ease by performing current measurements during
the exposure of interest. unchanged production processes, or by simula-
The identifiable nature of the group is another tion of exposure conditions in different historical
strength of occupational epidemiology. Hypo- phases of production.
219
e@ epidemiology applied to content areas
Precision is generally not a problem due to the estimation by the long induction periods charac-
possibility of easy and repeated access to persons, teristic of most occupational diseases. This takes
records and processes, without serious logistic on particular importance because so few longit-
implications in taking multiple measurements. udinal studies are done and there is a heavy reli-
ance on historical exposure data. Recall bias is a
Qualitative methods form of the more general information bias.
Qualitative research is becoming increasingly Strong confounding biases frequently operate
important in the workplace. It may be used to in industry. There is a common tendency for
determine perceptions of risk and to help prior- those most heavily exposed to hazards in the
itize problems and solutions. It may also be used workplace to have unhealthier lifestyles, which
to identify sources of potential resistance to frequently cause the same adverse health out-
strategies for improving occupational health comes as the workplace exposure. Workers ex-
practice, as well as auditing occupational health posed to dusty workplace environments tend in
services. Such methods are particularly appro- general to be heavy smokers as well. Both these
priate in the context of different workplace cul- exposures cause lung diseases. Another con-
tures, notably those of management and workers. founding factor which may be associated with
Groups within the workplace may have very dif- dust exposure and respiratory ill-health is pre-
ferent ideas about risk and priorities. In these and vious or current tuberculosis infection.
other ways, qualitative methods can help shape Internal validity of occupational epidemiology
what is measured and how it is measured in the studies is usually very good, while external valid-
study of work-related phenomena. ity or generalizability is more complicated due to
the obscure object of inference. For example, ina
Biases community-based survey, ifa sample from a large
Occupational studies are subject to all the same general population is selected and a determina-
generic biases as other epidemiologic studies. tion made of dust and fume exposure at work,
Certain biases appear to be more specific to occu- together with respiratory symptoms and signs
pational epidemiology, such as the ‘healthy and pulmonary function decreases, estimates of
worker effect’, particularly for cross-sectional the ‘true’ association in the general population
studies. In South Africa, because unemployment from which the sample was drawn can be deter-
is such a problem, workers tend to stay with a job mined. In occupational health, on the other hand,
even if it is affecting their health. Health effects in an entire workforce is studied. But what do the
most industries tend to be gradual and chronic measures of association obtained in such a study
and do not lead easily to workers departing. This estimate? What is being estimated is the true
is, however, just a form of the more general se- parameter of association in a hypothetical popu-
lection bias; subjects who are part of the group se- lation of all similar workers working in similar
lected for study tend not to include those who circumstances!
have left because of ill-health due to the exposure
of interest, or some other associated exposure at Analysis
work, and who are thereby lost to follow-up. The Various problems arise as a result of the relatively
group that is followed appears ‘healthier’ because small number of subjects in a typical study.
it consists of a disproportionate number of those Simple stratified analysis is not practical in these
who have survived ill-health caused by the study groups if it is necessary to look beyond the
exposure. This bias is very difficult to adjust for exposure (for example, dust), the outcome of
and bedevils most studies, whatever the design, interest (for example, chronic obstructive pul-
where follow-up is incomplete. Another form of monary disease) and one other covariate (for
this relates to the use of controls from the general example, smoking), and the number of subjects in
population who are by definition ‘sicker’, in that each stratum can thus become very small. Other
they may not be employed or employable. Use of relevant factors may be age, previous tuberculosis
such controls will dilute the effect of the pre- infection, and environmental exposure to sul-
sumed exposure. phur dioxide or domestic fuel combustion pro-
Another specific problem relates to recall bias ducts.
for exposure, which may be driven in the direc- However, there has been substantial devel-
tion of overestimation in the hope of obtaining opment in readily available statistical modelling
compensation, or in the direction of under- methods, which allow more precise parameter
220
24 occupational epidemiology
estimation from relatively sparse data than strati- mesothelioma, which is almost exclusively
fication analysis offers. caused by asbestos). The adverse health out-
Multivariate methods are well represented in comes typically studied may be produced by
occupational epidemiology literature. One limita- many different causative agents, both non-
tion of these multivariate methods is that in- occupational and occupational, in many differ-
creased precision is achieved at the expense of ent industries. Lack of specificity has had some
possible increased bias of parameter estimates. difficult consequences for those disabled by
The choice ofa particular model always involves occupational diseases. This is partly due to the
assumptions about the shape of the exposure- persistence of the old-fashioned notion of indus-
effect relationship, which may be incorrect, trial disease having specific causation. Modern
giving rise to a biased estimate. concepts of work-related diseases that have mul-
Indirect standardization (see Appendix 1: Stan- tiple and complex etiologies have been slow to
dardization), which yields the Standardized take on in the minds of health professionals and
Mortality (or morbidity) ratio (SMR), is the most administrators in state departments responsible
frequently used comparative measure in longit- for compensation. Attribution of risk for occupa-
udinal occupational mortality and cancer incid- tional disease to particular exposures or condi-
ence studies, because it provides a more stable tions is therefore an important part of occupa-
estimate when the study groups are relatively tional epidemiology. Simple stratified analysis
small. This has the disadvantage that it may not can yield estimates of attributable proportion or
be used for comparisons between different ex- etiologic fraction due to the effect of the exposure
posure categories within the study group, but in question. This has also taken a step forward
only between the index exposure category and since the advent of modern computerized statis-
the standard population from which the standard tical methods for investigating the partial effects
rates are drawn. This unfortunately means loss of of some occupational risk factors, while adjust-
ability to accurately and meaningfully estimate ing for several other occupational and non-
the exposure-outcome relationship in a situation occupational characteristics and exposures.
where categorization of workers by exposure is
possible. . Ethics
Association Ethical issues are generally more complex in
Applying Bradford Hill’s criteria for establishing occupational epidemiology than in general epi-
causality of an association (Bradford Hill 1965), demiology, and questions of confidentiality and
causal interpretation is enhanced in occupa- use of information generated are best addressed
tional epidemiology by strong associations and before conducting research.
exposure-response relationships (biological gra- These considerations favour participatory re-
dient). Additionally, because occupational search designs and clearly understood agree-
health research is efficient in terms of numbers ments, preferably in the form of written contracts,
of subjects.and resources expended, establishing between relevant parties at the workplace and re-
the consistency of findings by being able to searchers. For instance, there is considerable
repeat the study in other circumstances is en- potential for victimization of workers, or more
hanced. Time ordering of exposure-disease subtle forms of discrimination (including job
events is usually no problem in occupational loss) for those who participate in surveys. This
epidemiology. It is unlikely that a lung disease could be due to the discovery of occupational or
would cause someone to expose themselves to coincidental non-occupational illness. The re-
dust — the contrary would be more likely to searcher should, however, retain a clear right to
apply. Biological plausibility is usually provided publication of results.
by a well-developed international toxicology The application of cost-benefit analysis raises
database and existing theories of the mechanism important questions related to the recipient of
of disease causation. benefits and the payer of costs. These have fre-
On the other hand, specificity (where one quently been seen as different groups — workers
cause is uniquely responsible for one adverse pay costs and industry reaps the benefit. It is
health outcome), is not a strong suit in occupa- important that cost and benefit be derived for the
tional epidemiology (one notable exception is same subject.
221
e@ epidemiology applied to content areas
Resources and logistical issues sion, participative labour policies from factory
floor to parliament, and strong state involvement
Resources and funding possibilities present cer-
in occupational epidemiology and occupational
tain difficulties. They are more plentiful within
health generally. The Scandinavian countries are
the private sector, but with this relative abun-
the best example, and have occupational health
dance comes increased control over the epidemi-
systems that are light years ahead of anywhere
ologist. In particular, there is a strong ‘negative
else. By contrast, some developed countries such
findings publication bias’ as opposed to the ‘posi-
as the USA and Britain are sometimes hardly any
tive findings publication bias’ typical of general
better than South Africa when it comes to imple-
epidemiology. Many occupational health profes-
menting occupational epidemiology knowledge.
sionals working in industry are formally pro-
scribed by contract from publishing findings in
the scientific literature without express permis- Conclusion
sion from their employers. This once again raises The workplace has much to offer as an applica-
ethical problems. Occupational health does not tion area for epidemiology. The relatively well-
usually enjoy a high priority with public funding controlled occupational setting, well-defined
agencies, which tend to see occupational health study groups and exposures, industrial dyna-
as a private sector responsibility, thus posing mism and continual change provide many stimu-
additional difficulties for independent occupa- lating challenges. Because of these factors, de-
tional epidemiology research. spite the difficult ethical and social issues that
Access to problem areas is difficult and usually confront the epidemiologist, the workplace is
dependent on management goodwill. Where likely to continue to be a powerhouse of applica-
access is not blocked by those with vested inter- tion and development of epidemiologic method
ests, a participatory research methodology can and knowledge.
considerably enhance the quality of data and the
interpretation of results, by tapping the implicit
knowledge of workers about exposure, the inter-
pretation of analyses, and most importantly, the EXAMPLE 24.1
implementation of recommendations. Further- Brickfields study
more, participatory research methods are ideally
Brickmaking is a dusty process with potential
suited to an environment with an existing organ-
adverse respiratory effects. Brick dust is addition-
izational infrastructure (such as trade unions or
company health and safety structures) involved ally known to contain silica which causes lung
scarring or pneumoconiosis. Until this study, no
in the promotion of health and the prevention of
disease. Data may be cross-validated from differ- data existed for South African brickfields, which is
ent sources, and enthusiasm can be engendered fairly typical of much of South African industry. In
for the research process. This can help to lighten 1985, a cross-sectional analytic study was con-
the respondent burden, minimizing the provision ducted on all active workers (some 560 of them) in
of false information, removing fear of victimiza- four brickfields in the Western Cape over a two-
tion (in the case of TB, for example), and height- week period in order to determine the relation-
ening the possibility of efficient implementation ship between various dust components and res-
of recommendations. piratory disease in this setting.
Access for this study was gained as a result of
industrial conflict which had led to a strike. The
Implementation of findings health survey was one of the management con-
Implementation possibilities are potentially cessions to workers who had failed to obtain wage
strong, but depend critically upon socio-eco- increases.
nomic factors. Experience in occupational epi- Workers’ perceptions were essential in helping
demiology provides much insight into the limits to design a measurement strategy for dust at
of implementation of epidemiologic findings different points in the production process. Man-
generally, and highlights some of the social con- agement, medical staff, and workers all had
straints upon the development of occupational different subjective perceptions of dustiness at
epidemiology as a science. Implementation is various points. Additionally, the content of offi-
institutionalized in only relatively few countries. cial job descriptions varied widely within the
These countries have high degrees of social cohe- same job category when detailed information was
222
24 occupational epidemiology
obtained from workers. Details on exposures tive causative variables (exposures), while allow-
were collected qualitatively in pilot studies so as ing for simultaneous adjustment by known or
to ensure good quantitative measurement for the suspected confounding factors (age, smoking).
main study. Categorical outcomes (presence or absence of
Various components of dust in the atmosphere symptoms or pneumoconiosis) were analysed by
of the workplace could be identified and meas- multiple logistic regression and provided odds
ured, including total dust, respirable dust, and the ratios for dust exposure adjusted for confounding
respirable free silica fraction. This made it pos- factors.
sible to construct various dust exposure indices It was possible to show that neither smoking
for both current and cumulative exposures in nor age could explain excess respiratory symp-
workplaces for which dust profiles had been toms, X-ray pneumoconiosis, or decreased lung
gathered. A detailed interview schedule was ad- function. After adjusting for these known causes
ministered to all workers to obtain occupational, of ill health, there was a persistent residual effect
social, and health history data. Medical examina- of dust, the magnitude of which could be deter-
tions, chest X-rays, and lung functions tests were mined.
performed. Causal interpretation of the associations found
Occupational exposure histories were valid- in this study between silica dust exposure in the
ated against company employment records. As workplace and lung scarring was supported by
workers had little sense of which exposures the application of Bradford Hill’s criteria — most
caused which adverse health effects, they were of which were met.
unlikely to have provided biased data leading to There was no immediate implementation of
false associations. findings from this research. Due to industrial ten-
Outcome measures included continuous vari- sions during and after the study period, the
ables relating to lung function as well as categor- results were received with some hostility by man-
ical variables related to X-ray findings of lung agement. Conflict continued and some of the
scarring and respiratory symptoms. plants closed.
Ethical problems could have arisen because Nine years later, the principal author met the
many study subjects had very poor lung function, group medical officer for the brick company con-
and if their individual results had been made cerned at an occupational health conference and
available to management in the course of the discovered that many occupational health im-
study, they could have been discriminated provements had been introduced and that the
against. Therefore, before the study began, the results of the study had percolated slowly and in-
main stakeholders agreed that no one would be directly into improved practice over time.
retrenched on medical grounds, and all those
with conditions requiring medical attention References:
would be appropriately referred. Workers with Myers J E, Lewis P, Hofmeyr W. ‘Respiratory health of brick-
health problems were transferred to less dusty workers in Cape Town, South Africa: background, aims and
work, and workers’ compensation was initiated dust exposure determinations.’ Scandinavian Journal of
for deserving cases. Work Environment and Health 1989; 15(3):180-87.
In this case, no one was victimized, and the Myers J E, CornellJ E. ‘Respiratory health of brickworkers in
existence of a trade union representing workers Cape Town, South Africa: symptoms, signs and pulmonary
provided a safer environment for the researchers function abnormalities.’ Scandinavian Journal of Work
from an ethical point of view. Environment and Health 1989; 15(3):188-94.
During the analysis of the results, multiple Myers J E, Garisch D, Louw SJ. ‘Respiratory health of brick-
linear regression for continuous variable out- workers in Cape Town, South Africa: radiographic abnor-
comes provided partial correlation coefficients as malities.’ Scandinavian Journal of Work Environment and
measures of the explanatory power of presump- Health 1989; 15(3):195-7.
223
25 Disability studies
224
25 = disability studies
225
e epidemiology applied to content areas
_ to obtain a representative sample. Clusters of conducted in South Africa over the last five to 10
houses are selected and individuals within these years have used this framework.
households are interviewed. Strictly speaking, Usually the researcher needs to decide which
only one person per household should be inter- domain of disablement will be the primary focus
viewed after selection from among the others in of the proposed study. Will a person be defined as
the household by random techniques. However, ‘disabled’ if she or he has an impairment (such as
it is common for all the individuals within a inflamed joints) or a disability (for example, a
household to be studied, creating a second level locomotor disability) of some kind?
cluster (that is, the family), in that family mem- If, for example, a study sets out to measure dis-
bers are not selected independently. Given that ability, the research can investigate all types of
illnesses and disabilities may run in families, this disability or focus only on one (for example, loco-
approach may introduce a bias. Nevertheless, the motor disability). In general, surveys with a lim-
logistical gain of boosting sample size by count- ited focus are more common because specific
ing all the individuals in the households often professions (such as physiotherapists), medical
necessitates this approach. areas of specialization (such as orthopaedics), or
In Canada and New Zealand, community- agencies (such as the Organization for the Phys-
based prevalence was estimated using a national ically Disabled) may initiate the investigation.
census as the sampling frame. Everyone in the Clearly a limited focus is much easier and the
country was asked a screening question, and a screening questions (questions asked of all people
sample of positive and negative responders to the sampled in order to determine who needs to be
screening question were followed up to confirm investigated further) need to cover limited pos-
disability status. In the follow-up phase, a sub- sibilities only. Each particular disability may have
stantial proportion of positive responses con- its own series of screening questions, for example,
verted to negative, and vice versa. visual disabilities (‘Is there anyone in this house-
In studies where only a rough estimate of the hold who has difficulty seeing?’ plus others which
prevalence is needed, but more details are flow from this question) and locomotor disabil-
needed concerning needs and diagnostic/sever- ities (‘Is there anyone in this household who has
ity breakdown of the disabled in the community, difficulty moving?’). However, a drawback of sur-
service providers can attempt to identify all the veys with a limited focus is that sample sizes have
disabled cases by compiling a disability register. to be much larger to obtain precise estimates.
This is done by asking all key informants, This is because disablement in general is an un-
community bodies (for example, churches and common condition, and specific functional prob-
schools), and health services for the names of all lems such as communication disorders are even
the people with disability they know of. People more uncommon.
with disability and their families, in turn, provide Crude disablement rates alone are not a par-
the names of other people with disability. This ticularly useful measure. A crude rate of 12% gives
method is called networking and is excellent for us a figure but tells us very little that could be used
case finding, although not necessarily representa- in policy and service development. Impairment-
tive of all the people with disability in the com- specific rates (for example, the proportion of
munity. people in the population with amputation-
related disability) provide insight into the causes
of disablement, while age-specific rates provide
Measurement of disease
insight into the ages affected and allow for mean-
consequences ingful comparisons between populations.
228
25 = disability studies
order to broaden its service and identify disabil- with mild disability who had no occupational
ity-related problems. A proportional stratified handicap.
cluster sampling design was used with a sample All individuals (12,9%) who fell into the dis-
size of 500 households. The 82% household re- abled category from the first screening question
sponse rate yielded 2 424 individuals in 472 were followed up by experienced physiothera-
households. pists or occupational therapists, with 85% being
During Phase 1 of the data collection, rehabili- traced. The screening questions were repeated,
tation assistants interviewed a responsible adult and an in-depth interview concerning disability-
from each household. A census was taken of each related information was done on those who re-
household and two sets of screening questions ported to be unable or limited in the age-specific
were asked about household inhabitants. The key activities. A crude disability rate of 3,9% (95%
first question screened for all the categories of confidence limit 2,8% to 4,9%) was found. The
disability (such as movement, strange behaviour, authors considered the figure of 12,9% to approx-
or hearing) and also enquired about regular imate the impairment rate in the community, and
treatment and appointments for health prob- the figure of 3,9% to be the rate of moderate to
lems. If a positive response was given to any of severe disability.
these questions, a second screening question
was asked about the person’s ability to perform Reference:
key activities specific for the person’s age (for Katzenellenbogen J, Joubert G, Rendall, K, Coetzee T.
example, in the case of adults — ability to work; ‘Methodological issues in a disablement prevalence study:
in the case of children — ability to play). The Mitchells Plain, South Africa.’ Disability and Rehabilitation
second question tended to screen out people 1995; 17:350-57.
229
26 Psychiatric
epidemiology
230
26 psychiatric epidemiology
Options
Time factors and constraints Time frame for inquiry (e.g. lifetime, past
month); time needed for test administration
Prior application in study areas Has instrument been translated into language
of population of interest? Has it been
calibrated on that population?
231
e epidemiology applied to content areas
identifies possible cases. In the second stage, (such as functional impairments, disabilities, and
‘high-risk’ individuals are re-interviewed so that handicaps).
they may be confirmed as actual cases. Ifa sample Initiatives to- make psychiatric epidemiology
of those found to be negative on the initial screen- more valid have gone beyond calls for expanding
ing is also re-interviewed, the second stage also the focus of inquiry to investigating the way in
permits assessment of the validity of the screen- which people are classified. The traditional, cate-
ing instrument in the study population. gorical approach to classifying psychopathology
A number of instruments are available for is coming under increasing scrutiny. An altern-
epidemiologists to ascertain the psychiatric ative (though less commonly used strategy), is to
status of child, adolescent, and adult populations. adopt a dimensional approach where the focus is
One of the major factors influencing the choice of more on symptoms and symptom clusters than
outcome measures, and hence instruments, is the individual diagnoses. Other critics (for example,
research question. Is the researcher studying a Miller and Swartz [1992]), have argued that com-
particular age group (such as the elderly)? Is he or bining conventional epidemiological methods
she interested in studying a single type of dys- with qualitative methods will help to overcome
function (such as depression) or a range of prob- the limitations of using conventional epidemi-
lems? Is she or he interested in whether the ological methods to investigate the depth and
person or group under study has ever experi- complexity of psychological phenomena.
enced certain kinds of problems or symptoms
(lifetime prevalence) or whether they are cur- Measurement of exposures
rently experiencing them (current prevalence)? A In the mental health field, there are a number of
number of factors influencing the choice of which exposures which are associated with psychiatric
instrument to use are indicated in Table 26.1. illness.
Various screening instruments and structured
diagnostic interviews currently used to determine The effects of social and family arrangements
the psychiatric status of child and adult popula- Social support and network structures may help a
tions in developing countries are listed in tables person function longer and more productively
26.2 and 26.3. The tables also indicate the primary than if that person had been socially isolated.
application for which the instrument was de- Conversely, very stressful social or family circum-
signed, the general focus and type of instrument stances may aggravate psychiatric disorders.
(that is, type of population to be studied and There is a large literature on the specific ways in
whether the instrument is to be used for screen- which social networks and support structures
ing or a second-stage assessment), and other operate, and how they affect the course of dis-
issues which determine choice of instrument. For orders. The literature on what is known as ‘ex-
a more detailed discussion of commonly used pressed emotion’ (EE) has shown that people
psychiatric epidemiological instruments, please with psychiatric problems are more likely to re-
refer to Bech et al (1993), Tansella, de Girolamo, lapse in situations where members of their family
Sartorius (1992), and the WHO Division of Mental are Over-involved with them and over-critical of
Health (1993). them.
Various researchers have stressed that psy- Part of the challenge psychiatric epidemi-
chiatric epidemiology needs to go beyond psy- ologists face when trying to take account of the
chiatric symptomatology in case identification. role social and family arrangements play in affect-
Marsella (1992), for example, recommends that ing the course of illness, lies in separating the ill-
case definition should also include psychosocial ness itself from the factors which may affect it.
problems such as family violence, divorce, and When an association is demonstrated between
racism. He argues that it may be more helpful to social isolation and relapse, for example, it can be
examine various clinical parameters of mental because people isolate themselves as a result of
disorders and social pathologies than to focus on relapse, or because the social isolation has trig-
diagnoses. In particular, he stresses that greater gered the relapse. We can begin to speak of isola-
attention should be paid to studying symptom tion as a risk factor for relapse only once we are
parameters (for example, frequency, severity, and clear that the isolation is not in fact part of the ill-
duration of symptoms and problem behaviours), ness. One way this can be done methodologically
clinical parameters (for example, onset, course, is to demonstrate that the isolation predates any
and outcome), and psychosocial parameters changes in the person’s behaviour that suggest
232
233
psychiatric epidemiology
Aayeiyoksdoinen ulJUBWSSassy Jed!UI|D 10} ajnNDeUDS = NVOS ‘MaIAId}U| O1SOUZeIG jeuoneUsaju| ayisOdwoy = |qIO
Jes = OUS
‘ginPeyoS MeIALe}Uj ONSOUSEIG = SIC ‘a|NPEYoS MaIAE}U jed!UI|D = S| ‘UOeUIWeXA aJe}S JUeSeld = 4Sd ‘QNBUUONSENO UINCSH [219USH = OHH ‘asleuuonsend suiuoday
‘\sISojoyoAsd Jo IseIYOASd e Aq apelu BUleg SeSOUBEIp YIM ‘SIOMOIMEIU! Ae| pauies posn saey seipnisx
aes
Bisqpjop
OL6T
T86T
SUIgOY
90uU9190jO1
wiyywosyje
O86T
O66T 6867
Jajyndwog
18 vl6T e189 B8UIM| Z?LET B19qgPI0D ‘je 12 Bulpsey eu
pais}siulwpe
SuIgoYy
‘/2
‘/2
‘2
38
SUIM
12
je
12
SOA
SOA
SOX
SOA
‘SI@MAINOIU!
ON
SIBMIIN9IU!
SIOMIIN9\U!
SIOMIIMOIUI
~SUBIOIUI|D
(UOISIOApoures|Ayjensn
suelojulo
—jjes
posiape
suelojuljo
JEOIUI]D
poules|
{4s
peules|
pouies|
pouresp
poulel
poures
26
}aAIT
suuejqoid
Ae}
Ae}
Ae}
Ae}
Jo
|
onoyodsd
|
Ja}SIUIWIpe
|
spuadap)
suosJad
|-OE/"SUIW
yyM
é0Z
“xoIddy
IST
OZ—-OT
10}
O6-09
OZ-OT
09
06-09
uo
“SUILU
“SUI
OUI]
GT—G
“SUI
“SUIW
“SUIW
“SUI
G7
0}
OF
asgejs |
agejs puz Ajurew agels puz Ajurew aes puoses
pug
Suluaesos SUIUdBINS ageis puz | Ajulewi SUIUddIOS SUIUDBIOS _ /sulusead9g
ee
SUUI}OHI|
Aynbul
‘yUOU 1SP|
UsNO
SyoeM
yjuoW
owed
‘gposida SUIOsI| sy}UOWW Yeem Bulped
OUT
seg]
sed
Ma}
40}
Juasalg | —Syeem Z UIUUM | XIS sed/JUaUND -aid/USIND
|
suonsenb (09'0€'8Z‘eT Sule}
aAISUBPY wo dwhs 838z O@AISUS}XJ eAisualxXy OvT “3°9) SOURA GZ‘vz% jo s9qUINN
S$souj|l siapJosip SJapsJOSIp
onoyoksd | Ayyeuosied ysow | Ayjeuosiad ysow SJOPJOSIP SJapsosip SJOpsOSIp
jeuoljoun "joxe SlapsJosip | *|oxe ‘SJapJOSIP ouyelyoAsd JEJUBW BJOAVS SJapJOSsip onoyoAsd-uou
Jo onOINEN | OUJeIYOASC JOfe- | dUye!YyOASd JoleW\ jo asuey 0} a}eu9poW onoyodsd-uon pue onoyoksd snoo4
suonejndod suonejndod suonejndod saujunos
jeojuljo-uoU jeoluljo suonejndod | sojoejd jeuoues suorjeindod suonejndod suidojaAop uoljeojdde
pue jeolul|o pue jeJouas Aywunuiwog | pue Aylunwuiwog jeoiuijg | sonoeid jesouey} ul ssulnes OHd Auewiid
STC CRMC EPC PPCPt de: yay LE
e
| (| dee) om:
M PELe MSP
234
PCC COEUR UES) an
Aiewiid uojjeojdde
OHd SSUINSS
Ul OHd SSUIIOS
Ul jeiaue5
pue Jedjuljo jeiauey suoneindod jeisuey suoneindod
sUIdO}esAepSOLJUNOD SuIdo|aAepS9JUNOD suonejindod
OU]
OF 4d9}S/UJWIPe
|qd OT—-G “SUIW “Sul O9-Gr “SUI Ja8u0]) -O OQ-GY “SUI *O QOG—-O9 “SUIW
4! PIIYO
SI} (4OIS -d Q6-09 “SUI :d Q6-O9 “SUI
epidemiology applied to content areas
oul) ewe
10) Auinbul
| queuing ‘sed Juadino |ea one Sed 9/JedA sujuoW ysed/juaung
JaART
Jo JEdIUI[D pautes
Ae| SUBIDIUND paurex/sueiiu
Ae] | pouies,
Ae| SUBIDIUND
jIMs paaaiyoe SIOMAIAO]UI SIOMAIAJOU! SJOMAIAO1UI
JOMOIAIOU|
suo}}eAlasgo
wWy}OMe
483ndwWog
LL6L O86T
qualed = dq ‘pilyo = Ox
‘UBIPIIUD JO} BINPAYOS MaIM9}U] ONSOUBEIG = OSIG ‘UaP|IYO 10) Ma|AJa}U ONSOUBeI = VOI ‘USIPIIUD 10} MalMeyU] dn-mo}jo4 = O[4 {UUpIUD osleUUONSENd YoRasey = DY
UOISI9A |BOIBO|OlWapidy-elussydoziyoS pue Ssispsosiq eANIeYy J0J o[NpeUuds-alpply = J-SavS—y
26 psychiatric epidemiology
the possibility of relapse (that is, temporal rela- pressed. Recall bias can, however, be fully dealt
tionship between outcome and exposure). The with in psychiatric epidemiology, as in other
insidious nature of onset and relapse in mental branches of epidemiology, by the design of
illness, and hence the difficulty of pinpointing prospective studies.
onset of disease or of a particular phase in a The fact that different people can experience
disease, requires very careful recording of what are apparently the same external events in
changes in behaviour, as well as an appreciation very different ways leads the enquiry in two diver-
of the ways in which person and context are inter- gent directions. The first of these concerns indi-
twined. vidual vulnerability; the second concerns the
influence of broader cultural and social arrange-
Stressful life events ments on the ways in which communities under-
Intuitively, one might assume that unpleasant stand, manage, and respond to circumstances.
experiences or events would contribute to the The following two sections will deal with these
onset or exacerbation of psychiatric disorder, and questions.
research bears out this view.
Early work by Holmes and Rahe (1967) in the Individual vulnerability
1960s, however, demonstrated that it may not be A person may be vulnerable to disorder on the
the quality (positive or negative) of events which basis of a range of factors intrinsic to that person,
contributes to the course of disorder, but rather for example, the way they were brought up, phys-
the amount of change or readjustment which ical health, stressful events, and genetic factors
events cause in people’s lives. Thus, in the many (Kaplan, Sadock 1988). At first it may appear con-
scales that have been developed to measure life ceptually difficult to separate these factors from
events and life change, there are predictable the disorder itself, and there may also be some
items, such as the death of a spouse and deten- circularity in reasoning.
tion in jail, as well as items such as marriage, birth For example, if it can be shown that people who
of a child, and so on. There is ongoing debate are habitually suspicious are more prone than
about which sorts of events can predict the course others to a psychiatric disorder typified by exag-
of illness. In the case of depression, for example, gerated suspicion, have we demonstrated any-
events involving loss may be more important thing at all? In spite of these difficulties, it can be
than in other disorders. demonstrated empirically that there is no inevit-
The same event may have different meanings able link between any given personality style and
and implications for different people (Brown, psychiatric disorder.
Harris 1978). The view that individuals’ personal The two dominant contemporary approaches
experience or appraisal of events is more crucial to classification in psychiatry (the Diagnostic and
than the events themselves has considerable Statistical Manual of Mental Disorders, fourth
methodological implications. If a researcher is inte- edition, of the American Psychiatric Association,
rested only in events, these can be assessed fairly and the International Classification of Diseases,
quickly using a checklist, but if appraisal is of inter- tenth edition, of the WHO) have attempted to
est, then a detailed and relatively individualized in- deal conceptually with separating clinical dis-
terview format has to be developed. order and other factors intrinsic to the affected
One important methodological advantage of a person by classifying the clinical disorder on one
reasonably in-depth interview is that, apart from axis and personality disorders (and mental han-
providing access to the respondent's perceptions dicap) on other axes.
of events, it also helps clarify whether stressful The separate focus on personality disorder and
events occurred before or after the onset or on personality style (although not uncontrover-
exacerbation of symptoms. As we have seen sial) is useful both in psychiatric epidemiology
before, this is a key question for developing and more especially in studies of chronic diseases
models of cause of disorder. A further advantage of lifestyle, where issues of behaviourial change
of careful questioning in this area is that it facilit- and adherence to treatment are vital.
ates an exploration of recall bias in the reporting Often the concern in such studies is not with
of events. A depressed person, for example, may psychopathology as such, but with factors
recall many unpleasant experiences which would which may contribute to disease and affect re-
not be remembered by a person who is not de- covery.
235
e epidemiology applied to content areas
Broader social, economic, and cultural whether South Africans as a whole suffered from
influences higher rates of anxiety than people elsewhere,
From an intuitive point of view, it seems logical to and an attempt was made to explain differences
suppose that rates of psychiatric disorder would in rates with reference to apartheid (Swartz 1987).
increase in proportion to actual physical hardship The problem with this type of analysis is that it
experienced by people. Psychiatric epidemi- assumes a single causal pathway, and ignores a
ologists have explored the relationships between whole host of other possible interpretations of the
resource allocation to groups and populations, data — including the possibility that an inter-
and psychopathology. Though the evidence is not nationally used scale may work differently in
uniform, it does appear that there are relation- South Africa for cultural reasons.
ships between patterns of pathology and broad _In a rapidly changing world where diversity is
socio-economic factors (Blackburn 1991; Dawes, experienced in every country, there has been con-
Donald 1994). A major challenge in such research siderable interest in the effect of cultural factors
is to develop models capable of explaining the on psychopathology. Some early work in this field
mechanisms whereby broad social factors impact was clearly racist — for example, it was once
on psychopathology. thought that African people were incapable of
The psychiatric epidemiology of apartheid — feeling depressed, as they lacked the necessary in-
specifically the relationship between apartheid sight for reflection and concern about their lives
and mental health — can be viewed as one exam- (Littlewood, Lipsedge 1989). This flies in the face
ple of concern about broad social factors. Surpris- of practically all contemporary epidemiological
ingly little epidemiological work has systematic- evidence, which shows high rates of depression in
ally addressed the psychopathological effects of Africa (Littlewood, Lipsedge 1989).
apartheid (Swartz 1987). The challenges of re- The extent to which cultural arrangements
search in this area at a time of national recon- affect mental disorder, and how, is an area receiv-
struction in South Africa are greater than ever ing increasing attention. There is considerable
before. debate, for example, on whether schizophrenia
Other broad social factors that may influence has a better outcome in the Third World than
the mental health of populations can be seen as elsewhere, as early epidemiological evidence
intrinsically undesirable (such as war and civil seemed to suggest. These debates are part of a far
strife). Others can be viewed as broad and com- larger challenge to psychiatric epidemiology as a
plex social movements whose effects need to be whole, and this challenge is dealt with in the next
explored. Urbanization, for example, is a global section.
phenomenon leading to massive changes in
social networks and human relationships, and the Issues of validity in psychiatric
effects of urbanization on mental health are epidemiological research
beginning to be explored. Threats to the validity (accuracy) of epidemi-
Part of the problem in assessing the effects of ological investigations arise from failure to assess
apartheid and urbanization on mental health is the validity of the instruments, to calibrate them,
that it is very difficult to tease out particular ex- and to assess their reliability (repeatability). Psy-
posures for particular persons and to link these to chiatric epidemiologists have also faced diffi-
specific outcomes. There is also such a wide range culties in finding an appropriate way to define
of experience of broad social phenomena such as and measure. disorder, particularly in non-
apartheid, that it is often difficult to define pre- Western settings. These difficulties demonstrate
cisely what is meant by the term. It may be more how it is possible for a gap to exist between what
worthwhile from a methodological point of view is the ‘truth’, and what is measured.
to focus more narrowly on a phenomenon which In general, the validity of,instruments may be
can be measured precisely, and which can either enhanced by conducting medical examinations
serve as a proxy for or be an example of a broader to check whether physical disorders are causing
social phenomenon. It is of course possible to the mental disturbance. The use of multiple
compare the mental health of people living under sources of data, to double check information or to
certain social conditions with those living under fillin gaps, also improves the quality of the data.
other social conditions through ecological de- The validity of screening instruments can also
signs, but such designs are weak. For example, be enhanced by following up a certain percentage
some decades ago there was some interest in of screen negatives, and modifying morbidity es-
236
26 psychiatric epidemiology
timates on the basis of the number of screen nega- (Gillis, Elk, Ben-Arie, Teggin 1982).
tives who turn out to be cases. Reliability is as A number of strategies have been proposed for
important as validity, and it is crucial that inter- handling these difficulties. Before undertaking an
observer reliability be established (Swanson- epidemiological study, it is essential that re-
Fisher, Martin 1981). searchers learn about abnormal behaviour pat-
The validity of instruments developed in terns and the socio-cultural context within which
Europe and North America may be compromised such behaviour occurs (Draguns 1984). This
by application in a new context. Researchers fre- could be achieved, for example, by conducting
quently rely on cut-offs which have not been anthropological studies that try to understand the
adjusted for the population under study. This can cultural context of and the reasons for specific
result in inaccurate estimates of psychiatric activities. Based upon such an anthropological
morbidity. investigation, it might be useful to supplement
Besides the general issues of validity, there are ingtruments by adding questions about somatic
at least two major threats to the validity of psychi- expressions of mental illness or specific cultural
atric epidemiological instruments when applied phenomena. These questions can be analysed
in developing countries. The first arises from cul- separately. Other strategies include using person-
tural differences, and the second from the lang- nel familiar with subtle cultural nuances as inter-
uage translation process. viewers or interpreters, asking subjects to explain
Epidemiology is in itself a cultural enterprise, some of their responses, rating a symptom as
embodying Western notions of disease and basic negative whenever there is any doubt about it
Western assumptions about ways that people being positive, conducting physical examinations
function — the split between body and mind, and to make allowances for the influence of physical
between emotions and intellect, are just two exam- health on mental status, and modifying the con-
ples. Psychiatric epidemiology is thus a Western tent of certain questions to make them more rele-
product which tends to take as a given Western vant to interviewees’ everyday experience.
ways of seeing the world, Western classification Translation of psychological tests into African
systems, and Western healing systems. Most languages can also affect the validity of epidemi-
people in the world, however, do not conceptual- ological studies. This occurs primarily because
ize or experience their illness in this way. languages differ in the way they express inner dis-
This has important consequences, the most tress such as depression and anxiety (Swartz,
serious being how to judge whether behavi- Ben-Arie, Teggin 1985). The first step in reducing
our is indeed pathological or not. When measur- the threat to validity cdused by the translation
ing psychiatric morbidity, problems arise from process is to translate the instrument in such a
the confusion between culturally distinctive way that the concept behind each item is ade-
behaviour and psychopathological manifesta- quately conveyed, incorporating local idioms of
tions. For example, in some African settings appa- distress and, if necessary, asking a series of sep-
rent ‘hallucinations’ and ‘delusions’ may occur arate questions rather than a single question. The
even among psychiatrically healthy people and second step is to get someone to do an independ-
may result from encounters with ancestors whose ent translation back into the original language.
influence is believed to be tangible (Gillis, Ben- The third step is to straighten out the differences
Arie, Elk, Teggin 1982). between the original and the back-translation.
A second and related problem stems from the Another set of strategies can be applied during
way that the specific content of a psychiatric test the interview. Where interpreters are used, they
may culturally influence responses. For example, should preferably be from a similar cultural back-
questions about experiences such as watching TV ground to the respondents. They should be fluent
or reading newspapers may result in biased re- in the interviewee’s language, have mental health
sponses, as respondents may not have access to expertise, and be skilled in the use of the instru-
these objects. ment in question. The interview process should be
Thirdly, the format of psychiatric tests may constantly monitored by evaluating the relevance
influence responses. For example, the interrog- of answers and observing interviewee’s non-verbal
ative style of most standardized tests may be behaviour. It may even be necessary to ask inter-
foreign to the expectations and practice of many viewees to paraphrase some questions to deter-
Africans, and verbal and non-verbal responses mine how they understand them, or to rephrase
may not reflect the true state of the interviewee ‘difficult’ questions using standard alternatives.
Basil
e@ epidemiology applied to content areas
It will therefore be helpful to conduct pilot only on major recognized psychiatric illnesses
studies comparing the results with assessments (such as schizophrenia) and major affective dis-
made by clinicians, comparing the distribution of orders (disorders of mood). These illnesses are
scores between normal and patient samples, and important in the public health field, as they are
to pre-test the translation. The validity of an distressing, and may be chronic and place a
instrument in a cross-cultural setting may also be burden of care on family and community mem-
assessed by conducting formal statistical assess- bers and the health services. However, psychiatric
ment (for example, factor analysis) comparing epidemiology should and does go beyond the
scores obtained in a cross-cultural setting and study of the major psychiatric conditions. It is
studies elsewhere. often stated that psychological factors are
responsible for much of the work that has to be
done at primary health care level. The implication
Psychiatric epidemiology in the (spoken or unspoken) is that these factors waste
primary health care context primary health care resources. The first task of an
approach to psychiatric epidemiology at this level
General approach is to sort out a series of questions about psycho-
Following the Alma Ata declaration, there has logical factors in primary health care, and to
been increasing interest and emphasis on inte- decide which of these questions is relevant to the
grating mental health services into primary setting being studied.
health care (PHC). This has important implica- A central question which has been asked in psy-
tions for psychiatric epidemiology. What does it chiatric epidemiology concerns the paths people
mean to integrate mental health services into pri- follow on their journey to becoming known psy-
mary health care? At one important level, it chiatric patients. Table 26.4, reproduced from
means treating obvious or known cases of mental Goldberg and Huxley (1992), shows that in Britain
disorder as far as possible at the primary level and not all people who suffer from mental disorder
in an integrated way. At another, though, it means (defined as having an episode lasting at least two
rethinking the ways in which we understand weeks in a calendar year) consult primary care
health and illness behaviour (that is, the ways in services, and that there is a progressive filtering
which people interpret, respond to, and act on by which people become users of mental illness
their experiences of ill health) and reorientating services and, even more rarely, psychiatric in-
PHC programmes to accommodate these factors. patients.
Anyone with clinical experience in any area of
health care will be familiar with patients whose Of central importance in Goldberg and Huxley’s
behaviour does not easily fit with the health care work are two observations.
system. These may include people who come for Not everyone suffering from mental disorder
help when no physical problem can be found, comes to the attention of mental illness ser-
people who experience pain and discomfort out vices, or even to the attention of the health ser-
of proportion to their pathophysiology, people vices in general.
who do not take medication as prescribed, people The process whereby one becomes a patient in
who continue with lifestyles which are bad for mental illness services depends only partly on
their health, people who abuse their bodies with the symptoms from which one suffers.
cigarettes, alcohol, and addictive drugs, parents
who seem to have their children’s best interests at As Goldberg and Huxley note, the filters between
heart, but who fail to do what is necessary to keep different levels in their model depend partly on
the children healthy, and so on. The list is endless. how people who experience symptoms under-
The AIDS epidemic has convinced many sceptical stand and act on them (illness behaviour), clini-
health care professionals that serious attention cian competence (ability to detect disorder), and
must be given to psychosocial factors as part of patterns of referral and resources available (fefer-
any primary health care programme. ral to mental illness services and admission to
Psychiatric epidemiology has much to offer any psychiatric hospitals).
health worker or researcher in the quest to under- - In order to tease out the different contributors
stand and influence patterns of health and illness. of paths to psychiatric care and status as a psychi-
Psychiatric epidemiology is, however, narrowly atric patient, we need to be able to answer ques-
understood and is sometimes thought to focus tions both about what constitutes a psychiatric
238
26 psychiatric epidemiology
where (especially in Cape Town) at any given time range of ways, both quantitatively and qualit-
to assess directly, for example, level of care atively. The issues of validity so central to assess-
received. ment in psychiatric epidemiology were especially
The various studies in Mamre allowed for a clear in the cross-sectional, community-based
spectrum of questions to be addressed in a wide survey, which is described briefly.
240
26 psychiatric epidemiology
Randomly selected adults were assessed methods in South Africa (Rumble, Swartz, Parry,
using the Self-Reporting Questionnaire (SRQ) as Zwarenstein, [in press]).
a first-stage screen, and the Present State Exam-
ination (PSE-9) as the second-stage criterion.
Conclusion '
Demographic, health care utilization, and sub-
stance abuse data were also collected. Using the Psychiatric epidemiology in South Africa (and in
PSE-9 CATEGO Index of Definition of 5, the the developing world generally) is at an exciting
weighted prevalence of psychiatric morbidity stage. Methodologically, designs that have been
was 27,1%, the majority of cases being given a used up to this point have been rather unsophist-
tentative diagnosis of depressive or anxiety dis- icated. Part of the challenge now is to move to
order. Even though the ‘CATEGO’ algorithm was more sophisticated designs, including those
initially chosen as a standard for assessing which assess the efficacy of interventions. Metho-
whether a person met the criteria for classifica- dologically, much remains to be done in South
tion as a case of psychiatric morbidity, it Africa in terms of developing psychiatric epi-
became clear that this was not appropriate, as a demiology in the direction of analytical and inter-
substantial proportion of cases were identified vention studies. Questions of validity are also cru-
by CATEGO as having paranoid disorders. It cial to consider.
emerged on careful re-analysis of the data that Concerned as it is with the difficult-to-measure
what may be culturally appropriate ways of ex- but crucial realities of people’s daily lives, psychi-
plaining disorder in Mamre (such as believing atric epidemiology has been forced to grapple
that it was caused by the bad deeds of others) with how one makes as accurate an assessment of
was being rated as part of a paranoid syndrome those lives as possible. Many of the issues which
on the PSE. It was therefore necessary to rein- face psychiatric epidemiology are therefore rel-
terpret this type of phenomenon in the light of evant to epidemiology in general. One of the most
knowledge of local cultural beliefs. The SRQ’s important of these is the need for epidemiology to
weighted sensitivity and specificity were 49% be informed by local practices.
and 82% respectively. Sensitivity was therefore Modern psychiatric epidemiology is taking in-
very low, and further research on criterion va- creasing account of the reality that there are
lidity of the SRQ in South Africa is needed. Over- many perspectives on disorder and its treatment.
all, the SRQ correctly identified 67% of cases Its methods are being combined with qualitative
and non-cases. No demographic variables pre- and anthropological approaches, as we have
dicted psychiatric morbidity, but there was an seen. It is also important, however, to recognize
indirect link between morbidity and primary the extent to which psychiatric epidemiology’s
health care utilization. In summary, the study engagement with a diverse world has helped
provides useful information about prevalence, focus our attention on the nature of the epidemi-
and important lines of enquiry regarding the ological enterprise itself.
refinement of psychiatric epidemiological
241
@ epidemiology applied to content areas
242
2i Anthropometric
studies
the NCHS-tabulated 3rd percentile of weight for above the expected on the NCHS tables. Gross
age is 14 kg; that is, in the reference population, childhood obesity is associated with orthopaedic
only 3% of girls aged five weighed 14 kg or less. If problems and overweight with psychological prob-
we examine a girl of five and find that she weighs lems. Most fat babies lose their fat in childhood, but
13,5 kg, we would say that she falls ‘below the 3rd obese children tend to become obese adults.
percentile weight for age’. By referring to the
reference population, we can determine which Interpretation
children or what proportion of children are The three indicators each reflect a different
undernourished. Different definitions of under- aspect of nutrition. A child who is low on height
nutrition exist; often the 3rd percentile or the 5th for age is stunted and has a history of undernutri-
percentile are used as the cut-off level below tion (that is, chronic undernutrition). If this child
which undernutrition is diagnosed. More re- has a normal weight for height, it is currently
cently, the Nutrition Unit of the WHO has recom- being normally fed. Weight for height reflects cur-
mended the use of the median (the 50th per- rent nutritional status. A child could have a low
centile) minus 2 x standard deviation as the cut- weight for age if it is stunted (has low height for
off. (The values given by this cutpoint are similar age) or thin (has low weight for height) or both.
to those obtained when using the 3rd percentile.)
In the WHO publication Measuring Change in Other measures of nutritional status
Nutritional Status (1983), various percentiles, as Mid-upper arm circumference can be used as an
well as the median minus selected multiples of indicator of wasting. Mild to moderate cases of
the standard deviation, are tabulated for the malnutrition will not be detected by this
NCHS reference population. A researcher can measurement, but under famine conditions it is a
thus manually look up a cut-off point and classify good measure to use. In some research studies,
an individual child on that basis. The data has skinfold thicknesses are also measured. (Head
also been computerized, and a computer pro- circumference is generally used to measure brain
gramme, Epi Info, can be used to categorize chil- growth, not nutritional status.)
dren or summarize anthropometric data. A
child’s growth can thus be monitored by plotting Assessing the nutritional status of
his or her weight at consecutive ages, for example.
This is commonly done at well-baby clinics, adults
where weights are plotted on the child’s Road to To be able to define obesity accurately, one needs
Health chart (Figure 27.1 shows the chart for the to be able to measure body fat. Body fat can be
Birth-to-ten project). In this way, any faltering in measured by the seldom-used technique of
weight can be detected and prompt action taken. underwater weighing, which is costly and cum-
Not only children who fall below the 3rd _per- bersome. More easily measurable characteristics
centile, but any child who shows a decline in therefore have to be used as indicators of body
weight for age, is seen as possibly being at risk. fat. Many studies have been conducted to assess
Another measure which is sometimes used is the value of skinfold thicknesses and various
that of the percentage of expected value for a weight-height relationships as predictors of body
given indicator. For a given age or sex, the median fat. Skinfold measurements (biceps, triceps, sub-
of the indicator in the reference population is scapular, and suprailiac) have generally been
taken as the expected value, and the observed found to correlate well with body fat measured by
value can thus be expressed as a percentage of underwater weighing, and formulae exist for con-
expected value. The five-year-old girl has an verting skinfold thicknesses to an estimate for
expected weight for age of 17,7 kg (that is, the body fat. For accurate determinations of skinfold
median weight for age for a five-year-old girl is measurements, researchers need to use standard-
17,7 kg), so her weight of 13,5 kg is 76,3% of ized callipers, and to be well-trained in the
expected. In this context, 80% of expected is often methodology for these measurements, as meas-
used as the cut-off level below which undernutri- urement variation is often large. Age- and sex-
tion is diagnosed. specific reference data for skinfold thicknesses is
Obesity in children receives less attention than available from the Health and Nutritional Exam-
undernutrition because of uncertainties about the ination Survey (HANES) of the United States.
long-term dangers of childhood obesity. Obesity The indicator most often used to assess the
can be defined as a weight for height which is 20% nutritional status of adults, however, is the Body
244
245
anthropometric studies
winyepayoogay,
Yul
ayep
apueew syjuow
BGujpaoauem
I ueA ajseyiy!SSe|y (Spuom p - ¢) Sedua}UeS
UONLINWJEPUN Jo UONeOISSeID (aps00M p - ) aUUIS
yates FETC TERET
= HP = ee
oe 0 hsRe =
ea ee
27
POO
Peete De te ete | erate?
4
pe pe A ee
Pee a bk |)see aT. Gee Peter otot
ee = ee wi | Le Boe eR 2 Si ale!
eit |eee ba dole
rt [\:
aH
OL
beadBe
ary:
Lda
AG Se al SE Ee EE el
ro
oh
Zz
ry
|
.
1a
|\|
a
\
=
\
ETT TrineTl
‘SIEKTES EN VOORVALLE
Ye
\\
hI
4
a
|
Nt
eta eee Pete Vel
¢ eile
ela& a!
seer
eee Pee El
eee EH TCO
UU
sgt
BOUIBJWNDIII
oO iim VSEDPEES
soe
SCE MT
(wo)
PRAY
sehiesdes
qi co a ie se Cre tT tte is te Fe2 sere
SUIWEYA
uo4|
pue
oe se | ieult
leipat eis peiblilieet a oS
aJ0q96 eqeq @MNN GN aU xNIqGa6 aIN X HINGED A ONINNYIdIASNISID WO Aqeq MAN GN Pssn JON x Pas) A ONINNWId ATIWVS
MI
nan
TANT
POEM Le mee UL iewe ee
e@ epidemiology applied to content areas
Mass Index (BMI), also known as the Quetelet age): 44,3% of the boys and 44,1% of the girls.
index. To be able to calculate this index, we need However, less than 4% of the boys and girls
to determine a subject’s height and weight (once showed signs of acute malnutrition (fell below
again ensuring that this is done in a standardized third percentile of weight for height). It was found
way). that significantly more children from informal
BML is defined as: weight (in kg) dwellings had chronic malnutrition than children
height (in m) x height (in m) from formal dwellings: 53,2% compared to 39,5%.
BMI values can be categorized into undernour- To determine the reliability of the measure-
ished (women with BMI <19, men with BMI <20), ments, a random sample of 45 children were
normal (women with BMI 19-<24, men with BMI measured and weighed again. Poor reliability was
20-<25), overweight (women with BMI 24-<30, found, possibly due to the difficulty of measuring
men with BMI 25-<30) and obese (BMI 30 or more children aged 12-23 months in informal settle-
for both sexes). ments.
The distribution of fat over the body is also Weight for age results were compared with in-
important when assessing the nutritional status formation routinely documented on master cards
of adults. Individuals can have a gynoid shape (fat at the Alexandra Health Care Centre well-baby
is localized in the lower part of the body; pear- clinic from June 1990 to December 1990. This
shaped) or an android shape (fat is localized in source indicated that 5,5% of children were below
the upper part of the body; apple-shaped). These the third percentile of weight for age, compared
different shapes have been found to be associated to 17,7% of the boys and 9,3% of the girls in the
with different diseases. So, for example, people community survey. The researchers concluded
with an android shape are more at risk for that this may indicate that the health centre is not
diabetes, ischaemic heart disease, and hyperten- reaching those children in greatest need.
sion. Measurements of waist, hip, buttocks, and Recommendations made for nutrition pro-
thigh circumference can be used to determine gramme development include that an evaluation
which shape an individual has. should be made of who attends the health ser-
vices, and how to attract those in greatest need.
Conclusion Furthermore, health workers should be trained to
measure anthropometry accurately.
Nutritional status is both an important risk factor
and health outcome to measure and monitor. Reference:
Strict guidelines on how measurements are made Coetzee DJ, Ferrinho P. ‘Nutritional status of Alexandra
and interpreted have te be followed to ensure that township. Clinic-based data and a community survey.’
valid results and conclusions are drawn. South African Medical Journal 1994; 84(7):413-15.
246
27 anthropometric studies
247
28 Assessment of
dietary intake
249
e@ epidemiology applied to content areas
dedicated people qualify as suitable respondents. search methodology indicates that data from at
Because onlya select group of respondents are least 50 respondents is needed to describe the
suitable, random sampling cannot be done and ‘usual’ intake-for a group. Thus subgroups (for
the representativeness of samples is usually example, age or sex groups), have to have at least
limited. 50 participants. More than one 24-hour recall by
The researcher needs to train the respondent the same respondent, for example, for two week-
and give detailed instructions on the weighing days and one weekend day, cancel some of the
and recording of foods and drinks taken, as well disadvantages of the method.
as training on how to deal with combined dishes
such as ‘potjiekos’, samp and beans, pizza, Quantified food frequency
samoosas, and how to manage recording of meals The interviewer has a structured questionnaire,
not eaten at home. Records need to be checked and the respondent recalls how often and in what
often (daily in some studies). quantities the foods listed were eaten over a
specified period of time; for example, the last
Estimated records month, three or six months, or the last year. This
All foods and drinks are to be recorded on record is a long interview which needs an experienced
sheets in terms of household measures (such as interviewer. Visual aids are used to assist in quan-
spoon- or cupfuls), or in dimensions (for exam- tifying intake. In a Western-type mixed diet, this
ple, one meat patty, 8 cm diameter, 1 cm thick). method tends to overestimate intake because the
The researcher needs to train the respondent on wide variety of different foods means that a large
how to quantify food intake. In addition, written number of foods is recorded. Respondents are
instructions are usually part of the record sheets. usually unable to compensate in their reporting
Records are usually kept for seven days, but the for substitution (where some foods replace others
period is determined by the purpose of the study. during a specified time period). The quantified
For example, for representative information on food frequency gives a better indication of usual
nutrients for which intake can vary greatly food intake than the 24-hour recall. The re-
depending on food choice (such as vitamin A or searcher needs insight into the eating habits of
cholesterol), nine days or 28 days of record-keep- the research population, to ensure that often-
ing might be necessary. The estimated record used foods are included in the questionnaire.
method puts a lesser burden on the respondent
than the weighed record method. Food frequency method (qualitative food intake
In some studies, duplicate portions of all foods data)
eaten are to be assembled by the respondent for The interviewer uses a structured questionnaire
chemical assessment of the nutrient content of to fill in the number of times specific food items
the total food intake. Respondents find it difficult were eaten over a specified period of time. The
to dispose of valued food items such as alcoholic quantity of food eaten is not asked. Such inform-
drinks, cake, steak, and slabs of chocolate in such ation can be used to monitor respondents’ dietary
a way! intake in a study (for example, data to prove that
diet changed or remained unchanged during an
Recall of food eaten in past experiment), or to expose differences in the fre-
quency of intake between groups, for example
24-hour recall age, sex, Or experimental groups. Data from this
Instead of writing down dietary intake on record method cannot be used to calculate intake of nu-
sheets, the respondent is interviewed on all food trients.
intake for the previous day, that is, a 24-hour
period. The interviewer needs to be experienced Dietary histories
and must use visual aids to assist the respondent The term ‘dietary history method’ is often used,
in quantifying food intake. These can include sometimes incorrectly. According to interna-
food models, real food, and spoons of varying tional definition, a dietary history consists of a
sizes. combination of a 24-hour recall with a meal pat-
This method relies on the memory of the tern (that is, how many food intake periods per
respondent and intake is usually underestimated. day, and what foods are usually consumed at each
‘Yesterday’s’ intake is not representative of an of these periods) over an extended period of time
individual’s usual intake, and dietary study re- with food quantities specified in household
250
28 assessment of dietary intake
measures, as well as a checklist of foods usually of portions (portion size specified) of foods eaten
consumed. from the five basic food groups (milk, meat and
alternatives, vegetables and fruit, cereals, and
Important aspects of data collection fats) can be calculated. For example, these could
In dietary studies it is important to take note of calculate that children under five are taking only
the following aspects. 215 ml milk daily, compared to the recommenda-
@ Respondents must be trained to keep records. tion of 500 ml milk daily, or that adults are having
Correct reading of the scale and the discipline two portions of vegetables and fruit daily instead
not to estimate a weight are of crucial import- of the recommended minimum of four portions.
ance in the weighed record method. For the es-
timated record method, respondents need to Calculation in terms of nutrients
be taught in how much detail to record and For this, one needs information on food composi-
how quantities should be estimated and tion. This can be obtained by chemical analysis
recorded. (depending on the aim of the study, but not often
@ The training of the interviewers for the recall done, as it is very expensive) or from food com-
methods must be standardized. It is of great position tables (commonly used). Food composi-
importance that quantifying of intake is ‘stan- tion tables usually give the amount of nutrients in
dardized’ if more than one interviewer is used. a specified amount of food, for example, per 100 g
This implies training the fieldworkers together, of food. There are 3 g protein and 120 mg calcium
with food models and real food available to in 100 g milk, for example. The Medical Research
visualize quantities, and to reach mutual Council has compiled Food Composition Tables
definitions, even if these definitions apply only from American, British, and (where available)
to one specific study (for example, the size ofa South African food composition data. In the 1991
‘medium’ apple, or a ‘heaped’ spoonful). version of the Food Composition Tables, only
18% were South African values.
Validity and reproducibility Dietary data can be analysed by hand, but this is
very cumbersome. The quality of the results is at
Several articles have been published reviewing
risk because so many calculations need to be made.
the limitations of the various methods for assess- Dietary data can also be analysed by computer; and
ing food consumption. There is no ‘gold standard’ programmes such as SAS can be used. For comput-
for the validation of dietary study methods. One
erization, dietary intake data needs to be coded.
method may be compared with another method The MRC Food Quantities Manual is very useful for
(relative validity), or a new method may be converting food reported in household measures
validated against an external criterion such as a into grams of food consumed. Two software pack-
biological marker. Repeatability (measuring the ages for personal computers, containing the MRC
same subject under the same conditions) is influ- food composition database, are available. Food-
enced by the day-to-day variation in the subject’s finder is a nutrient analysis programme which can
food intake, as well as by seasonal variation and be used for small scale research, while Foodfundi
other factors. Professional is a nutrient analysis and counselling
programme for the therapeutic consultant. For
Calculation of intake more information on these products, contact the
Dietary data is usually presented either in terms authors.
of food intake or nutrient intake. The exception is
the qualitative food frequency method, where in- Standards for interpretation of
take is not quantified and nutrient intake thus
dietary data
cannot be calculated.
Food intake, expressed in number of portions
Calculation in terms of foods consumed from different food groups, can be
This, in its simplest form, involves counting (by - compared with recommendations for healthy
hand or computer) of the frequency with which eating, to describe adequacy of the diet. Nutrient
specific foods are eaten. Computer programmes data can be compared with recommendations for
have been developed at the National Research energy, protein, vitamin, and mineral intake.
Programme for Nutritional Intervention of the South Africa does not have its own recommenda-
Medical Research Council by which the number tions for these, but uses those of the United States
251
e@ epidemiology applied to content areas
of America (the US Recommended Dietary Allow- Intensive training of the interviewers was
ances [RDA], revised 1989), or the WHO (1985), or spread over five days and conducted by dietitians.
Britain (1991). Cut-off points such as <75%, <67%, Included in the training were general aspects of
<50%, >=100% and >=120% of the standard are interviewing techniques; awareness of the subtle
used to describe the adequacy of nutrient intake. differences in foods (for example, full cream, low
South Africa has its own dietary guidelines for fat, skim milk; butter, brick margarine, tub mar-
the prevention of coronary heart disease: less garine); awareness of food preparation methods
than 30% of energy should come from fat, and and ways of quantifying food intake, including
carbohydrate should contribute at least 55% of the use of food models to describe volume or size
energy. of portions.
The information was coded bya trained coder
Conclusion with knowledge of the scope of the food composi-
tion tables, and punched into a mainframe com-
Several methods are available to assess dietary puter for analysis. The results provided a profile
intake. Data can be described in terms of foods or of the dietary pattern in an urban African com-
nutrients, and standards for evaluation are avail- munity.
able. Dietary studies are not easy to execute or to
analyse; therefore, where evaluation of the diet
References
seems to be an essential part of a study protocol,
it is recommended that: Bourne LT, Langenhoven M L, Steyn K, Jooste P L,
LaubscherJ A, Van der Vyver E. ‘Nutrient intake in the
adietitian, knowledgeable in dietary research urban African population of the Cape Peninsula, South
methodology, is included as part of the re- Africa. The Brisk Study.’ Central African Journal of
search team; Medicine 1993; 39:238-47.
abaseline period (usually at least two weeks), Bourne LT, Langenhoven M L, Steyn K, Jooste PL,
NesamvuniA E, Laubscher]J A. ‘The food and meal pattern
during which respondents follow their usual in the urban African population of the Cape Peninsula,
diet, is included in dietary intervention studies South Africa; the Brisk Study.’ Central African Journal of
in order to provide for the placebo effect; Medicine 1994; 40:140-48.
@ depending on the design of the study, a refer- Bourne LT, Langenhoven M L, Steyn K, Jooste P L,
Laubscher J A, Bourne D E. ‘Nutritional status of 3-6 year-
ence group is included in dietary intervention
old African children in the Cape Peninsula.’ East African
studies to provide for possible influences such Medical Journal 1994; 71:695-702.
as the placebo effect and seasonal variation on
the outcome of the study.
EXAMPLE 28.2
full cream milk = 0006 b Day of the week: Monday = 1; Tuesday = 2; etc.
apple = 7001 c Meal of the day e.g. breakfast = 1; in-between =
2Petc;
Computer coding sheets containing 80 blocks d The last block is used for the card number. If
per line can be used, or a special coding sheet (see more than 80 spaces are required for coding
example below) can be used. one meal, the identification line is repeated on
the next coding sheet and the card number will
CODING SHEET then be 2.
a a a e e b c d e Additional information can be coded, for exam-
4 0 0 2 0 0 6 0 ple, gender, experimental period, type of diet,
6 5 0 8 0 0 1 0 etc.
0 0 0 6 0 2 5 0
7 0 0 1 0 1 5 0 Second to 10th line:
In each line, the first four blocks are used for the
Explanation: food code and the last four blocks for the amount
The first line of the coding sheet can be used for of food eaten.
identification, e.g.: This information can then be computerized for
a First three or more blocks: code number of analysis of the data.
study respondent;
2S
29 Measurements
in dental
epidemiology
cal value describing the relative status of a popu- standard deviation and standard error.
lation on a graduated scale with definite upper The DMFS index is increasingly being used, as it
and lower limits, which is designed to permit and allows for the monitoring and evaluation of trends
facilitate comparison with other population clas- in the distribution and progression of dental caries,
sified by the same criteria and methods.’ and can therefore be used as a tool to evaluate the
quality of care. The index indicates the distribution
of-either caries incidence or treatment on each
Measurement of dental caries tooth and surface in an individual’s mouth. Dental
Dental caries indices planners are currently using the measures of this
The criteria and methodology for the diagnosis of index to plan future needs for oral health care, and
dental caries and the definition of related treat- as a quality assurance measure.
ment needs of individual teeth are well defined
(WHO, 1987). Four indices are commonly used to dinft
measure prevalence, incidence, and severity of
This is an index for primary teeth and is recorded in
a similar way to the DMFT except that it is only used
dental caries. They are:
for children of five years and under. The index can
@ DMFT for permanent teeth
range from zero to amaximum of 20.
@ DMEFS for permanent teeth surfaces
¢@ dmft for primary teeth
¢@ dmfs for primary teeth surfaces.
dmfs
The dmfs measures the status of each tooth surface in
the primary dentition. This score can range from zero
DMFT to 88. Recording of dmfs and dmft from the age of six
The DMFT index measures the status of perma- years and upwards is problematic, as it is difficult to
nent teeth in young people and adults. Each tooth ascertain whether a primary tooth has been exfoliat-
in the mouth is examined using set criteria, and ed or extracted when looking at missing teeth.
teeth are classified as decayed (D), missing (M),
filled (F), or sound. The sum of decayed, missing, Comments on dental caries indices
and filled teeth gives the DMFT score for the indi- These indices record past caries history and treat-
vidual. DMFT scores can thus range from zero to ment received by individuals. Certain difficulties are
32 units. Teeth which are unerupted are not experienced when allocating scores to extracted teeth
scored as missing, but described as unerupted (teeth that are missing) due to caries, as the tooth loss
teeth, and thus not reflected in the score. may be due to periodontal disease. The DMFT index
In epidemiological surveys on a population, each reaches a saturation in older age groups, thus pre-
individual’s DMFT score is obtained and a mean venting registration of further attack. When using the
value with standard deviation or standard error is DMEFS index, the allocation of a score to extracted
calculated for the population. Distribution of teeth is a problem as the tooth may have been at-
scores often deviates from the normal, and the tacked on one surface only, even though its extraction
median, 25th, and 75th percentiles should then be score is four or five surfaces (depending on the tooth
used as summary statistics. type). The other disadvantage of the indices is that
each tooth is scored the same regardless of the severi-
DMFS ty of the lesion. The index merely notes whether a
The DMEFS index measures the status of each tooth tooth or surface of a tooth is decayed, filled, or miss-
surface of permanent dentition. It is computed in ing. The indices do not differentiate between arrested
the same way as the DMFT, except that the unit of caries and active caries, or between small lesions
measurement is the surface of the tooth rather than detected clinically or by radiographs. Furthermore,
the tooth itself. In the adult permanent dentition, a occlusal caries have become difficult to diagnose
total of 144 surfaces is recorded for this index: five clinically because of the impact of fluorides on teeth.
surfaces per tooth, except for the canines and inci- There is recent evidence that the diagnosis of secon-
sors which have only four. Thus, it is possible to have dary caries along the wall lesion in restored teeth is
a DMES score ranging from zero to a maximum of difficult to make. It is only certain when there is sub-
144. Any new lesion or filling can be identified and sequent tooth cavitation of the filling or tooth.
progression recorded. This index is therefore more Caries indices give an indication of past and treat-
sensitive than the DMFT, and is able to reveal small- ment experience. In longitudinal studies, they can be
er changes in caries progression. The increased re- used to measure the rate of caries attack and, in clini-
cording units of this index allow for more variability cal studies and preventive programmes, the efficacy
in the individual and mean scores, thus affecting the of preventive agents (toothpaste, for example).
255
e epidemiology applied to content areas
Gingival Index (Loé 1967) The PDI thus contains a gingivitis index in scores
This index involves numerical scoring of four 1, 2, and 3, and a measure of attachment loss
areas — mesial, distal, facial/buccal, and lingual — independent of gingivitis in scores 4, 5, and 6.
of selected teeth. It is a partial recording system;
six teeth are selected for examination, namely 16, Community Periodontal Index of Treatment
12, 24, 34, 42, and 46. The scores from the four Needs (CPITN)
areas are added together and averaged to give the The WHO and the International Dental Feder-
Gl for the tooth; these are in turn added and ation (FDI) have recommended the use in epide-
divided by the number of teeth measured to miological surveys of a periodontal index which
express a mean for the individual. The score is measures both the level of periodontal disease
attributed according to the following criteria: and the need for treatment. This index is known
0 = normal gingiva; no bleeding on probing as the Community Periodontal Index of Treat-
1 = mild inflammation, slight change in colour, ment Needs (CPITN).
slight alteration of gingival surface, no Three indicators of periodontal status are mea-
bleeding sured in this assessment:
2 = moderate inflammation, erythema, swelling 1 presence or absence of gingival bleeding
3 = severe inflammation, severe erythema and 2 supra- or subgingival calculus
swelling, tendencies towards spontaneous 3 periodontal pockets, subdivided into shallow —
haemorrhage, some ulceration. (4-5 mm) and deep (6+ mm).
260
30 Health
economics
261
e@ epidemiology applied to content areas
Self-interest (producers Consumers may not always seek to maximize utility; those who
seek to maximize profits and require health care may be unwilling or unable to seek it. Thus,
consumers seek to maximize health is sometimes considered a merit good; one which warrants
satisfaction gained from government intervention to ensure that it is not under-consumed.
consuming a good or utility)
Predictability of need and ll-health is often unforseeable, and hence the demand for health
cost care and the related costs are difficult to predict.
Full information regarding The supplier of health care usually holds a monopoly on
the product information, which limits competition and may encourage
monopolistic behaviour.
The ability of the consumer Health workers have greater understanding and access to
to assess fully the benefits information than the patient, and thus are directly involved in the
and opportunity cost of a patient’s decision to consume health care. This confers monopoly
product — consumer powers on the provider and hinders the consumer's ability to make
sovereignty an informed choice of provider. This hinders consumer sovereignty.
- Ahomogeneous product Neither health care nor its suppliers (public, private, NGOs)
are homogeneous.
There are many sellers and The long training period for health professionals and professional
they can easily enter and licensing bodies create barriers to easy entry into the health
leave the health care market care market.
(no barriers to entry)
No substantial fall in unit Several components of health care provision lead to economies
costs with larger-scale pro- of scale and natural monopolies. Cases in which only a few
duction — or no significant suppliers exist (such as hospitals and pharmaceutical
economies of scale — that companies) negate perfect competition.
would encourage monopolies
No spillover effects When health interventions affect society at large, rather than just
(externalities) resulting the individual receiving care, they are considered public goods.
from production or Externalities (spillover effects) are benefits or costs that have
consumption important implications for economic analysis. Being treated for
gonorrhoea, for example, benefits not only the persons involved,
but also decreases their sexual partners’ chances of getting
the disease. If society is better off because it knows that the sick
are being well cared for, this is a spillover effect of health care
(a caring externality).
263
@ epidemiology applied to content areas
interventions are deemed effective when they In general, if resources are not being allocated
result in improved health status. to those with the greatest needs, the efficiency of
In epidemiology, the evaluation of effectiveness the service is reduced. This is an important issue
can involve the measurement of health outputs or to consider in a decision-making or policy con-
outcomes resulting from an environmental text. Thus the situation in South Africa has not
change or a specific health care intervention. only been inequitable, but inefficient. In this con-
Whereas epidemiology compares the outcomes text, an intervention that is technically efficient is
of various interventions, an economic evaluation also equitable. Any attempt to extend services to
of effectiveness compares the cost of these out- those in greatest need will improve the efficiency
comes. The results from various epidemiological of the health care system.
studies (such as randomized control trials and If the goal is equity of access and utilization of
case-control studies) that measure effect are used health services, efficient use of scarce resources
for this type of economic evaluation. For exam- may have to be sacrificed for equity gains. While a
ple, the value of resources put into health care can 100% immunization rate is an equitable goal, a
be compared with indicators of effectiveness such slightly lower rate of 90% may be more efficient. A
as a change in blood pressure or years of life lot of resources would have to be used to achieve
gained. Thus, indicators of effectiveness link the the last 10 percent. Both national governments
results of epidemiological studies to economic and local health policy-makers are faced with
analyses of efficiency. such trade-offs between equity and efficiency.
Note that effectiveness (that is, it works) does not These decisions at the margins are often the focus
automatically mean that an intervention is efficient of health care debate.
(it works and is the best value for money). However,
technical, economic, and allocative efficiency are
contingent on effectiveness.
Economic evaluation
Economic evaluation considers two components
Equity of the production process for any given activity,
When assessing the economic objectives of health namely inputs and outputs. Inputs are resources
care, we must ask to what extent the allocation of such as labour and equipment that are put into
health care is equitable and reflects society’s will. producing goods and services, in this case health
While equity is usually a specified policy for health in care. Outputs are the results of production, while
most countries, it is not a straightforward notion. the process describes the transforming of inputs
Most simply put, it questions whether some mem- into outputs — the final result of the production
bers of society are willing to sacrifice some health process. In economics, the relationship between
gain in exchange fora ‘fairer’ distribution of health these components is analysed within the frame-
gain for all. work of the production function, which repre-
There are two dimensions of equity to consider. sents the relationship between inputs and out-
Horizontal equity means equal treatment of puts. The production function would indicate
equals; this would require that all those with similar that the output of one cured TB patient is de-
health care needs be given equal opportunity to pendent on (or a function of) the interaction be-
access and treatment. For example, in South Africa, tween inputs such as staff, medicines, and equip-
there has been much debate on how to ensure that ment.
in each province everyone has equal access to PHC. The health sector presents special problems for
Vertical equity means unequal treatment for un- economic analysis because it does not produce
equals; it implies equality in service utilization and simple units of output, but several different types
positive discrimination towards those who are less of outputs simultaneously. Health cannot be
willing or able to use health services. Thus, those expressed as a simple variable; thus various
with greater needs (more severe illnesses, inability output measures are expressed as a proxy for
to pay, and so on) would obtain concessional ‘health’. Moreover, a number of different inputs
access. For example, more resources for TB preven- are used in the production of health; these can
tion and control might be directed toward an area often be difficult to identify and quantify. The fol-
with a very high prevalence of TB, or lower-income lowing sections on inputs and outputs will pro-
groups may have their treatment and transport to vide researchers with some useful tools to use in
the services subsidized. economic evaluation.
264
30 health economics
However, the patient often has further costs not reflect their presence, government intervention
directly related to the provision of the good or ser- may be warranted. For example, if fees are set at
vice itself. These indirect costs borne outside the market rates, individuals with gonorrhoea may
health sector and incurred not only by patients, well decide not to go to a STD clinic. Given the
but also by others, may include time lost from implications of this, a good case could be made
work, or the consumption of something else fore- for state subsidization of such services. Some
gone. For example, the process of having one’s commodities or services would otherwise not be
child immunized against measles includes wait- produeéed at all, whereas others would be pro-
ing time (an opportunity cost) and subjective duced on an inappropriate scale. It is important
costs, which are often intangible and difficult to to consider these if we seek to reflect the true cost
measure. The economic value of these subjective or benefit of an intervention.
costs (pain, changes in normal daily activity,
mobility, apprehension, anxiety, and decreased Measures of cost
social performance) are included in the category To ascertain the unit cost or average cost of an
of indirect costs. The section below on Quality immunization service, for example, one must
Adjusted Life Years (QALYs) looks at these issues divide the total cost by the number of immuniza-
in detail. tions done. For a given number of immuniza-
Table 30.3 gives an example of a breakdown of tions, this establishes the average or mean cost
direct and indirect costs. In this case, it shows the per immunization.
direct costs (hospital, TB, drugs, outpatients) of The marginal cost of a service or good is the
HIV/AIDS in Lesotho from 1994-8. Indirect costs cost of producing (or providing) one extra unit of
are expressed as the value of productive years of that good or service. One does net use fixed costs
life lost as a result of AIDS morbidity and mortal- (for example, buildings) in determining the mar-
ity. ginal cost, but looks exclusively at changes in vari-
. The social cost or benefit of any one interven- able costs. The marginal cost of extending im-
tion is the sum of the cost or benefit to the indi- munization coverage to one more person may be
vidual and any additional cost or benefit to small (if that person lives on a route that is already
society. Since these additional benefits may be covered by the service), or large (if that individual
quite extensive, and market mechanisms do not lives some distance away from the normal route
1994 |2545 843 [4186546 |87616 |294 365 7114369 |6240676 | 24441810 | 30682486
1996 |7 703489 |7 525332 |268717 |630673 | 16128211 | 12 311 611 | 75875013 | 88186624
1997 14 603 169 |9 255 230 |409606 |844902 | 22112907 | 15 794 932 | 115 718 702 : 131 513 634
1998 | 16 263 122 |10 891 143/580 967 |1078870 28 814 102 | 19 209 401 | 164 094 254 | 183 303 655
266
30 ~=health economics
taken). This cost is an important one to consider This is particularly important ifan economic
when assessing whether to expand or contract a evaluation that compares different treatment
service. options is to be carried out. Alternatives can be
It is often difficult to calculate the marginal cost identified from the clinical literature, in discus-
unless detailed data is available. The researcher sions with experts in the field, and from present
could calculate the additional cost of extending a providers of a given service. It may be necessary
service to a group of people (for example, an addi- to construct a number of possible alternative
tional 100 patients) rather than calculating the models for a given service. Note that this step
cost of immunizing one more person. This is may be unnecessary if one is simply costing an
called incremental cost. Calculating the cost of intervention and a formal economic evaluation
extending an immunization service to a rural is not to be carried out.
village not previously covered will give one the
Assemble data and calculate/estimate costs
incremental cost of providing this service. The
involved
incremental unit cost (incremental cost divided
by village population) can be used as an altern- Sources for cost data may include accounts,
ative to the marginal cost. budgets, patient folders, management inform-
ation, staff employment records. In some cases,
Acosting framework: tools and tips for economic only total cost data are available and it may be
evaluation necessary to estimate costs based on the best
In undertaking a formal costing study of a given information available (staffinterviews might be
service, all costs must be taken into considera- useful). A way of minimizing imprecision is to
tion. There are a number of accessible texts that discount for future costs, and to conduct sensi-
address the process of costing (for example, tivity analyses (see pp. 270, 272).
Creese, Parker [n.d.]), and only a brief overview
@ Exclude costs
will be given here.
Some costs (for example, externalities) may be
Identify all elements involved in the interven-
irrelevant to the analysis performed, or too
tion being evaluated
expensive or time-consuming to collectina
This is achieved most easily by charting all comprehensive way. The analyst needs to
activities that are part of a given health care decide what to include in the costing on the
delivery process (for example, a clinic or an basis of its feasibility and likely implications.
immunization service). Costs may need to be For example, he or she: might decide to exclude
broken down into detailed categories: what, for indirect costs when costing a primary health
example, are the instruments needed, as well as care clinic.
the staff, building, equipment and vehicle
Allocate money values to the costs
requirements for immunizing the inhabitants
of a district against measles? What are the direct While charges may be equivalent to costs ina
and indirect costs to the patient? A useful perfect market, this is rarely so in the health
checklist derived from Luce and Elixhauser care sector where cross-subsidization, shifting
(1990) is the following: of costs (by transfer payments), and imperfect
— direct personal costs: overheads, staff market forces skew charges significantly. Bear-
salaries, appliances, medication, transporta- ing this in mind, it is often feasible to use a
tion, rehabilitation, etc; given charge as a proxy for cost, unless it is
— direct non-personal costs: support services, evident that the error produced is substantial. A
research and development, health education, more accurate alternative consists of calcu-
surveillance and control, etc; lating total cost by adding together the costs of
— indirect costs (wages and time): lost all components in the production process. This
productivity, income and leisure time, home is particularly relevant in studies of the public
caregivers, childcare, etc; sector where fees are unequal to costs.
— indirect costs (intangible): psychosocial costs, @ Beware of double counting
pain, changes in daily living patterns, etc.
A common error made in costing exercises is
@ Identify the possible alternatives to the inter- counting the same cost twice under different
vention being evaluated headings. For example, the cost of medication
267
e@ epidemiology applied to content areas
may have been entered both as a user charge employed elsewhere in the health sector. Further-
(to the patient) and in terms of its cost to the more, shadow pricing can also be used to reflect
clinic (procurement, storage, preparation, the social cost of an intervention.
dispensing, administration, monitoring, treat-
ment for adverse effects, etc). This will result in Outputs
an inflated total cost. A similar error can be Output indicators aim to measure changes in
made by allocating the same cost both to direct health status. No indicator is without short-
and indirect costs columns. comings, and all are value-laden and difficult to
measure. In economic evaluations of health care
Discounting interventions, these indicators are weighed
The implementation and impact of a health inter- against the cost of the programme or service.
vention is likely to occur over time. Thus, we are
likely to compare its costs and benefits over a Putting value on human life
number of years. If we must choose between two A common approach used to establish the value
or mote services or’projects, the net benefits of of an intervention places a monetary value on
each should be compared. It is necessary to use human life. Often this is measured in terms of
discounting in such an economic evaluation, in years of life gained. These monetary values can be
order to account for society’s desire for short- used to measure ‘benefits’ in cost-benefit ana-
term results. Discounting deflates the value of lysis. In general, there are three methods com-
money in the future to account for this positive monly used to value life and well-being (Mooney
time preference. As*the net benefits across all 1992):
years are compared in terms of their value at the The implied values approach takes into con-
time of evaluation, discounting then favours sideration the value that the government or soci-
those projects that provide them sooner. The pre- ety attaches to well-being or a life saved. These
sent value of net benefits is determined using a costs can be implied from the marginal cost of a
formula (Luce, Elixhauser 1990) which utilizes the preventative intervention or from the cost of
discount rate available from the Reserve Bank. introducing life-saving high technology.
Discounting should not be perceived as an Measurements are based on existing policies and
adjustment for inflation. value systems. For example, the marginal cost to
the state of expanding neonatal intensive care for
Shadow pricing @ premature infants could be used as a proxy for the
Price often does not reflect economic or oppor- value of an infant’s life.
tunity cost adequately. For example, the price of The human capital approach equates the value
goods purchased from abroad may not reflect of life to an individual’s lifetime productive capa-
their real value if the exchange rate is over-valued. city measured in terms of his or her income earn-
Economic distortions such as tariffs, subsidies, ings. Specifically, this is average discounted
import and export taxes also distort price. Like- labour costs adjusted according to the country’s
wise, the wage rate may not reflect the true value employment rate.
of labour: A limited number of doctors, repres- Finally, the consumer’s willingness to pay
ented by a professional body, may drive the price places a value on the amount she or he is pre-
of their labour above its market value. Conversely, pared to pay to reduce the risk of mortality or
if the state is the sole purchaser (monopsonist) of morbidity. The value of a health intervention is
nurses’ labour, it may keep wages below their true thus based on the amount that the consumer is
market value. Ultimately, the value of labour willing to pay for it.
should be assessed based on its availability All three of these approaches are proxies and
(elasticity). each presents its own set of problems. The
Shadow pricing allows prices to be adjusted to human capital approach equates life to livelihood
reflect their true economic value. In the case of and fails to encompass those who are not form-
labour, if there is high involuntary unemploy- ally employed. The implied values approach,
ment, the shadow price of unskilled labour may which uses the cost of mortality-reducing inter-
approach zero. If, on the other hand, there is a ventions introduced by the public sector as a
limited supply of highly skilled labour, the proxy for society’s value of life, is based on the
shadow price would represent the opportunity values of current decision-makers (McQuire,
cost of that individual’s labour if he or she were Henderson, Mooney 1992). The ‘willingness to
268
30 health economics
pay’ approach may be the most economically This model has been criticised because it is
sound because it assumes consumer sovereignty, based on trade-off decisions usually made under
but there are shortcomings in this approach. hypothetical situations; it looks exclusively at the
movement from good health to bad; its valuations
Health status indicators assume that an additional year of suffering is
Health status indicators are valuable when com- equivalent to the previous one; and QALYs do not
paring health status both across time and cross- measure externalities associated with health care.
Nonetheless, the introduction of QALYs into
sectionally. Mortality and morbidity rates are the
health economics represents an attempt to in-
most frequently used measures of health output.
These indicators are vague and those working in corporate intangible costs into the discipline.
the health field have sought to find more precise
Disability adjusted life years (DALYs)
ones. These include: disease-specific measure-
DALYs were developed by the World Bank to
ments which quantify clinical change (for exam-
measure the burden of disease (World Bank
ple, changes in cholesterol levels); measures of
1993). They measure the potential years of life lost
dysfunction (number of productive working days
due to disability or death ata given age. While
lost to illness or inability to perform certain tasks);
they are similar to QALYs, they explicitly include
health profiles which give individuals a rating in
disease type (using disease-specific severity
identified health ‘domains’ according to levels of
weightings), focus exclusively on disability, and
disability and distress (such as the Nottingham
are weighted for age based on the individual’s
Health Profile); and global health indicators
predicted productive value. Many of the criti-
which measure states of health (utilities) relative
cisms of QALYs are equally applicable to DALYs.
to each other on an interval scale of 0-1 (death —
DALYs have also been faulted for giving an age
perfect health).
weighting based solely on the individual’s
assumed productive capacity, without taking into
Quality adjusted life years (QALYs) account other socio-cultural variables.
The health status indicators covered above are
limited by their inability to measure the effect of
the duration of ill health. One methodology that
Economic evaluation methods
attempts to account for this is the QALY or the The economic evaluation of health-related inter-
Quality Adjusted Life Year. In this model, the ventions looks at how inputs relate to outputs, or
number of years of life gained from a medical the complete production function. Essentially, it
intervention are adjusted to account for quality of compares costs to consequences. Good cost and
life — disability and distress levels. This is espe- epidemiological data is crucial to economic
cially relevant to medical interventions that have evaluation. The results of this type of evaluation
the potential to incapacitate the patient. Like the are only as good as the information used. De-
global health indicators mentioned above, this cision-makers need to make decisions on how to
model depends on a single index to value levels of derive maximum benefit from a fixed budget.
health, and measures quality on a scale of 0-1 Economic evaluation is one tool that may provide
using health rating scales (Kind, Rosser, Williams a framework for these choices.
1983), time trade-offs between disability and A number of types of economic evaluation
years of life, or ‘standard gamble’ — the trade-off exist. Different ones are appropriate for different
between chronic illness and the chance of death contexts, and they differ from one another in the
due to a potentially hazardous treatment (Tor- extent of their analysis and .their evaluation of
rance 1987). QALYs extend beyond global health outcome. Four methods are briefly described
indicators to weight the index relative to years of below. Cost-minimization analysis is used to find
life gained. Ultimately, this approach can be used the least costly way of implementing a project-or
to measure the trade-off between being relatively programme. Cost-benefit analysis is used to
healthy over a short period of time against being determine whether a programme should be,intro-
in poor health over a long period of time. It can duced or not. And cost-effectiveness and cost-
also be used to assess the relative impact of differ- utility analyses compare programmes to deter-
ent health interventions when the costs of these mine which produces the desired outcome at the
interventions are expressed in terms of the least cost. In cost-utility analysis, the outcome is
number of QALYs they produce. expressed in terms of QALYs.
269
e epidemiology applied to content areas
detail necessary in a study. In some cases, results studies may reflect the most efficient path to
may be reliable using fairly rough data, whereas follow, they do not necessarily provide the most
in other cases more specific indicators are neces- equitable solutions. Their value stems mostly
sary. from the fact that they ‘make health care planning
comprehensive rather than partial, systematic
rather than piecemeal, and value judgements
Conclusion explicit rather than implicit’ (McQuire, Hender-
One must bear in mind that not all costs are easily son, Mooney 1992).
quantifiable, and may have effects which are not
purely financial. Economic evaluations are also
not free of the analyst’s value judgements. They EXAMPLE 30.1
require co-operation between clients, health Economic impact of the AIDS
workers, health managers, epidemiologists, and
economists. Furthermore, evaluations are costly
epidemic in SA
and time-consuming and are not presently used This study assesseds the direct and indirect costs
extensively in the health sector outside of narrow of AIDS to South Africa. Direct costs included
fields such as the evaluation of pharmaceuticals. direct personal costs (such as hospitalization and
Therefore, the results of economic evaluation drugs for HIV/AIDS patients) and non-personal
should be seen as no more than atool for de- costs (testing, research, and prevention pro-
cision-makers. In the provision of health care, grammes). Direct personal costs were estimated
there is a need for a balanced interplay between using a hypothetical model for utilization of
economic appraisal and considerations of equity, health care resources at each stage of the disease.
ethics, and political expediency. While evaluation The major cost component of this section was
271
e@ epidemiology applied to content areas
found to be hospitalization. The model generated benefits of a vaccine for Haemophilus influenzae
cost estimates for the public and private sectors type B infection (Hib) in the 1992 Cape Town
and for the country as a whole. Non-personal birth cohort (n = 46 537). The risk of disease over a
costs were calculated based on estimates of the five-year period was calculated to be 666 to
total number of HIV tests conducted, and the 100 000 births. Net benefits are the difference be-
total annual cost of research and education. It tween the costs of using or not using a HibTITER.
was estimated that the direct cost (both personal Costs were the sum of direct medical costs (per-
and non-personal) of HIV/AIDS to South Africa in sonal and non-personal) and indirect costs. The
1995 would be between R4713 million and bulk of direct benefits (avoidable disease costs)
R10 007 million. Indirect costs were based on the were the hospitalization of patients with menin-
total number of work years lost. The indirect cost gitis, pneumonia, and septicaemia. Indirect costs
of HIV/AIDS for 1995 was estimated at R1 887 and benefits were measured using the human
million (after the figures were adjusted for a 50% capital and willingness to pay approaches. Costs
replacement rate of disabled labour). The study and benefits were discounted through time at a
then went on to predict the economic impact of rate of 2%. Unvaccinated disease patterns and
HIV/AIDS on GNP; it found a 0,12% loss of earn- vaccine efficacy data were compiled from clinical
ings for 1995 and a 1,5% loss of earnings by 2005. data. Sensitivity analyses were conducted on a
Thereafter, various scenarios of service mix were number of assumptions; coverage was assessed at
simulated to establish their impact on costs. The 85%, 90% and 95%. The value of life estimates and
study concluded that AIDS could have a profound the discount rate chosen were found to have the
impact on health care resources, seriously hinder greatest impact on the model's results. Net bene-
economic growth in the medium term, and pro- fit was measured as the difference between cost of
duce numerous other indirect costs. The results disease averted and the cost of the vaccination
of such a costing exercise can be used to show programme itself. It was estimated that the costs
policy-makers the importance of early preventa- of introducing the vaccine to the 1992 Cape Town
tive measures and can assist them with their long- birth cohort would have been R8,3 million, while
term financial plans. the economic cost of Hib disease was projected at
R10,7 million to R11,8 million. Had the vaccine
Reference: been introduced, the net benefit (costs saved)
Broomberg, J, Steinberg M, Masobe P, Behr G. ‘The Eco- would have ranged from R2,4 million to R3,5 mil-
nomic Impact of the AIDS Epidemic in South Africa’ in Bar- lion. This example shows how economic evalu-
nett T, Blaikie P (eds). AIDS in Africa; Its Present and Future ation can be used to motivate for a particular
Impact, London: Belhaven Press 1992. health intervention. It can be used to show that
even if the costs upfront appear to be substantial,
the cost of doing nothing may be greater.
EXAMPLE 30.2
Costs and benefits of vaccination Reference:
programme for Hib disease Hussey, GD, Lasser M L, Reekie W D. ‘The costs and benefits
ofa vaccination programme for Haemophilus influenza type
In this study, the researchers conducted a cost- B disease.’ Southern African Medical Journal 1995; 85(1):
benefit analysis to measure the hypothetical 20-25.
272
appendix
Group A
Age group Population size Proportion of Number of deaths Mortality rate
population in in age group in age group
age group
Group B
Age group Population size Proportion of Number of deaths Mortality rate
population in in age group in age group
age group
the instruments. For example, the WHO (1983) stip- contact with the heels and the length read to the
ulates that crown to heel lengths of children under nearest 0,1 cm.’
two years of age should be determined using a After such training, and before field work starts,
wooden length-board in the following way: the consistency and accuracy of the observers’
‘The infant is laid on the board, which is itself readings should be investigated. Each observer
on a flat surface. The head is positioned firmly should make repeat measurements on some sub-
against the fixed headboard, with the eyes looking jects. These subjects should be similar to those who
vertically. The knees are extended, usually by firm will be measured in the study. In this way, one can
pressure applied by an assistant, and the feet are determine whether an observer measures consis-
flexed at right angles to the lower legs. The tently. However, a consistent observer may be
upright sliding footpiece is moved to obtain firm measuring consistently incorrectly (for example,
274
appendix
276
bibliography
bibliography
Section A: Contextualizing epidemiology 2 Key concepts in epidemiology
1 Introduction References
Beaglehole R, Bonita R, Kjellstrom T. Basic epidemiology.
References Geneva: World Health Organization 1993.
‘Alexandra Health Centre.’ Special edition South African Bradford Hill A. ‘The environment and disease: Association
Medical Journal 1991; 80:4. or causation?’ Proceedings of the Royal Society ofMedicine
Bac M. ‘PHCin practice.’ CHASA-in-Action: Optimal Health 1965; 58:295-300.
for All. 1st International Conference, 18-20 June 1990. Goldberg M, Dab W. ‘Complex indexes for measuring a
Cassel J A. ‘A comprehensive health program among South complex phenomenon.’ Chapter 7.5 in Abelin T,
African Zulus.’ In Paul B D (ed.) Health, culture and Brzezinski ZJ, Carstairs V D L (eds). Measurementin
community. New York: Russell Sage Foundation health promotion and protection. WHO Regional
1955; 15-41. Publications, European Series: No 22 (1987). *
Commission on Health Research for Development. Jeffe A M. ‘Health status indicators: their utility in chronic
Health research: Essential link to equity in development. disease evaluation research.’ Journal of Chronic Diseases
1980; 33:567-79.
New York: Oxford University Press 1990.
KarkS L, Steuart GW, (eds)..A practice of social medicine. Jinabhai C C, Coovadia H M, Abdool Karim SS. ‘Socio-
Edinburgh: E & S Livingstone 1962. medical indicators of health in South Africa.’ International
Journal of Health Services 1986; 16:163-76.
KarkS L, Le Riche H. ‘A health study of South African Bantu
Leavell H R, Clark EG. Preventive medicine for the doctor in
school children. Report on a somatometric and clinical
his community. New York: McGraw Hill Book Company
study conducted during 1938-39.’ Manpower. Scientific
1965.
Journal Devoted to Manpower Research in South Africa
Lilienfeld A M, Lilienfeld D E. Foundations of epidemiology
1994; 3:2-141.
(2nd edn). Oxford: Oxford University Press 1980.
LastJ M. A dictionary of epidemiology (2nd edn). New York:
MausnerJ S, Kramer S. Epidemiology —An introductory text.
Oxford University Press 1988.
Philadelphia: W B Saunders Company 1985.
‘Mamre Community Health Project.’ Special edition South MacMohonB, Pugh T F. Epidemiological principles and
African Medical Journal 1988; 74:7. methods (Chapter 2). Boston: Little, Brown and Co. 1970.
‘Report of the National Health Services Commission on the Mazur RE. Population structure, fertility and childhood
Provision of an Organized National Health Service for All mortality in South Africa: Lessons to be learned from
Sections of the People of the Union of South Africa 1942- analysis of the Poverty survey. Tygerberg: Medical Research
1944 [The Gluckman Report].’ Cape Town, Republic of Council 1995.
South Africa: South African Government Printer 1945. Noack H, Abelin T. ‘Conceptual and methodologic aspects of
Rothman K. Modern epidemiology. Boston: Little, Brown and measurement in health and health promotion.’ Chapter 5
Co. 1986. in Measurement in health promotion and protection edited
Slome C. ‘Community health in rural Pholela.’ In Kark S L, by Abelin T, Brzezinski ZJ, Carstairs VD L. WHO Regional
Steuart G W (eds). A practice of social medicine. Edinburgh: Publications, European Series: No 22 (1987).
Eand S Livingstone 1962; 260-91. Omran A R. ‘The epidemiological transition: a theory of the
Tollman S. ‘The Pholela Health Centre — The origins of epidemiology of population change.’ Millbank Memorial
community-oriented Primary Health Care (COPC).’ Fund Quarterly 1971; 49: 509-39.
South African Medical Journal 1994; 84:653-8. Rothman K. Modern epidemiology. Boston: Little, Brown and
Tollman S. ‘Community-oriented primary care: origins, Co. 1986.
evolution, application.’ Social Science and Medicine 1991; Sackett D, Haynes R B, Tugwell P. Clinical epidemiology—
32:633-42. A basic science for clinical medicine. Boston: Little, Brown
Yach D, Tollman S. ‘Public Health Initiatives in South Africa and Co. 1985.
in the 1940s and 1950s: Lessons for a Post-Apartheid Era.’ Schach E. ‘Alternative ways of presenting statistics on
Annals of the Journal of Public Health 1993; sickness, disability and death.’ Chapter 11 in Measurement
83:1 043-50. in health promotion and protection edited by Abelin T,
Brzezinski ZJ, Carstairs VD L. WHO Regional ee
Further reading European Series: No 22 (1987): 213-31.
Soberon G, FrenkJ,Sepulveda J. ‘The health care retdimitin
Hanlon JJ, Pickett GE. Public health, administration and
Mexico: before and after the 1985 earthquake.’ American
practice. Missouri: The C V Mosby Company 1979.
Journal of Public Health 1986; 76:673-80.
Parry C DH, Tollman SM, Yach D. ‘Towards an essential
Steyn K, Buch E. Chronic diseases of lifestyle in South Africa:
national health research strategy for South Africa.’
A policy proposal. Medical Research Council 1992.
(Opinion) South African Medical Journal 1992; Susser M. ‘What is a cause and how do we know one?
82:299-300.
A grammar for pragmatic epidemiology.’ American
Tollman S M. ‘The Commission on Health Research for Journal of Epidemiology 1991; 133:635-48.
Development: Its work and relevance for South Africa.’ World Health Organization. Development ofIndicators for
Urbanisation and Health Newsletter CERSA, Medical Monitoring Progress towards Health for All by the Year
Research Council, April 1991. 2000. Geneva 1981.
Wall S. ‘University of Umea Department of Epidemiology Yach D, Mathews C, Buch E. ‘Urbanisation and health:
and Health Care Research Annual Report 1989.’ University methodological difficulties in undertaking epidemiologi-
of Umea, Sweden, April 1990. cal research in developing countries.’ Social Science and
Medicine 1990; 31:507-14.
214,
epidemiology: a manual for south africa
Working Paper, Department of Social Anthropology, Steineck G, Ahlbom A. ‘A definition of bias founded on the
University of Cape Town 1989. concept of the study base.’ Epidemiology 1992; 3(6):477-82.
Rifkin S B. ‘Health planning and community participation.’
World Health Forum 1986; 7:156-62. 5 Literature review
Rifkin S B. ‘Lessons from community participation in health
programmes.’ Health Policy and Planning 1987; 1(3): 240-49. Further reading
Rothman J. Using research in organisations:A Guide to
successful application. Sage Publications 1980. Sackett D L, Haynes RB, Guyatt G H, Tugwell P. Clinical
Titchen A, Binnie A. ‘Action research: a strategy for theory epidemiology:A basic science for clinical medicine (2nd
generation and testing.’ International Journal of Nursing edn). Chapters 11, 12 and 13. Boston: Little, Brown and
Studies 1994; 31:(1)1-12. Co. 1991.
Twiss B. ‘Exploitation of science through technology and
innovation.” RSA 2000 1988; 10(1). Human Sciences 6 Setting objectives for research
Research Council.
Van der Walt H, Hoogendoom L. ‘Training for self-evalua- References
tion: Experiences at Ithusheng Health Centre, South Werner D, Bower B. Helping health workers learn
Africa.’ Learning for Health, Liverpool School of Tropical pp. 26.2-26.7. Palo Alto: Hesperian Foundation 1991.
Medicine, Education Resource Group Mar-Aug 1995; 7:8-12. Varkevisser C M, Pathmanathan I, Brownlee A. Designing
Van der Walt H. ‘A rural community health project for the and conducting HSR Projects HSR Training Series.
aged in Genadendal, South Africa.’ M Ed. dissertation. Volume 2 Part 1. International Development Research
Department of Adult and Higher Education, University of Centre (Canada) and World Health Organization (Geneva)
Manchester 1983. 19Q1:
Van der Walt H. ‘The role of participatory research to
enhance patient adherence to tuberculosis treatment.’ 7 Study design
In: Proceedings of the International Symposium on
Participatory Research in Health Promotion. Compiled by References
Korrie de Koning, Liverpool School of Tropical Medicine,
August 1994. Rodrigues L, Kirkwood B R. ‘Case-control designs in the
Veatch R M (ed). Cross cultural perspectives in medical ethics: study of common diseases: Updates on the demise of the
readings. Boston: Jones and Bartlett Publishers 1989. rare disease assumption and the choice of sampling
Walters S. ‘Participatory research — Issues and questions scheme for controls.’ International Journal ofEpidemiology
(working paper).’ In: Research Methods for Higher Degrees. 1990; 19:205-13.
Cape Town: Faculty of Education, University of Cape Rossouw] E, Jooste P L, Chalton D O, Jordaan ER,
Town 1983; 102-14. 4 : Langenhoven M L, Jordaan P C, Steyn M, Swanepoel AS,
Weiss C H, Bucuvalas MJ. ‘The challenge of social research Rossouw LJ. Community-based intervention: the
to decision-making.’ In Weiss C H (ed) Using social science Coronary Risk Factor Study (CORIS) International Journal
research in public policy-making. Lexington, MA: DC of Epidemiology 1993; 22:428-38.
Health 1977.
Wacholder S, McLaughlin J K, Silverman DT, MandelJ S.
Werner D. ‘Empowerment and health.’ Contact April 1980; ‘Selection of controls in case-control studies.’ American
102:1-9. Journal ofEpidemiology 1992; 135(9):1019-50.
Werner D. ‘Health care and human dignity — a subjective
look at community-based rural health programmes in Further reading
Latin America.’ Contact Aug 1980; 57:2-16. LilienfeldA M, Lilienfeld D. Foundations of epidemiology
Yach D, Dick J. ‘Implementation of research: The key to (2nd edn). Oxford: Oxford University Press 1980.
closing the gap between public health knowledge and Rothman KJ. Modern epidemiology New York: Little, Brown
action.’ Journal of Comprehensive Health 1991; 2(1):15-20. and Co. 1986.
Sackett D L, Haynes RB, Guyatt GH, Tugwell P. Clinical
epidemiology:A basic science forclinical medicine
Section B: Epidemiological research methods (2nd edn). Boston: Little, Brown and Co. 1991.
and protocol development
8 Sampling
4 Planning a research project
Reference References
Sackett D L. ‘Bias in analytic research.’ Journal of Chronic Jochelson K, Mothibeli M, LegerJ P. ‘Human
Immunodeficiency Virus and migrant labour in South
Diseases 1979; 32:51-63.
Africa.’ International Journal of Health Services 1991;
21:157-73.
Further reading PickW M, Cooper D, KlopperJ ML, MyersJ E, Hoffman MN,
Abrahamson J H. Survey methods in community medicine Kuhn LA. ‘A study of the effects of urbanisation on the
(3rd edn). New York: Longman Publishers 1988. health of women in Khayelitsha, Cape Town. Working
BothaJ, Yach D. ‘Epidemiological research methods (VI). Paper No. 1: rationale and methods.’ Cape Town:
Planning a research project.’ South African Medical Department of Community Health, University of Cape
Journal 1987; 72:563-6. Town 1990.
Hulley S B, Cummings S R. Designed clinical research: an Rutkove S B, Abdool Karim S S, Loening
W E. ‘Patterns of
epidemiological approach Baltimore: Williams and care in an overburdened tertiary hospital outpatients
Wilkens 1988. department.’ South African Medical Journal 1990; 77:476-8.
2
epidemiology: a manual for south africa
Sackett D LS. ‘Bias in analytic studies.’ Journal of Chronic ment.’ The Lancet 1986; 307-10.
Diseases. 1979; 32:51-3. Gardner MJ, Altman DG (eds.). Statistics with confidence
Watters J K, Biernacki P. ‘Targeted sampling: Options for the — Confidence intervals and statistical guidelines.
study of hidden populations.’ Social Problems 1989; London: British Medical Journal 1989.
36(4):416-30. Haertel U, Heiss G, Filipiak B, Doering A. ‘Cross-sectional
Yach D, Katzenellenbogen J, Conradie H. ‘Ciskei infant and longitudinal associations between high density
mortality study: Hewu district.’ South African Journal of lipoprotein cholesterol and women’s employment.’
Science 1987; 83:416-21. American Journal ofEpidemiology 1992; 135(1):68-78.
Mollentze W F, Joubert G. ‘Coronary risk factor study of
Further reading black rural and urban populations in the Free State.’
Biernacki P, Waldorf D. ‘Snowball sampling: Problems and University of the Orange Free State 1994. Unpublished
techniques of chain referral sampling.’ Social Methods data.
and Research 1981; 10:141-63.
Cohen J. Statistical power analysis for the behavioural Further reading
sciences. New York: Academic Press 1969. Altman D G. Practical statistics for medical research. London:
Donner A, Brown KS, Brasher P. ‘A methodological review of Chapman and Hall 1991.
non-therapeutic intervention trials employing cluster Anderson A, Richter L M. ‘““Birth-to-Ten”: error detection
sampling 1979-1989.’ International Journal of and correction in a longitudinal address database.’
Epidemiology 1990; 19(4):795-800. Southern African Journal ofEpidemiology and Infection
Goodman LA. ‘Snowball sampling.’ Annals ofMathematical 1994, 9:16-19.
Statistics 1961; 32:148-70. Chambers J M, Cleveland WS, Kleiner B, Tukey P A. Graphical
Levy PS, Lemeshow S. Sampling for health professionals. methods for data analysis. Boston: Duxbury Press 1983.
Belmont (California): Lifetime Learning Publications 1981. Draper N, Smith H. Applied regression analysis. New York:
Stoker DJ. ‘Basic sampling methods.’ In SchnetlerJ (ed). John Wiley 1981.
Survey methods and practice. Pretoria: Human Sciences Fleiss J L. Statistical methods for rates and proportions.
Research Council 1989. New York: John Wiley 1981.
Greenland S. ‘Randomisation, statistics and causal inference. ,
9 Datacollection and measurement Epidemiology 1990; 1(6):421-29.
Hoaglin D C, Mosteller F, Tukey JW. Understanding robust
Further reading and exploratory data analysis. New York: John Wiley 1983.
AbrahamsonJ H. Survey methods in community medicine Hosmer D W, Lemeshow S. Applied logistic regression.
(4th edn). Edinburgh: Churchill Livingstone 1990. New York: John Wiley 1989.
Joubert G, Hoffman M. ‘Data quality in a community health Jooste P L, Jordaan E, Van Eck M. ‘Error detection and
survey — the experience with the Mamre Community correction for improving the quality of epidemiological
Health project.’ Southern African Journal ofEpidemiology research data: Experience gained from a cardiovascular
and Infection 1990; 5(4):79-82. community study.’ Southern African Journal of
RezlerA G. The assessment ofattitudes. WHO/EDUC/712/ Epidemiology and Infection 1988; 3:12-16.
152. Geneva: World Health Organization. Kahn HA. An introduction to epidemiologic methods.
Sackett D LS. ‘Bias in analytic studies.’ New York: Oxford University Press 1983.
Journal of Chronic Diseases 1979; 32:51-63. Lee ET. Statistical methods for survival data analysis.
Woodward CA, Chambers LW. Guide to questionnaire Belmont, California: Wadsworth 1980.
construction and question writing (1st edn). Mainland D. ‘Statistical ritual in clinical journals: Is there a
Canadian Public Health Association 1980. cure? —I. British Medical Journal 1984; 288:841-3.
Mainland D. ‘Statistical ritual in clinical journals : Is there a
cure? —II.’ British Medical Journal 1984; 288:920-22.
10 Other issues to consider in the protocol Rosner B. Fundamentals of biostatistics. Boston:
Duxbury Press 1982.
Further reading
Rothman KJ. Modern epidemiology. New York:
MRC News. Special issue: Compilation of articles of general Little, Brown and Co. 1986.
interest 1989; 20(5). (This issue contains articles on, for Siegel S. Non-parametric statistics for the behavioural
example, how to write scientific papers, how to present sciences. New York: McGraw-Hill 1956.
research results visually, the roles of authors and editors, Tufte E R. The visual display ofquantitative information.
and ethical issues in publishing research.) Connecticut: Graphics Press 1983.
Also refer to references and further readings given for Chapter
3 and those given in Section C. 12 Integrating epidemiological concepts
Okediji F. ‘Theoretical and methodological critique of Bradshaw D, Laubscher R, Schneider M. ‘Estimated Cause of
surveys of knowledge, attitudes and practice of family Death Profiles for the nine new provinces based on 1990
planning in Africa.’ International Review ofModern data.’ South African Medical Research Council, Technical
Sociology 1973; 3:1-11. Report, Parow, 1995.
Pick WM, Steyn K. ‘Hypertension in family practice.
Astudy of knowledge, attitudes and practices.’ Further reading
South African Medical Journal 1992; 82:257-9. Alderson M R. Mortality, morbidity and health statistics.
Population Council. A manual for surveys offertility and
USA: Stockton Press 1988.
family planning: Knowledge, attitude and practice.
Bradshaw D, Botha H, Joubert G, Pretorius J P, Van Wyk R,
New York: Population Council 1972.
Yach D. Review of South African Mortality (1984). Medical
Ratcliffe J. ‘Analyst biases in KAP surveys: A cross-cultural Research Council, Technical Report No 1, Parow, 1987.
comparison.’ Studies in Family Planning 1976; 7:322-30.
ManualX. Indirect techniques for demographic estimation.
Stone L, Campbell J. ‘The use and misuse of surveys in inter- Department of International Economic and Social Affairs.
national development: An experiment from Nepal.’ Population Studies, No. 81. United Nations, New York 1983.
Human Organization 1984; 43:27-37.
TanurJ M (ed). Questions about questions: Inquiries into the
cognitive bases of surveys. New York: Russell Sage 24 Outbreak investigations
Foundation 1992.
Westoff C. ‘Is the KAP-gap real?’ Population and References
Development Review 1988; 14:225-32. Coetzee N, Hussey GD, Visser G, Barron P, Keen A. ‘The 1992
measles epidemic in Cape Town — a changing epidemiol-
19 Qualitative methodology: an introduction ogical pattern.’ South African Medical Journal 1994;
84:145-9.
Reference Coetzee N, Malusi S, MoetiS. ‘Lack of reporting on disease
outbreaks and epidemics.’ South African Medical Journal
Patton M Q. Qualitative evaluation methods. Beverly Hills: 1993; 83:531-2. a
Sage Publications 1980. Gear, J S, Cassell GA, Gear AJ, Trappler B, Clausen L, Meyers
AM, KewMC, Bothwell T H, Sher R, Miller GB, Schneider
Further reading J, Koornhof HJ, Gomperts ED, Isaacson M, GearJ H.
Basch CE. ‘Focus Group Interview: An under-utilised ‘Outbreak of Marburg virus disease in Johannesburg.’
research technique for improving theory and practice in British Medical Journal Nov 29 1975; 4(5 995):489-93.
health education.’ Health Education Quarterly 1987; TurkI. ‘A meningococcal meningitis epidemic really?’
—
14(4):411-48. Epidemiological Comments 1993; 20:78-9.
Bell J. Doing your research project. Milton Keynes:
Open University Press 1987. Further reading
Bernard H R. Research methods in anthropology.
Qualitative and quantitative approaches. Newbury Park, Benenson AS (ed). Control of communicable diseases
manual (16th edn). Washington DC: American Public
California: Sage Publications 1994.
Health Association 1995.
Eng E, Glik D, Parker K. ‘Focus-group method: Effects on
Centers for Disease Control. ‘Case definitions for public
village-agency collaboration for child survival.’
health surveillance.’ Morbidity and Mortality Weekly
Health Policy and Planning 1990; 5(1):67-76.
Report October 19 1990; 39:RR-13.
Miles M B, Huberman A M. Qualitative data analysis:
Dwyer DM, Strickler H, Goodman RA, Armenian H K.
Asourcebook of new methods. Beverly Hills:
‘Use of Case-Control Studies in Outbreak Investigations.’
Sage Publications 1984. Epidemiology Reviews 1994; 16(1):109-23.
The Open University. Research methods in Education and Fonseca M GP, Armenian H K. ‘Use of Case-Control Method
the Social Sciences. Milton Keynes: Open University Press in Outbreak Investigations.’ American Journal of
1979. Epidemiology 1991; 133:748-52.
Patton M Q. How to use qualitative methods in evaluation Galbraith N S. ‘The application of epidemiological methods
(chapter 5). Newbury Park, California: Sage Publications in the investigation and control of an acute episode of
1987. infection.’ In: Holland WW, Detels R, Knox G (eds).
Strauss A L. Qualitative analysis for social scientists. Oxford textbook of public health vol. 4. New York:
Cambridge: Cambridge University Press 1987. Oxford University Press 1985; 3-21.
Gregg MB. ‘The principles of an epidemic field investiga-
Section E: Epidemiology applied to content areas tion.’ In Holland WW, Detels R, Knox G (eds). Oxford
textbook ofpublic health vol 3. New York:
Oxford University Press 1985; 284-99.
20 Mortality studies KelseyJ L, Thompson WD, Evans AS. ‘Epidemic investiga-
tion.’ In Methods in observational epidemiology.
References New York: Oxford University Press 1986; 212-53.
BothaJ L, Irwig LM, Strebel P M. ‘Excess mortality from Orenstein W A, Bernier R H, Dondero T J, Hinman A R, Marks
stomach cancer, lung cancer, and asbestosis and/or JS, Bart KJ, Sirotkin B. ‘Field evaluation of vaccine efficacy.’
mesothelioma in crocidolite mining districts in South WHO Bulletin 1985; 63(6):1 055-68.
Africa.’ American Journal of Epidemiology 1986; 123(1):30- Palmer S R. ‘Epidemiology in search of infectious diseases:
40. methods in outbreak investigation.’ Journal of
Bailie RS, Selvey CE, Bourne D, Bradshaw D.‘Trends in Epidemiology and Community Health 1989; 43:311-14.
cervical cancer mortality in South Africa.’ Submitted to the Palmer S R. ‘Outbreak Investigation: The Need for “Quick
International Journal ofEpidemiology 1996; 25(3): 488-93. and Clean” Epidemiology.’ International Journal of
283
epidemiology: a manual for south africa
World Health Organization. Toxic oil syndrome. tion disorders in a rural area: the role of the community
Copenhagen: World Health Organization 1984. speech and hearing therapy workers.’ South African
World Health Organization. Environmentand health: Journal of Communication Disorders 1992; 39:55-61.
the European Charter. Geneva: World Health Schneider M, Rispel L, Doherty J. The development ofguide-
Organization 1990. lines for rehabilitation services in the Greater Soweto Area.
Paper No. 34, Centre for Health Policy, Department of
24 Occupational epidemiology Community Health. University of the Witwatersrand,
Johannesburg 1994.
References Zaman §S S, Kahn N Z, Islam S, Bann S, Dixit S, Shrout P,
Bradford Hill A. ‘The environment and disease: Association Durkin M. ‘Validity of the “Ten Questions” for screening
or causation?’ Proceedings of the Royal Society ofMedicine serious childhood disability: Results from urban
1965; 58:295-300.
Bangladesh.’ International Journal of Epidemiology 1990;
Checkoway H, Pearce N V, Crawford-Brown DJ. 19(3):613-20.
Research methods in occupational epidemiology.
New York: Oxford University Press 1989. 26 Psychiatric epidemiology
Myers J E, Cornell J E. ‘Respiratory health of brickworkers in
Cape Town, South AFrica: symptoms, signs and References
pulmonary function abnormalities.’ Scandinavian Journal American Psychiatric Association (APA). Diagnostic and
of
Work Environment and Health 1989; 15(3):188-94. statistical manual of mental disorders (4th edn) (DSM-IV).
Myers J E, Garisch D, Louw SJ. ‘Respiratory health of Washington DC: American Psychiatric Association 1994.
brickworkers in Cape Town, South Africa: radiographic Bech P, Malt U F, Dencker S J, Ahlfors U G, Elgen K,
abnormalities.’ Scandinavian Journal ofWork Lewander T, Lundell A, Simpson GM, Lingjaerde O (eds).
Environment and Health 1989; 15(3): 195-7. ‘Scales for assessment of diagnosis and severity of mental
Myers J E, Lewis P, Hofmeyr W. ‘Respiratory health of brick- disorders.’ Acta Psychiatrica Scandinavica Supplementum
workers in Cape Town, South Africa: background, aims No. 372. Vol 87. Copenhagen: Munksgaard 1993.
and dust exposure determinations.’ Scandinavian Journal Blackburn C. Poverty and health: Working with families.
of
Work Environment and Health 1989; 15(3):180-87. London: Open University Press 1991.
Myers J E, von Schirnding YE R. ‘Occupational and environ- Brown G, Harris T. Social origins of depression. London:
mental epidemiology — Similarities and contrasts.’ Tavistock 1978.
South African Medical Journal 1992; 81(11):546-9. Cooper B. ‘Psychotherapy, psychiatric epidemiology and
Steenland K(ed). Case studies in occupational epidemiology. health services.’ Social Psychiatry 1984, 19:93-5.
New York: Oxford University Press 1993. Dawes A, Donald D (eds). Childhood and adversity:
Psychological perspectives from South African research.
25 Disability studies Cape Town: David Philip 1994.
Dick B, Spencer 1 WF, Watermeyer GS, Bourne DE,
References Wolff E P M, Moyle GD. ‘Chronic illness in non-institut-
Coetzer T. ‘General development, visual perception and ionalized persons. Part I. Prevalence and epidemiology.’
motor function of children with birthweight under 1 250 g South African Medical Journal 1978; 53:892-904.
and their fullterm, normal birthweight peers at 5-6 years of DragunsJ G. ‘Assessing mental health and disorder across
age.’ Master of Philosophy (Maternal and Child Health) cultures.’ In Pedersen P B, Sartorius N and Marsella AJ
thesis, Department of Paediatrics and Child Health, (eds) Mental health services: The cross cultural context
University of Cape Town (in progress). 31-58. Beverly Hills: Sage Publications 1984.
World Health Organization. International classification of Gillis LS. ‘Research into community-based essential mental
Impairments, Disabilities and Handicaps —A manual of health care services.’ Paper presented at the Seminar on
classification relating to the consequences of disease. Geneva: community-based essential health care services.
World Health Organization 1980. Johannesburg, June 1987.
Gillis LS, Elk R, Ben-Arie O, Teggin A. ‘The present State
Further reading Examination: Experiences with Xhosa-speaking psychiat-
Chamie M. ‘Survey design strategies for the study of disability.’ ric patients.’ British Journal of Psychiatry 1982; 141:143-7.
World Health Status Quarterly 1989; 42(3):122-46. Goldberg D, Huxley P. Mental illness in the community.
Coetzee LT, Katzenellenbogen J, Rendall K. ‘Coping in London: Tavistock 1980.
Mitchell’s Plain: Qualitative analysis of a group of handi- Goldberg D, Huxley P. Common mental disorders:
capped elderly in the community.’ South African Family A bio-social model. London: Routledge 1992.
Practice 1993; 14(8):341-6. Holmes TH, Rahe RH. ‘The Social Readjustment Rating
Badley E M. ‘The ICIDH: Format, application in different Scale.’ Journal of Psychosomatic Research 1967; 11:213-18.
settings, and distinction between disability and handicap.’ Links P S. ‘Community surveys of the prevalence of child-
International Disability Studies 1989; 9:122-5. hood psychiatric disorders: A review.’ Child Development
Concha M &Lorenzo T. ‘The prevalence of disability in a 1983; 54:531-48.
rural area of South Africa-Gazankulu.’ South African Littlewood R, Lipsedge M. Aliens and alienists:
Journal of Occupational Therapy 1993; 23(2):6-15. Ethnic minorities and psychiatry (2nd edn).
Cornielje H, Ferrinho P, Coetzee D, Reinach SG. London: Unwin Hyman 1989.
‘Development of a community based rehabilitation Marsella AJ. ‘Urbanization and mental disorders:
programme for a poor urban area in South Africa. An overview of theory and research, and recommendat-
A Disability prevalence study — Part I.’ ions for interventions and research’ (Unpublished mono-
Journal of Comprehensive Health 1993; 4(1):26-32. graph). Geneva: Mental Health Division, World Health
Schneider M. ‘The nature and management of communica- Organization 1992.
285
epidemiology: a manual for south africa
Miller T, Swartz L. ‘Access to psychiatric resources: The case method with extent of “overweight” and various weight-
of Mamre.’ Presented at the Association for Sociology in height relationships in the assessment of obesity.’
Southern Africa Conference, Stellenbosch, July 1990. British Journal of Nutrition 1977; 38:271-84.
Miller T, Swartz L. ‘Psychology and epidemiology: World Health Organization. Measuring change in nutritional
An uncomfortable alliance?’ South African Journal of status. Geneva: WHO 1983.
Psychology 1992; 22:52-8.
Newton J. Preventing mental illness in practice. London: 28 Assessment of dietary intake
Routledge 1992.
Orley J, Wing) K. ‘Psychiatric disorders in two African References
villages.’ Archives of General Psychiatry 1979; 36:513-20.
Parry CDH. ‘Areview of psychiatric epidemiology in Africa: Department of Health. Report on Health and Social Subjects
Strategies for improving validity when using instruments 41. Dietary Reference Values for Food Energy and Nutrients
transculturally.’ Transcultural Psychiatric Research Review for the United Kingdom. Report of the Panel on Dietary
1996; 33:173-8. Reference Values of the Committee on Medical Aspects of
Reeler A P. ‘Psychological disorders in Africa II: Clinical Food Policy. London: HMSO 1991.
issues.’ Central African Journal of Medicine 1987; 33:15-19. Food and Nutrition Board, National Academy of Sciences —
Reynolds J, Swartz L. ‘Professional constructions ofa “lay” National Research Council. ‘USA 1989 Revised
illness: “nerves” in a rural “coloured” community in South Recommended Dietary Allowances.’ Journal of the
Africa.’ Social Science and Medicine 1993; 36:657-63. American Dietetic Association 1989; 89:1 748-52.
Robins LN. ‘Psychiatric epidemiology— ahistoric review.’
Social Psychiatry & Psychiatric Epidemiology 1990; 25:16-26. Further reading
‘Rumble S, Swartz L, Parry C D H, Zwarenstein M. Bingham SA. ‘The dietary assessment of individuals;
‘Prevalence of psychiatric morbidity in the adult populat- methods, accuracy, new techniques and recommendat-
ion ofa rural South African village.’ Psychological Medicine ions.’ Nutrition Abstracts and Reviews 1987; 57:705-42.
(in press). Cameron M E, Van Staveren W A (eds). Manual on method-
Swanson-Fisher R W, Martin CJ. ‘Standardized interviews in ology for food consumption studies. Oxford:
psychiatry: Issues of reliability.’ British Journal of Oxford University Press 1988.
Psychiatry 1981; 139:138-43. Department of Health. Guide to Healthy Eating.
Swartz L, Miller T. “Minor psychiatric disability and primary Pretoria 1990.
health care utilisation in Mamre.’ Presented at the Langenhoven ML, Kruger M, Gouws E, Faber M. MRC Food
Epidemiological Society of South Africa Conference, Composition Tables (3rd edn). Parow: Medical Research
University of the Western Cape, July 1991. Council 1991.
Swartz L. ‘Transcultural psychiatry in South Africa. Part II.’ Langenhoven M L, Conradie PJ, Wolmarans P, Faber M.
Transcultural Psychiatric Research Review 1987; 24:5-30. MRC Food Quantities Manual (2nd edn). Parow:
Swartz L, Ben-Arie O, Teggin AF. ‘Subcultural delusions and Medical Research Council 1991.
hallucinations: Comments on the present State Langenhoven M L, Gouws E, Wolmarans P, Van Eck M.
Examination in a multi-cultural context.’ British Journal of The analysis of dietary data at RIND: Nutrients and food
Psychiatry 1985; 146:391-4. groups. Parow: Medical Research Council 1989.
Tafari S, Aboud F E, Larson C P. ‘Determinants of mental Passmore R, Nicol B M, Rao MN, Beaton GH, DeMayer EM.
illness in a rural Ethiopian adult population.’ Handbook on Human Nutritional Requirements. Rome:
Social Science and Medicine 1991; 32:197-201. Food and Agricultural Organization of the United Nations
Tansella M, de Girolamo G, Sartorius N. Annotated bibliog- 1974.
raphy of psychiatric epidemiology. London: Gaskell 1992.
World Health Organization. The ICD-10 classification of 29 Measurements in dental epidemiology
mental and behavioural disorders: Clinical descriptions
and diagnostic guidelines 1992. References
Al-Alous W, Jackson D, Crompton G, Jenkins O C. ‘Enamel
27 Anthropometric studies mottling in a fluoride and non-fluoride community.’
British Dental Journal 1975; 138:9.
Further reading Angle E H. ‘Classification of malocclusion.’ Dental Cosmos
Bray GA. ‘Definition, measurement and classification of the 1989; 41:248.
syndromes of obesity.’ International Journal of Obesity Bollmer B W, Sturzenberger O P, Lenoff R W, Bosma
1978; 2:99-112. ML, Lang NP, Mallet ME, MeckelA H. ‘A comparison of
Frisancho AR. ‘Newnorms of upper limb fat and muscle three clinical indices for measuring gingivitis.’ Journal of
areas for assessment of nutritional status.’ American Clinical Periodontology 1986; 13:392-395.
Journal of Clinical Nutrition 1981; 34:2 540-5. Chamberlain, J. ‘Evaluation of screening for cancer.’
Keys A. ‘Overweight, obesity, coronary heart disease and Community Dental Health 1993; 10. Supplement 1:5-11.
mortality.’ Nutrition Review 1980; 38:297-307.
Cohen, LK, Jago J D. ‘Socio-dental indicators.’ In Eloresian
J, Siegmann AE (eds.) Socio-Medical Health Indicators
Kibel MA, Wagstaff A (eds). Child health for all: Amanual
1983.
for Southern Africa. Cape Town: Oxford University Press
Cushing AN, Sheiham A, Maizels J. ‘Developing socio-
199e ‘i
dental indicators — the social impact of disease.’
Steyn K, Fourie J, RossouwJ E, Langenhoven M L, Joubert G,
Community Dental Health 1986; 3:3-17.
Chalton D O. ‘Anthropometric profile of the coloured
Dean, HT. ‘Classification of mottled enamel diagnosis.’
population of the Cape Peninsula.’ South African Medical Journal of the American Dental Association 1934;
Journal 1990; 78:68-72. : 21:1421-1426.
Womersley J, Durnin J VGA. ‘A comparison of the Skinfold Dean, HT, Elvove, E. ‘Some epidemiological aspects of
286
bibliography
287
epidemiology: a manual for south africa
288
index
289
epidemiology: a manual for South Africa
CPITN see Community Periodontal Index of Treatment methods of data collection in 248-51
Needs national level 248
criteria-related validity 92 validity and reproducibility of 251
cross-sectional analytic studies 67-8, 216, 219 disability, classification of 224-5
example of 222-3 Disability Adjusted Life Years (DALY) 23, 190, 269
crude rates 18 disability prevalence (studies to determine) 228-9
birth 20 disability studies 224-9
death (mortality) 20, 187 examples of 228-9
culture 237 variables included in 227
cumulative incidence 16-17 disclosure 31
discounting 268
D disease surveillance 140-6
definition of 140
for detecting outbreaks 197
DALY see Disability Adjusted Life Years
" history of 140
data analysis 87-9, 95-7, 101-23
main purpose of 141
by computer 87-9
methods of 141-4
by hand 87, 88
problems in 144-5
checking data before 102-3
to determine immunization coverage 205
commonly used 116-22
uses of 140-1
exploratory 103-8
DMFS/dmfs index 255
for estimating immunization coverage 209
DMFT/dmft index 255
in KAPB surveys 172
dummy tables 95
summarizing 108-111
what to consider when choosing appropriate 116
data capture sheet 87
data checking procedure 102
data collection 82-94 ecologic studies 71-2, 193, 216
for estimating immunization coverage 208-9 economic evaluation 264-9
for qualitative research 177-9 examples of studies 271-2
in community studies 165 measurement of costs and consequences 271
in dietary studies 48-51 methods of 269-71
in outbreak investigation 202-3 economic objectives 262-4
issues related to 93-4 effectiveness 262-4
methods of 84, 177-9 efficiency 262
quality of data 89-93 ENHR see essential national health research
using questionnaires for 82-9 environmental diseases
data distribution (types of) 104-6 causes of 213
data interpretation 101-23 nature of 212-13
data management 95-7 environmental epidemiology 211-17
data presentation 101-23 environmental health
guidelines for 122
assessment of exposure 213-15
data quality 89-93
definition of 211
data types 89-90
environmental health outcomes 212-13
death rates 16
detection of 212-13
demographic indicators (for South Africa) 22
types of 212
demographic processes 20-2
environmental investigations 199
demographic transition 22-3
environmental monitoring 214
demography 19-23
dental caries (measurement of) 255
EPI see Expanded Programme on Immunization
EPI30x7 sample 206, 275
dental epidemiology (measurements in) 254-60
deontological theories 27 epidemic (definition of) 196
dependency ratio 19, 22 epidemic curve 199, 200
depth interviews 177 epidemiological concepts, integrating 124-30
examples of use of 181 epidemiological triangle 11
descriptive statistics 108 epidemiologist, role of 6
descriptive studies 64, 66 epidemiology 5-6
objectives of 61 analytic 5
diabetes, late-onset (prevalence of) 167-8 definition of 5
diagnostic interviews 232, 233-4 descriptive 5
dietary data discipline of 5-6
calculation of 251 future of 8-9
collection of 248-51 in South Africa (history of) 6-8
interpretation of 251-2 ofa disease 5
dietary histories 250-1 Epidemiology Society of Southern Africa (ESSA) 8
dietary studies 248-53 equity 264
examples of 252-3 ESSA see Epidemiology Society of Southern Africa
individual level 248, 249 essential national health research (ENHR) 5
290
index
ethical principles (in research) 27-34 contrasted with perfect competition situation 263
ethical reasoning health care system 148
ethics and 26-7 economic evaluation of 264-9
examples of ethical dilemmas to stimulate 42 ways of looking at 151
process of 28 health economics 261-72
ethical review (of research) 33-4 health indicators 23-4
ethical theory 27, 28 key mortality rates used as 189
ethics 25-6, 96 proposed by WHO 24
and data collection 93 selection of for monitoring population health 24
and ethical reasoning 26-7 sources of data for 23
and occupational epidemiology 221 uses of 23-4
examples of ethically unacceptable research 26 health insurance 262
guidelines for action 96 health services data
in clinical trials 70 sources and limitations of 135, 137
issues of in epidemiological research 314 study of in provinces 138-9
ethics review committees 33 health status indicators 269
evaluation 147 health system 147-8
Expanded Programme on Immunization (EPI) 158-60 Health Systems Research (HSR) 147-57
sampling method used in 206-8, 209-10 approach of 152
experimental studies 64, 66, 69-71, 217, 219 characteristics of 152
exploratory data analysis 103-8 examples of questions for 153
exposure, more than two levels of 118-19 examples of studies in 155-7
exposure estimation 213-15 integrated view of 151-3
methodology of 152-3
F tips for conducting 153-5
health-related behaviour 169 ’
face validity 92 height for age 243-4
family planning services (REA of) 162-3 hepatitis B reporting system 146
feedback 33 histogram 106
fertility 20 hospital care (observations of levels of) 184
fertility rate 16 house-to-house sentinel population investigation 208
field supervision 94 HSR see Health Systems Research
field trials 70 hypothesis 62
fieldworkers (in community studies) 165-6 generating a 200
fluorosis (measurement of) 259 testing a 200-2
focus group interviews 177-8 hypothesis (significance) testing 114-15
follow-up studies see cohort studies
food consumption studies see dietary studies
291
epidemiology: a manual for South Africa
292
index
etcee
293
epidemiology: a manual for South Africa
analysing 116-18
R measures of 15-19
road to health card 245
random number table 276
routine surveillance 142-3
random sampling 75-8
problems in 145
randomized controlled clinical trials 70
routinely available data
range 111
examples of use of 137-9
rapid environmental health assessment 160-1
for estimating immunization coverage 205-6
rapid epidemiological assessment (REA) 158-63
mortality data 135-6
broad types of 158-62
types, sources, and limitations of 134-5
family planning services study 162-3
using 133-5
measles immunization study 163
uses of 158
rapid ethnographic assessment 160 Ss
rates 15
calculating 15, 16 sample 74
comparison of 18-19 sample size 80-1
key 16 estimates yielded by increasing 125
REA see rapid epidemiological assessment formula for 275
recall bias 127 statistical significance, strength of association and 118
relative mortality rates 188-9 study design, precision of estimates and 124-5
relative risk 116-17 sampling 74-81
reliability (measurement) 90, 91 bias in 79-8, 127
improving and evaluating 92-3 EPI 206-8
of KAPB surveys 172, 173 example of strategy to eliminate bias in 81
report writing 98, 203 for disability studies 225-6
research 3-4 in community studies 164-5
applied 3 in KAPB surveys 172
basic (fundamental) 3 in occupational epidemiology 219
country-specific 5 in qualitative studies 180
global health 5 LQAS 208
public health 5 non-random (haphazard) 78-9
reasons for conducting 4 random 75-8
research diaries 179 techniques for REA 158-62
research hypothesis 62 sampling frame 75
research objectives, setting 56-63 for community study 164
examples of 62-3 sampling units 75
specific 61-2 for community study 164
research problem, identifying and analysing 56-8, 59, 60, 61 scales of measurement 86
research process 49 scatter plot 121
bias in 51-3 screening techniques 226-7
planning phase of 49-53 secondary prevention 14
researchers’ patterns of emotions during 53
selection bias 127-8
research project, probable stakeholders in 166
self-administered questionnaire 82-3
research protocol see study protocol
sensitivity analysis 270-1
research questions 64
sentinel surveillance 143-4, 161
developing 83-6
problems in 145
for health systems research 153-4
formulating 56-63 shadow pricing 268
in disability studies 226-7 significance level 114
in KAPB surveys 171 simple linear regression 120
methods of questioning 82-3 simple random sampling 75, 77
research report Simplified Oral Hygiene Index (OHI-S) 256
qualities of a good 98 smoking (KAPB survey regarding) 175
structure of 98 SMR see standardized mortality ratio
research team 49-50 snowballing (networking) 79
- for outbreak investigation 198 social dental indicators 259
researcher control 64-5 Social Learning Theory 169
researchers, emotional state of 53 social psychological theories 169
(see also fieldworkers; research team) socio-cultural context 174
resources 96 special surveillance programmes 144
for occupational epidemiology 222 problems in 145
respiratory disease specific rates 18
health survey 44 specificity 221
occupational study 222-3 specimens (use for other purposes) 32-3
Rifkin’s framework 38 spot map 199, 201
risk difference 117 standard deviation 111
risk factors 10-11 standard error 112-13, 124
294
index
T
table shells see dummy tables
295
ec gorge
aw no Tee Aa)
oe (> WAN Bemsur lapis marly
4? o , We ta atrapt Msriad
J =- i" "a
i£ cout b te ai plana} 7 M~ p< ane er gy
Lavine
7
» SPO ‘
Somehan me r
=
= “Wy 2 Rest eye as - a
eT! Wagir bes ey a *e
ees F Uf as ~eyaatle
(ine THVT: Sly : re «+ * SP ope, erreantsof ena, sam
ss hi gD
ages = ‘a ayigen: fea mLab A Bram govt So leap peidagitet Sarge
aiat “ derBx iy 7 Greeti ai
( ne ' ; 18 87to
; Inia wonay req iniaistebe
f >
tahonitneltig
Sptigi att 4
i tise fh) a ee ; ou lesen
+,
- FB yatveclie Retsil a me : :
Sémrthiesascller Al SO indy 9 eh “ograyeti «5
r + Papal 6 Pale: petals. aac.g obo knceu. dopenevehans
> ye ny ee
j : : ere b 4 7 wtp -. <4 - Mizo l esp cones bine
; ' a { nl DS vem vibettebray
oases
Wate ae Y en aon iso
ieee RR
ae (g}o apres
eee
SE
HMA OL $6 Wee Hei fe = / 2 7S! enotasrtenee
Moa gES ig Biles Eis” ane _ 5S wel
albayiitietealD
204. “yee chien > 7 » aR: ishige cmmseiee | |
+ ey | sg Bee der apbooisqunaant:
ta) yuu - a a a" Et qe temaiiqe sluelinyegal
fil: |ets4 Tuy . “alr 3 - GST aronjor damesintgecietam
e - ee eis? mi ~ nee widest bake levotonglate |
- ; ei dabtiny aan ae E SResisteliioora
= ae a See - 5-02 6 Bosteerer
Tee? et 2 PMT oh aetna whe
sed aoe nea dtp Seeing Aiea Si S eee ccm Crepes)ea
eo et ‘ ae «(eee oe meerer ae
ee eT ren scoreear gD
ou
tr ets ta a es Heine n
tywow
EB aot
Sens Lae neat eh
ea aeeeeeienaineeanedinnacent ee me
segannoemn
peor tae : a ;
: el + ae : } ¢
nt See gt eee J
ma aa Wearguacaale eile Si tm ae
6. LA ays @ ~@, %
ee ee | i E . a Sp
7 ar 4 sgoultng A =
ae Dba ial mr een VS y +
ee an, Se
: — 5 .]
rts pee oa Abe oe yy + - q
or eee) “ ? ‘ . ¥
rity Sri
i ee rn - rey:
re, ” 2G . 7 “i _
ah iatta avietog 145 7 Sy
yelp ie tsi fee, * ros
alte, Y Rog 1) he
= e
i Pe ‘ :
7 4 : er
4 «eee - ——— Tt awe i —_— -i
wu
Health care workers and medical students often find epidemiology
dauntingly abstract, even though it provides a vital background to
the work they do. This comprehensive manual provides an acces-
sible introduction to epidemiology and its application in primary
health care. The basic principles and tools are clearly explained and
applied to a wide range of specific topics — from psychiatric to
environmental epidemiology.
This clear and accessible book will be invaluable not only to medical
students and professionals, but also to all others interested in
epidemiology, or in current and future health trends in South Africa.
ISBN 0-19-571308-7
9 "780195 il