Stratified Cluster Sampling: BMJ (Online) November 2013

You might also like

Download as pdf or txt
Download as pdf or txt
You are on page 1of 3

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

net/publication/259685017

Stratified cluster sampling

Article  in  BMJ (online) · November 2013


DOI: 10.1136/bmj.f7016

CITATIONS READS

7 649

1 author:

Philip Sedgwick
St George's, University of London
391 PUBLICATIONS   4,097 CITATIONS   

SEE PROFILE

Some of the authors of this publication are also working on these related projects:

BMJ Statistics Endgames View project

All content following this page was uploaded by Philip Sedgwick on 15 March 2016.

The user has requested enhancement of the downloaded file.


BMJ 2013;347:f7016 doi: 10.1136/bmj.f7016 (Published 22 November 2013) Page 1 of 2

Endgames

ENDGAMES

STATISTICAL QUESTION

Stratified cluster sampling


Philip Sedgwick reader in medical statistics and medical education
Centre for Medical and Healthcare Education, St George’s, University of London, London, UK

Researchers investigated the suitability of a newly developed therefore imperative that any sample was representative of the
famine scale as an international definition of famine to guide population of Niger. Simple random sampling across the country
humanitarian response, funding, and accountability.1 The scale could have been used to recruit households. However, simple
had been proposed by Howe and Devereux, and it defined random sampling would have produced a representative sample
famine on the basis of intensity and magnitude.2 The scale was only if enough households were recruited. The population of
applied retrospectively to the humanitarian crisis during 2005 Niger was geographically diverse. Therefore, random sampling
in Niger, west Africa, to determine whether famine had occurred. of households across the country would have been impractical
A cross sectional study design was used. Households were and too expensive. A stratified two stage cluster sampling
recruited using a stratified two stage cluster sampling method. approach was therefore used to ensure the resulting sample was
Niger was stratified into its eight regions. Within each region, representative of the country, while concentrating resources in
26 villages were randomly selected, with the probability of fewer areas (a is true).
selection proportional to the size of the village. Within each The stratified cluster sampling approach incorporated a
village, 20 households were systematically randomly selected. combination of stratified and cluster sampling methods. Firstly,
A census of the entire household was undertaken by Niger was stratified by region. The country consists of eight
administering a questionnaire to the head of each selected regions—seven rural ones plus the capital, Niamey. Within each
household. region a simple random sample could have been taken to ensure
The researchers concluded that on the basis of the famine scale that each region was adequately represented. However, the
developed by Howe and Devereux, most regions in Niger in population of each region was geographically diverse. Therefore,
2005 experienced food crisis conditions, and some areas simple random sampling within each region or stratum would
approached famine. Furthermore, it was suggested that the scale have been impractical and expensive. To concentrate resources
afforded more objective criteria than did previous approaches in fewer places, a two stage cluster sampling process was
while providing early warning systems that might help guide performed within each stratum.
the level of response in future situations. A cluster is a natural grouping of people—for example, towns,
Which of the following statements, if any, are true? villages, schools, streets, and households. The sampling of
a) Cluster sampling meant that resources could be clusters in the above study was a two stage process. The first
concentrated in a limited number of areas of the country stage of cluster sampling involved a random sample of 26
villages within each stratum or region. The probability of
b) The stratified two stage cluster sampling approach
selection was proportional to the population size of the
constituted a multistage sampling method with three stages
region—that is, larger villages had a greater probability of being
c) Systematic random sampling of households in each village selected than smaller ones. Within each chosen village, a fixed
required the construction of a sampling frame number of 20 households were selected using systematic random
Answers sampling. The household was the unit of analysis, with a census
of each household achieved through a questionnaire.
Statement a is true, whereas b and c are false. The two stage cluster sampling process described above is
The aim of the study was to assess whether the famine scale referred to as a multistage cluster sampling approach, or simply
proposed by Howe and Devereux provided a suitable definition multistage sampling. In multistage sampling, the resulting
of famine to guide future humanitarian response, funding, and sample is obtained in two or more stages, with the nested or
accountability. A cross sectional study design was applied. hierarchical structure of the members within the population
The scale and severity of the humanitarian crisis in Niger in being taken into account. Population members are arranged in
2005 would probably have varied across the country. It was clusters. The method is based on the random sampling of clusters

p.sedgwick@sgul.ac.uk

For personal use only: See rights and reprints http://www.bmj.com/permissions Subscribe: http://www.bmj.com/subscribe
BMJ 2013;347:f7016 doi: 10.1136/bmj.f7016 (Published 22 November 2013) Page 2 of 2

ENDGAMES

at each stage, with the sampled clusters nested within the clusters were chosen at random. It was possible to count the number of
sampled at the previous stage. In the example above, a two stage households in each chosen village. However, the number of
multistage sampling approach was used. The first stage involved households in each of the clusters that were not selected was
random sampling of 26 villages within each region. The second not known. Hence the probability of selection of a household
stage involved the systematic random sampling of 20 households in the population could not be determined. Samples resulting
in each chosen village. The division of the country into regions from non-probability sampling methods are generally considered
was seen as stratification and not the first stage of a multistage not to be representative of the population. However, there is no
sampling process (b is false). This is because all regions in the reason to think that the sample in the study above was not
country were included and no random sampling of the regions representative of the population—the sampling approach ensured
took place. that the resulting sample was representative of each region.
The cluster sampling of villages within each stratum involved If a sampling approach involves only a single stage of sampling
the construction of the sampling frame—that is, a list of all of clusters it is referred to as cluster sampling. A random sample
villages within each region. However, presumably it was not of clusters from the population is obtained and all members of
possible to list all the households in each chosen village. the selected clusters are included in the resulting sample. After
Therefore, households in a village were selected using the selection of clusters, no further sampling takes place. Cluster
systematic random sampling, which does not depend on a sampling is often used to select participants for a trial—so
sampling frame (c is false). This involved selecting a single called, cluster trials. Cluster trials have been described in a
household in a village at random, with households then chosen previous question.3 However, in such trials the clusters are
at regular intervals thereafter—for example, every fifth typically not selected at random from the population but by
household. Systematic sampling is typically considered to be a using convenience sampling—that is, by selecting conveniently
random sampling method, as long as the starting point is random located clusters. Convenience sampling has been described in
and the periodic interval of selecting participants is determined a previous question.4
before sampling takes place.
There are two types of sampling methods—probability sampling Competing interests: None declared.
(also known as random sampling) and non-probability sampling
(also known as non-random sampling). By definition, probability 1 Reza A, Tomczyk B, Aguayo VM, Zagré NM, Goumbi K, Blanton C, et al. Retrospective
determination of whether famine existed in Niger, 2005: two stage cluster survey. BMJ
sampling methods involve some form of random selection of 2008;337:a1622.
the population members, with each population member having 2 Howe P, Devereux S. Famine intensity and magnitude scales: a proposal for an
instrumental definition of famine. Disasters 2004;28:353-72.
a known and typically equal probability of being selected. For 3 Sedgwick P. Cluster randomised controlled trials. BMJ 2012;345:e4654.
a non-probability sampling method, the probability of selection 4 Sedgwick P. Convenience sampling. BMJ 2013;347:f6304.

for each population member is not known. Although it is


debatable, the method of stratified cluster sampling used above Cite this as: BMJ 2013;347:f7016
is probably best described as a non-probability sampling method. © BMJ Publishing Group Ltd 2013
The villages in each region, and the households in each village,

For personal use only: See rights and reprints http://www.bmj.com/permissions Subscribe: http://www.bmj.com/subscribe
View publication stats

You might also like