Professional Documents
Culture Documents
Chandrasekar and Deming - 1949 - On A Method of Estimating Birth and Death Rates and The Extent of Registration
Chandrasekar and Deming - 1949 - On A Method of Estimating Birth and Death Rates and The Extent of Registration
Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at .
http://www.jstor.org/page/info/about/policies/terms.jsp
.
JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide range of
content in a trusted digital archive. We use information technology and tools to increase productivity and facilitate new forms
of scholarship. For more information about JSTOR, please contact support@jstor.org.
American Statistical Association is collaborating with JSTOR to digitize, preserve and extend access to Journal
of the American Statistical Association.
http://www.jstor.org
tionedearlier,
theomissionoftheunclassified entriesfromthecalcula-
tionsdoesnotaffecttheestimation ofN, thetotalnumberofevents.
The estimateof the extentof registration will be too low if the un-
classified
entriescontain,as is likely,correctentriesclassifiable
as C.
If the unclassified
entriesare all countedas correct,contrary to fact,
the calculations
willlead to an overestimate oftheextentofregistra-
tion.
Effectof incompleteness
of coverageof the population. As in every
populationenquiry,therewill be somefailuresto elicitinformation
fromall the households.This willhappenwhensomehouseholdsin
whichan event took place have moved away temporarily or per-
manently, orwhenno responsiblepersoncan be foundat hometo give
theinformation.So longas theeventsin theuninterviewed portionof
householdsare includedin the R-listto the same extentas thosein
the interviewedhouseholds,the estimationof N is unaffected. The
calculationofN maytherefore be littleaffected
by incompleteness of
coverageofthepopulation.
The effectof institutionalevents.In rural areas the bulk of the births
are domiciliary, but thereare somesmallscatteredhospitalsdrawing
patientsfroma widearea,and a highproportion oftheeventsthattake
place in themare fornon-residents. The R-listmay containsomeor
even all of the entriesfor these institutional events because the
registraris ableto ascertain
thisinformation easilyandaccuratelyfrom
the institutions.The interviewer,on the otherhand,will,by the na-
tureofa house-to-house canvass,failto discoveran institutional
event
concerning peoplewho had no familyconnections in the area. In-
stitutionalevents,as they are accuratelyascertainable,are best
handledas a separateblockand notas a problemofestimation.
The effectof correlationbetweeneventsmissed on bothlists. The first
step is to definethis correlation. The registrar and his co-workers
willdetectsomeeventsand missothers.The probability thatthe in-
terviewer [I] willdetectan eventthatwas missedby R maybe differ-
entfromtheprobability thathe willdetectan eventthatwas recorded
byR. Ifthesetwoprobabilities areequalthereis complete
independence,
but otherwise thereis not,in whichcase theformulagivenabovefor
the estimationof the total numberof eventswill be incorrect. The
extentoftheerrorcanbe investigated. If as before,
pi= theprobability oftheregistrardetecting an event
q =the probability oftheregistrarmissingit
Group Probability
betweentheeventsmissedbyboth
independence
Ifthereis complete
R and I, then P21=P22=P2, introduced previously,and q21 = 22= q2.
Whenthereis dependencethe expectedvalue of the estimateof the
numberofeventsY missedbybothR and I willbe closeto
Nplq2lqlp22
PlP21
is
The difference
whereasthecorrectvalueis Nqiq22.
Nplq2lqlp22 (P22
-Nqlq22 = Nql -1j.
PIP21 P21 /
estimateofthetotalnumberofeventsmadeundertheassumption
of
zero correlation.
Methodto reducetheeffect ofcorrelation.
It is important to notethat
correlation signifies
heterogeneityin thepopulationforit impliesthat
eventsthat fail to be detecteddo not forma randomsamplefrom
the wholepopulationof events.This heterogeneity may ariseonlyif
thereare differencesin thereporting ratesfordifferent
segments ofthe
population, in thegroupoffailures
resulting beingweighted dispropor-
tionatelyby thedifferent segments.
It therefore thatthecorrelation
follows canbe minimized bydividing
the populationinto homogeneous groupsand calculatingthe total
numberofeventsseparately foreach group;thenby additiongetting
the grandtotal. In orderto put thissuggestion intopractice,let us
considerthe difference betweentwo estimatesofthe totalnumberof
events:i. by dividingthe populationinto homogeneous groupsand
estimating theeventsin each groupseparately, thenforming a grand
total;ii. bytreatingtheentirepopulation as a unit.Let thepopulation
be comprisedof k homogeneous groups,withN, eventsin the i-th
group (i= 1, 2, * * *, k). Then let p1(i)be the probabilityof the regis-
trardetecting an eventin the i-thgroup,and p2(i) the corresponding
probability
forthe interviewer. The expectedvalue ofthe numberof
eventsmissedby bothin thei-thgroupis Niql0(ti and fortheentire
populationthetotalmissedby bothwillbe 2N,qj(i)q2(i). As by defini-
tionthereare onlyk homogeneous groups,thisvaluewillbe estimated
withoutbias whenthegroupsare treatedseparately.But iftheentire
populationofeventswerepooled,theexpectedvaluefortheestimate
ofthenumberofeventsmissedby bothwouldbe closeto
[ENip, (i)q2'(i][ Niql (')P2 (i)]
E2Nip,(i)P2(i)
in thetwovalueswillbe
The difference
[E Nip,(i)q2(i)] [ZENiqi(i)p2(i)] -
N2SlS2r
SNipi(i)P2(i) ,. - -Nqinq<)-S'p,(i)P2(i)-
where
A[pi (i) - pi1j2
S12 = -Ni -
Ni
ENi[P2( i)- P2]2
S2 2=
N= EZNs Pi N2 N -
and
S12 Ni
E [pIi(i) -1] [p2(i) -P2
r=~~ =
SAs2 Ss2 EN
0) co 00)
-~~~~~~~~~~~~~~~~~~~~~~~t
' _O~~~~4
$ bN u04 l
N 0
C~~~~~
00
t 0)t E8
H _ _ _ _ _ _
z O0 b -p '0 04 04 '
H'-4
0_ O 00 0
0 __ _ _ _ _ _ _ _ _ __ . _ _ _
Bo~~~~~~~~
0
00
z~~~~~'
Z ~~~~~~~~~~~~~~~~~~~~~~~~~~~C
4.0 "-' ~~~~~~~~~~~~~~~
~ ~ I C
49 *~ ~ '-4 0 0 4- 0
TABLE II
BIRTH AND DEATH RATES IN 1945 AND 1946, SINGUR HEALTH CENTRE
1945 1946
R Standard Standard
Rate err
error Rate ero
error
TABLE III
PERCENTAGE OF BIRTH AND DEATH REGISTRATION DURING 1945 AND 1946
Onecomment maybemadeinregardtothebirthratefor1946,which
appearsto be veryhigh.Possibleexplanationmay be the improved
economicsituationafterthe famineof 1943,and demobilization. An-
otherpossibleexplanation is failureoftheinvestigatorto establishthe
identity ofentriesin theR and I lists,butifthiswereso, it shouldbe
moreapparentfor1945,whichit is not,as thebirthratesfor1945are
muchlower.An improbable explanationis thateach UnionBoard is
composedof extremelyheterogeneous sectionsdisplayingnegative
correlation betweenthe probabilities of detectionof eventsby the
Registrar andtheinterviewers.
Another comment shouldbe made.The completeness ofregistration,
recorded in Table III, is based on thenumberofcorrect entriesin the
R-listjudged againstthe estimatedtotal numberof events.Official
publishedratesin all countries arebased on thetotalnumberofregis-
trars'entries, correctplusincorrect, and theusual practiceofinflating
official
ratesto correct forincompleteness ofregistration
yieldsspurious
results:theratesare alreadypartlyinflated owingto incorrect entries.
Properinflation (correction ofrates)is possibleonlyby comparing the
registration listswiththe resultsof a populationsurveyand making
estimatesofthetotalnumberofeventsand theproportion ofincorrect
entriesin the registration lists.
The precisionof estimatednumberof events.From the fact that the
coefficientof variationof a total estimatednumberofevents is
it willbe seenthatthelowertheefficiency
V/q1q2/N.p1p2, ofdetection of
an eventon eitherthe I or R-lists(pi or p2), thegreaterthestandard
errorofthetotal.In thisenquiry, in spiteofthefactthatlocal people
werehiredand trainedespeciallyforthiswork,the efficiency of the
interviewingwas not ofhighorder:only67.2%0ofthe birthsin 1946
and 52.8% in 1946weredetectedbytheinterviewers. The correspond-
ingpercentages fordeathswere50.7 and 32.3. Methodsofimproving
the performance of the interviewers mustbe sought,and it appears
thattheintervaloftimeto be coveredby thesurveymustnotextend
toofarback.
It is highlyimportant to bearin mindthatregardless ofthe inter-
viewers'performance, the methodproposedhereforestimating the
totalnumberof,N, eventsis notsubjectto bias,3butpoorperformance
doesincreasetheerroroftheestimateofN. It also increases thestand-
arderroroftheestimateoftheregistrars' performance.
The coefficientofvariationis also influenced by N. It is important
to notethatN in theformula refersto anytotal-not just a totalover
a In maldngthisstatementthe case of p2 (or pi) =0 is consideredtrivialand is excluded.
V fq N-C-N2
(C+N2)pl N-1
valueforthestandarderrorofR.
Anapproximate
(C + N1)(C + N2)
N =
C
4 It is presumedin this statementthat the physicalfacilitiesforsampling(maps,lists,personnel,
payment,etc.) are about the samneover all partsof the area to be covered.
5 As a matterof fact,the surveysreportedhave providedmostof these additionalitems,and the
cost mentionedincludesthem.
Underthesameconditions,
thevarianceV(X) ofA is
whichbytheapplication
oftheformula
givenabovereducesto
Nqlq2
P1P2
The standarderrorofN is therefore
aprxia/Nqel2
V P1P2
approximately.