Professional Documents
Culture Documents
Collection and Presentation of Data
Collection and Presentation of Data
Collection and Presentation of Data
Presentation of Data
2.1 Primary data and secondary data
Statistics being a body of methods meant for the study of
numerical data, it is obvious that the first step in any statistical enquiry
must be the collection of the relevant numerical data. To study the
growth of steel production in India since 1947, it is necessary to obtain
the actual production figures for all years from 1947 to date. If our
aim is to study the efficacy of a given cure for bronchial asthma,
we must collect data on people suffering from bronchial asthma and
see how many got cured (or had a remarkable degree of relief), and
how many did not, after a course of treatment with the drug.
Now, the data may be of two broad types: primary and secondary
The ordinary user of economic and social statistics will find that the
data have been already collected by some other agency, government
or private; these may exist either in a published or in an unpublished
form. His job will then be simply to have access to the source and
data. Govern-
secondary
get hold of the data. Such data will be called
ment departments collect data on diverse topics that touch the life
of the people as a matter of routine and as an essential basis of
administration. Private agencies like banks and industrial concerns
15
FUNDAMENTALS OF STATISTICS
16
Fundamentals (I)--2
FUNDAMENTALS O F STATISTICS
18
are not all nterate or, even if literate have
informants
in case the educational level. For instance, if one is
attained the requisite item.
not
income and expenditure on different one
interested in
family
the head family
t cach and collect thne
arrange to interview
nav
from him. The data collected by this method are
information sought
to be more accurate,
Since a tacttul investigator may persads
likely
the informmant to required information and the meaning of
supply the
be properly explained to him so that the answer.
each question may
may be correct and to the point.
Whichever of the two methods may be used, the questions and
the accompanying instructions to enumerators and respondents have
to be very carefully designed. It 1s necessary that each question be
clearly phrased and capable of an unambiguous answer. The instructions
must take into account all possibilities, even remote ones. The way
a question is put may well influence the answer, as those who have
conducted public opinion polls will bear out. A device often to
advantage is to insert a question meant primarily to produce answers
to other questions. For instance, the relationship of the members of
the household to one another may be used to check the stated age
figures. It is for this reason that forms often include apparently
unnecessary or irrelevant questions. The report of a statistical enquiry
should include the layout of the form used.
In the method of direct observation, the enquirer or his assistants
get the data directly from the field of enquiry without having to depend
on the co-operation of informants. When data are needed on the height
and weight of, say, 200 college students, they will be approached
individually and the height (say in cm.) of each measured with a tape
and the weight (say in kg.) measured with weighing balance. If data
are needed on the sentence-length of a novel by, say, Bankimchandra,
each
the enquirer himself will go through the book and note for
therein. On
sentence the length, i.e. the number of words contained
the other hand, if data are required on the
incidence of blindness
the
will just observe each member of
among a group of people, one
he or she is or is not blind.
The direct method
group and note whether
of data collection may, therefore, involve
either measurement or
holds for each family. When statistics of rice production are given
three series of figures may be available, viz. a series for total area
sown (or harvested) in acres, one for total yield in quintals and one
for yield-rate in quintal per acre, the different figures in a series
corresponding, maybe, to different villages or districts. The compatibility
or otherwise of the three series may then be judged by verifying the
relation
total yield
yield rate = total area
Again, if figures for the price of a commodity during two different
periods are given, together with the percentage increase in price în
the later period over the earlier, the data may be checked by seeing
if the relation
is satisfied.
It should, however, be obvious that no hard and fast rules may
be laid down for the scrutiny of data. The enquirer must use his
common sense, judgment and whatever knowledge he may have about
the field of enquiry to assess the
reliability of the data. In this context,
Mahalanobis's paper [5] will be found illuminating.
It will be found, as the reader
makes progress with the study
of statistical
methods, that certain statistical tools may be used to check
accuracy of figures-not the raw data of statistics, but
from thein figures derivea
according to some statistical concepts.
COLLECTION AND PRESENTATION OF DATA 21
The
description of the various columns is the caption of the table.
the
caption may include a mention of the units of measurement for
data of each column and also column numbers, like (1), (2), etc., that
may be quoted in any future reference. The title, stub and caption,
taken together, are said to form the box head of the table.
Civ) Body The body is the principal part of the table, where
all the relevant figures are exhibited.
TABLE 2.1
LAND UTILISATION PATTERN IN INDIA, 1987-88
for
Source Statistics Relating to Indian Economy, August, 1992, Centre
Monitoring Indian Economy.
COLLECTION AND PRESENTATION OF DATA- 25