Professional Documents
Culture Documents
01-Statistics - What and Why
01-Statistics - What and Why
1 W ha t an d Why
♦ INTROD UCTION
• ♦ ORJCIN AND CROW'Tri OF STATISTICS
♦ STATISTICS DEF'lNED
• STA·n sncs. SCIENCE OR ART
♦ FUNC:I ION~ OF S"fA11STICS
♦ APPU CA'l70N S Or ~7'A11S'nCS
♦ LIM11A'1 10NS Di' !)tATIST ICS
• l)I S'n-<llST (W ~iATIS 'nc"
♦ STA'l1S1 IC.Al. M~ll lOIJ~ V, l-..Xl1 l:.l{IMt-:N'Ti-\ l Mt-:'n IODS
♦ ~,A1,snc ~ A1'lJ < <,-.u·t n-.to
STATISTICAL METBoDs
2
INTRODUCTION
The word .statis ti~s. conv eys a van.e ty of mea ning s hto peo ple. To son-.
. fi of mat hem atic s, w erea s t o others .,,e it
'stat istic s' is an imp osm g J~g ure s whi
ch one com mon ly fmds in
sugg ests tabl es, cha rts ban k vari o~s repo rts. spee che s, classroom
newspapers. joum als , h doo s~e are exp
osed to a wid e asso rtm ent of
lect ures . 'IV. etc. Eac whi :6 ofte n has a
prof oun d imp act on our lives .
num eric al info rma tion ss stat eme nts like 'The re .a re 932 females
For example, wel cdiome a~;~eas in the Ru~ sia per
1 000 mal es in n a w ther e are 1, 1 70 fem ales per
1'000 males": "On an aver age for 400 stu d t th · nl b
wrttten, prin ted and pub lish ed in Indi a everen s .. er.:y~s ope y one ·ta oak
y y_ear : e r cap1 net
national prod uct has gon e up from Rs. 769th
in 1998-99"· 'The Public Sec tor Out lay of 8.2 N11;1 thl99F~-9 4yto Rps . 14_682R.3
e m 1ve- ear 1an 1s s.
8 59 200 cr~re whi ch is mor e than the com
bine d outl ays of all the other
Five:Year Plan s take n toge ther": 'Th e den sity
~f
from 216 in 1981 to 267 pers ons in 1991 ~?p ulat ion . has gone up
: Mum bai 1s the most
populated city of the cou ntry with urb an
pop ul~t ion of 12.5 7 milli~n.
Calc utta taking the seco nd plac e hav ing 10.9
2 million. Del hi r~nk~ ~hird
with a pop ulat ion of 8.38 million followed
by Che nna i 5 .36 milh on : The
num ber of univ ersi ties has witn esse d a con
side rabl e incr ease from 110 to
146 betw een 1981 and 1991 ".
Num bers play an esse ntia l role in stat istic
mat eria l of stat istic s. The se mat eria ls mus s. The y prov ide the raw
t be proc esse d to be useful.
just as crud e oil mus t be refined into petr ol
before it can be used by an
automobile engine. The stud y o, stat istic s
involves met hod s of refining
num eric al (and non -num eric al) info rma tion
into
The se stat eme nts cont ain figures and as suchusef ul form s .
stat eme nts of facts. They are highly conv they are call ed numerical
enie nt form s of com mun icat ion
and are, at the sam e time. quit e clear,
prec ise and mea ning ful. An
anal ysis of such stat eme nts can help in fram
people stati stics generally refers to info rma ing suit able policies. To most
proc ess whe ther it be prod ucti on, pop ulat ion. tion abo ut an activity or a
nati ona l inco me, etc .. that is
expr esse d in num bers . Num bers repr esen
commodities prod uced and sold. pric es of prodt qua ntiti es and value of
liabilities, raw materials, cust ome rs, inco mes ucts , inve ntor ies, asse ts and
birth s, deat hs, num ber of pass eng ers travelled and exp ense s. Records of
ship and othe r modes from one plac e to duri ng a yea r by road , rail.
ano ther , qua ntit y and valu e of
imp orts and expo rts of different commod 1
coun tries . num ber of stud ents in vari ous cou ities to and from different
provident fund, leave, cost s and profit, indurses in a univ ersi ty, reco rds of
production, etc., also lead to num eric al expr stria l prod ucti on, agri cult ural
essi ons. Whe neve r num bers are
collected and compiled, rega rdle ss of wha
t they repr esen t. they become
stat istic s. In othe r words, the term stati stics
with ways and mea ns of pres enti ng and han is con side red syno nym ous
logically and drawing relevant conc lusi ons. dlin g data , mak ing inferences
!n addi tio~ to mea ning data , 'stat istic s·
·ma them atic s refers to a subj ect as well also refers to a subj ect, just as
as to symbols, form ulae and
theo rem s and ·accounting' refers to principl
es and met hod s as . weli as to
acco unts . bala nce shee ts and income stat eme
is a bod y of met hod s of obtaining and anal nts. In this sen se stat istic s
deci sion s on them . It is a bran ch of scientifi ysin g data in orde r to base
c met hod used in deal ing with
STATISTICS-WHAT AND WHY 3
• Data are collection of any number of related observations. We can collect the
number of telephones installed in a given day by several workers or the
number of telephones installed per day over a period of several days by one
1 worker and call the results our data. A collection of data is called a data set
and a single observation a data point. -Levin
j
J
STATISTICAL METHoos
4
tence of life on earth, though its use was very much limite~. In the good
old days statistics was regarded as the 'Science of Statecraft and was the
by-product of the administrative activity of the State. It has been the tradi-
tional function of the governments to keep records of population, births.
deaths. taxes, crop yields and many other types of activitie~. Counting and
measuring these events may generate many kinds of numerical data.
Though. in its present usage, · the word "statistics" is barely a century
old, it has been in use for a much longer period. Census of population
and wealth were taken even in the ancient times. According to a Greek
historian, in 1400 B.C. a census of all the lands in Egypt was taken.
Similar reports on the ancient Chinese. Greeks and Romans are also
available. People and land •were the earliest objects of statistical enquiry.
The word 'statistics' comes from - the Italian word 'statista' (meaning
"Statesman") or the German word 'statistik' which means a Political
State. It was first used by Professor Gottfried Achenwall (1719-1772), a
professor in Marlborough in 1749 to refer to the subject-matter as a
whole. Achenwall defined statistics as ''the political science of the several
countries". The word ·statistics' appeared for the first time in the famous
book, Elements of Universal Erudition by Baran J .F. Von Bielfeld.
~ansl~ted ~y W: ~o~per M.D. (3 Vols., London, 1770). One of its chapter
1s. entitled Stati:5tics and contains a definition of the subject as "the
science that teaches us what is the political arrangement of all the
modern States of the known world".
The science of statistics is said to have originated from two main
sources:
• Government Records. and
• Mathematics.
Government Records
.
Exten sive resea rch work is now ~emg
than was done a centu ry ago. Smee s a
1r
derta ken by many more perso ns
tistlcs is a tool of resea rch, the
dema nd for statis tics has greatl y increa sed.
.
Decreasing Cost of Statistics The time anfd t cost
ti
of collec ting data are
ti·cs Howe ver with the
, · the use o s a s
very impo rtant limiting 1.actors m · '
ulato rs · comp uters etc
devel opme nt of electr onic mach ines, ~uch as ca1c do~ . This
the cost of analy sing data has consi derab ly gone has ied t~
the incre asing use of statis tics in solving vari?u ~ alpro
Moreover with the devel opme nt of statis tic
· ' •
th::
bl
&J
the cost of
•
collecting and proce ssmg data has gone d Own • For exam ple, h
consi derab le
bl
advan ce has been made in the samp ling techn iques whic
ena e us to
know the chara cteris tics of the popul ation by stu~y ing only
a_ part of it.
Since 1935, a branc h of statis tics, know n as design of expen
made rapid . progr ess and it is now possib le to collec t
ment s, has
and analy se
statis tics more promp tly and economically.
Of great intere st, even to the non-s pecia list in statis tics, is
the fact
that much of the basic progr ess in statis tical the<?IY of the
past few
decad es is attrib uted directly to a single individual, Sir Rona
ld Fishe r
(1890 -1962 ). Altho ugh many schol ars have contr ibuted
to the scien ce of
statis tics, Fishe r must be credit ed with at least half of the essen
tial and
impo rtant devel opme nts in the theoiy as it now stand s. The
outsta nding
contr ibutio n of Fishe r has put the subje ct of statis tics on
a very firm
footing and earne d for it the status of full-fledged scienc e.
However, an impre ssion shoul d not be formed that the
theor y of
statis tics is complete and final. In spite of the devel opme nts,
unsol ved statis tical problems is long and statis tical resea rchthe list of
more vigorous than ever before. today is
STATISTICS DEFINED
There have been many definitions of the term 'Stati stics' -inde ed
article s have carefully collected together hundr eds of defini tions.schol arly
have defined Statis tics as statistical data (plural sense) where as Some
statis tical metho ds (singular sens·e). A few definitions are other s as
exam ined below. analy tically \
·
Stati stical Data
(i) Statistics are Aggregates of Facts. Single and isolated figures are not
statistics for the simple reason that such figures are unrelated and cannot
be compared. To illustrate, if it is stated that the income of Mr. X is
Rs. 90,000 per annum, this would not constitute statistics although it is a
numerical statement of fact. Similarly. a single figure relating to production,
sale, birth, employment, purchase, accident, etc., cannot be regarded
statistics although aggregates of such figures would be statistics because of
their comparability and relationship as parts of a common phenomenon.
) narrow because it covers only one aspect of tcience, namely, the collection
of data. Other aspects like analysis. presentation. interpretation. etc .. are
completely ignored. ·:
-:'1-t another place ..Bowley says. "Statistics may rightly be called · the
science of averages. This defmition also is not satisfactory because
ave:ages. are ?nly ~ne of the devices used in statistical analysis. The other
devices hke dispersion. skewness, correlation. etc. , are not at all covered by
this defmition.
Still another definition given by' the same author is "Statistics is the
science of the measurements of social organism, regarded as a · whole in
all its manifestations." Thfs definition again is inadequate because it
confines the application of statistics only to sociology, i.e., man and his
I activities. Bowley himself realised this when he remarked. "Statistics
cannot be confined to any one science."
Boddington defines Statistics as "the science of estimates and
probabilities." This definition is also incomplete because estimates and
probabilities are only a part of statistical methods.
According to Berenson and Levin, "The science of statistics can b e
viewed as the application of the scientific method in the analysis of
numerical data for the purpose of making rational decisions."
Croxton and Cowden have given a very simple and concise definition of
Statistics. In their View "Statistics may be defined as the collection.
presentation, analysis and interpretation of numerical data. " This definition
clearly points out four stages in a statistical investigation, namely: (i)
collection of data, (ii) presentation of data, (iii) analysis of data. and (iv)
interpretation of data.
However. to the above stages one more stage may be added and that
is the organisation of data. Thus, Statistics may be defined as the science
of collection, organisation. presentation, analysis and interpretation of
numerical data.
of efforts.
.. . tion. Data collected from published sources a re genera lly
. (ri) Or~ants~ However. a large mass of figures that are collected
in organised orm. tl needs organisation. The first step in organising
from a surv~y f~equJ~rig The collected data must b e edited very carefully
a group of data i~ e_ · inconsistencies, irrelevant answers a nd wrong
s o that the omisswn~~rns from a survey may be corrected or a djusted.
computations in the re
10 STATISTICAL METHO DS
After the data have been edited the next step is to classify th em.
The
purpo se of classif ication is to arrang e the data accord ing to some comro
charac teristi cs posses sed by the items consti tuting the data. The ~
step in organi sation is tabula tion. The as
purpos
th e data in colum ns and rows so that theree of tabula tion is ~o arra~g :
is absolu te clarity in
data presen ted.
(iii) Presen tation. After the data have been collected and
organised ~ey
are ready for presentation. Data presented in an orderly manne r facilita
statistical analysis. There are two different modes in te
which the collected data
may be presented:
(a) Diagra ms. and
(b) Graph s.
. (iv) Analys is After collection, organisation and presen tation the
next
1s that of analysis. The purpose of analysing data ls to dig out inform step
ation
useful for decision-making. A major part of this text is devoted to
the
metho ds used in analysing the presented data mostly in a tabula r
form.
Metho ds used in analysing the presen ted data are numer ous. rangin g
from
simple observation of the data to complicated, sophisticated and highly
mathe matica l techniques. However, in this text only the most commo
nly
used metho ds of statistical analysis are included, such as measu res
of
centra l tendency. measu res of variation, correlation, regression. etc.
(v) Interpr etation . The last stage in statisti cal investi
gation is
interp retatio n, i.e., drawin g conclu sions from the data collected
analys ed. The interp retatio n of data is a difficult task and necess itatesand
a
high degree of skill and experience. If the data that have been analys
ed
are not proper ly interpr eted, the whole object of the investigation may
be
defeat ed and fallacious cpnclu sion be drawn. Correc t interp retatio n
will
lead to a valid conclu sion of the study and thus can aid one in taking
suitab le decisio ns.
Since statisti cal metho ds help in taking decisions. statisti cs may lightly
be regard ed as "a body of methods for making wise decisions in the face
of
uricert ainty. *" A modified form of this definition is given by Prof.
Ya-Luri-Cho u in whose words, "Statistics is a method of decision-making
in
the face of uricertainty on the basis of numerical data and calculated Iisks".
It may be noted that over the past few decade s the prima iy empha
sis
on statist ics has been to develop proced ures that can be used to
with uncert ainty. After all, most decisions, regard ing resour ce allocatdeal
ion
in profit and non-pr ofit organi sation s are made under uncert
ainty
becau se the decisio n-mak er never knows exactly what is going to happe
n.
This moder n concep tion of the subjec t is a far cxy from the one usuall
y
held by layme n. Indeed even the pionee rs in statisti cal resear ch
have
adopte d it only within the past two decade s.
FUNCTIONS OF STATISTICS
was 69.1 million tonnes which is 9.5 per cent higher ~hat: in 19? 7-9 8"-
lt conveys a definite information. Similarly, statements hke 'There is a lot
of unemployment in India"; ''The population of India is growing at a very
fast rate" and ''The prices of various commodities are rising", etc. , ha~dly
convey any worthwhile information as they do not specify the numencal
dimensions involved.
Comparison Unless figures are compared with the others of the same
kind, the.y are often devoid of any meaning. For example, examine tbe
statement that "the. production of rice in _1999-2000 is likely to be more
as compared to 1998-99.". The reader would not get a clear picture frQIIl
the statement. But if we make a statement that the production of rice in
1999-2000 is likely to be 87.5 million tonnes as compared to 86.0 million
tonnes. in 1998-99, it conveys· a definite meaning.
'
anomic, sOcial or
political issue in the absence of adequate,
nd timely data it would be
difficult, rather impossiKle, to frame accurate
nolicies. AISo It suitable
may be pointed out that it is not only in the formulation
of appropriate policies but also in
seeing that they
implemented that statistics play a vital role. are properly
Robert W.
Burgess
has beautifully summed
up the functions of
Statistics as "The
fundamental gospel of statistics is to push back
domain of 1gnorance, rule of thumb, arbitrary or the
premature aeCIsio
traditions and dogmatism and to increase the domain in which decisiors
are made and principles are formulated on the basis of analysea
quantitative facts."
APPLICATIONS OF STATISTICS
The applications of statistics are so vast and ever-expanding that not
only is it difficult to define it but is also unwise to do so. Statistics
pervades all subject matter-its use has permeated almost every facet of
our lives. It is a tool of all sciences indispensable to search an intelligent
judgment and has become a recognised discipline in its own right. There
is hardly any field whether it be trade. industry or commerce, economics,
biology. botany, astronomy, physics, chemistry, education, medicine
sociology. psychology, or meteorology where statistical tools are not
applicable. In fact. greatest victory of mankind of the 20th century. that
of landing of Apollo II on the moon, would not have been a success in
of statistics are so
the absence of statistical help. The applications
numerous that it is often remarked "Statistics is what statisticians do."
It may be pointed out that in the last three decades there has been a
methods. This has been
virtual explosion in the use of statistical
particularly true in recent years with the advent and accessibility of high
speed digital computers which have the capacity to process large
amounts of information.
Let us examine in detail a few fields in which statistics is applied.
Since ancient times the ruling kings and chiefs have relied heavily on
statistics in framing suitable military and fiscal policies. Most of the
statistics such as that of crimes, military strength, population, taxes, etc..
that were collected by them were a by-product of administrative activity. In
recent years the functions of the State have increased tremendously. The
concept of a State has changed from that of simply maintaining law and
order to that of welfare State. Statistical data and statistical methods are
a
biggest mOSt
data is of immense help collector
to many
and user of statistical
data. Such
Who lurther process it and institutions and research scholars
arrive at useful
decision-making. conclusions which help
Statistics and Business
With the
growing size and
ever-increasing competition, the problems o
business enterprises are
becoming
more statistics in decision-making. complex and they are using more and
methods in the solution of business problems However, the employment of
statüstical
20th century. In
earlier days when business belongs almost exclusívely to the
firms were firms were small. owners of the
dirertly engaged in almost all the areas of business
OWner of a small firm then
might act as the stores
activity. An
salesman. purchaser. etc. It was manager, accountant,
with the customers possible for him to make
and know exactly what personal contacts
growth in the size of business they wanted from him. With the
Owners firms it has often become
to maintain
personal contact with the thousands impossible for the
customers. Management has and lakhs of
become a specialised
upon to plan, organise, job and a manager is called
house. Since very little supervise and control the operations of the
perso'ial contact is possible with business
a moderm
business firm faces a much customers these days,
future operations than it greater degree of uncertainty
business was small. concerning
did when the size of
most of the
production
unless a very careful
these days is in
anticipation of demand and,
Moreover,
to make study of the market is made, the' firm therefore,
profits. Thus, a businessman who has may not be able
uncertainty can no longer adopt the method ofto deal in an atmosphere of
decisions. If he is to be trial and erTor in
deal successful in his
decision-making, taking
systematically with the
uncertainty itself
he must be able
by careful evaluation and
to
application ofstatistical
indeed runs on estiinatesmethods, concerning the business activities. Business
and probabilities. The
of a businessmnan's higher the degree of
estimates,
business. In recent years it has the greater is the success attendingaccuracy
become increasingly evident that on his
statistical methods have
valuable tools for provided the businessman with one statistics and
of his most
decision-making.
Business activities can
broadly be grouped under the
Production, following heads:
Sale
Purchase.
Finance,
Personnel
Accounting.
Market and Product Research,
and
Quality Control.
STATISTICS-WHAT AND WHY 15
In the year 1890 Prof. Alfred Marshall, the renowned economist, observed
that "Statistics are the straw out of which I, like every other economist,
have to make bricks." This proves the significance of statistics in
economics. Economics is concerned with the production and distribution
of wealth as well as with the complex institutional set-up connected with
the consumption, saving and investment of income. Statistical data and
statistical methods are of immense help in the proper understanding of
the economic problems and in the formnulation of economic policies. In
fact, these are the tools and appliances of an economist's laboratory. For
example, what to produce. how to produce and for whom to
produce-these are the questions that need a lot of statistical data in the
absence of which it is not possible to arrive at correct decisions.
Statistics of production help in adjusting the supply to demand. Statistics
16 STATISTICAL METHODS
LIMITATIONS OF STATISTICS
* For an excellent account of how statistics can be used and misused please
refer to Reichmern, W.J. : The Use and Abuse of Statistics, Oxford University
Press. London.
20 STATISTICAL METHODS
igures are convincing and, therefore, people are easily led to be-
lieve them.
They can be manipulated in such a manner as to establish fore-
gone conclusions.
(ii) Even if correct figures are used they may be presented in such a
manner that the reader is misled. For example, note the following
statement: ""The profits of firm A are Rs. 40,000 for 1999-2000 and
that of B Rs. 50,000 for the same period." On the basis of this
intormation only one would form the opinion that firm B is decid-
edly better than A. However, if we examine the amount of capital
invested in both the firms, the quality of work done, ete.. we
might reach a different conclusion. Hence, while making use of
statistics one should not only avoid outright falsehood but also be
alert to detect possible distortion of truth.
The various ways in which statistics are often misused shall be
discussed in detail in a subsequent chapter. Suffice here to know that
Statistics neither proves anything nor disproves anything. It as only a
tool, ie.. a method of approach. Tools, if properly used, do wonders and.
if misused, prove disastrous. The same is true of statistical tools. If used
properly, they help in taking wise decisions and if misused they can do
more harm than good. But the fault does not lie with the science of
Statistics as such. A few interesting examples can be cited to illustrate
the point. Medicines are meant for curing people, but if a wrong
medicine is take or an excessive dose of a medicine is taken a person
may die. We cannot blame the medicine for such a result. Similarly, if a
child cuts his finger with a sharp knife, it is not the knife that is to
blame, but the person who kept the knife at a place that the child could
get it. These examples help us in emphasising that if statisuual facts are
misused by some people it would be wrong to blame the science as
such. It is the people who are to be blamed. In fact, statistics are like
clay of which one can make a God or Devil as he pleases.
number
the best the carpenter has a
problems. Just
nor
approach to as
also the
of tools each appropriate for a different sort of operation, so of
researcher can avail techniques
himself of various which are the tools
his trade and each of which is appropriate to a specific type of situation.
Which technique or techniques should be applied in a particular
of investigation. Just as
situation would primarily depend upon the object to spoil the work,
the choice of a wrong tool by the carpenter is likely
the choice of a wrong method by the statistician would similarly lead to
wrong conclusions.
GLOSSARY
guantitative Data: Data that possess numerical properties are known ass
quantitative data.
Statistics: Statistics is the use of data to help the decision-maker reach better
decisions.
Variable: A variable is a characteristic that may take on different values at
different times, places or situations.
*It may be pointed out that students want to use calculators in the
examination and the matter is also taken up with the university authorities.
The universities find it difficult to supply calculators to all examinees and
poor students find it difficult to buy and hence even to those who can afford.
calculators are not allowed. If this is done, those who cannot afford would be
put to a disadvantage and injustice would be done to them. Now with the
price of calculators going low more and more universities are allowing the use
of calculators to the students in the examination. However, even while using
the calculators, students should not forget to recheck the caleulations.