Professional Documents
Culture Documents
Kan - Metrics and Models in Software Quality Engineering - Addison-Wesley, 1995. Cap. 3
Kan - Metrics and Models in Software Quality Engineering - Addison-Wesley, 1995. Cap. 3
Kan - Metrics and Models in Software Quality Engineering - Addison-Wesley, 1995. Cap. 3
Chaoter3: Fundamentals
in Meas' rtrl; ,¡ ^ tion,Onerationaf4q andMeasuremenl
.. l1ition,
and Measurernent
Definition,
S.1 Definition,Gperat¡onal goodreviewsandinspectionscould be the scoringof eachinspectionby the inspec-
tors at the end of the inspection,basedon a set of criteria.We may want to opera-
It is an undisputedstaternent that measurenentis crucial to the progressof all.sci- (for
tionally use a five-pointLikert scaleto denotethe degreeof effectiveness
ences.Scientificprogressis madethroughobservations andgeneralizations basedon
example,5=verYeffective,4=effective,3=somewhateffective,2=noteffective,
dataandmeasurements, the derivaüonof theoriesas a result,andin tum the confi¡-
| = poof inspection).Therernay alsobe otherindicatorsin additionto thesetwo.
maüonor refutationof theoriesvia hypothesistestingbasedon furtherempiricaldata. In additionto design,designreviews,codeimplementation, and codeinspec-
As anexampletaketheproposition"the morerigorouslyttrefront endof thesoftware testingis alsopart of our defrnitionof the front endof the devel-
tions,development
development processis executed,thebetterthequalityat ttrebackend."To confrm or
opmentprocess.We also needto operationallydefine"rigorousexecution"of this
refute this proposition, we fust need to deftnethe key concepts.For example,we.
test.Two indicatorsthat could be useda¡ethe percentcoveragein termsof instruC-
definethe softwaredevelopment processanddistinguishthe ftont-endprocesssteps tools) and the
tions executed(as measuredby some test coveragemeasUrement
and activities f¡om thosefor the back end. Assumethat after the requirements- in tern'Nof numberof defectsremovedper thousandlines of
defectrateexpressed
gatheringprocess,our development processconsistsof the followingphases:
sourcecode(KLOC).
fl Likewise,we needto operationallydefinewhatis meantby qualityat theback
Design
end andwhich measurement indicatorsale to be used.For the sakeof simplicitylet
n Designreviewsandinspections
us use defectsfound per KLOC during formal machinetestingas the indicatorof
u Code back-endquality.From thesemetrics,we can then formulateseveraltestable
o Codeinspection suchasthefollowing:
hypotheses
n Debugand development test
ü For softrva¡eppjects, the higher the percentage of the designsand code
o and modulesto form theproduct)
Integration(of components ¿t the
that areinspeóted,the lower the defectratethatwill be encountered
ü Formal machinetesting laterphaseof formal machinetesting.
tr Early customerprograms. n The moreeffectivethe designreviewsandthe codeinspectionsas scored
by the inspectionteam,thelower thedefectratethatwill be encountefed at
the laterphaseof formai machinetesting.
Integrationis thedevelopment phasein which variouspartsandcomponentsare
o The more thoroughthe developmenttest (in termsof testcoverage)done
integratedt9 folr-nthe enti¡esoftwareproduct,qndusgallyafterintegrationthe prod- at the
that will be.encountered
beforeintegration,the lower the defect.rate
uct is underformal changecontrol.Specifically,afterintegrationeverychangeof the formal machinetestingPhase.
softwa¡emust have a specificreason(for example,to fix a bug uncoveredduring
it
testing)and must be documented and tracked.Therefore,we may want to useinte-
grationas the cutoff point:The designto the debugphasesareclassifiedasthe front with the hypothesesformulated,we can set out to gatherdata and test the
hypotheses. We alsoneedto determinettreunit of analysisfor our measufement and
ir end of the developmentprocessandafter integrationit is labeledüe backend.
data.In this case, it could be at the projectlevel oI at the component level of a large
Wethendefinewhatis rigorousimplementationboth in thegeneralsensearidin
specifictermsas they relateto the front end of the developmentprocess.Assuming project.supposewe are able to collect a numberof datapointsthat form a feason-
ables'amplesize(for example,45projectsor components), we canthenperformsta-
the developmentprocesshasbeenformally documented,we may define rigorous
tistical analysisto test the hypotheses. We can classify projectsor components into
implementationas total adherenceto the process:Whateveris describedin the
severalgroupsaccordingto the independent variable of eachhypothesis, then
processdocumentationthatneedsto be executed,we plan to execute.However,this
for our purpose,which is to gatherdatato testour
generaldefinitionis not suffi.cient comparethe outcomeof the dependentvariable(defectrateduringformal machine
proposition.We needto furtherspecifyüe indicator(s)of the definitionandto make testing)acrossthe groups.We can conductsirnplecorrelationanalysis.Or we can
it (üem) ope¡ational.For example,supposethe processdocumentationsays all perform more sophisticatedstatisticalaralyses,If thehypothesesare substantiated
designsandcodeshouldbe inspected. One of our operationaldefinitionsof rigorous by the data,we con{irmthe proposition.If they arerejected,we refutethe proposi-
implementationmay be inspectioncoverageexpressed in termsof thepercentage of tion. If we havedoubtsor unansweredquestionsduring the process(for example:
the estimatedlines of code(LOC) that are actuallyinspected.Anotherindicatorof Are our indicatorsvalid?Are our datareliable?Are thereothervariableswe needto
57
li I : Measurement
chapter3: Fundant*t{t""';1*"t**n ' Theory
primitive
would be definedin tennsof the
they aretheprimitives'All otherconcepts undefined andthe
per- pint andli¡e may be used as
for hypothesistesting?andso forrh),then concepts.For example,tht tont"pt' prirnitives''
control when we conductthe analysis the propositionis can then be defined based on these
Flowev"r, ii the hypothesis(ses) or conceptsof triangleo'
hapsrnore researchis needed. 'ectangl* aredefinitionsthatactuallyspellout
themet-
takeactionsaccordinglyto Operationaldefin¡'¡'u'' in "ont'ast'
:.";;;;;,'r;;n utilize üe knowledgethusgainedand An operational definition
quality' rics to be used*0,t pto"tJures to beusedto obtaindata'
improveour softwaredevelopment a¡d data'It is mea-
"
indicatehow the rveightof a person
is to be measured' what
d"n'on'u^tes the importanceof measurement of "body weight" would
The example unit is used to record the results'An
theprogress of science and engineering' Witltout instrumentis to be used'and what measurement
sürementand datathatreally drive propositionswill defectrate.wouldindicatethe formula
for
and meisurement' theories and operationaldehnitionof softwa¡eproduct
the empirical verificationby data (nuülerator), what denominator (for exam-
defectrate,what defectis to be measured
renrainattheabsfu.actlevel.Theexamp}ea.lsoillusbatesthatfromtheorytotestable andso forth'
to msasufement'.there are**t1-:Y::::n 'ple, linesof codecount,functionpoint) to use'how to measure'
tun"Ot"rtt, and likewisefro* toot"po
simply put, a theory consistsof one or morepfoposr-
iíñ.r.nt t"r"r. of abstraction. expressed
rellrtionships among concepts-usually
;;;;ü;;;""ts lhat describethe
eachproposition'oneor more empirical hypothe-
in termsofcause a¡d effect'From The
3,2 Levelof Measurement
arethenformalll.defined and operationalized. hypothesisand from theoretically
sescan be derived.The concepts whiih data can be We have seenthat from theory to empirical
rnetrics and indicators for processis by no meansdirect'As the
operationalizationprocessñ;;;t from theory deftnedconceptsto operationaiátnnition''
the
empirically' The hierarchy
collected,The hypotheses thus b' tested a definitionand derivemeasurement
"un in .***pt" iliustrates,when we operationalize
measurementindicatorscan be illustrated For instance'to mea-
to hypothesisand from tont"p'to indicators,üe scaieof m"asurement
nqedsto be considered'
Figure3.1. we may use a.five-pointscaleto scorethe
sure the ciualityof softwure'insp"ction
o' *t áoy usepercentage to.indicatethe inspecúoncover-
inspectioneffectiveness
measurement scaleis applicable;for others'the
THEORY CONCEPT age.For somecases,to" tttuno*
I natureoftheconceptun¿tttt'"'uttuntoperationaldefinitioncanonlybemeasured
briefly discussttrefour leversof measurement:
I
I
I
I in a certainscare.In this section,we
I andratio scale'
v V scale,ordinalscale,intervalscale'
""*i"tf
PROPOSITION DEFINITION
AbStfACt Nominal Scale
World t is classi-
I andthe lowestleveiof measurement
¡ - - - - - - - - - l- - - - - - - -
----l ---------- The mostsimpleoperationin science with respect to a
¡! I
sort elementsinto categories
iYv
I
fication. In classifyingwe attemptto we may classify-
of is
interest rerigion,
Emoirical certainatfibute. For example,iithe attribute
HYPOTHESIS DEFTNITION
OPERATIONAL Protestants' Jews'Buddhists'andso on' If
i worto the subjectsof the study tio'átnofOt'
'o processmodelsthroughwhichthe
we classifysofware prorluctsby thedevelopment
categoriessuchas waterfalldevelop-
Ñ;;; í"te <leueláptd,then we may have
process' iterative developmentprocbss'object-
ment process,spUf o""tof*ent
others' In a nominal scale' the two key
DATAANALYSIS MEASUBEMENTSINTHEREALWORLO oriented programmingp'o"t"' ánd
exhaustiveandmutuallyexclusive'
requirements for the categ;;es aráÚratofjointly
canbe classifiedinto one andonly onecategory'
FIGURE3.1 Mutually exclusivemeansa subject all possiblecate-
together should cover
AbstractionHierarchY Jointiy exhaustivemeansthat ail categoriss areinterestedin'
has more categoriesthan we
goriesof the atüibute'If the attribute
anddefinitions'ln a theoreticaldef- theuseofthe..ot¡er,categoryisneededtomakethecategoriesjointlyexhaustive.
orderbearno
The building blocksof theoryareconcepts andtheirsequence
categories
other conceptsthat are alreadywell under- ln a nominalscalt, trt"tiá*t* or the place the
inition a conceptis deñnedin ttt'"' of betweencategories' For instance'we
certainconceptswould be takenasundefined; assumptionsu¡oot
stood.In the deductivelogic system' '"tutio-ns'hips
of Measuremendi-"
^ti
chapter ¡nMeasure. ,r¡4..ll
3: Fundamenlals ;il
3.3 $o¡r¡eBasic lVleasures Proportion also differs from ratio in that ratio is best usedfor two groups
Regardless of the measurementscale,whenthe dataarcgatheredwe needto analyze whereasproportionis usedfor multiplecategoriesin one group.In otherwords,the
them so rve can extÍactmealingful information.Variousmeasuresand statisticsare denominatorin the aboveformulacanbe morethaniust a + ü. If
availabiefor summarizingthe raw dataand for makingcomparisonsacrossgroups.
In this sectionwe discusssomebasicmeasures suchasratio,proportion,percentage, a +b +c+d +e = N
and rate,tvhich arefrequentlyusedin our daily lives aswell as in variousactivities
associatedwith softvare developmentand softwa¡equality.Thesebasicmeasures, thenwe have
while seeminglyeasy,areoften misused.The¡l a¡ealsonumeroussophisticated sta-
tistical techniquesand methodologiesthat can be employedin data anaiysis. a b cd e .
- +- +- +* +- = I
However',such topicsa¡enot within the scopeof this discussion. N N JV¡ fN
llad theresultsbeenstatedasfollows,it wouldhavebeenmuchmo¡einformative: computed.In a two-way table,the di¡ectionthe percentaSes ale computeddepends
on tire purposeof the comparison'Fo¡ instance, the percentages in Figure 3'2 are
The project consistsof 8 thousandli¡res of corie (KLOC). computedvertically (the total of each column is 100'070), and the purposeis to
During its developmenta total of 200 defectsweredetectedand pro-
comparethe defect-typeprofile acrossdifferentprojects(for example,projectB
removed,iiving a defectremovalrateof 25 defectsper KLOC. project A). In Figure 3.3' the
Of the 200 defects,requirementsbugsconstitutedl5%, design portionaliy has more requirementsdefectsthan
The purposehere is to compare the distribu-
btgsZ5Ta,codingbugs50%,andotherbugsmadeup 102o, ;srcentagesarecomputedhorizontally.
tion of defectsacrossprojectsfor eachtypeof defect.The intelpretationof the two
comparisonsdiffer'Therefore,rveshouldalwayscarefuliyexamineperce
A secondimportantrule of thumbis thatthe total numberof casesmustbe sufh- arecalculated'
tablesto detetmineexactlyhow the percentages
ciently largeenoughin orderto usepercentages. Percentages computedfiom a small
total are not stabie;they alsoconveyan impressionthat a largenumberof casesare
involved.some w¡itersrecommend th¿ittheminimumnumberof casesfor whichper- Projecl
centagescan be calculatedshouldbe50 or more.We recomrnend that, dependingon
Type of Defect A Tolal (N)
the numberof categories, theminimumnumbershouldbe no lessthan 30, the small-
est samplesizerequiredfor paranietric-statistics.
If the numbe¡of casesis too small,
Requirements(o/4 30.3 43.4 26.3 100.0 (e9)
the absolutenumbers,insteadof percentages, shouldbe used.For instance,
Design(7") 49.0 22.5 28.5 100.0 (102)
Of the total 20 defectsfor üe entireprojectof 2 KLOC, therewere
3 requirements bugs,5designbugs,10codingbugs,and2 others. Code (%) 16,9 26.6 100.0 (177)
Design 2 5 .0 2 1 .8 Rate
Defect rate=
Numberof defects
O FE
XK lL- s9.9S37%
99.999S43%
99.9999998%
In softrvare,defectrate is usuallydefinedas the numberof defectsper KLOC in a
giventime unit (for example,one.yearafterthegeneralavailabilityof theproductin
the marketplace,or for the entire life of the product).Note that this defectsper FIGURE3.4
KLOC merric is alsoa crudemeasure. Fi¡st, üe opportunityfor enor is.notknown. Areas Under the Normal Curve
Second,while any line of sourcdcodemay be subjectto effor, a defectmay involve
many sourcelines. The¡eforethemet¡icis only a proxy measureofdefect.rate even percentageof defect-free
If we take the areawithin the six sigmalimit asthe
assumingthere areno otherproblems.Suchlimitationsshouldl-¡etakeninto account percentage of defectiveparts'we find that
partsand the areaoutsidethe limit asthe
when analyzingresultsor interpretingdatapertainingto softwarequality. perbillion partsor 0'002 panper mil-
defectives
six sigmais equalto two defectives
lion(PPM)'Theinterpretationofdefectrateasitlelatestothenormaldistribution
Six Sigma
willbeclearerifweinclurlethespecificationlimitsinthediscussion,asshorvninthe
(which werederivedfrom cus-
The term six sigmarepresents a stringentlevel of quaiity.Il is a specificdefectrate: top panelof Figure3.5. Giventhespecificationlimits
3.4 defective parts per million (ppm). It was madeknown in the industry by tomers,requirements),ourpurposeistoproduc"partsofproductswithinthelimits.
Motorola, Inc., in the late 1980swhen Motorola won the fi¡st Malcolm Baldrige Partsorproductsoutsidethespecificationlimitsdonotconformtorequirements.If
National Quality Award (MBNQA) andhasbecomean industrystandardas an ulti- wecanreducethevariaüonsintheproductionprocesssothatthesixsigma(standard
matequaliiy goal. deviations)variationoftheproductionprocessiswithinthespecificationlimits'then
Sigma (o) is the Greeksymbolfor standarddeviation.As Figure3.4 indicates, we will havesix sigmaquality level'
nornal distribution'It
the areasunder the curve of normaldistributiondefinedby standa¡ddeviationsa¡e The six sigmavatueof ó.002ppm is from the statisrical
constantsin termsof percentages, regardlessof the distributionparameters. The area processwill produce the exactdistri-
assumesthat eachexecutionof thelroduction
under the curve as deflrnedby plus andminus one standarddeviation(sigma)from butionofpartsorproductscenteredwithfegardtothespecificationlimits.Inreality,
due to variationsin processexe-
the mean is 68.267o,The areadefinedby plus/minustwo standarddeviationsis however,ihere are alwaysprocessshifts and drifts
95.44To,and so forth. The a¡eadefinedby plqs/minussix sigmais 99.9999998V0. process shifts as indicated by research(Hany, 1989)is 1'5
cution. The maximum
The areaoutsidethe six sigmaareais thus 100%-99.9999998Vo = 0.0000002%. sigma.Ifweaccountforthisl.5-sigmashiftintheproductionprocess'weivillgel
the bottom two panelsin Figure
th-evalue of 3.4 ppm. Such shiftingls illust¡ateilin
il ""tll" ",1
n: ,,,1}
in Measursr" . Thér ...,' S ¡ BasicMeasr"t{,
Chaoter3: Fundamentals
,AÑ tency,predictability,
but¡ot valid
Reliable
w Validbutnotreliable andreliable
Val1d
3.5 lVleasurement Errors
wediscuss
Inthissection andreliabiütyflY:::'r:'r"ffi:T:ilT:il:;
validitv
itrra *a two types of measurement
measurement eno, ¡ u"otiu*i *itf'
bility. Let us revisitour t*u*pi"
l0 lb. Usingthe scale, each til"u"
errors: systennttc
validity; t*d9*:1*
uUoutti't úathroom weight
it associated
sci{é
*"igt'' we will get-ameasurement
wiih
with relia-
anbffset of
that is 10 lb'
amongmea-
in aJditionto the slight v¡riations
FIGURE3.6 more than our actualUoOy*"igttt' from the scaledoes
value of the measürements
:AnAnalogyto Validityand Reliability surements'Therefore,the expeJted of l0 lb' In simple formula:
deviadon
TheP&ctheolsocíatResea¡clt'Belmont'Calil:WadsworthPublishingCo'1986'
(Sou¡ca;Babble,E., not equalthe truevalued""
torepr¡nt')
co''wiihpermisslon
P'bl¡"ltins
óisas wuo"*o'th 'i;'ilyt;;;atic
;;ftñ;6i
from the scale= .
Measurement -.-.'^-
Notethatthereissómetensionbetweenvalidityand¡eliability.Forthedatato i;;Ñy weight + 10 lb' + Randomvanauons
endeavor'
.be reliable,the measurement mustbe specificallydefinedandin suchan
iepresent th; theoretical.concept validly may be high. on
ttrerisk of being not ableto In a generalcase:
be quite diffrcult to
, ¡ the other hand,for the definüon to havegood validit¡ it may
't For example' the measurement of churchatten-
áufrn. preciselythe measurements' it may
M =T+s+e
because it is specific and observable' However'
, á*.. *uy be quite reliable on the other hand' to derive score'I is the true score's is
systematrcerror'
r r not be valid to representthe conceptof religiousness. where M is the observed/measured
quite difficult' In the ¡eal world
vaüd measurements for the religioünessconceptis and e is randomeuor'
for a certain trade-off or balance
of measurements andmetrics,it is not uncommon *:11-*T"' the measurement invalid'
The presenceof s (systematic:"-).: theequation'We
to be madebetweenvalidity andreliability' i' nutidand thes term is not in
whenwe try to usemetricsandmea- Now let us assumethemeí'u"*tnt
Validity and reliabiliti issuescomeabout
to representabstacttheo¡eticalcons.tructs. In traditionalqualityengineer- have the following:
surements
arefrequently physical and usually do not involveabsract
ing wheremeesuremenrs
oi vatiaity and reliability a¡e termed accuracyandpreci- M =T+e
concepts,the counterparts
sunoundsthe terminologyfor accu- is not equalto the true
sion (iuran and Gryna,1g?0).Much confusion any particularobservedscore
distincriydifferentmeanings.lf we . The equationstill statesthat disturba¡cesmean
,u.v L¿ pr..irion despitethe two termshaving Ji'tuÁun"t-tht rundom enor e' These
measu¡ement (for example'accuracyup sco¡ebecauseof
runt u *L.h higherdegreeof precisionin 'un¿o*
tothreedigitsafterüedecimalpoi-rrtwhenmeasuringheight),thenourchanceof
chapter
s:.""0"*"",$.lj,t easureme,tr
rheory
lul"
),*,"*r,.*
of the X variable
pxr, is the estimatedreliability
of the Yvariable
pyy,is the esrimatedreliability
P(x,),)= o'2l'/l'sxrtl
- n?¿
if both rvere
betü'eenX and Y rvouldbe 0'34
This meansthat the conelation
enor'
measuredperfectly without
References
l. Babbie,E.,The Practiceof SocialResearch,4thed.,Belmont,Calif.:Wadsworth
PublishingCo., 1986.
2, Card, D. N., and W W. Agresti, "Resolvingthe SoftwareScienceAnomaly,"
Jounnl of Systems and Soflware,Yo7. 7, March 1987,pp, 29-35.
3, Carmines,E. G., and R. A. Zeller, Reliability andValidity Assessment, Beverly
Hills,Calif.: SagePublications, 1979.
4. Fitzsimmons,A., and T. Love, 'A Review and Evaluationof Softlva¡eScience,"
ACM ComputíngSrrvays,Vol. 10,No. l, March1978,pp.3-18.
5, Halstead,M.fl., Elentents of SofiwareScience, NewYork:Elsevier,1977.
6. Hany, M. J., "The Nature of Six Sigma Quality," GovemmentElectronicsGroup,
Moto¡ola,Inc.,Schaunrberg, Il., 1989.
7. Harr¡ M. J., and J, R. Lawson, Six Slg;na Producibilíty Analysisand Process
Clnracterization, Reading,Mass.:Addison-Wesley,1992.
8, Information obtainedfrom Motorola when the authorwas a memberof a software
benchmarkingteam at Motorola,Schaumberg.Il., on February9, 1990.
9, Jones,C., Applied Sofware Meásurement:AssuringProductivityand eualiry,New
York:McGraw-Hill. 1986.
10. Jones,C., "Critical Problemsin Softwa¡eMeasuremerrt,', Version 1.0, Burlington,
Mass.:Softwa¡eP¡oducriviryResearch(SPR),August 1992.
11. Juran,J. M., and F. M. Gryna, Jr., QuatíryPlanning and Analysis:F¡om product
DevelopmentThrough Ute, NewYork: McGraw-Hill, 1970.
12, Schneidewind,N. F., "Reporr on IEEE Standardfor a Softwa¡e euality Metrics
Methodology (Draft) P1061,with Discussionof Metrics Validationl' proceedings
IEEE Fourth Software Engineering Standards Applicatiott Workshop, 1991,
pp. 155-157.
13. Smith, W 8,, "Six Sigma:TQC, American Style," presenredat rhe National
TechnologicalUniversity televisionserieson October3i, 1989.
"'!fi I
t,,'i
d,r"'",i ',;it
X.''r',r/
Critedafor Causality
in MeasurementTheory
Chapter3: Fundamentals
Finally,whileasignificantcorrelationdemonstratesthatanassociationexists
noise rela-
only ordinaldata'If thereis too much betweenth-etwo variables,it doesnot automaticallyimply
a cause-and-effect
whereasrank-ordercorelationrequires calculatedwill be greatly
p"^'*on coefhcient thus tionship.Althoughitisanelementofcausality,correlationaloneisinadequateto
in the intervaldot^,tft" "orr"elation if we know the reliability of thevari- thecriteriafor estab-
As we tliscussed in the last section' showtheexisrence Inthe nextseciion,we discuss
of causality.
attenuated. we have no
resultantcorrelation'However' if
ables invotved, we can adjust the rvill be more
lishing causalitY.
of theva¡iables' correlation
knowledgeaboutihe reliability :ltk::td.tt the noisesof thedatadid
Specifrcally' if
likely to detectthe underlyiniretationst'ip' will be
affectthe originat orOerin"g of thedatapoints'thenrank-ordercorrelation 3.7 Oriteriafor CausalitY
nol conela-
t*" relationship'Sinceboth Pearson's
more successfutin r"p'='"ntñg *tt The isolationof causeand effect in controlle{ experiments
is relativelyeasy' For
coffelation arecoveredin basicstatisticstextbooks
tion and Spearman's thecalcu- was administered to a sample of subjectswho are
'^*-Jti softwarepackages'rvedo not get into example,a headachemedicine
and areavailablein moststatistical A placebo rvas administerecl to another group with headaches
having headaches'
iation detailshere. statisticallynot different from the first group)'If after a certaio
time of
(least-squares method)is very vulner- i;t"iuot
Thirdly, the methodof linearcorrelation the headaches of the ñrst group were
a iew extremeoutliers the sample'the
in correla- iaking the headachemedicineand the placebo,
able to extremevalues'fi me'" are shows a mode¡ate ,eduJedor disappeared while they still persisted among the second group' then the
For exampre, Figure 3.9
tion coefficientmaybe seriouslyaffected. medicine is clear'
Y' Horvever'dueto a coupleof extremeoutliers curingeffectof the headache
negativerelationshipbet'u'enX aod ñor analysiswith observationaldata,the taskis much
morediffrcult.Research-
coeftrcientwill becomepositive.This
ar the northeast.oor¿lnu*r,'ih* conelation th-reecriteria:
earlierpoint that when correlationis used'
one ers (for example,Babbie,1986)haveidentified
outlier susceptibilityreinforcesour
of the data'
shouldalsolook at the scatterdiagram tr..Thefirstrequi¡ementinacausalrelationshipbetweentwova¡iablesisthat
in logic'
the causepie.edesthe effect in time or asshownclearly
,, The secondrequi¡ementin a causálrelationshipis thatthe two variablesbe
.orpi¡.uUy conelatedwith one another'
empirical
The thirclrequirementfor a causalrelationshipis thattheobserved
i",,Joii"n ti"t*een rwo variablesis not becauseof a s¡uriousrelationship'
Thefirstandsecondrequirementssimplystatettratinadditiontoempiricalcorrela-
sequenceof occurrenceor
tion, the relationshiphas to be examinedin terms of
deductivelogic.Correlationisastatisticaltoolandcouldbemisusedwithoütthe
the outcomeof a
guidanceof i logic system.For instance,it is possibleto correlate
"superBowllt.lationalFootballLeagueversusAmerica¡FootballLeague)tosome
popularcolor' and so forth) and
interestingartifactssuchas fashion(lengthof skirt'
spuriousassociationcannotsub.
weathel.However,logic tells us that coincidenceor
staotiatecausation.
Thethirdrequirementisadifncultone.Thereareseveraltypesofspuriousrela-
a formidabletask to
tionships,as shownin Figure 3.10, and sometimesit may be
relationship.For this rea-
show that the observedconelationis not dueto a spurious
in observationaldatathañ in experi-
son,it is much mofe diffrcult to provecausality
examining for spurious relationships is a must for
FIGURE 3.9 mental data.Nonetheless,
:.# scientificreasoningandasaresult,ourconclusionorfindingsfromthedatawillbe
Etfect of Outliers on Correlation
of higherqualitY.