Professional Documents
Culture Documents
JSM2010 Yung PDF
JSM2010 Yung PDF
Yiu-Fai Yung
Senior Research Statistician
SAS Institute Inc.
Cary, NC 27513 USA
SAS and all other SAS Institute Inc. product or service names are registered trademarks or trademarks of SAS
Institute Inc. in the USA and other countries. indicates USA registration.
Other brand and product names are registered trademarks or trademarks of their respective companies.
Abstract
TheCALISprocedureinSAS/STATisageneralstructuralequation modeling(SEM)tool.
ThisworkshopintroducesthegeneralmethodologyofSEMandtheapplicationsofthe
CALISprocedure.Historicaltopicssuchascasualmodels,pathdiagram,confirmatory
factoranalysis,measurementerrormodel,andlinearstructuralrelations(LISREL)are
reviewed.ApplicationsoftheCALISproceduretoSEMaredemonstratedwithexamples
insocial,educational,behavioral,andmarketingresearch.Specifically,thefollowing
howtotechniquesoftheCALISprocedure(SAS/STAT9.22)arecovered:(1)Specifying
structuralequationmodelswithlatentvariablesbyusingthePATHmodelinglanguage;
(2)Interpretingthemodelfitstatisticsandestimationresults;(3)Testingmodelswith
multiplegroupsandmultiplemodels;(4)Analyzingdirectandindirecteffects;(5)
Modifyingstructuralequationmodels.
Thisworkshopisdesignedforstatisticiansanddataanalystswhowanttooverviewthe
applicationsoftheSEMbytheCALISprocedure.Attendeesshould haveabasic
understandingofregressionanalysisandexperienceusingtheSASlanguage.Previous
exposuretoSEMisuseful,butnotrequired.
Citationofthisworkshop:
Yung,Y.F.(2010).IntroductiontoStructuralEquationModelingUsingtheCALIS
ProcedureinSAS/STAT Software.Computertechnologyworkshoppresentedatthe
JointStatisticalMeetingonAugust4,2010,Vancouver,Canada.
1
SAS/STAT 9.22 or later
is assumed
for this workshop
Inthisworkshop,SAS/STAT9.22(TS2M3)orlaterisassumedfortheCALISprocedure.
SomeofthecodemightworkwithPROCTCALIS(anexperimentalprocedure)in
SAS/STAT9.2(TS2M2).However,thereisamajorsyntacticaldifferencebetweenPROC
TCALISandPROCCALIS.InPROCTCALIS,theparameterspecificationforeachpathin
thePATHstatementmustnot beprecededbyanequalsign.Butthisequalsignis
requiredinPROCCALISwhenyouspecifyparameters.Also,PROCTCALISdoesnot
supportthegeneralizedpathspecifications(forvariances,covariances,means,and
intercepts)andmultiplepathspecificationswhenyouusethePATHmodelinglanguage,
whichisthemainfocusoftodaystalk.
2
Causal Model, Prediction, and Path Diagram
X causes Y
X predicts Y
Linear regression equation
Y = b X + ey
Path diagram
b
ey Y X
Thecentralideaofstructuralequationmodelingisthestudyof causalrelationship
betweenvariables.Forexample,youhaveanXandanYvariable. XisthecauseofY,or
doingXresultsinY.Togiveamorerealisticexample:eatingmorevegetables(X)brings
downyourcholesterollevel(Y).However,thiscausalstructureisonlyanidealized
framework.Inmakingcausalinferences,youmusthaveisolatedallotherbackground
variablesandestablishtemporalsequenceofthevariables.Becauseofthecomplicated
philosophicalissuesinvolvedinmakingcausalinferences,ingeneralSEMwouldavoid
claimingcausalinferences.
Apredictoroutcomeframeworkmightbemoreappropriatephilosophically.The
semanticisnow XpredictsY.Mathematicallyandstatistically,thisideaisrepresented
inthesimplelinearregressionanalysis,asshowninthelinear regressionequation:
Y=b*X+e.
Thepathdiagramforthisrepresentationisshownintheslide,wherebiscalledthe
effect,regressioncoefficient,orpathcoefficient.Noticethatanerrortermisaddedto
showthatthepredictionofYfromXisnotperfect.Butessentially,thepredictor
outcomeframeworkisrepresentedbytheYXpathinthepathdiagram.
3
Structural Equation Modeling versus
Regression Analysis
Group 1
More variables a
Y X
More equations LV
Correlated errors Z a W
WhatarethedifferencesbetweenSEMandregressionanalysis?WhatmorecanSEM
offerthanthelinearregressionanalysis?
YoucanviewSEMasamuchmorecomplicatedsystemformultiplepredictoroutcome
relationships.SEMcanhandlethefollowingsituationswherelinearregressionanalysis
isoflimitedusefulness:
1.Morevariables(notjustXandY,butyoucanalsoaddWandZintothepathdiagram).
2.Moreequationsorfunctionalrelationships(notjustXY,butyoucanalsoanalyze
WZsimultaneously).
3.Correlatederrors,systemofequationscanhavecorrelatederrors.Forexample,the
doubleheadedarrowbetweenYandZ.
4.Directandindirecteffects:XhasadirecteffectonZandanindirecteffectonZviaits
effectonW.Thatis,XZandXWZaredirectandindirecteffects,respectively.
5.Latentvariables.Forexample,LVinthepathdiagramhaseffectsonXandW.
6.Parametricconstraints.Forexample,theconstraintsonthepath coefficientsor
effectslabeledasa intheupperpathdiagrams.
7.Multiplegroupanalysis.Fordifferentgroupsofpopulations,theoverall structureof
themodelarethesame,butthepathconstraintscouldbedifferentwhilethe
constrainedeffectinGroup1isdenotedasa, theconstrainedeffectinGroup2is
denotedasb, whichwillhaveadifferentestimatethanthatfora inGroup1.
4
Other Names for Structural Equation Modeling (SEM)
Path analysis
LISREL model (Jreskog 1973, Keesling 1972, Wiley
1973)
Covariance structures analysis
Analysis of moment structures
Confirmatory factor analysis
Causal modeling
CALIS: Covariance Analysis of Linear Structural
Equations
SEMhasalotofsynonymsinthefield:Pathanalysis(attributedtoSewallWright),
LISRELmodel(JKWmodel),covariancestructuresanalysis,analysisofmoment
structures,confirmatoryfactoranalysis,causalmodeling,andetc.Intermsofthe
statisticalmethodologyinvolved,allthesenamesaremoreorlessthesame.
PROCCALIS,whichstandsforcovarianceanalysisoflinearstructuralequations,isa
softwarethatwasdesignedtohandlealltheseanalysesundertheumbrellatermSEM.
Hopefully,onedayPROCCALISwouldalsoberememberedasasynonymofSEM.
5
SEM Software
ThereareseveralwellknownsoftwareinthefieldfordoingSEM:AMOS,EQS,LISREL,
andMPLUS,andmaybemore.WhyPROCCALIS?Whichisbest?Althoughtheseare
veryinterestingquestions,asthecurrentdeveloperofPROCCALISprocedureIamnot
atlibertytojudgeotherSEMsoftware.Thisworkshopgivesyouanintroductorytourof
SEMwiththeuseofPROCCALIS.Therefore,youmightcomparePROCCALISwithother
softwareonyourownafterlearningsomeofthefeaturesofPROC CALIS.
6
A Very Brief History of PROC CALIS
LetusstartwithabriefhistoryofPROCCALIS.
Ineighties,WolfgangHartmanndesignedanddevelopedthefirstversionofPROCCALIS.
Thestatisticalandmathematicalmodelwasgreatlyinfluencedby theCOSANmodel
proposedbyR.P.McDonald.Infact,therewasevidencethatCosan,insteadofCalis,
mighthavebeenproposedasthenameoftheprocedure.Themostpopularsyntaxin
PROCCALIS,however,wasundertheinfluenceoftheEQSprogrambyPeterBentler.
TheLINEQSsyntaxinPROCCALISformodelspecificationisbasicallyatwinbrotherof
thesyntaxoftheEQSprogram.
Ipickedupthedevelopmentofthesoftwarearound2000.Iactuallyrewrotethe
mathematicalfoundationsofthesoftware.Ikepttheoptimizationtechniquesandinitial
estimationtechniquessothatthenewCALISiscompatiblewiththeoldCALIS.
In2008,anexperimentalversioncalledTCALISwasreleased.Sincethen,Ihavemodified
thesyntaxalittlemoreandfixedsomemajorbugs.
ThenewCALIS(SAS9.22)hasbeenreleasedthisyear.IfyouhaveusedPROCCALIS
before,youwillnoticeonemajorchange:theemphasisonthePATHmodelinglanguage.
YoucanseeexamplesusingthePATHstatementeverywhereinthePROCCALIS
documentation.Othernoteworthynewfeaturesare:multiplegroupmodeling,
redesignedmeanstructureanalysis,andthenamefreeapproachtoparameter
specifications.Certainly,therearemanymorenewfeaturesthan these,asyouwilllearn
fromthisworkshopandelsewhere.
7
Structure of the Workshop
ThefirstpartoftheworkshopisaboutthebasicSEMmodelingusingPROCCALIS.Iwill
describetheresearchprocessofSEMbriefly.ThenIwillintroducethePATHmodeling
languageinPROCCALISbyusingasimplelinearregressionexample.Next,Iwillmove
ontomorecomplicatedexamplesthatanalyzeconfirmatoryfactormodels.Iwilluse
PROCCALISintheseexamplestoshowhowyoucanspecifySEMmodelsbythePATH
modelinglanguage,inrelationtothepathdiagramrepresentations.Iwillshowyouhow
tointerprettheresultsgeneratedbyPROCCALIS.Iwillendthe firstpartbyshowingyou
howaLISRELmodelcanbespecifiedbytheLISMODstatementinPROCCALIS.
Thesecondpartoftheworkshopisaboutadvanced modelingrelativelyspeaking.I
willshowhowmultiplegroupanalysiscanbedoneinPROCCALIS.Otherimportant
topicssuchasdirectandindirecteffectanalysis,testingspecifichypotheses,andmodel
modificationsarediscussed.
8
Emphases of the Workshop
Therearetwoemphasesofthistalk.
One,IwanttoshowyouanoverallpictureofSEM.ThisaddressesthewhatisSEM?
question.Iwillnotgiveyouatechnicaldefinition,butIwill showyouSEMexamplesso
thatyouwillhaveareal feelingabouttheapplicationsofSEM.
Two,IwanttoshowyouhowtousePROCCALIS.ThisaddressestheHowtodoSEM?
question.Ihopethatintheendoftheworkshop,youwillfindthatPROCCALISisvery
usefulforSEM.
9
Illustrating the Process of
Structural Equation Modeling
10
10
A Structural Equation Model of Web-Surfing
Behavior (Novak, Hoffman, & Yung 2000)
-.32 .31
Interactive .19 Future Use
.42 Speed Playfulness
.19
.47 .45
.25
.33 Exploratory
Importance Challenge Arousal Behavior
.70
.51 .24
.51 .14 .13
.24
Focused .67 Time
Distortion Stimulation
Attention Level
11
Thisisastructuralequationmodelaboutwebsurfingbehavior.Theresearchers
hypothesizethatthePlayfulness ofawebsitewouldenhancethefutureuse(Future
Use)ofthesamewebsite.However,thetheorydoesnotendthere.Theresearchers
thenhypothesizewhatwouldmakeawebsitetobeperceivedasplayful.Three
additionalconstructsarehypothesizedinthepathdiagram:Control (ofthewebpage),
Arousal (ofinterest),andFocusedAttention aredeterminantsofPlayfulness. In
fact,theresearchershypothesizedevenfurther.Forexample,theyuseStartUse
(whentheusersstartedtousecomputers)andTimeUse (howoftentheyuse
computers)asremotecauses ofalotoflatentconstructsinthepathdiagram.Insum,
thisisarelativelylargeSEMthattheorizescomplicatedrelationshipsamongconstructs.
Inthispathdiagram,theovalshapesrepresentlatentvariables,whicharenotmeasured
butserveasusefulconstructsinthemodel(e.g.,Playfulness).Therectanglesrepresent
measuredorobservedvariables(e.g.,StartUse,TimeUse,FutureUse).Inorderto
analyzethelatentconstructs,somemeasuredvariables(orindicators)areneeded.Inthe
pathdiagram,thosesmallunlabeledrectanglesaremeasuredindicatorsforthelatent
constructs.Inthisresearch,thesemeasuredindicatorsareratingresponsesona
questionnaire.Forexample,Ilosttrackofthetimewhenusingthiswebsite (thisis
notanexactitemfromtheactualresearch)couldbeanitemfor theTimeDistortion
construct.
Giventhispathdiagramforthetheoryaboutwebsurfingbehavior,anSEMsoftwarefits
themodelbasedontheobserveddataandinformsyouthemodelfitandtheestimates
oftheeffects(pathcoefficients)inthepathdiagram.TheSEMsoftwarealsotellsyouthe
significanceoftheseestimates.Ifthemodeldoesnotfitthedatawell,theSEMsoftware
wouldsuggestwaystoimprovethemodel.
11
Key Features of SEM
12
HereisalistofthekeyfeaturesofSEM:
Analyzingcomplicatedrelationshipsamongvariables
Pathdiagramrepresentationsformodels
Abilitytohandlelatentandobservedvariablessimultaneously
Testingthemodelfitandsignificanceoftheparameters
Suggestingwaystoimprovethemodel
12
Basics: A Simple Regression
Model and the PATH
Modeling Language
13
13
A Simple Linear Regression Model
y = b x + ey
y: outcome variable
x: predictor variable
ey: error term
b: effect or regression coefficient
14
TointroducethePATHmodelinglanguageinPROCCALIS,asimple linearregression
modelisused.Intheregressionequation,yistheoutcomevariable,xisthepredictor
variable,e_y istheerrortermandbistheeffectorregressioncoefficient.The
regressionmodelwritteninthisformassumesthatxandyarecenteredwithmeans
zero.Butthisassumptionwillnotaffectthegeneralitytouncenteredvariables.
14
Measures of the Number of Hen Pheasants
15
Onp.34ofFullersbookMeasurementErrorModels,hedescribesadatasetabout
thecountingofhenpheasantsinAprilandAugust.Fifteentrainedobserverscounted
thenumberofbirdsinthetwooccasions.YisthenumberofbirdsinAugustandXisthe
numberofbirdsinApril.Thegoalofthelinearregressionistopredictthenumberof
birdsinAugust(Fall)bythenumberofbirdsinApril(Spring).
15
Regression Analysis by PROC REG
data hens;
input y x @@;
datalines;
8 9 6 6.6 9.8 12.3 10.8 11.9 9.7 11.9 9.3 12
9.2 9.6 6.9 7.5 8.1 10.9 8.7 10.4 8.7 10.2 7.4 7.4
10.1 11 10 11.8 7.3 8.2
;
proc reg data=hens;
model y = x;
run;
16
Toconductalinearregressionanalysis,youcanuseaSASprocedurecalledPROCREG.
Thesyntaxisquitesimple.First,defineyourdataset.Second, callPROCREGwiththe
interesteddatasetspecifiedinthePROCREGstatement.Then,themodelstatement
specifiesthaty=x,whichmeansyispredictedbyx.Noerrortermneedstobespecified,
althoughPROCREGdoesassumethatpredictionisnotperfectsothattheerrordoes
existwithnonzerovarianceintheregression.
16
Results Obtained from PROC REG
Parameter Estimates
Parameter Standard
Variable DF Estimate Error t Value
Given a base survival of 2.14 birds, every additional bird in Spring predicts a
0.65 bird surviving in August.
17
ThistableshowstheessentialresultsfromPROCREG.Theoutput showsanestimateof
0.65fortheregressioncoefficientb.Theinterceptestimateis2.14.PROCREGalso
showsthestandarderrorestimatesandthetvaluesforjudgingstatisticalsignificance.
Bothestimatesarestatisticallysignificant.
Aninterpretationabouttheseregressionestimatesisthis:Givenabasesurvivalof2.14
birds,everyadditionalbirdinSpringpredictsa0.65birdsurvivinginAugust(Fall).
17
Regression Equation, Path Diagram, and the PATH
Modeling Language
y = b x + ey
Outcome Parameter Predictor
b
y x
PATH y <--- x = b;
Path Path Relation Parameter (optional)
Statement
18
Asshownpreviously,youcanrepresentthelinearregressionmodelbythepathdiagram,
whichisalsoarepresentationschemeforSEM.Hence,regression modelscouldbe
specifiedasSEMbyusingthepathdiagram.
HereiswhatyoudotospecifyasimplelinearregressionmodelinPROCCALIS.Youuse
thePATHstatementtospecifythepathintheregressionmodel.Inthiscase,itisjustY<
XinthePATHstatement.Optionally,youcandenotethecorrespondingpath
coefficientparameter.Forexample,youcanput=b atthebackofthepathtodenote
theparameterlabelorname.
18
Regression Model Specified by PROC CALIS
19
ThisistheentirePROCCALISsyntaxforthesimplelinearregressionmodel.Isntthat
easyandsimple?
19
Results from PROC CALIS for the Pheasant Data
PATH List
Standard
--------Path-------- Parameter Estimate Error t Value
Variance Parameters
Default variance
parameters
Variance Standard
Type Variable Parameter Estimate Error t Value
20
ThisslideshowstheresultsfromPROCCALIS.
Theestimatedeffectofxony,denotedasy< xintheoutput,is0.65,whichisthe
sameasthatinthePROCREGresults.Becauseyoudidnotnamethisregression
coefficientparameter(butyouspecifythepathnonetheless),PROCCALISgeneratesa
uniqueparameternamecalled_Parm1forit.Thestandarderrorestimateandthet
valuearealittlebitdifferentfromthatofthePROCREGresults.Thisisbecause
differentdegreesoffreedomforcomputingthestandarderrorsareusedinthetwo
approaches.
InPROCCALIS,italsoincludesresultsfortwomoreparametersintheSEM.Thevariance
ofxandtheerrorvarianceofyaretreatedasmodelparameters.Theirestimatesare
alsoshowninthePROCCALISresults.NotethatPROCCALIScreatesdefaultparameter
namesforthesedefaultvarianceseventhoughyoudidnotspecifythem.Inthis
example,thesevarianceparametersarenamed_Add1 and_Add2,respectively.
20
Specification with Parameter Names
errv_y ey
With an explicit error
term
21
YoucouldnamealltheparametersinPROCCALISbyputtingyourpreferrednames.
Inthepathdiagramatthetoprightcorner,theparametersareshowninred.Inthe
regressionmodel,bistheregressioncoefficient,var_x isthevarianceforthepredictor
variablex,anderrv_y istheerrorvarianceofy.Thispathdiagramrepresentationis
equivalenttotheoneshownatthebottomrightcorner,wherean expliciterrortermis
attachedtoY.Theerrortermisrepresentedbyanovalshapebecauseitistreatedasa
latentvariable.Thisrepresentationhasthesamesetofparameters,onlythaterrv_y is
nowattachedtotheerrorvariabledirectly.
YoucanspecifytheseparametersexplicitlyinPROCCALIS.Intheleftpaneloftheslide,
theparameterbisspecifiedafterthey< xpath,separatedbyanequalsign.To
specifythevariancesorerrorvariancesinthemodel,youcanusethePVARstatement.
Forexample,x=var_x meansthatthevarianceofxisaparametercalledvar_x.
Noticethatnamingparametersisentirelyoptional.Forthisexample,naming
parametersappearstoserveonlyasanillustration.Laterinthistalk,youwillfind
situationswheretheuseofparameternamesisnotonlyuseful,butalsonecessary.
21
Hen Pheasants Results with Parameter
Names Specified
PATH List
Standard
--------Path-------- Parameter Estimate Error t Value
Variance Parameters
Variance Standard
Type Variable Parameter Estimate Error t Value
22
Asshowninthisslide,thenumericalresultsfromPROCCALISwithexplicitparameter
namesspecifiedarethesameasthosewithoutusingparameternames.Theonly
differenceisthatnowyoucanusetheseparameternamestolocatethecorresponding
resultsdirectly.
22
Keys to the PATH Modeling Language
23
Sofar,Ihaveshownyouthat:
1.ThePATHmodelinglanguageisaseasyasdrawingapathdiagram.
2.YoucanusethePATHstatementtospecifythepathsinpathdiagram,withorwithout
specifytheparameternamesforthepathcoefficients.
3.Youcanalsospecifythevarianceorerrorvarianceparametersexplicitly.Inmost
practicalapplications,variancesanderrorvarianceshavealreadybeensetbydefault
andyoudonotneedtoworryaboutspecifyingthem.TheessentialpartofSEMis
specifiedinthePATHstatement.
4.NamingparametersisoptionalinPROCCALIS.
23
Measurement Errors in Predictors
24
LetusmakealittlestepforwardtoshowaspecialSEMfeaturethatlinearregression
cannothandleeasily.
Inthebirdcountingexample,wedidnottakeintoaccountthatbirdcountingcould
involvemeasurementerrors.Inthecurrentcontext,themeasurementerrorinbird
countingcouldbeduetotheenvironmentfactorsintheforest:obstructionfromthe
treebranches,biased anglesfromthebirdobservers,andetc.
Whatyougotfromthedataisx,theobservedfalliblescore.However,ideally,you
wouldwanttousefx,thetruescoreinyourregressionanalysis.
24
A Measurement Error Model for the Pheasant Data
Structural Equation
y = b fx + ey
Measurement equation
x = fx + ex
Can you estimate b?
Problem: The measurement equation introduces an
additional parameter: Var(ex) (variance of ex or error
variance of x)
25
TheprecedingideaisformalizedasthefollowingSEMwithalatentvariablefx.
Inthesocalledstructuralmodel,yispredictedfromfx,thetruescore,inthelinear
regressionmodel.Thissocalledstructuralequationtakestheroleoftheoriginallinear
regressionequationonlynowyouaresupposedtohaveabettermodelbyusingthe
measurementerrorfreefx asthepredictor.
Inthesocalledmeasurementmodel,youhypothesizethattheobservedvariablexis
obtainedasthesumoffx andanmeasurementerrorterme_x.
Canyouestimatebwiththelatentvariablefx inthestructuralequation?
Thisanswerisyes.
Butthetechnicalproblemencounteredhereisthatthemeasurementequation
introducesoneadditionalparametersinvar(ex)errorvarianceofx.Thisproblemwill
maketheSEMunidentified.Inaveryloosesense,thismeansthatyourmodelestimates
moreparametersthanwouldbeallowedbythegiveninformationofthedataset.
Consequently,theparametersinthemodelarenotestimable.Iwilldescribeamethod
todealwiththisidentificationproblemlater.
25
Path Diagram Representations
b b
y x 1
y fx x
One more
parameter
26
Thisslidecomparesthelinearregressionmodelwiththemeasurementerrormodelby
theuseofpathdiagram.Itdemonstrateswhythemeasurementerrormodelhasone
moreparametertoestimate.
Intheleftpanel,thepathdiagramforthesimplelinearregressionanalysisisshown.
Intherightpanelforthemeasurementerrormodel,westillhavexandyasthe
observedvariables.Butnowwehavealatentvariablefx thattakestheroleofthe
predictorofy.Var_x inthismodelnowrepresentsthetruevarianceofthepredictor fx.
Thenewparameterinthemeasurementerrormodeliserrv_x (errorvarianceofx).With
thisadditionalparameter,weneedtomakeadditionalassumption toestimatethe
modelparameters.
26
Constraining the Error Variances
errv_x = 6* errv_y
27
Fortunately,wehaveareasonableassumptionabouttherelativesizeoftheerror
variancesinthemodel.
ThisassumptionisbasedonthefactthatbirdcountinginFallismoreaccuratethanthat
inspring.ThereasonisthatthefallenleavesinFallmakesthecountingofbirdsless
obstructive.
Theassumptionwearegoingtomakeisbasedonanindependentstudyaboutthe
relativeerrorvariancesinxandiny.InFullersbook,theratioofthesevariancesis
about6.Mathematically,Var(ex)=6*Var(ey).Thatis,errorvarianceforxissixtimesas
muchastheerrorvarianceofy.Or,inthePROCCALISspecification,youwanttostate
thefollowingparametricconstraintinthemodeling:errv_x=6*errv_y.
27
A Measurement Error Model with a Constraint
for the Pheasant Data
28
ItturnsoutthatitisprettystraightforwardtospecifythisparametricconstraintinPROC
CALIS.Youjustsimplyaddonemorelineofcodetorepresentthisrelationship,as
shownintheSAScodeoftheslide.IntheSASliterature,thislineofcodeiscalledaSAS
programmingstatement,whichisusedextensivelyintheDATAstepofSAS.Youcanuse
asmanySASprogrammingstatementsasyouwanttodescribetherelationshipsofthe
parametersinthemodel.
28
Measurement Error Model Results for the Pheasant Data
PATH List
Standard
--------Path-------- Parameter Estimate Error t Value
Variance Standard
Type Variable Parameter Estimate Error t Value
Afteryoutakethemeasurementerrorintoaccount,theregressioncoefficientbisnow
0.75,whichisalargereffectthan0.649,whichyouobtainedfromthelinearregression
modelwithouttakingthemeasurementerrorinxintoaccount.Therefore,theprevious
regressionanalysisunderestimatedthiseffectbecauseitfailed toincorporatethe
measurementerrorintothemodel.However,withSEM,youcaneasilyincorporatethe
measurementerrorsintotheanalysis.
Estimatesofthevariancesanderrorvariancesareshowninthenexttable.Youcansee
thattheconstraintspecifiedinthePROCCALISishonoredintheestimation.Theerror
varianceestimateofxis0.49,whichissixtimesasmuchastheerrorvarianceestimate
ofy,whichis0.08.
29
Some Features of the PATH Modeling Language
30
ThisslidesummarizessomefeaturesofthePATHmodelinglanguage.
1.Itisasstraightforwardasdrawingthepaths.
2.Itcandealwithlatentvariableseasily.
3.YoucanusethePVARstatementtospecifyvariancesorerrorvariances(double
headedarrowsattachedtoindividualvariablesinthepathdiagram).
4.YoucanusethePCOVstatementtospecifycovariances orerrorcovariances (double
headedarrowsattachedtopairsofvariablesinthepathdiagram).
5.YoucanspecifyparameterdependencybyusingtheSASprogrammingstatements
directly.Indeed,evenverystrangeandcomplicated(continuous) parametricfunctions
aresupportedinPROCCALIS.
30
A Confirmatory Factor Model
31
Wenowmoveontoamorecomplicatedtypeofstructuralequation modelscalled
confirmatoryfactormodels.
31
Political Democracy Data
32
ThisexampleisbasedonanexampleinChapter7ofBollens classictextbook:Structural
EquationModeling.
Inthisexample,twolatentfactorsformeasuringpoliticaldemocracyin75developing
countriesin1960and1965werehypothesized.
Thesetwolatentfactorsarenotobserved,buttheyhavesomerelatedobserved
variablesthatserveasindicators.Ineachyear,youmeasurefourvariablestogaugethe
politicaldemocracy:freedomofpress,freedomofgroupoppositions,fairnessof
elections,andelectivenatureofthelegislativebody.
Thepurposeoftheconfirmatoryfactoranalysisistovalidatethesemeasurement
indicatorsstatistically.
32
A Confirmatory Factor Model for the Political
Democracy Data (Implicit Error Representation)
1. Press60
Freop60
Dem60
Fair60
Legis60
Press65
Freop65
Dem65
Fair65
1. Legis65
33
Thispathdiagramshowsthehypothesizedconfirmatoryfactormodel.
Inthepathdiagram,twolatentfactorsarerepresentedbytwoovals.Dem60isthe
politicaldemocracyin1960andDem65isthepoliticaldemocracy in1965.Theyare
linkedtotherespectivemeasuredvariables,asshowninthepathdiagram.Thesesingle
headedpathsrepresentthetypicalfactorobservedvariablerelationships.
ThedoubleheadedarrowthatconnectsDem60andDem65representsthecovariance
parameterbetweenthetwofactors.Itmeansthatthetwofactorsarecorrelated.
DoubleheadedarrowsthatareattachedtoDem60andDem65individuallyrepresent
thevarianceparametersofthetwofactors.Inthemodel,youfixthesevariancesto1so
thatthescalesofthefactorsareidentified.Thisisconventionallydonebecausethe
scaleoflatentfactorsisarbitrary(youdonotmeasurelatentvariablesdirectlysothat
theycouldbedefinedonanyunitofmeasurement).
Thedoubleheadedarrowsthatareattachedtotheobservedvariablesrepresentthe
errorvariances.Theysignifythefactthatthefactorsinthemodeldonotaccountfor
100%ofthevariancesoftheobservedvariables.Theerrorvariancesaretheunique
partofthevariancesinthevariablesthatarenotduetotheir relationshipswiththe
factorsinthemodel.
33
A Confirmatory Factor Model for the Political
Democracy Data (with Error Terms)
1.
1. Press60 e1
1.
Freop60 e2
Dem60 1.
Fair60 e3
1.
Legis60 e4
1.
Press65 e5
1.
Freop65 e6
Dem65 1.
Fair65 e7
1.
1. Legis65 e8
34
Thisisanalternativepathdiagramrepresentationwiththeuseofexpliciterrorterms.
Noticethatthedoubleheadedarrowsfortheobservedvariablesnowshifttotheerror
terms.Thispathdiagramrepresentationisshownhereonlyforillustrationpurposes.In
thisworkshop,Irelyonthepathdiagramrepresentationthatdoesnotuseexpliciterror
terms.
34
Basic Confirmatory Factor Model for the Political
Democracy Data
proc calis data=polidem; 1. Press60
path
Dem60 ---> Press60, Freop60
Dem60 ---> Freop60,
Dem60
Dem60 ---> Fair60,
Dem60 ---> Legis60,
Fair60
Dem65 ---> Press65,
Dem65 ---> Freop65, Legis60
Dem65 ---> Fair65,
Dem65 ---> Legis65;
pvar Press65
Dem60 = 1, Dem65 = 1,
Press60 Freop60 Fair60
Freop65
Legis60 Press65 Freop60 Dem65
Fair65 Legis65;
pcov Fair65
Dem60 Dem65;
run; 1. Legis65
35
SpecifyingtheCFAmodelisnotmuchharderthanthepreviousmeasurementerror
model.Basically,youonlyneedtospecifymorepathsfortheCFAmodel.
InthePATHstatement,youspecifyallthesingleheadedpaths(arrows)inthepath
diagram.
InthePVARstatement,youspecifyalldoubleheadedarrowsthatareattachedto
individualvariables.PVARactuallystandsforpartialvarianceyoucanspecifythe
variancesanderrorvariancesinthisstatement.Dem60=1 meansthatthevarianceof
Dem60isfixedtoone.SimilarlyforDem65=1.Theeightobservedvariableare
specifiedinthePVARstatementtosignifythattheirerrorvariancesarefreeparameters
inthemodel.
InthePCOVstatement,youspecifypairsofvariablesthathavecovariances orerror
covariances asparametersinthemodel.Inthecurrentpathdiagram,Dem60and
Dem65arecorrelated.
35
Error Variances, Variances, and Exogenous
Covariances Are Free Parameters by Default
Specifying All Variance and Using Default Variance and Covariance
Covariance Parameters Parameters
proc calis data=polidem; proc calis data=polidem;
path path
Dem60 ---> Press60, Dem60 ---> Press60,
Dem60 ---> Freop60, Dem60 ---> Freop60,
Dem60 ---> Fair60, Dem60 ---> Fair60,
Dem60 ---> Legis60, Dem60 ---> Legis60,
Dem65 ---> Press65, Dem65 ---> Press65,
Dem65 ---> Freop65, Dem65 ---> Freop65,
Dem65 ---> Fair65, Dem65 ---> Fair65,
Dem65 ---> Legis65; Dem65 ---> Legis65;
pvar pvar
Dem60 = 1, Dem65 = 1, Dem60 = 1, Dem65 = 1;
Press60 Freop60 Fair60 run;
Legis60 Press65 Freop65
Fair65 Legis65;
pcov
Dem60 Dem65;
run;
These could have been set 36
automatically by default.
Copyright 2010, SAS Institute Inc. All rights reserved.
Tomakemodelspecificationmoreefficientanderrorfree,PROCCALISemploysdefault
freeparametersinthemodel.Thesedefaultfreeparametersaresetbecausetheyare
commonlyemployedinpractice.
Forexample,becausepredictionsofoutcomevariablesareusuallynotperfect,theerror
variancesarefreeparametersbydefault.ThismeansthatallthePVARspecificationsfor
theobservedvariablesarenotnecessarybecausePROCCALISwouldhavetreatedthem
asfreeparametersbydefault.
Similarly,thevariancesofDem60andDem65andtheircovariance aredefaultfree
parametersbecausetheyareassumedinmostpracticalapplications.Inthecurrent
example,thismeansthatthePCOVstatementspecificationforthecovariancebetween
Dem60andDem65isnotnecessary.
However,becausethevariancesofDem60andDem65arefixedto1 (foridentification
ofthelatentvariablescales),theymustbespecifiedexplicitlyinthePVARstatement.
Otherwise,thesevarianceswouldhavebeenfreeparametersbydefault.
36
Estimates of Path Coefficients (Loadings) for the
Political Democracy Data
PATH List
Standard
----------Path---------- Parameter Estimate Error t Value
ThistableshowstheestimatesofpathcoefficientsfromPROCCALIS.
Inthefactoranalysisliterature,thesepathcoefficientsarealsocalledloadings.To
validatetherelationshipsbetweenthedemocracyfactorsandthe observedvariables,
thetvaluesmustbeexaminedforstatisticalsignificance.Usingnormalapproximation,
tvalueswiththeirabsolutevaluesbiggerthan1.96aresignificantlydifferentfromzero.
Inatypicalfactoranalysisstudy,youwouldwantallthesetvaluestobesignificantin
ordertoclaimnonzerofactorvariablerelationships.Aninsignificanttvaluemeansthat
thecorrespondingvariableisnotanindicatorforthepurported factor.Insignificantt
valuesforpathcoefficientswouldchallengethevalidityofyourfactormodel.
37
Estimates of Variances for the Political
Democracy Data
Variance Parameters
Variance Standard
Type Variable Parameter Estimate Error t Value
Estimatesofvariancesanderrorvariancesareshowninthistable.Thevariancesof
Dem60andDem65arefixedto1andthereforetherearenosignificancetestsforthese
variances.Allothererrorvarianceestimatesaresignificantlylargerthanzeros.Thisalso
meansthatthefactorsdonotaccountforallthevariancesoftheobservedvariables.
Thisisnaturalbecausedeterministicrelationshipsbetweenfactorsandobserved
variablesarerare.
38
Estimate of Covariance for the Political
Democracy Data
Standard
Var1 Var2 Parameter Estimate Error t Value
High and significant correlation between the Democracy factors in 1960 and 1965.
39
ThistableshowsthecovariancebetweenDem60andDem65.Thisestimateisalsothe
estimatedcorrelationbetweenthetwolatentfactorsbecausethe variancesofthe
factorsarefixedtoone.Thiscorrelationisextremelyhigh,possiblybecausethepolitical
democracystatusdonotchangemuchduringthose5years.
39
How Is the Model Fit?
Fit Summary
40
Wehavelookedattheestimatesandconcludedthattherelationshipsbetweenthe
factorsandthevariablesarestrongandsignificant.Thoseresultsvalidatedthe
individualfactorvariablerelationships.
Togainsupportfortheoverallconfirmatoryfactormodel,youwouldwanttoexamine
themodelfitstatistics.ThistableshowsvariousfitindicescomputedbyPROCCALIS.In
theSEMfield,alargenumberoffitindiceshavebeenproposed. Thereisnoconsensus
astowhichindicesarebesttoreportintheresearch.Butresearcherstendtoreport
someofthemostpopularonesintheirrespectivefields.
Becausealargenumberofindicesmightbeconfusing,PROCCALIS providesawayto
customizethisfitsummarytable.
40
Using the FITINDEX Statement to Customize the
Fit Summary Output
41
YoucanusetheFITINDEXstatementtocustomizeyourfitsummary table.
UsetheON(ONLY)=optiontoselectyourfavorite fitindices.
UsetheNOINDEXTYPEoptiontosuppresstheprintingofthefitindextypes.
41
Customized Fit Summary Table
Fit Summary
Chi-Square 44.4686
Chi-Square DF 19
Pr > Chi-Square 0.0008
Hoelter Critical N 51
Standardized RMSR (SRMSR) 0.0494
Adjusted GFI (AGFI) 0.7457
RMSEA Estimate 0.1346
Bentler Comparative Fit Index 0.9403
42
ThisisthecustomizedfitsummaryoutputbyusingthepreviousFITINDEXstatement
specifications.
Thistablecontainsthemorepopularfitindicesreportedinresearch(asrecognizedby
theauthor).
Inpractice,themodelfitchisquaremodelstatistic,itsdf,andthecorrespondingp
valueareroutinelyreportedeventhoughveryfewresearchersin thefieldwoulduse
themodelfitchisquarealonetojudgemodelfit.Asshowninthistable,thepvalueis
verysmallsothatstatisticallyitmeansthatthehypothesizedmodelshouldberejected.
However,itisaknownissueinSEMthatevenveryusefulSEMmodelswithminimum
departuresfromthedatawouldberejectedstatistically.Therefore,researchersinthe
SEMfieldtendtofocusmoreonotherfitindicestojudgemodel fit.
TheSRMSR,AGFI,RMSEA,andCFIarefourofthemostpopularfit indicesintheSEM
field.Seetheglossarypageforthedescriptionsofthesefitindices.FortheSRMSRand
RMSEA,thesmallerthevaluesthebetterthefit.Usually,valuesunder0.05indicate
goodmodelfit.Therefore,theSRMSRsaysthatthecurrentmodel isgood,butthe
RMSEAsaysthatthecurrentmodelisbad.FortheAGFIandBentlers CFI,thelargerthe
valuesthebetterthemodelfit.Therefore,theAGFIsaysthatthecurrentmodelisbad,
buttheCFIsaysthatitisgood.Becausetheseindicesdonotconsistentlyindicatea
goodmodelfit,itissafetosaythatthecurrentCFAmodelispromising,butitneeds
furtherconfirmation.
42
A Confirmatory Factor Model
with Loading Constraints
43
43
Constraining the Path Coefficients (Loadings)
1. Press60
lam1 Freop60
Dem60 lam2
lam3 Fair60
lam4 Legis60
1. lam4 Legis65
44
Inadditiontofittingabasicconfirmatoryfactormodel,PROCCALISenablesyoutoset
upparameterconstraintseasily.Themaintoolistouseparameternamesinthe
specification.
Forthepoliticaldemocracyexample,theresearcherwantstoconstraintthefactor
loadings(pathcoefficients)acrosstime.Thetheoreticalreason isthatbasicallythe
measuredvariablesarethesameinthetwoyears.Inthepathdiagram,youcan
representequalityconstraintsbyputtingthesameparameternamesorlabelstothe
pairsoftherelatedpaths.Forexample,lam1istheloadingofPress60onDem60.Itis
alsotheloadingofPress65onDem65.Similarly,youcansetthe other3setsof
constraintsinthepathdiagram.
44
Fitting a CFA Model with Constraints on the Loadings
45
InthePATHmodelinglanguage,theconstraintscouldbehandledsimilarly.Thecode
showninthisslideismodifiedfromthepreviouscodebyadding theparameternames
inthepaths.Thesyntaxistoaddanequalsignandthentheparameternamesafterthe
pathspecificationsinthePATHstatement.Withthesameparameternamesforthe
pairsoftherelatedpaths,theestimateswouldbeexactlythesame.
45
Fit Summary Table for the Political Democracy Data
with Loading Constraints
Fit Summary
Chi-Square 46.8893
Chi-Square DF 23
Pr > Chi-Square 0.0023
Hoelter Critical N 56
Standardized RMSR (SRMSR) 0.0714
Adjusted GFI (AGFI) 0.7844
RMSEA Estimate 0.1185
Bentler Comparative Fit Index 0.9440
46
Thistableshowsthefitsummaryofthemodelwiththeloadingconstraints.Becauseof
theconstraints,thismodeldoesnotfitaswellastheprevious model.TheSRMSRis
largerthan0.05.TheAGFIismuchsmallerthan0.9.TheRMSEAismuchlargerthan0.05.
Alltheseshowabadmodelfit.However,Bentlers CFI(0.94)showsagoodmodelfit.
46
Estimates of the Constrained Loadings for the
Political Democracy Data
PATH List
Standard
----------Path---------- Parameter Estimate Error t Value
47
Asrequiredfromthemodel,pathswiththesameloadingparameterhavethesame
estimates.Forexample,bothDem60>Press60andDem65>Press65havealoading
estimateof2.14(lam1).Allloadingestimates,again,arestatisticallysignificant.This
showsthatallthepurportedfactorvariablerelationshipsaresupported.
47
Estimates of Variances and Covariances for the Political
Democracy Data with Loading Constraints
Variance Parameters
Variance Standard
Type Variable Parameter Estimate Error t Value
Standard
Var1 Var2 Parameter Estimate Error t Value
Alltheerrorvarianceestimatesarealsosignificant.ThecorrelationbetweenDem60and
Dem60isveryhighandsignificant.
48
A Confirmatory Factor Model
with Correlated Errors
49
49
Adding Error Covariances
1. Press60
lam1 Freop60
Dem60 lam2
lam3 Fair60
lam4 Legis60
Press65
lam1 Freop65
Dem65 lam2
lam3 Fair65
1. lam4 Legis65
50
Withtheloadingconstraints,youobservedaworsemodelfit.
Thefourequalityconstraintsontheloadingsyoubasicallyreducethenumberofmodel
parametersby4.Thisnaturallyleadstoaworsemodelfitthanifyouwouldallowallthe
loadingstobefreelyestimated.
Nowyouconsideranoppositedirection.Insteadofreducingparametersbyputtingequality
constraints,youwanttoaddmoreparameterstothemodel.Addingmoreparameterstoyour
modelwouldimprovethemodelfit.Butthedrawbackofaddingmoreparametersisthatit
makesyourmodelmorecomplicated,whichisusuallyjudgedasan undesirablepropertyfora
scientifictheory.Itdoesnotmeanthatyoucannotaddparameters.Itonlymeansthatyou
shouldaddonlythoseparametersthatcouldbejustifiedbytheoreticalorsubstantivereasons.
Inthisexample,ithasbeenarguedthatfreedomofgroupoppositionandtheelectivenatureof
thelegislativebodyhaveapartoftheircorrelationthatisbeyondtheircommonlatentfactors
couldexplain(seeBollen).InSEM,thisextra correlationisconceptualizedasacorrelation(or
covariance)betweentheerrorsofthetwovariables.Inthepath diagram,thiserrorcovarianceis
representedbyadoubleheadedarrowconnectingthetwovariables.Thatis,Freop60and
Legis60areconnectedbyadoubleheadedarrowin1960.Bythesameargument,Freop65and
Legis65arealsoconnectedbyadoubleheadedarrowtorepresenterrorcovariance.
Inaddition,itisarguedinBollen thateachofthevariablepairsthatwereofthesamenaturebut
weremeasuredatdifferenttimeshaveapartofcorrelationthat isbeyondtheircommonlatent
factorscouldexplain.Forexample,Press60andPress65areconnectedbyadoubleheaded
arrowtorepresenttheirerrorcovariance,whichexplainsthepartofthecovariancebetween
thetwovariablesthatisbeyondtheexplanationbythecovariancebetweenDem60andDem65.
Similarly,theFreop,Fair,andLegis pairsareallconnectedbydoubleheadedarrowsto
representerrorcovariances.
50
Adding Error Covariances (with Error Terms
Displayed)
1.
1. Press60 e1
1.
lam1 Freop60 e2
Dem60 lam2 1.
Fair60 e3
lam3
1.
lam4 Legis60 e4
1.
Press65 e5
1.
lam1 Freop65 e6
Dem65 lam2
1.
lam3 Fair65 e7
1.
1. lam4 Legis65 e8
51
Thepathdiagraminthisslideisequivalenttothepreviousrepresentationthatdoesnot
useexpliciterrorvariables.
Inthispathdiagram,errortermsforthemeasuredvariablesare shown.Thedouble
headedarrowsareshiftedtotheerrorterms.Thismakesitobviousthatthosedouble
headedarrowsarecovariances betweentheerrorvariables(butnotaspartial
covariances betweentheobservedvariables,asshowninthepreviousslide).
Therefore,thispathdiagramrepresentationisconceptuallycleareraboutwhatare
reallybeingcorrelatedinthemodel.However,theadditionoftheerrortermsmakes
thepathdiagrammorecluttered.Inthisworkshop,Iwouldstick withthepathdiagram
representationthatdoesnotuseexpliciterrorterms.
51
Fitting a CFA Model with Loading Constraints
and Correlated Errors
proc calis data=polidem;
path
Dem60 ---> Press60 = lam1,
Dem60 ---> Freop60 = lam2,
Dem60 ---> Fair60 = lam3,
Dem60 ---> Legis60 = lam4,
Dem65 ---> Press65 = lam1,
Dem65 ---> Freop65 = lam2,
Dem65 ---> Fair65 = lam3,
Dem65 ---> Legis65 = lam4; Use the PCOV statement to
pvar specify error covariances.
Dem60 = 1, Dem65 = 1;
pcov
Freop60 Legis60, Freop65 Legis65,
Press60 Press65, Freop60 Freop65,
Fair60 Fair65, Legis60 Legis65;
fitindex on(only) = [chisq df probchi rmsea cn srmsr
bentlercfi agfi] noindextype;
run;
52
Withthesixadditionalpairsofcorrelatederrors,youhavesix moreerrorcovariance
parametersinthemodel.
InthePATHmodelinglanguage,youcanspecifythesecovarianceparametersinthe
PCOVstatement.Inthisexample,thismeansthatyouenumeratethesixpairsof
measuredvariablesinthePCOVstatement.Forexample,thefirst pairisFreop60and
Legis60,whichrepresentacovarianceparameterbetweentheirerrorterms.
52
Fit Summary Table for the CFA Model with Loading
Constraints and Correlated Errors
Fit Summary
Chi-Square 15.1946
Chi-Square DF 17
Pr > Chi-Square 0.5815
Hoelter Critical N 135
Standardized RMSR (SRMSR) 0.0590
Adjusted GFI (AGFI) 0.9043
RMSEA Estimate 0.0000
Bentler Comparative Fit Index 1.0000
53
Thismodelissupposedtofitbetterbecauseoftheaddedparametersfortheerror
covariances.
Infact,themodelfitchisquareisnotstatisticallysignificant.Thissupportsthe
hypothesizedmodelinthepopulation.
Allotherfitindicesshowgoodorexcellentfit.TheSRMSRis0.059,whichisonlyslightly
largerthanthe0.05criterion.TheAGFIis0.90,whichisanindicationofgoodmodelfit
byconvention.TheRMSEAisessentiallyzero,whichisthesmallestRMSEAyoucould
everget.TheCFIis1,whichisalsothelargestCFIyoucouldeverget.
53
Estimates of the Loadings for the CFA Model with
Constrained Loadings and Correlated Errors
PATH List
Standard
----------Path---------- Parameter Estimate Error t Value
54
Allloading(pathcoefficients)estimatesarestatisticallysignificant,supportingthe
relationshipsbetweenthelatentfactorsandthemeasuredvariables.
54
Estimates of the Variances for the CFA Model with
Constrained Loadings and Correlated Errors
Variance Parameters
Variance Standard
Type Variable Parameter Estimate Error t Value
55
Allerrorvarianceestimatesaresignificant.
55
Estimates of the Covariances for the CFA Model with
Constrained Loadings and Correlated Errors
Covariances Among Exogenous Variables
Standard
Var1 Var2 Parameter Estimate Error t Value
Thefirsttableshowsthecorrelationbetweenthetwolatentfactors.Again,the
correlationisveryhighandsignificant.
Thesecondtableshowstheestimatesforthenewlyaddedcovariances betweenerrors.
Threeofthesecovariances aresignificant,whiletheothersarenot.Forexample,
Freop60andLegis60,Freop65andLegis65,andFreop60andFreop65arethreeerror
covariances thathavetvalueslargerthan1.96.Theotherthreepairshave insignificant
tvalues.Thismeansthataddingthesethreecovariances mightbesomewhat
undesirablebecausetheirestimatesareactuallynotsignificantlydifferentfromzero,
castingdoubtsabouttheirpresenceinthemodel.
Thelessonhereisthateventhoughaddingerrorcorrelations(orcovariances)might
improvethemodelfit,youshouldnotroutinelyadderrorcovariances onlytoboostthe
modelfit.Addingunjustifiederrorcovariances makesyourmodelmorecomplicatedand
hardertointerpret,especiallywhensomeerrorvarianceestimatesturnouttobe
insignificant.
56
Political Democracy and
Industrialization:
A Full Structural Equation Model
57
57
Political Democracy and Industrialization
58
Wecontinuewiththepreviousmodelandaddonemorelatentfactoranditsindicators
intothemodel.
Thisexampleillustratesafullstructuralequationmodel(orafullLISREL)model.
Essentially,thismeansthatourfocusisnotonlyonvalidating therelationshipsbetween
thelatentfactorsandthemeasuredvariables(thatis,themeasurementmodel),but
alsoonvalidatingthefunctionalrelationshipsamonglatentvariables(thatis,the
structuralmodel).
Forexample,youhavealatentfactorcalledindustrialization(Induct)thatissupposedto
bereflectedbythreeobservedvariables:grossnationalproduct percapita(Gnppc60),
energyconsumptionpercapita(enpc60),andpercentoflaborforceinindustrial
occupations(Indlf60).Allthesevariablesweremeasuredin1960.
Theindustrialization(Induct)latentvariableservesasapredictorofthetwodemocracy
factors(Dem60andDem65).Thiskindoffunctionalrelationships betweenlatent
variableshasnotbeenexploredintheconfirmatoryfactormodelsdiscussedpreviously.
58
Political Democracy and Industrialization:
The Path Diagram
Press60
1. Freop60
Dem60 lam2
lam3 Fair60
Gnppc60 1. lam4 Legis60
Enpc60 Indust
Press65
Indlf60
1. Freop65
Dem65 lam2
lam3 Fair65
lam4 Legis65
59
TheentireSEMmodelisdepictedinthepathdiagramofthecurrentslide.Themost
notableadditionisthepathsfromIndust toDem60andDem65 industrializationin
1960servesasapredictorofdemocracyinboth1960and1965.Threeobserved
variablesserveasindicatorsoftheindustrialization:Gnppc60, Enpc60,andIndlf60.
Therearetwomainmodificationsfromtheprecedingconfirmatory factormodel.
First,insteadofallowingDem60andDem65tofreelycovary intheCFA,thecurrent
modeltreatsDem60asapredictorofDem65.
Second,adifferentmethodforidentifyingthelatentfactorscalesisusedinthecurrent
model.IntheprecedingCFAmodel,variancesofDem60andDem65arefixedtoone.
Butbecausetheybecomeendogenousinthecurrentmodel,youcan nolongerusethis
typeofscaleidentificationmethod.Instead,oneoftheirobservedindicatorvariables
(thatis,Press60andPress65)nowhasafixedpathcoefficientatone.Similarly,thepath
coefficientfromIndust toGnppc60isfixedtooneforscaleidentification.
59
Fitting the Structural Equation Model for the Political
Democracy and Industrialization Data
60
YoucanusePROCCALIStospecifythisstructuralequationmodel easily.
InthePATHstatement,Iuseamultiplepathspecificationsyntax.Inthefirst
specification,Dem60isapredictorof4outcomevariables:Press60,Freop60,Fair60,
andLegis60.Thisspecifiesfourpathsinasinglepathspecification.Afterusinganequal
sign,Ispecifyfourparametersforthefourpaths.Thefirstoneisafixedconstant1,
whichisappliedtotheDem60>Press60path.Thesecondoneisafreeparameter
lam2,whichisappliedtotheDem60>Freop60path,andsoon.
Inthenext3pathspecifications,Ialsousethemultiplepathspecificationsyntax.The
secondmultiplepathsyntaxspecifiesthatDem65isafactorwithfourindicators.The
pathcoefficients(loadings)arealsospecifiedexplicitly.Thethirdmultiplepathsyntax
specifiesthatIndust isafactorofthreeobservedindicators,withafixedoneforthe
effectofIndust onGnppc60.ThepathcoefficientsforthepathsIndust>Enpc60and
Indust>Indlf60areunnamedfreeparameters(withtheemptyspecifications).The
fourthmultiplepathsyntaxspecifiesthatIndust isapredictorofbothDem60and
Dem65.Thecorrespondingpathcoefficientsare(unnamed)freeparametersinthe
model.
ThelastpathinthePATHstatementspecifiesDem60asapredictorofDem65.Notice
thatnoPVARstatementisusedbecausefixingtheDem60andDem65variancestoone
isnotusedinthecurrentmodel.Thescalesofthelatentfactorsareidentifiedbyfixing
somepathcoefficientsto1.
60
Political Democracy and Industrialization:
Fit Summary Table
Fit Summary
Chi-Square 39.6438
Chi-Square DF 38
Pr > Chi-Square 0.3966
Hoelter Critical N 100
Standardized RMSR (SRMSR) 0.0558
Adjusted GFI (AGFI) 0.8606
RMSEA Estimate 0.0242
Bentler Comparative Fit Index 0.9975
61
Thefitofthestructuralmodelisacceptable,ifnotexceptionallygood.
Themodelfitchisquareisnotsignificant,supportingthehypothesizedmodel.The
SRMSRiscloseto0.05.TheAGFIis0.86,whichshowsareasonablefit.TheRMSEA
indicatesaverygoodmodelfit,asthevalue(.0242)ismuchlowerthan0.05.TheCFIis
almost1,whichshowsaperfectmodelfit.
61
Political Democracy and Industrialization:
Estimates of Path Coefficients
PATH List
Standard
----------Path----------- Parameter Estimate Error t Value
62
Allpathcoefficientsaresignificantaprettygoodsign.
62
Political Democracy and Industrialization:
Estimates of Variances
Variance Parameters
Variance Standard
Type Variable Parameter Estimate Error t Value
63
Someerrorvariancesarenotsignificant:Enpc60andDem65.Enpc60isanindicatorof
theIndustrializationin1960.ThisinsignificanterrorvariancemeansthattheIndust
factorpredictEnpc60perfectly.However,thecorrespondingtvalueis1.71,whichcould
bejudgedasmarginallysignificant.
TheerrorvarianceforDem65isalsonotsignificant,asevidentbythenonsignificantt
valueof0.72.ThismeansthatgivenIndust andDem60,Dem65canbepredictedalmost
perfectly.
63
Political Democracy and Industrialization:
Estimates of Covariances
64
Again,therearesomeinsignificanterrorcovariances.Thisresultchallengestheir
presenceinthemodel.
64
Political Democracy and Industrialization: Squared
Multiple Correlations
Error Total
Variable Variance Variance R-Square
65
Thesquaremultiplecorrelationsareusuallyusedtomeasurethe percentageof
overlappingvariancebetweenthepredictorsandtheoutcomevariables.Inthecurrent
example,Rsquaresrangefrom0.2toextremehighvaluessuchas0.95and0.96.
ThesmallestRsquareistheoneforpredictingDem60,whichis0.2.Thisactuallyisnot
thatsmallanRsquarevalueforsocialsciencedata.
ButtheRsquare(0.96)forDem65isextremelyhigh.ThismeansthatDem65 isalmost
perfectlypredictedfromdemocracyandindustrializationin1960.
65
A LISREL Model for the Political Democracy and
Industrialization Data
Structural Measurement
model model Press60
Measurement 1. Freop60
model
Dem60 lam2
lam3 Fair60
Gnppc60 1. lam4 Legis60
Enpc60 Indust
Press65
Indlf60
1. Freop65
Dem65 lam2
lam3 Fair65
Measurement
lam4 Legis65
model 66
ThisfullSEMmodelisalsoagoodillustrationoftheLISRELmodel.
Thepathdiagramfortheprecedingmodelremainsunchangedhere. Inordertocallthis
pathdiagramaLISRELmodel,youhavetoidentifytheLISRELcomponentsinthispath
diagram.ThetwomaincomponentsinLISRELarethemeasurementmodelsandthe
structuralmodel.
First,themeasurementmodelsareidentified.Ameasurementmodelisabouthow
observedvariablesarerelatedtothelatentvariablesorconstructsinthemodel.
Specifically,themeasurementmodelinvolvingindustrializationisthemeasurement
modelofxbecauseIndust servesasanexogenous(independent)variableinthepath
diagram.ThemeasurementmodelinvolvingDem60andDem65isthemeasurement
modelofybecauseDem60andDem65areendogenous(dependent)variablesinthe
pathdiagram.
Second,thestructuralmodelisidentifiedandhighlightedinthecenterofthepath
diagram.Thestructuralmodeldescribesthefunctionalrelationshipsamongthelatent
variables(constructs)inthepathdiagram.
Therefore,alltheessentialcomponentoftheLISRELmodelisidentifiedinthecurrent
pathdiagram.
66
Fitting the Structural Equation Model for the Political
Democracy and Industrialization Data
Measurement model: Relationships between
proc calis data=polidem; latent and observed indicators
path
Dem60 ---> Press60 Freop60 Fair60 Legis60 = 1. lam2 lam3 lam4,
Dem65 ---> Press65 Freop65 Fair65 Legis65 = 1. lam2 lam3 lam4,
Indust ---> Gnppc60 Enpc60 Indlf60 = 1.,
Indust ---> Dem60 Dem65, Structural model: Relationships
among latent constructs
Dem60 ---> Dem65;
pcov
Freop60 Legis60, Freop65 Legis65,
Press60 Press65, Freop60 Freop65,
Fair60 Fair65, Legis60 Legis65;
fitindex on(only) = [chisq df probchi rmsea cn srmsr
bentlercfi agfi] noindextype; Measurement model
run;
67
InthePATHmodelinglanguage,youcanalsoidentifythecodeforthemeasurement
modelsandthestructuralmodel.Theprecedingcodeisrecitedhereforillustrations.
InthePATHstatement,thefirstthreemultiplepathspecificationsareconcernedwith
themeasurementofthelatentconstructs.Inaddition,allspecificationsinthePCOV
statementareforthecovariances ofthemeasurementerrors.
ThelasttwospecificationsinthePATHstatementareforthestructuralmodel.They
describethefunctionalrelationshipsbetweenIndust,Dem60,andDem65.
AfteridentifyingtheLISRELcomponentsinthepathdiagramandintheSAScode,you
nowhaveacluetospecifytheLISRELmodelinPROCCALIS.Essentially,itshouldbe
clearthatthesamepathdiagramisbeingusedbythePATHmodelinglanguageandthe
LISRELmodel.TheonlytaskistotranscribethecodeinthePATHmodelinglanguageto
thelanguagefortheLISRELmodel.
67
A LISREL Model Specified by the LISMOD
Modeling Language of PROC CALIS
proc calis data=polidem nose noparmname; NOSE: No standard errors
lismod NOPARMNAME: No parameter names in the
output
xvar = Gnppc60 Enpc60 Indlf60,
yvar = Press60 Freop60 Fair60 Legis60 Press65 Freop65 Fair65 Legis65,
xi = Indust, Measurement model for
eta = Dem60 Dem65; y
matrix
_LambdaY_ [ 1, @1] = 1. lam2 lam3 lam4, /* Paths from Dem60 and Dem65 to yvar */
[ 5, @2] = 1. lam2 lam3 lam4;
matrix
_ThetaY_ [ 4, 2], [ 8, 6], /* pcov statement in the path model */
[ 5, 1], [ 6, 2], [ 7, 3], [ 8, 4];
matrix
_LambdaX_ [ 1, 1] = 1., /* Path from Indust to xvar*/
[ 2 to 3, 1]; Measurement model for
x
matrix
_Gamma_ [ 1 to 2, 1]; Structural
matrix model
_Beta_ [2,1];
run;
68
PROCCALISsupportsthesocalledLISMODmodelinglanguage.Inordertofullyunderstandthe
PROCCALIScodefortheLISRELmodel,knowledgeaboutmatrixalgebraisneeded.ButIwillonly
describethecodeinaconceptualway.
IntheLISMODstatement,youfirstclassifyyourvariablesintooneofthefourcategories:
1.xvariables:observedindicatorsfortheexogenous(independent)latentfactorsinthemodel.
2.yvariables:observedindicatorsfortheendogenous(dependent)latentfactorsinthemodel.
3.xivariables:exogenous(independent)latentfactorsinthemodel.
4.etavariables:endogenous(dependent)latentfactorsinthemodel.
TheLISRELmodelorLISRELprogramhadbeendevelopedasamatrixbasedlanguage.
Parametersinthemodelsarespecifiedasmatrixelementsinsomespecificmodelmatriceswith
Greeknames.PROCCALISsupportsthematrixinputoftheseLISRELmodelmatrices.For
example,inthemeasurementmodelfory,_LambdaY_isthematrixthatrelatestheyvariables
totheetavariables.InsteadofspecifyingthepathsasinthePATHstatement,theMATRIX
statementfor_LambdaY_servesthesamepurposeintheLISMODmodelinglanguage.The
MATRIXstatementfor_ThetaY_specifiestheerrorvariancesandcovariances oftheyvariables,
muchlikethespecificationsofthePCOVstatementinthePATHmodelinglanguage.Inother
words,thePATHmodelspecificationsaretranscribedintotheLISMODmodelspecificationsfor
theyvariables.
Similarly,theMATRIXstatementfor_LAMBDAX_specifiestheparametersinthemeasurement
modelforthexvariables.
Finally,thestructuralrelationshipsorthepathrelationshipsamongthelatentfactorsare
specifiedintheMATRIXstatementsforthe_GAMMA_and_BETA_matrices.
Tosimplifytheoutput,IusedtwooptionsinPROCCALISstatement.TheNOSEoption
suppressestheprintingofstandarderrorsandtheNOPARMNAMEoptionsuppressesthe
printingoftheparameternames.
68
LISMOD Output for the Political Democracy and
Industrialization Data (Measurement for y)
_LAMBDAY_ Matrix
Dem60 Dem65
Press60 1.0000 0
Freop60 1.1908 0
Fair60 1.1745 0
Legis60 1.2510 0
Press65 0 1.0000
Freop65 0 1.1908
Fair65 0 1.1745
Legis65 0 1.2510
69
ThefollowingfewslidesshowtheoutputfromPROCCALISforthe LISRELmodel.Allthe
resultsarematrixoriented.DetailsfortheseresultshavebeendiscussedforthePATH
modeloutputandwillnotberepeatedhere.Ingeneral,youcanfindcorrespondence
betweentheLISMODandthePATHresults.
Thisslideshowsthemeasurementmodelfortheyvariables.
69
LISMOD Output for the Political Democracy and
Industrialization Data (Measurement for y)
_THETAY_ Matrix
70
Thisslideshowsthemeasurementerrorvariancesandcovariances fortheyvariables.
70
LISMOD Output for the Political Democracy and
Industrialization Data (Measurement for x)
_LAMBDAX_ Matrix
Indust
Gnppc60 1.0000
Enpc60 2.1797
Indlf60 1.8182
_THETAX_ Matrix
Gnppc60 0.0825 0 0
Enpc60 0 0.1221 0
Indlf60 0 0 0.4730
Thisslideshowstheresultsofthemeasurementmodelforthexvariables,includingthe
pathcoefficientsandtheerrorvariances.
71
LISMOD Output for the Political Democracy and
Industrialization Data (Structural Model)
_BETA_ Matrix
Dem60 Dem65
Dem60 0 0
Dem65 0.8650 0
_GAMMA_ Matrix
Indust
Dem60 1.4713
Dem65 0.6005
72
Thisslideshowsthefunctionalrelationshipsbetweenlatentconstructs.
72
LISMOD Output for the Political Democracy and
Industrialization Data (Structural Covariances)
_PSI_ Matrix
Dem60 Dem65
Dem60 3.9277 0
Dem65 0 0.1667
_PHI_ Matrix
Indust
Indust 0.4547
Note: All variances (diagonal elements in _PSI_ and _PHI_) were set by default.
73
Thisslideshowstheerrorvariancesoftheetavariableandthevarianceofthexi
variable.
73
Features of the LISMOD Modeling Language
74
Insum,theLISMODmodelinglanguageinPROCCALISsupportstheLISRELmodelbyproviding
syntaxtospecifytheessentialcomponentsoftheLISRELmodel.However,LISMODitselfdoesnot
interpretaLISRELprogram.
TheLISMODmodelinglanguageinPROCCALISalsosupportsthemeanstructureanalysis.Thisis
donebyprovidingadditionalMATRIXstatementinputforthemean modelmatricesintheLISREL
model.
IfyouunderstandtheLISRELmodel,herearethreethingsyouinputbyusingtheLISMODlanguage:
1.Theorderedlistsofx,y,,and variables
2.MATRIXstatementstodefinefreeandfixedparameters
3.Namesforparameters(notrequiredforfreeparameters)
(Note:Thisslidewasupdatedaftertheprintingofthehandout)
TheDefaultcovariancestructureparametersintheLISMODlanguageare:
1.Diagonalelementsofallcovariancematrices(allvariances)
2.Lowertriangularelementsofthe_PHI_matrix(covariances ofthe variables)
Specifyingthedefaultparametersexplicitlyiscertainlyallowed,especiallywhenyouneedtoset
constraintsontheseparameters.
74
The Generalized
PATH Modeling Language
75
75
Political Democracy and Industrialization: Variances
and Covariances Are Paths
Press60
1. Freop60
Dem60 lam2
lam3 Fair60
Gnppc60 1. lam4 Legis60
Enpc60 Indust
Press65
Indlf60
1. Freop65
Dem65 lam2
lam3 Fair65
lam4 Legis65
Covariances are represented as double-headed
arrows.
76
TogeneralizethePATHmodelinglanguage,errorcovariances inthepathdiagramcould
alsobespecifiedaspaths inPROCCALIS.Infact,covariances inthepathdiagramare
alreadyrepresentedasdoubleheadedarrows,asshowninthepoliticaldemocracyand
industrializationexample.
76
An Example of the Generalized
PATH Modeling Language
PROC CALIS DATA=polidem;
PATH
Dem60 ---> Press60 Freop60 Fair60 Legis60 = 1. lam2 lam3 lam4,
Dem65 ---> Press65 Freop65 Fair65 Legis65 = 1. lam2 lam3 lam4,
Indust ---> Gnppc60 Enpc60 Indlf60 = 1.,
Indust ---> Dem60 Dem65,
Dem60 ---> Dem65; Use the PCOV statement to specify
PCOV the covariances
Freop60 Legis60, Freop65 Legis65,
Press60 Press65, Freop60 Freop65,
Fair60 Fair65, Legis60 Legis65;
ThetoppanelshowstheuseofPCOVstatementtospecifythecovariances.Thebottom
panelshowsthatthesecovariances arespecifiedasdoubleheadedpaths,which
resembletheirrepresentationsinthepathdiagram.
ThetwoPROCCALISspecificationsshownaboveareequivalent.
77
Political Democracy and Industrialization: Output with
Generalized Paths
PATH List
Standard
-----------Path----------- Parameter Estimate Error t Value
78
TheresultsobtainedfromPROCCALISnowshowsthecovarianceestimatesaspaths
inthePATHlist.Thistablecouldbeuseddirectlyinresearchpaperforreportingthe
SEMestimationresults.
78
Features of the Generalized PATH Modeling
Language
79
Insum,thegeneralizedpathmodelinglanguageenablesyoutospecifyalltypesof
arrowsinthepathdiagramaspaths, includingthevariance,covariance,intercept,and
meanparameters.
VarianceofYisapathlikeY<>Y.
CovariancebetweenXandYisapathlikeX<>Y
MeanorinterceptforYisonepathlike1>Y.
79
Default Free and Fixed Parameters in PROC CALIS
The main purpose of setting default parameters is to enable you to specify only the
functional relationships among variables in most practical applications.
80
KnowingthedefaultfreeandfixedparametersinPROCCALISareusefulbecauseit
enhancesthecodingefficiencyandaccuracy.Hereisalistofdefaultfreeparameters
andfixedzerosusedinPROCCALIS:
(Thisslidehasbeenchangedslightlyaftertheprintingofthehandout)
Defaultfreeparameters
o Variancesofandcovariances amongallexogenous (independent)variables
(observedorlatent,exceptforerrorterms)
o Errorvariancesofallendogenous (dependent)variables
o Meansorinterceptsofallobserved variables
Defaultfixedzeros
o Unspecifiedpathsanderrorcovariances
o Meansorinterceptsofalllatent variables
Atthefirstglance,itmightseemtobetediousanddemandingthatmodelersmust
rememberallthesedefaultparameterrulestospecifyanSEMaccurately.However,the
defaultparameterizationusedinPROCCALISmatchesthatofregressionanalysisandit
isdesignedwiththefollowingmainpurposeinmind:Inmostpracticalapplications,you
wouldonlyneedtospecifythefunctionalrelationshipsamongvariables(thatis,the
singleheadedpathsinthepathdiagram)andthefixedvariancesofthe latentvariables.
80
Multiple-Group Analysis
81
MultiplegroupanalysisrepresentsanimportantclassofSEMapplications.
81
PROC CALIS Syntax for Specifying a
Multiple-Group Analysis
proc calis;
group 1 / data=ds1; /* ds1: data set for Group 1 */
group 2 / data=ds2; /* ds2: data set for Group 2 */
model 1 / group=1; /* Group 1 is fitted by Model 1 */
run;
82
TheskeletonofthemultiplegroupsyntaxofPROCCALISisshowninthisslide.
Inthisspecification,youhavetwoindependentgroups,whicharestoredinseparate
datasetsds1 andds2. Youdefinetwomodels:Model1andModel2,respectively,for
fittingtothetwogroups.
UndereachMODELstatement,youspecifythemodelbyoneofthemodelinglanguages
supportedbyPROCCALIS.Forexample,aPATHstatementoraLISMODstatement.
82
Group-Model Mapping and Model Referencing
proc calis;
group 1 / data=ds1;
group 2 / data=ds2; Both Groups 1 and 2 are fitted by
group 3 / data=ds3; Model 1.
model 1 / group = 1,2;
path
x1 ---> x2-x5,
x2 ---> x3-x5,
x3 ---> x5 = effect1;
Model 2 makes reference to Model 1,
model 2 / group = 3; but with a unique parameter for the
refmodel 1; path from x3 to x5. All other paths
are constrained between the two
path models.
x3 ---> x5 = effect2;
run;
Note: REFMODEL does not constrain default free parameters in the two models.
It constrains only those parameters that are specified explicitly.
83
Thisexampleshowsthreegroupsofdata.Model1isfittedtoGroups1and2bythe
PATHmodelspecifiedimmediatelyafter.Model2isfittedtoGroup3.TodefineModel
2,aREFMODELstatementisused.REFMODEL1meansthatthespecificationsinModel
1isreferencedhere.Thismeansthatallexplicit specificationsinModel1arecopied
intothecurrentmodel.Forexample,allthePATHstatementspecificationsinModel1
arecopiedintoModel2.However,REFMODELallowsmodificationsfromthereference
model.Inthisexample,thex3>x5pathisrespecifiedwithanewpathcoefficient
calledeffect2. Thiseffect2 inModel2isadifferentparameterthaneffect1 inModel
1,eventhoughbothmodelhavethisx3>x5path.Exceptforthisx3>x5path,Models
1and2sharealltheremainingspecifiedpathswiththesamesetofpathcoefficients.
NoticethatbyusingtheREMODELstatementtoconnectmodels,all(andonly)the
explicit specificationsinthemodelsareconstrained(unlessbeingmodifiedorre
specified).Theconstraints,however,willnotapplytodefaultparameters(assumingthat
theyarenotexplicitlyspecified).Toconstrainthedefaultparameters,theywillhaveto
bespecifiedexplicitlyintheMODELdefinitions.
83
Completely Constrained CFA Models for Male
and Female Groups
Males Females
v1 v1
Visual ev1 ev1
1. 1. Visual
a2 ev2 a2
Spatial Cubes Spatial Cubes ev2
a3 a3
Lozenges ev3 ev3
Lozenges
cv cv
a5 ev5 a5
Verbal Sentence Verbal Sentence ev5
a6 a6
WordMean ev6 ev6
WordMean
v2 v2
Data source: Holzinger and Swineford1939 (an example used in Arbuckle 2008) 84
Letususeanexampletoillustratethemultiplegroupanalysis.TheHolzinger and
Swineford dataaresued.ThisdatasetisalsousedinArbucklesAMOSmanual.
Inthisresearch,visualandverbaltestscoreswereobserved.Visual,Cubes,and
Lozengesarespatialtests.Paragraph,Sentence,andWordMean areverbaltests.CFA
modelswerehypothesizedforthetwogroups:onegroupisformalesandtheotherfor
females.Thebasicfactorstructuresforthegroupsarethesame.Twolatentfactors
Spatial andVerbal areassumedforthemeasuredvariables.Allparameters(including
thosedefaultparametersinPROCCALIS)inthetwomodelsforthegroupsarelabeledin
red.
Thisisacompletelyconstrainedmultiplegroupmodelbecausethetwopathdiagrams
formalesandfemalesareexactlythesame(thatis,bothhavethesamepathstructures
andthesamesetofparameters).
84
Different Methods to Specify the Completely
Constrained Models
1. Male and female groups fit by two models, but with all
parameters (including all default parameters) being
constrained in the models
2. Male and female groups fit by a single model
definition
3. Male and female groups fit by two models that are
constrained through the REFMODEL specification
85
InPROCCALIS,youcanuseoneofthefollowingthreewaystospecifythepreceding
completelyconstrainedmodel:
1.Maleandfemalegroupsfittedbytwomodels,butwithallparameters(includingall
defaultparameters)beingconstrainedinthemodels
2.Maleandfemalegroupsfittedbyasinglemodeldefinition
3.Maleandfemalegroupsfittedbytwomodelsthatareconstrained throughthe
REFMODELspecification
Iwilldescribeeachofthesemethods.
85
Method 1: Two Groups Fitted by the Two Completely
Constrained Models
proc calis;
group 1 / label='Males' data=males; Models are
constrained through
group 2 / label='Females' data=females; the use of the same
model 1 / group = 1; parameter names.
path
Spatial ---> Visual Cubes Lozenges = 1. a2 a3,
Verbal ---> Paragraph Sentence Wordmean = 1. a5 a6;
pvar
Visual Cubes Lozenges Paragraph Sentence Wordmean = ev1-ev6,
Spatial = v1, Verbal = v2;
pcov
Spatial Verbal = cv;
model 2 / group = 2;
path
Spatial ---> Visual Cubes Lozenges = 1. a2 a3,
Verbal ---> Paragraph Sentence Wordmean = 1. a5 a6;
pvar
Visual Cubes Lozenges Paragraph Sentence Wordmean = ev1-ev6,
Spatial = v1, Verbal = v2;
pcov
Spatial Verbal = cv;
fitindex on(only) = [chisq df probchi rmsea srmsr aic
bentlercfi agfi] noindextype;
run;
86
Thefirstmethodistodefinetwomodelsforthetwogroups.The modelspecifications
underthetwoMODELstatementsmustbeexactlythesame.
Thismethodisintuitive,butalittleclumsybecauseyouneedtospecifyallparameters
withmatchingnamesinthemodels(althoughyoucancutandpastethemodel
specificationstoensureanexactcopy).Youalsoneedtospecifyeachparameterinthe
model,includingthedefaultparameters,whichyoumightsometimesmiss.
86
Method 2: Two Groups Fitted by the Same Model
proc calis;
group 1 / label='Males' data=males;
group 2 / label='Females' data=females;
Groups 1 and 2 are fitted
model 1 / group = 1, 2; by the same model.
path
Spatial ---> Visual Cubes Lozenges = 1. ,
Verbal ---> Paragraph Sentence Wordmean = 1. ;
fitindex on(only) = [chisq df probchi rmsea srmsr aic
bentlercfi agfi] noindextype;
run;
87
Thesecondmethodisverysimpleandintuitive.Youspecifyonemodelandfitthis
modeltothetwogendergroups.Thisensuresthegroupsarefittedexactlybythesame
model.
Theadvantageofthismethodisthatitissimple,intuitive,andnoparameternamesare
necessaryforconstrainingmodels.Also,youdonotneedtospecifyanyofthedefault
parametersexplicitlyforsettingupconstraints.
Thisisanidealspecificationmethodifthismodelisallwhatyouwanttofit.However,if
youaregoingtofitasequenceofmultiplegroupmodelstothegroups,youmightwant
toconsiderthenextmethod.
87
Method 3: Two Groups Fitted by the Two Models
That Are Constrained Through the REFMODEL
proc calis;
group 1 / label='Males' data=males;
group 2 / label='Females' data=females;
model 1 / group = 1;
path
Spatial ---> Visual Cubes Lozenges = 1. ,
Verbal ---> Paragraph Sentence Wordmean = 1. ;
pvar
Visual Cubes Lozenges Paragraph Sentence Wordmean
Spatial Verbal;
pcov
Spatial Verbal;
The REFMODEL statement makes reference
model 2 / group = 2;
to all explicit specifications in Model 1.
refmodel 1;
fitindex on(only) = [chisq df probchi rmsea srmsr aic
bentlercfi agfi] noindextype;
run;
ThethirdmethodconstrainsthemodelsbytheREFMODELstatement.Asdiscussed
previously,theREFMODELcopiesalltheexplicit specificationsfromthereference
modeltothetargetmodel.Inthisslide,allpathcoefficients, varianceparameters,and
covarianceparametersarespecifiedinModel1,whichisfittedtoGroup1(Males).
Model2,whichisfittedtoGroup2(Females),makesreferencetoModel1withoutany
modificationsorrespecifications.
Noticethatinordertocompletelyconstrainthetwomodelsforthetwogroups,all
parameters,includingthosecouldhavebeensetbydefaultbyPROCCALIS(e.g.,
specificationsinthePVARstatementandPCOVstatement),mustbespecifiedexplicitly
inModel1.Thisway,Model2willcopyalltheseparameterspecificationsviathe
REFMODELstatementspecification.
UnlikeMethod1,thismethoddoesnotrequiretheuseofparameternamesfor
constraintsacrossmodels.ConstraintsaredoneviatheREFMODEL statement.
AlthoughnotasintuitiveasMethod2,thismethodwouldbeusefulifyouneedto
considerfittingasequenceofmultiplegroupmodels,whichwillbeillustratedlater.
88
Fit Summary of the Completely Constrained
Multiple-Group Model
Fit Summary
Chi-Square 26.0154
Chi-Square DF 29
Pr > Chi-Square 0.6247
Standardized RMSR (SRMSR) 0.0968
Adjusted GFI (AGFI) 0.9235
RMSEA Estimate 0.0000
Akaike Information Criterion 52.0154
Bozdogan CAIC 103.7130
Schwarz Bayesian Criterion 90.7130
Bentler Comparative Fit Index 1.0000
89
Thecompletelyconstrainedmodelprovideagoodfitofthedata. Themodelfitchi
squareisnotsignificant.TheRMSEAisperfect,althoughtheSRMSRisnotverygood.
TheAGFIandtheCFIarealsogood.TheAIC,theCAIC,andtheSBCarealsoprinted.
Theseindicescannotbeinterpretedbytheirabsolutevalues,butwillbeusefulwhen
youcomparethefitsofdifferentmultiplegroupmodels.Youwillusetheseindicesto
selectthebest multiplegroupmodelforthedatalater.
89
Fitting Less Restrictive Multiple-Group Models
90
WehavefittedthecompletelyconstrainedmultiplegroupmodelbytheREFMODEL
method.Wecanfurtherfitlessconstrainedmultiplegroupmodelbymodifyingour
PROCCALIScode.
Wecanreleasetheconstraintsonerrorvariances.
Thenwecanreleasetheconstraintsonthestructuralcovariances (amonglatent
variables).
Finally,wecanreleasetheconstraintsonthepathcoefficients (orloadings).
Iamgoingtoshowthesestepbystep.
90
Release the Constraints on Error Variances
Males Females
v1 v1
Visual
1. 1. Visual
a2 a2
Spatial Cubes Spatial Cubes
a3 a3
Lozenges Lozenges
cv cv
1. Paragraph Paragraph
1.
a5 a5
Verbal Sentence Verbal Sentence
a6 a6
WordMean WordMean
v2 v2
Thepathdiagramsforthemultiplegroupmodelthatreleasestheconstraintsonthe
errorvariancesareshownabove.
Exceptfortheerrorvarianceparameters,alltheremainingparametersarelabeled.This
meansthatonlytheerrorvarianceparametersarenotinvariantacrossthemodelsfor
thegroups.
91
Releasing the Constraints on the Error Variances
proc calis;
group 1 / label='Males' data=males;
group 2 / label='Females' data=females;
model 1 / group = 1;
path
Spatial ---> Visual Cubes Lozenges = 1. ,
Verbal ---> Paragraph Sentence Wordmean = 1. ;
pvar
/* Visual Cubes Lozenges Paragraph Sentence Wordmean */
Spatial Verbal;
pcov
Spatial Verbal;
model 2 / group = 2;
refmodel 1;
fitindex on(only) = [chisq df probchi rmsea srmsr aic
bentlercfi agfi] noindextype;
run;
Comment out the error variance specifications in the PVAR statement, and let PROC CALIS
set two distinct sets of default error variances for the two models.
92
IntermsofPROCCALISspecification,thismeansthatthemodelforfemalesmakes
referencetothemodelformaleswithregardtothoseconstrainedparametersonly.
Thiscouldbedoneveryeasilyfrommodifyingthecompletelyconstrainedmultiple
groupmodel.AllyouneedtodoistocommentoutthePVARstatementspecifications
fortheobservedvariables.
WhenModel2makesreferencetoModel1,onlythoseexplicitspecificationswouldbe
constrainedbetweenthetwomodels.Becausetheerrorvariancesarenotspecifiedin
bothmodels(thatis,commentedoutfromthepreviouscode),PROCCALISwould
generatedifferentsetsofdefaulterrorvarianceparametersfor thetwomodels.In
otherwords,theerrorvarianceconstraintsarereleasedinthis PROCCALISspecification.
92
Fit Summary of the Multiple-Group Model with
Constraints on Loadings and Structural Covariances
Fit Summary
Chi-Square 22.0334
Chi-Square DF 23
Pr > Chi-Square 0.5182
Standardized RMSR (SRMSR) 0.0903
Adjusted GFI (AGFI) 0.9163
RMSEA Estimate 0.0000
Akaike Information Criterion 60.0334
Bozdogan CAIC 135.5913
Schwarz Bayesian Criterion 116.5913
Bentler Comparative Fit Index 1.0000
93
Themodelfitchisquarestillisnotsignificant,indicatingagoodmodelfit.TheRMSEA,
theAGFI,andtheCFIareallgood.However,theSRMSRdoesnotindicateagoodmodel
fit.
93
Release the Constraints on the Structural
Variances and Covariances
Males Females
Visual
1. 1. Visual
a2 a2
Spatial Cubes Spatial Cubes
a3 a3
Lozenges Lozenges
1. Paragraph Paragraph
1.
a5 a5
Verbal Sentence Verbal Sentence
a6 a6
WordMean WordMean
Common parameter names for the structural variances and covariance are removed. 94
Howaboutreleasingtheconstraintsonthestructuralcovariances?
Inthepathdiagram,onlythepathcoefficientsareconstrainednow(byusingthesame
setofparameternames).Thismeansthatonlythepatheffectsareinvariantacrossthe
modelsforthegroups.
94
Releasing the Constraints on the Error Variances
and Structural Covariances
proc calis;
group 1 / label='Males' data=males;
group 2 / label='Females' data=females;
model 1 / group = 1;
path
Spatial ---> Visual Cubes Lozenges = 1. ,
Verbal ---> Paragraph Sentence Wordmean = 1. ;
/*
pvar
Visual Cubes Lozenges Paragraph Sentence Wordmean
Spatial Verbal;
pcov
Spatial Verbal;
*/
model 2 / group = 2;
refmodel 1;
fitindex on(only) = [chisq df probchi rmsea srmsr aic
bentlercfi agfi] noindextype;
run;
Comment out the PVAR and PCOV statements, and let the PROC CALIS set
two distinct sets of default variances and covariances for the two models.
95
Thisnewmultiplegroupmodelcanbespecifyingbycommentedouttheexplicit
specificationsofthestructuralcovariances (variancesandcovariances amonglatent
variables)inModel1.
WhenModel2makesreferencetoModel1,itcopiestheexplicitspecificationsinthe
PATHstatementofModel1.Errorvariances,structuralvariances andcovariances inthe
twomodelsarenowsetbydefaultandareunconstrainedbetweenthetwomodels.
95
Fit Summary of the Multiple-Group Model with
Loading Constraints
Fit Summary
Chi-Square 18.2915
Chi-Square DF 20
Pr > Chi-Square 0.5682
Standardized RMSR (SRMSR) 0.0539
Adjusted GFI (AGFI) 0.9179
RMSEA Estimate 0.0000
Akaike Information Criterion 62.2915
Bozdogan CAIC 149.7796
Schwarz Bayesian Criterion 127.7796
Bentler Comparative Fit Index 1.0000
96
Themodelfitchisquareisnotsignificant.Now,theSRMSRisacceptable.TheAGFI,the
RMSEA,andtheCFIcontinuetobeverygood.
96
Release the Constraints on the Loadings
Males Females
Visual
1. 1. Visual
Lozenges Lozenges
1. Paragraph Paragraph
1.
Verbal Sentence Verbal Sentence
WordMean WordMean
Finally,forthecompletelyunconstrainedmultiplegroupthepathdiagramsforthetwo
groupsarethesame,butnoparameternames(exceptforfixedvaluesof1)areusedto
denoteconstraints.
97
Completely Unconstrained Multiple-Group Model
proc calis;
group 1 / label='Males' data=males;
group 2 / label='Females' data=females;
model 1 / group = 1;
path
Spatial ---> Visual Cubes Lozenges = 1. ,
Verbal ---> Paragraph Sentence Wordmean = 1. ;
model 2 / group = 2;
path
Spatial ---> Visual Cubes Lozenges = 1. ,
Verbal ---> Paragraph Sentence Wordmean = 1. ;
fitindex on(only) = [chisq df probchi rmsea srmsr aic
bentlercfi agfi] noindextype;
run;
The REFMODEL statement is not used here because the two models are
not constrained with each other.
98
Becausethetwomodelsforthegroupsaretotallyunrelated,you shouldnotneedto
usetheREFMODELstatementanymore.Instead,thetwomodelsare definedexactlyby
thesamePATHstatementspecifications.However,becausenocommonparameter
namesareusedforthepathcoefficients,thetwomodelsarenot constrained.
98
Fit Summary of the Completely Unconstrained
Multiple-Group Model
Fit Summary
Chi-Square 16.4795
Chi-Square DF 16
Pr > Chi-Square 0.4200
Standardized RMSR (SRMSR) 0.0449
Adjusted GFI (AGFI) 0.9077
RMSEA Estimate 0.0205
Akaike Information Criterion 68.4795
Bozdogan CAIC 171.8746
Schwarz Bayesian Criterion 145.8746
Bentler Comparative Fit Index 0.9984
The unconstrained SEM model for the groups gives you the best fit,
but it is also the least interesting multiple-group model.
99
Allfitindicesindicateverygoodfitofthecompletelyunconstrainedmultiplegroup
model.
99
Chi-Square Difference Tests for the Nested
Multiple-Group Models
Constrained
Completely Loadings and Constrained
Constrained Struct. Cov. Loadings
100
Whichmodelisthebestforthedata?
Chisquaredifferencetestsprovideastatisticalmethodtoseeifmodelsaresignificantly
differentfromeachother.Thisslidesshowsthechisquaredifferencetestsfor
comparingthefourmultiplegroupmodels.
Asallpvaluesarebiggerthan0.05,itmeansallthesemultiplegroupmodelsarenot
significantlydifferentfromeachother.
100
Comparing Model Fits by Using Various Fit Indices
Constrained
Completely Loadings and Constrained Completely
Constrained Struct. Cov. Loadings Unconstrained
Wecanalsocomparethefourmodelsbymeansofthefitindexvalues.
Themodelfitchisquarevaluealwaysfavorsthemodelwiththelargestnumberof
parameters.So,accordingtothemodelfitchisquare,thecompletelyunconstrained
modelisthebestmodel.TheSRMSRalsofavorsthecompletelyunconstrainedmodel
simplybecauseitcanbeviewedasamonotonetransformationofthechisquarevalue.
However,youshouldnotselectyourbestmodelbasedontheabsoluteindicessuchas
modelfitchisquarevalueortheSRMSRbecausetheseindicesdonottakemodel
parsimonyintoaccount.Complicatedmodelsmighthaveperfectmodelfitchisquare
andSRMSRvalues(i.e.,0).Butthesecomplexmodelsshouldnotbeselectedasthebest
modelsbecausetheyhaveverylittlescientificvalue.
TheAGFI,theRMSEA,theAIC,theCAIC,andtheSBCalltakesmodelparsimonyinto
account.FortheAGFI,thelargerthebetter.Forotherindices, thesmallerthebetter.All
theseparsimoniousindicespointtothecompletelyconstrainedmodelasthebest
multiplegroupmodelforthedata.
Lastly,theincrementalfitindexBentler CFIfavorsthecompletelyconstrainedmodeltoo.
However,virtuallyallmultiplegroupmodelinthiscomparisonareequallygood
accordingtotheCFI.NoticethatincrementalindicessuchastheCFImeasureshowa
targetmodelmeasuresbetterthanasocalledbaselinemodel.Theydonottakemodel
parsimonyintoaccount.Inaddition,theydependonhowgoodthe baselinemodelis
usedinthecomputingformula.Ifthebaselinemodelisverybad (suchasthecommonly
useduncorrelatedness model),allcompetingmodelswouldhavegoodincrementalfit
onlybecausethebaselinemodelismuchworse.Forthisreason, incrementalfitindices
mightnotserveasgoodcriteriaformodelselection.
101
Analyzing Direct and
Indirect Effects
102
102
Democracy and Industrialization Data: Direct and
Indirect Effects of Industrialization
Press60
1. Freop60
Dem60 lam2
lam3 Fair60
Gnppc60 1. lam4 Legis60
Enpc60 Indust
Press65
Indlf60
1. Freop65
Dem65 lam2
lam3 Fair65
lam4 Legis65
103
AnalyzingdirectandindirecteffectsissomethinguniquetoSEM.
Letuslookatthemodelforthedemocracyandindustrialization data.Onlythe
structuralpartoftheSEMisshowntoillustratetheidea.
103
Industrialization Effects on Democracy in 1960
and 1965
On democracy in 1960
A direct effect:
Indust --->Dem60 Dem60
On democracy in 1965
A direct effect: Indust
Indust --->Dem65
An indirect effect:
Indust --->Dem60 --->Dem65 Dem65
104
First,letuslookattheeffectofindustrializationonthedemocracymeasurein1960.
ThedirecteffectofIndust onDem60referstothepathIndust >Dem60.Thiseffect
canbeestimateddirectlyfromanySEMsoftware.
Onthedemocracymeasurein1965,industrializationhasadirect andindirecteffect.
ThedirecteffectreferstothepathIndust >Dem65.Theindirecteffectisindicatedby
thetrackIndust > Dem60>Dem65.
Whenyouaddupthedirectandindirecteffect,itgivesyouthe totaleffect.
InSEM,thedirecteffectsareestimatedasthepathcoefficients.Indirecteffectsand
totaleffectsarefunctionsoftheparameterestimates.Fortunately,PROCCALIScan
computethesefunctionsefficientlyanditcanalsoprovidestandarderrorestimatesfor
theseeffects.
104
Factors Affecting Mental Abilities: A Model Inspired by
Marjoribanks (1974)
S1 S2 S3 P1 P2 P3 M1 M2 M3
1. 1. 1.
Parental
Social Status Mental Ability
Encouragement
Achievement
Family Size
Motivation
1.
A1 A2 A3
105
Thisslideshowsamoreinterestingexampleaboutanalyzingdirect,indirect,andtotaleffects.
TheexampleisinspiredbyamodelofMarjoribanks (1974).Thecurrentmodelisasimplification
andthedataaregenerated.Theresultsheredonotrepresenttheoriginalstudy,butwould
servewellforillustrationpurposes.
Themainideaofthestudyistomodelthementalabilityofstudents.Thementalabilityisa
latentconstruct,whichissupposedtobedetermined(predicted) byparentalencouragement
andachievementmotivation,bothofwhichareformulatedaslatentconstructinthemodel.
Tworemotecauses(predictors),socialstatusandfamilysize,havedirecteffectsonparental
encouragementandachievementmotivation.However,thesetworemotecausesaffectthe
mentalabilityonlyindirectly.Socialstatusisalsoformulated asalatentvariable,whilefamily
sizeisanobservedvariable.Forallthelatentvariables,observedindicatorsareusedandthey
arerepresentedbysmallrectanglesinthepathdiagram.
Therearesomemotivatingquestionsaboutthispathdiagramregardingthedirectandindirect
effects.Forexample,
1.Eventhoughsocialstatusdoesnotaffectthementalability,it doeshaveanindirecteffecton
thementalabilityviaparentalencouragementandachievementmotivation.Onewouldlikethe
SEMsoftwaretocomputethisthisindirecteffectanditssignificance.
2.Parentalencouragementhasadirectandanindirecteffectsonthementalability.Whatisthe
overalltotaleffectofparentalencouragementonthementalability.Onewouldalsolikethe
SEMsoftwaretocomputealltheseeffectsandtheirsignificance.
105
Factors Affecting Mental Abilities: PROC CALIS Code
proc calis data=mental nobs=115 effpart;
The EFFPART option analyzes
path the effect partitioning in the
/* Structural Model */ model.
/* Measurement Model */
SocialStatus ---> S1 S2 S3 = 1.,
ParentalEncouragement ---> P1 P2 P3 = 1.,
AchievementMotivation ---> A1 A2 A3 = 1.,
MentalAbility ---> M1 M2 M3 = 1.;
fitindex on(only)=[agfi srmsr rmsea bentlercfi] noindextype;
run;
106
Now,thePATHspecificationforthetargetmodelshouldbeeasyforyou.Youcan
specifythemeasurementmodelandthestructuralmodelbythemultiplepathsyntax.
Youcanlookatthepathdiagramandwritedownthepathsinthe PATHstatement.
Noticethateachpathinthepathdiagramrepresentsadirecteffectofonevariableon
anothervariable.
TheonlynewoptionintroducedhereistheEFFPARToptioninthe PROCCALIS
statement.EFFPARTstandsforeffectpartitioning.Inotherwords,itpartitionsthetotal
effectsofanyvariableonanyothervariableintodirectandindirecteffects.PROCCALIS
willcomputetheseeffectsandthestandardizedversionallwithstandarderror
estimatesprovided.
106
Fit Summary
Fit Summary
107
Themodelfitactuallydoesnotlooktoogoodforthissimulated data.Butthisisnotthe
concernhere.Wewanttostudytheeffectpartitioning,assuming thatwearesatisfied
withthemodelfit.
107
Factors Affecting Mental Abilities: Estimation Results
S1 S2 S3 P1 P2 P3 M1 M2 M3
1. 1.20** .75** 1. .58** 1.47** 1. 2.11**
2.07**
1. 1.18** .60**
* Marginally significant
A1 A2 A3 ** Significant
~ Fixed
108
Beforedivingintotheresultsfortheeffectpartitioning,Iwanttolookattheestimates
showninthepathdiagram.Iwanttothrowinonemoremotivationtostudydirectand
indirecteffectsinastructuralequationmodel.
Inthispathdiagram,estimatesareshownwiththeirsignificancemarked.Twoasterisks
afteranestimatemeanstheestimateisstatisticallysignificant.Oneasteriskafteran
estimatemeansthattheestimateismarginallysignificant.
Iwanttofocusontheeffectsofparentalencouragementonmentalability.Thedirect
effectis1.73.Thismeansthatparentalencouragementhasanegativeeffectonmental
ability.Thissoundsalittlestrangeatthefirstglance.Butifwelookatthebiggerpicture
inthepathdiagram,wecanunderstandwhythatisso.Noticethatparental
encouragementhasapositiveeffectonachievementmotivation,whichinturnshasa
positiveeffectonmentalability.Thewholepicturesuggeststhatpurelyparental
encouragementdonotnecessarilyaffectmentalabilityinapositiveway.Sometimes,
themoreencouragementwouldonlyaddmorepressuretotheindividualsmental
performancehencethenegativedirecteffectonmentalabilityobservedinthepath
diagramresult.However,whentheparentalencouragementcanaffectsomethingmore
internaloftheindividualsnamely,theindividualsachievementmotivation,thenitwill
resultinahighermentalabilityscore.Hence,thereisapositiveindirecteffectof
parentalencouragementonthementalability.
Insum,aninterestingquestioninthispathdiagramresultisthatwhatistheoverall
totaleffectofparentalencouragementonmentalability,giventhatithasanegative
directeffectandapositiveindirecteffect?
108
Partitioning of the Effects: A Prerequisite
109
Beforeyoucananalyzedirectandindirecteffects,youshouldcheckwhethera
prerequisiteissatisfied.Inordertostudytheeffectpartitioninglegitimately,theso
calledstabilitycoefficientmustbelessthen1.PROCCALISprovidessuchacheck.When
youseethesemessagesfromthePROCCALISoutput,youcouldproceedtoexamine
youreffectpartitioningresults.
109
Partitioning of the Effects: Total Effects
Total Effects
Effect / Std Error / t Value / p Value
ParentalEncouragement 0 0 0 0 0.2516
0.0692
3.6356
0.000277
110
WiththeEFFPARToption,PROCCALISproducestablesfortotal,direct,andindirect
effectsseparately.Thesetablescouldbelarge.Ijustannotate theseresultshere.Some
resultsarenotshown.
Thistableisabouttheestimatesofthetotaleffects,theirstandarderrors,tvalues,and
significancelevels.
110
Partitioning of the Effects: Direct Effects
Direct Effects
Effect / Std Error / t Value / p Value
A1 0 1.0000 0 0 0
A2 0 1.1821 0 0 0
0.1084
10.9012
<.0001
. . . . . .
. . . . . .
.
.
.
.
.
.
.
.
.
.
.
.
Details
. . . . . . omitted.
. . . . . .
. . . . . .
. . . . . .
MentalAbility 0 1.3196 0 -1.7343 0
0.4124 0.9047
3.1995 -1.9169
0.001377 0.0553
ParentalEncouragement 0 0 0 0 0.2516
0.0692
3.6356
0.000277
111
Thistableisaboutthedirecteffects,theirstandarderrors,tvalues,andsignificance
levels.
111
Partitioning of the Effects: Indirect Effects
Indirect Effects
Effect / Std Error / t Value / p Value
ParentalEncouragement 0 0 0 0 0
112
Thistableisabouttheindirecteffects,theirstandarderrors,tvalues,andsignificance
levels.
112
Customized Effect Analysis
The EFFPART option displays all logical possible effects
of the variables
Columns: Five variables, each of which serves as a
predictor at least once:
o FamilySize
o AchivementMotivation
o MentalAbility
o ParentalEncouragement
o SocialStatus
Rows: Sixteen variables, each of which serves as an
outcome variable at least once (all variables except for
SocialStatus)
113
Whenyouhavelargetableslikethoseshowninpreviousslides,youarelikelytobe
doingexploratoryanalysiswithoutspecificquestionsinyourmind.Theeffecttables
couldgetverylargeandyoumighthaveadifficulttimetolook fortheparticularresults
thatyouareinterestedin.Forexample,thecolumnsoftheeffecttablesconsistoffive
variables,eachofwhichservesasapredictoratleastonceinthepathdiagram.These
fivevariableshavedirectorindirecteffectsontherowvariables.Therowsconsistof
sixteenvariables,eachofwhichservesasanoutcomevariableatleastonce.Inthe
currentpathdiagram,itincludesallvariablesexceptfortheSocialStatusvariable.
However,ifyouhavespecificresearchquestionsinyourmind,youarerecommendedto
docustomizedeffectanalysisthatissupportedbyPROCCALIS.
113
Customized Effect Analysis on the Structural Model
.25** -1.73*
Parental
Social Status Mental Ability
Encouragement
-.13** Achievement
Family Size
Motivation
114
Inthebeginning,wealreadyhavethesemotivatingquestions.
1.Forthesocialstatusvariable,Whatarethetotal,direct,andindirecteffectsofthis
remotecauseonotherlatentconstructs,especiallyonthementalability?
2.Whatisthetotaleffectofparentalencouragementonthemental ability?
114
Factors Affecting Mental Abilities: Customized
Effect Analysis
proc calis data=mental nobs=115;
path
/* Structural Model */
SocialStatus --> ParentalEncouragement FamilySize AchievementMotivation,
FamilySize ---> AchievementMotivation,
ParentalEncouragement ---> AchievementMotivation MentalAbility,
AchievementMotivation ---> MentalAbility,
/* Measurement Model */
SocialStatus ---> S1 S2 S3 = 1., The EFFPART
ParentalEncouragement ---> P1 P2 P3 = 1., statement defines the
customized effects of
AchievementMotivation ---> A1 A2 A3 = 1., interest.
MentalAbility ---> M1 M2 M3 = 1.;
effpart
SocialStatus -> ParentalEncouragement AchievementMotivation
MentalAbility,
ParentalEncouragement -> MentalAbility;
run;
115
PROCCALISsupportsthecustomizedeffectanalysis.ThiscanbedonebytheEFFPART
statement,asshowninthePROCCALIScodeinthisslide.
First,youwanttostudytheeffectpartitioningofsocialstatusonthesethreevariables:
parentalencouragement,achievementmotivation,andmentalability.Hence,youusethe
followingcodeintheEFFPARTstatement:
Second,youwanttostudytheeffectpartitioningofparentalencouragementonmental
ability.Hence,youusethefollowingcodeintheEFFPARTstatement:
115
Effects of Social Status
Effects of SocialStatus
Effect / Std Error / t Value / p Value
116
TheeffectpartitioningresultsfromPROCCALISareshowninthisslideandthenextone.
Theeffectsofsocialstatusonthethreespecificlatentvariablesareshowninthistable.
Ontheparentalencouragement,socialstatushasadirecteffect only,whichispositive
andsignificant.
Ontheachievementmotivation,socialstatushasbothadirectandanindirecteffects.
Bothoftheseeffectsaresignificant.Thetotaleffectisthesumofthedirectandindirect
effect.Thetotaleffectisalsosignificant.
Onthementalability,socialstatushasonlyanindirecteffect,whichisalsosignificant.
116
Effects of Parental Encouragement on Mental Ability
Effects of ParentalEncouragement
Effect / Std Error / t Value / p Value
117
Thisslideshowstheeffectpartitioningofparentalencouragementonmentalability.
Thedirecteffectisnegative,asshownpreviouslyinthepathdiagram.Thisdirecteffect
ismarginallysignificant.
Theindirecteffectispositiveandisstatisticalsignificant.Thisisapieceofcomforting
informationparentalencouragementdoesaffectthementalabilitypositively,butonly
throughitseffectonachievementmotivation.
Thetotaleffect,whichisthesumofdirectandindirecteffect,however,isnotsignificant.
ThisexampleshowsthatSEMeffectanalysiscanshowsomeeffect patternsthatsimply
cannotbeanalyzedbylinearregressionanalysisadequately.The SEMeffectanalysis
providessomethingmoredetailedandrefinedregardingthetotalityofthetheory.In
thisregard,thecustomizedeffectanalysissupportedbyPROCCALISisveryuseful.
117
Standardized Effects of Social Status
PROCCALISalsoprovidesthestandardizedresultsforeffectanalysis.Standarderrors,t
values,andpvaluesarealsocomputedforthestandardizedeffectestimates.
118
Standardized Effects of Parental Encouragement
on Mental Ability
119
Thisslideshowsthestandardizedeffectsofparentalencouragementonmentalability.
Thepatternisquitesimilartotheunstandardizedversion.
119
Testing Specific
Hypotheses
120
120
Testing Hypotheses about the Measurement Model
v1 v2 v3
S1 S2 S3 P1 P2 P3 M1 M2 M3
1. g2 g3 1. 1. r2 r3
Parental
Social Status Mental Ability
Encouragement
Achievement
Family Size
Motivation
1.
A1 A2 A3
121
Testingspecifichypothesesisaninterestingtopic.Herewelookatsomeexamples.
Inthementalabilitymodel,youhavesomeindicatorvariablesforthelatentvariables.
Twolatentvariablesareselectedtoillustratethetestingofspecifichypotheses.
Forthementalabilityfactor,onemightwanttotestthehypothesisthattheloadings
(pathcoefficients)arethesamefortheM2andM3indicators.Inthepathdiagram,r2
andr3arelabeledasthepathcoefficients.Youwanttotestwhetherr2andr3areequal
withinthemodel.
Forthesocialstatusfactor,younotonlywanttotestthehypothesisthattheloadings
(pathcoefficients)arethesameforthethreeindicators,butyoualsowanttoseeiftheir
correspondingerrorvariancesarethesameinthepopulation.In thepathdiagram,g2,
g3,v1,v2,andv3areparametersofinterest.Youwanttotestsimultaneouslywhether
g2,g3areequalto1andv1,v2,andv3arethesameinthepopulation.
121
Specific Hypotheses
122
Thetestofequalloadingsandequalerrorvariancesforthesocialstatusitemsisatest
ofparallelitems.Thiscouldbestatedmoreformallyasthefollowingfourcomponent
hypothesesH1,H2,H3,andH4,asshownintheslide.Thesefour hypothesesneedtobe
testedsimultaneously.Rejectionofthesimultaneoustestmeanstheitemsarenot
parallel.
Thetestofequalloadingsforthemeasurementindicatorsofthe mentalabilityfactoris
simpler.ItisstatedinH5.RejectionofH5meansthatr2andr3arenotequalinthe
population.
Finally,youcaninventanystrangehypothesisthatcanbeexpressedasacontinuous
functionofthemodelparameters.Forexample,H6statesthattheratioofthesumofr2
andr3tothesumofg2andg3is2.Thishypothesismayormaynotmakesense.Butitis
includedheretodemonstratetheflexibilityofPROCCALIS.
122
PROC CALIS Hypotheses Testing: h() = 0
123
BeforeIshowyouthePROCCALIScode,itisusefultoreformulatethehypothesesinto
theformsthatmatchthePROCCALISinput.
PROCCALIStestshypothesesoftheformh()=0,whereh()isanycontinuousfunction
ofthemodelparameters(forexample,theerrorvariancesandthepathcoefficientsin
themodel).
Thehypothesesinthepreviousslidecouldallberewritteninthisrequiredform,as
showninthisslide.Withtheseforms,youarereadytospecifythosehypothesesin
PROCCALIS.
123
Testing Specific Hypotheses about the Measurement
Model Using PROC CALIS
proc calis data=mental nobs=115;
path
SocialStatus ---> ParentalEncouragement FamilySize AchievementMotivation,
FamilySize ---> AchievementMotivation,
ParentalEncouragement ---> AchievementMotivation MentalAbility,
AchievementMotivation ---> MentalAbility,
SocialStatus ---> S1 S2 S3 = 1. g2 g3, Specify g2, g3, r2, r3, v1, v2,
ParentalEncouragement ---> P1 P2 P3 = 1., and v3 explicitly.
AchievementMotivation ---> A1 A2 A3 = 1.,
MentalAbility ---> M1 M2 M3 = 1. r2 r3;
pvar S1-S3 = v1-v3; Use the SIMTEST statement
simtest parallel_social_items=[h1 h2 h3 h4]; to test simultaneously
hypotheses. Use the
testfunc h5_equal_load_m2_m3 h6_proportional_sum;
TESTFUNC statement to test
h1 = g2 - 1; individual hypotheses.
h2 = g3 - 1;
Use the SAS programming
h3 = v1 - v2; statements to define the
h4 = v2 - v3; parametric functions in the
h5_equal_load_m2_m3 = r2 - r3; tests.
h6_proportional_sum = 2*(g2 + g3) - (r2 + r3);
run;
124
First,youhavetolabelornametheparametersinthecorrectlocationsofthemodel
specification.Forexample,g2andg3arethepathcoefficientsforS2andS3,respectively;andr2
andr3arethepathcoefficientsforM2andM3,respectively.Noticethatyoudidnotnamethese
parametersintheprecedingmodelspecifications.Namingtheseparameterswereoptional
becauseyoudidnotreferencethem.However,becauseyouaregoingtorefertothese
parametersinthehypothesistesting,youmustnameorlabelthemintherespectivelocations
now.Similarly,theerrorvariancesforS1S3arenamedasv1v3,asshowninthePVAR
statement.
ThemaintoolsfortestingspecifichypothesisinPROCCALISare theSIMTESTSandtheTESTFUNC
statements.
TheSIMTESTstatementenablesyoutotestsimultaneoushypothesesliketheparallelhypothesis
withfourcomponenthypotheses.Herewehaveh1,h2,h3,andh4, allofwhicharetreatedjust
asthenamesofthehypothesesthataredefinedlater.
TheTESTFUNCstatementenablesyoutotestindividualhypotheses liketheequalityofloadings
andtheproportionalityhypothesesdescribedpreviously.HereIuselongnamessuchas
h5_equal_load_m2_m3andh6_proportional_sumtoremindmeofthenatureofthetarget
hypotheses.
NowIusethesocalledSASprogrammingstatementstodefinethehypotheses:h1h4,
h5_equal_load_m2_m3,andh6_proportional_sum.TheSASprogrammingstatementsarejust
likecommonmathematicalequations.ThesesixSASprogrammingstatementsdefinethe
parametricfunctionsinthetargethypotheses.PROCCALIStestsallparametricfunctions
equalingzero.
124
Individual Tests of Parametric Functions:
TESTFUNC Results
Parametric Standard
Function Value Error t Value p Value
125
TheTESTFUNCspecificationproducestheresultsshowninthistable.
YoufailtorejecttheequalityofloadingsforM2andM3becausethepvalueisbigger
than0.05.So,theequalityoftheloadingsissupported.
Youalsofailtorejecttheproportionalsumhypothesis(pvalue=0.79).
125
Tests for Parallel Social Status Items:
SIMTESTS Results
Simultaneous Tests
Overall parallelism hypothesis is not supported for the SocialStatus items, although
the equality of error variances is supported.
126
TheSIMTESTSstatementspecificationproducestheoutputshowninthistable.
Fortheparallelhypothesis,thesimultaneoustestisrejected(p<.0001).Theparallel
itemhypothesisisnotsupported.
PROCCALISalsoprovidesindividualtestsforthecomponenthypotheses.Thiswouldbe
usefulfordoinganadhocanalysistoprobewhatfailsthesimultaneoushypothesis.For
example,bothh1andh2areatleastmarginallysignificant.But h3andh4arenot
significant.Recallthath1andh2areabouttheequalityofthe loadings(pathcoefficients)
whileh3andh4areabouttheequalityoferrorvariances.Thecurrentresultsshowthat
theitemsmighthavethesameerrorvariancesbutnotthesameloadingsinthe
population.
126
Model Modifications
127
127
When the Model Does Not Fit Well
Fit Summary
Chi-Square 196.7455
Chi-Square DF 59
Pr > Chi-Square <.0001
Standardized RMSR (SRMSR) 0.0936
Adjusted GFI (AGFI) 0.7341
RMSEA Estimate 0.1431
Bentler Comparative Fit Index 0.8087
128
Thementalabilitymodeldidnotfitwell.TheSRMSRandtheRMSEAarelarge,whilethe
AGFIandtheCFIaresmall.Whenyouencounterabadmodelfit, itwouldjeopardize
yourinterpretationsofthemodelparameters,effectanalysis,hypothesistesting,and
etc.
Modelmodificationisastatisticaltechniquethatsuggestsways toimproveyourmodel
fit.Themostcommonmodelmodificationtechniqueisdonethroughthesocalled
Lagrangemultiplier(LM)tests.Essentially,theLMtestssuggestwhichparametersyou
couldaddtothemodeltosignificantlylowerthemodelfitchisquarevalue.Whenthe
modelfitchisquareislowered,mostotherfitindices(butnotall,especiallythose
parsimoniousindicesthattakemodelcomplexityintoaccount)mightalsoimprove.
128
Using the MODIFICATION Option
129
TheoptionyoucanusetodomodelmodificationinPROCCALISis theMODIFICATION
optioninthePROCCALISstatement.YoucansimplyaddthisoptiontothePROCCALIS
statementwhenyourunyourmodel.ThisexampleshowsthattheLMtestsformodel
modificationisrequestedfortheoriginalmentalabilitymodel, whichdoesnothavea
verygoodmodelfit.
129
LM Tests for Paths
Parm
To From LM Stat Pr > ChiSq Change
Adding the P2 <--- P1 (or P1 <--- P2) path reduces your model fit chi-square by 56 approximately.
Adding the A2 <--- M2 path reduces your model fit chi-square by 19 approximately.
130
PROCCALISoutputseveraltablesfortheLMtests.Theresultsareshownindifferent
tables,accordingthetypeoftheparameters.ThistableshowstherankingofLM
statisticsforaddingthe(singleheaded)pathsintothementalabilitymodel.Itgivesyou
thetenpathsthatcanimprovethemodelfitchisquarestatisticthemost.
Thetoponeisthep1>p2path.TheLMstatistic56.19meansthatifyouincludethis
pathintothemodel,youcanexpecttoreducethemodelfitchisquarebyabout56.This
isasubstantialimprovementbecauseyoucangetthisbigimprovementbyjustlosing
onedegreeoffreedom.Thesecondoneisthep2>p1path.Essentially,thiswillgive
thesameamountofmodelimprovementasthefirstpath.Thethirdoneisnotthat
dramatic,butstillgiveyouasubstantialimprovement.AddingtheM2>A2path
reducesthemodelfitchisquareby19.
Doyouwanttoaddthesepathsintoyourmodel?Letusdiscussthisafterweexamine
moreresultsabouttheLMtests.
130
LM Tests for Error Variances and Covariances
Rank Order of the 10 Largest LM Stat for Error Variances and Covariances
Adding the error covariance (P2 <---> P1) reduces your model fit chi-square by
56 approximately.
131
ThistableshowstheLMtests(statistics)fortheerrorvariancesandcovariances.Onthe
topofthelististhecovariancebetweentheerrorsofP2andP1.Addingthecovariance
betweentheerrorsofthesetwovariablesreducesthemodelfitchisquarestatisticby
56.ThisisactuallythesameimprovementthatwehaveseenforaddingeithertheP2
>P1or
P1>P2path.Thenextoneinthelisthasamuchlessimprovement. TheLMstatisticis
only12.26.
Forthisparticularlymodel,thesetwotablesareallthatPROCCALISproducesfortheLM
statistics.Thequestionnowiswhichparameterorparametersyouwanttoaddtothe
model.ThiscouldnotbeansweredbyjustlookingattheLMstatistics.Butitmightalso
involvesomejudgmentabouthowreasonabletheaddedparametersare.Dothese
addedparametersrenderyourmodeluninterpretable,orevencontradictorytoyour
theoreticalclaims,despitethefactthattheyimproveyourmodelfitsubstantially?
131
Notes on the LM Statistics
132
Beforegivingananswertothecurrentmodelmodificationanalysis,someimportant
generalpointsabouttheLMstatisticsarediscussed.
First,themodelfitchisquarereductionsasindicatedbytheLMteststatisticsareonly
linearapproximations.Thismeansthatifyouactuallyrefitthe modelbyaddingthe
suggestedparameter,theactualchisquarereductionmightbemoreorless.
Second,thechisquarereductionsassuggestedbytheLMteststatisticsarenot additive.
Thatmeansthatyoucannotaddtwoormoreparametersintothemodelandexpectthe
actualreductioninthenewmodelisexactlythesumofthecorrespondingLMstatistics.
Usually,theactualreductionwouldbesmaller(althoughitcouldbelarger).
Lastbutnotleast,modificationsuggestedbytheLMstatisticsmightnotbe
substantivelymeaningful.
Allthesethreepointsareimportantindecidingwhichparameter youwanttoaddtothe
currentmentalabilitymodelforimprovingthemodelfit.
132
Adding Error Covariance between P1 and P2
S1 S2 S3 P1 P2 P3 M1 M2 M3
1. 1. 1.
Parental
Social Status Mental Ability
Encouragement
Achievement
Family Size
Motivation
1.
A1 A2 A3
133
ConsideringthetopsuggestionsfromtheresultsoftheLMteststatistics,Iwouldadd
thecovariancebetweenP1andP2.Theaddedparameterisshowninthepathdiagram.
Basically,allthetopLMsuggestions theP2>P1andP1>P2paths,andtheerror
covariancebetweenP1andP2arejustdifferentmanifestationsofthesamelackoffit
aboutacovarianceelementintheoriginalmodel.Thatis,thecovariancebetweenP1
andP2wasnotwellexplainedbytheoriginalmodel.Addingeitherofthesewillleadto
abetterfittingofthecovariancebetweenP1andP2.Inaddition,addingeitherofthese
willgiveyouanapproximatemodelfitchisquareimprovementof56.But,youwould
notgetthreetimesofthisamountbyaddingallthesethree.In fact,ifyouweretoadd
allthesethreeparameters,itisverylikelythatyourmodelis notidentified,meaning
thatyouwouldnotgetuniqueestimates.
Amongthetopthreechoices,theerrorcovarianceischosenbecausetheinterpretation
ofaddederrorcovarianceisalittlecleaner. P1andP2aremeasurementindicatorsof
thesamefactor(ParentalEncouragement).Theerrorcovarianceinterpretationisthat
thesetwoindicatorshavesomesortofcorrelationthatisunexplainedbytheircommon
factor.Theaddederrorcovariancerepresentsthecovarianceexplainedbysome
unknownsources.However,ifIweretoaddeithertheP1>P2orP2>P1paths,it
wouldcreatesomeconflictswithpurportedcommonfactorstructureforthetwo
indicatorvariables.
Notethatthecurrentconclusionisbasedonaverygeneralargumentthataimsat
preservingtheoriginalfactorvariablestructure.Itisnotauniversalprinciple.Inpractice,
youhavetoalsoconsiderthesubstantivegroundsoftheaddedparameters.
133
Adding Covariance between the Errors of P1 and P2
134
NowthatIhavedecidedtoaddthecovariancebetweenP1andP2, Irefitthemodelby
addingthePCOVstatementspecificationforthetwovariables,asshownintheSAS
codeinthisslide.IalsousetheMODIFICATIONoptiononemoretimetoseeifthere
couldbeanyfurthersuggestedimprovements.
134
Before and After Adding the Error Covariance
between P1 and P2
Before After
Fit Summary Fit Summary
135
BeforeyouaddtheerrorcovariancebetweenP1andP2,yourmodelfitchisquarewas
about197.Afteraddingthecovariance,themodelfitchisquareisabout111.This
improvementisactuallylargerthanwhattheLMstatisticsuggested,whichwas56.
Otherfitindicesalsoimprove.TheSRMSRandtheRMSEAarenowclosetobe
acceptable.TheAGFIandtheCFIareboostedtohigherlevels.
135
A New Set of LM Tests for Paths
Parm
To From LM Stat Pr > ChiSq Change
Adding the A2 <--- M2 path now reduces your model fit chi-square
by 24 (was 19 before adding the error covariance).
136
ThenewsetofLMtestsforpathssuggeststheadditionoftheM2>A2path.TheLM
statisticisabout24.Ifyoucomparethisresultwiththefirst LMresultsregardingthe
samepath,younoticethattheLMstatisticschangesasthefittedmodelchanges.
Previously,thesamepathhadanLMstatisticof19.Thisillustratesthenonlinearityand
nonadditivityoftheLMstatistics.
136
A New Set of LM Tests for Error Variances
and Covariances
Rank Order of the 10 Largest LM Stat for Error Variances and Covariances
137
ThereisalsoanewsetofLMtestsforaddingerrorcovariances.
Youmightwanttoimproveyourmodelfurtherbyaddingsomeparametersfromthese
twoLMtables,althoughIwillnotattempttodomorehere.
137
Customized LM Tests
138
ModelmodificationbyusingtheMODIFICATIONoptioniskindofblindsearch
procedurethatyoutrytoimproveyourmodelwithoutanydefinitedirections.As
discussedbefore,theLMteststatisticsmightnotgiveyousuggestionsthatare
substantivelymeaningful.
However,insomeoccasionsyoumightwanttorestrictyourattentiontocertainsetof
potentialpathsorparametersinyourmodel,ratherthanallpossibleparameterspace
searchedbytheMODIFICATIONoption.
Ifyouwanttodosuchaprincipledmodificationprocess,youcanusethecustomizedLM
testssupportedinPROCCALIS.
138
Customized LM Tests by Using the LMTESTS
Statement
proc calis data=mental nobs=115;
path
SocialStatus ---> ParentalEncouragement FamilySize AchievementMotivation,
FamilySize ---> AchievementMotivation,
ParentalEncouragement ---> AchievementMotivation MentalAbility,
AchievementMotivation ---> MentalAbility,
SocialStatus ---> S1 S2 S3 = 1. ,
ParentalEncouragement ---> P1 P2 P3 = 1.,
AchievementMotivation ---> A1 A2 A3 = 1.,
MentalAbility ---> M1 M2 M3 = 1.;
lmtests corr_err=[coverr] path=[LV->LV LV->MV];
run;
Explore the set of LM tests called Explore the set of LM tests called path, which
corr_err, which contains all the contains all potential latent variable paths (LV->
potential error covariance LV) and measurement paths (LV -> MV) to be
parameters (COVERR) to be freed. freed.
139
ThecustomizedLMtestsdefinesetsofparametersofinterestso thatyourmodel
modificationprocess(orLMstatisticsoutput)wouldbelimitedtothosesetsof
parameters.PROCCALISprovidestheLMTESTSstatementsyntaxtoachievethe
customizedLMtests.
Thementalabilitymodelisusedagain.ThistimeIdefinetwosetsofparametersof
interest.ThefirstsetofLMtestsiscalledcorr_err (itisjustanameyouassign).This
setofparameterscontainstheparameterregionCOVERR,whichis akeywordthat
denotesallerrorcovariancesinthemodel.ThesecondsetofLM testsiscalledpath
anameyouassign.Thissetofparametersdonotexhaustallpathsinthemodel.It
containstheparameterregionsLV>LVandLV>MV,whicharekeywordsthatdenotes
thelatentvariable(LV)tolatentvariable(LV)pathsandthelatentvariable(LV)to
manifestvariable(MV)paths,respectively.Therefore,thiscustomizedsetpath
excludespathsfromobservedvariablestoobservedvariables,or fromobserved
variablestolatentvariablessothatthefactorstructuresofthemodelcouldnotbe
potentiallydestroyedbyaddingthesepaths.TheLMtestsforthesepathsaresimplynot
includedintheresultsforthepath set.
139
Customized LM Tests for Error Covariances
Parm
Type Var1 Var2 LM Stat Pr > ChiSq Change
140
ThistableshowsthecustomizedLMtestsoftheCORR_ERR set.Essentially,thistable
isthesameasoneofthestandardtablesproducedwiththeMODIFICATIONoption
becausebothtableshavethesameparameterregionCOVERR.
140
Customized LM Tests for Paths
Parm
Type Var1 Var2 LM Stat Pr > ChiSq Change
141
ThesecondcustomizedsetofLMtestssuggeststhataddingthedependentvariable(DV)
todependentvariable(DV)pathA2< ParentalEncouragementimprovesthemodelfit
themostamongstallpathsinthepath set.Thechisquareimprovementisabout19,
whichisstatisticallysignificant.
141
Adding a Path from Parental Achievement to A2
S1 S2 S3 P1 P2 P3 M1 M2 M3
1. 1. 1.
Parental
Social Status Mental Ability
Encouragement
Achievement
Family Size
Motivation
1.
A1 A2 A3
142
Addingthefirstpathsuggestedbythesecondsetofcustomizedsetisrepresentedby
thepathdiagramshownabove.ThepathinredshowsthatA2,whichisanindicatorof
AchievementMotivation,isnowalsoanindicatorofParentalEncouragement.Although
thefactorvariablefunctionalrelationshipispreservedinthissuggestedpathdiagram,
A2becomesfactoriallycomplex.ThealsoimpliesthatA2mightnothavebeenagood
(unique)measureofachievementmotivation.
142
Adding the ParentalEncouragement ---> A2 path
143
Nonetheless,youaddthisnewpathforA2,asshownintheabove PROCCALIScode.All
youneedtodoistoaddA2asoneoftheobservedindicatorsof the
ParentalEncouragementfactor.
143
Before and After Adding the
ParentalEncouragement ---> A2 Path
Before After
Fit Summary Fit Summary
144
Thesetwotablescomparethefitindicesbeforeandafteradding the
ParentalEncouragement>A2path.Themodelfitchisquareactuallydropsmorethan
19,whichwassuggestedbytheLMstatisticintheprecedingresults.Allotherfitindices
improvequiteabittoo.
Finally,acautionaboutallmodelmodification:youshouldvalidateyournewly
establishedmodelbynewdata.Thereasonisthatthemodelmodificationprocessis
subjecttothecapitalizationonchance.Usingaprincipledmodificationprocessbythe
customizedLMtestsmightnotavoidthechanceproblemcompletely.Confirmation
fromnewdataisalwaysrecommended.
144
More About PROC CALIS .
145
Inthisworkshop,ImostlyusethePATHmodelinglanguagetofit SEM.Ialsobriefly
mentionedtheLISMODasaninterfacefortheLISRELmodel.There areactuallyquitea
fewmoremodelinglanguageinPROCCALIS:COSAN,FACTOR,LINEQS, MSTRUCT,and
RAM.Alloftheselanguagessupportmultiplegroupanalysisandmeanstructure
analysis.
IhavealsousedthedefaultML(maximumlikelihood)estimationmethodinthis
workshop,butPROCCALISsupportsmanyotherestimationmethodsaswell:GLS
(generalizeleastsquares),WLS(weightedleastsquares),ULS(unweightedleastsquares),
DWLS(diagonallyweightedleastsquares),andFIML(fullinformationmaximum
likelihood).
Ihaveonlyusedunstandardizedresultsinmostexamples,butPROCCALISalsoprovide
standardizedsolutionswithstandarderrorestimates.
Finally,analysisofmissingpatternswillbeavailablewiththe FIMLmethodinSAS/STAT
9.3.IhopetoaddmorefunctionalitiestoPROCCALISinthefuture.
145
Glossary
Manifest Observed variables (measured variables) in the data set.
Latent Unobserved variables.
Endogenous Dependent /mediating variables; at least one single-headed arrow points to it;
used as an outcome variable in an equation; can also be a predictor variable in other
equations.
Exogenous Independent variables; no single-headed arrows point to it; never used as an
outcome variable in the model; used only as a predictor in the model.
Factor A latent (unmeasured) variable that is treated as a hypothetical construct
(systematic source) in the model.
Error An exogenous term for uncertainty (unsystematic source) associated with an
endogenous manifest variable (or any endogenous variable, in a more general definition).
Disturbance An exogenous term for uncertainty (unsystematic source) associated with an
endogenous latent variable.
Path diagram representation
Rectangles: Observed / manifest variables.
Ovals / circles : Latent variables (factors, errors, and disturbances). Errors and
disturbances are not necessarily put into ovals/circles.
146
146
Glossary
Single-headed arrows: Directed paths, direct effects, path coefficients; specified in the
PATH statement.
Double-headed arrows that point to individual variables: Variance parameters of
exogenous variables or error variance parameters of endogenous variables; specified in
the PVAR statement.
Double-headed arrows that point to two distinct variables: Covariance parameters
between exogenous variables or error covariance parameters between endogenous
variables; specified in the PCOV statement.
Fit assessment
model fit chi-square statistic: Nonsignificance means that the theoretical model is
supported; not a very practical index because it almost always rejects all approximating
models that are practically useful.
AGFI (adjusted goodness-of-fit index) and Bentlers CFI (comparative fit index): Two
popular fit indices that indicate good model fit when their values are above 0.9.
SRMR (standardized root mean square residual) and RMSEA (root mean squared error
approximation): Two popular fit indices that indicate good model fit when their values
are below 0.05.
AIC, CAIC, and SBC: Information criteria for comparing competing models. The smaller
the better.
147
147
References
Arbuckle, J. (2008), AMOS 17.0 Users Guide, Crawfordville, FL: Amos Development
Corporation.
Bentler, P. M. (1985), Theory and Implementation of EQS: A Structural Equations
Program, Manual for Program Version 2.0, Los Angeles: BMDP Statistical Software.
Bentler, P. M. (1995), EQS, Structural Equations Program Manual, Program Version 5.0,
Encino, CA: Multivariate Software.
Bollen, K. A. (1989), Structural Equations with Latent Variables, New York: John Wiley
& Sons.
Fuller, W. A. (1987), Measurement Error Models, New York: John Wiley and Sons.
Holzinger, K. J. and Swineford, F. A. (1939), A Study in Factor Analysis: The Stability
of a Bi-Factor Solution, Supplementary Educational Monographs, No. 48. Chicago:
University of Chicago,Dept. of Education.
Jreskog, K. G. (1973), A General Method for Estimating a Linear Structural Equation
System, in A. S. Goldberger and O. D. Duncan, eds., Structural Equation Models in
the Social Sciences, New York: Academic Press.
148
148
References
Keesling, J. W. (1972), Maximum Likelihood Approaches to Causal Analysis, Ph.D. thesis,
University of Chicago.
Marjoribanks, K., ed. (1974), Environments for Learning, London: National Foundation
for Educational Research Publications.
McDonald, R. P. (1978), A Simple Comprehensive Model for the Analysis of Covariance
Structures, British Journal of Mathematical and Statistical Psychology, 31, 5972.
McDonald, R. P. (1980), A Simple Comprehensive Model for the Analysis of Covariance
Structures: Some Remarks on Applications, British Journal of Mathematical and
Statistical Psychology, 33, 161183.
Nowak, T. P., Hoffman, D. L., and Yung, Y. F. (2000), Measuring the Customer
Experience in Online Environments: A Structural Modeling Approach, Marketing
Science, 19(1), 2242.
Wiley, D. E. (1973), The Identification Problem for Structural Equation Models with
Unmeasured Variables, in A. S. Goldberger and O. D. Duncan, eds., Structural
Equation Models in the Social Sciences, New York: Academic Press.
149
149