Professional Documents
Culture Documents
Ijoprs MS Id 555703
Ijoprs MS Id 555703
Abstract
Structural Equation Model (SEM) is an advanced multivariate statistical tool for modeling latent variables and to control measurement errors.
Bayesian SEM (BSEM) gives better estimates of latent variable compared to classical SEM. To our knowledge the use of BSEM approach to
identify the significant latent constructs influencing quit smoking in tuberculosis (TB) and human immunodeficiency virus (HIV) patients was
not studied so far. The aim of this study is to identify latent variables influencing quitting smoking in TB and HIV patients using BSEM. The data
used for the study consist of 160 patients (80 TB and 80 HIV) randomised to receive smoking cessation intervention under clinical trial. The
smoking status was measured after one month of intervention. The latent variables ‘reasons for smoking’ (measured by the variables work
tension, family tension and pleasure while smoking) and ‘intensity of smoking’ (measured by the variables Fagerstrom score, smoking type,
number of times smoking per day and smoking duration) and the information on socio-economic characteristics were considered for analyses.
This study elucidates the importance of applying BSEM to assess smoking cessation in TB and HIV patients. BSEM gave the estimates indicating
the latent variable ‘intensity of smoking’ had negative effect on quit smoking.
Keywords: Randomised Clinical Trial; Smoking Cessation; TB; HIV; Bayesian Structural Equation Model
Abbreviations: SEM: Structural Equation Model; BSEM: Bayesian SEM; TB: Tuberculosis; HIV: Human Immunodeficiency Virus; RCT: Randomised
Controlled Clinical Trial; MDR-TB: Multidrug Resistance TB
Introduction
robust for small sample problems [8-11]. In BSEM, the computation
Structural equation models (SEMs) are advanced statistical
algorithm is based on raw observation rather than covariance
models that represent complex relationships between latent and
matrix as in classical SEM and solved using powerful computing
observed variables [1]. Latent variables are random variables
techniques such as Gibbs sampler and Metropolis-Hasting’s
which are measured indirectly through observed variable. SEM
algorithms [12]. Numerous articles highlighted superiority of
is a superior modeling tool than other techniques because of
BSEM compared to other traditional techniques [8,9,12-16]. To
reducing measurement errors by having multiple indicators
our knowledge the use of BSEM to identify the significant latent
of latent variables that is free from random error. Only a few
constructs influencing quit smoking in tuberculosis (TB) and
studies attempted SEM to evaluate many factors which are inter-
human immunodeficiency virus (HIV) patients was not studied so
related and cannot be easily disentangled by traditional statistical
far. In the current study, we hypothesised a latent variables model
techniques [2-5].
using BSEM to identify the influencing factors for quit smoking in
Simulations studies have compared Bayesian estimation and TB and HIV patients after one month of the interventions.
frequentist estimation for SEM with small samples and advantage
There is ample evidence for the association between smoking
of Bayesian SEM (BSEM) is well documented in literature [6,7].
and TB, with literature showing smoking among TB patients
The BSEM is a potential tool to overcome assumption issues and
causes morbidity and mortality. It was reported that smoking Ψδ = diagonal matrix and ξi and δi are independent [12].
has been also associated with multidrug resistance TB (MDR-TB)
Suppose v = {x, y} , where x = {x1 ,..., xr } is a subset of
[17-22]. Smokers who are infected with HIV will face more risks
observable continuous variables, while y = { y1 ,..., ys } be the
besides the impact of smoking. Compared to non-smoker, smokers
remaining subset of unobservable continuous variables where
were having poor viral and immunological response, more threat
p ≥ s = p-r ≥ 0. The information related with y be given as an
of virological rebound [23].
ordered categorical vector z = ( z1 ,..., zs )T . The latent variable are
ICMR-National Institute for Research in Tuberculosis either continuous or ordered categorical manifest variables as
conducted a randomised controlled clinical trial (RCT) for its indicators. The relationship of y and z is defined by a set of
quit smoking among TB and HIV patients. This trial used two thresholds as given bellow:
interventions, one arm being standard counselling and other arm z1 α1, z < y1 ≤ α1, z +1
being physicians’ advice along with standard counselling to see 1 1
the efficacy [24]. This was a unique opportunity to use this data . .
to identify the latent factors influencing the quit smoking. We
z= . if . , (3)
made an attempt to use BSEM for latent variables influencing quit
smoking for TB and HIV patients and presented in this paper. . .
The paper is structured as follows. In section 2, we propose zs α s , z < ys ≤ α s , z
s s +1
BSEM model for continuous and ordered categorical variables. In
section 3, we provide application of BSEM for clinical trial data
to identify latent variables influencing for quitting smoking in TB where, α k ,0 < α k , 1 < ... < α k bk +1 and for k = 1,...s , zk is an integral
and HIV patients. Section 4 describes results of the BSEM model. value in {0,1, …bk}. In general, we set α k ,b = ∞ , α k ,0 =-∞.
k +1
−∞ There are
Final section provides discussions and conclusions. bk +1 categories, for kth variable that are defined by the unknown
threshold α k , j . The integral values {0,1, …bk} of zk are used for
Methodology identifying the categories which contain the corresponding
SEM elements in yk . These integral values are neither directly used
in the posterior simulation nor in any actual computation of
SEM consists of the measurement equation and structural Bayesian analysis [12].
equation, the measurement equation of the SEM is defined by
Markov Chain Monte Carlo Algorithms
yi = Λωi + ∈i , i = 1,…n (1) In Frequentist analysis, parameters are used as constant,
whereas in Bayesian analysis, parameters are taken as variables
where [25]. In Bayesian analysis, the prior distributions of a parameter
yi = p X 1 random vector, is combined with likelihood obtained from the data to form
posterior distribution of the estimates of the parameter. The
Λ = p X q factor loading matrix, Bayesian estimates of the latent variables and the parameters
ωi = q X 1 factor scores vector and are estimated using Markov Chain Monte Carlo (MCMC) methods
called the Gibbs sampler [26] and the Metropolis-Hastings (MH)
∈ = (p x 1) is a random vector of error measurement
i algorithm [27,28]. MCMC algorithms creates approximations
∈ ~ N (0, Ψ ) where Ψ∈ is a diagonal matrix.
∈
to the posterior distributions by iteratively selecting random
draws in the MCMC chains. The initial draws of the estimates are
Let ωi = (η , ξi ) is a partition of ωi into an q1 x 1 dependent
T T T
i mentioned as the burnin phase of the MCMC algorithms. The
latent vector ηi and an q2 x 1 independent latent vector ξi, where ξ ~ trace plot of the posterior draws can be utilised to monitor the
N (0, I) and uncorrelated with ∈
. The structural equation which convergence. The maximum number of iteration for convergence
provides the relationship between dependent latent variables and can be assessed graphically by plotting the simulated sequences of
independent latent variables can be written as individual parameters. If converged, the parallel sequence created
η i = Π η i + Γξ i + δ i (2) with diverse initial values which overlap well. The convergence
can also be supervised using the Gelman-Rubin potential scaling
where, reduction by the parallel computing in multiple MCMC chains.
П(q1 x q1) & 1 Γ (q x q 2 ) = unknown parameter matrices of The parallel sequence of observations and the estimated potential
regression coefficients; scale reduction (EPSR) values of the parameters are estimated
sequentially based on the starting values of the structural
δi = q1 x 1 random vector of error measurements.
parameters and latent variables as the iterations proceed [29].
ξi ~N[0, Ф], and δi ~N[0, Ψδ], The ESPR values are less than 1.2 implies the convergence of the
model [15].
How to cite this article: Vasantha M, Ramesh S, Ponnuraja C, Adhin B, Muniyandi M, et al. Latent Factors Affecting Smoking Cessation Among TB and
002
HIV Patients: Bayesian Structural Equation Modeling Approach. Int J Pul & Res Sci. 2024; 7(1): 555703. DOI: 10.19080/IJOPRS.2024.07.555703
International Journal of Pulmonary & Respiratory Sciences
Evaluation Criteria score, smoking type, number of times smoking in a day and
smoking duration, income, intervention to quit smoking, drinking
The model fit of BSEM is evaluated using a posterior predictive
habits, past history of quit smoking, presence of HIV infection
(PP) p-value. The BSEM model is considered to have a good fit if
and the reasons for smoking such as pleasure, work tension and
PP p- value near to 0.5. If the PP p-value is small (approaching
family tension. There were seven independent variables framed
zero), the model is a poor fit for the data [12].
as two independent latent variables. The three variables pleasure,
Application of BSEM to Clinical Traill Data work tension and family tension were used as indicator variables
for the latent variable ‘reason’ ( ξ1 ) which was intended to
Data address reason for smoking. The four variables Fagerstrom score,
We used secondary data from randomised controlled trial smoking type, number of times smoking and smoking duration
piloted at ICMR-National Institute for Research in Tuberculosis were selected as indicator variables to address the latent variable
(ICMR-NIRT), Madurai Unit during March 2010 to May 2012. A ‘intensity’ ( ξ 2 ). The variables educations, income, intervention,
total of 160 patients (80 TB patients and 80 HIV patients) aged drinking habits, past history of quit smoking, presence of HIV or
≥ 18 years, with either HIV or TB and with a history of current TB infection were considered as covariates. The hypothesis of the
smoking (at least one cigarette in the past 1 week) were enrolled. study was that the latent variables ‘intensity’ and ‘reason’ were
This study was registered in the Clinical Registry of India related to quit smoking after one month of the interventions. The
(CTRI/2009/091/000962, 14.12.2009), approved by Scientific hypothesis and the path diagram of SEM is given in (Figure 1).
Advisory Committee and Institutional Ethics Committee of ICMR- In the model of the current study, yi is 7 X 1 vector of manifest
NIRT, Chennai to study the efficacy of physician’s advice in quitting variable defining the 2 X 1 random vector of latent variables ωi.
smoking. The factor loading matrix Λ is given below as
How to cite this article: Vasantha M, Ramesh S, Ponnuraja C, Adhin B, Muniyandi M, et al. Latent Factors Affecting Smoking Cessation Among TB and
003
HIV Patients: Bayesian Structural Equation Modeling Approach. Int J Pul & Res Sci. 2024; 7(1): 555703. DOI: 10.19080/IJOPRS.2024.07.555703
International Journal of Pulmonary & Respiratory Sciences
How to cite this article: Vasantha M, Ramesh S, Ponnuraja C, Adhin B, Muniyandi M, et al. Latent Factors Affecting Smoking Cessation Among TB and
004
HIV Patients: Bayesian Structural Equation Modeling Approach. Int J Pul & Res Sci. 2024; 7(1): 555703. DOI: 10.19080/IJOPRS.2024.07.555703
International Journal of Pulmonary & Respiratory Sciences
Quit rate were generated after discarding the first 3000 burn-in iterations.
The convergence of the sampler was verified by assessing the
Overall combined analysis including both HIV and TB, the
trace plots. The plots for a selection of the parameters and
quit rate was 38% (52/136). The quit rate among the patients
posterior distribution of kernel density are shown in (Figure 2
received physician’s advice was 41% (28/68) and in the standard
and 3). (Figure 2) shows convergence plots, which demonstrate
counselling was 35% (24/68) though the difference in quit rate
tight overlapping of parameter estimates across iterations. The
was not significant (p=0.473).
overlapping indicates that the parameters are converged. The
Convergence of BSEM posterior probability density plots in (Figure 3) which depicts
that the posterior distribution of the parameters is approximating
The SEM model for this dataset did not converge; hence,
normal. ESPR values are also found to be less than 1.2 after 2,000
we conducted a BSEM model for the study. To get the Bayesian
iterations.
estimates of latent variable and the parameters, 3000 observations
How to cite this article: Vasantha M, Ramesh S, Ponnuraja C, Adhin B, Muniyandi M, et al. Latent Factors Affecting Smoking Cessation Among TB and
005
HIV Patients: Bayesian Structural Equation Modeling Approach. Int J Pul & Res Sci. 2024; 7(1): 555703. DOI: 10.19080/IJOPRS.2024.07.555703
International Journal of Pulmonary & Respiratory Sciences
Figure 2(d): Trace plots of value of parameter for different iterations. Outcome on reason ( γ 2 ).
How to cite this article: Vasantha M, Ramesh S, Ponnuraja C, Adhin B, Muniyandi M, et al. Latent Factors Affecting Smoking Cessation Among TB and
006
HIV Patients: Bayesian Structural Equation Modeling Approach. Int J Pul & Res Sci. 2024; 7(1): 555703. DOI: 10.19080/IJOPRS.2024.07.555703
International Journal of Pulmonary & Respiratory Sciences
Figure 3(d): Posterior distribution of value of parameter for different iterations. 3d. Outcome on reason ( γ 2 ).
Model fit of BSEM observations after convergence are presented in (Table 2 and
3). The posterior estimates of the parameters and their 95%
In the current study, 42 parameters were used to estimate the
posterior credible intervals are presented in (Table 2 and 3). The
influencing factors for quit smoking. The PP p-value of 0.3 for the
standardised estimate of structural equation for quit smoking is
fitted BSEM indicated the good fit of the model for the data.
Unstandardized Standardized
Factors p-value
Estimate Posterior SD 95% Credible Interval Estimate Posterior SD 95% Credible Interval
Intensity By
Fagerstrom Score 1 - - 0.951 0.04 (0.848, 0.984) <0.001
Smoking type 0.141 0.074 (0.054, 0.332) 0.404 0.104 (0.184, 0.593) <0.005
Number of times
3.104 1.395 (1.609, 6.447) 0.881 0.059 (0.756, 0.994) <0.001
smoking
Smoking duration 0.443 0.389 (-0.130, 1.423) 0.141 0.092 (-0.038, 0.325) 0.064
Reason By
Family tension 1 - - 0.803 0.065 (0.656, 0.921) <0.001
Work tension 3.157 0.979 (1.040, 4.739) 0.973 0.047 (0.836, 0.990) <0.001
Pleasure 0.344 0.206 (0.066, 0.872) 0.429 0.157 (0.097, 0.699) <0.01
Intensity on
Drinking 0.756 0.933 (-0.478, 3.125) 0.107 0.093 (-0.069, 0.289) 0.122
Past quit of smoking -0.197 0.681 (-1.789, 1.065) -0.035 0.096 (-0.225, 0.156) 0.357
HIV -0.7 0.621 (-2.032, 0.496) -0.127 0.095 (-0.294, 0.075) 0.115
Reason on
Drinking 0.339 0.355 (-0.323, 1.077) 0.101 0.097 (-0.093, 0.286) 0.153
Past quit of smoking 0.329 0.334 (-0.323, 1.016) 0.125 0.116 (-0.106, 0.346) 0.141
HIV 0.099 0.281 (-0.548, 0.561) 0.037 0.101 (-0.173, 0.220) 0.366
How to cite this article: Vasantha M, Ramesh S, Ponnuraja C, Adhin B, Muniyandi M, et al. Latent Factors Affecting Smoking Cessation Among TB and
007
HIV Patients: Bayesian Structural Equation Modeling Approach. Int J Pul & Res Sci. 2024; 7(1): 555703. DOI: 10.19080/IJOPRS.2024.07.555703
International Journal of Pulmonary & Respiratory Sciences
How to cite this article: Vasantha M, Ramesh S, Ponnuraja C, Adhin B, Muniyandi M, et al. Latent Factors Affecting Smoking Cessation Among TB and
008
HIV Patients: Bayesian Structural Equation Modeling Approach. Int J Pul & Res Sci. 2024; 7(1): 555703. DOI: 10.19080/IJOPRS.2024.07.555703
International Journal of Pulmonary & Respiratory Sciences
and including younger age at initiation increase risk for death and 10. Ansari A, Jedidi K, Dube LL (2002) Heterogeneous factor analysis
reduce the chance to quit. models: A Bayesian approach. Psychometrika 67: 49-78.
11. Lee S, Song X (2004) Bayesian model comparison of nonlinear
BSEM was used in other settings and showed better estimates. structuralequation models with missing continuous and ordinal
The cross-cultural consistency of the factor structure of the categorical data. British Journal of Mathematical and Statistical
Hedonic and Eudaimonic Motives for Activities (HEMA) scale was Psychology 57: 131-150.
examined by BSEM showed that BSEM is more informative than 12. Lee SY (2007) Structural Equation Modeling: A Bayesian Approach.
the traditional SEM [34]. Similarly other studies applied BSEM John Wiley & Sons Ltd. England.
to identify the factors for non-adherence to medication among 13. Lee SY, Song XY (2003) Bayesian analysis of structural equation models
hypertension patients [13], and type 2 diabetic patients with with dichotomous variables. Statistics in Medicines 22: 3073-3088.
nephropathy [35]. In the current study showed that PP p- value 14. Kaplan D, Depaoli S (2012) Handbook of Structural Equation Modeling.
for the model fit was 0.3 indicating that the BSEM model was Hoyle RH, editor. The Guilford Press.
found to be adequately fitting the data. 15. Muthèn LK, Muthèn BO (2017) MPlus user’s guide. Los Angeles, CA:
Muthèn and Muthèn. Los Angeles, CA.
Conclusion 16. Van De Schoot R, Depaoli S (2014) Bayesian analyses: Where to start
and what to report. European Health Psychologist 16: 75-84.
The Novel approach of the study was that from our knowledge
we used first time BSEM to identify the latent constructs which 17. Altet-Gomez MN, Alcaide J, Godoy P, Romero MA (2005) Hernandez del
Rey I. Clinical and epidemiological aspects of smoking and tuberculosis:
influence quit smoking in TB and HIV patients. BSEM is powerful
a study of 13,038 cases. International Journal of Tuberculosis Lung and
statistical computing tool for more accurate analysis of more Disease 9(4): 430-436.
complex variables.
18. Ariyothai N, Podhipak A, Akarasewi P, Tornee S, Smithtikarn S, et al.
(2004) Cigarette smoking and its relation to pulmonary tuberculosis in
Acknowledgement adults. Southeast Asian Journal of Tropical Medicine and Public Health
35(1): 219-227.
The authors acknowledge the authorities of ICMR-National
Institute for Research in Tuberculosis, Indian Council of Medical 19. Barroso EC, Mota R, Santos RO, Sousa A, Barroso JB, et al. (2003)
Risk factors for acquired multidrug resistant tuberculosis. Journal of
Research, Chennai for permitting us to use the data for modelling.. Pneumologia 29: 89-97.
References 20. Chiang CY, Slama K, Enarson DA (2007) Associations between tobacco
and tuberculosis. International Journal of Tuberculosis and Lung
1. Beran TN, Violato C (2010) Structural equation modeling in medical Disease 11(3): 258-262.
research: a primer. BMC Research Notes 3: 267.
21. Pai M (2009) Tobacco and TB: what clinicians can and must do? J
2. Businelle MS, Kendzor DE, Reitzel LR, Costello TJ, Cofta Woerpel L, MGIMS 14: 1-6.
et al. (2010) Mechanisms linking socioeconomic status to smoking
cessation: a structural equation modeling approach. Health Psychology 22. Thomas A, Gopi PG, Santha T, Chandrasekaran V, Subramani R, et al.
29(3): 262-273. (2005) Predictors of relapse among pulmonary tuberculosis patients
treated in a DOTS programme in south India. International Journal
3. Huang C, Guo C, Yu S, Feng Y, Song J, et al. (2013) Smoking behaviours Tuberculosis Lung Disease 9(5): 556-561.
and cessation services among male physicians in China: evidence from
a structural equation model. Tobacco Control 2(2): 27-33. 23. Feldman JG, Minkoff H, Schneider MF (2006) The association of
cigarette smoking with HIV prognosis among women in the HAART
4. Martinez SA, Beebe LA, Thompson DM, Terrell DR, Campbell JE (2018) era-a report from the Women’s Interagency HIV Study. American
A structural equation modeling approach to understanding pathways Journal of Public Health 96(6): 1060-1065.
that connect socioeconomic status and smoking. PLoS ONE 2: 1-17.
24. Kumar SR, Pooranangangadevi N, Rajendran M, Mayer KH, Flanigan
5. Sauer A, Fedewa SA, Kim J, Jemal A, Westmaas JL (2018) Educational T, et al. (2017) Physician’s advice on quitting smoking in HIV and
attainment & quitting smoking: A structural equation model approach. TB patients in south India: a randomised clinical trial. Public Health
Preventive Medicine 116: 32-39. Action 7: 39-45.
6. Smid SC, Mcneish D, Miočević M, Schoot RV (2020) Bayesian Versus 25. Gelman A, Stern HS, Carlin JB, Rubin DB (2004) Bayesian Data Analysis.
Frequentist Estimation for Structural Equation Models in Small London, England: Routledge.
Sample Contexts: A Systematic Review. Structural Equation Modeling:
A Multidisciplinary Journal 27(1): 131-161. 26. Geman S, Geman D (1984) Stochastic relaxation, Gibbs distribution
and the Bayesian restoration of images. IEEE Transactions on Pattern
7. De Schoot R, Kaplan D, Denissen J, Asendorpf JB, Neyer FJ, et al. Analysis and Machine Intelligence 6: 721-741.
(2014) A Gentle Introduction to Bayesian Analysis: Applications to
Developmental Research. Child Development 85(3): 842-60. 27. Metropolis N, Rosenbluth AW, Rosenbluth MN, Teller AH, Teller E
(1953) Equations of state calculations by a fast-computing machine.
8. Scheines R, Hoijtink H, Boomsma A (1999) Bayesian estimation and Journal of Chemical Physics 21: 1087-1091.
testing of structural equation models. Psychometrika 64: 37-52.
28. Hastings WK (1970) Monte Carlo sampling methods using Markov
9. Lee SY, Shi JQ (2000) Bayesian analysis of structural equation model chains and their application. Biometrika 57: 97-109.
with fixed covariates. Structural Equation Modeling 7(3): 411-430.
How to cite this article: Vasantha M, Ramesh S, Ponnuraja C, Adhin B, Muniyandi M, et al. Latent Factors Affecting Smoking Cessation Among TB and
009
HIV Patients: Bayesian Structural Equation Modeling Approach. Int J Pul & Res Sci. 2024; 7(1): 555703. DOI: 10.19080/IJOPRS.2024.07.555703
International Journal of Pulmonary & Respiratory Sciences
29. Gilks WR, Richardson S, Spiegelhalter DJ, (1996) Gelman A Inference 33. Thun MJ, Carter BD, Feskanich D, Freedman ND, Prentice R, et al. (2013)
and monitoring convergence. London: Chapman and Hall. 50-year trend in smoking-related mortality in the United States. New
England Journal of Medicine 368: 351-364.
30. Inoue-Choi M, Hartge P, Park Y, Abnet CC, Freedman ND (2019)
Association Between Reductions of Number of Cigarettes Smoked per 34. Bujacz A, Vittersø J, Huta V, Kaczmarek LD (2014) Measuring hedonia
Day and Mortality Among Older Adults in the United States. American and eudaimonia as motives for activities: cross-national investigation
Journal of Epidemiology 188(2): 363-371. through traditional and Bayesian structural equation modelling.
Frontiers Psychology 5: 2-9.
31. Jha P, Ramasundarahettige C, Landsman V, Rostron B, Thun M, et al.
(2013) 21st-century hazards of smoking and benefits of cessation in 35. Song XY, Lee SY, Ng M, So WY, Chan J (2007) Bayesian analysis of
the United States. New England Journal of Medicine 368(4): 341-350. structural equation models with multinomial variables and an
application to type 2 diabetic nephropathy. Statistics in Medicine
32. Pirie K, Peto R, Reeves GK, Green J, Beral V (2013) Million Women 26(11): 2348-2369.
Study Collaborators. The 21st century hazards of smoking and benefits
of stopping: a prospective study of one million women in the UK.
Lancet 381: 133-141.
How to cite this article: Vasantha M, Ramesh S, Ponnuraja C, Adhin B, Muniyandi M, et al. Latent Factors Affecting Smoking Cessation Among TB and
0010
HIV Patients: Bayesian Structural Equation Modeling Approach. Int J Pul & Res Sci. 2024; 7(1): 555703. DOI: 10.19080/IJOPRS.2024.07.555703