Professional Documents
Culture Documents
How To Guide For SmartPLS
How To Guide For SmartPLS
Introduction
Partial Least Squares Structural Equation Modeling (PLS-SEM) is useful when
the research needs to predict a set of dependent variables from a large set
of independent variables (Abdi, 2007). This example shows in which situations
researchers should use this technique with respect to other predictive multivariate
techniques. We illustrate PLS-SEM using a subset of 2017 Customer Behavior in
Electronic Commerce Study in Ecuador.
Learn About Structural Equation Modeling in SmartPLS With Data From the
Page 2 of 19
Customer Behavior in Electronic Commerce Study in Ecuador (2017)
SAGE SAGE Research Methods Datasets Part
2019 SAGE Publications, Ltd. All Rights Reserved. 2
Contents
Learn About Structural Equation Modeling in SmartPLS With Data From the
Page 3 of 19
Customer Behavior in Electronic Commerce Study in Ecuador (2017)
SAGE SAGE Research Methods Datasets Part
2019 SAGE Publications, Ltd. All Rights Reserved. 2
In turn, these latent variables, depending on their role in the model, can be
exogenous and endogenous. The exogenous affect others and do not receive
influence, while the endogenous ones receive influence from other variables
but also can affect each other. Conventionally, graphically, these models are
represented with the latent variables in Greek letters within circles, and the
observed variables are represented in Latin letters within rectangles. The
relationship between the observed variables or indicators and the latent variables
is known as the measurement model or outer model, while the relationship
structure between the latent variables or concepts of the model is called the
structural model or inner model. Both models are represented graphically by
arrows in trajectory diagrams (Henlein & Kaplan, 2004). The direction of the
arrows between the observed and latent variables, theoretically, indicates whether
the observed measurements are reflective indicators (each indicator is a reflection
or direct observation of the latent variable or construct) or formative indicators
(where some set of indicators together jointly determine the latent variable). The
entire process to establish or represent the structural model is known as the
specification or identification of the model.
In this example, the relationships of all the indicators with their respective latent
variables are presented as reflective indicators, since each survey question
Learn About Structural Equation Modeling in SmartPLS With Data From the
Page 4 of 19
Customer Behavior in Electronic Commerce Study in Ecuador (2017)
SAGE SAGE Research Methods Datasets Part
2019 SAGE Publications, Ltd. All Rights Reserved. 2
Learn About Structural Equation Modeling in SmartPLS With Data From the
Page 5 of 19
Customer Behavior in Electronic Commerce Study in Ecuador (2017)
SAGE SAGE Research Methods Datasets Part
2019 SAGE Publications, Ltd. All Rights Reserved. 2
5,000 iterations are recommended); and set the Stop Criterion to 10−7;
this setting is generally recommended as adequately fine tolerance (if
Learn About Structural Equation Modeling in SmartPLS With Data From the
Page 6 of 19
Customer Behavior in Electronic Commerce Study in Ecuador (2017)
SAGE SAGE Research Methods Datasets Part
2019 SAGE Publications, Ltd. All Rights Reserved. 2
created (e.g., 5,000). The variation across these many (e.g., 5,000) estimations
from the bootstrap subsamples is used to derive standard errors for the PLS-
SEM results. With this information, standard errors, Beta coefficients, t-values,
p-values, and confidence intervals can be calculated to assess the PLS-SEM
estimation results.
The (fit) quality criteria are the most relevant output values for assessing both
phases of the PLS-SEM estimation. Specifically, the criteria and generally
suggested interpretation are summarized in Table 1.
Coefficient of determination. Indicates the percentage of the variance in the endogenous variable that the
R2 exogenous variables explain collectively. Can take values between 0 and 1, where values closer to 0 represent
poor fit and values closer to 1 represent a better fit.
Measures effect size and the strength of the relationship between the variables on a numeric scale related to
total, explained, and error variances. Higher is generally considered better, although “sufficiently high” values
will vary considerably across contexts. Nonetheless, these general guidelines have been offered:
Cronbach’s Alpha is a measure of internal consistency or reliability of a construct’s measure, that is, how
closely related the set of items comprising the construct are as a group. The result is usually a number from 0
to 1, but a negative Cronbach’s Alpha can also occur, suggesting something seriously wrong with the operation
(e.g., if some score items have polarity reversed relative to some others, the mean of all the inter-item
Construct
correlations can be negative: items’ polarity should always be aligned). General guidelines on Cronbach’s
Reliability
Alpha for Construct Reliability and Validity are:
and
Validity
Below 0.60 unacceptable
0.60–0.70 minimally acceptable
0.70–0.80 respectable
0.80–0.90 very good
Learn About Structural Equation Modeling in SmartPLS With Data From the
Page 7 of 19
Customer Behavior in Electronic Commerce Study in Ecuador (2017)
SAGE SAGE Research Methods Datasets Part
2019 SAGE Publications, Ltd. All Rights Reserved. 2
Composite reliability (CR), also referred to as McDonald’s coefficient, is obtained by combining all the true
score variances and covariances in the composite of indicator variables related to constructs and by dividing
this sum by the total variance in the composite. Like Cronbach’s Alpha, CR is a reliability indicator, but
Cronbach’s Alpha assumes factor loadings to be the same for all items, whereas CR takes into consideration
the varying factor loadings of the items. Acceptable values of CR are generally considered 0.7 and above.
Average of variance extracted (AVE) is an indicator of convergent validity that measures the amount of
variance that is captured by a construct in relation to the amount of variance due to measurement error.
Generally, AVE of at least 0.5 or higher is demanded, otherwise variance of error is more than variance
explained, which is considered unacceptable.
Discriminant validity, finally, determines whether the constructs in the model are highly correlated among
themselves or not. It compares the Square Root of AVE of a particular construct with the correlation between
that construct with other constructs. It is generally suggested that the Square Root of AVE should be higher
than the correlation of the construct with others (If not, the individual construct does not provide much
discrimination, i.e., unique explanatory power).
Collinearity
Variance inflation factor is a measure of the amount of multicollinearity among a set of multiple regression
Statistics
variables. Values greater than 4 are generally considered to indicate problematically high multicollinearity.
(VIF)
Standardized Root Mean Square Residual is an index of the average of standardized residual between the
observed and the hypothesized covariance matrices. Although recommendations vary across sources, a good
adjustment is generally considered to be less than 0.10 or 0.08.
Model fit
Chi Square (χ2), the model is generally considered to have an acceptable fit if the Chi-square/df values are
from 2 to 3 and with limits of up to 5.
For reflective models, these are the key indicators showing the trajectory of the Latent variable towards the
Outer
observed variables. Therefore, they show how much each observable variable or item contributes absolutely to
loadings
the definition of the construct or latent variable. Loadings are generally expected to be greater than .6.
They are indicators typical of the formative models, since they illustrate the trajectory from the Observed
Outer
variable to the Latent variables. These indicate the relative contribution of an indicator to the definition of its
weights
corresponding variable. It is also expected to be greater than .7.
Analogous to regression residuals, the residue indicates the variance that does not ultimately go into
Residue
explaining the factor. As this “waste” is lower, a better fit of the model is indicated.
Upon conclusion of the project creation, the command to import the data will
appear in the upper right panel. Double-click as indicated and look in the library of
your computer for the data file, select it and press open (see Figure 4).
Learn About Structural Equation Modeling in SmartPLS With Data From the
Page 10 of 19
Customer Behavior in Electronic Commerce Study in Ecuador (2017)
SAGE SAGE Research Methods Datasets Part
2019 SAGE Publications, Ltd. All Rights Reserved. 2
Once the data have been imported, you can view the descriptive statistics of the
variables under study in the lower right panel (e.g., Mean, Medium, Min, Max,
Standard deviation, Excess Kurtosis, and Skewness) (see Figure 4, Step 5).
To specify the model, use the superior command “new path model.” Clicking this
will bring up a dialog box to name your path diagram. Now you can visualize the
graphics resources in the upper part that will allow you to draw it. First, place
the latent variables and then the arrows as theoretically established. Initially, they
appear as red circles (see Figure 5).
Learn About Structural Equation Modeling in SmartPLS With Data From the
Page 11 of 19
Customer Behavior in Electronic Commerce Study in Ecuador (2017)
SAGE SAGE Research Methods Datasets Part
2019 SAGE Publications, Ltd. All Rights Reserved. 2
With the right button on each variable latent, you can label the variables. With the
“connect” command, you can identify with an arrow and establish the relationships
between the variables (see Figure 5, Step 7). Then, load the observed variables
or indicators to the model. To do so, select the lower right panel identified with the
indicator name and drag the corresponding items to each variable and drop it over
the corresponding latent circle or variable. Progressively, as the indicators of each
variable are loaded, originally in red circles, they will turn blue, and the indicators
or items will be reflected in yellow rectangles. Now you can see the model in the
following way, as it was seen in Figure 1 previously described.
Learn About Structural Equation Modeling in SmartPLS With Data From the
Page 12 of 19
Customer Behavior in Electronic Commerce Study in Ecuador (2017)
SAGE SAGE Research Methods Datasets Part
2019 SAGE Publications, Ltd. All Rights Reserved. 2
At the conclusion of the calculation, you can see all the indicators associated
with the measurement model in the lower right panel. In the upper tabs of this
quadrant, you can review this information numerically and graphically (see Figure
6).
Similarly, in the lower right panel, you can see the indicators associated with the
Learn About Structural Equation Modeling in SmartPLS With Data From the
Page 13 of 19
Customer Behavior in Electronic Commerce Study in Ecuador (2017)
SAGE SAGE Research Methods Datasets Part
2019 SAGE Publications, Ltd. All Rights Reserved. 2
PE1 0.807
PE2 0.772
Performance Expectancy
PE3 0.815 0.92 0.89 0.64
(PE)
PE4 0.818
PE5 0.770
Learn About Structural Equation Modeling in SmartPLS With Data From the
Page 14 of 19
Customer Behavior in Electronic Commerce Study in Ecuador (2017)
SAGE SAGE Research Methods Datasets Part
2019 SAGE Publications, Ltd. All Rights Reserved. 2
PE6 0.831
EE1 0.837
EE2 0.890
Effort Expectancy (EE) 0.91 0.86 0.71
EE3 0.763
EE4 0.865
SI1 0.821
SI2 0.865
Social Influence (SI) 0.91 0.88 0.72
SIC3 0.888
SI4 0.822
FC1 0.775
FC2 0.897
Facilitating Conditions
0.93 0.91 0.73
(FC)
FC3 0.873
FC4 0.875
INT1 0.904
INT3 0.904
Learn About Structural Equation Modeling in SmartPLS With Data From the
Page 15 of 19
Customer Behavior in Electronic Commerce Study in Ecuador (2017)
SAGE SAGE Research Methods Datasets Part
2019 SAGE Publications, Ltd. All Rights Reserved. 2
Learn About Structural Equation Modeling in SmartPLS With Data From the
Page 16 of 19
Customer Behavior in Electronic Commerce Study in Ecuador (2017)
SAGE SAGE Research Methods Datasets Part
2019 SAGE Publications, Ltd. All Rights Reserved. 2
PE 0.80
EE 0.75 0.84
model: R2 adjusted value, Beta Coefficient, and f2 effect size. Before testing the
structural model, fit adjustment with Standardized Root Mean Square Residual
value was evaluated. The result was 0.06, which indicated a good fit adjustment.
In respect to the predictive power of the model provided for Behavioral Intention,
R2 adjusted value indicates that the model explains 45% of the variance in
BI. Bootstrapping was performed to provide a significance level for each
hypothesized relationship, parameter settings for bootstrapping included no sign
changes, and the 500 samples. According to the results, PE, EE, and FC predict
Learn About Structural Equation Modeling in SmartPLS With Data From the
Page 17 of 19
Customer Behavior in Electronic Commerce Study in Ecuador (2017)
SAGE SAGE Research Methods Datasets Part
2019 SAGE Publications, Ltd. All Rights Reserved. 2
showed to be the best predictor, followed by the PE. According to f2, values of
0.02, 0.15, and 0.35 represent small, medium, and large effect sizes, respectively.
The f2 effect size on the BI was small for all the variables, which implies a small
but significant contribution of the variables whose hypotheses were confirmed. All
these results are summarized in Table 5.
H1: Performance Expectancy → Behavioral Intention 0.133 1.847 .066 Not confirm
H2: Social Influence → Behavioral Intention 0.020 0.394 .694 Not confirm
3 Your Turn
You can download this sample dataset along with a guide showing how to conduct
a PLS-SEM. The sample dataset also includes demographic variables such as
gender and age. See whether you can reproduce the results presented here, and
try to conduct your own PLS-SEM: (1) for men and women separately and (2) for
young people (18–25 years old) and young adults (26–35 years old) separately.
References
Abdi, H. (2007). Partial least squares regression. Encyclopaedia of measurement
and statistics, 2, 740–744.
Chin, W. W. (1998). The partial least squares approach for structural equation
Learn About Structural Equation Modeling in SmartPLS With Data From the
Page 18 of 19
Customer Behavior in Electronic Commerce Study in Ecuador (2017)
SAGE SAGE Research Methods Datasets Part
2019 SAGE Publications, Ltd. All Rights Reserved. 2
Hair, J., Jr., Sarstedt, M., Hopkins, L., & Kuppelwieser, V. G. (2014). Partial
least squares structural equation modeling (PLS-SEM): An emerging tool in
business research. European Business Review, 26(2), 106–121.
Henlein, M., & Kaplan, A. M. (2004). A beginner’s guide to partial least squares
analysis. Understanding Statistics, 3(4), 283–297.
Ringle, C. M., Wende, S., & Becker, J.-M. (2015). “SmartPLS 3.” Boenningstedt,
Germany: SmartPLS GmbH. Retrieved from http://www.smartpls.com
Venkatesh, V., Morris, M. G., Davis, G. B., & Davis, F. D. (2003). User
acceptance of information technology: Toward a unified view. MIS Quarterly,
27(3), 425–478.
Learn About Structural Equation Modeling in SmartPLS With Data From the
Page 19 of 19
Customer Behavior in Electronic Commerce Study in Ecuador (2017)