Professional Documents
Culture Documents
1 s2.0 S0959652624007522 Main
1 s2.0 S0959652624007522 Main
A R T I C L E I N F O A B S T R A C T
Handling editor: Zhen Leng The construction industry significantly contributes to global greenhouse gas emissions, highlighting the imper
ative for developing environmentally friendly construction materials. Geopolymers, particularly those utilizing
Keywords: metakaolin (MK), have emerged as a promising green alternative to conventional concrete. However, the
Geopolymer concrete acquisition of MK-based geopolymer concrete with optimal mechanical properties poses challenges due to
Metakaolin
numerous influential factors, disagreement over various findings, and the lack of a reliable predictive model. This
Compressive strength
study aimed to address this gap by employing a wide range of machine learning methods, namely gradient
Machine learning
SHAP boosting machine, random forest, decision tree, artificial neural network, and support vector machine. Different
optimization and regularization techniques were used to comprehensively understand the factors affecting the
compressive strength of MK-based geopolymer concrete, including mixture design, chemical characteristics of
the initial binder and activators, and different curing regimes. The results demonstrated the exceptional per
formance of the gradient boosting machine in predicting the compressive strength of MK-based geopolymer
concrete, achieving a coefficient of determination of 0.983 and a mean absolute error of 1.615 MPa. Addi
tionally, the study employed partial dependence plots, feature importance analysis, and SHapley Additive ex
Planations (SHAP) to elucidate the proposed models. The coarse-to-fine aggregate ratio, H2O/Na2O molar ratio,
extra water content, and sodium hydroxide concentration were identified as the most critical parameters
affecting the compressive strength of MK-based geopolymer concrete. This research contributes to advancing the
development of sustainable construction materials, streamlining experimental tasks, minimizing the need for
labor and materials, improving time efficiency, and providing valuable insights for optimizing the design of MK-
based geopolymer concrete.
* Corresponding author.
** Corresponding author.
E-mail addresses: shayanfar@iust.ac.ir (M.A. Shayanfar), emad.mohammadigolafshani@unimelb.edu.au (E. Golafshani).
https://doi.org/10.1016/j.jclepro.2024.141305
Received 14 September 2023; Received in revised form 4 February 2024; Accepted 15 February 2024
Available online 20 February 2024
0959-6526/© 2024 The Authors. Published by Elsevier Ltd. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
S.A. Eftekhar Afzali et al. Journal of Cleaner Production 446 (2024) 141305
natural sources, including calcined clays. While fly ash and GGBFS Table 1
availability vary by region due to their by-product nature, clay reserves, Application of different ML algorithms to predict the compressive strength of
such as Kaolin clay, are generally abundant worldwide and accessible geopolymers and alkali-activated materials.
globally, surpassing the quantity of cement produced (Ayeni et al., 2021; Utilized Size of Input variables Binder Year Refs
Bature et al., 2021; Environment et al., 2018). Therefore, calcined clays algorithms database
are the sole materials possessing significant potential to prolong the Logistic 616 Si/Al, Na/Al, Ca/ Fly 2023 Zhang
availability of suitable minerals for geopolymers and alkali-activated regression, Si, curing ash, et al.
materials (Environment et al., 2018; Heath et al., 2014). Metakaolin Artificial temperature, GGBFS (2023)
neural curing time,
(MK) is an ideal thermally activated calcined clay derived from the
network, humidity, water
calcination of natural kaolin clay at temperatures between 600 and Support vector
800 ◦ C (Moradikhou et al., 2020). Its incorporation as a partial machine,
replacement for OPC enhances various properties of OPC-based con Random forest
crete, including compressive strength and resistance to chemical Linear 220 Activated Fly 2022 (Ahmed
regression, alkaline solution ash, et al.,
degradation (Ahmed et al., 2022a, b, c, d; Asteris et al., 2022). More Multi-logistic to binder ratio, GGBFS 2022a,
over, MK serves as a suitable base material for alkali-activated binder regression, fly ash, SiO2/ b, c, d)
preparation due to its high reactivity in geopolymerization reactions Artificial Al2O3 of fly ash,
(Júnior et al., 2021). neural GGBS, SiO2/CaO
network, and of GGBS, fine
Consuming MK as the precursor results in a geopolymer with
M5P-tree aggregate, coarse
consistent and predictable properties, which is more easily character aggregate,
ized than those made from industrial wastes like fly ash and GGBFS (da sodium
Silva Rocha et al., 2018; Singh et al., 2015). MK can react rapidly, hydroxide,
exhibiting a faster reaction rate than siliceous fly ashes and even GGBFS sodium silicate,
molarity
(Environment et al., 2018). In some cases, MK-based geopolymers can
Decision tree, 154 Fly ash, coarse Fly ash 2022 Ahmed
achieve high early strengths at ambient temperatures without requiring Bagging, aggregate, fine et al.,
high-temperature pre-curing, primarily when activators are used in AdaBoost aggregate, 2022a,
higher concentrations (Provis et al., 2019). This challenges the idea that sodium b, c, d
hydroxide,
all alkali-activated materials necessitate such treatment for early
sodium
strength (Provis et al., 2019). Furthermore, MK-based geopolymers offer hydroxide
approximately 50% lower CO2 emissions compared to conventional concentration,
OPC-based mixtures with equivalent strength (Abbas et al., 2020). sodium silicate,
Additionally, they contribute to a reduced global warming potential by SiO2, Na2O, age
Ridge regression, 173 Fly ash, GGBFS, Fly 2022 Shah
about 61% (Perez-Cortes and Escalante-Garcia, 2020).
Random forest, Na2O dosage, ash, et al.
Several studies have been carried out to investigate the fresh and Extreme water/binder GGBFS (2022)
hardened properties of MK-based geopolymer concrete (GPC). Alghan gradient ratio, curing
nam et al. (2021) provided insights into the characteristics of MK-based boosting temperature,
retarder content,
GPC, focusing on the effects of curing conditions. Their findings revealed
superplasticizer
that variations in curing temperature (24–40 ◦ C) and relative humidity Support vector 676 Fly ash, GGBFS, Fly 2022 Zhang
(20–70%) did not significantly affect the compressive strength. Mor machine, reactivity ash, et al.
adikhou et al. (2020) aimed to determine the optimal mix design for Random forest, modulus, GGBFS (2022)
MK-based GPC and concluded that the compressive strength increased Extra trees, hydraulic
Gradient modulus, silica
between the sodium hydroxide concentrations of 10 and 14 M. However,
boosting modulus,
no further improvement was observed when the sodium hydroxide alumina
concentration increased from 14 M to 16. Pouhet and Cyr (2016) modulus, lime
highlighted the notable dependence of MK-based GPC’s compressive modulus, sodium
hydroxide,
strength on the mixture’s water content, in contrast to conventional
sodium silicate,
OPC-based concrete. Rovnaník (2010) explored the mechanical char Na2O in sodium
acteristics of MK-based geopolymers under various pre-curing condi silicate, silica in
tions, including temperatures ranging from 40 to 80 ◦ C. Their results sodium silicate,
indicated that higher pre-curing temperatures led to less compact water in sodium
silicate, water,
microstructure, attributed to the quick setting of the mixture hindering
fine aggregate,
the formation of a denser structure. coarse aggregate,
Considerable efforts have been exerted to identify the optimal temperature,
chemical characteristics of MK-based geopolymer systems, specifically relative humidity,
focusing on SiO2/Al2O3, Na2O/Al2O3, and H2O/Na2O molar ratios. age
Artificial neural 110 Fly ash, SiO2, Fly ash 2022 Peng
These ratios play a significant role in governing the system’s chemistry, network, Al2O3, coarse and
geopolymerization kinetics, microstructural evolution, and, conse Support vector aggregate, fine Unluer
quently, the compressive strength of MK-based geopolymers. However, machine, aggregate, (2022)
reported optimal values for these ratios diverge significantly across Extreme sodium
learning hydroxide,
various studies, exhibiting a wide range of variations. Specifically, the
machine sodium
recommended optimal values for SiO2/Al2O3, Na2O/Al2O3, and H2O/ hydroxide
Na2O ratios span from 2.75 to 5.5, 0.7 to 1.3, and 7 to 13, respectively concentration,
(De Silva et al., 2007; Duxson et al., 2007; Lahoti et al., 2017; Rowles sodium silicate,
and O’connor, 2003; Yunsheng et al., 2010; Zhu et al., 2021). The sodium silicate/
sodium
inconsistency in findings across different studies can be ascribed to the hydroxide, alkali
influence of other factors on compressive strength and the complex and
(continued on next page)
nonlinear interactions among them, which have not been
2
S.A. Eftekhar Afzali et al. Journal of Cleaner Production 446 (2024) 141305
3
S.A. Eftekhar Afzali et al. Journal of Cleaner Production 446 (2024) 141305
Fig. 1. The methodology used for a) development and evaluation, and b) ranking, selection, and interpretation of different ML models.
testing sets, each serving distinct purposes. The training set, comprising ranked based on their testing root mean squared error obtained in the
80% of the whole database, then undergoes further divisions in a k-fold previous step, and the best-synthesized models are selected. Finally,
cross-validation integrated Bayesian optimization process to adjust the various parametric studies are conducted utilizing the best-synthesized
hyperparameters of ANN, SVM, DT, GBM, and RF algorithms. Following models to interpret their predictions.
this, 20 prediction models are developed based on different initial
random seeds for each ML algorithm. Subsequently, the performance of 3. Data collection and preparation
these models is evaluated using a testing set containing 20% of the
whole database withheld from the training process. The models are then Accurate and reliable prediction of the compressive strength of MK-
4
S.A. Eftekhar Afzali et al. Journal of Cleaner Production 446 (2024) 141305
Table 3
Statistical description of input and output features.
Input features Type Unit Minimum Mean Maximum Standard deviation
5
S.A. Eftekhar Afzali et al. Journal of Cleaner Production 446 (2024) 141305
Fig. 2. Distribution of the input and output features, including a) MK, b) SHS, c) SHSM, d) SSS, e) W, f) W/S, g) Na2O/Al2O3, h) SiO2/Al2O3, i) H2O/Na2O, j) CA/FA,
k) SP, l) CT, m) Age, and n) CS.
∑
K LRidge (w) = L(w) + λ wi 2 (3)
γ ji = wj−ki 1 j−i 1
ρ + bji (2) i=1
k=1
where LRidge (w) is the Ridge regularized loss function, L(w) represents
j
where ρi represents the output of the i th neuron in the j th layer, Ψ(.) is the standard loss function for the FNN model (commonly used as the
the activation function, K is the number of neurons in the (j − 1)th layer mean squared error), wi denotes the network weights, and λ is the reg
j− 1
that are connected to the i th neuron in the j th layer, wki is the cor ularization parameter that controls the impact of the regularization
6
S.A. Eftekhar Afzali et al. Journal of Cleaner Production 446 (2024) 141305
term. In the present study, λ is determined during the hyperparameter training set of data {(x1 , y1 ), (x2 , y2 ), ․ ․ ., (xl , yl )}, where l denotes the
∑
tuning process discussed in Section 5.1. The additional term λ ni=1 wi 2 number of training data instances, xi represents the input vector, and yi
penalizes large weight values during the training phase, effectively denotes the corresponding actual target value. In regression problems,
constraining the model’s complexity and mitigating overfitting. SVM aims to establish a linear function that minimizes the difference
To further facilitate the learning process of FNN, data normalization between predictions and targets to be within a set threshold (ε), while
is a crucial preprocessing step (Adel et al., 2022; Peng and Unluer, maximizing flatness. The desired linear regression function can be
2022). Normalizing the data prevents certain features from dominating written as follows:
the learning process due to their larger scales, which can affect the
f (x) = 〈ω, x〉 + b (5)
convergence and performance of FNN models. This study employs data
normalization using the Min-Max scaling technique with the following where ω denotes the weight vector, b represents the scalar threshold
equation to map input and output values between − 1 and 1: vector, and 〈,〉 denotes the dot product operation between ω and x
2(X − Xmin ) vectors. The goal of maximizing flatness corresponds to minimizing the
Xn = − 1 (4) norm of the weight vector, which can be formulated as follows:
Xmax − Xmin
1
where Xn represents the normalized value of the feature X (input or minimize ‖ω‖2
2
output features), with Xmin and Xmax denoting the minimum and the ⎧
⎨ yi − 〈ω, xi 〉 − b ≤ ε + ξ
maximum values of X, respectively. ∑
l
( )
+C ξ + ξ∗i subjected to 〈ω, xi 〉 + b − yi ≤ ε + ξ∗ (6)
It should be emphasized that the predictive performance of FNN i=1
⎩
ξ, ξ∗ ≥ 0
models is significantly influenced by the selection of hyperparameters,
including the number of hidden layers, the number of neurons in each where C represents the regularization parameter that governs the trade-
hidden layer, and the activation function type. In the current research, off between the flatness of the function f(x) and the tolerance for de
only FNN models with a single hidden layer and a maximum of ten viations exceeding ε. To find a feasible solution to the above convex
neurons are utilized due to the limited size of the collected database. optimization problem, slack variables ξ and ξ∗ are introduced, allowing
for the neglect of deviations greater than ε utilizing an ε-insensitive loss
function, expressed as follows (Smola and Schölkopf, 2004):
4.2. Support vector machine (SVM)
{
0 if |y − f (x)| ≤ ε
|ξ|ε = (7)
SVM is a robust supervised ML technique that has demonstrated |y − f (x)| − ε otherwise
notable effectiveness in enhancing the generalization capability for
regression and classification tasks (Cortes and Vapnik, 1995). Consider a To tackle the non-linearity issue, SVM seeks a linear solution in a
7
S.A. Eftekhar Afzali et al. Journal of Cleaner Production 446 (2024) 141305
Fig. 4. An illustration of an unregularized ensemble and a regularized ensemble with weighted base learners.
high-dimensional feature space using Lagrange multipliers αi and α∗i ̃ is a measure of tree complexity and α is the complexity
where |T|
(Fig. 3):
parameter.
( ) ∑nsv
( )
f x, αi , α∗i = αi − α∗i 〈φ(xi ) , φ(x) 〉 + b (8)
4.4. Random forest (RF)
i=1
where φ is a nonlinear map from input space to the high-dimensional Bootstrap aggregating (Bagging) is a robust methodology for
feature space, 〈φ(xi ) , φ(x) 〉 represents the dot product between training generating independent base learners in a parallel process, and it has
patterns in high-dimensional feature space, and nsv is the number of shown success in supervised learning tasks (Breiman, 2001). Bagging
support vectors. However, mapping training patterns to a higher- builds a set of base learners by randomly selecting different subsets of
dimensional space introduces high computational costs. This computa data, known as bootstrap. Since sampling is done with replacement
tional complexity can be circumvented using a kernel function K(xi , xj ), (bootstrap sampling), approximately 63.2% of the total training data are
such as radial basis or polynomial functions, without explicitly included in each bootstrap, allowing the bagging model to calculate the
computing the form of φ(xi ), as shown below (Smola and Schölkopf, generalization error using the out-of-bag error.
2004): RF is an enhanced version of bagging that incorporates both bagging
and a random subset of input features to reduce correlations among
) ∑ grown trees while maintaining reasonable predictive strength (Breiman,
nsv
( ( )
f x, αi ., α∗i = αi − α∗i K(xi , x) + b (9)
i=1 2001). Given a training set of data {xi , yi }N1 , where N represents the
number of training instances, RF aggregates the predictions of the in
As a geometric algorithm, SVM is influenced by the magnitude of the
dividual trees Tj (xi ; θj ) by averaging them using the following equation:
input features. If certain features have larger scales than others, the SVM
may give them more weight, leading to a skewed interpretation of their 1 ∑M
( )
importance. Hence, in the current research, a data normalization step is f (X) = TJ X; θj (12)
M J=1
conducted before applying SVM, using the Min-Max Scaling technique
described by Eq. (4). where M is the total number of base learners and each θj contains
essential information about resampling, splitting features, and splitting
4.3. Decision tree (DT) locations. Irrelevant and non-informative base learners could be
removed from the ensemble using a Lasso regularization process. This
DTs are widely employed as a popular ML method, as they are involves assigning weights to the base learners based on their perfor
piecewise constant models, offering straightforward interpretability. mance, relevance, and contribution to the overall predictive power. If
Growing DTs involves iteratively partitioning the data space using bi the weight αj of a specific base learner equals 0, that learner may be
nary splits to create multiple subsets (Breiman et al., 2017). This process
excluded. The resulting ensemble is a Compact-RF model, comprising
entails addressing three key aspects: 1) Identifying the optimal splits, 2)
fewer base learners, and has a better or comparable generalization
Defining a stopping criterion for the splitting process, and 3) Deter
capability than the original ensemble model (Fig. 4). Accordingly, the
mining the output assigned to each terminal node. The terminal nodes,
optimal weights αj are determined to minimize the penalized ensemble
representing the non-partitioned subsets of X, are assigned specific
loss, as follows:
values or classes. Given a training set of data *** {(xi .yi )}N1 , where N ( )
denotes the number of training instances, the quality of each split is ∑N ∑
M
( ) ∑
M ⃒ ⃒
minimize φ yi , αj T X; θj + λ ⃒αj ⃒ (13)
assessed based on its ability to minimize the error measure R(T), as n=1 j=1 1
follows (Breiman et al., 2017):
1 ∑∑ where φ(, ) is the standard loss function for the ensemble model
R(T) = (yi − yt )2 (10) (commonly used as the mean squared error), λ is the Lasso parameter,
N
and T(X; θj ) represents a simple base learner characterized by θj . This
xi ∈t
t∈ ̃
T
8
S.A. Eftekhar Afzali et al. Journal of Cleaner Production 446 (2024) 141305
training set of data {xi , yi }N1 , where N represents the number of training Fig. 1. Once the hyperparameters have been properly tuned, the
instances, the key objective of GBM is to find an approximation of F∗ (X), generalization ability of the optimized models is assessed using the
which maps x to y, such that the expected value of an arbitrary differ remaining 20% of the data that was withheld during the training phase.
entiable loss function φ(y, F(x)) is minimized (Friedman, 2002):
F ∗ (X) = arg minEx.y φ(y, F(X)) (14) 5.2. Performance assessment
F(X)
GBM employs an additive expansion to estimate F∗ (X) using the The present work employs a set of five indicators, namely, the root-
equation below (Friedman, 2001, 2002): mean-square error (RMSE), mean absolute error (MAE), mean absolute
percentage error (MAPE), coefficient of determination (R2), and α 20
∑
M
index to quantitatively describe the predictive performance of the pro
FM (X) = h(X; am ) (15)
m=1
posed ML models. The mathematical definitions of these performance
indicators are given below:
where h(x; am ) represents a simple base learner, specifically a decision √̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅
tree characterized by am , and M refers to the total number of tree 1∑ n
( exp )2
RMSE = Y − Y prd (18)
learners or iterations (Friedman, 2001, 2002). GBM starts with an initial n I=1 i i
constant value. In each iteration, the algorithm fits a new tree to the
aggregated residuals resulting from predictions made by all previously 1∑ n ⃒ ⃒
MAE = ⃒Y exp − Yiprd ⃒ (19)
constructed trees, while the existing trees remain unchanged (Friedman, n i=1 i
2001). Subsequently, the algorithm updates predictions in each iteration
in a greedy manner using the equation below: n ⃒ ⃒
100 ∑ ⃒Yiexp − Yiprd ⃒
MAPE = ⃒
⃒ exp
⃒
⃒ (20)
Fm (X) = Fm− 1 (X) + ν ․ γ lm |(X ∈ Slm ) (16) n i=1 Yi
5.1. Hyperparameter tuning Despite ML’s acknowledged proficiency in tackling complex prob
lems, the lack of transparency associated with these models and their
Optimizing ML model performance requires essential hyper black-box nature poses a substantial risk to the reliability of predictions,
parameter tuning, directly impacting the model’s complexity and the impeding their effective utilization in critical decision-making scenarios.
risk of overfitting. To prevent overfitting, the Bayesian optimization Feature importance analysis is one way to elucidate the rationale behind
algorithm in combination with the k-fold cross-validation technique (k the predictions generated by the models. This analysis aims to quanti
was set to 5 in this study) was employed for the hyperparameter tuning tatively assess the relative contribution of each input feature to the
process of ANN, SVM, DT, RF, and GBM. The database was partitioned model’s predictions. Specifically, in tree-based methods, the Gini
into training and testing sets. The training set was divided into five non- importance is determined by evaluating the average decrease in the
overlapping folds, with four folds used for training and the fifth for error measure caused by the feature’s dominance in the splits. In the
validation. In each iteration, an ML model was built using four training case of RF models, additional information on permuted feature impor
folds and validated against the remaining fold. This process was tance can be obtained through out-of-bag analysis. This analysis in
repeated five times, with each fold serving as the validation set once. volves randomly permuting the values of a single feature in the out-of-
The average of the validation errors from these iterations (k-fold loss) bag samples and re-evaluating the out-of-bag error based on the
was then reported as the error of the ML model. manipulated data. The average difference between the model’s perfor
The optimal hyperparameters were determined by minimizing the mance on the original and permuted data across all trees provides in
fobj = ln (1 +k − fold loss) using Bayesian optimization, a method rec sights into the importance of each feature. Although feature importance
ommended for efficient optimization (Zhang et al., 2022). Bayesian analysis quantifies the relevance of input features based on their impact
optimization is characterized by a systematic searching technique that on the model’s output, it fails to capture interactions or dependencies
relies on expected improvement in the objective function. This unique between these input features. In many real-world scenarios, the signif
characteristic makes Bayesian optimization more effective than other icance of a feature might be conditional on the presence or absence of
methods, such as grid or random search. The primary goal of Bayesian other features, which feature importance analysis alone might not
optimization is to iteratively identify the globally optimal point by reveal. Hence, it is essential to complement feature importance analysis
updating a Gaussian process model of the objective function using the with other techniques. Methods such as partial dependence plots can
Bayes rule. Additional details about the steps the Bayesian optimization provide a more comprehensive understanding of the model’s behavior,
algorithm follows to find optimal hyperparameters are provided in explained in the next subsection.
9
S.A. Eftekhar Afzali et al. Journal of Cleaner Production 446 (2024) 141305
10
S.A. Eftekhar Afzali et al. Journal of Cleaner Production 446 (2024) 141305
Fig. 5. Mean and the best testing a) R2, b) RMSE, c) MAE, d) MAPE, and e) α20 achieved by GBM, Compact-GBM, RF, Compact-RF, DT, ANN, and SVM after 20
simulations.
11
S.A. Eftekhar Afzali et al. Journal of Cleaner Production 446 (2024) 141305
Fig. 6. Experimental vs. predicted compressive strengths by the a) GBM, b) Compact-GBM, c) RF, d) Compact-RF, e) DT, f) ANN, and g) SVM models.
12
S.A. Eftekhar Afzali et al. Journal of Cleaner Production 446 (2024) 141305
Fig. 7. Performance of a) GBM, b) Compact-GBM, c) RF, and d) Compact-RF on training and testing datasets (Continued). Fig. 7. Performance of e) DT, f) ANN, and
g) SVM on training and testing datasets.
13
S.A. Eftekhar Afzali et al. Journal of Cleaner Production 446 (2024) 141305
Fig. 7. (continued).
compared to individual models such as DT. In the case of individual results in terms of mean performance of (R2 = 0.936, RMSE = 3.843
models, the ANN and SVM demonstrate almost similar performances MPa, MAE = 2.505 MPa, MAPE = 5.580%, and α20 = 0.903) and the
with relatively lower standard deviations compared to ensemble models best model of (R2 = 0.983, RMSE = 2.218 MPa, MAE = 1.615 MPa,
considering R2, RMSE, and MAE indicators. However, initial observa MAPE = 6.976%, and α20 = 0.936).
tions indicate the outstanding average performance of the GBM models A graphical representation of the goodness of fit between predicted
in predicting the compressive strength of MK-based GPC, exhibiting the and actual compressive strength values for the best-synthesized models
highest level of accuracy, followed by the RF models. All proposed is provided in Fig. 6. A lower scatter of data points around the baseline
models predict the compressive strength with the mean testing R2 and x = y indicates higher prediction accuracy. In this regard, the GBM and
α20 values exceeding 0.8 and 0.79, respectively, while maintaining the Compact-GBM models exhibit the best fit for both the training and
mean RMSE, MAE, and MAPE values below 5.8 MPa, 4.22 MPa, and testing data, with over 93% of the data points for the GBM model and
15.77%, respectively. The GBM model yielded exceptionally superior over 91% for the Compact-GBM model falling within the range of 20%
14
S.A. Eftekhar Afzali et al. Journal of Cleaner Production 446 (2024) 141305
Table 5
The developed ML model’s comparison in different ranges of compressive strength.
Models Compressive strength ranges (MPa) Number of data points R2 RMSE (MPa) MAE (MPa) MAPE (%) α20 Comment
GBM 5–30 129 0.974 1.210 0.865 5.116 0.977 Very good
30–55 81 0.957 1.670 1.224 2.979 1 Very good
55–80 25 0.834 2.169 1.627 2.577 1 Very good
Compact-GBM 5–30 129 0.951 1.646 1.207 6.871 0.938 Very good
30–55 81 0.918 2.324 1.754 4.409 1 Very good
55–80 25 0.653 3.137 2.655 4.211 1 Poor
RF 5–30 129 0.847 2.913 2.062 13.895 0.829 Very good
30–55 81 0.823 3.409 2.543 6.339 0.926 Very good
55–80 25 0.25 4.612 3.981 6.296 1 Poor
Compact-RF 5–30 129 0.935 1.904 1.413 8.851 0.907 Very good
30–55 81 0.892 2.655 2.118 5.416 1 Very good
55–80 25 0.725 2.788 2.375 3.808 1 Good
DT 5–30 129 0.841 2.979 2.078 11.660 0.837 Very good
30–55 81 0.783 3.771 2.767 6.667 0.951 Good
55–80 25 0.483 3.826 2.992 4.693 1 Poor
ANN 5–30 129 0.878 2.603 1.858 10.584 0.884 Very good
30–55 81 0.799 3.630 2.546 6.415 0.951 Good
55–80 25 0.485 6.485 5.189 8.022 0.880 Poor
SVM 5–30 129 0.831 3.066 2.703 15.947 0.798 Very good
30–55 81 0.839 3.251 2.654 6.587 0.988 Very good
55–80 25 0.249 5.947 4.666 7.246 0.920 Poor
15
S.A. Eftekhar Afzali et al. Journal of Cleaner Production 446 (2024) 141305
Fig. 9. Feature importance analysis using different methods: a) Gini and GBM, b) Gini and Compact-RF, c) Gini and DT, and d) Out-of-Bag permuted and
Compact-RF.
16
S.A. Eftekhar Afzali et al. Journal of Cleaner Production 446 (2024) 141305
Fig. 11. SHAP analysis summary plots of the GBM model regarding the effects of a) CA/FA, b) H2O/Na2O, c) W, d) SHSM, e) SHS, f) SiO2/Al2O3, g) SSS, h) Age, and
i) W/S (continued). Fig. 11. SHAP analysis summary plots of the GBM model regarding the effects of j) MK, k) CT, l) Na2O/Al2O3, m) PCC, and n) SP.
Fig. 11 illustrates the SHAP global explanation of the GBM model. have a negative impact on compressive strength (Fig. 11 (c)). The extra
This comprehensive plot consolidates the information and aids in water contents close to zero (black points) are more likely to produce
identifying potential underlying trends in the data. In these plots, the higher compressive strengths (Fig. 11 (c)). Additionally, it is observed
horizontal axis represents Shapley values corresponding to individual that higher CTs have an adverse effect on compressive strength (Fig. 11
input features for each prediction. Furthermore, the color of each point (k)). This could be attributed to the accelerated geopolymerization re
indicates the value of the corresponding input parameter in each pre action at higher temperatures, resulting in a less compact gel structure
diction. In summary, certain factors positively impact compressive with lower quality and mechanical properties than gels formed over a
strength, including higher values of CA/FA, SHSM, SHS, SSS, and curing longer period at lower temperatures. This observation is in line with the
age, as shown in Fig. 11 (a), (d), (e), (g), and (h). Accordingly, the SHAP findings of Rovnaník (2010).
values for the CA/FA ratio vary in a wide range between − 10 and 15, Regarding the PCC (Fig. 11(m)), 0 signifies no pre-curing (black
indicating the significant effect of the CA/FA ratio on compressive points), 1 indicates pre-curing at 60 ◦ C (red points), and 2 corresponds to
strength. In this regard, it can be concluded that ratios greater than 1 are pre-curing at 80 ◦ C (white points). It can be observed that the distri
preferred to achieve higher compressive strengths. Regarding the impact bution of data points does not exhibit a clear pattern. The same holds for
of SHSM, it can be observed that the SHAP values for molarities greater the amount of SP, which shows a minimal effect on compressive strength
than 12 have a high density between 0 and 2. However, with a decrease (Fig. 11(n)).
in molarity below 12, the SHAP values drastically expand in the negative Fig. 12 reveals a strong interaction between the mixture design pa
direction of the x-axis. Conversely, increased amounts of extra water (W) rameters and their effect on the predicted compressive strength. The
17
S.A. Eftekhar Afzali et al. Journal of Cleaner Production 446 (2024) 141305
distinctive characteristic of PDPs lies in their capability to rapidly esti hydroxide concentration (Ayeni et al., 2021; Görhan and Kürklü, 2014).
mate the output based on various inputs and reveal underlying trends Overall, the compressive strength demonstrates a higher dependence on
and extremum points. The compressive strength demonstrates a signif the SHS contents lower than 150 kg/m3 and SHSM higher than 10
icant stepwise increase with an increasing CA/FA ratio, regardless of the (Fig. 12(g)). As mentioned earlier, the SP dosage, compared to other
MK precursor amount. The maximum compressive strength is achieved mixture components, does not significantly influence compressive
at CA/FA ratios higher than 1 (Fig. 12(a)). As depicted in Fig. 12(b), the strength, although an increase in its content may slightly reduce
compressive strength experiences a notable reduction as W increases, compressive strength (Fig. 12(i)).
aligning with earlier research on the compressive strength of MK-based Finally, considering the impact of the interactions between the
GPC (Pouhet and Cyr, 2016). When additional water is introduced into governing molar ratios on compressive strength development, the crit
the GPC mixture, it produces excess free water. After the geopolymer ical role of H2O/Na2O becomes apparent. The compressive strength
setting, this surplus free water increases total pore volume, resulting in progressively decreases with H2O/Na2O values exceeding 10 (Fig. 12(j)
decreased compressive strength. The compressive strength also de and (k)). This can be attributed to the reduced system pH resulting from
creases due to an increase in the W/S ratio from 0.2 to around 0.6 an increased H2O/Na2O ratio (i.e., dilution effect), negatively impacting
(Fig. 12(c)). Regarding the amount of added activators, it is evident that, the reaction rate and compressive strength (Juengsuwattananon et al.,
as the quantity of SSS increases up to approximately 225 kg/m3, there is 2019). Examining the compressive strength as a function of Na2O/Al2O3
a notable surge in compressive strength (Fig. 12(d) and (h)). However, and SiO2/Al2O3 reveals a notably greater dependence on SiO2/Al2O3
beyond that threshold, additional increments in the amount of SSS lead when the H2O/Na2O molar ratio is below 10 (Fig. 12(j)). Accordingly,
to a gradual rise in compressive strength. This observation holds for the the higher compressive strengths are associated with SiO2/Al2O3 ratios
augmentation of SHS content and its corresponding concentration around 2.4. However, a further increase in SiO2/Al2O3 to ratios beyond
(SHSM) as well (Fig. 12(e) and (f)). Various studies have reported that 2.4 reduces the compressive strength (Fig. 12(j)). This reduction is
higher availability of alkalinity sources (specifically Na2O) leads to mainly attributed to the high amount of unreacted MK remaining, a
higher dissolution of Si and Al components in the alkaline solution finding that was also observed by Juengsuwattananon et al. (2019).
(Aliabdo et al., 2019; Kaur et al., 2018). This produces more gels, According to Fig. 12(k) and (l), the highest compressive strength is ob
thereby densifying the microstructure of the geopolymer and enhancing tained when the Na2O/Al2O3 ratio is around 1, which aligns well with
its strength. Nevertheless, an excessive increase in the quantity or con previous experimental findings (Juengsuwattananon et al., 2019; Lahoti
centration of activators may not result in a further increase in et al., 2017). Adopting a theoretical value of 1 for Na2O/Al2O3 results in
compressive strength, likely due to the lack of sufficient precursor for higher compressive strength, mainly attributed to the charge balance
further dissolution. In contrast, concentrations lower than 10 can between the alkali cation and tetrahedrally coordinated aluminum in
severely diminish the compressive strength (Fig. 12(e)). Similar obser the formed N-A-S-H gel. This phenomenon was also observed by Lahoti
vations have been reported in several relevant experimental studies, et al. (2017) and Zhang et al. (2023).
where a decrease in sample compressive strength has been attributed to Finally, different performance indicators were compared between
the lower rate of the Si-O and Al-O bond breaking and subsequent the best GBM model proposed in the present study and various ML
polymerization of silicon and aluminum due to a decrease in the sodium models proposed by other studies for other geopolymer and alkali-
18
S.A. Eftekhar Afzali et al. Journal of Cleaner Production 446 (2024) 141305
activated concrete types. It is worth highlighting that preceding in and Unluer (2022), respectively. In general, it can be seen that the
vestigations in the literature did not incorporate MK within their proposed GBM model in this research offers comparable or even better
models. As indicated in Table 6, the highest R2 obtained corresponds to performance than the models presented in other studies.
the deep learning model proposed by Nguyen et al. (2020) to predict the
compressive strength of fly ash-based alkali-activated concrete. 7. Conclusions
Furthermore, this model has achieved the lowest values of RMSE and
MAE among all the models presented. It can be observed that the best This study aimed to comprehensively understand the factors
GBM model in the present study has reached the lowest MAPE value affecting the compressive strength of metakaolin-based geopolymer
among all the models that have reported MAPE. Considering R2, the concrete. This was achieved by leveraging a wide range of machine
GBM model proposed in this study ranks second after the deep learning learning techniques, including gradient boosting machine (GBM),
model presented by Nguyen et al. (2020). However, regarding MAE and random forest (RF), decision tree (DT), artificial neural network (ANN),
RMSE, the best GBM model in the present study ranks after the deep and support vector machine (SVM). Accordingly, a database comprising
learning and ANN models proposed by Nguyen et al. (2020) and Peng 235 experimental results was gathered through an extensive literature
Fig. 12. Partial dependence plots of the GBM model illustrating the interactions between input features, including a) CA/FA-MK, b) W-MK, c) MK-W/S, d) MK-SSS, e)
MK-SHSM, and f) SHS-MK (Continued). Fig. 12. Partial dependence plots of the GBM model illustrating the interactions between input features, including g) SHSM-
SHS, h) SSS-SHS, i) SP-MK, j) SiO2/Al2O3-H2O/Na2O, k) Na2O/Al2O3-H2O/Na2O, and l) Na2O/Al2O3-SiO2/Al2O3.
19
S.A. Eftekhar Afzali et al. Journal of Cleaner Production 446 (2024) 141305
survey. Following the development and comparison of the ML models, 2. The SHAP and feature importance analyses identified the coarse-to-
feature importance analysis, SHapley Additive exPlanations (SHAP), fine aggregate ratio, H2O/Na2O molar ratio, sodium hydroxide so
and partial dependence plot (PDP) analysis were employed to capture lution content, and extra water content as the most influential factors
the underlying complex and nuanced patterns regarding the impact of affecting the compressive strength of metakaolin-based geopolymer
each parameter on the strength development in metakaolin-based geo concrete.
polymer concrete. The following conclusions were drawn from this 3. According to the PDP analysis, the proportion of each mixture
study: component could be easily determined to achieve the desired
compressive strength. Specifically, coarse-to-fine aggregate ratios
1. Generally, the ensemble models, specifically GBM and RF, exhibited higher than 1, H2O/Na2O molar ratios lower than 10, extra water
higher prediction accuracy compared to individual models such as contents close to zero, and sodium hydroxide concentrations higher
ANN, SVM, and DT. Among the various ML models appraised, GBM than 10 were more likely to produce higher compressive strength.
demonstrated exceptionally higher capability in predicting the 4. The PDP analysis highlighted the synergistic effect of input features
compressive strength of metakaolin-based geopolymer concrete in on the predicted compressive strength. Specifically, compressive
different strength grades, achieving R2 = 0.983, RMSE = 2.218 MPa, strength showed a notably increased dependence on SiO2/Al2O3
MAE = 1.615 MPa, MAPE = 6.976 %, and α20 = 0.936 in the testing when the H2O/Na2O molar ratio was below 10. Accordingly, higher
phase. In the case of RF, the regularized Compact-RF outperformed strengths were observed at SiO2/Al2O3 ratios around 2.4.
the unregularized RF significantly, showing about a 26% reduction
in RMSE and a 42% reduction in MAPE while utilizing a much In conclusion, this study contributes to both the literature and
smaller number of base learners. practical applications related to the study of geopolymer concrete
20
S.A. Eftekhar Afzali et al. Journal of Cleaner Production 446 (2024) 141305
Artificial neural 0.968 NR NR 4.69 Fly ash, Ahmed Appendix A. Supplementary data
network GGBFS et al.
(2023a) Supplementary data to this article can be found online at https://doi.
Support vector 0.943 NR 3.399 4.472 GGBFS Ahmed
org/10.1016/j.jclepro.2024.141305.
regression and et al.
Mantra Rays (2023b)
Foraging References
Optimization
Random forest 0.932 29.090 4.526 6.336 Fly ash, Zhang Abbas, R., Khereby, M.A., Ghorab, H.Y., Elkhoshkhany, N., 2020. Preparation of
GGBFS et al. geopolymer concrete using Egyptian kaolin clay and the study of its environmental
(2023) effects and economic cost. Clean Technol. Environ. Policy 22, 669–687.
M5P-tree 0.77 NR NR 6.67 OPC, Ahmed Adel, H., Palizban, S.M.M., Sharifi, S.S., Ghazaan, M.I., Korayem, A.H., 2022. Predicting
MK et al. mechanical properties of carbon nanotube-reinforced cementitious nanocomposites
2022a, b, using interpretable ensemble learning models. Construct. Build. Mater. 354, 129209.
Ahmad, A., Ahmad, W., Aslam, F., Joyklad, P., 2022a. Compressive strength prediction of
c, d
fly ash-based geopolymer concrete via advanced machine learning techniques. Case
Artificial neural 0.98 NR 2.970 3.990 Fly ash, Ahmed
Stud. Constr. Mater. 16, e00840.
network GGBFS et al.
Ahmed, H.U., Abdalla, A.A., Mohammed, A.S., Mohammed, A.A., 2022b. Mathematical
(2022b) modeling techniques to predict the compressive strength of high-strength concrete
Artificial neural 0.916 NR 1.59 1.950 Fly ash Peng and incorporated metakaolin with multiple mix proportions. Cleaner Materials 5,
network Unluer 100132.
(2022) Ahmed, H.U., Mohammed, A.A., Mohammed, A., 2022c. Soft computing models to
Extreme 0.9 NRa 4.47 7.9 Fly ash, Shah predict the compressive strength of GGBS/FA-geopolymer concrete. PLoS One 17
gradient GGBFS et al. (5), e0265846.
boosting (2022) Ahmed, H.U., Mohammed, A.S., Faraj, R.H., Abdalla, A.A., Qaidi, S.M., Sor, N.H.,
Gradient 0.943 10.325 3.253 4.894 Fly ash, Zhang Mohammed, A.A., 2023a. Innovative modeling techniques including MEP, ANN and
boosting GGBFS et al. FQ to forecast the compressive strength of geopolymer concrete modified with
(2022) nanoparticles. Neural Comput. Appl. 35 (17), 12453–12479.
Ahmed, H.U., Mohammed, A.S., Mohammed, A.A., 2022d. Multivariable models
Random forest 0.944 13.644 2.013 2.650 Fly ash Gomaa
including artificial neural network and M5P-tree to forecast the stress at the failure
et al.
of alkali-activated concrete at ambient curing condition and various mixture
(2021)
proportions. Neural Comput. Appl. 34 (20), 17853–17876.
Deep neural 0.992 NR 0.554 1.269 Fly ash Nguyen Ahmed, H.U., Mostafa, R.R., Mohammed, A., Sihag, P., Qadir, A., 2023b. Support vector
network et al. regression (SVR) and grey wolf optimization (GWO) to predict the compressive
(2020) strength of GGBFS-based geopolymer concrete. Neural Comput. Appl. 35 (3),
This study 0.983 6.976 1.615 2.218 MK – 2909–2926.
a Albidah, A., Alghannam, M., Abbas, H., Almusallam, T., Al-Salloum, Y., 2021.
NR: Not reported. Characteristics of metakaolin-based geopolymer concrete for different mix design
parameters. J. Mater. Res. Technol. 10, 84–98.
Albidah, A., Alqarni, A.S., Abbas, H., Almusallam, T., Al-Salloum, Y., 2022a. Behavior of
compressive strength. The methodology utilized in this study can be
Metakaolin-Based geopolymer concrete at ambient and elevated temperatures.
served by researchers and engineers, offering an efficient tool for sys Construct. Build. Mater. 317, 125910.
tematic mixture optimization and proportioning. These initiatives help Albidah, A., Alsaif, A., Abadel, A., Abbas, H., Al-Salloum, Y., 2022b. Role of recycled
mitigate the environmental impact of the concrete industry by pro vehicle tires quantity and size on the properties of metakaolin-based geopolymer
rubberized concrete. J. Mater. Res. Technol. 18, 2593–2607.
moting the use of sustainable materials and expediting the development Albidah, A., Altheeb, A., Alrshoudi, F., Abadel, A., Abbas, H., Al-Salloum, Y., 2020. Bond
of alternative binders with minimal operational costs. The limited na Performance of GFRP and Steel Rebars Embedded in Metakaolin Based Geopolymer
ture of the database collected in the current research necessitates the Concrete, Structures. Elsevier, pp. 1582–1593.
Alghannam, M., Albidah, A., Abbas, H., Al-Salloum, Y., 2021. Influence of critical
expansion of this database specifically for high-strength metakaolin- parameters of mix proportions on properties of MK-based geopolymer concrete.
based geopolymer concrete through experimental efforts or using nu Arabian J. Sci. Eng. 46 (5), 4399–4408.
merical approaches such as generative adversarial networks (GANs). Aliabdo, A.A., Abd Elmoaty, M., Emam, M.A., 2019. Factors affecting the mechanical
properties of alkali activated ground granulated blast furnace slag concrete.
Future studies should consider additional parameters in modeling, such Construct. Build. Mater. 197, 339–355.
as the metakaolin-specific surface area, the maximum diameter of the Amin, M., Elsakhawy, Y., Abu el-hassan, K., Abdelsalam, B.A., 2022. Behavior evaluation
aggregate, and the substitution rate of nanomaterials like nano-silica. of sustainable high strength geopolymer concrete based on fly ash, metakaolin, and
slag. Case Stud. Constr. Mater. 16, e00976.
Asteris, P.G., Lourenço, P.B., Roussis, P.C., Adami, C.E., Armaghani, D.J., Cavaleri, L.,
CRediT authorship contribution statement Chalioris, C.E., Hajihassani, M., Lemonis, M.E., Mohammed, A.S., 2022. Revealing
the nature of metakaolin-based concrete materials using artificial intelligence
techniques. Construct. Build. Mater. 322, 126500.
Seyed Ali Eftekhar Afzali: Writing – original draft, Visualization,
Ayeni, O., Onwualu, A.P., Boakye, E., 2021. Characterization and mechanical
Validation, Software, Methodology, Formal analysis, Data curation, performance of metakaolin-based geopolymer for sustainable building applications.
Conceptualization. Mohsen Ali Shayanfar: Writing – review & editing, Construct. Build. Mater. 272, 121938.
Validation, Supervision, Investigation. Mohammad Ghanooni-Bagha: Bakharev, T., 2005. Geopolymeric materials prepared using Class F fly ash and elevated
temperature curing. Cement Concr. Res. 35 (6), 1224–1232.
Writing – review & editing, Supervision, Investigation. Emad Golaf Bature, A., Khorami, M., Ganjian, E., Tyrer, M., 2021. Influence of alkali activator type
shani: Writing – review & editing, Supervision, Project administration, and proportion on strength performance of calcined clay geopolymer mortar.
Methodology, Investigation, Conceptualization. Tuan Ngo: Writing – Construct. Build. Mater. 267, 120446.
Beltrame, N.A.M., Dias, R.L., Witzke, F.B., Medeiros-Junior, R.A., 2023. Effect of
review & editing, Investigation. carbonation curing on the physical, mechanical, and microstructural properties of
metakaolin-based geopolymer concrete. Construct. Build. Mater. 406, 133403.
Declaration of competing interest Breiman, L., 2001. Random forests. Mach. Learn. 45 (1), 5–32.
Breiman, L., Friedman, J.H., Olshen, R.A., Stone, C.J., 2017. Classification and
Regression Trees. Routledge.
The authors declare that they have no conflict of interest. Chaabene, W.B., Flah, M., Nehdi, M.L., 2020. Machine learning prediction of mechanical
properties of concrete: critical review. Construct. Build. Mater. 260, 119889.
Cortes, C., Vapnik, V., 1995. Support-vector networks. Mach. Learn. 20 (3), 273–297.
21
S.A. Eftekhar Afzali et al. Journal of Cleaner Production 446 (2024) 141305
da Silva Rocha, T., Dias, D.P., França, F.C.C., de Salles Guerra, R.R., de Oliveira, L.R.d.C., Moradikhou, A.B., Esparham, A., Avanaki, M.J., 2020. Physical & mechanical properties
2018. Metakaolin-based geopolymer mortars with different alkaline activators (Na+ of fiber reinforced metakaolin-based geopolymer concrete. Construct. Build. Mater.
and K+). Construct. Build. Mater. 178, 453–461. 251, 118965.
De Silva, P., Sagoe-Crenstil, K., Sirivivatnanon, V., 2007. Kinetics of geopolymerization: Nguyen, K.T., Nguyen, Q.D., Le, T.A., Shin, J., Lee, K., 2020. Analyzing the compressive
role of Al2O3 and SiO2. Cement Concr. Res. 37 (4), 512–518. strength of green fly ash based geopolymer concrete using experiment and machine
De Vargas, A.S., Dal Molin, D.C., Vilela, A.C., Da Silva, F.J., Pavao, B., Veit, H., 2011. The learning approaches. Construct. Build. Mater. 247, 118581.
effects of Na2O/SiO2 molar ratio, curing temperature and age on compressive Pacheco-Torgal, F., Moura, D., Ding, Y., Jalali, S., 2011. Composition, strength and
strength, morphology and microstructure of alkali-activated fly ash-based workability of alkali-activated metakaolin based mortars. Construct. Build. Mater.
geopolymers. Cement Concr. Compos. 33 (6), 653–660. 25 (9), 3732–3745.
Degefu, D.M., Liao, Z., Berardi, U., Labbé, G., 2022. The dependence of thermophysical Peng, Y., Unluer, C., 2022. Analyzing the mechanical performance of fly ash-based
and hygroscopic properties of macro-porous geopolymers on Si/Al. J. Non-Cryst. geopolymer concrete with different machine learning techniques. Construct. Build.
Solids 582, 121432. Mater. 316, 125785.
Duxson, P., Mallicoat, S.W., Lukey, G.C., Kriven, W.M., Van Deventer, J.S., 2007. The Perez-Cortes, P., Escalante-Garcia, J.I., 2020. Alkali activated metakaolin with high
effect of alkali and Si/Al ratio on the development of mechanical properties of limestone contents–Statistical modeling of strength and environmental and cost
metakaolin-based geopolymers. Colloids Surf. A Physicochem. Eng. Asp. 292 (1), analyses. Cement Concr. Compos. 106, 103450.
8–20. Pouhet, R., Cyr, M., 2016. Formulation and performance of flash metakaolin geopolymer
Eisa, M., Basiouny, M., Fahmy, E., 2022. Drying shrinkage and thermal expansion of concretes. Construct. Build. Mater. 120, 150–160.
metakaolin-based geopolymer concrete pavement reinforced with biaxial geogrid. Provis, J.L., Arbi, K., Bernal, S.A., Bondar, D., Buchwald, A., Castel, A.,
Case Stud. Constr. Mater. 17, e01415. Chithiraputhiran, S., Cyr, M., Dehghan, A., Dombrowski-Daube, K., 2019. RILEM TC
Environment, U., Scrivener, K.L., John, V.M., Gartner, E.M., 2018. Eco-efficient cements: 247-DTA round robin test: mix design and reproducibility of compressive strength of
potential economically viable solutions for a low-CO2 cement-based materials alkali-activated concretes. Mater. Struct. 52, 1–13.
industry. Cement Concr. Res. 114, 2–26. Ribeiro, M.G.S., Ribeiro, M.G.S., Keane, P.F., Sardela, M.R., Kriven, W.M., Ribeiro, R.A.
Friedman, J.H., 2001. Greedy function approximation: a gradient boosting machine. S., 2021. Acid resistance of metakaolin-based, bamboo fiber geopolymer composites.
Ann. Stat. 1189–1232. Construct. Build. Mater. 302, 124194.
Friedman, J.H., 2002. Stochastic gradient boosting. Comput. Stat. Data Anal. 38 (4), Rovnaník, P., 2010. Effect of curing temperature on the development of hard structure of
367–378. metakaolin-based geopolymer. Construct. Build. Mater. 24 (7), 1176–1183.
Gomaa, E., Han, T., ElGawady, M., Huang, J., Kumar, A., 2021. Machine learning to Rowles, M., O’connor, B., 2003. Chemical optimisation of the compressive strength of
predict properties of fresh and hardened alkali-activated concrete. Cement Concr. aluminosilicate geopolymers synthesised by sodium silicate activation of
Compos. 115, 103863. metakaolinite. J. Mater. Chem. 13 (5), 1161–1165.
Görhan, G., Kürklü, G., 2014. The influence of the NaOH solution on the properties of the Shah, S.F.A., Chen, B., Zahid, M., Ahmad, M.R., 2022. Compressive strength prediction of
fly ash-based geopolymer mortar cured at different temperatures. Composites part b: one-part alkali activated material enabled by interpretable machine learning.
engineering 58, 371–377. Construct. Build. Mater. 360, 129534.
Heath, A., Paine, K., McManus, M., 2014. Minimising the global warming potential of Shamsabadi, E.A., Roshan, N., Hadigheh, S.A., Nehdi, M.L., Khodabakhshian, A.,
clay based geopolymers. J. Clean. Prod. 78, 75–83. Ghalehnovi, M., 2022. Machine learning-based compressive strength modelling of
Jiang, X., Xiao, R., Bai, Y., Huang, B., Ma, Y., 2022. Influence of waste glass powder as a concrete incorporating waste marble powder. Construct. Build. Mater. 324, 126592.
supplementary cementitious material (SCM) on physical and mechanical properties Shehata, N., Mohamed, O., Sayed, E.T., Abdelkareem, M.A., Olabi, A., 2022. Geopolymer
of cement paste under high temperatures. J. Clean. Prod. 340, 130778. concrete as green building materials: recent applications, sustainable development
Jiang, X., Xiao, R., Ma, Y., Zhang, M., Bai, Y., Huang, B., 2020. Influence of waste glass and circular economy potentials. Sci. Total Environ. 836, 155577.
powder on the physico-mechanical properties and microstructures of fly ash-based Singh, B., Ishwarya, G., Gupta, M., Bhattacharyya, S., 2015. Geopolymer concrete: a
geopolymer paste after exposure to high temperatures. Construct. Build. Mater. 262, review of some recent developments. Construct. Build. Mater. 85, 78–90.
120579. Smola, A.J., Schölkopf, B., 2004. A tutorial on support vector regression. Stat. Comput.
Jiang, X., Zhang, Y., Zhang, Y., Ma, J., Xiao, R., Guo, F., Bai, Y., Huang, B., 2023. 14 (3), 199–222.
Influence of size effect on the properties of slag and waste glass-based geopolymer Valencia-Saavedra, W.G., de Gutiérrez, R.M., Puertas, F., 2020. Performance of FA-based
paste. J. Clean. Prod. 383, 135428. geopolymer concretes exposed to acetic and sulfuric acids. Construct. Build. Mater.
Juengsuwattananon, K., Winnefeld, F., Chindaprasirt, P., Pimraksa, K., 2019. Correlation 257, 119503.
between initial SiO2/Al2O3, Na2O/Al2O3, Na2O/SiO2 and H2O/Na2O ratios on Yunsheng, Z., Wei, S., Zongjin, L., 2010. Composition design and microstructural
phase and microstructure of reaction products of metakaolin-rice husk ash characterization of calcined kaolin-based geopolymer cement. Appl. Clay Sci. 47
geopolymer. Construct. Build. Mater. 226, 406–417. (3–4), 271–275.
Júnior, N.S.A., Neto, J.S.A., Santana, H.A., Cilla, M.S., Ribeiro, D.V., 2021. Durability Zhang, L.V., Marani, A., Nehdi, M.L., 2022. Chemistry-informed machine learning
and service life analysis of metakaolin-based geopolymer concretes with respect to prediction of compressive strength for alkali-activated materials. Construct. Build.
chloride penetration using chloride migration test and corrosion potential. Mater. 316, 126103.
Construct. Build. Mater. 287, 122970. Zhang, M., Zhang, C., Zhang, J., Wang, L., Wang, F., 2023. Effect of composition and
Kaur, K., Singh, J., Kaur, M., 2018. Compressive strength of rice husk ash based curing on alkali activated fly ash-slag binders: machine learning prediction with a
geopolymer: the effect of alkaline activator. Construct. Build. Mater. 169, 188–192. random forest-genetic algorithm hybrid model. Construct. Build. Mater. 366,
Lahoti, M., Narang, P., Tan, K.H., Yang, E.-H., 2017. Mix design factors and strength 129940.
prediction of metakaolin-based geopolymer. Ceram. Int. 43 (14), 11433–11441. Zhu, X., Li, W., Du, Z., Zhou, S., Zhang, Y., Li, F., 2021. Recycling and utilization
Lundberg, S.M., Lee, S.-I., 2017. A unified approach to interpreting model predictions. assessment of steel slag in metakaolin based geopolymer from steel slag by-product
Adv. Neural Inf. Process. Syst. 30. to green geopolymer. Construct. Build. Mater. 305, 124654.
22