Professional Documents
Culture Documents
1 s2.0 S1359645424000569 Main
1 s2.0 S1359645424000569 Main
Acta Materialia
journal homepage: www.elsevier.com/locate/actamat
A R T I C L E I N F O A B S T R A C T
Keywords: The near-surface degradation of γ’-strengthening phase caused by oxidation has a significant effect on the me
Nickel-based single crystal superalloys chanical properties of thin-walled Ni-based single crystal (SX) superalloys for turbine blade applications. It is
Oxidation dynamics essential yet still challenging to link alloy composition, oxidation behavior and near-surface microstructure to
Microstructure
guide the design of high-performance SX superalloy. In the present work, two key issues are breached from a
Machine learning
data-driven perspective, which are the prediction of the oxidation rate constant (Kp) and the near-surface γ’
evolution features respectively. Two modeling strategies are proposed to predict Kp from compositions. One is
the machine learning (ML) model based on detailed feature engineering, the other is a novel model that describes
the lnKp -U (alumina formation energy) relationship by several linear functions with different intercepts ac
cording to ML classified sub-regions. The generalization ability for predicting Kp is significantly boosted by an
ensemble model obtained from the two strategies. Insights related to oxidation control mechanisms and the
contributions of key descriptors are obtained from interpretable information. Further, the ML models of the
relationship between the near-surface composition and the fraction, shape factor, and size of γ’ are constructed
based on the near-surface microstructure datasets. Dynamic simulations using DICTRA are utilized to link the Kp
with the near-surface composition and to refine the Composition-Kp-Degradation Microstructure (CKM) pre
diction flow. This flow allows for the direct acquisition of diverse near-surface microstructural evolution features
from simple alloy compositions with a verified remarkable generalization performance, which can be further
applied to the design of Ni-based SX superalloys for thin-walled components.
1. Introduction initial oxidation and becomes the rate-controlling step for the later
stages of oxidation. However, this causes the depletion of Al in the alloy
As the core material for turbine blades of aero-engine, Ni-based matrix with the degradation of the γ’ strengthening phase. It is a
single crystal (SX) superalloys are operated in the service under ther generally accepted fact that the size and fraction of γ’ as well as the
modynamic coupling conditions and exposed to high temperatures with morphology of γ/γ’ interfaces in near-surface undergo drastic degrada
complex chemical environments including the harshly corrosive and tion during oxidation [5,6], which weakens the inherent strengthening
oxidation [1]. In the current design criteria for Ni-based SX superalloys, effect of superalloy. Moreover, thin-walled blades are widely employed
it is necessary to add sufficient Al (~6 wt.%) to promote the formation of in advanced aero-engines to tackle the increasing inlet temperature with
the protective Al2O3 on the surface [2–4]. The formation of continuous super air-cooling technology. Unfortunately, reduced wall thickness has
Al2O3 slows down the oxidation rate compared to the NiO-controlled been verified to decrease the creep rupture life of superalloys, which is
* Corresponding author at: Key Laboratory of Aerospace Materials and Performance (Ministry of Education), Research Institute of Frontier Science, Beihang
University, Beijing, 100191, China.
E-mail address: wendyzhao56@buaa.edu.cn (W. Zhao).
https://doi.org/10.1016/j.actamat.2024.119703
Received 8 July 2023; Received in revised form 16 January 2024; Accepted 22 January 2024
Available online 25 January 2024
1359-6454/© 2024 Acta Materialia Inc. Published by Elsevier Ltd. All rights reserved.
F. Yang et al. Acta Materialia 266 (2024) 119703
known as the thickness-debit effect and brings a new challenge for the γ’ fraction, while the morphological characteristics of the γ’ and γ/γ’
design of superalloys [7–9]. The oxidation induced γ’ degradation in the interfaces are ignored. The microstructural features need to be further
near-surface is suspected to emerge as a primary contributor to this ef extended in detail, especially in the case that the microscopic size and
fect, owing to the significantly augmented proportion of the affected coarsening of γ’ could affect the transformation of the interaction be
region [10–12]. Therefore, the quantitative description and prediction tween dislocation and γ’ [22,23]. The complex nonlinear relationship
of the degradation of the γ’ phase caused by oxidation can help designers between composition and Kp, the diverse description of near-surface
assess the damage of these degraded microstructures to the original microstructural features, and the prediction of near-surface micro
creep resistance, which is of great significance for designing alloys with structure remain challenges to be further addressed.
thinner wall thicknesses. With the concept of fourth paradigm science being proposed, data
However, consider a physical process of oxidation-induced near- science is widely applied in materials science [24–26]. As the core
surface γ’ degradation: the formation of continuous Al2O3 induces the method of data science, machine learning (ML) is used in research on the
change of composition distribution in the near-surface, which further prediction of material structure and property [27–30], and even the
induces a change in the morphology, number and size of γ’. It is noted reverse design of new material structures [31–33], it can be attributed to
that although dispersed alumina could be formed during the initial NiO- the advantage of ML in solving nonlinear problems in high-dimensional
controlled rapid oxidation stage, this does not cause a significantly spaces, this advantage is highly enlightening for the solution of the
larger range of γ’ phase degradation in the near-surface, thus, its effects Composition-Kp-Degradation Microstructure (CKM) relationship. ML
can be approximately ignored. It is challenging to model this physical has been applied to predict the dynamical behavior of alloy oxidation.
relationship, the prediction of the kinetic behavior of alumina and the Bhattacharya et al. develop an ML model to predict the oxidation rate
near-surface alloying element distribution is the prerequisite for constants for Ti alloys based on the GBDT model [34]. Wei et al. [35] fit
obtaining the γ’ degradation features, while how to infer the specific the parameters Q and m extracted from the oxidation weight gain curves
relationship between alloying elements distribution and the near- based on the Arrhenius equation using the Lasso algorithm to estimate
surface γ’ degradation features is another key issue to be addressed. A the oxidation weight gain behavior of FeCrAlCoNi high entropy alloy. In
core point is the estimation of the oxidation rate controlled by contin addition, for Ni-based superalloy or SX superalloy, research on directly
uous Al2O3, as it mainly determines the degree of the dissipation of γ’ predicting oxidation weight gain based on artificial neural networks is
forming element - Al in the process of near-surface microstructural carried out, such as a series of works by Kim et al. [36] and Yun et al.
degradation. The classical Wagner theory proposes that the alumina [37]. The model of the relationship between composition and Kp can be
growth process follows a parabolic law [13], the kinetic behavior constructed assisted by ML, but more efforts are required in feature
controlled by alumina can be described by the parabolic rate constant engineering, the fusion of prior information and the mining of inter
(Kp). The relationship between the composition and Kp needs to be pretable information to achieve breakthroughs in the generalization of
constructed to ensure that the alloy composition can be designed. For Kp prediction models and obtaining material insights into the effect on
the prediction of near-surface alloying elements distribution, it can be oxidation dynamics.
obtained by combining Kp with thermodynamic simulation [14,15]. To Although ML has not been applied to the establishment of oxidation-
sum up, the prediction of Kp from alloy composition and the construc microstructure relationships, some of the present studies are inspiring.
tion of the relationship between the near-surface alloying element dis In the works employing diffusion couples, the linkage between compo
tribution and γ’ degradation features are the two bottlenecks that need sition and microstructural features within the alloy is constructed
to be urgently broken through. [38–40]. The relationship between the composition and the γ’ fraction,
In the studies associated with the prediction of the oxidation rate and γ’ size is established based on diffusion-coupling experiments in
the evaluation of the effect of oxidation on the microstructural degra Co-based superalloys by Yu et al. [41]. Similar work is studied for
dation and creep resistance of Ni-based SX superalloys, Bensch and Sato Ni-based superalloys by Qin et al. [42], the precipitation phase is pre
have performed explorative and pioneering work [16,17]. In the pre dicted as the function of chemical composition, and Nb-W phase dia
diction of oxidation rate, Sato extends the Wagner-Hauffe theory by grams that far exceed Thermo-Calc accuracy are determined.
introducing the kinetic parameter - the effective valence electron num Development of the datasets consisting of near-surface elemental dis
eff
ber (Valt ) of the Al2O3, a concise empirical equation incorporating tributions and microstructural features, and modeling by ML could be a
thermodynamic and dynamics effects is obtained [16]. Bensch applied viable scheme to reach the goal of predicting the degradation of
this empirical equation, by combining thermodynamic calculations with microstructure for designed superalloys.
a creep constitutive equation to predict the internal microstructural In this work, the CKM linkages are modeled based on ML. Firstly, we
degradation and creep rupture life of RenéN5 and SCA425+ thin-walled collect the data of alumina-forming Ni-based SX superalloys from the
SX superalloys [18]. These works provide a quantitative assessment of published literature, and the Kp is extracted. An ML model that can
the damage caused by oxidation on the strengthening phases as well as predict the Kp of alumina-forming Ni-based SX superalloys is developed
the mechanical properties. However, further in-depth studies are still after the model selection and feature engineering. For the continuous
necessary. For the prediction of Kp, the generalizability of the alumina-forming superalloys, we explore the effectiveness of using
above-mentioned empirical formulas still needs to be deeply examined. discontinuous alumina-forming data as a data enhancement for predic
One hand arises from the large uncertainty of the fitted empirical pa tion performance improvement. Besides, we find that the lnKp -U rela
rameters caused by the very small amount of data used in past studies. tionship is sub-regional linear, that the defective and diffuse properties
On the other hand, the effect of multi-component alloying elements on of the alloys are the physical factors leading to this, and the key features
oxidation kinetics is complex. The rate of alumina formation is strongly determining the sub-regional properties are given and explained. ML
influenced by the diffusion kinetics within the alloy matrix (cross- classification models for predicting sub-regional properties are con
diffusion [19]), competition with other oxides for growth, and structed. Further, we propose a new empirical equation for predicting
third-element effects [20,21]. The complexity of the chemical compo the Kp. Secondly, we develop the datasets of near-surface composition
sition of the superalloy needs to be further integrated into the rela distribution and microstructural features based on the isothermal
tionship between composition and Kp. oxidation experiments of four Ni-based SX superalloys at 1100 ◦ C. The
In terms of the prediction of γ’ degradation features in near-surface, ML models for predicting γ’ fraction, γ’ size, and γ’ shape factor are
the establishment of the relationship between oxidation and micro established. To complete the entire CKM prediction process without
structure is more dependent on thermodynamic calculations [14,15], costly experiments, the near-surface alloying elemental distributions are
resulting in the predictable microstructural features being limited to the obtained by using the kinetic simulations - DICTRA and are utilized as
input to ML prediction models for the near-surface degradation γ’
2
F. Yang et al. Acta Materialia 266 (2024) 119703
features. A newly developed alumina-forming Ni-based SX superalloy is expansion and filtering is used to boost the prediction accuracy of the
utilized to demonstrate and validate the computational flow of CKM. ML model. Secondly, we extract the elemental distribution, γ’ features
Finally, we discuss the potential application of the CKM relationships from the near-surface microstructure at different isothermal oxidation
constructed in this work. times and construct the datasets. The ML substitute models are con
structed between these microstructural degradation features and near-
2. Methods for constructing the CKM prediction flow surface elemental composition distribution. With the aid of thermody
namic simulations, we obtain the distribution of near-surface elements
2.1. Overview of the overall modeling strategy from the predicted Kp and as the input of ML models to infer the γ’
degradation features in the near-surface. The key dataset components,
Fig. 1 shows the overall workflow diagram for constructing the CKM feature engineering methods and construction details of the ML model
relationships. It consists of two main parts: the construction of ML are described in the following sections.
models for predicting the "C-K" and "K-M" linkages, respectively. Firstly,
the Kp datasets are constructed from the previous study after necessary
pre-preprocessing. Two strategies are applied to model the composition- 2.2. Datasets of parabolic rate constants and near-surface degraded
Kp linkage, including the direct use of several classical ML models versus microstructure
the construction of ML models by the traction of the prior physical
formula, and the detailed feature engineering including feature In this study, we construct the dataset of Kp and near-surface
microstructural degradation features for alumina-forming Ni-based SX
Fig. 1. Computational flow of the constructed CKM relationship model. Consists of two core components, where the upper side of the flow chart shows the
construction of the prediction model for the Kp by using two strategies, including data collection, model selection, feature engineering, and the integration of physical
formula-driven data mining and ML. The lower part outlines the construction of the near-surface degraded γ’ features datasets and prediction ML model of γ’ features,
DICTRA in Thermo-Calc is used to connect Kp and the prediction of the near-surface degraded microstructure.
3
F. Yang et al. Acta Materialia 266 (2024) 119703
superalloys. We obtain 82 Al2O3-controlled Kp from previously reported the consumption of the film-forming elements. We use the Ni-based
literature. Specifically, the guidelines for Kp data collection are as fol alloy database and the diffusion database of Thermo-Calc to calculate
lows, 1. All data are derived from the results of isothermal oxidation the activity of alloying elements. As for the kinetic aspect, Sato [16]
experiments. Considering that the peeling of oxide film caused by proposes the total effective number of valence electrons as a kinetic term
thermal stress in the oxide scale in the cyclic oxidation could strongly affecting the oxidation parabolic rate constant based on the effect of
impact the oxidation kinetic process [43], the data in the cyclic oxida impurity cations on dominant defect concentration in alumina lattice, as
tion experiment is abandoned. 2. It is necessary to determine the for shown in Eq. (1).
mation of aluminum oxide during the oxidation process, which can be ∑n
determined from the XRD analysis or the morphology of the oxide layer Valeff
t = i=1
(Zi − ZAl )xi (1)
provided in the literature. Most of the alloys covered in the Kp dataset
are those that form continuous alumina. In some studies, the Where Zi is the effective valence of the element i’s ion, ZAl is the valence
alumina-controlled Kp is given directly, and for those not given directly, of the Al ion, and xi can be the atomic fraction or atomic fraction in γ of
the Kp is extracted from the kinetic curves of oxidation weight gain element i. The continuous depletion of Al in the alloy matrix during the
provided in the literature. It is noted that there are usually turning points formation of alumina causes a γ’-γ (γ’-free) phase transition immediately
in these oxidative weight gain curves, presenting two distinct kinetic below the alumina, therefore, the contribution of the γ’-free area to the
eff
regimes. In the initial stage of oxidation, NiO is the main oxidation impurity elements has to be taken into account. Here we use the Valt as
product (NiAl2O4, NiCr2O4, and NiTa2O4 could also be formed) [44], the a prior feature in terms of dynamics. In addition, we prefer that the
initial rate of oxidation is controlled by the faster kinetic formation rates elemental content in γ’ also needs to be introduced independently in the
of NiO caused by the large contact area of O2− with surface Ni, and the above equation, because the γ’ with high Al content could act as the
rate of oxidation gradually change to alumina controlled in the late nucleation site for Al2O3 [57,58], and the γ’ phase could contribute
stages as the formation of more thermodynamically stable continuous more impurity elements. Finally, CBFVs are constructed based on
alumina, it will lead to oxidation weight gain kinetic curves typically fundamental atomic electronic structure properties. The average
presenting two distinguishing stages, which is confirmed by many weighted and standard deviation algorithms are employed to calculate
studies [45–48]. Therefore, the Kp is extracted from the later the CBFVs, which gives a measure of the average level and differences
alumina-controlled stage of oxidation kinetic curves to avoid the among the elements in the alloy concerning specific elemental proper
possible confusion caused by the rapid NiO-controlled oxidation stage. It ties. The above-mentioned prior features in terms of thermodynamics
is noteworthy that all the following Kp in this paper refer to the oxidation and kinetics and CBFVs are summarized in Table 1. Feature screening is
rate at the alumina-controlled stage of oxidation kinetics. A more used to eliminate redundant features and features that are not relevant
detailed description of the Kp dataset and Kp extraction methods is to the target properties. As listed in Fig. 1, feature correlation calculated
provided in Supplemental 1.1. Table S.1 shows the statistical infor by Pearson Correlation Coefficient (PCC) is used to filter redundant
mation of the Kp dataset, and the original tabular data can be further features. The backward feature screening methods [59] and feature
referred to the Table S.2. importance from the tree-based ML model are used to narrow down the
For the construction of the degradation of near-surface microstruc range of possible optimal feature combinations, and then, the traversal
ture datasets, four Ni-based SX superalloys including DD32 [49], DD91 method [60] is employed to determine the optimal feature
[50], and two SX superalloys named ICSX1 and ICSX2 which are
developed by our group are selected as the experimental superalloys.
Table 1
The experimental details regarding isothermal oxidation, and
The extended thermodynamic activity features, prior knowledge features, and
near-surface composition measurements are described in Supplemental
CBFVs in the feature space. The physical description of the prior knowledge
1.2. The γ’ fraction, γ’ size, and γ’ shape factor in the γ’ reducing layer
features, the elemental properties involved in the CBFVs, and the calculation
are quantified using Image-j software [51], the method of measuring the formulas used are provided. The abbreviations in [] give simple descriptions of
γ’ shape factor is referred to that used in Lu et al. [52]. The constructed features and elemental properties.
near-surface microstructure dataset consists of 356 data including the
Features Feature pool [representation Calculation methods
alloying element distribution and microstructural degradation features. symbol]
The nominal compositions of the four Ni-based SX superalloys and the
Thermodynamic The activity of Ni, Al, Co, Cr, Thermo-Calc software
statistical information of the γ’ features are shown in Table S.3. activity features Mo, Re, Ru, Ti, Ta, W, Hf and TCNi10 database
∑
The effective valence is based Valct , ni=1 (zi − zAl )xi , xi
2.3. Feature engineering on C, Cγ , Cγ′ . [Valceff ,Valγeff ,Valγeff
′
] can be C, Cγ , Cγ′ .
∑
Effective diffusion coefficients D0 = xi Di0
Prior knowledge
In the modeling of the Kp, we perform detailed feature engineering to of Ni and Ni3Al. [D_Ni, D_Ni3Al]. i
features ∑
extend the available feature space and reduce the learning difficulty of For D_Ni: xi = Ciγ ; For D_Ni3Al: Qm = xi Qim
the ML model in the small dataset. As shown in Fig. 1, feature engi xi = Ciγ′ i
(− Qm)
neering is inspired by the following two aspects. 1. Constructing features D = D0 exp
∑n RT
related to the oxidation rate based on prior knowledge. 2. The statistical CBFVs Cohesion energy [CE], Pauling p = i=1 ci pi
electronegativity [PE],
Composition-Based Feature Vector [53] (CBFV) is constructed, which δ(p) =
Ionization energy [IE], Vacancy
has the potential to provide information on the microscopic level. CBFVs formation energy [VE], Atomic
√̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅
∑n 2
i=1 ci (pi − pNi )
consist of weighted statistics on the atomic properties and contents of radius [ar], ionic radius [ir],
the selected elements in the material, it has been widely employed in Ionic valence [iv], Free electron
ML-related works for superalloys [42,54-56] as it can represent the concentration [e/a]
physical information from the aspects of size, bond, and electrons. The Note: In the Calculation methods of Effective diffusion coefficients, xi is the
growth of oxides on metal surfaces is controlled by a combination of mole fraction of alloying element i, D0 and Qm are the pre-exponential factor and
thermodynamic and kinetic mechanisms. First, from the thermodynamic the activation energy. In the Calculation methods of CBFVs, pi is the property of
aspect, the activity of the alloying element determines the driving force alloying element i. p and δ(p) based on the elemental property p are named p_avg
of oxide formation, which is directly related to the oxidation rate based and p_var. Ci in the calculation equation is the nominal composition of the
on Wagner’s theory, and it also indirectly introduces information alloying element i in the alloy, and Ciγ and Ciγ′ are the content of the alloying
regarding the change of the composition of the alloy substrate by element in the γ and γ’, which are calculated using Thermo-Calc.
oxidation, since the Gibbs free energy of the oxide greatly determines
4
F. Yang et al. Acta Materialia 266 (2024) 119703
combinations. respectively, and compare the effects of these two types of features and
determine the baseline model. The prediction RMSE and MAE errors of
2.4. Training and evaluation metrics for ML models these ML models are shown in Fig. 2. The prediction errors of the ML
models with activity features are lower than the ML models with
For the Kp properties, we use Cross-Validation (CV) to evaluate the composition features, it is clear that the activity feature is more bene
predictive performance of the ML model since the small dataset of Kp. To ficial in improving the generalization performance. The RF model ex
quantify the generalization performance of the model to the unknown hibits the lowest generalization error, so the RF model based on the
compositions, the CV strategy similar to Leave-one-cluster-out cross- temperature and activity features is used as the baseline model for the
validation (LOCO CV) [61] is utilized. Instead of the clustering process subsequent feature engineering.
in LOCO CV, we validate each independent composition system as a fold. We perform further filtering of the thermodynamic activity features.
In this strategy, all alloy systems have been taken as the test set to make RF model can provide quantified importance about the features influ
the model evaluation results more robust, and also to ensure that suf encing the model decisions. The quantified importance can be taken as
ficient data are used as the training set with low training cost ascribed to the filtering sequence of backward feature filtering to refine key fea
small datasets. It is called Compositions-Based Cross Validation (CBCV) tures. Fig. 3a1 shows the internal ranking of feature importance obtained
in this paper. from the RF model. The activity of Ta, Ni, Al, and Cr shows high feature
In the modeling of near-surface microstructural degradation fea importance. A noteworthy point is that the activity of Al and Ta shows
tures, the dataset is divided randomly into a training set and a test set in the highest correlation among all elements, with a correlation coeffi
the ratio of 8: 2, and the 10-fold CV is performed on the training set to cient of 0.87 in the correlation matrix shown in Fig. 3a2. Considering the
obtain the optimal parameters. The average of the predictions of the 10 important influence of Al on the oxidation process of superalloys
models obtained from 10-CV on the test set is used as the prediction compared to Ta, here Ta activity could be a redundant feature that in
result on the test set, the prediction error on the test set is utilized to terferes with the learning of the ML model on the Al effects. So, we
evaluate the generalization performance of the models. All ML models, remove the Ta activity to ensure the reliability of the internal logic of the
implementation tools and hyperparameter optimization methods used ML model. Based on the remaining activity features and feature
in this work can be found in Supplemental 2.1. importance ranking, the feature screening is performed, and the changes
The values of Kp in this dataset range from 0.000468 g2/m4/h to in the RMSE error, MAE error and R value on CBCV of the RF model in
215.9 g2/m4/h but deviate from the normal distribution, the distribu the process of backward feature selection are shown in Fig. 3a3. The
tion of Kp is provided in Fig. S.1. Thus, logarithmically transformation is lowest generalization error of the RF model is achieved when the
used to normalize the distribution of the Kp data, as shown in Eq. (2): remaining activity features are the activities of Ni, Al, and Cr. The result
of predicted and experimental Kp is presented in Fig. 3c. The R value on
Y′ = ln(Y) (2) CBCV is 0.724, it can be found that some data points are clustered along
In this work, the metrics used to evaluate the generalization per the diagonal line, but there are some outliers with large prediction er
formance of the ML model include RMSE, MAE, R2, and R, as calculated rors, it needs to be further optimized.
in Eqs. (3-6). Next, we add prior knowledge features and CBFVs to the RF model
√̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅ and still perform the feature filtering, the specific constructed features
∑n pre 2 can be found in Table 1, the results are illustrated in Fig. 3(b1-b3).
i=1 (Yi − Y i )
RMSE = (3) Backward feature filtering is used to initially filter features, CBFVs -
n
∑n
i=1 |Yi − Yipre |
MAE = (4)
n
∑n 2
(Yi − Yipre )
R2 = 1 − ∑i=1
n 2
(5)
i=1 (Yi − Y)
∑n ( pre )
i=1 (Yi − Y) Yi − Y pre
R = √̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅ (6)
∑n 2 ∑n ( pre )2
i=1 (Yi − Y) i=1 Y i − Y pre
5
F. Yang et al. Acta Materialia 266 (2024) 119703
Fig. 3. Kp prediction ML model constructed based on feature engineering. a1-a3 Screening of thermodynamic activity features. c Prediction results of the RF
model on CBCV based on the filtered thermodynamic activity features and temperature. b1-b3 Backward selection and feature traversal of CBFVs and prior features,
where Fig. b2 and b3 are traversal results using MAE and RMSE as error evaluation, respectively. d Prediction results of the RF model on CBCV after whole feature
engineering. e the feature’s SHAP values as the representation of the global interpretability of the model.
CE_var, ir_avg, PE_avg, e/a_avg, e/a_var and the prior features - Valγeff ,
′
set. When the feature subsets are e/a_avg and e/a_var, the RF model has
the lowest MAE error, when the feature subset is CE_var, ir_avg, PE_avg,
D_Ni are identified as the features subset to be further filtered, notice
e/a_avg and Valγeff , the RF model has the lowest RMSE error. These two
′
that the R value on CBCV has been improved over 0.8 as shown in
Fig. 3b1. Moreover, benefiting from the small number of this feature subsets of features are not exactly aligned because the effect degree of
subset, the traversal method can be utilized to determine the optimal these two error functions by outliers is dissimilar, RMSE is more sensi
solution avoiding an unacceptably large computational cost. The results tive to outliers compared to MAE. Considering that the previous results
in Fig. 3(b2-b3) show the traversal results with MAE and RMSE as the in Fig. 3c show the larger outliers in the prediction of Kp, we choose the
error evaluation metrics, respectively, and indicate that the optimal subset of features with the lowest RMSE loss as the solution, and the
generalization performance could be obtained with the smaller feature MAE calculated with this feature set is 1.369, which is close to the
6
F. Yang et al. Acta Materialia 266 (2024) 119703
optimal MAE value in Fig. 3b2. The Kp prediction results based on the the Kp prediction model as a data enhancement method and evaluates
filtered feature set are shown in Fig. 3d, it can be seen that the prediction the prediction performance of the model on the alumina-forming data.
outliers are significantly reduced on CBCV, and the R value is improved The results of these two methods are illustrated in Fig. 4(a-b), the R
to 0.813 after feature engineering. values for the model of methods 1 and 2 are 0.810 and 0.832, respec
tively. The discontinuous alumina-forming data used as data augmen
3.1.2. Understanding the inherent decision logic of the model: revealing key tation could enhance the robustness of the model and promote the
features and their role generalization performance of the model. Method 2 is selected as the
It is of interest to understand the intrinsic material mechanisms re feature engineering-based ML model for predicting the Kp of continuous-
flected by the features. To clarify the role and contribution of features in alumina forming Ni-based SX superalloys. However, a non-negligible
the RF model, we use the SHapley Additive exPlanations (SHAP) [62] concern is that the Kp prediction model applicable to the continuous
tool to analyze the interpretability of features, which is illustrated in alumina-forming superalloys still has outliers with large prediction er
Fig. 3e. NiO, Cr2O3, and Al2O3 are the typical oxidation products of rors, as shown in the red region marked in the lower right corner of
Ni-based SX superalloys, and the activities of Ni, Al, and Cr are screened Fig. 4(a-b). As pointed out at the end of Section3.1, other forms of prior
out as important thermodynamic features precisely. We discuss in detail knowledge need to be introduced into the model. In the next section, we
the influence of Ni, Al, and Cr activity features on the predicted Kp and will try to combine the physical prior formula with data mining to solve
demonstrate the consistency of this interpretable information with ma this problem.
terial perceptions, which can be found in Supplemental 2.2. The ML
model does capture the key material information in the feature engi 3.2.2. A sub-regional conquer linear model for predicting Kp inspired by the
neering, it supports the exploration of the information embedded in the physical formula
prior knowledge features and CBFVs. Here a physical prior formula-driven data mining is proposed. In
For the prior knowledge features, a new effective descriptor is ob previous studies, classical Wagner’s theory proposes a mechanism con
tained, the Valγeff is only retained in feature filtering and shows the
′
trolling oxidation kinetics through the effect of defect concentration
within the alumina lattice on anion diffusion and provides a quantitative
highest feature importance, it implies that this feature could have a
description of this, which motivates us to check the impact of this
higher priority compared with the Valeff based on elemental contents in
mechanism, as well as the influence of other potential factors in more
γ and nominal composition. From the perspective of dynamics, Al2O3
data. Considering the classical Wagner-Hauffe theory [63] and the
tends to preferentially nucleate in the γ’ [58], and the following phase
following work carried out by Sato [16], the energy for the formation of
transition occurs,
Al2O3 can be estimated by Eq. (6)
∑
2Ni3 Al(γ′) + 3O→Al2 O3 + 6Ni(γ) U = ni=1 (Zi − ZAl )xi ΔGf (6)
ΔGf is the Gibbs free energy of formation of alumina, the meaning of
γ’ phase could be the main source of impurity elements in the Al2O3
Zi and ZAl is the same as in Eq. (1), and xi is the atomic fraction in γ of
lattice. The importance of the Valγeff feature in the model could also
′
element i. Kp is determined by U, specifically, according to the empirical
reflect that the initial alumina formation may have a prominent formula provided by Bensch [17], as shown in Eq. (7)
contribution to Kp, this needs to be verified by further research. Besides,
Kp = c1 exp(c2 U) (7)
the CBFVs based on cohesion energy, electronegativity, and free electron
concentration also have a high proportion of SHAP feature importance.
Where c1, and c2 are the fitting parameters that need to be obtained from
The addition of CBFVs also greatly reduces the number of predicted
the data. Taking the logarithm of both sides of Eq. (7), we have
outliers, which can be found in the comparison between Fig. 3c and 3d.
The CBFVs may compensate for missing physical information on the lnKp = c2 U + lnc1 (8)
input, the discussion of potential physical insights corresponding to
It can be seen that ln(Kp) is proportional to U, and where ln(c1) can
CBFVs can be found in Section3.2.3. However, some outliers still exist.
be considered as a new term, c3,
After effective feature engineering, the learning ability of the ML model
itself could be the main factor limiting the generalization performance, lnKp = c2 U + c3 (9)
which could be attributed to the inconsistency of inductive bias within
As shown in Eq. (9), combining the classical theoretical and empir
the model and material knowledge. Integrating more prior knowledge
ical equations, lnKp can be obtained by the linear equation fitted ac
may be the key to further improving the generalization performance in
cording to U. Since the empirical formulation of Eq. (9) is proposed from
terms of the composition-performance law.
a very small amount of data in previous work, we validate the lnKp - U
relationship and update the specific parameters in Eq. (9) based on more
3.2. Integrating physical formula-driven data mining and ML: enhancing data in this study. We visualize lnKp and U relationship in Fig. 5a and fit
Kp prediction accuracy for continuous-alumina forming Ni-based SX it using Eq. (9). The equation is determined to be lnKp = 0.0499U −
superalloys 6.83, it is clear that this equation does not fit most data points. Although
lnKp and U present a proportional relationship in Eq. (9), a newly
3.2.1. Prediction performance of feature engineering-based Kp prediction discovered point is that these data points appear to be approximately
ML model for continuous-alumina forming Ni-based SX superalloys sub-regional linearly distributed, this implies that c3 is not a fixed con
The formation of continuous alumina significantly causes the stant and could exhibit compositional sensitivities except for U. As
degradation of the near-surface γ’ strengthening phase, therefore the illustrated in Fig. 5b, with the above obtained linear equation as the
prediction of Kp for continuous alumina-forming superalloys needs to be baseline (the full red line in Fig. 5b), we calculate the difference between
further investigated systematically before the prediction of near-surface each data point and baseline and plot the distribution of the difference,
γ’ microstructural degradation. With the aid of the ML model obtained the subplot at the bottom right in Fig. 5b shows that there are four
in the previous section, we can quickly predict the Kp for the continuous different distributions, which indicates that these data points can be
alumina-forming superalloys. In terms of the dataset used, the Kp pre regarded as sub-regional distribution. The whole region can be divided
diction model can be constructed by the following two methods: 1. The into N (N = 4) subgroups, and are marked as Area 1–4 in the plot. In the
Kp prediction model is constructed using the dataset of continuous following part, we call the data points distributed in these four regions
alumina-forming alloys alone with the optimal features. 2. The data of Group 1–4. According to Eq. (9), we perform the linear fit for the rela
discontinuous alumina-forming data remains in the training process of tionship between U and lnKp in each group respectively, and the fitting
7
F. Yang et al. Acta Materialia 266 (2024) 119703
Fig. 4. Generalization performance of the Kp prediction ML model for continuous alumina-forming data. a Prediction results for the model trained on
continuous alumina-forming data on CBCV. b Prediction results for the model trained on continuous alumina-forming data and discontinuous alumina-forming data
on CBCV. Note that the results of the CBCV evaluation shown above are performed on continuous alumina-forming data.
results are shown in Table 2. misclassified points are also marked in Fig. 5f. It confirms the validity of
The correlation coefficients R values fitted for Groups 1, 2, 3, and 4 the approach of using prior formula as traction for ML modeling. The
are 0.742, 0.891, 0.890, and 0.863, respectively, the fitting results are physical prior formula derived from Wagner’s theory provides key in
acceptable and sufficiently accurate, this result further confirms that the formation, which reduces the difficulty of modeling the Kp. However, it
relationship between U and lnKp is sub-regional linear. More critically, still needs to be clarified that lnKp and U may exhibit a linear relation
another determining parameter - c3, is revealed. This parameter not only ship in a new region with further expansion of the data volume, thus
be a simple fitting parameter but also could correspond to some physical requiring continuous expansion of the data to validate and increase the
mechanism and lead to its contribution to the compositional sensitivity. generality of this empirical formula-based model.
Thus, two questions are worth exploring: Besides, by combining the above four fitted equations with the
1. Construction of a model for the relationship between alloy classification LGB model, we can obtain a simple empirical equation for
composition and c3. From a modeling perspective, it can assist in pre predicting the Kp of continuous alumina-forming superalloys. Consid
dicting Kp from simple linear models listed in Table 2 for the continuous ering that the coefficients c2 of the four subgroups in Table 2 can be
alumina-forming superalloys. In this case, the Kp prediction task is approximated as 0.05, while c3 can be approximated as − 1, − 4, − 7, and
converted from a regression prediction to a classification prediction. − 10, respectively, if we take the results of the classification ML model as
2. Potential physical mechanisms corresponding to c3, and physical an indicator function, as follows:
properties determine the sub-regional properties need to be revealed. ⎧
0 if ML(Inputs) ∈ 0
In the next paragraph, we focus on the exploration of Question 1. ML ⎪
⎪
⎨
1 if ML(Inputs) ∈ 1
models with detailed feature engineering are utilized to construct sub- C = ML(Inputs) =
⎪ 2 if ML(Inputs) ∈ 2
regional properties prediction models. Meanwhile, the key features ⎪
⎩
3 if ML(Inputs) ∈ 3
captured in feature engineering could be beneficial in solving Question
2. Then the c3 terms of the four subgroups can be expressed as the
We choose the baseline model by comparing the model with the following Eq. (10):
compositions, activity, CBFVs and prior knowledge features, respec
c3 = − 3C − 1 (10)
tively. Then perform the feature selection combining the backward
feature selection with the traversal method, and the CBCV approach is In summary, the empirical formula is determined as Eq. (11):
still utilized to evaluate the classification results of the model. The
lnKp = 0.05U − 3C − 1 (11)
detailed results of this part are shown in Supplemental 2.3. The tree-
based LGB model is identified as the final classification model, the pa It is necessary to emphasize that the approximate rounding operation
rameters for the model are shown in Table S.5. The features subset with is acceptable, and R-value achieved for prediction Kp on CBCV using the
optimal classification performance is determined to consist of e/a_avg, empirical equation compiled above (Eq. (11)) is 0.842, which is
′
Valγeff , CE_avg, ir_var, D_Ni, D_Ni3Al, VE_avg, PE_var. The LGB model consistent with the results in Fig. 5e. Because of the sub-regional char
acteristics reflected by the indicator function C, we call Eq. (11) as a sub-
achieves a classification accuracy of 0.817 on CBCV. Fig. 5d visualizes
regional conquer linear model. In the next section, to explore Question
the confusion matrix of the classification results, only 11 data points are
2, a more detailed discussion is carried out around the possible corre
misclassified and both are predicted to be the neighbors of the true
sponding physical mechanisms of sub-regional properties and the key
target group. The accuracy of the Kp prediction model based on strategy
features in determining the sub-regional properties.
2 is upgraded, with the R value being improved from 0.832 (strategy 1)
to 0.843 as shown in Fig. 5e. Compared with previous results, the data
3.2.3. Physical insights in oxidation dynamics implied by model
points with high prediction accuracy concentrated on the diagonal line
interpretable information
are increased, while the data points with high errors in the region of
In the previous section, the existence of another key parameter c3
previous outliers are reduced in the pair plots. Note that the
that determines Kp has been found through data mining. Feature
8
F. Yang et al. Acta Materialia 266 (2024) 119703
Fig. 5. Kp prediction model construction process based on the strategy of the traction of physical priori formula. a Fitting result based on the previously
proposed empirical formulation; the fitted equation is marked in the figure. b-c Analysis results of the Kp-U relationship, the four segmented regions according to the
data distribution characteristics are marked in Fig. b, and the fitted R values of four sub-regional linear function are marked in Fig. c. d Generalization performance
of the classification ML model on CBCV given in the form of the confusion matrix. e-f Prediction results of the novel empirical formulation on CBCV. The predicted
subgroups of each data point are marked in Fig. f. The true subgroup of the wrong predicted data points is marked using “£”.
9
F. Yang et al. Acta Materialia 266 (2024) 119703
the feature’s contributions to the model for each of the four subgroups Group 1 e/a_avg Average concentration of free Positive
and the joint distribution plots of the top two ranking features in terms electrons
effective valence electron Positive
of SHAP feature importance. The distribution of subgroup data on the Valγeff
′
number based on γ’
top two ranking features is relatively concentrated and can be discrim Group 2 CE_avg Average Cohesive Energy Positive
inated to some extent, so the top two features can be considered as D_Ni3Al Effective diffusion coefficient Mixed effects
primary controlling features. The top two key features and the contri of Ni3Al
butions of these features to the SHAP values for Groups 1 to 4 are Group 3 PE_var Variance of electronegativity Positive, but
among Ni-X (alloying slightly mixed
described in Table 3. elements) effects
Among them, PE_var is worth noting which is most important in the – Negative
Valγeff
′
Fig. 6. The SHAP values of features obtained from the LGB model used for classification and the joint distribution of the top two features. a-d The SHAP
values of features and top two features (ranking in SHAP importance) distribution for the predictions of Groups 1–4. In violin plots, the color bars indicate the
variation of feature values from small to large, and the larger SHAP values mean the larger contribution to the prediction, so it can be used to show the effects of
features on model predictions, whether positively or negatively.
10
F. Yang et al. Acta Materialia 266 (2024) 119703
alloying elements, which are also related to the formations or stabilities of the predictions of these two models as the predicted Kp. The R value of
of defects. In Ni-based SX superalloys, these defects such as interfaces this ensemble model on CBCV is greatly improved to 0.901, as shown in
and dislocations tend to act as fast diffusion pathways, thereby affecting Fig. 7. The predicted outlier problem is greatly alleviated especially by
the diffusion rate of ions during oxidation. However, the e/a_avg and comparing the predicted outlier regions in the previous two models. The
CE_avg are calculated for the whole alloy with a weighting average ensemble learning enables the generalization performance of the Kp
method in this work, and it is still unfeasible to traverse the bonding prediction model to be greatly improved. This ensemble Kp model is
behavior and cohesive energies of γ-Ni and γ’-Ni3Al phases with used as the final Kp prediction model and applied in subsequent
different defective configurations and compositions for superalloys. near-surface microstructural degradation prediction tasks.
Therefore, the impacts of these two properties could be clarified further
by studying specific chemical configurations using theoretical calcula 3.3. Modeling oxidation-induced degradation of near-surface
tions (first-principles calculation and/or Monte Carlo simulation) in microstructure
future works. These results further support the physical meaning infer
ence on the sub-regional properties and also imply that different sub- 3.3.1. Construction of near-surface degradation γ’ feature ML prediction
region properties may correspond to different ways of ionic diffusion. model based on ML
The revelation of key features also provides favorable insights into In this section, we concentrate on the prediction of near-surface
alloy design. To reduce the damage of oxidation to the γ’ phase, the microstructural degradation features. It is an inevitable trend for aero
design of alloys with smaller Kp could delay the dissipation of γ’ engines to serve at higher temperatures, and the strong degradation of γ’
strengthening element - Al. On the other hand, higher U could induce the caused by high temperatures affects the service performance of the Ni-
formation of continuous thicker Al2O3 to enhance the environmental based SX superalloy. Here we aim to construct a prediction model for
resistance of the alloy. With a sufficiently large U value, the selection of the degradation γ’ features of near-surface applicable to high tempera
alloys belonging to Group 3 or 4 may satisfy both the requirements for tures by orienting the study temperature at 1100 ◦ C. Modeling alloy and
sufficient environmental resistance and low damage to the near-surface microstructural degradation features using directly designed objects -
′
microstructure, and the features including PE_var, CE_avg, and Valγeff can nominal compositions may be more desirable in alloy design work,
unfortunately, the datasets containing a sufficient amount of nominal
be used as specific designed target parameters.
compositions and microstructural degradation features are missing, and
cannot support this goal. Even if these data could be added by ongoing
3.2.4. The ensemble of feature engineering-based ML models and a sub-
experiments, the process would undoubtedly be extremely cost-
regional conquer linear model
intensive. Modeling the physical process of oxidation-induced near-
Finally, we integrate the RF prediction model obtained in Section3.1
surface microstructural degradation is a feasible scheme, as shown in
with this empirical formulation based on ensemble learning [69]. From
Fig. 1. In Sections 3.1 and 3.2, the relationship between composition and
the perspective of ML, for the construction of the model of the rela
Kp has been constructed for the continuous alumina-forming Ni-based
tionship between composition and Kp, it is obvious that the RF model
SX superalloys. Kp can be approximated as the growth rate of the
and the sub-regional conquer linear model inspired by a physical for
continuous Al2O3 when it is a rate control step, which means that we can
mula have different hypothesis spaces, which can be reflected in the
obtain the diffusion flux of Al during the degradation of the near-surface
difference in the distribution of pair plots in Figs. 4b and 5e, however,
γ’. Naturally, the problem is focused on the connection between the
the two models achieve similar generalization performance. Therefore,
near-surface composition and microstructural features. Sufficient near-
with the ensemble of these two models, the corresponding hypothesis
surface compositional and microstructural characterization data can
space could be expanded, and the gap to the true composition - Kp
be obtained using modern microscopic characterization tools to support
relationship could be further narrowed. Specifically, we use the average
the construction of a near-surface feature prediction ML model. This
solution trades off the simplicity of input but solves the barrier of data
volume.
Four continuous alumina-forming alloys are utilized to construct the
near-surface microstructural degradation datasets (See Section2.2).
DD32 alloy is shown to be a continuous alumina-forming alloy [47], For
ICSX1, ICSX2, and DD91, we confirm that these alloys are also contin
uous alumina-forming alloys and the composite of the oxide layers and
the oxide composition analysis using EDS are illustrated in the Sup
plemental 2.5. In the following work, to validate the accuracy of the
CKM prediction process, ICSX2 is applied as the model alloy for vali
dation. The data from other alloys are used to train the ML models. It is
worth mentioning that the near-surface composition distributions of the
unknown newly designed alloys can be obtained by thermodynamic
calculations and utilized as inputs to the ML model of near-surface
feature prediction, enabling us to complete the entire CKM prediction
process without relying on experimental measurements. Additionally,
ICSX1, ICSX2, and DD91 can also be used to further test the general
ization performance of the Kp prediction model in Section3.2.4 for un
known alloys, this part is demonstrated in Supplemental 2.6. The
results further support the good generalization performance of the Kp
prediction model.
The ML models including ENet, KRR, SVR, RF, LGB, and MLP are
employed to construct the prediction models for microstructural
degradation features in the near-surface. The generalization perfor
Fig. 7. Prediction results of the ensemble model integrating the Kp prediction mance of the six ML models on the three datasets is shown in Fig. 8 a1,
model based on feature engineering (Section3.1.1) with the novel empirical b1, c1. RF, SVR, and MLP show the optimal performance in the prediction
formula (Section3.2.4) on CBCV. of γ’ fraction, γ’ size and γ’ shape factor. The MAE error of the RF model
11
F. Yang et al. Acta Materialia 266 (2024) 119703
Fig. 8. ML model for predicting near-surface γ’ degradation features, including model selection and the prediction results of optimal and suboptimal
models. a1-a3 Model for predicting γ’ fraction. b1-b3 Model for predicting γ’ shape factors. c1-c3 Model for predicting γ’ size.
for predicting the γ’ fraction is 0.101 and the MAE of the SVR model for importance, the γ’-γ phase transition due to the local Ni-Al composition
predicting the γ’ shape factor is 0.095, which is accurate enough for the change caused by Al dissipation in γ’-Ni3Al is one of the main causes of
prediction targets distributed between 0 and 1. The MAE and R2 of the microstructural degradation, this is indirectly reflected in the features
MLP model for predicting γ’ size are 0.431 and 0.856 respectively, this importance. Regarding the influence of alloying elements on the γ’
model has also achieved sufficient accuracy. In addition, we find that the shape factor, it can be found that Re and Mo have high importance in
suboptimal model exhibits similar generalization performance as the addition to Ni and Al. The strong influence of Re and Mo on the γ/γ’
optimal model. Considering that the prediction of a single ML model lattice mismatch indirectly determines the interfacial energy of the γ/γ’,
could be biased, so we ensemble the optimal and suboptimal ML models which in turn changes the shape of the γ/γ’ interface. And considering
as the final near-surface γ’ degradation prediction model to further the effect of γ’ directional coarsening on γ’ shape, the decrease of alloy
reduce the possibility of introducing the prediction bias. The prediction γ/γ’ lattice mismatch by Re and Mo could promote the γ’ directional
results of the optimal and suboptimal models are all shown in Fig. 8. coarsening [70–72]. Interestingly, a small concentration of the
Besides, we extract the internal feature importance of alloying ele γ’-forming element Nb also has a significant effect on the prediction of
ments from the constructed RF models for the prediction of γ’ fraction, the γ’ shape factor, but the specific effect mechanism is unclear and
shape factor and size to investigate the effect of alloying elements on needs further investigation.
near-surface microstructural features. The results are shown in Fig. 9.
For these three microstructural features, Ni, Al shows high feature
12
F. Yang et al. Acta Materialia 266 (2024) 119703
also captured. The dissolution and coarsening of the γ’ phase are typical important role in the prediction of Kp. In strategy 2, an enhanced Kp
features of the evolution of the microstructure of both under high- prediction model - a sub-regional conquer linear model is proposed for
temperature conditions. As shown in Fig. 11, we provide the sensi continuous alumina-forming SX superalloys. We find that the alumina
tivity of the predicted γ’ fraction and γ’ size evolution with time. In the formation energy U is sub-regional linear with lnKp . The bias terms - c3
region near the interior of the alloy away from the γ’ degradation layer,
that determine the sub-regional linear properties are quantified and a
the γ’ fraction predicted by the model gradually decreases with time as
classification model is constructed to predict the sub-regional linear
13
F. Yang et al. Acta Materialia 266 (2024) 119703
Fig. 10. The near-surface distribution of γ’ fraction, shape factor and size in 3 h, 6 h, 10 h, 20 h, 50 h and 100 h of ICSX2 alloy predicted by CKM prediction
flow. a-c the predicted results of γ’ fraction, shape factor and size, respectively. Note that the distribution of the calculated γ’ features using the experimentally
measured alloy composition distribution is marked with orange dots, and the scaled value of the Al content calculated from the Thermo-Calc DICTRA module is
marked as the green dotted line. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
14
F. Yang et al. Acta Materialia 266 (2024) 119703
Fig. 11. The sensitivity of γ’ fraction and γ’ size of ICSX2 on the time scale calculated by CKM prediction process. a The sensitivity of γ’ fraction. b The
sensitivity of γ’ size.
properties from alloy compositions. Further, combining the analysis of supported by Science Center for Gas Turbine Project (NO. P2021-A-IV-
the optimal feature subset obtained from feature engineering and 001–003) and Science Center for Gas Turbine Project (NO. P2022-B-IV-
interpretable information, the influence of the intrinsic defects and 005–001). Most of the calculations in this study were done on Tianhe-II
diffusion properties of the alloy on oxidation is considered to be a po high performance computer system in the National Supercomputer
tential physical mechanism corresponding to c3. Specific key features Center in Guangzhou, China. This research was also supported by the
and their roles influencing each sub-regional property are revealed. high-performance computing (IIPC) resources at Beihang University and
Besides, PE_var and CE_avg are also alloy design-friendly features that Chengdu Supercomputing Center. Wenyue Zhao designed the research,
could be utilized to design alloys with both environmental resistance and reviewed and edited the manuscripts. Fan Yang designed and
and weak near-surface microstructural damage rates. With the ensemble performed the research, constructed the ML model, experimental veri
of the two strategies above, the final Kp prediction model has achieved fication, analyzed data, and wrote initial manuscripts. Yi Ru supervision
an outstanding generalization performance and its R-value is signifi and funding acquisition. Yanling Pei supervision and funding acquisi
cantly increased to 0.901 on CBCV. tion. Shusuo Li supervision and funding acquisition. Shengkai Gong
The relationship model between the near-surface elemental distri supervision and funding acquisition. Huibin Xu supervision and fund
bution and the γ’ features including volume fraction, shape factor, and ing acquisition. All authors reviewed the paper.
size has been established based on ML. Dynamic simulation-DICTRA is
used to calculate the near-surface elemental distribution based on the Supplementary materials
predicted Kp and the entire CKM prediction process is completed finally.
The proposed CKM prediction flow is proved to be reliable, and the near- Supplementary material associated with this article can be found, in
surface γ’ features of the validation alloy are accurately predicted on the online version, at doi:10.1016/j.actamat.2024.119703.
both temporal and spatial scales. Moreover, the typical microstructural
evolution of Ni-based superalloys at high temperatures including γ’
References
resolution and coarsening are well captured. With the further expansion
of the near-surface microstructural evolution dataset in the future, the [1] Reed, C.Roger, The superalloys (fundamentals and applications) || The physical
accuracy of CKM prediction could be continuously improved, and its metallurgy of nickel and its alloys, 10.1017/CBO9780511541285(2) (2006)
assistance in the prediction of key mechanical properties is worth 33–120.
[2] A.P. Gordon, M.D. Trexler, R.W. Neu, T.J. Sanders, D.L. McDowell, Corrosion
expecting to guide the design of Ni-based SX superalloys for thin-walled kinetics of a directionally solidified Ni-base superalloy, Acta Mater. 55 (10) (2007)
blades. 3375–3385.
[3] H. Long, S. Mao, Y. Liu, Z. Zhang, X. Han, Microstructural and compositional
design of Ni-based single crystalline superalloys – A review, J. Alloys Compd. 743
5. Data and code availability (2018) 203–220.
[4] N. Birks, G.H. Meier, F.S. Pettit, Forming continuous alumina scales to protect
The data supporting the findings of this work is available in the main superalloys, JOM 46 (12) (1994) 42–46.
[5] J. Sun, C. Chen, J. Liu, J. Li, Combined effect of Ru on the oxidation behavior and
text and supplementary material. The raw data and code are available microstructure stability of a single crystal superalloy, J. Mater. Res. Technol.
on https://github.com/fanYang-X/Modeling-kp-for-oxidation-of- (2023).
superalloy, except for data from part of the ongoing study. [6] J.J. Wu, J. Meng, M.K. Zou, L.N. Yao, X.G. Wang, Y.H. Yang, Y.Z. Zhou, X.F. Sun,
Effect of wall thickness on micropores and stress-rupture properties of a single-
crystal nickel-based superalloy, Mater. Sci. Eng. A 872 (2023).
[7] L. Mataveli Suave, A.S. Muñoz, A. Gaubert, G. Benoit, L. Marcin, P. Kontis,
Declaration of competing interest P. Villechaise, J. Cormier, Thin-wall debit in creep of DS200 + Hf alloy, Metall.
Mater. Trans. A 49 (9) (2018) 4012–4028.
The authors declare that they have no known competing financial [8] Z. Wen, H. Pei, D. Li, Z. Yue, J. Gao, Thickness influence on the creep response of
DD6 Ni-based single-crystal superalloy, High Temperat. Mater. Process. 35 (9)
interests or personal relationships that could have appeared to influence (2016) 871–880.
the work reported in this paper. [9] M. Brunner, M. Bensch, R. Völkl, E. Affeldt, U. Glatzel, Thickness influence on
creep properties for Ni-based superalloy M247LC SX, Mater. Sci. Eng. A 550 (2012)
254–262.
Acknowledgment [10] S. Dryepondt, D. Monceau, F. Crabos, E. Andrieu, Static and dynamic aspects of
coupling between creep behavior and oxidation on MC2 single crystal superalloy at
The financial support provided by National Natural Science Foun 1150◦ C, Acta Mater. 53 (15) (2005) 4199–4209.
[11] H. Pei, Y. Yang, S. Gu, Y. Zhao, X. Yao, Z. Wen, Z. Yue, Study on oxidation-creep
dation of China (NSFC) under grant No. 92360302 for carrying out the behavior of a Ni-based single crystal superalloy based on crystal plasticity theory,
present research work is gratefully acknowledged. This work was Mater. Sci. Eng. A 839 (2022).
15
F. Yang et al. Acta Materialia 266 (2024) 119703
[12] M. Magdy, Y.B. Hu, J. Zhao, A study of the morphological effect of an α-Al2O3 [42] Z. Qin, Z. Wang, Y. Wang, L. Zhang, W. Li, J. Liu, Z. Wang, Z. Li, J. Pan, L. Zhao,
layer on the creep life for nickel-based superalloys using microstructure-based F. Liu, L. Tan, J. Wang, H. Han, L. Jiang, Y. Liu, Phase prediction of Ni-base
geometrical models, Vacuum.Vacuum. 202 (2022). superalloys via high-throughput experiments and machine learning, Mater. Res.
[13] J. Dewit, High temperature oxidation of metals, (1981). Lett. 9 (1) (2020) 32–40.
[14] T. Gheno, G. Lindwall, On the simulation of composition profiles in nicocral alloys [43] S.K. Srivastava, M.J. Newburn, J.P. Cotner, M.A. Richeson, Long-Term Oxidation
during Al2O3 scale growth in oxidation and oxidation–dissolution regimes, Oxidat. Behavior of Selected High Temperature Alloys, Asme Turbo Expo, Power for Land,
Metals 91 (3–4) (2018) 243–257. Sea, 1997. Air.
[15] X. Sun, L. Zhang, Y. Pan, X. Wang, Z. Huang, L. Jiang, Microstructural evolution [44] B.R. Barnard, P.K. Liaw, R.A. Buchanan, D.L. Klarstrom, Affects of applied stresses
during cyclic oxidation of a Ni-based singe crystal superalloy at 1100 ◦ C, Corros. on the isothermal and cyclic high-temperature oxidation behavior of superalloys,
Sci. 162 (2020). Mater. Sci. Eng. A 527 (16–17) (2010) 3813–3821.
[16] A. Sato, Y.L. Chiu, R.C. Reed, Oxidation of nickel-based single-crystal superalloys [45] C.Q. Cheng, Y.B. Hu, T.S. Cao, L. Zhang, Y.W. Zhu, J. Zhao, Two typical oxidation
for industrial gas turbine applications, Acta Mater. 59 (1) (2011) 225–240. models on nickel-based superalloys under different initial surface roughness,
[17] M. Bensch, A. Sato, N. Warnken, E. Affeldt, R.C. Reed, U. Glatzel, Modelling of high Corros. Sci. 176 (2020).
temperature oxidation of alumina-forming single-crystal nickel-base superalloys, [46] Y. Hu, C. Cheng, L. Zhang, T. Cao, G. Guo, X. Meng, J. Zhao, Microstructural
Acta Mater. 60 (15) (2012) 5468–5480. evolution of oxidation film on a single crystal nickel-based superalloy at 980◦ C,
[18] M. Bensch, J. Preußner, R. Hüttner, G. Obigodi, S. Virtanen, J. Gabel, U. Glatzel, Oxidat. Metals 89 (3–4) (2017) 303–317.
Modelling and analysis of the oxidation influence on creep behaviour of thin- [47] C.T. Liu, X.F. Sun, H.R. Guan, Z.Q. Hu, Oxidation of the single-crystal Ni-base
walled structures of the single-crystal nickel-base superalloy René N5 at 980◦ C, superalloy DD32 containing rhenium in air at 900 and 1000◦ C, Surf. Coat. Technol.
Acta Mater. 58 (5) (2010) 1607–1617. 197 (1) (2005) 39–44.
[19] V.K. Vanag, I.R. Epstein, Cross-diffusion and pattern formation in reaction- [48] C.T. Liu, J. Ma, X.F. Sun, Oxidation behavior of a single-crystal Ni-base superalloy
diffusion systems, Phys. Chem. Chem. Phys. 11 (6) (2009) 897–912. between 900 and 1000◦ C in air, J. Alloys Compd. 491 (1–2) (2010) 522–526.
[20] T. Jozaghi, C. Wang, R. Arroyave, I. Karaman, Design of alumina-forming [49] J. Yu, X. Sun, N. Zhao, T. Jin, H. Guan, Z. Hu, Effect of heat treatment on
austenitic stainless steel using genetic algorithms, Mater. Des. 186 (2020). microstructure and stress rupture life of DD32 single crystal Ni-base superalloy,
[21] Y. Niu, S. Wang, F. Gao, Z.G. Zhang, F. Gesmundo, The nature of the third-element Mater. Sci. Eng. A 460-461 (2007) 420–427.
effect in the oxidation of Fe–xCr–3at.% Al alloys in 1atm O2 at 1000◦ C, Corros. Sci. [50] L.Y. TAN Zihao, W.A.N.G. Xinguang, In-phase thermal-mechanical fatigue
50 (2) (2008) 345–356. behavior and damage mechanism of a fourth-generation single crystal superalloy,
[22] L. Wu, T. Osada, T. Yokokawa, Y. Chang, K. Kawagishi, The temperature Acta Metall. Sinica.
dependence of strengthening mechanisms in Ni-based superalloys: a newly re- [51] C.A. Schneider, W.S. Rasband, K.W. Eliceiri, NIH Image to ImageJ: 25 years of
defined cuboidal model and its implications for strength design, J. Alloys. Compd. image analysis, Nat. Methods 9 (7) (2012) 671–675.
931 (2023). [52] F. Lu, S. Antonov, S. Lu, J. Zhang, L. Li, D. Wang, J. Zhang, Q. Feng, Unveiling the
[23] Y. Ru, B. Hu, W. Zhao, H. Zhang, Y. Pei, S. Li, S. Gong, H. Xu, Topologically inverse Re effect on long-term coarsening behaviors of γ′ precipitates in Ni-based single
microstructure in single-crystal superalloys: microstructural stability and crystal superalloys, Acta Mater. 233 (2022).
properties at ultrahigh temperature, Mater. Res. Lett. 9 (12) (2021) 497–506. [53] R.J. Murdock, S.K. Kauwe, A.Y.T. Wang, T.D. Sparks, Is Domain Knowledge
[24] J. Behler, Perspective: machine learning potentials for atomistic simulations, Necessary for Machine Learning Materials Properties? Integr. Mater. Manuf. Innov.
J. Chem. Phys. 145 (17) (2016) 170901. 9 (3) (2020) 221–227.
[25] C.H. Chan, M. Sun, B. Huang, Application of machine learning for advanced [54] Y. Liu, J. Wu, Z. Wang, X.G. Lu, M. Avdeev, S. Shi, C. Wang, T. Yu, Predicting creep
material prediction and design, EcoMat. 4 (4) (2022). rupture life of Ni-based single crystal superalloys using divide-and-conquer
[26] J.E. Saal, A.O. Oliynyk, B. Meredig, Machine learning in materials discovery: approach based machine learning, Acta Mater. 195 (2020) 454–467.
confirmed predictions and their underlying approaches, Annu Rev. Mater. Res. 50 [55] P.L. Taylor, G.J.C.M.S. Conduit, Machine learning superalloy microchemistry and
(1) (2020) 49–69. creep strength from physical descriptors, 227 (2023) 112265.
[27] R.E.A. Goodall, A.A. Lee, Predicting materials properties without crystal structure: [56] C. Wen, C. Wang, Y. Zhang, S. Antonov, D. Xue, T. Lookman, Y. Su, Modeling solid
deep representation learning from stoichiometry, Nat. Commun. 11 (1) (2020) solution strengthening in high entropy alloys using machine learning, Acta Mater.
6280. 212 (2021).
[28] T. Xie, J.C. Grossman, Crystal graph convolutional neural networks for an accurate [57] Q. Ding, Z. Shen, S. Xiang, H. Tian, J. Li, Z. Zhang, In-situ environmental TEM
and interpretable prediction of material properties, Phys. Rev. Lett. 120 (14) study of γ′-γ phase transformation induced by oxidation in a nickel-based single
(2018) 145301. crystal superalloy, J. Alloys Compd. 651 (2015) 255–258.
[29] C. Wen, Y. Zhang, C. Wang, D. Xue, Y. Bai, S. Antonov, L. Dai, T. Lookman, Y. Su, [58] D. Kubacka, M. Weiser, E. Spiecker, Early stages of high-temperature oxidation of
Machine learning assisted design of high entropy alloys with desired property, Acta Ni- and Co-base model superalloys: a comparative study using rapid thermal
Mater. 170 (2019) 109–117. annealing and advanced electron microscopy, Corros. Sci. 191 (2021).
[30] Z. Rao, P.Y. Tung, R. Xie, Y. Wei, H. Zhang, A. Ferrari, T. Klaver, F. Krmann, P.T. [59] R.J. Urbanowicz, M. Meeker, W.La Cava, R.S. Olson, J.H. Moore, Relief-based
Sukumar, A. Silva, Machine learning-enabled high-entropy alloy discovery, (2022). feature selection: introduction and review, J. Biomed. Inform. 85 (2018) 189–203.
[31] X. Hu, J. Zhao, Y. Chen, J. Li, Z. Wang, J. Wang, Continually reactivating iterative- [60] F. Sohil, M.U. Sohali, J. Shabbir, An introduction to statistical learning with
projection method for instantiating microstructure from two-point statistics, Acta applications in R, Stat. Theory. Relat. Fields. 6 (1) (2021), 87-87.
Mater. 238 (2022). [61] Z. Xiong, Y. Cui, Z. Liu, Y. Zhao, M. Hu, J. Hu, Evaluating explorative prediction
[32] Z. Pei, K.A. Rozman, O.N. Dogan, Y. Wen, N. Gao, E.A. Holm, J.A. Hawk, D. power of machine learning algorithms for materials discovery usingk-fold forward
E. Alman, M.C. Gao, Machine-learning microstructure for inverse material design, cross-validation, Comput. Mater. Sci. 171 (2020).
Adv. Sci. (Weinh) 8 (23) (2021) e2101207. [62] S. Lundberg, S.I. Lee, A unified approach to interpreting model predictions, (2017).
[33] Y. Zhao, M. Al-Fahdi, M. Hu, E.M.D. Siriwardane, Y. Song, A. Nasiri, J. Hu, High- [63] K. Hauffe, Oxydation Yon Metallen und Metallegierungen, Sprlnger, Berlin, 1956.
throughput discovery of novel cubic crystal materials using deep generative neural [64] Wagner, C.J.J.o.t.E. Society, theoretical analysis of the diffusion processes
networks, Adv. Sci. (Weinh) 8 (20) (2021) e2100566. determining the oxidation rate of alloys, 99(10) (1952) 369.
[34] S.K. Bhattacharya, R. Sahara, T. Narushima, Predicting the parabolic rate constants [65] M.T. Lapington, D.J. Crudden, R.C. Reed, M.P. Moody, P.A.J. Bagot,
of high-temperature oxidation of Ti alloys using machine learning, Oxidat. Metals Characterization of oxidation mechanisms in a family of polycrystalline chromia-
94 (3–4) (2020) 205–218. forming nickel-base superalloys, Acta Mater. 206 (2021).
[35] Q. Wei, B. Cao, L. Deng, A. Sun, Z. Dong, T.Y. Zhang, Discovering a formula for the [66] J.P. Parras, R.A. De Souza, Grain-boundary diffusion of cations in fluorite-type
high temperature oxidation behavior of FeCrAlCoNi based high entropy alloys by oxides is faster but not always easier, Acta Mater. 195 (2020) 383–391.
domain knowledge-guided machine learning, J. Mater. Sci. Technol. 149 (2023) [67] F. Xue, E.A. Marquis, Role of diffusion-induced grain boundary migration in the
237–246. oxidation response of a Ni-30 Cr alloy, Acta Mater. 240 (2022).
[36] H.S. Kim, S.J. Park, S.M. Seo, Y.S. Yoo, H.W. Jeong, H. Jang, Regression analysis of [68] Z. Zhu, H. Basoalto, N. Warnken, R.C. Reed, A model for the creep deformation
high-temperature oxidation of Ni-based superalloys using artificial neural network, behaviour of nickel-based single crystal superalloys, Acta Mater. 60 (12) (2012)
Corros. Sci. 180 (2021). 4888–4900.
[37] D.W. Yun, S.M. Seo, H.W. Jeong, Y.S. Yoo, Effect of refractory elements and Al on [69] T.G.J.p.i.w.o.m.c.s. Dietterich, Ensemble methods in machine learning, (2000).
the high temperature oxidation of Ni-base superalloys and modelling of their [70] Y. Cheng, X. Zhao, W. Xia, Q. Yue, Y. Gu, X. Wei, H. Bei, Y. Dang, Z. Zhang, Effect
oxidation resistance, J. Alloys. Compd. 710 (2017) 8–19. of Mo on microstructural stability of a 4th generation Ni-based single crystal
[38] Y. Wang, M. Lu, Z. Wang, J. Liu, L. Xu, Z. Qin, Z. Wang, B. Wang, F. Liu, J. Wang, superalloy, J. Mater. Res. Technol. 21 (2022) 2672–2681.
The learning of the precipitates morphological parameters from the composition of [71] J.X. Zhang, J.C. Wang, H. Harada, Y. Koizumi, The effect of lattice misfit on the
nickel-based superalloys, Mater. Des. 206 (2021). dislocation motion in superalloys during high-temperature low-stress creep, Acta
[39] Z. Wang, L. Zhang, W. Li, Z. Qin, Z. Wang, Z. Li, L. Tan, L. Zhu, F. Liu, H. Han, Mater. 53 (17) (2005) 4623–4633.
L. Jiang, High throughput experiment assisted discovery of new Ni-base [72] T. Sugui, W. Minggang, Y. Huichen, Y. Xingfu, L. Tang, Q. Benjiang, Influence of
superalloys, Scr. Mater. 178 (2020) 134–138. element Re on lattice misfits and stress rupture properties of single crystal nickel-
[40] F. Liu, Z. Wang, Z. Wang, J. Zhong, L. Zhao, L. Jiang, R. Zhou, Y. Liu, L. Huang, based superalloys, Mater. Sci. Eng A 527 (16–17) (2010) 4458–4465.
L. Tan, Y. Tian, H. Zheng, Q. Fang, L. Zhang, L. Zhang, H. Wu, L. Bai, K. Zhou, [73] B. Gleeson, High-temperature corrosion of metallic alloys and coatings, Mater. Sci.
High-throughput method–accelerated design of Ni-based superalloys, Adv. Funct. Technol..
Mater. 32 (28) (2022). [74] Y. Ru, C. Ai, S.S. Li, S.K. Gong, Y.L. Pei, Two-phase microstructural evolution at
[41] J. Yu, S. Guo, Y. Chen, J. Han, Y. Lu, Q. Jiang, C. Wang, X. Liu, A two-stage high temperatures for γ′-richen single crystal superalloys, Mater. Research Innovat.
predicting model for γ′ solvus temperature of L12-strengthened Co-base superalloys 19 (sup4) (2015) S214–S219.
based on machine learning, Intermetallics. 110 (2019) (Barking).
16
F. Yang et al. Acta Materialia 266 (2024) 119703
[75] T. Murakumo, T. Kobayashi, Y. Koizumi, H. Harada, Creep behaviour of Ni-base [78] M. Mirza, S.J.C.S. Osindero, Condit. Generat. Advers. Nets (2014) 2672–2680.
single-crystal superalloys with various γ′ volume fraction, Acta Mater. 52 (12) [79] G. Lambard, K. Yamazaki, M. Demura, Generation of highly realistic
(2004) 3737–3744. microstructural images of alloys from limited data with a style-based generative
[76] J.S. Van Sluytman, T.M. Pollock, Optimal precipitate shapes in nickel-base γ–γ′ adversarial network, Sci. Rep. 13 (1) (2023) 566.
alloys, Acta Mater. 60 (4) (2012) 1771–1783. [80] J. Tang, X. Geng, D. Li, Y. Shi, J. Tong, H. Xiao, F. Peng, Machine learning-based
[77] I. Goodfellow, J. Pouget-Abadie, M. Mirza, B. Xu, D. Warde-Farley, S. Ozair, microstructure prediction during laser sintering of alumina, Sci. Rep. 11 (1) (2021)
A. Courville, Y. Bengio, Generative adversarial networks, 11%, J. Commun. ACM 10724.
63 (2020) 139–144.
17