Professional Documents
Culture Documents
Wei Guo and Tianhong Pan : A Review On Data-Driven Approaches For Industrial Process Modelling
Wei Guo and Tianhong Pan : A Review On Data-Driven Approaches For Industrial Process Modelling
2, 2020 75
Zhengming Li
School of Electrical Information and Engineering,
Jiangsu University,
Zhenjiang, Jiangsu 212013, China
Email: lzming@ujs.edu.cn
Guoquan Li
Jiangsu Hengshun Vinegar Industry Co., Ltd.,
Zhenjiang, Jiangsu 212043, China
Email: 13775556788@139.com
Abstract: Data-driven techniques in industrial processes have been continually attended during
the past decades. However, there are many challenging issues in this field when the collected
data presents different characteristics. In order to sketch the principle of different modelling
methods under various working conditions, data-driven modelling methods from perspectives
of data structures and model structures are reviewed in this paper. Firstly, the data collection
and preprocessing procedure are inspected. Then, popular methods from linear (including the
multivariate linear regression (MLR), to latent variable projection (LVP), etc.) to nonlinear
methods (including artificial intelligence, Gaussian process regression (GPR), local model, etc.)
are discussed. Finally, the model calibration strategies (including offset-based method, recursive
method, moving window method) are also reviewed. The major purpose is to support the industrial
process modelling for technical users by providing a set of data-driven methods.
Keywords: data-drive modelling; industrial process; machine learning; data analytics; model
structure.
Reference to this paper should be made as follows: Guo, W., Pan, T., Li, Z. and Li, G. (2020)
‘A review on data-driven approaches for industrial process modelling’, Int. J. Modelling,
Identification and Control, Vol. 34, No. 2, pp.75–89.
Biographical notes: Wei Guo received his BSc in Electronic Information Engineering from
Jingjiang College of Jiangsu University, Zhenjiang, China; and the MS in School of Electrical
Information and Engineering of Jiangsu University, Zhenjiang, China, in 2012, and 2016,
respectively. He is currently pursuing the PhD in Control Science and Engineering in Jiangsu
University. His research interests include process modelling, clustering algorithm and artificial
intelligence.
Tianhong Pan received his BSc in Mechatronics Technology and Applications from Anhui
Agriculture University, Hefei, China; the MS in Power Electronics and Electric Power Drives from
the Gansu University of Technology, Lanzhou, China; and the PhD in Control Theory and Control
Engineering from Shanghai Jiao Tong University, Shanghai, China, in 1997, 2000, and 2007,
respectively. He has been a Professor with the School of Electrical Engineering and Automation,
Anhui University, Hefei, Anhui, China. His research interests include multiple model approach and
its application, machine learning, virtual metrology, predictive control and run-to-run control theory
and practice, etc.
Zhengming Li received his BSc in Automation in 1982 from Jiangsu University, Zhenjiang,
China and the MS in Control Theory and Engineering in 1988 from Xi’an Jiao Tong University,
Xian, China. He has been a Professor in the School of Electrical and Information Engineering,
Jiangsu University. His research interests include R2R control theory and practice, complex process
modeling and control, system identification.
Guoquan Li has been a Director in the Jiangsu Hengshun Vinegar Industry Co., Ltd.
model structures. Under each specific topic, the principle from a historical database. For the purpose of constructing
of different methods or structures is discussed, with several an effective training model, the operating region of processes
corresponding industrial applications. should be noticed. Thus, the impractical samples recorded
by any invalid sensors will not be selected. Moreover,
Figure 1 The relationship of three modelling categories the category of selected dataset is not same for different
(see online version for colours) modelling purposes. For example, in soft sensing or fault
detecting, the dataset from a stable operating condition should
Unlabeled Labeled be selected. But in fault diagnosis models, abnormal data
Data Data
collection should be carried out as well. Another important
issue is to carry out the variable selection. Typically, the
variables measured in the industrial process may have co-
linear and redundant problems. There are two ways to
handle the collinearity problem. One is to use the statistic
Semi-supervised analysis method, such as PCA (Jiang et al., 2016), PLS
Learning
(Rinnan et al., 2014), and CCA (Ogura, 2010). Another is
to use the computational learning research for the feature
Unlabeled Unlabeled Labeled selection, such as support vector regression (SVR) (Ben
Data Data Data
Ishak, 2016), genetic algorithms (GA) (Lu et al., 2016), and
RF (Hapfelmeier and Ulm, 2013).
Figure 2 Overview of methodology for industrial processes modelling (see online version for colours)
Variable Outlier
Data Collection
Selection Detection
and
Historical
Sample Missing Pre-processing
Database Selection Data
Structure
Structures Selection Semi-supervised
Learning
Training
Supervised Unsupervised
Model
Learning Learning
Mathematical Selection
Methods
there is no unified guidance for selecting a structure or a until the termination of feedbacks (Qin, 1998). It should
mathematical method. However, several useful rules such be noticed that the feedback sample needs to satisfy the
as Bayesian information criterion and Akaike information requirement of the criteria in data collection and data pre-
criterion could promote the model selection (Bishop, 2006). processing.
(Lu et al., 2016), etc. On the other hand, the ridge regression all. Different objective functions can be derived through the
and latent variable can also deal with the collinear problem inherent concept of the maximum non-Gaussianity, and such
(Piepho, 2009). Most of applications using MLR focus on optimised objective functions enable the estimation of the
the key variable prediction. For example, Wang et al. (2019b) ICA model (Hyvärinen and Oja, 2000). Originally, ICA was
applied the MLR to influent chemical oxygen demand and proposed to deal with the problem of blind source separation
total phosphorus prediction in wastewater treatment plants. (Back and Weigend, 1997). Then, ICA was introduced into
Liu et al. (2018a) created a multivariate phase space by data-driven modelling methods (Ding et al., 2017; Lee et al.,
referencing the phase state and predicted the very-short-term 2007; Liu and Zhang, 2015; Liu et al., 2019a; Zhang and Qin,
wind power. Besides the predictive purpose, the MLR can 2007).
also be applied for process monitoring (Eyvazian et al., 2011).
Figure 3 The relationship among PCA, PCR, PLS, and PLSR
(2) Latent variable projection (LVP) model (see online version for colours)
Figure 5 The principle of FA (two factors) (see online version Grueiro et al., 2019). With the volume minimum principle
for colours) of a hypersphere in feature spaces, the spherically shaped
X boundary of a dataset can be estimated for detecting the novel
Noise data or outliers. The most application related to SVDD is the
F2 model development for process monitoring (Ge et al., 2017).
fault diagnosis algorithm for the Tennessee Eastman (TE) includes model division (Khediri et al., 2012; Zhai et al.,
process, which uses the convolutional neural network (CNN) 2018), soft sensor (Yuan et al., 2014a; Zhang et al., 2017),
for feature extraction. and process monitoring (Fan et al., 2014; Fezai et al., 2016,
The MLP is an extension of traditional artificial neural 2018), etc.
network (ANN) with several hidden layers. Liukkonen et al.
(2012) built a dynamic MLP model used for the nitrogen Figure 9 The principle of three nonlinear LVP models (see online
oxide content estimation of the flue gas. Souza et al. (2013) version for colours)
utilised MLP for variable selection, which is assumed that
the mean square errors of two identified results with different
inputs are similar. X
RNN is a time-correlating neural work model, which is
reflected in the memory networks. Such mnemonic ability
in RNN is incorporated with the state of neurons, which is
stored and used for the prediction of the next state sequence
(Smith and Jin, 2014). Sun and Ge (2019) developed a soft
sensor model using Gaussian-Bernoulli restricted Boltzmann
machine and RNN on carbon dioxide absorption column. Xe T X Tp Xk T
Duchanoy et al. (2017) used RNN successfully predict the
area of car tyres contacted with the ground.
In addition, the relationship of above techniques is shown
PT PT PT
in Figure 8.
Input extension Polynomial Kernel LVP
model projection model model
Figure 8 The relationship between FEN, MLP, and RNN
(see online version for colours)
Features
(4) Gaussian process regression (GPR)
MLP
Y
FEN The GPR is inspired by the concept of Gaussian process
RNN (GP), which is used for representing a distribution over
functions from the viewpoint of the normal distribution.
Statistically, a sample in original spaces is associated with a
Previous neurons Current neurons
random variable that obeys a Gaussian distribution. The GP
can be seen as a generalised infinite-dimensions functional
X
space of multivariate normal distributions (Rasmussen and
Williams, 2006). The inference of continuous variables with
a GP prior in Bayesian framework is known as GPR.
(3) Nonlinear latent variable projection (NLVP) model By referencing the distribution over functional space and
Bayesian inference, the posterior distribution of requested
In order to extend the linear LVP model to the nonlinear estimation is provided (shown in Figure 10). Meanwhile,
applications, there are three main methods, the input such distribution estimation is a crucial difference between
extension model, the polynomial projection model, and the GPR and other methods. In recent years, GPR has
the kernel LVP model (shown in Figure 9). The input been mostly used for applications on the predictive purpose,
extension is the simplest model of the nonlinear latent such as soft sensing modelling, output or quality related
variable projection (NLVP). It extends the variable space monitoring; Zhou et al. (2015) developed an adaptive quality
with associated nonlinear combinations (such as square monitoring model with the use of predictive errors from the
values, cross-products, logarithms) of the original input. GPR in the fed-batch process; Liu et al. (2017) made a
Then the linear LVP analysis on the extended data space is multi-step prediction using GPR for removing pollutants in
performed (Baffi et al., 1999). Wold et al. (1989) presented wastewater treatment plants; DuyTrinh et al. (2019) presented
the polynomial projection algorithm, in which, inner relations a fuzzy neural network combined GPR model for predicting
between LVs were modified as a nonlinear (polynomial) the grinding wheel wear and surface roughness of alloy.
relationship. Due to the lack of prior knowledge related to Besides, Wang and Mao (2019) also extended the GPR for
underlying nonlinear relationships, these two methods are not outlier detection.
getting as much attention as the kernel LVP.
The kernel LVP (KLVP) retains the linear framework of
the traditional LVP. It firstly maps the original inputs onto a
3.1.3 Adaptive models
feature space, namely, the reproducing kernel Hilbert space (1) Offset-based calibration
(RKHS). The RKHS can be treated as a generalised space
form in SVM and GPR (Rosipal and Trejo, 2001). Then the The output offset comes from suspecting data reliability
LV can be obtained on such functional space. The kernel or grade-drifting caused by aging sensors. The offset-based
based method is widely used in industrial processes, which calibration technique uses the deviation between the output
82 W. Guo et al.
feedback and the online prediction to fix the upcoming relevance under the current working condition. Commonly,
predictive output (Ahmed et al., 2009). Therefore, this a window with a fixed-length containing the most recent
method is mainly used for applications under supervised measurements is employed, which ensures the maximum
learning frameworks. For example, Ni et al. (2011) used relevance to the current condition (Kadlec et al., 2011). In the
the output basis robustly update the GPR model by adding MW, the newest samples are received and the same number
an offset smoother; Ahmed et al. (2015) used the offset- of oldest measurements is excluded as the window slides
based model into the least squares support vector machine along the record sequence. Then the identified model needs
(LSSVM) to predict the nitrogen oxide discharged from the to be recalculated as well (Fezai et al., 2016; Jiang and Yan,
coal-fired power plant. 2013). Sometimes, it is unnecessary to build a new model
immediately when the window has been slid. Therefore, the
Figure 10 Illustration of GPR for the posterior distributions hypothesis-testing scheme could make a decision whether the
(see online version for colours) model needs to be modified (Shao and Tian, 2015; Yao and
Ge, 2017).
5
True function Confidence
interval 3.2 Different model structures
4 Training data
space (Bemporad et al., 2003; Breschhi et al., 2016). Both k- one time or zero, which leads to the diversity of samples in
means and PWA require the specified cluster number at the each sub-model (shown in Figure 13).
initialisation step. Alternatively, the cluster number also can
be determined during the clustering step by using the affinity Figure 13 The framework of ensemble learning model
propagation (AP) clustering algorithm (Liang et al., 2019; (see online version for colours)
Zhang et al., 2018), or the subtractive clustering algorithm
(Pan et al., 2010; Norhayati and Rashid, 2018). Nonduplicate
data extraction
Base ω1
3.2.2 Mixture model model 1
Base ω2
Different from the switch-based division strategy, the mixture model 2
model converts those weighted sub-outputs into the final Training set
Base ω3
output with a linear superposition (shown in Figure 12). model 3
high nonlinearity in batch processes; Yuan et al. (2017) 4.1 Practical applications
constructed a local weighted log-likelihood function to
approximate the nonlinear relationship of data; Zheng et al. Challenges of the development from theoretical researches to
(2018) developed a semi-supervised soft sensing model by practical applications are still urgently needed to be addressed
extracting the information from the local model including (Quiñones-Grueiro et al., 2019). This is mainly because
both labelled and unlabelled data. the practical application is more like a stochastic discrete
event system on aspects of multi-rate sampling, missing
Figure 14 The framework of instance-based local model data, dynamic character, and maintenance frequency. Such
(see online version for colours) uncertain elements should be put in place for the development
of process monitoring, soft sensing, fault diagnosis, etc. From
the practical viewpoint, the proposal of process modelling
should handle this issue currently rather than waiting for
Training set
offline analysis.
Instance-based model
4.2 Data mining and analysing
Local
Query Relevant model Output
data With the increasing requirement of the product quality,
modern plant-wide processes are becoming more and more
complex. Most of modelling techniques are limited to product
quality prediction, normal working condition monitoring,
fault classification, and model division. However, tracing
back to the root cause of process changes is still difficult.
Therefore, a cause-to-effect relationship among the dataset
3.2.5 Transfer learning model should be considered for further analysis, which reveals
As a sub-field of machine learning techniques, transfer the root cause of process changes and directs the process
learning is used to deal with the target domain without or little operation. Although several methods have been used, e.g.,
amount of labelled data, which is disabled for training any Bayesian network (Gonzalez et al., 2015) and relative
model. A transferred model can be constructed by permitting importance analysis (Yan et al., 2017), investigations for the
the difference in distributions, domains, and tasks in training application in plant-wide processes are still needed to be
and testing phase (Pan and Yang, 2010). Of cause, it should executed (Ge, 2017).
be on the assumption that enough samples are contained in
at least one related source domain, which provides a primary 4.3 Diversified data
basis for modelling and associated information extraction.
As the coming era of big data, massive volumes of data are
Additionally, the proper identification of the source domain is
collected from industrial processes. Besides the depth of data
essential as the sample obtained from dramatically different
mining, the diversification of collected data is also needed
source domains may adversely affect the training process,
to be paid attention, such as spectral data, image data, audio
which is termed as negative transfer (Salaken et al., 2017).
data, video data, and so on. Basically, the diversified data
Transfer learning is mostly used in image recognition and
still contains information related to the nature of industrial
text classification, etc. (Pan and Yang, 2010). Until recent
processes. Meanwhile, such data is much easier and faster
years, it has been introduced to the process modelling (Facco
to collect. However, how to combine the traditional data and
et al., 2012, 2014; Kang, 2018; Liu et al., 2019b; Shao et al.,
the diversified data, or how to utilise the diversified data
2019; Tomba et al., 2014). Actually, the transfer learning
independently for process modelling is a great challenge for
is a kind of domain-based adaptation method with a model
future works.
trained on the source domain and calibrated on the target
domain (Kang, 2018; Salaken et al., 2017).
4.4 Economic model
Due to the factor of sustainability, recently, not only
4 Perspectives for future research for researches of energy efficiency, but also for matters
related to environmental protection have been taken into
consideration for process modelling (Hanes and Bakshi,
Over the past decades, data-driven approaches used for
2015). As mentioned in early section, modelling methods
industrial processes have continually caught widespread
actually can be performed as optimisation problems. By
attention from both academics and engineers, which takes
introducing new constraints inspired by the economic model
the benefit from the distributed control systems (DCS) and
predictive control (EMPC) (Ellis et al., 2014), both energy
the power of modern computers. Meanwhile, more and more
efficiency and environmental sustainability of the process
new developed data-driven algorithms have been introduced
should be monitored as well. To this end, how to effectively
to cooperate new industrial characteristics. However, some
incorporate the process knowledge into industrial processes
fundamental issues related to data-driven approaches should
needs more consideration in future works.
be considered for process modelling.
A review on data-driven approaches for industrial process modelling 85
5 Conclusion Bao, L., Yuan, X.F. and Ge, Z.Q. (2015) ‘Co-training partial least
squares model for semi-supervised soft sensor development’,
In this paper, several most widely used machine learning Chemometrics and Intelligent Laboratory Systems, Vol. 147,
pp.75–85.
algorithms for industrial process modelling are reviewed,
Bemporad, A., Garulli, A., Paoletti, S. and A, V. (2003) ‘A greedy
with the methodology, the mathematical method, and the
approach to identification of piecewise affine models’, Hybrid
structure selection. Discussions related to extensions and
Systems: Computation and Control, Vol. 2623, pp.97–112.
challenges are also carried out for future researches. Actually,
Ben Ishak, A. (2016) ‘Variable selection using support vector
data-driven modelling is a field relying on information regression and random forests: a comparative study’, Intelligent
fusion. From the perspective of algorithms, none of them Data Analysis, Vol. 20, pp.83–104.
can be treated as a universal method for addressing any Bishop, C.M. (2006) Pattern Recognition and Machine Learning,
industrial situation. From the perspective of applications, Springer-Verlag, New York.
similarly, one modelling task may need several algorithms to Breiman, L. (1996) ‘Bagging predictors’, Machine Learning,
be implemented. Therefore, the data-driven method has an Vol. 24, No. 2, pp.123–140.
important potential for practical applications and academic Breschhi, V., Piga, D. and Bemporad, A. (2016) ‘Piecewise
researches, which plays an important role in industrial affine regression via recursive multiple least squares and
processes. multicategory discrimination’, Automatica, Vol. 70, pp.155–
162.
Cao, D.S., Xu, Q.S., Liang, Y.Z., Zhang, L.X. and Li, H.D. (2010)
Acknowledgement ‘The boosting: A new idea of building models’, Chemometrics
and Intelligent Laboratory Systems, Vol. 100, pp.1–11.
Cheng, C. and Chiu, M.S. (2004) ‘A new data-based methodology
The project is funded in part by the National Natural Science
for nonlinear process modeling’, Chemical Engineering
Foundation under Grant 61873113, Key R&D Program of
Science, Vol. 59, pp.2701–2810.
Jiangsu Province, China under Grant BE2018370, and the
Cheng, C. and Chiu, M.S. (2005) ‘Nonlinear process monitoring
Postgraduate Research and Practice Innovation Program of
using JITL-PCA’, Chemometrics and Intelligent Laboratory
Jiangsu Province (KYCX17 1785). Systems, Vol. 76, pp.1–13.
Chiplunkar, R. and Huang, B. (2019) ‘Output relevant slow feature
extraction using partial least squares’, Chemometrics and
References Intelligent Laboratory Systems, Vol. 191, pp.148–157.
Ding, Z.Y., Zhang, J. and Liu, Y. (2017) ‘Ensemble non-gaussian
Abyaneh, H.Z. (2014) ‘Evaluation of multivariate linear regression local regression for industrial silicon content prediction’, ISIJ
and artificial neural networks in prediction of water quality International, Vol. 57, No. 11, pp.2022–2027.
parameters’, Journal of Environmental Health Science and Du, W.Y., Fan, Y.P. and Zhang, Y.W. (2017) ‘Multimode process
Engineering, Vol. 12, p.40. monitoring based on data-driven method’, Journal of the
Franklin Institute-Engineering and Applied Mathematics,
Ahmad, I., Kano, M., Hasebe, S., Kitada, H. and Murata, N. (2014)
Vol. 345, No. 6, pp.2613–2627.
‘Gray-box modeling for prediction and control of molten steel
temperature in Tundish’, Journal of Process Control, Vol. 24, Duchanoy, C.A., Moreno-Armendariz, M.A., Urbina, L., Cruz-
Villar, C.A., Calvo, H. and Rubio, J.D. (2017) ‘A novel
No. 4, pp.375–382.
recurrent neural network soft sensor via a differential evolution
Ahmed, F., Cho, H.J., Kim, J.K., Seong, N.U. and Yeo, Y.K. (2015) training algorithm for the tire contact patch’, Neurocomputing,
‘A recursive pls-based soft sensor for prediction of the melt Vol. 235, pp.71–82.
index during grade change operations in hdpe plant’, Korean DuyTrinh, N., Yin, S.H., Tan, N.N., Son, P.X. and Duc, L.A. (2019) ‘
Journal of Chemical Engineering, Vol. 32, No. 6, pp.1029– ‘A new method for online monitoring when grinding Ti-6Al-4V
1036. alloy’, Materials and Manufacturing Processes, Vol. 34, No. 1,
Ahmed, F., Nazir, S. and Yeo, Y.K. (2009) ‘A recursive pls-based pp.39–53.
soft sensor for prediction of the melt index during grade Ellis, M., Durand, H. and Christofides, P.D. (2014) ‘A tutorial review
change operations in HDPE plant’, Korean Journal of Chemical of economic model predictive control methods’, Journal of
Engineering, Vol. 26, No. 1, pp.14–20. Process Control, Vol. 24, No. 8, pp.1156–1178.
Askari, S., Montazerin, N., Fazel Zarandi, M.H. and Hakimi, Eyvazian, M., Noorossana, R., Saghaei, A. and Amiri, A. (2011) ‘
E. (2017) ‘Generalized entropy based possibilistic fuzzy c- ‘Phase II monitoring of multivariate multiple linear regression
means for clustering noisy data and its convergence proof’, profiles’, Quality and Reliability Engineering International,
Vol. 27, No. 3, pp.281–296.
Neurocomputing, Vol. 219, pp.186–202.
Facco, P., Largoni, M., Tomba, E., Bezzo, F. and Barolo, M. (2014)
Back, A.D. and Weigend, A.S. (1997) ‘A first application of
‘Transfer of process monitoring models between plants: Batch
independent component analysis to extracting structure from
systems’, Chemical Engineering Research and Design, Vol. 92,
stock returns’, International Journal of Neural Systems, Vol. 8, No. 2, pp.273–284.
No. 4, pp.473–484.
Facco, P., Tomba, E., Bezzo, F. and Garcı́a-Muñoz, S. (2012)
Baffi, G., Martin, E.B. and Morris, A.J. (1999) ‘Non-linear ‘Transfer of process monitoring models between different
projection to latent structures revisited: the quadratic pls plants using latent variable techniques’, Industrial and
algorithm’, Computers and Chemical Engineering, Vol. 23, Engineering Chemistry Research, Vol. 51, No. 21, pp.7327–
pp.395–411. 7339.
86 W. Guo et al.
Fan, J.C., Qin, S.J. and Wang, Y.Q. (2014) ‘Online monitoring of Huang, J.Y. and Zhao, J. (2018) ‘Identification of multi-model lpv
nonlinear multivariate industrial processes using filtering kica- model with two scheduling variables using transition test’,
pca’, Control Engineering Practice, Vol. 22, pp.205–216. International Journal of Modelling, Identification and Control,
Fezai, R., Mansouri, M., Taouali, O., Harkat, M.F. and Bouguila, Vol. 29, No. 1, pp.31–43.
N. (2018) ‘Online reduced kernel principal component analysis Hyvärinen, A. and Oja, E. (2000) ‘Independent component analysis:
for process monitoring’, Journal of Process Control, Vol. 61, algorithms and applications’, Neural networks, Vol. 13,
pp.1–11. Nos. 4–5, pp.411–430.
Fezai, R., Taouali, O., Harkat, M.F. and Bouguila, N. (2016) ‘A Jadhav, S., Nalbalwar, S. and Ghatol, A. (2014) ‘Feature elimination
new fault detection method for nonlinear process monitoring’, based random subspace ensembles learning for ecg arrhythmia
International Journal of Advanced Manufacturing Technology, diagnosis’, Soft Computing, Vol. 18, No. 3, pp.579–587.
Vol. 87, No. 9-12, pp.3425–3436. Jiang, Q.C. and Yan, X.F. (2013) ‘Weighted kernel principal
Frigieri, E.P., Campos, P.H.S., Paiva, A.P. and Balestrassi, P.P. component analysis based on probability density estimation
(2016) ‘A mel-frequency cepstral coefficient-based approach and moving window and its application in nonlinear chemical
process monitoring’, Chemometrics and Intelligent Laboratory
for surface roughness diagnosis in hard turning using acoustic
Systems, Vol. 127, pp.212–131.
signals and gaussian mixture models’, Applied Acoustics,
Vol. 113, pp.230–237. Jiang, Q.C., Yan, X.F. and Huang, B. (2016) ‘Performance-driven
distributed PCA process monitoring based on fault-relevant
Ge, Z.Q. (2016) ‘Supervised latent factor analysis for process
variable selection and bayesian inference’, IEEE Transactions
data regression modeling and soft sensor application’, IEEE
on Industrial Electronics, Vol. 63, No. 1, pp.377–386.
Transactions on Control Systems Technology, Vol. 24, No. 3,
Jiang, Q.C., Yan, X.F. and Huang, B. (2019) ‘Review and
pp.1004–1011.
perspectives of data-driven distributed monitoring for industrial
Ge, Z.Q. (2017) ‘Review on data-driven modeling and monitoring plant-wide processes’, Industrial and Engineering Chemistry
for plant-wide industrial processes’, Chemometrics and Research, Vol. 58, pp.12899–12912.
Intelligent Laboratory Systems, Vol. 171, pp.16–25.
Jing, S.X., Pan, T.H. and Li, Z.M. (2017) ‘Variable knot-based
Ge, Z.Q. and Song, Z.H. (2012) ‘Multivariate statistical process spline approximation recursive bayesian algorithm for the
monitoring using modified factor analysis and its application’, identification of wiener systems with process noise’, Nonlinear
Journal of Chemical Engineering of Japan, Vol. 45, No. 10, Dynamics, Vol. 90, No. 4, pp.2293–2303.
pp.829–839. Kadlec, P., Gabrys, B. and Strandt, S. (2009) ‘Data-driven soft
Ge, Z.Q. and Song, Z.H. (2014) ‘Ensemble independent component sensors in the process industry’, Computers and Chemical
regression models and soft sensing application’, Chemometrics Engineering, Vol. 33, pp.795–814.
and Intelligent Laboratory Systems, Vol. 130, pp.115–122. Kadlec, P., Grbic, R. and Gabrys, B. (2011) ‘Data-driven soft sensors
Ge, Z.Q., Song, Z.H., Ding, S.X. and Huang, B. (2017) ‘Data mining in the process industry’, Computers and Chemical Engineering,
and analytics in the process industry: The role of machine Vol. 35, No. 1, pp.1–24.
learning’, IEEE Access, Vol. 5, pp.20590–20616. Kaneko, H. and Funatsu, K. (2014) ‘ ‘Adaptive soft sensor based
Geladi, P. and Kowalski, B.R. (1986) ‘ ‘Partial least-squares on online support vector regression and bayesian ensemble
regression: a tutorial’, Analytica Chimica Acta, Vol. 185, learning for various states in chemical plants’, Chemometrics
pp.1–17. and Intelligent Laboratory Systems, Vol. 137, pp.57–66.
Gonzalez, R., Huang, B. and Lau, E. (2015) ‘Process monitoring Kaneko, H. and Funatsu, K. (2016) ‘ ‘Ensemble locally weighted
using kernel density estimation and bayesian networking with partial least squares as a just-in-time modeling method’,
an industrial case study’, ISA Transactions, Vol. 58, pp.330– AICHE Journal, Vol. 62, No. 3, pp.717–725.
347. Kang, S. (2018) ‘On effectiveness of transfer learning approach
Gopakumar, V., Tiwari, S. and Rahman, I. (2018) ‘A deep learning for neural network-based virtual metrology modeling’, IEEE
based data driven soft sensor for bioprocesses’, Biochemical Transactions on Semiconductor Manufacturing, Vol. 31, No. 1,
pp.149–155.
Engineering Journal, Vol. 136, pp.28–39.
Khatibisepehr, S., Huang, B. and Khare, S. (2013) ‘Design of
Guo, W., Pan, T.H. and Li, Z.M. (2016) ‘Development of a soft
inferential sensors in the process industry: A review of bayesian
sensor for processes with multiple operating regimes using
methods’, Journal of Process Control, Vol. 23, pp.1575–1596.
adaptive multi-state partial least squares regression’, Journal of
the Taiwan Institute of Chemical Engineers, Vol. 67, pp.20–28. Khediri, I.B., Weihs, C. and Liman, M. (2012) ‘Kernel k-means
clustering based local support vector domain description
Hanes, R.J. and Bakshi, B.R. (2015) ‘ ‘Sustainable process design fault detection of multimodal processes’, Expert Systems with
by the process to planet framework’, AICHE Journal, Vol. 61, Applications, Vol. 39, No. 2, pp.2166–2171.
No. 10, pp.3320–3331.
Kim, D. and Lee, I.B. (2003) ‘Process monitoring based on
Hapfelmeier, A. and Ulm, K. (2013) ‘ ‘A new variable selection probabilistic PCA’, Chemometrics and Intelligent Laboratory
approach using random forests’, Computational Statistics and Systems, Vol. 67, pp.109–123.
Data Analysis, Vol. 60, pp.50–69.
Kim, Y.I., Kim, D.W., Lee, D. and Lee, K.H. (2004) ‘A cluster
Hu, Y., Ma, H.H. and Shi, H.B. (2013) ‘Enhanced batch process validation index for GK cluster analysis based on relative
monitoring using just-in-time-learning based kernel partial degree of sharing’, Information Sciences, Vol. 168, Nos. 1–4,
least squares’, Chemometrics and Intelligent Laboratory pp.225–242.
Systems, Vol. 123, pp.15–27. LeBreux, M., Desilets, M. and Lacroix, M. (2012) ‘Control of
Hu, Z.K., snd W.H.Gui, Z.W.C. and Jiang, B. (2014) ‘Adaptive pca the ledge thickness in high-temperature metallurgical reactors
based fault diagnosis scheme in imperial smelting process’, ISA using a virtual sensor’, Inverse Problems in Science and
Transactions, Vol. 53, No. 5, pp.1446–1455. Engineering, Vol. 20, No. 8, pp.1215–1238.
A review on data-driven approaches for industrial process modelling 87
Lee, J.M., Qin, S.J. and Lee, I.B. (2007) ‘Fault detection of non- Norhayati, I. and Rashid, M. (2018) ‘Adaptive neuro-fuzzy
linear processes using kernel independent component analysis’, prediction of carbon monoxide emission from a clinical
Canadian Journal of Chemical Engineering, Vol. 85, No. 4, waste incineration plant’, Neural Computing and Applications,
pp.526–536. Vol. 30, No. 10, pp.3049–3061.
Liang, H.B., J, L.Z., Khan, M.J. and Han, J.X. (2019) ‘An sand Ogura, T. (2010) ‘ ‘A variable selection method in principal
plug of fracturing intelligent early warning model embedded canonical correlation analysis’, Computational Statistics and
in remote monitoring system’, IEEE Access, Vol. 7, pp.47944– Data Analysis, Vol. 54, No. 4, pp.1117–11237.
47954. Pan, S.J. and Yang, Q.A. (2010) ‘A survey on transfer learning’,
Liu, Y. and Zhang, G. (2015) ‘Scale-sifting multiscale nonlinear IEEE Transactions on Knowledge and Data Engineering,
process quality monitoring and fault detection’, Canadian Vol. 22, No. 10, pp.1345–1359.
Journal of Chemical Engineering, Vol. 93, No. 8, pp.1416– Pan, T.H., Wong, D.S.H. and Jang, S.S. (2010) ‘Development of a
1425. novel soft sensor using a local model network with an adaptive
Liu, Y., Gao, Z.L., Li, P. and Wang, H.Q. (2012) ‘Just-in-time kernel subtractive clustering approach’, Industrial and Engineering
learning with adaptive parameter selection for soft sensor Chemistry Research, Vol. 49, No. 10, pp.4738–4747.
modeling of batch processes’, Industrial and Engineering Peng, J.T., Peng, S.L. and Hu, Y. (2012) ‘Partial least squares
Chemistry Research, Vol. 51, No. 11, pp.4313–4327. and random sample consensus in outlier detection’, Analytica
Liu, Q., Qin, S.J. and Chai, T.Y. (2014) ‘Multiblock concurrent Chimica Acta, Vol. 719, pp.24–29.
pls for decentralized monitoring of continuous annealing Peng, K.X., Zhang, K., You, B. and Dong, J. (2015) ‘Quality-
processes’, IEEE Transactions on Industrial Electronics, related prediction and monitoring of multi-mode processes
Vol. 61, No. 11, pp.6429–6437. using multiple pls with application to an industrial hot strip
Liu, Y.Q., Pan, Y.P., Huang, D.P. and Wang, Q.L. (2017) mill’, Neurocomputing, Vol. 168, pp.1094–1103.
‘Fault prognosis of filamentous sludge bulking using an Piepho, H.P. (2009) ‘Ridge regression and extensions for
enhanced multi-output Gaussian processes regression’, Control genomewide selection in maize’, Crop Science, Vol. 49, No. 4,
Engineering Practice, Vol. 62, pp.46–54. pp.1165–1176.
Liu, R.S., Peng, M.F. and Xiao, X.H. (2018a) ‘Ultra-short-term Pilario, K.E.S. and Cao, Y. (2018) ‘Canonical variate dissimilarity
wind power prediction based on multivariate phase space analysis for process incipient fault detection’, IEEE
reconstruction and multivariate linear regression’, Energies, Transactions on Industrial Informatics, Vol. 14, No. 12,
Vol. 11, No. 10, pp.2763. pp.5308–5315.
Liu, Y.Q., Liu, B., Zhao, X.J. and Xie, M. (2018b) ‘A mixture Poerio, D.V. and Brown, S.D. (2018) ‘A frequency-localized
of variational canonical correlation analysis for nonlinear and recursive partial least squares ensemble for soft sensing’,
quality-relevant process monitoring’, IEEE Transactions on Journal of Chemometrics, Vol. 32, No. 5, pp.e2999.
Industrial Electronics, Vol. 65, No. 8, pp.6478–6486. Pooyan, N., Shahbazian, M., Salahshoor, K. and Hadian, M. (2015)
Liu, S.T., Gao, X.W., Qi, W.H. and Zhang, S.M. (2019a) ‘Soft sensor ‘Simultaneous fault diagnosis using multi class support vector
modelling of propylene conversion based on a takagi-sugeno machine in a dew point process’, Journal of Natural Gas
fuzzy neural network optimized with independent component Science And Engineering, Vol. 23, pp.373–379.
analysis and mutual information’, Transactions of the Institute Qi, C.K., Li., H.X., Li, S.Y., Zhao, X.C. and Gao, F. (2014) ‘A fuzzy-
of Measurement and Control, Vol. 41, No. 3, pp.737–748. based spatio-temporal multi-modeling for nonlinear distributed
Liu, Y., Yang, C., Liu, K.X., Chen, B.C. and Yao, Y. parameter processes’, Applied Soft Computing, Vol. 25,
(2019b) ‘Domain adaptation transfer learning soft sensor pp.309–321.
for productquality prediction’, Chemometrics and Intelligent Qin, S.J. (1998) ‘ ‘Recursive pls algorithms for adaptive data
Laboratory Systems, Vol. 192, p.103813. modeling’, Control Engineering Practice, Vol. 22, pp.503–514.
Liukkonen, M., Hälikkä, E., Hiltunen, T. and Hiltunen, Y. (2012) Qin, S.J. (2012) ‘Survey on data-driven industrial process
‘Dynamic soft sensors for NOx emissions in a circulating monitoring and diagnosis’, Annual Reviews in Control, Vol. 36,
fluidized bed boiler’, Environmental Modelling and Software, pp.220–234.
Vol. 97, pp.1483–490. Quiñones-Grueiro, M., Prieto-Moreno, A., Verde, C. and Llanes-
Liukkonen, M., Laakso, I. and Hiltunen, Y. (2013) ‘Advanced Santiago, O. (2019) ‘Data-driven monitoring of multimode
monitoring platform for industrial wastewater treatment: continuous processes: a review’, Chemometrics and Intelligent
Multivariable approach using the self-organizing map’, Laboratory Systems, Vol. 189, pp.56–71.
Environmental Modelling and Software, Vol. 48, pp.193–2016. Rasmussen, C.E. and Williams, C.K. (2006) Gaussian Processes for
Lu, L., Yan, J.H. and de Silva, C.W. (2016) ‘Feature selection Machine Learning, MIT Press, Cambridge, Mass.
for ecg signal processing using improved genetic algorithm Raveendran, R., Kodamana, H. and Huang, B. (2018) ‘Process
and empirical mode decomposition’, Measurement, Vol. 94, monitoring using a generalized probabilistic linear latent
pp.372–381. variable model’, Automatica, Vol. 96, pp.73–83.
Ma, M.D., Ko, J.W., Wang, S.J., Wu, M.F., Jang, S.S., Shieh, S.S. Rinnan, A., Andersson, M., Ridder, C. and Engelsen, S.B. (2014)
and Wong, D.S.H. (2009) ‘Development of adaptive soft sensor ‘Recursive weighted partial least squares (RPLS): an efficient
based on statistical identification of key variables’, Control variable selection method using pls’, Journal of Chemometrics,
Engineering Practice, Vol. 17, No. 9, pp.1026–1034. Vol. 28, No. 5, pp.439–447.
Ni, W.D., Tan, S.K. and Ng, W.J. (2011) ‘Recursive gpr for nonlinear Rosipal, R. and Trejo, L.J. (2001) ‘Kernel partial least squares
dynamic process modeling’, Chemical Engineering Journal, regression in reproducing kernel Hilbert space’, Journal of
Vol. 173, pp.636–643. Machine Learning Research, Vol. 2, No. 2, pp.97–123.
88 W. Guo et al.
Rothman, M.J., Rothman, S.I. and Beals, J. (2013) ‘Development Wang, R., Wang, X.Y., Sun, H. and Huang, Y.T. (2019a) ‘Analysis
and validation of a continuous measure of patient condition of estimator and energy consumption with multiple faults
using the electronic medical record’, Journal of Biomedical over the distributed integrated WSN’, International Journal
Informatics, Vol. 46, No. 5, pp.837–848. of Modelling, Identification and Control, Vol. 32, No. 2,
Salaken, S.M., Khosravi, A., Thanh, T. and Nahavandi, S. (2017) ‘ pp.154–168.
‘Extreme learning machine based transfer learning algorithms: Wang, X.D., Kvaal, K. and Ratnaweera, H. (2019b) ‘Explicit
a survey’, Neurocomputing, Vol. 267, pp.516–524. and interpretable nonlinear soft sensor models for influent
surveillance at a full-scale wastewater treatment plant’, Journal
Shang, C., Yang, F., Huang, D.X. and Lyu, W.X. (2014) ‘Data-driven
of Process Control, Vol. 77, pp.1–6.
soft sensor development based on deep learning technique’,
Journal of Process Control, Vol. 24, pp.223–233. Wold, S., Kettaneh-Wold, N. and Skagerberg, B. (1989) ‘Non-linear
PLS modelling’, Chemometrics and Intelligent Laboratory
Shao, S.Y., McAleer, S., Yan, R.Q. and Baldi, P. (2019) ‘Highly
Systems, Vol. 7, pp.53–65.
accurate machine fault diagnosis using deep transfer learning’,
IEEE Transactions on Industrial Informatics, Vol. 15, No. 4, Wu, H. and Zhao, J.S. (2018) ‘Deep convolutional neural network
model based chemical process fault diagnosis’, Computers and
pp.2446–2455.
Chemical Engineering, Vol. 115, pp.185–197.
Shao, W.M. and Tian, X.M. (2015) ‘Adaptive soft sensor for quality
Yan, Z.B., Kuang, T.H. and Yao, Y. (2017) ‘Multivariate fault
prediction of chemical processes based on selective ensemble
isolation of batch processes via variable selection in partial
of local partial least squares models’, Chemical Engineering
least squares discriminant analysis’, ISA Transactions, Vol. 70,
Research and Design, Vol. 95, pp.113–132.
pp.389–399.
Smith, C. and Jin, Y.C. (2014) ‘Evolutionary multi-objective Yao, L. and Ge, Z.Q. (2017) ‘Online updating soft sensor
generation of recurrent neural network ensembles for time modeling and industrial application based on selectively
series prediction’, Neurocomputing, Vol. 143, pp.302–311. integrated moving window approach’, IEEE Transactions
Souza, F. A.A., Araújo, R. and Mendes, J. (2016) ‘Review of soft on Instrumentation and Measurement, Vol. 66, No. 8,
sensor methods for regression applications’, Chemometrics and pp.1985–1993.
Intelligent Laboratory Systems, Vol. 152, pp.69–79. Yuan, X.F., Ge, Z.Q. and Song, Z.H. (2014a) ‘Locally weighted
Souza, F. A.A., Araújo, R., Matias, T. and Mendes, J. (2013) ‘ kernel principal component regression model for soft sensing of
‘A multilayer-perceptron based method for variable selection nonlinear time-variant processes’, Industrial and Engineering
in soft sensor design’, Journal of Process Control, Vol. 23, Chemistry Research, Vol. 53, No. 35, pp.13736–13749.
pp.1371–1378. Yuan, X.F., Ge, Z.Q. and Song, Z.H. (2014b) ‘Soft sensor model
Stoica, P., Selen, Y. and Jian, L. (2004) ‘Multi-model approach to development in multiphase/multimode processes based on
model selection’, Digital Signal Processing, Vol. 14, No. 5, gaussian mixture regression’, Chemometrics and Intelligent
pp.399–412. Laboratory Systems, Vol. 138, pp.97–109.
Su, Q.L., Hermanto, M.W., Braatz, R.D. and Yuan, X.F., Ge, Z.Q., Song, Z.H., Wang, Y.L., Yang, C.H. and
Chiu, M.S. (2016) ‘Just-in-time-learning based Zhang, H.W. (2017) ‘Soft sensor modeling of nonlinear
extended prediction self-adaptive control for batch industrial processes based on weighted probabilistic projection
processes’, Journal of Process Control, Vol. 43, regression’, IEEE Transactions on Instrumentation and
pp.1–9. Measurement, Vol. 66, No. 4, pp.837–845.
Sun, Q.Q. and Ge, Z.Q. (2019) ‘Probabilistic sequential network Yuan, X.F., Huang, B. and Wang, Y.L. (2018) ‘Deep learning-
based feature representation and its application for soft sensor
for deep learning of complex process data and soft sensor
modeling with variable-wise weighted sae’, IEEE Transactions
application’, IEEE Transactions on Industrial Informatics,
on Industrial Informatics, Vol. 14, No. 7, pp.3236–3243.
Vol. 15, No. 5, pp.2700–2709.
Zhai, L.R., Zhang, Y.W., Guan, S.P., Fu, Y.J. and Feng, L. (2018)
Tamboli, D. and Chile, R. (2018) ‘ ‘Multi-model approach for 2-
‘Nonlinear process monitoring using kernel nonnegative matrix
dof control of nonlinear cstr process’, International Journal
factorization’, Canadian Journal of Chemical Engineering,
of Modelling, Identification and Control, Vol. 30, No. 2, Vol. 96, No. 2, pp.554–563.
pp.143–161.
Zhang, S., Li, L.J., Yao, L.J., Yang, S.P. and Zou, T. (2018) ‘Data-
Teslić, L., Hartmann, B., Nelles, O. and Škrjanc, I. (2011) driven process decomposition and robust online distributed
‘Nonlinear system identification by gustafson- kessel fuzzy modelling for large-scale processes’, International Journal of
clustering and supervised local model network learning for the Systems Science, Vol. 49, No. 3, pp.449–463.
drug absorption spectra process’, IEEE Transactions on Neural Zhang, S.M., Zhao, C.H. and Gao, F.R. (2019) ‘Incipient fault
Networks, Vol. 22, No. 12, pp.1941–1951. detection for multiphase batch processes with limited batches’,
Tomba, E., Meneghetti, N., Facco, P. and Zelenkova, T. (2014) IEEE Transactions on Control Systems Technology, Vol. 27,
‘Transfer of a nanoparticle product between different mixers No. 1, pp.103–117.
using latent variable model inversion’, AICHE Journal, Zhang, X.M., Kano, M. and Li, Y. (2017) ‘Locally weighted
Vol. 2602, No. 1, pp.123–135. kernel partial least squares regression based on sparse
Wang, B. and Mao, Z.Z. (2019) ‘Outlier detection based on gaussian nonlinear features for virtual sensing of nonlinear time-varying
process with application to industrial processes’, Applied Soft processes’, Computers and Chemical Engineering, Vol. 104,
Computing, Vol. 76, pp.505–516. pp.164–171.
Wang, F.L., Tan, S., Peng, J. and Chang, Y.Q. (2012) ‘Process Zhang, Y. and Qin, S. (2007) ‘Fault detection of nonlinear processes
monitoring based on mode identification for multi-mode using multiway kernel independent component analysis’,
process with transitions’, Chemometrics and Intelligent Industrial and Engineering Chemistry Research, Vol. 46,
Laboratory Systems, Vol. 110, No. 1, pp.144–155. No. 23, pp.7780–7787.
A review on data-driven approaches for industrial process modelling 89
Zhao, L.P., Zhao, C.H. and Gao, F.R. (2014) ‘Regression Zhou, L., Chen, J.H. and Song, Z.H. (2015) ‘Recursive gaussian
modeling and quality prediction for multiphase batch processes process regression model for adaptive quality monitoring
with inner-phase analysis’, Chemometrics and Intelligent in batch processes’, Mathematical Problems in Engineering,
Laboratory Systems, Vol. 135, pp.1–16. p.761280.
Zhao, L., Peng, T., Xie, Y.F., Yang, C.H. and Gui, W.H. (2017) Zhu, J.L., Ge, Z.Q., Song, Z.H. and Gao, F.R. (2018) ‘Review and
‘Recognition of flooding and sinking conditions in flotation big data perspectives on robust data mining approaches for
process using soft measurement of froth surface level and qta’, industrial process modeling with outliers and missing data’,
Chemometrics and Intelligent Laboratory Systems, Vol. 169, Annual Reviews in Control, Vol. 46, pp.107–133.
pp.45–52. Zhu, Q.Q., Liu, Q. and Qin, S.J. (2017) ‘Concurrent quality
Zhao, Z.G., Li, Q.H., Huang, B., Liu, F. and Ge, Z.Q. (2015) and process monitoring with canonical correlation analysis’,
‘Process monitoring based on factor analysis: Probabilistic Journal of Process Control, Vol. 60, pp.95–103.
analysis of monitoring statistics in presence of both complete Zhu, Z.B., Huan, Z.H. and Palazoglu, A. (2011) ‘Transition
and incomplete measurements’, Chemometrics and Intelligent process modeling and monitoring based on dynamic ensemble
Laboratory Systems, Vol. 142, pp.18–27. clustering and multiclass support vector data description’,
Zheng, J.H. and Song, Z.H. (2018) ‘Semisupervised learning for Industrial and Engineering Chemistry Research, Vol. 50,
probabilistic partial least squares regression model and soft No. 24, pp.13969–13983.
sensor application’, Journal of Process Control, Vol. 64,
pp.123–131.
Zheng, W.J., Liu, Y., Gao, Z.L. and Yang, J.G. (2018) ‘Just-
in-time semi-supervised soft sensor for quality prediction
in industrial rubber mixers’, Chemometrics and Intelligent
Laboratory Systems, Vol. 180, pp.36–41.