Professional Documents
Culture Documents
Outlier Generation and Anomaly Detection Based On Intelligent One-Class Techniques Over A Bicomponent Mixing System
Outlier Generation and Anomaly Detection Based On Intelligent One-Class Techniques Over A Bicomponent Mixing System
Outlier Generation and Anomaly Detection Based On Intelligent One-Class Techniques Over A Bicomponent Mixing System
1 Introduction
In general terms, a high percentage of enterprises presents complex and expensive
processes whose operation can be optimized [1,18]. The technological advances
achieved in many different industrial fields, such as instrumentation or automa-
tion has led to the optimisation and development of industrial processes [15].
In addition, factors like of the promotion of energy efficiency policies or the
increased competitiveness must be taken into consideration in any plant opti-
misation [20,40]. Then, a good operation in a specific process requires an early
detection of any deviation from its normal operation of actuators, sensors, and
so on [12,24]. This is specially important in high cost and safe-critical processes
[9,28]. From an economic point of view, avoiding a wrong performance of any
c Springer Nature Switzerland AG 2020
F. Martı́nez Álvarez et al. (Eds.): SOCO 2019, AISC 950, pp. 399–410, 2020.
https://doi.org/10.1007/978-3-030-20055-8_38
400 E. Jove et al.
2 Case of Study
2.1 Bicomponent Mixing System
The installation used to develop the present work aims at mixing two different
fluids to obtain a bicomponent material, whose features are suitable for wind
Outlier Generation and Anomaly Detection 401
generator blades. The primary fluids, the epoxy resin and the catalyst, are stored
in two different tanks. They are boosted by two independent pumps, supplied
by variable frequency drives, and mixed in a valve in charge of delivering the
proper amount of bicomponent at the output [17].
Figure 1 shows a simplified scheme of the above explanation with the actua-
tors, valves, pipes and sensors of the mixing system.
1. Find out the maximum and minimum of each attribute in the initial dataset
and save these values in two vectors Vmax , Vmin ∈ Rn .
2. Select randomly 2 dimensions u and v of the n-dimensional dataset.
3. Replace the value of a(u) randomly by Vmax (u) or Vmin (u).
4. Replace the value of a(v) randomly by Vmax (v) or Vmin (v).
The example of Fig. 3 shows the transformation of a point inside the target
class (green point) in R3 into an anomaly. The transformation T x replaces the
x coordinate by the maximum registered (yellow dot), and the transformation
404 E. Jove et al.
T y changes the y value by its minimum (red dot). Then, it can be noticed that
the data generated is out of the initial set.
From the initial dataset, corresponding to the right operation of the plant, 5
% of the samples were artificially modified to transform them into anomalies,
following the process described in Sect. 3.2. For each sample, two dimensions are
randomly selected and replaced by its maximum or minimum values. The one
class techniques, used to assess the anomaly generation, were validated using
a 3 times nested k − f old with k = 10. The performance of each classifier is
assessed using the Area Under Curve (AUC) parameter, that relates the true
positives and false positives [2]. In addition, the standard deviation (SD) of the
AUC obtained for each repetition and the training time is registered.
The performance of this technique was evaluated using 5, 10, 50, 100, 500 and
1000 2D projections with a λ value of 0.9, 1 and 1.1. Given the geometric nature
of this technique, the dataset was not normalised. The AUC, the SD and the
training time for each configuration can be seen in Table 1. The obtained results
with λ = 0.9 were not successful and were omitted due to the wrong performance
of this configuration.
Outlier Generation and Anomaly Detection 405
The MLP Autoencoder was obtained using the Matlab function train
Autoencoder [25]. As mentioned in Sect. 3.1, the number of neurons in the hidden
layer must be less than the number of inputs. Then, to check the importance of
this parameter, it is varied from 1 to 9 with a step of 2 neurons. Different config-
uration were tried regarding the normalisation, first with a 0 to 1 normalisation,
then with z-score [35] and the results without normalisation were also checked.
The criteria to select if a test data is an anomaly, is based on the reconstruction
error. If the reconstruction error is higher than the one obtained with 99% of
the training set, it is considered as an anomaly. The results in terms of AUC,
SD and training cost for each configuration is shown in Table 2.
The SVM one-class classifier was obtained using the Matlab function f itcsvm
[26]. The kernel function was set as Gaussian, the percentage of outlier fraction
of the training data was varied from 0 to 3. The influence of the normalisation
methods was assessed as for Autoencoder. As mentioned in Sect. 3.1, the criteria
to decide if a test data is an anomaly is based on its distance to the decision
plane. This is evaluated with the Matlab function predict [27]. The results are
presented in Table 3
406 E. Jove et al.
Normalisation Outlier frac. (%) AUC (%) STD (%) Time (min)
NoNorm 0 95,82 0,14 2,755
1 95,32 0,35 2,580
2 94,72 0,54 2,631
3 94,53 0,54 2,621
0 to 1 0 95,79 0,15 2,571
1 95,39 0,33 2,613
2 94,83 0,60 2,582
3 94,53 0,60 2,598
Zscore 0 95,83 0,13 2,598
1 95,42 0,36 2,616
2 94,90 0,53 2,586
3 94,52 0,56 2,662
Outlier Generation and Anomaly Detection 407
References
1. Alaiz Moretón, H., Calvo Rolle, J., Garcı́a, I., Alonso Alvarez, A.: Formalization
and practical implementation of a conceptual model for PID controller tuning.
Asian J. Control 13(6), 773–784 (2011)
2. Bradley, A.P.: The use of the area under the ROC curve in the evaluation of
machine learning algorithms. Pattern Recognit. 30(7), 1145–1159 (1997)
3. Calvo-Rolle, J.L., Quintian-Pardo, H., Corchado, E., del Carmen Meizoso-López,
M., Garcı́a, R.F.: Simplified method based on an intelligent model to obtain the
extinction angle of the current for a single-phase half wave controlled rectifier with
resistive and inductive load. J. Appl. Log. 13(1), 37–47 (2015)
408 E. Jove et al.
4. Casale, P., Pujol, O., Radeva, P.: Approximate convex hulls family for one-class
classification. In: International Workshop on Multiple Classifier Systems, pp. 106–
115. Springer (2011)
5. Casteleiro-Roca, J.L., Barragán, A.J., Segura, F., Calvo-Rolle, J.L., Andújar, J.M.:
Fuel cell output current prediction with a hybrid intelligent system. Complexity
(2019)
6. Casteleiro-Roca, J.L., Jove, E., Sánchez-Lasheras, F., Méndez-Pérez, J.A., Calvo-
Rolle, J.L., de Cos Juez, F.J.: Power cell soc modelling for intelligent virtual sensor
implementation. J. Sens. (2017)
7. Chandola, V., Banerjee, A., Kumar, V.: Anomaly detection: a survey. ACM Com-
put. Surv. (CSUR) 41(3), 15 (2009)
8. Chen, Y., Zhou, X.S., Huang, T.S.: One-class SVM for learning in image retrieval.
In: Proceedings of 2001 International Conference on Image Processing, vol. 1, pp.
34–37. IEEE (2001)
9. Chiang, L.H., Russell, E.L., Braatz, R.D.: Fault Detection and Diagnosis in Indus-
trial Systems. Springer, Heidelberg (2000)
10. Fan, H., Wong, C., Yuen, M.F.: Prediction of material properties of epoxy materials
using molecular dynamic simulation. In: 7th International Conference on Thermal,
Mechanical and Multiphysics Simulation and Experiments in Micro-Electronics
and Micro-Systems, EuroSime 2006, pp. 1–4, April 2006
11. Fernández-Francos, D., Fontenla-Romero, Ó., Alonso-Betanzos, A.: One-class con-
vex hull-based algorithm for classification in distributed environments. IEEE Trans.
Syst. Man Cybern.: Syst. 99, 1–11 (2018)
12. Garcia, R.F., Rolle, J.L.C., Castelo, J.P., Gomez, M.R.: On the monitoring task of
solar thermal fluid transfer systems using NN based models and rule based tech-
niques. Eng. Appl. Artif. Intell. 27, 129 – 136 (2014). http://www.sciencedirect.
com/science/article/pii/S0952197613001127
13. González, G., Angelo, C.D., Forchetti, D., Aligia, D.: Diagnóstico de fallas en el
convertidor del rotor en generadores de inducción con rotor bobinado. Revista
Iberoamericana de Automática e Informática industrial 15(3), 297–308 (2018).
https://polipapers.upv.es/index.php/RIAI/article/view/9042
14. Goodfellow, I., Bengio, Y., Courville, A., Bengio, Y.: Deep Learning, vol. 1. MIT
Press, Cambridge (2016)
15. Hobday, M.: Product complexity, innovation and industrial organisation. Res. Pol-
icy 26(6), 689–710 (1998)
16. Hodge, V., Austin, J.: A survey of outlier detection methodologies. Artif. Intell.
Rev. 22(2), 85–126 (2004)
17. Jove, E., Aláiz-Moretón, H., Casteleiro-Roca, J.L., Corchado, E., Calvo-Rolle, J.L.:
Modeling of bicomponent mixing system used in the manufacture of wind generator
blades. In: Corchado, E., Lozano, J.A., Quintián, H., Yin, H. (eds.) Intelligent Data
Engineering and Automated Learning - IDEAL 2014, pp. 275–285. Springer, Cham
(2014)
18. Jove, E., Alaiz-Moretón, H., Garcı́a-Rodrı́guez, I., Benavides-Cuellar, C.,
Casteleiro-Roca, J.L., Calvo-Rolle, J.L.: PID-ITS: an intelligent tutoring system
for PID tuning learning process. In: International Joint Conference SOCO 2017-
CISIS 2017-ICEUTE 2017, León, Spain, 6–8 September 2017, pp. 726–735. Springer
(2017)
19. Jove, E., Antonio Lopez-Vazquez, J., Isabel Fernandez-Ibanez, M., Casteleiro-
Roca, J.L., Luis Calvo-Rolle, J.: Hybrid intelligent system to predict the individual
academic performance of engineering students. Int. J. Eng. Educ. 34(3), 895–904
(2018)
Outlier Generation and Anomaly Detection 409
20. Jove, E., Casteleiro-Roca, J.L., Quintián, H., Méndez-Pérez, J.A., Calvo-Rolle,
J.L.: A new approach for system malfunctioning over an industrial system control
loop based on unsupervised techniques. In: Graña, M., López-Guede, J.M., Etxaniz,
O., Herrero, Á., Sáez, J.A., Quintián, H., Corchado, E. (eds.) International Joint
Conference SOCO’18-CISIS’18-ICEUTE’18, pp. 415–425. Springer, Cham (2018)
21. Jove, E., Gonzalez-Cava, J.M., Casteleiro-Roca, J.L., Méndez-Pérez, J.A., Antonio
Reboso-Morales, J., Javier Pérez-Castelo, F., Javier de Cos Juez, F., Luis Calvo-
Rolle, J.: Modelling the hypnotic patient response in general anaesthesia using
intelligent models. Log. J. IGPL 27, 189–201 (2018)
22. Moreno-Fernandez-de Leceta, A., Lopez-Guede, J.M., Ezquerro Insagurbe, L.,
Ruiz de Arbulo, N., Graña, M.: A novel methodology for clinical semantic anno-
tations assessment. Log. J. IGPL 26(6), 569–580 (2018). https://doi.org/10.1093/
jigpal/jzy021
23. Li, K.L., Huang, H.K., Tian, S.F., Xu, W.: Improving one-class SVM for anomaly
detection. In: 2003 International Conference on Machine Learning and Cybernetics,
vol. 5, pp. 3077–3081. IEEE (2003)
24. Manuel Vilar-Martinez, X., Aurelio Montero-Sousa, J., Luis Calvo-Rolle, J., Luis
Casteleiro-Roca, J.: Expert system development to assist on the verification of
”TACAN” system performance. Dyna 89(1), 112–121 (2014)
25. MathWorks: Autoencoder. https://es.mathworks.com/help/deeplearning/ref/
trainautoencoder.html. Accessed 29 Jan 2019
26. MathWorks: fitcsvm. https://es.mathworks.com/help/stats/fitcsvm.html.
Accessed 29 Jan 2019
27. MathWorks: predict. https://es.mathworks.com/help/stats/classreg.learning.
classif.compactclassificationsvm.predict.html. Accessed 29 Jan 2019
28. Miljković, D.: Fault detection methods: a literature survey. In: 2011 Proceedings
of the 34th International Convention on MIPRO, pp. 750–755. IEEE (2011)
29. Pei, Y., Zaı̈ane, O.: A synthetic data generator for clustering and outlier analysis.
University of Alberta, edmonton, AB, Canada, Department of Computing science
(2006)
30. Quintián, H., Casteleiro-Roca, J.L., Perez-Castelo, F.J., Calvo-Rolle, J.L., Cor-
chado, E.: Hybrid intelligent model for fault detection of a lithium iron phosphate
power cell used in electric vehicles. In: International Conference on Hybrid Artifi-
cial Intelligence Systems, pp. 751–762. Springer (2016)
31. Quintián, H., Corchado, E.: Beta scale invariant map. Eng. Appl. Artif. Intell. 59,
218–235 (2017)
32. Rebentrost, P., Mohseni, M., Lloyd, S.: Quantum support vector machine for big
data classification. Phys. Rev. Lett. 113, 130503 (2014). https://doi.org/10.1103/
PhysRevLett.113.130503
33. Sakurada, M., Yairi, T.: Anomaly detection using autoencoders with nonlinear
dimensionality reduction. In: Proceedings of the MLSDA 2014 2nd Workshop on
Machine Learning for Sensory Data Analysis, p. 4. ACM (2014)
34. Segovia, F., Górriz, J.M., Ramı́rez, J., Martinez-Murcia, F.J., Garcı́a-Pérez, M.:
Using deep neural networks along with dimensionality reduction techniques to
assist the diagnosis of neurodegenerative disorders. Log. J. IGPL 26(6), 618–628
(2018). https://doi.org/10.1093/jigpal/jzy026
35. Shalabi, L.A., Shaaban, Z.: Normalization as a preprocessing engine for data min-
ing and the approach of preference matrix. In: 2006 International Conference on
Dependability of Computer Systems, pp. 207–214, May 2006
410 E. Jove et al.
36. Vincent, P., Larochelle, H., Lajoie, I., Bengio, Y., Manzagol, P.A.: Stacked denois-
ing autoencoders: learning useful representations in a deep network with a local
denoising criterion. J. Mach. Learn. Res. 11(Dec), 3371–3408 (2010)
37. Wang, C.K., Ting, Y., Liu, Y.H., Hariyanto, G.: A novel approach to generate
artificial outliers for support vector data description. In: IEEE International Sym-
posium on Industrial Electronics, ISIE 2009, pp. 2202–2207. IEEE (2009)
38. Wojciechowski, S.: A comparison of classification strategies in rule-based classifiers.
Log. J. IGPL 26(1), 29–46 (2018). https://doi.org/10.1093/jigpal/jzx05
39. Zeng, Z., Wang, J.: Advances in Neural Network Research and Applications, 1st
edn. Springer Publishing Company, Heidelberg (2010). Incorporated
40. Zotes, F.A., Peñas, M.S.: Heuristic optimization of interplanetary trajectories in
aerospace missions. Revista Iberoamericana de Automática e Informática Indus-
trial RIAI 14(1), 1–15 (2017). http://www.sciencedirect.com/science/article/pii/
S1697791216300486