Professional Documents
Culture Documents
Am Ruth Nath 2018
Am Ruth Nath 2018
6
machines where other physics-based data such as pressure and Eigenvalues. The Eigenvalues and components are plotted in a
temperature are significant. scree plot as shown in Figure 3.
B. Understanding the Current System F. Fault Prediction
A furnace fan was used as a test subject for performing this Fault prediction is a process of detecting an abnormal
predictive maintenance activity as shown in Figure 2. This fan behavior of the system. Some of the most commonly used
operates in a non-fluctuating environment, and due to moving algorithms are Hotelling T2 statistic [28] [20], Q statistic [28]
parts and no process data availability, vibration data was [20], K-means algorithm, and hierarchical clustering analysis
chosen to monitor. Also, due to the sensitivity of the vibration [32] in unsupervised learning. In this research study, Hotelling
spectrum of equipment failures, the vibration signal is T2 statistic, a multivariate statistic is used for the analysis. Test
commonly used as the data source [15]. It can also be noted data or calibration data is captured during a healthy state of the
that about 99% of mechanical faults have some noticeable machine and T2 statistics, and prediction limits are obtained at
vibration and acoustic signals. 99.99% confidence interval using the formula 1 and 2. Here,
the significant components obtained using PCA are used for
C. Data Collection calculating the statistic [19].
Vibration monitors were mounted both on x-axis and
y-axis. Vibration data was collected in the time domain at T2 statistic
2048 Hz and for 0.8 seconds every 5 minutes. This raw data is
used for feature extraction, feature selection, fault prediction = ∑ (1)
and fault class prediction. The methodology for this research
study is as shown in Table 1. Upper confidence limit
( )
D. Feature Extraction , ,∝ = , ,∝ (2)
Every instance of raw data collected consists of
approximately 1600 data point in the time domain. It is Figure 4 is the fault detection system for monitoring the
important to capture different features both in the time domain state of the fan. This system was calibrated for July 31 and
and frequency domain [11]. Here, the time domain features August 1. The obtained statistics were used for monitoring the
such as min, max, median, mean, standard deviation, kurtosis, system. Upper limit obtained was 15.83. It can be noticed that
skewness, range, and RMS [25] for both x-axis and y-axis are on August 26 there was a sharp trend in the graph which
collected. The raw data in time domain was later transformed indicated that there was a fault detected. Upon investigation, it
to frequency domain using Fourier transforms [16] to capture was found that the fan housing had displaced by 2 inches and
the same features in x-axis and y-axis at 25 Hz to minimize the the fan was replaced. Upon replacement of the fan, we were
noise at a higher frequency. able to confirm that the model had to be recalibrated for the
new fan. But, our research goal was to detect these changes in
E. Feature Selection our new fault class detection model. Hence, the fan was not
recalibrated and was left to continue to use the same statistics
as the previous fan. This behavior can be observed in Figure 4.
TABLE I. UNSUPERVISED PREDICTIVE MAINTENANCE
METHODOLOGY FOR FAULT CLASS PREDICTION
7
Mixture model clustering and K-means algorithm were used. identified. Hence, it was concluded to use non-parametric
The main objective of these to detect the following density estimation.
1. Healthy state of the machine
2. Faulty state of the machine
3. New fan replacement
Fan 1: PCA-T2-Health
T2-Health
T2-Limit
70
60
50
T2-Health
40
30
20
10
Date
( )=∑ () ( |) (3)
H. Fault Class Prediction
In this section, we will recall the density functions, where P(j) is the mixture proportion and is non-negative. Its
log-likelihood function and E and M steps for Gaussian sum must be equal to one. The Gaussian centers can be
Mixture Modeling [13]. Here, we have considered a defined by their centers cj and their covariance matrix ∑j.
D-dimensional continuous random vector X Rd. From the [13]
reference (equation 3 to 9), the probability density function for
a mixture model which is a linear combination of M Gaussian
component densities is defined as ( | ) = (2 ) |∑ |
) ∑ ( − )
. exp − ( −
[13]
(4)
8
Based on our above mixture model (equation 4), we can
define the log-likelihood function as (equation 5) ∑ ( )( ( )
)(
( )
)
[13] ∑( )
= ( )( | (8)
∑ )
( ) = log ∏ ( ) (5) ( )
()= ∑ ( )
(| ) (9)
where, is the model parameter of P(j), cj and ∑j . Using
Expectation and Maximization approach the maximum Raw PCA
likelihood estimate of can be obtained iteratively. GMM
Features Analysis
Expectation or E-step involves computing the expected value
of some unobserved data using current parameter estimates
and observed data. Maximization or M-step involves using the
expected values from E-step to compute the maximum
likelihood estimates. Upon achieving this model parameters Classification
are updated. At a given iteration step t, Data
Figure 6 represents the Gaussian Mixture Model have class labels from historical data we can only
Clustering fitted by Expectation-maximization (GMM-EM hypothesize this as a failed state. Upon investigation we
model) results grouped by dates on the x-axis and cluster by the maintenance crew, it was found that it was a shaft
number on the y-axis. The clusters are color-coded, and size of housing had displaced by two inches
the cluster represents some data points clustered to a certain 3. From August 31 to September 19, cluster 5 had
cluster on that day. Bigger the size of the cluster, the certain significant occurrence. As mentioned in the above
state was majorly observed and smaller the size fewer times section, this was the period when the old fan was
that state was observed. In our model, we found a total of 6 replaced with a new fan. We can confirm this change
states. Without any previous knowledge of the data, it is
here.
difficult to interpret the results. But, when we compare
4. Quite interestingly, there was yet another cluster formed.
1. From July 7 to August 25, cluster 4, 3 and 2 had major
Cluster 6 occurred majorly on September 3 and
state occurrence. From T2 statistic we can hypothesize
September 10. The machine was completely shut down
that this could be normal state
on September 3 and for a shift on September 10. These
2. From August 26 to August 30, cluster 1 has a major state
states could represent the machine shutdown state.
occurrence. We can observe the deviation from the
normal state. Since in unsupervised learning, we do not
9
I. K-means Algorithm silhouette method. K-means clustering is an unsupervised
K-Means Algorithm is one of the most commonly used in learning procedure; the method can be directly implemented to
fault detection and fault class prediction in unsupervised measured vibration data, and thus the need for training process
learning. This algorithm uses Euclidian distance to form the for identification for the faulty process is eliminated [11].
clusters, and the number of clusters is determined using
10
+ [10] B. H. Chowdhury and K. Wang, "Fault Classification Using Kohonen
Accuracy of T = Feature Mapping," in Proceedings of the International Conference on
+ + + Intelligent Systems Applications to Power Systems, 1996.
934 + 10340 [11] C. T. Yiakopoulos, K. C. Gryllias, and I. A. Antoniadis, "Rolling
= = 90.01%
934 + 142 + 1051 + 10340 element bearing fault detection in industrial environments based on a
(11) K-means clustering approach," Expert Systems with Applications, vol.
38, no. 3, pp. 2888-2911, 2011.
[12] Ç. Ari and S. Aksoy, "Unsupervised classification of remotely sensed
V. CONCLUSION images using Gaussian mixture models and particle swarm
In the recent years, the wide availability of secure wireless optimization," in Geoscience and Remote Sensing Symposium
(IGARSS), 2010.
sensor has made it more viable option to monitor the machine [13] C. Archambeau and M. Verleysen, "Fully Nonparametric Probability
regularly and continuously [33]. The core objective of this Density Function Estimation with Finite Gaussian Mixture Models,"
paper was to propose, develop and implement the predictive in ICAPR'2003 proceedings - 5th International Conference on
maintenance methodology rapidly using unsupervised Advances in Pattern Recognition, Calcutta, 2003.
learning for a furnace fan as a case. With the following [14] C. Biernacki, G. Celeux, and Gérard Govaert, "Choosing starting
values for the EM algorithm for getting the highest likelihood in
implementation, we were able to predict the faults with 82.96% multivariate Gaussian mixture models," Computational Statistics &
accuracy and determine the different states of the machine Data Analysis, vol. 41, no. 3-4, pp. 561-575, 2003
based on domain knowledge. In order predict the state of the [15] F. Zhou, Y. Gao and C. Wen, "A Novel Multimode Fault
machine, GMM, and K-means algorithms were used. Based Classification Method Based on Deep Learning," Journal of Control
on the analysis and results we found that GMM methodology Science and Engineering, vol. 2017, 2017
works better in predicting the states of the fault accurately [16] G. G. Yen and K.-C. Lin, "Wavelet packet feature extraction for
vibration monitoring," IEEE Transactions on Industrial Electronics,
compared to K-means algorithm. In conclusion, in this vol. 47, no. 3, pp. 650-667, 2000.
research, we were able to rapidly implement a predictive [17] H. Benitez-Perez, F. Garcia-Nocetti, and H. Thompson, "Fault
maintenance activity using unsupervised learning classification SOM and PCA for inertial sensor drift," in IEEE
methodology with no historical data for fault class prediction International Workshop on Intelligent Signal Processing, 2005.
algorithms and minimal historical data for fault detection [18] H. Li and D. Y. Xiao, "Survey on data-driven fault classification
methods," Control and Decision, vol. 26, no. 1, pp. 1-16, 2011.
algorithms. [19] H. Hotelling, "Analysis of a complex of statistical variables into
principal components," Journal of Educational Psychology, vol. 24,
VI. FUTURE SCOPE OF WORK pp. 417-441, 1933.
[20] J.-H. Cho, J.-M. Lee, S. Wook, D.-w. Choi and I.-B. L. Lee, "Fault
The current work possibly will be extended to different identification for process monitoring using kernel principal
physics-based data such as temperature, pressure, humidity, component analysis," Chemical Engineering Science, vol. 60, no. 1, pp.
acoustics, voltage, and current across different domains. This 279-288, 2005.
would also help in determining the correlation between the [21] J.-H. Shin and H.-B. Jun, "On condition based maintenance policy,"
Journal of Computational Design and Engineering, vol. 2, no. 2, pp.
need for dimensionality reduction algorithms such as PCA and
119-127, 2015.
the accuracy of clustering algorithms. [22] M. Fujimoto and Y. Riki, "Robust speech recognition in additive and
channel noise environments using GMM and EM algorithm," in IEEE
REFERENCES International Conference on Acoustics, Speech, and Signal Processing,
[1] R. K. Mobley, An Introduction to predictive maintenance, 2 ed., 2002. 2004.
[2] I. Guyon, "Design of experiments of the NIPS 2003 variable selection [23] D. Reynolds, "Gaussian Mixture Models," In Encyclopedia of
benchmark," 2003 Biometrics, pp. 659-663, 2009.
[3] Y. Peng, M. Dong, and M. J. Zuo, "Current status of machine [24] S. M. Zhang, F. L. Wang, S. Tan and S. Wang, "A fully automatic
prognostics in condition-based maintenance: a review," The online mode identification method for multi-mode processes”," Acta
International Journal of Advanced Manufacturing Technology, vol. 50, Automatica Sinica, vol. 42, no. 1, pp. 60-80, 2016.
no. 1-4, pp. 297-313, 2010. [25] S. Wegerich, A. Wilks, and R. Pipke, "Nonparametric modeling of
[4] K. Javed, R. Gouriveau, N. Zerhouni and P. Nectoux, "Enabling vibration signal feature for equipment health monitoring," in
Health Monitoring Approach Based on Vibration Data for Accurate Aerospace Conference, 2003.
Prognostics," IEEE Transactions on Industrial Electronics, vol. 62, no. [26] S. Memon, M. Lech, and N. Maddage, "Speaker Verification Based on
1, pp. 647-656, 2015. Different Vector Quantization Techniques with Gaussian Mixture
[5] E. Gundogar, A. Yilmaz and B. Erkayman, "A solution approach to a Models," in Third International Conference on Network and System
synchronization problem in a JIT production system," Production Security, 2009.
Planning and Control. , vol. 25, 2014. [27] S. M, C. P. and B. T., "Supervised and unsupervised learning process
[6] R. C. Walleigh, "What’s Your Excuse for Not Using JIT?”," Harvard in damage classification of rolling element bearings," Diagnostyka,
Business Review, 1983. vol. 17, no. 2, pp. 71-80, 2016.
[7] N. Amruthnath and T. Gupta, "Modified Rank Order Clustering [28] W. Ku, R. H. Storer, and C. Georgakis, "Disturbance detection and
Algorithm Approach by Including Manufacturing Data," in 4th IFAC isolation by dynamic principal component analysis," Chemometrics
International Conference on Intelligent Control and Automation and Intelligent Laboratory Systems, , vol. 30, no. 1, pp. 179-196, 1995.
Sciences, Reims, 2016. [29] C. Fraley and A. E. Raftery, "Model-based Clustering, Discriminant
[8] A. Prajapati, J. Bechtel, and S. Ganesan, "Condition based Analysis and Density Estimation," Journal of the American Statistical
maintenance: a survey," Journal of Quality in Maintenance Association, vol. 97, pp. 611-631, 2002.
Engineering, vol. 18, no. 4, pp. 384-400, 2012. [30] C. Fraley, A. E. Raftery, T. B. Murphy, and L. Scrucca, "mclust
[9] B. Das and J. Reddy, "Fuzzy-logic-based fault classification scheme Version 4 for R: Normal Mixture Modeling for Model-Based
for digital distance protection," IEEE Transactions on Power Delivery, Clustering," Classification, and Density Estimation Technical Report
vol. 20, no. 2, 2005 No. 597, Department of Statistics, University of Washington, 2012.
11
[31] M. Charrad, N. Ghazzali, V. Boiteau and A. Niknafs, "NbClust: An R Maintenance," in 5th International Conference on Industrial
Package for Determining the Relevant Number of Clusters in a Engineering and Applications, Singapore, 2018.
DataSet.," Journal of Statistical Software, vol. 61, no. 6, pp. 1-36, [33] N. Amruthnath and P. M. Prathibhavani, "Data Security in Wireless
2014. Sensor Network using Multipath Randomized Dispersive Routes.,"
[32] N. Amruthnath and T. Gupta, "A Research Study on Unsupervised International Journal of Scientific & Engineering Research, vol. 5, no.
Machine Learning Algorithms for Early Fault Detection in Predictive 1, 2014.
12