Professional Documents
Culture Documents
Vibration Spectrum Imaging a Novel Bearing Fault Classification Approach
Vibration Spectrum Imaging a Novel Bearing Fault Classification Approach
1, JANUARY 2015
Fig. 4. Enhanced VSI of an exemplary motor vibration signal. Fig. 5. VSI after binary conversion.
III. F EATURED I MAGE E NHANCEMENT This binary conversion of any pixel value index by ind for
the f dı image works as
The grayscale spectral image obtained in the previous section
contains the information of characteristic frequencies of the 1 f dı (ind) ≥ thr
f bı (ind) = . (6)
bearing health, as shown in Fig. 3. There are a few bright or 0 f dı (ind) < thr
high amplitude vertical continuous spectral lines and a few
with lesser amplitude or brightness in the image. This image Both the Avg_F ilter and the binary thr help in retaining
is treated using the 2-D averaging filter (3) for feature enhance- featured patterns and removing the noise patterns. For this
ment and noise mitigation purpose, the dimension of the filtration matrix Z and the thr
value are very important for spectral enhancement purpose.
f dı = Avg_F ilter(f ı , Z). (3) The optimal size of the filter and the thr value are explained
in detail in Section V. The binary-converted-spectral image of
In (3), Z is a matrix of (rows × columns) dimensions, with the exemplary signal, using an appropriate threshold value, is
each element of the matrix having a unit value. A new value shown in Fig. 5. It can be seen that insignificant values are
of any pixel p of the image is calculated by averaging rows filtered out from the image. This enhanced image is called
(number of vertical neighboring pixel amplitudes) and columns featured image and can be fed to the neural network to learn
(number of horizontal neighboring pixel amplitudes) by keep- the important features in this image for fault classification.
ing p in the center. The size of the matrix has both vertical and
horizontal averaging effects on the image, and its dimension
IV. ANN
selection plays an important role in image enhancement under
low SNR. Using larger row and column values causes too much ANN classifiers are a biologically inspired nonlinear empir-
augmentation of the features because of averaging; in contrast, ical model [6]. ANN design includes the input layer, hidden
using smaller values achieves insignificant augmentation. The layers, and output layer. The number of input layer neurons is
selection of dimensions of Z and its effects on feature aug- equal to the input features and the number of output neurons is
mentation will be discussed in Section V. Depending upon equal to the number of classes. There can be one or more than
the dimension of the Z matrix, the Avg_Filter does the 2-D one hidden layers with different numbers of neurons. Most of
averaging, horizontal and vertical in the image. Thus, the filter the times, increasing the number of hidden layer neurons guar-
makes the continuous lines or coherent features prominent by antees good learning, but it results in increased computational
filling the missing values with the average and mitigating the cost. Therefore, the minimum number of the hidden layers
random dots and insignificant lines in the image by averaging and neurons for certain classification accuracy is preferred.
out the contents over the filter size. As a result, averaging Training of ANN can be supervised or unsupervised, depending
will enhance coherent or featured patterns and will depreci- upon the nature of the problem [2], [9]. In this paper, we are
ate incoherent or noise spectral contents of Fig. 3 as shown using supervised learning as, for each input feature pattern, we
in Fig. 4. have an associated target output. In this paper, input feature
These depreciated patterns or noise can be eliminated, and patterns are spectral images, and targets are fault classes. Before
augmented features can be retained by converting the grayscale starting the training, the available data are divided into three
image into a binary one, using an appropriate threshold thr sets: training, validation, and testing. Mean square error (MSE)
is used to measure the accuracy of the training. The steepest
f bı = Binary(f dı , thr). (4) descent method is used to adjust the weights and biases. The
next section will discuss the experimental setup and ANN
The optimum binary thr is calculated using the thr cost
classification accuracy under different SNR conditions.
function
0
n
V. E XPERIMENTAL R ESULTS AND D ISCUSSION
Cost(thr) = (BIim − T Iim,db,thr ). (5)
db=−d im=1 A. Experimental Setup
Here, d is the minimum decibel value for a particular Fig. 6 depicts the experimental setup for recording the
setup/experiment, n is the number of classes, BI is a baseline actual vibration data sets [28]. Experiments have been
image, and T I is a test image. Considering the BI images for conducted with four different bearings, including one normal
each class, (5) is used to calculate the cost of a particular thr and three having faults in their inner race, ball, and outer
value. The thr value that minimizes the cost function over the race. Bearing specifications are as follows (specified both
range of decibel values for a given number of classes is selected in inches and millimeters): inside diameter = 0.9843 in
as the optimum thr for binary conversion in (4). (25.001 mm), outside diameter = 2.0472 in (51.999 mm),
Authorized licensed use limited to: National Engineering College. Downloaded on July 05,2024 at 09:48:08 UTC from IEEE Xplore. Restrictions apply.
AMAR et al.: VIBRATION SPECTRUM IMAGING: NOVEL BEARING FAULT CLASSIFICATION APPROACH 497
Fig. 15. Enhanced spectral images with averaging filter of (8 × 4) at Fig. 18. thr = 0.7, SNR = −10 dB, and Z dimensions = (2 × 2).
SNR = −10 dB.
Fig. 19. thr = 0.7, SNR = −10 dB, and Z dimensions = (4 × 4).
Fig. 16. Enhanced image of −10-dB SNR with averaging filter of
(8 × 4) converted to binary image with thr = 0.7.
Fig. 20. thr = 0.7, SNR = −10 dB, and Z dimensions = (8 × 8).
Fig. 17. Binary image with thr = 0.7 without averaging filter.
2) The row value has effects on filling the missing values
Then, Fig. 16 shows the binary converted spectral images of among the stacked segments and mitigating the noise.
Fig. 15 using thr = 0.7. Fig. 17 shows the binary converted With a larger row value, the missing value can be better
spectral images of Fig. 13 with the same threshold thr = estimated, and larger reduction in noise can be achieved.
0.7 without applying the averaging filter. Now, by looking at However, a larger row value truncates the featured image
Figs. 16 and 17, it is clear that the averaging filter enhances from top and bottom as shown in Fig. 21. Smaller row
the image by brightening the fault signature features and by values do not estimate the missing values accurately as
mitigating the noise of the spectral images. shown in Fig. 18. Therefore, a compromised row value
Let us discuss the dimensions of the filter size. Figs. 18–22 is required which gives us optimum results, and that
show the binary converted images processed with different value is 8.
averaging filter sizes. The averaging filter is a 2-D filter with Thus, comparing Figs. 18–22 with Fig. 16, it is evident that,
both horizontal and vertical effects on the image. with the averaging filter of (8 × 4), we get better enhancement
1) With a larger column value, a larger number of adjacent of features and noise mitigation.
frequencies in the spectrum will be affected by the av- 2) Binary Threshold Value: After determining the size of
eraging as shown in Fig. 22. Thus, a maximum column the averaging filter, now, we will discuss about the threshold
value that avoids the merging of the adjacent frequencies value for binary conversion. Figs. 23 and 24 show the effects of
is preferred, and that value is 4 in this case. the threshold values of thr = 0.5 and thr = 0.9 with the use
Authorized licensed use limited to: National Engineering College. Downloaded on July 05,2024 at 09:48:08 UTC from IEEE Xplore. Restrictions apply.
500 IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, VOL. 62, NO. 1, JANUARY 2015
Fig. 21. thr = 0.7, SNR −10 dB, and Z dimensions = (16 × 16). Fig. 24. thr = 0.5, SNR = −10 dB, and Z dimensions = (8 × 4).
Fig. 22. thr = 0.7, SNR = −10 dB, and Z dimensions = (16 × 4).
optimum choice to remove the noise under poor SNR and still
ensures good classification by retaining the spectral features.
3) Costs Associated With Average Filtering and Binary
Threshold: There are two noise sources in the spectral images:
intentionally added AWGN and processing noise. Processing
noise is the error caused by different enhancement procedures
in the spectral images. The total noise or error cost associated
with the test image relative to the baseline image can be studied
Fig. 23. thr = 0.9, SNR = −10 dB, and Z dimensions = (8 × 4). by modifying (5) into (7)
( nim=1 abc(BIim − T Iim ))
of the averaging filter of size (8 × 4). Comparing these figures Cost(db) = . (7)
N orm
with Fig. 16, it is clear that using a higher thr value results in
the loss of features and a lesser thr value results in inefficient N orm is the normalization factor and is equal to the product
noise removal. After a number of studies using different thr of the number of image pixels and the number of classes. If
values, it was found that thr = 0.7 is a better choice for binary we have a white test image and a black baseline image or
conversion. vice versa, then (7) will give us the cost of 1, which is the
A visually inspected optimized thr = 0.7 value can be con- maximum possible error or cost. As similarity between baseline
firmed by (5). The plot of the cost function in (5) over the thr and test images increases, the cost value decreases. Therefore,
range of 0–1 with n = 4 and d = 12 is shown in Fig. 25 using the cost of 1 corresponds to maximum dissimilarity, and that of
baseline images of four classes with the averaging filter of (8 × 0 corresponds to maximum similarity, indicating the absence of
4). It is evident from the plot that the cost function minimizes AWGN and processing noise. If we know the cost of AWGN,
at the thr = 0.7 value. then the scale of 0 to 1 can be used to estimate the processing
Fig. 25 indicates that 0.7 minimizes the cost function. The noise caused by image enhancement procedures for a spectral
cost of the function is high before and after thr = 0.7. Costs image at any given SNR. Fig. 26 shows the error curves of
at thr values lesser than 0.7 point toward excessive noise in different image enhancement procedures in comparison to no-
the images, and costs at values greater than 0.7 indicate poor enhancement reference curve, i.e., AWGN only at any SNR
fault features. Thus, binary conversion with thr = 0.7 is the level.
Authorized licensed use limited to: National Engineering College. Downloaded on July 05,2024 at 09:48:08 UTC from IEEE Xplore. Restrictions apply.
AMAR et al.: VIBRATION SPECTRUM IMAGING: NOVEL BEARING FAULT CLASSIFICATION APPROACH 501
TABLE I
C LASSIFICATION ACCURACY W ITH SNR
To validate the robustness of the VSI features, ANN was [9] J. F. Martins, V. F. Pires, and A. J. Pires, “Unsupervised neural-network-
trained with WT features proposed in [7] for comparison. These based algorithm for an on-line diagnosis of three-phase induction motor
stator fault,” IEEE Trans. Ind. Electron., vol. 54, no. 1, pp. 259–264,
features gave 100% accuracy at 0 dB with high hit probability Feb. 2007.
and low false alarm probability, but at −10 dB, it had an overall [10] C. S. Tyagi, “A comparative study of SVM classifiers and artificial neu-
hit probability of 0.38 and a false alarm probability of 0.15. This ral networks application for rolling element bearing fault diagnosis us-
ing wavelet transform preprocessing,” in Proc. World Acad. Sci., Eng.
indicates that these features are not performing well and are Technol., Sep. 2008, vol. 45, pp. 319–327.
unable to discriminate among different classes with ANN under [11] F. Filippetti, G. Franceschini, C. Tassoni, and P. Vas, “Recent develop-
poor SNR and hints that VSI features are robust when used with ments of induction motor drives fault diagnosis using AI techniques,”
IEEE Trans. Ind. Electron., vol. 47, no. 5, pp. 994–1004, Oct. 2000.
ANN even under poor SNR in contrast to the features in [7]. [12] C. Chen, B. Zhang, and G. Vachtsevanos, “Prediction of machine health
The translation-invariant nature of FT spectral contents, in condition using neuro-fuzzy and Bayesian algorithms,” IEEE Trans.
contrast to WT, and image processing, with visual aid for Instrum. Meas., vol. 61, no. 2, pp. 297–306, Feb. 2012.
[13] Y. Jun-rong, Y. Min, C. Xia, and H. Yan, “Fault diagnosis of rolling
feature enhancement, has helped to increase the classification bearing based on rough set and neural network,” Appl. Mech. Mater.,
accuracy under low SNR. The averaging filter size and binary vol. 58–60, pp. 974–977, Jun. 2011.
threshold value are very crucial parameters for this reliable [14] M. Xia, F. Kong, and F. Hu, “An approach for bearing fault diagnosis
based on PCA and multiple classifier fusion,” in Proc. 6th IEEE Joint Int.
fault classification. The 2-D averaging filter simultaneously ITAIC, Aug. 2011, pp. 321–325.
mitigates the incoherent noise and augments the coherent fault [15] G. G. Yen and K.-C. Lin, “Wavelet packet feature extraction for vibration
features even under low SNR. Applying an appropriate binary monitoring,” IEEE Trans. Ind. Electron., vol. 47, no. 3, pp. 650–667,
Jun. 2000.
threshold removes the mitigated noise and keeps the augmented [16] S. K. Goumas, M. E. Zervakis, and G. S. Stavrakakis, “Classification of
fault signatures and thus provides feature enhancement that washing machines vibration signals using discrete wavelet analysis for
helps to achieve improved classification accuracy. feature extraction,” IEEE Trans. Instrum. Meas., vol. 51, no. 3, pp. 497–
508, Jun. 2002.
[17] X. Lou and K. a Loparo, “Bearing fault diagnosis based on wavelet
transform and fuzzy inference,” Mech. Syst. Signal Process., vol. 18, no. 5,
VI. C ONCLUSION pp. 1077–1095, Sep. 2004.
[18] S. Seker and E. Ayaz, “Feature extraction related to bearing damage in
In this paper, VSI and ANN-based bearing fault classification electric motors by wavelet analysis,” J. Franklin Inst., vol. 340, no. 2,
into the inner race, outer race, and ball faults against the pp. 125–134, Mar. 2003.
[19] F. Li, G. Meng, L. Ye, and P. Chen, “Wavelet transform-based higher-order
usual behavior has been presented. The spectral contents of statistics for fault diagnosis in rolling element bearings,” J. Vib. Control,
the translation-variant time-segmented vibration signal, trans- vol. 14, no. 11, pp. 1691–1709, Nov. 2008.
formed into a spectral image, have been processed for feature [20] B. Samanta and K. R. Al-Balushi, “Artificial neural network based fault
diagnostics of rolling element bearings using time-domain features,”
enhancement using the 2-D averaging filter and grayscale to Mech. Syst. Signal Process., vol. 17, no. 2, pp. 317–328, Mar. 2003.
binary image conversion. These enhanced featured images are [21] A. Malhi and R. X. Gao, “PCA-based feature selection scheme for ma-
then used to train the ANN classifier. Results have shown that chine defect classification,” IEEE Trans. Instrum. Meas., vol. 53, no. 6,
pp. 1517–1525, Dec. 2004.
ANN with VSI has learned the complex known and unknown [22] K. L. V. Iyer, X. Lu, Y. Usama, V. Ramakrishnan, and N. C. Kar, “A
patterns with 96.90% accuracy, outperforming the existing twofold Daubechies-wavelet-based module for fault detection and voltage
techniques by 5.67% at the worst case of SNR = −10 dB. regulation in SEIGs for distributed wind power generation,” IEEE Trans.
Ind. Electron., vol. 60, no. 4, pp. 1638–1651, Apr. 2013.
[23] M. Blodt, P. Granjon, B. Raison, and G. Rostaing, “Models for bearing
damage detection in induction motors using stator current monitoring,”
R EFERENCES IEEE Trans. Ind. Electron., vol. 55, no. 4, pp. 1813–1822, Apr. 2008.
[1] Motor Reliability Working Group, Power Systems Reliability Subcom- [24] Y. Zhang, C. Bingham, M. Gallimore, Z. Yang, and J. Chen, “Machine
mittee, Power Systems Engineering Committee, Industrial and Commer- fault detection during transient operation using measurement denoising,”
cial Power Systems Department, IEEE Industry Applications Society, in Proc. IEEE Int. Conf. CIVEMSA, Jul. 2013, pp. 110–115.
“Report of large motor reliability survey of industrial and commercial [25] C. T. Yiakopoulos, K. C. Gryllias, and I. A. Antoniadis, “Rolling element
installations, part I,” IEEE Trans. Ind. Appl., vol. IA-21, no. 4, pp. 853– bearing fault detection in industrial environments based on a K-means
864, Jul. 1985. clustering approach,” Expert Syst. Appl., vol. 38, no. 3, pp. 2888–2911,
[2] B. Li, M.-Y. Chow, Y. Tipsuwan, and J. C. Hung, “Neural-network-based Mar. 2011.
motor rolling bearing fault diagnosis,” IEEE Trans. Ind. Electron., vol. 47, [26] V. Sugumaran and K. I. Ramachandran, “Fault diagnosis of roller bearing
no. 5, pp. 1060–1069, Oct. 2000. using fuzzy classifier and histogram features with focus on automatic rule
[3] A. Bellini, F. Immovilli, R. Rubini, and C. Tassoni, “Diagnosis of bearing learning,” Expert Syst. Appl., vol. 38, no. 5, pp. 4901–4907, May 2011.
faults of induction machines by vibration or current signals: A critical [27] M. Amar, I. Gondal, and C. Wilson, “Multi-size-window spectral augmen-
comparison,” in Conf. Rec. IEEE IAS Annu. Meeting, Oct. 2008, pp. 1–8. tation: Neural network bearing fault classifier,” in Proc. IEEE 8th Conf.
[4] M. Amar, I. Gondal, and C. Willson, “Unitary anomaly detection for ICIEA, Jun. 2013, pp. 261–266.
ubiquitous safety in machine health monitoring,” in Proc. Neural Inf. [28] Bearing Data Center, Jan. 2009. [Online]. Available: http://csegroups.
Process., 2012, pp. 361–368. case.edu/bearingdatacenter/pages/download-data-file
[5] S. Hayashi, T. Asakura, and S. Zhang, “Study of machine fault diag- [29] F. Immovilli, C. Bianchini, M. Cocconcelli, A. Bellini, and R. Rubini,
nosis system using neural networks,” in Proc. IJCNN, 2002, vol. 1, “Bearing fault model for induction motor with externally induced vi-
pp. 956–961. bration,” IEEE Trans. Ind. Electron., vol. 60, no. 8, pp. 3408–3418,
[6] H. Su and K. T. Chong, “Induction machine condition monitoring using Aug. 2013.
neural network modeling,” IEEE Trans. Ind. Electron., vol. 54, no. 1, [30] S. Lessmann, B. Baesens, C. Mues, and S. Pietsch, “Benchmarking clas-
pp. 241–249, Feb. 2007. sification models for software defect prediction: A proposed framework
[7] M. F. Yaqub, I. Gondal, and J. Kamruzzaman, “Inchoate fault de- and novel findings,” IEEE Trans. Softw. Eng., vol. 34, no. 4, pp. 485–496,
tection framework: Adaptive selection of wavelet nodes and cumulant Jul./Aug. 2008.
orders,” IEEE Trans. Instrum. Meas., vol. 61, no. 3, pp. 685–695,
Mar. 2012.
[8] Y. Yang and W. Tang, “Study of remote bearing fault diagnosis based on
BP neural network combination,” in Proc. 7th ICNC, Jul. 2011, vol. 2, Authors’ photographs and biographies not available at the time of
pp. 618–621. publication.
Authorized licensed use limited to: National Engineering College. Downloaded on July 05,2024 at 09:48:08 UTC from IEEE Xplore. Restrictions apply.