Professional Documents
Culture Documents
Bearing Diagnostics Based On Pattern Recognition of Statistical Parameters
Bearing Diagnostics Based On Pattern Recognition of Statistical Parameters
Bearing Diagnostics Based On Pattern Recognition of Statistical Parameters
com/
Vibration and Control
Published by:
http://www.sagepublications.com
Additional services and information for Journal of Vibration and Control can be found at:
Subscriptions: http://jvc.sagepub.com/subscriptions
Reprints: http://www.sagepub.com/journalsReprints.nav
Permissions: http://www.sagepub.com/journalsPermissions.nav
Citations: http://jvc.sagepub.com/content/6/3/375.refs.html
What is This?
FENGFENG XI
Integrated Manufacturing Technologies Institute, National Research Council Canada, 800 Collip
Circle, London. Ontario N6G 4X8, Canada
QIAO SUN
Department of Mechanical and Manufacturing Engineering, University of Calgary, 2500 Univer-
sity Drive, N. W., Calgary, Alberta T2N 1N4, Canada
GOVINDAPPA KRISHNAPPA
Integrated Manufacturing Technologies Institute, Western Laboratory, National Research Council
Canada, 3250 East Mall, Vancouver, B. C. V6T 1 W5, Canada
(Received 31 August 1998; accepted19 January 1999)
Abstract: In this paper, a new bearing defect diagnostic and classification method is proposed based on pattern
recognition of statistical parameters. Such a pattern recognition problem can be described as transformation
from the pattern space to the feature space and then to the classification space. Based on trend analysis of
six commonly used statistical parameters, four parameters, namely, RMS, Kurtosis, Crest Factor, and Impulse
Factor, are selected to form a pattern space. A 2-D feature space is formulated by a nonlinear transformation.
An intraclass transformation is used to cluster the data of different bearing defects into different regions in the
feature space. The classification space is constructed by piecewise linear discriminant functions. Training the
classification space is performed, in this paper, by using data of bearings with seeded defects. Diagnosis of
the defected bearings in the classification space then becomes straightforward. Numerical experiments show
that the proposed method is effective in indicating both the location and the severity of bearing defects.
1. INTRODUCTION
The subject of rolling bearing diagnostics has been studied over the past 25 years because they
are used in all rotating machinery from small, medium, to large size (Rao, 1996). Common
failure of rolling bearings includes spalling, corrosion, brinelling, and so on. These defects
induce repetitive vibrations when bearing elements encounter them (McFadden and Smith,
1984a, 1984b). The objective of bearing diagnostics is to identify the type of defects and the
severity by means of the technologies involving measuring and processing of these defect-
induced vibration signals.
375-
multiple defects. Time domain analysis of the envelope signal may overcome the weakness
of spectrum analysis, as statistical parameters can provide information such as the shape of
the amplitude probability distribution and the energy level of the vibration signals. Much
research work has been done in using these parameters individually to detect the bearing
defects, and the results have shown that each parameter is only effective for certain defects.
For example, spikiness of the vibration signals indicated by CrestFactor and Kurtosis implies
incipient defects, while the high energy level given by the value of RMS and Peak indicates
severe defects.
discriminant functions that are further used to generate piecewise linear boundaries in the
classification space. The proposed method is tested with data provided by the Association of
American Railroads. Results show that it is an effective method for bearing diagnostics.
Note that this description is different from that given by Andrews (1972), in that we include
the measurement space. As will be discussed in the following, this allows us to construct the
pattern space by means of certain analysis methods, such as statistics, rather than simply by
the measurement data.
As the title may suggest, measurement space is constructed directly by the measurement data.
If a finite dimension N is considered, the following column data vector
represents a series of measurements with N digitized data samples. Hence, the measurement
space is formed by N axes, and vector s represents a point in the space. To establish the
classifiable patterns, learning techniques should be applied. We adopt a supervised learning
procedure that uses prototype data of each class to be identified to establish the classifiable
patterns. For each class, we may need a set of prototype data samples. Therefore, a vector in
the measurement space representing the mth prototype of class k is expressed as
where m =
1, ... , Mk. M~ denotes the number of prototype data vectors for the kth class.
For given classes, the total number of data sets, Nd, is given by
K
According to Andrews (1972), the pattern space was simply represented by the measurement
data. We extend it here, so that it could also be defmed by means of certain analysis methods
such as statistics. This means that after this process, the data vector s becomes the following
new vector
where subscript p indicates the pattern space. Dimension of the pattern space is denoted as
R, where R < N. A prototype in the pattern space is given by the following vector
The problem of pattern recognition is to correctly classify the known prototypes and
afford some degree of confidence based on certain criteria. The solution is based on the
assumption that the pattern space forms a metric space that satisfies the following conditions
(Andrews, 1972):
where d denotes a distance function and a, b and c represent three vectors in the pattern space.
In terms of the metric space, the similarity of a point xp to the k-th class can be measured by
This similarity measure is an average of the squared distance between the pointx and the set
of prototypes (k) .
To ensure that we compare vectors with the same unit, normalization of
measurement data is required, which is referred to as squaring up the pattern space. A simple
way is to divide measurement data by their variance, known as variance normalization.
Feature space is the intermediate domain between the pattern space and the classification
space, with its dimension denoted by L. The data vector x f and the prototype vector
the feature space are defined as -
in y
and
This space serves two purposes. The first is to reduce the dimensions of the pattern
space so that a classification algorithm can be efficiently computed and the results can be
readily presented. The second purpose is to combine the dimensions of the pattern space so
that classification characteristics can be extracted. The process of selecting a feature space
is to find a transformation either linear or nonlinear to reduce the dimension of the pattern
space, yet maintaining discriminatory characteristics for the classification purpose. There
are a number of transformation methods. The one adopted in this paper is the intraclass
The classification space is referred to as the separating surfaces in the feature space. For
given K classes, there will be K separating surfaces. The separating surfaces are hyperplanes
in the L - 1 dimension. A classification algorithm is to define the space partition in the L
dimensional feature space into disjoint regions, with each region associated with one class.
The separation is a point on a line for L 1, a line on a plane for L
=
2, and a plane in a =
volume for L =3. Since the data are usually not linearly separable, space partition generally
results in nonlinear surfaces. For example, the boundary of the classification space forL 2 =
The idea of pattern recognition of statistical parameters is first to construct the pattern space
based on statistics analysis and then perform pattern recognition analysis using statistical
parameters. In this section, we describe how to select the statistical parameters to form the
pattern space for bearing diagnostics. In the following two sections, we will describe how to
construct the feature space and classification space.
as a normalizing parameter. Based on our experiment, we have found that RMSo can improve
the robustness of the energy level parameters, Peak and RMS, against variation in the bearing
operating conditions. RMSo is determined by considering all the possible loads and the
rotating speeds for the bearing under study In terms of RMSo, we can rewrite equations
(3) through (8) and define the following normalized vector z
It has been shown in the literature (Howard, 1994) that Peak and RMS values directly
reflect the energy level of the vibration signals. Since the localized bearing defects result in
structural vibrations, these two parameters are generally used to indicate the presence and
severity of the bearing defects. CrestFactor and Kurtosis are seen less dependent on the
vibration level but sensitive to the spikiness of the vibration signals. As such, they can provide
early indication of significant changes in the vibration signals. As the damage increases, the
vibration signals become more random, and the values of CrestFactor and Kurtosis could
decrease to the undamaged level. ImpulseFactor and ShapeFactors have similar effects like
CrestFactor and Kurtosis.
*
+, #, and indicate similarity
Probably the most important aspect of pattern recognition is selection of feature space. Proper
and efficient feature extraction allows large dimension reduction, yet retaining as much as
possible the useful information. Constructing a feature space by a planar image is especially
advantageous owing to the fact that a planar image of data can be readily perceived and
analyzed by a human observer. For bearing diagnostics, we start by recording the vibration
signals using acoustic sensors or accelerometers. The raw data are a series of numbers
representing the amplitude of signals at discrete times. Although by using the aforementioned
statistical parameters and trend analysis, dimension is reduced to four, it is desirable that two
indices should be extracted, which gives a combination of the other parameters.
We consider two key factors that statistical parameters might give us about the bearing
vibration signals. One is the spikiness, and the other includes the shape of the amplitude
distribution density and the energy level of the vibration. The first one is straightforward
and can be provided by CrestFactor, Kurtosis, and ImpulseFactor. We choose Kurtosis in
view of its robustness in variation with the operating conditions, as indicated in Figures 2(a)
and (b). Based on the studies reported in the literature (Howard, 1994), the shape of the
amplitude density distribution can be reflected by the statistical parameters pertinent to the
impulsiveness of the signal, that is, CrestFactor, Kurtosis, and the ImpulseFactors. Therefore,
the second feature is selected as
By substituting equation (5) for CrestFactor into equation (11) and using a logarithmatic scale
for the second feature, the vector defming the feature space is given as
&dquo;I
The dimension of our feature space is two, that is, L = 2. Accordingly, the prototype vector
is defined as
Note that the actual statistical parameters used in the computation of our approach are Peak,
RMSo, Kurtosis, and ImpulseFactor.
Figure 3 shows the feature space constructed using equation (12) considering a range
of operating speed and load conditions for six different types of bearing defects. It can
be seen that samples representing different groups are scattered and overlapped. This will
impose difficulties in classification. For this reason, the intraclass transformation is applied
to introduce the clustering effects on the samples within the same class.
Variances Q and or are determined, respectively, by the first and second row of the following
data matrix y(k) formed by M~ prototypes
Through the intraclass transformation, the mean square intraset distance of the kth class
is minimized (Andrews, 1972). It can be seen from equation (15) that the two coordinate
dimensions are inversely proportional to the variance of their own dimension. This can
be interpreted as stating that a small weight is to be given to those coordinates with large
variances because these particular coordinates have little in common over the prototypes of
the kth class. For those dimensions with near constant values, the variance will be small,
which will imply large weighting. Figure 4 shows the effects of the intraclass transformation
on the same set of data used to obtain Figure 3. Clearly, the samples belonging to the same
where W is defined in the same way as equation (15); except in this case, the variance will
be determined based on x f spanning a number of samples.
Recall that the problem of forming the classification space is to partition the feature space.
For given K classes, Sl, ... , Sr ... , SK , mathematically, this problem is to find a function
that can measure each point in the feature space in terms of its degree of membership to a
given class. This function is called the discriminant function in pattern recognition and is
defined such that for all points xf in the feature space within the region describing S~ , there
exists a function gk (x) such that
In other words, within the region, the kth discriminant function will have the largest value.
The piecewise linear discriminant function is used, which can approximate the nonlinear
boundaries separating the different class regions. This function is defmed by the minimum
distance between a point x f and the prototype points in class Sk
Point x f belongs to class Sk if the distance is minimum. The classification then comes to
determine the smallest distance between all of the prototypes of Sk and the unknown xf .
Mathematically, this can be written as
The boundaries separating the different class regions are determined by the following
equation:
Figure 5 shows the boundaries determined by equation (22), providing a very distinct partition
of the different class regions for the different bearing defects.
6. CASE STUDY
different loads, 8K and 33K pounds for the first type and 8K and 27.5K pounds for the second
type. The wheel speed tested was from 25 to 80 mph. The data used were the vibration signals
measured from the bearing housing and sampled at 523 kHz. Table 2 summarizes the bearing
condition classes considered in our study, and Figure 9 shows the pictures of the defects.
Figure 9(a). Pictures of bearing defects (Courtesy of the Association of American Railroads): Broken
roller.
Figure 9(b). Pictures of bearing defects (Courtesy of the Association of American Railroads): Single
outer race spall.
Figure 9(c). Pictures of bearing defects (Courtesy of the Association of American Railroads): Multiple
innerrace spall.
Figure 9(d). Pictures of bearing defects (Courtesy of the Association of American Railroads): Multiple
outer race spall.
Bearing diagnostics was carried out by using our test program written in MATLAB, according
to the computation procedure summarized in Figure 1. First, the statistical parameters were
calculated using equations (3) through (8). Each sample was then located in the feature space
according to equation (12) and shown in Figure 3. Through the intraclass transformation
(equation (14)), the prototypes were clustered in the feature space as shown in Figure 4. We
then constructed the six linear discriminant functions by equation (21) and determined the
classification space, as shown in Figure 5, through learning from the prototype data of the
six classes. It is interesting to note, from Figure 5, that severity of the bearing damage can
be shown by the migrating trend of class 2 to class 3, and of class 4 to class 5. Referring to
Table 2, class 2 and class 3 belong to the same type of defects, except that the latter represents
more severely damaged bearings. This is also true with class 4 and class 5. As such, severity
of the bearing damage is indicated by associating testing samples into different regions in the
classification space.
To test the effectiveness of this method, we used a set of data with an unknown type of
defect. We used our test program and located the testing data in the region of broken rollers,
as shown in Figure 6 (the &dquo;+&dquo; symbol). The result was verified by checking the bearing,
which indeed had a broken roller defect.
7. CONCLUSIONS .
A bearing diagnostics method has been developed based on pattern recognition of statistical
parameters. This method is unique in that it combines the commonly used statistical
parameters to provide an effective tool for bearing diagnostics, and it is simple both
conceptually and computationally. The method has been tested and the results show that
it is an effective method for bearing diagnostics, particularly in providing a simple way
for a human observer to visualize the diagnostics results on a two-dimensional plane of the
computer.
Acknowledgments. The msearrhreported here was possible under the financial support fium the Transport
Canada and the experiment support from the Association of American Railroads.
REFERENCES
Andrews, H. C., 1972, Mathematical Techniques in Pattern Recognition, Wiley-Interscience, New York.
Braun, S., 1986, Mechanical Signature Analysis, Academic Press, London.
Howard, 1., 1994, A Review of Rolling Element Bearing Vibration Detection, Diagnosis and Prognosis, Defense
Science and Technology Organization, Australia.
McFadden, Smith, J. D., 1984a, "Model for the vibration produced by a single point defect in a rolling element
P. D. and
bearing," Journal of Sound and Vibration 966
(1), 69-82.
McFadden, P. D. and Smith, J. D., 1984b, "Model for the vibration produced by multiple point defects in a rolling
element bearing," Journal of Sound and Vibration 966 (2), 263-273.
Rao, B.K.N., 1996, Handbook of Condition Monitoring, Elsevier Advanced Technology, Oxford, UK.
Yoshioka, T, 1993, "Detection of rolling contact sub-surface fatigue cracks using acoustic emission technique,Journal
of the Society of Tribologists and Lubrication Engineers 4 (4), 303-308.