07005180

You might also like

Download as pdf or txt
Download as pdf or txt
You are on page 1of 8

Distributed Neuro-Fuzzy Feature Forecasting

approach for Condition Monitoring


Daniel Zurita, Jess A. Carino, Miguel Delgado, Juan A. Ortega
MCIA Research Center, Department of Electronic Engineering
Technical University of Catalonia (UPC)
Rbla. San Nebridi n22, Gaia Research Building, 08222 Terrassa, Spain
{daniel.zurita, jesus.carino, miguel.delgado, juan.antonio.ortega}@mcia.upc.edu

AbstractThe industrial machinery reliability represents a of the prognosis model. The prognosis output represents directly
critical factor in order to assure the proper operation of the whole the remaining useful life of the system or the expected time to
productive process. In regard with this, diagnosis schemes based failure. Some significant studies use an Artificial Neural
on physical magnitudes acquisition, features calculation, features Network (ANN) based model and time domain features such as
reduction and classification are being applied. However, in this RMS or Kurtosis to estimate the Remaining Useful Life (RUL)
paper, in order to enhance the condition monitoring capabilities, a [6] , other schemes, as in [7], present a previous feature reduction
forecasting approach is proposed, in which not only the current stage to the RUL estimator.
status of the system under monitoring in identified, diagnosis, but
also the future condition is assessed, prognosis. The novelties of the The second approach can be called discrete forecasting, in
proposed methodology are based on a distributed features which the prognosis is related to the future condition of the
forecasting approach by means of adaptive neuro-fuzzy inference system from a predefined set of conditions. An initial health
system models. The proposed method is validated by means of an assessment block, that is the diagnosis, gives an outcome among
accelerated bearing degradation experimental platform. the defined discrete stages, corresponding to the current
condition, then the prognostic model provides the forecasting
Keywords Artificial intelligence, Condition monitoring, condition or the remaining time till the next state. Most of these
Feature extraction, Fuzzy neural networks, Machine learning, kinds of approaches are based on Hidden Markov Models, as in
Prognosis, Remaining Useful Life, Time domain analysis.
[8], where starting from time domain features from a bearing
I. INTRODUCTION experimental test bench, the methodology estimates prognostic
metrics as the next degradation stage probability.
Unexpected industrial machinery breakdowns cause a high
impact in the factory production parameters, which involves Finally, the third main approach corresponds conceptually to
economic losses, machine repair and replacement costs and a continuous forecasting, where the set of numerical features is
associated labor and down-times losses [1]. Due to this fact, the input to the forecasting block, in order to predict their
manufacturing industries operate more often in rich data expected evolution in time. That is, the forecasting model
environments, in which most of the critical parts of the projects the feature in the future and, then, the future condition
machinery, such as screws, bearings, gears, and others, are is analyzed. Recent studies, as the presented in [9, 10], propose
monitored. However, the acquired information is, mainly, machine prognostic schemes for future fault propagation
associated to thresholds for alarm and machinery stoppage assessment based on neural networks and support vector
purposes, and, in some cases exploited by one of the different machine.
automatic industrial diagnosis schemes proposed since the last Although direct forecasting represents the simplest approach
decades [2-4]. with proper results for systems with a reduced number of
However, one of the current challenges of industrial variables, it is not able to give further information of the
condition monitoring is finding useful data management degradation evolution. Discrete forecasting approaches,
schemes capable of given the future condition of the system however, offers higher accuracy capabilities in both health and
under monitoring, that is, prognosis approaches [5]. The prognosis assessment, but the prognosis results depend critically
information regarding the future condition of a system or the on the performance of the classifier, since the features are only
probability of reliable operation, represents a powerful used in the classifications stage. The continuous approach, based
knowledge able to change considerably the maintenance on feature evolution forecasting, presents a more complex
procedures and increase the manufacturing processes reliability structure due to the fact that each feature should be forecasted
and performance. Although there exists a wide range of individually, which means that the number of models is equal to
techniques and methodologies, the prognosis strategies can be the number of features. As a result, this approach is less used in
divided mainly in three approaches. the literature, because in complex scenarios, where the
interdependences within the different variables is high, the
The first approach is based on direct forecasting. In this relation between the features is not known. However, if there are
approach, the acquired physical magnitudes are characterized by not such dependences among the variables, the models are
a set of representative numerical features, which are the inputs simply, that is, the prognostics can be faced as a multi-regression

2014 IEEE Emerging Technology and Factory Automation (ETFA)


c 2014 IEEE
978-1-4799-4845-1/14/$31.00
problem. This particularity offers more versatility in the design the antecedent consequence statements of fuzzy logic. These
phase of the model, and increases the precision of the prediction. statements provide the condition of the fault being monitored
given the linguistic operating range of the inputs.
The contributions of this work are based on a distributed
forecasting methodology for degradation assessment based on Each node calculates the reason between the i-th activation
continuous forecasting approach. The proposed methodology is variable, and the sum of all the activation variables (4). The
based on a distributed prediction scheme of numerical features outputs of this layer are called normalized activation variables.
calculated from acquired physical magnitudes. Once the In the defuzzification layer, L4, each node computes the
prediction is obtained, the future system condition is assessed by contribution of the i-th fuzzy rule over all the outputs (5). The
a neural based classifier. outputs of each node are named consequents. The nodes of the
output layer, L5, calculate the output signals as the summation of
The paper is organized as follows: Section II introduces the all the input signals (6).
theoretical bases of the Adaptive Network-based Fuzzy
Inference System (ANFIS). In Section III the methodology to
Rule 1: if A1 y 1 1 = 1 + 1 + 1
design the degradation assessment and forecasting algorithm is (1)
Rule 2: if A2 y 2 2 = 2 + 2 + 2
shown. Finally, in Section IV, the experimental results and the
method capabilities are tested and the results are analyzed.
1 = () , = 1,2 1 = 2 () , = 3,4 (2)
II. ADAPTIVE NETWORK BASED FUZZY INFERENCE
SYSTEM 2 = = () (), = 1,2 (3)
There are, currently, different techniques to design
=
3 = , = 1,2 (4)
forecasting models. However, the prognosis schemes are usually 1 + 2
based in the concept of hybrid prognosis; which integrates
different techniques in order to take advantages of each method 4 =
=
( + + ), = 1,2 (5)
involved. In this topic, the most important hybrid system is Wi
5 = = (6)
ANFIS [11]. This method fuses the parametric adaptability of
ANN, and the generalization capabilities of the fuzzy logic.
ANFIS based prognostic systems offers a very reliable and 1
= ( )

2 (7)
robust condition predictor, since it can capture the system =1
dynamic behavior quickly and accurately [12, 13].
The methodology for the training process consists on the use
The neural-fuzzy architecture can be divided in five different of a hybrid algorithm that selects the inputs of the ANFIS system
layers, as shown in Fig. 1. For the concrete architecture shown and evaluates its result. The performance of the ANFIS models
in the figure, this method aims to find a relation between the is evaluated by means of the Root Mean Squared Error (RMSE)
output, z, and a set of inputs, x and y, which are fuzzified by of the overall signal, which is defined by equation (7). Where n
means of the membership functions, A1, A2, B1, and B2. The is the size of the dataset, corresponds to the predicted output,
output is computed regarding the pertinence of the inputs and and yi to the real one.
the two if-then rules as defined in (1).
III. ARCHITECTURE OF THE PROPOSED METHOD
The input layer, L1, is in charge of addressing the input signal
to a node. Each node, i, of each layer, j, is an adaptive node with The objective of the proposed Distributed Feature

an output function, , defined in (2), as the membership values, Forecasting Architecture, DFFA, is to assess the current and
future degradation level of a system by modeling time domain
where Ai and Bi are the degree of pertinence to the membership
based feature evolutions. The future condition will be obtained
functions.
from the assessment of such individual forecasted features. The
proposed condition monitoring approach can be divided in three
different steps, as shown in Fig.2.
1) The feature calculation and selection: aims to calculate
and select depending on selection criteria, the most
suitable subset of features to represent the predefined
degradation conditions.
2) The distributed forecasting step: deals with the
generation, training, and validation of the ANFIS based
forecasting models responsible of predict the evolution
of each selected feature based on a predefined prediction
Fig. 1 Adaptive Network-based Fuzzy Inference System scheme with two
inputs, four membership functions, two rules and one output. horizon, p.
3) The future health assessment: includes the integration of
In the fuzzification layer, L2, the weight of each rule is the distributed prognostic models to assess and predict
computed by means of a fuzzy AND operation that multiplies the current condition, S(t), and future condition of the
the input signals. The outputs represent the activation variables system, S(t+p),by combining the calculated features
of a fuzzy rule (3). The fuzzy rules definition layer, L3, provides before and after the forecasting model respectively.
Fig. 2 Distributed Distributed Feature Forecasting approach for current and forecasted Condition Monitoring assessment. The proposed scheme is formed by three
steps: step 1 - feature calculation and selection, step 2 - individual forecasting models generation and, step 3 - future health assessment.

A. Feature calculation and selection


Starting from the physical magnitudes acquired during the This selection should be made in base of some objective
considered operating conditions of the system, it is required to function; for prognosis and health assessment purposes, there
characterize the signal waveform by numerical features. are different criteria for determining the suitability of a feature.
Although frequency and time-frequency based features can be According to M. Lebold [14] criteria, a good feature must be
calculated, statistical time features show a good trade-off predictable, which means the feature should be monotonically
between signal dynamics and low computational requirements. increasing or decreasing.
Moreover, if it is considered that the system works under Then, the proposed criterion function proposed for selecting
stationary conditions, the statistical time features represents, the best subset of features is the fisher linear discriminant
definitively, proper condition characterization indexes [8]. coefficient. This coefficient is a relation of how separated are the
Some of the most common statistical time features are the considered classes (predefined system conditions), regarding its
mean value (1st moment), the maximum value, the Root Mean variance (how disperse are these classes) as it is mathematically
Square (RMS), the variance (2nd moment), the skewness (3rd represented in in (16), where Fc is the fisher coefficient, SB is the
moment), the kurtosis (4th moment), the crest factor and the between-class scatter matrix which is a measure of the
shape factor. All the mentioned time domain features are listed separation between the projected samples of different classes as
in Table 1, where n is the length of the signal, and xk is the k-th shown in (17).
sample of the signal. However, in order to optimize the set of Taking into account that c is the number of classes, nc-i is the
features used to characterize the signals, usually, feature number of samples in each class, mi is the mean of each class
reduction approaches are applied. This procedure allows to and mc represent the total mean vector, N is the total number of
reduce the amount of data needed for the analysis, reduce samples and xv is the matrix of the calculated features. SW is the
complexity and computational cost. The presented DFFA within-class scatter matrix, which is a measure of the samples
method aims at predicting the evolution of those features that are dispersion corresponding to each class (18), and Si represents the
more related to the degradation mechanism. scatter matrix for each class (19).

Table 1. Time domain features calculated. = (16)


1
Mean Value = (8) = ( )( ) (17)

=1 =1

Max Value = max() (9)
= 2 (18)
=1
Root Mean Square 1

(RMS) = ( )2 (10) = ( ) ( )
=1 (19)

1
Variance 2 = ( )2 (11)
=1

B. Distributed forecasting
Shape Factor =
(SF) 1 (12) The aim is to define prognosis model for each feature able to
| |
=1 forecast how such feature will evolve among time, taking into
Crest Factor account its previous and current values. As it has been aforesaid,
(CF) = (13)
this work proposes a distributed scheme of individual feature
models. This individual analysis is made in order to simplify the
[( )3 ]
Skewness 1 = (14) forecasting procedure, increase the generalization capabilities
3
towards new data and reduce the prediction errors of the whole
Kurtosis
[( )4 ]
(15) system by analyzing each feature individually; then addressing
=
4 the forecasting problem from a distributed point of view.
The ANFIS model for a feature F1, has a single output, which A. Description of the experimental platform
is the predicted feature at a time horizon p, F1 (t+p). This time The experimental datasets come from the bearing failure
horizon should be configured depending on the application, the diagnostic and prognostics platform [17], which provides access
dynamics of the features and the forecasting requirements. The to accelerated bearings degradation tests. The test bench, as it is
inputs of the model are of three types, i.e. current values of the shown in Fig.3, is composed by the speed variation, torque
feature to predict, F1 (t), past values of the feature, F1 (t-n), and transmission and the load profile generation. A cyclic radial load
auxiliary inputs. It should be noted that, generally, auxiliary is applied on the external bearing under test in order to simulate
inputs, are other information that helps the model to improve its its mechanical stress conditions. The experiment starts at a fixed
performance by detecting sudden changes in the input signal, speed condition, and stops when the measured vibration at the
such as data from other features, or more information bearing under test is higher than 20g. It should be noticed that,
characterizing the signal waveform of F1. The decision of in order to speed up the degradation, the applied radial load
including a new input to the model should be evaluated in regard exceeds the maximal load supported by the bearing. During the
with the temporal signature of the signal, the cross influences experiments any kind of failure (inner race, outer race, ball or
between features, and the affectation of this input to the final cage) could occur. This fact allow better representation of a real
RMSE. The output of the model is the same feature F1 projected industrial scenario [18].
over a time horizon p. As a result from this step, an ANFIS
model will be generated, trained and validated for each feature Regarding the test bench instrumentation, two high
selected. A dataset containing the predicted values of the frequency accelerometers (DYTRAN 3035B), are mounted on
features will be the input to the next step of the method. the bearing external race in order to measure the horizontal and
the vertical accelerations. In addition, the monitoring system
C. Future health assessment includes one PT100 to measure the bearings temperature, which
A data fusion strategy is applied in order to combine the is placed near the external ring of the bearing under test. The
information given by the forecasting model and generate a signals are acquired by means of a NI DAQ card; the
diagnosis outcome in regard with the current and future acceleration signals are acquired in successive windows with
condition. It should be noticed that, previously, the systems duration of 1/10 seconds, repeated every 10 seconds, with a
conditions must be defined in order to apply a common sampling frequency of 25.6 Hz. Similarly, the temperature
classifier. signal is acquired every minute with a sampling frequency of 10
Hz.
Although different classifier can be applied, the ANN is
considered as the most convenient option for most of the
problems. ANN are data-driven self-adaptive information
processing method inspired in biological systems, and represents
the most commonly found data-driven technique in the
prognosis literature [15]. It is composed by a number of
interconnected processing elements (neurons) working at the
same time to solve a specific problem. ANN represents a no-
lineal, multivariate and no-parametric algorithm for pattern
recognition and time series modelling applications [16].
The ANN will be trained with the original feature space in
order to detect the specific degradation pattern among the
features. Once the net has been trained, it will be applied with
the original and the predicted outputs of each forecasting model
in order to assess the current and the future degradation levels,
which are the outputs of the presented method.
Fig. 3 Overview of PRONOSTIA experimental platform [17].
IV. EXPERIMENTAL RESULTS
In this section, the proposed methodology is validated using Two different experiments from the PRONOSTIAs
a real experimental data related to bearing degradation database under the same operating conditions have been
evolution. These kinds of tests are known as run-to-failure selected. The first dataset will be used to train both forecasting
experiments. This kind of information, although highly model and the classification algorithm of the proposed method;
significant for electro-mechanical systems monitoring, is the second set will be used to test and validate the performance
difficult to be obtained from real industrial applications. Due to of the method against new data with a similar behavior. The
this fact, in order to obtain such information, long-time characteristics of the selected datasets for this experimental
acquisition (weeks, months or even years depending on the validation could be seen in Table 2.
application), are ideally required. However, in this work, an Three different discrete health states have been identified by
accelerated live test bench has been used, where the elements are analysing the evolution of the vibration amplitude, as shown in
forced to work beyond their nominal values, and then, the Fig.4. These degradation degrees are related to the RUL of the
degradation profiles can be extracted. bearing under test. Accordingly, the first health status (1st class)
corresponds to the normal functioning of the bearing, and goes
from the starting point 100% to the 51 % of the RUL. The second
status (2nd class) represent the degradation procedure of the Good discrimination capabilities between the different
bearing and goes from the 50 to the 6% of the RUL. Finally, the degradation classes can be observed for both data sets; in
critical failure of the bearing (3rd class) is represented the last 5% addition, the degradation samples distribution appears to be
of the RUL. monotonically increasing, which is a sign of suitability of the
Table 2. Main characteristics of the used datasets for the experimental
selected features for assessing the condition of the bearings. It
validation. can be observed that the training set, despite it takes less time
to reach the final degradation level, is more progressive than the
Set Experiment Duration Conditions
validation test. This differences in the dynamics of the sets will
help to assess the proposed methodology performance.
Training Bearing 1_4 14000 sec 1800 Rpm
Validation Bearing 1_1 28000 sec 4000 N

Fig. 4 Proposed classes represented over the RMS value of the X-axis
acceleration. The classes are based on different RUL estimation ranges.

B. Feature calculation and selection


From the three acquired magnitudes of the training set
(acceleration in axes X and Y, and temperature) the time domain
features specified in Table 1 are calculated. Therefore, a total
amount of 24 features is obtained for each measurement. With Fig. 5 Best features calculated over the training and the validation set. Note that
this set of feature, different subsets of features with different the evolution towards the final breakdown of the bearing can be appreciated in
all the features.
dimensions will be evaluated and sorted regarding their
discriminant capabilities in regard with the defined classes.
Then, the subset with the highest discriminant coefficient will be a)
selected, and each feature from this subset will be forecasted
with an individual prognosis model in the next stage. For this
reason, the maximum feature subset dimension has been limited
to three in this application, in order to have a maximum of three
prediction models, which has been validated as a proper
dimensionality to maintain more than the 90% of the data
variance.
As a result of the assessment of the different sets of features
by (16), the best feature subset is obtained corresponding, for this
application, to the variance of the acceleration in X, the variance
of the acceleration in Y, and the maximum value of the b)
temperature. Then, each measurement is characterized by three
features. The temporal signature of the selected features is
shown in Fig.5 for both training (a) and validation (b) datasets.
Analyzing these figures, it must be noticed that the sets
present different temporal evolution. That is, the training set
double the degradation speed of the test set. Moreover, the test
set presents a faster final degradation procedure. Additionally,
the failure occurred in the training set seems to be severe, hence
the final feature amplitude and the slope in the breaking point is
higher. These differences represent the required generalization
capabilities of the models. Accordingly to the results the feature
Fig. 6 Resulting feature space versus the different degradation stages in a) the
space with the selected best set of features is shown in Fig.6 for training set and b) the validation set.
both the training and validation sets.
C. Training procedure of the method The different models have been trained, using the training
The next step in the methodology is to generate and train, for dataset, by means of a hybrid learning algorithm; the epochs
each feature, the ANFIS models with the training set. First of all, during the training procedure have been limited to 20. The
the prediction horizon p of the models should be selected. For resulting performance of the trained models is validated trough
this concrete experiment, the horizon has been configured to the achieved RMSE; the training result for the maximum value
detect the critical failure (3rd class) before it occurs; which means of the temperature is shown, as an example, in Fig.7. Once the
that at least it should be able to predict a 5% of the RUL of the three ANFIS models have been trained, a predicted feature space
bearing. For this application, this consideration means that the can be obtained with the training dataset as can be seen in Fig.8.
prediction horizon is configured with a time of 710 seconds (71 The space present a similar geometry that the one presented in
samples). Fig.6.a with the initial set of features.

The three ANFIS models have been configured with the


same structure, which was defined in section III. For this
concrete application, the total amount of inputs considered is
five and consists on: (i) the current value of the feature, F1(t); (ii)
past values of the feature delayed n1 samples, F1(t-n1); (iii)
another past value of the feature delayed n2 samples, F1(t-n2);
(iv) auxiliary input, the accumulate derivative of the feature
1 () ; (v) auxiliary input, derivative of the most related
feature 2 () . Each input is fuzzyfied by means of two
generalized bell-shaped membership functions. The selection of
the inputs has been done based on experience, recommendations
and automatic searching procedures.
Fig. 8 Predicted feature space resulting from the training of the three ANFIS
In order to automate the model inputs related with the past models using the training set (p=71).
values of the feature, a Genetic Algorithm (GA) has been applied
to find the best past values of the feature, n1 and n2, which After the generation of the ANFIS models, it is time to
minimize the RMSE prediction error of the ANFIS. The found configure the health assessment stage. This health assessment is
values are n1= 51 and n2=94 for the training dataset, and will be based on a neural classifier that detects the degradation pattern
maintained for the validation set. within the features. For this application, a multi-layer ANN has
been configured with one hidden layer which is composed of 6
The derivative input corresponds to the derivative value of hidden neurons. The neurons have been configured with a
the corresponding feature in a specific time window, and sigmoid activation function, which is recommended for
represents a significant feature evolution characterization classification problems involving multiple independent
parameter. It compresses the information about the evolution of attributes. Once the network has been defined and configured, it
the signal, and including other features is interesting for the is trained by means of classical backpropagation algorithm. The
purpose to help the model with the sudden changes of the data used is the initial training set without the application of the
predicted signal. In regard with the influence of the derivative ANFIS models; finally, the ANN is tested by means of the
input, this can be appreciated in Fig.7. There is a loss of validation set.
performance in the model without derivative inputs when the
target signal presents sudden changes, so the utility of the The obtained confusion matrix for the validation set is shown
auxiliary inputs to improve the model performance is in Table 3, the performance of the classification is 89.8% for the
demonstrated. validation set. It can be observed that the critical class (3rd class)
is always well classified, hence its signature is the most
different, and thus its associated pattern is easier to be identified
by the ANN. Furthermore, the ANN presents some misclassified
samples (below 5%) between the classes 1 and 2. This is due to
the fact that run-to-failure experiments present a characteristic
continuous pattern that can cause overlapping between two
consecutive classes. These overlapping areas are known as
conflictive regions.
Table 3. Confusion Matrix extracted from the ANN with the validation set
before the prediction made by the ANFIS model. A total classification rate of
89.8% is achieved with this dataset.

Output\Target Class 1 Class 2 Class 3


Class 1 1166 0 0
Class 2 236 1212 0
Fig. 7 Performance after the training procedure in order to show the influence
of the derivative inputs (5 inputs model with differential, diff) versus a model
Class 3 0 49 140
which only considers current and previous values of the predicted feature (3
inputs).
a) b) 25 c)
30 Real Var x Real Var y 160
Real Max Temp.
Predicted Var x 20 Predicted Var y Predicted Max Temp.
25 150

Max temperature
Variance in y
Variance in x

20 15
140
15
10
130
10
5
5 120

500 1000 1500 2000 2500 500 1000 1500 2000 2500 500 1000 1500 2000 2500
Time in Acq.(1=10s) Time in Acq.(1=10s) Time in Acq.(1=10s)

Fig. 9 Predicted model value versus the real for the validation data set for the features a) Variance in X, b) Variance in Y c) Maximum temperature value.

performance of 85.8% which is good enough to assess the


D. Validation procedure of the method degradation of the bearings.
Once the different models have been configured and trained,
they are validated by means of the validation dataset. The
resulting predicted RMSE of the models can be seen in Table 4.
Table 4. Prediction RMSE of the generated models
Var. in X Var. in Y Max Temp
Training 0.50125 g 1.1868g 0.3005 C
Validation 0.67891 g 0.60178g 1.7102 C

A comparison between the real evolution of the feature and


the predicted one with the proposed forecasting strategy can be
seen in Fig.9.a for the variance of the acceleration in X, Fig.9.b
Fig. 10 Behavior of the generated model for the variance in X with sudden
the acceleration in Y, and Fig.9.c the maximum temperature. All changes of the signal to be predicted.
the models are capable of predicting the outputs of the validation
set with a relative low error with smooth dynamics; however, the
most difficult behaviors to be predicted are those related to
sudden changes of the signal (high derivative). In this sense, the
trained models show the highest error in these points, as shown
in Fig.10, with an achieved RMSE of 2.02 g which represents 3
times the maximum error found in the overall signal. The
predicted feature space from the ANFIS model is shown in
Fig.11 for the validation set. The evolution of the degradation is
modified if compared with the original form (Fig.6.b).
Despite this fact, the classifier is able to find the same
degradation pattern, and thus, it presents a good classification
performance of 86.1%, as shown in the confusion matrix (Table
Fig. 11 Predicted feature space of the validation set, with p=71.
5). In addition, the confusion matrix presents the same structure
that the defined in the training procedure (Table 3), the critical
class (3rd class) is well classified, while the misclassifications are
located in the conflictive regions between consecutive classes.
Table 5. Confusion Matrix extracted from the ANN with the validation set after
the prediction made by the ANFIS model. A total classification rate of 86.1%
is achieved with this dataset.
Output\Target Class 1 Class 2 Class 3
Class 1 1133 231 0
Class 2 3 911 0
Class 3 0 119 140

Additionally, if the prediction horizon is extended to an 8%


of the signal, the resulting feature space is shown in Fig.12. It Fig. 12 Predicted feature space of the validation set, with p=112.
can be seen how the feature space changes the data
representation due to the fact that the error of the model The proposed method shows promising results when its
increases. Furthermore, the neural classifier achieves a applied to systems in which the interdependences between the
different signals is not significant. However, further statistical [3] D. Zurita, M. Delgado, J. A. Ortega, and L. Romeral, "Intelligent
sensor based on acoustic emission analysis applied to gear fault
validation tests should be applied, such the Mariano-Debold
diagnosis," in Diagnostics for Electric Machines, Power Electronics
method proposed in [19] or the non-parametric Wilcoxon [20] and Drives (SDEMPED), 2013 9th IEEE International Symposium
for the further validation of the prediction made by the ANFIS on, 2013, pp. 169-176.
models, regarding the neural classifier, to ensure the [4] S. Hafeez, C. S. H. Zaidi, and A. Siddiqui, "Broken rotor bar
detection of single phase induction motor using Wigner-Ville
classification ratio achieved, ranked based non-parametric test
Distributions," in Emerging Technologies & Factory Automation
should be applied as proposed in [21]. (ETFA), 2013 IEEE 18th Conference on, 2013, pp. 1-7.
[5] P. Maier, M. Sachenbacher, T. Ruhr, and L. Kuhn, "Integrated plan
V. CONCLUSIONS tracking and prognosis for autonomous production processes," in
Emerging Technologies & Factory Automation, 2009. ETFA 2009.
In this work, a distributed feature forecasting condition IEEE Conference on, 2009, pp. 1-7.
monitoring approach has been presented and validated by means [6] A. Heng, S. Zhang, A. C. C. Tan, and J. Mathew, "Rotating
of an accelerated bearing degradation experimental platform. machinery prognostics: State of the art, challenges and
The results of the method show that the designed ANFIS models opportunities," Mechanical Systems and Signal Processing, vol. 23,
are able to deal with the feature evolution forecasting with a low pp. 724-739, 4// 2009.
error value. Also, the proposed neural classifier is able to assess [7] T. Benkedjouh, K. Medjaher, N. Zerhouni, and S. Rechak, "Fault
prognostic of bearings by using support vector data description," in
the current and future status of the bearings with a performance Prognostics and Health Management (PHM), 2012 IEEE
greater than 80% for all the studied cases. Conference on, 2012, pp. 1-7.
[8] A. Soualhi, H. Razik, G. Clerc, and D. D. Doan, "Prognosis of
The main benefit of the proposed method is the Bearing Failures Using Hidden Markov Models and the Adaptive
simplification of the modelling by analyzing the forecasting Neuro-Fuzzy Inference System," Industrial Electronics, IEEE
problem from a distributed point of view. Therefore, as has been Transactions on, vol. 61, pp. 2864-2874, 2014.
seen, it implies a direct reduction of the models complexity, and [9] A. Widodo and B.-S. Yang, "Machine health prognostics using
an increase of the precision while maintain the performance of survival probability and support vector machine," Expert Systems
with Applications, vol. 38, pp. 8430-8437, 7// 2011.
the posterior classifier. However, as it was expected, the error of [10] C. Chaochao, Z. Bin, G. Vachtsevanos, and M. Orchard, "Machine
the method is directly related with the prediction horizon Condition Prediction Based on Adaptive Neuro-Fuzzy and High-
selected, so this parameter should be carefully chosen for each Order Particle Filtering," Industrial Electronics, IEEE Transactions
application. Furthermore, in forecasting applications, time factor on, vol. 58, pp. 4353-4364, 2011.
has a strong influence. Accordingly, quick dynamics or [11] J. S. R. Jang, "ANFIS: adaptive-network-based fuzzy inference
system," Systems, Man and Cybernetics, IEEE Transactions on, vol.
responses that happen in a short period of time are not usually 23, pp. 665-685, 1993.
related to the normal degradation mechanism of the system [12] W. Q. Wang, M. F. Golnaraghi, and F. Ismail, "Prognosis of
being monitored. For this reason, although other approaches machine health condition using neuro-fuzzy systems," Mechanical
based in frequency and time-frequency domains are also Systems and Signal Processing, vol. 18, pp. 813-831, 7// 2004.
feasible, time domain features are an adequate choice since they [13] J. J. Cardenas, F. Giacometto, A. Garcia, and J. L. Romeral, "STLF
present a good tradeoff between computational cost, complexity in the user-side for an iEMS based on evolutionary training of
Adaptive Networks," in Emerging Technologies & Factory
and the characterization and the representation of the Automation (ETFA), 2012 IEEE 17th Conference on, 2012, pp. 1-8.
degradation evolution in run-to-failure experiments, as has been [14] M. Lebold, "Development of distributed wireless smart sensors for
demonstrated with the application of the method. gearbox prognostic application," in Proceedings of the 58th Meeting
of the Society for Machinery Failure Prevention Technology,
The presented work could be directly applied to monitor and Virginia Beach, Virginia, USA, 2004.
predict the degradation in other simply electromechanical [15] G. Vachtsevanos, F. L. Lewis, M. Roemer, A. Hess, and B. Wu,
actuators. Additionally, as a further work, the proposed Intelligent Fault Diagnosis and Prognosis for Engineering Systems:
approach will be improved with an additional layer, which Wiley, 2006.
[16] K. Michail and K. M. Deliparaschos, "Sensor fault detection with
consists on the addition of a second data fusion layer in order to low computational cost: A proposed neural network-based control
integrate the future health information from different sources. scheme," in Emerging Technologies & Factory Automation (ETFA),
2012 IEEE 17th Conference on, 2012, pp. 1-6.
VI. AKNOWLEDGMENT [17] P. Nectoux, R. Gouriveau, K. Medjaher, E. Ramasso, B. Morello,
N. Zerhouni, et al., "Pronostia: An experimental platform for
This work was supported in part by the Spanish Ministry of bearings accelerated life test," in IEEE International Conference on
Economy and Competitiveness, CICYT (TRA2013-46757-R), Prognostics and Health Management, Denver, CO, USA, 2012.
and by the Technical University of Catalonia under the grant [18] K. Medjaher, N. Zerhouni, and J. Baklouti, "Data-driven
FPI-158-710. prognostics based on health indicator construction: Application to
PRONOSTIA's data," in Control Conference (ECC), 2013
VII. REFERENCES European, 2013, pp. 1451-1456.
[19] F. X. Diebold and R. S. Mariano, "Comparing Predictive Accuracy,"
[1] F. Bastianini, S. Sedigh, G. Pascale, and G. Perri, "Cost-Effective Journal of Business & Economic Statistics, vol. 13, pp. 253-63,
Dynamic Structural Health Monitoring with a Compact and 1995.
Autonomous Wireless Sensor System," in Nondestructive Testing of [20] B. Trawiski, M. Smtek, Z. Telec, and T. Lasota, "Nonparametric
Materials and Structures. vol. 6, O. Gne and Y. Akkaya, Eds., ed: statistical analysis for multiple comparison of machine learning
Springer Netherlands, 2013, pp. 1065-1070. regression algorithms," in International Journal of Applied
[2] M. D. Prieto, G. Cirrincione, A. G. Espinosa, J. A. Ortega, and H. Mathematics and Computer Science vol. 22, ed, 2012, p. 867.
Henao, "Bearing Fault Detection by a Novel Condition-Monitoring [21] J. Luengo, S. Garca, and F. Herrera, "A study on the use of
Scheme Based on Statistical-Time Features and Neural Networks," statistical tests for experimentation with neural networks: Analysis
Industrial Electronics, IEEE Transactions on, vol. 60, pp. 3398- of parametric test conditions and non-parametric tests," Expert
3407, 2013. Systems with Applications, vol. 36, pp. 7798-7808, 5// 2009.

You might also like