Design of A Reconfigurable Prognostics P

You might also like

Download as pdf or txt
Download as pdf or txt
You are on page 1of 13

Expert Systems with Applications 37 (2010) 240–252

Contents lists available at ScienceDirect

Expert Systems with Applications


journal homepage: www.elsevier.com/locate/eswa

Design of a reconfigurable prognostics platform for machine tools


Linxia Liao *, Jay Lee
NSF I/UCR Center for Intelligent Maintenance Systems, Department of Mechanical Engineering, Univ. of Cincinnati, Cincinnati, OH 45221, USA

a r t i c l e i n f o a b s t r a c t

Keywords: For decades, researchers and practitioners have been trying to develop and deploy prognostics technol-
Prognostics ogies with ad hoc and trial-and-error approaches. These efforts have resulted in limited success, due to
Machine performance assessment the fact that it lacks a systematic approach and platform in deploying the right prognostics tools for
Reconfigurable platform the right applications. This paper introduces a methodology for designing a reconfigurable prognostics
Machine tool
platform (RPP) which can be easily and effectively used to assess and predict the performance of machine
tools. RPP can be installed on the equipment and it has the prognostic capabilities to convert the data into
performance-related information. The equipment performance information can then be integrated into
the enterprise asset management system for maintenance decision making through the Internet. Two
industrial cases are used to validate the effectiveness of applying the RPP for different prognostic appli-
cations as well as the reconfigurable capabilities of the proposed RPP.
Ó 2009 Elsevier Ltd. All rights reserved.

1. Introduction implemented and maintained because of the lack of expert knowl-


edge for data analysis or well-designed system architecture.
Unexpected machine tool downtime is still one of the major is- In the literature, a large variety of prognostic approaches and
sues in impacting machining productivity in industry. For example, stand-alone platforms have been studied and implemented for dif-
1 min of downtime in an automotive manufacturing plant could ferent applications. Ge et al. (2004) presented a support vector ma-
cost as much as $20,000 (Spiewak et al., 2000). The breakdown of chine (SVM) based method for fault diagnosis in sheet metal
one machine may result in the halt of the whole production line stamping processes and a shop floor application was implemented
in a manufacturing facility. Purely reactive (reactive maintenance) by using a PC and additional data acquisition equipment. Li et al.
or blindly proactive (preventive maintenance) maintenance are (2005) proposed a hidden Markov model (HMM)-based fault diag-
existing widely in today’s industry, while both of these scenarios nosis in speed-up and speed-down process for rotary machinery. In
could be extremely wasteful. Predictive maintenance focuses on the implementation of the system, one PC was used for data sam-
failure prediction in order to prevent failure in advance and offer pling and another PC was used for data storage and analysis. A net-
sufficient information to improve planning, while it is hard to be work communication was also set up between the two computers
properly established and effectively implemented because of the for data transmission. Wu and Chow (2004) presented a self-orga-
lack of expert knowledge for data analysis or well-designed system nizing map (SOM) based radial-basis-function (RBF) neural net-
architecture. work method for induction machine fault detection. The system
Traditional Condition-Based Maintenance (CBM) systems have was implemented by utilizing a PC and additional data acquisition
been implemented in manufacturing plants to continuously deliver equipment. Those stand-alone platforms have been validated to be
data related to the machine’s status and performance. The problem effective for specific applications, while they may not be applicable
is that field engineers, practitioners and management staff are not when the application conditions change and it is generally difficult
able to make effective use of huge amount of data obtained by CBM to integrate them into other systems and keep them well
systems to accurately detect the degradation status of the equip- maintained.
ment, not mention to be aware of the degradation trend which will In recent years, some maintenance system architectures and
lead to catastrophic failure eventually. The decision making in CBM case studies have been proposed and studied by many researchers.
focuses on predictive maintenance (Lee et al., 2006). Predictive Wang et al. (2004) presented a web-enabled remote monitoring,
maintenance focuses on failure prediction in order to prevent fail- diagnosis and collaborative maintenance system. Data was col-
ure in advance and offer sufficient information to improve plan- lected from six monitoring points on an industrial machine
ning, while it is hard to be properly established and effectively through a DAQ server and sent to a web-based diagnosis system
for analysis. Chen et al. (2005) proposed a five-layer system archi-
* Corresponding author. tecture for remote monitoring and e-maintenance. Two perfor-
E-mail address: liaolinxia@gmail.com (L. Liao). mance monitoring systems for an elevator and a spindle were

0957-4174/$ - see front matter Ó 2009 Elsevier Ltd. All rights reserved.
doi:10.1016/j.eswa.2009.05.004
L. Liao, J. Lee / Expert Systems with Applications 37 (2010) 240–252 241

developed and presented, respectively. Qu et al. (2006) presented a work makes the RPP easily to be adapted to various prognostics
Common Object Request Broker Architecture (CORBA) – based re- applications. To validate the effectiveness of using RPP for machine
mote diagnostic system for large scale distributed rotary machines. tool prognostic applications, two industrial case studies are pre-
In the implementation of the diagnosis platform for milling and sented to illustrate how to apply RPP to applications of automatic
turbine machines, data was accumulated and preprocessed in the tool changer (ATC) and spindle bearing health monitoring and
factory level and was sent to remote diagnosis center for further prognostics, respectively.
analysis and decision making. Han and Yang (2006) proposed an
e-maintenance system integrated with several advances tech-
niques (i.e. wireless, web, etc). The system included two subsys- 2. Design of the reconfigurable prognostics platform (RPP)
tems: maintenance center and local maintenance. The former
one consisted of the maintenance strategies which are imple- The proposed approach is for designing prognostic systems for
mented and evaluated by the later one. The feasibility of the sys- machine tool applications. The RPP is a reconfigurable platform
tem was demonstrated by a case study of fault diagnosis of an which integrates both hardware and prognostic software. The plat-
introduction motor. Those proposed architectures can be applied form utilizes computing, communication and data analysis tech-
to large-scale monitoring enterprise applications and they are, to nologies to assess and predict the equipment performance. The
some extent, easier to be integrated into other systems than the RPP contains an embedded prognostic toolbox (Watchdog AgentÒ)
stand-alone systems. However, the local data analysis capability which converts multi-sensory data into machine performance-re-
was not well addressed. Most of the existing proposed mainte- lated information. Advanced data processing and autonomous
nance architectures rely on the remote data analysis systems for prognostics methods are developed to facilitate the plug-and-play
performance assessment and decision making. Large amount of approach in which the RPP can be set up and run with the mini-
data transmission over the network raises the problems of network mum need for expert knowledge or human intervention.
congestion and issues of data storage. Another issue is that the
reconfigurable framework of the data analysis software modules 2.1. A prognostic toolbox – Watchdog AgentÒ
was not well emphasized. The developed systems are usually very
hard to be maintained due to the difficulty to integrate the newly The Watchdog AgentÒ toolbox, developed by the NSF Center for
developed software modules into the existing systems or the lack Intelligent Maintenance Systems (IMS), is able to convert relevant
of development experience. machinery data into performance-related health features and
As for standardization, Kothamasu et al. (2006) discussed sev- quantitatively assess the machine’s performance/health index. Fur-
eral organizations who have been putting effort to develop stan- thermore, the toolbox also contains algorithms that can predict the
dard specifications in the maintenance and control arena. Lebold degradation trends in order to provide valuable information for the
et al. (2002) proposed a typical open architecture, which is called maintenance practitioners to take proactive actions before poten-
OSA-CBM (Open System Architecture for Condition-Based Mainte- tial failures occur. As illustrated in Fig. 1, data can be obtained from
nance), for machinery monitoring and diagnosis. The purpose was the embedded sensors on the machine tools and maintenance re-
to demonstrate a software structure which facilitates the interop- cords as well as manually input working conditions. These data
erability of CBM software modules. The OSA-CBM architecture is are transformed into multiple-regime (e.g. high-speed cutting
not exclusive to any specific software implementation technology. and low-speed cutting, etc.) features by selecting appropriate algo-
Yu et al. (2007) presented the implementation of Internet-based rithms for signal processing and feature extraction. In the feature
remote monitoring and diagnostic platform for key equipment space, performance/health indices are obtained by statistically
according to the OSA-CBM architecture. Several algorithms for detecting the deviation of the real-time features from baseline by
fault diagnosis and condition prediction were utilized to realize a selecting proper health assessment algorithms. The future perfor-
maintenance support system for distributed power-generation mance degradation tendency is predicted by applying appropriate
systems. However, due to the lack of local data processing capabil- performance prediction algorithms to the feature space or health
ities, the data was transferred to remote computers for further indices. A health feature radar chart, which shows the health con-
analysis, which may incur the network traffic problems and the dition of all monitored critical components of the machine tool, is
lag of maintenance decision making. Furthermore, how to reconfig- then presented for in a user-friendly interface for field engineers
ure the data analysis software modules for various applications and management staff to make maintenance decisions.
was not addressed in detail. As illustrated in Table 1, the Watchdog AgentÒ toolbox consists
Much progress has been made in the literature, while two major of various algorithms for machine prognostics, which mainly fo-
issues still remain: cuses on the following four areas: signal processing and feature
extraction, health assessment, performance prediction and diagno-
 Most existing approaches still focus on failure detection and sis. A brief introduction of these four sets of tools will be given as
fault isolation instead of prognostics. Also, many algorithms follows.
are application specific. A scalable prognostic toolbox, which is Signal processing and feature extraction tools are used for decom-
used for equipment performance degradation assessment and posing the sensory data into multi-dimensional feature space that
prediction, becomes necessary to automatically convert machin- is most related to the performance of components of the machine
ery data into performance-related information. tool. For example, time domain analysis directly compares the time
 There is no available prognostic platform that can easily adapt to waves of two different signals. Fast Fourier transform (FFT), which
new application environments and quickly be applied to various is a typical frequency domain analysis algorithm, decomposes a
machine tool applications. waveform into a sum of sinusoids with different frequencies. An
example can be found in Schoen and Habetler (1995) which de-
This paper proposes a systematic approach for designing a scribes using spectrum analysis to detect rotor fault in induction
reconfigurable prognostics platform (RPP) which can be quickly machines. Time frequency analysis and wavelet/wavelet packet
and effectively deployed for various machine tool prognostic appli- analysis can be used to capture the dynamic characteristics of
cations. A toolbox, which is called Watchdog AgentÒ (Djurdjanovic the non-stationary data (with frequency component changes over
et al., 2003), is applied to converts machinery data into perfor- time). Wavelet packet analysis was proposed in Yen and Lin
mance-related information. An agent-based reconfigurable frame- (2000) to extract time-frequency information form vibration signa-
242 L. Liao, J. Lee / Expert Systems with Applications 37 (2010) 240–252

Fig. 1. Flowchart of Watchdog AgentÒ to convert data into performance-related information (Liao & Lee, 2008).

Table 1
Prognostic algorithms in the Watchdog AgentÒ toolbox (Liao & Lee, 2008).

Signal processing and feature extraction tools Health assessment tools


Time domain analysis Logistic regression
Frequency domain analysis Statistical pattern recognition
Time-frequency analysis Feature map pattern matching (self-organizing maps)
Wavelet/wavelet packet analysis Neural network
Principal component analysis (PCA) Gaussian mixture model (GMM)
Performance prediction tools Diagnosis tools
Autoregressive moving average (ARMA) Support vector machine (SVM)
Elman recurrent neural network Feature map pattern matching (self-organizing maps)
Fuzzy logic Bayesian belief network (BBN)
Match matrix Hidden Markov model (HMM)

ture. Principal component analysis (PCA) is a widely used algo- requires the distribution of features be appropriately Gaussian and
rithm to reduce the dimension of the feature space or data by only description in the normal operating condition is required. A
transforming the original high-dimensional data into a new group statistical pattern recognition method, which calculates the statis-
of variables which are uncorrelated to each other. PCA was used in tical overlap of Gaussians describing the most recent and normal
Wang and Cui (2005) as a sensor fault detection and diagnosis operating behavior, was described in Djurdjanovic et al. (2002)
method for centrifugal chiller systems. for tool wear monitoring on a CNC lathe machine.
Health assessment tools are used for evaluating the health condi- Diagnosis tools are used for analyzing the patterns embedded in
tion of the machine tools by calculating the overlap between the the data/features to determine the root causes of previous ob-
real-time feature space and that during normal product operation. served faults or degradation. For example, Bayesian belief network
This overlap is continuously transformed into a confidence value (BBN) uses a directed graphical model to represent joint distribu-
(CV), following Lee (1996), ranging from 0 to 1 (which indicates tion of different events. The structural characteristics make BBN
unacceptable and normal machine performance, respectively) over a very powerful diagnostic algorithm that can handle large proba-
time. CV quantitatively evaluates the deviation of the recent behav- bility distributions, especially for a complex system with a large
ior from normal/baseline behavior. For example, logistic regression number of variables. An example can be found in Romessis and
algorithm can convert multi-sensory data into a probability-based Mathioudakis (2006) for jet engine gas path fault diagnosis. Sup-
output which is constrained between 0 and 1. It requires data to port vector machine (SVM) is a binary classification algorithm in
be available both in normal operating condition and unacceptable which a linearly inseparable problem can be solved by using an
condition. Logistic regression was proposed in Yan and Lee (2005) appropriate kernel function to project the data into a higher
for assessing machine performance using a case study of an elevator dimensional space where a hyper-linear boundary is possible.
door. Statistical pattern recognition calculates the overlap between SVM was proposed in Yuan et al. (2004) for fault detection and
the most recent behavior and the normal/baseline behavior, which diagnosis for electromechanical systems.
L. Liao, J. Lee / Expert Systems with Applications 37 (2010) 240–252 243

Performance prediction tools are used for predicting the trend of tion of SA is to manage both the system resources (e.g. memory
equipment signals or health indices over time and extrapolating and disk capacity) and device hardware (e.g. data acquisition board
the equipment behaviors in the future. For example, auto-regres- and wireless board). If a request is received to generate an EA at the
sive moving average (ARMA) algorithm is applicable to predict initial stage or to modify the EA at the runtime stage, SA creates a
the performance of linear time-invariant systems whose perfor- vacant agent first. With interaction with KA, SA assigns system re-
mance features display stationary behavior (without frequent sources to the agent and executes it to be autonomous. SA can also
components change over time). The details and applications of communicate with and receive requests from other SAs in the net-
ARMA can be found in Pandit and Wu (1993). Elman recurrent neu- work. KA interacts with the knowledge database to obtain decision
ral network (ERNN) with backward propagation has its own spe- making support. It provides component dependencies and model
cialty in non-linear curve fit. It can be used to forecast the health parameters to perform a specific machine tool monitoring and
indices of highly dynamic systems and it has the capability to yield prognostics task. KA also provides coded modules which can be
good long term prediction. An example can be found in Yu et al. downloaded from the knowledge database to create a functional
(2006) using recurrent neural network for feature signature predic- EA. The flowchart of the program is shown in Fig. 3.
tion of a boring process. The states executed by different agents are marked with differ-
Those algorithms will be integrated into the RPP software ent colors. After the system starts, SA and KA are automatically
framework as components, which will be further described in Sec- generated. At the initialization stage, KA interacts with the knowl-
tion 2.2. In order to automatically choose the most appropriate edge database to obtain knowledge rules, code modules and
algorithms for a predetermined application, Quality Function parameters. SA generates a vacant EA first and then initializes
Deployment (QFD)-based algorithm selection method is devel- the hardware resources and loads the necessary algorithm compo-
oped. QFD was developed by Yoji Akao in the 1960s and it provides nents according to the knowledge obtained from KA. EA will auton-
a structured framework for concurrent engineering that ensures omously and repeatedly perform the health monitoring and
that the ‘‘voice of the customer” is incorporated into all phases of prognostic tasks with the algorithms in the algorithm toolbox until
product development (Govers, 1996). The purpose of utilizing the a stop task or reconfiguration request is received. If an alarm is oc-
QFD method here is to construct a relationship between the user’s curred, EA will send the alarm before it sends the health informa-
requirements or application conditions and the appropriate tools. tion through the Internet. One thing needs to be noticed is that
In the implementation of QFD, each tool in the Watchdog AgentÒ when SA receives a reconfiguration request, it will terminate the
toolbox is assigned a weight representing the correlation of the existing runtime EA and generate a new EA by interacting with KA.
tool with the specific application conditions such as characteristics There are three main remote application servers which are
of the signal (e.g. stationary or non-stationary), system knowledge knowledge base and code library server, application and authenti-
(e.g. enough or limited), data dimension (e.g. scalar or multi- cation server, and information database. Those web-based applica-
dimensional) and so on. The QFD method is used to calculate a final tions are built based on a multi-tier architecture including
weight for each tool under the constraints of user-defined condi- presentation tier, application tier and data tier. This is an open, dis-
tion priorities. The tool with the highest final weight is then chosen tributed approach that separates the ‘fat’ client into two parts –
as the most appropriate tool. The implementation details can be user interface and logic processing (TonyLiu & William Xu, 2001).
referenced to Armacost et al. (1994). In this case, the user interface in the presentation tier consists of
applications in the enterprise layer as described in Section 2.1.
2.2. Reconfigurable prognostic software architecture of RPP Application and authentication server functions as middleware in
the application tier to provide services for data requests between
The RPP has an agent-based reconfigurable software framework the users and the databases. Information database and knowledge
which is shown in Fig. 2. There are three main agents in the recon- base and code library server is database server in the data tier. The
figurable software framework, system agent (SA), knowledge-data- three-tier architecture allows faster response and processing of
base agent (KA) and executive agent (EA), which play important user’s requests (TonyLiu & William Xu, 2001). The three-tier struc-
roles in the software reconfigurable processes. The primary func- ture and the request/response flowchart are shown in Fig. 4.

Fig. 2. Reconfigurable prognostic software architecture for RPP.


244 L. Liao, J. Lee / Expert Systems with Applications 37 (2010) 240–252

Fig. 3. The program flowchart of the RPP.

Fig. 4. Three-tier architecture for remote applications.

The web-based applications in the presentation tier include the by utilizing Microsoft ActiveX control technique which can be
software to display the equipment health information, look embedded in the Internet browser. Active Server Pages
through historical information, and input engineering tool selec- (ASP) + Microsoft SQL database techniques are also used to develop
tion rules and so on. The web-based application was implemented the web-based applications.
L. Liao, J. Lee / Expert Systems with Applications 37 (2010) 240–252 245

Knowledge base and code library server has two main tasks. The 3. Industrial case studies using RPP
first one is to accumulate the rules or experience to deal with spe-
cial data and specific applications. A selection tool is running on Two industrial cases (a. automatic tool changer health monitor-
this server and provides guidance service to select appropriate ing and prognostics; b. spindle bearing health monitoring and
tools for different prognostics tasks. It prioritizes all the algorithms prognostics) will be presented to validate the effectiveness of
according to the specific application situation and chooses the applying the RPP to different applications. The description of the
tools with the highest priority in signal processing and feature case studies will follow the sequence: (1) system setup; (2) data
extraction, health assessment and diagnosis and performance pre- processing which includes signal processing and feature extrac-
diction. The SA on RPP will generate EA based on the guidance of tion, health assessment and performance prediction. Different
the knowledge base. The rules to select tools can also be input algorithms and results are elaborated for each case study; (3) user
manually from the web-based user interface. The second task of interface. At the end of this section, a reconfigurable capability
the knowledge base and code library server is to save all the code experiment is carried out to show the efficiency of reconfiguring
modules of the algorithms in the DLL format. The code modules the RPP for various applications.
can be downloaded to the RPP when the environment or condition
of the applications changes. All the requests/responses associated 3.1. Automatic tool changer (ATC) health monitoring
with the knowledge base and code library server go through the
application and authentication server. Machining systems, such as Computer Numerical Control
Application and authentication server provides services for data (CNC) machine tools, are used for independent parts machining,
requests between the users, RPP and the databases on the network tooling fabrication and are therefore integrated into the manu-
and verifies the identities of the Internet users before they get ac- facture of products. The breakdown of one machine tool may re-
cess to the RPP or databases. Before an Internet user gets access to sult in the halt of the whole production line in a manufacturing
the embedded web server on RPP, the application and authentica- facility. From the manufacturer’s maintenance records, ATC is
tion server will verify the user’s identity on the information data- one of the components which fail the most. In this case, both
base to decide whether further access is allowed. Application and the health condition of the mechanical part and the electrical
authentication server also provides identity verification for users part are monitored and results are fused to assess the health
who tend to access the data saved on the information database. condition of the ATC. Multiple sensory data is obtained as the in-
After the user’s identity is verified, application and authentication put of the RPP which will convert data into health information
server will redirect the request to corresponding server or RPP if for maintenance decision making. The data is processed locally
the user has the appropriate access right. When the application on the RPP and the health information is then published to the
and authentication server receives the response from the server Internet.
or RPP, it will redirect the response to the corresponding user.
Information database includes a CBM system which incorpo- 3.1.1. System setup and data acquisition
rates all the health assessment and predicted information gener- Both mechanical and electrical degradation are taken into con-
ated by RPP for all the monitored machines in the plant. The sideration in this case study. For mechanical part, the behavior of
CBM system follows the MIMOSA standards, which makes it easy the vibration signals will change significantly if the ATC develops
to integrate the information converted by RPP. The information a mechanical defect such as cam wear, tool drop and tool change
database can be a certain Computerized Maintenance Management position error and so on. Hence vibration data is obtained to anal-
System (CMMS) or enterprise level database containing the asset ysis the mechanical degradation of the ATC. For electrical part, if
information. Information database also contains a database which the control logic or the Programming Logic Control (PLC) performs
saves all the user identification information such as user name, abnormally, the timing of the control steps will deviate from the
password, and department and so on. normal condition. Therefore, timing signals from the controller
are also monitored to detect the degradation of the ATC.
2.3. Hardware integration for RPP As shown in Fig. 5, a Kistler 8141A1 accelerometer is installed
on the spindle housing to obtain the vibration signals from the ma-
According to the roles described above, the RPP hardware plat- chine tool test-bed. The raw data goes through the Kistler 5127B01
form should have necessary data acquisition, computation and com- conditioner box to the RPP by using Sensory Data Acquisition com-
munication capabilities. Considering both the local data processing ponent. ATC timing widows from the Fanuc controller are also ob-
and the reconfigurable requirements, the hardware platform must tained via an Ethernet connection to the RPP through OPC Process
have high computational performance and good extension capabil- Data Acquisition component. The vibration sampling rate is 10 kHz
ity. A PC/104 main board is selected as the computational platform. and the controller signal sampling rate is 75 Hz. The RPP is con-
PC/104 architecture is a popular standardized embedded platform nected to the Internet through a switch and a router. After the data
for small computing modules typically used in industrial applica- is converted into health information, the information is published
tions. With the Windows XP embedded system running on it, all to Internet and can be accessed remotely by a web browser appli-
Win32 compatible software modules are supported. The operating cation from the workstation.
system and the programs are stored in a 1 GB compact flash card. Vibration signals from the accelerometer, in both a normal con-
A PCM-3718HG-B DAQ Card is chosen as the analog to digital data dition and a faulty condition, are obtained and shown in Fig. 6a. It
acquisition hardware which has 12-bit sampling accuracy and sup- is not easy to tell which one is in a normal condition and which one
ports various data acquisition methods such as software triggering, is in a faulty condition by just looking at the data.
interrupt, and direct memory access (DMA). The DAQ card is con- In this case, timing signals from the controller are also collected.
nected to the main board via a compatible PC/104 extension slot. In Fig. 6b, 15 signals, which are T-code finish keep, Magazine count,
A PC/104 wireless board and PC/104 GPRS (General Packet Radio ATC holder down 1, ATC holder up 1, ATC holder down 2, ATC
Service) module can also be selected to equip the integrated hard- holder up 2, M06 cycle run, M06 arm turn 65 command, M06
ware platform with wireless and GPRS communication capability arm turn command, M06 arm turn original command, Arm origin
if necessary. These communication boards are also connected to detect, Arm middle detect, Arm stop detect, Tool clamp and Tool
the main board with compatible PC/104 extension slots. unclamp, are utilized in this case study.
246 L. Liao, J. Lee / Expert Systems with Applications 37 (2010) 240–252

ture space to a lower dimensional space containing 95% of the


variation information of the original feature space. In this applica-
tion, the Karhunen-Loève Transform (KLT) takes the projection of
the 16-dimensional energy components to a 2-dimensional feature
space.
For the timing signals, status 5 (ATC holder down 2) and status
13 (Arm stop detect), which represent the time span of the tool
topple down action and the duration of the arm rotation action,
are selected as two independent features of the input for a logistic
regression health assessment algorithm. These two features are se-
lected because they are the most critical ones indicating the run-
ning status of the ATC. Together with the 2-dimensional
projected features by PCA, a 4-dimensional feature space is ex-
tracted from the raw data as input for health assessment
algorithms.

3.1.3. Health assessment by logistic regression


After the feature spaces for both normal and faulty condition
are obtained, the next step is to use the extracted feature spaces
as the input of the health assessment model to evaluate the health
condition of the ATC. In this case, both the normal and faulty data
are available and the health assessment task is to determine either
Fig. 5. System setup for ATC health monitoring. the ATC is in normal condition or faulty condition according to the
real-time data. Logistic regression is a robust tool that can easily
3.1.2. Signal processing and feature extraction by wavelet packet represent multiple variables as a dichotomous problem. Hence, it
analysis is chosen as the health assessment model. The goal of logistic
Usually, the vibration signals of defective components are regression is to find the best fitting model to describe the relation-
highly structured and can be grouped into two categories: sus- ship between the categorical characteristic of dependent variable
tained defects and intermittent defects (Paul, 1995). For sustained and a set of independent variables (Yan & Lee, 2005). The maxi-
defects, Fourier-based analysis, which uses sinusoidal functions as mum likelihood estimate method is used to estimate the parame-
base functions, provides an ideal candidate for extraction of these ter values that maximize the probability of the observed data by
narrow-band signals (Yen & Lin, 2000). For intermittent defects, iterative steps.
signals often demonstrate a non-stationary and transient nature The training datasets (40 datasets from normal condition and
(Yen & Lin, 2000). In this case, as we can see in Fig. 6a, the raw 35 from faulty condition) are used to train the logistic regression
vibration data is non-stationary as it contains two major impacts. model and 280 datasets are used for validation. In the test, feature
Wavelet Packet Transform (WPT), using a rich library of redundant space is extracted from the raw data by using the method de-
bases with arbitrary time-frequency resolution, enables the extrac- scribed in Section 3.1.2. Fig. 7 shows the confidence value curve
tion of features from signals that combine non-stationary and sta- of the ATC over time. The drop of the confidence value at cycle
tionary characteristics (Sun & Chang, 2002). Hence, wavelet packet 251 was caused by the drop of the air pressure at that time.
transform is used to decompose the raw data and the energy of
each node at the resolution level can be calculated. The decompo- 3.1.4. Performance prediction by neural network
sition level is chosen as 4 according to experience and the number Due to its own specialty like non-linear curve fitting, neural net-
of the energy nodes is 16. work (NN) is widely used in industrial fields. NN is an effective
The problem remains that how to decide how many features algorithm to predict complex system behaviors which involve
should be chosen for next step, which is health assessment. Princi- non-linear behaviors and unstable processes. It is able to take large
pal Component Analysis (PCA) method is used to projects the fea- amount of historical data and can achieve adequate accuracy for

Fig. 6. Raw data for ATC health monitoring.


L. Liao, J. Lee / Expert Systems with Applications 37 (2010) 240–252 247

0.8

0.6
CV

0.4

0.2

0
0 50 100 150 200 250 300
Cycle Fig. 8. Performance prediction results.

Fig. 7. Confidence value curve.

faults has been gaining importance in recent years because of its


long term prediction. In the case study, NN model is selected for detrimental effect on the reliability of rotating machines (Prabha-
performance prediction. kar et al., 2002). In this case, vibration data is acquired from the
Confidence value (CV) can be used to indicate the health condi- spindle bearing housing and transferred to the RPP. The vibration
tion of the ATC. The degradation of the ATC can be clearly seen signals are used for evaluating the health condition of the bearing.
from the curve of the confidence value over time. In the experi- Performance prediction method is then applied to predict the fu-
ment, the air pressure is adjusted to decrease gradually to cause ture feature space based on which health assessment method is
the ATC function abnormally. The historical 170 cycles’ confidence used to assess the future performance.
value is used for performance prediction. The performance of 40
steps (from cycle 190) ahead is predicted by using an NN model. 3.2.1. System setup and data acquisition
60 samples are selected from the historical health indices. A 3- In the case study, Rexnord ZA-2115 bearing is used for a run-to-
layer neural network, which has 105 neurons in the input layer failure test. As shown in Fig. 10, an accelerometer is installed on
and one neuron in the output layer, is trained with the historical the vertical direction of the bearing housing. Vibration data is col-
data. In order to decide the number of the neurons in the hidden lected every 30 minutes with sampling rate 20 kHz by using Sen-
layer, the number of the hidden layer increases from 3 to 12 in sory Data Acquisition component on the RPP. A current
the experiment. Five trials are carried out for each number of the transducer is installed to monitor one phase of the current of the
hidden layer. The NN model which achieves the minimum mean spindle motor as a time stamp. The current signal is also obtained
square error (MSE) is selected as the final model for performance by the Data Acquisition component and is used to synchronize the
prediction. Finally, the number of the neurons in the hidden layer vibration data with different running speeds of the shaft associat-
is selected as 7. ing with different machine processes.
Then the trained NN model is used to predict the confidence va- All the raw data obtained from the sensors goes through the ter-
lue of the next 40 cycles from cycle 191. The results are shown in minal into the RPP. The raw data will be processed locally on the
Fig. 8. The star points are the predicted results. The MSE is 0.0091. RPP and then the health assessment and prognostics results of
The results show that the NN model can predict the degradation the bearing will be published to the Internet through which main-
trend of the health indices of the ATC. The time consumed in online tenance engineers can access the information at their
testing using RPP is shown in Table 2. workstations.
In order to validate the prognostic results, a magnetic plug is in-
3.1.5. Application user interface stalled in the oil feedback to accumulate debris, which is used as
In this case, a real-time web application for remote monitoring evidence for the bearing degradation in the test. At the end of
and machine health assessment is built via using an ActiveX con- the failure stage, the debris accumulates to a certain level and it
trol which is used as an embedded component of Internet Explorer causes an electrical switch to close to stop the test. In the test,
browser. Fig. 9a shows the results of the signal processing and fea- the bearing finally develops a roller defect.
ture extraction. Wavelet energy feature of each decomposed node
is shown by a bar chart. Fig. 9b shows a demonstration of the web 3.2.2. Feature extraction by fast Fourier transform (FFT)
application. Machine health information can be accessed from the In this case, the vibrations can be treated as stationary signals as
Internet explorer browser. The curve is the confidence value which the machine is rotating in a constant speed with a constant load.
indicates the health condition of the ATC. The drop of the confi- FFT, as a typical frequency domain analysis tool, is chosen from
dence value at cycle 239 is because of the drop of the air pressure the Watchdog AgentÒ toolbox as an appropriate tool for feature
at that time. extraction.
By using FFT algorithm, the vibration signal is transformed from
3.2. Spindle bearing health monitoring and prognostics time-domain into its equivalent frequency domain representation.
The magnitude spectrum can be subdivided into a specific number
Bearings are critical components in rotating machines, as their of sub-bands. A sub-band is basically a group of adjacent frequen-
failures could lead to a sequence of events that could cause serious cies. The center frequencies of these sub-bands have already been
damage in machines on the production lines. Detection of bearing pre-defined at the bearing defect frequencies such as Ball Passing
248 L. Liao, J. Lee / Expert Systems with Applications 37 (2010) 240–252

Table 2
Time cost for ATC health monitoring.

Dataset Dataset type Vibration (text file) Timing widow (DB file)
description
Dataset Size 750 KB 2 KB
(each)
Offline training # of Training Time for reading per Total time for reading data Time for feature extraction per Total time for feature Time for
dataset dataset dataset extraction training
75 0.5084 s 38.1286 s 0.2973 s 22.2976 s 3.1607 s
Online Testing # of Testing Time for reading per Time for feature extraction per Time for health assessment per Total time for online health assessment
dataset dataset dataset dataset
60 0.5645 s 0.2407 s 0.017 ms 49.332 s
Online # of Prediction Average time for per prediction step Total prediction time
prediction steps
30 2.3151 s 69.4521 s

Fig. 9. Web-based HMI for ATC health monitoring.

Frequency Inner-race (BPFI), Ball Passing Frequency Outer-race


(BPFO), Ball Spin Frequency (BSF) and Foundation Train Frequency
(FTF). The equations for calculating those bearing defect frequen-
cies are described in Tse et al. (2001). The BPFI, BPFO and BSF in
this case are calculated as 131.73, 95.2 and 77.44 Hz, respectively.
Then, the energy in each of these sub-bands centered at BPFI, BPFO
and BSF is computed and passed on to the health assessment algo-
rithms in the next step.

3.2.3. Health assessment by self-organizing maps (SOM)


SOM provides a way of representing multidimensional feature
space in a one or two-dimensional space while preserving the
topological properties of the input feature space. SOM is an unsu-
pervised learning algorithm derived from neural network which
can organizes itself according to the nature of the input data.
A task for automatic health assessment is the detection of the
bearing degradation. Typically, only measurement at the normal
operating conditions is available. In rare cases there exists histori-
cal data of defects development in measurements of a complete set
of all possible defects. After a description of the normal machine
behavior is set up, anomalies are expected to show up as signifi-
cant deviations from this description. In the case, SOM can be
trained only with the normal operation data. For each input feature
vector, a BMU can be found in the SOM. The distance between the
input data feature vector and the weight vector of the BMU, which
can be defined as minimum quantization error (MQE) (Qiu & Lee,
2004), actually indicates how far away the input data feature vec-
tor deviates from the normal operation state. Hence, the degrada-
Fig. 10. System setup for spindle bearing health monitoring and prognostics. tion trend can be visualized by the trend of the MQE. As the MQE
L. Liao, J. Lee / Expert Systems with Applications 37 (2010) 240–252 249

increases, the extent of the degradation becomes severer. A thresh- ing degradation in the test, which validates the health assessment
old can be set as the maximum MQE that can be expected and the results. At the end of the failure stage, the debris accumulates to a
degradation extent can be normalized by converting the MQE into certain level and it causes an electrical switch to close in order to
confident value (CV) ranging from 0 to 1, in which MQE increases stop the test.
while the CV decreases.
Data of the first 1000 cycles when the bearing is in the normal 3.2.4. Performance prediction by autoregressive moving average
condition are used to train the SOM. After training, the whole life (ARMA) and Gaussian mixture model (GMM)
cycle data of the bearing, which develops roller element defect, is The performance prediction is based on the historical feature
used for testing and the corresponding CVs are calculated. From space, from which a baseline can be set up for the normal/baseline
the curve shown in Fig. 11, the degradation process of the bearing behavior. Performance prediction tools can be used to predict the
can be detected. In the first 1000 cycles, the bearing health is in future feature space based on the historical data. ARMA model
good condition, the CVs are near one. From around cycle 1250 to (Pandit & Wu, 1993) can be used to predict the future feature space
cycle 1500, the initial defects appear and the CV begins to decrease. in a short term. In this case, the performance in the near future (e.g.
The CV keeps decreasing until it reaches the cycle around 1700, 20 cycles ahead) for the bearing is of interest. Therefore, ARMA
which means the defects become more serious. After that and till model is chosen for modeling and predicting future feature space
cycle around 2000, the CV increases because the propagation of in a time series of historical data. Data of faulty condition is not
the roller counterbalances the vibration. The CV will decrease shar- available in this case. GMM (Lemm, 1999) is a tool to calculate
ply after this stage until the bearing fails. When the CV starts to de- the confidence value according to the overlap between the pre-
crease and becomes unstable after cycle 1300, the amount of dicted feature space distribution and the baseline feature space
debris adhered to the magnetic plug installed in the oil feedback distribution. In another word, faulty data is not necessary for
pipe starts to increase. The debris is used as evidence for the bear- GMM. Hence, GMM is selected to evaluate the predicted health
condition of the bearing based on the predicted feature space.

Fig. 11. CV of the degradation process of the bearing with roller defect.
Fig. 13. Predicted feature distributions in abnormal condition.

Fig. 12. Prediction results in the feature domain. Fig. 14. Predicted confidence value in abnormal condition.
250 L. Liao, J. Lee / Expert Systems with Applications 37 (2010) 240–252

In this case, vibration data of the first 1890 cycles is used as the a very low level (near zero). The 10 steps’ confidence values are
historical data. Features are extracted from the vibration data using shown in Fig. 14. The CV decreases to a low level, which means a
FFT. Principal component analysis (PCA) is then used to find the certain abnormal/unacceptable situation is likely to happen. The
first two principal components which contain more than 90% rep- time consumed in the online testing is shown in Table 3.
resentation information in this case.
Performance prediction is actually based on those two principal 3.2.5. Application user interface
components feature space. ARMA model is used to predict the fu- In this case, a real-time web application for remote monitoring
ture feature space of the first two components. To determine ade- and machine health assessment is also built via using an ActiveX
quate order for the ARMA model while achieve the smallest error control embedded in the Internet Explorer browser. Fig. 15a shows
at the same time; ARMA (2, 1) is the model to start with. Then the raw vibration data and frequency spectrum extracted by FFT.
the orders of the ARMA model are adjusted and judged by F-crite- Fig. 15b shows a demonstration of the web application. Spindle
rion which is a hypothesis test that measures the significance of bearing health information can be accessed from the Internet ex-
the improvement in modeling by using two different orders as de- plorer browser. The curve is the confidence value which indicates
scribed in Pandit and Wu (1993). This is an iterative step until an its health condition of the spindle bearing. The circles at the end
adequate order for the ARMA model is achieved. In this case, the of the curve are the predicted confidence values which successfully
prediction starts from step 1070 and the number of the prediction predict the trend of the health degradation.
step is 10. As Fig. 12 shows that the two dotted lines are the pre-
dicted results for 10 steps and the two solid lines are the real fea- 3.3. Reconfigurable capability experiment
tures. The result shows that the ARMA model successfully predicts
the trend of the first two components in the future 20 steps. The two case studies, which are described in Section 3.1 and 3.2,
A GMM with four centers is used to approximate the distribu- are used as examples to test the reconfigurable capability of the
tion of the normal/baseline feature space. RPP. In the test, the RPP was preset for ATC health monitoring task.
For each predicted step, Monte Carlo sampling method is uti- After the physical system setup and connection modification were
lized to generate 800 sampling points within the predicted confi- finished, the system started. SA created a vacant EA prototype. KA
dence intervals. A GMM with 2 centers is used to approximate communicated with the Knowledge base and code library server to
the predicted feature distribution and it is shown in Fig. 13. It obtain the information for the necessary algorithm components
shows that the predicted feature distribution rarely overlaps with and the model parameters. Then EA will load the selected algo-
the normal feature distribution, hence the confidence values are in rithm components (Wavelet Packet Analysis, Principal Component

Table 3
Time cost for spindle bearing health monitoring and prognostics.

Dataset Dataset type Vibration (text file)


description
Dataset size 1122 KB
(each)
Offline # of Training Time for reading per Total time for reading data Time for feature extraction per Total time for feature Time for
Training dataset dataset dataset extraction training
1000 0.7725 s 772.5461 s 0.0859 s 85.9672 s 13.2157 s
Online testing # of Testing Time for reading per Time for feature extraction per Time for health assessment per Total time for online health assessment
dataset dataset dataset dataset
2190 0.7688 s 0.7238 s 0.0796 s 3443.118 s
Online # of Prediction Time for per prediction step Total prediction time
prediction steps
10 2.5892 s 25.8925 s

Fig. 15. Web-based HMI for spindle bearing health monitoring and prognostics.
L. Liao, J. Lee / Expert Systems with Applications 37 (2010) 240–252 251

Table 4
Steps of Task Reconfiguration.

Step 1 Release all the components loaded for task1 and then release the hard ware resources Less than 1 ms
Step 2 Stop and delete the existing EA 4281 ms
Step 3 Create new EA prototype 3047 ms
Step 4 Load component of FFT for task 2 2390 ms
Step 5 Load component of SOM for task 2 3782 ms
Step 6 Load component of ARMA for task 2 2656 ms
Step 7 Load component of GMM for task 2 2641 ms

Fig. 16. Web-based HMI for task reconfiguration.

Analysis and Logistic Regression) to be ready for the health moni- in detail in the paper. Two industrial cases validated the effec-
toring task for ATC. About 20 minutes later, the physical connec- tiveness to apply the RPP to various machine health assessment
tion was changed and SA received a request from the and performance prediction applications. The prognostics infor-
administrator to change the system function from ATC health mon- mation generated by the RPP can be further integrated in the
itoring to spindle bearing health monitoring and prognostics. The enterprise asset management system, which will greatly improve
steps after the confirmation of the task reconfiguration command productivity and asset utilization. The reconfigurable capabilities
and the time consumed are listed in Table 4. of the RPP facilitate rapid deployment and the experiment re-
The test was completed on an Advantech Uno-2160 platform sults showed the efficiency to reconfigure the RPP for different
which has Celeron 400 MHz CPU, 512 MB SDRAM, and PCM- tasks.
3178-HG data acquisition board. The operating system is Window
Embedded XP with service pack 1. The algorithm components were References
written in Matlab 7.0.4.365 (R14) with Service Pack 2 and they are
converted in to DLL format running with the Matlab Runtime sup- Armacost, R. L., Componation, P. J., et al. (1994). AHP framework for prioritizing
customer requirements in QFD: An industrialized housing application. IIE
port. The task reconfiguration execution results are presented in Transactions (Institute of Industrial Engineers), 26(4), 72–79.
the human machine interface shown in Fig. 16. In totally 18.80 sec- Chen, Z., Lee, J., et al. (2005). Intelligent infotronics system platform for remote
onds, the reconfiguration for new application was finished. monitoring and E-maintenance. International Journal of Agile Manufacturing,
8(1), 3–11.
Djurdjanovic, D., Lee, J., et al. (2003). Watchdog agent – An infotronics-based
4. Conclusion prognostics approach for product performance degradation assessment and
prediction. Advanced Engineering Informatics, 17(3–4), 109–125.
Djurdjanovic, D., Ni, J., et al. (2002). Time-frequency based sensor fusion in the
This paper proposes a reconfigurable prognostics platform assessment and monitoring of machine performance degradation. American
(RPP) which is an integration of both the hardware and software society of mechanical engineers, dynamic systems and control division (Publication)
that can be easily and effectively used to assess and predict the DSC.
Ge, M., Du, R., et al. (2004). Fault diagnosis using support vector machine with an
performance for machine tools. The design of the system archi- application in sheet metal stamping operations. Mechanical Systems and Signal
tecture and the implementation of the platform are described Processing, 18(1), 143–159.
252 L. Liao, J. Lee / Expert Systems with Applications 37 (2010) 240–252

Govers, C. P. M. (1996). What and how about quality function deployment (QFD). Schoen, R. R., & Habetler, T. G. (1995). Effects of time-varying loads on rotor fault
International Journal of Production Economics, 575–585. detection in induction machines. IEEE Transactions on Industry Applications,
Han, T., & Yang, B. S. (2006). Development of an e-maintenance system integrating 31(4), 900–906.
advanced techniques. Computers in Industry, 57(6), 569–580. Spiewak, S. A., Duggirala, R., et al. (2000). Predictive monitoring and control of the
Kothamasu, R., Huang, S. H., et al. (2006). System health monitoring and prognostics cold extrusion process. CIRP Annals – Manufacturing Technology, 49(1), 383–386.
– A review of current paradigms and practices. International Journal of Advanced Sun, Z., & Chang, C. C. (2002). Structural damage assessment based on wavelet
Manufacturing Technology, 28(9), 1012–1024. packet transform. Journal of Structural Engineering, 128(10), 1354–1361.
Lebold, M., Reichard, K., et al. (2002). A framework for next generation machinery Tony Liu, D., & William Xu, X. (2001). A review of web-based product data
monitoring and diagnostics. In Failure prevention technology to improve the management systems. Computers in Industry, 44(3), 251–262.
bottom line, 56th meeting of the society for machinery failure prevention technology. Tse, P. W., Peng, Y. H., et al. (2001). Wavelet analysis and envelope detection for
Lee, J. (1996). Measurement of machine performance degradation using a neural rolling element bearing fault diagnosis – Their effectiveness and flexibilities.
network model. Computers in Industry, 30(3), 193–209. Journal of Vibration and Acoustics, 123(3), 303–310.
Lee, J., Ni, J., et al. (2006). Intelligent prognostics tools and e-maintenance. Wang, J. F., Tse, P. W., et al. (2004). Remote sensing, diagnosis and collaborative
Computers in Industry, 57(6), 476–489. maintenance with Web-enabled virtual instruments and mini-servers.
Lemm, J. C. (1999). Mixtures of Gaussian process priors. In IEEE conference publication. International Journal of Advanced Manufacturing Technology, 24(9–10), 764–772.
Li, Z. N., Wu, Z. T., et al. (2005). Hidden Markov model-based fault diagnostics Wang, S., & Cui, J. (2005). Sensor-fault detection, diagnosis and estimation for
method in speed-up and speed-down process for rotating machinery. centrifugal chiller systems using principal-component analysis method. Applied
Mechanical Systems and Signal Processing, 19(2), 329–339. Energy, 82(3), 197–213.
Liao, L., & Lee, J. (2008). A reconfigurable embedded prognostics platform for Wu, S. T., & Chow, T. W. S. (2004). Induction machine fault detection using SOM-
machinery performance management. In 62nd machinery failure prevention based RBF neural networks. IEEE Transactions on Industrial Electronics, 51(1),
technology. Virginia Beach, Virginia, US. 183–194.
Pandit, S. M., & Wu, S. M. (1993). Time series and system analysis with applications. Yan, J., & Lee, J. (2005). Degradation assessment and fault modes classification using
New York: John Wiley. logistic regression. Journal of Manufacturing Science and Engineering Transactions
Paul, D. (1995). Condition monitoring and defect diagnosis in manufacturing of the ASME, 127(4), 912–914.
process using DDA and wavelets. Dep. Elect. Eng. Houghton, MI, Michigan Yen, G. G., & Lin, K.-C. (2000). Wavelet packet feature extraction for vibration
Technol. Univ. Ph.D. monitoring. Industrial Electronics, IEEE Transactions, 47(3), 650–667.
Prabhakar, S., Mohanty, A. R., et al. (2002). Application of discrete wavelet transform Yu, G., Qiu, H., et al. (2006). Feature signature prediction of a boring process using
for detection of ball bearing race faults. Tribology International, 35(12), 793–800. neural network modeling with confidence bounds. International Journal of
Qiu, H., Lee, J. (2004). Feature fusion and degradation detection using self- Advanced Manufacturing Technology, 30(7–8), 614–621.
organizing map. In Proceedings of the 2004 international conference on machine Yu, H., Li, Y., et al. (2007). Development and application of internet-based remote
learning and applications, ICMLA’04. monitoring and diagnostic platform for key equipment. International Journal of
Qu, R., Xu, J., et al. (2006). An implementation of a remote diagnostic system on Computer Integrated Manufacturing, 20(2–3), 234–243.
rotational machines. Structural Health Monitoring, 5(2), 185–193. Yuan, S., Ge, M., et al. (2004). Intelligent diagnosis in electromechanical operation
Romessis, C., & Mathioudakis, K. (2006). Bayesian network approach for gas path systems. In Proceedings – IEEE international conference on robotics and
fault diagnosis. Journal of Engineering for Gas Turbines and Power, 128(1), 64–72. automation.

You might also like