Professional Documents
Culture Documents
2549-Document Upload-7312-1-10-20201129
2549-Document Upload-7312-1-10-20201129
2549-Document Upload-7312-1-10-20201129
1,2,3,4
United Technologies Research Center (UTRC), East Hartford, CT, USA
reddykk@utrc.utc.com, sarkars@utrc.utc.com, venugov@utrc.utc.com, GierinMJ@utrc.utc.com
1
A NNUAL C ONFERENCE OF THE P ROGNOSTICS AND H EALTH M ANAGEMENT S OCIETY 2016
2
A NNUAL C ONFERENCE OF THE P ROGNOSTICS AND H EALTH M ANAGEMENT S OCIETY 2016
into the DAE. The parameters of DAE are learned by back- (Bengio et al., 2007) based on the final reconstruction error
propagation and stochastic gradient descent (Bengio et al., for nominal training data.
2007) based on nominal (no fault) data. More detailed de-
scription of DAE is in the following subsection. While test- 3. DATA D ESCRIPTION FOR VALIDATION
ing, the whole tool chain is operated in a feed-forward fashion
The data for validating the proposed algorithm is downloaded
in real-time and provides the RMS reconstruction error for a
from the NASA DASHlink open database. The original de-
time window. The average reconstruction error of a time win-
sign of experiment and data collection were performed by
dow over all sensors denotes how far it is from the nominal
Balaban et. al. (Balaban et al., 2009, 2015). A brief descrip-
condition and is used as an anomaly indicator. The distri-
tion of the experimental setup, sensor suits, fault injection
butions of reconstruction error over multi-modal sensors are
methods and fault types are given below. For more details,
leveraged to differentiate specific fault signatures. The fault
refer to Balaban et. al. (Balaban et al., 2009, 2015).
disambiguation in an unsupervised manner (without exposing
the model to all fault types in the training phase) is possible A set of electromechanical actuators (EMA), constructed by
due to the advanced representational capability of a DAE. A Moog Corporation, were used by Balaban et. al. (Balaban
DAE can capture a large amount of multi-scale and non-linear et al., 2009, 2015) for running different experiments. To in-
features of multi-sensor data without significant over-fitting. crease the horizon of available operating conditions, flyable
These statements are supported by adequate visualization and electromechanical actuator (FLEA) testbed was also con-
analysis in the section 4. structed. This paper mainly deals with the fault scenarios
that are injected in laboratory setting under various operating
2.1. Deep auto-encoder (DAE) conditions. Coupling of test actuators to the load actuator is
accomplished via an electromagnetic system. Only one test
DAE are one of the significant variants of deep Learning and
actuator at a time is normally coupled to the load actuator.
put a strong emphasis on modeling multiple levels of data
The data acquisition system consists of two National Instru-
abstraction (from low-level features to higher-order represen-
ments 6259 cards and the Galil motor controller. In this paper,
tations, i.e., features of features) from data (Erhan, Courville,
the data contains 13 modalities sampled at 100Hz, which are
& Bengio, 2010). A DAE is constructed by a multi-layer neu-
Time, Actuator Z Position, Measured Load, Motor X Current,
ral network, where there is an input layer, single or multiple
Motor Y Current, Motor Z Current, Motor X Voltage, Motor
hidden layers and an output layer. Each layer can have differ-
Y Voltage, Motor Y Temperature, Motor Z Temperature, Nut
ent number of neural units as shown in the figure 1. A DAE
X Temperature, Nut Y Temperature, Ambient Temperature.
(Bengio et al., 2007) takes an input vector x ∈ Rd and first
0 The accelerometers are connected through custom-fabricated
maps it to the latent representation h ∈ Rd using a deter-
conditioner boards that supply them with excitation voltage
ministic sigmoid function of the type h = fθ = σ(W x + b)
and remove the dc portion of the return signal. The current
with parameters θ = {W, b}, where W is the weight and b
sensors are built in the Galil motor controller. Specific num-
is the bias. This ”code” is then used to reconstruct the input
ber of nominal scenarios and different fault scenarios are de-
by a reverse mapping of f : y = f θ0 (h) = σ(W 0 h + b0 )
scribed in section 4.
with θ0 = {(W 0 ), b0 }. The two parameter sets are con-
strained to be of the form W 0 = W T , using the same weights Two faults under various operating conditions were injected
for encoding the input and decoding the latent representa- into the test articles in the following manner: 1) A jam fault
tion. Each training pattern xi is then mapped onto its code was injected via a mechanism mounted on the return channel
hi and its reconstruction yi . The parameters are optimized of the ball screw that can stop circulation of the bearing balls
via stochastic gradient descent method (Bengio et al., 2007), through the circuit. 2) A spall fault was injected by intro-
minimizing an appropriate cost function over the training set ducing cuts of various geometries via a precise electrostatic
Dn = {(x0 , t0 ), ..., (xn , tn )}. In this paper, the cost function discharge process. The initial size and subsequent growth of
L(xy) is assumed to be the root mean square error between these cuts were confirmed by using an optical inspection and
the input vector and reconstructed vector. measurement system.
3
A NNUAL C ONFERENCE OF THE P ROGNOSTICS AND H EALTH M ANAGEMENT S OCIETY 2016
Normalized RMS error at different levels of compression (Average of all sensors of all training files) Baseline (Average of all sensors mean value)
0.18
0.16
0.14
0.12
0.1
0.08
0.06
0.04
0.02
0
2 10 18 26 34 42 50 58 66 74 82 90 98 106 114 122 130 138 146 154 162 170 178 186 194 202 210 218 226 234 242 250
(a)
0.25
0.2
0.15
0.1
Dimension 6
0.05 Dimension 14
Dimension 206
0
(b)
Figure 3. (a) Variation of normalized root mean square (RMS) error at the reconstructed output layer with increasing dimension
of the bottleneck layer, (b) Individual sensor-wise reconstruction errors at the output layer for 3 different bottleneck layer
dimensions
4
A NNUAL C ONFERENCE OF THE P ROGNOSTICS AND H EALTH M ANAGEMENT S OCIETY 2016
Receiver operating characteristic (ROC Curve) ples and those samples are fed through the nominal DAE to
1.0 obtain the final reconstruction (NRMS) errors averaged over
all sensors. A detection threshold λ is applied on the aver-
0.8 age NRMS error such that a test run with larger NRMS error
than λ would be diagnosed as a fault. Varying λ, multiple
receiver operating characteristics (ROC) curves are generated
True Positive Rate
0.6
and showed in figure 4(a) for different bottleneck layer di-
Baseline
0.4
Bottleneck Dimension: 206 mensions and few single hidden-layer DAE models. It is
Bottleneck Dimension: 98
Bottleneck Dimension: 26 observed that the ROC curve for 14-dimensional bottleneck
Bottleneck Dimension: 14 layer case performs the best in fault detection. For λ = 0.56,
Bottleneck Dimension: 6
0.2 Bottleneck Dimension: 2
Single Layer: 1950
the fault detection rate is 97.8% with 0.0% false alarm. As
Single Layer: 650 the number of fault scenarios is much larger than the number
Single Layer: 512
0.0 of nominal runs in the testing phase, i.e., the testing data set
0.0 0.2 0.4 0.6 0.8 1.0
False Positive Rate
is not balanced, a series of precision-recall curves (Duda et
(a)
al., 2000) are also shown on figure 4(b). By also consider-
ing the proximity to the upper-right corner of precision-recall
Precision-Recall curve
1.0 plots, it can be concluded that the 11-layer DAE model with
14-dimensional bottleneck layer performs best in fault diag-
nostics. According to the ROC curves, a single hidden-layer
0.8
DAE model with 512-dimensional bottleneck layer performs
similarly to the proposed 11-layer DAE with 14-dimensional
0.6
bottleneck layer. The reason, for which the proposed DAE
Precision
5
A NNUAL C ONFERENCE OF THE P ROGNOSTICS AND H EALTH M ANAGEMENT S OCIETY 2016
Motor X Current Nut Y Temperature Motor X Current Nut Y Temperature Motor X Current Nut Y Temperature
Motor Z Current Motor Z Temperature Motor Z Current Motor Z Temperature Motor Z Current Motor Z Temperature
Motor X Voltage Motor Y Temperature Motor X Voltage Motor Y Temperature Motor X Voltage Motor Y Temperature
Motor Y Voltage Motor X Temperature Motor Y Voltage Motor X Temperature Motor Y Voltage Motor X Temperature
Figure 5. Spider charts showing the NRMS error across different sensors during testing phase for (a) nominal scenarios, (b)
spall fault scenarios, and (c) ballscrew jam fault
TRAINING Nominal Scenario
TESTING Nominal Scenario
TESTING Spall Fault Scenario Actuator Z Position
TESTING Ballscrew Jam Fault Scenario
Measured Load Ambient Temperature
0.5
Motor X Current Nut Y Temperature 0.0
0.8
0.5
0.6
0.4 1.0
PCA Dimension 2
Motor Y Current 0.2 Nut X Temperature
1.5
2.0
Motor Z Current Motor Z Temperature
2.5
Nominal Scenario
3.0 Spall Fault Scenario
Motor X Voltage Motor Y Temperature Ballscrew Jam Fault Scenario
3.5
0.8 0.6 0.4 0.2 0.0 0.2 0.4 0.6 0.8 1.0
PCA Dimension 1
Motor Y Voltage Motor X Temperature
(a)
(a)
0.8
0.5
0.6
0.4
Motor Y Current 0.2
Nut X Temperature
1.0
1.5
Motor Z Current Motor Z Temperature
2.0
2.0 1.5 1.0 0.5 0.0 0.5 1.0 1.5
PCA Dimension 1
6
A NNUAL C ONFERENCE OF THE P ROGNOSTICS AND H EALTH M ANAGEMENT S OCIETY 2016
components using two models: (i) single hidden-layer DAE tions. IEEE Sensors Journal, 9(12), 1907-1917. doi:
model with 512-dimensional bottleneck layer and (ii) pro- 10.1109/JSEN.2009.2030284
posed 11-layer DAE with 14-dimensional bottleneck layer. Balaban, E., Saxena, A., Narasimhan, S., Roychoudhury, I.,
It is apparent from the figure 7 that, even in a reduced dimen- Koopmans, M., Ott, C., & Goebel, K. (2015). Prognos-
sion, the proposed model can identify different faults with tic health-management system development for elec-
high confidence in an unsupervised fashion. However, the tromechanical actuators. Journal of Aerospace Infor-
single hidden-layer DAE model can not distinguish between mation Systems, 12, 329-344.
the faults in lower dimension because it does not have enough Basir, O., & Yuan, X. (2007). Engine fault diagnosis based on
representational capacity for the multi-modal sensor data. Al- multi-sensor information fusion using dempstershafer
though it has comparable anomaly detection performance, it evidence theory. Information Fusion, 8(4), 379 - 386.
can not classify different faults as well as the proposed 11- Bengio, Y., Lamblin, P., Popovici, D., Larochelle, H., et
layer DAE model due to over-fitting. al. (2007). Greedy layer-wise training of deep net-
works. Advances in neural information processing sys-
5. C ONCLUSION AND F UTURE W ORK tems, 19, 153.
The authors propose a Deep Auto-encoder (DAE) based fault Chen, W., & Saif, M. (2007). A sliding mode observer-based
detection and disambiguation framework, which is built upon strategy for fault detection, isolation, and estimation in
the concepts of deep learning. This work is one of the few a class of Lipschitz nonlinear systems. International
recent efforts utilizing the large representational capability Journal of Systems Science, 38(12), 943–955.
of deep architectures for addressing complex PHM prob- Das, S., Sarkar, S., Ray, A., Srivastava, A., & Simon, D. L.
lems. The paper attempts to capture the nominal signature (2013). Anomaly detection in flight recorder data: A
under a variety of conditions by building and training an 11- dynamic data-driven approach. In American control
layer DAE on multi-modal sensor data. The proposed frame- conference (acc), 2013 (pp. 2668–2673).
work is trained directly on raw time series from heteroge- Duda, R. O., Hart, P. E., & Stork, D. G. (2000). Pattern
neous sensors without hand-crafting any features based on classification, second ed. (Vol. 654). New York: John
domain knowledge. The paper demonstrates the training ap- Wiley and Sons.
proach and performance of the proposed technique via test- Erhan, D., Courville, A., & Bengio, Y. (2010). Understand-
ing it on a large set of realistic data. A high fault detection ing representations learned in deep architectures. De-
rate (∼ 97.8%) along with zero false alarm are achieved by partment d’Informatique et Recherche Operationnelle,
the the 11-layer DAE model. This paper also successfully University of Montreal, QC, Canada, Tech. Rep, 1355.
disambiguates among different types of faults with high con- Gardner, W., Napolitano, A., & Paura, L. (2006). Cyclosta-
fidence in an unsupervised way, i.e., via clustering. A thor- tionarity: half a century of research. Signal processing,
ough comparison (via visualization of sensor-specific NRMS 86(4), 639–697.
errors) of the proposed DAE model with a single hidden-layer Gertler, J. J. (1988). Survey of model-based failue detection
DAE model demonstrates the necessity of deep architecture and isolation in complex plants. IEEE Control Systems
for building a robust algorithm for fault detection and diag- Magazine, 11.
nostics. Some future research tasks are: Gertler, J. J. (1997). Fault detection and isolation using parity
equations. Control Engineering Practice, 5(5), 653–
• further validation on a larger number of fault types, 661.
• introduction of supervised classification of faults, Gustafsson, F. (2002). Stochastic fault diagnosability in par-
• testing at simultaneous multi-fault scenarios. ity spaces. In 15th IFAC world congress, barcelona,
spain.
ACKNOWLEDGMENT Hagenblad, A., Gustafsson, F., & Klein, I. (2003). A com-
parison of two methods for stochastic fault detection
The data for validating the proposed algorithm is downloaded and principal component analysis. In 13th ifac sympo-
from the NASA DASHlink open database. A sincere ac- sium on system identification (sysid), rotterdam, nl (pp.
knowledgment goes to NASA for making this data available 27–29).
for research and Balaban et. al. (Balaban et al., 2009, 2015) Hinton, G., & Salakhutdinov, R. (2006). Reducing the di-
for performing the experiments and data collection. mensionality of data with neural networks. Science,
313.5786, 504-507.
Kim, S., Kim, Y., & Park, C. (2004). Hybrid fault detection
R EFERENCES
and isolation techniques for aircraft inertial measure-
Balaban, E., Saxena, A., Bansal, P., Goebel, K. F., & Cur- ment sensors. In Aiaa guidance, navigation and control
ran, S. (2009, Dec). Modeling, detection, and dis- conference and exhibit, providence, rh.
ambiguation of sensor faults for aerospace applica- Kim, S. J., & Lee, C. W. (1999). Diagnosis of sensor faults in
7
A NNUAL C ONFERENCE OF THE P ROGNOSTICS AND H EALTH M ANAGEMENT S OCIETY 2016
active magnetic bearing system equipped with built-in tion in aircraft gas turbine engines. Proceedings of the
force transducer. IEEE Transactions on Mechatronics, Institution of Mechanical Engineers, Part G: Journal
4(3), 180–186. of Aerospace Engineering, 227(12), 1988–2001.
Kiyak, E., Cetin, O., & Kahvecioglu, A. (2008). Air- Sarkar, S., Sarkar, S., & Ray, A. (2014). Data-enabled health
craft sensor fault detection based on unknown input management of complex industrial systems. Fault De-
observers. Aircraft Engineering and Aerospace Tech- tection: Classification, Techniques and Role in Indus-
nology, 80(5), 545–548. trial Systems, NOVA Science Publishers.
Litt, J., Kurtkaya, M., & Duyar, A. (1994). Sensor fault detec- Sarkar, S., Sarkar, S., Virani, N., Ray, A., & Yasar, M. (2014).
tion and diagnosis simulation of a helicopter. In Con- Sensor fusion for fault detection and classification in
ference of military, government, and aerospace simula- distributed physical processes. Frontiers in Robotics
tion, san diego, ca. and AI, 1, 16.
Ngiam, J., Khosla, A., Kim, M., Nam, J., Lee, H., & Ng, A. Y. Seda, P., Kadir, E., & Dogru, B. E. (2007). Intelligent sensor
(2011). Multimodal Deep Learning. In Proceedings of fault detection and identification for temperature con-
the 28th international conference on machine learning trol. In Proceedings of the 11th wseas international
(icml-11) (pp. 689–696). conference on computers, agios nikolaos, greece.
Rizzoni, G., & Min, P. S. (1991). Detection of sensor failures
Simani, S., Fantuzzi, C., & Beghelli, S. (2000). Diagno-
in automotive engines. IEEE Transactions on Vehicular
sis techniques for sensor faults of inustrial processes.
Technology, 40(2), 487–500.
IEEE Transactions on Control Systems Technology,
Sarkar, S., Lore, K. G., Sarkar, S., Ramanan, V.,
8(5), 848–855.
Chakravarthy, S. R., Phoha, S., & Ray, A. (2015).
Early detection of combustion instability from hi-speed Tan, C. P., & Edwards, C. (2002). Sliding mode observers
flame images via deep learning and symbolic time se- for detection and reconstruction of sensor faults. Auto-
ries analysis. In Annual conference of the prognostics matica, 38, 1815–1821.
and health management society 2015. Tran, V., AlThobiani, F., & Ball, A. (2014). An approach to
Sarkar, S., Mukherjee, K., Sarkar, S., & Ray, A. (2013). Sym- fault diagnosis of reciprocating compressor valves us-
bolic dynamic analysis of transient time series for fault ing teager-kaiser energy operator and deeep belief net-
detection in gas turbine engines. Journal of Dynamic works. Expert Systems with Applications.
Systems, Measurement, and Control, 135(1), 014506. Verma, K., Gupta, V., Sharma, M., & Sevakula, R. (2013).
Sarkar, S., Sarkar, S., Mukherjee, K., Ray, A., & Srivastav, A. Intelligent condition based monitoring of rotating ma-
(2013). Multi-sensor information fusion for fault detec- chines using sparse auto-encoders. IEEE.