Professional Documents
Culture Documents
An Unsupervised Learning Framework For Track Quality Index and Safety
An Unsupervised Learning Framework For Track Quality Index and Safety
An Unsupervised Learning Framework For Track Quality Index and Safety
https://doi.org/10.1007/s40515-019-00087-6
TECHNICAL PAPER
Abstract
Rail track geometry defect remains a primary cause of train accidents in the USA. With
over $1 billion lost annually to geometry-related accidents, there is an urgent need to re-
assess the analysis and treatment of geometry defects. The development of track quality
index (TQI) takes a contracted view of track assessment by focusing only on quality
without any safety consideration. Track geometry safety limits are set by Federal
Railroad Administration based on raw track geometry data. Since different variations
of track geometry parameters exist, there is a skepticism about the effectiveness of the
current bipartite analytical approach of track quality and safety. These results into two
maintenance regimes: regular and spot maintenance. This study aims to create a
framework through which a hybrid index that combines both safety and quality can
effectively eliminate costly spot maintenance practices. This index would be used to
create a data-driven maintenance scheme that maximizes the time between two main-
tenance cycles and minimizes disruptions. This technical note describes the proposed
framework for creating such an index using unsupervised machine learning.
Keywords Rail track engineering . Safety . Data science . Safety . Track quality
1 Introduction
There is a growing application of data analytical tools in the area of rail track safety and
novel data science applications (Sharma et al. 2018). A previous study focused on
methods of improving Weibull rail defect analysis (Lasisi and Attoh-Okine 2019a;
Lasisi and Attoh-Okine 2019b) while this article considers the role of track quality
indices in rail track safety.
Track quality index (TQI) is used to achieve average-based assessments of discrete rail
track segments (or sections) for the purpose of maintenance planning (Lee 2005; Esveld
* Nii Attoh-Okine
okine@udel.edu
1
Department of Civil & Environmental Engineering, University of Delaware, Newark, DE, USA
Transportation Infrastructure Geotechnology
2016). Federal railroad administration (FRA) data shows that over $1 billion is annually
lost to track geometry-related accidents in the USA (Appendix). These accidents repeat-
edly involve fatalities and casualties (FRA 2018). The role of TQIs in track safety needs a
close re-examination. For example, if TQIs are used to assess track quality (and determine
geometry maintenance regimes), why aren’t they sensitive to safety exceptions? In other
words, an index that measures the quality of track should inherently have the ability to
imply when rail track is not in the best shape for operation, hence, safety. The current state
of business suggests that geometry exceptions be handled by issuing slow-down orders on
a given track section that violates FRA safety thresholds until the exception is corrected. Is
there any way this “spot and fix” approach can be improved by developing TQIs that are
both sensitive to track quality and safety? The aim of this article is to re-evaluate the
purpose of TQIs and suggest techniques that can help modify them to integrate several
elements of track quality, ride comfort, and geometry exceptions (safety). The literature is
lavish with abundant TQIs that have been developed with different degrees of variability
in focus, approach, and effectiveness. Authors have highlighted a few of them in this note
to acquaint readers with. Figure 1a depicts a visual relationship between an aggregate track
quality measure (TQI) over a defined length, D and the occurrence of geometry exceptions
within the same length (El-Sibaie and Zhang 2004). Railroad professionals argue that it is
difficult to examine TQI per unit length of the track because of the hyper-sensitivities in
the track measurements. Therefore, maintenance decision-making due to these measure-
ments becomes a non-trivial task. However, by obtaining a collective measure of the per-
unit measurements (e.g., surface, gage, cross-level, alignment) in form of mean, standard
deviation, or power spectral density, the variabilities in the track parameter/measurement
over the chosen length, D can be summarized by a single scalar value. While this approach
simplifies the problem of hypersensitivity, it has a fallen short of picking up threshold
violations, exceptions, or defects.
Typical examples of the measurements shown in Fig. 1b are long- and short-wave
profile measurements for a 4-km section of in-service freight track. As can be observed,
the stochastic nature of the per-unit measurements is likely to be lost if measurements
are averaged or aggregated as suggested by the TQI approach.
Objective tract quality index is also known as single or individual TQIs because it only
aggregates one-parameter measurements (e.g., profile or surface) over a fixed length of
the track. Examples include FRA length-based TQI, Canadian National (CN) TQI, and
Amtrak roughness index summarized in Table 1 (Lasisi and Attoh-Okine 2018).
The above are considered amongst the most popular methods of track quality
evaluation until it became difficult to consider TQIs for different track geometry
parameters differently for the purpose of collective maintenance decision-making.
The need to combine TQIs then arose inadvertently.
With the CN polynomial-based TQI approach, it was easy to combine all TQIs by a
simple average of all parameters (gage, surface (long and short chords), alignment (long
and short chords), and cross-level) except warp. The China railway (CR) approach to
Transportation Infrastructure Geotechnology
(b) Long (20m) and Short (10m) Wave Profile measurements for a sample
4km stretch of track.
Fig. 1 A track quality index and the nature of track geometry measurements. a Track quality index and defects
(exceptions) (Ciobanu 2016); b long (20-m) and short (10-m) wave profile measurements for a sample 4-km
stretch of track
combining TQIs is to sum the standard deviations for each parameter (Xu et al. 2011).
Australian rail track corporation (ARTC) and Swedish National Railway have a slightly
different approach to combining different parameters as follows:
While some of the shortcomings of artificial TQIs are highlighted above, single/
objective TQIs are also not free of limitations. The very fact that they aggregate
parameter measurements and miss the sensitivities per unit length (or exceptions)
makes their use questionable. Although, because most objective TQIs are expressed
in the same units as the parent parameters (inch or mm), a dimensional representation
gives room for further analysis. However, combined TQIs are ambiguous, dimension-
less, and often arbitrary. While this is common of other engineering indices like
Transportation Infrastructure Geotechnology
pavement condition or international roughness index for highway pavements, the latter
has a universally acceptable scale with meaningful connotations and understanding.
Same cannot be said of many of the artificial/combined track quality indices in the
rolling stock world.
The subjective nature of the weight assignment for individual parameters found in
combined TQIs like the Swedish Q index, Polish J coefficient, Indian TGI, etc. (Lasisi
and Attoh-Okine 2018) further lowers the objectivity of any potentially acceptable
universal TQI (combined). In the following lines, authors have attempted to meet some
of the identified shortcomings with novel applications of data science techniques in rail
track engineering.
3 Potential Enhancements
A group of railroad professionals admit some of these shortcomings while others are
very resistant because change takes time. The dilemma of combining/aggregating
parameters without losing geometry exceptions can be reframed as a dimension
reduction problem that maximizes the variance in the data set, in other words,
projecting high-dimensional data onto a plane that exhibits most of the variabilities in
the data. This plane is defined by latent unique characteristics of track (or any matrix-
form) data known as Eigen-vectors. The process of transforming or decomposing
hitherto mutually correlated track geometry measurements into a set of linearly uncor-
related variables or components is called the principal component or factor analysis.
This approach can be used to decompose multi-dimensional, multi-observation track
geometry data into simple components that can describe the vertical, horizontal, and
transverse track irregularities. In order to verify the consistency of this approach,
several tests were conducted using different lengths of track and class. It was found
that three principal components were sufficient to summarize track geometry data.
Figure 2 shows that 3 components effectively summarize 95% of the variance in
track geometry. This observation was confirmed with three different track geometry
data as follows:
We have demonstrated this approach using the afore-stated datasets and found a way
to solve the problem of subjective weight assignment as well as combining multivariate
track geometry data in a way that correlates with sensitivities or exceptions in raw
geometry measurements (Lasisi and Attoh-Okine 2018). Figure 3 shows a correlation
plot of objective TQIs and combined artificial TQIs as well as the transformed or
decomposed principal components (PC1, PC2, and PC3). Combined TQIs like CNTQI,
J_Coeff, CHTQI, and TGI represents Canadian National, Polish, Chinese and Indian
TQIs respectively. While ALI_R_124 indicates the long chord (124 ft) right alignment
measurement for a sample mile track, copious details can be found in the cited paper
(Lasisi and Attoh-Okine 2018).
Transportation Infrastructure Geotechnology
Fig. 2 Cumulative variance plot versus number of principal components (Lasisi and Attoh-okine 2019)
It is immediately obvious from Fig. 3 that PC1 correlates more with surface
parameters and can be seen as a measure of vertical irregularities. PC3 however
combines elements of gage and cross-level indicating its intrinsic properties of trans-
verse irregularities. Due to the nature of principal components, PC2 has to be longitu-
dinal because all components (PC1, PC2, and PC3) are orthogonal transformations of
raw track geometry data. Hence, it can be deduced that CNTQI, J_Coeff, and TGI
actually measure longitudinal track properties contrary to assessing overall track
quality. Therefore, none of these combined TQIs assess vertical track irregularities that
often influence most track geometry maintenance work (Soleimanmeigouni et al.
2016).
The former is the more conventional method that most railways utilize. Per-
unit length observations of gage, cross-level, surface, and alignment are
collected for a given track location. Time-based observations of three-
dimensional acceleration (vertical, horizontal, and transverse) are also collect-
ed. The effect of different train speed is reflected in the number of time-based
observations per unit length. Figure 4 presents the structure of time-based data
per foot.
Depending on the speed of track geometry inspection vehicle, there can be
5 to 15 observations for every geometry measurement taken per unit length.
The higher the speed, the greater the frequency of time-based observations
taken and vice versa. Both data sources are connected by a common index
which can be used to extract the exact track acceleration tensor (vectors)
corresponding to a given track geometry measurement. Hence, the difference
in speeds is accounted for by the number of time-based data produced per-
unit length.
Due to the relatively high frequency of time-based data, it is important to
spatially synchronize the measurements with the distance-based readings. One
way to decompose the measurements and achieve a one-to-one mapping is by
computing moving averages or performing time series analysis on the high-
frequency data (time). Once both data sources are synchronized in Step 1,
objective TQIs are developed from distance-based data while an exploratory
data analysis is carried out on time-based measurements. Step 3 will enable the
elimination of redundant acceleration parameters and gyroscope readings based
Transportation Infrastructure Geotechnology
In this section, the rationale for safety is considered using dimension reduction (Fodor
2002) and machine learning.
In order to integrate safety into the proposed index, the original track
geometry data is reduced using specific unsupervised learning technique (linear
and nonlinear dimension reduction). The observed measurements are then
checked for safety threshold limits. Every observation (vector) is labeled as a
function of which track geometry limit is exceeded (e.g., “Surface Defect,”
“Profile Defect,” or “No defect”). A supervised learning technique (Hastie et al.
2009) or machine learning algorithm (Martey et al. 2017) is then used to
predict the labeled column using three predictors:
Fig. 5 A framework for developing Hybrid Indices that combine TQIs, safety, and ride quality
The performance of the predictors is evaluated for the linear and nonlinear reduced
dimensions using the original track geometry parameters as the baseline. Metrics such as
sensitivity and specificity (Marsland 2015) can be used to assess how observations with
safety violations are learned by the hybrid index(ces) (Zarembski et al. 2016; Fazio
1986). Once this is established, a soft-max layer can be used to export the probability
distribution of each parameter to examine the correlations with the reduced components
or original geometry measurements thresholds with the probability of defects. If a
correlation is established, a threshold can be determined based on the observations with
established geometry defects. Figure 6 shows a principal component threshold consid-
eration for profile defect. This limit already considers the different wavelengths of
profile since they have been learned during the supervised learning process. As a result,
the thresholds for each wavelength were learned from the labels. Through this approach,
Transportation Infrastructure Geotechnology
Potenal Threshold
several spot corrective maintenance activities can be eliminated because the hybrid TQI
presents an index that sets an artificial limit to measured geometry values (Sharma et al.
2018). This will result in cost savings and further buttresses the potential of machine
learning in rail infrastructure management (Wright et al. 2016).
4 Concluding Remarks
Track quality indices have been the industry standard for assessing track condition and
making maintenance decisions for a couple of decades. This trend continues to gain
currency amongst railroad practitioners across the world. Unfortunately, there has not
been rational, repeatable ways of verifying the efficacy of these TQIs except that
practitioners can only convince their guts that their decision-making is supported by
some numerical track analysis. Classical data science techniques have shown promising
potential throughout our railway engineering world and beyond (Lasisi et al. 2018;
Ghofrani et al. 2018) and can potentially unravel the latencies in the TQI methods that
have been developed in the past to address track quality. Hence, this article is one in a
series of initial steps into an exposition of data science potentials in track quality
assessment and rail track safety engineering.
Acknowledgments Authors would like to recognize the contributions of Dr. Joseph Palese and Dr.
Emmanuel Martey.
Appendix
References
Audley, M., Andrews, J.: The effects of tamping on railway track geometry degradation. Proc. Inst. Mech.
Eng. Part F J. Rail Rapid Transit. 227(4), 376–391 (2013)
Ciobanu, C.: Evaluation of the track quality. Pway Blog, Permanent Way Institution. (2016)
El-Sibaie, M., Zhang, Y.-J.: Objective track quality indices. Transp. Res. Rec., no. 1863, 81–87 (2004)
Esveld, C.: Modern Railway Track, 2nd edn. MRT-Productions (2016)
Fazio, J.L.C.: Track quality index for high speed tracks. Transp. Eng. 1(112), 46–61 (1986)
I. K. Fodor, “A survey of dimension reduction techniques,” 2002
FRA, FRA office of safety analysis, 3.10 - FRA accident causes, 2018. [Online]. Available: http://safetydata.
fra.dot.gov/officeofsafety/default.aspx
S. Galván-Núnez and N. Attoh-Okine, “A threshold-regression model for track geometry degradation,” 2018
Ghofrani, F., He, Q., Goverde, R.M.P., Liu, X.: Recent applications of big data analytics in railway
transportation systems: a survey. Transp. Res. Part C. 90, 226–246 (2018)
Hastie, T., Tibshirani, R., Friedman, J.: The elements of statistical learning, vol. 1. Springer (2009)
Lasisi, A., Attoh-Okine, N.: Principal components analysis and track quality index: a machine learning
approach. Transp. Res. Part C Emerg. Technol. 91, 230–248 (Jun. 2018)
Lasisi, A., Attoh-okine, N.: Nonlinear dimension reduction technique for hybrid rail track quality index.
Transp. Res. Part C Emerg. Technol. (2019)
Lasisi, A., Attoh-Okine, N.: Research: future of Weibull defects analysis in the railway industry - railway track
and structures. Railway Track & Structures. (2019a)
Lasisi, A., Attoh-Okine, N.: Machine learning ensembles and rail defects prediction : a multi-layer stacking
methodology. ASCE-ASME J. Risk Uncertain. Eng. Syst. Part A Civ. Eng. (2019b)
Lasisi, A., Martey, E.N., Guillot, D., Attoh-okine, N.: A three-step agglomerated machine learning : an
alternative to Weibull defect analysis of rail infrastructure. IEEE Trans. Intell. Transp. Syst. (2018)
S. Lee, “Development of objective track quality indices,” 2005
Marsland, S.: MACHINE LEARNING, An algorithmic perspective, Second Edi. Chapman & Hall/CRC CRC
Press, Boca Raton (2015)
E. N. Martey and N. Attoh-Okine, “Modeling tamping recovery of track geometry using the copula-based
approach”
Martey, E.N., Ahmed, L., Attoh-Okine, N.: Track geometry big data analysis : a machine learning approach.
In: 2017 IEEE Int. Conf. Big Data (Big Data), pp. 3718–3727 (2017)
Muinde, M.S.: Railway track geometry inspection optimization. Luleå University of Technology. (2018)
Sharma, S., Cui, Y., He, Q., Mohammadi, R., Li, Z.: Data-driven optimization of railway maintenance for
track geometry. Transp. Res. Board Part C. 90, 34–58 (2018)
Soleimanmeigouni, I., Ahmadi, A., Kumar, U.: Track geometry degradation and maintenance modelling: a
review. Proc. Inst. Mech. Eng. Part F J. Rail Rapid Transit. 0(0), 1–30 (2016)
Wright, N.P., Gan, R., McVae, C.: Software and machine learning tools for monitoring railway track switch
performance. In: 7th IET Conf. Railw. Cond. Monit. 2016 (RCM 2016), pp. 1–7 (2016)
Xu, P., Sun, Q., Liu, R., Wang, F.: A short-range prediction model for track quality index. Proc. Inst. Mech.
Eng. Part F J. Rail Rapid Transit. 225, 277–285 (2011)
Zarembski, A.M., Einbinder, D., Attoh-Okine, N.: Using multiple adaptive regression to address the impact of
track geometry on development of rail defects. Constr. Build. Mater. 127, 546–555 (2016)
Publisher’s Note Springer Nature remains neutral with regard to jurisdictional claims in published maps and
institutional affiliations.