Modified DTW JATIT

You might also like

Download as docx, pdf, or txt
Download as docx, pdf, or txt
You are on page 1of 8

Journal of Theoretical and Applied Information Technology

XXst Month 201x. Vol. x No.x


© 2005 - 2012 JATIT & LLS. All rights reserved.

ISSN: 1992-8645 www.jatit.org E-ISSN: 1817-3195

MODIFIED DYNAMIC TIME WARPING FOR


HIERARCHICAL CLUSTERING
1
MAHMOUD SAMMOUR, 2 ZULAIHA OTHMAN
Center for Artificial Intelligence Technology, Faculty of Information Science and Technology, Universiti
Kebangsaan Malaysia 43600 Bangi, Selangor Darul Ehsan, Malaysia.
E-mail: 1mahmoud.samour@gmail.com

ABSTRACT

Time series clustering is the process of grouping sequential correspondences in similar clusters. The key
characteristic behind clustering time series data lies on the similarity/distance function used to identify the
sequential matches. Dynamic Time Warping (DTW) is one of the common distance measures that have
demonstrated competitive results compared to other functions. DTW aims to find the shortest path in the
process of identifying sequential matches. DTW relies on a dynamic programming to obtain the shortest path
where the smaller distance is being computed. However, in the case of equivalent distances, DTW is selecting
the path randomly. Hence, the selection could be misguided in such randomization process which
significantly affect the matching quality. This is due to randomization may lead to longer path which drift
from obtaining the optimum path. This paper proposes a modified DTW that aims to enhance the dynamic
selection of the shortest path when handling equivalent distances. Experiments were conducted using twenty
UCR benchmark datasets. Results shown that the modified DTW reveal superior results in compared to the
standard. This can demonstrate the effectiveness of the proposed modification in which optimizing the
shortest path has enhanced the performance of clustering.
Keywords: Hierarchical Clustering, Dynamic Time Warping, Distance Measures

1. INTRODUCTION proposed to integrate with clustering techniques such


as Euclidean distance, Minkowski distance and
Time series has been emerged as a response to the Dynamic Time Warping (DTW) measures. DTW has
data evolution of chronological representation where been widely used for time series data due to its ability
the data been made in time intervals (Fu, 2011). to identify sequential correspondences.
There are many kinds of time series data such as
Dynamic Time Warping (DTW) is one of the
financial, weather forecasting, pattern recognition,
common similarity functions that has demonstrated
etc. (Wismüller et al., 2002). The common task of
competitive results compared to other measures.
time series data mining is the process of identifying
DTW aims to find the shortest path in the process of
similar sequences. Such process is performed using
identifying sequential matches. DTW relies on a
clustering techniques.
dynamic programming to obtain the shortest path
Clustering time series data aims to identify where the smaller distance is being computed.
sequential correspondences among two time However, in the case of equivalent distances, DTW
sequences (Rani and Sikka, 2012). Several domains is selecting the path randomly. Hence, the selection
have been examined in the context of time series could be misguided in such randomization process
such as financial, weather forecasting, pattern which significantly affect the matching quality. This
recognition, etc. (Wismüller, et al., 2002). Clustering is due to randomization may lead to longer path
can be performed using two main approaches; which drift from obtaining the optimum path.
partitioning (e.g. k-means clustering) and
Several studies have addressed this problem by
hierarchical clustering (e.g. agglomerative
proposing specific modifications for the DTW such
clustering) (Badr et al., 2015). In fact, each clustering
as computing a global averaging (Petitjean et al.,
technique is integrated with a particular similarity
2016; Petitjean et al., 2011) or computing the shifting
(distance) measure that has the ability to identify
estimation (Yuan et al., 2011). However, these
similarity among the objects. In fact, integrating an
modifications may not fit multiple time series with
appropriate similarity measure with an appropriate
different lengths. Therefore, this paper proposes a
clustering technique is a challenging task (Kalpakis
modified DTW that aims to enhance the dynamic
et al., 2001). Several similarity functions have been
selection of the shortest path when handling

1
Journal of Theoretical and Applied Information Technology
XXst Month 201x. Vol. x No.x
© 2005 - 2012 JATIT & LLS. All rights reserved.

ISSN: 1992-8645 www.jatit.org E-ISSN: 1817-3195


equivalent distances. In order to demonstrate the overcome both time shifting estimation and
effectiveness of the proposed modification, multiple significant indicators detection.
time series datasets with different lengths, have been
Zhu et al. (2012) have addressed the problem of
used in the experiment. In addition, the proposed
time-consumption when using DTW for large-scale
modified DTW has been compared with three
time series clustering. The authors have proposed a
similarity measures including standard DTW,
novel approximation for DTW in which the DTW
Euclidean and Mikowski distance measures.
distances can be bound between LB-Keogh and
2. RELATED WORK Euclidian distance functions. The key characteristic
behind the proposed method lies on the accurate
In fact, many research efforts have proposed
approximation gained by the two bounds LB and ED
enhancement and modification for the Dynamic
together. This can be performed by analyzing the
Time Warping (DTW) in order to improve its
best ‘mixing weight’ of the upper and lower bounds
functionality regarding specific domains. Such
by sampling a tiny fraction of the true DTW
enhancements have intended to reduce the time
distances. Many UCR benchmark datasets have been
consumed which leads to improve the efficiency, as
used to evaluate the proposed approximation.
well as, acquiring better accuracy which leads to
Results shown a significant reduction regarding
improve the effectiveness. For instance, Efrat & Fan
consumption when using the proposed
(Efrat et al., 2007) have addressed the problem of
approximation of DTW.
curve matching which is considered as a drawback
of DTW in which the curves are being sampled as Begum et al. (2015) proposed a pruning strategy
sequences to identify the similarity among two time- in order to accelerate the performance of DTW. Such
series. The authors suggested that such sampling of strategy aims to utilize both upper and lower bounds
curve is directly affect the quality of the results. In to prune off a large fraction of the expensive distance
this manner, the authors have proposed a continuous calculations. Using different domains of time series,
DTW for calculating similarity among curves using the authors have demonstrated the efficiency of the
exact and approximate matching algorithms. The proposed pruning strategy.
proposed continuous DTW has been evaluated using
Petitjean et al. (2016) Have addressed the problem
signature verification dataset and compared with the
arises when combining nearest neighbor
traditional DTW. The proposed continuous DTW
classification with DTW for time series data. Such
has outperformed the traditional (i.e. discrete) DTW.
problem can be represented as the inaccurate
In addition, Petitjean et al. (2011) have proposed centroid resemble in terms of the actual instances. In
a global averaging mechanism for DTW regarding this vein, the authors have utilized the global
clustering time series sequences. The authors have averaging mechanism introduced by Petitjean et al.
concentrated on the drawback of using DTW with (2011) in order to generate efficient nearest centroid
pairwise sequence similarity in which a two classifiers.
individual sequences are being compared with each
Silva et al. (2016) have addressed the problem of
other. The main drawback behind the pairwise
DTW in terms of discarding the prefixes and suffixes
matching lies on the sensitivity of sequence ordering.
of two time series. Such problem can be represented
The influence on sequence ordering has a significant
by the warping invariance that is being considered to
impact on the equality of results. Twenty UCR
the body of time series but not for the head and tails.
benchmark datasets have been used to evaluate the
In this vein, the authors have considered more points
proposed averaging DTW. Results shown superior
in the head and tails to be involved in the
performance for the proposed method compared to
computation of the DTW in order to provide more
the pairwise matching.
accurate resemble of the time series.
In the context of gene expression, Yuan et al.
3. MATERIALS AND METHOD
(2011) have figured out that the traditional DTW
cannot be applied to genes that have significant Hierarchical clustering algorithms work by
differences in different time series. Such differences aggregating the objects into a tree of clusters
may indicate significant factor rather than a simple (Aghagolzadeh et al., 2007). Hierarchical clustering
time change. Therefore, time shift estimation can be categorized into two types; agglomerative and
appears to be insufficient in terms of identifying divisive. Such categorization is inspired from the
similarity among two genes expressions. Therefore, mechanism of grouping the objects whether bottom-
a modified version of DTW has been proposed to up or top-down approach. Agglomerative
hierarchical clustering (AHC) is considered as a

2
Journal of Theoretical and Applied Information Technology
XXst Month 201x. Vol. x No.x
© 2005 - 2012 JATIT & LLS. All rights reserved.

ISSN: 1992-8645 www.jatit.org E-ISSN: 1817-3195


bottom-up hierarchical approach where each object minimum distance/similarity between the two series
set in a separated cluster then AHC will merge such is of interest. Hence, the DTW can be computed as
clusters into larger clusters (Rani and Sikka, 2012). follows:
Such process is continuing until a specific
∑𝐾𝑘=1 𝑝𝐾
termination has been reached. Complete linkage 𝑑𝐷𝑇𝑊 = 𝑚𝑖𝑛 (3)
algorithm aims to identify the similarity between two 𝐾
clusters by measure two nearest data points that are
located in different clusters. Hence, the merge will 3.3. Modified Dynamic Time Warping (M-
be done between the clusters that have minimum DTW)
distance -most similar- between each other. In this
Let A and B are two time series that consist of data
paper, AHC has been applied with multiple distance
points as A = {𝑃𝐴1 , 𝑃𝐴2 , 𝑃𝐴3 , 𝑃𝐴4 , 𝑃𝐴5 , 𝑃𝐴6 }, and B
measures including Euclidean distance, Minkowski
distance, standard DTW and the modified DTW as = {𝑃𝐵1 , 𝑃𝐵2 , 𝑃𝐵3 , 𝑃𝐵4 , 𝑃𝐵5 , 𝑃𝐵6 , 𝑃𝐵7 }. DTW will
similarity measures. establish the similarity matrix between the two time
series by computing the Euclidean distance between
3.1. Euclidean and Minkowski Distance each corresponding pair of the data points as shown
Measures in Table 1.
Let 𝑥𝑖 and 𝑣𝑗 be a P-dimensional vector, then the Table 1. Similarity matrix
Euclidean distance can be measured as (Liao, 2005):
B PA1 PA2 PA3 PA4 PA5 PA6
A
𝑃
2
PB1 0 0 1 4 1 1
𝑑𝐸 = √∑(𝑥𝑖𝑘 − 𝑣𝑗𝑘 ) (1)
PB2 1 1 0 0 0 4
𝑘=1
PB3 4 4 1 0 0 9
Minkowski distance is a generalization of
Euclidean distance, which is computed as follows PB4 1 1 0 1 0 4
(Liao, 2005): PB5 1 0 1 4 1 1

𝑃 PB6 0 1 1 4 1 1
𝑞
𝑞
𝑑𝑀 = √∑(𝑥𝑖𝑘 − 𝑣𝑗𝑘 ) (2) PB7 1 1 4 9 4 0
𝑘=1

Where q is a positive integer. The objective of this matrix is to find the shortest
path by starting from the top most right corner
3.2. Dynamic Time Warping (DTW) 𝑃(𝑃𝐴6 , 𝑃𝐵1 ) and using dynamic programming to
DTW has been widely used to compare between reach the point of origin 𝑃(𝑃𝐴1 , 𝑃𝐵7 ) accumulating
discrete sequences and sequences of continuous the sum of the similarity between the went thru
points (accumulated path).
values (Liao, 2005). Let 𝑆 = {𝑠1 , 𝑠2 , … , 𝑠𝑖 , … , 𝑠𝑛 }
and 𝑇 = {𝑡1 , 𝑡2 , … , 𝑡𝑗 , … , 𝑡𝑛 } be a two time series On 𝑃(𝑃𝐴6 , 𝑃𝐵1 ), DTW will pick the minimum
sequences. DTW will minimize the differences between three points 𝑃(𝑖 − 1, 𝑗) = 𝑃(𝑃𝐴5 , 𝑃𝐵1 ),
among these series by representing a matrix of 𝑛 × 𝑃(𝑖, 𝑗 − 1) = 𝑃(𝑃𝐴6 , 𝑃𝐵2 ) or 𝑃(𝑖 − 1, 𝑗 − 1) =
𝑚. In such matrix, the distance/similarity between 𝑠𝑖 𝑃(𝑃𝐴5 , 𝑃𝐵2 ). Since 𝑃(𝑃𝐴5 , 𝑃𝐵2 ) = 0 which is the
and 𝑡𝑗 will be calculated using Euclidean distance. minimum value thus, it was selected and the
accumulated cost was increment by its value. And so
However, a warping path 𝑃= on until the point P(𝑃𝐴1 , 𝑃𝐵7 ) is reached.
{𝑝1 , 𝑝2 , … , 𝑝𝑘 , … , 𝑝𝐾 } where max(𝑚, 𝑛) ≤ 𝐾 ≤ 𝑚 +
𝑛 − 1 will be elements from the matrix that meet On point 𝑃(𝑃𝐴5 , 𝑃𝐵2 ), all the neighboring cells
three constraints including boundary condition, have equivalent values, here the randomization is
continuity and monotonicity. The boundary taking a place in which the movement will be
condition constraint requires the warping path to selected randomly as shown in Fig. 1 (a). Similarly,
start and finish in diagonally opposite corner cells of in the points 𝑃(𝑃𝐴3 , 𝑃𝐵2 ) the selection has been
the matrix. That is 𝑝1 = (1,1) and 𝑝𝐾 = (𝑚, 𝑛). The performed randomly.
continuity constraint restricts the allowable steps to Hence, the modification of DTW has been
adjacent cells. The monotonicity constraint forces conducted by setting the value of 𝑃(𝑖 − 1, 𝑗 − 1)
the points in the warping path to be monotonically when handling neighbors with similar values, this
spaced in time. The warping path that has the can be represented in Fig. 1 (b).

3
Journal of Theoretical and Applied Information Technology
XXst Month 201x. Vol. x No.x
© 2005 - 2012 JATIT & LLS. All rights reserved.

ISSN: 1992-8645 www.jatit.org E-ISSN: 1817-3195

(a) Standard DTW (b) Modified DTW

Figure 1. Obtaining the shortest path

As shown in Fig. 1, the summation of the path for 0 + 0 + 0 + 0 + 1} = 2. Apparently, the results of
the standard DTW can be expressed as 𝑆𝑢𝑚 = both applications shown variant values. Such
{1 + 0 + 0 + 0 + 1 + 0 + 0 + 0 + 1} = 3, whereas difference may have a potential impact on the final
for the modified DTW 𝑆𝑢𝑚 = {1 + 0 + 0 + 0 + outcome of the time series clustering. Fig. 2 depicts
the pseudo code of the modified DTW.
Algorithm 1. Modified Dynamic Time Warping
1 Input:
2 S // is a 2-dimensional matrix with 𝑚 × 𝑛
3 𝑉1 = {𝑠1 , 𝑠2 , … . , 𝑠𝑛 } //is the first vector with n time points
4 𝑉2 = {𝑠1 , 𝑠2 , … . , 𝑠𝑚 } // is the second vector with m time points
5 𝑖, 𝑗 // are loop index, cost is an integer
6
7 Steps:
8 Initialize the matrix
9 S[0,0] = 0;
10 FOR i = 1 to m DO LOOP
11 S[0 , i ] = ∞
12 END
13 FOR i = 1 to n DO LOOP
14 S[i , 0 ] = ∞
15 END
16 Populating the similarity matrix
17 FOR i = 1 to n DO LOOP
18 FOR j = 1 to m DO LOOP
19 Cost = d(V1[i], V2[j])//Euclidean distance
20 S[i , j] = cost + MINI
21 MINI = IF S[I-1, J-1] equals S[i, j-1]
22 MINI = S[i-1, j-1]
23 OR if S[i-1, j-1] equals S[i-1, j]
24 MINI = S[i-1, j-1]
25 ELSE MINI = Minimum ( S[i-1 , j] , //increment
26 S[i , j-1] , //decrement
27 S[i-1, j-1] ) //match
28 END
29 END
30 Return S[n , m]
31
32
33

Figure 2, Modified DTW pseudo code

4
Journal of Theoretical and Applied Information Technology
XXst Month 201x. Vol. x No.x
© 2005 - 2012 JATIT & LLS. All rights reserved.

ISSN: 1992-8645 www.jatit.org E-ISSN: 1817-3195


4. EXPERIMENTAL RESULTS is the instance that is not affiliated within a cluster
and it is not related to it.
Twenty time series datasets have been used in this
study from UCR time series data archive (Keogh and Based on the mentioned evaluation metrics, Table
Folias, 2002). Such archive contains several time 2 depicts the results of Minkowski, Euclidean, DTW
series datasets that are related to multiple domains and modified DTW for all datasets.
with different lengths.
As shown in Table 2, for Minkowski, the results
Basically, the evaluation has been conducted were relatively similar to Euclidean except for
based on the distribution of members within the datasets 18 and 19 where Minkowski has shown
cluster in which the majority will be considered as relatively higher performance. Vice versa, for
the class label. Consequentially, the common datasets 17 and 20 Minkowski shown poor
information retrieval metrics precision, recall and f- performance compared to Euclidean.
measure are being used to evaluate the clusters. Such
For DTW, the results were relatively higher than
metrics can be computed as follows:
Euclidean’s results especially for datasets 11, 16, 18
|𝑇𝑃| and 19 where the superiority of DTW was
𝑃𝑟𝑒𝑐𝑖𝑠𝑖𝑜𝑛 = (4)
|𝑇𝑃| + |𝐹𝑃| remarkable.
|𝑇𝑃| For the modified DTW, the results of all the
𝑅𝑒𝑐𝑎𝑙𝑙 = (5)
|𝑇𝑃| + |𝐹𝑁| datasets were better than Euclidean’s results
𝑃×𝑅 (6) especially for datasets 11, 13, 17, 18, 19 and 20
𝐹 − 𝑚𝑒𝑎𝑠𝑢𝑟𝑒 = 2 × where the performance has significantly increased.
𝑃+𝑅
Fig. 3 depicts the performances for each distance
where TP is the instance that has been affiliated measure respectively, while Fig. 4 shows the
within a cluster and related to this cluster, FP is the comparison among the distance measures based on
instance that has been affiliated within a cluster but f-measure.
not related, TN is the instance that is not affiliated
within a cluster but it is related to it, and finally FN
Table 2. Results of the four distance measures based on Precision, Recall and F-measure for all datasets

Dataset Euclidean Minkowski DTW Modified DTW


P R F P R F P R F P R F
1 TwoLeadECG 0 0 0 +1 +1 +1 -3 +6 +3 -5 +12 +6
2 OSU Leaf 0 0 0 -4 -2 -3 14 +7 +10 +4 +1 +4
3 Phoneme 0 0 0 0 0 0 -9 +3 +11 -3 +8 +11
4 ArrowHead 0 0 0 0 0 +2 -7 0 0 +16 +1 +8
5 DistalPhalanxTW 0 0 0 +9 +7 +8 10 +1 +3 +5 +12 +10
6 FordA 0 0 0 +2 0 +1 2 +1 +1 +1 +4 +2
7 FISH 0 0 0 +5 -3 0 20 +5 +10 +19 -1 +12
8 50Words 0 0 0 0 -1 -2 -1 +8 +7 +4 +14 +10
9 Beef 0 0 0 -19 +19 +1 2 +4 -1 +1 +19 +7
10 Swedish Leaf 0 0 0 -5 +1 0 6 +7 +8 +12 +10 +12
11 Face (all) 0 0 0 +4 +6 +5 30 +23 +25 +34 +10 +27
12 Computers 0 0 0 +21 0 +8 3 +1 +2 +11 +12 +12
13 Lightning-7 0 0 0 +11 -6 0 1 +9 +9 +10 +18 +17
14 Gun-Point 0 0 0 -4 -2 -3 +12 +8 +10 +14 +4 +6
15 MiddlePhalanxOutlineAgeGroup 0 0 0 +1 +3 -1 +7 17 +9 +6 +7 +7
16 ProximalPhalanxOutlineCorrect 0 0 0 +11 0 +3 +10 30 +20 +10 +21 +16
17 DistalPhalanxOutlineAgeGroup 0 0 0 -10 -20 -19 +20 0 +3 +9 +8 +6
18 BeetleFly 0 0 0 +35 +25 +30 +25 19 +21 +25 +21 +24
19 ProximalPhalanxOutlineAgeGroup 0 0 0 +30 +17 +23 +33 36 +34 +36 +39 +37
20 CBF 0 0 0 -30 -10 -19 -7 6 +1 +11 +31 +22

5
Journal of Theoretical and Applied Information Technology
XXst Month 201x. Vol. x No.x
© 2005 - 2012 JATIT & LLS. All rights reserved.

ISSN: 1992-8645 www.jatit.org E-ISSN: 1817-3195

Euclidean Minkowski
100 100

50 50

0 0
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20

P R F P R F

DTW Modified DTW


100 100
80 80
60 60
40 40
20 20
0 0
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20

P R F P R F

Figure 3. Results of precision, recall and f-measure for the four distance measures

F-measure
100

80

60

40

20

0
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20

Euclidean Minkowski DTW Modified DTW

Figure 4. Comparison among the four distance measures based on f-measure

As shown in Fig 4, The result shows that DTW has outperformed the other distance functions in terms of
outperformed both of Minkowski and Euclidean f-measure for 6 datasets including '4, 5, 6, 9, 12, 18'.
distance measures in terms of f-measure for 14 out Apparently, Euclidean distance function has
of 20 data sets including ‘1, 2, 3, 7, 8, 10, 11, 13, 14, obtained the lowest f-measure values for all datasets.
15, 16, 17, 19, 20’. In contrast, Minkowski has

6
Journal of Theoretical and Applied Information Technology
XXst Month 201x. Vol. x No.x
© 2005 - 2012 JATIT & LLS. All rights reserved.

ISSN: 1992-8645 www.jatit.org E-ISSN: 1817-3195


In contrast between DTW and the modified DTW, series clustering applications. Twenty UCR
DTW has outperformed the modified DTW in terms benchmark datasets have been used in the
of F-measure for five datasets including ‘2, 10, 14, experiments. The clustering has been performed
15, 16’. This is due to the limitation of AHC in which using Agglomerative Hierarchical Clustering (AHC)
the clusters merging process cannot be undone (Gao using multiple distance measures including
et al., 2010). In this manner, the modification of Euclidean distance, Minkowski distance, standard
DTW will lead to merge irrelevant clusters. Similar DTW and the proposed modified DTW. Results
to incorrect rejection of true in which the error may shown that the proposed DTW has outperformed the
lead to better results, the randomization of acquiring other distance measures. This can demonstrate the
the shortest path using DTW will avoid such impact of acquiring the shortest path on the
limitation of AHC. clustering results. However, this study has not
addressed the impact of the modified DTW on the
On the other hand, the modified DTW has
efficiency thus; future researches could be examine
outperformed DTW for the remaining 15 datasets. In
the time and memory consuming using the proposed
this manner, the modification of DTW to acquire the
DTW.
shortest path has successfully enhanced the quality
of clustering results. Figure 3 shows both ACKNOWLEDGEMENT
performances of DTW and modified DTW.
This study is supported by the University
The superiority of DTW compared to Euclidean Kebangsaan Malaysia (UKM) and funded by
and Minkowski distance measures was expected research grant (AP2013-007).
from the literature where several clustering
REFERENCES
approaches have been proposed for time series using
DTW for various domains (Al-Naymat et al., 2009; M. Aghagolzadeh, et al. (2007) 'A Hierarchical
Niennattrakul and Ratanamahatana, 2007; Petitjean, Clustering Based on Mutual Information
et al., 2011; Yuan, et al., 2011; Zhu, et al., 2012). Maximization', Image Processing, 2007. ICIP
2007. IEEE International Conference on, pp. I -
Multiple comparative analysis studies have been
277-I - 280.
presented to compare different distance measures
G. Al-Naymat, et al. (2009) 'SparseDTW: a novel
with clustering technique. For instance, Ferreira &
approach to speed up dynamic time warping',
Zhao (Ferreira and Zhao, 2016) have proposed a
Proceedings of the Eighth Australasian Data
clustering approach for time series community
Mining Conference-Volume 101, Australian
detection in social network using multiple distance
Computer Society, Inc., pp. 117-127.
measures including ED, Infinite Norm, DTW,
H.S. Badr, et al. (2015) 'A tool for hierarchical
Person Correlation, Wavelet Transform, and
climate regionalization', Earth Science
Integrated Periodogram. Among these distance
Informatics, pp. 1-10 (Access 2015
functions, DTW has the superior performance
N. Begum, et al. (2015) 'Accelerating dynamic time
compared to other measures.
warping clustering with a novel admissible
However, several researchers attempted to modify pruning strategy', Proceedings of the 21th ACM
DTW for instance, Zhu et al. (2012) have modified SIGKDD International Conference on
DTW in terms of time-consuming. Whereas, Yuan Knowledge Discovery and Data Mining, ACM,
et al. (2011) have attempted to make DTW suitable pp. 49-58.
for specific domain specifically Gene expressions. A. Efrat, et al. (2007) 'Curve matching, time
Furthermore, Petitjean et al. (2011) have modified warping, and light fields: New algorithms for
DTW to be able to identify similarity via global computing similarity between curves', Journal of
averaging mechanism rather than the pair-wise Mathematical Imaging and Vision, Vol. 27 No.
sequence matching. Similar to the mentioned 3, pp. 203-216 (Access 2007
modifications, this study has proposed a modified L.N. Ferreira and L. Zhao (2016) 'Time series
DTW by optimizing the acquired shortest path. clustering via community detection in networks',
Results shown a superior performance compared to Information Sciences, Vol. 326, pp. 227-242
the standard DTW. (Access 2016
T.-c. Fu (2011) 'A review on time series data
5. CONCLUSION
mining', Engineering Applications of Artificial
This study has proposed a modification for Intelligence, Vol. 24 No. 1, pp. 164-181 (Access
Dynamic Time Warping (DTW) in terms of 2011
acquiring the shortest path for hierarchical time

7
Journal of Theoretical and Applied Information Technology
XXst Month 201x. Vol. x No.x
© 2005 - 2012 JATIT & LLS. All rights reserved.

ISSN: 1992-8645 www.jatit.org E-ISSN: 1817-3195


H. Gao, et al. (2010) 'A New Agglomerative
Hierarchical Clustering Algorithm
Implementation based on the Map Reduce
Framework', JDCTA, Vol. 4 No. 3, pp. 95-100
(Access 2010
K. Kalpakis, et al. (2001) 'Distance measures for
effective clustering of ARIMA time-series', Data
Mining, 2001. ICDM 2001, Proceedings IEEE
International Conference on, IEEE, pp. 273-280.
E. Keogh and T. Folias (2002) 'The UCR time series
data mining archive', Computer Science &
Engineering Department, University of
California, Riverside, CA. http://www. cs. ucr.
edu/eamonn/TSDMA/index. html, (Access 2002
T.W. Liao (2005) 'Clustering of time series data—a
survey', Pattern recognition, Vol. 38 No. 11, pp.
1857-1874 (Access 2005
V. Niennattrakul and C.A. Ratanamahatana (2007)
'On clustering multimedia time series data using
k-means and dynamic time warping', Multimedia
and Ubiquitous Engineering, 2007. MUE'07.
International Conference on, IEEE, pp. 733-738.
F. Petitjean, et al. (2016) 'Faster and more accurate
classification of time series by exploiting a novel
dynamic time warping averaging algorithm',
Knowledge and information Systems, Vol. 47
No. 1, pp. 1-26 (Access 2016
F. Petitjean, et al. (2011) 'A global averaging method
for dynamic time warping, with applications to
clustering', Pattern recognition, Vol. 44 No. 3,
pp. 678-693 (Access 2011
S. Rani and G. Sikka (2012) 'Recent techniques of
clustering of time series data: A Survey', Int. J.
Comput. Appl, Vol. 52 No. 15, pp. 1-9 (Access
2012
D.F. Silva, et al. (2016) 'On the Effect of Endpoints
on Dynamic Time Warping', (Access 2016
A. Wismüller, et al. (2002) 'Cluster analysis of
biomedical image time-series', International
Journal of Computer Vision, Vol. 46 No. 2, pp.
103-128 (Access 2002
Y. Yuan, et al. (2011) 'Development and application
of a modified dynamic time warping algorithm
(DTW-S) to analyses of primate brain expression
time series', BMC bioinformatics, Vol. 12 No. 1,
pp. 347 (Access 2011
Q. Zhu, et al. (2012) 'A Novel Approximation to
Dynamic Time Warping allows Anytime
Clustering of Massive Time Series Datasets',
SDM, SIAM, pp. 999-1010.

You might also like