Professional Documents
Culture Documents
Download pdf Algorithms And Architectures For Parallel Processing 13Th International Conference Ica3Pp 2013 Vietri Sul Mare Italy December 18 20 2013 Proceedings Part I 1St Edition Antonio Balzanella ebook full chapter
Download pdf Algorithms And Architectures For Parallel Processing 13Th International Conference Ica3Pp 2013 Vietri Sul Mare Italy December 18 20 2013 Proceedings Part I 1St Edition Antonio Balzanella ebook full chapter
https://textbookfull.com/product/computational-logistics-9th-
international-conference-iccl-2018-vietri-sul-mare-italy-
october-1-3-2018-proceedings-raffaele-cerulli/
Algorithms and
LNCS 8285
Architectures
for Parallel Processing
13th International Conference, ICA3PP 2013
Vietri sul Mare, Italy, December 2013
Proceedings, Part I
123
Lecture Notes in Computer Science 8285
Commenced Publication in 1973
Founding and Former Series Editors:
Gerhard Goos, Juris Hartmanis, and Jan van Leeuwen
Editorial Board
David Hutchison
Lancaster University, UK
Takeo Kanade
Carnegie Mellon University, Pittsburgh, PA, USA
Josef Kittler
University of Surrey, Guildford, UK
Jon M. Kleinberg
Cornell University, Ithaca, NY, USA
Alfred Kobsa
University of California, Irvine, CA, USA
Friedemann Mattern
ETH Zurich, Switzerland
John C. Mitchell
Stanford University, CA, USA
Moni Naor
Weizmann Institute of Science, Rehovot, Israel
Oscar Nierstrasz
University of Bern, Switzerland
C. Pandu Rangan
Indian Institute of Technology, Madras, India
Bernhard Steffen
TU Dortmund University, Germany
Madhu Sudan
Microsoft Research, Cambridge, MA, USA
Demetri Terzopoulos
University of California, Los Angeles, CA, USA
Doug Tygar
University of California, Berkeley, CA, USA
Gerhard Weikum
Max Planck Institute for Informatics, Saarbruecken, Germany
Joanna Kołodziej Beniamino Di Martino
Domenico Talia Kaiqi Xiong (Eds.)
Algorithms and
Architectures
for Parallel Processing
13th International Conference, ICA3PP 2013
Vietri sul Mare, Italy, December 18-20, 2013
Proceedings, Part I
13
Volume Editors
Joanna Kołodziej
Cracow University of Technology, Institute of Computer Science
Cracow, Poland
E-mail: jokolodziej@pk.edu.pl
Beniamino Di Martino
Seconda Università di Napoli, Dipartimento di Ingegneria
Industriale e dell’ Informazione
Aversa, Italy
E-mail: beniamino.dimartino@unina.it
Domenico Talia
DIMES and ICAR-CNR, c/o Università della Calabria
Rende, Italy
E-mail: talia@deis.unical.it
Kaiqi Xiong
Rochester Institute of Technology, B. Thomas College
of Computing and Information Sciences
Rochester, NY, USA
E-mail: kxxics@rit.edu
Last but not least, we heartily thank all authors for submitting and pre-
senting their high-quality papers to the ICA3PP 2013 main conference and
workshops/symposia.
We are very happy to welcome readers to the proceedings of the 13th Inter-
national Conference on Algorithms and Architectures for Parallel Processing
(ICA3PP 2013) held in Vietri sul Mare, Italy, in December 2013.
ICA3PP 2013 was the 13th event in this series of conferences that is de-
voted to algorithms and architectures for parallel processing starting from 1995.
ICA3PP is now recognized as a main regular scientific venue internationally,
covering the different aspects and issues of parallel algorithms and architec-
tures, encompassing fundamental theoretical approaches, practical experimental
results, software systems, and product components and applications.
As the use of computing systems has permeated in every aspect of daily life,
their scalability, adaptation, and distribution in human environments have be-
come increasingly critical and vital. Specifically, the main areas of the conference
include cluster, distributed and parallel systems, and middeware that cover a va-
riety of topics such as big data, multi-core programming and software tools, dis-
tributed scheduling and load balancing, high-performance scientific computing,
parallel algorithms, parallel architectures, scalable and distributed databases,
dependability in distributed and parallel systems, as well as wireless and mobile
computing.
ICA3PP 2013 provided a widely known forum for researchers and practi-
tioners from many countries around the world to exchange ideas on improving
the computational power and functionality of computing systems through the
exploitation of parallel and distributed computing techniques and models.
The ICA3PP 2013 call for papers received a great deal of attention from
the computer science community and 90 manuscripts were submitted from 33
countries. These papers were strictly evaluated on the basis of their original-
ity, significance, correctness, relevance, and technical quality. Each paper was
reviewed by at least three members of the Program Committee and external
reviewers. Based on these evaluations of the papers submitted, 10 distinguished
papers and 31 regular papers were selected for presentation at the conference,
representing 11% of acceptance for distinguished papers and 34% for regular
papers. This book consists of the two parts: distinguished papers and regular
papers. All of them are considered as the papers of the main conference tracks.
The success of any conference depends on its authors, reviewers, and orga-
nizers. ICA3PP 2013 was no exception. We are grateful to all the authors who
submitted their papers and to all the reviewers for their outstanding work in
refereeing the papers on a very tight schedule. We relied heavily on a team of
volunteers, especially those in Italy, to keep the ICA3PP 2013 wheel turning.
VIII Message from the ICA3PP 2013 Program Chairs
Last but not least, we would also like to take this opportunity to thank the
whole LNCS Springer editorial team for their support in the preparation of this
publication.
ICA3PP 2013 was organized by the Second University of Naples, Italy, Depart-
ment of Industrial and Information Engineering, and St. Francis Xavier Univer-
sity, Canada, Department of Computer Science. It was hosted by the Second
University of Naples in Vietri sul Mare (Italy) during December 18–20, 2013.
Steering Committee
Andrzej Gościński Deakin University, Australia
Yi Pan Georgia State University, USA
Yang Xiang Deakin University, Australia
Advisory Committee
Minyi Guo Shanghai Jiao Tong University, China
Ivan Stojmenovic University of Ottawa, Canada
Koji Nakano Hiroshima University, Japan
Conference Organizers
General Chairs
Beniamino Di Martino Second University of Naples, Italy
Albert Y. Zomaya The University of Sydney, Australia
Christian Engelmann Oak Ridge National Laboratory, USA
Program Chairs
Joanna Kolodziej Cracow University of Technology, Poland
Domenico Talia Università della Calabria, Italy
Kaiqi Xiong Rochester Institute of Technology, USA
Workshop Chairs
Rocco Aversa Second University of Naples, Italy
Jun Zhang Deakin University, Australia
Publicity Chairs
Xiaojun (Matt) Cao Georgia State University, USA
Shui Yu Deakin University, Australia
X Organization
Web Chair
Sazzad Hussain St. Francis Xavier University, Canada
Local Organization
Pasquale Cantiello Second University of Naples, Italy
Giuseppina Cretella Second University of Naples, Italy
Luca Tasquier Second University of Naples, Italy
Alba Amato Second University of Naples, Italy
Loredana Liccardo Second University of Naples, Italy
Serafina Di Biase Second University of Naples, Italy
Paolo Pariso Second University of Naples, Italy
Angela Brunitto Second University of Naples, Italy
Marco Scialdone Second University of Naples, Italy
Antonio Esposito Second University of Naples, Italy
Vincenzo Reccia Second University of Naples, Italy
Program Committee
Alba Amato Second University of Naples, Italy
Henrique Andrade JP Morgan, USA
Cosimo Anglano Università del Piemonte Orientale, Italy
Ladjel Bellatreche ENSMA, France
Jorge Bernal Bernabe University of Murcia, Spain
Ateet Bhalla Oriental Institute of Science and Technology,
Bhopal, India
George Bosilca University of Tennessee, USA
Surendra Byna Lawrence Berkeley National Lab, USA
Aleksander Byrski AGH University of Science and Technology,
Poland
Massimo Cafaro University of Salento, Italy
Pasquale Cantiello Second University of Naples, Italy
Eugenio Cesario ICAR-CNR, Italy
Ruay-Shiung Chang National Dong Hwa University, Taiwan
Organization XI
Distinguished Papers
Clustering and Change Detection in Multiple Streaming Time Series . . . 1
Antonio Balzanella and Rosanna Verde
Regular Papers
PastryGridCP: A Decentralized Rollback-Recovery Protocol
for Desktop Grid Systems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 143
Heithem Abbes and Thouraya Louati
Improving Continuation-Powered Method-Level Speculation for JVM
Applications . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 153
Ivo Anjo and João Cachopo
Applicability of the (m,k)-firm Approach for the QoS Enhancement
in Distributed RTDBMS . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 166
Malek Ben Salem, Fehima Achour, Emna Bouazizi,
Rafik Bouaziz, and Claude Duvallet
A Parallel Distributed System for Gene Expression Profiling
Based on Clustering Ensemble and Distributed Optimization . . . . . . . . . . 176
Zakaria Benmounah and Mohamed Batouche
Unimodular Loop Transformations with Source-to-Source Translation
for GPUs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 186
Pasquale Cantiello, Beniamino Di Martino, and Francesco Piccolo
HMHS: Hybrid Multistage Heuristic Scheduling Algorithm
for Heterogeneous MapReduce System . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 196
Heng Chen, Yao Shen, Quan Chen, and Minyi Guo
Dynamic Resource Management in a HPC and Cloud Hybrid
Environment . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 206
Miao Chen, Fang Dong, and Junzhou Luo
Candidate Set Parallelization Strategies for Ant Colony Optimization
on the GPU . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 216
Laurence Dawson and Iain A. Stewart
Synchronization-Reducing Variants of the Biconjugate Gradient and
the Quasi-Minimal Residual Methods . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 226
Stefan Feuerriegel and H. Martin Bücker
Memory Efficient Multi-Swarm PSO Algorithm in OpenCL
on an APU . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 236
Wayne Franz, Parimala Thulasiraman, and Ruppa K. Thulasiram
Multi-objective Parallel Machines Scheduling for Fault-Tolerant Cloud
Systems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 247
Jakub Ga̧sior and Franciszek Seredyński
Exploring Irregular Reduction Support in Transactional Memory . . . . . . . 257
Miguel A. Gonzalez-Mesa, Ricardo Quislant, Eladio Gutierrez, and
Oscar Plata
Table of Contents – Part I XVII
H-DB: Yet Another Big Data Hybrid System of Hadoop and DBMS . . . . 324
Tao Luo, Guoliang Chen, and Yunquan Zhang
IDES Project: A New Effective Tool for Safety and Security in the
Environment . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 201
Francesco Gargiulo, G. Persechino, M. Lega, and A. Errico
1 Introduction
A growing number of applicative fields is generating huge amounts of tempo-
ral data. We can take, for instance, data generated by real-time trade surveil-
lance systems for security fraud and money laundering, communication networks,
power consumption measurement, dynamic tracing of stock fluctuations, inter-
net traffic, industry production processes, scientific and engineering experiments,
remote sensors. In these contexts, data are sequences of values or events obtained
through repeated measurements over time.
Traditional learning techniques for temporal data have their reference in tem-
poral data mining and time series analysis literature [22], however new challenges
emerge when data are collected on-line, fast changing, massive, and potentially
infinite.
Data stream mining [25] is the recent research field which aims at developing
new methods for the analysis of such kind of data.
Algorithms for data stream mining should meet a set of stringent design
criteria[14]: i) time required for processing the incoming observations has to
be small and constant, ii) memory requirements have to be reduced with refer-
ence to the amount of data to be processed, iii) algorithms have to perform only
J. Kolodziej et al. (Eds.): ICA3PP 2013, Part I, LNCS 8285, pp. 1–14, 2013.
c Springer International Publishing Switzerland 2013
2 A. Balzanella and R. Verde
one scan of the data iv) knowledge about data should be available at any point
in time or on user demand.
In order to deal with these requirements, the analysis of stream data is, often,
performed using synopses, which are data structures storing summaries of the
incoming data. The general idea is to update and/or generate synopsis data
structures every time a new element or a batch of elements is collected and
then, to extract the knowledge starting from the summaries rather than directly
from the observations.
This mode of processing data enforces a trade-off between the accuracy of the
data mining techniques and the computational and storage constraints. Thus,
one of the challenges is to develop effective and space-time efficient synopses
which allow to provide a good approximation of the results of data analysis
techniques for stocked data.
Several kinds of synopses have been developed for data stream processing:
sampling based on the reservoir sampling algorithm [24][2], on line histogram
construction [13], wavelets as data stream synopsis [16], sketches, which are
small-space summaries for a distribution vector (e.g., histogram) based on using
randomized linear projections of the underlying data vectors [12][15], finally,
techniques aiming at representing streaming time series into reduced space[20][3].
Data stream mining methods use synopses for performing, on-line or off-line,
usual data mining tasks such as frequent pattern mining, association rules de-
tection, classification, clustering.
In this paper, we focus on a specific clustering problem. We propose a new
strategy able to detect a partition of a set of streams into clusters and a repre-
sentation of the clusters through suitable prototype functions.
In data stream framework, the clustering aims at dealing with two different
challenges. The first is to analyze a single univariate or multivariate data stream
with the objective to reveal a partition of the observations it is composed of.
The second is to analyze multiple data streams generated by several sources (for
example sensor networks) to find (and make available at any time), a partition
of streams which behave similarly through time.
The latter is the topic of this paper and is usually referred to as clustering of
time series data streams since it considers each data stream as a univariate time
series of potentially infinite size, which acquires, on-line, new observations.
Due to the constraints in data stream analysis, clustering of time series data
streams is strongly differentiated by the batch clustering of time series. In par-
ticular, it needs to satisfy several specific requirements: it has to be possible to
recover the clustering structure of a user chosen time interval without perform-
ing a further pass on data; summaries should be recorded on-line in order to
get an overview of the monitored phenomenon over time; algorithms should still
support the addition of new streams during the monitoring process and adapt
the clustering to the new available information.
A further important issue is to take into account the evolution of data and the
consequent, possible, change in the proximity relations among the streams. The
evolution in the proximities impacts on the clustering structure which should
Clustering and Change Detection of Multiple Data Streams 3
2 State of Art
The data stream clustering problem has been widely dealt with in recent years.
Several proposals ([17] [1]) address the challenge of the clustering of the obser-
vations of univariate or multivariate data streams (a wide review is available
in [18]). The clustering of multiple data streams, which is the challenge of this
paper, is a more recent topic. Interesting proposals have been introduced in
[5][6][23] The first one is an extension, to the data stream framework, of the
k-means algorithm for time series. The main drawback of this strategy is the
inability to deal with evolving data streams. This is because the final data parti-
tion only depends from the data of the most recent window. Moreover it does not
allow to interrogate for the clustering structure over user defined time intervals.
The second proposal, named Clustering On Demand (COD), is based on per-
forming an on-line scan of the data for dimensionality reduction using a multi
resolution approach based on wavelet transformation or on piecewise linear re-
gression. The reduced time series are indexed by suitable hierarchies which are
4 A. Balzanella and R. Verde
proximity matrix which is updated with the new proximities at each clustering
process performed on the set of subsequences in each window.
The on-line phase is then stopped at a certain time τ , which corresponds to
a number of processed windows set apriori or on demand by the user. A global
partitioning of the streams recorded until the stopping of the on-line clustering is
then performed, off-line, using the DCLUST algorithm on dissimilarity matrix
[11]. In particular, we consider in input the proximity matrix which contains
information about the proximities between the pairs of subsequences as updated
at each on-line clustering.
Repeating this off-line phase for different intervals of time in which the on-line
process is stopped, it is possible to discover changes in the structure of clustering
that corresponds to changes in the behavior of the subsequences along time.
According to the introduced clustering schema, the first step consists in splitting
the incoming parallel streams into non overlapping windows, where each window
w = 1, . . . , ∞, is an ordered subset of T having size s which frames a subset
Yiw = {yj , . . . , yj+s } of Yi called subsequence.
A Dynamic Cluster Algorithm (DCA) [10][9] is then performed on the sub-
sequences Y1w , . . . , Yiw , . . . , Ynw of the current window by looking for P ∗ among
all the partitions PC of Y w in C classes and L∗ among all the representations
spaces LC of the partition of Y w in C classes, such that is minimized a measure
Δ of fitting between L and P :
The first function concerns the representation structure L for the classes
{C1 , . . . , Ck } ∈ P . Our proposal is based on a description of the classes again in
terms of subsequences, so that we choose to represent the clusters by profiles of
the subsequences belonging to the clusters, here called prototypes, here denoted
as Lw = (g1w , . . . , gkw , . . . , gK
w
) for the cluster Ckw (with k = 1, . . . , C) of each
w w
partition P of the set Y (for each window w).
The definition of the allocationf unction f allows to assign a subsequence
Yiw to a cluster Ckw according to its similarity with the prototype gkw .
Given a suitable distance function d(·), the criterion Δ(P w , Lw ) optimized in
DCA is usually defined as the sum of the measures d(Yiw , gkw ) of fitting between
6 A. Balzanella and R. Verde
each subsequence Yiw in a cluster Ckw ∈ P w and the prototypes gkw , expressed
as:
K
Δ(P w , Lw ) = d(Yiw , gkw ) (1)
k=1 Yiw ∈Ckw
The DCA algorithm used for carrying out the partition P w and the related
prototypes in Lw can be synthesized, in our context, by the following steps:
to different clusters. Updating the matrix A with values 0 or 1 (for all the pair
i, m = 1, . . . , n) at each partition of the local window, at a certain time the
matrix A stores in each cell aw (i, m) the number of times each couple of streams
is allocated to the same cluster of a local partition P w . Even if the matrix A
for a large number of windows approximates a matrix of similarities, it does not
take into account any information about the internal cohesion of the clusters or
the heterogeneity among streams belonging to different clusters.
In order to introduce more information in this proximity matrix, always re-
specting the computational constraints, we introduce a new approach based on
updating the dissimilarities rather than the similarities among the streams.
According to this new strategy, the dissimilarities d(Yiw , Ymw ) between the
couples Yiw , Ymw (with i, m = 1, . . . , n) are computedas follows:
Y w ∈C w d(Yiw ;gk
w
)
If Yiw , Ymw ∈ Ckw , d(Yiw , Ymw ) is equal to Wkw = i k
that is the
| Ckw |
average distance of the subsequences Yiw in acluster Ckw to the correspondent
w w
i,m∈C w d(Yi ;Ym )
prototype gkw ; It is worth noting that Wkw = k
when the proto-
|Ckw |·|Ckw −1|
types gkw (for k = 1, . . . , C) of the C clusters of the local partition P w are the
average profiles of the subsequences belonging to the several clusters and d(.) is
the Euclidean distance.
If (Yiw ∈ Ckw ) ∧ (Ymw ∈ Clw ) (with k = l = 1, . . . , C), the dissimilarity value
is Di,l = d(Yiw ; glw ) that is the distance of a subsequence Yiw , belonging to the
cluster Ckw to the prototype glw of a cluster Clw at which Ymw belongs.
We note that for the subsequences in the same cluster, the cell aw (i, m) is
updated with the same value Wkw while, for couples of streams belonging to dif-
ferent clusters, the cell aw (i, m) is updated with a different value Di,k depending
on the distance of the stream Yiw to the prototype gkw of the cluster Ckw at which
Ymw belongs.
In synthesis, for each partition P w of the subsequences in a window w, the
cells of A are updated as follows:
aw (i, m) = aw−1 (i, m) + Wkw if Yiw , Ymw ∈ Ckw
aw (i, m) = aw−1 (i, m) + Di,l if Yiw ∈ Ckw and Ymw ∈ Clw
In order to obtain a global partition P from the stored results of the local parti-
tion P w we propose to use the Dynamic Clustering Algorithm on a Dissimilarity
Table (DCLU ST ) [11] on the proximity matrix Aw . In such a way the criterion
8 A. Balzanella and R. Verde
optimized for obtaining the partition P is consistent with the criterion optimized
in the clustering of each local batch of data.
The aim of the DCLU ST is to partition a set of elements into a fixed number
of homogeneous classes (that we can choose to be equal or not equal to the num-
ber of clusters of each local partition) on the basis of the proximities between
pairs of elements. The optimized criterion is based on the sum of the dissimilar-
ities between elements belonging to the same cluster. Because the dissimilarities
between pair of streams (Yi and Ym ) are the values in the cells aw (i, m) of Aw ,
the DCA criterion can be expressed as:
C
Δ(P, L) = aw (i, m) (2)
k=1 i,m∈Ck
1.D. The copyright laws of the place where you are located also
govern what you can do with this work. Copyright laws in most
countries are in a constant state of change. If you are outside
the United States, check the laws of your country in addition to
the terms of this agreement before downloading, copying,
displaying, performing, distributing or creating derivative works
based on this work or any other Project Gutenberg™ work. The
Foundation makes no representations concerning the copyright
status of any work in any country other than the United States.
1.E.6. You may convert to and distribute this work in any binary,
compressed, marked up, nonproprietary or proprietary form,
including any word processing or hypertext form. However, if
you provide access to or distribute copies of a Project
Gutenberg™ work in a format other than “Plain Vanilla ASCII” or
other format used in the official version posted on the official
Project Gutenberg™ website (www.gutenberg.org), you must, at
no additional cost, fee or expense to the user, provide a copy, a
means of exporting a copy, or a means of obtaining a copy upon
request, of the work in its original “Plain Vanilla ASCII” or other
form. Any alternate format must include the full Project
Gutenberg™ License as specified in paragraph 1.E.1.
• You pay a royalty fee of 20% of the gross profits you derive from
the use of Project Gutenberg™ works calculated using the
method you already use to calculate your applicable taxes. The
fee is owed to the owner of the Project Gutenberg™ trademark,
but he has agreed to donate royalties under this paragraph to
the Project Gutenberg Literary Archive Foundation. Royalty
payments must be paid within 60 days following each date on
which you prepare (or are legally required to prepare) your
periodic tax returns. Royalty payments should be clearly marked
as such and sent to the Project Gutenberg Literary Archive
Foundation at the address specified in Section 4, “Information
about donations to the Project Gutenberg Literary Archive
Foundation.”
• You comply with all other terms of this agreement for free
distribution of Project Gutenberg™ works.
1.F.
Most people start at our website which has the main PG search
facility: www.gutenberg.org.