Professional Documents
Culture Documents
15.book 1 Reinforcement and Non Reinforcement Machine Learning Classifiers For User Movement Prediction
15.book 1 Reinforcement and Non Reinforcement Machine Learning Classifiers For User Movement Prediction
15.book 1 Reinforcement and Non Reinforcement Machine Learning Classifiers For User Movement Prediction
net/publication/258305284
CITATIONS READS
4 94
1 author:
SEE PROFILE
Some of the authors of this publication are also working on these related projects:
Simple-Link Sensor Network- Based Remote Monitoring of Multiple Patients View project
All content following this page was uploaded by Theodoros Thodoris Anagnostopoulos on 31 January 2020.
Chapter 12
Reinforcement and Non-
Reinforcement Machine
Learning Classifiers for User
Movement Prediction
Theodoros Anagnostopoulos
National and Kapodistrian University of Athens, Greece
ABSTRACT
Mobile context-aware applications are required to sense and react to changing environment conditions.
Such applications, usually, need to recognize, classify, and predict context in order to act efficiently,
beforehand, for the benefit of the user. In this chapter, the authors propose a mobility prediction model,
which deals with context representation and location prediction of moving users. Machine Learning
(ML) techniques are used for trajectory classification. Spatial and temporal on-line clustering is ad-
opted. They rely on Adaptive Resonance Theory (ART) for location prediction. Location prediction
is treated as a context classification problem. The authors introduce a novel classifier that applies a
Hausdorff-like distance over the extracted trajectories handling location prediction. Two learning methods
(non-reinforcement and reinforcement learning) are presented and evaluated. They compare ART with
Self-Organizing Maps (SOM), Offline kMeans, and Online kMeans algorithms. Their findings are very
promising for the use of the proposed model in mobile context aware applications.
DOI: 10.4018/978-1-4666-4038-2.ch012
Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
Reinforcement and Non-Reinforcement Machine Learning Classifiers for User Movement Prediction
a user and an application, including the user and appropriate solutions for location prediction are
the application themselves” (Dey, 2001). Context offline and online clustering and classification.
refers to the current values of specific ingredients Offline clustering is performed through the Of-
that represent the activity of an entity/situation fline kMeans algorithm while online clustering is
and environmental state (e.g., attendance of a accomplished through the Self-Organizing Maps
meeting, location, temperature). (SOM), Online kMeans and Adaptive Resonance
One of the more intuitive capabilities of the Theory (ART). Offline learners typically perform
mobile context-aware applications is their pro- complete model building, which can be very costly,
activity. Predicting user actions and contextual if the amount of samples rises. Online learning
ingredients enables a new class of applications algorithms are able to detect changes and adapt/
to be developed along with the improvement of update only parts of the model thus providing for
existing ones. One very important ingredient is fast adaptation of the model. Both forms of algo-
location. Estimating and predicting the future loca- rithms extract a subset of patterns/clusters (i.e.,
tion of a mobile user enables the development of a knowledge base) from an initial dataset (i.e., a
innovative, location-based services/applications database of user itineraries). Moreover, online
(Hightower & Borielo, 2001; Priggouris et al., learning is more suited for the task of classifica-
2006). For instance, location prediction can be tion/prediction of the user mobility behavior as in
used to improve resource reservation in wireless the real life user movement data often needs to be
networks and facilitate the provision of location- processed in an online manner, each time after a
based services by preparing and feeding them with new portion of the data arrives. This is caused by
the appropriate information well in advance. The the fact that such data is organized in the form of a
accurate determination of the context of users and data stream (e.g., a sequence of time-stamped vis-
devices is the basis for context-aware applications. ited locations) rather than a static data repository,
In order to adapt to changing demands, such ap- reflecting the natural flow of data. Classification
plications need to reason based on basic context involves the matching of an unseen pattern with
ingredients (e.g., time, location) to determine existing clusters in the knowledge base. We rely
knowledge of higher-level situation. on a Hausdorff-like distance (Belogay et al., 1997)
Prediction of context is quite similar to in- for matching unseen itineraries to clusters (such
formation classification/prediction (offline and metric applies to convex patterns and is consid-
online). In this paper, we adopt ML techniques ered ideal for user itineraries). Finally, location
for predicting location through an adaptive model. prediction boils down to location classification
ML is the study of algorithms that improve au- w.r.t. Hausdorff-like distance.
tomatically through experience. ML provides We assess two training methods for training
algorithms for learning a system to cluster pre- an algorithm: (1) the “nearly” zero-knowledge
existing knowledge, classify observations, predict method in which an algorithm is incrementally
unknown situations based on a history of patterns trained starting with a little knowledge on the user
and adapt to situation changes. Therefore, ML can mobility behavior and the (2) supervised method in
provide solutions that are suitable for the location which sets of known itineraries are fed to the clas-
prediction problem. Context-aware applications sifier. Moreover, we assess two learning methods
have a set of pivotal requirements (e.g., flexibility for the online algorithms regarding the success of
and adaptation), which would strongly benefit location prediction: (1) the non-Reinforcement
if the learning and prediction process could be Learning (nRL), in which a misclassified instance
performed in real time. We argue that the most is no further used in the model-training phase,
219
Reinforcement and Non-Reinforcement Machine Learning Classifiers for User Movement Prediction
thus, the classifier is no longer aware of unsuc- in Section Prediction Evaluation, where different
cessful predictions, and (2) the Reinforcement versions of that model are evaluated. Moreover,
Learning (RL), in which a misclassified instance in Section Comparison with Other Models, we
is introduced into the knowledge base updating compare the ART models with the Offline/On-
appropriately the model. line kMeans and SOM algorithms as long as we
We evaluate the performance of our models report the computational complexity of all the
against the movement of mobile users. Our objec- algorithms discussed. Related work is discussed
tive is to predict the users’ future location (their in Section Prior Work and we conclude the paper
next move) through an on-line adaptive classifier. in Section Conclusions.
We establish some important metrics for the per-
formance assessment process taking into account
low system-requirements (storage capacity) and MACHINE LEARNING MODELS
effort for model building (processing power).
Specifically, besides the prediction accuracy, i.e., In this section we briefly discuss the clustering
the precision of location predictions, we are also algorithms used throughout the paper. Specifi-
interested in the size of the derived knowledge cally, we distinguish between offline and online
base; that is the produced clusters out of the vol- clustering and elaborate on the Offline/Online
ume of the training patterns, and the capability kMeans and ART.
of the classifier to adapt the derived model to
unseen patterns. Surely, we need to keep storage Offline kMeans
capacity as low as possible while maintaining
good prediction accuracy. Lastly, our objective is In Offline kMeans (Alpaydin, 2004) we assume
to assess the adaptivity of the proposed schemes, that there are k > 1 initial clusters (groups) of data.
i.e., the capability of the predictor to detect and The objective of this algorithm is to minimize the
update appropriately the specific part of the trained reconstruction error, which is the total Euclidean
model. The classifier (through the location predic- distance between the instances (patterns), ui,
tion process) should rapidly detect changes in the and their representation, i.e., the cluster centers
behavior of the mobile user and adapt accordingly (clusters), ci. We define the reconstruction error
through model updates, however, often at the as follows:
expense of classification accuracy (note that an
ambient environment implies high dynamicity). 1
E {ci } | U = ∑ ∑ bi,t || ut − ci ||2
k
220
Reinforcement and Non-Reinforcement Machine Learning Classifiers for User Movement Prediction
ci =
∑b u
t i ,t t
(2)
terns U is not available during training. Instead,
patterns are received one by one and the model
∑b t i ,t
is updated progressively. The model has three
crucial properties:
Since the algorithm operates in offline mode,
the initial clusters can be set during the training 1. A normalization of the total network activity.
phase and cannot be changed (increased or relo- Biological systems are usually very adaptive
cated) during the testing phase. to large changes in their environment.
2. Contrast enhancement of input patterns.
Online kMeans The awareness of subtle differences in input
patterns can mean a lot in terms of survival.
In Online kMeans (Alpaydin, 2004) we assume Distinguishing a hiding panther from a rest-
that there are k > 1 initial clusters that split the ing one makes all the difference in the world.
data. Such algorithm processes unseen patterns 3. Short-term memory storage of the contrast-
one by one and performs small updates in the enhanced pattern. Before the input pattern
position of the appropriate cluster (ci) at each can be decoded, it must be stored in the
step. The algorithm does not require a training short-term memory. The long-term memory
phase. The update for each new (unseen) pattern implements an arousal mechanism (i.e., the
ut is the following: classification), whereas the STM is used to
cause gradual changes in the LTM.
ci = ci + η ⋅ bi,t ⋅ ut − ci ( )
The system consists of two layers which are
connected to each other via the long term memory.
This update moves the closest cluster (for which
The input pattern is received at the first layer and
bi,t = 1) toward the input pattern ut by a factor of
classification takes place in the second layer.
η. The other clusters (found at bigger distances
The term competitive learning is used for ART
from the considered pattern) are not updated. The
denoting that the (local) clusters compete among
semantics of bi,t, η and (ut – ci) are:
themselves to assume the “responsibility” for
representing an unseen pattern. The model is also
• bi,t ∈ {0, 1} denotes which cluster is being
called winner-takes-all because one cluster “wins
modified.
the competition” and gets updated, and the others
• η ∈ [0, 1] denotes how much is the cluster
are not updated at all.
shifted toward the new pattern.
The ART approach is incremental, meaning
• (ut – ci) denotes the distance to be learned.
that one starts with one cluster and adds a new
one, if needed. Given an input ut, the distance bt
Since the algorithm is online, the initial clusters
is calculated for all clusters ci, i = 1, .., k, and the
should be known beforehand1 and can only be
closest (e.g., minimum Euclidean distance) to
relocated during the testing phase. The number of
ut is updated. The closest cluster is called “win-
clusters remains constant. Therefore, the algorithm
ner”. Specifically, if the minimum distance bt is
exhibits limited flexibility.
smaller than a certain threshold value, named the
vigilance, ρ, the update is performed as in Online
Adaptive Resonance Theory kMeans (see Eq.(3)). Otherwise, a new center ck+1
representing the corresponding input ut is added
The ART approach (Duada et al., 2001) is an
in the model (see Eq.(3)). It is worth noting that
online learning scheme in which the set of pat-
the vigilance threshold refers to the criterion of
221
Reinforcement and Non-Reinforcement Machine Learning Classifiers for User Movement Prediction
considering two patterns equivalent or not dur- spatiotemporal history model in which the move-
ing the learning phase of a classifier. As it will ment history is represented as the sequence of 3-D
be shown, the value of vigilance is considered points (3DPs) visited by the moving user, i.e.,
essential in obtaining high values of corrected time-stamped trajectory points in a 2D surface.
classified patterns. The following equations are The spatial attributes in that model denote latitude
adopted in each update step of ART: and longitude.
Let e = (x, y, t) be a 3DP. The user trajectory
k
u consists of several time-ordered 3DPs, u = [ei]
bt =|| ci − ut ||= min || cl − ut ||
l =1 = [e1, …, eN], i = 1, …, N and is stored in the
c ← u
k +1 t
if bt > ρ (3) system’s database. It holds that t(e1) < t(e2) <
ci = ci + η (ut − ci ) otherwise … < t(eN), i.e., time-stamped coordinates. The
x and y dimensions denote the latitude and the
longitude while t denotes the time dimension
(and t(⋅) returns the time coordinate of e). Time
Self-Organizing Maps assumes values between 00:00 and 23:59. To avoid
state information explosion, trajectories contain
By adopting the ART algorithm, all clusters apart time-stamped points sampled at specific time
from the winner cluster are not updated once an instances. Specifically, we sample the movement
input ut is given. In the Self-Organizing Maps of each user at 1.66⋅10-3 Hertz (i.e., once every 10
(SOM) algorithm the winner cluster and some minutes). Sampling at very high rates (e.g., in the
of the other clusters are updated w.r.t. an input order of a Hertz) is meaningless, as the derived
ut to a certain extent. Specifically, SOM defines points will be highly correlated. In our model,
the neighborhood of a ci cluster as the set Vi(n) u is a finite sequence of N 3DPs, i.e., u is a 3∙N
of those n clusters cl closest to ci, i.e., Vi(n) = dimension vector. We have adopted a value of N
{cl: cl = argmink||ck - ci||, k = 1,…, n}. When ci = 6 for our experiments meaning that we estimate
is the winner cluster then, in addition to ci, its the future position of a mobile terminal from a
neighbor clusters are also updated. For instance, movement history of 50 minutes (i.e., 5 samples).
if the neighborhood is of size n = 2, then the two Specifically, we aim to query the system with a
closest clusters to ci are also updated but with N-1 3DP sequence so that our classifier/predictor
less weight as the neighborhood increases. The returns a 3DP, which is the predicted location of
clusters cl ∈ Vi(n) are updated as: the mobile terminal.
A cluster trajectory c consists of a finite
cl = cl + ηe(l, i )(ut − cl ) (3a) number of 3DPs, c = [ei], i = 1, …, N stored in
the knowledge base. Note that a cluster trajectory
−
(l −i )2
c and a user trajectory u are vectors of the same
e(l, i ) = 1
e 2 σ2
(3b)
2 πσ length N. This is because c, which is created
from ART based on unseen user trajectories, is a
representative itinerary of the user movements.
In addition, the query trajectory q consists of
CONTEXT REPRESENTATION
a number of 3DPs, q = [ej], j = 1, …, N-1. It is
worth noting that q is a sequence of N-1 3DPs.
Several approaches have been proposed in order
Given a q with a N-1 history of 3DPs we predict
to represent the movement history (or history) of
the eN of the closest c as the next user movement.
a mobile user (Cheng et al., 2003). We adopt a
222
Reinforcement and Non-Reinforcement Machine Learning Classifiers for User Movement Prediction
223
Reinforcement and Non-Reinforcement Machine Learning Classifiers for User Movement Prediction
224
Reinforcement and Non-Reinforcement Machine Learning Classifiers for User Movement Prediction
Figure 2. The generated GPS trace files for ex- In the following sub-sections, we evaluate the
perimentation diverse versions of the C classifier w.r.t. training
and learning methods by examining the classifier
convergence (speed of learning and adaptation)
and the derived precision on prediction future
locations.
225
Reinforcement and Non-Reinforcement Machine Learning Classifiers for User Movement Prediction
thus, it is gradually “reinforced” to learn new into account the prediction accuracy in order to
patterns up to 110 clusters. In the next 600 known reach safe conclusions about the efficiency and
patters, C-RLT-2 does not need to learn addi- effectiveness of the proposed models.
tional clusters thus it settles at 110 clusters.
We now examine the behavior of the C-RLnT Precision of C-RLT and C-RLnT
model corresponding to the zero-knowledge
training method. The asterisk (*) marked line In Figure 4 we examine the precision achieved by
depicts the training phase (with TrainB) followed the algorithms adopting reinforcement learning.
by the testing phase (with TestA) of C-RLnT. In The vertical axis depicts the precision value p
this case, we have an incremental ART that does achieved during the testing phase. The point (.)
not need to be trained. For technical consistency marked line depicts the precision of the C-RLT-1
reasons, it only requires a single pattern, which model trained with TrainA and tested with TestA.
is the unique cluster in the knowledge base at the During the test phase, for the first 600 known pat-
beginning. In the testing phase, for the first 600 terns C-RLT-1 achieves precision value ranging
unseen patterns of TestA we observe a progressive from 97% to 100%. In the next 600 unseen pat-
cluster creation (up to 45 clusters). For the next terns, we observe that for the first instances the
600 unseen patterns, we also observe a gradual precision drops smoothly to 95% and as C-RLT-1
cluster creation (up to 85 clusters) followed by is reinforced to learn, i.e., learn new clusters and
convergence. Comparing the C-RLT-1/-2 and C- optimize the old ones, the precision converges
RLnT models, the latter one achieves the minimum to 96%.
number of clusters (22.72% less storage cost). This The circle (o) marked line depicts the precision
is due to the fact that C-RLnT starts learning only behavior for the C-RLT-2 model tested with
from unsuccessful predictions in an incremental TestB and trained with TrainA. With the first 600
way by adapting pre-existing knowledge base to totally unseen patterns during the test phase, C-
new instances. Nevertheless, we also have to take RLT-2 achieves precision from 26% to 96%. This
226
Reinforcement and Non-Reinforcement Machine Learning Classifiers for User Movement Prediction
indicates that the model is still learning during cost. Specifically, we need to store 110 clusters,
the test phase adopting the reinforcement learning in the case of C-RLT, compared to 85 clusters in
method increasing the precision value. In the next the case of C-RLnT (22.72% less storage cost).
600 known patterns, the model has nothing to Furthermore, the user movement patterns can be
learn and the precision value converges to 96%. changed repeatedly over time. Hence, by adopt-
The asterisk (*) marked line depicts the preci- ing the training method, one has to regularly train
sion behavior of the C-RLnT model tested with and rebuild the model. Nevertheless, we have to
TestA and trained with TrainB. In this case, C- examine the behavior of the C-nRLT and C-nRLnT
RLnT is trained with only one pattern instance, models in order to decide on the appropriate model
i.e., the algorithm is fully incremental, thus, all the for the discussed domain.
instances are treated as unseen. In the test phase,
for the first 600 patterns, the model achieves Convergence of C-nRLT
precision, which ranges from 25% to 91% (due and C-nRLnT
to the reinforcement mechanism). In the next 600
patterns, we can notice that for the first instances In Figure 5, we see the number of clusters that
the precision drops smoothly to 88% and as the are generated by C-nRLT/-nRLnT assuming the
model is “reinforced” to learn, precision gradually non-reinforcement learning method. The verti-
converges to 93%. cal axis indicates the generated clusters during
Evidently, the adoption of the training method, the training and the testing phase. The point
i.e., the C-RLT-1/-2 models, yields better preci- (.) marked line depicts the training phase with
sion. However, if we correlate our findings with TrainA and the testing phase with TestA for the
the results shown in Figure 3, we infer that a small C-nRLT-1 model. As we can see in the training
improvement in precision has an obvious storage phase, the first 600 patterns (TrainA) have gener-
227
Reinforcement and Non-Reinforcement Machine Learning Classifiers for User Movement Prediction
ated 70 clusters. In the testing phase, the model unseen patterns we observe that there is no extra
knows the first 600 patterns thus there is no new cluster generation except the unique cluster (of the
cluster creation. Contrast to the C-RLT-1 model TrainB file). In addition, for the next 600 unseen
(Section Prediction Evaluation), the remaining patterns, no additional cluster generation is ob-
600 unseen patterns cannot be further learned by served. We only notice that the C-nRLT/-nRLnT
the C-nRLT-1, thus, the total of clusters remains models achieve a lower amount of clusters than
constant at 70 clusters. the C-RLT/-RLnT models.
The circle (o) marked line depicts the training
phase with TrainA and the testing phase with Precision of C-nRLT and C-nRLnT
TestB for the C-nRLT-2 model. Since the train
file is the same, (i.e., TrainA), the count of the In Figure 6, we see the precision of the models
generated clusters is the same (70 clusters) as adopting the non-reinforcement learning. The
before. In contrast to C-RLT-2, in the testing phase vertical axis shows the precision value achieved
we do not observe a change in the number of during the testing phase. The point (.) marked
clusters. The model does not know the second line refers to the C-nRLT-1 model (trained with
600 unseen patterns and is not reinforced to learn TrainA, tested with TestA). In the testing phase, the
new ones, thus, the total cluster count remains first 600 known patterns achieve precision levels
constant at 70. In the next 600 known patterns, ranging from 97% to 100%. In the next 600 unseen
the model retains the same cluster count (70 patterns, we notice that the precision value drops
clusters). gradually to 59%. This is attributed to the fact
Finally, the asterisk (*) marked line depicts that there are no new clusters to optimize the old
generated number of clusters for the C-nRLnT ones due to the lack of the reinforcement learning
model tested with TestA. In this case, we have mechanism. This indicates that C-nRLT-1 model
an incremental, non-reinforcement model that is does not adapt to new knowledge.
not trained. In the testing phase, for the first 600
228
Reinforcement and Non-Reinforcement Machine Learning Classifiers for User Movement Prediction
The circle (o) marked line refers to the C- cost as lightweight as possible, the C-RLnT and
nRLT-2 model (trained with TrainA, tested with C-nRLnT models should be chosen. If the mobile
TestB). In the testing phase, the first 600 unseen context-aware application aims at maximizing
patterns achieve precision levels ranging from the supported quality of service w.r.t. precision,
16% to 19% because the reinforcement mechanism while keeping the storage cost stable, the C-RLnT
is not used and, thus, new clusters cannot derive. model should be adopted. Table 3 summarizes
In the next 600 known patterns, we notice that our conclusions with respect to the comparison of
the precision value converges to 59%. the four alternative models discussed throughout
Finally, the asterisk (*) marked line depicts the the paper. Comparing the four versions of the C
behavior of the precision of the C-nRLnT model classifier, we can clearly distinguish the C-RLnT
(loaded with TrainB, tested with TestA). In this model as the most efficient classifier for location
scenario, the classifier is fully incremental and prediction.
all instances are treated as unseen. As we can see
in the testing phase the first 600 patterns achieve
precision levels ranging from 10% to 14% and, in COMPARISON WITH THE OTHER
the next 600 pattern instances, we notice that the MODELS
precision value drops smoothly to 7%. Evidently,
the C-nRLT/-nRLnT models achieve much lower We compare the C-RLnT model with other known
precision than their counterparts C-RLT/-RLnT models that can be used for location prediction.
since the former models do not support adaptation. Such models implement the Offline kMeans and
Therefore, the adoption of the reinforcement Online kMeans algorithms. Such models require
learning method for location prediction produces a predefined number of k > 1 initial clusters for
better results than a non-reinforcement classifier. constructing the corresponding knowledge base.
Each approach has certain advantages and limita- We should stress here that, the greater the k the
tions. If the major concern is to keep the storage greater the precision value achieved by Offline/
229
Reinforcement and Non-Reinforcement Machine Learning Classifiers for User Movement Prediction
230
Reinforcement and Non-Reinforcement Machine Learning Classifiers for User Movement Prediction
by comparing such three models, the most suitable for both spatial ρs and temporal ρt vigilance coef-
model for location prediction is the C-RLnT since ficients in order to obtain the highest precision
(i) it achieves greater precision through model with low memory requirements. We introduce the
adaptation and (ii) requires a smaller size of the weighted sum γ as follows:
underlying knowledge base (i.e., less clusters)
than the Offline/Online kMeans models. γ = w ⋅ p + (1 - w) ⋅ (1 – a)
Moreover, we compare the C-RLnT model
with the SOM model. We have experimented where a is the proportion of the generated clusters
with n = 5 and n = 10 neighboring clusters of by the classifier (i.e., the size of the knowledge
the winner cluster. In Figure 8 we can observe base in clusters) out of the total movement patterns
that the precision value for the SOM model (the (i.e., the size of the database in patterns), that is:
asterisk (*) and point (.) marked line) increases a = |C|/|U|; |C| is the cardinality of the set C. The
as the number of neighboring clusters decreases. weight value w ∈ [0, 1] indicates the importance
In case of n = 0, that is, no neighboring clusters of precision and memory requirements; a value
are updated, i.e., the C-RLnT model (the circle (o) of w = 0.5 assigns equal importance to a and p.
marked line), we can achieve the best precision. In our assessment, we set w = 0.7. We require
Up to this point we have concluded that the that a assumes low values minimizing the storage
C-RLnT model achieves good precision with cost of the classifier. A low value of a indicates
limited memory requirements, which are very that the applied classifier appropriately adopts
important parameters for mobile context-aware and learns the user movements without retaining
systems. However, we need to perform some tests redundant information. The value of γ indicates
with C-RLnT in order to determine the best value which values of ρs and ρt maximize the precision
for the spatiotemporal parameter vigilance ρ. In while, on the same time, minimize the memory
other words, we aim to determine the best values requirements. Hence, our aim is to achieve a high
Figure 8. Comparison of the C-RLnT with the SOM model for n = 5 and n = 10 neighboring clusters
231
Reinforcement and Non-Reinforcement Machine Learning Classifiers for User Movement Prediction
Figure 9. The behavior of the γ parameter vs. temporal and spatial coefficients of the vigilance threshold
232
Reinforcement and Non-Reinforcement Machine Learning Classifiers for User Movement Prediction
Figure 10. Adaptivity capability for C-RLnT, SOM-5, SOM-10 incremental models
233
Reinforcement and Non-Reinforcement Machine Learning Classifiers for User Movement Prediction
and assumes a normal density distribution of the prediction accuracy lower than ours (i.e., in the
underlying data. However, such assumptions are order of 70% for deterministic movement). In
not considered in our model as long as mobility (Nhan & Ryu, 2006), the authors apply movement
patterns refer to real human traces with unknown rules in mobility prediction given the user’s past
distribution. Moreover, the learning automaton in movement patterns. Prediction accuracy is still
(Hadjiefthymiades & Merakos, 2003) follows a lin- low (i.e., in the order of 65% for deterministic
ear reward-penalty reinforcement learning method movement). The authors in (Xiao et al., 2007)
for location prediction. However, such model does introduce a prediction model that uses grey theory
not assume satisfactory correct predictions, as (i.e., a theory used to study uncertainty). This ap-
reported in (Hadjiefthymiades & Merakos, 2003). proach achieves prediction accuracy lower than
The authors in (Karmouch & Samaan, 2005) apply ours (i.e., in the order of 82% for deterministic
evidential reasoning in mobility prediction when movement). The authors in (Samaan & Karmouch,
knowledge on the mobility patterns is not avail- 2005) apply evidential reasoning for location
able (i.e., like our case). Yet, such model assumes prediction when knowledge on the mobility pat-
large computational complexity (derived from the terns is not available. This model suffers from
adopted Dempster-Schafer algorithm) once the increased computational complexity attributed to
amount of possible locations (the user moves to) the Dempster-Shafer algorithm. It also requires
increases and requires detailed user information detailed user information (e.g., daily movement
(e.g., daily profile, preferences, favorite meeting profile, favorite meeting places).
places). Other methods for predicting trajectory The authors in (Jeung et al., 2008) apply a
have also been proposed in the literature (Viayan hybrid model for location prediction. The key
& Holtman, 1993) but these have generally been component of the model in (Jeung et al., 2008) is
limited in scope since they consider rectilinear a database of association rules. Such model uses
movement patterns only (e.g., highways) and not association rule mining for specific user patterns.
unknown patterns, as dealt with in our model. If no rules can be exploited (i.e., an unknown
A closely related work has been carried out in user pattern) the query engine uses a Recursive
(Ashrook & Starner, 2002), where a GPS sys- Motion Function (RMF) to predict the next user
tem is used to collect location information. The location. A major drawback of the hybrid model
proposed system then automatically clusters GPS is that it uses more detailed mobility information
data taken into meaningful locations at multiple (i.e., time and velocity) than our LPs. The per-
scales. These locations are then incorporated formance of the proposed scheme is not assessed
into a similar Markov model to predict the user’s w.r.t. movement randomness. In the prediction
future location. The authors in (Yavas et al., model proposed in (Anagnostopoulos et al., 2007)
2005) adopt a data mining approach (i.e., rule a single, quantized timestamp is attached to each
extraction) for predicting user locations in mobile training trajectory. This timestamp refers to the last
environments. This approach achieves prediction cell transition seen in the trajectory. The predic-
accuracy lower than ours (i.e., in the order of 80% tion performance is comparable to some of our
for deterministic movement). In (Katsaros et al., LPs. Also, in (Anagnostopoulos et al., 2009), we
2003), the authors adopt a clustering method compare our LPs with other state-of-art predictors
for the location prediction problem. Prediction for varying moving behaviors. The multi-expert
accuracy is still low (in the order of 66% for combination method in (Anagnostopoulos et al.,
deterministic movement). The authors in (Tao et 2009) achieves better prediction accuracy than the
al., 2004) introduce a framework where for each other LPs. However, that comes at the expense of
user an individual function is computed in order higher time complexity for reaching a prediction
to capture its movement. This approach achieves decision. The method proposed in (Akoush &
234
Reinforcement and Non-Reinforcement Machine Learning Classifiers for User Movement Prediction
235
Reinforcement and Non-Reinforcement Machine Learning Classifiers for User Movement Prediction
Karmouch, A., & Samaan, N. (2005). A Mobil- Yavas, G., Katsaros, D., Ulusoy, O., & Manolo-
ity Prediction Architecture Based on Contextual poulos, Y. (2005). A data mining approach for
Knowledge and Spatial Conceptual Maps. IEEE location prediction in mobile environments. Data
Transactions on Mobile Computing, 4(6). & Knowledge Engineering, 54(2). doi:10.1016/j.
datak.2004.09.004.
Katsaros, D., Nanopoulos, A., Karakaya, M., Ya-
vas, G., Ulusoy, O., & Manolopoulos, Y. (2003).
Clustering Mobile Trajectories for Resource Al-
location in Mobile Environments. In Proceedings ADDITIONAL READING
IDA, pp. 319-329.
Akyildiz, I. F. et al. (1999). Mobility manage-
Lensu, A., & Koikkalainen, P. (2002). Artificial ment for next generation wireless systems.
Neural Networks. ICANN. Proceedings of the IEEE, 87(8), 1347–1385.
Narendra, K., & Thathachar, M. A. L. (1989). doi:10.1109/5.775420.
Learning Automata – An Introduction. Prentice Anagnostopoulos, T., Anagnostopoulos, C., &
Hall. Hadjiefthymiades, S. (2009). An online adaptive
Nhan, V. T. H., & Ryu, K. H. (2006). Future model for location prediction. In Proceedings of
Location Prediction of Moving Objects Based on ICST Autonomics 2009. Limassol, Cyprus: ICST.
Movement Rules. Springer ICIC 2006. LNCIS, Anagnostopoulos, T., Anagnostopoulos, C., &
344, 875–881. Hadjiefthymiades, S. (2011). An adaptive machine
OpenStreetMap. Webite: http://www.openstreet- learning algorithm for location prediction. Interna-
map.org/traces/tag/Denmark tional Journal of Wireless Information Networks,
18, 88–99. doi:10.1007/s10776-011-0142-4.
Priggouris, I., Zervas, E., & Hadjiefthymiades,
S. (2006). Location Based Network Resource Bharghavan, V., & Jayanth, M. (1997). Profile-
Management. In Ibrahim, I. K. (Ed.), Handbook based next-cell prediction in indoor wireless LAN.
of Research on Mobile Multimedia. Idea Group In Proceedings of IEEE SICON. Singapore: IEEE.
Inc. doi:10.4018/978-1-59140-866-6.ch011. Biesterfeld, J., Ennigrou, E., & Jobmann, K.
Tao, Y., Faloutsos, C., Papadias, D., & Liu, B. (1997). Location prediction in mobile networks
(2004). Prediction and Indexing of Moving Objects with neural networks. In Proceedings of the Inter-
with Unknown Motion Patterns. ACM SIGMOD. national Workshop on Applied Neural Networks
doi:10.1145/1007568.1007637. to Telecommunications, (pp. 207-214). IEEE.
Viayan, R., & Holtman, J. (1993). A model for Chan, J., Zhou, S., & Seneviratne, A. (1998). A
analyzing handoff algorithms (Vol. 42, p. 3). IEEE QoS adaptive mobility prediction scheme for wire-
Trans. on Veh. Technol. less networks. In Proceedings of IEEE Globecom.
Sydney, Australia: IEEE.
Xiao, Y., Zhang, H., & Wang, H. (2007). Location
Prediction for Tracking Moving Objects Based on
Grey Theory. IEEE FSKD 2007.
236
Reinforcement and Non-Reinforcement Machine Learning Classifiers for User Movement Prediction
Das, S. K., & Sen, S. K. (1999). Adaptive location Zhang, T. et al. (2001). Local predictive reservation
prediction strategies based on a hierarchical net- for handoff in multimedia wireless IP networks.
work model in a cellular mobile environment. The IEEE Journal on Selected Areas in Communica-
Computer Journal, 42(6), 473–486. doi:10.1093/ tions, 19, 1931–1941. doi:10.1109/49.957308.
comjnl/42.6.473.
Erbas, F., et al. (2001). A regular path recogni-
tion method and prediction of user movements in KEY TERMS AND DEFINITIONS
wireless networks. In Proceedings of the IEEE
Vehicular Technology Conference (VTC). IEEE. Adaptive Resonance Theory: Theory that
involves a number of neural network models in
Jain, R., Lin, Y.-B., & Mohan, S. (1999). Location order to address problems such as pattern recogni-
strategies for personal communications services. tion or prediction.
In Gibson, J. (Ed.), Mobile Communications Classification: The identification of the cor-
Handbook (2nd ed.). Boca Raton, FL: CRC. rect category where a new observation belongs.
Jiang, S., He, D., & Rao, J. (2001). A prediction- Context Awareness: Sensing the environment
based link availability estimation algorithm for of some devices and adapting the behavior of a
mobile ad hoc networks. In Proceedings of IEEE system.
InfoCom. IEEE. Location Prediction: Predicting the location
of a mobile user or device.
Krishna, P., Vaidya, N., & Pradhan, D. (1996). Machine Learning: Branch of artificial intel-
Static and adaptive location management in ligence that is concerned by the development of
mobile wireless networks. Computer Commu- algorithms that take as input empirical data and
nications, 19(4), 321–334. doi:10.1016/0140- yield patterns or prediction of the underlying
3664(96)01070-5. mechanism.
Liu, T., Bahl, P., & Chlamtac, I. (1998). Mobility Online Clustering: Clustering technique that
modeling, location tracking, and trajectory pre- processes data points in serial, one at a time as
diction in wireless ATM networks. IEEE Journal they enter into the system, with the aim of sav-
on Selected Areas in Communications, 16(6), ing computational time when used in real time
922–936. doi:10.1109/49.709453. applications.
237