Professional Documents
Culture Documents
Visualization of Location-Referenced Web Textual Information Based On Map Mashups
Visualization of Location-Referenced Web Textual Information Based On Map Mashups
April 9, 2019.
Digital Object Identifier 10.1109/ACCESS.2019.2907570
ABSTRACT Space is the most fundamental organizing dimension for information that forms the basic
spatial understanding around which all other temporal and semantic details are situated. Various types
of web information are present in our daily lives, and spatial content can facilitate the understanding of
that information. Hence, many studies have been conducted to extract the implicit spatial location from
different web resources, and they have mainly investigated web textual information. However, the existing
studies mainly focus on the extraction or simple presentation of the location of the information, while
further exploration and visualization of the information through the comprehensive consideration of its
spatial, temporal, and semantic dimensions have rarely been performed. Thus, this paper proposes a novel
modeling and visualization framework for location-referenced web textual information. The framework
applies an information model to structurally extract and resolve spatial content along with the temporal and
semantic elements from the location-referenced information items. Based on the constructed information
model, the information items are hierarchically clustered and visualized by combining map mashups with
cartographic processes and methods. This framework enables users to interactively index and browses
individual information items. Furthermore, the association relationships involved in the information set are
explored in our work. With knowledge graphs that are automatically generated based on the information
model, a high-level understanding of the information set can be obtained by users.
Many related studies have been performed to achieve map- the multidimensional content of the information has not
based information visualization or map mashups, and some been explored, visualization is limited to roughly present the
of the studies are devoted to locating and tagging the geo- location of information; thus, the further level knowledge
graphic coordinates of the web content to extract the implicit of the information set has not been thoroughly extracted or
locations [15], as geographic locations are usually implicitly displayed.
contained in location-referenced information. These studies This paper proposes a novel modeling and visualization
are known as geotagging, geo-referencing or geocoding in framework for location-referenced web textual information.
the field of geographic information system [16]–[18], most In this framework, a location-referenced information model
of which consider textual data. Amitay et al. [19] proposed a is established to comprehensively resolve and extract basic
system named Web-a-Where to identify the geographic focus spatial, temporal, and semantic elements such as the loca-
of a document using a hierarchical gazetteer and a simple tion, time, and participant (person, organization) from the
scoring algorithm. Blessing et al. [20] used named entity information items. The information processed in the study
recognition (NER) and a knowledgebase gazetteer to recog- mainly exists as text because, for other types of information,
nize and locate the geographic names for German content. to enable automatic interpretation using a computer, the infor-
Kordopatis-Zilos et al. [21], [22] refined language models mation content is generally processed and extracted to the
for geotagging social media content based on text. Various form of text. Based on the constructed information model,
geotagging tasks for different types of digital resources have hierarchical clustering of the information is applied, and a
been conducted in an international benchmarking initiative visualization strategy that combines map mashups with car-
named MediaEval [23]. tographic processes so that users can interactively index and
Based on these geotagging methods, map mashup applica- browse the clustered information is developed. In addition,
tions that use geographic locations as the medium to present the association relationships between the entities (locations,
and index information have been explored by researchers. persons, etc.) in the information are explored according to
Ahern et al. [24] built an application system called World their co-occurrences in the information model to automati-
Explorer that analyzes the location-referenced textual tags cally generate the corresponding knowledge graphs, which
associated with Flickr images and aggregated and presented enable the users to obtain entity-level knowledge from the
these images with the map interface. Carmo et al. [25] information set.
completed MoViSys, a map visualization system for geo- This paper is organized as follows. Section II describes
referenced data, organized by categories with intuitive and modeling, information processing, and the visualization
adaptive icons for mobile devices. Teitler et al. [26] and framework in detail, while Section III provides the experi-
Sankaranarayanan et al. [27] proposed automated systems ments and the evaluation of the result. Finally, Section IV
to associate news articles or Twitter messages with geo- discusses the results, and a summary of the conclusions is
graphic locations and displayed location-referenced informa- presented.
tion based on the map interface. Gao et al. [28] proposed
the NewsViews system, which leverages text mining to II. METHODOLOGY
identify key concepts and locations discussed in articles To address the limitations mentioned previously, we propose
and to automatically create visually thematic maps based an information processing and map mashup framework to
on an extensive repository of databases. Troudi et al. [29] meet the needs of the users by providing a convenient way
detected events from social media by using descriptive to interactively browse and index location-referenced web
dimensions as the topic, time, and location and devel- textual information and obtain knowledge graphs of the infor-
oped a mashup-based framework to present detailed event mation set on a map interface. The detailed architecture is
information. presented in Figure 1.
The use of geotagging technologies and map mashup appli- Through this framework, we crawl and model the location-
cations offers a new perspective for browsing and perceiv- referenced information from web sources. By resolving the
ing location-referenced information. However, the existing web textual content, the basic elements of the information,
studies are insufficient in terms of several factors. First, including the time, location, activity, and participant, are ana-
the existing studies mainly focus on the acquisition and pre- lyzed and extracted using natural language processing (NLP)
sentation of location content, and the comprehensive reso- and geotagging techniques; the location-referenced informa-
lution and modeling of the information that the temporal tion model is constructed accordingly. Then, the informa-
and semantic content should be considered has not been tion items are hierarchically clustered, and the relationships
sufficiently explored. Second, regarding the visualization between the entities in the information are explored accord-
strategies of these studies, cartographic processing of map ing to the information model. Finally, based on the map
mashups is lacking, and the related cartographic principles mashup strategy, the integrated information items are effec-
and solutions have not been utilized. Because of the rich- tively presented on a map interface using different labeling
ness of location-referenced information, the quality of the modes. Meanwhile, the knowledge graphs of the relationships
visual representation and man-machine interaction of the between the entities involved in the information set can also
map mashups cannot be guaranteed. In addition, because be obtained.
FIGURE 1. General framework for the information processing and map mashup visualization.
TABLE 2. Summary of the PENN treebank part of speech tag sets in english.
dimensions to realize the comprehensive expression of infor- similarity function, which calculates the term feature vector
mation according to topic and content. Since web information similarity between the information and the cluster, is applied:
is constantly being generated in real time; thus, traditional
VEc · VEi
clustering methods for massive incremental information are D_sim (c, i) =
(5)
infeasible because, for every new information item inputted,
E
E
Vc
Vi
all information items need to be clustered again, which
would lead to unacceptable time and computing costs. Hence, where the c is the specific active cluster, i is the information
a modified leader-follower clustering [36] strategy is applied to be clustered, and VEc and VEi are the term feature vectors
in this study to incrementally aggregate the information items for information i and cluster c, respectively, which are gen-
into clusters according to their extracted location, time, and erated based on the terms from the Description tuple and
semantic elements. The specific clustering process is applied their corresponding TF-IDF scores. Based on this formula,
based on the active clusters (with location, time, and seman- the semantic similarity between the information item and the
tic centroid). For each new information item, the similarity cluster is calculated, and the result values range from 0 to 1.
between the information and the active clusters is measured. With the abovementioned location, time, and seman-
Then, the cluster is determined according to the similarity tic similarity measurement, the comprehensive similarity
score. The detailed location, time, and semantic similarity between an information item and a cluster is calculated using
measurement are described as follows. Formula 6, and the result values range from 0 to 1. In this
During the clustering process, location is an important formula, c and i are the cluster and the information item,
factor that must be considered, and information items related respectively, and k is the location hierarchical level. Based
to the same location need to be integrated. In our study, on this formula, the incremental real-time clustering process
Formula 3 is used to measure the location similarity between is performed for each new information item. Specifically,
the information item and the active clusters: the similarity scores between the new information item and
( the existing clusters are calculated. Then, the item is added to
1 If i and c have the same location the most similar cluster if the score is larger than the threshold
L_simk (c, i) =
0 else. value (since the value ranges of the time and semantic similar-
(3) ity score are both from 0 to 1, the comprehensive threshold
is set as 0.5 × 0.5 = 0.25 in this work), and the cluster is
where k stands for the hierarchical level (country, state, updated accordingly; otherwise, a new cluster is created using
county, etc.) of the location. Based on this parameter, infor- the information item. In this way, clustering is incrementally
mation clustering is conducted hierarchically by location performed for each new information item collected in real
level. c refers to the specific active cluster and i refers to the time.
information to be clustered. If i and c have the same location,
Total_simk (c, i) = L_simk (c, i) · T _sim (c, i) · D_sim (c, i)
then the similarity score is set to 1; otherwise, the value is set
to 0, which enables the information set to be clustered based (6)
on the location factor.
To account for temporal similarity, the Gaussian atten- B. MAP MASHUP VISUALIZATION
uation function is applied to measure the time difference Based on the abovementioned work, location-referenced
between the new information item and the active cluster information was processed, modeled, and hierarchically clus-
time centroid. The time similarity measurement formula is tered. Then the visualization of these structured information
as follows: items in the map mashup interface is handled in this stage.
There are two main concerns in our visualization imple-
(ti −tc )2
− mentation. First, a concise and intuitive map presentation
T _sim (c, i) = e 2σ 2 (4)
for the information items should be developed because the
where c and i refer to the specific active cluster and the fundamental function of a map mashup is for the users to
information to be clustered, respectively; ti and tc are the time index and browse the location-referenced information on the
element of information i and cluster c, respectively; and σ map interface. Second, the relationships between the enti-
is the parameter to handle the attenuate rate, which is set to ties (locations, persons, etc.) involved in the information set
3 in the formula. Based on this formula, temporal similarity is are visualized using knowledge graphs, according to their
quantified, and the result values range from 0 to 1, depending co-occurrences in the information model.
on the time difference between the information item and the
cluster. 1) VISUALIZATION OF INFORMATION ITEMS
The semantic similarity between the crawled information The essential function of the map mashup is to index or present
and the cluster is also measured based on the Description location-referenced information on a map interface. However,
tuple from the information model because the Description information visualization via map mashups is usually visually
tuple is the overall semantic expression, which involves chaotic [37], [38] due to the great amount of information
the participant, activity, and other content. Thus, the cosine items and the lack of cartographic processing. If the presented
FIGURE 3. Map labeling modes for information schematic diagram. (a) Traditional labeling mode. (b) Boundary labeling mode.
information is too chaotic, the visualization would be worth- involved are spatial elements (locations, countries, cities, etc.)
less. The problem of display massive amounts of information or nonspatial elements (organizations, persons, etc.), the gen-
on the map has been handled in cartography for decades, and erated knowledge graphs can be divided into two cate-
various labeling methods have provided the most effective for gories: spatial knowledge graphs and nonspatial knowledge
interactive information visualization [39], [40]. Nevertheless, graphs.
relevant methods have rarely been used to map mashup in
(1) Spatial knowledge graphs: For this type of knowledge
previous studies.
graph, the location tuple of information is mainly con-
Thus, we apply two types of labeling modes (i.e.,
sidered. Specifically, the spatial distribution of the ref-
the traditional labeling mode and the boundary labeling
erenced locations within the corresponding information
mode) [39], [41] in the interactive and real-time map mashup
items are presented as the nodes on the map interface,
framework to achieve an intuitive and effective visualization
and their association relationships are presented by
of information items The traditional labeling mode, which has
connecting links according to co-occurrences in the
been most widely studied in previous cartographic studies,
information set.
is often used to place labels on a map. This mode mainly
(2) Nonspatial knowledge graphs: For this visualization
considers the explicit correspondence between the placed
type, knowledge graphs are created for nonspatial
items and their corresponding locations on a map; thus,
entities such as organizations and persons related to
each item is maintained close to its location while avoiding
the information. These entities are compiled as the
item overlap. A modified traditional labeling method [42]
nodes of the knowledge graph network and connected
is applied here, and the visualization schematic diagram is
with lines based on co-occurrences in the information
shown in Figure 3(a). The boundary labeling mode places
set to present the association relationships between
the items outside the map area, with the leader lines show-
them.
ing the corresponding relationship between items and loca-
tions. In our study, boundary labeling is applied to combine Knowledge graphs are constructed based on the
the heuristic search strategy with a boundary labeling co-occurrences of entities in the information set; the detailed
method [41], [43]. The boundary labeling mode takes advan- process is shown in Figure 4. For convenience of illustration,
tage of free space to present information; a schematic diagram we take ‘‘Person’’ entities as an example to describe this
is shown in Figure 3(b). These two labeling modes have process, and the information items in the figure are assumed
their own advantages, and they offer users flexible options to be a subset that is filtered based on a user’s request. For
to conveniently index and browse the location-referenced each information item (I1, I2, I3, etc.) in this subset, some
information items under the map mashups framework. ‘‘Person’’ from the Participant tuple in the information model
(E1, E2, E3, etc.) may be referenced synchronously, which
are referred to as co-occurrences between ‘‘Person’’ entities.
2) INFORMATION KNOWLEDGE GRAPHS By integrating the co-occurrences of each information item,
In addition to map presentations of information items, peo- the overall co-occurrence matrix for the information subset
ple are also interested in the association knowledge of the can be generated; accordingly, the co-occurrence knowledge
information, which enables them to form a high-level under- graph between the ‘‘Person’’ entities can be automatically
standing of the information set. With the information model, constructed.
various entities are extracted and recorded for information With the proposed knowledge graphs, users can determine
items such as locations, organizations, and persons, based the relationships that exist for a specific information subset of
on which corresponding knowledge graphs [44], [45] can be interest, which can focus on a particular period of time or spe-
automatically constructed. According to whether the entities cific locations, organizations, or persons.
synchronously, such as the UK, Russia, France, etc. In addi- Based on these collected data, a large amount of location-
tion, Syria and Iran are involved in the results because of the referenced information is covered in our experiment, and the
high exposure and international attention they have recently interactive visualization for the real-time obtained data can
received. be achieved accordingly.
FIGURE 7. Co-occurrence knowledge graph between locations of ‘‘trade’’ activity. (a) Overview of the co-occurrence graph. (b) Co-occurrences for
New York, USA. (c) Co-occurrences for Beijing, China. (d) Co-occurrences for London, UK. (e) Co-occurrences for Tokyo, Japan.
B. INFORMATION ITEM VISUALIZATION structured information set, users can focus on location levels
The main aim of the designed visualization framework is to of interest, such as country, state, county, etc.
use the map interface as the spatial reference to index and Some visualizations for country-level locations are pre-
browse location-referenced information items. The interface sented in Figure 6, and the results for the two types of visu-
mainly employs a network map and corresponding visual alization modes, traditional labeling and boundary labeling,
containers to present the information items. Users can pan are shown in Figure 6(a) and Figure 6(b), respectively. For
and zoom through the map interface to interactively browse a specific location on the map, a visual panel is created to
and retrieve specific information items that have been hierar- provide an integrated display for the corresponding infor-
chically clustered and integrated according to their location, mation items, which are distributed in clusters, as shown
time, and semantic dimensions. Based on the hierarchically in Figure 6(c). Users can swipe up and down the panel
FIGURE 8. Co-occurrence knowledge graph between ‘‘cristiano ronaldo’’ and related persons.
to dynamically load and browse information items for this map space and may influence the expression of the base map.
location in an interactive way. For boundary labeling mode, because the information panels
For the traditional labeling mode, the information panels are placed outside the map space, there is little influence on
are placed close to the locations; in this way, the correspond- the base map; however, the correlation between location and
ing relationship between the location and the information can information in this mode is not as intuitive as that in the
be clearly presented. However, the information panels occupy former mode. The two modes have their own advantages, and
both have the ability to instantly refresh the map interface clustering methods. Based on the quintet information model,
according to pan and zoom operations. Thus, by applying the extracted spatial, temporal, and semantic elements pro-
these two labeling modes in the proposed map mashup frame- vide deep-level content that can intuitively and comprehen-
work, a more flexible and intuitive way to index and browse sively facilitate the understanding of the information. Then,
location-referenced information items is provided. the visualization application is implemented to combine map
mashups with cartographic methods to automatically gener-
C. KNOWLEDGE GRAPH VISUALIZATION ate knowledge graphs, which allows users to interactively
In addition to indexing and browsing individual information index and browse the individual information items and to fur-
items on a map interface, users are interested in the associa- ther view the overall distribution and correlations of various
tion relationship knowledge contained in the information set. entities in the information set.
With the proposed framework, the basic elements for each The proposed framework is an effective and feasible archi-
information item are extracted and modeled, which provides tecture for general location-referenced information. Cus-
the basis to further interpret the relationships between the tomized extensions can be developed for specific domains to
location, person, organization, etc., entities in the information obtain more targeted and particular information models along
set. With the method proposed in Section II, spatial and with the corresponding visualization patterns. In addition,
nonspatial knowledge graphs can be generated for various while the framework is currently limited to textual informa-
entities in which users are interested. tion, the semantic resolution of other forms of media, such
Knowledge graph visualization examples are shown in Fig- as video, audio, graphs, and images, could be explored to
ures 7 and 8 for location and person entities, respectively. The extend the ability of the system to parse various types of
spatial knowledge graph in Figure 7 considers the ‘‘trade’’ information. These considerations present several directions
Activity between various locations around the world. The for future work.
graph is constructed based on the subset retrieved from the
information set with the search criteria as follows: the Time REFERENCES
tuple is from 2017/1/1 to 2018/10/31, and the Activitytuple [1] S. Hahmann and D. Burghardt, ‘‘How much information is geospatially
is ‘‘trade’’. Thus, the network is constructed with the loca- referenced? Networks and cognition,’’ Int. J. Geograph. Inf. Sci., vol. 27,
no. 6, pp. 1171–1189, Jun. 2013.
tions as the nodes (with different symbol sizes to reflect [2] Q. Gan, J. Attenberg, A. Markowetz, and T. Suel, ‘‘Analysis of geographic
the occurrence frequencies) and the co-occurrences between queries in a search engine log,’’ in Proc. 1st Int. Workshop Location Web,
locations as the edges (with different colors and widths to Beijing, China, 2008, pp. 49–56.
[3] S. Aloteibi and M. Sanderson, ‘‘Analyzing geographic query reformula-
reflect the connection intensity between entities according tion: An exploratory study,’’ J. Assoc. Inf. Sci. Technol., vol. 65, no. 1,
to their co-occurrence frequencies), as shown in Figure 7(a). pp. 13–24, Jan. 2014.
The ‘‘trade’’ condition for some locations can be interactively [4] D. Butler, ‘‘Mashups mix data into global service,’’ Nature, vol. 439, p. 6,
Jan. 2006.
viewed by hovering the mouse cursor over the location, and [5] D. E. Simmen, M. Altinel, V. Markl, S. Padmanabhan, and A. Singh,
the ‘‘trade’’ condition views for New York, Beijing, London, ‘‘Damia: Data mashups for intranet applications,’’ in Proc. ACM SIGMOD
and Tokyo are presented in Figure 7(b)-(e), respectively. Int. Conf. Manage. Data, Vancouver, BC, Canada, 2008, pp. 1171–1182.
The nonspatial knowledge graph example is presented [6] B. Beemer and D. Gregg, ‘‘Mashups: A literature review and classification
framework,’’ Future Internet, vol. 1, no. 1, pp. 59–87, Dec. 2009.
in Figure 8. The information subset for this result is retrieved [7] F. Daniel, M. Matera, and M. Weiss, ‘‘Next in mashup development:
with the following search criteria: the Time tuple is from User-created apps on the Web,’’ IT Prof., vol. 13, no. 5, pp. 22–29,
2018/5/1 to 2018/10/31 and the Participant tuple is ‘‘Cris- Sep./Oct. 2011.
[8] S. Li and J. Gong, ‘‘Mashup: A new way of providing Web mapping/GIS
tiano Ronaldo’’. The co-occurrence relationships between services,’’ in Proc. ISPRS Congr. Commission, Beijing, China, 2008,
soccer player Cristiano Ronaldo and other related persons, pp. 639–649.
such as Lionel Messi, Florentino, and Putin, are illustrated [9] L. R. Johnston and K. L. Jensen, ‘‘MapHappy: A user-centered interface to
library map collections via a Google maps ‘mashup’,’’ J. Map Geography
in the figure. From this type of knowledge graph, users can Libraries, vol. 5, no. 2, pp. 114–130, Jun. 2009.
intuitively determine implicit relationships for specific per- [10] M. Batty, A. Hudson-Smith, R. Milton, and A. Crooks, ‘‘Map mashups,
sons, organizations, etc., using the visualization result. Thus, Web 2.0 and the GIS revolution,’’ Ann. GIS, vol. 16, no. 1, pp. 1–13,
Apr. 2010.
the visualization examples show that the knowledge graphs in [11] S. B. Liu and L. Palen, ‘‘The new cartographers: Crisis map mashups and
our framework provide an intuitive and effective perspective the emergence of neogeographic practice,’’ Cartography Geograph. Inf.
to help users gain insight into the entity-level knowledge in Sci., vol. 37, no. 1, pp. 69–90, Jan. 2010.
[12] M. Haklay, A. Singleton, and C. Parker, ‘‘Web mapping 2.0: The neogeog-
the information set. raphy of the geoWeb,’’ Geography Compass, vol. 2, no. 6, pp. 2011–2039,
Nov. 2008.
IV. CONCLUSIONS [13] M.-J. Kraak, ‘‘Trends in cartography,’’ in Web Cartography: Devel-
In this research, we propose a novel information pro- opments Prospects, M.-J. Kraak and A. Brown, Eds. London, U.K.:
Taylor & Francis, 2001, pp. 9–19.
cessing, modeling, and map mashup visualization frame- [14] M.-J. Kraak, ‘‘Setting and needs for web cartography,’’ in Web Cartogra-
work for location-referenced web resources. The framework phy: Developments Prospects, M.-J. Kraak and A. Brown, Eds. London,
attempts to resolve and model real-time collected web tex- U.K.: Taylor & Francis, 2001, pp. 1–7.
[15] J. Luo, D. Joshi, J. Yu, and A. Gallagher, ‘‘Geotagging in multimedia
tual information from spatial, temporal, and semantic dimen- and computer vision—A survey,’’ Multimedia Tools Appl., vol. 51, no. 1,
sions in real time using NLP, geotagging, and hierarchical pp. 187–211, Jan. 2011.
[16] J. A. McElroy, P. L. Remington, A. Trentham-Dietz, S. A. Robert, and [37] M.-J. Kraak, ‘‘Is there a need for neo-cartography?’’ Cartography Geo-
P. A. Newcomb, ‘‘Geocoding addresses from a large population-based graphic Inf. Sci., vol. 38, no. 2, pp. 73–78, Jan. 2011.
study: Lessons learned,’’ Epidemiology, vol. 14, no. 4, pp. 399–407, [38] J. Korpi and P. Ahonen-Rainio, ‘‘Clutter reduction methods for point
Jul. 2003. symbols in map mashups,’’ Cartographic J., vol. 50, no. 3, pp. 257–265,
[17] M. Larson et al., ‘‘Automatic tagging and geotagging in video collections Aug. 2013.
and communities,’’ in Proc. 1st ACM Int. Conf. Multimedia Retr., Trento, [39] F. Wagner and A. Wolff, ‘‘A combinatorial framework for map
Italy, 2011, p. 51. labeling,’’ in Graph Drawing (Lecture Notes in Computer Science),
[18] P. Nesi, G. Pantaleo, and M. Tenti, ‘‘Ge(o)Lo(cator): Geographic infor- S. H. Whitesides, Ed. Berlin, Germany: Springer, 1998, pp. 316–331.
mation extraction from unstructured text data and Web documents,’’ in [40] M. Luboschik, H. Schumann, and H. Cords, ‘‘Particle-based labeling:
Proc. 9th Int. Workshop Semantic Social Media Adaptation Person., Corfu, Fast point-feature labeling without obscuring other visual features,’’
Greece, Nov. 2014, pp. 60–65. IEEE Trans. Vis. Comput. Graphics, vol. 14, no. 6, pp. 1237–1244,
[19] E. Amitay, N. Har’El, R. Sivan, and A. Soffer, ‘‘Web-a-where: Geotagging Nov./Dec. 2008.
Web content,’’ in Proc. 27th Annu. Int. ACM SIGIR Conf. Res. Develop. [41] M. A. Bekos, M. Kaufmann, D. Papadopoulos, and A. Symvonis,
Inf. Retr., Sheffield, U.K., 2004, pp. 273–280. ‘‘Combining traditional map labeling with boundary labeling,’’ in
[20] A. Blessing, R. Kuntz, and H. Schütze, ‘‘Towards a context model driven Theory and Practice of Computer Science. Berlin, Germany, 2011,
German geo-tagging system,’’ in Proc. 4th ACM Workshop Geograph. Inf. pp. 111–122.
Retr., Lisbon, Portugal, 2007, pp. 25–30. [42] L. Li, H. Zhang, H. Zhu, X. Kuai, and W. Hu, ‘‘A labeling model based
[21] G. Kordopatis-Zilos, S. Papadopoulos, and Y. Kompatsiaris, ‘‘Geotagging on the region of movability for point-feature label placement,’’ ISPRS Int.
social media content with a refined language modelling approach,’’ in Proc. J. Geo-Inf., vol. 5, no. 9, p. 159, Sep. 2016.
Pacific–Asia Workshop Intell. Secur. Inform., Cham, Switzerland, 2015, [43] M. Benkert, H. Haverkort, M. Kroll, and M. Nöllenburg, ‘‘Algorithms for
pp. 21–40. multi-criteria one-sided boundary labeling,’’ in Graph Drawing (Lecture
[22] G. Kordopatis-Zilos, S. Papadopoulos, and I. Kompatsiaris, ‘‘Geotagging Notes in Computer Science), S.-H. Hong, T. Nishizeki, and W. Quan, Eds.
text content with language models and feature mining,’’ Proc. IEEE, Berlin, Germany: Springer, 2007, pp. 243–254.
vol. 105, no. 10, pp. 1971–1986, Oct. 2017. [44] A. Singhal, Introducing the Knowledge Graph: Things, Not Strings, Offi-
[23] M. Larson et al., ‘‘The benchmark as a research catalyst: Charting the cial Google Blog. 2012, p. 5.
progress of geo-prediction for social multimedia,’’ in Multimodal Location [45] J. Pujara, H. Miao, L. Getoor, and W. Cohen, ‘‘Knowledge
Estimation of Videos and Images, J. Choi and G. Friedland, Eds. Cham, graph identification,’’ in The Semantic Web (Lecture Notes in
Switzerland: Springer, 2015, pp. 5–40. Computer Science), H. Alani, Eds. Berlin, Germany: Springer, 2013,
[24] S. Ahern, M. Naaman, R. Nair, and J. H.-I. Yang, ‘‘World explorer: Visual- pp. 542–557.
izing aggregate data from unstructured text in geo-referenced collections,’’
in Proc. 7th ACM/IEEE-CS Joint Conf. Digit. Libraries, Vancouver, BC,
Canada, 2007, pp. 1–10.
[25] M. B. Carmo, A. P. Afonso, P. P. de Matos, and A. Vaz, ‘‘MoViSys—
A visualization system for geo-referenced information on mobile
devices,’’ in Proc. Int. Conf. Adv. Vis. Inf. Syst., Berlin, Germany, 2008, HANG ZHANG received the B.S. degree from the
pp. 167–178. Wuhan University of Technology, China, in 2012.
[26] B. E. Teitler, M. D. Lieberman, D. Panozzo, J. Sankaranarayanan, He is currently pursuing the Ph.D. degree in car-
H. Samet, and J. Sperling, ‘‘NewsStand: A new view on news,’’ in Proc. tography and geographical information engineer-
16th ACM SIGSPATIAL Int. Conf. Adv. Geographic Inf. Syst., Irvine, CA, ing with the School of Resource and Environ-
USA, 2008, p. 18. mental Science, Wuhan University. His research
[27] J. Sankaranarayanan, H. Samet, B. E. Teitler, M. D. Lieberman, and interests include geographic information mining,
J. Sperling, ‘‘Twitterstand: News in tweets,’’ in Proc. 17th ACM SIGSPA- cartography, and map visualization.
TIAL Int. Conf. Adv. Geographic Inf. Syst., Seattle, WA, USA, 2009,
pp. 42–51.
[28] T. Gao, J. R. Hullman, E. Adar, B. Hecht, and N. Diakopoulos,
‘‘NewsViews: An automated pipeline for creating custom geovisualiza-
tions for news,’’ in Proc. 32nd Annu. ACM Conf. Hum. Factors Comput.
Syst., Toronto, ON, Canada, 2014, pp. 3005–3014.
[29] A. Troudi, C. A. Zayani, S. Jamoussi, and I. A. B. Amor, ‘‘A new mashup LIN LI received the Ph.D. degree from Wuhan
based method for event detection from social media,’’ Inf. Syst. Frontiers, University, China, in 1997. He was a Professor
vol. 20, no. 5, pp. 981–992, Oct. 2018.
with the School of Resource and Environmental
[30] X. Kuai, L. Li, H. Luo, S. Hang, Z. Zhang, and Y. Liu, ‘‘Geospa-
Science, Wuhan University. He was with Joseph
tial information categories mapping in a cross-lingual environment:
A case study of ‘surface water’ categories in Chinese and Ameri- Fourier University, France, and The University
can topographic maps,’’ ISPRS Int. J. Geo-Inf., vol. 5, no. 6, p. 90, of Tokyo, Japan, for over four years. His cur-
Jun. 2016. rent research interests include 3D modeling and
[31] D. Jurafsky and J. H. Martin, Speech and Language Processing: An visualization, geographical ontology, 3D cadastre,
Introduction to Natural Language Processing, Computational Linguistics, the integration of ubiquitous location information,
and Speech Recognition. Upper Saddle River, NJ, USA: Prentice-Hall, and feature extraction from point cloud data.
2009.
[32] S. Bird, E. Klein, and E. Loper, Natural Language Processing with Python:
Analyzing Text with the Natural Language Toolkit. Sebastopol, CA, USA:
O’Reilly Media, 2009.
[33] C. D. Manning, M. Surdeanu, J. Bauer, J. Finkel, S. J. Bethard, and
D. McClosky, ‘‘The stanford coreNLP natural language processing WEI HU received the B.S. degree in geomatics
toolkit,’’ in Proc. 52nd Annu. Meeting Assoc. Comput. Linguistics, Syst. engineering from Northeastern University, China,
Demonstrations, Baltimore, MD, USA, 2014, pp. 55–60. in 2012. He is currently pursuing the Ph.D. degree
[34] G. Salton and C. Buckley, ‘‘Term-weighting approaches in automatic in cartography and geographical information sys-
text retrieval,’’ Inf. Process. Manage. vol. 24, no. 5, pp. 513–523, tem with the School of Resource and Environ-
1988. mental Science, Wuhan University. His research
[35] M. Wick and B. Vatant. (2012). The Geonames Geographical Database. interests include digital cartography models and
[Online]. Available: https://www.geonames.org/ technologies, web mapping, and spatial and tem-
[36] R. O. Duda, P. E. Hart, and D. G. Stork, Pattern Classification. New York, poral big data analysis.
NY, USA: Wiley, 2000.
WENJING YAO received the B.S. degree in geo- HAIHONG ZHU received the B.S. and M.S.
graphical information system the from Wuhan degrees in cartography from the Wuhan Technical
University of Technology, China, in 2012. She University of Surveying and Mapping, Wuhan,
is currently involved in research related to smart China, in 1986 and 1996, respectively, and the
cities and big data mining. Ph.D. degree in cartography from Wuhan Uni-
versity, China, in 2013, where she is currently
a Professor and the Ph.D. Advisor with the
School of Resource and Environmental Science.
Her research interests include navigation digital
map, map design, geographical ontology, and the
3D modeling and visualization of geographical information.