Download as pdf or txt
Download as pdf or txt
You are on page 1of 70

The Semantic Web 15th International

Conference ESWC 2018 Heraklion Crete


Greece June 3 7 2018 Proceedings
Lecture Notes in Computer Science
10843 Aldo Gangemi
Visit to download the full and correct content document:
https://ebookmeta.com/product/the-semantic-web-15th-international-conference-eswc
-2018-heraklion-crete-greece-june-3-7-2018-proceedings-lecture-notes-in-computer-s
cience-10843-aldo-gangemi/
More products digital (pdf, epub, mobi) instant
download maybe you interests ...

The Semantic Web ISWC 2021 20th International Semantic


Web Conference ISWC 2021 Virtual Event October 24 28
2021 Proceedings Lecture Notes in Computer Science
Andreas Hotho
https://ebookmeta.com/product/the-semantic-web-iswc-2021-20th-
international-semantic-web-conference-iswc-2021-virtual-event-
october-24-28-2021-proceedings-lecture-notes-in-computer-science-
andreas-hotho/

Learning and Intelligent Optimization 15th


International Conference LION 15 Athens Greece June 20
25 2021 Revised Selected Papers Lecture Notes in
Computer Science Dimitris E. Simos (Editor)
https://ebookmeta.com/product/learning-and-intelligent-
optimization-15th-international-conference-lion-15-athens-greece-
june-20-25-2021-revised-selected-papers-lecture-notes-in-
computer-science-dimitris-e-simos-editor/

Brain Function Assessment in Learning Second


International Conference BFAL 2020 Heraklion Crete
Greece October 9 11 2020 Proceedings Claude Frasson

https://ebookmeta.com/product/brain-function-assessment-in-
learning-second-international-conference-bfal-2020-heraklion-
crete-greece-october-9-11-2020-proceedings-claude-frasson/

Proceedings of the International Conference of


Mechatronics and Cyber MixMechatronics 2018 Lecture
Notes in Networks and Systems Book 48 Gheorghe I
Gheorghe
https://ebookmeta.com/product/proceedings-of-the-international-
conference-of-mechatronics-and-cyber-
mixmechatronics-2018-lecture-notes-in-networks-and-systems-
Web Engineering: 21st International Conference, ICWE
2021, Biarritz, France, May 18–21, 2021, Proceedings
(Lecture Notes in Computer Science, 12706) Marco
Brambilla (Editor)
https://ebookmeta.com/product/web-engineering-21st-international-
conference-icwe-2021-biarritz-france-may-18-21-2021-proceedings-
lecture-notes-in-computer-science-12706-marco-brambilla-editor/

Reversible Computation 13th International Conference RC


2021 Virtual Event July 7 8 2021 Proceedings Lecture
Notes in Computer Science 12805 Shigeru Yamashita
(Editor)
https://ebookmeta.com/product/reversible-computation-13th-
international-conference-rc-2021-virtual-event-
july-7-8-2021-proceedings-lecture-notes-in-computer-
science-12805-shigeru-yamashita-editor/

Web and Internet Economics: 17th International


Conference, WINE 2021, Potsdam, Germany, December
14–17, 2021, Proceedings (Lecture Notes in Computer
Science, 13112) Michal Feldman
https://ebookmeta.com/product/web-and-internet-economics-17th-
international-conference-wine-2021-potsdam-germany-
december-14-17-2021-proceedings-lecture-notes-in-computer-
science-13112-michal-feldman/

Integer Programming and Combinatorial Optimization 23rd


International Conference IPCO 2022 Eindhoven The
Netherlands June 27 29 2022 Proceedings Lecture Notes
in Computer Science 13265 Karen Aardal (Editor)
https://ebookmeta.com/product/integer-programming-and-
combinatorial-optimization-23rd-international-conference-
ipco-2022-eindhoven-the-netherlands-june-27-29-2022-proceedings-
lecture-notes-in-computer-science-13265-karen-aardal/

Introduction to Lipschitz Geometry of Singularities


Lecture Notes of the International School on
Singularity Theory and Lipschitz Geometry Cuernavaca
June 2018 Lecture Notes in Mathematics Walter Neumann
(Editor)
https://ebookmeta.com/product/introduction-to-lipschitz-geometry-
of-singularities-lecture-notes-of-the-international-school-on-
singularity-theory-and-lipschitz-geometry-cuernavaca-
Aldo Gangemi · Roberto Navigli
Maria-Esther Vidal · Pascal Hitzler
Raphaël Troncy · Laura Hollink
Anna Tordai · Mehwish Alam (Eds.)
LNCS 10843

The Semantic Web


15th International Conference, ESWC 2018
Heraklion, Crete, Greece, June 3–7, 2018
Proceedings

123
Lecture Notes in Computer Science 10843
Commenced Publication in 1973
Founding and Former Series Editors:
Gerhard Goos, Juris Hartmanis, and Jan van Leeuwen

Editorial Board
David Hutchison
Lancaster University, Lancaster, UK
Takeo Kanade
Carnegie Mellon University, Pittsburgh, PA, USA
Josef Kittler
University of Surrey, Guildford, UK
Jon M. Kleinberg
Cornell University, Ithaca, NY, USA
Friedemann Mattern
ETH Zurich, Zurich, Switzerland
John C. Mitchell
Stanford University, Stanford, CA, USA
Moni Naor
Weizmann Institute of Science, Rehovot, Israel
C. Pandu Rangan
Indian Institute of Technology Madras, Chennai, India
Bernhard Steffen
TU Dortmund University, Dortmund, Germany
Demetri Terzopoulos
University of California, Los Angeles, CA, USA
Doug Tygar
University of California, Berkeley, CA, USA
Gerhard Weikum
Max Planck Institute for Informatics, Saarbrücken, Germany
More information about this series at http://www.springer.com/series/7409
Aldo Gangemi Roberto Navigli

Maria-Esther Vidal Pascal Hitzler


Raphaël Troncy Laura Hollink


Anna Tordai Mehwish Alam (Eds.)


The Semantic Web


15th International Conference, ESWC 2018
Heraklion, Crete, Greece, June 3–7, 2018
Proceedings

123
Editors
Aldo Gangemi Raphaël Troncy
University of Bologna EURECOM
Bologna Biot
Italy France
Roberto Navigli Laura Hollink
Sapienza University of Rome CWI
Rome Amsterdam
Italy The Netherlands
Maria-Esther Vidal Anna Tordai
Universidad Simón Bolívar Elsevier B.V.
Caracas Amsterdam
Venezuela The Netherlands
Pascal Hitzler Mehwish Alam
Wright State University CNR-ISTC
Dayton, OH Rome
USA Italy

ISSN 0302-9743 ISSN 1611-3349 (electronic)


Lecture Notes in Computer Science
ISBN 978-3-319-93416-7 ISBN 978-3-319-93417-4 (eBook)
https://doi.org/10.1007/978-3-319-93417-4

Library of Congress Control Number: 2018946633

LNCS Sublibrary: SL3 – Information Systems and Applications, incl. Internet/Web, and HCI

© Springer International Publishing AG, part of Springer Nature 2018


This work is subject to copyright. All rights are reserved by the Publisher, whether the whole or part of the
material is concerned, specifically the rights of translation, reprinting, reuse of illustrations, recitation,
broadcasting, reproduction on microfilms or in any other physical way, and transmission or information
storage and retrieval, electronic adaptation, computer software, or by similar or dissimilar methodology now
known or hereafter developed.
The use of general descriptive names, registered names, trademarks, service marks, etc. in this publication
does not imply, even in the absence of a specific statement, that such names are exempt from the relevant
protective laws and regulations and therefore free for general use.
The publisher, the authors, and the editors are safe to assume that the advice and information in this book are
believed to be true and accurate at the date of publication. Neither the publisher nor the authors or the editors
give a warranty, express or implied, with respect to the material contained herein or for any errors or
omissions that may have been made. The publisher remains neutral with regard to jurisdictional claims in
published maps and institutional affiliations.

Printed on acid-free paper

This Springer imprint is published by the registered company Springer International Publishing AG
part of Springer Nature
The registered company address is: Gewerbestrasse 11, 6330 Cham, Switzerland
Preface

This volume contains the main proceedings of the 2018 edition of the Extended
Semantic Web Conference (ESWC 2018). ESWC is established as a yearly major
venue for discussing the latest scientific results and technology innovations related to
the Semantic Web and linked data. At ESWC, international scientists, industry spe-
cialists, and practitioners meet to discuss the future of applicable, scalable,
user-friendly, as well as potentially game-changing solutions. The 15th edition took
place from June 3rd to June 7th, 2018, in Heraklion, Crete (Greece).
Building on its past success, ESWC is also a venue for broadening the focus of the
Semantic Web community to span other relevant research areas in which semantics and
Web technology play an important role, as well as for experimenting with innovative
practices and topics that deliver extra value to the community and beyond.
This year’s conference has introduced some novelties compared with the past: a new
open reviewing policy (see later); three main tracks instead of two: the usual research
and in-use tracks, plus a novel resource track; a new industry session dedicated to
experiences and results by semantic technology companies.
The research track is organized into subtracks, and the chairs of ESWC 2018
organized two special subtracks putting particular emphasis on usage areas where
Semantic Web technologies are facilitating a leap of progress, namely: “Benchmarking
and Empirical Evaluations” and “Semantic Web for Science.”
The other events keep the legacy of the past: a poster and demo session, workshops
and tutorials, a PhD symposium, a challenges track, a project networking session. The
regular subtracks include: vocabularies, schemas, ontologies, reasoning, linked data
(including knowledge graphs), social Web and Web science, semantic data manage-
ment, natural language processing and information retrieval, machine learning, mobile
Web, sensors and semantic streams, services, APIs, processes and cloud computing.
Concerning the open reviewing policy, ESWC 2018 adopted a double-open
reviewing schema, additionally placing all papers (either accepted or rejected) on the
conference website, jointly with their reviews (rejected papers will be removed after the
conference upon the authors’ request) and the names of the reviewers (except when
they decided to opt out). Two special chairs, Silvio Peroni and Sarven Capadisli,
produced the website branch hosting papers and reviews, and enabled visitors to
provide comments. The reviewing data have been processed in order to obtain insights
over the multiple facets involved in the process and its argumentation. The overall
policy is original for a conference, but we have inherited the experience of the suc-
cessful open reviewing applied by the Semantic Web Journal, championed by one of its
editors-in-chief, Pascal Hitzler, who was the resource chair at ESWC 2018. Data
analytics and user feedback reports from this innovative process were discussed at the
opening and closing sessions of ESWC 2018.
Emerging from its roots in AI and Web technology, the Semantic Web today is
mainly a Web of linked data and a reference model for all sorts of knowledge graphs
VI Preface

emerging in institutions, companies, and academia, upon which a plethora of services


and applications for all possible domains are being proposed. Some of the core chal-
lenges that the Semantic Web aims at addressing are heterogeneity of content, and its
volatile and rapidly changing nature, its uncertainty, provenance, and varying quality.
This, in combination with more traditional disciplines, such as logical modeling and
reasoning, natural language processing, databases and data storage and access, machine
learning, distributed systems, information retrieval and data mining, social networks,
Web science and Web engineering, shows the span of topics covered by this confer-
ence. The nine regular research subtracks, in combination with resources and in-use
tracks, and the two special research subtracks, constitute the main scientific program of
ESWC 2018.
The main scientific program of the conference comprised 48 papers: 31 research,
nine resources papers, eight in-use papers, selected out of 179 reviewed submissions,
which corresponds to an acceptance rate of 23.5% for the research papers submitted,
41% for the resource papers, and 32% for the in-use papers. This program was com-
pleted by a demonstration and poster session (41 accepted out of 53 submissions), in
which researchers had the chance to present their latest results and advances in the form
of live demos or face-to-face presentations. In addition, the PhD symposium program
included ten contributions, selected out of 17 submissions.
The conference program contained nine workshops, five tutorials, and an EU project
networking session. This year, an open call also allowed us to select and support four
challenges: Open Knowledge Extraction, Semantic Sentiment Analysis, Triple Store
Performance, and Scalable Question-Answering over Linked Data. These associated
events created an even more open, multidisciplinary, and cross-fertilizing environment
at the conference, allowing for work-in progress and practical results to be discussed.
Workshops ranged from domain-focused topics, including the biomedical, scientific
publishing, e-science, robotics, distributed ledgers, and scholarly fields, to more
technology-focused topics ranging from RDF stream processing, query processing,
data quality and data evolution, to sentiment analysis and semantic deep learning.
Tutorial topics spanned NLP, ontology engineering, and linked data, including specific
tutorials on knowledge graphs and rule-based processing of data. Proceedings from
these satellite events are available in a separate volume.
The program also included three exciting invited keynotes, with topics ranging from
the use of knowledge graphs in industrial applications, to traditional as well as novel
Semantic Web topics.
The general and Program Committee chairs would like to thank the many people
who were involved in making ESWC 2018 a success. First of all, our thanks go to the
24 co-chairs of the research track and the 360 reviewers for all tracks, including 49
external reviewers, for ensuring a rigorous and open review process that led to an
excellent scientific program, and an average of four reviews per article. The scientific
program was completed by an exciting selection of posters and demos chaired by Anna
Lisa Gentile and Andrea Giovanni Nuzzolese.
Special thanks go to the PhD symposium chairs, Sebastian Rudolph and Maria
Maleshkova, who managed one of the key events at ESWC. The brilliant PhD students
will become the future leaders of our field, and deserve both encouragement and
mentoring, which Maria and Sebastian made sure we could provide.
Preface VII

We also had a great selection of workshops and tutorials, as mentioned earlier,


thanks to the commitment of our workshop and tutorial chairs, Heiko Paulheim and Jeff
Pan.
Thanks to our EU project networking session chair, Maria Maleshkova, we had the
opportunity to facilitate meetings and exciting discussions between leading European
research projects. Networking and sharing ideas between projects is a crucial success
factor for such large research projects.
We are additionally grateful for the work and commitment of Davide Buscaldi and
Diego Reforgiato Recupero, and all the individual challenges chairs, who successfully
established a challenge track.
Thanks to Andrea Conte for the organization of the industry track, which included
six substantial presentations from companies that have started adopting Semantic Web
technologies, and to York Sure for playing the ever-challenging sponsor chair role.
We thank STI International for supporting the conference organization, and partic-
ularly Dieter Fensel as the conference treasurer. YouVivo GmbH and Venja Lehmann in
particular deserve special thanks for the professional support of the local conference
organization and for solving all practical matters.
Further, we are very grateful to Mehwish Alam, our publicity and proceedings chair,
who kept our community informed throughout the year and did an excellent job in
preparing this volume with the kind support of Springer. We also thank Venislav
Georgiev, who administered the website. We finally thank our sponsors, listed on the
next pages, for their vital support of this edition of ESWC.
The general chair would like to close the preface with his warm thanks to the six
program chairs for their relaxed but rigorous commitment in carrying out the novel
open reviewing experiment, facing the thousands of questions and the few complaints
from leading our peer community in an almost unknown territory, which will be a
milestone for future events fostering openness in science.

June 2018 Aldo Gangemi


Maria-Esther Vidal
Roberto Navigli
Pascal Hitzler
Raphaël Troncy
Anna Tordai
Laura Hollink
Organization

General Chair
Aldo Gangemi University of Bologna and CNR, Italy

Research Track Program Chairs


Roberto Navigli Sapienza University of Rome, Italy
Maria-Esther Vidal Leibniz Information Centre for Science and
Technology University Library, Germany, and
Universidad Simon Bolivar, Venezuela

Resource Track Program Chairs


Pascal Hitzler Wright State University, USA
Raphael Troncy Eurecom, France

In-Use Track Program Chairs


Laura Hollink Centrum Wiskunde and Informatica, The Netherlands
Anna Tordai Elsevier, The Netherlands

Workshop and Tutorial Chairs


Heiko Paulheim University of Mannheim, Germany
Jeff Pan The University of Aberdeen, UK

Poster and Demo Track Chairs


Anna Lisa Gentile IBM Research Almaden, USA
Andrea Giovanni Nuzzolese ISTC-CNR, Rome, Italy

Challenge Chairs
Davide Buscaldi Université Paris 13, France
Diego Reforgiato Recupero University of Cagliari, Italy

PhD Symposium Chairs


Sebastian Rudolph Technische Universität Dresden, Germany
Maria Maleshkova Karlsruhe Institute of Technology, Germany
X Organization

EU Project Networking Chair


Maria Maleshkova Karlsruhe Institute of Technology, Germany

Industry Session Chair


Andrea Conte Reply Spa, Italy

Sponsoring Chair
York Sure-Vetter Karlsruhe Institute of Technology, Germany

Open Conference Data Chairs


Sarven Capadisli University of Bonn, Germany
Silvio Peroni University of Bologna, Italy

Publicity Chairs
Mehwish Alam ISTC-CNR, Rome, Italy

Web Presence
Venislav Georgiev STI, Austria

Proceedings Chair
Mehwish Alam ISTC-CNR, Rome, Italy

Treasurer
Dieter Fensel STI, Austria

Local Conference Organization


Venja Lehmann YouVivo GmbH, Germany

Subtrack Chairs
Vocabularies, Schemas, Ontologies
Helena Sofia Pinto Universidade de Lisbon, Portugal
Christoph Lange University of Bonn Fraunhofer IAIS, Germany
Organization XI

Reasoning
Jeff Pan The University of Aberdeen, UK
Diego Calvanese Free University of Bozen-Bolzano, Italy

Linked Data
Hala Skaf-Molli University of Nantes, France
Jorge Gracia University of Zaragoza, Spain

Social Web and Web Science


Harald Sack Leibniz Institute for Information Infrastructure,
Karlsruhe Institute of Technology, Germany
Stefan Dietze L3S Research Center, Germany

Semantic Data Management, Big Data, Scalability


Maribel Acosta Karlsruhe Institute of Technology, Germany
Olaf Hartig Linköping University, Sweden

Natural Language Processing and Information Retrieval


John McCrae National University of Ireland, Ireland
Valentina Presutti Institute of Cognitive Sciences and Technologies, Italy

Machine Learning
Andreas Hotho University of Würzburg, Germany
Achim Rettinger Karlsruhe Institute of Technology, Germany

Mobile Web, Sensors, and Semantic Streams


Themis Palpanas Paris Descartes University, France
Intizar Ali National University of Ireland, Ireland

Services, APIs, Processes, and Cloud Computing


Amrapali Zaveri Maastricht University, The Netherlands
Ruben Verborgh Research Foundation Flanders Ghent University,
Belgium

Benchmarking and Empirical Evaluation


Oscar Corcho Universidad Politécnica de Madrid, Spain
Emanuele Della Valle Politecnico di Milano, Italy
XII Organization

Scientific Semantic Web


Catia Pesquita University of Lisbon, Portugal
Adrian Paschke Freie Universität Berlin, Germany

Program Committee
Maribel Acosta Karlsruhe Institute of Technology, Germany
Nitish Aggarwal IBM, USA
Asan Agibetov Medical University of Vienna, Austria
Henning Agt-Rickauer HPI, Germany
Mehwish Alam Consiglio Nazionale delle Ricerche, Italy
Céline Alec Université Caen-Normandie, France
Intizar Ali Insight Centre for Data Analytics, National University
of Ireland
Vito Walter Anelli Politecnico di Bari, Italy
Grigoris Antoniou University of Huddersfield, UK
Mihael Arcan Insight Centre for Data Analytics, National University
of Ireland
Luigi Asprino University of Bologna and STLab, Italy
Judie Attard Trinity College Dublin, Ireland
Martin Atzmueller Tilburg University, The Netherlands
Sören Auer TIB Leibniz Information Center of Science
and Technology and University of Hannover,
Germany
Michele Barbera SpazioDati SRL, Italy
Payam Barnaghi University of Surrey, UK
Caroline Barriere CRIM, Canada
Pierpaolo Basile University of Bari, Italy
Valerio Basile Inria, France
Hannah Bast University of Freiburg, Germany
Martin Becker University of Würzburg, Germany
Wouter Beek Vrije Universiteit Amsterdam, The Netherlands
Khalid Belhajjame Université Paris-Dauphine, France
Vito Bellini Politecnico di Bari, Italy
Eva Blomqvist Linköping University, Sweden
Carlos Bobed University of Rennes 1, France
Shawn Bowers Gonzaga University, USA
Loris Bozzato Fondazione Bruno Kessler, Italy
Christopher Brewster TNO, The Netherlands
Carlos Buil Aranda Universidad Técnica Federico Santa María, Chile
Paul Buitelaar Insight Centre for Data Analytics, National University
of Ireland
Davide Buscaldi LIPN, Université Paris 13, France
Elena Cabrio Université Côte d’Azur, CNRS, Inria, France
Organization XIII

Jean-Paul Calbimonte University of Applied Sciences and Arts Western


Switzerland, Switzerland
Diego Calvanese Free University of Bozen-Bolzano, Italy
Mario Cannataro University Magna Graecia of Catanzaro, Italy
Caterina Caracciolo Food and Agriculture Organization of the United
Nations, Italy
Irene Celino CEFRIEL, Italy
Davide Ceolin Vrije Universiteit Amsterdam, The Netherlands
Miguel Ceriani Queen Mary University of London, UK
Michelle Cheatham Wright State University, USA
Christian Chiarcos Universität Frankfurt am Main, Germany
Key-Sun Choi Korea Advanced Institute of Science and Technology,
South Korea
Diego Collarana Enterprise Information System
Pieter Colpaert Ghent University, Belgium
Andrea Conte Reply, Italy
Oscar Corcho Universidad Politécnica de Madrid, Spain
Francesco Corcoglioniti University of Trento, Italy
Melanie Courtot European Bioinformatics Institute, UK
Francisco Couto University of Lisbon, Portugal
Roberta Cuel University of Trento, Italy
Edward Curry Insight Centre for Data Analytics, National University
of Ireland
Claudia d’Amato University of Bari, Italy
Mathieu D’Aquin Insight Centre for Data Analytics, National University
of Ireland
Aba-Sah Dadzie The Open University
Laura M. Daniele TNO, Netherlands Organization for Applied Scientific
Research, The Netherlands
Brian Davis Insight Centre for Data Analytics, National University
of Ireland
Victor de Boer Vrije Universiteit Amsterdam, The Netherlands
Ernesto William De Luca Georg Eckert Institute – Leibniz Institute for
International Textbook Research, Germany
Jeremy Debattista Trinity College Dublin, Ireland
Daniele Dell’Aglio University of Zurich, Switzerland
Emanuele Dellavalle Politecnico di Milano, Italy
Gianluca Demartini University of Queensland, Australia
Tommaso Di Noia Politecnico di Bari, Italy
Dennis Diefenbach Jean Monet University, France
Stefan Dietze L3S Research Center, Germany
Anastasia Dimou Ghent University, Belgium
Christian Dirschl Wolters Kluwer, Germany
Dejing Dou University of Oregon, USA
Mauro Dragoni Fondazione Bruno Kessler, Italy
Alistair Duke BT, UK
XIV Organization

Anca Dumitrache Vrije Universiteit Amsterdam, The Netherlands


Michel Dumontier Maastricht University
Floriana Esposito Università degli Studi di Aldo Moro Bari, Italy
Ralph Ewerth Leibniz Information Centre for Science
and Technology, Germany
Nicola Fanizzi Università degli Studi di Bari Aldo Moro, Italy
Stefano Faralli University of Mannheim, Germany
Daniel Faria Instituto Gulbenkian de Ciência, Portugal
Catherine Faron Zucker Université Nice Sophia Antipolis, France
Anna Fensel Semantic Technology Institute Innsbruck,
University of Innsbruck, Austria
Miriam Fernandez Knowledge Media Institute
Javier D. Fernández Vienna University of Economics and Business, Austria
Agata Filipowska Poznan University of Economics, Poland
Valeria Fionda University of Calabria, Italy
George H. L. Fletcher Eindhoven University of Technology, The Netherlands
Giorgos Flouris FORTH-ICS, Greece
Flavius Frasincar Erasmus University of Rotterdam, The Netherlands
Irini Fundulaki FORTH-ICS, Greece
Luis Galárraga Aalborg University, Denmark
Aldo Gangemi Università di Bologna and CNR-ISTC, Italy
Daniel Garijo Information Sciences Institute, USA
Anna Lisa Gentile IBM, USA
Alain Giboin Inria, France
Martin Giese University of Oslo, Norway
Rafael S Gonçalves Stanford University, USA
Jorge Gracia University of Zaragoza, Spain
Irlan Grangel University of Bonn, Germany
Michael Granitzer University of Passau, Germany
Alasdair Gray Heriot-Watt University, UK
Dagmar Gromann TU Dresden, Germany
Paul Groth Elsevier Labs
Giovanna Guerrini University of Genoa, Italy
Alessio Gugliotta Innova, Italy
Giancarlo Guizzardi Ontology and Conceptual Modeling Research
Group/Federal University of Espirito Santo, Brazil
Christophe Guéret Accenture, Ireland
Amelie Gyrard Ohio Center of Excellence in Knowledge-enabled
Computing, USA
Peter Haase metaphacts
Lavdim Halilaj Fraunhofer, Germany
Siegfried Handschuh University of Passau, Germany
Andreas Harth Karlsruhe Institute of Technology, Germany
Olaf Hartig Linköping University, Sweden
Oktie Hassanzadeh IBM, USA
Jörn Hees TU Kaiserslautern and DFKI, Germany
Organization XV

Sven Hertling University of Mannheim, Germany


Daniel Hienert GESIS, Leibniz Institute for the Social Sciences,
Germany
Pascal Hitzler Wright State University, USA
Robert Hoehndorf King Abdullah University of Science and Technology,
Saudi Arabia
Rinke Hoekstra University of Amsterdam, The Netherlands
Aidan Hogan Universidad de Chile, Chile
Laura Hollink CWI, The Netherlands
Anett Hoppe TIB, Leibniz Information Centre for Science
and Technology, Germany
Matthew Horridge Stanford University, USA
Tomas Horvath Eötvös Loránd University, Hungary
Katja Hose Aalborg University, Denmark
Andreas Hotho University of Würzburg, Germany
Zhisheng Huang Vrije Universiteit Amsterdam, The Netherlands
Antoine Isaac Europeana and Vrije Universiteit Amsterdam,
The Netherlands
Ashutosh Jadhav IBM, USA
Valentina Janev The Mihajlo Pupin Institute, Serbia
Krzysztof Janowicz University of California, Santa Barbara, USA
Ernesto Jimenez-Ruiz University of Oslo, Norway
Anna Jordanous University of Kent, UK
Simon Jupp European Bioinformatics Institute
Martin Kaltenböck Semantic Web Company
Naouel Karam Freie Universität Berlin, Germany
Anna Kaspzik Technische Informationsbibliothek, Germany
Tomi Kauppinen Aalto University School of Science, Finland
Takahiro Kawamura Japan Science and Technology Agency, Japan
Ali Khalili Vrije Universiteit Amsterdam, The Netherlands
Evgeny Kharlamov University of Oxford, UK
Sabrina Kirrane Vienna University of Economics and Business, Austria
Tomas Kliegr University of Economics, Prague, Czech Republic
Roman Kontchakov Birkbeck, University of London
Maria Koutraki FIZ Karlsruhe, Germany
Kouji Kozaki Osaka University, Japan
Adila A. Krisnadhi Wright State University, USA and Universitas
Indonesia
Sebastian Köhler Institut für Medizinische Genetik,
Charite Universitätsmedizin Berlin, Germany
Christoph Lange University of Bonn and Fraunhofer IAIS, Germany
Mike Lauruhn Elsevier
Agnieszka Lawrynowicz Poznan University of Technology, Poland
Danh Le Phuoc TU Berlin, Germany
Maxime Lefrançois Mines Saint-Etienne, France
Jens Lehmann University of Bonn, Germany
XVI Organization

Steffen Lohmann Fraunhofer, Germany


Vanessa Lopez IBM, Ireland
Phillip Lord Newcastle University, UK
Ismini Lourentzou University of Illinois at Urbana - Champaign, USA
Bertram Ludaescher University of Illinois at Urbana-Champaign, USA
Ioanna Lytra University of Bonn, Germany
Valentina Maccatrozzo Vrije Universiteit Amsterdam, The Netherlands
Christian Mader Fraunhofer, Germany
Veronique Malaise Elsevier BV
Maria Maleshkova Karlsruhe Institute of Technology, Germany
Alessandro Margara Politecnico di Milano, Italy
Nicolas Matentzoglu EMBL-EBI, UK
Diana Maynard The University of Sheffield, UK
John P. McCrae National University of Ireland
Fiona McNeill Heriot Watt University, UK
Edgar Meij Bloomberg L.P., London
Albert Meroño-Peñuela Vrije Universiteit Amsterdam, The Netherlands
Aditya Mogadala Karlsruhe Institute of Technology, Germany
Pascal Molli University of Nantes, France
Stefano Montanelli University of Milan, Italy
Steven Moran University of Zurich, Switzerland
Till Mossakowski University of Magdeburg, Germany
Andreas Mueller Schaeffler Technologies AG, Germany
Varish Mulwad GE Global Research, USA
Raghava Mutharaju GE Global Research, USA
Roberto Navigli Sapienza University of Rome, Italy
Axel-Cyrille Ngonga Paderborn University, Germany
Ngomo
Vinh Nguyen National Library of Medicine, NIH, USA
Andriy Nikolov metaphacts GmbH, Germany
Lyndon Nixon MODUL Technology GmbH
Vit Novacek DERI, National University of Ireland, Ireland
Andrea Giovanni Nuzzolese University of Bologna, Italy
Leo Obrst MITRE, USA
Fabrizio Orlandi University of Bonn, Germany
Francesco Osborne The Open University
Petya Osenova Sofia University and IICT-BAS, Bulgaria
Guillermo Palma Universidad Simón Bolívar, Venezuela
Matteo Palmonari University of Milano-Bicocca, Italy
Themis Palpanas Paris Descartes University, France
Jeff Z. Pan University of Aberdeen, UK
Adrian Paschke Freie Universität Berlin, Germany
Michele Pasin Nature Publishing Group
Tommaso Pasini Sapienza University of Rome, Italy
Pankesh Patel Fraunhofer CESE, Germany
Heiko Paulheim University of Mannheim, Germany
Organization XVII

Tassilo Pellegrini University of Applied Sciences St. Pölten, Austria


Silvio Peroni University of Bologna, Italy
Catia Pesquita Universidade de Lisboa, Portugal
Rafael Peñaloza Free University of Bozen-Bolzano, Italy
Hsofia Pinto Algos, INESC-ID/IST, Portugal
Giuseppe Pirrò Institute for High Performance Computing
and Networking, ICAR-CNR, Italy
Antonella Poggi Sapienza University of Rome, Italy
Axel Polleres Vienna University of Economics and Business, Austria
Jedrzej Potoniec Poznan University of Technology, Poland
María Poveda-Villalón Universidad Politécnica de Madrid, Spain
Valentina Presutti CNR, Institute of Cognitive Sciences and Tecnologies,
Italy
Freddy Priyatna Universidad Politécnica de Madrid, Spain
Gustavo Publio Universität Leipzig, Germany
Dharmen Punjani National and Kapodistrian University of Athens,
Greece
Guilin Qi Southeast University
Steffen Remus University of Hamburg, Germany
Achim Rettinge Karlsruhe Institute of Technology, Germany
Martin Rezk Rakuten, Tokyo, Japan
Martin Riedl University of Stuttgart, Germany
Petar Ristoski University of Mannheim, Germany
Carlos R. Rivero Rochester Institute of Technology, USA
Giuseppe Rizzo ISMB, Italy
Víctor Rodríguez Doncel Universidad Politécnica de Madrid, Spain
Mariano Rodríguez Muro IBM, USA
Francesco Ronzano Universitat Pompeu Fabra, Spain
Ana Roxin University of Burgundy, France
Edna Ruckhaus Universidad Politécnica de Madrid, Spain
Sebastian Rudolph TU Dresden, Germany
Anisa Rula University of Milano-Bicocca, Italy
Alessandro Russo STLab, ISTC-CNR, Italy
Michele Ruta Politecnico di Bari, Italy
Harald Sack FIZ Karlsruhe, Leibniz Institute for Information
Infrastructure and KIT Karlsruhe, Germany
Satya Sahoo Case Western Reserve University, USA
Idafen Santana-Pérez Universidad Politécnica de Madrid, Spain
Cristina Sarasua University of Zurich, Switzerland
Felix Sasaki Lambdawerk
Kai-Uwe Sattler TU Ilmenau, Germany
Marco Luca Sbodio IBM, Ireland
Ralf Schenkel Trier University, Germany
Stefan Schlobach Vrije Universiteit Amsterdam, The Netherlands
Michael Schmidt Amazon, USA
Jodi Schneider University of Illinois Urbana Champaign, USA
XVIII Organization

Juan F. Sequeda Capsenta Labs


Bariş Sertkaya Frankfurt University of Applied Sciences, Germany
Nicolas Seydoux LAAS-CNRS/IRIT, France
Hamed Shariat Yazdi University of Bonn, Germany
Chaitanya Shivade IBM, USA
Pavel Shvaiko Informatica Trentina, Italy
Mantas Simkus Vienna University of Technology, Austria
Kuldeep Singh Fraunhofer, Germany
Hala Skaf Nantes University, France
Jennifer Sleeman University of Maryland Baltimore County, USA
Monika Solanki University of Oxford, UK
Steffen Staab University of Koblenz-Landau, Germany
and University of Southampton, UK
Timo Stegemann University of Duisburg-Essen, Germany
Thomas Steiner Google, Germany
Armando Stellato University of Rome Tor Vergata, Italy
Daria Stepanova Max Planck Institute for Informatics, Germany
Markus Stocker German National Library of Science and Technology,
TIB, Germany
Umberto Straccia ISTI-CNR, Italy
Olga Streibel Bayer, Germany
York Sure-Vetter Karlsruhe Institute of Technology, Germany
Mari Carmen Universidad Politécnica de Madrid, Spain
Suárez-Figueroa
Vojtěch Svátek University of Economics, Prague, Czech Republic
Danai Symeonidou INRA, France
Ruben Taelman Ghent University, Belgium
Hideaki Takeda National Institute of Informatics, Japan
Harsh Thakkar University of Bonn, Germany
Krishnaprasad Thirunarayan Wright State University, USA
Steffen Thoma Karlsruhe Institute of Technology, Germany
Ilaria Tiddi The Open University, UK
Tabea Tietz FIZ Karlsruhe, Germany
Riccardo Tommasini Politecnico di Milano, Italy
Anna Tordai Elsevier
Ignacio Traverso-Rebon Karlsruhe Institute of Technology, Germany
Cassia Trojahn UT2J and IRIT, France
Raphael Troncy EURECOM, France
Jürgen Umbrich Vienna University of Economy and Business, Austria
Ricardo Usbeck Paderborn University, Germany
Astrid van Aggelen CWI, The Netherlands
Marieke van Erp KNAW Humanities Cluster, The Netherlands
Jacco van Ossenbruggen CWI and VU University Amsterdam, The Netherlands
Ruben Verborgh Ghent University, Belgium
Maria Esther Vidal Universidad Simon Bolivar, Venezuela
Maria-Esther Vidal Leibniz Information Centre, Germany
Organization XIX

Serena Villata CNRS, Laboratoire d’Informatique,


Signaux et Systèmes de Sophia-Antipolis, France
Simon Walk Graz University of Technology, Austria
Sebastian Walter Semalytix GmbH, Germany
Haofen Wang Shenzhen Gowild Robotics Co. Ltd.
Kewen Wang Griffith University, Australia
Krzysztof Wecel Poznan University of Economics, Poland
Josiane Xavier Parreira Siemens AG Österreich, Austria
Guohui Xiao Free University of Bozen-Bolzano, Italy
Chenyan Xiong Carnegie Mellon University, USA
Takahira Yamaguchi Keio University, Japan
Ondřej Zamazal University of Economics, Prague, Czech Republic
Amrapali Zaveri Maastricht University, The Netherlands
Lei Zhang Karlsruhe Institute of Technology, Germany
Ziqi Zhang Sheffield University, UK
Jun Zhao University of Oxford, UK
Antoine Zimmermann École des Mines de Saint-Étienne, France
Daniel Zoller University of Würzburg, Germany

Additional Reviewers

Agapito, Giuseppe Ibrahim, Yusra


Arcan, Mihael Idrissou, Al
Bader, Sebastian Janke, Daniel
Bakkelund, Daniel Jurgovsky, Johannes
Barros, Márcia Karlsen, Leif Harald
Basile, Pierpaolo Klungre, Vidar
Bin, Simon Lamurias, Andre
Brosius, Dominik Lohr, Matthias
Calabrese, Barbara Malone, James
Chakravarthi, Bharathi Raja Matentzoglu, Nicolas
Corman, Julien Milano, Marianna
d’Amato, Claudia Mireles, Victor
de Graaf, Klaas Andries Monti, Diego
Ding, Linfang Montoya, Gabriela
Elias, Mirette Moodley, Kody
Fafalios, Pavlos Mousavi Nejad, Najmeh
Ferreira, João Müller-Budack, Eric
Fischl, Wolfgang Nayyeri, Mojtaba
Gaignard, Alban Neuhaus, Fabian
Galliani, Pietro Neumaier, Sebastian
Galárraga, Luis Nooralahzadeh, Farhad
Ghiasnezhadomran, Pouya Pandit, Harshvardhan Jitendra
Güzel, Elem Paudel, Bibek
XX Organization

Peñaloza, Rafael Spahiu, Blerina


Potoniec, Jedrzej Springstein, Matthias
Revenko, Artem Steinmetz, Nadine
Ringsquandl, Martin Sun, Chang
Rizzo, Giuseppe van Soest, Johan
Rula, Anisa Weller, Tobias
Sadeghi, Afshin Westphal, Patrick
Saha Roy, Rishiraj Wilke, Adrian
Savkovic, Ognjen Zayed, Omnia
Schlötterer, Jörg Zhou, Lu
Schmelzeisen, Lukas Zhou, Qianru
Skjæveland, Martin G. Zhu, Rui
Sousa, Diana Zucco, Chiara
Organization XXI

Sponsoring Institutions
Gold Sponsors

Silver Sponsors

Bronze Sponsors
Abstracts of Keynotes
How to Make, Grow, and Sell a Semantic Web
Start-Up

Milan Stankovic

Sépage, Paris, France

Abstract. The Semantic Web broke down the boundaries of academia years
ago, and started being used to solve more and more industry challenges.
However, inducing the change toward embracing Semantic Web technologies in
many domains remains a mystery. How to sell Semantic Web solutions, in
competition with aggressively marketed legacy technologies? How to get old
and stagnant industry players to invest in a Semantic Web future? What are the
key advantages that make clients buy Semantic Web solutions? In his talk,
Milan will address these questions from the perspective of his own experience in
creating and growing a Semantic Web start-up in the travel market. He will
share recipes for success and discuss the remaining potential for the Semantic
Web to yet greatly impact the travel industry.
Knowledge Representation and the Semantic
Web – An Ontologician’s View

Sebastian Rudolph

TU Dresden, Germany

Abstract. With the rise of the Semantic Web and in the course of the stan-
dardization of ontology languages, logic-based knowledge representation
(KR) has received great attention from academics and practitioners alike. This
talk will present a – necessarily subjective – view on the role of KR in the
context of the Semantic Web. It will make a case for rigid logical underpinnings
with principled analyses of expressivity and computational properties (like
decidability or complexity) of KR formalisms, but also discuss the challenges
that the KR community has to address in order to ensure the ongoing uptake of
modern KR technology by the wider Semantic Web public and IT business in
general.
Structural Summarization of Semantic Graphs

Ioana Manolescu

Inria, Paris-Saclay, Paris, France

Abstract. RDF graphs comprise highly complex data, both from a structural and
from a semantic perspective. This makes them hard to discover and learn, and
hinders their usability.
An elegant basis for summarizing graphs is provided by the graph quotient
formalism. In a nutshell, a graph quotient specifies a way to view some graph
nodes as equivalent to each other, and represents a graph through its equivalence
classes based on this equivalence. This talk presents work carried out over the
past few years on quotient summarization of semantic-rich RDF graph. In
particular, it introduces a set of summaries particularly suited to the heteroge-
neous structure of RDF graphs, and discusses novel results at the interplay of
summarization and saturation with RDF schema rules.
Contents

Analyzing the Evolution of Vocabulary Terms and Their Impact


on the LOD Cloud . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
Mohammad Abdel-Qader, Ansgar Scherp, and Iacopo Vagliano

GSP (Geo-Semantic-Parsing): Geoparsing and Geotagging with Machine


Learning on Top of Linked Data. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17
Marco Avvenuti, Stefano Cresci, Leonardo Nizzoli,
and Maurizio Tesconi

LDP-DL: A Language to Define the Design of Linked Data Platforms . . . . . 33


Noorani Bakerally, Antoine Zimmermann, and Olivier Boissier

Empirical Analysis of Ranking Models for an Adaptable Dataset Search . . . . 50


Angelo B. Neves, Rodrigo G. G. de Oliveira, Luiz André P. Paes Leme,
Giseli Rabello Lopes, Bernardo P. Nunes, and Marco A. Casanova

sameAs.cc: The Closure of 500M owl:sameAs Statements . . . . . . . . . . . . 65


Wouter Beek, Joe Raad, Jan Wielemaker, and Frank van Harmelen

Modeling and Preserving Greek Government Decisions Using Semantic


Web Technologies and Permissionless Blockchains . . . . . . . . . . . . . . . . . . . 81
Themis Beris and Manolis Koubarakis

Towards a Binary Object Notation for RDF . . . . . . . . . . . . . . . . . . . . . . . . 97


Victor Charpenay, Sebastian Käbisch, and Harald Kosch

User-Centric Ontology Population . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 112


Kenneth Clarkson, Anna Lisa Gentile, Daniel Gruhl, Petar Ristoski,
Joseph Terdiman, and Steve Welch

Using Ontology-Based Data Summarization to Develop Semantics-Aware


Recommender Systems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 128
Tommaso Di Noia, Corrado Magarelli, Andrea Maurino,
Matteo Palmonari, and Anisa Rula

PageRank and Generic Entity Summarization for RDF Knowledge Bases. . . . 145
Dennis Diefenbach and Andreas Thalhammer

Answering Multiple-Choice Questions in Geographical Gaokao


with a Concept Graph . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 161
Jiwei Ding, Yuan Wang, Wei Hu, Linfeng Shi, and Yuzhong Qu
XXX Contents

TweetsKB: A Public and Large-Scale RDF Corpus of Annotated Tweets . . . . 177


Pavlos Fafalios, Vasileios Iosifidis, Eirini Ntoutsi, and Stefan Dietze

HDTQ: Managing RDF Datasets in Compressed Space . . . . . . . . . . . . . . . . 191


Javier D. Fernández, Miguel A. Martínez-Prieto, Axel Polleres,
and Julian Reindorf

Answers Partitioning and Lazy Joins for Efficient Query Relaxation


and Application to Similarity Search . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 209
Sébastien Ferré

Evaluation of Schema.org for Aggregation of Cultural Heritage Metadata . . . 225


Nuno Freire, Valentine Charles, and Antoine Isaac

Dynamic Planning for Link Discovery . . . . . . . . . . . . . . . . . . . . . . . . . . . . 240


Kleanthi Georgala, Daniel Obraczka, and Axel-Cyrille Ngonga Ngomo

A Dataset for Web-Scale Knowledge Base Population . . . . . . . . . . . . . . . . . 256


Michael Glass and Alfio Gliozzo

EventKG: A Multilingual Event-Centric Temporal Knowledge Graph . . . . . . 272


Simon Gottschalk and Elena Demidova

Semantic Concept Discovery over Event Databases . . . . . . . . . . . . . . . . . . . 288


Oktie Hassanzadeh, Shari Trewin, and Alfio Gliozzo

Smart Papers: Dynamic Publications on the Blockchain . . . . . . . . . . . . . . . . 304


Michał R. Hoffman, Luis-Daniel Ibáñez, Huw Fryer, and Elena Simperl

Mind the (Language) Gap: Generation of Multilingual Wikipedia


Summaries from Wikidata for ArticlePlaceholders . . . . . . . . . . . . . . . . . . . . 319
Lucie-Aimée Kaffee, Hady Elsahar, Pavlos Vougiouklis,
Christophe Gravier, Frédérique Laforest, Jonathon Hare,
and Elena Simperl

What Does an Ontology Engineering Community Look Like? A Systematic


Analysis of the schema.org Community . . . . . . . . . . . . . . . . . . . . . . . . . . . 335
Samantha Kanza, Alex Stolz, Martin Hepp, and Elena Simperl

FERASAT: A Serendipity-Fostering Faceted Browser for Linked Data . . . . . 351


Ali Khalili, Peter van den Besselaar, and Klaas Andries de Graaf

Classifying Crises-Information Relevancy with Semantics . . . . . . . . . . . . . . 367


Prashant Khare, Grégoire Burel, and Harith Alani

Efficient Temporal Reasoning on Streams of Events with DOTR . . . . . . . . . 384


Alessandro Margara, Gianpaolo Cugola, Dario Collavini,
and Daniele Dell’Aglio
Contents XXXI

Intelligent Clients for Replicated Triple Pattern Fragments . . . . . . . . . . . . . . 400


Thomas Minier, Hala Skaf-Molli, Pascal Molli, and Maria-Esther Vidal

Knowledge Guided Attention and Inference for Describing Images


Containing Unseen Objects . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 415
Aditya Mogadala, Umanga Bista, Lexing Xie, and Achim Rettinger

Benchmarking of a Novel POS Tagging Based Semantic Similarity


Approach for Job Description Similarity Computation . . . . . . . . . . . . . . . . . 430
Joydeep Mondal, Sarthak Ahuja, Kushal Mukherjee,
Sudhanshu Shekhar Singh, and Gyana Parija

A Tri-Partite Neural Document Language Model for Semantic


Information Retrieval. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 445
Gia-Hung Nguyen, Lynda Tamine, Laure Soulier, and Nathalie Souf

Multiple Models for Recommending Temporal Aspects of Entities . . . . . . . . 462


Tu Ngoc Nguyen, Nattiya Kanhabua, and Wolfgang Nejdl

GDPRtEXT - GDPR as a Linked Data Resource. . . . . . . . . . . . . . . . . . . . . 481


Harshvardhan J. Pandit, Kaniz Fatema, Declan O’Sullivan,
and Dave Lewis

Transfer Learning for Item Recommendations and Knowledge Graph


Completion in Item Related Domains via a Co-Factorization Model . . . . . . . 496
Guangyuan Piao and John G. Breslin

Modeling and Summarizing News Events Using Semantic Triples. . . . . . . . . 512


Radityo Eko Prasojo, Mouna Kacimi, and Werner Nutt

GNIS-LD: Serving and Visualizing the Geographic Names Information


System Gazetteer as Linked Data . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 528
Blake Regalia, Krzysztof Janowicz, Gengchen Mai, Dalia Varanka,
and E. Lynn Usery

Event-Enhanced Learning for KG Completion . . . . . . . . . . . . . . . . . . . . . . 541


Martin Ringsquandl, Evgeny Kharlamov, Daria Stepanova,
Marcel Hildebrandt, Steffen Lamparter, Raffaello Lepratti,
Ian Horrocks, and Peer Kröger

Exploring Enterprise Knowledge Graphs: A Use Case


in Software Engineering. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 560
Marta Sabou, Fajar J. Ekaputra, Tudor Ionescu, Juergen Musil,
Daniel Schall, Kevin Haller, Armin Friedl, and Stefan Biffl

Using Link Features for Entity Clustering in Knowledge Graphs. . . . . . . . . . 576


Alieh Saeedi, Eric Peukert, and Erhard Rahm
XXXII Contents

Modeling Relational Data with Graph Convolutional Networks. . . . . . . . . . . 593


Michael Schlichtkrull, Thomas N. Kipf,
Peter Bloem, Rianne van den Berg, Ivan Titov, and Max Welling

Ontology-Driven Sentiment Analysis of Product and Service Aspects . . . . . . 608


Kim Schouten and Flavius Frasincar

Frankenstein: A Platform Enabling Reuse of Question


Answering Components . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 624
Kuldeep Singh, Andreas Both, Arun Sethupat, and Saeedeh Shekarpour

Querying APIs with SPARQL: Language and Worst-Case


Optimal Algorithms. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 639
Matthieu Mosser, Fernando Pieressa, Juan Reutter, Adrián Soto,
and Domagoj Vrgoč

Task-Oriented Complex Ontology Alignment: Two Alignment


Evaluation Sets . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 655
Élodie Thiéblin, Ollivier Haemmerlé, Nathalie Hernandez,
and Cassia Trojahn

Where is My URI? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 671


Andre Valdestilhas, Tommaso Soru, Markus Nentwig, Edgard Marx,
Muhammad Saleem, and Axel-Cyrille Ngonga Ngomo

Optimizing Semantic Reasoning on Memory-Constrained Platforms


Using the RETE Algorithm . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 682
William Van Woensel and Syed Sibte Raza Abidi

Efficient Ontology-Based Data Integration with Canonical IRIs. . . . . . . . . . . 697


Guohui Xiao, Dag Hovland, Dimitris Bilidas, Martin Rezk,
Martin Giese, and Diego Calvanese

Formal Query Generation for Question Answering over Knowledge Bases . . . 714
Hamid Zafar, Giulio Napolitano, and Jens Lehmann

A LOD Backend Infrastructure for Scientific Search Portals . . . . . . . . . . . . . 729


Benjamin Zapilko, Katarina Boland, and Dagmar Kern

Detecting Hate Speech on Twitter Using a Convolution-GRU Based Deep


Neural Network . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 745
Ziqi Zhang, David Robinson, and Jonathan Tepper

Author Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 761


Analyzing the Evolution of Vocabulary
Terms and Their Impact on the LOD
Cloud

Mohammad Abdel-Qader1,2(B) , Ansgar Scherp1,2 , and Iacopo Vagliano2


1
Christian-Albrechts University, Kiel, Germany
{stu120798,asc}@informatik.uni-kiel.de
2
ZBW – Leibniz Information Centre for Economics, Kiel, Germany
{m.abdel-qader,a.scherp,I.Vagliano}@zbw.eu

Abstract. Vocabularies are used for modeling data in Knowledge


Graphs (KGs) like the Linked Open Data Cloud and Wikidata. Dur-
ing their lifetime, vocabularies are subject to changes. New terms are
coined, while existing terms are modified or deprecated. We first quan-
tify the amount and frequency of changes in vocabularies. Subsequently,
we investigate to which extend and when the changes are adopted in
the evolution of KGs. We conduct our experiments on three large-scale
KGs: the Billion Triples Challenge datasets, the Dynamic Linked Data
Observatory dataset, and Wikidata. Our results show that the change
frequency of terms is rather low, but can have high impact due to the
large amount of distributed graph data on the web. Furthermore, not
all coined terms are used and most of the deprecated terms are still
used by data publishers. The adoption time of terms coming from dif-
ferent vocabularies ranges from very fast (few days) to very slow (few
years). Surprisingly, we could observe some adoptions before the vocabu-
lary changes were published. Understanding the evolution of vocabulary
terms is important to avoid wrong assumptions about the modeling sta-
tus of data published on the web, which may result in difficulties when
querying the data from distributed sources.

1 Introduction

Vocabulary terms define the schema of Knowledge Graphs (KGs) such as the
Linked Open Data (LOD) cloud or Wikidata. After ontology engineers publish
the first version of a vocabulary, the terms are subject to changes to reflect
new requirements or shifts in the domains the vocabularies model. So far it is
unknown how such vocabulary changes are reflected by the KGs that are using
their terms. Data publishers may not be aware that changes in the vocabulary
terms happened since it occurs rather rarely [7]. Explicitly triggering data pub-
lishers to update their model is also challenging due to the distributed nature of
KGs such as the LOD cloud. Data publishers may be interested in being notified
when certain vocabulary term changes happen, but they lack proper tools and
c Springer International Publishing AG, part of Springer Nature 2018
A. Gangemi et al. (Eds.): ESWC 2018, LNCS 10843, pp. 1–16, 2018.
https://doi.org/10.1007/978-3-319-93417-4_1
2 M. Abdel-Qader et al.

services to track whether and what kind of changes on vocabulary terms hap-
pened. Likewise, ontology engineers are not aware of who uses their vocabularies
and lack a tool that reflects the adoption status of their ontologies and changes
on the defined terms. In this paper, we study the evolution of vocabulary terms
in KGs. We address three research questions: (1) When are the newly created
terms adopted in KGs? (2) What is the use rate of terms for a set of vocabularies
in each dataset? (3) Are the deprecated terms still used in KGs?
To address these questions, we analyzed various vocabularies to better under-
stand by whom and how they are used, and how these changes are adopted in
evolving KGs. Formally, we understand a vocabulary V as a set of terms T . A
term T is either a classC or a property P . A set of terms relates to a vocab-
ulary as T (V ) = C(V ) P (V ). Changes in a vocabulary V are changes on its
terms, i.e., classes and properties. Data that uses terms of a changed vocabulary
should be updated accordingly. In a previous work [2], we manually conducted a
qualitative analysis of vocabulary evolution on the LOD cloud. We analyzed the
changes for a set of vocabularies by clarifying which terms changed, the type of
change, and if those changes were done on terms defined in the vocabularies or
on the classes and properties that were imported from other vocabularies. In this
paper, we consider the two basic types of changes: addition and deletion. Any
other change, e.g., a modification, can be expressed by these two basic changes.
We use three well-known dataset: Dynamic Linked Data Observatory (DyLDO)
[8], Billion Triples Challenge (BTC)1 , and Wikidata2 .
Our experiments showed that even if the frequency of vocabularies terms
changes is rather low, they have a large impact on the real data. Most of the
newly coined terms are adopted in less than one week after their publishing date.
Our work may help ontology engineers to select classes and properties that
fit their needs when creating or updating ontologies. For example, we believe
it can make ontology engineers more aware of who uses their terms and how.
Furthermore, it may foster a better understanding what is the impact of their
changes and how they are adopted, to possibly learn from previous experience
what change is effective and what not. This study may also assists data publishers
in updating their models by providing information about vocabulary changes.
The remainder is structured as follows. In Sect. 2, we review related work.
We present our methodology in Sect. 3, and describe the datasets considered in
Sect. 4. We outline our results in Sect. 5, we discuss them in Sect. 6 and conclude
in Sect. 7.

2 Related Work

In terms of analyzing the use of structured data on the web, some works focused
on schema.org. Meusel et al. [9] analyzed its evolution and adoption. They made
a comparison of the use of schema.org terms over four years by extracting the

1
http://challenge.semanticweb.org/, last accessed: 29/11/2017.
2
https://www.wikidata.org, last accessed: 29/11/2017.
Analyzing the Evolution of Vocabulary Terms 3

structured data from the web pages that use this vocabulary from WebDataCom-
mons Microdata datasets3 . They discovered that not all terms in schema.org are
used and deprecated terms are still used, as it is also illustrated in this work.
Furthermore, they found that publishing new types and properties is preferred
over using schema.org’s extension mechanism. Guha et al. [6] investigated the
use of the schema.org in the structured data of a set of web pages. They analyzed
a sample of 10 billion web pages crawled from Google index and WebDataCom-
mons and found that about 31% of those pages had some schema.org elements
and estimated that around 12 million websites are using schema.org terms. In
contrast to this work, they did not consider the changes in vocabulary terms.
Additionally, we are not limited to one vocabulary only.
Mihindukulasooriya et al. [10] conducted a quantitative analysis for studying
the evolution of DBpedia, schema.org, PROV-O, and FOAF ontologies. They
proposed recommendations such as the need of dividing large ontologies into
modules to avoid duplicates when adding new terms and adding provenance
information beside the generic metadata when the change occurred. Papavas-
siliou et al. [11] proposed a framework that automatically identifies changes for
both schema and data. They provide a formal language for identifying ontology
changes and a change detection algorithm. Roussakis et al. [12] introduced a
framework for analyzing the evolution of LOD datasets. Their framework allows
users to identify changes in datasets versions and make a complex analysis on
the evolved data.
Other works exploited DyLDO to study the use of vocabularies. Dividino
et al. [4] analyzed how the use of RDF terms on the LOD cloud changed over
time. They studied the combination of terms that describe a resource but did
not investigate whether a vocabulary and its terms have changed. The authors
applied their analysis on a dataset of 53 weekly snapshots from the DyLDO
dataset, as it is also investigated in this work. Over six months, Käfer et al.
[7] observed the documents retrieved from DyLDO. They analyzed those doc-
uments using different factors, their lifespan, the availability of those docu-
ments and their change rate. Also, they analyzed the RDF content that is fre-
quently changed (triple added or removed). Additionally, they observed how
links between documents are evolved over time. While their study is important
for various areas such as smart caching, link maintenance, and versioning, it does
not include information about adopting new and deprecated terms.
Gottron et al. [5] provided an analysis of the LOD schema information by
analyzing the BTC 2012 dataset in three different levels. The first level concerns
unique subject URIs by studying the dependency relations between the classes
and their properties. They found a redundancy between classes and the attached
properties. The second level addresses Pay-Level Domains (PLDs) by dividing
the BTC 2012 dataset into individual PLDs. They found that 20% of the PLDs
can be ignored without losing the graph explanation. The third level focuses on
the vocabularies by analyzing how important a vocabulary is for describing the

3
http://webdatacommons.org/, last accessed: 10/10/2017.
4 M. Abdel-Qader et al.

data. They stated that data publishers either made a strong schematic design,
or apply a combination between a set of vocabularies to model their data.
Finally, some studies analyzed the use of vocabularies with other sources.
Vandenbussche et al. [15] published a report that describes Linked Open Vocab-
ularies (LOV). It provides statistics about LOV and its capabilities such as the
total number of terms and the top-10 searched terms, but does not include
information about adopting new terms and which PLD uses which vocabulary.
Chawuthai et al. [3] proposed a model that facilitates the understanding of organ-
isms. Their model presents the changes in taxonomic knowledge in RDF form.
The proposed model acts as a history tracking system for changing terms but
gives no information about how and when the terms are used, and which PLDs
adopted the changed terms. Schaible et al. [13] published a survey of the most
preferred strategies for reusing vocabulary terms. The participants, 79 Linked
Data experts and practitioners, were asked to rank several LOD modeling strate-
gies. The survey concluded that terms widely used are considered as a better
approach. Furthermore, the use rate of vocabularies is a more important argu-
ment for reuse than the frequency of a single vocabulary term. Their survey can
help to understand why there are some terms frequently used and why some of
them are not used at all.

3 Analysis Method

Our analysis method consists of two steps. First, we determined vocabularies that
have more than one published version on the web. Second, we investigated how
the changed terms of vocabularies are adopted and used in the evolving KGs. For
the first step, we relied on Schmachtenberg et al. [14] who published a report with
detailed statistics about a large-scale snapshot of the LOD cloud. The snapshot
comprises seed URIs from the datahub.io dataset4 , the BTC 2012 dataset5 , and
the public-lod@w3.org mailing list6 . We selected a set of vocabularies that satisfy
the following set of conditions and characteristics. (1) The vocabulary have at
least two versions published on the web to make a comparison between them. (2)
These two versions are covered by the dataset that we investigate. For example,
for the DyLDO dataset, there is to be one version of the vocabularies that have
been published after May 6th, 2012. This is needed since at this date the first
snapshot of the DyLDO dataset has been crawled. (3) The vocabulary terms are
directly used for modeling some data, i.e., the vocabulary terms occur in at least
one triple in the published dataset. In contrast, vocabularies could also be just
linked from a data publisher, where changes of external vocabularies may not
have any impact on the published data.
On the basis of these criteria, we examined 134 of the most used vocabularies
listed in the state of the LOD cloud 2014 report by Schmachtenberg et al. [14]. We

4
http://datahub.io/group/lodcloud, last accessed: 10/10/2017.
5
http://km.aifb.kit.edu/projects/btc-2012/, last accessed: 10/10/2017.
6
http://lists.w3.org/Archives/Public/public-lod/, last accessed: 10/10/2017.
Analyzing the Evolution of Vocabulary Terms 5

found 18 vocabularies that have more than one version. From them, 13 vocabu-
laries have changes (additions or deprecations) on terms created by the ontology
engineers of those vocabularies in the timeframe of the considered datasets. We
downloaded the different versions using the Linked Open Vocabularies (LOV)
observatory7 . Due to the low number of changes in the vocabulary, we did not
use data mining techniques. Instead, we exploited the PromptDiff Protégé 4.3.08
plugin to identify the vocabulary changes. This plugin identifies simple as well
as complex changes, and shows the difference between two versions. The vocab-
ularies selected are listed in Table 1, which also provides the number of versions
considered for each vocabulary and the total number of changes (additions and
deletions) occurred. Considering all the vocabularies and all their versions the
total number of terms studied is 936.

Table 1. Overview of the vocabularies and their changes.

Vocabulary Versions Changes


Asset Description Metadata Schema (ADMS)a 2 18
Citation Typing Ontology (CiTO)b 3 218
The data cube vocabulary (Cube)c 2 6
Data Catalog Vocabulary (DCAT)d 2 13
A vocabulary for jobs (emp)e 2 1
Ontology for geometry (geom)f 2 2
The Geonames ontology (GN)g 7 31
The music ontology (mo)h 2 46
Open Annotation Data Model (oa)i 2 31
Core organization ontology (org)j 2 8
W3C PROVenance Interchange (Prov)k 5 168
Vocabulary of a Friend (voaf)l 4 8
An extension of SKOS for representation of nomenclatures (xkos)m 2 1
a https://www.w3.org/TR/vocab-adms/, last accessed: 10/11/2017
b https://sparontologies.github.io/cito/current/cito.html, last accessed: 10/11/2017
c http://www.w3.org/TR/vocab-data-cube/, last accessed: 10/11/2017
d https://www.w3.org/TR/vocab-dcat/, last accessed: 10/11/2017
e http://lov.okfn.org/dataset/lov/vocabs/emp, last accessed: 10/11/2017
f http://data.ign.fr/def/geometrie/20160628.htm, last accessed: 10/11/2017
g http://www.geonames.org/ontology/documentation.html, last accessed: 10/11/2017
h http://www.geonames.org/ontology/documentation.html, last accessed: 10/11/2017
i http://www.openannotation.org/spec/core/, last accessed: 10/11/2017
j https://www.w3.org/TR/vocab-org/, last accessed: 10/11/2017
k https://www.w3.org/TR/prov-o/, last accessed: 10/11/2017
l http://lov.okfn.org/vocommons/voaf/v2.3/, last accessed: 10/11/2017
m http://rdf-vocabulary.ddialliance.org/xkos.html, last accessed: 10/11/2017

7
http://lov.okfn.org/dataset/lov, last accessed: 10/10/2017.
8
http://protege.stanford.edu, last accessed: 10/10/2017.
6 M. Abdel-Qader et al.

Subsequently, we investigated how the vocabulary terms changed are used in


the evolving KGs. We extracted all PLDs from the subject of the crawled triples
that use any of the terms from the 13 vocabularies above. Specifically, we relied
on the Guava library9 , which returns the PLD from any given URL. Besides the
date of the first appearance of a vocabulary term, we also recorded the number
of triples which contain that term. This information is then used to compute the
adoption time of terms over the dataset snapshots.

4 Datasets

We applied our analysis approach on three large-scale KGs. The first two are
DyLDO and BTC, which are obtained from the Linked Open Data cloud, and
the third is Wikidata. We analyzed the use, changes and adoption of vocabular-
ies within each individual dataset. We did not compare any results across the
datasets because the results cannot be meaningfully compared. Below, we briefly
characterize the datasets.
DyLDO is a repository to store weekly snapshots from a subset of web data
documents [8]. For our study, we parse 242 snapshots (from May 2012 until
March 2017). BTC (See footnote 1) is yearly crawled from the LOD cloud from
2009 to 2012, as well as in 2014. We used all available versions to analyze the
adoption of the extracted vocabularies in our study. Wikidata (See footnote
2) is a knowledge base to collaboratively store and edit structured data. To
analyze the Wikidata vocabulary, we first extracted the terms introduced by
this vocabulary. Specifically, through the RDF Exports from Wikidata page10 ,
we parsed the terms and properties from the RDF dump files that were generated
using the Wikidata toolkit11 . We assumed that the first snapshot of those files
is the first version of the Wikidata vocabulary, and based on this assumption
we parsed the next dump files to extract the changes to the first version, and
so on. Relying on the 25 RDF dump files (from April 2014 until August 2016),
we extracted the terms that are added or deprecated. Subsequently, we parsed
those files to extract the adoption of terms to analyze the adoption behavior for
the Wikidata vocabulary’s terms.

5 Results

In this section, we summarize our findings based on the experiments conducted.


Section 5.1 presents the results of vocabulary changes, use, and adoption in the
LOD Cloud, while Sect. 5.2 outline the same findings for Wikidata.

9
https://github.com/google/guava/, release 23.1.
10
https://tools.wmflabs.org/wikidata-exports/rdf/, last accessed: 29/11/2017.
11
https://github.com/Wikidata/Wikidata-Toolkit, last accessed: 29/11/2017.
Analyzing the Evolution of Vocabulary Terms 7

5.1 The LOD Cloud


Changes in LOD Vocabularies. We studied the changes of terms in the
vocabularies, focusing on creation and deprecation. Overall we observed 35% of
newly created terms and 11% of deprecated ones. 85% of the vocabularies in
this study have an increased number of terms. Two exceptions are ADMS and
CiTO: the number of terms decreased for the former, while the latter vastly
dropped in the number of classes.
During our analysis, we noticed that some of the deprecated properties were
reintroduced later. These reintroduced terms belongs to the CiTO and GN
vocabularies. The former deprecated 18 properties in May 2014 (introduced in
March 2010), which reappeared in the version that was published in March 2015,
i.e. after around ten months. The latter reintroduced three deprecated prop-
erties: alternateName (creation: October 2006, deprecation: September 2010,
recreation: February 2012), name (creation: October 2006, deprecation: Septem-
ber 2010, recreation: October 2010), and shortName (creation: September 2010,
deprecation: May 2010, recreation: February 2012). GN reintroduced two out of
three deprecated terms after about 17 months and one shortly after (13 days).

Use of LOD Vocabularies. We analyzed the use of the selected vocabularies


by considering triples which contains one of their terms in the predicate and/or
the object position and a PLD in the subject. Geonames.org is the PLD that
uses most terms of the selected vocabularies in the BTC 2009 and 2010 datasets
(Table 2). In BTC 2011 and 2012, zitgist.com and rdfize.com are the most fre-
quent PLDs, while in BTC 2014 and DyLDO, dbtune.org accounts for the highest
use. However, the number of triples in BTC 2009, 2011, and 2012 is significantly
lower than for the other datasets. The PLDs with the highest use of certain
terms vary over time. For example, geonames.org’s triples did not disappear in
BTC 2011. It still accounts for around 500,000 triples, i.e. much less than BTC
2009 and BTC 2010, but Table 2 only lists the PLD that has the highest amount
of triples for each dataset (zitgist.com for BTC 2011). Please note that there are
different crawling strategies for each BTC dataset and this may contribute to
the variations in the number of triples.

Table 2. PLDs with the highest use of terms from the selected vocabularies for each
of the datasets.

Dataset PLD Triples


BTC 2009 geonames.org 81M
BTC 2010 geonames.org 7M
BTC 2011 zitgist.com 2.6M
BTC 2012 rdfize.com 3.8M
BTC 2014 dbtune.org 81.5M
DyLDO dbtune.org 160M
8 M. Abdel-Qader et al.

In DyLDO, the use of most vocabularies is steady. Figure 1 shows the vocab-
ularies with a varying use. Notably, mo shows increasing and declining inter-
vals, Prov is increasing in popularity despite some slight negative picks, while
ADMS had a significant drop in 2015 after an initial increasing use, although
it seems again slightly increasing from 2015 to 2017. Furthermore, Cube had a
pick towards the end of 2015 to then come back to its initial use rate, while emp
seems no more used from 2015.

Fig. 1. The mean number of triples that use terms for a subset of the vocabularies
considered by DyLDO snapshots aggregated in quarters.

The great majority of the deprecated terms (87%) are still used after dep-
recation. We found that geonames.org is the PLD that most frequently uses
deprecated terms in the BTC and DyLDO datasets. For instance, Fig. 2 shows
the use of the gn:Country class in DyLDO, which was deprecated in Septem-
ber 2010. Despite various fluctuations, its use increased until August 2015, then
declined and increased again to reach a peak in August 2016.

Fig. 2. The use of gn:Country class in the DyLDO dataset.


Analyzing the Evolution of Vocabulary Terms 9

Adoption of LOD Vocabulary Changes. The majority of the newly created


terms were adopted in less than 10 days, as showed in Table 3. The triples column
represents the total number of triples in DyLDO that contains the adopted terms,
while μ and σ are the average number of days before adoption and the standard
deviation, respectively. Additionally, adopting geom and GN terms took long
time.

Table 3. The adoption of newly created terms for each of the vocabularies.

Vocabulary New terms Adopted terms Triples μ σ


ADMS 6 100% 31K 7 0
CiTO 80 100% 281K 7 0
Cube 5 100% 15K 7 0
DCAT 5 100% 104K 8.4 3.13
emp 1 100% 4K 7 0
geom 2 100% 16K 420 0
GN 21 100% 160M 127.76 255.33
mo 44 100% 45M 8.75 9.68
oa 21 0% - - -
org 8 100% 173K 7 0
Prov 106 85% 121M 30.15 37.49
voaf 10 100% 75K 43.33 68.58
xkos 1 0% - - -

After being adopted, 50% of the newly created terms decreased in use during
the considered period, 47% showed a steady use, while 3% increased. For exam-
ple, during its evolution, the voaf vocabulary created 10 new terms. All but one
of those have a decline in the use. Figure 3 shows only six terms as the remaining
are exploited in much fewer triples. In general, a similar trend holds for all the
vocabularies. More details about the adoption time of other vocabularies are
available in an extended technical report [1].
Not all terms are adopted. For example, the percentage of adoption for half
of the vocabularies is less than 50% of terms in the BTC dataset (in total, 50%
of all terms were not used). While in DyLDO, the percentage of unused terms of
all vocabularies was 23%, and only one vocabulary (CiTO) adopted 60% of the
terms, while the remaining vocabularies less than 40% (Table 4). Notably, the
21 new terms of the oa vocabulary and the only xkos term are never adopted.

5.2 Wikidata
After parsing the terms and properties from the RDF dump files for the period
from April 2014 until August 2016, we have extracted the added and deprecated
10 M. Abdel-Qader et al.

terms of the Wikidata vocabulary. Figure 4 presents the total number of classes
and properties in each Wikidata snapshot, which grows to reach 11 classes and 27
properties in August 2017. Ontology engineers added 3 classes and 9 properties
during the analyzed period. Notably, there are no terms that are deprecated
during the ontology evolution.

Fig. 3. The use (amount of triples in which a term occurs) of the voaf ’s newly cre-
ated terms by quarters of DyLDO snapshots. The vertical dashed lines represent the
publishing time of new versions of the vocabulary. Please note that two versions
of voaf have been published before the first snapshot of DyLDO (i.e. dataset and
hasDisjunctionsWith are newly created in versions released before the second quarter
of 2012).

Fig. 4. Total number of terms of the Wikidata vocabulary per RDF dump file.

Figure 5 illustrates the use of newly created terms in Wikidata. Only 5 out
of 12 terms are adopted. NormalRank and rank are much more used than the
other new terms. Furthermore, the actually adopted terms among all the newly
Analyzing the Evolution of Vocabulary Terms 11

Table 4. The percentage of unused terms in the BTC and DyLDO datasets.

Vocabulary Total terms BTC DyLDO


ADMS 31 68% 3%
CiTO 220 72% 60%
Cube 37 35% 0%
DCAT 23 48% 9%
emp 31 87% 6%
geom 34 100% 3%
GN 43 26% 9%
mo 208 36% 2%
oa 63 83% 35%
org 44 20% 11%
Prov 143 22% 24%
voaf 24 33% 8%
xkos 35 63% 14%

Fig. 5. The amount of triples that the adopted newly created classes and properties of
the Wikidata vocabulary after parsing Wikidata RDF dump files.

created ones are adopted directly after their creation date (i.e., on the same
day). One possible reason is that Wikidata is a more controlled and centralized
environment than a distributed KG, such as the LOD cloud, as discussed in
Sect. 6.2.

6 Discussion
We found that not all vocabulary changes are reflected in the data in KGs, and
there is a need for a service to track vocabulary changes. Such service helps
ontology engineers and data publishers in updating their ontologies and models.
In Sect. 6.1, we discuss the results related to the LOD Cloud, and in Sect. 6.2 we
discuss the results of changes and adoption of the Wikidata terms.
12 M. Abdel-Qader et al.

6.1 The LOD Cloud


Changes in LOD Vocabularies. The number of terms’ changes is small. This
is in line with existing studies [2,6,9]. However, those changes may have a large
impact on the data of KGs. For example, the new version of the oa vocabulary
caused a significant increased of its use: the triples containing its terms almost
triplicates (from roughly 400 hundred to over 1100). In general, the changes
impact on the use either in an increasing or decreasing way (6 and 5 out of
13 vocabularies, respectively), although with varying time. For DCAT there is
an increase so delayed in time (3 years) which is probably not due to the new
version. More details are available in our extended technical report [1].
Most of the vocabularies increased in the total number of terms. This suggests
that more knowledge is represented in the LOD cloud, requiring new terms.
One exception is CiTO, which consisted of 94 classes and 36 properties when
initially published. The second version counted only one class and 50 properties.
Specifically, all the 94 classes were replaced with the new class CitationAct and
most of the 36 properties of the first version were substituted. The third version
provided 91 properties, although 18 of the new properties were reintroduced
from the first version (deprecated in the second and reintroduced in the third).
In practice, almost a new ontology was built. This is particularly important since
CiTO has grown much in popularity (BTC 2014 contained over 300 thousand
triples compared to 40 thousand in BTC 2011).
New versions of vocabularies, together with the great variety of vocabularies
already existing, and the new vocabularies may overwhelm ontology engineers,
which need to choice among a vast amount of alternative terms when building
or updating their ontologies. Similar issues may occur to data publishers when
deciding which vocabularies to exploit in their datasets. Missing some changes
and consequently not updating an ontology or a dataset is likely (see the fol-
lowing discussion on the use of terms), notably in a distributed environment
as the LOD cloud. This holds particularly for deprecation. There is a lack of
tools to notify ontology engineers and data publishers when there are changes
in the vocabularies. Such tools may help ontology engineers to select classes and
properties that they want to use by knowing the latest updates of terms, and
help them in updating their vocabularies. They could also assist data publishers
in updating their models by providing a history of changes for the terms they
use. While these systems can ease the maintenance of vocabularies and datasets,
more advanced one could also recommend terms and vocabularies according to
the specific needs of their users. The insights provided in this study can be
beneficial to build such tools.

Use of LOD Vocabularies. Cross-domain (Prov, voaf, and ADMS ) and Geo-
graphic (Cube and GN ) vocabularies were the most popular among data publish-
ers. Some of them are exploited by few PLDs. For instance, w3.org widely used
ADMS terms at the beginning of the investigated time-frame, while later deri.de
accounted for the highest use of this vocabulary. On the other hand, some vocab-
ularies have been used by various PLDs. For example, Cube has been employed by
Analyzing the Evolution of Vocabulary Terms 13

ontologyCenter.com, esd.org.uk, linked-statistics.org, and linkedu.en. This may


suggest that some vocabularies are applicable in multiple domains, while others
are more application-specific, but it should be further investigated.
Overall, geonames.org and dbtune.org are the most frequent PLDs. In the
BTC 2009 and BTC 2010 datasets, geonames.org was the PLD that uses most
of the terms. This is caused by the wide use of the GN vocabulary in those years.
Later, dbtune.org accounted for the highest number of triples in the BTC 2014
and DyLDO snapshots from 2012 to 2014.
Although some terms are deprecated, 87% of them were still exploited. This is
in line with [9]. Geonames.org is the PLD with the highest number of deprecated
terms. For example, in the BTC 2011 dataset, geonames.org used six deprecated
terms in about 522 thousand triples. That number declined to three terms and
roughly 181 thousand triples in BTC 2012, but increased again to 49 terms in
BTC 2014 (5.5 thousand triples). It seems is that data publishers did not update
their data models. A possible reason of this is that they are not aware of changes
in the vocabularies exploited. Thus, as previously discussed, they could benefit
from tools to notify these changes.
In order to provide information about the status of a term, the Vocabu-
lary Status ontology12 can be used. This ontology consists of three properties:
vs:term status, vs:moreinfo, and vs:userdocs. Unfortunately, this ontology is not
widely used. Only 7 out of the 134 vocabularies investigated in our paper rely
on it.

Adoption of LOD Vocabulary Changes. Most of the newly coined terms


are adopted rather quickly (in less than one week). Surprisingly, we even found
some terms adopted before their official publishing date. We believe that some of
the new versions of vocabularies are already online and can be used before their
official announcement. In some cases, it may take time to finish the procedures to
publish the new version of the vocabulary. Thus, data publishers can access the
new terms before their formal release, simply because they are available online.
Although most of the terms have fast adoption time, some vocabularies, such
as GN, took more than 120 days, in average, to adopt new terms. However, this
average does not reflect the actual adoption behavior: the new version of GN
provides 21 new terms, 17 terms are adopted within 7 days, while the remaining
4 terms are adopted in over 600 days. Therefore, the average result was affected
by those few terms that have a vast adoption time.
Another interesting point is that some newly created terms are never adopted.
For example, ontology engineers published a new version of the oa vocabulary
in June 2016, with 21 new classes and properties. None of those terms have been
adopted (at least until April 2017, the last DyLDO snapshot considered), while
the first version of oa was published in February 2013 with 42 terms and all but
one were adopted in less than 3 months. However, the reasons why those terms
are unused likely depends on the specific application scenario. For instance,
12
https://www.w3.org/2003/06/sw-vocab-status/note.html, last accessed: 27/02/
2018.
14 M. Abdel-Qader et al.

not all terms need to be currently in use: some could be designed for future
applications. Furthermore, although some terms are not used in the LOD cloud,
they may be exploited in other forms. We do not mean that every term has to
be adopted: we aim to raise awareness to ontology engineers that there are some
of their terms that are never adopted. We also believe that raising awareness to
data publishers about the existence of other terms in an ontology in use may
further stimulate the reuse of ontologies.

6.2 Wikidata
We found that the Wikidata vocabulary showed no deprecated terms, although
some were never adopted during the investigated time-frame (e.g., the Article
class). Likewise most of the LOD vocabularies, the Wikidata vocabulary counts
a small number of additions (3 classes and 9 properties) and no deprecation.
Three classes (DeprecatedRank, NormalRank, and PreferredRank) suddenly
disappeared from Wikidata statements after the snapshot in December 2015,
after about 8 months (they were created in May 2015). There is a huge dif-
ference in the number of triples in which the terms occur. For instance, the
NormalRank and Statement classes have been used in about 106 and 81 million
triples, respectively. The other classes (except Item) are used in less than 2.4
million triples. The same observation can be made for the properties: all but
rank appeared in less than 2.7 million triples, while rank accounted for approx-
imately 62 million triples when introduced in May 2015, then reached about 106
million triples in August 2016. The wide exploitation of these terms suggests a
pressing necessity for adding them to the vocabulary.
Only 5 out of 12 of the newly created terms are adopted and their adoption
occurs directly after their creation date. This was expected in Wikidata which is
a more controlled and centralized environment than a distributed KG as the LOD
cloud. Surprisingly, the majority of new terms (2 classes and 9 properties) seems
not adopted in any statements of Wikidata. However a deeper analysis showed
that these are used to define properties and their types, except the Article
class, which needs further investigation.

7 Conclusion and Future Work


Even small changes of vocabulary terms can have a deep impact on the real
data that use those terms. Most of newly coined terms are adopted immedi-
ately afterwards, while 50%, and 23% of the terms studied are never adopted
in the BTC and DyLDO datasets, respectively. Unexpectedly, some deprecated
terms have been recreated after some time by their deprecation. Deprecation is
a critical operation, notably in a distributed KG as the LOD cloud. We are not
surprised that most of the deprecated terms are still used, because data pub-
lishers may not be aware of the changes to the exploited vocabularies. We think
that this study can help ontology engineers and data publishers in updating their
ontologies and datasets. Providing a service to notify changes on ontologies can
simplify the update of vocabulary and datasets, as well as foster the adoption of
Analyzing the Evolution of Vocabulary Terms 15

new terms. In order to reproduce our research and extend on them, we provide
all results and datasets to the public13 . As future work, we plan to study the
impact of vocabulary changes on the ontology network and provide a service
for tracking changes on vocabulary which incorporate the insights of this study.
Furthermore, we plan to investigate the impact of similarity measures on the
reuse of vocabulary terms on the LOD cloud.

Acknowledgment. This work was supported by the EU’s Horizon 2020 programme
under grant agreement H2020-693092 MOVING.

References
1. Abdel-Qader, M., Scherp, A.: Towards understanding the evolution of vocabulary
terms in knowledge graphs. ArXiv e-prints, September 2017. https://arxiv.org/
pdf/1710.00232.pdf
2. Abdel-Qader, M., Scherp, A.: Qualitative analysis of vocabulary evolution on the
linked open data cloud. In: PROFILES Workshop Co-located with ESWC, vol.
1598. CEUR-WS.org (2016)
3. Chawuthai, R., Takeda, H., Wuwongse, V., Jinbo, U.: Presenting and preserving
the change in taxonomic knowledge for linked data. Semant. Web 7(6), 589–616
(2016)
4. Dividino, R., Scherp, A., Gröner, G., Grotton, T.: Change-a-LOD: does the schema
on the linked data cloud change or not? In: Consuming Linked Data Workshop Co-
located with ISWC, vol. 1034. pp. 87–98. CEUR-WS.org (2013)
5. Gottron, T., Knauf, M., Scherp, A.: Analysis of schema structures in the linked
open data graph based on unique subject URIs, pay-level domains, and vocabulary
usage. Distrib. Parallel Databases 33(4), 515–553 (2015)
6. Guha, R.V., Brickley, D., Macbeth, S.: Schema.org: evolution of structured data
on the web. Commun. ACM 59(2), 44–51 (2016)
7. Käfer, T., Abdelrahman, A., Umbrich, J., O’Byrne, P., Hogan, A.: Observing linked
data dynamics. In: Cimiano, P., Corcho, O., Presutti, V., Hollink, L., Rudolph, S.
(eds.) ESWC 2013. LNCS, vol. 7882, pp. 213–227. Springer, Heidelberg (2013).
https://doi.org/10.1007/978-3-642-38288-8 15
8. Käfer, T., Umbrich, J., Hogan, A., Polleres, A.: Towards a dynamic linked data
observatory. In: LDOW Co-located with WWW (2012)
9. Meusel, R., Bizer, C., Paulheim, H.: A web-scale study of the adoption and evolu-
tion of the schema.org vocabulary over time. In: International Conference on Web
Intelligence, Mining and Semantics, p. 15. ACM (2015)
10. Mihindukulasooriya, N., Poveda-Villalón, M., Garcı́a-Castro, R., Gómez-Pérez, A.:
Collaborative ontology evolution and data quality - an empirical analysis. In: Drag-
oni, M., Poveda-Villalón, M., Jimenez-Ruiz, E. (eds.) OWLED/ORE -2016. LNCS,
vol. 10161, pp. 95–114. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-
54627-8 8
11. Papavassiliou, V., Flouris, G., Fundulaki, I., Kotzinos, D., Christophides, V.: On
detecting high-level changes in RDF/S KBs. In: Bernstein, A., Karger, D.R., Heath,
T., Feigenbaum, L., Maynard, D., Motta, E., Thirunarayan, K. (eds.) ISWC 2009.
LNCS, vol. 5823, pp. 473–488. Springer, Heidelberg (2009). https://doi.org/10.
1007/978-3-642-04930-9 30
13
https://figshare.com/s/d5487f88a2bdfab4c2ee.
16 M. Abdel-Qader et al.

12. Roussakis, Y., Chrysakis, I., Stefanidis, K., Flouris, G., Stavrakas, Y.: A flexible
framework for understanding the dynamics of evolving RDF datasets. In: Arenas,
M., et al. (eds.) ISWC 2015. LNCS, vol. 9366, pp. 495–512. Springer, Cham (2015).
https://doi.org/10.1007/978-3-319-25007-6 29
13. Schaible, J., Gottron, T., Scherp, A.: Survey on common strategies of vocabulary
reuse in linked open data modeling. In: Presutti, V., d’Amato, C., Gandon, F.,
d’Aquin, M., Staab, S., Tordai, A. (eds.) ESWC 2014. LNCS, vol. 8465, pp. 457–
472. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-07443-6 31
14. Schmachtenberg, M., Bizer, C., Paulheim, H.: Adoption of the linked data best
practices in different topical domains. In: Mika, P., et al. (eds.) ISWC 2014. LNCS,
vol. 8796, pp. 245–260. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-
11964-9 16
15. Vandenbussche, P.Y., Atemezing, G.A., Poveda-Villalón, M., Vatant, B.: Linked
open vocabularies (LOV): a gateway to reusable semantic vocabularies on the web.
Semant. Web 8(3), 437–452 (2017)
Another random document with
no related content on Scribd:
However, thanks to the doctor and to a Frenchman who remained
near us, things were better than they otherwise might have been.
We were waiting in great suspense to hear the result of the
negotiations. At last there came a man with the news that peace had
been declared on the 1st of June. It was wonderful. I had been so
anxious to hear if it would be peace, and now when the news came I
could not be joyful. I knew nothing for certain yet about the terms,
and I thought, “That is the reason why I do not feel happier, although
it is two years and six months now since we began this dreadful and
pitiless struggle from which we have so often longed to be
delivered.”
The children were very happy. The doctor and our Frenchman still
had their horses, and they rode off that same day. Other burghers
fired their guns into the air for joy. They did not know what sort of a
peace it was, but for the moment they could only rejoice.
I did not want to stay any longer where I was, but had still no oxen.
A short way off there was a man who had a span of Government
oxen. I sent my boy to this man to get them from him so that I should
be able to return to my own district.
He sent the oxen, and everything was soon ready for the start,
although, as I had always had two waggons with me before, it was
very difficult to get everything packed into one. Whatever I was not
able to load I left behind me. We had been in this place now for more
than a month, and the people were sorry to see us going away. But,
however hard it might be for me, on I went.
The waggon was heavy and the road very sandy, so that very
often the children had to get out and walk alongside the waggon.
The first place we came to belonged to Widow Lemmer. The poor old
woman was very unhappy, for that very week the “khakis” had taken
away her cows and everything else that she had remaining.
From there I went on again, but heard no talk of peace. I went past
the Zoutspannen to the place belonging to G. Stolz.
I stopped there that Sunday. In the afternoon it was peaceful, and
yet I felt so sorrowful. Saturday night I could not sleep, and that night
I said to my daughter Ada, “I cannot think why I feel like this about
the peace. If only it is not a surrender of arms, this peace that they
talk of! But no,” I said, “it cannot be anything like that, for then it
would not be peace.”
At ten o’clock next morning my children and I all met together for
worship.
I felt very much affected. There had been so many Sundays spent
in making war, and now to-day it was peace. Therefore I said to my
children, “We have been through so many hard and bitter days, and
the Almighty Father has brought us safely through our weary
pilgrimage. Let us now thank Him with all our hearts.” I felt that it was
only God’s goodness that had spared us from falling into the hands
of our adversary.
That afternoon I went to lie down for a little in my waggon. At four
o’clock in the afternoon Liebenberg arrived from Klerksdorp and
came to my waggon with the report of peace. And now I had to hear
that it was indeed a surrender of our arms.
I did not know how to pacify my children; they wept bitterly, and
could not find words for their indignation. And yet it was peace all the
same. I said to them, “Let us keep silence; later on we shall
understand it all.”
I stayed there till Monday morning. As Liebenberg had come to
take me to Klerksdorp, and as from there I should be able to go on to
Pretoria, I soon thought to myself, “What a joy it will be when I can
meet my children again, after having been separated from them for
nearly two years.”
But this peace was so distasteful that I could not get over the
thought of it.
When everything was packed we made ready to start. While I was
driving I took my day-book. The text for that morning was Gen. xxii.
7: “And Abraham said God will provide....”
Now we went on quickly. I met on that road none but sorrowing
women and children. I said sometimes, “Where can the poor
burghers be that we do not meet them?”
After having travelled for a couple of days we came to Mr D. van
der Merwe’s place. There I met several burghers. Van der Merwe
was a good and clever man and I was glad to be able to talk with
him. He told me that, however incomprehensible it might all seem,
he was sure that the officers after having struggled so long and so
bitterly would now also do their best.
As they had first gone to the Zwartruggens and Marico to see that
the arms were all given up, I had not seen any of them yet.
We were now in the Lichtenburg district. I waited at Mr van der
Merwe’s place. It was bitterly cold. It snowed for three days, and
during all my wanderings this was the worst cold I had experienced.
And there was no house to shelter us. There were plenty of buildings
there, but all were more or less in ruins. It was dreadful to see them.
Now came the time when the burghers in this neighbourhood also
had to give up their arms.
On the 12th of June the last gun had been given up in the
Lichtenburg district. That evening my people came for the first time
to my tent. I thought how bitter it was to meet them in this way. My
husband came to me and my son, little Coos. Little Coos cried,
“Mamma, I have still got my gun.”
It was very hard for him; he could stand the war better than the
peace. I did not want to speak about it with his father. The terrible
shedding of blood was at end. We had offered up our property and
our blood for Freedom and Justice.... Where was this freedom?
where was this justice?
Jesus was betrayed by Judas. He had to die the cruel death on
the cross. His death brought us everlasting life; yet Judas killed
himself.
We have been betrayed by many of our burghers. We have lost
our right for a time. Yet they who struggled to the end, and who
resolved that right must go first without thinking of might, have kept
their fortitude. However the end may have shaped itself, they are
glad not to have been Judases.
There was now a great longing on the part of all those around to
see the members of their families from whom they had been parted
for so long.
I, too, rejoiced that I could go to Pretoria to see my children, whom
I had not seen for so long a time. And I thought again of my beloved
son, whom I had had to give up.
And yet I had not had to sacrifice so much; many a woman had
given her husband and her children too.
We went on to Klerksdorp in my waggon. As we would have to
drive for two days before getting there, and as Sunday came in
between, on that day we made a halt. It was the last Sunday that I
spent out on the veldt. I thought earnestly of all that had taken place.
How many Sundays had I not spent in flying before the enemy?
Sunday evening we entered Klerksdorp; Monday morning I sent
back the waggons which had housed me and my children for twenty
months. I had grown so used to life in my waggon that I did not like
to see it go away.
We went to Pretoria that evening, and I found my children in well-
being. It was pleasant after such a long separation. I had been two
days in Pretoria when a son was born to my daughter. Then I
became a grandmother, for this was the first grand-child. He was
christened Jacobus Herklaas De la Rey. I felt very grateful. In all the
bitterness of those weary days I was able to say, “But He knoweth
the way that I take; when He hath tried me, I shall come forth as
gold” (Job xxiii. 10); and also as in Job ii. 10 (the last part), “What?
Shall we receive good at the hand of God and shall we not receive
evil? In all this did not Job sin with his lips.”
Now, dear brothers and sisters, since we are sure that God shall
prove His Word, let us stand steadfast in our faith and wait for the
salvation of the Lord. This is the time of trial; now will the Lord see if
we are worthy that He should make His wonders manifest unto us
before the eyes of the whole world.

“Though He tarry, He knows His hour,


He comes, He surely comes:
And all eyes shall contemplate Him,
All hearts shall realise the deeds
Which He wrought here upon earth.”

Mesdames Ferreira and Bezuidenhout.


One of the three ladies standing up (in the accompanying
photograph) is my daughter, now Mrs Ferreira; the other two are two
Misses Bezuidenhout. They have there the very flags of the
Transvaal and Orange Free State, which are their great treasure.
The two Misses Bezuidenhout are the daughters of the Widow
Bezuidenhout, my husband’s eldest sister. She is the mother of nine
children; she too had her house burned down and all that she
possessed taken away from her. I have often found her in tears, and
she would say to me, “What is to become of us all? I and my children
have not one single animal left alive, and everything on my farm has
been destroyed!”
Her daughter (Mrs Bodenstein) is also a mother of seven children,
and when I saw her she told me that she had nothing for her
children, not a single blanket had been left to her. When she went to
complain to the English officer, he only asked her, “Who is the man
who did it?” The man who did it was nowhere to be found. But the
harm had been done, and in the same way not one, but many had to
suffer. May no other people in the world ever have to endure another
such war so long as the world may last. I do not think that it can be
forgotten. I cannot say who it was who were wrong, we or our
adversaries. But this I can say, that it was terrible to bear. Never
could I have thought that human beings could treat each other in
such a way. I know well that war is one of the blackest things upon
earth, still I cannot depart from all sense of justice and put down
every sort of barbarity to war and consider it right. As we were
known to the whole world as two Christian nations, I had thought that
such things could not be allowed. But I have been taught that
suffering and misery can go on increasing to the bitter end, and that
in war no deed can be too hard or cruel to be committed.

Four of Mrs De la Rey’s children, with two little girl friends.


Here are four of my children who during all my wanderings were
with me all the time. A Frenchman near my waggon took this
photograph; he found it very amusing to take the children in that way.
The girl with the revolver is the daughter of Field-Cornet Meyer, at
whose place I was then staying. The Frenchman slung round the
children all the guns and revolvers and field-glasses.
At times the children thoroughly enjoyed life out on the veldt. It
was often trying for them with all the hardships we were constantly
encountering, but on the whole the children came off better than their
elders. When there was danger they would begin to cry, but soon it
would all be forgotten. They were always so ready to help me and to
make everything easier for me as far as they could. It was very hard
for them sometimes to look after themselves and keep their clothes
in order. And as they always went on growing they kept growing out
of their clothes and wore them out, so that they had to help to patch
and make the most of them. Sometimes they would have no cups or
mugs to drink out of—then they would set to work with “jambliks” and
try to make mugs out of them. My brother, Jan Greef, who used
sometimes to be near my waggon, would help to make mugs. He
was very clever at it. Fortunately, I managed to keep always one or
two cups and saucers. Cups which could break so easily during all
that driving about had to be taken great care of. We were always
trying to find new ways of doing things. If we wanted cotton for
sewing we would take a piece of sailcloth, unravel it and use the
threads, or else undo our shoe-strings and unravel them. The girls
made a large number of socks, stockings, bands, etc. We would take
some sheep’s wool and card it; then the Kaffir boys made little
wooden “machines,” and with them wove the wool into strands.
Three of Mrs De la Rey’s children.
Here are also three more of my children, the three youngest. The
little boy in the middle, Gabriel Johannes, is now six. He often used
to say to the English soldiers, “Hands up,” and then they would laugh
at him. It was a pleasure to him to be flying away; whenever we had
been long in one place, he would say, “I wish we had to run away
again; not from the ‘khakis,’ but just so as to be driving.” When he
heard that the burghers had to give up their arms, he said, “I am not
going to give my gun to ‘khakis’; I would rather break it in pieces; the
‘khakis’ shall never have it.” There he has it in his hand. It was such
a momentous time, even the youngest felt the weight of it.
If I am still to go through many days of hardship in this life, let me
then cast a glance upon those that are past. Because once we had
so many good and happy hours, and now find ourselves so bitterly
oppressed, I have often heard people say that they doubted whether
there could be a God.
What I used to think of in my days that were most full of sorrow,
was just that it could be no mortal that was guiding us through
everything; my past life rose constantly before my eyes. My father,
Hendrik Adrian Greef, who is now long since dead, was one of the
Voortrekkers of Transvaal, and he went then to live in Lichtenburg,
where I remained from my fourth year, till the day I was driven away
by the war. My father too had had to go through a great deal; for in
those days the whole place was a wilderness. When my father went
on a shooting expedition, as we were living alone on the farm, my
mother would not let him leave her behind; so she used to go with
him with her four children, of which I was the eldest, then twelve
years old. It was a wild country; one met only Kaffirs and other
savage people. They were like wild beasts; as soon as they caught
sight of us they would run off as hard as they could; thus very often
my father could not get anybody to show him where there was water.
He went on then to the Lake, and from there to Damaraland, so that
this journey kept us for a year on the veldt. We were often in danger
from the black Kaffirs, who tried to get hold of us; and often we came
to places where there was no water to be found.
When passing through these later days of trial I often thought of
my father, for a child does not easily forget what its father has done.
When living in prosperity, it had always been my wish to walk in the
ways of the Lord, and now, however hard the storm of injustice might
strike me, never would it be heavy enough to part me from the love
of Jesus. In the days when all was well with me I often wondered
how it would be if I had to bear great troubles; and if I should not
then despair. When one is happy it is easy enough to be a Christian.
And though, like everyone else upon this earth, I have my cross, yet
the Saviour has always made it lighter for me, so that I am able to
bear it.
And this weary war has taught me to see one thing clearly;—that
the bent reed will not be broken. For many a time when I thought,
“Now all is over,” I would seem to hear God’s voice answering me,
and saying, “He who persists unto the end shall be saved.” Thus I
cannot come to understand how in such times men can drift away
from God instead of drawing closer to Him.
So I can say about myself, “I shall still praise and thank the Lord
for His Fatherly guidance”; for who was it that watched over me
when I wandered around for many a stormy day and night?—It was
our gracious God. Who was it that kept me and mine from falling into
the hands of our adversary?—It was the Lord, before whom we must
all of us come one day.
How can I then prove unfaithful? No—however great may be our
sufferings, I shall always put my trust in the salvation of the Lord.

Praise the Lord, who ever will forgive your sins. How many they
may be, He will graciously forgive. He knows your sufferings and will
lovingly cure them. He will cleanse your life from stain, and will
crown you with goodness and mercy as He saved you in your need.
Jacoba Elizabeth De la Rey,
(born Greef).

COLSTON AND COY. LIMITED, PRINTERS, EDINBURGH


*** END OF THE PROJECT GUTENBERG EBOOK A WOMAN'S
WANDERINGS AND TRIALS DURING THE ANGLO-BOER WAR ***

Updated editions will replace the previous one—the old editions


will be renamed.

Creating the works from print editions not protected by U.S.


copyright law means that no one owns a United States copyright
in these works, so the Foundation (and you!) can copy and
distribute it in the United States without permission and without
paying copyright royalties. Special rules, set forth in the General
Terms of Use part of this license, apply to copying and
distributing Project Gutenberg™ electronic works to protect the
PROJECT GUTENBERG™ concept and trademark. Project
Gutenberg is a registered trademark, and may not be used if
you charge for an eBook, except by following the terms of the
trademark license, including paying royalties for use of the
Project Gutenberg trademark. If you do not charge anything for
copies of this eBook, complying with the trademark license is
very easy. You may use this eBook for nearly any purpose such
as creation of derivative works, reports, performances and
research. Project Gutenberg eBooks may be modified and
printed and given away—you may do practically ANYTHING in
the United States with eBooks not protected by U.S. copyright
law. Redistribution is subject to the trademark license, especially
commercial redistribution.

START: FULL LICENSE


THE FULL PROJECT GUTENBERG LICENSE
PLEASE READ THIS BEFORE YOU DISTRIBUTE OR USE THIS WORK

To protect the Project Gutenberg™ mission of promoting the


free distribution of electronic works, by using or distributing this
work (or any other work associated in any way with the phrase
“Project Gutenberg”), you agree to comply with all the terms of
the Full Project Gutenberg™ License available with this file or
online at www.gutenberg.org/license.

Section 1. General Terms of Use and


Redistributing Project Gutenberg™
electronic works
1.A. By reading or using any part of this Project Gutenberg™
electronic work, you indicate that you have read, understand,
agree to and accept all the terms of this license and intellectual
property (trademark/copyright) agreement. If you do not agree to
abide by all the terms of this agreement, you must cease using
and return or destroy all copies of Project Gutenberg™
electronic works in your possession. If you paid a fee for
obtaining a copy of or access to a Project Gutenberg™
electronic work and you do not agree to be bound by the terms
of this agreement, you may obtain a refund from the person or
entity to whom you paid the fee as set forth in paragraph 1.E.8.

1.B. “Project Gutenberg” is a registered trademark. It may only


be used on or associated in any way with an electronic work by
people who agree to be bound by the terms of this agreement.
There are a few things that you can do with most Project
Gutenberg™ electronic works even without complying with the
full terms of this agreement. See paragraph 1.C below. There
are a lot of things you can do with Project Gutenberg™
electronic works if you follow the terms of this agreement and
help preserve free future access to Project Gutenberg™
electronic works. See paragraph 1.E below.
1.C. The Project Gutenberg Literary Archive Foundation (“the
Foundation” or PGLAF), owns a compilation copyright in the
collection of Project Gutenberg™ electronic works. Nearly all the
individual works in the collection are in the public domain in the
United States. If an individual work is unprotected by copyright
law in the United States and you are located in the United
States, we do not claim a right to prevent you from copying,
distributing, performing, displaying or creating derivative works
based on the work as long as all references to Project
Gutenberg are removed. Of course, we hope that you will
support the Project Gutenberg™ mission of promoting free
access to electronic works by freely sharing Project
Gutenberg™ works in compliance with the terms of this
agreement for keeping the Project Gutenberg™ name
associated with the work. You can easily comply with the terms
of this agreement by keeping this work in the same format with
its attached full Project Gutenberg™ License when you share it
without charge with others.

1.D. The copyright laws of the place where you are located also
govern what you can do with this work. Copyright laws in most
countries are in a constant state of change. If you are outside
the United States, check the laws of your country in addition to
the terms of this agreement before downloading, copying,
displaying, performing, distributing or creating derivative works
based on this work or any other Project Gutenberg™ work. The
Foundation makes no representations concerning the copyright
status of any work in any country other than the United States.

1.E. Unless you have removed all references to Project


Gutenberg:

1.E.1. The following sentence, with active links to, or other


immediate access to, the full Project Gutenberg™ License must
appear prominently whenever any copy of a Project
Gutenberg™ work (any work on which the phrase “Project
Gutenberg” appears, or with which the phrase “Project
Gutenberg” is associated) is accessed, displayed, performed,
viewed, copied or distributed:

This eBook is for the use of anyone anywhere in the United


States and most other parts of the world at no cost and with
almost no restrictions whatsoever. You may copy it, give it
away or re-use it under the terms of the Project Gutenberg
License included with this eBook or online at
www.gutenberg.org. If you are not located in the United
States, you will have to check the laws of the country where
you are located before using this eBook.

1.E.2. If an individual Project Gutenberg™ electronic work is


derived from texts not protected by U.S. copyright law (does not
contain a notice indicating that it is posted with permission of the
copyright holder), the work can be copied and distributed to
anyone in the United States without paying any fees or charges.
If you are redistributing or providing access to a work with the
phrase “Project Gutenberg” associated with or appearing on the
work, you must comply either with the requirements of
paragraphs 1.E.1 through 1.E.7 or obtain permission for the use
of the work and the Project Gutenberg™ trademark as set forth
in paragraphs 1.E.8 or 1.E.9.

1.E.3. If an individual Project Gutenberg™ electronic work is


posted with the permission of the copyright holder, your use and
distribution must comply with both paragraphs 1.E.1 through
1.E.7 and any additional terms imposed by the copyright holder.
Additional terms will be linked to the Project Gutenberg™
License for all works posted with the permission of the copyright
holder found at the beginning of this work.

1.E.4. Do not unlink or detach or remove the full Project


Gutenberg™ License terms from this work, or any files
containing a part of this work or any other work associated with
Project Gutenberg™.
1.E.5. Do not copy, display, perform, distribute or redistribute
this electronic work, or any part of this electronic work, without
prominently displaying the sentence set forth in paragraph 1.E.1
with active links or immediate access to the full terms of the
Project Gutenberg™ License.

1.E.6. You may convert to and distribute this work in any binary,
compressed, marked up, nonproprietary or proprietary form,
including any word processing or hypertext form. However, if
you provide access to or distribute copies of a Project
Gutenberg™ work in a format other than “Plain Vanilla ASCII” or
other format used in the official version posted on the official
Project Gutenberg™ website (www.gutenberg.org), you must, at
no additional cost, fee or expense to the user, provide a copy, a
means of exporting a copy, or a means of obtaining a copy upon
request, of the work in its original “Plain Vanilla ASCII” or other
form. Any alternate format must include the full Project
Gutenberg™ License as specified in paragraph 1.E.1.

1.E.7. Do not charge a fee for access to, viewing, displaying,


performing, copying or distributing any Project Gutenberg™
works unless you comply with paragraph 1.E.8 or 1.E.9.

1.E.8. You may charge a reasonable fee for copies of or


providing access to or distributing Project Gutenberg™
electronic works provided that:

• You pay a royalty fee of 20% of the gross profits you derive from
the use of Project Gutenberg™ works calculated using the
method you already use to calculate your applicable taxes. The
fee is owed to the owner of the Project Gutenberg™ trademark,
but he has agreed to donate royalties under this paragraph to
the Project Gutenberg Literary Archive Foundation. Royalty
payments must be paid within 60 days following each date on
which you prepare (or are legally required to prepare) your
periodic tax returns. Royalty payments should be clearly marked
as such and sent to the Project Gutenberg Literary Archive
Foundation at the address specified in Section 4, “Information
about donations to the Project Gutenberg Literary Archive
Foundation.”

• You provide a full refund of any money paid by a user who


notifies you in writing (or by e-mail) within 30 days of receipt that
s/he does not agree to the terms of the full Project Gutenberg™
License. You must require such a user to return or destroy all
copies of the works possessed in a physical medium and
discontinue all use of and all access to other copies of Project
Gutenberg™ works.

• You provide, in accordance with paragraph 1.F.3, a full refund of


any money paid for a work or a replacement copy, if a defect in
the electronic work is discovered and reported to you within 90
days of receipt of the work.

• You comply with all other terms of this agreement for free
distribution of Project Gutenberg™ works.

1.E.9. If you wish to charge a fee or distribute a Project


Gutenberg™ electronic work or group of works on different
terms than are set forth in this agreement, you must obtain
permission in writing from the Project Gutenberg Literary
Archive Foundation, the manager of the Project Gutenberg™
trademark. Contact the Foundation as set forth in Section 3
below.

1.F.

1.F.1. Project Gutenberg volunteers and employees expend


considerable effort to identify, do copyright research on,
transcribe and proofread works not protected by U.S. copyright
law in creating the Project Gutenberg™ collection. Despite
these efforts, Project Gutenberg™ electronic works, and the
medium on which they may be stored, may contain “Defects,”
such as, but not limited to, incomplete, inaccurate or corrupt
data, transcription errors, a copyright or other intellectual
property infringement, a defective or damaged disk or other
medium, a computer virus, or computer codes that damage or
cannot be read by your equipment.

1.F.2. LIMITED WARRANTY, DISCLAIMER OF DAMAGES -


Except for the “Right of Replacement or Refund” described in
paragraph 1.F.3, the Project Gutenberg Literary Archive
Foundation, the owner of the Project Gutenberg™ trademark,
and any other party distributing a Project Gutenberg™ electronic
work under this agreement, disclaim all liability to you for
damages, costs and expenses, including legal fees. YOU
AGREE THAT YOU HAVE NO REMEDIES FOR NEGLIGENCE,
STRICT LIABILITY, BREACH OF WARRANTY OR BREACH
OF CONTRACT EXCEPT THOSE PROVIDED IN PARAGRAPH
1.F.3. YOU AGREE THAT THE FOUNDATION, THE
TRADEMARK OWNER, AND ANY DISTRIBUTOR UNDER
THIS AGREEMENT WILL NOT BE LIABLE TO YOU FOR
ACTUAL, DIRECT, INDIRECT, CONSEQUENTIAL, PUNITIVE
OR INCIDENTAL DAMAGES EVEN IF YOU GIVE NOTICE OF
THE POSSIBILITY OF SUCH DAMAGE.

1.F.3. LIMITED RIGHT OF REPLACEMENT OR REFUND - If


you discover a defect in this electronic work within 90 days of
receiving it, you can receive a refund of the money (if any) you
paid for it by sending a written explanation to the person you
received the work from. If you received the work on a physical
medium, you must return the medium with your written
explanation. The person or entity that provided you with the
defective work may elect to provide a replacement copy in lieu
of a refund. If you received the work electronically, the person or
entity providing it to you may choose to give you a second
opportunity to receive the work electronically in lieu of a refund.
If the second copy is also defective, you may demand a refund
in writing without further opportunities to fix the problem.

1.F.4. Except for the limited right of replacement or refund set


forth in paragraph 1.F.3, this work is provided to you ‘AS-IS’,
WITH NO OTHER WARRANTIES OF ANY KIND, EXPRESS
OR IMPLIED, INCLUDING BUT NOT LIMITED TO
WARRANTIES OF MERCHANTABILITY OR FITNESS FOR
ANY PURPOSE.

1.F.5. Some states do not allow disclaimers of certain implied


warranties or the exclusion or limitation of certain types of
damages. If any disclaimer or limitation set forth in this
agreement violates the law of the state applicable to this
agreement, the agreement shall be interpreted to make the
maximum disclaimer or limitation permitted by the applicable
state law. The invalidity or unenforceability of any provision of
this agreement shall not void the remaining provisions.

1.F.6. INDEMNITY - You agree to indemnify and hold the


Foundation, the trademark owner, any agent or employee of the
Foundation, anyone providing copies of Project Gutenberg™
electronic works in accordance with this agreement, and any
volunteers associated with the production, promotion and
distribution of Project Gutenberg™ electronic works, harmless
from all liability, costs and expenses, including legal fees, that
arise directly or indirectly from any of the following which you do
or cause to occur: (a) distribution of this or any Project
Gutenberg™ work, (b) alteration, modification, or additions or
deletions to any Project Gutenberg™ work, and (c) any Defect
you cause.

Section 2. Information about the Mission of


Project Gutenberg™
Project Gutenberg™ is synonymous with the free distribution of
electronic works in formats readable by the widest variety of
computers including obsolete, old, middle-aged and new
computers. It exists because of the efforts of hundreds of
volunteers and donations from people in all walks of life.

Volunteers and financial support to provide volunteers with the


assistance they need are critical to reaching Project
Gutenberg™’s goals and ensuring that the Project Gutenberg™
collection will remain freely available for generations to come. In
2001, the Project Gutenberg Literary Archive Foundation was
created to provide a secure and permanent future for Project
Gutenberg™ and future generations. To learn more about the
Project Gutenberg Literary Archive Foundation and how your
efforts and donations can help, see Sections 3 and 4 and the
Foundation information page at www.gutenberg.org.

Section 3. Information about the Project


Gutenberg Literary Archive Foundation
The Project Gutenberg Literary Archive Foundation is a non-
profit 501(c)(3) educational corporation organized under the
laws of the state of Mississippi and granted tax exempt status by
the Internal Revenue Service. The Foundation’s EIN or federal
tax identification number is 64-6221541. Contributions to the
Project Gutenberg Literary Archive Foundation are tax
deductible to the full extent permitted by U.S. federal laws and
your state’s laws.

The Foundation’s business office is located at 809 North 1500


West, Salt Lake City, UT 84116, (801) 596-1887. Email contact
links and up to date contact information can be found at the
Foundation’s website and official page at
www.gutenberg.org/contact

Section 4. Information about Donations to


the Project Gutenberg Literary Archive
Foundation
Project Gutenberg™ depends upon and cannot survive without
widespread public support and donations to carry out its mission
of increasing the number of public domain and licensed works
that can be freely distributed in machine-readable form

You might also like