Professional Documents
Culture Documents
W1 L11 Introduction
W1 L11 Introduction
management: introduction
H Batatia
Dubai Teaching Team
2
Timetable (Dubai)
Lecture session
Lab session
3
Course content – part 1
• Semantic web
• Linked data
• Knowledge graphs
• Knowledge representation: ontologies, inference – RDFS, OWL
• Knowledge retrieval: SPARQL, triplestores
• Data integration
Course content – part 2
• Big data
• NoSQL data models
• Graph model (neo4j, Cypher QL)
• Key-value model (Redis)
• Document model (MongoDB, JavaScript API)
• Wide-column (Cassandra, CQL)
• New SQL
• Geospatial data
• vector and raster models
• Geospatial SQL
Tools needed
• Part 1
• Protégé: ontology editor with OWL, RDFS..., SPARQL querying
• Fuseki JENA: triplestore, SPARQL querying
• H2 or MYSQL: data integration
• A few more tools online (viewer, validator…)
• Part 2
• neo4j: graph database
• MongoDB: document store
• Cassandra: wide- column store
• PostgreSQL + PostGIS + Qgis: geographical information systems
Prior knowledge
• Extensive knowledge of Relational Database Systems
• Relational model and algebra
• SQL
• ACID Transactions
• First order logic
• Conjunction, disjunction, and negation
• Familiarity with distributed system concepts
• Messaging issues
• Client-server
• Peer-to-peer
Expectations
• F21DB/F20DB is a Technical course
• Hours (per week):
• 2 hours lecture (selected topics)
• 1 hour lab/tutorial
• 7-10 hours private study
Assessment
• Assignments 30%
• Part 1 (15%): class test
• Coursework - Stage 2 (15%)
• Exam 70%
• Part 1 – ontologies, owl, sparql queries
• Part 2 – noSQL models, GIS, data wrangling
Questions?
Traditional computer science
Data, information, and knowledge concerned with this
Statistical
Inference New data
prediction
Properties (estimates)
Prediction, Classification, regression
Of the phenomenon
Big Data
• Facebook
• Youtube: Videos
• Instagram: Images
• Twitter: words, sentences etc. Categorical Data: words
17
Questions
19
Characteristics of big data
Semantics Processing
(semantics hard coded in the
programs)
• A term invented by the French
philosopher Michel Bréal to explain
how terms may have various Information
meanings for different people
• In computer science, semantics
refers to the “meaning and Source 1 Source 2 Source 3
practical use of data” (data objects Information Information Information
used to represent a concept or
entity)
semantics Reasoning
• Semantic technologies are need
when data from different sources
Decision
and types is combined (integrated)
to perform some decision making