Professional Documents
Culture Documents
09 Tools For Data Science GGurtner
09 Tools For Data Science GGurtner
Dr Gerald Gurtner
12 Sept. 2019
Engage summer school, FTTE, Belgrade
The data scientist’s job
• Ex.:
Kolmogorov-
Smirnov test
(modularity based)
• Relational database
• Advantage
• Structured Query Language (SQL)
• Easy to join data
• Server based solutions
• Secure access
• View on data
• Explore data with interface
• Structure of database (UML)
• API for programming languages
• ACID principle (Atomicity, Consistency,
Isolation, Durability)
• Disadvantages
• Might be difficult to scale
• Slower access
• Dynamic definition of data structure
• Harder to keep traceability
• Server configuration for perfomances
Engage summer school , Belgrade, 12/09/2019 27
Data storage
• Relational databases
- SQLite
- Different implementations /
- Apache CASSANDRA alternatives
- Research online!
- Apache HBASE
- mongoDB
- Amazon Dynamo
• Java:
• Dedicated frameworks for ABMs (Java): Jade, Jadex.
• Faster than Python.
• Less obvious syntax, harder to learn.
• Not so used for data handling and data visualisation.
Thank you
This network has received funding from the SESAR Joint Undertaking
under the European Union’s Horizon 2020 research and innovation
programme under grant agreement No 783287.