Professional Documents
Culture Documents
SQL Masterclass Odsc Apac 2021
SQL Masterclass Odsc Apac 2021
SQL Masterclass
for Data Scientists
Danny Ma
Sept 2021
02
About me
My Data Science Journey
03
(so far...)
bit.ly/dwd-info
COMMON SQL MYTHS
06
"SQL is such an old "I can do everything in "SQL is so slow and poor
language, I only want to Python, I don't need performing, I can't rely
learn modern tools so it SQL for any of my data on it because sometimes
looks better on my science tasks" the database can't
resume" handle my queries"
"The only SQL I need is "SQL is going away soon "SQL is only for data
SELECT * FROM and I don't want to analysts, as a data
because I always need waste my time learning scientist - I don't need
the entire dataset from it - everything is moving to use SQL anymore
the database" to NoSQL anyway" because it's below me"
SQL FACTS
07
65% of data scientists The modern data stack SQL databases can also
and data analysts use is moving to SQL-centric store semi-structured
SQL according to a 2020 tools such as Snowflake, data like JSON objects
Stack Overflow survey Amazon Redshift and with expanding support
Google BigQuery for images and other
newer data sources
THE PLAN FOR TODAY
08
SQL General
Masterclass Q&A
08
bit.ly/odsc-sql
09
SQL
Masterclass
10
General
Q&A