Download as pdf or txt
Download as pdf or txt
You are on page 1of 9

Manage Data

with Delta Lake

©2022 Databricks Inc. — All rights reserved 1


Module Agenda
Manage Data with Delta Lake

What is Delta Lake


DE 3.1 - Schemas and Tables
DE 3.2 - Version and Optimize Delta Tables
DE 3.3L - Manipulate Delta Tables Lab
DE 3.4 - Set Up Delta Tables
DE 3.5 - Load Data into Delta Lake
DE 3.6 - Load Data Lab

©2022 Databricks Inc. — All rights reserved 2


What is Delta
Lake?

©2022 Databricks Inc. — All rights reserved 3


Delta Lake is an open-source
project that enables building a
data lakehouse on top of
existing storage systems

©2022 Databricks Inc. — All rights reserved 4


Delta Lake Is Not…

• Proprietary technology
• Storage format
• Storage medium
• Database service or data warehouse

©2022 Databricks Inc. — All rights reserved 5


Delta Lake Is…

• Open source
• Builds upon standard data formats
• Optimized for cloud object storage
• Built for scalable metadata handling

©2022 Databricks Inc. — All rights reserved 6


Delta Lake brings ACID to object storage

▪ Atomicity
▪ Consistency
▪ Isolation
▪ Durability

©2022 Databricks Inc. — All rights reserved 7


Problems solved by ACID
1. Hard to append data
2. Modification of existing data difficult
3. Jobs failing mid way
4. Real-time operations hard
5. Costly to keep historical data versions

©2022 Databricks Inc. — All rights reserved


Delta Lake is the default for all
tables created in Databricks

©2022 Databricks Inc. — All rights reserved 9

You might also like