Professional Documents
Culture Documents
BRK250
BRK250
Introduction
Speakers
Einat Orr, PhD.
CEO & Co-founder
Treeverse
@EinatOrr
Adi Polak
Senior Cloud Advocate
Microsoft
@AdiPolak
Object storage is the present
and future of data lakes
The data lake advantage
Scalability and cost effectiveness
High throughput
IN A PERFECT WORLD
revert
experiment-1
main
Continuous data integration
Ingest new data safely
merge changeset:
✓ 001.parquet
✓ 002.parquet
x random.csv
new-data-1
main
Continuous data deployment
Prevent data quality issues: by testing production
data before exposing it to users
Commit metadata
topic_name = events
topic_offset = 1761348
job_git_commit = 60c3fa
stream
main
Demo
Einat Orr
3