Delta Lake: The Definitive Guide: Modern Data Lakehouse Architectures with Data Lakes
Description:
Discover how Delta Lake simplifies the process of building data lakehouses and data pipelines at scale. With this practical guide, data engineers, data scientists, and data analysts will explore key data reliability challenges and learn to apply modern data engineering and management techniques. You'll also understand how ACID transactions bring reliability to data lakehouses at scale.
This book helps you:
- Understand key data reliability challenges
- Examine data management and engineering techniques using the modern data stack
- Realize data reliability improvements using Delta Lake
- Concurrently run streaming and batch jobs against your data lake
- Execute update, delete, and merge commands
- Use time travel to rollback and examine previous versions of your data
- Build a streaming data quality pipeline following the medallion construct
We're an Amazon Associate. We earn from qualifying purchases at Amazon and all stores listed here.