Delta Lake: Up and Running

Delta Lake: Up and Running
Author :
Publisher : "O'Reilly Media, Inc."
Total Pages : 271
Release :
ISBN-10 : 9781098139681
ISBN-13 : 1098139682
Rating : 4/5 (81 Downloads)

Book Synopsis Delta Lake: Up and Running by : Bennie Haelen

Download or read book Delta Lake: Up and Running written by Bennie Haelen and published by "O'Reilly Media, Inc.". This book was released on 2023-10-16 with total page 271 pages. Available in PDF, EPUB and Kindle. Book excerpt: With the surge in big data and AI, organizations can rapidly create data products. However, the effectiveness of their analytics and machine learning models depends on the data's quality. Delta Lake's open source format offers a robust lakehouse framework over platforms like Amazon S3, ADLS, and GCS. This practical book shows data engineers, data scientists, and data analysts how to get Delta Lake and its features up and running. The ultimate goal of building data pipelines and applications is to gain insights from data. You'll understand how your storage solution choice determines the robustness and performance of the data pipeline, from raw data to insights. You'll learn how to: Use modern data management and data engineering techniques Understand how ACID transactions bring reliability to data lakes at scale Run streaming and batch jobs against your data lake concurrently Execute update, delete, and merge commands against your data lake Use time travel to roll back and examine previous data versions Build a streaming data quality pipeline following the medallion architecture


Delta Lake: Up and Running Related Books

Delta Lake: Up and Running
Language: en
Pages: 271
Authors: Bennie Haelen
Categories: Computers
Type: BOOK - Published: 2023-10-16 - Publisher: "O'Reilly Media, Inc."

DOWNLOAD EBOOK

With the surge in big data and AI, organizations can rapidly create data products. However, the effectiveness of their analytics and machine learning models dep
Data Engineering with Apache Spark, Delta Lake, and Lakehouse
Language: en
Pages: 480
Authors: Manoj Kukreja
Categories: Computers
Type: BOOK - Published: 2021-10-22 - Publisher: Packt Publishing Ltd

DOWNLOAD EBOOK

Understand the complexities of modern-day data engineering platforms and explore strategies to deal with them with the help of use case scenarios led by an indu
Building the Data Lakehouse
Language: en
Pages: 256
Authors: Bill Inmon
Categories:
Type: BOOK - Published: 2021-10 - Publisher: Technics Publications

DOWNLOAD EBOOK

The data lakehouse is the next generation of the data warehouse and data lake, designed to meet today's complex and ever-changing analytics, machine learning, a
Data Lakehouse in Action
Language: en
Pages: 206
Authors: Pradeep Menon
Categories: Computers
Type: BOOK - Published: 2022-03-17 - Publisher: Packt Publishing Ltd

DOWNLOAD EBOOK

Propose a new scalable data architecture paradigm, Data Lakehouse, that addresses the limitations of current data architecture patterns Key FeaturesUnderstand h
Trino: The Definitive Guide
Language: en
Pages: 310
Authors: Matt Fuller
Categories: Computers
Type: BOOK - Published: 2021-04-14 - Publisher: "O'Reilly Media, Inc."

DOWNLOAD EBOOK

Perform fast interactive analytics against different data sources using the Trino high-performance distributed SQL query engine. With this practical guide, you'