The Linux Foundation Projects
Delta Lake

Delta Lake Blogs

Thumbnail for Working with Apache Sedona

Working with Apache Sedona

By Avril Aysha

Learn how to use Apache Sedona with Delta Lake

Thumbnail for Understanding Open Table Formats

Understanding Open Table Formats

By Avril Aysha

Learn about open table formats

Thumbnail for Delta Lake Liquid Clustering

Delta Lake Liquid Clustering

By Avril Aysha

Learn how to use Delta Lake Liquid Clustering feature

Thumbnail for Delta Lake on Azure Data Lake Storage

Delta Lake on Azure Data Lake Storage

By Avril Aysha

Learn how to use Delta Lake on Azure Data Lake Storage

Thumbnail for Delta Lake Upsert

Delta Lake Upsert

By Avril Aysha

Learn how to perform upserts with Delta Lake

Thumbnail for Delta Lake on GCP

Delta Lake on GCP

By Avril Aysha

Learn how to use Delta Lake on GCP

Thumbnail for Building the Medallion Architecture with Delta Lake

Building the Medallion Architecture with Delta Lake

By Matthew Powers

Using the Medallion Architecture with Delta Lake

Thumbnail for Delta Lake Clone

Delta Lake Clone

By Avril Aysha

Learn how to clone Delta tables

Thumbnail for Delta Lake on S3

Delta Lake on S3

By Avril Aysha

Learn how to use Delta Lake on S3

Thumbnail for Delta Lake for ETL

Delta Lake for ETL

By Avril Aysha

Learn how to use Delta Lake for ETL workloads

Thumbnail for Delta Lake 4.0 Preview

Delta Lake 4.0 Preview

By Tathagata Das , Allison Portis , Scott Sandre , Susan Pierce , Carly Akerly

We are pleased to announce the preview release of Delta Lake 4.0 (release notes) on Apache Spark™ 4.0 Preview.

Thumbnail for Delta Lake Optimize

Delta Lake Optimize

By Avril Aysha

Learn how to optimize your Delta Lake tables

Thumbnail for Unlocking the Power of Delta Lake 3.0+: Introducing the New StarTree Connector with Delta Kernel

Unlocking the Power of Delta Lake 3.0+: Introducing the New StarTree Connector with Delta Kernel

By Vibhuti Bhushan

In the rapidly evolving landscape of data management, staying up-to-date with the latest advancements is key to maintaining a competitive edge.

Thumbnail for Delta Kernel - Building Delta Lake connectors, made simple

Delta Kernel - Building Delta Lake connectors, made simple

By Nick Lanham , Tathagata Das

Delta Lake recently hit an impressive milestone of being downloaded more than 20M times per month!

Thumbnail for Unifying the open table formats with Delta Lake Universal Format (UniForm) and Apache XTable

Unifying the open table formats with Delta Lake Universal Format (UniForm) and Apache XTable

By Jonathan Brito , Kyle Weller

Delta Lake Universal Format (UniForm) enables Delta tables to be read by any engine that supports Delta, Iceberg, and now, through code contributed by Apache XTable, Hudi.

Thumbnail for Query Delta Lake natively using BigQuery

Query Delta Lake natively using BigQuery

By Gaurav Saxena , Justin Levandoski

Users working with Delta Lake tables can now easily integrate their workloads with BigQuery, ensuring secure and more managed interoperability.

Thumbnail for A Guide to Delta Lake Sessions at Data+AI Summit

A Guide to Delta Lake Sessions at Data+AI Summit

By Carly Akerly

The Data+AI Summit returns to San Francisco from June 10-13, 2024.

Thumbnail for Delta Lake without Spark

Delta Lake without Spark

By Avril Aysha

Learn how to use Delta Lake without Spark

Thumbnail for Use Delta Lake from Jupyter Notebook

Use Delta Lake from Jupyter Notebook

By Avril Aysha

Learn how to use Delta Lake from a Jupyter Notebook

Thumbnail for Scaling Graph Data Processing with Delta Lake: Lessons from a Real-World Use Case

Scaling Graph Data Processing with Delta Lake: Lessons from a Real-World Use Case

By Yeshwanth Vijayakumar

The Adobe Experience Platform includes a set of analytics, social, advertising, media optimization, targeting, Web experience management, journey orchestration, and content management products.

Thumbnail for Delta Lake vs Data Lake - What's the Difference?

Delta Lake vs Data Lake - What's the Difference?

By Avril Aysha

Understand the difference between Delta Lake and a data lake

Thumbnail for Delta Lake 3.2

Delta Lake 3.2

By Carly Akerly

We are pleased to announce the release of Delta Lake 3.2 (release notes) on Apache Spark 3.5, with features that improve the performance and interoperability of Delta Lake.

Thumbnail for Efficient Delta Vacuum with File Inventory

Efficient Delta Vacuum with File Inventory

By Arun Ravi M V (Grab)

Today, Delta Lake is rapidly making its mark as a highly popular hybrid data format, earning widespread adoption across various organizations.

Thumbnail for Rivian expands the Delta Lake ecosystem with Delta-Go

Rivian expands the Delta Lake ecosystem with Delta-Go

By Chelsea Jones , Rahul Madnawat , Jason Shiverick

Real-time data ingestion for high-volume transactions, now available in open source

Thumbnail for Pros and cons of Hive-style partitioning

Pros and cons of Hive-style partitioning

By Matthew Powers , Martin Bode

This post discusses the pros and cons of Hive-style partioning.

Thumbnail for Structured Spark Streaming with Delta Lake: A Comprehensive Guide

Structured Spark Streaming with Delta Lake: A Comprehensive Guide

By Delta Lake

The webinar demonstrates how to embrace structured streaming seamlessly from data emission to your final Delta table destination.

Thumbnail for High-Performance Querying on Massive Delta Lake Tables with Daft

High-Performance Querying on Massive Delta Lake Tables with Daft

By Clark Zinzow , Jay Chia

This post introduces the distributed + parallel Delta Lake reader in Daft.

Thumbnail for Delta Lake - State of the Project - Part 2

Delta Lake - State of the Project - Part 2

By Tathagata Das , Susan Pierce , Carly Akerly

Delta Lake, a project hosted under The Linux Foundation, has been growing by leaps and bounds. To celebrate the achievements of the project, we’re publishing a 2-part series on Delta Lake.

Thumbnail for Delta Lake Announces Pandas Enhancement: Real Pandas to Optimize Data Lakehouse Performance

Delta Lake Announces Pandas Enhancement: Real Pandas to Optimize Data Lakehouse Performance

By Carly Akerly

The Delta Lake project is thrilled to announce its latest and most exciting collaboration with the Pandas community!

Thumbnail for Delta Lake - State of the Project - Part 1

Delta Lake - State of the Project - Part 1

By Tathagata Das , Susan Pierce , Carly Akerly

Delta Lake, a project hosted under The Linux Foundation, has been growing by leaps and bounds. To celebrate the achievements of the project, we’re publishing a 2-part series on Delta Lake.

Thumbnail for Delta Lake 3.1.0

Delta Lake 3.1.0

By Carly Akerly

This post describes the exiting features in the Delta Lake 3.1.0 release

Thumbnail for Delta Lake 3.3

Delta Lake 3.3

By Allison Portis , Susan Pierce

We are pleased to announce the release of Delta Lake 3.3 on Apache Spark 3.5, with features that improve the performance and interoperability of Delta Lake.

Thumbnail for Delta Lake replaceWhere

Delta Lake replaceWhere

By Matthew Powers

Selectively overriding rows or partitions of a Delta Lake table with replaceWhere.

Thumbnail for Delta Lake Performance

Delta Lake Performance

By Joe Harris

This post shows explains why Delta Lake is fast and describes improvements to Delta Lake performance over time.

Thumbnail for Writing a Kafka Stream to Delta Lake with Spark Structured Streaming

Writing a Kafka Stream to Delta Lake with Spark Structured Streaming

By Bo Gao , Matthew Powers

This blog post explains how to write a Kafka stream to a Delta table with Spark Structured Streaming.

Thumbnail for Using Delta Lake with AWS Glue

Using Delta Lake with AWS Glue

By Keerthi Josyula , Matthew Powers

This post shows how to register Delta tables in the AWS Glue Data Catalog with the AWS Glue Crawler.

Thumbnail for New features in the Python deltalake 0.12.0 release

New features in the Python deltalake 0.12.0 release

By Ion Koutsouris

This post explains the new features in the Python deltalake 0.12.0 release

Thumbnail for Delta Lake 3.0.0

Delta Lake 3.0.0

By Carly Akerly

This post describes the exiting features in the Delta Lake 3.0.0 release

Thumbnail for Delta Lake vs. Parquet Comparison

Delta Lake vs. Parquet Comparison

By Matthew Powers

This post compares the stengths and weaknesses of Delta Lake vs Parquet.

Thumbnail for Delta Lake vs. ORC Comparison

Delta Lake vs. ORC Comparison

By Avril Aysha

This post compares the stengths and weaknesses of Delta Lake vs ORC.

Thumbnail for Unlock Delta Lakes for PyTorch Training with DeltaTorch

Unlock Delta Lakes for PyTorch Training with DeltaTorch

By Daniel Liden , Michael Shtelma

This post demonstrates how to create PyTorch DataLoaders using Delta tables as data sources for training deep learning models.

Thumbnail for Introducing Delta Lake Table Features

Introducing Delta Lake Table Features

By Nick Karpov

This introduces Delta Lake Table Features, a discrete feature-based compatibility scheme that replaces the traditional integer protocol versioning for Delta Lake tables and clients.

Thumbnail for Delta Lake Change Data Feed (CDF)

Delta Lake Change Data Feed (CDF)

By Nick Karpov , Matthew Powers

This blog shows how to enable and use the Delta Lake Change Data Feed.

Thumbnail for Delta Lake’s transaction log protocol and its implementations

Delta Lake’s transaction log protocol and its implementations

By Matthew Powers

This blog explains the Delta Lake transaction log protocol and its various implementation.

Thumbnail for Delta Lake Deletion Vectors

Delta Lake Deletion Vectors

By Nick Karpov

This blog introduces the new Deletion Vectors table feature for Delta Lake tables, and explains how Deletion Vectors speed up operations that modify existing data in your lakehouse.

Thumbnail for Using Ibis with PySpark on Delta Lake tables

Using Ibis with PySpark on Delta Lake tables

By Marlene Mhangami , Matthew Powers

This post explains how to use Ibis to query Delta tables with PySpark

Thumbnail for Delta Lake Z Order

Delta Lake Z Order

By Matthew Powers

This post explains how to use Delta Lake Z Order to make your queries run faster

Thumbnail for Delta Lake 2.3.0 Released

Delta Lake 2.3.0 Released

By Allison Portis , Matthew Powers

This post explains some of the key features in the Delta Lake 2.3.0 release

Thumbnail for Open source self-hosted Delta Sharing server

Open source self-hosted Delta Sharing server

By Shingo Okawa

This post explains Kotosiro Delta Sharing server basic instructions

Thumbnail for How Delta Lake uses metadata to make certain aggregations much faster

How Delta Lake uses metadata to make certain aggregations much faster

By Matthew Powers , Scott Sandre

This post explains Delta Lake performance optimizations that make some aggregations execute quicker

Thumbnail for How to use Delta Lake generated columns

How to use Delta Lake generated columns

By Matthew Powers

How to create Delta Lake tables with generated columns and the benefits of this feature

Thumbnail for Introducing Support for Delta Lake Tables in AWS Lambda

Introducing Support for Delta Lake Tables in AWS Lambda

By Nick Karpov

How to use deltalake in AWS Lambda with AWS SDK for pandas

Thumbnail for How to create and append to Delta Lake tables with pandas

How to create and append to Delta Lake tables with pandas

By Matthew Powers

This post explains how to create and append to Delta Lake tables with pandas

Thumbnail for Running ML Workflows with Delta Lake and Ray

Running ML Workflows with Delta Lake and Ray

By Jim Hibbard

This post explains how you can read Delta Lake with the Ray compute framework

Thumbnail for How to Convert from CSV to Delta Lake

How to Convert from CSV to Delta Lake

By Matthew Powers

This post explains how to convert from a CSV data lake to Delta Lake, which offers much better features.

Thumbnail for Getting started contributing to Delta Lake Spark

Getting started contributing to Delta Lake Spark

By Nick Karpov

This post explains the full development loop with the Delta Lake Spark connector. You'll learn how to retrieve and navigate the codebase, make changes, and package and debug custom builds.

Thumbnail for New features in the Python deltalake 0.7.0 release of delta-rs

New features in the Python deltalake 0.7.0 release of delta-rs

By Will Jones , Matthew Powers

This post explains the new features in the deltalake 0.7.0 release

Thumbnail for Delta Lake Merge

Delta Lake Merge

By Nick Karpov

This post shows how to use MERGE with Delta tables.

Thumbnail for Delta Lake Schema Evolution

Delta Lake Schema Evolution

By Matthew Powers

This post shows how to enable schema evolution in Delta tables and when this is a good option.

Thumbnail for Delta Lake Time Travel

Delta Lake Time Travel

By Matthew Powers

This post shows how to time travel between different versions of a Delta table.

Thumbnail for Delta Lake Small File Compaction with OPTIMIZE

Delta Lake Small File Compaction with OPTIMIZE

By Matthew Powers

This post shows compact small files in Delta tables with OPTMIZE.

Thumbnail for Adding and Deleting Partitions in Delta Lake tables

Adding and Deleting Partitions in Delta Lake tables

By Matthew Powers , Ryan Zhu

This post shows add partitions and remove partitions from Delta Lake tables.

Thumbnail for Remove old files with the Delta Lake Vacuum Command

Remove old files with the Delta Lake Vacuum Command

By Matthew Powers , Nick Karpov

This blog post explains how to remove files marked for deletion from storage with the Delta Lake Vacuum command.

Thumbnail for Reading Delta Lake Tables into Polars DataFrames

Reading Delta Lake Tables into Polars DataFrames

By Matthew Powers , Chitral Verma

This post shows how to read Delta Lake tables into Polars DataFrames.

Thumbnail for Building a more efficient data infrastructure for machine learning with Open Source using Delta Lake, Amazon SageMaker, and EMR

Building a more efficient data infrastructure for machine learning with Open Source using Delta Lake, Amazon SageMaker, and EMR

By Vedant Jain , Denny Lee

In this blog, we’ll explore how connecting Delta Lake, Amazon SageMaker Studio, and Amazon EMR can simplify the end-to-end workflow required to support data engineering and data science projects.

Thumbnail for How to Delete Rows from a Delta Lake Table

How to Delete Rows from a Delta Lake Table

By Matthew Powers

This post teaches you how to delete rows from a Delta Lake table and how the operation is implemented under the hood.

Thumbnail for Delta Lake Constraints and Checks

Delta Lake Constraints and Checks

By Matthew Powers

This post shows how to add constraints to your Delta table to avoid certain types of values from getting appended.

Thumbnail for Delta Lake Schema Enforcement

Delta Lake Schema Enforcement

By Matthew Powers

This post teaches you about schema enforcement in Delta Lake and why it's better than what's offered by data lakes

Thumbnail for Why PySpark append and overwrite write operations are safer in Delta Lake than Parquet tables

Why PySpark append and overwrite write operations are safer in Delta Lake than Parquet tables

By Matthew Powers

This post shows you why PySpark overwrite operations are safer with Delta Lake and how the different save mode operations are implemented under the hood.

Thumbnail for How to Create Delta Lake Tables

How to Create Delta Lake Tables

By Matthew Powers

This post shows you how to create Delta Lake tables with Python, SQL, and PySpark.

Thumbnail for How to Version Your Data with pandas and Delta Lake

How to Version Your Data with pandas and Delta Lake

By Matthew Powers

This post shows you how to version your pandas datasets and the benefits you'll enjoy with versioned data.

Thumbnail for Sharing a Delta Table’s Change Data Feed with Delta Sharing 0.5.0

Sharing a Delta Table’s Change Data Feed with Delta Sharing 0.5.0

By Will Girten

We are excited to announce the release of Delta Sharing 0.5.0.

Thumbnail for How to Rollback a Delta Lake Table to a Previous Version with Restore

How to Rollback a Delta Lake Table to a Previous Version with Restore

By Matthew Powers

This post shows you how to rollback Delta Lake tables to previous versions with restore.

Thumbnail for Converting from Parquet to Delta Lake

Converting from Parquet to Delta Lake

By Matthew Powers

This post shows how to convert a Parquet table to a Delta Lake.

Thumbnail for Why we migrated to a Data Lakehouse on Delta Lake for T-Mobile Data Science and Analytics Team

Why we migrated to a Data Lakehouse on Delta Lake for T-Mobile Data Science and Analytics Team

By Robert Thompson , Geoff Freeman

In this post, we will discuss the how and why we migrated from databases and data lakes to a data lakehouse on Delta Lake. Our lakehouse architecture allows reading and writing of data without blocking and scales out linearly. Business partners can easily adopt advanced analytics and derive new insights. These new insights promote innovation across disparate workstreams and solidify the decentralized approach to analytics taken by T-Mobile.

Thumbnail for How to drop columns from a Delta Lake table

How to drop columns from a Delta Lake table

By Matthew Powers

This post shows you two ways to drop columns from Delta Lake tables.

Thumbnail for Apache Flink Source Connector for Delta Lake tables

Apache Flink Source Connector for Delta Lake tables

By Krzysztof Chmielewski , Scott Sandre , Denny Lee

We are excited to announce the release of Delta Connectors 0.5.0, which introduces the new Flink/Delta Source Connector on Apache Flink™ 1.13 that can read directly from Delta tables using Flink’s DataStream API.

Thumbnail for Delta 2.0 - The Foundation of your Data Lakehouse is Open

Delta 2.0 - The Foundation of your Data Lakehouse is Open

By Tathagata Das , Denny Lee

We are happy to announce the release of the Delta Lake 2.0 on Apache Spark™ 3.2! The significance of Delta Lake 2.0 is not just a number - though it is timed quite nicely with Delta Lake’s 3rd birthday. It reiterates our collective commitment to the open-sourcing of Delta Lake, as announced by Michael Armbrust’s Day 1 keynote at Data + AI Summit 2022.

Thumbnail for Multi-cluster writes to Delta Lake Storage in S3

Multi-cluster writes to Delta Lake Storage in S3

By Scott Sandre , Denny Lee , Mariusz Kryński

While Delta Lake has supported concurrent reads from multiple clusters since its inception, there were limitations for multi-cluster writes specifically to Amazon S3. Note, this was not a limitation for Azure ADLSgen2 nor Google GCS, as S3 currently lacks “put-If-Absent” consistency guarantees. Thus, to guarantee ACID transactions on S3, one would need to have concurrent writes originating from the same Apache Spark™ driver. This was one of the most requested issues by the community and we are excited to announce that Delta Lake 1.2 (release notes, blog) now supports writing data from multiple clusters to S3 while maintaining the transactionality of the writes.

Thumbnail for Delta Lake 1.2 - More Speed, Efficiency and Extensibility Than Ever

Delta Lake 1.2 - More Speed, Efficiency and Extensibility Than Ever

By Venki Korukanti , Scott Sandre , Tathagata Das , Allison Portis , Denny Lee , Vini Jaiswal

Introducing performance optimizations that will supercharge your data pipelines at any scale.

Thumbnail for Writing to Delta Lake from Apache Flink

Writing to Delta Lake from Apache Flink

By Fabian Paul , Pawel Kubit , Scott Sandre , Tathagata Das , Denny Lee

Learn more about how you can write from Apache Flink to Delta Lake about the latest release of the open-source project Delta Sharing and how it enables sharing on Google Cloud Storage, among other enhancements.

Thumbnail for Extending Delta Sharing to Google Cloud Storage

Extending Delta Sharing to Google Cloud Storage

By Will Girten , Shixiong Zhu

Learn more about the latest release of the open-source project Delta Sharing and how it enables sharing on Google Cloud Storage, among other enhancements.

Thumbnail for Delta Connectors 0.3.0 Released

Delta Connectors 0.3.0 Released

By Allison Portis

We are excited to announce the release of Delta Connectors 0.3.0.

Thumbnail for Delta Lake 1.1.0 Released

Delta Lake 1.1.0 Released

By Scott Sandre

We are excited to announce the release of Delta Lake 1.1.0.

Thumbnail for Delta Sharing 0.3.0 Released

Delta Sharing 0.3.0 Released

By Lin Zhou

We are excited to announce the release of Delta Sharing 0.3.0.

Thumbnail for Power BI Delta Sharing Connector

Power BI Delta Sharing Connector

By Denny Lee

We are excited about the recently announced preview of the Power BI Delta Sharing connector

Thumbnail for Delta Lake User Survey (2021 H2)

Delta Lake User Survey (2021 H2)

By Denny Lee

We would like to invite you to provide your feedback on Delta Lake OSS.

Thumbnail for Delta Lake 1.0.0 Released

Delta Lake 1.0.0 Released

By Tathagata Das

We are excited to announce the release of Delta Lake 1.0.0 on Apache Spark 3.1.

Thumbnail for AMA: Growing the Delta Lake ecosystem

AMA: Growing the Delta Lake ecosystem

By Denny Lee

On March 11th, 2021 9:00 am PT, join us for this fun Delta Lake AMA session where we discuss with QP Hou, Christian Williams, and Alexander Kushnir from Scribd on growing the Delta Lake open-source ecosystem.

Thumbnail for Salesforce Engineering: Delta Lake Tech Talk Series

Salesforce Engineering: Delta Lake Tech Talk Series

By Denny Lee

We are happy to announce the Salesforce Engineering Delta Lake Tech Talk Series for March and April 2021.

Thumbnail for Delta Lake 0.8.0 Released

Delta Lake 0.8.0 Released

By Denny Lee

We are excited to announce the release of Delta Lake 0.8.0.

Thumbnail for Salesforce Engineering: Delta Lake Blog Series

Salesforce Engineering: Delta Lake Blog Series

By Denny Lee

Salesforce Engineering has published a series of blogs on how they use Delta Lake.

Thumbnail for Salesforce Engineering: Global Synchronousness and Ordering in Delta Lake

Salesforce Engineering: Global Synchronousness and Ordering in Delta Lake

By Denny Lee

At Salesforce, we maintain a platform to capture customer activity — various kinds of sales events such as emails, meetings, and videos. These events are either consumed by downstream products in real time or stored in our data lake, which we built using Delta Lake.

Thumbnail for Getting Started with Delta Lake

Getting Started with Delta Lake

By Denny Lee

Want to learn more about Delta Lake? Check out this series of Delta Lake videos.

Thumbnail for Delta Lake Sessions at Spark+AI Summit North America 2020

Delta Lake Sessions at Spark+AI Summit North America 2020

By Denny Lee

We're really excited for the numerous Delta Lake training and conference sessions that will be showcased throughout Spark+AI Summit NA 2020.

Thumbnail for Delta Lake 0.7.0 Released

Delta Lake 0.7.0 Released

By Denny Lee

We are excited to announce the release of Delta Lake 0.7.0 on Apache Spark 3.0. This is the first release on Spark 3.x and adds support for metastore-defined tables and SQL DDLs.

Thumbnail for Delta Lake 0.6.1 Released

Delta Lake 0.6.1 Released

By Denny Lee

We are excited to announce the release of Delta Lake 0.6.1, which fixes a few critical bugs in merge operation and operation metrics. If you are using version 0.6.0, it is strongly recommended that you upgrade to version 0.6.1.

Thumbnail for Delta Lake 0.6.0 Released

Delta Lake 0.6.0 Released

By Denny Lee

We are excited to announce the release of Delta Lake 0.6.0, which introduces schema evolution and performance improvements in merge, and operation metrics in table history.

Thumbnail for Delta Lake Newsletter: 2020-03-20 Edition

Delta Lake Newsletter: 2020-03-20 Edition

By Denny Lee

For this edition of the Delta Lake Newsletter, find out more about the latest and upcoming tech talks and videos.

Thumbnail for Diving into Delta Lake Online Tech Talk Series

Diving into Delta Lake Online Tech Talk Series

By Denny Lee

For our next series of Delta Lake online tech talks, we're excited to dive into the internals with our Diving into Delta Lake series. This will be a fun set of tech talks with live demos and Q&A. Check them out!

Thumbnail for Delta Lake Online Tech Talks

Delta Lake Online Tech Talks

By Denny Lee

We’re excited to announce the next series of Delta Lake online tech talks over the next few weeks. This will be a fun set of tech talks with live demos and Q&A. Check them out!

Thumbnail for Delta Lake 0.5.0 Released

Delta Lake 0.5.0 Released

By Denny Lee

We are excited to announce the release of Delta Lake 0.5.0, which introduces Presto/Athena support and improved concurrency.

Thumbnail for Delta Lake Newsletter: 2019-10-03 Edition (incl. SAIS EU 2019 Sessions)

Delta Lake Newsletter: 2019-10-03 Edition (incl. SAIS EU 2019 Sessions)

By Denny Lee

This edition of the Delta Lake Newsletter, find out more about the latest and upcoming webinars, meetups, and publications. For this edition, we will also focus on the many sessions at Spark+AI Summit EU 2019 in Amsterdam.

Thumbnail for Delta Lake 0.4.0 Released

Delta Lake 0.4.0 Released

By Denny Lee

We are excited to announce the release of Delta Lake 0.4.0 which introduces Python APIs for manipulating and managing data in Delta tables.

Thumbnail for Delta Lake 0.3.0 Released

Delta Lake 0.3.0 Released

By Denny Lee

We are happy to announce the availability of Delta Lake 0.3.0! Features include: Scala Java APIs for DML commands, Scala/Java APIs for query commit history, and Scala/Java APIs for vacuuming old files.

Thumbnail for Delta Lake 0.2.0 Released

Delta Lake 0.2.0 Released

By Denny Lee

We are happy to announce the availability of Delta Lake 0.2.0! It brings support for cloud storage (e.g. Amazon S3 and Azure Blob Storage) and improved concurrency.

Thumbnail for Delta Lake 0.1.0 Released

Delta Lake 0.1.0 Released

By Denny Lee

We are happy to announce the availability of Delta Lake 0.1.0! Initial version of the open source Delta Lake.