Last Week in a Byte on Delta Lake | 2023-03-21
Delta Lake
Delta Lake is an open-source storage framework that enables building a Lakehouse architecture.
You can watch or read the latest?#DeltaLake?news a week late (2023-03-21 edition)!
Recent releases and contributions
delta_table = DeltaTable.forPath(spark, path
mack.latest_version(delta_table)
>> 2
# import Jodie library
ChangeDataFeedHelper(deltaTablePath,0,25)
.dryRun()
.readCDF()
Upcoming events
We are happy to partner with Blueprint on their Velocity Tour to bring you demos, meet and greets, speaking sessions, and more! They will be at Data Council Austin 2023, PyCon US 2023 in Salt Lake City, and PyData Seattle 2023 in Seattle for March and April. Check out the Velocity Tour for all of their dates!
Latest community blogs
Robert Kossendey published the fourth blog in his insightful series on his journey to the #lakehouse with the post Lakehouse - A resumé.
Overall, we are more than satisfied with the outcome of our Lakehouse migration. We reduced our overall costs by?80%?while improving our developer experience drastically. We don’t have to maintain a Redshift cluster anymore. Instead, we store all the data in a?single place, S3. Further, the core of our infrastructure is powered by?open source, namely Apache Spark and Delta Lake. That empowers us to move away from Databricks if we are ever unhappy with the service.
For more information, check out the vidcast D3L2: The Journey Unifying Data Lake and Data Warehouse with Robert Kossendey at Claimsforce. cc claimsforce