Data Council 2022: Building Lakehouse with Delta Lake
Vini Jaiswal
Top Voice in AI + Data | Open Source, Fortune 500 & Unicorns Advisor | Databricks, Citi Alumni | Speaker
I am building machine learning models, but my data is siloed”. “I need to ensure that the models I am building are based on reliable data, so my company can make quality decisions.” “I need to ensure that I am serving the right data to the right audience.” “I need to ensure governance so I can be prepared for the audit and GDPR.” “I also need to ensure that I build efficient performant pipelines as the data volume grows.” ---- says Data Engineers, Data Scientists, Data Architects, ML Practitioners, and so on.
?Does any of this sound like the architecture considerations when you go through building data architectures? This is what I can resonate with, too. So I want to take you through a journey of how we can solve these data engineering problems. Come learn at this workshop at Data Council on why Delta Lake checks the boxes on solutions to all these problems and why lakehouse architectures have become the modern architectures for companies building analytics and AI applications.
There is no shortage of challenges associated with building data pipelines, and this workshop walks through how to tackle them and make data pipelines robust and reliable. This allows downstream users to both realize the significant value and rely on their data to make critical data-driven decisions.?
Given the location of the event and considering that the housing market has been trending in Austin, it's perfect to use lending club data for our workshop to see how we qualify for a loan. For the workshop, we will use the Databricks Community edition so we can have the data and storage easily accessible for hands-on lab.
We will go through the following cool features about Delta Lake:
领英推荐
Conclusion
Delta Lake is used by 5000+ organizations in production to power their Lakehouse reliably. This workshop is curated so that you can leave feeling good about getting started with Delta Lake, learn about its benefits, we will also have a Q&A at the end to provide you the opportunity to ask us questions.
Very excited to see you at the Data Community at Data Council on Mach 23rd at 11 AM CST. Here's the link to the event: https://www.datacouncil.ai/austin.
If you are not attending the Data Council conference, I will provide the notebooks and useful links afterward. Also if you would like to stay tuned with the innovations in delta lake or want to contribute to the project, please reach the community on slack, google group, linkedin, github or you can find us on youtube doing an AMA or community event. Thank you!
?? Building bridges @naas.ai Universal Data & AI Platform | Research Associate in Applied Ontology | Senior Advisor Data & AI Services
3 年cannot come :) but is there something online?