DATA Pill #007 - learn DATA Mesh, take part in the Kaggle competition and be like Bond in the DATA world
Hi everyone!
Last week was a juicy week with lots of valuable content to share with you.
Selecting the ‘creme de la creme’ was a struggle for me.
So please, grab a coffee and enjoy DATA Pill 007
ARTICLES?
The State of Data Engineering 2022 | 10 min read | Data Engineering | Einat Orr | lakeFS
A map of categorized tools and technologies with comments and an explanation.
What drives your customer’s decisions? Find answers with Machine Learning Models! H&M’s Kaggle competition | 11 min read | ML | ?? Adrian Dembek | GetInData?
GetInData recently took part in the Kaggle H&M Personalized Fashion Recommendations competition where they were challenged to build a recommendation engine that would predict which articles a customer would buy in a particular week.
In this blog post Adrian presents:
The CDP as we know it is dead: Introducing the Unbundled CDP | 7 min read | Data Warehouse | Tejas Manohar | Hightouch
"We’re predicting that most companies’ CDPs will be rebuilt on top of the data warehouse and look like this:"
Our journey towards an open data platform | 8 min read | Data Platform Engineering | Doran Parat
The journey of shaping Yotpo’s data platform architecture:
"Navigating the flooded data technologies market can be confusing at times. We find ourselves mixing managed, open-source and self-development solutions to build a balanced stack. So many decisions to make along the way — all made under one clear principle — keeping our data platform as open as possible."
NEWS?
Terraform Cloud Adds Drift Detection for Infrastructure Management | 5 min read | HasiCorp Blog?
Drift Detection provides continuous checks against an infrastructure state to detect and notify when there are changes.
Introducing the dbt Certification Program | 2 min read | dbt
Introducing the new dbt Certification Program and the first dbt Analytics Engineering Certification exam.
领英推荐
DISCUSS?
?Streaming Pseudonymization by tokenization | 10 min read | Streaming, Architecture | Robert Sahlin?
Robert shared his presentation on LinkedIn from the Heroes of Data meetup.
?
DATAtube?
Data Analytics Democratization: How ING Data Analytics Platform Bootstraps New Data Driven Products | 49 min | Krzysztof Adamski from ING | The Linux Foundation
?Three years ago, ING (banking industry) took on the challenge of gathering a curated portfolio of internal data sources together with a large scale compute platform.
The idea core:
In this presentation you will discoverthe results, what the key elements of the strategy are and what is still ahead of the ING Data Analytics Platform.
?
?PODCAST
?Data Journey with Max Schultze (Zalando) - Data Mesh | 1 h | ?? Radio DaTa by GetInData?
I talk with Max about how Zalando use data and analytics and how their data platform has evolved over the last few years
Data Mesh - what it is, what it is NOT, how it helps Zalando to become an even more data-driven company, if, when & how to introduce it to your organization.
?CONFS AND MEETUPS
DATA + AI SUMMIT | 27-30 June | San Francisco Hybrid
??
We need now a little break.?
Thank you once again for this first week with DATA Pill.
There are almost a thousand of us in the DATA Pill community now (here and on datapill.tech). It makes me very happy and I start to thinking that maybe we should have some space to communicate on a regular basis.?
Github? Slack? Any ideas??
Adam Kawa from GetInData