DATA Pill #007 - learn DATA Mesh, take part in the Kaggle competition and be like Bond in the DATA world

DATA Pill #007 - learn DATA Mesh, take part in the Kaggle competition and be like Bond in the DATA world

Hi everyone!

Last week was a juicy week with lots of valuable content to share with you.

Selecting the ‘creme de la creme’ was a struggle for me.

So please, grab a coffee and enjoy DATA Pill 007


ARTICLES?

The State of Data Engineering 2022 | 10 min read | Data Engineering | Einat Orr | lakeFS

A map of categorized tools and technologies with comments and an explanation.


What drives your customer’s decisions? Find answers with Machine Learning Models! H&M’s Kaggle competition | 11 min read | ML | ?? Adrian Dembek | GetInData?

GetInData recently took part in the Kaggle H&M Personalized Fashion Recommendations competition where they were challenged to build a recommendation engine that would predict which articles a customer would buy in a particular week.

In this blog post Adrian presents:

  • a humans' decision making process as if it was an algorithm which processes multidimensional input information and generates outputs in the form of decisions.
  • how to represent the decision-making process with numbers
  • how to match machine learning algorithms to real-life concepts.


The CDP as we know it is dead: Introducing the Unbundled CDP | 7 min read | Data Warehouse | Tejas Manohar | Hightouch

"We’re predicting that most companies’ CDPs will be rebuilt on top of the data warehouse and look like this:"


Our journey towards an open data platform | 8 min read | Data Platform Engineering | Doran Parat

The journey of shaping Yotpo’s data platform architecture:

"Navigating the flooded data technologies market can be confusing at times. We find ourselves mixing managed, open-source and self-development solutions to build a balanced stack. So many decisions to make along the way — all made under one clear principle — keeping our data platform as open as possible."

{ MORE LINKS }


NEWS?

Terraform Cloud Adds Drift Detection for Infrastructure Management | 5 min read | HasiCorp Blog?

Drift Detection provides continuous checks against an infrastructure state to detect and notify when there are changes.


Introducing the dbt Certification Program | 2 min read | dbt

Introducing the new dbt Certification Program and the first dbt Analytics Engineering Certification exam.

{ MORE LINKS }


DISCUSS?

?Streaming Pseudonymization by tokenization | 10 min read | Streaming, Architecture | Robert Sahlin?

Robert shared his presentation on LinkedIn from the Heroes of Data meetup.

?

DATAtube?

Data Analytics Democratization: How ING Data Analytics Platform Bootstraps New Data Driven Products | 49 min | Krzysztof Adamski from ING | The Linux Foundation

?Three years ago, ING (banking industry) took on the challenge of gathering a curated portfolio of internal data sources together with a large scale compute platform.

The idea core:

  • allowing internal projects to get access to a rich toolset of open source and industry standard frameworks
  • preprocessed data to validate business ideas in a secure exploration environment.

In this presentation you will discoverthe results, what the key elements of the strategy are and what is still ahead of the ING Data Analytics Platform.

?

?PODCAST

?Data Journey with Max Schultze (Zalando) - Data Mesh | 1 h | ?? Radio DaTa by GetInData?

I talk with Max about how Zalando use data and analytics and how their data platform has evolved over the last few years

Data Mesh - what it is, what it is NOT, how it helps Zalando to become an even more data-driven company, if, when & how to introduce it to your organization.

{ MORE LINKS }


?CONFS AND MEETUPS

DATA + AI SUMMIT | 27-30 June | San Francisco Hybrid

??

We need now a little break.?

Thank you once again for this first week with DATA Pill.

There are almost a thousand of us in the DATA Pill community now (here and on datapill.tech). It makes me very happy and I start to thinking that maybe we should have some space to communicate on a regular basis.?

Github? Slack? Any ideas??

Adam Kawa from GetInData


要查看或添加评论,请登录

Adam Kawa的更多文章

社区洞察

其他会员也浏览了