DATA Pill #041 - Streamlining Data Science Workflows, Machine Learning Models in LoL, and more…
Hi
Another week, another DATA Pill.
In this one, we’ll focus on:
Navigating the Data mesh, Machine Learning Models in the gaming world,?
and a little on A/B testing.
Enjoy the read!
ARTICLES
Navigating the Data Mesh: Organizational Challenges and Opportunities | 10 min | Pierre-Alain Genilloud | Data Engineering | ELCA IT?
Most of you without doubt have heard of the Data Mesh. Let’s take a deeper look at some implications in terms of organization and agility, challenges and opportunities. Also, let’s discuss the opportunities and open questions brought by the Data Mesh:
Opportunities:
Open questions:
dbt run real-time analytics on Apache Flink. Announcing the dbt-flink-adapter! | 23 min? | Data Analytics | Grzegorz Liter, Krzysztof Zarzycki, Micha? Soszko | GetInData | Part of Xebia Blog
We would like to announce the dbt-flink-adapter, that allows running pipelines defined in SQL in a dbt project on Apache Flink! Check out the newest blog post and find out:
Also, we deal with the myth that real-time analytics is not worth the cost.?
Streamlining Data Science Workflows with a Feature Catalog | 5 min | Roel Bertens | Data Engineering | GoDataDriven | Now Xebia Blog
Dealing with confusion and duplicative work in your data science team can be exhausting. In this post, Roel explores ways to overcome these challenges and improve collaboration, consistency and speed within your data science team. Read about the Feature Catalog that can help data science teams work together better.
In MORE LINKS you will find content about Machine Learning Models into League of Legends and layering.?
TUTORIAL
Snowflake Data Mesh: Step-by-Step Setup Guide, with Detailed Notes on Scaling and Maintenance | 25 min | Data Mesh | Atlan Blog
Data Mesh can be hard to implement. It requires an org-wide mindset shift toward decentralization and product thinking. Team Atlan attempted to demonstrate a reference Data Mesh implementation in a growth-stage organization with a complex business domain.
领英推荐
NEWS
Uber Ditches On-Prem and Hooks Future to GCP and Oracle Cloud | 4 min | Cloud | Lisa D Sparks | Data Center Knowledge
Uber joins the cloud! It was a long resisted move by one of the largest Hadoop users. And now they are also converting & over the 7 next years they will migrate all of that over to GCP or Oracle. Data & Data workloads will probably go to GCP. There is a lot of news about it, but this piece seems to put forward an interesting view.
In MORE LINKS you will find better Airflow with Metaflow.
VIDEO
Make Your A:B Testing More Effective and Efficient | 50 min | Analytics | Anjali Mehra | DataCamp
One of the toughest parts of any data project is experimentation, not just because you need to choose the right testing method to confirm the project’s effectiveness, but also because you need to make sure you are testing the right hypothesis and measuring the right KPIs to ensure you receive accurate results.?
One of the most effective methods for data experimentation is A/B testing, and Anjali Mehra is no stranger to how A/B testing can impact multiple parts of any organization.
Since we are talking about analytics, there is an interesting job offer available in that area.
PODCAST
Implementing Patterns And Practices For Infrastructure as Code | 56 min | Hosts: Ned Bellavance, Ethan Banks Guest: Rosemary Wang | Cloud | Day Two Cloud Podcast
A one hour talk with the Developer Advocate at HashiCorp and author of Infrastructure as Code, Patterns and Practices. Listen to more about Infrastructure as Code (IaC)including about the patterns and practices you might want to put in place. So you might want to apply some software development practices to it, particularly for the parts of your team who know what they’re doing with infrastructure but may not be familiar with things like repositories, re-usability, unit tests and so on.
Since we are talking about analytics, there is an interesting job offer available in that area.
?
CONFS EVENTS AND MEETUPS
Upgrade your Scaleup from using Spreadsheets to Data Platform | 14th March 2023 | Online
Do you want to know how to increase your data capabilities and become a data-driven company? Join the first webinar in series ‘Building a Data-Driven Company’ and learn what an implemented Modern Data Platform can look like and how it can assist you during your journey into modern analytics.
Webinar online 2023 - Big Data Technology Warsaw Summit | 9th March 2023 | Online
On March 9th you will have the opportunity to listen to presentations given by Mariusz Strzelecki from GetInData | Part of Xebia and Juan Cano from QuantumBlack:
________________________
Have any interesting content to share in the DATA Pill newsletter?
? Join us on GitHub
? Dig previous editions of DataPill?
Adam from the GetInData | Part of Xebia
Editor | Writer
2 年Nice analysis, Adam Kawa. Thanks for sharing my work. I have similar coverage here: https://lisadsparks.substack.com/