登录查看更多内容

Event Report (#131): Data Engineering Meetup - dbt & Kubernetes API

Mathias Weber

Data Architect and Author of the Book "Other People's Software Endeavours"

发布日期: 2025年3月15日

+ 关注

When:?Thursday, 13th March 2025, 6:30pm to 9:30pm

Where: Diconium Office, Skalitzer Strasse 126, Berlin, Germany

Hosting Organization:?applydata Berlin

Participation Fee:?Free Entrance

Agenda:?Socializing, Host Intro, Talk 1, Talk 2, Socializing & Food

Topics covered:?Diconium, Applydata Berlin's Data Engineering Meetup Series & Ongoing Search for Speakers (Host Intro), Employing dbt to Scale Customer Behaviour Analytics (Talk 1), Saving Computing Costs by Providing a Kubernetes API for Data Processing Jobs (Talk 2)

I've learned something today:

DBT (Data Build Tool) is ideal for data analysts building reliable, testable data models in an ELT pipeline, as it lets them work directly with SQL without needing complex programming or orchestration. Its simple structure—SQL queries with dependencies—makes transformations intuitive. In contrast, Apache Airflow is better suited for data engineers managing complex ETL workflows with tools like Spark, as its Python-based orchestration can introduce unmanageable complexity for analysts.
Macros in dbt are like reusable SQL functions written using the Jinja templating language. They allow developers to automate repetitive logic and standardize transformations across multiple models. By following the DRY (Don't Repeat Yourself) principle, macros reduce redundancy, improve maintainability, and enhance code consistency.
Azure VNET Injection integrates services like Databricks into a private network within a company's Azure tenant, isolating them from the public internet for enhanced security and compliance. However, it can lead to unnecessary resource consumption due to complex subnet and endpoint configurations, increasing costs, over-provisioning, and potential latency.
Hard limits on public IP addresses in cloud environments, such as Azure, restrict the creation of new clusters or services. This prevents scaling big data workloads. When these limits are reached, additional compute resources cannot be provisioned, leading to deployment failures and processing bottlenecks.
Karpenter improves Azure Kubernetes Service (AKS) scaling by dynamically provisioning nodes instead of relying on predefined node pools, which limit flexibility to fixed VM types. By selecting the most cost-effective and available VMs in real time, Karpenter enables faster, more efficient scaling.
The Diconium Office provided a welcoming and well-equipped event venue:

要查看或添加评论，请登录

Mathias Weber的更多文章

Event Report (#130): ACE Berlin-Brandenburg - March Edition 2025

2025年3月8日

Event Report (#130): ACE Berlin-Brandenburg - March Edition 2025

When: Tuesday, 4th March 2025, 5:30pm to 9:00pm Where: Codecentric AG, K?penicker Strasse 31, Berlin, Germany Hosting…
Event Report (#129): From Network Tab to API Test with AI - The Test Tribe 5th Berlin Meetup

2025年3月1日

Event Report (#129): From Network Tab to API Test with AI - The Test Tribe 5th Berlin Meetup

When: Tuesday, 27th February 2025, 6:30pm to 8:30pm Where: N26, Voltairestra?e 8, Berlin, Germany Hosting Organization:…
Event Report (#128): How to Manage Your Crypto Taxes?

2025年2月22日

Event Report (#128): How to Manage Your Crypto Taxes?

title picture screenshotted from: https://www.blockpit.
Event Report (#127): Windows Server User Group Berlin 2025 Q1

2025年2月15日

Event Report (#127): Windows Server User Group Berlin 2025 Q1

title picture screenshotted from: https://www.adlershof.
Event Report (#126): Django User Group Berlin - February 2025

2025年2月8日

Event Report (#126): Django User Group Berlin - February 2025

title picture screenshotted from: https://htmx.org/img/memes/original.

7 条评论
Event Report (#125): 2025 Tech Leadership Perspectives Conference

2025年2月2日

Event Report (#125): 2025 Tech Leadership Perspectives Conference

title picture screenshotted from Damia Group Presentation "HTTP: HIRING TECH TALENT IN PORTUGAL. Decoding the…

1 条评论
Event Report (#124): Is AI Really a Threat to QE? - The Test Tribe Berlin Meetup No. 4

2025年1月25日

Event Report (#124): Is AI Really a Threat to QE? - The Test Tribe Berlin Meetup No. 4

title picture screenshotted from: https://reports.weforum.

2 条评论
Event Report (#123): The Modern Data Platform - Lakehouse and DataOps in Databricks

2025年1月18日

Event Report (#123): The Modern Data Platform - Lakehouse and DataOps in Databricks

title picture screenshotted from: learn.microsoft.

2 条评论
Event Report (#122): Bitcoin Socratic Seminar Berlin No. 63

2025年1月11日

Event Report (#122): Bitcoin Socratic Seminar Berlin No. 63

title picture screenshotted from: https://bitnodes.io/ When: Wednesday, 8th January 2025, 7:00pm to 10:00pm Where:…
Event Report (#121): BLISS AI Speaker Series Episode 12 - Combining RL and Gen-Models for Drug Design

2024年12月20日

Event Report (#121): BLISS AI Speaker Series Episode 12 - Combining RL and Gen-Models for Drug Design

title picture screenshotted from: https://github.com/aspuru-guzik-group/selfies/blob/master/examples/VAE_LS_Validity.

See all articles

Mathias Weber的更多文章

Event Report (#130): ACE Berlin-Brandenburg - March Edition 2025

Event Report (#129): From Network Tab to API Test with AI - The Test Tribe 5th Berlin Meetup

Event Report (#128): How to Manage Your Crypto Taxes?

Event Report (#127): Windows Server User Group Berlin 2025 Q1

Event Report (#126): Django User Group Berlin - February 2025

Event Report (#125): 2025 Tech Leadership Perspectives Conference

Event Report (#124): Is AI Really a Threat to QE? - The Test Tribe Berlin Meetup No. 4

Event Report (#123): The Modern Data Platform - Lakehouse and DataOps in Databricks

Event Report (#122): Bitcoin Socratic Seminar Berlin No. 63

Event Report (#121): BLISS AI Speaker Series Episode 12 - Combining RL and Gen-Models for Drug Design