Micropipelines: A microservice approach for DAG authoring in Apache Airflow

We have wanted to add Data awareness to Airflow for a long time and it's finally here!

With Airflow 2.4, you can now break down big, monolithic pipelines—in which long-running tasks can delay the completion of time-critical ones—into smaller, loosely coupled “micropipelines” that can be orchestrated together. Conceptually, these are microservices for data, a transformative development made possible by the new release’s introduction of Data Driven Scheduling and a new Airflow abstraction called Datasets.?

I am excited by the capabilities that micropipelines enable such as:

  • Helping teams tune their entire data ecosystem to balance time criticality vs. cost using
  • Enable collaboration between data team members and make it easier for them to use the right language—e.g., Python or SQL—for the job.

I am also thrilled by the easier authoring process for data pipelines aka DAGs enabled by the AstroSDK. This open source DAG authoring framework building on Airflow is amazing!

Kudos to all the people who have worked on this across both Airflow and AstroSDK: Andrey Anshin, Ankit Chaurasia , Ash Berlin-Taylor , Bartlomiej Hirsz, Brent Bovenzi , Daniel Imberman , Daniel Standish , Drew Hubl, Elad Kalif , Ephraim Ewele Anierobi , Ernest Rohit katta , Jarek Potiuk , Jedidiah Cunningham , Josh Fell , Kaxil Naik , Mark Norman Francis, Matt Rixman , Mike Shwe , Niko, Pankaj Koti , Pankaj Singh , Phani Kumar V , Rahul Vats , Tatiana Al-Chueyr Martins , Tzu-ping Chung , Vincent, Woojciech Januszek, Chethanuk-plutoflume, PierreJeambrun and everyone else (all 152) who committed! Thank you all for making Airflow the most popular Apache project (measured by contributors) ever!

#apacheairflow #opensource #dataengineering #airflow

Iuliia Dankovych

Senior Business Development Manager / Account Manager | Women in Tech

1 年

Dear Vikram Koka pls check your private messages ??

回复

Congratulations to you and the team

Love to see it. 2.4 is big!

回复
Matt M.

Revenue Operations Leader | Revenue Engines Optimized by AI

2 年

Vikram Koka you gave an excellent presentation on this several weeks ago in New York, I enjoyed it and learned a lot Data awareness is life in 2022

Marc Lamberti

Head of Customer Education @Astronomer | Best Selling Instructor @Udemy

2 年

Wonderful article! A must read

要查看或添加评论,请登录

社区洞察

其他会员也浏览了