Mustafa Akur and Andrew Lamb published a interesting post how to get better execution plans in #datafusion using orderings Please check it in the #datafusion blog on the link below https://lnkd.in/gwh7aDXg
Apache DataFusion
软件开发
Apache DataFusion is a fast, feature rich and extensible query engine built on the Apache Arrow memory model.
关于我们
Apache DataFusion is a fast, feature rich and extensible query engine built on the Apache Arrow memory model. “Out of the box,” DataFusion offers SQL and Dataframe APIs, excellent performance, built-in support for CSV, Parquet, JSON, and Avro, extensive customization, and a great community. Python Bindings are also available. DataFusion features a full query planner, a columnar, streaming, multi-threaded, vectorized execution engine, and partitioned data sources. You can customize DataFusion at almost all points including additional data sources, query languages, functions, custom operators and more. See the Architecture section for more details.
- 网站
-
https://datafusion.apache.org
Apache DataFusion的外部链接
- 所属行业
- 软件开发
- 规模
- 51-200 人
- 类型
- 非营利机构
- 创立
- 2020
Apache DataFusion员工
动态
-
There is an awesome opportunity coming up! ParadeDB #paradedb is hiring for OLAP database engineers to come build faceted search/columnar analytics in Postgres, in #rust. They just raised a 12M$ Series A and work with customers like Alibaba, Modern Treasury, BILT, and others. If you're curious to learn more, Philippe No?l spoke at CMU on their architecture: https://lnkd.in/gbXpPTzS The job postings: https://lnkd.in/g45WV4yu
ParadeDB – Postgres for Search and Analytics (Philippe No?l)
https://www.youtube.com/
-
Thanks #synnada for making this #gsoc happen with #apache #datafusion
???Exciting news! Apache DataFusion has been selected as a mentoring organization for?Google Summer of Code (GSoC) 2025! With?11 project ideas?and?14 possible mentors, we’re excited to welcome new contributors into?open-source software development?through this global program. Synnada is proud to contribute by mentoring?5 projects, supporting the next wave of open-source developers. ?? https://lnkd.in/gHre-c3 #GSoC2025 #ApacheDataFusion #OpenSource #DataEngineering
-
-
#datafusion #rust #queryengine #sql #bigdata Apache DataFusion 45.0.0 Released! We are very proud to announce?DataFusion 45.0.0. This blog highlights some of the many major improvements since we released?DataFusion 40.0.0?and a preview of what the community is thinking about in the next 6 months https://lnkd.in/gnsir6X4
-
#datafusion #rust #queryengine #sql #bigdata Apache DataFusion 45.0.0 Released! We are very proud to announce?DataFusion 45.0.0. This blog highlights some of the many major improvements since we released?DataFusion 40.0.0?and a preview of what the community is thinking about in the next 6 months https://lnkd.in/gnsir6X4
-
InfluxData Staff Engineer Andrew Lamb has an ambitious goal: 1,000 projects powered by Apache DataFusion. And he thinks 2025 could be the year we hit that mark. ?? Here’s why he’s betting big. ?? https://bit.ly/42eSEzg #InfluxDB
-
Apache DataFusion now is on Linkedin! ???Introducing Apache DataFusion: The Fast, Extensible Query Engine??? We're excited to introduce?Apache DataFusion, an open-source, high-performance query engine designed for modern analytics workloads. Built with Rust, DataFusion enables?fast, efficient, and scalable?data processing with SQL and DataFrame APIs. ???Key Benefits of DataFusion: ??High Performance?– Leverages Rust’s safety and speed for optimal query execution. ??Extensibility?– Customizable execution plans, user-defined functions, and integration with various storage backends. ??Open-Source & Community-Driven?– Backed by a growing community of developers and data enthusiasts. ?? If you're passionate about?databases, query engines, data processing, analytics, and open-source innovation, follow our page for updates, technical insights, and community highlights! ???Follow us and be part of data processing with Apache DataFusion! #ApacheDataFusion #BigData #RustLang #SQL #DataProcessing #OpenSource #Analytics #DataEngineering
-
-
#apache #datafusion #comet #spark Apache Spark native accelerator Apache DataFusion Comet 0.6.0 released
Apache DataFusion Comet 0.6.0 has been released ?? Comet is an accelerator for Apache Spark that provides 2x or better speedups for many workloads without requiring any code changes or specialized hardware. This is a smaller release than usual now that we have moved to an approximately monthly release cadence to match core DataFusion. Read more in the announcement blog post: https://lnkd.in/gFz5UQzW