Overview of Apache Flink: the 4G of Big Data Analytics Frameworks
Slim Baltagi
I help with Snowflake AI Data Cloud Cost & Performance Optimization, and Administration
My proposal on Apache Flink to the Hadoop Summit Europe 2016 is a Community Choice Winner of 'The Future of Hadoop' track!! Thanks to all of you who voted for it and made this happen.
This is an introductory level talk about Apache Flink: a multi-purpose Big Data analytics framework leading a movement towards the unification of batch and stream processing or stream processing-first in the open source. With the many technical innovations it brings along with its unique vision and philosophy, Apache Flink is considered the 4 G (4th Generation) of Big Data analytics frameworks providing the only hybrid (Real-Time Streaming + Batch) open source distributed data processing engine supporting many use cases.
In this talk, you will learn more about:
1. What is Apache Flink stack? Its streaming dataflow execution engine, APIs and domain-specific libraries for batch, streaming, machine learning and graph processing.
2. How Apache Flink integrates with Hadoop and other open source tools for data input and output as well as deployment?
3. Why Apache Flink is an alternative to Apache Hadoop MapReduce, Apache Storm and Apache Spark?
4. How Apache Flink is used at Capital One?
Thanks
Slim Baltagi
Senior Engineer
9 年Voted
Senior Java Developer
9 年Voted.
Entrepreneur | Space Data Enthusiast | Sustainable Marine Ecosystem | Blue Economy | Forbes-W-Power 2022
9 年Voted . Will you be uploading the video online after the Summit for offline viewers ?
Executive Director / Associate Partner(Enterprise solution and Technical Delivery)
9 年Voted.