登录查看更多内容

Apache Flink: What, How, Why, Who, Where?

Slim Baltagi

I help with Snowflake AI Data Cloud Cost & Performance Optimization, and Administration

发布日期: 2016年2月13日

On February 2nd, 2016, I gave a talk titled "Apache Flink: What, How, Why, Who, Where?" at the New York City (NYC) Apache Flink Meetup that I founded on December 23, 2015. The event, that took place at the NYC Civic Hall, was sponsored by Bloomberg and Capital One.

This is the video recording

and the slides of my talk.

This introductory level talk was about Apache Flink: a multi-purpose Big Data analytics framework leading a movement towards the unification of batch and stream processing in the open source.

With the many technical innovations it brings along with its unique vision and philosophy, it is considered the 4 G (4th Generation) of Big Data Analytics frameworks providing the only hybrid (Real-Time Streaming + Batch) open source distributed data processing engine supporting many use cases: batch, streaming, relational queries, machine learning and graph processing.

In this talk, you will learn about:

1. What is Apache Flink stack and how it fits into the Big Data ecosystem?

2. How Apache Flink integrates with Hadoop and other open source tools for data input and output as well as deployment?

3. Why Apache Flink is an alternative to Apache Hadoop MapReduce, Apache Storm and Apache Spark.

4. Who is using Apache Flink?

5. Where to learn more about Apache Flink?

You comments, here or on the web sites hosting the video recording and the slides of my talk, are much appreciated.

Raja K Thaw

Big Data & Cloud Technical Architect(now more of advisory role )

9 年

Spark's micro-batching is an issue for fast ultra-low latency real-time data reqt. But it is accepted because of strong marketing lobby. As a result it gains momentum and it has more contributors, adoption... Storm subsides slowly because of igniting Spark.Now we need to see Flink.Proprietary products like Streambase, Apama, IBM Infosphere streams have seen the drawbacks of Open source. They have strong integration IDE coupled with rich libraries to connect to assorted sources, DB, market data, streaming analytics, statistics application,live data, operational BI... Some have specific ultra-low latency application servers like in-memory though not in-chip( which may come up maybe). The rate of changes in open source releases seems to be scary and difficult to catch up. Deprecated features need to be amended most of the time :)

Sreeram Madhu Chintalapudi

Chief Data Officer - FSS DTT@ IBM | Data and Advanced Analytics | MIT Data Science

9 年

Great Post Slim..

Kumar Chinnakali

Reimagining contact center as a hands-on architect bridging users, clients, developers, and business executives in their context.

9 年

The best information

查看更多评论

要查看或添加评论，请登录

Slim Baltagi的更多文章

23 FREE Snowflake Optimization Apps for Cost & Performance

2024年4月9日

23 FREE Snowflake Optimization Apps for Cost & Performance

Disclaimer: As of April 9th, 2024 there are 23 FREE Snowflake optimization apps on the Snowflake Marketplace from 13…
20 Snowflake Inefficiencies To Avoid and Save You Money!

2023年7月9日

20 Snowflake Inefficiencies To Avoid and Save You Money!

Disclaimer: The opinions in this article are entirely mine and do not necessarily reflect those of my employers (past…

1 条评论
Snowflake Performance Challenges & Solutions - Part 1

2022年9月2日

Snowflake Performance Challenges & Solutions - Part 1

Disclaimer: The opinions in this two-part blog series are entirely mine and do not necessarily reflect my employers…

16 条评论
A novel approach for your Snowflake Health Check

2022年6月9日

A novel approach for your Snowflake Health Check

Disclaimer: This short thought leadership article is my contribution to help drive the discussion about an important…
Snowflake Cost Intelligence & Optimization (Part 1)

2021年10月4日

Snowflake Cost Intelligence & Optimization (Part 1)

Disclaimer: The opinions in this three-part blog series are entirely mine and do not necessarily reflect my employers'…

9 条评论
Try out our Free Community Edition of 'Nadilytics for Snowflake' !

2021年8月20日

Try out our Free Community Edition of 'Nadilytics for Snowflake' !

Nadilytics is the only Unified Intelligence Platform, purposely built on top of Snowflake, to help you confidently use…

1 条评论
Why do you need an intelligence platform on top of Snowflake?

2021年6月23日

Why do you need an intelligence platform on top of Snowflake?

Lately, I was busy architecting and building with my Kloudgen team a platform named Nadilytics that turns Snowflake…
nadilytics.com: A companion to your Snowflake investment ...

2020年12月11日

nadilytics.com: A companion to your Snowflake investment ...

A new service for Snowflake inspection, monitoring and health check. Please sign up here if you’d like to be notified…

1 条评论
SnowMigrate?: What? Why? How?

2020年11月25日

SnowMigrate?: What? Why? How?

What? SnowMigrate? is our service for legacy modernization to the Snowflake data platform. As a Snowflake Service…

3 条评论
Snowflake and your business life cycle

2020年9月11日

Snowflake and your business life cycle

I will share the five stages of the business life cycle, then show how Snowflake can help with the main business…

6 条评论

See all articles

Apache Flink: What, How, Why, Who, Where?

Slim Baltagi

I help with Snowflake AI Data Cloud Cost & Performance Optimization, and Administration

Slim Baltagi的更多文章

社区洞察

其他会员也浏览了

Apache Spark :: HiveWarehouseSession (CRUD) with Hive 3 Managed Tables

Partitioning and Bucketing in Apache Spark

Demo Delta Lake on big data workloads...

Apache Ozone at ApacheCon 2022 New Orleans

Modern data ingestion approach using Kafka and Apache Iceberg

#Cassandra single node cluster is operational

Apache Storm for stream processing

Exploring Apache Hudi, Apache Iceberg, and Delta Lake: A Comparative Analysis of Open-Source Data Lake Management Projects

Apache Storm and big data

Slim Baltagi的更多文章

23 FREE Snowflake Optimization Apps for Cost & Performance

20 Snowflake Inefficiencies To Avoid and Save You Money!

Snowflake Performance Challenges & Solutions - Part 1

A novel approach for your Snowflake Health Check

Snowflake Cost Intelligence & Optimization (Part 1)

Try out our Free Community Edition of 'Nadilytics for Snowflake' !

Why do you need an intelligence platform on top of Snowflake?

nadilytics.com: A companion to your Snowflake investment ...

SnowMigrate?: What? Why? How?

Snowflake and your business life cycle

社区洞察

其他会员也浏览了

Apache Spark :: HiveWarehouseSession (CRUD) with Hive 3 Managed Tables

Partitioning and Bucketing in Apache Spark

Demo Delta Lake on big data workloads...

Apache Ozone at ApacheCon 2022 New Orleans

Modern data ingestion approach using Kafka and Apache Iceberg

#Cassandra single node cluster is operational

Apache Storm for stream processing

Exploring Apache Hudi, Apache Iceberg, and Delta Lake: A Comparative Analysis of Open-Source Data Lake Management Projects

Apache Storm and big data