登录查看更多内容

Hadoop’s Legacy: No more fear of data

Tony Baer

Principal at dbInsight LLC

发布日期: 2021年4月27日

A couple weeks back, the Apache community pulled the plug on a slew of dormant Hadoop projects. Comrade-in-arms Andrew Brust gave the rundown in his ZDnet post. The tagline of Hadoop being dead is hardly new; it's dated back at least 5 years when Gartner announced a drastic slowdown in new Hadoop projects.

There's no question that activity for implementing Hadoop in the raw has practically ceased. That doesn't mean that the underlying technology is dead: Surviving projects continue to live on in the Cloudera Data Platform, which unlike previous Hadoop generations, is a coherent platform packaged through its own binaries rather than a string of disparate zoo animal open source projects.

The more important point, however, is that Hadoop opened the world of what we used to call "Big Data." Before Hadoop, and the associated discoveries with file systems that could live on commodity hardware and almost perfectly linear scale-out processing that Google published as research papers, the notion of going outside the data warehouse to bring non-relational data -- and lots of it (in the terabyte and petabyte) range was practically unthinkable. Hadoop introduced us to the art of the possible, and given that today, there are a variety of paths for analyzing and building machine learning models on all that data, and that relational data is no longer the limitation, are all thanks to Hadoop.

For our take on how this happened and what it made possible, check out our detailed post on ZDnet. And stay tuned, from all our research over the past decade, we'll be putting together a timeline on exactly how this all unfolded.

Daniel Kirsch

Enabling businesses to make better decisions with #AI & #CausalAI | Head of Community, Geminos Software

3 年

AI and Machine Learning needs context and connections to all data. In our hybrid/multi cloud world Hadoop as a data source is going to live on for decades. Like software, databases never die! #data #ai #machinelearning #hadoop #multicloud #analytics

Eric Newcomer

3 年

Tony - great article, for me especially the foundation in the first principles of cloud native data and its origins. This helped create a good perspective for the future, placing Hadoop in context of its origins and what is happening now and into the future.

2 次回应

Ashwin V.

3 年

Wow Apache Sentry and Chukwa were from the early part of the last decade! Darwin has spoken.

1 次回应

Steve Friedberg

Delivering the "moments of WOW!" through corporate magic

3 年

Tape is dead, too. At least, that's what I've been hearing for the past 20 years.

2 次回应

Saksham Agrawal

Fintech Data Products

3 年

Very true!

1 次回应

查看更多评论

要查看或添加评论，请登录

Tony Baer的更多文章

SAP’s key to generation change is its best-kept secret

2022年5月12日

SAP’s key to generation change is its best-kept secret

SAP is no stranger to generation change. Turning 50 years old this year, it wasn’t until SAP reached middle age that it…
Data remains the lifeblood of Innovation

2022年3月30日

Data remains the lifeblood of Innovation

We're reprinting in full our March 29, 2022 farewell post on ZDnet as it provided a good opportunity to reassess how…

1 条评论
Data is On the Move

2022年3月29日

Data is On the Move

It's a big day. Am jazzed to announce that Big on Data, the series co-authored by Andrew J.
What's happening in data 2022?

2022年1月10日

What's happening in data 2022?

It's always 5:00 somewhere, and on the last Friday of each month at 5:00 Eastern, our group of data analysts gathers on…
Data Outlook 2022: Will the cloud finally get easier?

2022年1月4日

Data Outlook 2022: Will the cloud finally get easier?

And will streaming get off its island? Will data mesh see the light of day? In some ways it seems like Groundhog Day…

3 条评论
Serverless at re:Invent: Where should Amazon Redshift go?

2021年12月6日

Serverless at re:Invent: Where should Amazon Redshift go?

With tens of thousands of customers, Amazon Redshift was more known as a market rather than a technology leader. The…
Will AWS re:Invent break our digital exhaustion?

2021年11月29日

Will AWS re:Invent break our digital exhaustion?

The past 18 months have been a slog, to put it quite mildly. For those of us that could work virtually, we’ve gotten…

2 条评论
Data Mesh: Should you try this at home?

2021年11月16日

Data Mesh: Should you try this at home?

To centralize or distribute data management? That's been a perennial question in the data world ever since…
Informatica IPO: Better the second time around?

2021年10月27日

Informatica IPO: Better the second time around?

As they say, the definition of insanity is doing the same thing again but hoping that this time will be different. For…

2 条评论
What's the next act for Databricks cofounder Ion Stoica?

2021年6月22日

What's the next act for Databricks cofounder Ion Stoica?

A decade ago, Ion Stoica and his colleagues at UC Berkeley's school of computing identified the roadblock to performing…

1 条评论

See all articles

Hadoop’s Legacy: No more fear of data

Tony Baer

Principal at dbInsight LLC

Tony Baer的更多文章

社区洞察

其他会员也浏览了

The History of Hadoop and Big Data

Understanding Hadoop: A Foundation for Big Data Processing

Hadoop 2.x

Harnessing Hadoop: Empowering Data-Driven Innovation

Let’s research and the world the know about the Myths of Hadoop

#bigdata 32e?—?Hadoop: The platform of choice

Spark Or Hadoop : Which Is The Best Big Data Framework?

Hadoop and LVM

Doug Cutting Reflects on Hadoop’s Impact, Future

INTEGRATING LVM WITH HADOOP AND PROVIDING ELASTICITY TO DATA NODE STORAGE

Tony Baer的更多文章

SAP’s key to generation change is its best-kept secret

Data remains the lifeblood of Innovation

Data is On the Move

What's happening in data 2022?

Data Outlook 2022: Will the cloud finally get easier?

Serverless at re:Invent: Where should Amazon Redshift go?

Will AWS re:Invent break our digital exhaustion?

Data Mesh: Should you try this at home?

Informatica IPO: Better the second time around?

What's the next act for Databricks cofounder Ion Stoica?

社区洞察

其他会员也浏览了

The History of Hadoop and Big Data

Understanding Hadoop: A Foundation for Big Data Processing

Hadoop 2.x

Harnessing Hadoop: Empowering Data-Driven Innovation

Let’s research and the world the know about the Myths of Hadoop

#bigdata 32e?—?Hadoop: The platform of choice

Spark Or Hadoop : Which Is The Best Big Data Framework?

Hadoop and LVM

Doug Cutting Reflects on Hadoop’s Impact, Future

INTEGRATING LVM WITH HADOOP AND PROVIDING ELASTICITY TO DATA NODE STORAGE