Hadoop’s Legacy: No more fear of data

Hadoop’s Legacy: No more fear of data

A couple weeks back, the Apache community pulled the plug on a slew of dormant Hadoop projects. Comrade-in-arms Andrew Brust gave the rundown in his ZDnet post. The tagline of Hadoop being dead is hardly new; it's dated back at least 5 years when Gartner announced a drastic slowdown in new Hadoop projects.

There's no question that activity for implementing Hadoop in the raw has practically ceased. That doesn't mean that the underlying technology is dead: Surviving projects continue to live on in the Cloudera Data Platform, which unlike previous Hadoop generations, is a coherent platform packaged through its own binaries rather than a string of disparate zoo animal open source projects.

The more important point, however, is that Hadoop opened the world of what we used to call "Big Data." Before Hadoop, and the associated discoveries with file systems that could live on commodity hardware and almost perfectly linear scale-out processing that Google published as research papers, the notion of going outside the data warehouse to bring non-relational data -- and lots of it (in the terabyte and petabyte) range was practically unthinkable. Hadoop introduced us to the art of the possible, and given that today, there are a variety of paths for analyzing and building machine learning models on all that data, and that relational data is no longer the limitation, are all thanks to Hadoop.

For our take on how this happened and what it made possible, check out our detailed post on ZDnet. And stay tuned, from all our research over the past decade, we'll be putting together a timeline on exactly how this all unfolded.

Daniel Kirsch

Enabling businesses to make better decisions with #AI & #CausalAI | Head of Community, Geminos Software

3 年

AI and Machine Learning needs context and connections to all data. In our hybrid/multi cloud world Hadoop as a data source is going to live on for decades. Like software, databases never die! #data #ai #machinelearning #hadoop #multicloud #analytics

回复

Tony - great article, for me especially the foundation in the first principles of cloud native data and its origins. This helped create a good perspective for the future, placing Hadoop in context of its origins and what is happening now and into the future.

Wow Apache Sentry and Chukwa were from the early part of the last decade! Darwin has spoken.

Steve Friedberg

Delivering the "moments of WOW!" through corporate magic

3 年

Tape is dead, too. At least, that's what I've been hearing for the past 20 years.

Saksham Agrawal

Fintech Data Products

3 年

Very true!

要查看或添加评论,请登录

Tony Baer的更多文章

  • SAP’s key to generation change is its best-kept secret

    SAP’s key to generation change is its best-kept secret

    SAP is no stranger to generation change. Turning 50 years old this year, it wasn’t until SAP reached middle age that it…

  • Data remains the lifeblood of Innovation

    Data remains the lifeblood of Innovation

    We're reprinting in full our March 29, 2022 farewell post on ZDnet as it provided a good opportunity to reassess how…

    1 条评论
  • Data is On the Move

    Data is On the Move

    It's a big day. Am jazzed to announce that Big on Data, the series co-authored by Andrew J.

  • What's happening in data 2022?

    What's happening in data 2022?

    It's always 5:00 somewhere, and on the last Friday of each month at 5:00 Eastern, our group of data analysts gathers on…

  • Data Outlook 2022: Will the cloud finally get easier?

    Data Outlook 2022: Will the cloud finally get easier?

    And will streaming get off its island? Will data mesh see the light of day? In some ways it seems like Groundhog Day…

    3 条评论
  • Serverless at re:Invent: Where should Amazon Redshift go?

    Serverless at re:Invent: Where should Amazon Redshift go?

    With tens of thousands of customers, Amazon Redshift was more known as a market rather than a technology leader. The…

  • Will AWS re:Invent break our digital exhaustion?

    Will AWS re:Invent break our digital exhaustion?

    The past 18 months have been a slog, to put it quite mildly. For those of us that could work virtually, we’ve gotten…

    2 条评论
  • Data Mesh: Should you try this at home?

    Data Mesh: Should you try this at home?

    To centralize or distribute data management? That's been a perennial question in the data world ever since…

  • Informatica IPO: Better the second time around?

    Informatica IPO: Better the second time around?

    As they say, the definition of insanity is doing the same thing again but hoping that this time will be different. For…

    2 条评论
  • What's the next act for Databricks cofounder Ion Stoica?

    What's the next act for Databricks cofounder Ion Stoica?

    A decade ago, Ion Stoica and his colleagues at UC Berkeley's school of computing identified the roadblock to performing…

    1 条评论

社区洞察

其他会员也浏览了