Hadoop vs. MPP

Hadoop vs. MPP

With Big Data Map/Reduce is always the first term that comes to mind. But it’s not the only way to handle large amounts of data. There are database systems especially built to deal with huge amounts of data and they are called Massively Parallel Processing (MPP) databases.

MPP database systems have been around for a longer time than Map/Reduce and its most popular integration Hadoop and are based on a shared nothing architecture. The data is partitioned across several nodes of hardware and queries are processed via a network interconnect on a central server. They often use commodity hardware that is as inexpensive as hardware for Map/Reduce. For working with data they have the advantage to make use of SQL as their interface, the language used by most Data Scientists and other analytic professionals so far.

Map/Reduce provides a Java interface to analyze the data, which comes with more time to implement than just write an SQL statement. Hadoop has some projects, that provide a SQL similar query language, like Hive which provides HiveQL, an SQL-like query language, as an interface.

Since both systems handle data, there will be a lot gained, when both are combined. There are already projects working on that, like Aster Data nCluster or Teradata and Hortonworks.

There is even a new product bringing both worlds together as one product, Hadapt. With this product, you can access all your data, structured or unstructured, in a single platform. Each node has space for SQL as well as for Map/Reduce.

Last but not least a list of some MPP databases available right now:

·        GreenPlum

·        Stado

·        ParAccel

·        Vertica

Depending on your business needs, you may not need a Map/Reduce cluster, but an MPP database or both to benefit from their respective strengths in your implementation.

Best Regards,

Anoop Goyal

Email: [email protected]

要查看或添加评论,请登录

Anoop Goyal的更多文章

  • WHERE & WHY BIG DATA PROJECTS FAIL

    WHERE & WHY BIG DATA PROJECTS FAIL

    Over the past 3 Years I have seen the number of big data projects go up significantly and most of the companies I have…

  • E Learning Trends & BIG 4

    E Learning Trends & BIG 4

    Different industries across the globe realizing the value of investing in a strong, robust eLearning infrastructure…

  • Five key cloud trends to look forward to in 2017

    Five key cloud trends to look forward to in 2017

    There’s no denying that cloud computing has changed the way both large enterprises and small businesses operate. The…

  • Top Free Online Sales Tools- Discover how to Increase your Sales

    Top Free Online Sales Tools- Discover how to Increase your Sales

    Let’s face it — prospecting can be a painful, expensive, time-draining activity if it’s not managed effectively. For…

  • HOW TO CHOOSE A BIG DATA SOLUTION: STEP BY STEP

    HOW TO CHOOSE A BIG DATA SOLUTION: STEP BY STEP

    CHOOSING A BIG DATA SOLUTION: SEVEN STEPS Clearly, choosing a big data solution isn’t easy. As companies of all sizes…

  • Salesforce Practices

    Salesforce Practices

    Hi, Hope you are doing great. Actually, I wanted to touch base with you to understand your interest around Salesforce…

社区洞察

其他会员也浏览了