登录查看更多内容

How Spark is complementing Hadoop

janardhan reddy

Started ML and Data science Career in EY with high note from March

发布日期: 2017年2月6日

+ 关注

Spark is in memory analytics tool designed for complementing Map reduce jobs for low latency and iterative jobs.

Spark is next generation distributed processing engine for Bigdata .

Spark never been competition to Hadoop jobs in fact it will increase scope of Bigdata project for different type of job which are difficult to do in Hadoop

Spark became good tool for online near realtime analytics with straming support.

Spark is very fast and follows the top down approach for streaming and analytics.

Spark is independent of file systesm and can integrate with any Hadoop file system or no SQL DB or Cloud platform.

Spark has very good SQL tool called Spark Sql which can read and write to almost any DB.

Spark supports Machine learning with Mlib and supports R with SparkR.

Spark integrates with all Hadoop ecosystem tools very well.

要查看或添加评论，请登录

janardhan reddy的更多文章

Disconnecting in the Connected world

2019年6月28日

Disconnecting in the Connected world

Welcome to the world of Social media and connected world . I am a Data and Social Engineer working in world since my…
Preparing pitch for Spark

2017年6月1日

Preparing pitch for Spark

What is needed to learn Apache Spark Apache Spark is open source technoly for inmemory data analytics and complex data…
Blockchain unleashed

2017年2月27日

Blockchain unleashed

A blockchain is decentralized transactional system for internet based generation ,It manages series of transactions as…

3 条评论
Lets play with Hadoop Elephant : Part 0

2016年8月22日

Lets play with Hadoop Elephant : Part 0

What is Hadoop Hadoop is distributed processing and storage framework Hadoop is data processing tool to handle big and…
Anybody can become Hadooper

2016年8月16日

Anybody can become Hadooper

Bigdata /Hadoop is new terminology we are listening from last few years What It means for a fresh graduate engineer is…

1 条评论

See all articles

How Spark is complementing Hadoop

janardhan reddy

Started ML and Data science Career in EY with high note from March

janardhan reddy的更多文章

社区洞察

其他会员也浏览了

Still confused about different data lakes? Hadoop Vs. In-Memory Databases

Integrating LVM with Hadoop and providing Elasticity to DataNode Storage

IMPALA

Apache Flink :Next Generation Big Data Framework

Distributed storage cluster and Hadoop

Hangout - Hadoop Map Reduce vs. Spark

Hadoop- A Brief......

When MAPREDUCE is not suitable for processing?

Single Slider On - Hadoop Compression's

FAQ – Apache Spark Tutorial for Beginners (Part -2)

janardhan reddy的更多文章

Disconnecting in the Connected world

Preparing pitch for Spark

Blockchain unleashed

Lets play with Hadoop Elephant : Part 0

Anybody can become Hadooper

社区洞察

其他会员也浏览了

Still confused about different data lakes? Hadoop Vs. In-Memory Databases

Integrating LVM with Hadoop and providing Elasticity to DataNode Storage

IMPALA

Apache Flink :Next Generation Big Data Framework

Distributed storage cluster and Hadoop

Hangout - Hadoop Map Reduce vs. Spark

Hadoop- A Brief......

When MAPREDUCE is not suitable for processing?

Single Slider On - Hadoop Compression's

FAQ – Apache Spark Tutorial for Beginners (Part -2)