登录查看更多内容

Setup Of Hadoop Cluster ; Uploading, Reading, Deleting Files Through Cluster.

Shivani Agarwal

Software Developer @ThinkTech Software Inc. | Full Stack | React | Python | DSA

发布日期: 2020年11月12日

Hadoop Cluster is use to solve BigData problems. Hadoop is product of Apache and it is very popular in BigData world, facebook is using this to solve there BigData problem to Store there data of user.

In 2012, Facebook has revealed that it is generating around 500+ terabytes of data every day. In which 2.7 billion were likes and around 300 million photos per day. Another exciting thing is Facebook is scanning around 105 terabytes of data per each half hour.

So let's see how to build Hadoop cluster to solve bigdata problem

To create this hadoop cluster we need some basic things first;

O.S. ( we are using RedHat 8 )
Hadoop Latest Version
JDK ( Java Development Kit )
More than one OS ( You can use VM, Cloud )

That's it let's do this

First Download Hadoop and JDK

Now Install this with redhat cmd "rpm -i file(hadoop/jdk)"

Now Configure Hadoop Core and HDFS file to create Cluster;

After same as this create a client and upload some file and use tcpdump cmd to check uploading of file.

Now we'll see files on WebUI, for this type ip:50070 on browser;

Now we check uploading of file in our slave node for this we use tcpdump -i enp0s3 -n -X

Now Check in your slave node that client IP is coming in your pc to upload file;

If you have more than one slave node and if you stopped one of slave node then also hadoop will upload file because of it's replication feature.

To prove this we uploaded a big file to see replication, block is showing in the pic is the replication of file;

You can perform this practical alone just create more than instance, like this I done this same on AWS Instance;?

Thank you

Vijay Narain Meena

Consultant at Xebia

4 年

Awesome

1 次回应

查看更多评论

要查看或添加评论，请登录

Shivani Agarwal的更多文章

NetFlix using Machine Learning to maximize customer experience: A Case Study

2020年11月12日

NetFlix using Machine Learning to maximize customer experience: A Case Study

NetFlix using Machine Learning to maximize customer experience What began as a DVD rental service in 1998 is now one of…

1 条评论
How To Provide Elasticity Storage To Hadoop Slave From LVM (Logical Volume Management) ? BigData

2020年11月11日

How To Provide Elasticity Storage To Hadoop Slave From LVM (Logical Volume Management) ? BigData

LVM (Logical Volume Management) What is LVM ? LVM allows for very flexible disk space management. It provides features…
Launching AWS Instances, Volumes Using CLI

2020年11月11日

Launching AWS Instances, Volumes Using CLI

To launch a instance using CLI, we should have AWS CLI, Python downloaded. Here's the link below of AWS CLI :…

8 条评论

Setup Of Hadoop Cluster ; Uploading, Reading, Deleting Files Through Cluster.

Shivani Agarwal

Software Developer @ThinkTech Software Inc. | Full Stack | React | Python | DSA

Hadoop Cluster is use to solve BigData problems. Hadoop is product of Apache and it is very popular in BigData world, facebook is using this to solve there BigData problem to Store there data of user.

So let's see how to build Hadoop cluster to solve bigdata problem

If you have more than one slave node and if you stopped one of slave node then also hadoop will upload file because of it's replication feature.

Shivani Agarwal的更多文章

社区洞察

其他会员也浏览了

Best Guide to Features and Design Principles of Hadoop

Unlocking Big Data: Demystifying Hadoop as a Distributed Database

How To Create Hadoop Cluster In Just 10 Minutes ?

How Apache Spark sparkles over Hadoop?

Big Data #1

Features and Design Principles of Big data Hadoop

Hadoop vs Spark

HADOOP

Configure Hadoop and start cluster services using Ansible Playbook

Hadoop Cluster is use to solve BigData problems. Hadoop is product of Apache and it is very popular in BigData world, facebook is using this to solve there BigData problem to Store there data of user.

So let's see how to build Hadoop cluster to solve bigdata problem

If you have more than one slave node and if you stopped one of slave node then also hadoop will upload file because of it's replication feature.

Shivani Agarwal的更多文章

NetFlix using Machine Learning to maximize customer experience: A Case Study

How To Provide Elasticity Storage To Hadoop Slave From LVM (Logical Volume Management) ? BigData

Launching AWS Instances, Volumes Using CLI

社区洞察

其他会员也浏览了

Best Guide to Features and Design Principles of Hadoop

Unlocking Big Data: Demystifying Hadoop as a Distributed Database

How To Create Hadoop Cluster In Just 10 Minutes ?

How Apache Spark sparkles over Hadoop?

Big Data #1

Features and Design Principles of Big data Hadoop

Hadoop vs Spark

HADOOP

Configure Hadoop and start cluster services using Ansible Playbook