Setup Of Hadoop Cluster ; Uploading, Reading, Deleting Files Through Cluster.

Setup Of Hadoop Cluster ; Uploading, Reading, Deleting Files Through Cluster.

Hadoop Cluster is use to solve BigData problems. Hadoop is product of Apache and it is very popular in BigData world, facebook is using this to solve there BigData problem to Store there data of user.

In 2012, Facebook has revealed that it is generating around 500+ terabytes of data every day. In which 2.7 billion were likes and around 300 million photos per day. Another exciting thing is Facebook is scanning around 105 terabytes of data per each half hour.


So let's see how to build Hadoop cluster to solve bigdata problem

No alt text provided for this image


To create this hadoop cluster we need some basic things first;

  1. O.S. ( we are using RedHat 8 )
  2. Hadoop Latest Version
  3. JDK ( Java Development Kit )
  4. More than one OS ( You can use VM, Cloud )

That's it let's do this

  • First Download Hadoop and JDK
No alt text provided for this image


  • Now Install this with redhat cmd "rpm -i file(hadoop/jdk)"
No alt text provided for this image
No alt text provided for this image
  • Now Configure Hadoop Core and HDFS file to create Cluster;
No alt text provided for this image
No alt text provided for this image

After same as this create a client and upload some file and use tcpdump cmd to check uploading of file.

No alt text provided for this image

Now we'll see files on WebUI, for this type ip:50070 on browser;

No alt text provided for this image
No alt text provided for this image

Now we check uploading of file in our slave node for this we use tcpdump -i enp0s3 -n -X

No alt text provided for this image

Now Check in your slave node that client IP is coming in your pc to upload file;

No alt text provided for this image

If you have more than one slave node and if you stopped one of slave node then also hadoop will upload file because of it's replication feature.


To prove this we uploaded a big file to see replication, block is showing in the pic is the replication of file;

No alt text provided for this image


You can perform this practical alone just create more than instance, like this I done this same on AWS Instance;?

No alt text provided for this image
No alt text provided for this image




Thank you

要查看或添加评论,请登录

Shivani Agarwal的更多文章

社区洞察

其他会员也浏览了