登录查看更多内容

Contribute Limited Amount Of Storage Of DataNode In Hadoop Cluster

Govind Bhardwaj

Software Engineer at miniOrange

发布日期: 2020年10月15日

+ 关注

Task :-

??In a Hadoop cluster, find how to contribute limited/specific amount of storage as slave to the cluster?

* To complete this task or To solve this issue , we have to use Linux Partition Concept .

* I will follow the below step for this task -

<1>. Add New HardDisk To DataNode

<2>. Create Partition In Add Device At DataNode

<3>. Format & Mount Partition at DataNode

<4>. Configure NameNode

<5>. Configure DataNode

<6>. Check Contribution Of DataNode In Distributed File Storage of Hadoop Cluster

I have a Hadoop Cluster in which one NameNode and one DataNode is present.

IP of NameNode - 192.168.43.106 Hostname is "NN1"

IP of DataNode - 192.168.43.65 Hostname is "DN1"

Step - 1 Add New HardDisk To DataNode -

Now I am using Oracle Virtual Box so we don't need to purchase new hard disk . We will use Virtual Hard Disk concept.

We are adding new hard disk because we don't have any unallocated space to do partitions.

* You can also refer this video for more understanding for this -

To add new hard disk DataNode must be in "Stopped" state then Follow this steps -

(A) Go Storage in Settings of DataNode -

(B) Click on "Controller: SATA" & after this click on right "+" icon of "Controller: SATA" -

(C) Click On "Create" -

(D) Click on "Next" -

(E) Again "Next" -

(F) Choose your Hard Disk Size & Do "Create" -

In my case My Hard Disk size is 10 GiB

(G) Click on "DN_1.vdi" because in my case hard disk name is "DN_1.vdi" & and choose it -

(H) Now our Hard Disk is attached -

(I) To check hard disk is attached or not run "fdisk -l" command -

You will see "/dev/sdb : 10 GiB"

Step - 2 Create Partition In Add Device At DataNode -

* You can also refer this video for more understanding for this-

(A) Run "fdisk /dev/sdb" command -

"/dev/sdb" is name of added device in previous step.

(B) Run "n" to create new partition -

(C) Run "p" -

* Here I want to create Primary Partitions.

(D) Press "Enter" -

(E) Again press "Enter" -

(F) Give value of Last Sector -

* I want to create 2 GiB partition so that DataNode can only use 2 GiB for contribution in Hadoop Cluster.

(G) Run "w" to Save this Partition -

(H) Run "fdisk -l /dev/sdb" to check partition -

(I) Run "udevadm settle" to load Driver for Partition -

* Whenever New device is added in Computer then we have to load respectively driver so that we can communicate with that device.

Step - 3 Format & Mount Partition at DataNode -

* You can refer this video more understanding for this -

(A) Run this command to format "mkfs.ext4 /dev/sdb1" -

* In my case I am using "ext4" format type ,you can choose according to you.

(B) Create a Directory where you want to mount Partition -

* I will use this directory in Hadoop Cluster Distributed File Storage .

(C) Mount Partition at "/DataNode" Directory -

Step -4 Configure NameNode -

(A) Make a Directory "/nn" -

(B) "hdfs-site.xml" file configuration -

(C) "core-site.xml" file configuration -

(D) Format NameNode -

(E) Start NameNode -

Check with "jps" command NameNode is working or not.

(F) Stop Firewalld -

Step - 5 Configure DataNode -

(A) "hdfs-site.xml" file configuration -

(B) "core-site.xml" file configuration -

(C) Stop Firewalld -

(D) Start DataNode -

Check with "jps" command that DataNode is running or not.

Step -6 Check Contribution Of DataNode In Distributed File Storage of Hadoop Cluster -

* Run this command "hadoop dfsadmin -report"

You can see DataNode is contributing around 2 GiB . Thus we can set limitation of contribution of DataNode in Hadoop Cluster.

Thank you for giving you time to my article.

karthik kompella

4 年

nice

查看更多评论

要查看或添加评论，请登录

Govind Bhardwaj的更多文章

RedHat OpenShift Case Study : Ford Motor

2021年3月13日

RedHat OpenShift Case Study : Ford Motor

Ford Motor Company is a global company based in Dearborn, Michigan. The company designs, manufactures, markets and…
Jenkins Case Study: Avoris Travel

2021年3月12日

Jenkins Case Study: Avoris Travel

Speed matters when your mission is to reinvent the travel business: for your agents, your customers, and, especially…
Create Chat Server Using Python

2021年2月16日

Create Chat Server Using Python

Hello Guys , In this practical we will create a chat server using Python . The Description of Practical or Task is…
Kubernetes Multi-Node Cluster Setup on AWS with Ansible

2021年2月15日

Kubernetes Multi-Node Cluster Setup on AWS with Ansible

Hello Guys , In this article we will setup Kubernetes Cluster on Amazon Web Services (AWS) with the help of Ansible…
WordPress Deployment On EC2 Instance And Use RDS For Database

2021年2月14日

WordPress Deployment On EC2 Instance And Use RDS For Database

Hello Guys , In this article we will deploy WordPress on AWS EC2 Instance and we will use AWS RDS Service For Database…
Apache Web Server On Docker Container With Ansible (Dynamic Inventory ) using Container IP

2021年2月12日

Apache Web Server On Docker Container With Ansible (Dynamic Inventory ) using Container IP

Hello Guys , In this article we will configure Apache Web Server On Docker Container with Ansible . We will use…
Dynamically Load Variable According to OS Type In Ansible

2021年2月11日

Dynamically Load Variable According to OS Type In Ansible

Hello Guys , In this article we will load a variable file in Ansible according to Operating System Type ( Means If we…
Automation With Ansible

2020年12月29日

Automation With Ansible

Today’s my blog is to share, what I have learnt from session which I as ARTH Learner had with two best industry expert…
How IBM use Kubernetes to Solve their challenge

2020年12月26日

How IBM use Kubernetes to Solve their challenge

Introduction to IBM - > International Business Machines Corporation (IBM) is an American Multinational Technology and…
Configure Routing Table In Such a Way So That We Can Ping Only Google IP Not FaceBook Ip

2020年12月18日

Configure Routing Table In Such a Way So That We Can Ping Only Google IP Not FaceBook Ip

ARTH - Task 13 ??????? Task Description?? ?? Create a Setup so that you can ping google but not able to ping Facebook…

See all articles

Contribute Limited Amount Of Storage Of DataNode In Hadoop Cluster

Govind Bhardwaj

Software Engineer at miniOrange

Step - 1 Add New HardDisk To DataNode -

Step - 2 Create Partition In Add Device At DataNode -

Step - 3 Format & Mount Partition at DataNode -

Step -4 Configure NameNode -

Step - 5 Configure DataNode -

Step -6 Check Contribution Of DataNode In Distributed File Storage of Hadoop Cluster -

Govind Bhardwaj的更多文章

社区洞察

其他会员也浏览了

Hadoop: Pioneering the Era of Big Data Storage Technologies

Top Hadoop Services in London UK Introduction , Features & Use Cases

A Comprehensive Guide to Hadoop YARN - Yet Another Resource Negotiator.

Hadoop Cluster Revealed

Integrating LVM with Hadoop and providing Elasticity to DataNode Storage

Integration of LVM with Hadoop

INTEGRATION OF LVM Partition WITH HADOOP CLUSTER

3 Solutions for Big Data’s Small Files Problem !

Setup a Multi-Node Hadoop Cluster using Docker

Step - 1 Add New HardDisk To DataNode -

Step - 2 Create Partition In Add Device At DataNode -

Step - 3 Format & Mount Partition at DataNode -

Step -4 Configure NameNode -

Step - 5 Configure DataNode -

Step -6 Check Contribution Of DataNode In Distributed File Storage of Hadoop Cluster -

Govind Bhardwaj的更多文章

RedHat OpenShift Case Study : Ford Motor

Jenkins Case Study: Avoris Travel

Create Chat Server Using Python

Kubernetes Multi-Node Cluster Setup on AWS with Ansible

WordPress Deployment On EC2 Instance And Use RDS For Database

Apache Web Server On Docker Container With Ansible (Dynamic Inventory ) using Container IP

Dynamically Load Variable According to OS Type In Ansible

Automation With Ansible

How IBM use Kubernetes to Solve their challenge

Configure Routing Table In Such a Way So That We Can Ping Only Google IP Not FaceBook Ip

社区洞察

其他会员也浏览了

Hadoop: Pioneering the Era of Big Data Storage Technologies

Top Hadoop Services in London UK Introduction , Features & Use Cases

A Comprehensive Guide to Hadoop YARN - Yet Another Resource Negotiator.

Hadoop Cluster Revealed

Integrating LVM with Hadoop and providing Elasticity to DataNode Storage

Integration of LVM with Hadoop

INTEGRATION OF LVM Partition WITH HADOOP CLUSTER

3 Solutions for Big Data’s Small Files Problem !

Setup a Multi-Node Hadoop Cluster using Docker