登录查看更多内容

Limiting The Storage In Hadoop Cluster By Data Node

Onkar Naik

DevOps @Forescout ?? | Google Developer Expert | AWS | DevOps | 3X GCP | 1X Azure | 1X Terraform | Ansible | Kubernetes | SRE | Platform | Jenkins | Tech Blogger ??

发布日期: 2020年10月21日

!! ?????????? ?????????????????????? !!

? Welcome you all to my article based on ???????? - ??.?? of ???????? - ?????? ???????????? ???? ?????????????????????????

?? TASK DESCRIPTION:

?? In Hadoop cluster , find how to contribute limited/specific amount of storage as a slave node to the cluster ?

Hint : Use Of Partitions

?? TASK COMPLETION:

1) I have build the Hadoop cluster on AWS Cloud .So , Let's start the Hadoop services i.e Name Node and Data Node .

2) As we store any folder or directory inside one storage drive like for example C-drive .Normally by default the folder takes total size of that storage drive . In case of Hadoop cluster we give storage of Data Node by some folder or directory which by default shares all the storage amount of the drive inside which the folder is present .

We can check the storage amount of the Drive inside which the Data Node storage directory is present by using command :

df -h

These Linux Command shows all the Information of amount of available and used Hard disk or volume space .

We can see that the size of / drive is 10 GB . As In my case the Data Node Directory is mounted on / drive . Let's check that Data node shares the total storage of / drive .

hadoop dfsadmin -report

3) Now we have to find some way by which the we can control or limit these Data Node storage in Hadoop cluster . For these we have to use the concept of disk partitions .So , Lets create one EBS Volume of some size and attach it to the Data Node .

We can see all the disk partitions present in our Data Node using Linux Command :

fdisk -l

4) We have attached the EBS Volume /dev/xvdf of 15 GB to the Data Node . To create the partitions we have command as :

fdisk  device_name

We just have to specify the amount or size of Last Sector of primary partition . In my case I have created the partition of size 8 GB .

5) Let's check the partition is successfully created or not . We see that one partition named as /dev/xvdf1 is created of size 8 GB .

6) Now we have to format the partition . To format the partition we have command as :

mkfs.ext4  device_name

7) After formatting the partition we have to mount our Data Node directory to the new partition we have created . To mount the partition we have command as :

mount  device_name  directory_name

8) We can see that now we have control or Limit on Data Node Storage as /dn1 Data Node Directory is now access the storage that we have provided to it .

? We see that now the Data Node not takes the total storage of EBS Volume of size 15 GB as we have control or provided it specific amount of storage using the concept of disk partitions .

?? In these way I successfully completed the ???????? - ??.?? of ARTH - The School Of Technologies .

? I would like to thanks Mr.Vimal Daga for giving such research based task which helps me to explore my core concepts of Big Data - Hadoop .

??For any queries or suggestions DM me .

!! Thanking you all for visiting my article !!

?? Keep Sharing Keep Learning ??

Dhiraj Bodake

SDE At Siemens

4 年

Nice Work

1 次回应

Akanksha Tangade

Software Developer @FYNDNA

4 年

Great work ??

1 次回应

Nivedita Shinde

4 年

Well done!!

1 次回应

Simran Kukareja

DevOps Engineer at SmartBear

4 年

Well done Onkar Naik

2 次回应

查看更多评论

要查看或添加评论，请登录

Onkar Naik的更多文章

Gremlin - The Magic Of Chaos Engineering

2021年7月17日

Gremlin - The Magic Of Chaos Engineering

?? Hello Connections ?? ?? Welcome to the world of Chaos Engineering ?? ?? In this article, we going to see how to…

14 条评论
Jenkins - The Heart of DevOps Automation

2021年3月31日

Jenkins - The Heart of DevOps Automation

???? Hello Connections ???? ? Welcome You All To The World of DevOps CI/CD Automation ? This Article is based on case…

36 条评论
AWS SQS Case Study - Red Bus

2021年3月19日

AWS SQS Case Study - Red Bus

???? Hello Connections ???? ? Welcome You All To The World of Cloud Computing ? This Article is based on case study of…

22 条评论
SIEMENS Healthineers With Azure Kubernetes Service (AKS)

2021年3月5日

SIEMENS Healthineers With Azure Kubernetes Service (AKS)

???? Hello Connections ???? ? Welcome You All To The World of DevOps ? This Article is based on case study of one of…

20 条评论
Ansible Use Cases & Case Study

2020年11月29日

Ansible Use Cases & Case Study

!! Hello Connections !! ? Welcome You All To World Of Automation ? ?? This Article is based on Use Cases and Case Study…

26 条评论
Launching A Webserver & Python Interpreter On Docker Container

2020年11月2日

Launching A Webserver & Python Interpreter On Docker Container

!! ?????????? ?????????????????????? !! ? Welcome you all to my article based on TASK-7.2 of ARTH - The School Of…

16 条评论
Designing High Availability Architecture with AWS S3 & CloudFront Using AWS CLI

2020年10月26日

Designing High Availability Architecture with AWS S3 & CloudFront Using AWS CLI

!! ?????????? ?????????????????????? !! ? Welcome you all to my article based on TASK-6 of ARTH - The School Of…

32 条评论
Building Basic AWS Cloud Infrastructure Using AWS CLI

2020年10月12日

Building Basic AWS Cloud Infrastructure Using AWS CLI

!! ?????????? ?????????????????????? !! ? Welcome you all to my article based on TASK-2 of AWS CSA & Developer…

28 条评论
Coursera Cuts Build Times by 83% Using AWS Codebuild, Amazon ECS

2020年9月21日

Coursera Cuts Build Times by 83% Using AWS Codebuild, Amazon ECS

!! Hello Connections !! ? Welcome you all to the world of Cloud Computing ? This Article is based on one of the case…

44 条评论
Introduction to Big Data Problems and their Solutions

2020年9月16日

Introduction to Big Data Problems and their Solutions

? ?????????? ?????????????????????? ? As now a days everything got connected to Internet as we everyday keep our things…

12 条评论

See all articles

Limiting The Storage In Hadoop Cluster By Data Node

Onkar Naik

DevOps @Forescout ?? | Google Developer Expert | AWS | DevOps | 3X GCP | 1X Azure | 1X Terraform | Ansible | Kubernetes | SRE | Platform | Jenkins | Tech Blogger ??

Onkar Naik的更多文章

社区洞察

其他会员也浏览了

Now You’re a Hadoop Expert

HADOOP: "How to share Limited Storage of Datanode to the Namenode in Hadoop Distributed Storage Cluster?"

Hadoop Market Research and Analysis: Uncovering Insights for Business Success

Hadoop: Pioneering the Era of Big Data Storage Technologies

Hadoop Usage in Data Analytics: An Overview

Integration of LVM with Hadoop

#bigdata 32e?—?Hadoop: The platform of choice

Integrating LVM with Hadoop and providing Elasticity to DataNode Storage

Setup a Multi-Node Hadoop Cluster using Docker

3 Solutions for Big Data’s Small Files Problem !

Onkar Naik的更多文章

Gremlin - The Magic Of Chaos Engineering

Jenkins - The Heart of DevOps Automation

AWS SQS Case Study - Red Bus

SIEMENS Healthineers With Azure Kubernetes Service (AKS)

Ansible Use Cases & Case Study

Launching A Webserver & Python Interpreter On Docker Container

Designing High Availability Architecture with AWS S3 & CloudFront Using AWS CLI

Building Basic AWS Cloud Infrastructure Using AWS CLI

Coursera Cuts Build Times by 83% Using AWS Codebuild, Amazon ECS

Introduction to Big Data Problems and their Solutions

社区洞察

其他会员也浏览了

Now You’re a Hadoop Expert

HADOOP: "How to share Limited Storage of Datanode to the Namenode in Hadoop Distributed Storage Cluster?"

Hadoop Market Research and Analysis: Uncovering Insights for Business Success

Hadoop: Pioneering the Era of Big Data Storage Technologies

Hadoop Usage in Data Analytics: An Overview

Integration of LVM with Hadoop

#bigdata 32e?—?Hadoop: The platform of choice

Integrating LVM with Hadoop and providing Elasticity to DataNode Storage

Setup a Multi-Node Hadoop Cluster using Docker

3 Solutions for Big Data’s Small Files Problem !