Limiting The Storage In Hadoop Cluster By Data Node

Limiting The Storage In Hadoop Cluster By Data Node

TASK DESCRIPTION:

??In a Hadoop cluster, find how to contribute limited/specific amount of storage as slave to the cluster?

??Hint: Linux partitions

?? TASK COMPLETION:

=> Firstly we have create the Hadoop cluster on AWS And start the services Name Node and Data Node.

  1. Contribute storage of slave node to the cluster storage of '/' drive of data node = 8GB Lets check the share storage by data node
hadoop dfsadmin -report
No alt text provided for this image

2. While we have to share limited amount of storage of slave node to the cluster. we have to use the concept of partition And create the EBS volume and after then attach from data node.

No alt text provided for this image
No alt text provided for this image

? Check disk partition -

No alt text provided for this image

3. Create partition -

fdisk <device_name>
No alt text provided for this image

4. Check the new partition /dev/xvdf1 with size 2GB -

No alt text provided for this image

5. Command for format the partition -

mkfs.ext4  <device_name>
No alt text provided for this image

6. mount data node to the new partition /dev/xvdf1 of size 2GB -

No alt text provided for this image

7. We have to find to contribute limited/specific amount of storage as a slave node /dn1 to the cluster - Total storage of EBS volume = 5GB Share/contribute storage = 2GB

No alt text provided for this image

??????Here we have successfully performed the task??????

Thanks Mr.Vimal Daga (Mentor) for giving such a researchable task which helps me to explore my core concepts of Big Data Hadoop.




要查看或添加评论,请登录

Chandrahas Patel的更多文章

  • How Kubernetes is used in Industries

    How Kubernetes is used in Industries

    Kubernetes is an open source container orchestration engine for automating deployment, scaling, and management of…

  • How industries are solving challenges using Ansible

    How industries are solving challenges using Ansible

    ?? What is Ansible? Ansible is an open source software provisioning, Configuration Management and Deployment tool…

    1 条评论
  • ?? Create High Availability Architecture with AWS CLI ??

    ?? Create High Availability Architecture with AWS CLI ??

    ?The architecture includes-? - Webserver configured on EC2 Instance - Document Root(/var/www/html) made persistent by…

  • ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING

    ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING

    Artificial Intelligence and Machine Learning is a popular topic in the tech industry. Perhaps more than our daily lives…

  • Building basic Cloud Infrastructure using AWS Cloud

    Building basic Cloud Infrastructure using AWS Cloud

    Description ?? Create a key pair? ?? Create a security group? ?? Launch an instance using the above created key pair…

  • Big Data

    Big Data

    What is Big Data? ->Big Data is also data but with a huge size. Big Data is a term used to describe a collection of…

社区洞察

其他会员也浏览了