登录查看更多内容

Controlling Hadoop Storage

Sujagi Verma

Werkstudentin @ Siemens Healthineers

发布日期: 2020年10月21日

>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>

Big Data :- Big data is a collection of large datasets that cannot be processed using traditional computing techniques. It is not a single technique or a tool, rather it has become a complete subject, which involves various tools, techniques and frameworks.

Hadoop :- Hadoop is an open source, Java based framework used for storing and processing big data. The data is stored on inexpensive commodity servers that run as clusters. Hadoop uses the MapReduce programming model for faster storage and retrieval of data from its nodes.

>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>

In a Hadoop Cluster, a limited/specific amount of storage can be contributed. This can easily be initiated with the help of partitions in the following ways:-

Create a Hadoop cluster containing Name Node and a Data Node

The Data Node in this case with 10GB of storage is mounted on / drive contributing all of its storage to the Master Node. For controlling the storage attach an EBS volume of desired size with the Data Node.

All the disks present in our Data Node can be viewed as follows.

lsblk  or  fdisk -l

An EBS volume /dev/xvdf of 20GB has been attached to the Data Node. Now, its time to create a physical partition of desired size. I have created a partition of size 2GB.

fdisk /dev/xvdf

The next step is formatting the partition being created.

mkfs.ext4 /dev/xvdf1

The final step , mount the partition to our Data Node directory.

mount /dev/xvdf1 /dn1

The limited storage of 2GB being shared can now be viewed easily.

THANK YOU

Toshine Garg

UTS Australia | Chandigarh University | Pianist

4 年

Excellent work ????Sujagi Verma

1 次回应

Prasant Mahato

DevOps, Cloud & Performance Engineer| DevOps Engineer

4 年

Good work Sujagi Verma

1 次回应

Saurabh Sharma

Open for Contracts | Love Startups

4 年

Short and straight to point. ?

1 次回应

查看更多评论

要查看或添加评论，请登录

Sujagi Verma的更多文章

Ransomware Attacks in the Cloud

2024年8月9日

Ransomware Attacks in the Cloud

A Growing Threat to AWS Users and How to Stay Safe Ransomware has evolved from a nuisance to a top-tier threat…

3 条评论
WordPress Application using Amazon RDS as a backend !!

2021年7月3日

WordPress Application using Amazon RDS as a backend !!

What is WordPress ? WordPress is a free, open-source website creation platform which requires access to database to…
Instance-Volume Attachment using Terraform

2021年6月26日

Instance-Volume Attachment using Terraform

A terraform code for all the below mentioned steps is being provided- Create a key pair Create a security group Launch…

3 条评论
Companies that got benefitted from AWS - KIA MOTORS

2021年6月25日

Companies that got benefitted from AWS - KIA MOTORS

AWS has significantly more services, and more features within those services, than any other cloud provider–from…
Automate a dynamic Infrastructure over AWS using Terraform

2021年6月1日

Automate a dynamic Infrastructure over AWS using Terraform

What is Terraform : Terraform is an open-source infrastructure as code software tool that enables you to safely and…
Automate K8s Multi Node Cluster Over AWS using Ansible

2021年5月30日

Automate K8s Multi Node Cluster Over AWS using Ansible

Kubernetes Cluster - A Kubernetes cluster is a set of node machines for running containerized applications. If you’re…
High Availability Architecture using AWS CLI

2021年2月11日

High Availability Architecture using AWS CLI

The Architecture includes- Webserver configured on EC2 Instance. Document Root (/var/www/html) made persistent by…

2 条评论
Getting started with AWS-CLI !

2020年10月13日

Getting started with AWS-CLI !

The AWS Command Line Interface (CLI) is a unified tool to manage your AWS services. With just one tool to download and…
Deploying WordPress on top of Google Cloud Platform along with Kubernetes Integration

2020年8月26日

Deploying WordPress on top of Google Cloud Platform along with Kubernetes Integration

So finally here I am with the successful completion of Google Cloud Platform task and here goes the self reflection of…

8 条评论

See all articles

Controlling Hadoop Storage

Sujagi Verma

Werkstudentin @ Siemens Healthineers

Sujagi Verma的更多文章

社区洞察

其他会员也浏览了

Harnessing the Power of Hadoop A Guide to Effective Data Management

Harnessing the Power of Hadoop A Guide to Effective Data Management

Hadoop

#bigdata 30e?—?Apache Flume and Sqoop

Hadoop

Oozie

Hadoop

Oozie

How To Create Hadoop Cluster In Just 10 Minutes ?

Integrating LVM with Hadoop and providing Elasticity to DataNode Storage

Sujagi Verma的更多文章

Ransomware Attacks in the Cloud

WordPress Application using Amazon RDS as a backend !!

Instance-Volume Attachment using Terraform

Companies that got benefitted from AWS - KIA MOTORS

Automate a dynamic Infrastructure over AWS using Terraform

Automate K8s Multi Node Cluster Over AWS using Ansible

High Availability Architecture using AWS CLI

Getting started with AWS-CLI !

Deploying WordPress on top of Google Cloud Platform along with Kubernetes Integration

社区洞察

其他会员也浏览了

Harnessing the Power of Hadoop A Guide to Effective Data Management

Harnessing the Power of Hadoop A Guide to Effective Data Management

Hadoop

#bigdata 30e?—?Apache Flume and Sqoop

Hadoop

Oozie

Hadoop

Oozie

How To Create Hadoop Cluster In Just 10 Minutes ?

Integrating LVM with Hadoop and providing Elasticity to DataNode Storage