登录查看更多内容

CONFIGURE HADOOP AND START CLUSTER SERVICES USING ANSIBLE PLAYBOOK:-

Udit Agarwal

Software Engineer | Python | GCP Cloud | Devops | Kubernetes | Grafana | AWS cloud | JAVA enthusiast | web developer | Docker | Rhel 8

发布日期: 2022年6月29日

REDHAT ANSIBLE:-

Ansible is an open-source automation tool which is used for IT tasks such as configuration management, application deployment, intra-service orchestration, and provisioning. Automation is crucial these days, with IT environments that are too complex and often need to scale too quickly for system administrators and developers to keep up if they had to do everything manually. Automation simplifies complex tasks, not just making developers’ jobs more manageable but allowing them to focus attention on other tasks that add value to an organization. In other words, it frees up time and increases efficiency.

HADOOP CLUSTER:-

Hadoop is an Apache open-source framework written in java that allows distributed processing of large datasets across clusters of computers using simple programming models. The Hadoop framework application works in an environment that provides distributed storage and computation across clusters of computers. Hadoop is designed to scale up from a single server to thousands of machines, each offering local computation and storage.

It includes Master-Slave Architecture also known as Namenode-Datanode Architecture.

Task 11.1 Description:-

?? Configure Hadoop and start cluster services using Ansible Playbook.

Let's start...

My Controller Node IP is 192.168.0.102 in which ansible is installed and also my Master/Namenode for Hadoop.

My Target/Managed Nodes IP's are 192.168.0.104 and 192.168.0.103 which is my Slave/Datanode and Client for Hadoop respectively.

? STEP 1:- Check the version.

ansible --version

? STEP 2:- Update the Inventory file and then check the pinging.

gedit /root/ip.txt

ansible all --list-hosts

ansible all -m ping

? STEP 3:- Update the Ansible configuration file.

gedit /etc/ansible/ansible.cfg

? STEP 4:- Ansible Playbook.

gedit task11-1.yml

For configuring hadoop cluster we need jdk & hadoop software so first we will copy and install this software in namenode, datanode and client.

And then we need to configure namenode, datanode and client individually.

领英推荐

Why do we need Hadoop for Data Science - NareshIT

Naresh i Technologies 2 年前

Basics of Hadoop

Vivek Bansal 11 个月前

Introduction to Hadoop

Simran Rai 1 个月前

gedit var.yml

It contains all the variables and their values used in playbook.

gedit hdfs-site.xml

The hdfs-site.xml file for namenode and datanode used as template in playbook are stored in namenode_files and datanode_files i.e. in their respective folders.

gedit core-site.xml

The core-site.xml file for namenode, datanode and client used as template in playbook is stored in the workspace.

? STEP 5:- Run the Ansible Playbook.

ansible-playbook task11-1.yml

? STEP 6:- Check that directory created and service started manually and then generate report.

We can clearly see that we have successfully added one datanode(slave) to the namenode (master) of 49.98 GB out of which 43.87 GB is available for namenode. Like this we just need to update IP’s of multiple datanodes in the inventory file and then run the playbook & we can increase our hadoop cluster.

TASK COMPLETED SUCCCESSFULLY??????? ??

Thanks for reading!! ??

??Keep Learning????Keep Sharing??

Md Rashid Salim

Network Engineer L1at HPE

2 年

I love it concept

查看更多评论

要查看或添加评论，请登录

Udit Agarwal的更多文章

Most Famous CI/CD Tool which is commonly used in DevOps automation world

2022年8月13日

Most Famous CI/CD Tool which is commonly used in DevOps automation world

Let’s understand Jenkins. Jenkins is an open source automation tool.
Multi Cloud Kubernetes Setup Using Ansible and Terraform

2022年7月17日

Multi Cloud Kubernetes Setup Using Ansible and Terraform

In this blog I am going to Create Multi Cloud Kubernetes Setup Using Ansible and Terraform Where User can deploy…
Automate Kubernetes Cluster Using Ansible and launching WordPress MySQL and expose WordPress pod .

2022年7月1日

Automate Kubernetes Cluster Using Ansible and launching WordPress MySQL and expose WordPress pod .

Hello guys, here I came up with a new and interesting task here I am configuring the Kubernetes master and slave using…
OpenShift Industry use cases

2022年6月30日

OpenShift Industry use cases

What is OpenShift ? OpenShift is DevOps tools which helps you to develop, deploy, and manage container-based…
Kubernetes Configuration Manager - Helm

2022年6月30日

Kubernetes Configuration Manager - Helm

Challenge For example, to deploy the Nginx-alpine application 4 resource manifests were necessary: Namespace…
ANSIBLE ROLE TO CONFIGURE K8S MULTI NODE CLUSTER OVER AWS CLOUD.

2022年6月30日

ANSIBLE ROLE TO CONFIGURE K8S MULTI NODE CLUSTER OVER AWS CLOUD.

WHAT IS KUBERNETES? Kubernetes also known as K8s, is an system which is used for Automation deployment, Scaling and…
ANSIBLE PLAYBOOK THAT WILL RETRIEVE NEW CONTAINER IP AND DYNAMICALLY UPDATE THE INVENTORY AND CONFIGURE WEB-SERVER INSIDE THAT DOCKER CONTAINER:-

2022年6月29日

ANSIBLE PLAYBOOK THAT WILL RETRIEVE NEW CONTAINER IP AND DYNAMICALLY UPDATE THE INVENTORY AND CONFIGURE WEB-SERVER INSIDE THAT DOCKER CONTAINER:-

Task 14.2 Description:- ANSIBLE PLAYBOOK: Configuring Docker Container with HTTPD Server Image and Hosting a Webpage.
Ansible Playbook which will dynamically load the variable file named same as OS_name:-

2022年6月29日

Ansible Playbook which will dynamically load the variable file named same as OS_name:-

Task-14.3 Description:- ?? Create an Ansible Playbook which will dynamically load the variable file named same as…
Run GUI Programs on Docker Container

2022年6月28日

Run GUI Programs on Docker Container

Task Description - Run a Docker container that can run GUI (Graphical User Interface) Programs Launch a container on…

1 条评论
Industry use of Kubernetes and use cases solved by it.

2022年6月27日

Industry use of Kubernetes and use cases solved by it.

WHAT IS KUBERNETES? Kubernetes is a portable, extensible, open-source platform for managing containerized workloads and…

See all articles

CONFIGURE HADOOP AND START CLUSTER SERVICES USING ANSIBLE PLAYBOOK:-

Udit Agarwal

Software Engineer | Python | GCP Cloud | Devops | Kubernetes | Grafana | AWS cloud | JAVA enthusiast | web developer | Docker | Rhel 8

REDHAT ANSIBLE:-

HADOOP CLUSTER:-

Task 11.1 Description:-

Let's start...

领英推荐

TASK COMPLETED SUCCCESSFULLY??????? ??

??Keep Learning????Keep Sharing??

Udit Agarwal的更多文章

社区洞察

其他会员也浏览了

Integration of LVM with Hadoop-Cluster To contribute limited storage of datanode on aws

Hadoop vs Hive

Automating Hadoop Using Ansible

Apache Hadoop vs Apache Spark

9 issues I’ve encountered when setting up a Hadoop/Spark cluster for the first time

Hadoop And Apache SparK: Which Is Suitable for Your Domain of Work?

Configuration of HDFS Cluster with Ansible

CONFIGURING HADOOP CLUSTER USING ANSIBLE

Apache Hadoop

Getting started with Apache Spark

REDHAT ANSIBLE:-

HADOOP CLUSTER:-

Task 11.1 Description:-

Let's start...

领英推荐

TASK COMPLETED SUCCCESSFULLY??????? ??

??Keep Learning????Keep Sharing??

Udit Agarwal的更多文章

Most Famous CI/CD Tool which is commonly used in DevOps automation world

Multi Cloud Kubernetes Setup Using Ansible and Terraform

Automate Kubernetes Cluster Using Ansible and launching WordPress MySQL and expose WordPress pod .

OpenShift Industry use cases

Kubernetes Configuration Manager - Helm

ANSIBLE ROLE TO CONFIGURE K8S MULTI NODE CLUSTER OVER AWS CLOUD.

ANSIBLE PLAYBOOK THAT WILL RETRIEVE NEW CONTAINER IP AND DYNAMICALLY UPDATE THE INVENTORY AND CONFIGURE WEB-SERVER INSIDE THAT DOCKER CONTAINER:-

Ansible Playbook which will dynamically load the variable file named same as OS_name:-

Run GUI Programs on Docker Container

Industry use of Kubernetes and use cases solved by it.

社区洞察

其他会员也浏览了

Integration of LVM with Hadoop-Cluster To contribute limited storage of datanode on aws

Hadoop vs Hive

Automating Hadoop Using Ansible

Apache Hadoop vs Apache Spark

9 issues I’ve encountered when setting up a Hadoop/Spark cluster for the first time

Hadoop And Apache SparK: Which Is Suitable for Your Domain of Work?

Configuration of HDFS Cluster with Ansible

CONFIGURING HADOOP CLUSTER USING ANSIBLE

Apache Hadoop

Getting started with Apache Spark