登录查看更多内容

Configure Hadoop and start cluster services using Ansible Playbook

Govind Bhardwaj

Software Engineer at miniOrange

发布日期: 2020年12月4日

+ 关注

ARTH - Task 11.1 ???????

Task Description??

?? Configure Hadoop and start cluster services using Ansible Playbook

Introduction To Hadoop -

* Big Data is not a technology we can say that it is just a umbrella of problems which occurs because of huge amount of data and in different formats.

* Visit this article to know more about BigData -

* To solve BigData problem Hadoop is used . Apache Hadoop is a collection of open-source software utilities that facilitates using a network of many computers to solve problems involving massive amounts of data and computation. It provides a software framework for distributed storage and processing of big data using the MapReduce programming model

* To know how to configure Hadoop Cluster Visit this Article -

* In this practical I will create All ansible playbooks in "/root/HadoopCluster/" -

* For this practical I will use three Virtual Machine on My Local System. My NameNode is (IP - 192.168.43.129) & DataNode is (IP - 192.168.43.94).

* Ansible Configuration File "/etc/ansible/ansibe.cfg" at Controller Node -

* Inventory File "/root/HadoopCluster/hosts.txt" at Controller Node ( In my case I create Two groups of Managed Node First --> NameNode Second --> DataNode) -

Visit my this article to know about Ansible Playbook & Grouping of Inventories -

* Check List of Managed Node with ansible command -

  To get all Managed Node IPs list -
  # ansible all --list-hosts

  To get "NameNode" group IPs list -
  # ansible NameNode --list-hosts

  To get "DataNode" group IPs list -
  # ansible DataNode --list-hosts

* Check Connectivity all Managed Node with Controller Node -

  # ansible all -m ping

* For this practical I will create many file but our main file (playbook) is "hadoopcluster.yml" . We will create all files "hadoopcluster.yml" , "install_package.yml" , "Configure_Node.yml" , "namenode.yml" , "datanode.yml" . Overview of our Ansible Playbooks -

Step - 1 Create "hadoopcluster.yml" file -

1(A) Task 1 -

1(B) Task 2 -

Visit this article to know more about loop -

1(C) Task 3 -

1(D) Task 4 -

Step -2 Gather All Data For setup Hadoop Cluster -

* I create a separate variable file "hadoop_var.yml".

2(A) "Hadoop_Package_Requirement" is variable which have requied package name & their command -

2(B) . "Node_MetaData" is a variable which have some metadata about Nodes

2(C) . "Node" is variable which have Hadoop Configuration Properties for NameNode & DataNodes -

Step - 3 Write Tasks in separate file "install_package.yml" to install Required Packages for establish Hadoop Cluster -

3(A) . Task 1

3(B) . Task 2

I have "hadoop-1.2.1-1.x86_64.rpm" & "jdk-8u171-linux-x64.rpm" also in "/root/HadoopCluster" directory.

3(C) . Task 3

Step - 4 Create "Configure_Node.yml" -

Step -5 Create "namenode.yml" file -

5(A) . Task 1

5(B) . Task 2

5(C) . Task 3

5(D) . Task 4

5(E) . Task 5

5(F) . Task 6

Step - 6 Create "datanode.yml" -

6(A) . Task 1

6(B) . Task 2

6(C) . Task 3

6(D) . Task 4

6(E) . Task 5

Step - 7 Run "hadoopcluster.yml" playbook -

    # ansible-playbook hadoopcluster.yml

Step - 8 Check Hadoop Cluster setup is done or not -

  At NameNode
  # jps
  # hadoop dfsadmin -report

  At DataNode
  # jps

* Upload File from namenode to Hadoop Cluster ( To check )-

  # hadoop fs -put home.html
  # hadoop fs -ls /

Our Hadoop Cluster is working good

Task is successfully done.

要查看或添加评论，请登录

Govind Bhardwaj的更多文章

RedHat OpenShift Case Study : Ford Motor

2021年3月13日

RedHat OpenShift Case Study : Ford Motor

Ford Motor Company is a global company based in Dearborn, Michigan. The company designs, manufactures, markets and…
Jenkins Case Study: Avoris Travel

2021年3月12日

Jenkins Case Study: Avoris Travel

Speed matters when your mission is to reinvent the travel business: for your agents, your customers, and, especially…
Create Chat Server Using Python

2021年2月16日

Create Chat Server Using Python

Hello Guys , In this practical we will create a chat server using Python . The Description of Practical or Task is…
Kubernetes Multi-Node Cluster Setup on AWS with Ansible

2021年2月15日

Kubernetes Multi-Node Cluster Setup on AWS with Ansible

Hello Guys , In this article we will setup Kubernetes Cluster on Amazon Web Services (AWS) with the help of Ansible…
WordPress Deployment On EC2 Instance And Use RDS For Database

2021年2月14日

WordPress Deployment On EC2 Instance And Use RDS For Database

Hello Guys , In this article we will deploy WordPress on AWS EC2 Instance and we will use AWS RDS Service For Database…
Apache Web Server On Docker Container With Ansible (Dynamic Inventory ) using Container IP

2021年2月12日

Apache Web Server On Docker Container With Ansible (Dynamic Inventory ) using Container IP

Hello Guys , In this article we will configure Apache Web Server On Docker Container with Ansible . We will use…
Dynamically Load Variable According to OS Type In Ansible

2021年2月11日

Dynamically Load Variable According to OS Type In Ansible

Hello Guys , In this article we will load a variable file in Ansible according to Operating System Type ( Means If we…
Automation With Ansible

2020年12月29日

Automation With Ansible

Today’s my blog is to share, what I have learnt from session which I as ARTH Learner had with two best industry expert…
How IBM use Kubernetes to Solve their challenge

2020年12月26日

How IBM use Kubernetes to Solve their challenge

Introduction to IBM - > International Business Machines Corporation (IBM) is an American Multinational Technology and…
Configure Routing Table In Such a Way So That We Can Ping Only Google IP Not FaceBook Ip

2020年12月18日

Configure Routing Table In Such a Way So That We Can Ping Only Google IP Not FaceBook Ip

ARTH - Task 13 ??????? Task Description?? ?? Create a Setup so that you can ping google but not able to ping Facebook…

See all articles

Introduction To Hadoop -

Step - 1 Create "hadoopcluster.yml" file -

1(A) Task 1 -

1(B) Task 2 -

1(C) Task 3 -

1(D) Task 4 -

Step -2 Gather All Data For setup Hadoop Cluster -

Step - 3 Write Tasks in separate file "install_package.yml" to install Required Packages for establish Hadoop Cluster -

3(A) . Task 1

3(B) . Task 2

3(C) . Task 3

Step - 4 Create "Configure_Node.yml" -

Step -5 Create "namenode.yml" file -

5(A) . Task 1

5(B) . Task 2

5(C) . Task 3

5(D) . Task 4

5(E) . Task 5

5(F) . Task 6

Step - 6 Create "datanode.yml" -

6(A) . Task 1

6(B) . Task 2

6(C) . Task 3

6(D) . Task 4

6(E) . Task 5

Step - 7 Run "hadoopcluster.yml" playbook -

Step - 8 Check Hadoop Cluster setup is done or not -

Govind Bhardwaj的更多文章

RedHat OpenShift Case Study : Ford Motor

Jenkins Case Study: Avoris Travel

Create Chat Server Using Python

Kubernetes Multi-Node Cluster Setup on AWS with Ansible

WordPress Deployment On EC2 Instance And Use RDS For Database

Apache Web Server On Docker Container With Ansible (Dynamic Inventory ) using Container IP

Dynamically Load Variable According to OS Type In Ansible

Automation With Ansible

How IBM use Kubernetes to Solve their challenge

Configure Routing Table In Such a Way So That We Can Ping Only Google IP Not FaceBook Ip

社区洞察

其他会员也浏览了

CONFIGURE HADOOP AND START CLUSTER SERVICES USING ANSIBLE PLAYBOOK:-

Integrating LVM with Hadoop and providing Elasticity to DataNode Storage

Enroll For Interactive Free Hadoop Online Demo Session & Interact With Big Data Experts On 15th March At 7: 00 AM (IST) At Kelly Technologies.

Setup a Multi-Node Hadoop Cluster using Docker

Contribute Limited Amount of Storage of Data Node in Hadoop Cluster

"Getting Started with Hadoop on Ubuntu: Installation Made Easy"

YARN & MapR, YARN Requirements and YARN Frameworks

Building a Hadoop Cluster from the Powerful Automation Tool: Ansible

Task 9.2: Create a Web Menu Using Python-CGI and API :"Integrating all the different technologies..!!"

Setting Up Hadoop Cluster Using Ansible