登录查看更多内容

Configuration of Hadoop Cluster using Ansible

Ganesh Chaudhari

Software Engineer at Dassault Systèmes || Frontend Engineer

发布日期: 2020年11月29日

+ 关注

Task 11.1

Configure Hadoop cluster using Ansible

1. Installation of Hadoop Requirements 2.Configuration of Name Node & Data Node 3. Starting Hadoop Services

lets start, Ansible

Ansible is configuration management tool. It works on push mechanism and it is agentless. ansible is built on the top of python hence before ansible installation we should have installed python3. we can install ansible using pip3 install ansible. after that configure /etc/ansible/ansible.cfg and inventory files like below

lets write playbook for master nodes

1.Transfer java JDK and install it on target node because Hadoop built from java language then transfer Hadoop library which will be compactiable with java.

2. Creating directory for master node and updating the /etc/hadoop/hdfs-site.xml and /etc/hadoop/core-site.xml file

3. Format the /master directory to store metadata of data nodes. then start service of master node.

4. Then firewall rules like 50070/tcp,50010/tcp and 9001/tcp because 9001 is used for service and 50070 is used for WebUI.

lets start, configuration of datanode

1.Transfer JDK and install it on target node because Hadoop built from java language then transfer Hadoop library which will be compactiable with java.

2. Creating directory for data node and updating the /etc/hadoop/hdfs-site.xml and /etc/hadoop/core-site.xml file

3.start service of data node and add the firewall rules

then lets create file for variables which which was mentioned in above playbook

then check syntax of playbook by ansible-playbook --syntax-check playbook_name then run playbook by ansible-playbook playbook_name

Name node:

Data Node:

Thus I have successfully completed task 11.1

要查看或添加评论，请登录

Ganesh Chaudhari的更多文章

Configuration of K8s Multinode Cluster over AWS by integrating ansible and terraform with dynamic inventory.

2021年2月6日

Configuration of K8s Multinode Cluster over AWS by integrating ansible and terraform with dynamic inventory.

Integration of terraform, ansible, AWS and k8s. lets understand what is terraform, ansible, AWS and k8s.
Case Study of Kubernetes

2020年12月26日

Case Study of Kubernetes

What is Kubernetes? Kubernetes is a portable, extensible, open-source platform for managing containerized workloads and…
AWS Cloud using AWS CLI.

2020年10月13日

AWS Cloud using AWS CLI.

AWS CSA Training Task : 2 1.Create a key pair 2.

6 条评论
Configuration of Load balancer HAPROXY Using Ansible

2020年10月9日

Configuration of Load balancer HAPROXY Using Ansible

Ansible Task-3 Deploy a Load Balancer and multiple Web servers on AWS instances using Ansible. 1.
Big Data

2020年9月17日

Big Data

What do you think, how big companies stored their customers data. You may think about big data but actually it is…

4 条评论
Deployment of Webserver on AWS using Ansible

2020年8月22日

Deployment of Webserver on AWS using Ansible

Deployment of Webserver on AWS through Ansible TASK 2 1.Provision of EC2 instance through Ansible 2.

3 条评论
Integration of Ansible with Docker

2020年8月5日

Integration of Ansible with Docker

Integration of Ansible with Docker Ansible TASK 1 : Write an Ansible playbook to perform following operations in…

2 条评论

See all articles

Configuration of Hadoop Cluster using Ansible

Ganesh Chaudhari

Software Engineer at Dassault Systèmes || Frontend Engineer

Task 11.1

Configure Hadoop cluster using Ansible

Ganesh Chaudhari的更多文章

社区洞察

其他会员也浏览了

Hadoop vs Spark Comparison

Pig Latin and its Operators

Hadoop 3: Comparison with Hadoop 2 and Spark

Apache Hadoop vs Apache Spark

Hadoop Cluster Revealed

Apache Pig Architecture

CONFIGURE HADOOP AND START CLUSTER SERVICES USING ANSIBLE PLAYBOOK:-

Impala

Hadoop – Architecture

Introduction to Hadoop

Task 11.1

Configure Hadoop cluster using Ansible

Ganesh Chaudhari的更多文章

Configuration of K8s Multinode Cluster over AWS by integrating ansible and terraform with dynamic inventory.

Case Study of Kubernetes

AWS Cloud using AWS CLI.

Configuration of Load balancer HAPROXY Using Ansible

Big Data

Deployment of Webserver on AWS using Ansible

Integration of Ansible with Docker

社区洞察

其他会员也浏览了

Hadoop vs Spark Comparison

Pig Latin and its Operators

Hadoop 3: Comparison with Hadoop 2 and Spark

Apache Hadoop vs Apache Spark

Hadoop Cluster Revealed

Apache Pig Architecture

CONFIGURE HADOOP AND START CLUSTER SERVICES USING ANSIBLE PLAYBOOK:-

Impala

Hadoop – Architecture

Introduction to Hadoop