登录查看更多内容

Hadoop Cluster using Ansible

Pranesh Prashar

Associate Software Engineer at Bosch Global Software Technologies

发布日期: 2020年12月19日

Big Data :-

Today in this growing world we need speed and data to predict and manage the desired outcomes. So large companies like Facebook and Google generate huge amount of data and manage them.So to manage these huge data we need some tools so that it is easy to use and can be easy implemented so for that Hadoop comes into play.

Hadoop :-

It is a open-source software made for reliable and scalable distributed system. Hadoop software library is a framework that allows for distributed processing of large data sets across the clusters of computer using simple programing models. That's all to be needed for now.

Ansible :-

Ansible is an open-source software provisioning, configuration management, and application-deployment tool enabling infrastructure as code. It runs on many Unix-like systems, and can configure both Unix-like systems as well as Microsoft Windows. It includes its own declarative language to describe system configuration.

Let's follow this step to configure the hadoop using ansible

For using ansible for configuration we need 1 os for namenode (namenode works as the os conf with ansible for performing and config the systems where we want to perform some operation) and 2nd for target ndoe (target node works as the os where we want to do some operations)

Datanode ip :- (192.168.43.186)

Namenode ip :- (192.168.43.41)

STEP 2 :- (To configure hadoop we need to install its dependencies like JDK and apache hadoop)

to install jdk and hadoop system we need to first install the software on both os (i.e namenode and datanode or target node). We need to write the following code in the task to install the softwares. I am using redhat commands that is why i use the condition to run only on the specified os.

so to complete the configuration throug anisble we are going to write the ansible-playbook in ".yml file". here is code for that

to run this file we need to save the file in .yml file . I have saved as hadoop.yml file

now let's run this and see how it goes.

As in the above hadoop and jdk were already installed previously if I have ran it obviously it will throw error so I used ignore_errors to move forward.

So at last all run successfully and we are good to go.

Thankyou..

Incredible Interns

11 个月

Great job on automating big data Hadoop with Ansible! Your attention to detail in using ansible-playbook and YML file is impressive. To take your skills even further, you might want to explore automating other technologies with Ansible or dive deeper into big data analytics. What other areas in tech are you interested in exploring? Your initiative is a huge step towards becoming a big data engineer. Have you thought about the specific industries or projects you'd like to work on in the future?

要查看或添加评论，请登录

Pranesh Prashar的更多文章

Docker using javascript and python CGI

2021年6月25日

Docker using javascript and python CGI

In this task you have to create a Web Application for Docker (one of the great Containerization Tool which provides the…
JavaScript (programming language): Is JavaScript being used in the industry? Is it worth investing time learning it?

2021年6月16日

JavaScript (programming language): Is JavaScript being used in the industry? Is it worth investing time learning it?

JavaScript is the language to add interaction to the web browser. So if you want any sort of interactive elements other…
A great journey in K8S

2021年6月9日

A great journey in K8S

Initally I thought this would be another so called trainning that we get to see now a days in online platform like…
Creating Helm Chart Containing Wordpress and Mysql

2021年6月9日

Creating Helm Chart Containing Wordpress and Mysql

Task :-Create a Helm chart containing wordpress and mysql environment ii) Publish it on artifacthub.io (This is the…
Launching A WordPress Application With MYSQL Database in K8S Cluster On AWS Using Ansible !

2021年6月9日

Launching A WordPress Application With MYSQL Database in K8S Cluster On AWS Using Ansible !

Steps to do this project: step 1: we have to launch 3 ec2 instances on aws step 2: I have launched an extra instance on…
Live Streaming Video Chat App without voice using cv2 module of Python And Socket Programming

2021年6月8日

Live Streaming Video Chat App without voice using cv2 module of Python And Socket Programming

This Article Is Mainly About Transferring Video using Socket Programming : To Perform This Practical We Have to First…
Confusion Matrix

2021年6月7日

Confusion Matrix

A confusion matrix is a fairly common term when it comes to machine learning. Today I would be trying to relate the…

1 条评论
GUI container on the Docker

2021年5月30日

GUI container on the Docker

Task Description ?? ?? GUI container on the Docker ?? Launch a container on docker in GUI mode ?? Run any GUI software…

1 条评论
Predicting Salary Using Linear Regression Model On Top of Docker Container

2021年5月27日

Predicting Salary Using Linear Regression Model On Top of Docker Container

?? Pull the Docker container image of CentOS image from DockerHub and create a new container ?? Install the Python…

1 条评论
How to make HTTPD Service idempotence in Nature using Ansible?

2020年12月26日

How to make HTTPD Service idempotence in Nature using Ansible?

In this blog we are going to solve a problem and that is "RESTARTING OF HTTPD SERVER IN LINUX IS NOT AN IDEMPOTENCE IN…

See all articles

Hadoop Cluster using Ansible

Pranesh Prashar

Associate Software Engineer at Bosch Global Software Technologies

Big Data :-

Hadoop :-

Ansible :-

Let's follow this step to configure the hadoop using ansible

For using ansible for configuration we need 1 os for namenode (namenode works as the os conf with ansible for performing and config the systems where we want to perform some operation) and 2nd for target ndoe (target node works as the os where we want to do some operations)

Datanode ip :- (192.168.43.186)

STEP 2 :- (To configure hadoop we need to install its dependencies like JDK and apache hadoop)

so to complete the configuration throug anisble we are going to write the ansible-playbook in ".yml file". here is code for that

Pranesh Prashar的更多文章

社区洞察

其他会员也浏览了

Harnessing the Power of Hadoop A Guide to Effective Data Management

Harnessing the Power of Hadoop A Guide to Effective Data Management

Hadoop Cluster Revealed

What are the prerequisites to learn Hadoop?

Integrating LVM with Hadoop and providing Elasticity to DataNode Storage

Hadoop Architecture

Deep Dive into Hadoop YARN

Let’s research and the world the know about the Myths of Hadoop

#bigdata 30e?—?Apache Flume and Sqoop

Oozie

Big Data :-

Hadoop :-

Ansible :-

Let's follow this step to configure the hadoop using ansible

For using ansible for configuration we need 1 os for namenode (namenode works as the os conf with ansible for performing and config the systems where we want to perform some operation) and 2nd for target ndoe (target node works as the os where we want to do some operations)

Datanode ip :- (192.168.43.186)

STEP 2 :- (To configure hadoop we need to install its dependencies like JDK and apache hadoop)

so to complete the configuration throug anisble we are going to write the ansible-playbook in ".yml file". here is code for that

Pranesh Prashar的更多文章

Docker using javascript and python CGI

JavaScript (programming language): Is JavaScript being used in the industry? Is it worth investing time learning it?

A great journey in K8S

Creating Helm Chart Containing Wordpress and Mysql

Launching A WordPress Application With MYSQL Database in K8S Cluster On AWS Using Ansible !

Live Streaming Video Chat App without voice using cv2 module of Python And Socket Programming

Confusion Matrix

GUI container on the Docker

Predicting Salary Using Linear Regression Model On Top of Docker Container

How to make HTTPD Service idempotence in Nature using Ansible?

社区洞察

其他会员也浏览了

Harnessing the Power of Hadoop A Guide to Effective Data Management

Harnessing the Power of Hadoop A Guide to Effective Data Management

Hadoop Cluster Revealed

What are the prerequisites to learn Hadoop?

Integrating LVM with Hadoop and providing Elasticity to DataNode Storage

Hadoop Architecture

Deep Dive into Hadoop YARN

Let’s research and the world the know about the Myths of Hadoop

#bigdata 30e?—?Apache Flume and Sqoop

Oozie