Configure Hadoop and start cluster services using Ansible Playbook !!!
Anushka Visapure
Solution-Oriented DevOps Engineer || Skilled in Kubernetes | Terraform | Ansible | Docker | Git and GitHub | GitHub Action || Expanding Capabilities in AWS | GCP | Linux.
What is Ansible ?
- Ansible is an open-source software provisioning, configuration management, and application-deployment tool enabling infrastructure as code. It runs on many Unix-like systems, and can configure both Unix-like systems as well as Microsoft Windows. It includes its own declarative language to describe system configuration.
What is Hadoop ?
The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage. Rather than rely on hardware to deliver high-availability, the library itself is designed to detect and handle failures at the application layer, so delivering a highly-available service on top of a cluster of computers, each of which may be prone to failures.
Pre-requisites:
- For the Integratiom of Ansible with Hadoop we require three redhat8 system with yum configuration
- * Controller Node
- * Managed Node
So let's Start ......
So first we have to start both the system ..then go to the controller node and install the ansible . Ansible tool work on the python thus to install the ansible we have to use the pip3 command
command : pip3 install ansible
Then we have check weather the Ansible is install or not .
command : ansible --version
Go to the managed and get the IP of managed node then go to the controller node and create the inventory file
After that create the configuration file by using the command :
" vim /etc/ansible/"name_of_file".cfg "
Now check the connection between the controller node and managed node by using the command : ansible all -m ping
Now create the ansible-playbook for Namenode using the following command :
vim namenode.yml
Now run the namenode.yml playbook to setup the namenode :
command : ansible-playbook namenode.yml
Now create the ansible-playbook for Datanode using the following command :
vim datanode.yml
Now run the datanode.yml playbook to setup the datanode :
command : ansible-playbook datanode.yml
Once the installtion is finished we can see that the Hadoop is successfully installed and the Namenode and Datanode services are running on the respective target nodes
And Finally we successfully configured the Namenode and Datanode with the help of Ansible !!!
And finally Task completed successfully...
Thank you for Reading !!!
Former SDE Intern @Raja Software Labs, Pune
4 年Great Work
Jr. Soft Engg @Cognizant, EDI-Maps Developer, 2X OCI, 1xAWS Certified, 1X Aviatrix Certified, AT&T Summer Learning Academy Extern, LW summer Research Intern, ARTH Learner, 1X Gitlab Certified Associate, ARTH 2.0 LW_TV
4 年Keep it up Anushka!!
MTS 1 @Cohesity | Ex-Veritas | Kubernetes | Docker | Golang | Python
4 年Well done Anushka ?