Hadoop Installation on Ubuntu.
Ravindra soran
Aspiring DevOps Enthusiast || RedHat Certified RHCSA & RHCE || AWS, TERRAFORM
In this article, I will walk you through a simple process of installing Hadoop on a single-node system. This tutorial uses a custom script that installs all necessary dependencies, sets up Hadoop, and simplifies the installation process.
The approach outlined here assumes that you are using a fresh Ubuntu virtual machine (VM). The script provided will automate the installation of all necessary packages for Hadoop, and I’ll explain the steps for downloading, setting up, and running the script. Let's get started!
Step-by-Step Instructions
1. Setting Up the Ubuntu VM
2. Update Your System
Before proceeding, make sure your system is up to date with the latest packages. In your terminal, run:
sudo apt update ; sudo apt upgrade -y
Important: Do not install any additional software or packages at this stage, as the script provided will take care of all the required dependencies for you.
3. Download the Installation Script
The next step is to download the Hadoop installation script. To do this, use wget to fetch the file directly from the GitHub repository:
wget https://raw.githubusercontent.com/rsoran/ns3-install/main/hadoop.sh
This command will download the hadoop.sh script to your current directory.
4. Make the Script Executable
After the script is downloaded, we need to make it executable so we can run it. Use the chmod command:
chmod +x hadoop.sh
5. Run the Script
Now that the script is executable, you can run it to begin the installation of Hadoop. Execute the following command:
./hadoop.sh
The script will take care of all the necessary steps, including:
Once the script completes successfully, Hadoop will be installed on your machine.
6. Check the Installation
After the installation process, you can verify if Hadoop has been installed correctly by running the jps command:
jps
The output should list Hadoop-related processes like NameNode, DataNode, ResourceManager, etc., indicating that the installation was successful.
Optional: Installing NetBeans for Hadoop Development
If you plan to develop applications using Hadoop and would like to use an Integrated Development Environment (IDE), you can install NetBeans.
Install OpenJDK 17
Hadoop requires Java, and for this tutorial, we’ll use OpenJDK 17. Install it using:
sudo apt install openjdk-17-jdk -y
Install NetBeans
Once Java is set up, you can install NetBeans via the Snap package manager:
sudo snap install netbeans --classic
This will install NetBeans on your system, which you can use to develop, debug, and manage your Hadoop applications.
Conclusion
By following these steps, you should now have Hadoop installed and running on a single-node Ubuntu machine. The installation is automated through a simple script, making the process more accessible and faster.
Additionally, by installing NetBeans and OpenJDK 17, you’ll have a complete development environment ready for building and testing Hadoop-related applications.
If you run into any issues or have questions about the setup, feel free to refer to the official Hadoop documentation or ask for support in relevant forums.
Happy Hadoop-ing!
3rd Year CS & AI
4 周Great article sir !
Devops & Cloud Enthusiast || RedHat Certified Specialist in Containers || RHCE Certified || RHCSA Certified || Linux || Aws || Ansible || Docker || Kubernetes || Git || Jenkins || Terraform || ICFAI"25
1 个月Very helpful