0% found this document useful (0 votes)

6 views10 pages

VSS Lab Assignment #01

This document provides a detailed guide on installing and setting up Hadoop on Ubuntu, including steps for installing Java, creating a user, configuring SSH, and downloading Hadoop. It outlines the necessary configurations for Hadoop's environment variables and various XML configuration files to ensure proper functionality. Finally, it describes how to start the Hadoop cluster and access the web interface.

Uploaded by

aqib.ilyas90f

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views10 pages

VSS Lab Assignment #01

Uploaded by

aqib.ilyas90f

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

Lahore Garrison University

Virtual Systems & Services - Lab

Assignment # 01

Submitted To:
Mam Alishwa Amin
Submitted By:
Syed M. Ali Hamza
Roll No:
Fa-21/BS-IT/057
(Sec-B)
Installation and Setup Hadoop in Ubuntu
To install Hadoop, you will have to go through various steps, which include:

• Installing Java and configuring environment variables

• Creating user and configuring SSH
• Installation and configuration of Hadoop

Step 1: Installing Java on Ubuntu

To install java on Ubuntu,

sudo apt install default-jdk

sudo apt install default-jre

sudo apt install -y

To verify the installation, check the java version:

java -version

Step 2: Create a user for Hadoop and configure SSH

First, create a new user named hadoop:

sudo adduser hadoop

To enable superuser privileges to the new user, add it to the sudo group:

sudo usermod -aG sudo Hadoop

Once done, Switch the user hadoop:

sudo su - hadoop

Next, install the OpenSSH server and client:

sudo apt install openssh-server openssh-client -y

Now, use the following command to generate private and public keys:
ssh-keygen -t rsa

Here, it will ask you:

• Where to save the key (hit enter to save it inside your home directory)
• Create passphrase for keys (leave blank for no passphrase)

Now, add the public key to authorized_keys:

cat ~/.ssh/id_rsa.pub >> ~/.ssh/authorized_keys

Use the chmod command to change the file permissions of authorized_keys:

sudo chmod 640 ~/.ssh/authorized_keys

Finally, verify the SSH configuration:

ssh localhost

If you have not configured the password, all you have to do is type yesand hit enter if you
added a passphrase for the keys, it will ask you to enter here:

Step 3: Download and install Apache Hadoop on Ubuntu

If you have created a user for Hadoop, first, log in as the hadoop user:

sudo su - hadoop

download this release:

wget[Link]

Once you are done with the download, extract the file using the following command:
tar -xvzf [Link]

Next, move the extracted file to the /usr/local/hadoop using the following command:

sudo mv hadoop-3.4.1 /usr/local/hadoop

Now, create a directory mkdir command to store logs:

sudo mkdir /usr/local/hadoop/logs

Finally, change the ownership of the /usr/local/hadoop to the user hadoop:

sudo chown -R hadoop:hadoop /usr/local/Hadoop

Step 4: Configure Hadoop on Ubuntu

First, open the .bashrc file using the following command:

sudo nano ~/.bashrc

Jump to the end of the line of nano text editor, and paste the following:

export HADOOP_HOME=/usr/local/hadoop

export HADOOP_INSTALL=$HADOOP_HOME

export HADOOP_MAPRED_HOME=$HADOOP_HOME

export HADOOP_COMMON_HOME=$HADOOP_HOME

export HADOOP_HDFS_HOME=$HADOOP_HOME

export YARN_HOME=$HADOOP_HOME

export HADOOP_COMMON_LIB_NATIVE_DIR=$HADOOP_HOME/lib/native

export PATH=$PATH:$HADOOP_HOME/sbin:$HADOOP_HOME/bin

export HADOOP_OPTS="-[Link]=$HADOOP_HOME/lib/native"

Save changes and exit from the nano text editor.

To enable the changes, source the .bashrc file:

source ~/.bashrc
Step 5: Configure java environment variables

To use Hadoop, you are required to enable its core functions which include YARN, HDFS,
MapReduce, and Hadoop-related project settings.

To do that, you will have to define java environment variables in [Link].

Edit the [Link]

First, open the [Link]:

sudo nano $HADOOP_HOME/etc/hadoop/[Link]

Press Alt + / to jump to the end of the file and paste the following lines in the file to add the
path of the Java: (use the version of jdk which is installed instead of java-21)

export JAVA_HOME=/usr/lib/jvm/java-21-openjdk-amd64

export HADOOP_CLASSPATH+=" $HADOOP_HOME/lib/*.jar"

Save changes and exit from the text editor.

Next, change your current working directory to /usr/local/hadoop/lib:

cd /usr/local/hadoop/lib

Here, download the javax activation file:

Sudo wget [Link]

api/1.2.0/[Link]

Once done, check the Hadoop version in Ubuntu:

hadoop version

Next, you will have to edit the [Link] to specify the URL for the name node.

Edit the [Link]

First, open the [Link] using the following command:

sudo nano $HADOOP_HOME/etc/hadoop/[Link]

And add the following lines in between <configuration> </configuration>:

<description>The default file system URI</description>

</property>

Save the changes and exit from the text editor.

Next, create a directory to store node metadata using the following command:

sudo mkdir -p /home/hadoop/hdfs/{namenode,datanode}

And change the ownership of the created directory to the hadoopuser:

sudo chown -R hadoop:hadoop /home/hadoop/hdfs

Edit the [Link] configuration file

By configuring the [Link], you will define the location for storing node
metadata, fs-image file.

So first open the configuration file:

sudo nano $HADOOP_HOME/etc/hadoop/[Link]

And paste the following line in between <configuration> ... </configuration>:

</property>

<value>[Link]

</property>
<property>

<value>[Link]

</property>

Save changes and exit from the [Link].

Edit the [Link] file

By editing the [Link], you can define the MapReduce values.

To do that, first, open the configuration file using the following command:

sudo nano $HADOOP_HOME/etc/hadoop/[Link]

And paste the following line in between <configuration> ... </configuration>:

</property>

Save and exit from the nano text editor.

Edit the [Link] file

This is the last configuration file that needs to be edited to use the Hadoop service.

The purpose of editing this file is to define the YARN settings.

First, open the configuration file:

sudo nano $HADOOP_HOME/etc/hadoop/[Link]

Paste the following in between <configuration> ... </configuration>:

<name>[Link]-services</name>

<value>mapreduce_shuffle</value>

</property>
Save changes and exit from the config file.

Finally, use the following command to validate the Hadoop configuration and to format the
HDFS NameNode:

hdfs namenode -format

Step 6: Start the Hadoop cluster

To start the Hadoop cluster, you will have to start the previously configured nodes.

So let's start with starting the NameNode and DataNode:

[Link]

Next, start the node manager and resource manager:

[Link]

To verify whether the services are running as intended, use the following command:

jps
Step 7: Access the Hadoop Web Interface

To access the Hadoop web interface, you will have to know your IP and append the port no
9870 in your address bar

My IP is [Link] so I will be entering the following:

[Link]

Install Hadoop on Ubuntu Guide
No ratings yet
Install Hadoop on Ubuntu Guide
6 pages
Install Hadoop 3.3.6 on Ubuntu Guide
No ratings yet
Install Hadoop 3.3.6 on Ubuntu Guide
7 pages
Hadoop Installation Guide for Ubuntu
No ratings yet
Hadoop Installation Guide for Ubuntu
15 pages
Install Hadoop on Ubuntu Guide
No ratings yet
Install Hadoop on Ubuntu Guide
6 pages
Big Data Lab Exp1
No ratings yet
Big Data Lab Exp1
29 pages
Install Hadoop on Ubuntu: Step-by-Step Guide
No ratings yet
Install Hadoop on Ubuntu: Step-by-Step Guide
9 pages
Hadoop Installation Guide for Ubuntu
No ratings yet
Hadoop Installation Guide for Ubuntu
20 pages
EXP-1 Downloading and Installing Hadoop
No ratings yet
EXP-1 Downloading and Installing Hadoop
14 pages
Install Hadoop Single Node on Ubuntu 16.04
No ratings yet
Install Hadoop Single Node on Ubuntu 16.04
22 pages
Hadoop Installation and File Management Guide
No ratings yet
Hadoop Installation and File Management Guide
60 pages
Single Node Hadoop Installation Guide
No ratings yet
Single Node Hadoop Installation Guide
13 pages
Install Hadoop on Ubuntu 20.04 Guide
No ratings yet
Install Hadoop on Ubuntu 20.04 Guide
15 pages
Single Node Hadoop 2.7.7 Installation Guide
100% (1)
Single Node Hadoop 2.7.7 Installation Guide
6 pages
HADOOP Installation
No ratings yet
HADOOP Installation
5 pages
Install Hadoop on Ubuntu Steps
No ratings yet
Install Hadoop on Ubuntu Steps
3 pages
Hadoop Installation Steps on Ubuntu
No ratings yet
Hadoop Installation Steps on Ubuntu
7 pages
Installing Hadoop on Ubuntu: A Guide
No ratings yet
Installing Hadoop on Ubuntu: A Guide
8 pages
Downloading Hadoop 3.3.4 on Ubuntu
No ratings yet
Downloading Hadoop 3.3.4 on Ubuntu
5 pages
Hadoop Installation Guide for Ubuntu 18.04
No ratings yet
Hadoop Installation Guide for Ubuntu 18.04
13 pages
Hadoop Installation Guide for Ubuntu 18.04
No ratings yet
Hadoop Installation Guide for Ubuntu 18.04
13 pages
Big Data Lab Manual - 115102
No ratings yet
Big Data Lab Manual - 115102
62 pages
SBD 11 20
No ratings yet
SBD 11 20
10 pages
Install Hadoop on Ubuntu: Step-by-Step Guide
No ratings yet
Install Hadoop on Ubuntu: Step-by-Step Guide
29 pages
Install Hadoop on VMware Ubuntu Guide
No ratings yet
Install Hadoop on VMware Ubuntu Guide
60 pages
Installing Hadoop on Ubuntu 13.10
No ratings yet
Installing Hadoop on Ubuntu 13.10
8 pages
Set Up Single-Node Hadoop on Ubuntu
No ratings yet
Set Up Single-Node Hadoop on Ubuntu
9 pages
Installing Hadoop on Ubuntu 20.04
No ratings yet
Installing Hadoop on Ubuntu 20.04
15 pages
Hadoop Installation Guide for Ubuntu 20.04
No ratings yet
Hadoop Installation Guide for Ubuntu 20.04
7 pages
Install and Configure Hadoop on Ubuntu
No ratings yet
Install and Configure Hadoop on Ubuntu
8 pages
Install Hadoop on WSL Ubuntu Guide
No ratings yet
Install Hadoop on WSL Ubuntu Guide
23 pages
Install Hadoop on VMware with Ubuntu
No ratings yet
Install Hadoop on VMware with Ubuntu
60 pages
Installing Apache Hadoop on Ubuntu
No ratings yet
Installing Apache Hadoop on Ubuntu
19 pages
Hadoop Installation Guide for Linux
No ratings yet
Hadoop Installation Guide for Linux
8 pages
Install Single Node Hadoop on Ubuntu
No ratings yet
Install Single Node Hadoop on Ubuntu
38 pages
Big Data Analytics Practical Guide
No ratings yet
Big Data Analytics Practical Guide
56 pages
Installing Hadoop on Ubuntu 14.04
No ratings yet
Installing Hadoop on Ubuntu 14.04
27 pages
Hadoop Singlenode Installation
No ratings yet
Hadoop Singlenode Installation
8 pages
Hadoop Pseudo Distributed Installation
No ratings yet
Hadoop Pseudo Distributed Installation
4 pages
BDA LAB Manual Final
No ratings yet
BDA LAB Manual Final
51 pages
Installing Hadoop on Ubuntu Guide
No ratings yet
Installing Hadoop on Ubuntu Guide
4 pages
Install Single Node Hadoop on Ubuntu
No ratings yet
Install Single Node Hadoop on Ubuntu
8 pages
Hive Installation Guide for Ubuntu
No ratings yet
Hive Installation Guide for Ubuntu
4 pages
Big Data Lab: Hadoop Setup Guide
No ratings yet
Big Data Lab: Hadoop Setup Guide
49 pages
Install Hadoop on Ubuntu: Step-by-Step Guide
No ratings yet
Install Hadoop on Ubuntu: Step-by-Step Guide
9 pages
BDA Lab Manual-1
No ratings yet
BDA Lab Manual-1
49 pages
Install Hadoop 3.1.3 on Ubuntu 18.04-19.10
No ratings yet
Install Hadoop 3.1.3 on Ubuntu 18.04-19.10
6 pages
Install Apache Hadoop on Ubuntu
No ratings yet
Install Apache Hadoop on Ubuntu
8 pages
Install Hadoop 2.7.3 on Ubuntu 15.10
No ratings yet
Install Hadoop 2.7.3 on Ubuntu 15.10
7 pages
Installing and Starting Hadoop
No ratings yet
Installing and Starting Hadoop
3 pages
HBase Installation Guide for Hadoop
No ratings yet
HBase Installation Guide for Hadoop
12 pages
Install Oracle Java 8 on Ubuntu
No ratings yet
Install Oracle Java 8 on Ubuntu
7 pages
Hadoop 2.7 Pseudo Node Setup Guide
No ratings yet
Hadoop 2.7 Pseudo Node Setup Guide
9 pages
Install Hadoop on Ubuntu 16.04/18.04
No ratings yet
Install Hadoop on Ubuntu 16.04/18.04
7 pages
Install Hadoop 2.6.0 on Ubuntu 14.04
No ratings yet
Install Hadoop 2.6.0 on Ubuntu 14.04
6 pages
Install Hadoop on Ubuntu 18.04
No ratings yet
Install Hadoop on Ubuntu 18.04
15 pages
Install Hadoop on Ubuntu: Step-by-Step Guide
No ratings yet
Install Hadoop on Ubuntu: Step-by-Step Guide
16 pages
BigData Analytics Lab Manual SJIT
No ratings yet
BigData Analytics Lab Manual SJIT
56 pages
Hadoop Installation Guide for Ubuntu
No ratings yet
Hadoop Installation Guide for Ubuntu
22 pages
Configuring Hadoop Cluster on Ubuntu
No ratings yet
Configuring Hadoop Cluster on Ubuntu
4 pages
Sustainable Multifamily Housing Design
No ratings yet
Sustainable Multifamily Housing Design
71 pages
Grade 11 Geography Teaching Plan 2025
No ratings yet
Grade 11 Geography Teaching Plan 2025
13 pages
Gas Compressor Shelter Base Plate Design
No ratings yet
Gas Compressor Shelter Base Plate Design
2 pages
Aviation Service Bulletins Overview
No ratings yet
Aviation Service Bulletins Overview
1 page
White's Country Meats Menu Overview
No ratings yet
White's Country Meats Menu Overview
4 pages
MBA Corporate Finance Term Test 2021
No ratings yet
MBA Corporate Finance Term Test 2021
1 page
Affidavit of Loss for Driver's License
50% (2)
Affidavit of Loss for Driver's License
2 pages
Understanding Kidney Failure and Care
No ratings yet
Understanding Kidney Failure and Care
2 pages
Asuhan Keperawatan pada Syok Sepsis
No ratings yet
Asuhan Keperawatan pada Syok Sepsis
26 pages
STIGLITZ, Joseph E. Whither Socialism
No ratings yet
STIGLITZ, Joseph E. Whither Socialism
688 pages
Best CV Format for Ethiopian Engineers
No ratings yet
Best CV Format for Ethiopian Engineers
2 pages
Timothy A. Warner, David J. Campagna, Florencia Sangermano - REMOTE SENSING WITH TERRSET® 2 0 2 0 - IDRISI® (2021)
No ratings yet
Timothy A. Warner, David J. Campagna, Florencia Sangermano - REMOTE SENSING WITH TERRSET® 2 0 2 0 - IDRISI® (2021)
701 pages
Turbulent Flow Rate Calculation in Pipe
No ratings yet
Turbulent Flow Rate Calculation in Pipe
2 pages
BBA - Course Outline-Spring 2020
100% (1)
BBA - Course Outline-Spring 2020
213 pages
Sample Loan Agreement Template
No ratings yet
Sample Loan Agreement Template
2 pages
Understanding Major Scales for Guitar
100% (2)
Understanding Major Scales for Guitar
3 pages
Chapter 1 Environmental Problems and Their Causes
100% (1)
Chapter 1 Environmental Problems and Their Causes
44 pages
Understanding Slowly Changing Dimensions in ETL
No ratings yet
Understanding Slowly Changing Dimensions in ETL
22 pages
Redress Scheme FOI Review Statement
No ratings yet
Redress Scheme FOI Review Statement
18 pages
IoT Air Quality Monitoring System
No ratings yet
IoT Air Quality Monitoring System
34 pages
FOPDT Model Characterization Guide
No ratings yet
FOPDT Model Characterization Guide
6 pages
Sustainable Tourism Plan for Narra, Palawan
No ratings yet
Sustainable Tourism Plan for Narra, Palawan
104 pages
Simple Present Verb Formation Guide
100% (1)
Simple Present Verb Formation Guide
13 pages
PhotoMOS Short Circuit Protection Guide
No ratings yet
PhotoMOS Short Circuit Protection Guide
16 pages
Peñaranda vs. Baganga: Overtime Pay Ruling
No ratings yet
Peñaranda vs. Baganga: Overtime Pay Ruling
1 page
VASP Input and Output Analysis
50% (2)
VASP Input and Output Analysis
11 pages
SYSU International Inc. Sponsorship Appeal
No ratings yet
SYSU International Inc. Sponsorship Appeal
2 pages
Edison Fuse Links Catalog Ca132008en
No ratings yet
Edison Fuse Links Catalog Ca132008en
12 pages
Nagini Yalla Resume 2026
No ratings yet
Nagini Yalla Resume 2026
8 pages
Prambanan Temple: Indonesia's Hindu Marvel
No ratings yet
Prambanan Temple: Indonesia's Hindu Marvel
5 pages

VSS Lab Assignment #01

Uploaded by

VSS Lab Assignment #01

Uploaded by

Lahore Garrison University

Virtual Systems & Services - Lab

• Installing Java and configuring environment variables

Step 1: Installing Java on Ubuntu

To install java on Ubuntu,

sudo apt install default-jdk

sudo apt install default-jre

sudo apt install -y

To verify the installation, check the java version:

Step 2: Create a user for Hadoop and configure SSH

First, create a new user named hadoop:

sudo adduser hadoop

sudo usermod -aG sudo Hadoop

Once done, Switch the user hadoop:

Next, install the OpenSSH server and client:

sudo apt install openssh-server openssh-client -y

Here, it will ask you:

Now, add the public key to authorized_keys:

cat ~/.ssh/id_rsa.pub >> ~/.ssh/authorized_keys

Use the chmod command to change the file permissions of authorized_keys:

sudo chmod 640 ~/.ssh/authorized_keys

Finally, verify the SSH configuration:

Step 3: Download and install Apache Hadoop on Ubuntu

download this release:

sudo mv hadoop-3.4.1 /usr/local/hadoop

Now, create a directory mkdir command to store logs:

sudo mkdir /usr/local/hadoop/logs

Finally, change the ownership of the /usr/local/hadoop to the user hadoop:

sudo chown -R hadoop:hadoop /usr/local/Hadoop

Step 4: Configure Hadoop on Ubuntu

First, open the .bashrc file using the following command:

sudo nano ~/.bashrc

Save changes and exit from the nano text editor.

To enable the changes, source the .bashrc file:

To do that, you will have to define java environment variables in [Link].

Edit the [Link]

First, open the [Link]:

sudo nano $HADOOP_HOME/etc/hadoop/[Link]

export HADOOP_CLASSPATH+=" $HADOOP_HOME/lib/*.jar"

Save changes and exit from the text editor.

Next, change your current working directory to /usr/local/hadoop/lib:

Here, download the javax activation file:

Sudo wget [Link]

Once done, check the Hadoop version in Ubuntu:

Edit the [Link]

First, open the [Link] using the following command:

sudo nano $HADOOP_HOME/etc/hadoop/[Link]

<description>The default file system URI</description>

Save the changes and exit from the text editor.

sudo mkdir -p /home/hadoop/hdfs/{namenode,datanode}

And change the ownership of the created directory to the hadoopuser:

sudo chown -R hadoop:hadoop /home/hadoop/hdfs

So first open the configuration file:

sudo nano $HADOOP_HOME/etc/hadoop/[Link]

And paste the following line in between <configuration> ... </configuration>:

Save changes and exit from the [Link].

Edit the [Link] file

By editing the [Link], you can define the MapReduce values.

sudo nano $HADOOP_HOME/etc/hadoop/[Link]

And paste the following line in between <configuration> ... </configuration>:

Save and exit from the nano text editor.

Edit the [Link] file

The purpose of editing this file is to define the YARN settings.

First, open the configuration file:

sudo nano $HADOOP_HOME/etc/hadoop/[Link]

Paste the following in between <configuration> ... </configuration>:

hdfs namenode -format

So let's start with starting the NameNode and DataNode:

Next, start the node manager and resource manager:

My IP is [Link] so I will be entering the following:

You might also like