HADOOP 1.X Installation Steps On Ubuntu

This document outlines 46 steps to install Hadoop in pseudo-distributed mode on Ubuntu. It involves installing Java, configuring environment variables, creating a dedicated Hadoop user, setting up SSH keys for passwordless access, downloading and extracting the Hadoop tarball, configuring core-site.xml, hdfs-site.xml and mapred-site.xml files, and starting the necessary daemons to launch the fully configured Hadoop installation.

Uploaded by

VisheshUtsav

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

120 views

HADOOP 1.X Installation Steps On Ubuntu

Uploaded by

VisheshUtsav

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

HADOOP INSTALLATION STEPS (Pseudo Distributed Mode)

1. $ uname m find the 32 bit or 64 bit

2. $ sudo apt-get update to get the latest packages in Ubuntu
3. $ sudo apt-get install openjdk-7-jre To install JRE
4. $ JAVA version To understand that JAVA is successfully installed as well as to know the version of it
5. $ which java to know the path where JAVA is installed
6. $ su root to change to root user
7. $ export JAVA_HOME=/usr to set the path for JAVA_HOME (#5 above)
8. $ export PATH=$JAVA_HOME/bin:$PATH to set the JAVA_HOME in path
9. $ echo $PATH To ensure that the path is correctly set for JAVA
10.
$ sudo apt-get install openssh-server SSH required for inter machine connection
11.
$ sudo apt-get install openssh-client SSH required for inter machine connections
12.
$ sudo addgroup hadoop create a dedicated group for HADOOP users
13.
$ sudo adduser --ingroup hadoop hduser create a user by name hduser and add that user to Hadoop group
14.
$ su hduser Switch to hduser
15.
$ ssh-keygen t rsa P Create a password less rsa key for hduser
16.
$ cat $HOME/.ssh/id_rsa.pub >> $HOME/.ssh/authorized_keys add the key generated to the authorized
keys for the machine, so that communication happens seamlessly
17.
$ ssh localhost to add the localhost to the list of known hosts for seamless communication between
machines
18.
19.
$ ifconfig to find the IP address of VM
20.
Copy or download the Hadoop 1.x.tar.gz the tarball of Hadoop from www.apache.org
21.
$ sudo cp <Hadoop 1.x.tar.gz from downloaded folder> /usr/local/ copy the tar ball to the standard
folder /usr/local/
22.
$ su user swith to user who has sudo privileges
23.
$ cd /usr/local/ get in to the folder where HADOOP is expected to be installed
24.
$ sudo tar xzf hadoop1.x unzip the Hadoop tar file (it would create a folder )
25.
$ sudo mv Hadoop1.x Hadoop rename the folder with the version # to Hadoop alone (for simplicity)
26.
$ sudo chown R hduser:Hadoop Hadoop give complete permissions on Hadoop folder to hduser of Hadoop
27.
$ usercd /home/hduser Get to the home directory of hduser
28.
$ ls al look for all hidden files aswell
29.
$ gedit .bashrc Need to modify the bashrc file of hduser to set the path etc for hduser
30.
$ export HADOOP_HOME=/usr/local/hadoop

31.
32.
33.
34.
35.
36.
37.

$
$
$
$
$
$
a. $
$
a.
b.
c.
d.
e.
f.
g.
h.
i.
j.
k.

export JAVA_HOME=/usr
export PATH=$PATH:$HADOOP_HOME/bin
sudo mkdir p /app/hadoop/tmp this folder is created to act as a temporary storage folder
sudo chown R hduser:Hadoop /app/Hadoop/tmp provide full privilege to hduser on folder
cd /usr/local/Hadoop/conf/ change to the folder where Hadoop configuration is to be done
nano hadoop-env.sh open the file in editor and add the below line
export JAVA_HOME=/usr setting JAVA HOME
nano core-site.xml open in the editor and add following lines

<property>
<name>hadoop.tmp.dir</name>
<value>/app/hadoop/tmp</value>
<description>A base for other temporary directories.</description>
</property>
<property>
<name>fs.default.name</name>
<value>hdfs://localhost:54310</value>
<description>Default FS, NN Machine and the port#</description>
</property>

$ nano hdfs-site.xml open in the editor and add following lines

38.
a.
b.
c.
d.
e.
f.
g.

<property>
<name>dfs.replication</name>
<value>1</value>
<description>Default block replication.
The actual number of replications can be specified when the file is created.
The default is used if replication is not specified in create time.
</description>
h. </property>

39.

$ nano mapred-site.xml open in the editor and add following lines

a. <property>
b.
<name>mapred.job.tracker</name>
c.
<value>localhost:54311</value>
d.
<description>The host and port that the MapReduce job tracker runs
e.
at. If "local", then jobs are run in-process as a single map
f.
and reduce task.
g.
</description>
h. </property>

40.

$ su hduser change to user under whom Hadoop is configured to run

41.
42.
43.
44.
45.
46.

$
$
$
$
$
$

hadoop namenode format create the hdfs structure of Hadoop

start-dfs.sh a shell script to start DFS related daemons
start-mapred.sh a shell script to start process related daemons
jps to show all the Hadoop daemons active and their process IDs
su user change to an user with sudo privileges
sudo apt-get install openjdk-7-jdk to install jps (incase if it is not installed)

####################################################################################################

Tala System in Bharathanatyam
100% (1)
Tala System in Bharathanatyam
5 pages
Handbook of Electric Power Calculations: H. Wayne Beaty
100% (1)
Handbook of Electric Power Calculations: H. Wayne Beaty
4 pages
Experiment No - 1
No ratings yet
Experiment No - 1
13 pages
Hadoop Installation Manual 2.odt
No ratings yet
Hadoop Installation Manual 2.odt
20 pages
Hadoop Installation Steps
100% (1)
Hadoop Installation Steps
6 pages
Hadoop Installation
No ratings yet
Hadoop Installation
4 pages
Updated CMD
No ratings yet
Updated CMD
23 pages
BDAO
No ratings yet
BDAO
23 pages
Single Node Hadoop Cluster
No ratings yet
Single Node Hadoop Cluster
9 pages
Installation of Hadoop
No ratings yet
Installation of Hadoop
8 pages
Hadoop 2.6.5 Installing On Ubuntu 16.04 and 18.04 (Single-Node Cluster)
No ratings yet
Hadoop 2.6.5 Installing On Ubuntu 16.04 and 18.04 (Single-Node Cluster)
7 pages
Hadoop 2.6 Installing On Ubuntu 14.04 (Single-Node Cluster)
No ratings yet
Hadoop 2.6 Installing On Ubuntu 14.04 (Single-Node Cluster)
27 pages
Hadoop Installation
No ratings yet
Hadoop Installation
6 pages
Hadoop Installation Step by Step
No ratings yet
Hadoop Installation Step by Step
8 pages
Hadoop Installation Final
No ratings yet
Hadoop Installation Final
32 pages
HDFS Installation Guide-Anju
No ratings yet
HDFS Installation Guide-Anju
4 pages
Bda Lab
No ratings yet
Bda Lab
37 pages
Hadoop 2.6 Installing On Ubuntu 14.04 (Single-Node Cluster) STEP:1
No ratings yet
Hadoop 2.6 Installing On Ubuntu 14.04 (Single-Node Cluster) STEP:1
13 pages
$ Sudo Apt-Get Install Oracle-Java8-Installer
No ratings yet
$ Sudo Apt-Get Install Oracle-Java8-Installer
4 pages
Hadoop InstallSteps
No ratings yet
Hadoop InstallSteps
14 pages
BDA LAB Programs
No ratings yet
BDA LAB Programs
56 pages
Hadoop Installation Guide
No ratings yet
Hadoop Installation Guide
18 pages
Hadoop Single Node Installation
No ratings yet
Hadoop Single Node Installation
7 pages
Hadoop 2 - Pseudo Node Installation
No ratings yet
Hadoop 2 - Pseudo Node Installation
9 pages
Installing Multi Node Cluster - Handbook 2.0
No ratings yet
Installing Multi Node Cluster - Handbook 2.0
2 pages
Installing A Single Node Hadoop Cluster
No ratings yet
Installing A Single Node Hadoop Cluster
4 pages
Installation of Hadoop in Ubuntu
No ratings yet
Installation of Hadoop in Ubuntu
15 pages
DataVisuaization Lab
No ratings yet
DataVisuaization Lab
5 pages
BDA Practical1 MC18-23
No ratings yet
BDA Practical1 MC18-23
17 pages
Installationof Hadoop 3
No ratings yet
Installationof Hadoop 3
6 pages
Install Hadoop
No ratings yet
Install Hadoop
8 pages
Experiment-2_BDA_Lab
No ratings yet
Experiment-2_BDA_Lab
13 pages
Hadoop Install
No ratings yet
Hadoop Install
19 pages
Installation of Hadoop
No ratings yet
Installation of Hadoop
6 pages
Hadoop Installation Guide
No ratings yet
Hadoop Installation Guide
18 pages
Installation Process of HADOOP
No ratings yet
Installation Process of HADOOP
12 pages
Hadoop 2.7.3 Setup On Ubuntu 15.10
No ratings yet
Hadoop 2.7.3 Setup On Ubuntu 15.10
7 pages
Edureka Apache Hadoop Single Node Cluster On Ubuntu
No ratings yet
Edureka Apache Hadoop Single Node Cluster On Ubuntu
9 pages
Hadoop Installation
No ratings yet
Hadoop Installation
7 pages
Single Node Cluster
No ratings yet
Single Node Cluster
31 pages
How To Install Hadoop On Ubuntu 18.04 or 20.04
No ratings yet
How To Install Hadoop On Ubuntu 18.04 or 20.04
15 pages
TP2 _3IM - En
No ratings yet
TP2 _3IM - En
7 pages
Had Oop Installation
No ratings yet
Had Oop Installation
4 pages
BDA Practical
No ratings yet
BDA Practical
38 pages
Hadoop Installation
No ratings yet
Hadoop Installation
12 pages
Install Sqoop
No ratings yet
Install Sqoop
7 pages
Hadoop for Ubuntu 2
No ratings yet
Hadoop for Ubuntu 2
4 pages
Hadoop Cluster Creation
No ratings yet
Hadoop Cluster Creation
8 pages
CDH3 Pseudo Installation On Ubuntu
No ratings yet
CDH3 Pseudo Installation On Ubuntu
4 pages
A Report On Distributed Computing
No ratings yet
A Report On Distributed Computing
25 pages
Experiment 1 Hadoop Installation
No ratings yet
Experiment 1 Hadoop Installation
6 pages
Hadoop Installation Commands
No ratings yet
Hadoop Installation Commands
3 pages
213nt1306- Big Data Analytics Lab Manual
No ratings yet
213nt1306- Big Data Analytics Lab Manual
80 pages
Big Data Analytics Lab Experiments
No ratings yet
Big Data Analytics Lab Experiments
16 pages
Hadoop Installation
No ratings yet
Hadoop Installation
6 pages
Big Data Analytics - Lab-Manual
No ratings yet
Big Data Analytics - Lab-Manual
19 pages
Hadoop Multinode Cluster Installation
No ratings yet
Hadoop Multinode Cluster Installation
4 pages
Hadoop Installation
No ratings yet
Hadoop Installation
7 pages
PRACTICAL 4 - Single and Multi Node Hadoop Install
No ratings yet
PRACTICAL 4 - Single and Multi Node Hadoop Install
11 pages
2023MCS320004 HEMANTH TARRA - Hadoop Installation - Assignment
No ratings yet
2023MCS320004 HEMANTH TARRA - Hadoop Installation - Assignment
9 pages
BigData_Lab_Manual
No ratings yet
BigData_Lab_Manual
44 pages
Configuration of a Simple Samba File Server, Quota and Schedule Backup
From Everand
Configuration of a Simple Samba File Server, Quota and Schedule Backup
Dr. Hidaia Mahmood Alassouli
No ratings yet
Grammar and Maths Questions
100% (1)
Grammar and Maths Questions
3 pages
LTZ Catalogo Ga041 0601gb
No ratings yet
LTZ Catalogo Ga041 0601gb
24 pages
(John Scofield) - Chord Changes Over A Pedal Tone I
100% (5)
(John Scofield) - Chord Changes Over A Pedal Tone I
1 page
B1 Summary Answers
No ratings yet
B1 Summary Answers
4 pages
Natural and Forced Convection Experiments
25% (4)
Natural and Forced Convection Experiments
12 pages
Digi Notes Reasoning Eng 13-09-17
No ratings yet
Digi Notes Reasoning Eng 13-09-17
7 pages
Logistic Regression
No ratings yet
Logistic Regression
12 pages
03 Adherence With The Processes of Time Management in Construction
No ratings yet
03 Adherence With The Processes of Time Management in Construction
12 pages
Square Difference Labeling of Some Special Graphs
No ratings yet
Square Difference Labeling of Some Special Graphs
6 pages
Biochemistry Practical PDF
No ratings yet
Biochemistry Practical PDF
21 pages
基于ABAQUS的八叉树比...坝地震响应分析中的应用研究_刘云辉
No ratings yet
基于ABAQUS的八叉树比...坝地震响应分析中的应用研究_刘云辉
124 pages
CM1072-Assignment 01 - 2021
No ratings yet
CM1072-Assignment 01 - 2021
3 pages
Whiz Calculator Company
100% (1)
Whiz Calculator Company
8 pages
Rotational Motion Rolling & Translation
No ratings yet
Rotational Motion Rolling & Translation
11 pages
How To Select SET Exam Books?
No ratings yet
How To Select SET Exam Books?
7 pages
Machine Learning + Devops Using Azure ML Services
No ratings yet
Machine Learning + Devops Using Azure ML Services
17 pages
Analog Communication Course File
No ratings yet
Analog Communication Course File
18 pages
4707 Russ Horn Secret Method
100% (1)
4707 Russ Horn Secret Method
2 pages
Math Centralized Exam MFCP1
No ratings yet
Math Centralized Exam MFCP1
4 pages
Contactors
No ratings yet
Contactors
78 pages
Essentials of Materials Science and Engineering SI Edition 3rd Edition Askeland Solutions Manual - Available For Instant Download And Reading
100% (4)
Essentials of Materials Science and Engineering SI Edition 3rd Edition Askeland Solutions Manual - Available For Instant Download And Reading
53 pages
Weekly Learning Activity Sheets General Physics 1 Grade 12, Quarter 2, Week 4
100% (2)
Weekly Learning Activity Sheets General Physics 1 Grade 12, Quarter 2, Week 4
4 pages
Ereta Uace Physics 2 Final
No ratings yet
Ereta Uace Physics 2 Final
9 pages
Flow Assurance in Kumuje Wet Gas Pipeline Analysis of Pigging Solution To Liquid Accumulation PDF
100% (1)
Flow Assurance in Kumuje Wet Gas Pipeline Analysis of Pigging Solution To Liquid Accumulation PDF
7 pages
Intermediate Programming (Java) 1: Course Title: Getting Started With Java Language
No ratings yet
Intermediate Programming (Java) 1: Course Title: Getting Started With Java Language
11 pages
Coagulation and Flocculation
No ratings yet
Coagulation and Flocculation
52 pages
DX Diag
No ratings yet
DX Diag
35 pages
Ca 2016 Tga
No ratings yet
Ca 2016 Tga
235 pages

HADOOP 1.X Installation Steps On Ubuntu

Uploaded by

HADOOP 1.X Installation Steps On Ubuntu

Uploaded by

HADOOP INSTALLATION STEPS (Pseudo Distributed Mode)

1. $ uname m find the 32 bit or 64 bit

$ nano hdfs-site.xml open in the editor and add following lines

$ nano mapred-site.xml open in the editor and add following lines

$ su hduser change to user under whom Hadoop is configured to run

hadoop namenode format create the hdfs structure of Hadoop

You might also like