0% found this document useful (0 votes)

125 views

CDH3 Pseudo Installation On Ubuntu

This document provides instructions for installing CDH3 Hadoop on Ubuntu, including: 1. Installing Java and the CDH3 package 2. Configuring environment variables for Hadoop and Java home directories 3. Configuring core-site.xml, hdfs-site.xml, and mapred-site.xml files 4. Formatting the namenode and starting all Hadoop daemons

Uploaded by

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

125 views

CDH3 Pseudo Installation On Ubuntu

Uploaded by

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

CDH3 Pseudo installation on Ubuntu

1) Do not create username as hadoop as you will have issues in installation.

2) Install Java
Copy the java jdk on to desktop.
$ sudo cp jdk-6u30-linux-x** /usr/local
$ cd /usr/local
$ sudo sh jdk-6u30-linux-x**
3) Install CDH3 package
Go to - http://www.cloudera.com/content/cloudera-content/cloudera-docs/CDH3/CDH3u6/CDH3Installation-Guide/CDH3-Installation-Guide.html
Click on Installing CDH3 on Ubuntu and Debian Systems
Click on - this link for a Maverick system
Install using GDebi package installer or save it and issue the command below
$ sudo dpkg -i Downloads/cdh3-repository_1.0_all.deb
$ sudo apt-get update
4) Install Hadoop
$ apt-cache search hadoop - Must show all available Hadoop Packages
$ sudo apt-get install hadoop-0.20 hadoop-0.20-native
sudo apt-get install hadoop-0.20-<daemon type> install all Daemons
5) Set Java and Hadoop Home
Using command:

gedit ~/.bashrc

# Set Hadoop-related environment variables

export HADOOP_HOME=/usr/lib/hadoop

export PATH=$PATH:/usr/lib/hadoop/bin

# Set JAVA_HOME
export JAVA_HOME=/usr/local/jdk1.6.0_30
export PATH=$PATH:/usr/local/jdk1.6.0_30/bin
close terminals and open new one and test
echo $JAVA_HOME
echo $HADOOP HOME
6) Adding dedicated users hdfs and mapred to hadoop group

$ sudo gpasswd -a hdfs hadoop

$ sudo gpasswd -a mapred hadoop
7) Configuration
$ cd /usr/lib/hadoop/conf
Set Java Home in hadoop-env.sh
$ sudo gedit hadoop-env.sh
export JAVA_HOME=/usr/local/jdk1.6.0_30

8) core-site.xml

<property>
<name>hadoop.tmp.dir</name>
<value>/usr/lib/hadoop/tmp</value>
</property>
<property>
<name>fs.default.name</name>
<value>hdfs://localhost:8020</value>
</property>
$ sudo mkdir /usr/lib/hadoop/tmp
$ sudo chmod 750 /usr/lib/hadoop/tmp/
$ sudo chown hdfs:hadoop /usr/lib/hadoop/tmp/
9) hdfs-site.xml

<property>
<name>dfs.permissions</name>
<value>false</value>
</property>
<property>
<name>dfs.name.dir</name>
<value>/storage/name</value>
</property>
<property>
<name>dfs.data.dir</name>
<value>/storage/data</value>
</property>
<property>
<name>dfs.replication</name>
<value>1</value>
</property>
$ sudo mkdir /storage

$ sudo chmod 775 /storage/

$ chown hdfs:hadoop /storage/

10) mapred-site.xml

<property>
<name>mapred.job.tracker</name>
<value>hdfs://localhost:8021</value>
</property>
<property>
<name>mapred.system.dir</name>
<value>/mapred/system</value>
</property>
<property>
<name>mapred.local.dir</name>
<value>/mapred/local</value>
</property>
<property>
<name>mapred.temp.dir</name>
<value>/mapred/temp</value>

</property>
$ sudo mkdir /mapred

$ sudo chmod 775 /mapred

$ sudo chown mapred:hadoop /mapred
11) User Assignment

export HADOOP_NAMENODE_USER=hdfs
export HADOOP_SECONDARYNAMENODE_USER=hdfs
export HADOOP_DATANODE_USER=hdfs
export HADOOP_JOBTACKER_USER=mapred
export HADOOP_TASKTRACKER_USER=mapred

12) Format namenode

$ cd /usr/lib/hadoop/bin/

$ sudo -u hdfs hadoop namenode -format

You must get a successfully formatted message. Otherwise, check the error and correct it.
13) Start Daemons
$ sudo /etc/init.d/hadoop-0.20-namenode start
$ sudo /etc/init.d/hadoop-0.20-secondarynamenode start
$ sudo /etc/init.d/hadoop-0.20-jobtracker start
$ sudo /etc/init.d/hadoop-0.20-datanode start
$ sudo /etc/init.d/hadoop-0.20-tasktracker start

Check for any errors in /var/log/hadoop-0.20 for each daemon

check all ports are opened using $netstat -ptlen
14) Check UI
localhost:50070 - Hadoop Admin
localhost:50030 - Mapreduce

Gurobi - Optimization For Dummies PDF
100% (1)
Gurobi - Optimization For Dummies PDF
27 pages
CERSAI2.0 AOR Transactions User Manual 14032022
No ratings yet
CERSAI2.0 AOR Transactions User Manual 14032022
24 pages
Marketing: Case Studies
100% (2)
Marketing: Case Studies
21 pages
Universal ESP + Aimbot
No ratings yet
Universal ESP + Aimbot
12 pages
Netscape's Work Culture
No ratings yet
Netscape's Work Culture
16 pages
Hadoop Installation Manual 2.odt
No ratings yet
Hadoop Installation Manual 2.odt
20 pages
DataVisuaization Lab
No ratings yet
DataVisuaization Lab
5 pages
Installing Multi Node Cluster - Handbook 2.0
No ratings yet
Installing Multi Node Cluster - Handbook 2.0
2 pages
Hadoop Installation
No ratings yet
Hadoop Installation
4 pages
Experiment No - 1
No ratings yet
Experiment No - 1
13 pages
Installing A Single Node Hadoop Cluster
No ratings yet
Installing A Single Node Hadoop Cluster
4 pages
BDAO
No ratings yet
BDAO
23 pages
HADOOP 1.X Installation Steps On Ubuntu
No ratings yet
HADOOP 1.X Installation Steps On Ubuntu
3 pages
BDA Practical1 MC18-23
No ratings yet
BDA Practical1 MC18-23
17 pages
Installationof Hadoop 3
No ratings yet
Installationof Hadoop 3
6 pages
Hadoop Installation
No ratings yet
Hadoop Installation
6 pages
Install Hadoop
No ratings yet
Install Hadoop
8 pages
Hadoop Installation Guide
No ratings yet
Hadoop Installation Guide
18 pages
Bda Lab
No ratings yet
Bda Lab
37 pages
Hadoop Installation Steps
100% (1)
Hadoop Installation Steps
6 pages
Hadoop 2 - Pseudo Node Installation
No ratings yet
Hadoop 2 - Pseudo Node Installation
9 pages
Hadoop 3 Installation
No ratings yet
Hadoop 3 Installation
10 pages
Installation of Hadoop in Ubuntu
No ratings yet
Installation of Hadoop in Ubuntu
15 pages
Hadoop Installation On Linux
No ratings yet
Hadoop Installation On Linux
4 pages
Hadoop Installation Commands
No ratings yet
Hadoop Installation Commands
3 pages
Big Data Analytics - Lab-Manual
No ratings yet
Big Data Analytics - Lab-Manual
19 pages
Hadoop 2.6.5 Installing On Ubuntu 16.04 and 18.04 (Single-Node Cluster)
No ratings yet
Hadoop 2.6.5 Installing On Ubuntu 16.04 and 18.04 (Single-Node Cluster)
7 pages
Hadoop Installation
No ratings yet
Hadoop Installation
7 pages
HDFS Installation Guide-Anju
No ratings yet
HDFS Installation Guide-Anju
4 pages
Hadoop Cluster Creation
No ratings yet
Hadoop Cluster Creation
8 pages
Nitish Steps To Install Hadoop
No ratings yet
Nitish Steps To Install Hadoop
3 pages
Step 1 - Install Oracle Java 8 On Ubuntu
No ratings yet
Step 1 - Install Oracle Java 8 On Ubuntu
7 pages
BDA LAB Programs
No ratings yet
BDA LAB Programs
56 pages
Hadoop Single Node Installation
No ratings yet
Hadoop Single Node Installation
7 pages
big data
No ratings yet
big data
5 pages
2023MCS320004 HEMANTH TARRA - Hadoop Installation - Assignment
No ratings yet
2023MCS320004 HEMANTH TARRA - Hadoop Installation - Assignment
9 pages
Had Oop Installation
No ratings yet
Had Oop Installation
4 pages
Hadoop 2.6 Installing On Ubuntu 14.04 (Single-Node Cluster) STEP:1
No ratings yet
Hadoop 2.6 Installing On Ubuntu 14.04 (Single-Node Cluster) STEP:1
13 pages
Experiment-2_BDA_Lab
No ratings yet
Experiment-2_BDA_Lab
13 pages
Hadoop 2.7.3 Setup On Ubuntu 15.10
No ratings yet
Hadoop 2.7.3 Setup On Ubuntu 15.10
7 pages
BDA Practical
No ratings yet
BDA Practical
38 pages
Hadoop for Ubuntu 2
No ratings yet
Hadoop for Ubuntu 2
4 pages
Installation of Hadoop
No ratings yet
Installation of Hadoop
6 pages
Week 1 in Terminal
No ratings yet
Week 1 in Terminal
10 pages
Hadoop 3x Installation With HA
No ratings yet
Hadoop 3x Installation With HA
17 pages
TP2 _3IM - En
No ratings yet
TP2 _3IM - En
7 pages
Setting Hadoop and Mysql 8.0
No ratings yet
Setting Hadoop and Mysql 8.0
3 pages
Installing Standalone and Pseudocode Hadoop Cluster: 1. Setting Up Vmware Virtual Machine
No ratings yet
Installing Standalone and Pseudocode Hadoop Cluster: 1. Setting Up Vmware Virtual Machine
14 pages
Exp-1-1
No ratings yet
Exp-1-1
24 pages
Single Node Hadoop Cluster
No ratings yet
Single Node Hadoop Cluster
9 pages
Edureka Apache Hadoop Single Node Cluster On Ubuntu
No ratings yet
Edureka Apache Hadoop Single Node Cluster On Ubuntu
9 pages
Experiment 1 Hadoop Installation
No ratings yet
Experiment 1 Hadoop Installation
6 pages
Hadoop 2.6 Installing On Ubuntu 14.04 (Single-Node Cluster)
No ratings yet
Hadoop 2.6 Installing On Ubuntu 14.04 (Single-Node Cluster)
27 pages
Create A Multi-Node Cluster For Distributed Hadoop Environment
No ratings yet
Create A Multi-Node Cluster For Distributed Hadoop Environment
5 pages
Online:: Setting Up The Environment
No ratings yet
Online:: Setting Up The Environment
9 pages
Install Sqoop
No ratings yet
Install Sqoop
7 pages
Hadoop Installation Guide
No ratings yet
Hadoop Installation Guide
18 pages
How To Install Hadoop On Ubuntu 18.04 or 20.04
No ratings yet
How To Install Hadoop On Ubuntu 18.04 or 20.04
15 pages
Bab 3 Hadoop Installation
No ratings yet
Bab 3 Hadoop Installation
12 pages
$ Sudo Apt-Get Install Oracle-Java8-Installer
No ratings yet
$ Sudo Apt-Get Install Oracle-Java8-Installer
4 pages
PRACTICAL 4 - Single and Multi Node Hadoop Install
No ratings yet
PRACTICAL 4 - Single and Multi Node Hadoop Install
11 pages
Installation of Hadoop
No ratings yet
Installation of Hadoop
8 pages
BigData_Lab_Manual
No ratings yet
BigData_Lab_Manual
44 pages
Hadoop Install
No ratings yet
Hadoop Install
19 pages
Configuration of a Simple Samba File Server, Quota and Schedule Backup
From Everand
Configuration of a Simple Samba File Server, Quota and Schedule Backup
Dr. Hedaya Alasooly
No ratings yet
2 Cognitive Biases in Decision Making
No ratings yet
2 Cognitive Biases in Decision Making
4 pages
Introduction: The Origin and Purpose of The Essential Drucker
No ratings yet
Introduction: The Origin and Purpose of The Essential Drucker
8 pages
Lecture8 Linearalg Matlab2 PDF
No ratings yet
Lecture8 Linearalg Matlab2 PDF
42 pages
Cajori F.-A History of Mathematics (1894) PDF
100% (3)
Cajori F.-A History of Mathematics (1894) PDF
516 pages
Managerial Accounting 5th Ed. Kieso PPTs For Chapters
No ratings yet
Managerial Accounting 5th Ed. Kieso PPTs For Chapters
63 pages
Study Guidance
No ratings yet
Study Guidance
2 pages
Problem Set 2
No ratings yet
Problem Set 2
12 pages
Marketing Mix
No ratings yet
Marketing Mix
6 pages
Currency Exchange Rate and International Trade and Capital Flows
No ratings yet
Currency Exchange Rate and International Trade and Capital Flows
39 pages
A Tutorial On Geographic Information Systems A Ten-Year Update
No ratings yet
A Tutorial On Geographic Information Systems A Ten-Year Update
48 pages
Bhavesh Krishan Garg Cse2b-G1 (Lab01)
No ratings yet
Bhavesh Krishan Garg Cse2b-G1 (Lab01)
8 pages
Complete Download of Solution Manual for Database Systems – Introduction to Databases and Data Warehouses Nenad Jukic Full Chapters in PDF DOCX
100% (7)
Complete Download of Solution Manual for Database Systems – Introduction to Databases and Data Warehouses Nenad Jukic Full Chapters in PDF DOCX
34 pages
Mikroprog Pic Manual v200
100% (1)
Mikroprog Pic Manual v200
44 pages
Adobe Photoshop CS5 Extended Readme
No ratings yet
Adobe Photoshop CS5 Extended Readme
2 pages
Effect of Machining Parameters On Surface Roughness of H13 Steel in Edm Process Using Powder Mixed Fluid
No ratings yet
Effect of Machining Parameters On Surface Roughness of H13 Steel in Edm Process Using Powder Mixed Fluid
3 pages
A11 Rig Move
100% (1)
A11 Rig Move
57 pages
Programming Assignment 4 Detailed Instructions
No ratings yet
Programming Assignment 4 Detailed Instructions
5 pages
Using Xbee Radios For Telemetry With APM
No ratings yet
Using Xbee Radios For Telemetry With APM
15 pages
K Means Clustering
100% (1)
K Means Clustering
14 pages
AD3301 - Visualization - Ipynb - Colaboratory
No ratings yet
AD3301 - Visualization - Ipynb - Colaboratory
15 pages
Tender Digitization Answer Script
No ratings yet
Tender Digitization Answer Script
21 pages
Kig12 GIS in Swiss
No ratings yet
Kig12 GIS in Swiss
1 page
521.105.03.2 Quick Start P1F - English 1.0
No ratings yet
521.105.03.2 Quick Start P1F - English 1.0
20 pages
XCPT DESKTOP 10-08-16 00.09.17
No ratings yet
XCPT DESKTOP 10-08-16 00.09.17
4 pages
Clips - Design Examples
No ratings yet
Clips - Design Examples
9 pages
Python Class Constructors
No ratings yet
Python Class Constructors
18 pages
Chapter 3 Constructors and Desctructors
No ratings yet
Chapter 3 Constructors and Desctructors
66 pages
Multiple-Choice Questions
No ratings yet
Multiple-Choice Questions
13 pages
Cyber Security: Ethical Hacking
No ratings yet
Cyber Security: Ethical Hacking
7 pages
Insert Tab
No ratings yet
Insert Tab
3 pages
Screen Exit For CO11N - Step Wise Descritpion - SAP Blogs
100% (1)
Screen Exit For CO11N - Step Wise Descritpion - SAP Blogs
17 pages
EvolutionofMalwareThreatsandTechniques_aReview
No ratings yet
EvolutionofMalwareThreatsandTechniques_aReview
13 pages
Python network programming
No ratings yet
Python network programming
14 pages
Hosea Mission Hna: (Source: Kan Mission Field Chu - R.Vanlalzauva)
No ratings yet
Hosea Mission Hna: (Source: Kan Mission Field Chu - R.Vanlalzauva)
4 pages
C Memory Allocation Questions
No ratings yet
C Memory Allocation Questions
10 pages

CDH3 Pseudo Installation On Ubuntu

Uploaded by

CDH3 Pseudo Installation On Ubuntu

Uploaded by

CDH3 Pseudo installation on Ubuntu

1) Do not create username as hadoop as you will have issues in installation.

# Set Hadoop-related environment variables

$ sudo gpasswd -a hdfs hadoop

$ sudo chmod 775 /storage/

$ sudo chmod 775 /mapred

12) Format namenode

$ sudo -u hdfs hadoop namenode -format

Check for any errors in /var/log/hadoop-0.20 for each daemon

You might also like