0% found this document useful (0 votes)

21 views

Word Count

Uploaded by

K arun kumar Arun

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views

Word Count

Uploaded by

K arun kumar Arun

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 10

How to Execute WordCount Program in

MapReduce using Cloudera

Distribution Hadoop(CDH)

For Lab on Wednesday (27/9/23)

The steps which show how to write a MapReduce code for Word
Count.

Input:

Hello I am Geeks for Geeks

Hello I am an Intern
Output:

GeeksforGeeks 1
Hello 2
I 2
Intern 1
am 2
an 1

Steps:

 First Open Eclipse -> then select File -> New -> Java
Project ->Name it WordCount -> then Finish.
 Create Three Java Classes into the project. Name
them WCDriver(having the main
function), WCMapper, WCReducer.
 You have to include two Reference Libraries for that:
Right Click on Project -> then select Build Path-> Click
on Configure Build Path
 In the above figure, you can see the Add External JARs
option on the Right Hand Side. Click on it and add the below
mention files. You can find these files in /usr/lib/
1. /usr/lib/hadoop-0.20-mapreduce/hadoop-core-2.6.0-mr1-
cdh5.13.0.jar
2. /usr/lib/hadoop/hadoop-common-2.6.0-cdh5.13.0.jar

Mapper Code: You have to copy paste this program into the
WCMapper Java Class file.

 Java

// Importing libraries
import java.io.IOException;
import org.apache.hadoop.io.IntWritable;
import org.apache.hadoop.io.LongWritable;
import org.apache.hadoop.io.Text;
import org.apache.hadoop.mapred.MapReduceBase;
import org.apache.hadoop.mapred.Mapper;
import org.apache.hadoop.mapred.OutputCollector;
import org.apache.hadoop.mapred.Reporter;

public class WCMapper extends MapReduceBase implements Mapper<LongWritable,

Text, Text, IntWritable> {

// Map function
public void map(LongWritable key, Text value, OutputCollector<Text,
IntWritable> output, Reporter rep) throws IOException
{

String line = value.toString();

// Splitting the line on spaces

for (String word : line.split(" "))
{
if (word.length() > 0)
{
output.collect(new Text(word), new IntWritable(1));
}
}
}
}

Reducer Code: You have to copy paste this program into the
WCReducer Java Class file.

 Java

// Importing libraries
import java.io.IOException;
import java.util.Iterator;
import org.apache.hadoop.io.IntWritable;
import org.apache.hadoop.io.Text;
import org.apache.hadoop.mapred.MapReduceBase;
import org.apache.hadoop.mapred.OutputCollector;
import org.apache.hadoop.mapred.Reducer;
import org.apache.hadoop.mapred.Reporter;

public class WCReducer extends MapReduceBase implements Reducer<Text,

IntWritable, Text, IntWritable> {

// Reduce function
public void reduce(Text key, Iterator<IntWritable> value,
OutputCollector<Text, IntWritable> output,
Reporter rep) throws IOException
{

int count = 0;

// Counting the frequency of each words

while (value.hasNext())
{
IntWritable i = value.next();
count += i.get();
}

output.collect(key, new IntWritable(count));

}
}

Driver Code: You have to copy paste this program into the
WCDriver Java Class file.

 Java

// Importing libraries
import java.io.IOException;
import org.apache.hadoop.conf.Configured;
import org.apache.hadoop.fs.Path;
import org.apache.hadoop.io.IntWritable;
import org.apache.hadoop.io.Text;
import org.apache.hadoop.mapred.FileInputFormat;
import org.apache.hadoop.mapred.FileOutputFormat;
import org.apache.hadoop.mapred.JobClient;
import org.apache.hadoop.mapred.JobConf;
import org.apache.hadoop.util.Tool;
import org.apache.hadoop.util.ToolRunner;
public class WCDriver extends Configured implements Tool {

public int run(String args[]) throws IOException

{
if (args.length < 2)
{
System.out.println("Please give valid inputs");
return -1;
}

JobConf conf = new JobConf(WCDriver.class);

FileInputFormat.setInputPaths(conf, new Path(args[0]));
FileOutputFormat.setOutputPath(conf, new Path(args[1]));
conf.setMapperClass(WCMapper.class);
conf.setReducerClass(WCReducer.class);
conf.setMapOutputKeyClass(Text.class);
conf.setMapOutputValueClass(IntWritable.class);
conf.setOutputKeyClass(Text.class);
conf.setOutputValueClass(IntWritable.class);
JobClient.runJob(conf);
return 0;
}

// Main Method
public static void main(String args[]) throws Exception
{
int exitCode = ToolRunner.run(new WCDriver(), args);
System.out.println(exitCode);
}
}

 Now you have to make a jar file. Right Click on Project-

> Click on Export-> Select export destination as Jar
File-> Name the jar File(WordCount.jar) -> Click on
next -> at last Click on Finish. Now copy this file into the
Workspace directory of Cloudera



 Open the terminal on CDH and change the directory to the
workspace. You can do this by using “cd workspace/”
command. Now, Create a text file(WCFile.txt) and move it
to HDFS. For that open terminal and write this
code(remember you should be in the same directory as jar
file you have created just now).

 cat >> WordCCountFinal.txt

Enter your own text here
After finishing the text
Click Ctrl+z

 Then Create a directory using below command:

 sudo -u hdfs hadoop dfs -mkdir /WordCCount
 TO create the text file . Type the below command:
 Add the WordCCount.txt in hadoop by using below
command :
 sudo -u hdfs hadoop dfs -put
/WordCCount/WordCCountFinal.txt
 To view the contents
 sudo -u hdfs hadoop dfs -cat
/WordCCount/WordCCountFinal.txt
 Run jar file now using below command :

Sudo -u hdfs hadoop jar /home/cloudera/WordCCountFinal.jar

WCDriver /WordCCount/WordCCountFinal.txt OutputWC

 After executing the jar file , Run below command to

see the output
 hadoop fs -ls /WordCCount
 You can view output file that is OutputWC
 Type
 sudo -u hdfs hadoop dfs -cat
/WordCCount/OutputWC

Thanks and Regards

Kimmi Kumari

DSBDA 11
No ratings yet
DSBDA 11
15 pages
BDA3
No ratings yet
BDA3
7 pages
DA Lab Program-2
No ratings yet
DA Lab Program-2
6 pages
Steps to create jar file and execute word count problem in mapper reducer
No ratings yet
Steps to create jar file and execute word count problem in mapper reducer
5 pages
Experiment-4 BDA LAB
No ratings yet
Experiment-4 BDA LAB
7 pages
ExNo04
No ratings yet
ExNo04
4 pages
B1 instructions
No ratings yet
B1 instructions
9 pages
Ravinder Big Data 4 PDF
No ratings yet
Ravinder Big Data 4 PDF
15 pages
02-Wordcount Mapreduce
No ratings yet
02-Wordcount Mapreduce
5 pages
Developing A Simple Map-Reduce Program For Hadoop: Big Data Course CS6350 Professor: Dr. Latifur Khan
No ratings yet
Developing A Simple Map-Reduce Program For Hadoop: Big Data Course CS6350 Professor: Dr. Latifur Khan
22 pages
Word Count using MapReduce on Hadoop
No ratings yet
Word Count using MapReduce on Hadoop
14 pages
Practical 2c
No ratings yet
Practical 2c
2 pages
Import Import Import Import Import Import Import Import Public Class Extends Implements
No ratings yet
Import Import Import Import Import Import Import Import Public Class Extends Implements
7 pages
Lab3_BigData-MapReduce
No ratings yet
Lab3_BigData-MapReduce
8 pages
BDA
No ratings yet
BDA
6 pages
Lab11 B
No ratings yet
Lab11 B
9 pages
BDA Lab
No ratings yet
BDA Lab
13 pages
Installation of Hadoop
No ratings yet
Installation of Hadoop
37 pages
L4A Running Hadoop with MR
No ratings yet
L4A Running Hadoop with MR
5 pages
CS246 TA Session: Hadoop Tutorial: Peyman Kazemian 1/11/2011
No ratings yet
CS246 TA Session: Hadoop Tutorial: Peyman Kazemian 1/11/2011
13 pages
Part B Assignment - No - 1
No ratings yet
Part B Assignment - No - 1
6 pages
Steps: /usr/lib/hadoop-0.20/ Usr/lib/hadoop-0.20/lib
No ratings yet
Steps: /usr/lib/hadoop-0.20/ Usr/lib/hadoop-0.20/lib
4 pages
Ravikant_Hadoop_file
No ratings yet
Ravikant_Hadoop_file
22 pages
50 Recipes for Programming Node.js
From Everand
50 Recipes for Programming Node.js
Jamie Munro
3/5 (4)
Word Count Program To Demonstrate The Use of Map and Reduce Tasks
No ratings yet
Word Count Program To Demonstrate The Use of Map and Reduce Tasks
5 pages
Running Jar Program
No ratings yet
Running Jar Program
3 pages
Palak
No ratings yet
Palak
10 pages
DSBDA GRP B Print
No ratings yet
DSBDA GRP B Print
21 pages
Cloud PDF
No ratings yet
Cloud PDF
47 pages
Exp 4 Word Count
No ratings yet
Exp 4 Word Count
4 pages
Tutorial-Counting Words in File (S) Using Mapreduce: Prerequisites
No ratings yet
Tutorial-Counting Words in File (S) Using Mapreduce: Prerequisites
11 pages
Homework_Labs_Lecture2
No ratings yet
Homework_Labs_Lecture2
6 pages
Labs Lecture2
No ratings yet
Labs Lecture2
6 pages
TP3_hadoop python_Wordcount (1)
No ratings yet
TP3_hadoop python_Wordcount (1)
6 pages
Activity 2
No ratings yet
Activity 2
31 pages
Run Wordcount
No ratings yet
Run Wordcount
3 pages
BDT Lab Manual
No ratings yet
BDT Lab Manual
48 pages
WordCount Program Hadoop Task 2
No ratings yet
WordCount Program Hadoop Task 2
7 pages
BDF Programs
No ratings yet
BDF Programs
32 pages
Setting Up Eclipse:: Codelab 1 Introduction To The Hadoop Environment (Version 0.17.0)
No ratings yet
Setting Up Eclipse:: Codelab 1 Introduction To The Hadoop Environment (Version 0.17.0)
9 pages
Word Count Program With MapReduce and Java
No ratings yet
Word Count Program With MapReduce and Java
6 pages
Prerequisites: Single Node Setup Cluster Setup
No ratings yet
Prerequisites: Single Node Setup Cluster Setup
5 pages
Word Count Program
No ratings yet
Word Count Program
3 pages
Intellipaat Hands On Exercises PDF
No ratings yet
Intellipaat Hands On Exercises PDF
49 pages
Word_Count(2021)
No ratings yet
Word_Count(2021)
50 pages
CS702_Big_Data_Programs
No ratings yet
CS702_Big_Data_Programs
58 pages
Example - (Map Function in Word Count)
No ratings yet
Example - (Map Function in Word Count)
6 pages
3 MapReduce program ex code
No ratings yet
3 MapReduce program ex code
14 pages
Practical 2-1
No ratings yet
Practical 2-1
4 pages
Cloudera Academic Partnership 3 PDF
0% (1)
Cloudera Academic Partnership 3 PDF
103 pages
Lecture 4 PDF
No ratings yet
Lecture 4 PDF
38 pages
Cloudera Academic Partnership 4 PDF
No ratings yet
Cloudera Academic Partnership 4 PDF
38 pages
Word Count Program With MapReduce and Java
No ratings yet
Word Count Program With MapReduce and Java
6 pages
sets_bda
No ratings yet
sets_bda
19 pages
Hadoop and Map Reduce
No ratings yet
Hadoop and Map Reduce
27 pages
Map Reduce
No ratings yet
Map Reduce
57 pages
MapReduce Word Count Example - Javatpoint
No ratings yet
MapReduce Word Count Example - Javatpoint
12 pages
Lab2 WC
No ratings yet
Lab2 WC
2 pages
Hadoop Mini Project
No ratings yet
Hadoop Mini Project
8 pages
Go To Cloudera Quickstart VM To Download A Pre-Setup CDH Virtual Machine
No ratings yet
Go To Cloudera Quickstart VM To Download A Pre-Setup CDH Virtual Machine
20 pages
RL report TEAM - 12
No ratings yet
RL report TEAM - 12
17 pages
Public Participation in EIA
No ratings yet
Public Participation in EIA
12 pages
React-JS.pptx
No ratings yet
React-JS.pptx
11 pages
RL Report TEAM - 6
No ratings yet
RL Report TEAM - 6
13 pages
C# Expiriments - Copy
No ratings yet
C# Expiriments - Copy
42 pages
Module 3 Part 1B AgularJS
No ratings yet
Module 3 Part 1B AgularJS
24 pages
Labsheet2
No ratings yet
Labsheet2
21 pages
LB12_Implement GAN for neural style transfer (1).ipynb - Colab
No ratings yet
LB12_Implement GAN for neural style transfer (1).ipynb - Colab
17 pages
important programs - Copy
No ratings yet
important programs - Copy
20 pages
PREMKUMAR
No ratings yet
PREMKUMAR
2 pages
Comprehensive Analysis of Software Project Management
No ratings yet
Comprehensive Analysis of Software Project Management
20 pages
20231CSE0156 - Copy
No ratings yet
20231CSE0156 - Copy
1 page
Case Study Applied ML
No ratings yet
Case Study Applied ML
1 page
Exp-2 Hadoop Commands
No ratings yet
Exp-2 Hadoop Commands
6 pages
Soa Module1
No ratings yet
Soa Module1
42 pages
Lab Sheet 3 - Interactive Webpage Using HTML5 and CSS3 - Resturant
No ratings yet
Lab Sheet 3 - Interactive Webpage Using HTML5 and CSS3 - Resturant
7 pages
5
No ratings yet
5
14 pages
Quiz 4.1 OOP - CPE 009 CPE12S1 Object Oriented Programming
No ratings yet
Quiz 4.1 OOP - CPE 009 CPE12S1 Object Oriented Programming
7 pages
Strong Typing vs. Strong Testing: Bruce Eckel
No ratings yet
Strong Typing vs. Strong Testing: Bruce Eckel
2 pages
Pointers L1
No ratings yet
Pointers L1
39 pages
MCS-024 Solved Assignment 2023-2024
No ratings yet
MCS-024 Solved Assignment 2023-2024
12 pages
Java 8 Features
No ratings yet
Java 8 Features
3 pages
Java 5marks Imp Questions
No ratings yet
Java 5marks Imp Questions
28 pages
Temp Anr 5024272502532716684
No ratings yet
Temp Anr 5024272502532716684
63 pages
Paper: Model
No ratings yet
Paper: Model
4 pages
2016 Winter Model Answer Paper PDF
No ratings yet
2016 Winter Model Answer Paper PDF
28 pages
Realme RMX3834 RE5C9F 2024-04-26 16-55-43
No ratings yet
Realme RMX3834 RE5C9F 2024-04-26 16-55-43
6 pages
Mod Menu Crash 2023 06 27-07 27 52
No ratings yet
Mod Menu Crash 2023 06 27-07 27 52
1 page
Java InputStream Operation
No ratings yet
Java InputStream Operation
3 pages
Le Quoc Huy Backend Java Developer TopCV - VN 281223.135647
No ratings yet
Le Quoc Huy Backend Java Developer TopCV - VN 281223.135647
2 pages
Log-20231029 0813 1
No ratings yet
Log-20231029 0813 1
6 pages
Adjava PDF
No ratings yet
Adjava PDF
422 pages
Bpo1 Module 4
No ratings yet
Bpo1 Module 4
156 pages
Tweak Build Prop
No ratings yet
Tweak Build Prop
2 pages
Unit-6 Java Server Pages
No ratings yet
Unit-6 Java Server Pages
7 pages
Rudrajeet Singh: Professional Summary
No ratings yet
Rudrajeet Singh: Professional Summary
3 pages
Settings Provider
0% (1)
Settings Provider
67 pages
Profile Info: Contact
No ratings yet
Profile Info: Contact
1 page
Registration Form in Java
No ratings yet
Registration Form in Java
5 pages
Unit 6 Devops
No ratings yet
Unit 6 Devops
50 pages
Java Lab Manual Sargun Singh Narula
No ratings yet
Java Lab Manual Sargun Singh Narula
24 pages
Difference Between Stringbuffer and Stringbuilder: 4. What Is An Abstract Class?
No ratings yet
Difference Between Stringbuffer and Stringbuilder: 4. What Is An Abstract Class?
1 page
Hands On - JDBC
No ratings yet
Hands On - JDBC
9 pages
Assignment-2 1. Write A Java Program To Show That Private Member of A Super Class Cannot Be Accessed From Derivedclasses
No ratings yet
Assignment-2 1. Write A Java Program To Show That Private Member of A Super Class Cannot Be Accessed From Derivedclasses
19 pages
Untitled document
No ratings yet
Untitled document
7 pages
History of JAVA
No ratings yet
History of JAVA
1 page

Word Count

Uploaded by

Word Count

Uploaded by

How to Execute WordCount Program in

MapReduce using Cloudera

For Lab on Wednesday (27/9/23)

Hello I am Geeks for Geeks

public class WCMapper extends MapReduceBase implements Mapper<LongWritable,

String line = value.toString();

// Splitting the line on spaces

public class WCReducer extends MapReduceBase implements Reducer<Text,

// Counting the frequency of each words

output.collect(key, new IntWritable(count));

public int run(String args[]) throws IOException

JobConf conf = new JobConf(WCDriver.class);

 Now you have to make a jar file. Right Click on Project-

 cat >> WordCCountFinal.txt

 Then Create a directory using below command:

Sudo -u hdfs hadoop jar /home/cloudera/WordCCountFinal.jar

 After executing the jar file , Run below command to

Thanks and Regards

You might also like