Exp2 Hadoop

The document shows the steps taken to run a word count MapReduce job on a Hadoop cluster. It includes starting HDFS and YARN, placing an input file in HDFS, running the wordcount example job, and viewing the output. The job counts the number of occurrences of each word in the input file and writes the results to the output directory in HDFS.

Uploaded by

Abdul Wajeed

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

25 views

Exp2 Hadoop

Uploaded by

Abdul Wajeed

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

C:\Windows\System32>cd..

C:\Windows>cd..

C:\>cd C:\hadoop-3.2.4\bin

C:\hadoop-3.2.4\bin>cd..

C:\hadoop-3.2.4>cd sbin

C:\hadoop-3.2.4\sbin>start-dfs

C:\hadoop-3.2.4\sbin>start-yarn
starting yarn daemons

C:\hadoop-3.2.4\sbin>jps
13156 Jps
3828 DataNode
5224 NameNode
8424 NodeManager
3388 ResourceManager

C:\hadoop-3.2.4\sbin>hadoop fs -mkdir /input

C:\hadoop-3.2.4\sbin>hadoop fs -put C:/hadoop-3.2.4/data.txt /input

C:\hadoop-3.2.4\sbin>hadoop fs -ls /input/

Found 1 items
-rw-r--r-- 1 ASUS supergroup 39 2024-01-30 15:43 /input/data.txt

C:\hadoop-3.2.4\sbin>hadoop dfs -cat /input/data.txt

DEPRECATED: Use of this script to execute hdfs command is deprecated.
Instead use the hdfs command for it.
Hello
Hi
Hello
Good morning
Teacher
C:\hadoop-3.2.4\sbin>hadoop jar

C:\hadoop-3.2.4\sbin>hadoop jar C:/hadoop-3.2.4/share/hadoop/mapreduce/hadoop-mapreduce-examples-

3.2.4.jar
An example program must be given as the first argument.
Valid program names are:
aggregatewordcount: An Aggregate based map/reduce program that counts the words in the input files.
aggregatewordhist: An Aggregate based map/reduce program that computes the histogram of the words in the
input files.
bbp: A map/reduce program that uses Bailey-Borwein-Plouffe to compute exact digits of Pi.
dbcount: An example job that count the pageview counts from a database.
distbbp: A map/reduce program that uses a BBP-type formula to compute exact bits of Pi.
grep: A map/reduce program that counts the matches of a regex in the input.
join: A job that effects a join over sorted, equally partitioned datasets
multifilewc: A job that counts words from several files.
pentomino: A map/reduce tile laying program to find solutions to pentomino problems.
pi: A map/reduce program that estimates Pi using a quasi-Monte Carlo method.
randomtextwriter: A map/reduce program that writes 10GB of random textual data per node.
randomwriter: A map/reduce program that writes 10GB of random data per node.
secondarysort: An example defining a secondary sort to the reduce.
sort: A map/reduce program that sorts the data written by the random writer.
sudoku: A sudoku solver.
teragen: Generate data for the terasort
terasort: Run the terasort
teravalidate: Checking results of terasort
wordcount: A map/reduce program that counts the words in the input files.
wordmean: A map/reduce program that counts the average length of the words in the input files.
wordmedian: A map/reduce program that counts the median length of the words in the input files.
wordstandarddeviation: A map/reduce program that counts the standard deviation of the length of the words in
the input files.

C:\hadoop-3.2.4\sbin>hadoop jar C:/hadoop-3.2.4/share/hadoop/mapreduce/hadoop-mapreduce-examples-

3.2.4.jar wordcount /input /out
2024-01-30 15:53:02,627 INFO client.RMProxy: Connecting to ResourceManager at /0.0.0.0:8032
2024-01-30 15:53:03,652 INFO mapreduce.JobResourceUploader: Disabling Erasure Coding for path: /tmp/hadoop-
yarn/staging/ASUS/.staging/job_1706608858316_0001
2024-01-30 15:53:03,937 INFO input.FileInputFormat: Total input files to process : 1
2024-01-30 15:53:04,042 INFO mapreduce.JobSubmitter: number of splits:1
2024-01-30 15:53:04,229 INFO mapreduce.JobSubmitter: Submitting tokens for job: job_1706608858316_0001
2024-01-30 15:53:04,231 INFO mapreduce.JobSubmitter: Executing with tokens: []
2024-01-30 15:53:04,462 INFO conf.Configuration: resource-types.xml not found
2024-01-30 15:53:04,462 INFO resource.ResourceUtils: Unable to find 'resource-types.xml'.
2024-01-30 15:53:05,138 INFO impl.YarnClientImpl: Submitted application application_1706608858316_0001
2024-01-30 15:53:05,187 INFO mapreduce.Job: The url to track the job: http://DESKTOP-
1969SCE:8088/proxy/application_1706608858316_0001/
2024-01-30 15:53:05,188 INFO mapreduce.Job: Running job: job_1706608858316_0001
2024-01-30 15:53:19,741 INFO mapreduce.Job: Job job_1706608858316_0001 running in uber mode : false
2024-01-30 15:53:19,755 INFO mapreduce.Job: map 0% reduce 0%
2024-01-30 15:53:27,941 INFO mapreduce.Job: map 100% reduce 0%
2024-01-30 15:53:36,070 INFO mapreduce.Job: map 100% reduce 100%
2024-01-30 15:53:36,079 INFO mapreduce.Job: Job job_1706608858316_0001 completed successfully
2024-01-30 15:53:36,225 INFO mapreduce.Job: Counters: 54
File System Counters
FILE: Number of bytes read=66
FILE: Number of bytes written=478269
FILE: Number of read operations=0
FILE: Number of large read operations=0
FILE: Number of write operations=0
HDFS: Number of bytes read=140
HDFS: Number of bytes written=40
HDFS: Number of read operations=8
HDFS: Number of large read operations=0
HDFS: Number of write operations=2
HDFS: Number of bytes read erasure-coded=0
Job Counters
Launched map tasks=1
Launched reduce tasks=1
Data-local map tasks=1
Total time spent by all maps in occupied slots (ms)=5401
Total time spent by all reduces in occupied slots (ms)=5719
Total time spent by all map tasks (ms)=5401
Total time spent by all reduce tasks (ms)=5719
Total vcore-milliseconds taken by all map tasks=5401
Total vcore-milliseconds taken by all reduce tasks=5719
Total megabyte-milliseconds taken by all map tasks=5530624
Total megabyte-milliseconds taken by all reduce tasks=5856256
Map-Reduce Framework
Map input records=5
Map output records=6
Map output bytes=60
Map output materialized bytes=66
Input split bytes=101
Combine input records=6
Combine output records=5
Reduce input groups=5
Reduce shuffle bytes=66
Reduce input records=5
Reduce output records=5
Spilled Records=10
Shuffled Maps =1
Failed Shuffles=0
Merged Map outputs=1
GC time elapsed (ms)=142
CPU time spent (ms)=809
Physical memory (bytes) snapshot=465309696
Virtual memory (bytes) snapshot=646627328
Total committed heap usage (bytes)=317194240
Peak Map Physical memory (bytes)=287711232
Peak Map Virtual memory (bytes)=400330752
Peak Reduce Physical memory (bytes)=185565184
Peak Reduce Virtual memory (bytes)=252416000
Shuffle Errors
BAD_ID=0
CONNECTION=0
IO_ERROR=0
WRONG_LENGTH=0
WRONG_MAP=0
WRONG_REDUCE=0
File Input Format Counters
Bytes Read=39
File Output Format Counters
Bytes Written=40

C:\hadoop-3.2.4\sbin>hadoop fs -cat /out/*

Good 1
Hello 2
Hi 1
Teacher 1
morning 1

C:\hadoop-3.2.4\sbin>

Indian Polity 100+ MCQs
100% (1)
Indian Polity 100+ MCQs
16 pages
Modern Indian History Objective Questtions and Answers (High Order MCQS)
No ratings yet
Modern Indian History Objective Questtions and Answers (High Order MCQS)
21 pages
NSE4 - FGT-7.2 Exam - Free Actual Q&As, Page 1 - ExamTopics
No ratings yet
NSE4 - FGT-7.2 Exam - Free Actual Q&As, Page 1 - ExamTopics
41 pages
Creditcardprocessing SEproject
100% (3)
Creditcardprocessing SEproject
12 pages
VSICM7 M10 Lifecycle Management
100% (1)
VSICM7 M10 Lifecycle Management
62 pages
DSBDA GROUP B 1
No ratings yet
DSBDA GROUP B 1
5 pages
DSBDA grp b 1
No ratings yet
DSBDA grp b 1
8 pages
DSBDA grp b 1
No ratings yet
DSBDA grp b 1
8 pages
bda 1
No ratings yet
bda 1
6 pages
Write A Mapreduce Program To Find Dept Wise Salary. Empno Empname Dept Salary
100% (1)
Write A Mapreduce Program To Find Dept Wise Salary. Empno Empname Dept Salary
5 pages
Practical-1: Aim:-Make A Single Node Cluster in Hadoop. Solution
No ratings yet
Practical-1: Aim:-Make A Single Node Cluster in Hadoop. Solution
49 pages
DSBDN
No ratings yet
DSBDN
4 pages
CS702_Big_Data_Programs
No ratings yet
CS702_Big_Data_Programs
58 pages
Hadoop Installation Steps
No ratings yet
Hadoop Installation Steps
10 pages
BDA Output
No ratings yet
BDA Output
32 pages
Hive Assignment Logs
No ratings yet
Hive Assignment Logs
37 pages
bda lab s
No ratings yet
bda lab s
92 pages
CS-702 (D) BigData
No ratings yet
CS-702 (D) BigData
61 pages
$ Hdfs Dfsadmin - Report
No ratings yet
$ Hdfs Dfsadmin - Report
7 pages
CS702 Big Data Programs
No ratings yet
CS702 Big Data Programs
59 pages
Parlab Parallel Boot Camp: Cloud Computing With Mapreduce and Hadoop
No ratings yet
Parlab Parallel Boot Camp: Cloud Computing With Mapreduce and Hadoop
55 pages
Parlab Parallel Boot Camp Cloud Computing With Mapreduce and Hadoop
No ratings yet
Parlab Parallel Boot Camp Cloud Computing With Mapreduce and Hadoop
49 pages
Palak
No ratings yet
Palak
10 pages
Data Science
No ratings yet
Data Science
82 pages
BDA Lab Manual_organized (2) (1) - Copy
No ratings yet
BDA Lab Manual_organized (2) (1) - Copy
69 pages
Map Reduce
No ratings yet
Map Reduce
30 pages
Hadoop Administrator Training - Lab Hand Book
No ratings yet
Hadoop Administrator Training - Lab Hand Book
12 pages
084 Liza Bda File
No ratings yet
084 Liza Bda File
23 pages
Big Data Lab Manual Printout Copy
No ratings yet
Big Data Lab Manual Printout Copy
51 pages
Big Data Akshat
No ratings yet
Big Data Akshat
57 pages
BDA record
No ratings yet
BDA record
58 pages
Tutorial-Counting Words in File (S) Using Mapreduce: Prerequisites
No ratings yet
Tutorial-Counting Words in File (S) Using Mapreduce: Prerequisites
11 pages
Parlab Parallel Boot Camp: Cloud Computing With Mapreduce and Hadoop
No ratings yet
Parlab Parallel Boot Camp: Cloud Computing With Mapreduce and Hadoop
53 pages
BDA practical (1)
No ratings yet
BDA practical (1)
18 pages
Cp5261 Da Lab Me-Cse 2021 - Edit
No ratings yet
Cp5261 Da Lab Me-Cse 2021 - Edit
88 pages
02-Wordcount Mapreduce
No ratings yet
02-Wordcount Mapreduce
5 pages
BDT Lab Manual
No ratings yet
BDT Lab Manual
48 pages
CCBDI Full Lab Manual Anurag Removed
No ratings yet
CCBDI Full Lab Manual Anurag Removed
97 pages
Big Data Manual
No ratings yet
Big Data Manual
82 pages
Procedure: 1
No ratings yet
Procedure: 1
29 pages
Prerequisites: Single Node Setup Cluster Setup
No ratings yet
Prerequisites: Single Node Setup Cluster Setup
5 pages
Bda Experiment No2
No ratings yet
Bda Experiment No2
12 pages
Introduction To MapReduce
No ratings yet
Introduction To MapReduce
17 pages
Hadoop module1
No ratings yet
Hadoop module1
37 pages
Bda Lab Manual
No ratings yet
Bda Lab Manual
20 pages
Hadoop Mini Project
No ratings yet
Hadoop Mini Project
8 pages
CSF443 Lab-Report Nimish Shandilya 1000016934
No ratings yet
CSF443 Lab-Report Nimish Shandilya 1000016934
17 pages
Map Reduce Notes and Learning
No ratings yet
Map Reduce Notes and Learning
48 pages
Unit IV Programming Model
No ratings yet
Unit IV Programming Model
30 pages
BDF Programs
No ratings yet
BDF Programs
32 pages
PDC All Labs
100% (1)
PDC All Labs
129 pages
bda megh
No ratings yet
bda megh
50 pages
BDA Lab 8 Manual
No ratings yet
BDA Lab 8 Manual
7 pages
Big Data Analytics Lab Manual(BE AI&DS)
No ratings yet
Big Data Analytics Lab Manual(BE AI&DS)
29 pages
Import Import Import Import Import Import Import Import Public Class Extends Implements
No ratings yet
Import Import Import Import Import Import Import Import Public Class Extends Implements
7 pages
Run Python MapReduce On Local Docker Hadoop Cluster - DEV Community
No ratings yet
Run Python MapReduce On Local Docker Hadoop Cluster - DEV Community
5 pages
Exp 4 Word Count
No ratings yet
Exp 4 Word Count
4 pages
CS246 TA Session: Hadoop Tutorial: Peyman Kazemian 1/11/2011
No ratings yet
CS246 TA Session: Hadoop Tutorial: Peyman Kazemian 1/11/2011
13 pages
Creation and Execution Process Document
No ratings yet
Creation and Execution Process Document
4 pages
Bigdata Lab
No ratings yet
Bigdata Lab
55 pages
Merge Files Store in A Directory To A File
No ratings yet
Merge Files Store in A Directory To A File
3 pages
1.4 Map Reduce
No ratings yet
1.4 Map Reduce
30 pages
Map Reduce Design and Execution Framework Part 1
No ratings yet
Map Reduce Design and Execution Framework Part 1
19 pages
Oracle Certified Professional Java Programmer OCPJP 1Z0 809
From Everand
Oracle Certified Professional Java Programmer OCPJP 1Z0 809
Manish Soni
No ratings yet
Foundation Course for Advanced Computer Studies
From Everand
Foundation Course for Advanced Computer Studies
Franck Ismael Djédjé
No ratings yet
CSM (1)
No ratings yet
CSM (1)
11 pages
Dristi Indian and World Geography
No ratings yet
Dristi Indian and World Geography
431 pages
Karpuri Thakur pdf(Eng)
No ratings yet
Karpuri Thakur pdf(Eng)
15 pages
Full Length Test - 01 Result
No ratings yet
Full Length Test - 01 Result
4 pages
Abdul Humcare
No ratings yet
Abdul Humcare
1 page
Bihar Daroga SI Question Paper 2023 PDF @exam - Stocks
100% (6)
Bihar Daroga SI Question Paper 2023 PDF @exam - Stocks
10 pages
Tmo 351 PPT 5
No ratings yet
Tmo 351 PPT 5
52 pages
Lecture 1.3.3
No ratings yet
Lecture 1.3.3
11 pages
Tmo-351 PPT 2
No ratings yet
Tmo-351 PPT 2
50 pages
Tmo-351 PPT 1
No ratings yet
Tmo-351 PPT 1
20 pages
Lecture 1.3.2
100% (1)
Lecture 1.3.2
12 pages
Tmo 351 PPT 6
No ratings yet
Tmo 351 PPT 6
32 pages
Tmo-351 PPT3
No ratings yet
Tmo-351 PPT3
31 pages
Lecture 1.3.5
No ratings yet
Lecture 1.3.5
14 pages
Tmo 351 PPT 4
No ratings yet
Tmo 351 PPT 4
22 pages
CN Lab 2.2
No ratings yet
CN Lab 2.2
5 pages
Lecture 3.1.2 - Puzzle Friendly Hash
No ratings yet
Lecture 3.1.2 - Puzzle Friendly Hash
7 pages
3 Architecture of CPU
No ratings yet
3 Architecture of CPU
52 pages
Attendance Bot Tutorial
No ratings yet
Attendance Bot Tutorial
38 pages
Grade 8 Comp
No ratings yet
Grade 8 Comp
2 pages
Lesson-1-ICF-8_094608
No ratings yet
Lesson-1-ICF-8_094608
3 pages
Nokia8 ROM Installation Guide V2.0
No ratings yet
Nokia8 ROM Installation Guide V2.0
18 pages
ITEC 102 FINAL TERM Quiz No 1
No ratings yet
ITEC 102 FINAL TERM Quiz No 1
4 pages
Primary Storage
100% (1)
Primary Storage
4 pages
Brilliance CT: 6/10/16-Slice Configuration (Air)
No ratings yet
Brilliance CT: 6/10/16-Slice Configuration (Air)
136 pages
Rudra 18
No ratings yet
Rudra 18
21 pages
DCIT104 Lecture 3
No ratings yet
DCIT104 Lecture 3
39 pages
News
No ratings yet
News
212 pages
Sap Transaction Codes and Tables
No ratings yet
Sap Transaction Codes and Tables
4 pages
PCM UPS-Protocol
No ratings yet
PCM UPS-Protocol
16 pages
Devices of FTTX Solutions
No ratings yet
Devices of FTTX Solutions
55 pages
Security Onion Cheat Sheet
No ratings yet
Security Onion Cheat Sheet
2 pages
An Introduction To Digital Image Process
No ratings yet
An Introduction To Digital Image Process
233 pages
PROJECT CHARTER For Portal Migrations
No ratings yet
PROJECT CHARTER For Portal Migrations
5 pages
Block Propagation in Blockchain
No ratings yet
Block Propagation in Blockchain
10 pages
Module-2
No ratings yet
Module-2
23 pages
Using GLOBE Claritas™: 2D/3D Seismic Processing Software
No ratings yet
Using GLOBE Claritas™: 2D/3D Seismic Processing Software
25 pages
Bing Translator News
No ratings yet
Bing Translator News
4 pages
10 2023
No ratings yet
10 2023
53 pages
Lecture Computer Aided Software Engineering Lecture
No ratings yet
Lecture Computer Aided Software Engineering Lecture
26 pages
ACIT4030 Machine Learning For Images and 3D Data
No ratings yet
ACIT4030 Machine Learning For Images and 3D Data
4 pages
Student Notes
No ratings yet
Student Notes
10 pages
Javascript
No ratings yet
Javascript
61 pages
Automata Theory
No ratings yet
Automata Theory
48 pages

Exp2 Hadoop

Uploaded by

Exp2 Hadoop

Uploaded by

C:\Windows\System32>cd..

C:\hadoop-3.2.4\sbin>hadoop fs -mkdir /input

C:\hadoop-3.2.4\sbin>hadoop fs -put C:/hadoop-3.2.4/data.txt /input

C:\hadoop-3.2.4\sbin>hadoop fs -ls /input/

C:\hadoop-3.2.4\sbin>hadoop dfs -cat /input/data.txt

C:\hadoop-3.2.4\sbin>hadoop jar C:/hadoop-3.2.4/share/hadoop/mapreduce/hadoop-mapreduce-examples-

C:\hadoop-3.2.4\sbin>hadoop jar C:/hadoop-3.2.4/share/hadoop/mapreduce/hadoop-mapreduce-examples-

C:\hadoop-3.2.4\sbin>hadoop fs -cat /out/*

You might also like