0% found this document useful (0 votes)

4 views

ccpractical 7

Uploaded by

Akshay Rathod

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views

ccpractical 7

Uploaded by

Akshay Rathod

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 11

Cloud Computing Lab

Practical-7
Aim: Demostrate the use of map and reduce tasks.
Theory:
MapReduce
A MapReduce is a data processing tool which is used to process the data parallelly in a
distributed form. It was developed in 2004, on the basis of paper titled as "MapReduce:
Simplified Data Processing on Large Clusters," published by Google. The MapReduce is a
paradigm which has two phases, the mapper phase, and the reducer phase. In the Mapper, the
input is given in the form of a key-value pair. The output of the Mapper is fed to the reducer
as input. The reducer runs only after the Mapper is over. The reducer too takes input in key-
value format, and the output of reducer is the final output.

Steps in Map Reduce:

The map takes data in the form of pairs and returns a list of pairs. The keys will not be
unique in this case. Using the output of Map, sort and shuffle are applied by the Hadoop
architecture. This sort and shuffle acts on these lists of pairs and sends out unique keys and a
list of values associated with this unique key . An output of sort and shuffle sent to the
reducer phase. The reducer performs a defined function on a list of values for unique keys,
and Final output will be stored/displayed.

Step 1 : Install java jdk 8

First of all you must install Java JDK 8 on your system. You can just type this command to
install java jdk on your system.
sudo apt install openjdk-8-jdk

Matoshri Pratishthan’s Group Of Institutions,Nanded

Cloud Computing Lab

sudo apt-get install openjdk-8-jdk -y

Verify Java installation:
bash
java -version

Matoshri Pratishthan’s Group Of Institutions,Nanded

Cloud Computing Lab

Step 3 : Create a Dedicated Hadoop User

Bash sudo useradd hadoop

sudo passwd hadoop

Matoshri Pratishthan’s Group Of Institutions,Nanded

Cloud Computing Lab

Now add this configuration in core-site.xml file.

Step 4: Add this reading package sudonano .bashrc
sudoapt.get install ssh

Now add this configuration in core-site.xml file.

Step 5: Download the latest Hadoop version from the Apache Hadoop

Matoshri Pratishthan’s Group Of Institutions,Nanded

Cloud Computing Lab

Step 6: Add this file in hdfs-site.xml

core-site.xml

Matoshri Pratishthan’s Group Of Institutions,Nanded

Cloud Computing Lab

mapred-site.xml
yarn-site.xml

Step 7 : Map reduce

Matoshri Pratishthan’s Group Of Institutions,Nanded

Cloud Computing Lab

 It can be used for distributed pattern-based searching.

 We can also use MapReduce in Machin learning.

Matoshri Pratishthan’s Group Of Institutions,Nanded

Cloud Computing Lab

Step 8: Output of the mapreduce

Step 9: Open mpgi

Matoshri Pratishthan’s Group Of Institutions,Nanded

Cloud Computing Lab

Step 10:open wrodcountTutorial

Step 11: open input and output

Step 12: open for the input file for input.text

Matoshri Pratishthan’s Group Of Institutions,Nanded

Cloud Computing Lab

Step 13: open for the output file open part-r-00000

Matoshri Pratishthan’s Group Of Institutions,Nanded

Cloud Computing Lab

Conclusion :
Thus, we have successfully demonstrated the use of map and reduce
tasks.

Matoshri Pratishthan’s Group Of Institutions,Nanded

Learn NodeJS in 1 Day: Complete Node JS Guide with Examples
From Everand
Learn NodeJS in 1 Day: Complete Node JS Guide with Examples
Krishna Rungta
3.5/5 (4)
Hivemq Ebook MQTT Essentials
100% (1)
Hivemq Ebook MQTT Essentials
72 pages
Accelerated Computing with HIP
From Everand
Accelerated Computing with HIP
Yifan Sun
4.5/5 (2)
Assignment 1 Front Sheet: Qualification BTEC Level 5 HND Diploma in Computing
No ratings yet
Assignment 1 Front Sheet: Qualification BTEC Level 5 HND Diploma in Computing
23 pages
Learning Cascading
From Everand
Learning Cascading
Michael Covert
No ratings yet
Big Data Analytics IT
No ratings yet
Big Data Analytics IT
55 pages
Hadoop Blueprints
From Everand
Hadoop Blueprints
Anurag Shrivastava
No ratings yet
Machine Learning: Hands-On for Developers and Technical Professionals
From Everand
Machine Learning: Hands-On for Developers and Technical Professionals
Jason Bell
No ratings yet
Assignment No 1Gr A
No ratings yet
Assignment No 1Gr A
6 pages
Big Data Analysis 3170722 Lab Manual
No ratings yet
Big Data Analysis 3170722 Lab Manual
68 pages
Professional Hadoop Solutions
From Everand
Professional Hadoop Solutions
Boris Lublinsky
4/5 (2)
Hadoop Beginner's Guide
From Everand
Hadoop Beginner's Guide
Garry Turkington
4/5 (7)
JAVASCRIPT FRONT END PROGRAMMING: Crafting Dynamic and Interactive User Interfaces with JavaScript (2024 Guide for Beginners)
From Everand
JAVASCRIPT FRONT END PROGRAMMING: Crafting Dynamic and Interactive User Interfaces with JavaScript (2024 Guide for Beginners)
DAISY JOHNSTON
No ratings yet
BDA LAB FILE Final 18EGICS110
No ratings yet
BDA LAB FILE Final 18EGICS110
54 pages
Parallel Python with Dask
From Everand
Parallel Python with Dask
Tim Peters
No ratings yet
Parallel Python with Dask: Perform distributed computing, concurrent programming and manage large dataset
From Everand
Parallel Python with Dask: Perform distributed computing, concurrent programming and manage large dataset
Tim Peters
No ratings yet
Learning Apache Spark 2
From Everand
Learning Apache Spark 2
Muhammad Asif Abbasi
No ratings yet
Mastering Apache Cassandra - Second Edition
From Everand
Mastering Apache Cassandra - Second Edition
Nishant Neeraj
No ratings yet
BDA practical (1)
No ratings yet
BDA practical (1)
18 pages
bda megh
No ratings yet
bda megh
50 pages
CSE488 Lab01
No ratings yet
CSE488 Lab01
6 pages
Lab Manual Big Data Analytics Lab (LC-CSE-410G) : Department of Computer Science and Engineering
No ratings yet
Lab Manual Big Data Analytics Lab (LC-CSE-410G) : Department of Computer Science and Engineering
28 pages
BDT Lab Manual
No ratings yet
BDT Lab Manual
34 pages
7 Full Hadoop Performance Modeling For Job Estimation and Resource Provisioning
No ratings yet
7 Full Hadoop Performance Modeling For Job Estimation and Resource Provisioning
94 pages
Big Data Lab
No ratings yet
Big Data Lab
159 pages
Cloud Computing
0% (1)
Cloud Computing
5 pages
Learning PyTorch 2.0, Second Edition
From Everand
Learning PyTorch 2.0, Second Edition
Matthew Rosch
No ratings yet
Learning PyTorch 2.0, Second Edition: Utilize PyTorch 2.3 and CUDA 12 to experiment neural networks and deep learning models
From Everand
Learning PyTorch 2.0, Second Edition: Utilize PyTorch 2.3 and CUDA 12 to experiment neural networks and deep learning models
Matthew Rosch
No ratings yet
Programming in Pascal: From simple Pascal programs to current desktop applications with Database DEV-PASCAL, LAZARUS AND PASCAL N-IDE
From Everand
Programming in Pascal: From simple Pascal programs to current desktop applications with Database DEV-PASCAL, LAZARUS AND PASCAL N-IDE
Olga Maria Stefania Cucaro
No ratings yet
Big Dataa-Lab-Manual
No ratings yet
Big Dataa-Lab-Manual
24 pages
Quick Configuration of Openldap and Kerberos In Linux and Authenicating Linux to Active Directory
From Everand
Quick Configuration of Openldap and Kerberos In Linux and Authenicating Linux to Active Directory
Dr. Hidaia Mahmood Alassouli
No ratings yet
Learning Hadoop 2
From Everand
Learning Hadoop 2
Garry Turkington
4/5 (1)
50 Recipes for Programming Node.js
From Everand
50 Recipes for Programming Node.js
Jamie Munro
3/5 (4)
BDT Lab Manual
No ratings yet
BDT Lab Manual
48 pages
Hadoop Job Runner UI Tool
No ratings yet
Hadoop Job Runner UI Tool
10 pages
Mastering CUDA Python Programming
From Everand
Mastering CUDA Python Programming
Ed A Norex
No ratings yet
Hadoop Course Content
No ratings yet
Hadoop Course Content
3 pages
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked
From Everand
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
Practical-1: Aim:-Make A Single Node Cluster in Hadoop. Solution
No ratings yet
Practical-1: Aim:-Make A Single Node Cluster in Hadoop. Solution
49 pages
CC_unit4_52e39303-d867-4b14-b5bf-38bc746359c6
No ratings yet
CC_unit4_52e39303-d867-4b14-b5bf-38bc746359c6
14 pages
BDA Lab 8 Manual
No ratings yet
BDA Lab 8 Manual
7 pages
Third Year Coursebook
No ratings yet
Third Year Coursebook
39 pages
Data Science
No ratings yet
Data Science
82 pages
Bigdata Manual Final
No ratings yet
Bigdata Manual Final
65 pages
Learning Jupyter
From Everand
Learning Jupyter
Dan Toomey
3.5/5 (4)
Advanced Penetration Testing for Highly-Secured Environments: The Ultimate Security Guide
From Everand
Advanced Penetration Testing for Highly-Secured Environments: The Ultimate Security Guide
Allen Lee
4.5/5 (6)
Parlab Parallel Boot Camp: Cloud Computing With Mapreduce and Hadoop
No ratings yet
Parlab Parallel Boot Camp: Cloud Computing With Mapreduce and Hadoop
55 pages
20dce017 Bda Pracfil
No ratings yet
20dce017 Bda Pracfil
41 pages
Learn Cassandra in 24 Hours
From Everand
Learn Cassandra in 24 Hours
Alex Nordeen
No ratings yet
BIG data file
No ratings yet
BIG data file
28 pages
CT2 BDTT
No ratings yet
CT2 BDTT
6 pages
Fast Data Processing Systems with SMACK Stack
From Everand
Fast Data Processing Systems with SMACK Stack
Raúl Estrada
No ratings yet
Bringing Images to Life: Exploring DALL-E with ChatGPT
From Everand
Bringing Images to Life: Exploring DALL-E with ChatGPT
Aura-Elena Turcu
No ratings yet
Bda Da1
No ratings yet
Bda Da1
14 pages
Bda Record
No ratings yet
Bda Record
48 pages
Kadi Sarva Vishwavidyalaya: LDRP Institute of Technology and Research Gandhinagar
No ratings yet
Kadi Sarva Vishwavidyalaya: LDRP Institute of Technology and Research Gandhinagar
44 pages
OpenStack Sahara Essentials
From Everand
OpenStack Sahara Essentials
Omar Khedher
No ratings yet
Bda Lab Manual
0% (1)
Bda Lab Manual
40 pages
DS&BDA
No ratings yet
DS&BDA
118 pages
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
From Everand
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
Marcus Richards
No ratings yet
Chicago Crime (2013) Analysis Using Pig and Visualization Using R
No ratings yet
Chicago Crime (2013) Analysis Using Pig and Visualization Using R
61 pages
BDA Lab Manual
No ratings yet
BDA Lab Manual
34 pages
Unity - Scripting API - Collider - OnCollisionEnter (Collision)
No ratings yet
Unity - Scripting API - Collider - OnCollisionEnter (Collision)
3 pages
UML Diagrams: Use Case Diagram
No ratings yet
UML Diagrams: Use Case Diagram
28 pages
Chapter-1 Review of Pyhton Basics (Notes)
No ratings yet
Chapter-1 Review of Pyhton Basics (Notes)
11 pages
Module-3 Syntax Analyzer
No ratings yet
Module-3 Syntax Analyzer
80 pages
Microsoft Beefs Up VBScript With Regular Expressions
No ratings yet
Microsoft Beefs Up VBScript With Regular Expressions
10 pages
RaghavendraY - Devops
No ratings yet
RaghavendraY - Devops
5 pages
Allotment Logic
No ratings yet
Allotment Logic
7 pages
Oosd Remaining
No ratings yet
Oosd Remaining
8 pages
Learn Python 3 - Modules Cheatsheet - Codecademy
No ratings yet
Learn Python 3 - Modules Cheatsheet - Codecademy
4 pages
Guia Basica de Fortran
No ratings yet
Guia Basica de Fortran
45 pages
The Isabelle System Manual
No ratings yet
The Isabelle System Manual
72 pages
Open Quickstart Guide
No ratings yet
Open Quickstart Guide
12 pages
C# 8 Feature Cheat Sheet: Pattern Matching Enhancements Default Interface Methods Using Declarations
No ratings yet
C# 8 Feature Cheat Sheet: Pattern Matching Enhancements Default Interface Methods Using Declarations
2 pages
Class - 8 - Computer - Chapter-2 (Introduction To Java)
No ratings yet
Class - 8 - Computer - Chapter-2 (Introduction To Java)
12 pages
Jasmine Bell Resume
No ratings yet
Jasmine Bell Resume
2 pages
Scalypso EN
No ratings yet
Scalypso EN
7 pages
Expressions in C
No ratings yet
Expressions in C
7 pages
Unit No. 8
No ratings yet
Unit No. 8
24 pages
IT4302: Rapid Application Development: University of Colombo, Sri Lanka
No ratings yet
IT4302: Rapid Application Development: University of Colombo, Sri Lanka
12 pages
IoT & Edge Developer Survey Report - 2021
No ratings yet
IoT & Edge Developer Survey Report - 2021
36 pages
XXX Project 1 Day? Mon 1/3/11 Mon 1/3/11 Project Initiation Phase 1 Day? Mon 1/3/11 Mon 1/3/11 0%
No ratings yet
XXX Project 1 Day? Mon 1/3/11 Mon 1/3/11 Project Initiation Phase 1 Day? Mon 1/3/11 Mon 1/3/11 0%
8 pages
(U2000) Patch Installation Guide V300R005
No ratings yet
(U2000) Patch Installation Guide V300R005
21 pages
JCL Patni
No ratings yet
JCL Patni
64 pages
Chapter 1 OOP
No ratings yet
Chapter 1 OOP
88 pages
Flex Remote Object Service With Java
No ratings yet
Flex Remote Object Service With Java
10 pages
Oracle BPEL vs. Oracle BPM
No ratings yet
Oracle BPEL vs. Oracle BPM
4 pages
Curriculam Vitae: MOB: +971 50 9038683 Al Karama, Dubai, Uae. Email: Driving License No: Uae-2396020
No ratings yet
Curriculam Vitae: MOB: +971 50 9038683 Al Karama, Dubai, Uae. Email: Driving License No: Uae-2396020
3 pages
BCS304 DS Module 1 KMP Algorithm
No ratings yet
BCS304 DS Module 1 KMP Algorithm
6 pages

ccpractical 7

Uploaded by

ccpractical 7

Uploaded by

Cloud Computing Lab

Steps in Map Reduce:

Step 1 : Install java jdk 8

Matoshri Pratishthan’s Group Of Institutions,Nanded

sudo apt-get install openjdk-8-jdk -y

Matoshri Pratishthan’s Group Of Institutions,Nanded

Step 3 : Create a Dedicated Hadoop User

Bash sudo useradd hadoop

Matoshri Pratishthan’s Group Of Institutions,Nanded

Now add this configuration in core-site.xml file.

Now add this configuration in core-site.xml file.

Matoshri Pratishthan’s Group Of Institutions,Nanded

Step 6: Add this file in hdfs-site.xml

Matoshri Pratishthan’s Group Of Institutions,Nanded

Step 7 : Map reduce

Matoshri Pratishthan’s Group Of Institutions,Nanded

 It can be used for distributed pattern-based searching.

Matoshri Pratishthan’s Group Of Institutions,Nanded

Step 8: Output of the mapreduce

Step 9: Open mpgi

Matoshri Pratishthan’s Group Of Institutions,Nanded

Step 10:open wrodcountTutorial

Step 11: open input and output

Step 12: open for the input file for input.text

Matoshri Pratishthan’s Group Of Institutions,Nanded

Step 13: open for the output file open part-r-00000

Matoshri Pratishthan’s Group Of Institutions,Nanded

Matoshri Pratishthan’s Group Of Institutions,Nanded

You might also like