17CSCS2T2

This document provides information about the course "Big Data Analytics" including: 1) The course aims to provide foundational training in basic and advanced big data methods and tools like MapReduce and Hadoop. 2) Key course outcomes include learning big data use cases, building scalable distributed systems using Hadoop, applying MapReduce concepts, and using Hadoop ecosystem components. 3) The course covers topics like introduction to big data and Hadoop, Hadoop architecture and storage, MapReduce, and the Hadoop ecosystem including components like Hive and HBase.

Uploaded by

Soundarrajan OGP

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views

17CSCS2T2

Uploaded by

Soundarrajan OGP

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

PVP 17

Prasad V. Potluri Siddhartha Institute of Technology:: Vijayawada.

Department of Computer Science and Engineering
I/II M.Tech. (CSE) - (Second Semester)
17CSCS2T2 BIG DATA ANALYTICS Credits: 4
Lecture: 4 Periods/week Internal Assessment: 40 Marks
Semester end examination: 60 Marks
__________________________________________________________________________________

Course Description

This course provides practical foundation level training that enables immediate and effective
participation in big data projects. The course provides grounding in basic and advanced methods
to big data technology and tools, including MapReduce and Hadoop and its ecosystem.

Course Outcomes:
At the end of the course, students should be able to:

CO1: Learn tips and tricks for Big data use cases and solutions
CO2: Learn about build and maintain reliable, scalable, distributed systems in big data
using Apache Hadoop
CO3: Apply MapReduce concepts in Distributed environemnt
CO4: Able to apply Hadoop ecosystem components

Unit-1

Introduction to Big data and Hadoop: Introduction – Distributed file system, Big data and
its importance, Six V’s, Drivers for Big Data, Big data Analytics, Applications of Big data,
algorithms using MapReduce.
Introduction to Hadoop: Big Data – Apache Hadoop & Hadoop EcoSystem – Moving Data
in and out of Hadoop – Understanding inputs and outputs of MapReduce - Data Serialization.

Unit-2
Hadoop Architecture, Hadoop Storage: HDFS, Common Hadoop Shell commands ,
Anatomy of File Write and Read., NameNode, Secondary NameNode, and DataNode,
Hadoop MapReduce paradigm, Map and Reduce tasks, Job, Task trackers - Cluster Setup –
SSH & Hadoop Configuration – HDFS Administering –Monitoring & Maintenance.
PVP 17

Unit-3
MAP REDUCE: Introduction – distributed file system – algorithms using map reduce,
Matrix-Vector Multiplication by Map Reduce – Hadoop - Understanding the Map Reduce
architecture - Writing Hadoop MapReduce Programs - Loading data into HDFS - Executing
the Map phase - Shuffling and sorting - Reducing phase execution.

Unit-4
HADOOP ECOSYSTEM AND YARN: Hadoop ecosystem components - Schedulers - Fair
and Capacity, Hadoop 2.0 New Features- NameNode High Availability, HDFS Federation,
MRv2, YARN, Running MRv1 in YARN. Introduction to Hive, HBASE, HiveQL,
Zookeeper.

Text Books:

1. Boris lublinsky, Kevin t. Smith, Alexey Yakubovich, “Professional Hadoop

Solutions”, Wiley, ISBN: 9788126551071, 2015.
2. Chris Eaton, Dirk deroos et al. , “Understanding Big data ”, McGraw Hill, 2012.
3. Tom White, “HADOOP: The definitive Guide” , O Reilly 2012.

Reference Books:

1. Vignesh Prajapati, “Big Data Analytics with R and Haoop”, Packet Publishing
2013.
2. Tom Plunkett, Brian Macdonald et al, “Oracle Big Data Handbook”, Oracle Press,
2014. http://www.bigdatauniversity.com/
3. Jy Liebowitz, “Big Data and Business analytics”,CRC press, 2013.
4. Big Data and Analytics, Seema Acharya, Subhashini Chellappan, Wiley
Publications, 2015.

Salesforce AI Specialist
No ratings yet
Salesforce AI Specialist
7 pages
Management Information Systems 7th Edition Sousa Oz Solution Manual
100% (44)
Management Information Systems 7th Edition Sousa Oz Solution Manual
16 pages
20IT503 - Big Data Analytics - Unit4
No ratings yet
20IT503 - Big Data Analytics - Unit4
73 pages
Software Engineering Manual
No ratings yet
Software Engineering Manual
85 pages
r18 - Big Data Analytics - Cse (DS)
0% (1)
r18 - Big Data Analytics - Cse (DS)
1 page
CC ZG522 Course Handout
No ratings yet
CC ZG522 Course Handout
6 pages
BigData and Hadoop - Syllabus
No ratings yet
BigData and Hadoop - Syllabus
2 pages
Big Data Analytics Syllabus
No ratings yet
Big Data Analytics Syllabus
2 pages
BIG DATA ANALYTICS (1)
No ratings yet
BIG DATA ANALYTICS (1)
20 pages
Coursera Report Divyansh Sahai CSF443
No ratings yet
Coursera Report Divyansh Sahai CSF443
7 pages
Koe097big Data
No ratings yet
Koe097big Data
1 page
Big Data Syllabus
No ratings yet
Big Data Syllabus
2 pages
BDA - Unit-1
No ratings yet
BDA - Unit-1
24 pages
Big Data Analytics Syllabus
No ratings yet
Big Data Analytics Syllabus
2 pages
BCA-BIGDATA-FIFTH_SEM-APPROVED-SYLLABUS
No ratings yet
BCA-BIGDATA-FIFTH_SEM-APPROVED-SYLLABUS
23 pages
LP BigData
No ratings yet
LP BigData
5 pages
B.Tech. CS_CE and CSE Syllabus 3rd Year 2024-25
No ratings yet
B.Tech. CS_CE and CSE Syllabus 3rd Year 2024-25
2 pages
Big Data Technology E1UJ502B
No ratings yet
Big Data Technology E1UJ502B
11 pages
22IS61 Big data analytics 2025
No ratings yet
22IS61 Big data analytics 2025
4 pages
IOT Analytics - AI361
No ratings yet
IOT Analytics - AI361
3 pages
Bigdata
No ratings yet
Bigdata
2 pages
DATA ANALYTICS Lab
No ratings yet
DATA ANALYTICS Lab
3 pages
Appendix-74
No ratings yet
Appendix-74
42 pages
BDS Course Handout - Intuit PDF
No ratings yet
BDS Course Handout - Intuit PDF
6 pages
Big Data Analytics With Lab
No ratings yet
Big Data Analytics With Lab
3 pages
Big Data Analytics
No ratings yet
Big Data Analytics
2 pages
Bite411l Big-data-Analytics TH 1.0 73 Bite411l 67 Acp
No ratings yet
Bite411l Big-data-Analytics TH 1.0 73 Bite411l 67 Acp
2 pages
Sample Tlep
No ratings yet
Sample Tlep
12 pages
Syllabus Big Data Analytics
No ratings yet
Syllabus Big Data Analytics
2 pages
Ccs334 Big Data Analytics
0% (1)
Ccs334 Big Data Analytics
2 pages
BDA-UNIT-1
No ratings yet
BDA-UNIT-1
32 pages
4.Syllabus_Copy
No ratings yet
4.Syllabus_Copy
2 pages
Coursera Report Ishaan Taneja 1000016551
No ratings yet
Coursera Report Ishaan Taneja 1000016551
7 pages
Unit 1
No ratings yet
Unit 1
19 pages
Big Data Analytics
No ratings yet
Big Data Analytics
3 pages
Cap456-Introduction To Big Data
No ratings yet
Cap456-Introduction To Big Data
1 page
big data analytics syallabus
No ratings yet
big data analytics syallabus
3 pages
Syllabus
No ratings yet
Syllabus
2 pages
CCS334 BDA Syllabus
No ratings yet
CCS334 BDA Syllabus
5 pages
big data sv publication
No ratings yet
big data sv publication
142 pages
Essentials of Big Data Griet
No ratings yet
Essentials of Big Data Griet
2 pages
BIG Data Syllabus
No ratings yet
BIG Data Syllabus
2 pages
Big Data Analytics
No ratings yet
Big Data Analytics
2 pages
Ccs334 - Big Data Analytics
75% (4)
Ccs334 - Big Data Analytics
2 pages
Training For Bigdata and Hadoop: #I Background and Introduction
No ratings yet
Training For Bigdata and Hadoop: #I Background and Introduction
9 pages
Big Data and Hadoop Developer
No ratings yet
Big Data and Hadoop Developer
7 pages
Big Data Analytics Digital Notes
No ratings yet
Big Data Analytics Digital Notes
119 pages
MCAD2232 (PRESS) BIG DATA and Its Applications
No ratings yet
MCAD2232 (PRESS) BIG DATA and Its Applications
140 pages
20CT1152
No ratings yet
20CT1152
3 pages
Big Data and Analytics Syllabus 2021
No ratings yet
Big Data and Analytics Syllabus 2021
3 pages
Bda Unit 2
No ratings yet
Bda Unit 2
57 pages
Digital Notes of Big Data Analytics Dated 5.1.2024
No ratings yet
Digital Notes of Big Data Analytics Dated 5.1.2024
175 pages
CS8091 BDA Unit1
No ratings yet
CS8091 BDA Unit1
63 pages
Big Data Analytics Course Outline (Fall 2020) : Dr. Tariq Mahmood 830 Am - 11 Am (Monday) Scope
No ratings yet
Big Data Analytics Course Outline (Fall 2020) : Dr. Tariq Mahmood 830 Am - 11 Am (Monday) Scope
3 pages
Syllabus of Big Data Analysis - Proposed
No ratings yet
Syllabus of Big Data Analysis - Proposed
2 pages
Blda Pract 2024
No ratings yet
Blda Pract 2024
59 pages
Big Data analyticsNEW SYLLABUS FRAMING
No ratings yet
Big Data analyticsNEW SYLLABUS FRAMING
3 pages
Big Data Syllabus
No ratings yet
Big Data Syllabus
1 page
2022-23-BDA-LAB Manual
No ratings yet
2022-23-BDA-LAB Manual
59 pages
Big Data Analytics- sem 7 CVMU
No ratings yet
Big Data Analytics- sem 7 CVMU
4 pages
Mastering Big Data and Hadoop: From Basics to Expert Proficiency
From Everand
Mastering Big Data and Hadoop: From Basics to Expert Proficiency
William Smith
No ratings yet
Hadoop Ecosystem for Big Data
From Everand
Hadoop Ecosystem for Big Data
Dr. Zemelak Goraga
No ratings yet
Advanced Hadoop Techniques: A Comprehensive Guide to Mastery
From Everand
Advanced Hadoop Techniques: A Comprehensive Guide to Mastery
Adam Jones
No ratings yet
Types of Information System: TPS, DSS & Pyramid Diagram
No ratings yet
Types of Information System: TPS, DSS & Pyramid Diagram
16 pages
1.0 Company Profile Electric Library 2021 - Compressed
No ratings yet
1.0 Company Profile Electric Library 2021 - Compressed
27 pages
General Description Features: High-Current, High-Performance Drmos Power Module
No ratings yet
General Description Features: High-Current, High-Performance Drmos Power Module
18 pages
DELTA IA-TC DTM B EN-DIN 20181004 Web
No ratings yet
DELTA IA-TC DTM B EN-DIN 20181004 Web
4 pages
Application Wise Process Steps For Stability Control - Fuji (HMI)
No ratings yet
Application Wise Process Steps For Stability Control - Fuji (HMI)
7 pages
H-Series-Video-Wall-Splicers-User-Manual-V1.12.0-2
No ratings yet
H-Series-Video-Wall-Splicers-User-Manual-V1.12.0-2
95 pages
Hotmail (2.319)
No ratings yet
Hotmail (2.319)
54 pages
How To Configure The SRTP Live View
No ratings yet
How To Configure The SRTP Live View
8 pages
Cambridge IGCSE™: Computer Science 0478/13
No ratings yet
Cambridge IGCSE™: Computer Science 0478/13
10 pages
Revised Obtaf For Approval (Re: Cover The Turnover of Sdcs in Camps Bushra, Bilal and Salman On Feb. 24 To 28, 2024)
100% (1)
Revised Obtaf For Approval (Re: Cover The Turnover of Sdcs in Camps Bushra, Bilal and Salman On Feb. 24 To 28, 2024)
4 pages
CompactLogix 5370 Controllers
No ratings yet
CompactLogix 5370 Controllers
336 pages
Best Software Company in Lucknow
No ratings yet
Best Software Company in Lucknow
4 pages
Difference Between Java Bean and EJB
100% (2)
Difference Between Java Bean and EJB
2 pages
Get (Ebook) Quantitative analysis for management by Badri, T.N.; Hale, Trevor S.; Hanna, Michael E.; Render, Barry; Stair, Ralph M ISBN 9789332568587, 9789332578692, 9332568588, 9332578699 PDF ebook with Full Chapters Now
100% (11)
Get (Ebook) Quantitative analysis for management by Badri, T.N.; Hale, Trevor S.; Hanna, Michael E.; Render, Barry; Stair, Ralph M ISBN 9789332568587, 9789332578692, 9332568588, 9332578699 PDF ebook with Full Chapters Now
55 pages
CIVI6731 Lecture (Week1)
No ratings yet
CIVI6731 Lecture (Week1)
30 pages
Chapter 18_sale of Goods_ Commentaries on European Contract Laws
No ratings yet
Chapter 18_sale of Goods_ Commentaries on European Contract Laws
112 pages
Description_ERDI12_0v9
No ratings yet
Description_ERDI12_0v9
43 pages
Computer Science Grade 1 - 3rd Term
No ratings yet
Computer Science Grade 1 - 3rd Term
4 pages
RoboCare Manual en
No ratings yet
RoboCare Manual en
17 pages
Rakesh M. Verma - David J. Marchette - Cybersecurity Analytics-CRC Press (2020)
No ratings yet
Rakesh M. Verma - David J. Marchette - Cybersecurity Analytics-CRC Press (2020)
357 pages
AWS - Route 53 Notes
No ratings yet
AWS - Route 53 Notes
4 pages
Accenture Everest Group Healthcare Data and Analytics Services PEAK Matrix Assessment 2023
No ratings yet
Accenture Everest Group Healthcare Data and Analytics Services PEAK Matrix Assessment 2023
13 pages
ARLC6 Service Manual
No ratings yet
ARLC6 Service Manual
28 pages
Adobe Scan 01-May-2022
No ratings yet
Adobe Scan 01-May-2022
9 pages
Exception Taken How France Has Defied Hollywood s New World Order Jonathan Buchsbaum download pdf
No ratings yet
Exception Taken How France Has Defied Hollywood s New World Order Jonathan Buchsbaum download pdf
24 pages
ERP - Case
No ratings yet
ERP - Case
13 pages
WCM Business Process - ACB
100% (1)
WCM Business Process - ACB
61 pages

17CSCS2T2

Uploaded by

17CSCS2T2

Uploaded by

PVP 17

Prasad V. Potluri Siddhartha Institute of Technology:: Vijayawada.

1. Boris lublinsky, Kevin t. Smith, Alexey Yakubovich, “Professional Hadoop

You might also like