0% found this document useful (0 votes)

65 views

Mid-Term Exam (30%) PROFESSOR: Oussama Derbel SECTION: 11112

The document is a mid-term exam for a Big Data course that includes: - Student identification information at the top - Exam rules listing allowed materials and prohibiting things like phones/devices and cheating - Two main exercises worth 40% and 70% of the grade: - The first asks students to define terms like data, big data, information, and Hadoop and list components of the Hadoop ecosystem - The second asks students to describe the Map, Combine, and Reduce operations to count word occurrences across 5 sample text files The summary covers the key elements of the exam document: the student and course information, exam rules, and overview of the two main exercises and their requirements.

Uploaded by

Gurvinder Chahal

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

65 views

Mid-Term Exam (30%) PROFESSOR: Oussama Derbel SECTION: 11112

Uploaded by

Gurvinder Chahal

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 4

FILLED BY THE STUDENT:

Tarandeep singh
_______________________________________________________________________________________
_

Student’s Name

5354120
_______________________________________________________________________________________
_

Student’s ID Number

2022-01-03
_______________________________________________________________________________________

Mid-Term Exam (30%) _

Date

PROFESSOR: Oussama Derbel

SECTION: 11112

EXAM RULES: FILLED BY THE PROFESSOR:

 All students must have an ID to confirm their identity.
 No student will be allowed to enter the evaluation room Evaluated Competencies:
20 minutes after the evaluation has started.
 Students may not leave the evaluation room during the
Use Hadoop components
exam period for any reason.
 Any student who arrives late will not be given any extra Time Allowed: 1h30 Hours
time to complete his or her evaluation.
 Students may be assigned a specific desk/location by the Materials Allowed: Yes
teacher.
 Students may not bring any food or drink other than
water into the evaluation room. Total Mark: 100
 All communication devices including but not limited to
cell phones, smart phones, smart watches, iPods, pagers Mark Obtained:
and Web-accessible electronic devices must be turned off
and left at a place designated by the teacher. Failure to
do so may lead to the removal of the evaluation.
 Cheating attempts or any assistance offered to others will
merit a mark of zero on the evaluation. This includes but
not limited to speaking or looking around the evaluation
room. In this case, the teacher will seize the evaluation
documents and submit a written report to the Program
Coordinator.
Big Data
420-BZ2-GX
STUDENT’S NAME:_____________ _____________________________________________________________________________________________________________

This Exam paper should be uploaded on Omnivox via Lea (No Mio)

Exercise 1 (40%):

a- What is Data ?
Ans- On a computer, data is information that is translated into a form that
works well for movement or processing. With regard to modern computers
and transmission media, data is information that is converted into a digital
binary form. It is acceptable for data to be used as a singular or plural topic.
Raw data is a term used to describe data in its basic digital format.
b- What is Big Data?
Ans- Big data refers to large, diverse sets of information growing at ever-
increasing prices. It covers the amount of information, speed or speed at
which it is built and collected, as well as the variety or scope of data points
to be combined.
c- What is information?
Ans- Big data involves managing data sets that are so large and
sophisticated that software processing software is not enough to capture,
filter, manage, and process data over a reasonable amount of time. Big
data can be used to predict and analyze user behavior.
d- What is Hadoop?
Ans- Apache Hadoop is an open source framework used to store and
process large data sets ranging from gigabyte to petabytes of data. Instead
of using a single large computer to store and process data, Hadoop allows
multiple computers to analyze large data sets for faster compliance.
e- List the 5 components of the Hadoop ecosystems and briefly describe the
functionality of each component:
Ans- Following are the components that collectively form a Hadoop
ecosystem:
 HDFS: Hadoop Distributed File System.
 YARN: Yet Another Resource Negotiator.
 MapReduce: Programming based Data Processing.
 Spark: In-Memory data processing.
 PIG, HIVE: Query based processing of data services.

2|Page
Big Data
420-BZ2-GX
STUDENT’S NAME:_____________ _____________________________________________________________________________________________________________

Exercise 3 (70%):

Describes the three operations (Map, Combine, Reduce) to count occurrences of

each word across these four files:

File 1: I’m Indian student.

File 2: I live in Montreal.

File 3: I’m learning Big data subject.

File 4: I love India.

File 5: I love Canada.

3|Page
Big Data
420-BZ2-GX
STUDENT’S NAME:_____________ _____________________________________________________________________________________________________________

4|Page

Mid Term Exam Big Data - 2
No ratings yet
Mid Term Exam Big Data - 2
4 pages
Cisco Hosted Collaboration Solution: Hcs Virtual Machine Requirements
No ratings yet
Cisco Hosted Collaboration Solution: Hcs Virtual Machine Requirements
17 pages
FBI Explanation of Silk Road Vulnerability
100% (2)
FBI Explanation of Silk Road Vulnerability
10 pages
Ese - Dec2020 - Socs - B Tech Cse Iotsc - Sem Vii - Csba4001 - Big Data Analytics
No ratings yet
Ese - Dec2020 - Socs - B Tech Cse Iotsc - Sem Vii - Csba4001 - Big Data Analytics
2 pages
3170722 W-23
No ratings yet
3170722 W-23
1 page
BDA Merged
No ratings yet
BDA Merged
7 pages
202410161135359598
No ratings yet
202410161135359598
1 page
KCS061-BIG-DATA
No ratings yet
KCS061-BIG-DATA
2 pages
Btech Oe 8 Sem Big Data Koe 097 2023
No ratings yet
Btech Oe 8 Sem Big Data Koe 097 2023
2 pages
GTU Big data analysis question paper Summer 2022
No ratings yet
GTU Big data analysis question paper Summer 2022
1 page
3161607 SUMMER 2023
No ratings yet
3161607 SUMMER 2023
2 pages
Gujarat Technological University
No ratings yet
Gujarat Technological University
1 page
3170722-w2023
No ratings yet
3170722-w2023
1 page
3161607 WINTER 2023
No ratings yet
3161607 WINTER 2023
1 page
Bda
No ratings yet
Bda
2 pages
BIG 22
No ratings yet
BIG 22
2 pages
@vtucode - in 18CS72 Previous Year Paper
No ratings yet
@vtucode - in 18CS72 Previous Year Paper
2 pages
ACSC31_qp
No ratings yet
ACSC31_qp
2 pages
3161607 WINTER 2024
No ratings yet
3161607 WINTER 2024
2 pages
Big-Data-Koe097 2021 - 22 8th Sem
No ratings yet
Big-Data-Koe097 2021 - 22 8th Sem
2 pages
W'22
No ratings yet
W'22
1 page
3161607 SUMMER 2024
No ratings yet
3161607 SUMMER 2024
1 page
Modal Question Paper
No ratings yet
Modal Question Paper
1 page
BDA PYQ
No ratings yet
BDA PYQ
4 pages
Model Paper BDA R20 VII Sem
No ratings yet
Model Paper BDA R20 VII Sem
3 pages
syllabus
No ratings yet
syllabus
4 pages
QP23EP1 - 290: Time: 3 Hours Total Marks: 100
No ratings yet
QP23EP1 - 290: Time: 3 Hours Total Marks: 100
1 page
6th sem DS syllabus 2022 scheme
No ratings yet
6th sem DS syllabus 2022 scheme
54 pages
Mca 3 Sem Big Data Kca022 Mar 2024
No ratings yet
Mca 3 Sem Big Data Kca022 Mar 2024
1 page
Pue Big Data
No ratings yet
Pue Big Data
2 pages
6-& 11 M - Big Data Analytics-VII Set1 BAO-Set 2 ECOM RA
No ratings yet
6-& 11 M - Big Data Analytics-VII Set1 BAO-Set 2 ECOM RA
4 pages
ST1 KCS061 - Updated
No ratings yet
ST1 KCS061 - Updated
2 pages
BDA Practical File
No ratings yet
BDA Practical File
61 pages
6th sem AIDS syllabus 2022 scheme
No ratings yet
6th sem AIDS syllabus 2022 scheme
52 pages
Syallaus 6 Final
No ratings yet
Syallaus 6 Final
16 pages
7th Cssyll
No ratings yet
7th Cssyll
49 pages
Final Exam Big Data - 11112
No ratings yet
Final Exam Big Data - 11112
6 pages
Big Data Analystics
No ratings yet
Big Data Analystics
4 pages
CP7019-Managing Big Data-Anna University - Question Paper
75% (4)
CP7019-Managing Big Data-Anna University - Question Paper
4 pages
BigDataSystems Regular HO
No ratings yet
BigDataSystems Regular HO
6 pages
Gujarat Technological University
No ratings yet
Gujarat Technological University
2 pages
Big Data
No ratings yet
Big Data
4 pages
2022-23-BDA-LAB Manual
No ratings yet
2022-23-BDA-LAB Manual
59 pages
CC ZG522 Course Handout
No ratings yet
CC ZG522 Course Handout
6 pages
BCA-BIGDATA-FIFTH_SEM-APPROVED-SYLLABUS
No ratings yet
BCA-BIGDATA-FIFTH_SEM-APPROVED-SYLLABUS
23 pages
Cs442 Dsa Unit Test 1
No ratings yet
Cs442 Dsa Unit Test 1
2 pages
2022-23-BDA-LAB Manual
No ratings yet
2022-23-BDA-LAB Manual
59 pages
BIG DATA ANALYTICS March 2023
No ratings yet
BIG DATA ANALYTICS March 2023
2 pages
Big Data Nit067
No ratings yet
Big Data Nit067
1 page
Big Data Cat Questions
No ratings yet
Big Data Cat Questions
7 pages
Blda Pract 2024
No ratings yet
Blda Pract 2024
59 pages
Notes
No ratings yet
Notes
11 pages
22684-S24
100% (1)
22684-S24
2 pages
Big Data Analytics 18ITC27
No ratings yet
Big Data Analytics 18ITC27
2 pages
Big Data Question Bank
No ratings yet
Big Data Question Bank
11 pages
170854b Dec 19
No ratings yet
170854b Dec 19
2 pages
Introduction To Big data-21CS753-syllabus
No ratings yet
Introduction To Big data-21CS753-syllabus
3 pages
Btech Cs 6 Sem Big Data Kcs 061 2023
No ratings yet
Btech Cs 6 Sem Big Data Kcs 061 2023
2 pages
BDA NOV-DEC 2022
No ratings yet
BDA NOV-DEC 2022
2 pages
IV Yr II Sem Lesson Plans
No ratings yet
IV Yr II Sem Lesson Plans
19 pages
Hadoop BIG DATA Interview Questions You'll Most Likely Be Asked
From Everand
Hadoop BIG DATA Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
Touchpad Information Technology Class 10
From Everand
Touchpad Information Technology Class 10
Sanjay Jain
5/5 (1)
Python Cheatsheet: Strftime
No ratings yet
Python Cheatsheet: Strftime
3 pages
mx4gvr Ol e
No ratings yet
mx4gvr Ol e
111 pages
Skyline Queries Project Report
No ratings yet
Skyline Queries Project Report
10 pages
Administration It Professional 2015
No ratings yet
Administration It Professional 2015
2 pages
Indexing and Hashing: B.Ramamurthy
No ratings yet
Indexing and Hashing: B.Ramamurthy
24 pages
CP Assignment 2
No ratings yet
CP Assignment 2
6 pages
CLASS-X - IT UNIT - 2-Revision Questions-2024
No ratings yet
CLASS-X - IT UNIT - 2-Revision Questions-2024
5 pages
Famacy
No ratings yet
Famacy
15 pages
Home Lab
No ratings yet
Home Lab
2 pages
Sevlet
No ratings yet
Sevlet
14 pages
14,(Traffic Management)
No ratings yet
14,(Traffic Management)
11 pages
Hass Apps
No ratings yet
Hass Apps
73 pages
Share Full stack development Roadmap
No ratings yet
Share Full stack development Roadmap
4 pages
NMCC
No ratings yet
NMCC
6 pages
Positrex Presentation ENG
No ratings yet
Positrex Presentation ENG
13 pages
10 Best Offline Grammar Checker Software - Rigorous Themes
No ratings yet
10 Best Offline Grammar Checker Software - Rigorous Themes
34 pages
Sample Instructional Outline
No ratings yet
Sample Instructional Outline
2 pages
VNX7600 Parts Location Guide
No ratings yet
VNX7600 Parts Location Guide
66 pages
01-PAM-ADMIN-Introduction to-CyberArk-PAM
No ratings yet
01-PAM-ADMIN-Introduction to-CyberArk-PAM
41 pages
AZ 800T00A ENU ChangeLog
No ratings yet
AZ 800T00A ENU ChangeLog
5 pages
FMG-FAZ 5.4.5 Event Log Reference
No ratings yet
FMG-FAZ 5.4.5 Event Log Reference
37 pages
Project Report Manish
No ratings yet
Project Report Manish
16 pages
560F - Vxrail 14g e Series Owners Manual
No ratings yet
560F - Vxrail 14g e Series Owners Manual
25 pages
Crime File System Project Report
67% (3)
Crime File System Project Report
79 pages
Why Blockchain Unit 1
No ratings yet
Why Blockchain Unit 1
40 pages
Product Compatibility Sheet Rev 1 4
No ratings yet
Product Compatibility Sheet Rev 1 4
18 pages
Krunal Mahajan: Role of Intellectual Property in E-Commerce
No ratings yet
Krunal Mahajan: Role of Intellectual Property in E-Commerce
1 page
RSD (G1)
No ratings yet
RSD (G1)
40 pages

Mid-Term Exam (30%) PROFESSOR: Oussama Derbel SECTION: 11112

Uploaded by

Mid-Term Exam (30%) PROFESSOR: Oussama Derbel SECTION: 11112

Uploaded by

FILLED BY THE STUDENT:

Mid-Term Exam (30%) _

PROFESSOR: Oussama Derbel

EXAM RULES: FILLED BY THE PROFESSOR:

Describes the three operations (Map, Combine, Reduce) to count occurrences of

File 1: I’m Indian student.

File 2: I live in Montreal.

File 3: I’m learning Big data subject.

File 4: I love India.

File 5: I love Canada.

You might also like