0% found this document useful (0 votes)
53 views

Question Bank - 1

This document contains 20 questions about big data concepts and technologies including characteristics of big data, forms of big data, big data platforms, MapReduce architecture and functionality, Hadoop components like HDFS and its operations, data ingestion tools in Hadoop, HDFS features, MongoDB querying and indexing, differences between SQL and noSQL, CRUD operations in MongoDB, Spark and its applications, Pig Latin and its execution modes, HBase, Hive architecture, and Hive join queries with examples.

Uploaded by

Priyanshu Kumar
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
53 views

Question Bank - 1

This document contains 20 questions about big data concepts and technologies including characteristics of big data, forms of big data, big data platforms, MapReduce architecture and functionality, Hadoop components like HDFS and its operations, data ingestion tools in Hadoop, HDFS features, MongoDB querying and indexing, differences between SQL and noSQL, CRUD operations in MongoDB, Spark and its applications, Pig Latin and its execution modes, HBase, Hive architecture, and Hive join queries with examples.

Uploaded by

Priyanshu Kumar
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 2

Question Bank

Big Data

1. Explain about Big Data Architecture and characteristics(5V’s)


2. Write down the Big Data features-security, compliance, auditing and
protection, Privacy and ethics.
3. What are the different forms of Big Data.
4. List out Big Data Platforms and drivers of it.
5. Differentiate
a) Analysis and report
b) Analysis and Data Analytics
6. Discuss about Anatomy of MapReduce job run
7. What are the features of MapReduce.
8. Sketch out Architecture of MapReduce and explain in detail
9. Discuss about Job Scheduling
10.Describe
a) Components of Hadoop
b) Two nodes of HDFS
c) Shuffle and sorting in MapReduce
d) Scale in and scale out
11.How does HDFS store, read and write data files.
12.Discuss about
a) Data Replication
b) Default block size of HDFS
c) Data abstraction in HDFS
d) AVRO
e) Benefits and challenges of HDFS
13.Describe in detail about Data Ingestion tools in Hadoop.
14.Provide overview of
a) Hadoop2.0 new Feature
b) HDFS Federation
c) Querying in Mongodb
d) Indexing in Mongodb
15. Difference between SQL and noSQL
16.Elaborate in detail about CRUDE operations in Mongodb
17.What is SPARK and write down the application of SPARK?
18.Write in detail about
a) Pig Latin
b) Grunt in Pig
c) Execution modes of Pig
d) Hbase
19.Discuss about HIVE Architecture.
20.What are the HIVE queries for JOIN and explain with examples.

You might also like