BDA Assignment - 231012 - 151952
BDA Assignment - 231012 - 151952
Question Bank
Assignment I
1. What is Big Data? Explain Characteristics of Big Data.
2. What is Big Data Analytics? Explain 5 ‘V’s of Big Data.
3. Explain any 4 big data distribution packages
4. List top big data technologies.
5. Explain applications of big data.
Assignment II
1. With neat sketch explain HDFS?
2. Describe the working of map reduce with a relevant example.
3. Discuss architecture and application workflow of Hadoop YARN in detail.
4. What is zookeeper? Explain the architecture of zookeeper in detail.
5. Illustrate the architecture of big data stack.
6. Illustrate Paxos algorithm and its working in detail.
7. Explain HBASE architecture.
Assignment III
1. Explain different types of big data pipeline architecture with suitable diagram.
2. Explain spark streaming architecture with neat diagram.
3. What are the major components of Kafka? Explain with neat diagram.
MCQ
1. ___________ is a collection of data that is used in volume, yet growing exponentially
with time
A. Big Database
B. Big DBMS
C. Big Datafile
D. Big Data
A. Apache Pytorch
B. Apache Kafka
C. Apache Hadoop
D. Apache Spark
6. What is the minimum amount of data that a disk can read or write in HDFS?
A. Byte size
B. Block size
C. Heap
D. None of the above
A. structured data
B. unstructured datat
C. Both A and B
D. None of the above
13. Which step is executed by the data scientist after obtaining the data?
(A). Data Replication
(B). Data Integration
(C). Data Cleansing
(D). All of these
(E). None of these