BDA_Assignment 1.docx
BDA_Assignment 1.docx
Dept of CSE
Assignment 1
Course Outcome
CO 2. Investigate Hadoop framework, Hadoop Distributed File system and essential Hadoop tools.
CO 3. Illustrate the concepts of NoSQL using MongoDB and Cassandra for Big Data.
1 define data, web data, big data. explain structured, semi structured & unstructured 2 1 8
data
3 What do you mean by 3Vs characteristics of Big Data? What are the challenges 2 1 6
faced from large growth in volume of data?
4 with a neat diagram, explain the functions of each of the five layers in big data 2 1 8
architecture design
7 List the features of Grid computing. How does it differ from clusters and cloud 2 1 8
computing
10 with a neat diagram explain Hadoop main components & ecosystem components 2 2 7
11 Brief out the features of the Hadoop HDFS. also explain the functions of 2 2 7
NameNode and DataNode
13 Explain the following i. HDFS block replication ii. HDFS safe node iii. RAck 2 2 12
awareness iv. Name node high availability
14 Discuss the Apache sqoop import and export methods with neat diagram 2 2 8
15 List and compare the features of Big Table, RC, ORC, and Parquet data stores 2 3 10
16 With example explain key value pair 2 3 8
17 Discuss the usage of MongoDB, Cassandra, CouchDB, Oracle NoSQL and Riak 2 3 8