bigdata imp ques
bigdata imp ques
YOUTUBE : ShortNotes4U
TELEGRAM : ShortNotes4u
UNIT 1:
1. List any five Big Data platform.
2. Discuss in detail the different forms of Big Data.
3. Elaborate various components of Big Data ecosystem.
4. Detail about the three dimensions of Big Data.
5. Briefly discuss the history of Big Data and its innovation.
6. Write down any four industry examples for Big Data.
7. What are the Big Data technology components?
8. Discuss the advantages and disadvantages of Big Data.
9. Explain the conventional systems. List some of the
challenges of conventional systems.
10. Differentiate between analysis and reporting.
UNIT2:
1. What is the role of Sort and Shuffle in Map-Reduce?
2. Illustrate the architecture of Map-Reduce.
3. Differentiate “Scale up & Scale out”. Explain with an
example how Hadoop uses Scale
out features to improve the performance.
4. Explain the Anatomy of MapReduce job run.
5. Discuss the different types of input formats of Map Reduce
with examples.
6. Explain how to develop a Map Reduce application?
7. Explain the components of Hadoop in detail.
8. Discuss Map Reduce features in detail.
9. Write a short note on
a) Hadoop streaming
b) Hadoop pipes,
10. Give a real life example of Map-Reduce.
Unit 3:
1. Examine how a client read and write data in HDFS.
2. Demonstrate the design of HDFS and discuss in detail.
3. Write the benefits and challenges of HDFS.
4. Describe the role of secondary NameNode in HDFS
Architecture. Is it a substitute to
the NameNode?
5. Explain Avro file-based data structures in detail.
6. List out some limitations of Hadoop archives.
7. Explain data ingestion with Flume and Scoop
8. Discuss security in Hadoop.
9. What are file-based data structures?
10. What is the advantage of Hadoop in cloud computing?
UNIT 4:
Q1. Summarize the role of indexing in MongoDB using an
example.
Q2. Explain Resilient Distributed Databases in Spark.
Q3. Explain Fair and Capacity scheduler in detail.
Q4. What are the Hadoop Ecosystem Components?
Q5: Classify and detail the different types of NoSQL.
Q6: Compare and contrast No SQL Relational Databases.
Q7: Does MongoDB support ACID properties. Justify your
answer.
Q8: With the help of suitable example, explain how CRUD
operations are performed in
MongoDB.
Q9. Explain Data Sharing using Spark RDD.
Q10. Describe inheritance mechanism in Spark.
UNIT 5:
Q1: Define Scheme.
Q2: Discuss the different types of data that can be handled
with HIVE.
Q3. Differentiate between Map-Reduce, Pig and Hive.
Q4. Explain various execution models of PIG.
Q5. Design and explain the detailed architecture of HIVE.
Q6. How does Zookeeper helps in monitoring a cluster?
Q7. Explain several Big Data strategy given by IBM.
Q8. Differentiate between Hbase and RDBMS.
Q9. . Differentiate between Pig and SQL.
Q10. Explain Metastore.