Assignment 2 IBDA
Assignment 2 IBDA
VIDHYADEEP INSTITUTE OF
ENGINEERING AND TECHNOLOGY
Vidhyadeep Campus, Anita (Kim), Ta. Olpad,
Dist. Surat
Assignment_2
UNIT: II
1. Discuss the history of Hadoop. How did it evolve, and what key developments led to its widespread
adoption in the Big Data ecosystem?
2. Explain the architecture of the Hadoop Distributed File System (HDFS). Describe the physical
organization of compute nodes in HDFS.
3. What are the key components of Hadoop? Discuss the role of HDFS, MapReduce, and YARN in
the Hadoop ecosystem.
4. How is data analyzed using Hadoop? Discuss the process of data processing in Hadoop using
MapReduce and how it scales to handle large datasets.
5. What is the concept of "scaling out" in Hadoop? Explain how Hadoop achieves scalability and fault
tolerance in distributed data processing.
6. What is Hadoop Streaming? How does it allow the integration of non-Java programming languages
with Hadoop for data processing?
7. Describe the design of HDFS. How does the master-slave architecture of HDFS contribute to fault
tolerance and efficient data storage in Hadoop?