The document contains questions about big data concepts like HDFS, MapReduce, YARN and the four V's of big data. It also contains questions about Hadoop components like Namenode, Datanodes and MapReduce programming model.
Download as DOCX, PDF, TXT or read online on Scribd
0 ratings0% found this document useful (0 votes)
56 views
Bits
The document contains questions about big data concepts like HDFS, MapReduce, YARN and the four V's of big data. It also contains questions about Hadoop components like Namenode, Datanodes and MapReduce programming model.
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 2
1. What are the main components of big data?
a)HDFS b)MapReduce c)YARN d) all
2. On which of the following platforms does Hadoop run? a) Cross-desk b) cross-platform c)cross-platforrm d)unix 3. What are the four V’s of Big Data? (a)Volume (b)Velocity (c)Variety d)All the above 4. ___ is general-purpose computing model and runtime system for distributed data analytics. (a)Mapreduce (b)Drill (c)Oozie (d)None of the above 5. The examination of large amounts of data to see what patterns or other useful information can be found is known as (a)Data examination (b)Information analysis (c)Big data analytics (d)Data analysis 6.Big data analysis does the following except (a)Collects data (b)Spreads data (c)Organizes data (d)Analyzes data 7. Concerning the Forms of Big Data, which one of these is odd? (a)Structured b)Unstructured (c)Processed (d)Semi-Structured 8.In which language is Hadoop written? a)C++ b)Java c)Python d) Rust 9._____ is a collection of data that is used in volume, yet growing exponentially with time a)BigDatafile b)BigDBMS c)BigData d) big database 10. __________ has the world’s largest Hadoop cluster. a) Apple b) Datamatics c) Facebook d) None of the mentioned 11. A ________ node acts as the Slave and is responsible for executing a Task assigned to it by the JobTracker. a) MapReduce b) Mapper c) TaskTracker d) JobTracker 12.__________ part of the MapReduce is responsible for processing one or more chunks of data and producing the output results. a) Maptask b) Mapper c) Task execution d) All of the mentioned 13. Hadoop is a framework that works with a variety of related tools. Common cohorts include ____________ a) MapReduce, Hive and HBase b) MapReduce, MySQL and Google Apps c) MapReduce, Hummer and Iguana d) MapReduce, Heron and Trumpet
14.All of the following accurately describe Hadoop, EXCEPT ____________
a) Open-source b) Real-time c) Java-based d) Distributed computing approach 15. What is MapReduce? a) A distributed data processing framework b).A database management system c). A network routing algorithm. d)A single computer data processing framework 16.What is the purpose of the map function in MapReduce? A. To convert input data into key-value pairs B. To sort the input data C. To combine the input data D. To summarize the input data
17.What is the purpose of the reduce function in MapReduce?
A. To sort the input data B. To combine the input data C. To summarize the input data D. To convert input data into key-value pairs
18.Which of the following is a key feature of Hadoop?
A. High latency B. Real-time processing C. Fault tolerance D. Limited scalability
19. Clients access the blocks directly from ________for read and write a)datanodes b)namenode c)secondarynamenode d) none
20. Information about locations of the blocks of a file is stored at __________
a)datanodes b)namenode c)secondarynamenode d) None 21.which node stores the checksum? a)datanodes b)namenode c)secondarynamenode d) none 22. Mapreduce programming model is ______ a)Platform dependent but not language specific b) Neither platform–nor-language specific c) Platform independent but language-specific d) Platform dependent and language-specific 23. The MapReduce algorithm contains two important tasks, namely __________. A. mapped, reduce B. mapping, Reduction C. Map, Reduction D. Map, Reduce 24. In how many stages the MapReduce program executes? A. 2 B. 3 C. 4 D. 5 25. Which of the following is used for an execution of a Mapper or a Reducer on a slice of data? A. Task B. Job C. Mapper D. PayLoad 26. The number of maps is usually driven by the total size of ____________ A. Inputs B. Output C. Task D. None of the above