0% found this document useful (0 votes)
443 views2 pages

Question Paper Code:: (10×2 20 Marks)

This document is a question paper for an exam on big data analytics. It contains 3 parts with multiple choice and descriptive questions. Part A contains 10 short answer questions worth 2 marks each, covering topics like the definition of big data, data analytics tools, MapReduce, and issues in stream processing. Part B has 5 longer answer questions worth 13 marks each, requiring explanations of challenges in conventional systems, Apache Hadoop features, clustering algorithms, and stream processing. Part C is a single 15 mark question requiring analysis of MapReduce, HDFS architecture, or the use of Hive with Hadoop.

Uploaded by

Ponraj Park
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
443 views2 pages

Question Paper Code:: (10×2 20 Marks)

This document is a question paper for an exam on big data analytics. It contains 3 parts with multiple choice and descriptive questions. Part A contains 10 short answer questions worth 2 marks each, covering topics like the definition of big data, data analytics tools, MapReduce, and issues in stream processing. Part B has 5 longer answer questions worth 13 marks each, requiring explanations of challenges in conventional systems, Apache Hadoop features, clustering algorithms, and stream processing. Part C is a single 15 mark question requiring analysis of MapReduce, HDFS architecture, or the use of Hive with Hadoop.

Uploaded by

Ponraj Park
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 2

*X86564*

Reg. No. :

Question Paper Code : X86564


M.E./M.Tech. Degree Examinations, April/may 2021
Second Semester
Computer Science and Engineering
CP5293 – big data analytics
(Common to M.E. Mobile and Pervasive Computing/M.E. Software
Engineering/M.Tech. Information Technology)
(Regulations 2017)

Time : Three Hours Maximum : 100 Marks

Answer all questions

Part – a (10×2=20 Marks)

1. What is Big Data ?

2. List out the data analytics tools.

3. Define Map reduce.

4. How big data and Hadoop related to each other ?

5. Distinguish between correlation and regression.

6. What is R in programming language ?

7. List out the issues in stream processing.


8. What is sampling data in a stream ?

9. What is NoSQL ?

10. Identify in which pig programs can be executed.

Part – B (5×13=65 Marks)

11. a) Explain in detail about the challenges of conventional system.

(OR)
b) Discuss about the tools, trends and technology in big data.
X86564 *X86564*

12. a) Discuss the features of Apache Hadoop in detail with neat diagram.
(OR)

b) i) Describe Map Reduce framework in detail. Draw the architectural


diagram for physical organization of compute nodes. (7)
ii) What are the different configuration files in Hadoop ? (6)

13. a) Describe the various hierarchical methods of cluster analysis.


(OR)
b) Explain the K-means clustering algorithm with an example.

14. a) What are streams ? Explain stream data model with its architecture.
(OR)
b) Discuss about decaying window in detail.

15. a) List the classification of NoSQL Databases and explain about Key Value Stores.
(OR)
b) What is HBase ? Discuss about features of HBase in detail.

Part – C (1×15=15 Marks)

16. a) State the significances of MapReduce and discuss about Hadoop distributed
file system architecture with neat diagram.

(OR)
b) Analyze the use of Hive. How does Hive interact with Hadoop explain in
detail ?

–––––––––––––

You might also like