0% found this document useful (0 votes)
66 views

Siddaganga Institute of Technology, Tumakuru - 572 103

The document is a question paper for the M.Tech Computer Science and Engineering examination on Big Data and Data Analytics. It contains 6 questions with 3 sub-questions each. The questions assess different aspects of big data systems including the four elements of big data, Hadoop ecosystem components, MapReduce approach, Hive commands, data distribution models, R scripts, and social media analytics.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
66 views

Siddaganga Institute of Technology, Tumakuru - 572 103

The document is a question paper for the M.Tech Computer Science and Engineering examination on Big Data and Data Analytics. It contains 6 questions with 3 sub-questions each. The questions assess different aspects of big data systems including the four elements of big data, Hadoop ecosystem components, MapReduce approach, Hive commands, data distribution models, R scripts, and social media analytics.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 2

USN 1 S I 1SCSE3

Siddaganga Institute of Technology, Tumakuru – 572 103


(An Autonomous Institution affiliated to VTU, Belagavi, Approved by AICTE, Programmes Accredited by NBA, New Delhi, An ISO9001:2008 Certified Institute)

First Semester M.Tech.- Computer Science & Engg. Examinations Jan. 2017
Big Data & Data Analytics
Time: 3 Hours Max. Marks: 100
Note : 1. Answer any 5 full questions

1 a) List and explain the four elements of Big Data. 4


b) With a suitable example explain how use of Big Data prevents fradulent activities. 4
Explain how the following technologies help organisations to analyse data under varying
circumstances:
c) i) In-Memory computing Technology.
ii) Hybrid cloud.
iii) HDFS and Map-reduce. 12

2 a) Draw a neat diagram that shows the interaction between various tools and components in a
Hadoop Ecosystem. Explain any two components. 6
b) Discuss the role of following layers in the Big Data stack:
i) Ingestion layer.
ii) Storage layer.
iii) Visualization layer. 6
c) With neat diagrams compare the execution of a query in RDBMS and Big Data processing
solution. 8

3 a) Assume that a pharmaceutical company wants to track the stock of a specific medicine in all
its ware houses. Describe the working of an M-R Approach to achieve the task. 6
b) Discuss any three major guidelines used in the implementation of M-R application. 6
c) How can the following be used to customize M-R execution to improve the performance of
the cluster network:
i) Implementing Input Format for Compute Intensive Applications.
ii) Optimizing M-R Execution with combines. 8

4 a) Write HIVE commands for the following :


i) Create a database with any two database properties.
ii) Create an external table.
iii) Copy the book-title column from table Lib-info to table list-titles.
iv) Display the Cartesian product of two tables. 8
b) Explain the concepts of Map-side join in Hive with a suitable diagram. 6
c) List any two functions of a Oozie co-ordinator. Discuss the types of time-based co-ordinators. 6

5 a) Discuss the Data Distribution Models used with Aggregate-Oriented Databases. What is the
importance of CAP theorem in such distributed databases? 8
b) What is the relevance of the following in Big Data Analytics?
i) Operational Analytics.
ii) Monetized Analytics. 6

-1- Please Turn Over


-2- 1SCSE3
c) Compare the Analytical tools with respect to the following features:
i) Decision Making.
ii) File Management.
iii) Data Management. 6

6 a) Assume the following data sets:


i) 200 random numbers.
ii) Iris data.
Write R scripts to display groups for the above data sets. 4
b) Discuss the importance of following functions in R with suitable examples:
i) ls( ) ii) save ( ) iii) load ( ) 6
c) With suitable examples discuss the following with respect to social media analytics:
i) Text mining process.
ii) Sentiment Analysis. 10
________

You might also like