DATA ANALYTICS AND R PROGRAMMING- CMCA22ET3
DATA ANALYTICS AND R PROGRAMMING- CMCA22ET3
IMPORTANT QUESTIONS
UNIT - 1
PART – A
1. Discuss the challenges of conventional systems
2. Define Big data. Describe with an example.
3. Describe the significance of Intelligent Data analysis.
PART – B
1. Differentiate: Analysis and Reporting in Data Analytics.
2. “Big Data revolves around 4Vs”. Discuss them in detail with their role in Big Data
Analytics.
3. Define Sampling. Illustrate the various types of Sampling distribution and techniques in
detail.
4. Narrate the 4V’s involved in Big Data.
5. Enumerate the various types of sampling distribution and techniques in detail.
UNIT - 2
PART – A
1. Write short note on streaming data.
2. Write down the concept of Windowing in Big data.
3. What are the characteristics of stream data?
PART – B
1. Articulate the DGIM algorithm for counting oneness in a window
2. Explain the concept of Filtering Streams in streaming data
3. Outline the various benefits of Real time Sentiment Analysis
4. Elucidate the concept of DGIM algorithm in detail.
5. Observe and summarize the various purposes of Real time Sentiment Analysis.
UNIT - 3
PART – A
1. Narrate the differences between dependent and independent variables in Linear
Regression.
2. Analyze association rule mining, as an unsupervised learning technique with
appropriate example.
3. Discuss the various steps involved in the process of data analysis.
PART – B
1. Demonstrate k-means clustering algorithm with its impact on data analytics.
2. List out and elaborate the various built in data structures procurable in R
programming.
3. “Data Visualization is an integral part of Exploratory Data Analysis”. Assert the
statement with the significance of Infographics.
4. Discuss the various built-in data structures available in R programming.
5. Demonstrate the concept of Naïve Bayes classifier with suitable example
6. Enumerate with example the various built in R programming data structures.
UNIT - 4
PART – A
1. Outline the features of Map reduce technique.
2. Determine the objectives of Hadoop in data analytics.
3. Enumerate various Map reduce types and formats in HDFS.
PART – B
1. Illustrate the process of Hadoop Distributed File system (HDFS) with examples.
2. Is Hadoop a database or framework? Validate your response and discuss the various
components of Hadoop modules in detail.
3. Interpret the working of Map Reduce technique in Hadoop framework.
4. Elucidate the facts of Map Reduce algorithm in detail
5. Signify Hadoop Distributed File system (HDFS) with examples.
UNIT - 5
PART – A
1. What are the objectives of HBase?
2. Narrate the working of Querying Data in Hive
3. What are the fundamentals of Hbase and Zookeeper?
PART – B
1. What are the objectives of HBase?
2. Summarize the applications of Big Data Analytics Using Pig and Hive Frameworks.
3. State and explain the various operators in Hive with suitable examples.
4. Compare and contrast Pig and Hive services in Data Analytics.