0% found this document useful (0 votes)
31 views10 pages

DA Long Questions (12!11!24)

Uploaded by

Harika
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
31 views10 pages

DA Long Questions (12!11!24)

Uploaded by

Harika
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 10

III-I (R22) BATCH 2022-2026

DATA ANALYTICS
UNIT-I Long answer questions
1) What are the constraints and influences that will have an effect on data
Architecture Design? Explain [Feb 2023] [5 marks]
2) Describe the Factors that influence the Data Architecture?
3) Discuss about the Data Reduction as a Data preprocessing step [Feb
2023] [5 marks]
4) Explain the need of data preprocessing with illustrations [Aug 2022] [8
marks]
5) Why preprocess the data? Explain in detail. [March 2021][8 marks]
6) Explain in about Data Pre-processing and Data processing.[15
marks]
7) What is the important of data processing?Describe the steps involved
in data processing? [Aug 2024] [10 marks]
8) What is Secondary Data? Classify the secondary data sources. [5
marks][Feb 2023]
9) List and explain about the secondary sources of data
10) What are primary sources of data? Explain in detail Discuss how data
11)Quality assessment can depend on the Intended use of the Data?
12) How can data be collected from primary and secondary sources.[7
marks]
13) Explain about (or) discuss various sources of data in detail. [march
2021] [8 marks] [Aug 2022] [8 marks]
14) Explain about various constraints and influences that will affect data
15) Briefly describe various sources of data like sensors, signals, GPS in
data management. [Sep 2021] [15 marks]
16) What is data? How to handle large collection of data? [Aug 2022] [7
marks]
17) How to identify data Quality? What are the quality measures of data?
[Aug 2022] [7 marks]
18) What are the parameters to check quality measures of data?[7 marks]
19) Discuss how data quality assessment can depend on the Intended Use
of the data? [Feb 2023] [5 marks]
20) Market researchers have used four experimental design for generating
primary data. Describe them in detail. [Feb 2022] [15 marks]
21) Explain about data quality and data preprocessing. [Feb 2022] [15
marks]
22) Explain about the detection and treatment of Outliers
23) What are hazards? Explain potential sources of hazards in an
organization.[March 2021][8 marks]
24) Outline several sources of data for data collection and compare those
sources of data with advantages and limitations [Aug 2024] [10 marks]
25)Discuss the steps involved in Export job process in Amazon S3[8
marks]
26) How to detect and remove outliers in given data set?[8 marks]
27)Illustrate techniques of missing values treatment with example.[7
marks]
28) Demonstrate data preprocessing techniques in detail.[9 Marks]
29) What is data deduplication? Explain deduplication methods[6
marks]
30)How to manage data which comes from various sources by ensuring
data quality? Explain with real time example[15 marks]
31)Data set D {10K, 15K,22K, 25K,36K,40K,13K,19K, 88K,94K}
represents packages of the students placed in an interview where "K
represents thousand". Identify the outliers in the data set and analyze its
impact in studying the spread of data.[ 8 marks]

UNIT-II Long answer questions

1) a) What are the ways to use the Data Analytics?


b)List and explain the (or) Discuss various data Analytics techniques
with examples[Sep 2021] [15 marks] [Feb 2023] [4 marks]
c) Demonstrate the various steps involved in data analytics and discuss
the tools and environment needed for analytics [Feb 2022] [15 marks]
2) What are the steps involved in Data Analytics? Explain in detail.
3) What are the different primary Analytics Tools for DA? Explain in
detail
4) Illustrate about Apache Spark Built and components in Hadoop.
5a) Contrast the differences about SQL & NOSQL databases.
b) Compare and contrast SQL Databases with NOSQL Databases.
c) What are the types of NOSQL tools based on Data models? Explain
[Feb 2023] [6 marks]
6) Explain in detail about the Missing Imputations (or) what are Missing
Imputations? Explain in detail
7) Illustrate the Model Building Life Cycle in Data Analytics.
8) Explain the various methods to identify the gaps in the Data and their
handling mechanisms [Feb 2023] [6 marks]
9) a)Discuss about the importance (or) Explain the applications of data
modeling in Business[Aug 2022] [8 marks] [Feb 2023] [4 marks]
b) Enumerate your views and observations on different data modeling
techniques [Sep 2021] [15 marks]
10) Explain in detail about need for business modeling. [March 2021][8
marks]
11) Explain how data imputation can be performed [Feb 2022] [15
marks]or Illustrate data imputations techniques[Aug 2022] [7 marks]
12) Contrast nominal, ordinal and ratio-scaled data. [Aug 2022] [7
marks]
Provide examples for the following types of data: nominal, ordinal,
categorical, continuous, discrete.[8 marks]
13) Discuss about the types of data variables? [Aug 2024] [5 marks]
14) Write about the applications of modeling a business and need for
business modeling? [Aug 2024] [5 marks]
15) Summarize different types of data models with suitable examples?
16) Briefly describe application of modeling in business[7 marks]
17)Describe the important features of Cloudera Impala [Aug 2022] [8
marks]
18)What is big data? How to handle big data for analytics?[7 marks]
19)Demonstrate partial imputation using expectation maximization
algorithm[8 marks]
20)Qualitative variables are not categorical. Justify with suitable
example? [7 marks]
21)Discuss storage mechanism of unstructured data in distributed
computing[8 marks]
21)Demonstrate Missing Imputation methods in detail with examples[8
marks]
22) with suitable examples(or) Illustrate Data modeling techniques.[7
marks]

UNIT-III Long answer questions


1) Discuss the significance of ROC analysis and ROC curve [Aug 2022]
[7 marks]
2) Demonstrate variable rationalization in regression [Aug 2022] [8
marks]
3) Illustrate Hosmer–Lemeshow test for goodness of fit of logistic
regression [Aug 2022] [8 marks]
4) Discuss in short about the following Model Fit Statistics: [Feb 2023]
[5 marks]
i) Hosmer Lemeshow Test ii) Error Matrix.
5) Explain ordinary least squares regression with an example [Aug 2022]
[7 marks]
6) a)Demonstrate ordinary least square estimation[Feb 2022] [15 marks]
b) Explain (or) outline neatly the purpose of the Least Square Estimation
in Regression with an example. [Feb 2023] [5 marks] [Aug 2024] [10
marks]
c) Test for the ‘least square estimation’ with appropriate case study. [Sep
2021] [15 marks]
d) What is least square estimate? Illustrate its importance in regression
modeling.
7) Discuss the best unbiased linear estimator property of regression [Feb
2022] [7 marks]
8) What is meant by BLUE property? What are the blue properties of
OLS method? [Feb 2023] [5 marks]
9) Discuss in detail about Multinomial Logistic Regression. [Feb 2023]
[5 marks]
10) What is wrong with linear regression? Explain logistic regression
[March 2021][8 marks]
11) Explain about variable rationalization with examples. [March
2021][7 marks]
12) ‘Logistic regression is an example of non-linear regression’ prove the
statement with suitable case study and experiment. [Sep 2021] [15
marks] [Aug 2024] [10 marks]
13) a)Sketch various analytics applications to various Business Domains.
b) Explain about analytics applications to various Business
Domains[march 2021] [8 marks]
C) Elucidate analytical applications to various business domains. [Feb
2022] [15 marks]
14) What is the role of sigmoid function in regression models? Give
illustrations.[7 marks]
15) Apply logistic regression to demonstrate binary classification.[7
marks]
16) When to choose logistic regression over linear regression? Explain
with examples.[7 marks]
17) Discuss logistic transformation and its components.[8 marks]
18)Explain different types of variables used in Regression modeling.[8
marks]
19) Demonstrate linear regression with suitable example[7 marks]
20) What is the role of regression in data analytics? Illustrate different
types of regression.[15 marks]
21) Make a comparison of ordinal least squares method and maximum
likelihood estimation method with suitable example data.[15 marks]

UNIT-IV Long answer questions


1) Explain decision tree induction approach [Aug 2022] [7 marks]
2) How to extract features from time-series data? Explain with an
example [Aug 2022] [8 marks]
3) What is ETL? List the commercially available ETL tools. [Feb 2022]
[7 marks]
4)Explain ARIMA[march 2021][5 marks]
5) Demonstrate (or) Discuss ARIMA for time series data. [Feb 2022] [8
marks]
6) What is Overfitting? How to Prevent Overfitting? [Feb 2023] [5
marks]
7)Explain the following time series models with examples
Auto regression(AR)
8)Auto Regressive Moving Average(ARMA) [Aug 2024] [10 marks]
9) Compare and Contrast between ARMA and ARIMA[Feb 2023] [5
marks]
10) Differentiate (or)Discuss about supervised and unsupervised learning.
[Feb 2023] [5 marks] [Aug 2024] [5 marks]
11) Discuss the STL approach for Time Series Decomposition [Feb
2023] [5 marks]
12) Discuss about dimensional stacking and tree-map. [March 2021][7
marks]
13) What is STL approach? Explain in detail. [March 2021][7 marks]
[Aug 2024] [5 marks]
14) Describe measures of forecast accuracy. [March 2021][8 marks]
15) Briefly construct the following decision tree algorithms. [Sep 2021]
[15 marks]
CART [8 marks] b) C4.5[7 marks]
16) Distinguish between supervised and unsupervised learning. [March
2021] [7 marks]
17) Explain CHAID algorithm for classification and also discuss its
limitations. [15 marks]
18) Outline major steps of decision tree classification with a suitable
example.[7 marks]
19)What is tree pruning? Illustrate drawback of using separate set of
tuples to evaluate pruning.[8 marks]
20)What is meant by overfitting of a model? How to handle such
situation?
UNIT-V Long answer questions
1) Describe parallel coordinates and land scapes for geometric data
visualization [Aug 2022] [8 marks]
2) Elucidate visualization of complex data and relations [Aug 2022] [7
marks]
3) How to perform visualization of the data using a hierarchical
partitioning into subspaces? Explain with examples. [Feb 2022] [15
marks]
4) Demonstrate Geometric projection visualization techniques [Feb 2023]
[5 marks]
Infer about “Geometric projection visualization techniques” in detail with
case study [Aug 2024] [10 marks]
5) List out various applications of data visualization [Feb 2023] [5
marks]
6) Discuss (or) Describe (or) Write down steps involved in Data
Visualization in Tableau. [Feb 2023] [5 marks][15 marks]
7) Explain about tag cloud visualization technique [Feb 2023] [5 marks]
8) Explain the following:
a)Apache Spark[march 2021][5 marks]
b) Data visualizations [march 2021][5 marks]
9) a)Discuss in detail about visualizing complex data and relations.
[March 2021] [7 marks]
b) How to perform visualization of complex data and relations? Quote
suitable examples.[8 marks]
10) Explain about direct visualization and hyperslice [March 2021] [7
marks]
11) Interpret on ‘pixel-oriented visualization’ with example. [Sep 2021]
[15 marks]
12) Illustrate the usage of Chernoff faces and stick figures in data
visualization.[7 marks]
13) Analyze and outline the importance of scales and dimensions in
spread sheet visualization.[8 marks]
14) Apply dimensional stacking and explain how to visualize multivariate
data. [7 marks]
15)Outline the concept of “Icon-Based visualization Techniques” with
case study. [Aug 2024] [10 marks]

You might also like