DA Long Questions (12!11!24)
DA Long Questions (12!11!24)
DATA ANALYTICS
UNIT-I Long answer questions
1) What are the constraints and influences that will have an effect on data
Architecture Design? Explain [Feb 2023] [5 marks]
2) Describe the Factors that influence the Data Architecture?
3) Discuss about the Data Reduction as a Data preprocessing step [Feb
2023] [5 marks]
4) Explain the need of data preprocessing with illustrations [Aug 2022] [8
marks]
5) Why preprocess the data? Explain in detail. [March 2021][8 marks]
6) Explain in about Data Pre-processing and Data processing.[15
marks]
7) What is the important of data processing?Describe the steps involved
in data processing? [Aug 2024] [10 marks]
8) What is Secondary Data? Classify the secondary data sources. [5
marks][Feb 2023]
9) List and explain about the secondary sources of data
10) What are primary sources of data? Explain in detail Discuss how data
11)Quality assessment can depend on the Intended use of the Data?
12) How can data be collected from primary and secondary sources.[7
marks]
13) Explain about (or) discuss various sources of data in detail. [march
2021] [8 marks] [Aug 2022] [8 marks]
14) Explain about various constraints and influences that will affect data
15) Briefly describe various sources of data like sensors, signals, GPS in
data management. [Sep 2021] [15 marks]
16) What is data? How to handle large collection of data? [Aug 2022] [7
marks]
17) How to identify data Quality? What are the quality measures of data?
[Aug 2022] [7 marks]
18) What are the parameters to check quality measures of data?[7 marks]
19) Discuss how data quality assessment can depend on the Intended Use
of the data? [Feb 2023] [5 marks]
20) Market researchers have used four experimental design for generating
primary data. Describe them in detail. [Feb 2022] [15 marks]
21) Explain about data quality and data preprocessing. [Feb 2022] [15
marks]
22) Explain about the detection and treatment of Outliers
23) What are hazards? Explain potential sources of hazards in an
organization.[March 2021][8 marks]
24) Outline several sources of data for data collection and compare those
sources of data with advantages and limitations [Aug 2024] [10 marks]
25)Discuss the steps involved in Export job process in Amazon S3[8
marks]
26) How to detect and remove outliers in given data set?[8 marks]
27)Illustrate techniques of missing values treatment with example.[7
marks]
28) Demonstrate data preprocessing techniques in detail.[9 Marks]
29) What is data deduplication? Explain deduplication methods[6
marks]
30)How to manage data which comes from various sources by ensuring
data quality? Explain with real time example[15 marks]
31)Data set D {10K, 15K,22K, 25K,36K,40K,13K,19K, 88K,94K}
represents packages of the students placed in an interview where "K
represents thousand". Identify the outliers in the data set and analyze its
impact in studying the spread of data.[ 8 marks]