Question Bank Dau
Question Bank Dau
QUESTION BANK
Module - 1
Short type Questions:
1. What is Data science
2. List different types of data used in data science with example
3. Why should you learn python for data science
4. List the Features of Python
5. Explain purpose of python for data science
6. List and explain components of python used for data science
7. Why top most companies use python is implementation language?
8. Explain limitations of python
Focused short :-
1. Explain in detail about applications of data science?
2. Explain different data analysis and manipulation tools with example
3. Explain life cycle of data science
4. Apply data science case study on your own data set
5. Explain sklearn library in detail
Long Questions:
1. List and explain components of python used for data science
2. Explain in detail about panda’s library
3. Discuss the core libraries and frameworks in Python that are widely used in data science, such
as NumPy, Pandas, and Scikit-learn.
4. What are the advantages of using Python for data visualization, and which libraries are
commonly used for this purpose?
Module - 2
1. Short type Questions:
2. What is data analytics and its applications
3. What is EDA and list its types
4. What is dimensionality reduction and its importance
5. What is data analysis tools explain
6. Is data analysis part of data analytics justifying your answer
7. Explain data analytics process with an example
8. Explain data analytics types with an example
9. Perform EDA process on iris data set
10. Explain types of EDA process with an example
Focused short :-
1. Explain EDA quantitative and graphical techniques with an example
2. Define data analytics process. Explain with an example
3. Explain in detail about data analytics process?
4. Identify types of data with an example?
5. Explain the importance of data analytics process.
6. Describe the role of data preparation in the data analytics process.
7. Study the following bar graph and answer the questions that follow: Total monthly
income = Rs. 50,000
Long Questions:
Module - 3
1. What is feature generation in data science, and why is it important for user retention?
2. How does domain expertise play a role in feature generation?
3. What is the difference between feature generation and feature selection?
4. Name two feature selection algorithms used for user retention.
5. How does feature selection help in improving model performance for user retention?
6. How does domain expertise play a role in feature generation
7. How do you evaluate the importance of features in a data set
8. How do you brainstorm features from a given problem
9. What are the advantages and disadvantages of feature generation
Focused short :-
1. Explain the difference between filter method and wrapper method for feature selection
2. Explain the process of feature generation in data science, highlighting its significance in user
retention.
3. Discuss the role of domain expertise and imagination in feature generation for user retention.
4. Compare and contrast two feature selection algorithms used for user retention, highlighting
their advantages and disadvantages.
5. How does feature selection impact model performance and interpretability in user retention
applications? Explain with examples.
6. Describe a scenario where feature generation and selection can be applied to improve user
retention in a real-world application.
Long Questions:
1. Develop a comprehensive feature generation and selection framework for user retention,
incorporating domain expertise and imagination.
2. Evaluate the performance of two feature selection algorithms on a user retention dataset,
comparing their results and discussing the implications.
4. Discuss the ethical considerations in feature generation and selection for user retention,
highlighting potential biases and mitigation strategies.
5. Create a case study illustrating the application of feature generation and selection in
improving user retention for a real-world organization.
Module - 4
2. Discuss bar chart line chart area fill and pie chart with examples
Focused short :-
1. Explain the concept of data visualization and its significance in communicating insights.
2. Discuss the principles of effective data visualization, highlighting color, size, and
position.
3. Describe two inspiring industry projects that demonstrate effective data visualization.
4. Compare and contrast two data visualization tools, highlighting their strengths and
weaknesses.
5. Create a simple visualization (e.g., bar chart, scatter plot) using a sample dataset.
Long Questions:
3. Evaluate the effectiveness of two data visualizations, discussing clarity, accuracy, and insight
generation.
4. Create a visualization that tells a story with data, using a real-world dataset and appropriate
tools.
5. Conduct a case study on an industry project that demonstrates exceptional data visualization,
analyzing its impact and effectiveness.
Module - 5
6. What are the ethical considerations in using data science for social media analytics?
Focused short:-
1.Discuss the skills required for next-generation data scientists, highlighting the importance
of domain expertise.
2.Explain the role of communication skills in data science, highlighting best practices.
3.Describe the emerging trends in data science and their impact on next-generation data
scientists.
4.Compare and contrast two approaches to integrating domain expertise with data science.
5.Discuss the ethical implications of using data science in healthcare, highlighting privacy
and security concerns.
6.Compare and contrast two ethical frameworks for data science.
7.Discuss the applications of data science in finance, highlighting two specific use cases.
8.Explain how data science is used in healthcare to improve patient outcomes.
Long Questions:
2. Design a data science framework for customer segmentation, including data preprocessing,
model development, and evaluation.
3. Evaluate the applications of data science in finance, highlighting two specific use cases and
their impact.
5. Create a case study on an ethical issue in data science and propose a solution, highlighting the
importance of domain expertise.
6. Develop a data science code of ethics, outlining key principles and consideration