0% found this document useful (0 votes)

16 views

TOPIC WISE DSA QUESTIONS

Uploaded by

Surabhi Raj

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views

TOPIC WISE DSA QUESTIONS

Uploaded by

Surabhi Raj

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 15

Data Science and its Applications (21AD62)

MODULE 1

Data Visualization

1. What is data Visualization? Explain bar chart and line chart. (8)
2. Explain Data Visualization and recognize its use. Sketch Python code segment to
visualize line chart and scatterplot with example. (6)
3. With matplotlib explain simple line chart and bar chart. (8)
4. Write a short note on data visualization. (6)
5. Describe the process of creating a bar chart using matplotlib. What information is
typically conveyed by a bar chart? (4)
6. Explain the concept of correlation and its significance in data analysis. Discuss
Simpson’s Paradox and other correlational caveats with examples. (8)
7. Explain with example the matplotlib library in python. (6)
8. Draw the scatter plot to illustrate the relationship between the number of friends and
the number of minutes spent on every day. (4)
9. Develop a python program to plot a bar chart for the given data. Draw the bar chart
and label x and y axes. (8)
10. Develop a python program to plot a line chart for the given data. Explain the various
attributes of the line chart. Draw the line chart. (6)

Probability Theory & Bayes Theorem

1. Write a note on probability theory as applicable to data science. (8)

2. Describe Bayes’s Theorem in detail with an example. (8)
3. State and explain Bayes’s theorem. (6)
4. Describe Bayes’s Theorem and its significance in statistical inference. How can
Bayes’s Theorem be applied to improve classification models? (8)
5. Describe the following probability concepts:
o Conditional Probability
o Bayes Theorem
o Central Limit Theorem
o Normal Distribution
o Random Variables (8)
6. Find the probability of the given events:
o A single letter is selected at random from the word ‘MACHINE LEARNING’.
The probability that it is a consonant.
o The probability of rolling 2 dice to get a sum of 4 or 7.
o Lottery tokens are numbered from 1 to 25. What is the probability that a token
drawn is a multiple of 5 or 7?
o The probability of getting a face in 52 cards. (8)
Normal Distribution

1. Write a note on normal distribution. (4)

2. Describe Normal Distribution with a Python routine for PDF and CDF. (7)
3. Describe the statement “Correlation is not Causation” with an example in detail. (6)
4. What is the Standard Normal distribution? Explain how to use the Z-score to
standardize a normal random variable. (7)
5. Illustrate normal distribution and continuous distribution in detail. (10)
6. Illustrate Central limit theorem with a neat diagram. (8)
7. State and illustrate the Central Limit Theorem with a python code using a suitable
example. (7)
8. Discuss the Central Limit Theorem and its significance in relation to the Normal
distribution. How is the Normal distribution used in hypothesis testing? (6)

Vectors in Data Science

1. Explain the following:

o Vector addition
o Vector sum
o Vector mean
o Vector multiplication (8)
2. Describe vectors in Data Science and explain any three operations on vectors with
Python routine for each operation. (6)
3. Write a Python Program to add Two Vectors and Multiply a Vector by Scalar. (6)
4. Explain vectors with a code to find the distance between two vectors. (6)
5. Discuss the concept of vectors and matrices in Linear Algebra. Provide examples of
how they are used in data manipulation and machine learning. (10)

Measures of Central Tendency & Dispersion

1. Explain the following statistical techniques:

o Mean
o Median
o Mode
o Interquartile range (8)
2. What are the main measures of central tendency? Describe each one. How do you
represent a vector in Python using libraries like NumPy? (7)
3. What are measures of dispersion, and why are they important? (6)
4. Summarize dispersion. Using Python code snippet explain the various measures of
dispersion. (7)
5. Describe dispersion and variance and write the python code to compute the variance.
(6)
6. Explain standard deviation and interquartile range and write python code to compute
standard deviation and interquartile range. (8)
7. Develop python functions for computing the components of central tendencies with
explanation. (6)
8. Consider the following employees data:


Find the standard deviation of salary of employees in each dept. of a company
and identify the department with the highest standard deviation. (7)
 Find the mean and median salary of employees in each department of the
company. (7)

9. Compute code to compute standard deviation. (6)

Simpson’s Paradox

1. Explain Simpson’s Paradox. (4)

2. Explain Simpson’s paradox with an example. (7)
3. Illustrate Simpson’s paradox with an example. (6)
4. Discuss Simpson’s Paradox and other correlational caveats with examples. (8)

Correlation vs Causation

1. Explain the difference between correlation and causation. Why is it incorrect to infer
causation from correlation alone? Describe an example where correlation does not
imply causation. (7)
2. Describe the statement “Correlation is not Causation” with an example in detail. (6)

Data Science

1. Define Data Science. Explain the Venn diagram of Data Science. (6)
2. What is Data Science? Write a short note on data visualization. (6)
3. What is Data Science? With example explain the role of a data scientist. (8)
4. Who is a Data scientist? Draw the data science life cycle in detail. (8)
Random Variables

1. Discuss Random variables with an example in detail. (6)

2. Discuss random variables with an example in detail. (6)
3. What are random variables? State Bayes’s theorem in detail with an example. (8)
MODULE 2

Gradient Descent:

1. Explain the gradient descent approach in detail with a relevant example.

2. What is gradient descent, and why is it important in machine learning? Explain the
difference between gradient descent and stochastic gradient descent.
3. Explain the way how Gradient descent is used to fit Parameterized models.
4. Explain how gradient descent is used to fit parameterized models.
5. What is gradient descent? Explain the idea behind gradient descent and how it is used
to fit models. Discuss the differences between batch gradient descent, minibatch
gradient descent, and stochastic gradient descent.
6. Compute code to estimate the gradient.
7. Summarize Stochastic and Minibatch Gradient Descent.

Hypothesis Testing & A/B Testing:

1. Explain in detail on hypothesis testing with example.

2. Interpret the importance of power and significance in Statistical Hypothesis Testing
with a suitable Python routine.
3. What is an A/B test, and why is it used in data science? Describe the steps involved in
designing and running an A/B test.
4. Describe A/B test with an example.
5. Explain null and alternative hypothesis by considering the example for a flipping
coin. Write a Python program to flip the coin 1000 times and count the number of
heads and tails. Based on the results, determine if the coin is fair.
6. What is P-Hacking? Describe A/B test with an example.
7. Explain statistical hypothesis testing with examples.
8. Explain A/B testing with an example, with a relevant equation.
9. Write a short note on null and alternative hypothesis by considering the example for a
flipping coin.
10. Describe the process of statistical hypothesis testing. Using the example of flipping a
coin, explain how you would determine if a coin is fair or biased.
11. Illustrate A/B test with an example.
12. Illustrate p-Values with an example.
13. What are p-values and confidence intervals in the context of hypothesis testing?
Discuss their significance and how they are used in making statistical inferences.
14. Explain Confidence Intervals with an example.
15. Write a note on confidence intervals in detail.
16. What is p-hacking? Write a short note running an A/B testing.

Data Cleaning & Munging:

1. Explain data cleaning, data munging, and manipulating Data.

2. Explain cleaning and munging of data with an example.
3. Illustrate cleaning and munging with suitable code.

Web Scraping:

1. Explain the methodologies to extract data from web scraping.

2. Articulate the role of BeautifulSoup in Web scraping using Python snippet.
3. Consider an HTML file. Write a Python program to scrap the page extract values
associated with tags and properties.
4. Write a Python code for scraping an HTML document with an example.
5. Consider an HTML file and build a Python program to scrap the page, extract values
associated with tags and properties.
6. Describe the steps involved in obtaining data from various sources such as stdin,
stdout, reading files, web scraping, and using APIs. Provide a detailed example of
using the Twitter API to gather data.
7. Write a short note on Beautiful Soup library.

Linear Regression & Error Detection:

1. What is Simple Linear Regression? How is error calculated in the Linear Regression
model? How would you detect overfitting in a linear model?
2. Explain the mathematical intuition of Multiple Linear Regression. Explain the steps.
3. Explain how gradient descent is used to fit parameterized models.

Data Handling with Python (CSV, Named Tuples, etc.):

1. Sketch the use of csv.reader, csv.DictReader, and csv.writer in processing Delimited

Files.
2. Brief out Bootstrapping. Explain how manipulation of data is done and brief out what
is named tuples.
3. Write a program that counts the lines it receives and then writes out the count.
4. Explain and write a code using the “NamedTuple” class.
5. Illustrate the difference between named tuples and Data classes with an example.
6. Write a Python code for counting the number of lines and counting the 10 most
repeated words in the given file using stdin and stdout and regular expression.

Dimensionality Reduction:

1. Explain dimensionality reduction in detail.

2. Explain in detail dimensionality reduction with an example.

Miscellaneous:
1. Predict the genre of the ‘Barbie’ movie with IMDB=7.4 and duration 114 using KNN,
considering k=3.

2. Illustrate tqdm Library functions with an example.

3. Illustrate the tqdm library by considering an example.
4. Compute code to explain the beta distributions.
5. Explain with an example the concept of rescaling.
6. Illustrate 1D, 2D, and multi-dimensional data with examples.
7. Describe Bayesian Inference in detail.
8. Compute code to explain the beta distributions.
MODULE 3

Overfitting and Underfitting

1. Explain underfitting and overfitting in detail.

2. Summarize overfitting and underfitting with examples and explain how to resolve
them.
3. Compare overfitting and underfitting the training data in Machine Learning.
4. Explain overfitting and underfitting with examples.
5. Define machine learning and discuss the difference between overfitting and
underfitting. How can these issues be mitigated in model training?
6. Discuss the Bias-Variance tradeoff in detail.

Naive Bayes Algorithm

1. Explain Naive Bayes as a really dumb spam filter.

2. Explain Naïve Bayes Algorithm in the context of classification with functions.
3. Describe theoretically the Naive Bayes theorem to model a sophisticated spam filter
and write a Python program to classify whether a message contains spam or not using
Naive Bayes theorem.
4. Describe theoretically the Naive Bayes theorem to model a sophisticated spam filter.
5. What is the Naive Bayes algorithm? Illustrate its application with an example of a
spam filter.

Logistic Function and Logistic Regression

1. Explain the use of the logistic function in logistic regression in detail.

2. Explain the logistic function in detail.
3. Write a note on simple linear regression using gradient descent.

Simple Linear Regression and Gradient Descent

1. Explain the simple linear regression model in detail and write a Python program to
illustrate gradient descent for a simple linear regression model.
2. Write a note on simple linear regression using gradient descent.

Feature Extraction and Feature Selection

1. What is feature extraction, and why is it important in machine learning? Explain the
difference between feature extraction and feature selection.
2. Write a short note on feature extraction and selection.
3. Illustrate the process of feature extraction and selection in machine learning. Why is
this step important, and what techniques are commonly used?
Support Vector Machines (SVM)

1. How is the support vector machine used to classify the data?

2. Write a program to train an SVM classifier on the iris dataset using sklearn. Try
different kernels and the associated hyperparameters. Train the model with the
following set of hyperparameters: RBF kernel, gamma=0.5, one-vs-rest classifier, no
feature normalization. Also try C=0.01, 1, 10. For the above set of hyperparameters,
find the best classification accuracy along with the total number of support vectors on
the test data.
3. Explain Support Vector Machines in detail.

Iris Dataset

1. Describe the Iris dataset and its significance in machine learning. What are the
features and target variables in the Iris dataset? How is the Iris dataset typically used
to demonstrate classification algorithms?
2. What is the Iris Dataset? Build a model that can predict the class from the first four
measurements.
3. Write a Python program to build a K-nearest neighbor model that can predict the class
from the Iris dataset.

Model and Model Fitting

1. What is a model in the context of machine learning? Explain the difference between
supervised and unsupervised learning models.
2. Discuss the need for fitting the model in Multiple Regression.

Maximum Likelihood Estimation

1. Write a short note on Maximum Likelihood Estimation.

2. Discuss the process of simple linear regression. Explain how gradient descent and
maximum likelihood estimation are used to fit the model.

Digression

1. Explain Digression in detail.

2. Write a short note on digression with code.

Regularization

1. Explain in detail the regularization technique in machine learning.

2. Illustrate regularization.
3.
K-Nearest Neighbors (K-NN)

1. Explain the K-Nearest Neighbors Algorithm using the Iris dataset.

2. Explain the K-Nearest Neighbors (K-NN) algorithm with an example.
3. Write a Python program to build a K-nearest neighbor model that can predict the class
from the Iris dataset.

Standard Errors and Regression Coefficients

1. Explain the Standard errors of Regression Coefficients.

2. Illustrate standard errors of regression coefficients.
MODULE 4

Decision Trees

1. Illustrate the working of decision tree and explain the importance of entropy in
decision trees.
2. Can decision trees handle continuous data? If so, how is entropy used to handle
continuous data in decision trees? What are the limitations of decision trees?
3. Discuss decision trees in detail and provide a Python program to create a decision
tree.
4. Describe the decision tree process with Python and demonstrate the ID3 algorithm.
5. Consider the following dataset. Write a program to demonstrate the working of the
decision tree based ID3 algorithm.

6. Explain the role of entropy and entropy partition in creating a decision tree with
explanation and Python code.
7. Describe how entropy is used to create a decision tree and provide an example to
illustrate the process.

Feedforward Neural Networks & Backpropagation

1. Define a feedforward neural network and explain the backpropagation method for
training neural networks.
2. Describe the basic architecture of a feedforward neural network and explain the
concept of a loss function.
3. Discuss the role of the backpropagation algorithm in training neural networks.
4. Explain layer abstraction in deep learning and provide a Python program to compute
loss and optimization in deep learning.
5. Illustrate K-Nearest Neighbors with code.
6. Define neural networks and explain implementing AND function using the perceptron
algorithm.
7. Illustrate the backpropagation algorithm, its importance in training neural networks,
and how gradients are computed and weights are updated.
Deep Learning vs. Machine Learning

1. Describe how deep learning differs from machine learning.

2. Explain deep learning and how it differs from traditional machine learning methods.
Include the general architecture of a deep learning model.
3. Define a loss function and discuss its importance in deep learning. Describe common
loss functions used for different applications.

Artificial Neural Networks

1. Illustrate the working of artificial neural networks.

2. Explain neural networks as a sequence of layers with functions.
3. Describe the basic structure and function of a perceptron and its role as a building
block in feedforward neural networks.
4. Illustrate the working of the perceptron using OR Gate and AND Gate as examples.

Clustering & K-Means

1. Define clustering and explain the K-means clustering algorithm in detail.

2. Describe the basic idea behind clustering algorithms using color quantization as an
example.
3. Consider the dataset with coordinates and cluster labels, compute the rand index for
various clustering methods, visualize the dataset, and determine which algorithm
recovers the true clusters.
4. Explain the bottom-up hierarchical clustering approach with examples.
5. Illustrate K-means clustering with examples.

Optimization and Loss Functions

1. Define an optimization algorithm and explain its role in training deep learning
models. Describe gradient descent and its variants.
2. Define entropy and write code for entropy calculation.
3. Write a function to compute gradients for backpropagation.
4. Write code to train a network that computes XOR using a new framework.
5. Write code to generate any number of clusters by performing the appropriate number
of unmerges.
6. Explain the process of training a neural network on the MNIST dataset, including
architecture, input preprocessing, evaluation metrics, and a summary of the network's
performance.

Miscellaneous

1. Build and explain the Random Forests algorithm.

2. Compute tensors in deep learning by implementing concepts in Python.
3. Construct linear layers with implementation in Python.
4. Write a Python program to train a network that can compute XOR.
MODULE 5

Gibbs Sampling & Topic Modeling

1. Define and explain Gibbs Sampling with an example.

2. Describe Gibbs Sampling and its application in machine learning or statistical
modeling. Provide an example of using Gibbs Sampling to estimate parameters in a
Bayesian model.
3. Summarize topic modeling with reference to topic-word distribution and document-
topic distribution.
4. Build with relevant Python code and explain topic modeling for natural language
processing.

Recurrent Neural Networks (RNNs)

1. Write a note on Recurrent Neural Networks (RNNs).

2. What are Recurrent Neural Networks (RNNs), and how do they differ from
feedforward neural networks? What are Long Short-Term Memory (LSTM)
networks?
3. Explain the architecture and function of recurrent neural networks (RNNs). Provide
an example of using a character-level RNN in a text generation task.
4. Describe the architecture of a recurrent neural network (RNN) and its application in
sequential data modeling. Implement a simple character-level RNN using Python and
train it on a text dataset.
5. Explain Recurrent Neural Network in detail.

Word Clouds & n-Gram Language Models

1. Explain Word Clouds and n-Gram Language Models.

2. Discuss word clouds and write a Python program to generate word clouds.
3. Explain Word cloud approach in data visualization using Python code snippet.
4. What is an n-gram in the context of language modeling? Explain the differences
between unigrams, bigrams, and trigrams.
5. Describe n-Gram language models in detail.
6. Discuss n-gram language models and their application in NLP. How do these models
help in understanding the context within a text? Provide an example.
7. Explain how grammars are used in modeling languages.

Recommender Systems

1. Write a note on recommender systems.

2. Explain item-based collaborative filtering and matrix factorization.
3. Describe item-based collaborative filtering and how it differs from user-based
collaborative filtering.
4. How does item-based collaborative filtering generate recommendations in a
recommendation system?
5. Explain matrix factorization in the context of recommender systems. Discuss how it is
used to improve recommendation accuracy and provide an example.
6. Discuss the techniques used for recommender systems: (i) User-based collaborative
filtering (ii) Item-based collaborative filtering.
7. Write a code to find the interests most similar to Big Data (interest 0) using item-
based collaborative filtering.

Centrality Measures & Network Analysis

1. Write a note on betweenness centrality and eigenvector centrality.

2. Discuss the following metrics used for network analysis: (i) Betweenness centrality
(ii) Closeness centrality (iii) Eigenvector centrality.
3. With an example, explain the DataSciencester network sized by betweenness
centrality.
4. Write a code to find an eigenvector using matrix_times_vector.
5. Write a code to explain the DataSciencester network sized by PageRank.
6. Define network analysis and how two centrality measures are used to evaluate node
importance in a network. Calculate the degree centrality and betweenness centrality of
nodes in a small social network graph.
7. Describe the function of a recurrent layer in a recurrent neural network (RNN). (Note:
This question is related to RNNs but can be cross-referenced here for context.)

PageRank Algorithm & Graphs

1. Illustrate the PageRank algorithm and its application in directed graphs. How does it
work and what is its significance in network analysis?
2. Develop a Python function for the PageRank algorithm for a directed graph.

3. Explain PageRank with the Hypertext Induced Topic Selection algorithm in terms of
their underlying principles and use cases.

Additional Topics

1. Compare singular value decomposition with probabilistic matrix factorization in

terms of their suitability for recommendation systems.
2. Write a code to generate sentences using bigrams.

The AI Wealth Creation Blueprint PDF
67% (3)
The AI Wealth Creation Blueprint PDF
50 pages
The Age of AI and Our Human Future (Henry Kissinger, Eric Schmidt Etc.) (Z-Library)
100% (8)
The Age of AI and Our Human Future (Henry Kissinger, Eric Schmidt Etc.) (Z-Library)
148 pages
How To Hack Atm
87% (15)
How To Hack Atm
1 page
Christopher Langan - CTMU, The Cognitive-Theoretic Model of The Universe, A New Kind of Reality Theory
88% (8)
Christopher Langan - CTMU, The Cognitive-Theoretic Model of The Universe, A New Kind of Reality Theory
56 pages
Data Structure and Algorithmic Thinking With Python Data Structure and Algorithmic Puzzles PDF
95% (20)
Data Structure and Algorithmic Thinking With Python Data Structure and Algorithmic Puzzles PDF
471 pages
Movie Recommendation Project Report
90% (10)
Movie Recommendation Project Report
30 pages
Gayle Laakmann McDowell - Cracking The Coding Interview - 189 Programming Questions and Solutions (2015, CareerCup)
81% (48)
Gayle Laakmann McDowell - Cracking The Coding Interview - 189 Programming Questions and Solutions (2015, CareerCup)
708 pages
Gödel, Escher, Bach - An Eternal Golden Braid (20th Anniversary Edition) by Douglas R. Hofstadter (Charm-Quark) PDF
100% (10)
Gödel, Escher, Bach - An Eternal Golden Braid (20th Anniversary Edition) by Douglas R. Hofstadter (Charm-Quark) PDF
821 pages
Cracking The Coding Interview - 189 Programming Questions and Solutions (6th Edition) (EnglishOnlineClub - Com)
100% (10)
Cracking The Coding Interview - 189 Programming Questions and Solutions (6th Edition) (EnglishOnlineClub - Com)
708 pages
Chris Bailey - Hyperfocus - The New Science of Attention, Productivity, and Creativity-Viking (2018)
100% (25)
Chris Bailey - Hyperfocus - The New Science of Attention, Productivity, and Creativity-Viking (2018)
306 pages
The Art of Asking ChatGPT For High-Quality Answers A Complete Guide To Prompt Engineering Techniques (Ibrahim John) (Z-Library)
100% (24)
The Art of Asking ChatGPT For High-Quality Answers A Complete Guide To Prompt Engineering Techniques (Ibrahim John) (Z-Library)
52 pages
The Fabric of Reality
100% (1)
The Fabric of Reality
6 pages
Banana Pancakes - Ukulele Chord Chart
100% (1)
Banana Pancakes - Ukulele Chord Chart
2 pages
75 Productivity Hacks - System Sunday
100% (7)
75 Productivity Hacks - System Sunday
75 pages
Data Science Textbook PDF
100% (2)
Data Science Textbook PDF
646 pages
Cs 229, Autumn 2016 Problem Set #2: Naive Bayes, SVMS, and Theory
No ratings yet
Cs 229, Autumn 2016 Problem Set #2: Naive Bayes, SVMS, and Theory
20 pages
Military Remote Viewing Manual
100% (5)
Military Remote Viewing Manual
72 pages
Machine Learning For Humans
100% (4)
Machine Learning For Humans
97 pages
Dsa QB
No ratings yet
Dsa QB
2 pages
Syllabus AIML
No ratings yet
Syllabus AIML
14 pages
data_science_syllabus
No ratings yet
data_science_syllabus
4 pages
1152CS239-Intro. To Data Science-Syllabus
No ratings yet
1152CS239-Intro. To Data Science-Syllabus
6 pages
Datascience
No ratings yet
Datascience
8 pages
Data Science-1
No ratings yet
Data Science-1
6 pages
Data Science Assignment
No ratings yet
Data Science Assignment
9 pages
Data Science Master
No ratings yet
Data Science Master
11 pages
Data Science Master Program Syllabus
No ratings yet
Data Science Master Program Syllabus
28 pages
6C - Data Science - Syllabus - 01
No ratings yet
6C - Data Science - Syllabus - 01
4 pages
Module_2_Answers_Corrected
No ratings yet
Module_2_Answers_Corrected
5 pages
syllabus sem 6
No ratings yet
syllabus sem 6
6 pages
Data Science New Report
No ratings yet
Data Science New Report
39 pages
EDA - With Python Question Bank
No ratings yet
EDA - With Python Question Bank
3 pages
Machine Learning Assignment Solution
No ratings yet
Machine Learning Assignment Solution
30 pages
Syllabus_Principle of Data Science
No ratings yet
Syllabus_Principle of Data Science
4 pages
DS QB
No ratings yet
DS QB
6 pages
Data+Analytics+Detailed+Syllabus
No ratings yet
Data+Analytics+Detailed+Syllabus
26 pages
ESE-Theory Question -bank
No ratings yet
ESE-Theory Question -bank
6 pages
DS Unit 2
No ratings yet
DS Unit 2
50 pages
30 Data Science Minor
No ratings yet
30 Data Science Minor
18 pages
DS TANSCHE 03.06.2024
No ratings yet
DS TANSCHE 03.06.2024
23 pages
Data Science
No ratings yet
Data Science
6 pages
Assignment Unit I and II
No ratings yet
Assignment Unit I and II
3 pages
Dsbda Lab Manual Merged
No ratings yet
Dsbda Lab Manual Merged
117 pages
Birla Institute of Technology & Science, Pilani: Work Integrated Learning Programmes Part A: Content Design
No ratings yet
Birla Institute of Technology & Science, Pilani: Work Integrated Learning Programmes Part A: Content Design
6 pages
Data Analyst Nanodegree Program - Syllabus
50% (2)
Data Analyst Nanodegree Program - Syllabus
7 pages
Nd002 Syllabus 2018 June v9
No ratings yet
Nd002 Syllabus 2018 June v9
5 pages
Data Science
No ratings yet
Data Science
15 pages
Cat1 QB
No ratings yet
Cat1 QB
2 pages
Data Science Topics
No ratings yet
Data Science Topics
7 pages
Dat QB
No ratings yet
Dat QB
8 pages
365 Data Science Axs
No ratings yet
365 Data Science Axs
103 pages
Full Stack Data Science
No ratings yet
Full Stack Data Science
54 pages
CS3352 FDS
No ratings yet
CS3352 FDS
23 pages
Data Science Course Outline CES LUMS
No ratings yet
Data Science Course Outline CES LUMS
4 pages
Dand Syllabus v7 Terms 1
No ratings yet
Dand Syllabus v7 Terms 1
6 pages
Course Outline - FM217
No ratings yet
Course Outline - FM217
4 pages
FDSA unit 1
No ratings yet
FDSA unit 1
34 pages
Minor Cse Dsv2
No ratings yet
Minor Cse Dsv2
7 pages
Unit - 1& 2 IFS Question Banks
No ratings yet
Unit - 1& 2 IFS Question Banks
3 pages
Data 8 Textbook
No ratings yet
Data 8 Textbook
326 pages
Data_Science_Basics_Module_1
No ratings yet
Data_Science_Basics_Module_1
8 pages
Data Scientist Analyitcs Syllabus - Tech Transition
No ratings yet
Data Scientist Analyitcs Syllabus - Tech Transition
7 pages
Dsbda Lab Manual
No ratings yet
Dsbda Lab Manual
167 pages
Data Science Course in Hyderabad - Innomatics
No ratings yet
Data Science Course in Hyderabad - Innomatics
10 pages
COURSE PLAN - FDS THEORY
No ratings yet
COURSE PLAN - FDS THEORY
8 pages
FDS Important Q
No ratings yet
FDS Important Q
5 pages
End Sem PYQ
No ratings yet
End Sem PYQ
8 pages
DSBDAL Lab Manual
No ratings yet
DSBDAL Lab Manual
26 pages
Data Science & Machine Learning 2024
No ratings yet
Data Science & Machine Learning 2024
2 pages
FDS 2 Marks 50 Questions
No ratings yet
FDS 2 Marks 50 Questions
2 pages
CD 404 Imp Que of Data Science
No ratings yet
CD 404 Imp Que of Data Science
3 pages
DSBDA Lab Plan
No ratings yet
DSBDA Lab Plan
5 pages
Digital Unit Plan Template-1
No ratings yet
Digital Unit Plan Template-1
5 pages
New Microsoft Word Document
No ratings yet
New Microsoft Word Document
5 pages
DS-DS Lab-1
No ratings yet
DS-DS Lab-1
4 pages
Data Science and Its Applications (21AD62) Lab Manual
No ratings yet
Data Science and Its Applications (21AD62) Lab Manual
26 pages
Python for Data Science For Dummies
From Everand
Python for Data Science For Dummies
John Paul Mueller
No ratings yet
The Secrets of A Slot Machine
No ratings yet
The Secrets of A Slot Machine
4 pages
Roadmap How To Learn AI in 2024 (Uncovered AI)
No ratings yet
Roadmap How To Learn AI in 2024 (Uncovered AI)
6 pages
Teas Topics To Study
100% (12)
Teas Topics To Study
6 pages
My Ai Cheat List
100% (11)
My Ai Cheat List
3 pages
Tech Trend 2024 Report-2
No ratings yet
Tech Trend 2024 Report-2
11 pages
2045: The Year Man Becomes Immortal
No ratings yet
2045: The Year Man Becomes Immortal
9 pages
From Music To Mathematic
100% (1)
From Music To Mathematic
4 pages
Mind Control Patents
100% (1)
Mind Control Patents
41 pages
Rationality From AI To Zombies
86% (7)
Rationality From AI To Zombies
1,813 pages
Attention Is All You Need
67% (3)
Attention Is All You Need
11 pages
Python Programming and Maching Learning 2 in 1 B08Y5DPX32
100% (7)
Python Programming and Maching Learning 2 in 1 B08Y5DPX32
145 pages
Wisc V Interpretation
100% (1)
Wisc V Interpretation
8 pages
Current and Future Trends on AI Applications - Mohammed A Al-Sharafi
No ratings yet
Current and Future Trends on AI Applications - Mohammed A Al-Sharafi
456 pages
Psych Unit 7a Practice Quiz
No ratings yet
Psych Unit 7a Practice Quiz
4 pages
Project - Report - Movie Recommendfation System
No ratings yet
Project - Report - Movie Recommendfation System
31 pages
Building A Recommendation Engine With Scala by Saleem A. Ansari
No ratings yet
Building A Recommendation Engine With Scala by Saleem A. Ansari
5 pages
Tvrev Contextual+Targeting+Report Final
No ratings yet
Tvrev Contextual+Targeting+Report Final
61 pages
Matrix Factorization Techniques For Recommender Systems: Collaborative Filtering
No ratings yet
Matrix Factorization Techniques For Recommender Systems: Collaborative Filtering
20 pages
Impact of a i on Media Entertainment Industry
No ratings yet
Impact of a i on Media Entertainment Industry
32 pages
2504 Chapter 1 2 December 4 2023 6 PM
No ratings yet
2504 Chapter 1 2 December 4 2023 6 PM
33 pages
Hybrid Recommender Systems: A Systematic Literature Review: Erion Çano and Maurizio Morisio
No ratings yet
Hybrid Recommender Systems: A Systematic Literature Review: Erion Çano and Maurizio Morisio
38 pages
Pranav VI Jetflix Sem Project
No ratings yet
Pranav VI Jetflix Sem Project
7 pages
Risk PDF
No ratings yet
Risk PDF
41 pages
BIG DATA ANALYTICS IN RETAIL
No ratings yet
BIG DATA ANALYTICS IN RETAIL
18 pages
Muskan Project Report
No ratings yet
Muskan Project Report
16 pages
ARS CH6 Multiplex
No ratings yet
ARS CH6 Multiplex
81 pages
23mba0045 CB (Da)
No ratings yet
23mba0045 CB (Da)
27 pages
Ml-1-Guided-Bus Report
No ratings yet
Ml-1-Guided-Bus Report
35 pages
Analysis of Superstore Database
No ratings yet
Analysis of Superstore Database
23 pages
Drug Recommendation Using Recurrent Neural Networks Augmented With Cellular Automata
No ratings yet
Drug Recommendation Using Recurrent Neural Networks Augmented With Cellular Automata
7 pages
PDF Computational Collective Intelligence 10th International Conference ICCCI 2018 Bristol UK September 5 7 2018 Proceedings Part I Ngoc Thanh Nguyen download
100% (4)
PDF Computational Collective Intelligence 10th International Conference ICCCI 2018 Bristol UK September 5 7 2018 Proceedings Part I Ngoc Thanh Nguyen download
55 pages
AYASKANTA PARIDA - Report
No ratings yet
AYASKANTA PARIDA - Report
116 pages
UNIT 5
No ratings yet
UNIT 5
9 pages
Emotion-Based_Music_Recommendation_System
No ratings yet
Emotion-Based_Music_Recommendation_System
5 pages
ArtificiaI Intelligence Engineer Brochure
No ratings yet
ArtificiaI Intelligence Engineer Brochure
27 pages
THE-GENAI-REVOLUTION-UNLEASHING-THE-ROLE-OF-INFORMATION-TECHNOLOGY-IN-EDUCATION
No ratings yet
THE-GENAI-REVOLUTION-UNLEASHING-THE-ROLE-OF-INFORMATION-TECHNOLOGY-IN-EDUCATION
21 pages
CSD101 Fundamentals of Data Science Session 1 and 2
No ratings yet
CSD101 Fundamentals of Data Science Session 1 and 2
53 pages
Mini Project
No ratings yet
Mini Project
32 pages
Artificial Intelligence in Marketing
No ratings yet
Artificial Intelligence in Marketing
54 pages
Good Good Good - Modeling Behavior Sequence For Personalized Fund Recommendation With Graphical Deep Collaborative Filtering
No ratings yet
Good Good Good - Modeling Behavior Sequence For Personalized Fund Recommendation With Graphical Deep Collaborative Filtering
14 pages
CS8091 Big Data Analytics MCQ
No ratings yet
CS8091 Big Data Analytics MCQ
22 pages
Mini Proj
No ratings yet
Mini Proj
7 pages
Rl9.1 Recommendation System 1
No ratings yet
Rl9.1 Recommendation System 1
14 pages

TOPIC WISE DSA QUESTIONS

Uploaded by

TOPIC WISE DSA QUESTIONS

Uploaded by

Data Science and its Applications (21AD62)

Probability Theory & Bayes Theorem

1. Write a note on probability theory as applicable to data science. (8)

1. Write a note on normal distribution. (4)

Vectors in Data Science

1. Explain the following:

Measures of Central Tendency & Dispersion

1. Explain the following statistical techniques:

9. Compute code to compute standard deviation. (6)

1. Explain Simpson’s Paradox. (4)

1. Discuss Random variables with an example in detail. (6)

1. Explain the gradient descent approach in detail with a relevant example.

Hypothesis Testing & A/B Testing:

1. Explain in detail on hypothesis testing with example.

Data Cleaning & Munging:

1. Explain data cleaning, data munging, and manipulating Data.

1. Explain the methodologies to extract data from web scraping.

Linear Regression & Error Detection:

Data Handling with Python (CSV, Named Tuples, etc.):

1. Sketch the use of csv.reader, csv.DictReader, and csv.writer in processing Delimited

1. Explain dimensionality reduction in detail.

2. Illustrate tqdm Library functions with an example.

Overfitting and Underfitting

1. Explain underfitting and overfitting in detail.

Naive Bayes Algorithm

1. Explain Naive Bayes as a really dumb spam filter.

Logistic Function and Logistic Regression

1. Explain the use of the logistic function in logistic regression in detail.

Simple Linear Regression and Gradient Descent

Feature Extraction and Feature Selection

1. How is the support vector machine used to classify the data?

Model and Model Fitting

Maximum Likelihood Estimation

1. Write a short note on Maximum Likelihood Estimation.

1. Explain Digression in detail.

1. Explain in detail the regularization technique in machine learning.

1. Explain the K-Nearest Neighbors Algorithm using the Iris dataset.

Standard Errors and Regression Coefficients

1. Explain the Standard errors of Regression Coefficients.

Feedforward Neural Networks & Backpropagation

1. Describe how deep learning differs from machine learning.

Artificial Neural Networks

1. Illustrate the working of artificial neural networks.

Clustering & K-Means

1. Define clustering and explain the K-means clustering algorithm in detail.

Optimization and Loss Functions

1. Build and explain the Random Forests algorithm.

Gibbs Sampling & Topic Modeling

1. Define and explain Gibbs Sampling with an example.

Recurrent Neural Networks (RNNs)

1. Write a note on Recurrent Neural Networks (RNNs).

Word Clouds & n-Gram Language Models

1. Explain Word Clouds and n-Gram Language Models.

1. Write a note on recommender systems.

Centrality Measures & Network Analysis

1. Write a note on betweenness centrality and eigenvector centrality.

PageRank Algorithm & Graphs

1. Compare singular value decomposition with probabilistic matrix factorization in

You might also like