0% found this document useful (0 votes)
90 views

Data Science: Concepts and Practice: Course Slides

This document provides an overview of a course on data science concepts and practice. It introduces key topics like classification, regression, clustering, and association analysis. For each topic it provides examples of common algorithms and real-world applications. The course outline also lists core algorithms that will be covered, including decision trees, naive Bayesian, k-means clustering, and Apriori association rules. The document establishes a foundation for understanding common data science tasks, techniques, and applications.

Uploaded by

shabir Ahmad
Copyright
© © All Rights Reserved
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
90 views

Data Science: Concepts and Practice: Course Slides

This document provides an overview of a course on data science concepts and practice. It introduces key topics like classification, regression, clustering, and association analysis. For each topic it provides examples of common algorithms and real-world applications. The course outline also lists core algorithms that will be covered, including decision trees, naive Bayesian, k-means clustering, and Apriori association rules. The document establishes a foundation for understanding common data science tasks, techniques, and applications.

Uploaded by

shabir Ahmad
Copyright
© © All Rights Reserved
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 9

Data Science: Concepts and Practice

Course slides
Course Book Course Software

Data Science: Concepts and


Practice
Authors : Vijay Kotu & Bala Deshpande
Publisher : Morgan Kaufmann www.rapidminer.com

Free Download
1. Introduction
What is Data Science
Models
Types of Data Science
Tasks Description Algorithms Examples

Classification Predict if a data point belongs to Decision Trees, Neural Assigning voters into known buckets by
one of predefined classes. The networks, Bayesian political parties eg: soccer moms.
prediction will be based on models, Induction rules, K Bucketing new customers into one of
learning from known data set. nearest neighbors known customer groups.

Regression Predict the numeric target label of Linear regression, Logistic Predicting unemployment rate for next
a data point. The prediction will regression year. Estimating insurance premium.
be based on learning from known
data set.

Anomaly detection Predict if a data point is an outlier Distance based, Density Fraud transaction detection in credit
compared to other data points in based, LOF cards. Network intrusion detection.
the data set.

Time series Predict if the value of the target Exponential smoothing, Sales forecasting, production
variable for future time frame ARIMA, regression forecasting, virtually any growth
based on history values. phenomenon that needs to be
extrapolated

Clustering Identify natural clusters within the K means, density based Finding customer segments in a
data set based on inherit clustering - DBSCAN company based on transaction, web
properties within the data set. and customer call data.

Association analysis Identify relationships within an FP Growth, Apriori Find cross selling opportunities for a
itemset based on transaction retailor based on transaction purchase
data. history.
Course
Core Algorithms
outline Classification
Decision Trees
Rule Induction
k-Nearest Neighbors
Naïve Bayesian
Artificial Neural Networks
Process Basics Support Vector Machines
Common Applications

Data Science Ensemble Learners


Text Mining
Process Regression
Time Series Forecasting
Data Exploration Linear Regression
Logistic Regression Anomaly Detection
Model Evaluation
Association Analysis Feature Selection
Apriori
FP-Growth

Clustering
k-Means
DBSCAN
Self-Organizing Maps

You might also like