2 Data Analytics
2 Data Analytics
Data Science
Data/
Types of Data Analytics
• Descriptive Analytics: This is a method for quantitatively
describing the main features of a collection of data
Variables: Categorical, Ratio, Independent,
Dependent
Frequency Distribution: Histogram, Normal
distribution
Measures of Centrality: Mean, Median, Mode
Dispersion of a Distribution: Range, Interquartile
range, Variance, Standard deviation
• Diagnostic Analytics/Causal analysis: Are used for
discovery, or to determine why something happened
Correlation: Pearson’s r correlation
Types of Data Analytics
• Predictive Analytics: These analytics are about
understanding the future using the data and the trends
we have seen in the past, as well as emerging new
contexts and processes. They are done in stages:
Traditional Programming
Data
Computer Output
Program
Machine Learning
Data
Computer Program/
Output Rules
ML Related Fields
data Assignment1:
mining control theory Discuss their relation
to ML
statistics
decision theory
information theory machine
learning
cognitive science
databases
psychological models
evolutionary neuroscience
models
Machine learning is primarily concerned with the accuracy
and effectiveness of the computer system in performing
complex tasks.
Machine Learning Domain
Basic Algorithms
Supervised Vs Unsupervised Learning
Algorithms
Semi-Supervised learning
Machine learning structure
Supervised learning
Machine learning structure
Unsupervised learning
Learning Techniques
Supervised learning categories and techniques
Linear classifier (numerical functions)
Parametric (Probabilistic functions)
Naïve Bayes, Gaussian discriminant analysis (GDA), Hidden Markov
models (HMM), Probabilistic graphical models
Non-parametric (Instance-based functions)
K-nearest neighbors, Kernel regression, Kernel density estimation,
Local regression
Non-metric (Symbolic functions)
Classification and regression tree (CART), decision tree
Aggregation (Ensembling)
Bagging (bootstrap + aggregation), Adaboost, Random forest
Learning Techniques
Unsupervised learning categories and techniques
Clustering
K-means clustering
Spectral clustering
Density Estimation
Gaussian mixture model (GMM)
Graphical models
Dimensionality reduction
Principal component analysis (PCA)
Factor analysis
Deep Learning
Deep Learning
Deep Learning
Weights to learn!
Weights to learn!
• Lots of hidden layers
• Depth = power (usually)
Weights to learn!
Deep Learning
Weights to learn!
Deep Learning- Loos gradients
• Denoted as (diff notations):