0% found this document useful (0 votes)

21 views27 pages

EE353 - 769 06 Intro To ML

Uploaded by

deepikameena16052004

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views27 pages

EE353 - 769 06 Intro To ML

Uploaded by

deepikameena16052004

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 27

ML for Smart Monkeys

Amit Sethi
Faculty member, IIT Bombay

Image source: Pixabay.com

ML is…
• The practice of automating the use of related data to estimate
models that make useful predictions about new data, where the
model is too complex for standard statistical analysis, e.g.
• Improve accuracy of classification of images using labeled images
• Improve win percentage on alpha-go using several simulated game move
sequences and their results
• Improve the Turing test confusion between human and machine for NLP Q&A
using a large sample of text including Q&A
When not to use ML
• Possible inputs are countable and few
• Use look up tables
• Algorithm is well-known and efficient
• E.g. sorting, Dijkstra's shortest path
• Model is well-known and tractable
• Use statistical estimation
• There is no notion of contiguity
• Use dicrete variable methods or give up
• Lack of data
• Use transfer learning or few-shot learning, or give up
When to use ML
• Possible inputs are many or continuous
• No well-known or efficient algorithm
• Model is not well-known or tractable
• Strong notion of contiguity
• Good amount of data
• Desired output known
• Well-defined inputs
Sweet spot for ML

• Lots of structured data

• Explainability is not critical

• Prediction accuracy is the primary goal

• Underlying model is complex but stationary

Image courtesy: Pixabay.com

ML model training and
deployment
Training on past data Prediction on future data
ML gives a model
Supervised Machine Learning System - Training

• Elements of a model: Model Output fθ(xi)

Input xi
• Input xi

(and regularization)
Hyper-parameters

Loss function
• Function fθ(xi)
Parameters θ

• Utility of the model:

• Target output ti Target output ti

• Bring fθ(xi) close to ti Learning algorithm

• Minimize loss L(ti, fθ(xi), θ)
Components of a Trained ML System

Supervised Machine Learning System - Testing

Input xi Model Output fθ(xi)

Hyper-parameters

Parameters θ
Mathematically speaking…
§ Determine f such that ti=f(xi) and g(T, X) is minimized for unseen set T
and X pairs, where T is the ground truth that cannot be used

§ Form of f is fixed, but some parameters can be tuned:

§ So, y=fθ(x), where, x is observed, and y needs to be inferred

§ e.g. y = 1, if mx > c, y = 0 otherwise, so θ = (m,c)

§ Machine Learning is concerned with designing algorithms that learn

“better” values of θ given “more” x (and t) for a given problem

9
Key

Parameters and Hyperparameters Concept

● Parameters: These are the variable whose values are updated during the
training process of model.
○ Feature coefficient in regression model
○ Weights of a neural network
● Hyperparameters: These are the variables/ parameter whose values are
fixed by model developer before the beginning of learning process.
○ Number of variables in a tree node
○ Height of a tree
○ Number of layers of a neural network
Type of ML problems
● Supervised learning: uses labeled data
○ Classification: Labels are discrete
○ Regression: Labels are continuous
○ Ranking: Labels are ordinal
● Unsupervised learning: uses unlabeled data
○ Clustering: Divide data into discrete groups
○ Dimension reduction: Represent data with fewer numbers
● Somewhere in between: fewer labels than one per example
○ Semi-supervised learning: some examples are labeled
○ Weakly supervised learning: groups of examples are labeled
○ Reinforcement learning: Label (reward) is available after a sequence of steps
Supervised Learning
● Predictor variables/features and a target variable (label)
● Aim: Predict the target variable (label), given the predictor variables
○ Classification: Target variable (y) consists of categories
○ Regression: Target variable is continuous

(Label)
Broad types of ML problems

Output  Categorical Ordinal Continuous

Supervised Classification Ranking Regression

(Examples) {Cats, dogs} {Low, Med, High} [-20,+10)

Unsupervised Clustering Dimension reduction

Some popular ML frameworks

Dimension
Classification Regression Clustering
reduction
Logistic Linear K-means, PCA, k-PCA,
Vector regression regression Fuzzy C-means, LLE,
SVM, RF, NN DB-SCAN ISOMAP
RNN, LSTM, Transformer,
Series, text
1-D CNN, HMM
Images 2-D CNN, MRF
Video, MRI 3-D CNN, CNN+LSTM, MRF
Recipe for ML training

• Decide on the type of the ML problem

• Prepare data

• Shortlist ML frameworks

• Prepare training, validation, and test sets

• Train, validate, repeat

• Use test data only once

Preparing data

• Remove useless data • Handle missing data

• No variance • Impute, if sporadic

• Falsely assumed to be available • Drop, if too frequent

• Reduce redundancy • Transform variables

• Correlated • Convert discrete to one-hot-bit
• Pearson and Spearman • Normalize continuous variables
Examples of structure in the data
Product SKU Price Margin Volume

• Records A123ajkhdf $ 120 30% 1,000,000

B456ddsjh $200 10% 2,000,000

• Temporal order

• Spatial order

• Web of relationships
Images courtesy: Pixabay.com
Model choice and rigorous
validation are very important

 Underfit Sweet Overfit 

Spot
Error

Validation

Training

Model complexity
Bias-variance trade-off
Generalization of model is bounded by the two undesirable outcomes high
bias and high variance.
● Underfitting: High bias, Low variance
● Overfitting: Low bias, High variance

Bias occurs when an algorithm has limited flexibility to learn the true signal
from the dataset. High bias can cause an algorithm to miss the relevant
relations between features and target outputs (underfitting).
Variance is an error from sensitivity to small fluctuations in the training set.
High variance can cause an algorithm to model the random noise in the
training data, rather than the intended outputs (overfitting).
Regularization is a key concept in ML
● Regularization means constraining the model

● More constraints may reduce model fit on training data

● However, it may improve fit on validation and test data

● Training performance of more constrained models are more likely to

reflect test performance
Loss versus performance metric
● Loss is a convenient expression used for guiding the learning
(optimization)
● Loss is related to performance metric, but it is not the same
● Loss also includes regularization
● Performance metric is what is used to judge the model
● Performance metric on only the held-out (validation or test) data makes
sense
Preparing data for training and validation
• Data splits:
• Training  Used to optimize the parameters (e.g. random 70%)
• Validation  Used to compare models (e.g. random 15%)
• Testing  One final check after multiple rounds of validation (e.g. random 15%)
• Cross-validation:
• K-folds: One fold for validation, K-1 folds for training
• Rotate folds K times
• Select framework (hyperparameters) best average performance
• Re-train best framework on entire data
• Test one final time on held-out data that was not a part of any fold
Cross-validation
● Model performance measurement is dependent on way the data is split
● Not representative of the model’s ability to generalize
● Solution: Cross-validation, especially when data is less
● Con: more computations
ML can fail to perform in deployment
• Lack of training diversity: data had limited confounders
• Single speaker, author, camera, background, accent, ethnicity, etc.
• Data imbalance between high-value rare and more common examples
• Proxy label leak during training:
• E.g. Only speakers A and B provide emotion “anger,” so ML confused their
voice characteristics with “anger”
• Too much manual cleansing of training data
• Too little training data, and very complex models
• Concept drift: The assumptions behind training are no longer valid
ML life stages

Most ML courses
Relation of ML to other fields

Artificial Intelligence

Machine Learning

Neural Networks

Deep Learning
Relation of ML to other fields
Probability
Machine and Statistics
Learning
Optimization

Programming
Linear Data Science
Algebra

Machine Learning?
100% (2)
Machine Learning?
114 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
77 pages
Machine Learning Batch 8 2021
100% (1)
Machine Learning Batch 8 2021
73 pages
Unit Iv Parametric Machine Learning
No ratings yet
Unit Iv Parametric Machine Learning
4 pages
Ai - Foundations of Machine Learning I
No ratings yet
Ai - Foundations of Machine Learning I
39 pages
BE02000041 Funda of AI Unit 3 Basics of ML
No ratings yet
BE02000041 Funda of AI Unit 3 Basics of ML
86 pages
Introduction
No ratings yet
Introduction
73 pages
Tema6 Models for AI and ML
No ratings yet
Tema6 Models for AI and ML
77 pages
Basic_concepts_of_Machine_Learning_for_Beginners_1732109263
No ratings yet
Basic_concepts_of_Machine_Learning_for_Beginners_1732109263
102 pages
unit 1 ml
No ratings yet
unit 1 ml
41 pages
Unit II
No ratings yet
Unit II
14 pages
ML Models, Model Evaluation Methods, Overfitting, Underfitting Bias Variance Loss Function Hyperparameter and Gradient Descent
No ratings yet
ML Models, Model Evaluation Methods, Overfitting, Underfitting Bias Variance Loss Function Hyperparameter and Gradient Descent
74 pages
ML Revision
No ratings yet
ML Revision
207 pages
Unit 1
No ratings yet
Unit 1
62 pages
ML1_Theory
No ratings yet
ML1_Theory
43 pages
Al_Lec 3
No ratings yet
Al_Lec 3
30 pages
FML - KNN
No ratings yet
FML - KNN
64 pages
4. Ai_foundations of Machine Learning i
No ratings yet
4. Ai_foundations of Machine Learning i
40 pages
ML-chap-2
No ratings yet
ML-chap-2
60 pages
2024 Machine Learning Intro
No ratings yet
2024 Machine Learning Intro
50 pages
1 - Machine Learning Overview
No ratings yet
1 - Machine Learning Overview
53 pages
L7.1.AI
No ratings yet
L7.1.AI
127 pages
1 - Machine Learning Overview
No ratings yet
1 - Machine Learning Overview
56 pages
Machine - Learning - Unit - 1
No ratings yet
Machine - Learning - Unit - 1
70 pages
July4 SaketAnand FriendlyIntroToML
No ratings yet
July4 SaketAnand FriendlyIntroToML
84 pages
2021 Machine Learning Intro
No ratings yet
2021 Machine Learning Intro
43 pages
SEC Presentation
No ratings yet
SEC Presentation
22 pages
Lecture 1 Ai
No ratings yet
Lecture 1 Ai
38 pages
Lec2 Intro to ML
No ratings yet
Lec2 Intro to ML
35 pages
5.1 Large Scale ML
No ratings yet
5.1 Large Scale ML
10 pages
L1 Overview
No ratings yet
L1 Overview
28 pages
Class1-%20Introduction%20and%20foundation-1717413257735
No ratings yet
Class1-%20Introduction%20and%20foundation-1717413257735
23 pages
Unit 1b - Fundamentals of Machine Learning
No ratings yet
Unit 1b - Fundamentals of Machine Learning
31 pages
dbms-10 marks
No ratings yet
dbms-10 marks
32 pages
Machine Learning Notes "2023
No ratings yet
Machine Learning Notes "2023
31 pages
Firoz Topic 0 Ppt
No ratings yet
Firoz Topic 0 Ppt
24 pages
Lecture 12 - Machine Learning
No ratings yet
Lecture 12 - Machine Learning
18 pages
Introduction to Machine Learning
No ratings yet
Introduction to Machine Learning
15 pages
Machine Learning On Big Data: Opportunities and Challenges
No ratings yet
Machine Learning On Big Data: Opportunities and Challenges
25 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
60 pages
ML 01
No ratings yet
ML 01
24 pages
Machine Leaning 1 unit
No ratings yet
Machine Leaning 1 unit
10 pages
01 - Introduction
No ratings yet
01 - Introduction
35 pages
Presentation1.Pptx Tanushka - Copy
No ratings yet
Presentation1.Pptx Tanushka - Copy
13 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
24 pages
Afafdfsregf
No ratings yet
Afafdfsregf
9 pages
ML
No ratings yet
ML
4 pages
1
No ratings yet
1
7 pages
Data Science
No ratings yet
Data Science
4 pages
Machine Learning
No ratings yet
Machine Learning
30 pages
Machine Learning - course
No ratings yet
Machine Learning - course
6 pages
ML Models Concepts
No ratings yet
ML Models Concepts
32 pages
ML_Basics
No ratings yet
ML_Basics
3 pages
Lecture 2.2 Example Data Preparation Feature Engineering
No ratings yet
Lecture 2.2 Example Data Preparation Feature Engineering
25 pages
Java Ninja
No ratings yet
Java Ninja
1 page
Fundamentals of ML 1
No ratings yet
Fundamentals of ML 1
38 pages
Machine Learning Notes From AWS
No ratings yet
Machine Learning Notes From AWS
5 pages
Machine Learning Spark ML
No ratings yet
Machine Learning Spark ML
11 pages
Introduction To ML
No ratings yet
Introduction To ML
4 pages
ML Midterm Cheatsheet
No ratings yet
ML Midterm Cheatsheet
2 pages
Au l 1659430016 Handwriting is Better Than Typing Persuasive Reading Comprehension Activity Ver 1
No ratings yet
Au l 1659430016 Handwriting is Better Than Typing Persuasive Reading Comprehension Activity Ver 1
7 pages
Natkin, Stephane-Video Games and Interactive Media - A Glimpse at New Digital Entertainment-CRC Press (2006)
No ratings yet
Natkin, Stephane-Video Games and Interactive Media - A Glimpse at New Digital Entertainment-CRC Press (2006)
162 pages
Leg Spawn Manual
No ratings yet
Leg Spawn Manual
32 pages
APCS Unit 7_ ArrayList In-Depth Review - Ajay Gandecha
No ratings yet
APCS Unit 7_ ArrayList In-Depth Review - Ajay Gandecha
3 pages
Software Testing Intern - Morphle Labs (YC W20) (1)
No ratings yet
Software Testing Intern - Morphle Labs (YC W20) (1)
2 pages
ERP Systems
No ratings yet
ERP Systems
20 pages
Strings in C++
No ratings yet
Strings in C++
59 pages
Ict-Lab Proposal
100% (1)
Ict-Lab Proposal
28 pages
Multispan Utc 421 Digital Timer
No ratings yet
Multispan Utc 421 Digital Timer
4 pages
Freezing Cross Tab Row Headers
0% (1)
Freezing Cross Tab Row Headers
8 pages
SuperEx Whitepaper - Eng
No ratings yet
SuperEx Whitepaper - Eng
18 pages
Adianti Framework
No ratings yet
Adianti Framework
29 pages
Microprocessor and Interfacing Devices/Peripherals: 8086 Instructions Set
No ratings yet
Microprocessor and Interfacing Devices/Peripherals: 8086 Instructions Set
24 pages
Anything Above Marked With A Red X Will Not Appear in This Smartart Graphic and Will Not Be Saved Message
No ratings yet
Anything Above Marked With A Red X Will Not Appear in This Smartart Graphic and Will Not Be Saved Message
2 pages
Consent Form and Privacy Notice - Online Visitors
100% (1)
Consent Form and Privacy Notice - Online Visitors
3 pages
Biomems 1 Intro
No ratings yet
Biomems 1 Intro
19 pages
Software Requirements Specification "Smart Game "
No ratings yet
Software Requirements Specification "Smart Game "
4 pages
ACI 121R-98 - Quality Management System For Concrete Construction
No ratings yet
ACI 121R-98 - Quality Management System For Concrete Construction
9 pages
Argentina Class I-II Registration Revalidation Form (EN)
No ratings yet
Argentina Class I-II Registration Revalidation Form (EN)
4 pages
Dragonpay Corp 0075352743 Colleague/Friend/Others Others Payment
No ratings yet
Dragonpay Corp 0075352743 Colleague/Friend/Others Others Payment
1 page
General Mathematics 11: Learning Activity Sheets
No ratings yet
General Mathematics 11: Learning Activity Sheets
3 pages
Analysis For Power System State Estimation
No ratings yet
Analysis For Power System State Estimation
9 pages
Xerox Case Study
100% (1)
Xerox Case Study
3 pages
Arun Kumar Merum: Experience
No ratings yet
Arun Kumar Merum: Experience
2 pages
Cinematographers
No ratings yet
Cinematographers
2 pages
Introduction To The 9S12 Microcontroller: EE 308 Spring 2014
No ratings yet
Introduction To The 9S12 Microcontroller: EE 308 Spring 2014
16 pages
Comparison of COB Vs SMD in Details PDF
No ratings yet
Comparison of COB Vs SMD in Details PDF
20 pages
Bocker Giant Construction Hoist
No ratings yet
Bocker Giant Construction Hoist
8 pages
Win 7
No ratings yet
Win 7
3 pages
Mathematics for Data Science: Linear Algebra with Matlab
From Everand
Mathematics for Data Science: Linear Algebra with Matlab
César Pérez López
No ratings yet

EE353 - 769 06 Intro To ML

Uploaded by

EE353 - 769 06 Intro To ML

Uploaded by

ML for Smart Monkeys

Image source: Pixabay.com

• Lots of structured data

• Explainability is not critical

• Prediction accuracy is the primary goal

• Underlying model is complex but stationary

Image courtesy: Pixabay.com

• Elements of a model: Model Output fθ(xi)

• Utility of the model:

• Bring fθ(xi) close to ti Learning algorithm

Supervised Machine Learning System - Testing

Input xi Model Output fθ(xi)

§ Form of f is fixed, but some parameters can be tuned:

§ e.g. y = 1, if mx > c, y = 0 otherwise, so θ = (m,c)

§ Machine Learning is concerned with designing algorithms that learn

Parameters and Hyperparameters Concept

Output  Categorical Ordinal Continuous

Supervised Classification Ranking Regression

(Examples) {Cats, dogs} {Low, Med, High} [-20,+10)

Unsupervised Clustering Dimension reduction

• Decide on the type of the ML problem

• Prepare training, validation, and test sets

• Train, validate, repeat

• Use test data only once

• Remove useless data • Handle missing data

• Falsely assumed to be available • Drop, if too frequent

• Reduce redundancy • Transform variables

• Records A123ajkhdf $ 120 30% 1,000,000

 Underfit Sweet Overfit 

● More constraints may reduce model fit on training data

● However, it may improve fit on validation and test data

● Training performance of more constrained models are more likely to

You might also like