0% found this document useful (0 votes)

35 views

ML Notes

This document discusses various machine learning concepts including learning systems, management of information and material flows, CRUD operations in systems, qualitative and quantitative data, the goals of machine learning, defining and processing training data, function approximation algorithms, approximation in different contexts, regression, classification vs clustering, gradient descent, decision trees, and k-fold cross validation. It provides examples to illustrate key machine learning techniques and challenges.

Uploaded by

Pooja Gangapure

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

35 views

ML Notes

Uploaded by

Pooja Gangapure

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 15

11/1- Introduction

18/1-

Learning System-
It is a system which provides the compilation of data about resources for learning. Every system
has its own Input-Processing-Output methods. When you are going to deal with the compilation
of data about resources for learning- for creating and storing all the learning resources, you
need some sort of access to those learning resources. In our system, resources are some sort of
software/hardware/internet connection/switches/hub (Any I/O devices).
Management is going to manage the Information flow, the material flow and money flow.
LMS enables you to create, manage and deliver. Ex. MS Word.
When a system allows CRUD (Create, Read, Update, Delete), it allows you to go for simulation.
Simulation basically means some sort of CRUD operations.
Ex- Moodle

Tom Mitchell – A computer program is said to learn from experience “E” with respect to some
class of tasks “T” and performance measure “P”, if its performance at tasks in T, as measured by
P, improves with experience E. Ex- Email Spam Detection

Dealing with numbers – quantitative (Statistics)

Dealing with numbers and letters and strings – Qualitative (Fuzzy logic)
//Fuzzy logic contains the multiple logical values and these values are the truth values of a
variable or problem between 0 and 1

GOAL OF ML (ML is a subset of AI)

Achieve thorough understanding about the nature of learning of process of both, human and
form of learning.

25/1-
Target- How to define a target? What will we need to decide a target? Which parameters
should be taken into consideration?

4 Functioning of target-
1. Regression – talks about how many variables, how many independent & dependent
variables, how to look into its dependability ,
2. Precision and recall (used to find out the accuracy of network)
3. Supervised/ Unsupervised
4. ?

Training data – extremely large dataset that is used to teach ML model

Processed data – information

Function approximation algorithm- It is technique for estimating an unknown underlining

function using the historical or available observation from the domain.
Current Data- OLTP (Online Transaction Processing)
Historical data- OLAP (Online Analytic Processing) -neural network as we are looking into
approximation

27/1-
Approximation – So approximation is used whenever a numerical value model or structure or
function is either unknown or difficult to compute. Approximation is used whenever there is
some sort of numerical value is their model is their structure is their function is there and its
nature is difficult to compete. In that case, we're going for approximation.

Approximation when the form of function is known (e.g., for loop or when you have to go to
your native place)

Approximation when the form of function is not known but numerically it is difficult to
compute the exact value like value of pi (e.g., while loop, where you don’t know the amount of
time the loop is going to run). Here we will produce an output which is close to the known
function. E.g. travel to Gujarat – Baroda, Surat, Gandhi Nagar, Ahmedabad

TAYLOR SERIES of a function is the sum of infinite terms which are computed using function
derivative. Here numerical computation is going to be expensive.
Examples-
S = 1/1! + 2/2! + 3/3! + …. n/n!
S = 1+ x/1! - x/2! + x/3! - ….

NEWTON’S METHOD can be used to approximate the roots of polynomial, making it a useful
technique for approximating quantities such as sqrt of different values or reciprocal of different
numbers.
Examples-
sqrt(3) = “approximated value” but sqrt(9) = “known”
X ~ P (X is directly proportional to P)
X ~ 1/P (X is indirectly proportional to P)
APPROXIMATION IN REGRESSION-
 Prediction of an output variable when given set of inputs. The function that truly maps
the input variable to outputs is not known. It is assumed that some linear and nonlinear
regression modeling can approximate the mapping of input to output.
 Predict future values/ predicts the values based on current scenario.
 Input is known and output is unknown. Prediction solely depends upon inputs.
 OLTP(Current data) or OLAP (Historical data, last 5 years/10 years)

Example 1-

Input = Calorie intake per day

Output = its equivalent blood sugar(100-140 )

Example 2-
Classification:
No. of students per class = 146 /3 ~= 50
Division wise = 3
Specialization = 7
Analysis = Value of classification is fixed

Can we fix the value of cluster? (Value of cluster depends upon logical condition)
ML Cluster 1 = 0 or 1-146
CAD Cluster 2 = 0 or 1-146
Total no of students = 146

Willing to join ML? Y or N = 0, 1-146

This concludes that the value of classification is possible to estimate but value of cluster is
relatively difficult.

CLASSIFICATION VS CLUSTERING

LEARNING ALGO 1- Gradient Descent

Going to generate output in the form of class label, let us say, O.(If multiple O1, O2, O3 …)

If you want the output to approximate to 1 and you’re getting .18, .199. .96 etc. You need to
introduce ‘W’. No of inputs should be equal to number of weights. As per the expectations,
that loop will executive till that many times. Otherwise it will recompute again and again.
This is how the neural network will work.

K-means Clustering
1/2-
INDUCTIVE CLASSIFICATION LOGIC –
learning system that learns first order logic. Helpful to classify things which belong to 2/+
classes. Used to classify unseen examples or interpretations.

Can be –
 Deductive (more to less or less to more) ; proven through observation ; difficult to
find the accuracy.
 Inductive ; extracts likely premise from specific and limited observations

HYPOTHESIS SPACE-
It has a general to specific ordering of hypothesis(myth/assumptions). Goal- find the best
fitting hypothesis for the training data.

More constraints more complex.

CANDIDATE ELIMINATION ALGO-

The function is known but it is difficult / numerically expensive to compute its exact value.
In this case approximation methods are used to find vales, which are close to function’s
actual values.
3/2-
DECISION TREE- Uses 3 representation to solve a problem in which each leaf node
corresponds to a class label attributes are represented on the internal load of the tree.
Visualization technique which will talk about how to visualize the outcome.
1. Decision node – denoting choice
2. Chance node – denoting the probability
3. End node – denoted by the outcome.

Consider whole training set as root.

Recursive induction of decision tree-

Does not use back propagation?
Example, A is input, B is output then A and B should not be connected. If it is connected to
a, then we call it as a back propagation.
Once output is generated we should not reiterate.
The tree decision points are in a top-down recursive way, so sometimes it will be referred as
a divide and conquer approach which will dissembles traditional if yes then do it, if not then
do B.
In ascending order
8/2-
Picking best splitting attribute-
Attributes – splitting attributes – 7-8 attributes ? How to split? How do decide root, right
and left node?
Computation – entropy & information gain also called as Gini Index

Information gain = (entropy before split) – (entropy after split.)

Or, = (overall entropy at parent node) – (sum of weighted entropy at each child node)

Attribute with maximum information is the best split attribute.

Maximum Information Gain ; Minimum Entropy

//Entropy is the number of bits required to transmit a randomly selected event from a
probability distribution. It is used to make decisions

Entropy deals with a formula that is going to identify the minimum among the split nodes.
Trying to find out the path and based on the path, it is trying to find the split.

Computational complexity depends on the height of the tree. If the tree is large, the
complexity is more complex.
Time complexity is denoted by O(H). O is a function while H is a parameter.(H is a parameter
passed through the function)
Example-

AVL tree is a binary search tree with additional property that difference between height of
left subtree and right subtree.
I AM TIRED UGGGGGH

Noisy data – Data which have variable return types/ invalid attributes
Overfitting – occurs when the tree is designed so as to perfectly fit all samples in the
training data set.

15/2-
Iterative Dichotomizer 3 ALGORITHM –
Classification kind of algo. Used for building a decision tree.

Example-
Example2-
25/3-
0 Aspect – Part of records.
1. Training the network
2. Testing the network
Cross validation –
1. Training {80 records}
2. Testing {20 records}

1 Aspect – No of records
1. Training
2. Testing
3. Blind spot
K fold cross validation-

30/3-
Eg. x company produces 1000 units/day
If successful, null Hypothesis
If not, Alternate Hypothesis

95% level of significance

T-A = E
Target – actual = error
1000 – 950 = 50

Error-
Type 1- Alpha ; actual production of x company based on null hypo ; actual error
Type 2- Beta ; Calculate expected error- {assuming} ; expected error

Questions-
[Solved] QuestionNull Hypothesis Are teens better at math than adults? Age... | Course Hero
1. Are teens better at math than adult?
Ans. Age has no effect on mathematical ability.

2. Does taking aspirin everyday reduce the chance or having heart attack?
3. Do teens use cellphones to access the internet more than adults?
4. Do cats care about the color of their food?
5. Does chewing willow bark relieve pain ?

K fold cross validation-

Total number of S1,S2… must be equal to value of k

Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
From Everand
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
Artem Kovera
No ratings yet
Rohan Finance Project
100% (2)
Rohan Finance Project
43 pages
ML Concepts: 1. Parametric Vs Non-Parametric Models:: Examples: Linear, Logistic, SVM
No ratings yet
ML Concepts: 1. Parametric Vs Non-Parametric Models:: Examples: Linear, Logistic, SVM
34 pages
Machine Learning
No ratings yet
Machine Learning
9 pages
AI Unit 1
No ratings yet
AI Unit 1
30 pages
Unit-1_ML
No ratings yet
Unit-1_ML
39 pages
Machine
No ratings yet
Machine
61 pages
ML - Machine Learning PDF
No ratings yet
ML - Machine Learning PDF
13 pages
Machine Learning INTRO
No ratings yet
Machine Learning INTRO
12 pages
ML Topics
No ratings yet
ML Topics
18 pages
DWDM Unit 4
No ratings yet
DWDM Unit 4
22 pages
Lec 12 NN
No ratings yet
Lec 12 NN
20 pages
Outline: - Learning Agents - Inductive Learning - Decision Tree Learning
No ratings yet
Outline: - Learning Agents - Inductive Learning - Decision Tree Learning
30 pages
Fall 2022 Midterm Notes PDF
No ratings yet
Fall 2022 Midterm Notes PDF
15 pages
Chapter Introduction
No ratings yet
Chapter Introduction
7 pages
ML 1 2 3
No ratings yet
ML 1 2 3
54 pages
DW&M Unit 3 Part I
No ratings yet
DW&M Unit 3 Part I
101 pages
Machine Learning Overview
No ratings yet
Machine Learning Overview
54 pages
Introduction To Classification - PPT Slides 1
No ratings yet
Introduction To Classification - PPT Slides 1
62 pages
Artificial Intelligence Chapter 18 (Updated)
No ratings yet
Artificial Intelligence Chapter 18 (Updated)
19 pages
1
No ratings yet
1
42 pages
Introduction to Machine Learning
No ratings yet
Introduction to Machine Learning
21 pages
Machine Learning HC
No ratings yet
Machine Learning HC
4 pages
This Story Paraphrased From A Post On 9/4/12
No ratings yet
This Story Paraphrased From A Post On 9/4/12
7 pages
MIT - Machine Learning Notes From Chapter 1 - 14 PDF
No ratings yet
MIT - Machine Learning Notes From Chapter 1 - 14 PDF
101 pages
Machine Learning
No ratings yet
Machine Learning
32 pages
ML short Question and answers
No ratings yet
ML short Question and answers
11 pages
AWS Machine Learning Specialty Master Cheat Sheet
No ratings yet
AWS Machine Learning Specialty Master Cheat Sheet
24 pages
Week11_regularization and optimization
No ratings yet
Week11_regularization and optimization
75 pages
Module 1
No ratings yet
Module 1
27 pages
ML
No ratings yet
ML
9 pages
Cs 171 18 IntroLearning Old
No ratings yet
Cs 171 18 IntroLearning Old
47 pages
AIch5 (2)
No ratings yet
AIch5 (2)
50 pages
Anuranan Das Summer of Sciences, 2019. Understanding and Implementing Machine Learning
No ratings yet
Anuranan Das Summer of Sciences, 2019. Understanding and Implementing Machine Learning
17 pages
Chapter 01 Introduction To Machine Learning
No ratings yet
Chapter 01 Introduction To Machine Learning
59 pages
ML-U3
No ratings yet
ML-U3
6 pages
Unit III - I
No ratings yet
Unit III - I
15 pages
Module 1 ML Mumbai University
No ratings yet
Module 1 ML Mumbai University
47 pages
Machine Learning and Data Mining
No ratings yet
Machine Learning and Data Mining
88 pages
"Classifiers": R & D Project by Under The Guidance of
No ratings yet
"Classifiers": R & D Project by Under The Guidance of
59 pages
Machine Learning: Classification & Decision Trees
No ratings yet
Machine Learning: Classification & Decision Trees
24 pages
machine learning
No ratings yet
machine learning
37 pages
DAC ML Tutorial Final Deck
No ratings yet
DAC ML Tutorial Final Deck
150 pages
UNIT 5
No ratings yet
UNIT 5
21 pages
Machine Learning Concepts
No ratings yet
Machine Learning Concepts
68 pages
Ijcrt 195700
No ratings yet
Ijcrt 195700
7 pages
To Machine Learning: Isabelle Guyon
No ratings yet
To Machine Learning: Isabelle Guyon
40 pages
DSS07 CLS Rule Induction, K NN, Naive Bayesian en Đã G P
No ratings yet
DSS07 CLS Rule Induction, K NN, Naive Bayesian en Đã G P
507 pages
Classification
No ratings yet
Classification
74 pages
Machine Learning Theory
100% (1)
Machine Learning Theory
12 pages
Unit4_PPT
No ratings yet
Unit4_PPT
118 pages
Mauryan Empire
No ratings yet
Mauryan Empire
11 pages
Unit 5 Intro To Machine Learning
No ratings yet
Unit 5 Intro To Machine Learning
25 pages
DWDM Unit-3: What Is Classification? What Is Prediction?
No ratings yet
DWDM Unit-3: What Is Classification? What Is Prediction?
12 pages
Machine Learning
No ratings yet
Machine Learning
6 pages
Eem520l1 2023
No ratings yet
Eem520l1 2023
20 pages
Algorithms and Data Structures: An Easy Guide to Programming Skills
From Everand
Algorithms and Data Structures: An Easy Guide to Programming Skills
Rigdon Jonathan
No ratings yet
Introduction to Algorithms
From Everand
Introduction to Algorithms
S VASIST
No ratings yet
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
From Everand
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
Fouad Sabry
No ratings yet
Alternating Decision Tree: Fundamentals and Applications
From Everand
Alternating Decision Tree: Fundamentals and Applications
Fouad Sabry
No ratings yet
Design And Analysis Of Algorithm
From Everand
Design And Analysis Of Algorithm
Bhupendra Mandloi
No ratings yet
The Personality of The Antichrist
No ratings yet
The Personality of The Antichrist
13 pages
I and The Village
No ratings yet
I and The Village
4 pages
PCS - TYProduction - SEM V - 211040002 - Presentation On Interpersonal Skill - 2023
No ratings yet
PCS - TYProduction - SEM V - 211040002 - Presentation On Interpersonal Skill - 2023
13 pages
Giao Trinh Eng 168 Giao Trinh Nghe 2
No ratings yet
Giao Trinh Eng 168 Giao Trinh Nghe 2
128 pages
The Edge - 041017
No ratings yet
The Edge - 041017
33 pages
Starbucks 2
No ratings yet
Starbucks 2
4 pages
Organizational Challenge: Tuesday, September 7
No ratings yet
Organizational Challenge: Tuesday, September 7
18 pages
Christopher Wong's Resume
No ratings yet
Christopher Wong's Resume
1 page
Solution Manual for C Primer Plus, 6/E 6th Edition : 0321928423 - Download Now And Start Reading The Complete Content
100% (14)
Solution Manual for C Primer Plus, 6/E 6th Edition : 0321928423 - Download Now And Start Reading The Complete Content
42 pages
HOSA_Forensic_Science_Practice_Test
No ratings yet
HOSA_Forensic_Science_Practice_Test
3 pages
Macbeth Notes
No ratings yet
Macbeth Notes
3 pages
Opioid Equianalgesic Chart
100% (7)
Opioid Equianalgesic Chart
1 page
Body Parts of Animals
100% (1)
Body Parts of Animals
24 pages
DLL IWRBS QUARTER 1 WEEK 2
100% (1)
DLL IWRBS QUARTER 1 WEEK 2
3 pages
Aizawa Seishisai Shinron
No ratings yet
Aizawa Seishisai Shinron
3 pages
IKS FDP 20-25 jan 2025
No ratings yet
IKS FDP 20-25 jan 2025
2 pages
Work Immersion Portfolio
No ratings yet
Work Immersion Portfolio
19 pages
Boiling Salt Solutions Presentation-2
No ratings yet
Boiling Salt Solutions Presentation-2
39 pages
(Ebook) Cruel to Be Kind: The Life and Music of Nick Lowe by Will Birch ISBN 9780306921957, 0306921952 - Instantly access the full ebook content in just a few seconds
100% (2)
(Ebook) Cruel to Be Kind: The Life and Music of Nick Lowe by Will Birch ISBN 9780306921957, 0306921952 - Instantly access the full ebook content in just a few seconds
77 pages
"A Study To Access The Awarness Levels About Sakhi in Hyderabad". Dissertation Submitted in Partial Fulfillment of Master of Social Work (Medical and Psychiatric Social Work) by
No ratings yet
"A Study To Access The Awarness Levels About Sakhi in Hyderabad". Dissertation Submitted in Partial Fulfillment of Master of Social Work (Medical and Psychiatric Social Work) by
32 pages
Robert Frost Poems
No ratings yet
Robert Frost Poems
4 pages
Thesis in Islamic Studies PDF
100% (3)
Thesis in Islamic Studies PDF
6 pages
Advanced Practice On Word Formation+key
No ratings yet
Advanced Practice On Word Formation+key
18 pages
DSKP KSSR Compilation Y1 - Y6 (English SK)
No ratings yet
DSKP KSSR Compilation Y1 - Y6 (English SK)
18 pages
Algal Density Assessed by Spectrophotometry
No ratings yet
Algal Density Assessed by Spectrophotometry
4 pages
Iqra Riaz
No ratings yet
Iqra Riaz
3 pages
Project Report
No ratings yet
Project Report
121 pages
Communication Satellite Antennas - Robert Dybdal - 2009 PDF
100% (1)
Communication Satellite Antennas - Robert Dybdal - 2009 PDF
345 pages
Negotiation in The European Union Bargaining or Problem Solving
No ratings yet
Negotiation in The European Union Bargaining or Problem Solving
22 pages

ML Notes

Uploaded by

ML Notes

Uploaded by

11/1- Introduction

Dealing with numbers – quantitative (Statistics)

GOAL OF ML (ML is a subset of AI)

Training data – extremely large dataset that is used to teach ML model

Function approximation algorithm- It is technique for estimating an unknown underlining

Input = Calorie intake per day

Willing to join ML? Y or N = 0, 1-146

LEARNING ALGO 1- Gradient Descent

More constraints more complex.

CANDIDATE ELIMINATION ALGO-

Consider whole training set as root.

Recursive induction of decision tree-

Information gain = (entropy before split) – (entropy after split.)

Attribute with maximum information is the best split attribute.

95% level of significance

K fold cross validation-

You might also like