0% found this document useful (0 votes)

2 views21 pages

Intro MLT 08Jan25

The document provides an introduction to machine learning, explaining its definition, differences from traditional programming, and key terminologies such as model, feature, target, training, and prediction. It outlines types of learning, including supervised and unsupervised learning, along with their respective algorithms and challenges. Additionally, it discusses data characteristics, the knowledge discovery process, and the course objectives and outcomes for a machine learning techniques course.

Uploaded by

vidisha yadav

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views21 pages

Intro MLT 08Jan25

Uploaded by

vidisha yadav

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 21

UEE612 Machine Learning

Techniques
An Introduction
Machine Learning
“Machine Learning” – “Field of study that gives computers the capability to learn
without being explicitly programmed”.

How it is different from traditional

Programming:
➢ In Traditional Programming, we feed the Input,
Program logic and run the program to get
output.

➢ In Machine Learning, we feed the input, output

and run it on machine during training and the
machine creates its own logic, which is being
evaluated while testing.
Terminologies of Machine Learning:

❑ Model: A model is a specific representation learned from data by applying

some machine learning algorithm. A model is also called hypothesis.

❑ Feature: A feature is an individual measurable property of our data. A set of

numeric features can be conveniently described by a feature vector. Feature
vectors are fed as input to the model. For example, in order to predict a fruit,
there may be features like color, smell, taste, etc.

❑ Target(Label): A target variable or label is the value to be predicted by our

model. For the fruit example discussed in the features section, the label with
each set of input would be the name of the fruit like apple, orange, banana,
etc.

❑ Training: The idea is to give a set of inputs(features) and it’s expected

outputs(labels), so after training, we will have a model (hypothesis) that will
then map new data to one of the categories trained on.

❑ Prediction: Once our model is ready, it can be fed a set of inputs to which it will
provide a predicted output(label).
Types of Learning

• Supervised Learning
• Unsupervised Learning
• Semi-Supervised Learning

1. Supervised Learning: Supervised learning is when the model is getting trained on a

labelled dataset. Labelled dataset is one which have both input and output
parameters. In this type of learning both training and validation datasets are
labelled as shown in the figures below.

Regression
Classification
Types of Supervised Learning:
• Classification
• Regression

Classification : It is a Supervised Learning task where output is having defined

labels(discrete value).

It can be either binary or multi class classification. In binary classification,

model predicts either 0 or 1 ; yes or no but in case of multi class classification,
model predicts more than one class.
Example: Gmail classifies mails in more than one classes like social,
promotions, updates, offers.

Regression : It is a Supervised Learning task where output is having continuous

value.

The goal here is to predict a value as much closer to actual output value as
our model can and then evaluation is done by calculating error value. The
smaller the error the greater the accuracy of our regression model.
Example of Supervised Learning Algorithms:

❑ Linear Regression

❑ Nearest Neighbor

❑ Gaussian Naive Bayes

❑ Decision Trees

❑ Support Vector Machine (SVM)

❑ Random Forest
Unsupervised Learning:
Unsupervised learning is the training of machine using information that is
neither classified nor labeled and allowing the algorithm to act on that
information without guidance. Here the task of machine is to group
unsorted information according to similarities, patterns and differences
without any prior training of data. Unsupervised machine learning is more
challenging than supervised learning due to the absence of labels.

Types of Unsupervised Learning:

❑ Clustering

❑ Association
Clustering: A clustering problem is where you want to discover the inherent
groupings in the data, such as grouping customers by purchasing behavior.

Association: An association rule learning problem is where you want to

discover rules that describe large portions of your data, such as people that
buy X also tend to buy Y.

Examples of unsupervised learning algorithms are:

❑ k-means for clustering problems.

❑ Apriori algorithm for association rule learning problems

The most basic disadvantage of any Supervised Learning algorithm is that

the dataset has to be hand-labeled either by a Machine Learning Engineer
or a Data Scientist. This is a very costly process, especially when dealing with
large volumes of data. The most basic disadvantage of any Unsupervised
Learning is that it’s application spectrum is limited.
What is Data
“Data is the new oil. It's valuable, but if unrefined it cannot
really be used. It has to be changed into gas, plastic,
chemicals, etc to create a valuable entity that drives profitable
activity; so must data be broken down, analyzed for it to have
value.” — Clive Humby, 2006.
What is Data ?
Data is distinct information, usually formatted and stored in a way
that is suited for a specific purpose. It can be a collection of :

• Facts,
• Measurements,
• Observations or
• Descriptions of things.
What is Data?
What is Data ?
Attributes

Collection of records and their Tid Refund Marital Taxable

Status Income Cheat
attributes
1 Yes Single 125K No
2 No Married 100K No
3 No Single 70K No
An attribute is a characteristic of 4 Yes Married 120K No
an object 5 No Divorced 95K Yes
Objects 6 No Married 60K No
7 Yes Divorced 220K No
8 No Single 85K Yes
A collection of attributes describe
9 No Married 75K No
an object
10 No Single 90K Yes
10
Qualitative vs Quantitative Data
• Qualitative : descriptive information
• Quantitative : numerical information

Quantitative

• Discrete

• Continuous

Discrete data is counted, Continuous data is measured

What is Data ?
In present communication age, data is commonly refers to information
that is transmitted or stored.
• All data can be human-readable machine-readable, or both.
• Tables, Text, images, graphs, web

Objective of data analysis is to find information that is

valid, novel, potentially useful, understandable

Knowledge Discovery in Data: Process
Knowledge Discovery through Data

Data Mining Interpretation/

Evaluation

Knowledge
Patterns
Data
Structured and Unstructured Data

Structured data : Any data that resides in a fixed field

within a record or file.
Ex : Employee Record, Student Record

Unstructured data: Information that does not reside in

a traditional column-row database like structured data.
Ex : Email, Review, Essay etc.
Knowledge
Challenges toDiscovery in Data: Challenges
Data Analytics
Volume
- Big Data
- Small Data

Data
Variety
Velocity - Transaction
- Data Stream - Temporal
- Static - Spatial
…
5
Data Sources
Data Come from Everywhere

Grocery Markets E-Commerce Stock Exchange

But, they have different form

Hospital Weather Station 8

Social Media
Outline (Part 1)
Introduction to Data
Introduction to Data
Transactional Data
Temporal Data
Spatial & Spatial-Temporal Data

Data Preprocessing
Missing Values
Summarization
What is Data?
Structured Data
Attributes

Collection of records and their Tid Refund Marital Taxable

L T P Cr.
3 0 2 4.0

Course Objective: To understand the need, latest trends and design appropriate machine learning
algorithms for problem solving
Introduction Definition of learning systems, machine learning, training data, concept
representation, function approximation for learning system; Objective functions for classification,
regression, and ranking.
Concept of Optimization: Convex function, gradients and sub-gradients, Unconstrained smooth
convex minimization, gradient descent, Constrained optimization, Stochastic gradient descent
Regression and Supervised learning Linear regression and LMS algorithm, Perceptron and
logistic regression, Nonlinear function estimation, Multilayer perceptron and backpropagation,
recurrent networks, Generalization, Underfitting, overfitting, Cross-validation, Regularization,
mixture of Gaussians
Support Vector Machines: Maximum margin linear separators, solution approach to finding
maximum margin separators, Radial basis function network, Kernels for learning non-linear
functions, support vector regression
Decision Tree Learning: Representing concepts as decision trees, Recursive induction, splitting
attributes, simple trees and computational complexity, Overfitting, noisy data, and pruning.
Bayesian Learning: Probability and Bayes rule, Naive Bayes learning algorithm, Parameter
smoothing, Generative vs. discriminative training, Logisitic regression, Bayes nets and Markov nets
for representing dependencies.
Clustering and Unsupervised Learning: Learning from unclassified data. Clustering. k-means
partitional clustering, Fuzzy C-means, Expectation maximization (EM) for soft clustering, Gaussian
Mixture Model
Dimension Reduction Techniques: Feature selection, Principle Component Analysis (PCA),
Linear Discriminant Analysis (LDA)
Applications to Power System: Some of the Power System applications but not restricted to energy
pricing estimation, energy meter analytics, renewable generation forecasting, load profile and
consumer classification, Controller design for ALFC, Filter design.
Laboratory work: The laboratory work includes supervised learning algorithms, linear regression,
logistic regression, decision trees, k-nearest neighbor, Bayesian learning and the naïve Bayes
algorithm, support vector machines and kernels and neural networks with an introduction to Deep
Learning and basic clustering algorithms.
Course Learning Outcomes (CLO):
After the completion of the course the students will be able to:
1. Demonstrate the concept of optimization for various learning functions
2. Analyze the complexity of machine learning algorithms and their limitations
3. Realize learning algorithms as neural computing machine

Approved in 102nd meeting of the Senate held on November 27, 2020

4. Demonstrate the ability to evaluate and compare learning models and learning algorithms
5. Realize algorithms on power system problems.

Text Books:
1. Mitchell T.M., Machine Learning, McGraw Hill(1997).
2. Alpaydin E., Introduction to Machine Learning, MIT Press(2010).
Reference Books:
1. Bishop C., Pattern Recognition and Machine Learning, Springer-Verlag(2006).
2. Michie D., Spiegelhalter D. J., Taylor C. C., Machine Learning, Neural and Statistical
Classification. Overseas Press (2009).

Evaluation Scheme:
S.
Evaluation Elements Weightage (%)
No.
1. MST 25
2. EST 45
3. Sessional (Assignments/Projects/Tutorials/Quizzes/Lab 30
Evaluations)

Approved in 102nd meeting of the Senate held on November 27, 2020

Research Proposal Final
100% (2)
Research Proposal Final
53 pages
Chapters 1 3 Thesis Sample
100% (7)
Chapters 1 3 Thesis Sample
20 pages
Pre-calculus Demystified, Second Edition
From Everand
Pre-calculus Demystified, Second Edition
Rhonda Huettenmueller
3/5 (5)
Why Supporters Contribute To Reward-Based Crowdfunding: Ijebr 23,2
No ratings yet
Why Supporters Contribute To Reward-Based Crowdfunding: Ijebr 23,2
18 pages
Chapter 01 Introduction to ML
No ratings yet
Chapter 01 Introduction to ML
178 pages
DS&ML 1
No ratings yet
DS&ML 1
9 pages
AIML Unit 2 Introduction To Machine Learning
No ratings yet
AIML Unit 2 Introduction To Machine Learning
32 pages
Module2 ML 22 01 2024 WM
No ratings yet
Module2 ML 22 01 2024 WM
42 pages
Artificial Intelligence and Machine Learning: Subject Code: 21CS54 by Savitha Nagaraju Aiml Dept, Atme
No ratings yet
Artificial Intelligence and Machine Learning: Subject Code: 21CS54 by Savitha Nagaraju Aiml Dept, Atme
80 pages
ML
No ratings yet
ML
12 pages
Data Science Activity
No ratings yet
Data Science Activity
11 pages
MachineLearning Jan2nd
100% (2)
MachineLearning Jan2nd
171 pages
Unit5_ML_introduction
No ratings yet
Unit5_ML_introduction
32 pages
TTDS Lectures
No ratings yet
TTDS Lectures
13 pages
Unit 3
No ratings yet
Unit 3
13 pages
Module2 ch2
No ratings yet
Module2 ch2
36 pages
Data Science Activity
No ratings yet
Data Science Activity
12 pages
Unit 3
No ratings yet
Unit 3
33 pages
DOC-20241106-WA0007
No ratings yet
DOC-20241106-WA0007
48 pages
ML Notes All
No ratings yet
ML Notes All
257 pages
Screenshot 2025-01-03 at 8.05.30 PM
No ratings yet
Screenshot 2025-01-03 at 8.05.30 PM
20 pages
Understanding Data Mining
No ratings yet
Understanding Data Mining
21 pages
Week 12 Intro to DS and ML
No ratings yet
Week 12 Intro to DS and ML
67 pages
Lect3 Machine Learning
No ratings yet
Lect3 Machine Learning
27 pages
ML L1 PDF
No ratings yet
ML L1 PDF
43 pages
2.0 Machine Learning Introduction
No ratings yet
2.0 Machine Learning Introduction
24 pages
Week 4 - Intro to ML
No ratings yet
Week 4 - Intro to ML
37 pages
Supervised and Unsupervised Machine Learning
No ratings yet
Supervised and Unsupervised Machine Learning
3 pages
ML Notes
No ratings yet
ML Notes
7 pages
Data_in_machine_learning
No ratings yet
Data_in_machine_learning
7 pages
Unit 3 and Unit 4 Notes - Data Science - III BCA 2
No ratings yet
Unit 3 and Unit 4 Notes - Data Science - III BCA 2
27 pages
AIYA SESSION 4
No ratings yet
AIYA SESSION 4
42 pages
Data Science_ppt
No ratings yet
Data Science_ppt
45 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
5 pages
ida unit-4
No ratings yet
ida unit-4
19 pages
DS - NLP
No ratings yet
DS - NLP
39 pages
4.1 Machine Learning Basics
No ratings yet
4.1 Machine Learning Basics
26 pages
Unit-3-ML
No ratings yet
Unit-3-ML
119 pages
23ECE205 FoDS 13 Introduction To ML
No ratings yet
23ECE205 FoDS 13 Introduction To ML
41 pages
UNIT 1 - Introduction (Types of Machine Learning)
100% (1)
UNIT 1 - Introduction (Types of Machine Learning)
21 pages
Supervised learning
No ratings yet
Supervised learning
19 pages
1 ML M1503-Introduction - ABP
No ratings yet
1 ML M1503-Introduction - ABP
14 pages
Unit-4 AML (1. Basics and K-NN)
No ratings yet
Unit-4 AML (1. Basics and K-NN)
25 pages
Introducti0n (MLT)
No ratings yet
Introducti0n (MLT)
39 pages
Day 2. Lecture - Machinelearning
No ratings yet
Day 2. Lecture - Machinelearning
32 pages
Unit 2 – Advance Concepts of Modelling in AI
No ratings yet
Unit 2 – Advance Concepts of Modelling in AI
12 pages
Big-Data Unit-3
100% (1)
Big-Data Unit-3
54 pages
AI Unit-4
No ratings yet
AI Unit-4
58 pages
Machine Learning and Deep Learning Supervised Learning 1682688720
No ratings yet
Machine Learning and Deep Learning Supervised Learning 1682688720
121 pages
Unit 3 - DS - 1st year
No ratings yet
Unit 3 - DS - 1st year
5 pages
AI Unit4 Learning Dd83e0ee 7d19 48c7 Bc5d b39decf3b0fc
No ratings yet
AI Unit4 Learning Dd83e0ee 7d19 48c7 Bc5d b39decf3b0fc
19 pages
Decision Trees. These Models Use Observations About Certain
No ratings yet
Decision Trees. These Models Use Observations About Certain
6 pages
Machine Learning
No ratings yet
Machine Learning
57 pages
Tesla Stock Marketing Price Prediction
No ratings yet
Tesla Stock Marketing Price Prediction
62 pages
Machine Learning
No ratings yet
Machine Learning
51 pages
ds unit 2
No ratings yet
ds unit 2
36 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
24 pages
UNIT-1 (Preparing To Model)
No ratings yet
UNIT-1 (Preparing To Model)
82 pages
Unit-1
No ratings yet
Unit-1
24 pages
4.0 Introduction to Data
No ratings yet
4.0 Introduction to Data
16 pages
Unit 1 Machine Learning
No ratings yet
Unit 1 Machine Learning
68 pages
AI LM
No ratings yet
AI LM
72 pages
Introduction to Robotics
From Everand
Introduction to Robotics
Swarnalata Verma
No ratings yet
Template - IJORERNew2023 Fix
No ratings yet
Template - IJORERNew2023 Fix
6 pages
Analyzing VAERS Data for Vaccine Safety
No ratings yet
Analyzing VAERS Data for Vaccine Safety
17 pages
Majorship Area: English Focus: Language and Literature Research LET Competencies
No ratings yet
Majorship Area: English Focus: Language and Literature Research LET Competencies
13 pages
AI Marketing White Paper
No ratings yet
AI Marketing White Paper
15 pages
QM 201 - Quantitative Methods: Flexible Learning Syllabus
100% (1)
QM 201 - Quantitative Methods: Flexible Learning Syllabus
7 pages
Business Analytics
No ratings yet
Business Analytics
4 pages
Mis Unit 4
No ratings yet
Mis Unit 4
10 pages
Biological Reviews - 2007 - Nakagawa - Effect Size Confidence Interval and Statistical Significance A Practical Guide For
No ratings yet
Biological Reviews - 2007 - Nakagawa - Effect Size Confidence Interval and Statistical Significance A Practical Guide For
15 pages
Data Analytics Trend Report Guide PDF
No ratings yet
Data Analytics Trend Report Guide PDF
12 pages
Planning Data Analysis
No ratings yet
Planning Data Analysis
11 pages
Public Sector Reforms in Fiji
No ratings yet
Public Sector Reforms in Fiji
185 pages
Effect_of_work_scheduling_on_employee_pe
No ratings yet
Effect_of_work_scheduling_on_employee_pe
7 pages
Placemaking
No ratings yet
Placemaking
272 pages
Sample Paper For The Machine Learning Course Ajay Sharma
No ratings yet
Sample Paper For The Machine Learning Course Ajay Sharma
19 pages
His Is A Full Class Attendance at The First Day of Class Is Mandatory
100% (1)
His Is A Full Class Attendance at The First Day of Class Is Mandatory
11 pages
Pam Clustering Technique: Bachelor of Technology Computer Science and Engineering
No ratings yet
Pam Clustering Technique: Bachelor of Technology Computer Science and Engineering
12 pages
DAX Functions - Time Intelligence Functions
No ratings yet
DAX Functions - Time Intelligence Functions
7 pages
Multiple Regression Analysis
No ratings yet
Multiple Regression Analysis
61 pages
Log Dump SPSS1
No ratings yet
Log Dump SPSS1
6 pages
Multiple Regression With Serial
No ratings yet
Multiple Regression With Serial
15 pages
les6e_ppt_02_04
No ratings yet
les6e_ppt_02_04
42 pages
Int (2)
No ratings yet
Int (2)
15 pages
Machine learning models for geospatial data
No ratings yet
Machine learning models for geospatial data
54 pages
ILS0044 24 Isi Artikel
No ratings yet
ILS0044 24 Isi Artikel
30 pages
COIS13013 Business Intelligence: Term 1 - 2022
No ratings yet
COIS13013 Business Intelligence: Term 1 - 2022
11 pages
Data Analysis
No ratings yet
Data Analysis
28 pages
A Project Report On "Online Trading" AT: Emkey Global Financial
No ratings yet
A Project Report On "Online Trading" AT: Emkey Global Financial
42 pages

Intro MLT 08Jan25

Uploaded by

Intro MLT 08Jan25

Uploaded by

UEE612 Machine Learning

How it is different from traditional

➢ In Machine Learning, we feed the input, output

❑ Model: A model is a specific representation learned from data by applying

❑ Feature: A feature is an individual measurable property of our data. A set of

❑ Target(Label): A target variable or label is the value to be predicted by our

❑ Training: The idea is to give a set of inputs(features) and it’s expected

1. Supervised Learning: Supervised learning is when the model is getting trained on a

Classification : It is a Supervised Learning task where output is having defined

It can be either binary or multi class classification. In binary classification,

Regression : It is a Supervised Learning task where output is having continuous

❑ Gaussian Naive Bayes

❑ Support Vector Machine (SVM)

Types of Unsupervised Learning:

Association: An association rule learning problem is where you want to

Examples of unsupervised learning algorithms are:

❑ k-means for clustering problems.

The most basic disadvantage of any Supervised Learning algorithm is that

Collection of records and their Tid Refund Marital Taxable

Discrete data is counted, Continuous data is measured

Objective of data analysis is to find information that is

valid, novel, potentially useful, understandable

Data Mining Interpretation/

Structured data : Any data that resides in a fixed field

Unstructured data: Information that does not reside in

Grocery Markets E-Commerce Stock Exchange

Hospital Weather Station 8

Collection of records and their Tid Refund Marital Taxable

Approved in 102nd meeting of the Senate held on November 27, 2020

Approved in 102nd meeting of the Senate held on November 27, 2020

You might also like