0% found this document useful (0 votes)
2 views21 pages

Intro MLT 08Jan25

The document provides an introduction to machine learning, explaining its definition, differences from traditional programming, and key terminologies such as model, feature, target, training, and prediction. It outlines types of learning, including supervised and unsupervised learning, along with their respective algorithms and challenges. Additionally, it discusses data characteristics, the knowledge discovery process, and the course objectives and outcomes for a machine learning techniques course.

Uploaded by

vidisha yadav
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
2 views21 pages

Intro MLT 08Jan25

The document provides an introduction to machine learning, explaining its definition, differences from traditional programming, and key terminologies such as model, feature, target, training, and prediction. It outlines types of learning, including supervised and unsupervised learning, along with their respective algorithms and challenges. Additionally, it discusses data characteristics, the knowledge discovery process, and the course objectives and outcomes for a machine learning techniques course.

Uploaded by

vidisha yadav
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 21

UEE612 Machine Learning

Techniques
An Introduction
Machine Learning
“Machine Learning” – “Field of study that gives computers the capability to learn
without being explicitly programmed”.

How it is different from traditional


Programming:
➢ In Traditional Programming, we feed the Input,
Program logic and run the program to get
output.

➢ In Machine Learning, we feed the input, output


and run it on machine during training and the
machine creates its own logic, which is being
evaluated while testing.
Terminologies of Machine Learning:

❑ Model: A model is a specific representation learned from data by applying


some machine learning algorithm. A model is also called hypothesis.

❑ Feature: A feature is an individual measurable property of our data. A set of


numeric features can be conveniently described by a feature vector. Feature
vectors are fed as input to the model. For example, in order to predict a fruit,
there may be features like color, smell, taste, etc.

❑ Target(Label): A target variable or label is the value to be predicted by our


model. For the fruit example discussed in the features section, the label with
each set of input would be the name of the fruit like apple, orange, banana,
etc.

❑ Training: The idea is to give a set of inputs(features) and it’s expected


outputs(labels), so after training, we will have a model (hypothesis) that will
then map new data to one of the categories trained on.

❑ Prediction: Once our model is ready, it can be fed a set of inputs to which it will
provide a predicted output(label).
Types of Learning

• Supervised Learning
• Unsupervised Learning
• Semi-Supervised Learning

1. Supervised Learning: Supervised learning is when the model is getting trained on a


labelled dataset. Labelled dataset is one which have both input and output
parameters. In this type of learning both training and validation datasets are
labelled as shown in the figures below.

Regression
Classification
Types of Supervised Learning:
• Classification
• Regression

Classification : It is a Supervised Learning task where output is having defined


labels(discrete value).

It can be either binary or multi class classification. In binary classification,


model predicts either 0 or 1 ; yes or no but in case of multi class classification,
model predicts more than one class.
Example: Gmail classifies mails in more than one classes like social,
promotions, updates, offers.

Regression : It is a Supervised Learning task where output is having continuous


value.

The goal here is to predict a value as much closer to actual output value as
our model can and then evaluation is done by calculating error value. The
smaller the error the greater the accuracy of our regression model.
Example of Supervised Learning Algorithms:

❑ Linear Regression

❑ Nearest Neighbor

❑ Gaussian Naive Bayes

❑ Decision Trees

❑ Support Vector Machine (SVM)

❑ Random Forest
Unsupervised Learning:
Unsupervised learning is the training of machine using information that is
neither classified nor labeled and allowing the algorithm to act on that
information without guidance. Here the task of machine is to group
unsorted information according to similarities, patterns and differences
without any prior training of data. Unsupervised machine learning is more
challenging than supervised learning due to the absence of labels.

Types of Unsupervised Learning:

❑ Clustering

❑ Association
Clustering: A clustering problem is where you want to discover the inherent
groupings in the data, such as grouping customers by purchasing behavior.

Association: An association rule learning problem is where you want to


discover rules that describe large portions of your data, such as people that
buy X also tend to buy Y.

Examples of unsupervised learning algorithms are:

❑ k-means for clustering problems.


❑ Apriori algorithm for association rule learning problems

The most basic disadvantage of any Supervised Learning algorithm is that


the dataset has to be hand-labeled either by a Machine Learning Engineer
or a Data Scientist. This is a very costly process, especially when dealing with
large volumes of data. The most basic disadvantage of any Unsupervised
Learning is that it’s application spectrum is limited.
What is Data
“Data is the new oil. It's valuable, but if unrefined it cannot
really be used. It has to be changed into gas, plastic,
chemicals, etc to create a valuable entity that drives profitable
activity; so must data be broken down, analyzed for it to have
value.” — Clive Humby, 2006.
What is Data ?
Data is distinct information, usually formatted and stored in a way
that is suited for a specific purpose. It can be a collection of :

• Facts,
• Measurements,
• Observations or
• Descriptions of things.
What is Data?
What is Data ?
Attributes

Collection of records and their Tid Refund Marital Taxable


Status Income Cheat
attributes
1 Yes Single 125K No
2 No Married 100K No
3 No Single 70K No
An attribute is a characteristic of 4 Yes Married 120K No
an object 5 No Divorced 95K Yes
Objects 6 No Married 60K No
7 Yes Divorced 220K No
8 No Single 85K Yes
A collection of attributes describe
9 No Married 75K No
an object
10 No Single 90K Yes
10
Qualitative vs Quantitative Data
• Qualitative : descriptive information
• Quantitative : numerical information

Quantitative

• Discrete

• Continuous

Discrete data is counted, Continuous data is measured


What is Data ?
In present communication age, data is commonly refers to information
that is transmitted or stored.
• All data can be human-readable machine-readable, or both.
• Tables, Text, images, graphs, web

Objective of data analysis is to find information that is

valid, novel, potentially useful, understandable


Knowledge Discovery in Data: Process
Knowledge Discovery through Data

Data Mining Interpretation/


Evaluation

Knowledge
Patterns
Data
Structured and Unstructured Data

Structured data : Any data that resides in a fixed field


within a record or file.
Ex : Employee Record, Student Record

Unstructured data: Information that does not reside in


a traditional column-row database like structured data.
Ex : Email, Review, Essay etc.
Knowledge
Challenges toDiscovery in Data: Challenges
Data Analytics
Volume
- Big Data
- Small Data

Data
Variety
Velocity - Transaction
- Data Stream - Temporal
- Static - Spatial

5
Data Sources
Data Come from Everywhere

Grocery Markets E-Commerce Stock Exchange


But, they have different form

Hospital Weather Station 8


Social Media
Outline (Part 1)
Introduction to Data
Introduction to Data
Transactional Data
Temporal Data
Spatial & Spatial-Temporal Data

Data Preprocessing
Missing Values
Summarization
What is Data?
Structured Data
Attributes

Collection of records and their Tid Refund Marital Taxable


Status Income Cheat
attributes
1 Yes Single 125K No
2 No Married 100K No
3 No Single 70K No
An attribute is a characteristic of 4 Yes Married 120K No
an object 5 No Divorced 95K Yes
Objects 6 No Married 60K No
7 Yes Divorced 220K No
8 No Single 85K Yes
A collection of attributes describe
9 No Married 75K No
an object
10 No Single 90K Yes
10
UEE612: MACHINE LEARNING TECHNIQUES

L T P Cr.
3 0 2 4.0

Course Objective: To understand the need, latest trends and design appropriate machine learning
algorithms for problem solving
Introduction Definition of learning systems, machine learning, training data, concept
representation, function approximation for learning system; Objective functions for classification,
regression, and ranking.
Concept of Optimization: Convex function, gradients and sub-gradients, Unconstrained smooth
convex minimization, gradient descent, Constrained optimization, Stochastic gradient descent
Regression and Supervised learning Linear regression and LMS algorithm, Perceptron and
logistic regression, Nonlinear function estimation, Multilayer perceptron and backpropagation,
recurrent networks, Generalization, Underfitting, overfitting, Cross-validation, Regularization,
mixture of Gaussians
Support Vector Machines: Maximum margin linear separators, solution approach to finding
maximum margin separators, Radial basis function network, Kernels for learning non-linear
functions, support vector regression
Decision Tree Learning: Representing concepts as decision trees, Recursive induction, splitting
attributes, simple trees and computational complexity, Overfitting, noisy data, and pruning.
Bayesian Learning: Probability and Bayes rule, Naive Bayes learning algorithm, Parameter
smoothing, Generative vs. discriminative training, Logisitic regression, Bayes nets and Markov nets
for representing dependencies.
Clustering and Unsupervised Learning: Learning from unclassified data. Clustering. k-means
partitional clustering, Fuzzy C-means, Expectation maximization (EM) for soft clustering, Gaussian
Mixture Model
Dimension Reduction Techniques: Feature selection, Principle Component Analysis (PCA),
Linear Discriminant Analysis (LDA)
Applications to Power System: Some of the Power System applications but not restricted to energy
pricing estimation, energy meter analytics, renewable generation forecasting, load profile and
consumer classification, Controller design for ALFC, Filter design.
Laboratory work: The laboratory work includes supervised learning algorithms, linear regression,
logistic regression, decision trees, k-nearest neighbor, Bayesian learning and the naïve Bayes
algorithm, support vector machines and kernels and neural networks with an introduction to Deep
Learning and basic clustering algorithms.
Course Learning Outcomes (CLO):
After the completion of the course the students will be able to:
1. Demonstrate the concept of optimization for various learning functions
2. Analyze the complexity of machine learning algorithms and their limitations
3. Realize learning algorithms as neural computing machine

Approved in 102nd meeting of the Senate held on November 27, 2020


4. Demonstrate the ability to evaluate and compare learning models and learning algorithms
5. Realize algorithms on power system problems.

Text Books:
1. Mitchell T.M., Machine Learning, McGraw Hill(1997).
2. Alpaydin E., Introduction to Machine Learning, MIT Press(2010).
Reference Books:
1. Bishop C., Pattern Recognition and Machine Learning, Springer-Verlag(2006).
2. Michie D., Spiegelhalter D. J., Taylor C. C., Machine Learning, Neural and Statistical
Classification. Overseas Press (2009).

Evaluation Scheme:
S.
Evaluation Elements Weightage (%)
No.
1. MST 25
2. EST 45
3. Sessional (Assignments/Projects/Tutorials/Quizzes/Lab 30
Evaluations)

Approved in 102nd meeting of the Senate held on November 27, 2020

You might also like