0% found this document useful (0 votes)

10 views

Predictive Modeling BI 4

Uploaded by

ikher.shivin

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views

Predictive Modeling BI 4

Uploaded by

ikher.shivin

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 28

Techniques for Predictive Modeling-Learning Objectives

• Understand the concept and definitions of artificial

neural networks (ANN)
• Learn the different types of ANN architectures
• Know how learning happens in ANN
• Become familiar with ANN applications
• Understand the sensitivity analysis in ANN
• Understand the concept and structure of support
vector machines (SVM)
• Comparison

Appln: Predictive Modeling Helps Better Understand and Manage Complex

Medical Procedures, …
A Process Map for
Training and
Testing Four
Predictive Models
The Comparison of Four Models –An example
Neural Network Concepts
• Neural networks (NN): a brain metaphor for
information processing
• Neural computing
• Artificial neural network (ANN)
• Many uses for ANN for
– pattern recognition, forecasting, prediction, and
classification
• Many application areas
– finance, marketing, manufacturing, operations,
information systems, and so on
Application Case 6.1
Neural Networks Are Helping to Save
Lives in the Mining Industry

Questions for
Discussion
1. How did neural networks help save lives in the mining
industry?
2. What were the challenges, the proposed solution, and the
obtained results?
Elements of ANN

• Processing element (PE)

• Network architecture
– Hidden layers
– Parallel processing
• Network information processing
– Inputs
– Outputs
– Connection weights
– Summation function
Elements of ANN

x1 (PE)

x2 Weighted Transfer
(PE) Sum Function
Y1
x3 (S) (f)

(PE)

(PE) (PE)

Output
(PE)
Layer

Hidden
(PE)
Layer Neural Network with
Input
One Hidden Layer
Layer
Elements of ANN

(a) Single neuron (b) Multiple neurons

x1 x1 w11 (PE) Y1
w1
w21
(PE) Y

w1 w12
x2 Y = X 1W1 + X 2W2
x2 w22 (PE) Y2
PE: Processing Element (or neuron)

Y1 = X1W11 + X 2W21
Summation Function for a Single w23
Y2 = X1W12 + X2W22
Neuron (a), and
Y3 = X 2W 23 (PE) Y3
Several Neurons (b)
Elements of ANN
• Transformation (Transfer) Function
– Linear function
– Sigmoid (logical activation) function [0 1]
– Tangent Hyperbolic function [-1 1]

Summation function: Y = 3(0.2) + 1(0.4) + 2(0.1) = 1.2

X1 = 3 Transfer function: YT = 1/(1 + e-1.2) = 0.77
W
1 =0
.2

W2 = 0.4 Processing Y = 1.2

X2 = 1 YT = 0.77
element (PE)
.1
3
=0
W

X3 = 2
❖ Threshold value?
Neural Network Architectures
• Architecture of a neural network is driven by the
task it is intended to address
– Classification, regression, clustering, general
optimization, association, ….
• Most popular architecture: Feedforward, multi-
layered perceptron with backpropagation learning
algorithm
– Used for both classification and regression type
problems
• Others – Recurrent, self-organizing feature maps,
Hopfield networks, …
Neural Network Architectures
Feed-Forward Neural Networks

Feed-forward MLP with 1 Hidden Layer

Socio-demographic
Predicted
= vs. Actual
Religious
Voted “yes” or
“no” to legalizing
Financial gaming

. .
. .
. .
Other

INPUT HIDDEN OUTPUT

LAYER LAYER LAYER
Neural Network Architectures
Recurrent Neural Networks
Testing a Trained ANN Model

• Data is split into three parts

– Training (~60%)
– Validation (~20%)
– Testing (~20%)

• k-fold cross validation

– Less bias
– Time consuming
AN Learning Process
A Supervised Learning Process
ANN
Model
Three-step process:
1. Compute temporary
Compute
output
outputs.
2. Compare outputs with
desired targets.
3. Adjust the weights and
Is desired
Adjust
weights
No
output repeat the process.
achieved?

Yes

Stop
learning
Support Vector Machines (SVM)

• SVM are among the most popular machine-learning techniques.

• SVM belong to the family of generalized linear models… (capable of
representing non-linear relationships in a linear fashion).
• SVM achieve a classification or regression decision based on the value of
the linear combination of input features.
• Because of their architectural similarities, SVM are also closely
associated with ANN.
Support Vector Machines (SVM)

• Goal of SVM: to generate mathematical functions that map input

variables to desired outputs for classification or regression type
prediction problems.
– First, SVM uses nonlinear kernel functions to transform non-linear relationships
among the variables into linearly separable feature spaces.
– Then, the maximum-margin hyperplanes are constructed to optimally separate
different classes from each other based on the training dataset.
• SVM has solid mathematical foundation!
Support Vector Machines (SVM)

• A hyperplane is a geometric concept used to describe the separation

surface between different classes of things.
– In SVM, two parallel hyperplanes are constructed on each side of the separation
space with the aim of maximizing the distance between them.
• A kernel function in SVM uses the kernel trick (a method for using a
linear classifier algorithm to solve a nonlinear problem)
– The most commonly used kernel function is the radial basis function (RBF).
Support Vector Machines (SVM)

L1
M
X2 X2 ar
gi
L2 n

e
an
L3

l
rp
pe
hy
n
gi
ar
-m
um
im
ax
M
X1 X1

➢ Many linear classifiers (hyperplanes) may separate the data

Application Case 6.4

Managing Student Retention with

Predictive Modeling
Questions for Discussion
1. Why is attrition one of the most important issues in
higher education?
2. How can predictive analytics (ANN, SVM, and so forth)
be used to better manage student retention?
3. What are the main challenges and potential solutions to
the use of analytics in retention management?
How Does an SVM Work?

• Following a machine-learning process, an SVM learns from the historic

cases.
• The Process of Building SVM
1. Preprocess the data
• Scrub and transform the data.
2. Develop the model.
• Select the kernel type (RBF is often a natural choice).
• Determine the kernel parameters for the selected kernel type.
• If the results are satisfactory, finalize the model; otherwise change the kernel type and/or kernel
parameters to achieve the desired accuracy level.
3. Extract and deploy the model.
The Process of Building an SVM
Pre-Process the Data
Training
ü Scrub the data
data
“Identify and handle missing,
incorrect, and noisy”
ü Transform the data
“Numerisize, normalize and
standardize the data”

Pre-processed data

Develop the Model

Experimentation
ü Select the kernel type “Training/Testing”
“Choose from RBF, Sigmoid
or Polynomial kernel types”
ü Determine the kernel values
“Use v-fold cross validation or
employ ‘grid-search’”

Validated SVM model

Deploy the Model

Prediction
ü Extract the model coefficients Model
ü Code the trained model into
the decision support system
ü Monitor and maintain the
model
SVM Applications
• SVMs are the most widely used kernel-learning
algorithms for wide range of classification and
regression problems
• SVMs represent the state-of-the-art by virtue of their
excellent generalization performance, superior
prediction power, ease of use, and rigorous theoretical
foundation
• Most comparative studies show its superiority in both
regression and classification type prediction problems.
• SVM versus ANN?
k-Nearest Neighbor Method (k-NN)

• ANNs and SVMs → time-demanding, computationally intensive iterative

derivations
• k-NN is a simplistic and logical prediction method, that produces very
competitive results
• k-NN is a prediction method for classification as well as regression types
(similar to ANN & SVM)
• k-NN is a type of instance-based learning – most of the work takes place
at the time of prediction (not at modeling)
• k : the number of neighbors used
k-Nearest Neighbor Method (k-NN)
Y

k=3

k=5
Yi

The answer depends on

the value of k

Xi X
The Process of k-NN Method

Training Set
Parameter Setting

Historic Data ü Distance measure

ü Value of “k”

Validation Set

Predicting
Classify (or Forecast)
new cases using k
number of most
similar cases

New Data
k-NN Model Parameter
1. Similarity Measure: The Distance Metric

– Numeric versus nominal values?

k-NN Model Parameter
2. Number of Neighbors (the value of k)
– The best value depends on the data
– Larger values reduce the effect of noise but also
make boundaries between classes less distinct
– An “optimal” value can be found heuristically
• Cross Validation is often used to determine the best
value for k and the distance measure
Application Case 6.5

Efficient Image Recognition and

Categorization with kNN

Questions for Discussion

1. Why is image recognition/classification a worthy
but difficult problem?
2. How can k-NN be effectively used for image
recognition/classification applications?

Impact of Business Analytics and Enterprise Systems On Managerial
No ratings yet
Impact of Business Analytics and Enterprise Systems On Managerial
16 pages
Busiess Analytics Data Mining Lecture 7
No ratings yet
Busiess Analytics Data Mining Lecture 7
37 pages
Lecture 4-Machine Learning Applications
No ratings yet
Lecture 4-Machine Learning Applications
52 pages
Chapter 05 - In Class
No ratings yet
Chapter 05 - In Class
39 pages
Naive Bayes Algorithm
No ratings yet
Naive Bayes Algorithm
3 pages
Chapter 05 - Sharda 11e Full Accessible PPT 05
No ratings yet
Chapter 05 - Sharda 11e Full Accessible PPT 05
31 pages
Lecture_3_Machine_learning_Techniques_For_Predictive_Analytics
No ratings yet
Lecture_3_Machine_learning_Techniques_For_Predictive_Analytics
40 pages
06-Classification_Part2
No ratings yet
06-Classification_Part2
34 pages
Sharda dss10 PPT 06
No ratings yet
Sharda dss10 PPT 06
43 pages
This Is
No ratings yet
This Is
7 pages
Week 5 Slides
No ratings yet
Week 5 Slides
25 pages
Chapter 9. Classification: Advanced Methods
No ratings yet
Chapter 9. Classification: Advanced Methods
39 pages
SVM, Neural Network and Random Forest in R
No ratings yet
SVM, Neural Network and Random Forest in R
45 pages
IIS Lecture 3
No ratings yet
IIS Lecture 3
21 pages
DWDM
No ratings yet
DWDM
20 pages
Data analysis ch1
No ratings yet
Data analysis ch1
13 pages
Deep Learning
No ratings yet
Deep Learning
13 pages
Unec 1705121586
No ratings yet
Unec 1705121586
33 pages
Machine-Learning Techniques for Predictive Analytics
No ratings yet
Machine-Learning Techniques for Predictive Analytics
53 pages
Stock Trend Prediction With Neural Network Techniques
No ratings yet
Stock Trend Prediction With Neural Network Techniques
61 pages
Machine Learning Concept1
No ratings yet
Machine Learning Concept1
16 pages
Artificial Neural Network Bao
No ratings yet
Artificial Neural Network Bao
26 pages
AAM book
No ratings yet
AAM book
159 pages
Deep Learning 1
No ratings yet
Deep Learning 1
48 pages
1
No ratings yet
1
42 pages
SVM Unit 2
No ratings yet
SVM Unit 2
12 pages
Lecture 9
No ratings yet
Lecture 9
27 pages
Ijcsea 2
No ratings yet
Ijcsea 2
13 pages
AIYA SESSION 4
No ratings yet
AIYA SESSION 4
42 pages
SVM
No ratings yet
SVM
2 pages
MachineLearning Lecture 2
No ratings yet
MachineLearning Lecture 2
23 pages
"Classifiers": R & D Project by Under The Guidance of
No ratings yet
"Classifiers": R & D Project by Under The Guidance of
59 pages
DL1-Ver1
No ratings yet
DL1-Ver1
49 pages
Term Paper: Dept of CSE, GMRIT
No ratings yet
Term Paper: Dept of CSE, GMRIT
16 pages
Instance Based Learning
No ratings yet
Instance Based Learning
21 pages
12 Advanced Machine Learning Algorithms
No ratings yet
12 Advanced Machine Learning Algorithms
41 pages
Topic: Machine Learning
No ratings yet
Topic: Machine Learning
35 pages
UNIT 1,2,3
No ratings yet
UNIT 1,2,3
17 pages
ML-24-SVM-other info-v.0.1_=15
No ratings yet
ML-24-SVM-other info-v.0.1_=15
20 pages
Prediction in Data Mining
No ratings yet
Prediction in Data Mining
12 pages
Machine Learning
No ratings yet
Machine Learning
32 pages
Sales Forecasting Using Kernel Based Support Vector Machine Algorithm
No ratings yet
Sales Forecasting Using Kernel Based Support Vector Machine Algorithm
6 pages
lec08_Classification_kNN_ANN
No ratings yet
lec08_Classification_kNN_ANN
39 pages
Unit 5
No ratings yet
Unit 5
28 pages
UNIT - 2-1
No ratings yet
UNIT - 2-1
7 pages
CSCI946 w5-classification
No ratings yet
CSCI946 w5-classification
72 pages
DWDM Rit-E22 Unit4
No ratings yet
DWDM Rit-E22 Unit4
139 pages
SUpport Vector Machine
No ratings yet
SUpport Vector Machine
28 pages
Machine Learning-Gkouzionis
No ratings yet
Machine Learning-Gkouzionis
14 pages
Machine Learning in A Nutshell
No ratings yet
Machine Learning in A Nutshell
36 pages
Module 3
No ratings yet
Module 3
11 pages
Summer of Science-Final Report
100% (1)
Summer of Science-Final Report
7 pages
Bda unit 5
No ratings yet
Bda unit 5
11 pages
Machine Learning
No ratings yet
Machine Learning
32 pages
ML 7th Sem AIML ITE Notes Complete LONG
No ratings yet
ML 7th Sem AIML ITE Notes Complete LONG
202 pages
ML RUSA Module 6 Probablistic EM KNN SVM
No ratings yet
ML RUSA Module 6 Probablistic EM KNN SVM
51 pages
AP for NLP-LO2
No ratings yet
AP for NLP-LO2
38 pages
Geometric functions in computer aided geometric design
From Everand
Geometric functions in computer aided geometric design
Oscar Ruiz
No ratings yet
Multiple Integrals, A Collection of Solved Problems
From Everand
Multiple Integrals, A Collection of Solved Problems
Steven Tan
No ratings yet
The Numpy Pocketbook: Essentials on the Go
From Everand
The Numpy Pocketbook: Essentials on the Go
Silas Meadowlark
No ratings yet
Neural Networks
From Everand
Neural Networks
Sasha Kurzweil
No ratings yet
Data Mining - Bi 3
No ratings yet
Data Mining - Bi 3
40 pages
Intrusion Detection System On IoT With 5G Network
No ratings yet
Intrusion Detection System On IoT With 5G Network
13 pages
ET Instrumentation 25 983c7f01af
No ratings yet
ET Instrumentation 25 983c7f01af
1 page
Pre Employment Medical 9aa0a328eb
No ratings yet
Pre Employment Medical 9aa0a328eb
16 pages
urfd
No ratings yet
urfd
14 pages
Cement and Concrete Research: Sciencedirect
No ratings yet
Cement and Concrete Research: Sciencedirect
10 pages
An introduction to IoT Analytics 1st Edition Harry G Perros download
100% (2)
An introduction to IoT Analytics 1st Edition Harry G Perros download
58 pages
A Survey of Predictive Maintenance: Systems, Purposes and Approaches
No ratings yet
A Survey of Predictive Maintenance: Systems, Purposes and Approaches
38 pages
Rapid_analysis_technologies_with_chemometrics_forf
No ratings yet
Rapid_analysis_technologies_with_chemometrics_forf
28 pages
Data Mining for Bioinformatics 1st Edition Sumeet Dua instant download
100% (2)
Data Mining for Bioinformatics 1st Edition Sumeet Dua instant download
53 pages
Personality Prediction Using CV, Deep Learning
No ratings yet
Personality Prediction Using CV, Deep Learning
7 pages
synopsis 3d objects2
No ratings yet
synopsis 3d objects2
21 pages
Soft Computing Unit 1 Notes
No ratings yet
Soft Computing Unit 1 Notes
33 pages
Bessel Sequence and Operator Norm
No ratings yet
Bessel Sequence and Operator Norm
27 pages
(INTI
No ratings yet
(INTI
9 pages
Main
No ratings yet
Main
13 pages
Review On Online Feature Selection
No ratings yet
Review On Online Feature Selection
4 pages
Download Complete Machine Learning for Intelligent Multimedia Analytics Techniques and Applications Pardeep Kumar Amit Kumar Singh Eds PDF for All Chapters
100% (6)
Download Complete Machine Learning for Intelligent Multimedia Analytics Techniques and Applications Pardeep Kumar Amit Kumar Singh Eds PDF for All Chapters
50 pages
7.Tomato Quality Classification Based on Transfer
No ratings yet
7.Tomato Quality Classification Based on Transfer
14 pages
vihari
No ratings yet
vihari
27 pages
SS ZC416 Revised Course Handout
No ratings yet
SS ZC416 Revised Course Handout
6 pages
IEEE-Machine Learning For The Predictive Maintenance of A Jaw Crusher in The Mining Industry
No ratings yet
IEEE-Machine Learning For The Predictive Maintenance of A Jaw Crusher in The Mining Industry
6 pages
Dynamic Modeling of Complex Industrial Processes Data driven Methods and Application Research 1st Edition Chao Shang (Auth.) download
100% (1)
Dynamic Modeling of Complex Industrial Processes Data driven Methods and Application Research 1st Edition Chao Shang (Auth.) download
62 pages
1) Download the binary classification dataset for... - Colab
No ratings yet
1) Download the binary classification dataset for... - Colab
6 pages
Big Data Syllabus
No ratings yet
Big Data Syllabus
17 pages
machine learning LIST OF EXPERIMENTS
No ratings yet
machine learning LIST OF EXPERIMENTS
2 pages
Learning Techniques For NILMTK
No ratings yet
Learning Techniques For NILMTK
9 pages
5th International Conference On Electronics and Sustainable Communication Systems (ICESC 2024)
No ratings yet
5th International Conference On Electronics and Sustainable Communication Systems (ICESC 2024)
15 pages
Ciml Mini Project (1)
No ratings yet
Ciml Mini Project (1)
19 pages
ICOfrauddetectionpaper 1
No ratings yet
ICOfrauddetectionpaper 1
17 pages
Diabetes Prediction Using Machine Learning Classification Techniques
No ratings yet
Diabetes Prediction Using Machine Learning Classification Techniques
34 pages
Trends in Food Science & Technology: Eloisa Bagnulo, Giulia Strocchi, Carlo Bicchi, Erica Liberto
No ratings yet
Trends in Food Science & Technology: Eloisa Bagnulo, Giulia Strocchi, Carlo Bicchi, Erica Liberto
13 pages
Weather Forecasting Basepaper
100% (1)
Weather Forecasting Basepaper
14 pages

Predictive Modeling BI 4

Uploaded by

Predictive Modeling BI 4

Uploaded by

Techniques for Predictive Modeling-Learning Objectives

• Understand the concept and definitions of artificial

Appln: Predictive Modeling Helps Better Understand and Manage Complex

• Processing element (PE)

(a) Single neuron (b) Multiple neurons

Summation function: Y = 3(0.2) + 1(0.4) + 2(0.1) = 1.2

W2 = 0.4 Processing Y = 1.2

Feed-forward MLP with 1 Hidden Layer

INPUT HIDDEN OUTPUT

• Data is split into three parts

• k-fold cross validation

• SVM are among the most popular machine-learning techniques.

• Goal of SVM: to generate mathematical functions that map input

• A hyperplane is a geometric concept used to describe the separation

➢ Many linear classifiers (hyperplanes) may separate the data

Managing Student Retention with

• Following a machine-learning process, an SVM learns from the historic

Develop the Model

Validated SVM model

Deploy the Model

• ANNs and SVMs → time-demanding, computationally intensive iterative

The answer depends on

Historic Data ü Distance measure

– Numeric versus nominal values?

Efficient Image Recognition and

Questions for Discussion

You might also like