SlideShare a Scribd company logo
Special Codemotion
Tech Meetup:
Data Driven Innovation
Antimo Musone
IT Manager
17 Maggio 2016
2
About Me
Presentation title
►>Antimo Musone
 IT Manager / Architect presso EY
 Co - Founder Fifth Ingenum Srls.
 Ing. Informatica II Università degli Studi di Napoli
 email: antimo.musone@it.ey.com
4
What is Machine Learning ?
5
Machine Learning / Predictive Analytics
Vision Analytics
Recommenda-
tion engines
Advertising
analysis
Weather
forecasting for
business
planning
Social network
analysis
Legal
discovery and
document
archiving
Pricing analysis
Fraud
detection
Churn
analysis
Equipment
monitoring
Location-based
tracking and
services
Personalized
Insurance
Machine learning &
predictive analytics are core
capabilities that are needed
throughout your business
6
Machine Learning Overview
► Formal definition: “The field of machine learning is concerned with the
question of how to construct computer programs that automatically improve
with experience” - Tom M. Mitchell
► Another definition: “The goal of machine learning is to program computers to
use example data or past experience to solve a given problem.” – Introduction to
Machine Learning, 2nd Edition, MIT Press
► ML often involves two primary techniques:
► Supervised Learning: Finding the mapping between inputs and outputs using
correct values to “train” a model
► Unsupervised Learning: Finding patterns in the input data (similar to Density
Estimates in Statistics)
7
Machine Learning
Data:
A B C D E F G H I J K L M N O P Q R S T U V W X Y Z
Rules, or Algorithms:
about, Learning, language – Spelling and sounding builds words
Learning about language. – Words build sentences
Learning, or Abstraction:
Any new understanding proceeds from previous knowledge.
Data + Rules/ Algorithms = Machine Learning
8
Traditional programming VS Machine
Learning
Computer
Data
Program
Output
Traditional Programming
Data
Output
Program/Algorithms
Machine Learning
Program can predict the output!
Computer
9
ML : No, more like gardening
Gardener = You
Seeds = Algorithms
Nutrients = Data
Plants = Programs
10
ML Sample Application
► Web search
► Computational biology
► Finance
► E-commerce
► Space exploration
► Robotics
► Information extraction
► Social networks
► Debugging
► [Your favorite area]
11
What is Predictive Analytics?
Presentation title
Wikipedia Definition: (http://en.wikipedia.org/wiki/Predictive_analytics)
“Predictive analytics encompasses a variety of techniques from
statistics, modeling, machine learning, and data mining that analyze
current and historical facts to make predictions about future, or
otherwise unknown, events.”
Facts Predictions
Predictive
Analytics
Techniques
12
Breaking it Down
Presentation title
“Predictive analytics encompasses a variety of techniques from
statistics, modeling, machine learning, and data mining that analyze
current and historical facts to make predictions about future, or otherwise
unknown, events.”
Machine Learning Use of
computer algorithms to derive
complex formulations based on
objectives and constraints
Tools and Techniques
Data visualization,
segmentation, correlations
Use in Predictive Analytics
Predictive analytics is often
applied in the context of
datasets that are too large for
manual analysis, so data mining
techniques are required
Statistics Focus on learning
population characteristics based on
samples of data
Tools and Techniques p-values,
confidence intervals, sampling,
ANOVA
Use in Predictive Analytics
Underlying theory behind many
parametric models – observed facts
are a sample from a population
including both known/historic and
unknown/future events
Modeling Representations of systems
used to understand the underlying
dynamics of the system
Tools and Techniques
Symbolic logic, proxies
Use in Predictive Analytics
Complex relationships can be simplified
through modeling – these models can
then be used to analyze relationships
between factors
13
What is a Model?
A model is a simplified representation of observed
effects
Presentation title
Key terms:
 Dependent or target variable – the variable of interest
 Independent or predictor variable(s) – variable(s) used
for explanation/prediction
 Effect – the (quantitative) impact of an independent
variable or combination of independent variables on the
dependent variable
 Main Effect – The direct effect of a single independent variable
on the dependent variable
 Interaction Effect – The effect of a combination of multiple
independent variables on the dependent variable
14
Two types of model
A model is a simplified representation of observed
effects
Presentation title
Statistical
Parametric Models
Effects are well-quantified and can
be examined
An equation can be used to
represent the model
Emphasis on explanation
“What causes the dependent
variable to change?”
Test hypotheses
p-values, confidence intervals
Machine Learning
Non-parametric models
Effects may be unquantified (“black
box”)
No representative equation
Model may be stochastic, so results
my vary
Emphasis on prediction
“What will the value of the next
observation be?”
Generate hypotheses
15
Types of Learning
► Supervised (inductive) learning
► Training data includes desired outputs
► Dependent variable is known
► May be statistical or non-statistical
► Unsupervised learning
► Training data does not include desired outputs
► No dependent variable
► Non-statistical
► Semi-supervised learning
► Training data includes a few desired outputs
16
Machine Learning Problem
Classification or
Categorization
Clustering
Regression
Dimensionality
reduction
Supervised Learning Unsupervised Learning
DiscreteContinuous
17
What is Logistic Regression?
Presentation title
Regression Models are a form of supervised learning that attempt to fit
“linear” functions to training data – the most common type of regression,
linear regression, should be familiar to most of you as a “best fit line”
Logistic Regression is closely related to linear regression, but fits a
different shape function by using a binomial link function on the dependent
variable
18
Machine Learning Example
Predict function F(X) for new examples X
Discrete F(X): Classification
Continuous F(X): Regression
F(X) = Probability(X): Probability
estimation
Given examples of a function (X, F(X))
The probability of an event X, denoted F(X), represents the proportion of all
events that have X as their outcome, and is typically represented as a
decimal 0<P(X)<1
19
Machine Learning Example
Apply a prediction function to a feature representation of the image to get the
desired output:
• Training: given a training set of labeled examples {(x1,y1), …, (xN,yN)}, estimate the prediction
function f by minimizing the prediction error on the training set
• Testing: apply f to a never before seen test example x and output the predicted value y = f(x)
output prediction
function
Image
feature
y = f(x)
F( ) = «apple»
F( ) =«tomato»
F( ) = «dog»
20
Supervised Learning
 Used when you want to predict unknown answers from answers
you already have
 Data is divided into two parts: the data you will use to “teach” the
system (data set), and the data to test the algorithm (test set)
 After you select and clean the data, you select data points that show
the right relationships in the data. The answers are “labels”, the
categories/columns/attributes are “features” and the values
are…values.
 Then you select an algorithm to compute the outcome. (Often you
choose more than one)
 You run the program on the data set, and check to see if you got the
right answer from the test set.
 Once you perform the experiment, you select the best model. This is
the final output – the model is then used against more data to get the
answers you need
21
Supervised Learning
 Car
 Not Car
22
Unsupervised Learning
 Used when you want to find unknown answers –
mostly groupings - directly from data
 No simple way to evaluate accuracy of what you learn
 Evaluates more vectors, groups into sets or classifications
 Start with the data
 Apply algorithm
 Evaluate groups
23
Unsupervised Learning
Example 1 example A Example 2
example B Example 3 example C
example A example B example C
Example 1 Example 2 Example 3
The clustering strategies have more tendency to transitively group points even if
they are not nearby in feature space
24
Cross-Validation and Model Evaluation
Presentation title
Cross-validation is a method of ensuring that models generalize to data they
have not been trained to fit
 Given any collection of data points, a model can be developed that fits the
data exactly; however, this model will have no predictive power
25
Evaluating Predictive Models
Presentation title
Model evaluation involves a combination of objective criteria and
subjective judgment
Objective Measures
Gain or Lift
Sensitivity
Accuracy
Others
Subjective Considerations
Business intuition
Explainability
Simplicity
Usefulness
26
Gain or Lift
Presentation title
Lift is a measure of the effectiveness of a predictive model calculated as
the ratio between the results obtained with and without the predictive
model.
 Cumulative gains and lift charts are visual aids for measuring model
performance
 Both charts consist of a lift curve and a baseline
 The greater the area between the lift curve and the baseline, the
better
27
Sensitivity
Presentation title
A Receiver Operating Characteristic (ROC) curve is a plot of test
sensitivity as a function of (1 - specificity) for several possible (arbitrary)
cut off values. The curve illustrates the trade off between type I and type
II errors in a given test.
 The closer the curve follows the left-hand border and then the top
border of the ROC space, the more accurate the test, and the area
under the curve is a measure of accuracy.
28
IBM Watson
29
Cognitive Services
Presentation title
30
Cortana Suite
Presentation title
31
Cortana Suite
Presentation title
32
Demo Cognitive
33
Microsft Azure Learning Machine
 Data It’s all about the data. Here’s where you will acquire,
compile, and analyze testing and training data sets for use in
creating Azure Machine Learning predictive models.
 Create the model Use various machine learning algorithms
to create new models that are capable of making predictions
based on inferences about the data sets.
 Evaluate the model Examine the accuracy of new predictive
models based on ability to predict the correct outcome, when
both the input and output values are known in advance.
Accuracy is measured in terms of confidence factor
approaching the whole number one.
 Refine and evaluate the model Compare, contrast, and
combine alternate predictive models to find the right
combination(s) that can consistently produce the most
accurate results.
 Deploy the model Expose the new predictive model as a
scalable cloud web service, one that is easily accessible over
the Internet by any web browser or mobile client.
 Test and use the model Implement the new predictive model
web service in a test or production application scenario.
34
Azure Machine Learning algorithms
 Classification algorithms These are used to classify data into
different categories that can then be used to predict one or more
discrete variables, based on the other attributes in the dataset.
 Regression algorithms These are used to predict one or more
continuous variables, such as profit or loss, based on other attributes
in the dataset.
 Clustering algorithms These determine natural groupings and
patterns in datasets and are used to predict grouping classifications
for a given variable.
35
Azure Machine Learning Steps
36
Azure Machine Learning Deploying a
prediction model
The deployment of a new prediction model takes the form of exposing a
web service which can then be invoked via the Representational State
Transfer (REST) protocol.
► Azure Machine Learning web services can be called via two different exposed interfaces:
► Single, request/response-style calls.
► “Batch” style calls, where multiple input records are passed into the web service in a single
call and the corresponding response contains an output list of predictions for each input record.
When a new machine learning prediction model is exposed on the Web, it
performs the following operations:
► New input data is passed into the web service in the form of a JavaScript Object Notation
(JSON) payload.
► The web service then passes the incoming data as inputs into the Azure Machine Learning
prediction model engine.
► The Azure Machine Learning model then generates a new prediction based on the input data
and returns the new prediction results to the caller via a JSON payload.
37
Azure ML Getting Started
Option 1 Take advantage of a free Azure trial offer at
http://azure.microsoft.com/en-us/pricing/free-trial
Option 2 Take advantage of the (free) Azure Machine
Learning trial offer at https://studio.azureml.net/Home
38
Thanks
Presentation title
► Questions?
39
Azure Machine Learning workspaces
 Workspace
 Experiments
 Azure ML Studio
 Web Services
 Datasets
 Modules
40
Create your first Azure Machine Learning
experiment
► Defining the problem you want to solve
► e.g. figure out if you like certain movie what’s another movie you should watch?
(movie recommendation )
► Designing a Solution Using AzureML
1. Creating an experiment
2. Load a Data Set
3. Create the Experiment
4. Add Transformations and Filters
5. Create the Experiment Path and apply Algorithms
6. Save and Run the Model
7. Publish the Model
8. Use the Model
► Saving and Running
► Publishing and Accessing
Ad

More Related Content

What's hot (16)

Supervised learning
Supervised learningSupervised learning
Supervised learning
Alia Hamwi
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
Nazmus Sakib
 
Lecture 9: Machine Learning in Practice (2)
Lecture 9: Machine Learning in Practice (2)Lecture 9: Machine Learning in Practice (2)
Lecture 9: Machine Learning in Practice (2)
Marina Santini
 
L2. Evaluating Machine Learning Algorithms I
L2. Evaluating Machine Learning Algorithms IL2. Evaluating Machine Learning Algorithms I
L2. Evaluating Machine Learning Algorithms I
Machine Learning Valencia
 
internship project1 report
internship project1 reportinternship project1 report
internship project1 report
sheyk98
 
LR2. Summary Day 2
LR2. Summary Day 2LR2. Summary Day 2
LR2. Summary Day 2
Machine Learning Valencia
 
Qt unit i
Qt unit   iQt unit   i
Qt unit i
bhuvana ganesan
 
Anomaly detection workshop
Anomaly detection workshopAnomaly detection workshop
Anomaly detection workshop
gforgovind
 
Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...
Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...
Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...
Md. Main Uddin Rony
 
Supervised Machine Learning Techniques common algorithms and its application
Supervised Machine Learning Techniques common algorithms and its applicationSupervised Machine Learning Techniques common algorithms and its application
Supervised Machine Learning Techniques common algorithms and its application
Tara ram Goyal
 
Dbm630 lecture08
Dbm630 lecture08Dbm630 lecture08
Dbm630 lecture08
Tokyo Institute of Technology
 
IRJET- Stock Market Prediction using Machine Learning Techniques
IRJET- Stock Market Prediction using Machine Learning TechniquesIRJET- Stock Market Prediction using Machine Learning Techniques
IRJET- Stock Market Prediction using Machine Learning Techniques
IRJET Journal
 
孔令傑 / 給工程師的統計學及資料分析 123 (2016/9/4)
孔令傑 / 給工程師的統計學及資料分析 123 (2016/9/4)孔令傑 / 給工程師的統計學及資料分析 123 (2016/9/4)
孔令傑 / 給工程師的統計學及資料分析 123 (2016/9/4)
台灣資料科學年會
 
Supervised learning and Unsupervised learning
Supervised learning and Unsupervised learning Supervised learning and Unsupervised learning
Supervised learning and Unsupervised learning
Usama Fayyaz
 
Machine Learning and Causal Inference
Machine Learning and Causal InferenceMachine Learning and Causal Inference
Machine Learning and Causal Inference
NBER
 
Modelling and evaluation
Modelling and evaluationModelling and evaluation
Modelling and evaluation
eShikshak
 
Supervised learning
Supervised learningSupervised learning
Supervised learning
Alia Hamwi
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
Nazmus Sakib
 
Lecture 9: Machine Learning in Practice (2)
Lecture 9: Machine Learning in Practice (2)Lecture 9: Machine Learning in Practice (2)
Lecture 9: Machine Learning in Practice (2)
Marina Santini
 
L2. Evaluating Machine Learning Algorithms I
L2. Evaluating Machine Learning Algorithms IL2. Evaluating Machine Learning Algorithms I
L2. Evaluating Machine Learning Algorithms I
Machine Learning Valencia
 
internship project1 report
internship project1 reportinternship project1 report
internship project1 report
sheyk98
 
Anomaly detection workshop
Anomaly detection workshopAnomaly detection workshop
Anomaly detection workshop
gforgovind
 
Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...
Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...
Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...
Md. Main Uddin Rony
 
Supervised Machine Learning Techniques common algorithms and its application
Supervised Machine Learning Techniques common algorithms and its applicationSupervised Machine Learning Techniques common algorithms and its application
Supervised Machine Learning Techniques common algorithms and its application
Tara ram Goyal
 
IRJET- Stock Market Prediction using Machine Learning Techniques
IRJET- Stock Market Prediction using Machine Learning TechniquesIRJET- Stock Market Prediction using Machine Learning Techniques
IRJET- Stock Market Prediction using Machine Learning Techniques
IRJET Journal
 
孔令傑 / 給工程師的統計學及資料分析 123 (2016/9/4)
孔令傑 / 給工程師的統計學及資料分析 123 (2016/9/4)孔令傑 / 給工程師的統計學及資料分析 123 (2016/9/4)
孔令傑 / 給工程師的統計學及資料分析 123 (2016/9/4)
台灣資料科學年會
 
Supervised learning and Unsupervised learning
Supervised learning and Unsupervised learning Supervised learning and Unsupervised learning
Supervised learning and Unsupervised learning
Usama Fayyaz
 
Machine Learning and Causal Inference
Machine Learning and Causal InferenceMachine Learning and Causal Inference
Machine Learning and Causal Inference
NBER
 
Modelling and evaluation
Modelling and evaluationModelling and evaluation
Modelling and evaluation
eShikshak
 

Similar to Tech meetup Data Driven - Codemotion (20)

Top 20 Data Science Interview Questions and Answers in 2023.pptx
Top 20 Data Science Interview Questions and Answers in 2023.pptxTop 20 Data Science Interview Questions and Answers in 2023.pptx
Top 20 Data Science Interview Questions and Answers in 2023.pptx
AnanthReddy38
 
Supervised learning techniques and applications
Supervised learning techniques and applicationsSupervised learning techniques and applications
Supervised learning techniques and applications
Benjaminlapid1
 
Top 20 Data Science Interview Questions and Answers in 2023.pdf
Top 20 Data Science Interview Questions and Answers in 2023.pdfTop 20 Data Science Interview Questions and Answers in 2023.pdf
Top 20 Data Science Interview Questions and Answers in 2023.pdf
AnanthReddy38
 
Machine Learning by Rj
Machine Learning by RjMachine Learning by Rj
Machine Learning by Rj
Shree M.L.Kakadiya MCA mahila college, Amreli
 
Statistical Machine Learning Lecture notes
Statistical Machine Learning  Lecture notesStatistical Machine Learning  Lecture notes
Statistical Machine Learning Lecture notes
SureshK256753
 
Optimal Model Complexity (1).pptx
Optimal Model Complexity (1).pptxOptimal Model Complexity (1).pptx
Optimal Model Complexity (1).pptx
MurindanyiSudi1
 
Introduction to machine learning-2023-IT-AI and DS.pdf
Introduction to machine learning-2023-IT-AI and DS.pdfIntroduction to machine learning-2023-IT-AI and DS.pdf
Introduction to machine learning-2023-IT-AI and DS.pdf
SisayNegash4
 
Regression with Microsoft Azure & Ms Excel
Regression with Microsoft Azure & Ms ExcelRegression with Microsoft Azure & Ms Excel
Regression with Microsoft Azure & Ms Excel
Dr. Abdul Ahad Abro
 
Machine Learning.pptx
Machine Learning.pptxMachine Learning.pptx
Machine Learning.pptx
NitinSharma134320
 
Machine Learning with Python- Methods for Machine Learning.pptx
Machine Learning with Python- Methods for Machine Learning.pptxMachine Learning with Python- Methods for Machine Learning.pptx
Machine Learning with Python- Methods for Machine Learning.pptx
iaeronlineexm
 
Machine Learning Basics
Machine Learning BasicsMachine Learning Basics
Machine Learning Basics
Suresh Arora
 
Machine Learning_Unit 2_Full.ppt.pdf
Machine Learning_Unit 2_Full.ppt.pdfMachine Learning_Unit 2_Full.ppt.pdf
Machine Learning_Unit 2_Full.ppt.pdf
Dr.DHANALAKSHMI SENTHILKUMAR
 
G. Barcaroli, The use of machine learning in official statistics
G. Barcaroli, The use of machine learning in official statisticsG. Barcaroli, The use of machine learning in official statistics
G. Barcaroli, The use of machine learning in official statistics
Istituto nazionale di statistica
 
Machine Learning: Transforming Data into Insights
Machine Learning: Transforming Data into InsightsMachine Learning: Transforming Data into Insights
Machine Learning: Transforming Data into Insights
pemac73062
 
Data Science Interview Questions PDF By ScholarHat
Data Science Interview Questions PDF By ScholarHatData Science Interview Questions PDF By ScholarHat
Data Science Interview Questions PDF By ScholarHat
Scholarhat
 
5_Model for Predictions_Machine_Learning.ppt
5_Model for Predictions_Machine_Learning.ppt5_Model for Predictions_Machine_Learning.ppt
5_Model for Predictions_Machine_Learning.ppt
VGaneshKarthikeyan
 
DS103 - Unit04 - Part1DS103 - Unit04 - Part1.pptx
DS103 - Unit04 - Part1DS103 - Unit04 - Part1.pptxDS103 - Unit04 - Part1DS103 - Unit04 - Part1.pptx
DS103 - Unit04 - Part1DS103 - Unit04 - Part1.pptx
FutureTechnologies3
 
Lesson 1 - Overview of Machine Learning and Data Analysis.pptx
Lesson 1 - Overview of Machine Learning and Data Analysis.pptxLesson 1 - Overview of Machine Learning and Data Analysis.pptx
Lesson 1 - Overview of Machine Learning and Data Analysis.pptx
cloudserviceuit
 
Chapter 05 Machine Learning.pptx
Chapter 05 Machine Learning.pptxChapter 05 Machine Learning.pptx
Chapter 05 Machine Learning.pptx
ssuser957b41
 
Lecture #1: Introduction to machine learning (ML)
Lecture #1: Introduction to machine learning (ML)Lecture #1: Introduction to machine learning (ML)
Lecture #1: Introduction to machine learning (ML)
butest
 
Top 20 Data Science Interview Questions and Answers in 2023.pptx
Top 20 Data Science Interview Questions and Answers in 2023.pptxTop 20 Data Science Interview Questions and Answers in 2023.pptx
Top 20 Data Science Interview Questions and Answers in 2023.pptx
AnanthReddy38
 
Supervised learning techniques and applications
Supervised learning techniques and applicationsSupervised learning techniques and applications
Supervised learning techniques and applications
Benjaminlapid1
 
Top 20 Data Science Interview Questions and Answers in 2023.pdf
Top 20 Data Science Interview Questions and Answers in 2023.pdfTop 20 Data Science Interview Questions and Answers in 2023.pdf
Top 20 Data Science Interview Questions and Answers in 2023.pdf
AnanthReddy38
 
Statistical Machine Learning Lecture notes
Statistical Machine Learning  Lecture notesStatistical Machine Learning  Lecture notes
Statistical Machine Learning Lecture notes
SureshK256753
 
Optimal Model Complexity (1).pptx
Optimal Model Complexity (1).pptxOptimal Model Complexity (1).pptx
Optimal Model Complexity (1).pptx
MurindanyiSudi1
 
Introduction to machine learning-2023-IT-AI and DS.pdf
Introduction to machine learning-2023-IT-AI and DS.pdfIntroduction to machine learning-2023-IT-AI and DS.pdf
Introduction to machine learning-2023-IT-AI and DS.pdf
SisayNegash4
 
Regression with Microsoft Azure & Ms Excel
Regression with Microsoft Azure & Ms ExcelRegression with Microsoft Azure & Ms Excel
Regression with Microsoft Azure & Ms Excel
Dr. Abdul Ahad Abro
 
Machine Learning with Python- Methods for Machine Learning.pptx
Machine Learning with Python- Methods for Machine Learning.pptxMachine Learning with Python- Methods for Machine Learning.pptx
Machine Learning with Python- Methods for Machine Learning.pptx
iaeronlineexm
 
Machine Learning Basics
Machine Learning BasicsMachine Learning Basics
Machine Learning Basics
Suresh Arora
 
G. Barcaroli, The use of machine learning in official statistics
G. Barcaroli, The use of machine learning in official statisticsG. Barcaroli, The use of machine learning in official statistics
G. Barcaroli, The use of machine learning in official statistics
Istituto nazionale di statistica
 
Machine Learning: Transforming Data into Insights
Machine Learning: Transforming Data into InsightsMachine Learning: Transforming Data into Insights
Machine Learning: Transforming Data into Insights
pemac73062
 
Data Science Interview Questions PDF By ScholarHat
Data Science Interview Questions PDF By ScholarHatData Science Interview Questions PDF By ScholarHat
Data Science Interview Questions PDF By ScholarHat
Scholarhat
 
5_Model for Predictions_Machine_Learning.ppt
5_Model for Predictions_Machine_Learning.ppt5_Model for Predictions_Machine_Learning.ppt
5_Model for Predictions_Machine_Learning.ppt
VGaneshKarthikeyan
 
DS103 - Unit04 - Part1DS103 - Unit04 - Part1.pptx
DS103 - Unit04 - Part1DS103 - Unit04 - Part1.pptxDS103 - Unit04 - Part1DS103 - Unit04 - Part1.pptx
DS103 - Unit04 - Part1DS103 - Unit04 - Part1.pptx
FutureTechnologies3
 
Lesson 1 - Overview of Machine Learning and Data Analysis.pptx
Lesson 1 - Overview of Machine Learning and Data Analysis.pptxLesson 1 - Overview of Machine Learning and Data Analysis.pptx
Lesson 1 - Overview of Machine Learning and Data Analysis.pptx
cloudserviceuit
 
Chapter 05 Machine Learning.pptx
Chapter 05 Machine Learning.pptxChapter 05 Machine Learning.pptx
Chapter 05 Machine Learning.pptx
ssuser957b41
 
Lecture #1: Introduction to machine learning (ML)
Lecture #1: Introduction to machine learning (ML)Lecture #1: Introduction to machine learning (ML)
Lecture #1: Introduction to machine learning (ML)
butest
 
Ad

More from antimo musone (13)

Multi Cloud essentials
Multi Cloud essentialsMulti Cloud essentials
Multi Cloud essentials
antimo musone
 
Intelligent ChatBot
Intelligent ChatBotIntelligent ChatBot
Intelligent ChatBot
antimo musone
 
Amazon Echo
Amazon EchoAmazon Echo
Amazon Echo
antimo musone
 
AI Machine vs Human
AI Machine vs HumanAI Machine vs Human
AI Machine vs Human
antimo musone
 
Azure Machine Learning Dotnet Campus 2015
Azure Machine Learning Dotnet Campus 2015 Azure Machine Learning Dotnet Campus 2015
Azure Machine Learning Dotnet Campus 2015
antimo musone
 
Cloud Computing - Albicocca University of Milan
Cloud Computing - Albicocca University of MilanCloud Computing - Albicocca University of Milan
Cloud Computing - Albicocca University of Milan
antimo musone
 
Web Cloud Computing SQL Server - Ferrara University
Web Cloud Computing SQL Server  -  Ferrara UniversityWeb Cloud Computing SQL Server  -  Ferrara University
Web Cloud Computing SQL Server - Ferrara University
antimo musone
 
.netcampus 2014 - E Commerce On Cloud
.netcampus 2014 - E Commerce On Cloud.netcampus 2014 - E Commerce On Cloud
.netcampus 2014 - E Commerce On Cloud
antimo musone
 
Microsoft Accademic Tour Web and Cloud Service Bologna
Microsoft Accademic Tour Web and Cloud Service BolognaMicrosoft Accademic Tour Web and Cloud Service Bologna
Microsoft Accademic Tour Web and Cloud Service Bologna
antimo musone
 
.NetCampus Windows Azure Mobile
.NetCampus Windows Azure Mobile.NetCampus Windows Azure Mobile
.NetCampus Windows Azure Mobile
antimo musone
 
Codemotion Windows Azure Mobile
Codemotion Windows Azure MobileCodemotion Windows Azure Mobile
Codemotion Windows Azure Mobile
antimo musone
 
Antimo Musone Graduation Thesis
Antimo Musone Graduation ThesisAntimo Musone Graduation Thesis
Antimo Musone Graduation Thesis
antimo musone
 
Microsoft Accademic Cloud Tour Genova
Microsoft Accademic Cloud Tour GenovaMicrosoft Accademic Cloud Tour Genova
Microsoft Accademic Cloud Tour Genova
antimo musone
 
Multi Cloud essentials
Multi Cloud essentialsMulti Cloud essentials
Multi Cloud essentials
antimo musone
 
Azure Machine Learning Dotnet Campus 2015
Azure Machine Learning Dotnet Campus 2015 Azure Machine Learning Dotnet Campus 2015
Azure Machine Learning Dotnet Campus 2015
antimo musone
 
Cloud Computing - Albicocca University of Milan
Cloud Computing - Albicocca University of MilanCloud Computing - Albicocca University of Milan
Cloud Computing - Albicocca University of Milan
antimo musone
 
Web Cloud Computing SQL Server - Ferrara University
Web Cloud Computing SQL Server  -  Ferrara UniversityWeb Cloud Computing SQL Server  -  Ferrara University
Web Cloud Computing SQL Server - Ferrara University
antimo musone
 
.netcampus 2014 - E Commerce On Cloud
.netcampus 2014 - E Commerce On Cloud.netcampus 2014 - E Commerce On Cloud
.netcampus 2014 - E Commerce On Cloud
antimo musone
 
Microsoft Accademic Tour Web and Cloud Service Bologna
Microsoft Accademic Tour Web and Cloud Service BolognaMicrosoft Accademic Tour Web and Cloud Service Bologna
Microsoft Accademic Tour Web and Cloud Service Bologna
antimo musone
 
.NetCampus Windows Azure Mobile
.NetCampus Windows Azure Mobile.NetCampus Windows Azure Mobile
.NetCampus Windows Azure Mobile
antimo musone
 
Codemotion Windows Azure Mobile
Codemotion Windows Azure MobileCodemotion Windows Azure Mobile
Codemotion Windows Azure Mobile
antimo musone
 
Antimo Musone Graduation Thesis
Antimo Musone Graduation ThesisAntimo Musone Graduation Thesis
Antimo Musone Graduation Thesis
antimo musone
 
Microsoft Accademic Cloud Tour Genova
Microsoft Accademic Cloud Tour GenovaMicrosoft Accademic Cloud Tour Genova
Microsoft Accademic Cloud Tour Genova
antimo musone
 
Ad

Recently uploaded (20)

real illuminati Uganda agent 0782561496/0756664682
real illuminati Uganda agent 0782561496/0756664682real illuminati Uganda agent 0782561496/0756664682
real illuminati Uganda agent 0782561496/0756664682
way to join real illuminati Agent In Kampala Call/WhatsApp+256782561496/0756664682
 
Just-In-Timeasdfffffffghhhhhhhhhhj Systems.ppt
Just-In-Timeasdfffffffghhhhhhhhhhj Systems.pptJust-In-Timeasdfffffffghhhhhhhhhhj Systems.ppt
Just-In-Timeasdfffffffghhhhhhhhhhj Systems.ppt
ssuser5f8f49
 
2024-Media-Literacy-Index-Of-Ukrainians-ENG-SHORT.pdf
2024-Media-Literacy-Index-Of-Ukrainians-ENG-SHORT.pdf2024-Media-Literacy-Index-Of-Ukrainians-ENG-SHORT.pdf
2024-Media-Literacy-Index-Of-Ukrainians-ENG-SHORT.pdf
OlhaTatokhina1
 
Modern_Distribution_Presentation.pptx Aa
Modern_Distribution_Presentation.pptx AaModern_Distribution_Presentation.pptx Aa
Modern_Distribution_Presentation.pptx Aa
MuhammadAwaisKamboh
 
03 Daniel 2-notes.ppt seminario escatologia
03 Daniel 2-notes.ppt seminario escatologia03 Daniel 2-notes.ppt seminario escatologia
03 Daniel 2-notes.ppt seminario escatologia
Alexander Romero Arosquipa
 
Deloitte Analytics - Applying Process Mining in an audit context
Deloitte Analytics - Applying Process Mining in an audit contextDeloitte Analytics - Applying Process Mining in an audit context
Deloitte Analytics - Applying Process Mining in an audit context
Process mining Evangelist
 
Principles of information security Chapter 5.ppt
Principles of information security Chapter 5.pptPrinciples of information security Chapter 5.ppt
Principles of information security Chapter 5.ppt
EstherBaguma
 
Chapter-3-PROBLEM-SOLVING.pdf hhhhhhhhhh
Chapter-3-PROBLEM-SOLVING.pdf hhhhhhhhhhChapter-3-PROBLEM-SOLVING.pdf hhhhhhhhhh
Chapter-3-PROBLEM-SOLVING.pdf hhhhhhhhhh
ChrisjohnAlfiler
 
3. Univariable and Multivariable Analysis_Using Stata_2025.pdf
3. Univariable and Multivariable Analysis_Using Stata_2025.pdf3. Univariable and Multivariable Analysis_Using Stata_2025.pdf
3. Univariable and Multivariable Analysis_Using Stata_2025.pdf
axonneurologycenter1
 
Adopting Process Mining at the Rabobank - use case
Adopting Process Mining at the Rabobank - use caseAdopting Process Mining at the Rabobank - use case
Adopting Process Mining at the Rabobank - use case
Process mining Evangelist
 
Suncorp - Integrating Process Mining at Australia's Largest Insurer
Suncorp - Integrating Process Mining at Australia's Largest InsurerSuncorp - Integrating Process Mining at Australia's Largest Insurer
Suncorp - Integrating Process Mining at Australia's Largest Insurer
Process mining Evangelist
 
...lab.Lab123456789123456789123456789123456789
...lab.Lab123456789123456789123456789123456789...lab.Lab123456789123456789123456789123456789
...lab.Lab123456789123456789123456789123456789
Ghh
 
Impact_of_AI_on_Everyday_Life and info.pptx
Impact_of_AI_on_Everyday_Life and info.pptxImpact_of_AI_on_Everyday_Life and info.pptx
Impact_of_AI_on_Everyday_Life and info.pptx
swatibhusari5
 
Process Mining and Data Science in the Financial Industry
Process Mining and Data Science in the Financial IndustryProcess Mining and Data Science in the Financial Industry
Process Mining and Data Science in the Financial Industry
Process mining Evangelist
 
Volkswagen - Analyzing the World's Biggest Purchasing Process
Volkswagen - Analyzing the World's Biggest Purchasing ProcessVolkswagen - Analyzing the World's Biggest Purchasing Process
Volkswagen - Analyzing the World's Biggest Purchasing Process
Process mining Evangelist
 
How to join illuminati Agent in uganda call+256776963507/0741506136
How to join illuminati Agent in uganda call+256776963507/0741506136How to join illuminati Agent in uganda call+256776963507/0741506136
How to join illuminati Agent in uganda call+256776963507/0741506136
illuminati Agent uganda call+256776963507/0741506136
 
indonesia-gen-z-report-2024 Gen Z (born between 1997 and 2012) is currently t...
indonesia-gen-z-report-2024 Gen Z (born between 1997 and 2012) is currently t...indonesia-gen-z-report-2024 Gen Z (born between 1997 and 2012) is currently t...
indonesia-gen-z-report-2024 Gen Z (born between 1997 and 2012) is currently t...
disnakertransjabarda
 
IAS-slides2-ia-aaaaaaaaaaain-business.pdf
IAS-slides2-ia-aaaaaaaaaaain-business.pdfIAS-slides2-ia-aaaaaaaaaaain-business.pdf
IAS-slides2-ia-aaaaaaaaaaain-business.pdf
mcgardenlevi9
 
MASAkkjjkttuyrdquesjhjhjfc44dddtions.docx
MASAkkjjkttuyrdquesjhjhjfc44dddtions.docxMASAkkjjkttuyrdquesjhjhjfc44dddtions.docx
MASAkkjjkttuyrdquesjhjhjfc44dddtions.docx
santosh162
 
新西兰文凭奥克兰理工大学毕业证书AUT成绩单补办
新西兰文凭奥克兰理工大学毕业证书AUT成绩单补办新西兰文凭奥克兰理工大学毕业证书AUT成绩单补办
新西兰文凭奥克兰理工大学毕业证书AUT成绩单补办
Taqyea
 
Just-In-Timeasdfffffffghhhhhhhhhhj Systems.ppt
Just-In-Timeasdfffffffghhhhhhhhhhj Systems.pptJust-In-Timeasdfffffffghhhhhhhhhhj Systems.ppt
Just-In-Timeasdfffffffghhhhhhhhhhj Systems.ppt
ssuser5f8f49
 
2024-Media-Literacy-Index-Of-Ukrainians-ENG-SHORT.pdf
2024-Media-Literacy-Index-Of-Ukrainians-ENG-SHORT.pdf2024-Media-Literacy-Index-Of-Ukrainians-ENG-SHORT.pdf
2024-Media-Literacy-Index-Of-Ukrainians-ENG-SHORT.pdf
OlhaTatokhina1
 
Modern_Distribution_Presentation.pptx Aa
Modern_Distribution_Presentation.pptx AaModern_Distribution_Presentation.pptx Aa
Modern_Distribution_Presentation.pptx Aa
MuhammadAwaisKamboh
 
Deloitte Analytics - Applying Process Mining in an audit context
Deloitte Analytics - Applying Process Mining in an audit contextDeloitte Analytics - Applying Process Mining in an audit context
Deloitte Analytics - Applying Process Mining in an audit context
Process mining Evangelist
 
Principles of information security Chapter 5.ppt
Principles of information security Chapter 5.pptPrinciples of information security Chapter 5.ppt
Principles of information security Chapter 5.ppt
EstherBaguma
 
Chapter-3-PROBLEM-SOLVING.pdf hhhhhhhhhh
Chapter-3-PROBLEM-SOLVING.pdf hhhhhhhhhhChapter-3-PROBLEM-SOLVING.pdf hhhhhhhhhh
Chapter-3-PROBLEM-SOLVING.pdf hhhhhhhhhh
ChrisjohnAlfiler
 
3. Univariable and Multivariable Analysis_Using Stata_2025.pdf
3. Univariable and Multivariable Analysis_Using Stata_2025.pdf3. Univariable and Multivariable Analysis_Using Stata_2025.pdf
3. Univariable and Multivariable Analysis_Using Stata_2025.pdf
axonneurologycenter1
 
Adopting Process Mining at the Rabobank - use case
Adopting Process Mining at the Rabobank - use caseAdopting Process Mining at the Rabobank - use case
Adopting Process Mining at the Rabobank - use case
Process mining Evangelist
 
Suncorp - Integrating Process Mining at Australia's Largest Insurer
Suncorp - Integrating Process Mining at Australia's Largest InsurerSuncorp - Integrating Process Mining at Australia's Largest Insurer
Suncorp - Integrating Process Mining at Australia's Largest Insurer
Process mining Evangelist
 
...lab.Lab123456789123456789123456789123456789
...lab.Lab123456789123456789123456789123456789...lab.Lab123456789123456789123456789123456789
...lab.Lab123456789123456789123456789123456789
Ghh
 
Impact_of_AI_on_Everyday_Life and info.pptx
Impact_of_AI_on_Everyday_Life and info.pptxImpact_of_AI_on_Everyday_Life and info.pptx
Impact_of_AI_on_Everyday_Life and info.pptx
swatibhusari5
 
Process Mining and Data Science in the Financial Industry
Process Mining and Data Science in the Financial IndustryProcess Mining and Data Science in the Financial Industry
Process Mining and Data Science in the Financial Industry
Process mining Evangelist
 
Volkswagen - Analyzing the World's Biggest Purchasing Process
Volkswagen - Analyzing the World's Biggest Purchasing ProcessVolkswagen - Analyzing the World's Biggest Purchasing Process
Volkswagen - Analyzing the World's Biggest Purchasing Process
Process mining Evangelist
 
indonesia-gen-z-report-2024 Gen Z (born between 1997 and 2012) is currently t...
indonesia-gen-z-report-2024 Gen Z (born between 1997 and 2012) is currently t...indonesia-gen-z-report-2024 Gen Z (born between 1997 and 2012) is currently t...
indonesia-gen-z-report-2024 Gen Z (born between 1997 and 2012) is currently t...
disnakertransjabarda
 
IAS-slides2-ia-aaaaaaaaaaain-business.pdf
IAS-slides2-ia-aaaaaaaaaaain-business.pdfIAS-slides2-ia-aaaaaaaaaaain-business.pdf
IAS-slides2-ia-aaaaaaaaaaain-business.pdf
mcgardenlevi9
 
MASAkkjjkttuyrdquesjhjhjfc44dddtions.docx
MASAkkjjkttuyrdquesjhjhjfc44dddtions.docxMASAkkjjkttuyrdquesjhjhjfc44dddtions.docx
MASAkkjjkttuyrdquesjhjhjfc44dddtions.docx
santosh162
 
新西兰文凭奥克兰理工大学毕业证书AUT成绩单补办
新西兰文凭奥克兰理工大学毕业证书AUT成绩单补办新西兰文凭奥克兰理工大学毕业证书AUT成绩单补办
新西兰文凭奥克兰理工大学毕业证书AUT成绩单补办
Taqyea
 

Tech meetup Data Driven - Codemotion

  • 1. Special Codemotion Tech Meetup: Data Driven Innovation Antimo Musone IT Manager 17 Maggio 2016
  • 2. 2 About Me Presentation title ►>Antimo Musone  IT Manager / Architect presso EY  Co - Founder Fifth Ingenum Srls.  Ing. Informatica II Università degli Studi di Napoli  email: [email protected]
  • 3. 4 What is Machine Learning ?
  • 4. 5 Machine Learning / Predictive Analytics Vision Analytics Recommenda- tion engines Advertising analysis Weather forecasting for business planning Social network analysis Legal discovery and document archiving Pricing analysis Fraud detection Churn analysis Equipment monitoring Location-based tracking and services Personalized Insurance Machine learning & predictive analytics are core capabilities that are needed throughout your business
  • 5. 6 Machine Learning Overview ► Formal definition: “The field of machine learning is concerned with the question of how to construct computer programs that automatically improve with experience” - Tom M. Mitchell ► Another definition: “The goal of machine learning is to program computers to use example data or past experience to solve a given problem.” – Introduction to Machine Learning, 2nd Edition, MIT Press ► ML often involves two primary techniques: ► Supervised Learning: Finding the mapping between inputs and outputs using correct values to “train” a model ► Unsupervised Learning: Finding patterns in the input data (similar to Density Estimates in Statistics)
  • 6. 7 Machine Learning Data: A B C D E F G H I J K L M N O P Q R S T U V W X Y Z Rules, or Algorithms: about, Learning, language – Spelling and sounding builds words Learning about language. – Words build sentences Learning, or Abstraction: Any new understanding proceeds from previous knowledge. Data + Rules/ Algorithms = Machine Learning
  • 7. 8 Traditional programming VS Machine Learning Computer Data Program Output Traditional Programming Data Output Program/Algorithms Machine Learning Program can predict the output! Computer
  • 8. 9 ML : No, more like gardening Gardener = You Seeds = Algorithms Nutrients = Data Plants = Programs
  • 9. 10 ML Sample Application ► Web search ► Computational biology ► Finance ► E-commerce ► Space exploration ► Robotics ► Information extraction ► Social networks ► Debugging ► [Your favorite area]
  • 10. 11 What is Predictive Analytics? Presentation title Wikipedia Definition: (http://en.wikipedia.org/wiki/Predictive_analytics) “Predictive analytics encompasses a variety of techniques from statistics, modeling, machine learning, and data mining that analyze current and historical facts to make predictions about future, or otherwise unknown, events.” Facts Predictions Predictive Analytics Techniques
  • 11. 12 Breaking it Down Presentation title “Predictive analytics encompasses a variety of techniques from statistics, modeling, machine learning, and data mining that analyze current and historical facts to make predictions about future, or otherwise unknown, events.” Machine Learning Use of computer algorithms to derive complex formulations based on objectives and constraints Tools and Techniques Data visualization, segmentation, correlations Use in Predictive Analytics Predictive analytics is often applied in the context of datasets that are too large for manual analysis, so data mining techniques are required Statistics Focus on learning population characteristics based on samples of data Tools and Techniques p-values, confidence intervals, sampling, ANOVA Use in Predictive Analytics Underlying theory behind many parametric models – observed facts are a sample from a population including both known/historic and unknown/future events Modeling Representations of systems used to understand the underlying dynamics of the system Tools and Techniques Symbolic logic, proxies Use in Predictive Analytics Complex relationships can be simplified through modeling – these models can then be used to analyze relationships between factors
  • 12. 13 What is a Model? A model is a simplified representation of observed effects Presentation title Key terms:  Dependent or target variable – the variable of interest  Independent or predictor variable(s) – variable(s) used for explanation/prediction  Effect – the (quantitative) impact of an independent variable or combination of independent variables on the dependent variable  Main Effect – The direct effect of a single independent variable on the dependent variable  Interaction Effect – The effect of a combination of multiple independent variables on the dependent variable
  • 13. 14 Two types of model A model is a simplified representation of observed effects Presentation title Statistical Parametric Models Effects are well-quantified and can be examined An equation can be used to represent the model Emphasis on explanation “What causes the dependent variable to change?” Test hypotheses p-values, confidence intervals Machine Learning Non-parametric models Effects may be unquantified (“black box”) No representative equation Model may be stochastic, so results my vary Emphasis on prediction “What will the value of the next observation be?” Generate hypotheses
  • 14. 15 Types of Learning ► Supervised (inductive) learning ► Training data includes desired outputs ► Dependent variable is known ► May be statistical or non-statistical ► Unsupervised learning ► Training data does not include desired outputs ► No dependent variable ► Non-statistical ► Semi-supervised learning ► Training data includes a few desired outputs
  • 15. 16 Machine Learning Problem Classification or Categorization Clustering Regression Dimensionality reduction Supervised Learning Unsupervised Learning DiscreteContinuous
  • 16. 17 What is Logistic Regression? Presentation title Regression Models are a form of supervised learning that attempt to fit “linear” functions to training data – the most common type of regression, linear regression, should be familiar to most of you as a “best fit line” Logistic Regression is closely related to linear regression, but fits a different shape function by using a binomial link function on the dependent variable
  • 17. 18 Machine Learning Example Predict function F(X) for new examples X Discrete F(X): Classification Continuous F(X): Regression F(X) = Probability(X): Probability estimation Given examples of a function (X, F(X)) The probability of an event X, denoted F(X), represents the proportion of all events that have X as their outcome, and is typically represented as a decimal 0<P(X)<1
  • 18. 19 Machine Learning Example Apply a prediction function to a feature representation of the image to get the desired output: • Training: given a training set of labeled examples {(x1,y1), …, (xN,yN)}, estimate the prediction function f by minimizing the prediction error on the training set • Testing: apply f to a never before seen test example x and output the predicted value y = f(x) output prediction function Image feature y = f(x) F( ) = «apple» F( ) =«tomato» F( ) = «dog»
  • 19. 20 Supervised Learning  Used when you want to predict unknown answers from answers you already have  Data is divided into two parts: the data you will use to “teach” the system (data set), and the data to test the algorithm (test set)  After you select and clean the data, you select data points that show the right relationships in the data. The answers are “labels”, the categories/columns/attributes are “features” and the values are…values.  Then you select an algorithm to compute the outcome. (Often you choose more than one)  You run the program on the data set, and check to see if you got the right answer from the test set.  Once you perform the experiment, you select the best model. This is the final output – the model is then used against more data to get the answers you need
  • 21. 22 Unsupervised Learning  Used when you want to find unknown answers – mostly groupings - directly from data  No simple way to evaluate accuracy of what you learn  Evaluates more vectors, groups into sets or classifications  Start with the data  Apply algorithm  Evaluate groups
  • 22. 23 Unsupervised Learning Example 1 example A Example 2 example B Example 3 example C example A example B example C Example 1 Example 2 Example 3 The clustering strategies have more tendency to transitively group points even if they are not nearby in feature space
  • 23. 24 Cross-Validation and Model Evaluation Presentation title Cross-validation is a method of ensuring that models generalize to data they have not been trained to fit  Given any collection of data points, a model can be developed that fits the data exactly; however, this model will have no predictive power
  • 24. 25 Evaluating Predictive Models Presentation title Model evaluation involves a combination of objective criteria and subjective judgment Objective Measures Gain or Lift Sensitivity Accuracy Others Subjective Considerations Business intuition Explainability Simplicity Usefulness
  • 25. 26 Gain or Lift Presentation title Lift is a measure of the effectiveness of a predictive model calculated as the ratio between the results obtained with and without the predictive model.  Cumulative gains and lift charts are visual aids for measuring model performance  Both charts consist of a lift curve and a baseline  The greater the area between the lift curve and the baseline, the better
  • 26. 27 Sensitivity Presentation title A Receiver Operating Characteristic (ROC) curve is a plot of test sensitivity as a function of (1 - specificity) for several possible (arbitrary) cut off values. The curve illustrates the trade off between type I and type II errors in a given test.  The closer the curve follows the left-hand border and then the top border of the ROC space, the more accurate the test, and the area under the curve is a measure of accuracy.
  • 32. 33 Microsft Azure Learning Machine  Data It’s all about the data. Here’s where you will acquire, compile, and analyze testing and training data sets for use in creating Azure Machine Learning predictive models.  Create the model Use various machine learning algorithms to create new models that are capable of making predictions based on inferences about the data sets.  Evaluate the model Examine the accuracy of new predictive models based on ability to predict the correct outcome, when both the input and output values are known in advance. Accuracy is measured in terms of confidence factor approaching the whole number one.  Refine and evaluate the model Compare, contrast, and combine alternate predictive models to find the right combination(s) that can consistently produce the most accurate results.  Deploy the model Expose the new predictive model as a scalable cloud web service, one that is easily accessible over the Internet by any web browser or mobile client.  Test and use the model Implement the new predictive model web service in a test or production application scenario.
  • 33. 34 Azure Machine Learning algorithms  Classification algorithms These are used to classify data into different categories that can then be used to predict one or more discrete variables, based on the other attributes in the dataset.  Regression algorithms These are used to predict one or more continuous variables, such as profit or loss, based on other attributes in the dataset.  Clustering algorithms These determine natural groupings and patterns in datasets and are used to predict grouping classifications for a given variable.
  • 35. 36 Azure Machine Learning Deploying a prediction model The deployment of a new prediction model takes the form of exposing a web service which can then be invoked via the Representational State Transfer (REST) protocol. ► Azure Machine Learning web services can be called via two different exposed interfaces: ► Single, request/response-style calls. ► “Batch” style calls, where multiple input records are passed into the web service in a single call and the corresponding response contains an output list of predictions for each input record. When a new machine learning prediction model is exposed on the Web, it performs the following operations: ► New input data is passed into the web service in the form of a JavaScript Object Notation (JSON) payload. ► The web service then passes the incoming data as inputs into the Azure Machine Learning prediction model engine. ► The Azure Machine Learning model then generates a new prediction based on the input data and returns the new prediction results to the caller via a JSON payload.
  • 36. 37 Azure ML Getting Started Option 1 Take advantage of a free Azure trial offer at http://azure.microsoft.com/en-us/pricing/free-trial Option 2 Take advantage of the (free) Azure Machine Learning trial offer at https://studio.azureml.net/Home
  • 38. 39 Azure Machine Learning workspaces  Workspace  Experiments  Azure ML Studio  Web Services  Datasets  Modules
  • 39. 40 Create your first Azure Machine Learning experiment ► Defining the problem you want to solve ► e.g. figure out if you like certain movie what’s another movie you should watch? (movie recommendation ) ► Designing a Solution Using AzureML 1. Creating an experiment 2. Load a Data Set 3. Create the Experiment 4. Add Transformations and Filters 5. Create the Experiment Path and apply Algorithms 6. Save and Run the Model 7. Publish the Model 8. Use the Model ► Saving and Running ► Publishing and Accessing

Editor's Notes

  • #5: Machine learning can be described as computing systems that improve with experience. It can also be described as a method of turning data into software. “The goal of machine learning is to program computers to use example data or past experience to solve a given problem.” Introduction to Machine Learning, 2nd Edition, MIT Press Whatever term is used, the results remain the same; data scientists have successfully developed methods of creating software “models” that are trained from huge volumes of data and then used to predict certain patterns, trends, and outcomes.
  • #6: Predictive analytics is can be simply defined as a way to scientifically use the past to predict the future to help drive desired outcomes.
  • #7: Machine learning can be described as computing systems that improve with experience. It can also be described as a method of turning data into software. Whatever term is used, the results remain the same; data scientists have successfully developed methods of creating software “models” that are trained from huge volumes of data and then used to predict certain patterns, trends, and outcomes. Apprendimento Supervisionato Apprendimento Non Supervisionato
  • #8: Tecniche alla base del machine learning sono utilizzate da noi tutti i giorni….ad esempio il linguaggio Che è composto da un Alfabeto ( dati ) e da regole come il suono di una vocale o di una parola o da regole come la costruzione di una frase La comprensione avviene da una conoscenza pregressa. I Dati + Rule = Machine Learning
  • #9: Under traditional programming models, programs and data are processed by the computer to produce a desired output, such as using programs to process data and produce a report When working with machine learning, the processing paradigm is altered dramatically. The data and the desired output are reverse-engineered by the computer to produce a new program The power of this new program is that it can effectively “predict” the output, based on the supplied input data. The primary benefit of this approach is that the resulting “program” that is developed has been trained (via massive quantities of learning data) and finely tuned (via feedback data about the desired output) and is now capable of predicting the likelihood of a desired output based on the provided data.
  • #10: A classic example of predictive analytics can be found everyday on Amazon.com; there, every time you search for an item, you will be presented with an upsell section on the webpage that offers you additional catalog items because “customers who bought this item also bought” those items. This is a great example of using predictive analytics and the psychology of human buying patterns to create a highly effective marketing strategy
  • #11: Many examples of predictive analytics can be found literally everywhere today in our society: Spam/junk email filters These are based on the content, headers, origins, and even user behaviors (for example, always delete emails from this sender). Mortgage applications Typically, your mortgage loan and credit worthiness is determined by advanced predictive analytic algorithm engines. Various forms of pattern recognition These include optical character recognition (OCR) for routing your daily postal mail, speech recognition on your smart phone, and even facial recognition for advanced security systems. Life insurance Examples include calculating mortality rates, life expectancy, premiums, and payouts. Medical insurance Insurers attempt to determine future medical expenses based on historical medical claims and similar patient backgrounds. Liability/property insurance Companies can analyze coverage risks for automobile and home owners based on demographics. Credit card fraud detection This process is based on usage and activity patterns. In the past year, the number of credit card transactions has topped 1 billion. The popularity of contactless payments via near-field communications (NFC) has also increased dramatically over the past year due to smart phone integration. Airline flights Airlines calculate fees, schedules, and revenues based on prior air travel patterns and flight data. Web search page results Predictive analytics help determine which ads, recommendations, and display sequences to render on the page. Predictive maintenance This is used with almost everything we can monitor: planes, trains, elevators, cars, and yes, even data centers. Health care Predictive analytics are in widespread use to help determine patient outcomes and future care based on historical data and pattern matching across similar patient data sets.
  • #14: Regression analysis is a statistical process for estimating the relationships among variables. It includes many techniques for modeling and analyzing several variables, when the focus is on the relationship between adependent variable and one or more independent variables (or 'predictors')
  • #16: Supervised learning is a type of machine learning algorithm that uses known datasets to create a model that can then make predictions. The known data sets are called and include input data elements along with known response values In the case of unsupervised machine learning, the task of making predictions becomes much harder. In this scenario, the machine learning algorithms are not provided with any kind of known data inputs or known outputs to generate a new predictive model. In the case of unsupervised machine learning, the success of the new predictive model depends entirely on the ability to infer and identify patterns, structures, and relationships in the incoming data set.
  • #17: Classification algorithms These are used to classify data into different categories that can then be used to predict one or more discrete variables, based on the other attributes in the dataset. Regression algorithms These are used to predict one or more continuous variables, such as profit or loss, based on other attributes in the dataset. Clustering algorithms These determine natural groupings and patterns in datasets and are used to predict grouping classifications for a given variable. One of the most common unsupervised learning algorithms is known as which is used to find hidden patterns or groupings within data sets. Some common examples of cluster analysis classifications would include the following: Socioeconomic tiers Income, education, profession, age, number of children, size of city or residence, and so on. Psychographic data Personal interests, lifestyle, motivation, values, involvement. Social network graphs Groups of people related to you by family, friends, work, schools, professional associations, and so on. Purchasing patterns Price range, type of media used, intensity of use, choice of retail outlet, fidelity, buyer or nonbuyer, buying intensity. The other type of approach to unsupervised machine learning is to use a reward system, rather than any kind of teaching aids, as are commonly used in supervised learning. Positive and negative rewards are used to provide feedback to the predictive model when it has been successful.
  • #18: In statistical modeling, regression analysis is a statistical process for estimating the relationships among variables. It includes many techniques for modeling and analyzing several variables, when the focus is on the relationship between adependent variable and one or more independent variables (or 'predictors')
  • #21: Features or vectors Known data that is used as an input element for making a prediction. Labels or supervisory signal Represents the known outcomes for the corresponding features for the input record. Not used (default) Not used by predictive algorithms for inferring a new predictive model
  • #28: In statistics, a receiver operating characteristic (ROC), or ROC curve, is a graphical plot that illustrates the performance of abinary classifier system as its discrimination threshold is varied. The curve is created by plotting the true positive rate (TPR) against the false positive rate (FPR) at various threshold settings. 
  • #29: Watson is the first cognitive system in the world Offer a lot of service like face, voice , image recognition , question & answer But in this moment don’t offer any types or configurations of tool to costumize these services.
  • #30: Natural interaction with humans For example speech and emotions Not only used by web applications, but also in physical world Processing of semi-structured data and big amounts of data For example language classifications and image recognitions Trained and continuously improved via machine and deep learning For example search (retrieve and rank) Leverages context to improve service qualities
  • #31: A fully managed Big Data and Advanced Analytics Suite that enables you to transform your data into intelligent action.
  • #32: From data to decisions and actions
  • #33: http://visual-recognition-demo.mybluemix.net/ http://vision.alchemy.ai/ https://www.microsoft.com/cognitive-services/en-us/computer-vision-api https://tone-analyzer-demo.mybluemix.net/ https://text-analytics-demo.azurewebsites.net/
  • #34: The basic process of creating Azure Machine Learning solutions is composed of a repeatable pattern of workflow steps that are designed to help you create a new predictive analytics solution in no time. The basic steps in the process are summarized in Figure Data It’s all about the data. Here’s where you will acquire, compile, and analyze testing and training data sets for use in creating Azure Machine Learning predictive models. Create the model Use various machine learning algorithms to create new models that are capable of making predictions based on inferences about the data sets. Evaluate the model Examine the accuracy of new predictive models based on ability to predict the correct outcome, when both the input and output values are known in advance. Accuracy is measured in terms of confidence factor approaching the whole number one. Refine and evaluate the model Compare, contrast, and combine alternate predictive models to find the right combination(s) that can consistently produce the most accurate results. Deploy the model Expose the new predictive model as a scalable cloud web service, one that is easily accessible over the Internet by any web browser or mobile client. Test and use the model Implement the new predictive model web service in a test or production application scenario. Add manual or automatic feedback loops for continuous
  • #36: The formula for producing a supervised learning model is expressed in Figure illustrates the general flow of creating new prediction models based on the use of supervised learning along with known input data elements and known outcomes to create an entirely new prediction model. A supervised learning algorithm analyzes the known inputs and known outcomes from training data Predictive models can generally achieve better accuracy results when provided with new (and more recent) datasets. The prediction evaluation process can be expressed as shown in Figure The evaluation process for new prediction models that use supervised learning primarily consists of determining the accuracy of the new generated model. Once a new predictive model has been generated from good training datasets and carefully evaluated for accuracy, then it can be deployed for use in testing or production usage scenarios. The new production prediction process can be expressed as shown in Figur
  • #40: Azure Machine Learning workspaces Represent a discrete “slice” of the Azure Machine Learning tool set that can be partitioned by Workspace name and Workspace owner Azure Machine Learning experiments Experiments are created within Azure Machine Learning workspaces and represent the primary method of enabling an iterative approach to rapidly developing Azure Machine Learning solutions. Azure ML Studio This is the primary interactive predictive analytics workbench that is accessed from within an Azure Machine Learning workspace to allow a data scientist to create Azure Machine Learning experiments via a drag-and-drop visual designer interface. Access to a unique Azure ML Studio environment is governed from within an Azure Machine Learning workspace. Azure Machine Learning web services These represent Azure Machine Learning experiments that have been exposed as public APIs over the Internet in the form of the Azure Machine Learning REST API. These services are generally exposed as a simple web service, or as an OData endpoint. Datasets This is data that has been uploaded to Azure ML Studio so that it can be used in the prediction modeling process. Modules These are algorithms that you can apply to your data. Azure ML Studio has a number of modules ranging from data ingress functions to training, scoring, and validation processes. Here are some examples of included modules: Convert to ARFF Converts a .NET serialized dataset to ARFF format. ARFF is a common machine learning construct and stands for Attribute-Relation File Format. It is commonly defined as an ASCII text file that describes a list of instances sharing a set of attributes. Elementary Statistics Calculates elementary statistics such as mean, standard deviation, and so on. Linear Regression Creates an online gradient, descent-based, linear regression model. Score Model Scores a trained classification or regression model