SlideShare a Scribd company logo
Regression Analysis & Prediction
Devon Jones
Lead Systems Engineer, Knewton

Gabe Hamilton
Software Engineering Mgr, Revionics
Tech Confluence
For those who work downtown, check out our
DOSUG inspired group.
meetup.com/TechConfluence
3rd Wednesday of the month
At lunch: 12:30 - 1:30pm
The Plan
1. Regression Analysis

- Devon

2. Google Prediction API - Gabe
3. Applying Regression

- Devon
Google Prediction API

{

{

"label": "awesome",
"score": 0.98

},

"label": "lame",
"score": 0.08

}

Gabe Hamilton
What kind of Prediction?
Predict an output value based on some input
values.
Things like:
Sentiment Analysis, Spam Detection, Today's
temperature, GDP Growth
DOSUG Intro to google prediction api
How does Google predict things?
Well, it's Google
Through an intensive breeding program Google has managed to distribute Punxsutawney Phils
throughout its datacenters across the world. Each Phil is kept in a climate controlled enclosure that
mimics the conditions of a perfectly average February 2nd. A full scale digital sundial maps your
problem domain onto the shadow matrix of the enclosure allowing each Phil to fully interact with
your model. The early spring / long winter emergence probability of each Phil is then sorted and
reduced to determine the final result returned by the prediction API.
No Really, How do they do it?
Short Answer:
I have no idea
Long answer:
It's a service, they can
do whatever works,
swap implementations
run multiple
algorithms
Possible Implementations
Regression Analysis
Neural Networks
Support Vector
Machine
Monte Carlo Sim
Decision Trees
Evolutionary
Algorithms

Basically it is

STATISTICS
Types of Prediction you can do
Regression

Classification

How do inputs cause
an output to vary?

Deciding which bucket
some input belongs in

Output is a numeric
value:
Shopping Cart Size
Stock Price

Buckets are text
values: French,
Spanish, English
What is Classification good for?
Classification
●
●
●
●
●
●
●
●

Sentiment analysis
Spam detection
Language categorization
Tagging
Assign priority to bugs
Predict movie ratings
Message routing decisions
<Your brilliant idea here>
Getting Started
Hello World page is great
https://developers.google.com/prediction/docs/hello_world
So you have a big pile
of data
Time for some cleanup
90% of the
development time is
data cleanup

Good talk on data
driven projects
http://www.slideshare.
net/ryanweald/building-data-drivenproducts-with-ruby-rubyconf-2012
CSV Input file aka Training Set
First column is expected values.
2nd through N columns are input values
"French", "Je pense donc j'essuie", "Paris"

Output

an input

No header columns

more input
250MB max file size
4 Steps to Prediction
1. Create a CSV file of your training data
2. Create a new Project in the Prediction API
a. requires entering billing info

3. Upload your csv file to Google Storage
4. In Prediction API Browser:
a. insert a new training set (the csv file)
b. view your trained set
c. use trainedmodel.predict to make
predictions
See the hello world for details of the method calls
Let's make some predictions...
Live demo screenshots: List Models
Live demo screens: Analyze Model
Live demo: Predict Model Category
Live demo: Predict Model Numeric
Storage for datasets
https://storage.cloud.google.com

API Explorer
https://developers.google.com/apis-explorer/#s/prediction/v1.6/
Ad

More Related Content

Viewers also liked (20)

2010-11 CIARD - Bridging Rural Digital Divide (Brasil) - English
2010-11 CIARD - Bridging Rural Digital Divide (Brasil) - English2010-11 CIARD - Bridging Rural Digital Divide (Brasil) - English
2010-11 CIARD - Bridging Rural Digital Divide (Brasil) - English
CIARD
 
Ijetcas14 379
Ijetcas14 379Ijetcas14 379
Ijetcas14 379
Iasir Journals
 
T16 multiple regression
T16 multiple regressionT16 multiple regression
T16 multiple regression
kompellark
 
Regression analysis
Regression analysisRegression analysis
Regression analysis
Subin Raj
 
Analysis of crop yield prediction using data mining techniques
Analysis of crop yield prediction using data mining techniquesAnalysis of crop yield prediction using data mining techniques
Analysis of crop yield prediction using data mining techniques
eSAT Journals
 
Predicting the future with Google Prediction API
Predicting the future with Google Prediction APIPredicting the future with Google Prediction API
Predicting the future with Google Prediction API
Radu-Sebastian Amarie
 
Scale Invariant Feature Tranform
Scale Invariant Feature TranformScale Invariant Feature Tranform
Scale Invariant Feature Tranform
Shanker Naik
 
Prediction of house price using multiple regression
Prediction of house price using multiple regressionPrediction of house price using multiple regression
Prediction of house price using multiple regression
vinovk
 
Data mining
Data miningData mining
Data mining
Meysam Asadi
 
Data mining in agriculture
Data mining in agricultureData mining in agriculture
Data mining in agriculture
Sibananda Khatai
 
Day 10 prediction and regression
Day 10 prediction and regressionDay 10 prediction and regression
Day 10 prediction and regression
Elih Sutisna Yanto
 
Michal Erel's SIFT presentation
Michal Erel's SIFT presentationMichal Erel's SIFT presentation
Michal Erel's SIFT presentation
wolf
 
Chapter 4 Classification
Chapter 4 ClassificationChapter 4 Classification
Chapter 4 Classification
Khalid Elshafie
 
Regression Analysis
Regression AnalysisRegression Analysis
Regression Analysis
nadiazaheer
 
Data mining: Classification and prediction
Data mining: Classification and predictionData mining: Classification and prediction
Data mining: Classification and prediction
DataminingTools Inc
 
Correlation and Simple Regression
Correlation  and Simple RegressionCorrelation  and Simple Regression
Correlation and Simple Regression
Venkata Reddy Konasani
 
B0930610
B0930610B0930610
B0930610
IOSR Journals
 
Regression analysis ppt
Regression analysis pptRegression analysis ppt
Regression analysis ppt
Elkana Rorio
 
Multiple linear regression
Multiple linear regressionMultiple linear regression
Multiple linear regression
James Neill
 
Correlation and regression
Correlation and regressionCorrelation and regression
Correlation and regression
Khalid Aziz
 
2010-11 CIARD - Bridging Rural Digital Divide (Brasil) - English
2010-11 CIARD - Bridging Rural Digital Divide (Brasil) - English2010-11 CIARD - Bridging Rural Digital Divide (Brasil) - English
2010-11 CIARD - Bridging Rural Digital Divide (Brasil) - English
CIARD
 
T16 multiple regression
T16 multiple regressionT16 multiple regression
T16 multiple regression
kompellark
 
Regression analysis
Regression analysisRegression analysis
Regression analysis
Subin Raj
 
Analysis of crop yield prediction using data mining techniques
Analysis of crop yield prediction using data mining techniquesAnalysis of crop yield prediction using data mining techniques
Analysis of crop yield prediction using data mining techniques
eSAT Journals
 
Predicting the future with Google Prediction API
Predicting the future with Google Prediction APIPredicting the future with Google Prediction API
Predicting the future with Google Prediction API
Radu-Sebastian Amarie
 
Scale Invariant Feature Tranform
Scale Invariant Feature TranformScale Invariant Feature Tranform
Scale Invariant Feature Tranform
Shanker Naik
 
Prediction of house price using multiple regression
Prediction of house price using multiple regressionPrediction of house price using multiple regression
Prediction of house price using multiple regression
vinovk
 
Data mining in agriculture
Data mining in agricultureData mining in agriculture
Data mining in agriculture
Sibananda Khatai
 
Day 10 prediction and regression
Day 10 prediction and regressionDay 10 prediction and regression
Day 10 prediction and regression
Elih Sutisna Yanto
 
Michal Erel's SIFT presentation
Michal Erel's SIFT presentationMichal Erel's SIFT presentation
Michal Erel's SIFT presentation
wolf
 
Chapter 4 Classification
Chapter 4 ClassificationChapter 4 Classification
Chapter 4 Classification
Khalid Elshafie
 
Regression Analysis
Regression AnalysisRegression Analysis
Regression Analysis
nadiazaheer
 
Data mining: Classification and prediction
Data mining: Classification and predictionData mining: Classification and prediction
Data mining: Classification and prediction
DataminingTools Inc
 
Regression analysis ppt
Regression analysis pptRegression analysis ppt
Regression analysis ppt
Elkana Rorio
 
Multiple linear regression
Multiple linear regressionMultiple linear regression
Multiple linear regression
James Neill
 
Correlation and regression
Correlation and regressionCorrelation and regression
Correlation and regression
Khalid Aziz
 

Similar to DOSUG Intro to google prediction api (20)

"r" for ROI
"r" for ROI"r" for ROI
"r" for ROI
Andrea Dodet
 
#rstats lessons for #measure
#rstats lessons for #measure#rstats lessons for #measure
#rstats lessons for #measure
Mark Edmondson
 
The Art of Evolutionary Algorithms Programming
The Art of Evolutionary Algorithms ProgrammingThe Art of Evolutionary Algorithms Programming
The Art of Evolutionary Algorithms Programming
Juan J. Merelo
 
Actminds Outsourcing Summit 07
Actminds Outsourcing Summit 07Actminds Outsourcing Summit 07
Actminds Outsourcing Summit 07
cnetto
 
Week 4 Assignment - Software Development PlanScenario-Your team has be.docx
Week 4 Assignment - Software Development PlanScenario-Your team has be.docxWeek 4 Assignment - Software Development PlanScenario-Your team has be.docx
Week 4 Assignment - Software Development PlanScenario-Your team has be.docx
estefana2345678
 
2013 04 prime guide
2013 04 prime guide2013 04 prime guide
2013 04 prime guide
Droidcon Berlin
 
Droidcon thessaloniki 2015
Droidcon thessaloniki 2015Droidcon thessaloniki 2015
Droidcon thessaloniki 2015
Raymond Chenon
 
Data Analytic s (Unit -1).pRESENTATION .PPT
Data Analytic s (Unit -1).pRESENTATION .PPTData Analytic s (Unit -1).pRESENTATION .PPT
Data Analytic s (Unit -1).pRESENTATION .PPT
GayathriShiva4
 
Meet a 100% R-based CRO - The summary of a 5-year journey
Meet a 100% R-based CRO - The summary of a 5-year journeyMeet a 100% R-based CRO - The summary of a 5-year journey
Meet a 100% R-based CRO - The summary of a 5-year journey
Adrian Olszewski
 
Meet a 100% R-based CRO. The summary of a 5-year journey
Meet a 100% R-based CRO. The summary of a 5-year journeyMeet a 100% R-based CRO. The summary of a 5-year journey
Meet a 100% R-based CRO. The summary of a 5-year journey
Adrian Olszewski
 
Big dataarchitecturesandecosystem+nosql
Big dataarchitecturesandecosystem+nosqlBig dataarchitecturesandecosystem+nosql
Big dataarchitecturesandecosystem+nosql
Khanderao Kand
 
Introduction to Google Cloud platform technologies
Introduction to Google Cloud platform technologiesIntroduction to Google Cloud platform technologies
Introduction to Google Cloud platform technologies
Chris Schalk
 
Continuous Deployment
Continuous DeploymentContinuous Deployment
Continuous Deployment
Brian Henerey
 
DA 592 - Term Project Report - Berker Kozan Can Koklu
DA 592 - Term Project Report - Berker Kozan Can KokluDA 592 - Term Project Report - Berker Kozan Can Koklu
DA 592 - Term Project Report - Berker Kozan Can Koklu
Can Köklü
 
Dataiku productive application to production - pap is may 2015
Dataiku    productive application to production - pap is may 2015 Dataiku    productive application to production - pap is may 2015
Dataiku productive application to production - pap is may 2015
Dataiku
 
Cómo programar en c++ 9th Edition Harvey Deitel
Cómo programar en c++ 9th Edition Harvey DeitelCómo programar en c++ 9th Edition Harvey Deitel
Cómo programar en c++ 9th Edition Harvey Deitel
vrtoviuostas90
 
The Testing Planet Issue 7
The Testing Planet Issue 7The Testing Planet Issue 7
The Testing Planet Issue 7
Rosie Sherry
 
2014-10-10-SBC361-Reproducible research
2014-10-10-SBC361-Reproducible research2014-10-10-SBC361-Reproducible research
2014-10-10-SBC361-Reproducible research
Yannick Wurm
 
The Google Analytics API and Analytics Canvas
The Google Analytics API and Analytics CanvasThe Google Analytics API and Analytics Canvas
The Google Analytics API and Analytics Canvas
nModal Solutions Inc.
 
Maximizing Big Data ROI via Best of Breed Technology Patterns and Practices -...
Maximizing Big Data ROI via Best of Breed Technology Patterns and Practices -...Maximizing Big Data ROI via Best of Breed Technology Patterns and Practices -...
Maximizing Big Data ROI via Best of Breed Technology Patterns and Practices -...
DigiMarCon - Digital Marketing, Media and Advertising Conferences & Exhibitions
 
#rstats lessons for #measure
#rstats lessons for #measure#rstats lessons for #measure
#rstats lessons for #measure
Mark Edmondson
 
The Art of Evolutionary Algorithms Programming
The Art of Evolutionary Algorithms ProgrammingThe Art of Evolutionary Algorithms Programming
The Art of Evolutionary Algorithms Programming
Juan J. Merelo
 
Actminds Outsourcing Summit 07
Actminds Outsourcing Summit 07Actminds Outsourcing Summit 07
Actminds Outsourcing Summit 07
cnetto
 
Week 4 Assignment - Software Development PlanScenario-Your team has be.docx
Week 4 Assignment - Software Development PlanScenario-Your team has be.docxWeek 4 Assignment - Software Development PlanScenario-Your team has be.docx
Week 4 Assignment - Software Development PlanScenario-Your team has be.docx
estefana2345678
 
Droidcon thessaloniki 2015
Droidcon thessaloniki 2015Droidcon thessaloniki 2015
Droidcon thessaloniki 2015
Raymond Chenon
 
Data Analytic s (Unit -1).pRESENTATION .PPT
Data Analytic s (Unit -1).pRESENTATION .PPTData Analytic s (Unit -1).pRESENTATION .PPT
Data Analytic s (Unit -1).pRESENTATION .PPT
GayathriShiva4
 
Meet a 100% R-based CRO - The summary of a 5-year journey
Meet a 100% R-based CRO - The summary of a 5-year journeyMeet a 100% R-based CRO - The summary of a 5-year journey
Meet a 100% R-based CRO - The summary of a 5-year journey
Adrian Olszewski
 
Meet a 100% R-based CRO. The summary of a 5-year journey
Meet a 100% R-based CRO. The summary of a 5-year journeyMeet a 100% R-based CRO. The summary of a 5-year journey
Meet a 100% R-based CRO. The summary of a 5-year journey
Adrian Olszewski
 
Big dataarchitecturesandecosystem+nosql
Big dataarchitecturesandecosystem+nosqlBig dataarchitecturesandecosystem+nosql
Big dataarchitecturesandecosystem+nosql
Khanderao Kand
 
Introduction to Google Cloud platform technologies
Introduction to Google Cloud platform technologiesIntroduction to Google Cloud platform technologies
Introduction to Google Cloud platform technologies
Chris Schalk
 
Continuous Deployment
Continuous DeploymentContinuous Deployment
Continuous Deployment
Brian Henerey
 
DA 592 - Term Project Report - Berker Kozan Can Koklu
DA 592 - Term Project Report - Berker Kozan Can KokluDA 592 - Term Project Report - Berker Kozan Can Koklu
DA 592 - Term Project Report - Berker Kozan Can Koklu
Can Köklü
 
Dataiku productive application to production - pap is may 2015
Dataiku    productive application to production - pap is may 2015 Dataiku    productive application to production - pap is may 2015
Dataiku productive application to production - pap is may 2015
Dataiku
 
Cómo programar en c++ 9th Edition Harvey Deitel
Cómo programar en c++ 9th Edition Harvey DeitelCómo programar en c++ 9th Edition Harvey Deitel
Cómo programar en c++ 9th Edition Harvey Deitel
vrtoviuostas90
 
The Testing Planet Issue 7
The Testing Planet Issue 7The Testing Planet Issue 7
The Testing Planet Issue 7
Rosie Sherry
 
2014-10-10-SBC361-Reproducible research
2014-10-10-SBC361-Reproducible research2014-10-10-SBC361-Reproducible research
2014-10-10-SBC361-Reproducible research
Yannick Wurm
 
The Google Analytics API and Analytics Canvas
The Google Analytics API and Analytics CanvasThe Google Analytics API and Analytics Canvas
The Google Analytics API and Analytics Canvas
nModal Solutions Inc.
 
Ad

More from Gabriel Hamilton (14)

Javascript Smart Contracts on NEAR
Javascript Smart Contracts on NEARJavascript Smart Contracts on NEAR
Javascript Smart Contracts on NEAR
Gabriel Hamilton
 
Smart Contracts
Smart ContractsSmart Contracts
Smart Contracts
Gabriel Hamilton
 
Web Next
Web NextWeb Next
Web Next
Gabriel Hamilton
 
Beyond Agile Software
Beyond Agile SoftwareBeyond Agile Software
Beyond Agile Software
Gabriel Hamilton
 
Natural language processing: feature extraction
Natural language processing: feature extractionNatural language processing: feature extraction
Natural language processing: feature extraction
Gabriel Hamilton
 
Software engineering for CEOs
Software engineering for CEOsSoftware engineering for CEOs
Software engineering for CEOs
Gabriel Hamilton
 
Adaptive software engineering
Adaptive software engineeringAdaptive software engineering
Adaptive software engineering
Gabriel Hamilton
 
The TensorFlow dance craze
The TensorFlow dance crazeThe TensorFlow dance craze
The TensorFlow dance craze
Gabriel Hamilton
 
DataFlow & Beam
DataFlow & BeamDataFlow & Beam
DataFlow & Beam
Gabriel Hamilton
 
Software engineering for CEOs ch1
Software engineering for CEOs ch1Software engineering for CEOs ch1
Software engineering for CEOs ch1
Gabriel Hamilton
 
How BigQuery broke my heart
How BigQuery broke my heartHow BigQuery broke my heart
How BigQuery broke my heart
Gabriel Hamilton
 
How to present lots of information on a screen
How to present lots of information on a screenHow to present lots of information on a screen
How to present lots of information on a screen
Gabriel Hamilton
 
Dojo: Beautiful Web Apps, Fast
Dojo: Beautiful Web Apps, FastDojo: Beautiful Web Apps, Fast
Dojo: Beautiful Web Apps, Fast
Gabriel Hamilton
 
Dojo: Getting Started Today
Dojo: Getting Started TodayDojo: Getting Started Today
Dojo: Getting Started Today
Gabriel Hamilton
 
Javascript Smart Contracts on NEAR
Javascript Smart Contracts on NEARJavascript Smart Contracts on NEAR
Javascript Smart Contracts on NEAR
Gabriel Hamilton
 
Natural language processing: feature extraction
Natural language processing: feature extractionNatural language processing: feature extraction
Natural language processing: feature extraction
Gabriel Hamilton
 
Software engineering for CEOs
Software engineering for CEOsSoftware engineering for CEOs
Software engineering for CEOs
Gabriel Hamilton
 
Adaptive software engineering
Adaptive software engineeringAdaptive software engineering
Adaptive software engineering
Gabriel Hamilton
 
The TensorFlow dance craze
The TensorFlow dance crazeThe TensorFlow dance craze
The TensorFlow dance craze
Gabriel Hamilton
 
Software engineering for CEOs ch1
Software engineering for CEOs ch1Software engineering for CEOs ch1
Software engineering for CEOs ch1
Gabriel Hamilton
 
How BigQuery broke my heart
How BigQuery broke my heartHow BigQuery broke my heart
How BigQuery broke my heart
Gabriel Hamilton
 
How to present lots of information on a screen
How to present lots of information on a screenHow to present lots of information on a screen
How to present lots of information on a screen
Gabriel Hamilton
 
Dojo: Beautiful Web Apps, Fast
Dojo: Beautiful Web Apps, FastDojo: Beautiful Web Apps, Fast
Dojo: Beautiful Web Apps, Fast
Gabriel Hamilton
 
Dojo: Getting Started Today
Dojo: Getting Started TodayDojo: Getting Started Today
Dojo: Getting Started Today
Gabriel Hamilton
 
Ad

Recently uploaded (20)

IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...
IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...
IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...
organizerofv
 
Semantic Cultivators : The Critical Future Role to Enable AI
Semantic Cultivators : The Critical Future Role to Enable AISemantic Cultivators : The Critical Future Role to Enable AI
Semantic Cultivators : The Critical Future Role to Enable AI
artmondano
 
Special Meetup Edition - TDX Bengaluru Meetup #52.pptx
Special Meetup Edition - TDX Bengaluru Meetup #52.pptxSpecial Meetup Edition - TDX Bengaluru Meetup #52.pptx
Special Meetup Edition - TDX Bengaluru Meetup #52.pptx
shyamraj55
 
Electronic_Mail_Attacks-1-35.pdf by xploit
Electronic_Mail_Attacks-1-35.pdf by xploitElectronic_Mail_Attacks-1-35.pdf by xploit
Electronic_Mail_Attacks-1-35.pdf by xploit
niftliyevhuseyn
 
Heap, Types of Heap, Insertion and Deletion
Heap, Types of Heap, Insertion and DeletionHeap, Types of Heap, Insertion and Deletion
Heap, Types of Heap, Insertion and Deletion
Jaydeep Kale
 
Cyber Awareness overview for 2025 month of security
Cyber Awareness overview for 2025 month of securityCyber Awareness overview for 2025 month of security
Cyber Awareness overview for 2025 month of security
riccardosl1
 
Role of Data Annotation Services in AI-Powered Manufacturing
Role of Data Annotation Services in AI-Powered ManufacturingRole of Data Annotation Services in AI-Powered Manufacturing
Role of Data Annotation Services in AI-Powered Manufacturing
Andrew Leo
 
Generative Artificial Intelligence (GenAI) in Business
Generative Artificial Intelligence (GenAI) in BusinessGenerative Artificial Intelligence (GenAI) in Business
Generative Artificial Intelligence (GenAI) in Business
Dr. Tathagat Varma
 
Noah Loul Shares 5 Steps to Implement AI Agents for Maximum Business Efficien...
Noah Loul Shares 5 Steps to Implement AI Agents for Maximum Business Efficien...Noah Loul Shares 5 Steps to Implement AI Agents for Maximum Business Efficien...
Noah Loul Shares 5 Steps to Implement AI Agents for Maximum Business Efficien...
Noah Loul
 
MINDCTI revenue release Quarter 1 2025 PR
MINDCTI revenue release Quarter 1 2025 PRMINDCTI revenue release Quarter 1 2025 PR
MINDCTI revenue release Quarter 1 2025 PR
MIND CTI
 
tecnologias de las primeras civilizaciones.pdf
tecnologias de las primeras civilizaciones.pdftecnologias de las primeras civilizaciones.pdf
tecnologias de las primeras civilizaciones.pdf
fjgm517
 
Linux Professional Institute LPIC-1 Exam.pdf
Linux Professional Institute LPIC-1 Exam.pdfLinux Professional Institute LPIC-1 Exam.pdf
Linux Professional Institute LPIC-1 Exam.pdf
RHCSA Guru
 
Into The Box Conference Keynote Day 1 (ITB2025)
Into The Box Conference Keynote Day 1 (ITB2025)Into The Box Conference Keynote Day 1 (ITB2025)
Into The Box Conference Keynote Day 1 (ITB2025)
Ortus Solutions, Corp
 
Transcript: #StandardsGoals for 2025: Standards & certification roundup - Tec...
Transcript: #StandardsGoals for 2025: Standards & certification roundup - Tec...Transcript: #StandardsGoals for 2025: Standards & certification roundup - Tec...
Transcript: #StandardsGoals for 2025: Standards & certification roundup - Tec...
BookNet Canada
 
Vaibhav Gupta BAML: AI work flows without Hallucinations
Vaibhav Gupta BAML: AI work flows without HallucinationsVaibhav Gupta BAML: AI work flows without Hallucinations
Vaibhav Gupta BAML: AI work flows without Hallucinations
john409870
 
The Evolution of Meme Coins A New Era for Digital Currency ppt.pdf
The Evolution of Meme Coins A New Era for Digital Currency ppt.pdfThe Evolution of Meme Coins A New Era for Digital Currency ppt.pdf
The Evolution of Meme Coins A New Era for Digital Currency ppt.pdf
Abi john
 
SAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdf
SAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdfSAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdf
SAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdf
Precisely
 
Procurement Insights Cost To Value Guide.pptx
Procurement Insights Cost To Value Guide.pptxProcurement Insights Cost To Value Guide.pptx
Procurement Insights Cost To Value Guide.pptx
Jon Hansen
 
TrsLabs - Fintech Product & Business Consulting
TrsLabs - Fintech Product & Business ConsultingTrsLabs - Fintech Product & Business Consulting
TrsLabs - Fintech Product & Business Consulting
Trs Labs
 
Mastering Advance Window Functions in SQL.pdf
Mastering Advance Window Functions in SQL.pdfMastering Advance Window Functions in SQL.pdf
Mastering Advance Window Functions in SQL.pdf
Spiral Mantra
 
IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...
IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...
IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...
organizerofv
 
Semantic Cultivators : The Critical Future Role to Enable AI
Semantic Cultivators : The Critical Future Role to Enable AISemantic Cultivators : The Critical Future Role to Enable AI
Semantic Cultivators : The Critical Future Role to Enable AI
artmondano
 
Special Meetup Edition - TDX Bengaluru Meetup #52.pptx
Special Meetup Edition - TDX Bengaluru Meetup #52.pptxSpecial Meetup Edition - TDX Bengaluru Meetup #52.pptx
Special Meetup Edition - TDX Bengaluru Meetup #52.pptx
shyamraj55
 
Electronic_Mail_Attacks-1-35.pdf by xploit
Electronic_Mail_Attacks-1-35.pdf by xploitElectronic_Mail_Attacks-1-35.pdf by xploit
Electronic_Mail_Attacks-1-35.pdf by xploit
niftliyevhuseyn
 
Heap, Types of Heap, Insertion and Deletion
Heap, Types of Heap, Insertion and DeletionHeap, Types of Heap, Insertion and Deletion
Heap, Types of Heap, Insertion and Deletion
Jaydeep Kale
 
Cyber Awareness overview for 2025 month of security
Cyber Awareness overview for 2025 month of securityCyber Awareness overview for 2025 month of security
Cyber Awareness overview for 2025 month of security
riccardosl1
 
Role of Data Annotation Services in AI-Powered Manufacturing
Role of Data Annotation Services in AI-Powered ManufacturingRole of Data Annotation Services in AI-Powered Manufacturing
Role of Data Annotation Services in AI-Powered Manufacturing
Andrew Leo
 
Generative Artificial Intelligence (GenAI) in Business
Generative Artificial Intelligence (GenAI) in BusinessGenerative Artificial Intelligence (GenAI) in Business
Generative Artificial Intelligence (GenAI) in Business
Dr. Tathagat Varma
 
Noah Loul Shares 5 Steps to Implement AI Agents for Maximum Business Efficien...
Noah Loul Shares 5 Steps to Implement AI Agents for Maximum Business Efficien...Noah Loul Shares 5 Steps to Implement AI Agents for Maximum Business Efficien...
Noah Loul Shares 5 Steps to Implement AI Agents for Maximum Business Efficien...
Noah Loul
 
MINDCTI revenue release Quarter 1 2025 PR
MINDCTI revenue release Quarter 1 2025 PRMINDCTI revenue release Quarter 1 2025 PR
MINDCTI revenue release Quarter 1 2025 PR
MIND CTI
 
tecnologias de las primeras civilizaciones.pdf
tecnologias de las primeras civilizaciones.pdftecnologias de las primeras civilizaciones.pdf
tecnologias de las primeras civilizaciones.pdf
fjgm517
 
Linux Professional Institute LPIC-1 Exam.pdf
Linux Professional Institute LPIC-1 Exam.pdfLinux Professional Institute LPIC-1 Exam.pdf
Linux Professional Institute LPIC-1 Exam.pdf
RHCSA Guru
 
Into The Box Conference Keynote Day 1 (ITB2025)
Into The Box Conference Keynote Day 1 (ITB2025)Into The Box Conference Keynote Day 1 (ITB2025)
Into The Box Conference Keynote Day 1 (ITB2025)
Ortus Solutions, Corp
 
Transcript: #StandardsGoals for 2025: Standards & certification roundup - Tec...
Transcript: #StandardsGoals for 2025: Standards & certification roundup - Tec...Transcript: #StandardsGoals for 2025: Standards & certification roundup - Tec...
Transcript: #StandardsGoals for 2025: Standards & certification roundup - Tec...
BookNet Canada
 
Vaibhav Gupta BAML: AI work flows without Hallucinations
Vaibhav Gupta BAML: AI work flows without HallucinationsVaibhav Gupta BAML: AI work flows without Hallucinations
Vaibhav Gupta BAML: AI work flows without Hallucinations
john409870
 
The Evolution of Meme Coins A New Era for Digital Currency ppt.pdf
The Evolution of Meme Coins A New Era for Digital Currency ppt.pdfThe Evolution of Meme Coins A New Era for Digital Currency ppt.pdf
The Evolution of Meme Coins A New Era for Digital Currency ppt.pdf
Abi john
 
SAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdf
SAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdfSAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdf
SAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdf
Precisely
 
Procurement Insights Cost To Value Guide.pptx
Procurement Insights Cost To Value Guide.pptxProcurement Insights Cost To Value Guide.pptx
Procurement Insights Cost To Value Guide.pptx
Jon Hansen
 
TrsLabs - Fintech Product & Business Consulting
TrsLabs - Fintech Product & Business ConsultingTrsLabs - Fintech Product & Business Consulting
TrsLabs - Fintech Product & Business Consulting
Trs Labs
 
Mastering Advance Window Functions in SQL.pdf
Mastering Advance Window Functions in SQL.pdfMastering Advance Window Functions in SQL.pdf
Mastering Advance Window Functions in SQL.pdf
Spiral Mantra
 

DOSUG Intro to google prediction api

  • 1. Regression Analysis & Prediction Devon Jones Lead Systems Engineer, Knewton Gabe Hamilton Software Engineering Mgr, Revionics
  • 2. Tech Confluence For those who work downtown, check out our DOSUG inspired group. meetup.com/TechConfluence 3rd Wednesday of the month At lunch: 12:30 - 1:30pm
  • 3. The Plan 1. Regression Analysis - Devon 2. Google Prediction API - Gabe 3. Applying Regression - Devon
  • 4. Google Prediction API { { "label": "awesome", "score": 0.98 }, "label": "lame", "score": 0.08 } Gabe Hamilton
  • 5. What kind of Prediction? Predict an output value based on some input values. Things like: Sentiment Analysis, Spam Detection, Today's temperature, GDP Growth
  • 7. How does Google predict things?
  • 8. Well, it's Google Through an intensive breeding program Google has managed to distribute Punxsutawney Phils throughout its datacenters across the world. Each Phil is kept in a climate controlled enclosure that mimics the conditions of a perfectly average February 2nd. A full scale digital sundial maps your problem domain onto the shadow matrix of the enclosure allowing each Phil to fully interact with your model. The early spring / long winter emergence probability of each Phil is then sorted and reduced to determine the final result returned by the prediction API.
  • 9. No Really, How do they do it? Short Answer: I have no idea Long answer: It's a service, they can do whatever works, swap implementations run multiple algorithms
  • 10. Possible Implementations Regression Analysis Neural Networks Support Vector Machine Monte Carlo Sim Decision Trees Evolutionary Algorithms Basically it is STATISTICS
  • 11. Types of Prediction you can do Regression Classification How do inputs cause an output to vary? Deciding which bucket some input belongs in Output is a numeric value: Shopping Cart Size Stock Price Buckets are text values: French, Spanish, English
  • 13. Classification ● ● ● ● ● ● ● ● Sentiment analysis Spam detection Language categorization Tagging Assign priority to bugs Predict movie ratings Message routing decisions <Your brilliant idea here>
  • 14. Getting Started Hello World page is great https://developers.google.com/prediction/docs/hello_world
  • 15. So you have a big pile of data
  • 16. Time for some cleanup 90% of the development time is data cleanup Good talk on data driven projects http://www.slideshare. net/ryanweald/building-data-drivenproducts-with-ruby-rubyconf-2012
  • 17. CSV Input file aka Training Set First column is expected values. 2nd through N columns are input values "French", "Je pense donc j'essuie", "Paris" Output an input No header columns more input 250MB max file size
  • 18. 4 Steps to Prediction 1. Create a CSV file of your training data 2. Create a new Project in the Prediction API a. requires entering billing info 3. Upload your csv file to Google Storage 4. In Prediction API Browser: a. insert a new training set (the csv file) b. view your trained set c. use trainedmodel.predict to make predictions See the hello world for details of the method calls
  • 19. Let's make some predictions...
  • 20. Live demo screenshots: List Models
  • 21. Live demo screens: Analyze Model
  • 22. Live demo: Predict Model Category
  • 23. Live demo: Predict Model Numeric
  • 24. Storage for datasets https://storage.cloud.google.com API Explorer https://developers.google.com/apis-explorer/#s/prediction/v1.6/