SlideShare a Scribd company logo
Competitive Advantage. Elegantly Engineered.
A Different
Data Science Methodology
We use data, analytics, and design to help clients
perform at their best.
Machine Intelligence catalyzes innovation, engineers machine learning
applications, and builds enduring capabilities.
We’re creative, rigorous, and efficient. We bring the sophistication of a
large strategy firm with the speed and value of a focused boutique.
We apply proven techniques, designs, and world-class expertise to:
• Improve how companies engage customers
• Optimize machine performance
• Enhance process results
Models reproduce how questions are answered
in training data.
Business, not IT, should design training data.
Most project time is used understanding how
data is generated and building training data sets.
Machine Learning is Simple
Real
world
Training
Data Results
Generally a subset of
scenarios in the real
world.
Data trains models that
reproduce decisions in
the training data with
80-95% accuracy.
The full set of all
consumers, machines, or
business results that a
model will forecast.
A Different Data Science Methodology
Many data science projects jump into
algorithms and technology.
We reverse the usual approach by first
rigorously defining the business question
and understanding data.
The methodology:
• Aligns the whole business
• Sets practical expectations
• Leads change
• Builds sustaining capabilities
Data
Technology
Business question
Business
goals
Time
and
focus
Data
Technology
Steps
Foundation
• Align change across the business
• Understand data
• Define the business question
Results
• Sustain capabilities
• Communicate value
• Build application
Model
• Iterate production model
• Pilot models
• Build training data
1.
2.
3.
Project Phasing
• Most time is spent understanding data and building training data.
• An early pilot is key to refining to training data and building support for change.
• Developing the full application starts early with a UX for the pilot model.
1. Set Foundation
A. Define the business question
B. Align change
C. Understand data
• Learn and set expectations on the data science process and cloud hosting.
• Define precise business questions.
• Model how answering the business question delivers results.
• Link business and regulatory needs to training data design and algorithm selection, e.g. does a
model require easy explainability?
• Build a coalition of sponsors and communicate the vision.
• Define roles for compliance, customer service, finance, marketing, product, and sales.
• Understand the data generating process: genchi genbutsu.
• Visualize the “shape of the data”: distributions, sensitivity, clusters, anomalies, and
sparseness. Identify quality issues.
• Capture rules and map data flows from source systems.
2. Build Models
A. Build training data
B. Pilot models
C. Iterate production models
• Form business and IT team: roles, super-labelers, biases.
• Design the data set’s scenarios and set quality criteria.
• Visualize attributes and confirm with business sponsors.
• Define rules to pre-process data and select open source algorithms.
• Visualize and communicate results. Show an early win. Ideally, prototype the UX.
• Plan enhancements to training data, algos, and applications.
• Refine data (feature shaping and dimensionality reduction).
• Customize rules and algorithms.
• Connect into the broader application starting with the data model.
3. Deliver Results
A. Build application
B. Communicate value
C. Sustain capabilities
• Visualize UX, define data model and APIs.
• Set non-functional requirements such as scalability, latency, and security.
• Define test plan.
• Communicate how the solution makes jobs better and brings value to customers
• Build understanding and support with key influencers
• Use multiple channels (meetings, email, calls) repeatedly to ensure reaching people
• Optimize costs and scalability. Plan for decreased costs.
• Confirm team skills and capacity to evolve the models.
• Set plan for and automate re-training models. Set expectations that models may expand the
range of scenarios covered and/or may improve precision.
Contact
Machine Intelligence Partners LLC serves clients
globally. Our people are centered in Boston,
Bozeman, Grand Rapids, London, New York, San
Francisco, and Washington, D.C.
Client relationship leaders:
New York
Jeremy Lehman
917.225.2011
jeremy.lehman@machineintel.com
Washington, D.C.
Philippe Berckmans
804.405.6009
philippe.berckmans@machineintel.com
Machine Intelligence is an Amazon Technology Partner
and member of the Microsoft Partner Network.
We are a veteran-owned small business.

More Related Content

What's hot (15)

Managing uncertainty in ai performance target setting
Managing uncertainty in ai performance target settingManaging uncertainty in ai performance target setting
Managing uncertainty in ai performance target setting
Noelle Ibrahim
 
Learn How to Make Machine Learning Work
Learn How to Make Machine Learning WorkLearn How to Make Machine Learning Work
Learn How to Make Machine Learning Work
iTrainMalaysia1
 
Establish the right practices for Effective AI
Establish the right practices for Effective AIEstablish the right practices for Effective AI
Establish the right practices for Effective AI
Wee Hyong Tok
 
Indhu resume
Indhu resumeIndhu resume
Indhu resume
Indhumathi R
 
MonetizingStatistics
MonetizingStatisticsMonetizingStatistics
MonetizingStatistics
Aaron Sankey
 
Integrating A.I. and Machine Learning with your Demand Forecast
Integrating A.I. and Machine Learning with your Demand ForecastIntegrating A.I. and Machine Learning with your Demand Forecast
Integrating A.I. and Machine Learning with your Demand Forecast
Steve Sager
 
Resume
ResumeResume
Resume
Brennen Andrews
 
Sudheera_Profile
Sudheera_ProfileSudheera_Profile
Sudheera_Profile
Sudheera Achar
 
Top 5 high demand jobs in data science
Top 5 high demand jobs in data scienceTop 5 high demand jobs in data science
Top 5 high demand jobs in data science
Dr Han Lau
 
Business intelligence prof nikhat fatma mumtaz husain shaikh
Business intelligence  prof nikhat fatma mumtaz husain shaikhBusiness intelligence  prof nikhat fatma mumtaz husain shaikh
Business intelligence prof nikhat fatma mumtaz husain shaikh
Nikhat Fatma Mumtaz Husain Shaikh
 
ceresume
ceresumeceresume
ceresume
Chase England
 
Experiment idea poster-p2
Experiment idea poster-p2Experiment idea poster-p2
Experiment idea poster-p2
Cristiane Namiuti
 
This is AI doing – applying artificial intelligence to business problems by H...
This is AI doing – applying artificial intelligence to business problems by H...This is AI doing – applying artificial intelligence to business problems by H...
This is AI doing – applying artificial intelligence to business problems by H...
Mindtrek
 
resume
resumeresume
resume
Travis Wilburn
 
New patterns of innovation
New patterns of innovationNew patterns of innovation
New patterns of innovation
Vashishtha Vidyarthi
 
Managing uncertainty in ai performance target setting
Managing uncertainty in ai performance target settingManaging uncertainty in ai performance target setting
Managing uncertainty in ai performance target setting
Noelle Ibrahim
 
Learn How to Make Machine Learning Work
Learn How to Make Machine Learning WorkLearn How to Make Machine Learning Work
Learn How to Make Machine Learning Work
iTrainMalaysia1
 
Establish the right practices for Effective AI
Establish the right practices for Effective AIEstablish the right practices for Effective AI
Establish the right practices for Effective AI
Wee Hyong Tok
 
MonetizingStatistics
MonetizingStatisticsMonetizingStatistics
MonetizingStatistics
Aaron Sankey
 
Integrating A.I. and Machine Learning with your Demand Forecast
Integrating A.I. and Machine Learning with your Demand ForecastIntegrating A.I. and Machine Learning with your Demand Forecast
Integrating A.I. and Machine Learning with your Demand Forecast
Steve Sager
 
Top 5 high demand jobs in data science
Top 5 high demand jobs in data scienceTop 5 high demand jobs in data science
Top 5 high demand jobs in data science
Dr Han Lau
 
Business intelligence prof nikhat fatma mumtaz husain shaikh
Business intelligence  prof nikhat fatma mumtaz husain shaikhBusiness intelligence  prof nikhat fatma mumtaz husain shaikh
Business intelligence prof nikhat fatma mumtaz husain shaikh
Nikhat Fatma Mumtaz Husain Shaikh
 
This is AI doing – applying artificial intelligence to business problems by H...
This is AI doing – applying artificial intelligence to business problems by H...This is AI doing – applying artificial intelligence to business problems by H...
This is AI doing – applying artificial intelligence to business problems by H...
Mindtrek
 

Similar to Machine intelligence data science methodology 060420 (20)

DS Life Cycle
DS Life CycleDS Life Cycle
DS Life Cycle
Knoldus Inc.
 
DS Life Cycle
DS Life CycleDS Life Cycle
DS Life Cycle
Knoldus Inc.
 
AI Class Topic 3: Building Machine Learning Predictive Systems (Predictive Ma...
AI Class Topic 3: Building Machine Learning Predictive Systems (Predictive Ma...AI Class Topic 3: Building Machine Learning Predictive Systems (Predictive Ma...
AI Class Topic 3: Building Machine Learning Predictive Systems (Predictive Ma...
Value Amplify Consulting
 
Presentation1the Security Risk Management in.pptx.
Presentation1the Security Risk Management in.pptx.Presentation1the Security Risk Management in.pptx.
Presentation1the Security Risk Management in.pptx.
MahmoudElmahdy32
 
WHAT IS BUSINESS ANALYTICS um hj mnjh nit 1 ppt only kjjn
WHAT IS BUSINESS ANALYTICS um hj mnjh nit 1 ppt only kjjnWHAT IS BUSINESS ANALYTICS um hj mnjh nit 1 ppt only kjjn
WHAT IS BUSINESS ANALYTICS um hj mnjh nit 1 ppt only kjjn
RohitKumar639388
 
Smart Data Module 4 d drive_business models
Smart Data Module 4 d drive_business modelsSmart Data Module 4 d drive_business models
Smart Data Module 4 d drive_business models
caniceconsulting
 
Doing Analytics Right - Designing and Automating Analytics
Doing Analytics Right - Designing and Automating AnalyticsDoing Analytics Right - Designing and Automating Analytics
Doing Analytics Right - Designing and Automating Analytics
Tasktop
 
Starter Kit for Collaboration from Karuana @ Microsoft IT
Starter Kit for Collaboration from Karuana @ Microsoft ITStarter Kit for Collaboration from Karuana @ Microsoft IT
Starter Kit for Collaboration from Karuana @ Microsoft IT
Karuana Gatimu
 
How to Build an AI/ML Product and Sell it by SalesChoice CPO
How to Build an AI/ML Product and Sell it by SalesChoice CPOHow to Build an AI/ML Product and Sell it by SalesChoice CPO
How to Build an AI/ML Product and Sell it by SalesChoice CPO
Product School
 
[DSC Europe 23] Josip Saban - Leading AI teams.pptx
[DSC Europe 23] Josip Saban - Leading AI teams.pptx[DSC Europe 23] Josip Saban - Leading AI teams.pptx
[DSC Europe 23] Josip Saban - Leading AI teams.pptx
DataScienceConferenc1
 
Get your data analytics strategy right!
Get your data analytics strategy right!Get your data analytics strategy right!
Get your data analytics strategy right!
SPAN Infotech (India) Pvt Ltd
 
Machine Learning in Customer Analytics
Machine Learning in Customer AnalyticsMachine Learning in Customer Analytics
Machine Learning in Customer Analytics
Course5i
 
Business Analytics Training Catalog - QueBIT Trusted Experts in Business Anal...
Business Analytics Training Catalog - QueBIT Trusted Experts in Business Anal...Business Analytics Training Catalog - QueBIT Trusted Experts in Business Anal...
Business Analytics Training Catalog - QueBIT Trusted Experts in Business Anal...
QueBIT Consulting
 
A Guide to Machine Learning Developer in 2024.pdf
A Guide to Machine Learning Developer in 2024.pdfA Guide to Machine Learning Developer in 2024.pdf
A Guide to Machine Learning Developer in 2024.pdf
JPLoft Solutions
 
how to successfully implement a data analytics solution.pdf
how to successfully implement a data analytics solution.pdfhow to successfully implement a data analytics solution.pdf
how to successfully implement a data analytics solution.pdf
basilmph
 
Simplify Your Analytics Strategy
Simplify Your Analytics StrategySimplify Your Analytics Strategy
Simplify Your Analytics Strategy
Shreya Singireddy
 
Embedded Analytics
Embedded AnalyticsEmbedded Analytics
Embedded Analytics
MITSDEDistance
 
Machine Learning: The First Salvo of the AI Business Revolution
Machine Learning: The First Salvo of the AI Business RevolutionMachine Learning: The First Salvo of the AI Business Revolution
Machine Learning: The First Salvo of the AI Business Revolution
Cognizant
 
Simplify your analytics strategy
Simplify your analytics strategySimplify your analytics strategy
Simplify your analytics strategy
Shaun Kollannur
 
Data Science Introduction by Emerging India Analytics
Data Science Introduction by Emerging India AnalyticsData Science Introduction by Emerging India Analytics
Data Science Introduction by Emerging India Analytics
AyeshaSharma29
 
AI Class Topic 3: Building Machine Learning Predictive Systems (Predictive Ma...
AI Class Topic 3: Building Machine Learning Predictive Systems (Predictive Ma...AI Class Topic 3: Building Machine Learning Predictive Systems (Predictive Ma...
AI Class Topic 3: Building Machine Learning Predictive Systems (Predictive Ma...
Value Amplify Consulting
 
Presentation1the Security Risk Management in.pptx.
Presentation1the Security Risk Management in.pptx.Presentation1the Security Risk Management in.pptx.
Presentation1the Security Risk Management in.pptx.
MahmoudElmahdy32
 
WHAT IS BUSINESS ANALYTICS um hj mnjh nit 1 ppt only kjjn
WHAT IS BUSINESS ANALYTICS um hj mnjh nit 1 ppt only kjjnWHAT IS BUSINESS ANALYTICS um hj mnjh nit 1 ppt only kjjn
WHAT IS BUSINESS ANALYTICS um hj mnjh nit 1 ppt only kjjn
RohitKumar639388
 
Smart Data Module 4 d drive_business models
Smart Data Module 4 d drive_business modelsSmart Data Module 4 d drive_business models
Smart Data Module 4 d drive_business models
caniceconsulting
 
Doing Analytics Right - Designing and Automating Analytics
Doing Analytics Right - Designing and Automating AnalyticsDoing Analytics Right - Designing and Automating Analytics
Doing Analytics Right - Designing and Automating Analytics
Tasktop
 
Starter Kit for Collaboration from Karuana @ Microsoft IT
Starter Kit for Collaboration from Karuana @ Microsoft ITStarter Kit for Collaboration from Karuana @ Microsoft IT
Starter Kit for Collaboration from Karuana @ Microsoft IT
Karuana Gatimu
 
How to Build an AI/ML Product and Sell it by SalesChoice CPO
How to Build an AI/ML Product and Sell it by SalesChoice CPOHow to Build an AI/ML Product and Sell it by SalesChoice CPO
How to Build an AI/ML Product and Sell it by SalesChoice CPO
Product School
 
[DSC Europe 23] Josip Saban - Leading AI teams.pptx
[DSC Europe 23] Josip Saban - Leading AI teams.pptx[DSC Europe 23] Josip Saban - Leading AI teams.pptx
[DSC Europe 23] Josip Saban - Leading AI teams.pptx
DataScienceConferenc1
 
Machine Learning in Customer Analytics
Machine Learning in Customer AnalyticsMachine Learning in Customer Analytics
Machine Learning in Customer Analytics
Course5i
 
Business Analytics Training Catalog - QueBIT Trusted Experts in Business Anal...
Business Analytics Training Catalog - QueBIT Trusted Experts in Business Anal...Business Analytics Training Catalog - QueBIT Trusted Experts in Business Anal...
Business Analytics Training Catalog - QueBIT Trusted Experts in Business Anal...
QueBIT Consulting
 
A Guide to Machine Learning Developer in 2024.pdf
A Guide to Machine Learning Developer in 2024.pdfA Guide to Machine Learning Developer in 2024.pdf
A Guide to Machine Learning Developer in 2024.pdf
JPLoft Solutions
 
how to successfully implement a data analytics solution.pdf
how to successfully implement a data analytics solution.pdfhow to successfully implement a data analytics solution.pdf
how to successfully implement a data analytics solution.pdf
basilmph
 
Simplify Your Analytics Strategy
Simplify Your Analytics StrategySimplify Your Analytics Strategy
Simplify Your Analytics Strategy
Shreya Singireddy
 
Machine Learning: The First Salvo of the AI Business Revolution
Machine Learning: The First Salvo of the AI Business RevolutionMachine Learning: The First Salvo of the AI Business Revolution
Machine Learning: The First Salvo of the AI Business Revolution
Cognizant
 
Simplify your analytics strategy
Simplify your analytics strategySimplify your analytics strategy
Simplify your analytics strategy
Shaun Kollannur
 
Data Science Introduction by Emerging India Analytics
Data Science Introduction by Emerging India AnalyticsData Science Introduction by Emerging India Analytics
Data Science Introduction by Emerging India Analytics
AyeshaSharma29
 

Recently uploaded (20)

Data Science Courses in India iim skills
Data Science Courses in India iim skillsData Science Courses in India iim skills
Data Science Courses in India iim skills
dharnathakur29
 
IAS-slides2-ia-aaaaaaaaaaain-business.pdf
IAS-slides2-ia-aaaaaaaaaaain-business.pdfIAS-slides2-ia-aaaaaaaaaaain-business.pdf
IAS-slides2-ia-aaaaaaaaaaain-business.pdf
mcgardenlevi9
 
03 Daniel 2-notes.ppt seminario escatologia
03 Daniel 2-notes.ppt seminario escatologia03 Daniel 2-notes.ppt seminario escatologia
03 Daniel 2-notes.ppt seminario escatologia
Alexander Romero Arosquipa
 
Simple_AI_Explanation_English somplr.pptx
Simple_AI_Explanation_English somplr.pptxSimple_AI_Explanation_English somplr.pptx
Simple_AI_Explanation_English somplr.pptx
ssuser2aa19f
 
Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...
Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...
Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...
Abodahab
 
Principles of information security Chapter 5.ppt
Principles of information security Chapter 5.pptPrinciples of information security Chapter 5.ppt
Principles of information security Chapter 5.ppt
EstherBaguma
 
Safety Innovation in Mt. Vernon A Westchester County Model for New Rochelle a...
Safety Innovation in Mt. Vernon A Westchester County Model for New Rochelle a...Safety Innovation in Mt. Vernon A Westchester County Model for New Rochelle a...
Safety Innovation in Mt. Vernon A Westchester County Model for New Rochelle a...
James Francis Paradigm Asset Management
 
Thingyan is now a global treasure! See how people around the world are search...
Thingyan is now a global treasure! See how people around the world are search...Thingyan is now a global treasure! See how people around the world are search...
Thingyan is now a global treasure! See how people around the world are search...
Pixellion
 
EDU533 DEMO.pptxccccvbnjjkoo jhgggggbbbb
EDU533 DEMO.pptxccccvbnjjkoo jhgggggbbbbEDU533 DEMO.pptxccccvbnjjkoo jhgggggbbbb
EDU533 DEMO.pptxccccvbnjjkoo jhgggggbbbb
JessaMaeEvangelista2
 
AI Competitor Analysis: How to Monitor and Outperform Your Competitors
AI Competitor Analysis: How to Monitor and Outperform Your CompetitorsAI Competitor Analysis: How to Monitor and Outperform Your Competitors
AI Competitor Analysis: How to Monitor and Outperform Your Competitors
Contify
 
Molecular methods diagnostic and monitoring of infection - Repaired.pptx
Molecular methods diagnostic and monitoring of infection  -  Repaired.pptxMolecular methods diagnostic and monitoring of infection  -  Repaired.pptx
Molecular methods diagnostic and monitoring of infection - Repaired.pptx
7tzn7x5kky
 
183409-christina-rossetti.pdfdsfsdasggsag
183409-christina-rossetti.pdfdsfsdasggsag183409-christina-rossetti.pdfdsfsdasggsag
183409-christina-rossetti.pdfdsfsdasggsag
fardin123rahman07
 
chapter 4 Variability statistical research .pptx
chapter 4 Variability statistical research .pptxchapter 4 Variability statistical research .pptx
chapter 4 Variability statistical research .pptx
justinebandajbn
 
Digilocker under workingProcess Flow.pptx
Digilocker  under workingProcess Flow.pptxDigilocker  under workingProcess Flow.pptx
Digilocker under workingProcess Flow.pptx
satnamsadguru491
 
Cleaned_Lecture 6666666_Simulation_I.pdf
Cleaned_Lecture 6666666_Simulation_I.pdfCleaned_Lecture 6666666_Simulation_I.pdf
Cleaned_Lecture 6666666_Simulation_I.pdf
alcinialbob1234
 
Geometry maths presentation for begginers
Geometry maths presentation for begginersGeometry maths presentation for begginers
Geometry maths presentation for begginers
zrjacob283
 
CTS EXCEPTIONSPrediction of Aluminium wire rod physical properties through AI...
CTS EXCEPTIONSPrediction of Aluminium wire rod physical properties through AI...CTS EXCEPTIONSPrediction of Aluminium wire rod physical properties through AI...
CTS EXCEPTIONSPrediction of Aluminium wire rod physical properties through AI...
ThanushsaranS
 
Flip flop presenation-Presented By Mubahir khan.pptx
Flip flop presenation-Presented By Mubahir khan.pptxFlip flop presenation-Presented By Mubahir khan.pptx
Flip flop presenation-Presented By Mubahir khan.pptx
mubashirkhan45461
 
Just-In-Timeasdfffffffghhhhhhhhhhj Systems.ppt
Just-In-Timeasdfffffffghhhhhhhhhhj Systems.pptJust-In-Timeasdfffffffghhhhhhhhhhj Systems.ppt
Just-In-Timeasdfffffffghhhhhhhhhhj Systems.ppt
ssuser5f8f49
 
Minions Want to eat presentacion muy linda
Minions Want to eat presentacion muy lindaMinions Want to eat presentacion muy linda
Minions Want to eat presentacion muy linda
CarlaAndradesSoler1
 
Data Science Courses in India iim skills
Data Science Courses in India iim skillsData Science Courses in India iim skills
Data Science Courses in India iim skills
dharnathakur29
 
IAS-slides2-ia-aaaaaaaaaaain-business.pdf
IAS-slides2-ia-aaaaaaaaaaain-business.pdfIAS-slides2-ia-aaaaaaaaaaain-business.pdf
IAS-slides2-ia-aaaaaaaaaaain-business.pdf
mcgardenlevi9
 
Simple_AI_Explanation_English somplr.pptx
Simple_AI_Explanation_English somplr.pptxSimple_AI_Explanation_English somplr.pptx
Simple_AI_Explanation_English somplr.pptx
ssuser2aa19f
 
Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...
Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...
Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...
Abodahab
 
Principles of information security Chapter 5.ppt
Principles of information security Chapter 5.pptPrinciples of information security Chapter 5.ppt
Principles of information security Chapter 5.ppt
EstherBaguma
 
Safety Innovation in Mt. Vernon A Westchester County Model for New Rochelle a...
Safety Innovation in Mt. Vernon A Westchester County Model for New Rochelle a...Safety Innovation in Mt. Vernon A Westchester County Model for New Rochelle a...
Safety Innovation in Mt. Vernon A Westchester County Model for New Rochelle a...
James Francis Paradigm Asset Management
 
Thingyan is now a global treasure! See how people around the world are search...
Thingyan is now a global treasure! See how people around the world are search...Thingyan is now a global treasure! See how people around the world are search...
Thingyan is now a global treasure! See how people around the world are search...
Pixellion
 
EDU533 DEMO.pptxccccvbnjjkoo jhgggggbbbb
EDU533 DEMO.pptxccccvbnjjkoo jhgggggbbbbEDU533 DEMO.pptxccccvbnjjkoo jhgggggbbbb
EDU533 DEMO.pptxccccvbnjjkoo jhgggggbbbb
JessaMaeEvangelista2
 
AI Competitor Analysis: How to Monitor and Outperform Your Competitors
AI Competitor Analysis: How to Monitor and Outperform Your CompetitorsAI Competitor Analysis: How to Monitor and Outperform Your Competitors
AI Competitor Analysis: How to Monitor and Outperform Your Competitors
Contify
 
Molecular methods diagnostic and monitoring of infection - Repaired.pptx
Molecular methods diagnostic and monitoring of infection  -  Repaired.pptxMolecular methods diagnostic and monitoring of infection  -  Repaired.pptx
Molecular methods diagnostic and monitoring of infection - Repaired.pptx
7tzn7x5kky
 
183409-christina-rossetti.pdfdsfsdasggsag
183409-christina-rossetti.pdfdsfsdasggsag183409-christina-rossetti.pdfdsfsdasggsag
183409-christina-rossetti.pdfdsfsdasggsag
fardin123rahman07
 
chapter 4 Variability statistical research .pptx
chapter 4 Variability statistical research .pptxchapter 4 Variability statistical research .pptx
chapter 4 Variability statistical research .pptx
justinebandajbn
 
Digilocker under workingProcess Flow.pptx
Digilocker  under workingProcess Flow.pptxDigilocker  under workingProcess Flow.pptx
Digilocker under workingProcess Flow.pptx
satnamsadguru491
 
Cleaned_Lecture 6666666_Simulation_I.pdf
Cleaned_Lecture 6666666_Simulation_I.pdfCleaned_Lecture 6666666_Simulation_I.pdf
Cleaned_Lecture 6666666_Simulation_I.pdf
alcinialbob1234
 
Geometry maths presentation for begginers
Geometry maths presentation for begginersGeometry maths presentation for begginers
Geometry maths presentation for begginers
zrjacob283
 
CTS EXCEPTIONSPrediction of Aluminium wire rod physical properties through AI...
CTS EXCEPTIONSPrediction of Aluminium wire rod physical properties through AI...CTS EXCEPTIONSPrediction of Aluminium wire rod physical properties through AI...
CTS EXCEPTIONSPrediction of Aluminium wire rod physical properties through AI...
ThanushsaranS
 
Flip flop presenation-Presented By Mubahir khan.pptx
Flip flop presenation-Presented By Mubahir khan.pptxFlip flop presenation-Presented By Mubahir khan.pptx
Flip flop presenation-Presented By Mubahir khan.pptx
mubashirkhan45461
 
Just-In-Timeasdfffffffghhhhhhhhhhj Systems.ppt
Just-In-Timeasdfffffffghhhhhhhhhhj Systems.pptJust-In-Timeasdfffffffghhhhhhhhhhj Systems.ppt
Just-In-Timeasdfffffffghhhhhhhhhhj Systems.ppt
ssuser5f8f49
 
Minions Want to eat presentacion muy linda
Minions Want to eat presentacion muy lindaMinions Want to eat presentacion muy linda
Minions Want to eat presentacion muy linda
CarlaAndradesSoler1
 

Machine intelligence data science methodology 060420

  • 1. Competitive Advantage. Elegantly Engineered. A Different Data Science Methodology
  • 2. We use data, analytics, and design to help clients perform at their best. Machine Intelligence catalyzes innovation, engineers machine learning applications, and builds enduring capabilities. We’re creative, rigorous, and efficient. We bring the sophistication of a large strategy firm with the speed and value of a focused boutique. We apply proven techniques, designs, and world-class expertise to: • Improve how companies engage customers • Optimize machine performance • Enhance process results
  • 3. Models reproduce how questions are answered in training data. Business, not IT, should design training data. Most project time is used understanding how data is generated and building training data sets. Machine Learning is Simple Real world Training Data Results Generally a subset of scenarios in the real world. Data trains models that reproduce decisions in the training data with 80-95% accuracy. The full set of all consumers, machines, or business results that a model will forecast.
  • 4. A Different Data Science Methodology Many data science projects jump into algorithms and technology. We reverse the usual approach by first rigorously defining the business question and understanding data. The methodology: • Aligns the whole business • Sets practical expectations • Leads change • Builds sustaining capabilities Data Technology Business question Business goals Time and focus Data Technology
  • 5. Steps Foundation • Align change across the business • Understand data • Define the business question Results • Sustain capabilities • Communicate value • Build application Model • Iterate production model • Pilot models • Build training data 1. 2. 3.
  • 6. Project Phasing • Most time is spent understanding data and building training data. • An early pilot is key to refining to training data and building support for change. • Developing the full application starts early with a UX for the pilot model.
  • 7. 1. Set Foundation A. Define the business question B. Align change C. Understand data • Learn and set expectations on the data science process and cloud hosting. • Define precise business questions. • Model how answering the business question delivers results. • Link business and regulatory needs to training data design and algorithm selection, e.g. does a model require easy explainability? • Build a coalition of sponsors and communicate the vision. • Define roles for compliance, customer service, finance, marketing, product, and sales. • Understand the data generating process: genchi genbutsu. • Visualize the “shape of the data”: distributions, sensitivity, clusters, anomalies, and sparseness. Identify quality issues. • Capture rules and map data flows from source systems.
  • 8. 2. Build Models A. Build training data B. Pilot models C. Iterate production models • Form business and IT team: roles, super-labelers, biases. • Design the data set’s scenarios and set quality criteria. • Visualize attributes and confirm with business sponsors. • Define rules to pre-process data and select open source algorithms. • Visualize and communicate results. Show an early win. Ideally, prototype the UX. • Plan enhancements to training data, algos, and applications. • Refine data (feature shaping and dimensionality reduction). • Customize rules and algorithms. • Connect into the broader application starting with the data model.
  • 9. 3. Deliver Results A. Build application B. Communicate value C. Sustain capabilities • Visualize UX, define data model and APIs. • Set non-functional requirements such as scalability, latency, and security. • Define test plan. • Communicate how the solution makes jobs better and brings value to customers • Build understanding and support with key influencers • Use multiple channels (meetings, email, calls) repeatedly to ensure reaching people • Optimize costs and scalability. Plan for decreased costs. • Confirm team skills and capacity to evolve the models. • Set plan for and automate re-training models. Set expectations that models may expand the range of scenarios covered and/or may improve precision.
  • 10. Contact Machine Intelligence Partners LLC serves clients globally. Our people are centered in Boston, Bozeman, Grand Rapids, London, New York, San Francisco, and Washington, D.C. Client relationship leaders: New York Jeremy Lehman 917.225.2011 [email protected] Washington, D.C. Philippe Berckmans 804.405.6009 [email protected] Machine Intelligence is an Amazon Technology Partner and member of the Microsoft Partner Network. We are a veteran-owned small business.