SlideShare a Scribd company logo
Using Machine Learning to Optimize
DevOps Practices
Building Learning into Monitoring and Feedback
Peter Varhol
About me
• International speaker and writer
• Degrees in Math, CS, Psychology
• Technology communicator
• Former university professor, tech journalist
• Cat owner and distance runner
• peter@petervarhol.com
Agenda
• What is machine learning?
• How is machine learning applied to DevOps?
• Challenges in training these systems
• What constitutes an issue?
• Summary and conclusions
What is Machine Learning?
• Layered algorithms that change parameters based on feedback
from know data
• Can be linear or nonlinear
• Algorithms can be fixed in production or adaptive
• Fixed – algorithms do not adjust once deployed
• Adaptive – algorithms continually adjust to new data
• Usually part of a larger system
Adaptive Systems
• Airline pricing
• Ticket prices change three times a day based on demand
• It can cost less to go farther
• It can cost less later
• Ecommerce systems
• Recommendations try to discern what else you might want
• Can I incentivize you to fill up the plane?
Why Use Adaptive?
• The “right” result will vary over time
• Trying to optimize a particular result
• Revenue
• The problem domain is not static
Confidential, Dynatrace LLC
How Are Fixed Systems Used?
• Transportation
• Self-driving cars
• Aircraft/Drones
• Ecommerce
• Recommendation engines
• Medical
• Diagnosis systems
Why Use Fixed Machine Learning Systems
• The problem domain is static
• The expectations remain constant
• The right answer is known under most conditions
• The original algorithms remain valid over a long period of time
DevOps Practices Generate Data
• During development
• Agile metrics, JIRA issues, test case metrics
• During continuous integration
• System test metrics
• During continuous deployment
• Quality metrics for deployments
• After deployment and into production
• Application availability and performance
• Usage log files
Focus on Monitoring
• Ongoing data on availability and performance
• RUM
• Synthetic tests
• Application monitoring
• Monitoring tackles the back end of DevOps
• Identifying unhealthy trends
• Diagnoses failures and poor performance
• Recommends action
• Fixed or adaptive depends on your goals
Where Do Predictive Analytics Come In?
• Big data makes possible predictions of future events
• Are we going to fail?
• How will we perform with traffic surges?
• As well as past events
• What went wrong and how do we fix it
• We can rely on past data
• Adaptive systems may not perform as well
• Clear goals needed
What Technologies Are Involved?
• Neural networks
• Genetic algorithms
• Rules engines
Neural Networks
• Set of layered algorithms whose variables can be
adjusted via a learning process
• The learning process involves training with
known inputs and outputs
• The algorithms adjust coefficients to converge on
the correct answer (or not)
• You freeze the algorithms and coefficients, and
deploy
• Or you optimize on a particular set of characteristics
A Sample Neural Network
Genetic Algorithms
• Use the principle of natural selection
• Create a range of possible solutions
• Try out each of them
• Choose and combine two of the better
alternatives
• Rinse and repeat as necessary
Bringing in DevOps
• DevOps has data that can be used to train neural networks
• Health of the application
• Trends in application traffic and responsiveness
• Application failure
Machine Learning Helps DevOps
• Decisions are complex
• Why is the CPU maxed?
• What is causing disk thrashing?
• Why did the network slow?
• Why did the application fail?
• Data is massive
• Potentially thousands of data points a day
How Good Are Decisions?
• Expert versus machine
• Given the same data
• In many domains they tie
• With additional data, the human can be better
• But machine learning will get better
• But only as good as the data
We Want to Do Two Things
• Identify trends that may indicate future problems
• Increasing response times
• More page errors
• Diagnose faults once they have happened
• Why did the application fail?
• How can we fix it as quickly as possible?
Fixed Algorithms Work for Some Problems
• Immediate performance and failure identification
• Diagnosis of failures and performance issues
• These are readily identifiable from known data
Adaptive Systems Supplement These Tools
• Predictions of future events
• Performance
• Availability
• The target is moving
• So we need current data to adjust the algorithms
The Machine Helps the DevOps Expert
• The machine learning app provides:
• Early warning on possible performance issues and failures
• Immediate notification of failure or impending failure
• Trend analysis of data to predict unhealthy outcomes
• The machine learning is an assistant
• It can’t fix anything
• It can’t necessarily identify the root cause
What is the Goal?
• We have many ways of monitoring
• Many of them are represented at this conference
• Each measures something a little different
• Latency, response time, availability, network, DNS . . .
• Too much data can be no better than no data at all
• Machine learning can correlate across
measurements
• Focus to eliminate false positives
Intelligent Systems Are Sometimes Wrong
• The problem domain is ambiguous
• There is no single “right” answer
• “Close enough” is good
• We don’t know quite why the software
responds as it does
• We can’t easily trace code paths
Testing Machine Learning Systems
• Have objective acceptance criteria
• Test with new data
• Don’t count on all results being accurate
• Understand the architecture of the network as a part of
the testing process
• Communicate the level of confidence you have in the
results to management and users
A Cautionary Tale
• All events are not created equal
• AI systems treat events equally
• A failure of a system during busy season is the same as any other
• DevOps pros know otherwise
• And can exert additional effort in response
• And actually fix the problem
• We can’t automate what we don’t understand
• You need the human in the loop
Confidential, Dynatrace LLC
Conclusions
• DevOps is a natural environment for machine learning
systems
• Any activity that generates data and requires a decision is fair game
• Monitoring is low-hanging fruit
• Fixed systems for failure and diagnosis, adaptive for trend
analysis
Confidential, Dynatrace LLC
References
• https://qz.com/989137/when-a-robot-ai-doctor-misdiagnoses-you-
whos-to-blame/
• https://pvarhol.wordpress.com/2017/07/22/what-brought-about-
our-ai-revolution/
• https://pvarhol.wordpress.com/2017/06/21/analytics-dont-apply-in-
the-clutch/
Confidential, Dynatrace LLC
Thank You
Peter Varhol
peter@petervarhol.com

More Related Content

What's hot (20)

PPTX
Optimizing Java
Adam Feldscher
 
PPTX
TransPort Workshop
jwcampbe
 
PPTX
Monitoring Distributed Systems
Aleksandr Tavgen
 
PDF
Software Testing
MusTufa Nullwala
 
PPTX
Grab a coffee and take 5 mins out
Druantia
 
PDF
What Do We Automate First
rrice2000
 
PDF
Automated testing san francisco oct 2013
Solano Labs
 
PDF
Your Data Scientist Hates You
Bradford Stephens
 
PPTX
SharePoint Troubleshooting
Toby McGrail
 
KEY
Make Life Suck Less (Building Scalable Systems)
Bradford Stephens
 
PPTX
New technology new approaches - tmf - july 2016
Stevan Zivanovic
 
PPTX
Software engineering
DivyaSharma458
 
PPTX
Wix Automation - Automation Manager
Efrat Attas
 
PPTX
Performing Oracle Health Checks Using APEX
Datavail
 
PPTX
4 pc repair
Rheigh Henley Calderon
 
PPTX
Solano Labs presented at MassTLC's automated testing
MassTLC
 
PPTX
Grab a coffee and take 5 mins out
Druantia
 
PPTX
Becoma an Ace in Analytics
Ken Goossens
 
PDF
SHEKHAR VERMA
shekhar verma
 
PPTX
Digital Testing Approach
Anand Deshpande
 
Optimizing Java
Adam Feldscher
 
TransPort Workshop
jwcampbe
 
Monitoring Distributed Systems
Aleksandr Tavgen
 
Software Testing
MusTufa Nullwala
 
Grab a coffee and take 5 mins out
Druantia
 
What Do We Automate First
rrice2000
 
Automated testing san francisco oct 2013
Solano Labs
 
Your Data Scientist Hates You
Bradford Stephens
 
SharePoint Troubleshooting
Toby McGrail
 
Make Life Suck Less (Building Scalable Systems)
Bradford Stephens
 
New technology new approaches - tmf - july 2016
Stevan Zivanovic
 
Software engineering
DivyaSharma458
 
Wix Automation - Automation Manager
Efrat Attas
 
Performing Oracle Health Checks Using APEX
Datavail
 
Solano Labs presented at MassTLC's automated testing
MassTLC
 
Grab a coffee and take 5 mins out
Druantia
 
Becoma an Ace in Analytics
Ken Goossens
 
SHEKHAR VERMA
shekhar verma
 
Digital Testing Approach
Anand Deshpande
 

Viewers also liked (9)

PPTX
Using Infrastructure as an Accelerator of DevOps Maturity
Josh Atwell
 
PDF
The API Side of Monitoring
Nordic APIs
 
PPTX
VMUG Melbourne - DevOps - Not Just for Open Source and Unicorns
Josh Atwell
 
PPTX
Managing the Infrastructure Stack with PowerShell
Josh Atwell
 
PPTX
DevOps and Groupthink An Oxymoron?
Qualitest
 
PDF
Philipp Krenn - NoSQL Means No Security?
Kevin Cross
 
PPTX
Work + Family +Self + Fast Paced Industry = ¯\_(ツ)_/¯
Josh Atwell
 
PDF
Josh Atwell - Infrastructure Extensibility at Home and in DevOps
Kevin Cross
 
PPTX
Devopsdays Edinburgh 2017 - Ignite talk - Swarming
Jon Stevens-Hall
 
Using Infrastructure as an Accelerator of DevOps Maturity
Josh Atwell
 
The API Side of Monitoring
Nordic APIs
 
VMUG Melbourne - DevOps - Not Just for Open Source and Unicorns
Josh Atwell
 
Managing the Infrastructure Stack with PowerShell
Josh Atwell
 
DevOps and Groupthink An Oxymoron?
Qualitest
 
Philipp Krenn - NoSQL Means No Security?
Kevin Cross
 
Work + Family +Self + Fast Paced Industry = ¯\_(ツ)_/¯
Josh Atwell
 
Josh Atwell - Infrastructure Extensibility at Home and in DevOps
Kevin Cross
 
Devopsdays Edinburgh 2017 - Ignite talk - Swarming
Jon Stevens-Hall
 
Ad

Similar to Using Machine Learning to Optimize DevOps Practices (20)

PPTX
Correlation does not mean causation
Peter Varhol
 
PPTX
Testing a movingtarget_quest_dynatrace
Peter Varhol
 
PPTX
Testing for cognitive bias in ai systems
Peter Varhol
 
PDF
How to improve your system monitoring
Andrew White
 
PPTX
Not fair! testing AI bias and organizational values
Peter Varhol
 
PPTX
Not fair! testing ai bias and organizational values
Peter Varhol
 
PPTX
YTD Video Downloader Pro Crack [Latest 2025]
rpari1867
 
PPTX
Distributed systems: design, principles and experiencies
Andrés Pérez Gil
 
PPTX
Observability – the good, the bad, and the ugly
Timetrix
 
PDF
Building data intensive applications
Amit Kejriwal
 
PPTX
An Agile Approach to Machine Learning
Randy Shoup
 
PPTX
Making a Mock by Kelsey Shannahan
QA or the Highway
 
PPTX
Making disaster routine
Peter Varhol
 
PDF
Productionising Machine Learning Models
Tash Bickley
 
PPTX
Lucas Gravley - HP - Self-Healing And Monitoring in a DevOps world
DevOps Enterprise Summit
 
PPTX
Design Like a Pro: Machine Learning Basics
Inductive Automation
 
PDF
Implementing Metrics & Completeness Reporting in TMF Management​
Montrium
 
PPTX
The Analysis Part of Integration Projects
BizTalk360
 
PPTX
Avoiding test hell
Yun Ki Lee
 
PPTX
Alphabet Soup: A(utomation), BC (Business Continuity) and DR (Disaster Recovery
Internetwork Engineering (IE)
 
Correlation does not mean causation
Peter Varhol
 
Testing a movingtarget_quest_dynatrace
Peter Varhol
 
Testing for cognitive bias in ai systems
Peter Varhol
 
How to improve your system monitoring
Andrew White
 
Not fair! testing AI bias and organizational values
Peter Varhol
 
Not fair! testing ai bias and organizational values
Peter Varhol
 
YTD Video Downloader Pro Crack [Latest 2025]
rpari1867
 
Distributed systems: design, principles and experiencies
Andrés Pérez Gil
 
Observability – the good, the bad, and the ugly
Timetrix
 
Building data intensive applications
Amit Kejriwal
 
An Agile Approach to Machine Learning
Randy Shoup
 
Making a Mock by Kelsey Shannahan
QA or the Highway
 
Making disaster routine
Peter Varhol
 
Productionising Machine Learning Models
Tash Bickley
 
Lucas Gravley - HP - Self-Healing And Monitoring in a DevOps world
DevOps Enterprise Summit
 
Design Like a Pro: Machine Learning Basics
Inductive Automation
 
Implementing Metrics & Completeness Reporting in TMF Management​
Montrium
 
The Analysis Part of Integration Projects
BizTalk360
 
Avoiding test hell
Yun Ki Lee
 
Alphabet Soup: A(utomation), BC (Business Continuity) and DR (Disaster Recovery
Internetwork Engineering (IE)
 
Ad

More from Peter Varhol (12)

PPTX
DevOps and the Impostor Syndrome
Peter Varhol
 
PPTX
162 the technologist of the future
Peter Varhol
 
PPTX
Digital transformation through devops dod indianapolis
Peter Varhol
 
PPTX
What Aircrews Can Teach Testing Teams
Peter Varhol
 
PPTX
Identifying and measuring testing debt
Peter Varhol
 
PPTX
What aircrews can teach devops teams ignite
Peter Varhol
 
PPTX
Talking to people lightning
Peter Varhol
 
PPTX
Varhol oracle database_firewall_oct2011
Peter Varhol
 
PPTX
Qa test managed_code_varhol
Peter Varhol
 
PDF
Talking to people: the forgotten DevOps tool
Peter Varhol
 
PPTX
How do we fix testing
Peter Varhol
 
PPTX
Moneyball peter varhol_starwest2012
Peter Varhol
 
DevOps and the Impostor Syndrome
Peter Varhol
 
162 the technologist of the future
Peter Varhol
 
Digital transformation through devops dod indianapolis
Peter Varhol
 
What Aircrews Can Teach Testing Teams
Peter Varhol
 
Identifying and measuring testing debt
Peter Varhol
 
What aircrews can teach devops teams ignite
Peter Varhol
 
Talking to people lightning
Peter Varhol
 
Varhol oracle database_firewall_oct2011
Peter Varhol
 
Qa test managed_code_varhol
Peter Varhol
 
Talking to people: the forgotten DevOps tool
Peter Varhol
 
How do we fix testing
Peter Varhol
 
Moneyball peter varhol_starwest2012
Peter Varhol
 

Recently uploaded (20)

PDF
Bitcoin for Millennials podcast with Bram, Power Laws of Bitcoin
Stephen Perrenod
 
PPTX
Q2 FY26 Tableau User Group Leader Quarterly Call
lward7
 
DOCX
Python coding for beginners !! Start now!#
Rajni Bhardwaj Grover
 
PPTX
The Project Compass - GDG on Campus MSIT
dscmsitkol
 
PPT
Ericsson LTE presentation SEMINAR 2010.ppt
npat3
 
PPTX
Future Tech Innovations 2025 – A TechLists Insight
TechLists
 
PDF
NASA A Researcher’s Guide to International Space Station : Physical Sciences ...
Dr. PANKAJ DHUSSA
 
PDF
LOOPS in C Programming Language - Technology
RishabhDwivedi43
 
PDF
Book industry state of the nation 2025 - Tech Forum 2025
BookNet Canada
 
PDF
Staying Human in a Machine- Accelerated World
Catalin Jora
 
PDF
POV_ Why Enterprises Need to Find Value in ZERO.pdf
darshakparmar
 
PDF
Go Concurrency Real-World Patterns, Pitfalls, and Playground Battles.pdf
Emily Achieng
 
PDF
Peak of Data & AI Encore AI-Enhanced Workflows for the Real World
Safe Software
 
PDF
“Voice Interfaces on a Budget: Building Real-time Speech Recognition on Low-c...
Edge AI and Vision Alliance
 
PDF
NLJUG Speaker academy 2025 - first session
Bert Jan Schrijver
 
PDF
The Rise of AI and IoT in Mobile App Tech.pdf
IMG Global Infotech
 
PDF
How do you fast track Agentic automation use cases discovery?
DianaGray10
 
PDF
UPDF - AI PDF Editor & Converter Key Features
DealFuel
 
DOCX
Cryptography Quiz: test your knowledge of this important security concept.
Rajni Bhardwaj Grover
 
PDF
Newgen 2022-Forrester Newgen TEI_13 05 2022-The-Total-Economic-Impact-Newgen-...
darshakparmar
 
Bitcoin for Millennials podcast with Bram, Power Laws of Bitcoin
Stephen Perrenod
 
Q2 FY26 Tableau User Group Leader Quarterly Call
lward7
 
Python coding for beginners !! Start now!#
Rajni Bhardwaj Grover
 
The Project Compass - GDG on Campus MSIT
dscmsitkol
 
Ericsson LTE presentation SEMINAR 2010.ppt
npat3
 
Future Tech Innovations 2025 – A TechLists Insight
TechLists
 
NASA A Researcher’s Guide to International Space Station : Physical Sciences ...
Dr. PANKAJ DHUSSA
 
LOOPS in C Programming Language - Technology
RishabhDwivedi43
 
Book industry state of the nation 2025 - Tech Forum 2025
BookNet Canada
 
Staying Human in a Machine- Accelerated World
Catalin Jora
 
POV_ Why Enterprises Need to Find Value in ZERO.pdf
darshakparmar
 
Go Concurrency Real-World Patterns, Pitfalls, and Playground Battles.pdf
Emily Achieng
 
Peak of Data & AI Encore AI-Enhanced Workflows for the Real World
Safe Software
 
“Voice Interfaces on a Budget: Building Real-time Speech Recognition on Low-c...
Edge AI and Vision Alliance
 
NLJUG Speaker academy 2025 - first session
Bert Jan Schrijver
 
The Rise of AI and IoT in Mobile App Tech.pdf
IMG Global Infotech
 
How do you fast track Agentic automation use cases discovery?
DianaGray10
 
UPDF - AI PDF Editor & Converter Key Features
DealFuel
 
Cryptography Quiz: test your knowledge of this important security concept.
Rajni Bhardwaj Grover
 
Newgen 2022-Forrester Newgen TEI_13 05 2022-The-Total-Economic-Impact-Newgen-...
darshakparmar
 

Using Machine Learning to Optimize DevOps Practices

  • 1. Using Machine Learning to Optimize DevOps Practices Building Learning into Monitoring and Feedback Peter Varhol
  • 2. About me • International speaker and writer • Degrees in Math, CS, Psychology • Technology communicator • Former university professor, tech journalist • Cat owner and distance runner • [email protected]
  • 3. Agenda • What is machine learning? • How is machine learning applied to DevOps? • Challenges in training these systems • What constitutes an issue? • Summary and conclusions
  • 4. What is Machine Learning? • Layered algorithms that change parameters based on feedback from know data • Can be linear or nonlinear • Algorithms can be fixed in production or adaptive • Fixed – algorithms do not adjust once deployed • Adaptive – algorithms continually adjust to new data • Usually part of a larger system
  • 5. Adaptive Systems • Airline pricing • Ticket prices change three times a day based on demand • It can cost less to go farther • It can cost less later • Ecommerce systems • Recommendations try to discern what else you might want • Can I incentivize you to fill up the plane?
  • 6. Why Use Adaptive? • The “right” result will vary over time • Trying to optimize a particular result • Revenue • The problem domain is not static Confidential, Dynatrace LLC
  • 7. How Are Fixed Systems Used? • Transportation • Self-driving cars • Aircraft/Drones • Ecommerce • Recommendation engines • Medical • Diagnosis systems
  • 8. Why Use Fixed Machine Learning Systems • The problem domain is static • The expectations remain constant • The right answer is known under most conditions • The original algorithms remain valid over a long period of time
  • 9. DevOps Practices Generate Data • During development • Agile metrics, JIRA issues, test case metrics • During continuous integration • System test metrics • During continuous deployment • Quality metrics for deployments • After deployment and into production • Application availability and performance • Usage log files
  • 10. Focus on Monitoring • Ongoing data on availability and performance • RUM • Synthetic tests • Application monitoring • Monitoring tackles the back end of DevOps • Identifying unhealthy trends • Diagnoses failures and poor performance • Recommends action • Fixed or adaptive depends on your goals
  • 11. Where Do Predictive Analytics Come In? • Big data makes possible predictions of future events • Are we going to fail? • How will we perform with traffic surges? • As well as past events • What went wrong and how do we fix it • We can rely on past data • Adaptive systems may not perform as well • Clear goals needed
  • 12. What Technologies Are Involved? • Neural networks • Genetic algorithms • Rules engines
  • 13. Neural Networks • Set of layered algorithms whose variables can be adjusted via a learning process • The learning process involves training with known inputs and outputs • The algorithms adjust coefficients to converge on the correct answer (or not) • You freeze the algorithms and coefficients, and deploy • Or you optimize on a particular set of characteristics
  • 14. A Sample Neural Network
  • 15. Genetic Algorithms • Use the principle of natural selection • Create a range of possible solutions • Try out each of them • Choose and combine two of the better alternatives • Rinse and repeat as necessary
  • 16. Bringing in DevOps • DevOps has data that can be used to train neural networks • Health of the application • Trends in application traffic and responsiveness • Application failure
  • 17. Machine Learning Helps DevOps • Decisions are complex • Why is the CPU maxed? • What is causing disk thrashing? • Why did the network slow? • Why did the application fail? • Data is massive • Potentially thousands of data points a day
  • 18. How Good Are Decisions? • Expert versus machine • Given the same data • In many domains they tie • With additional data, the human can be better • But machine learning will get better • But only as good as the data
  • 19. We Want to Do Two Things • Identify trends that may indicate future problems • Increasing response times • More page errors • Diagnose faults once they have happened • Why did the application fail? • How can we fix it as quickly as possible?
  • 20. Fixed Algorithms Work for Some Problems • Immediate performance and failure identification • Diagnosis of failures and performance issues • These are readily identifiable from known data
  • 21. Adaptive Systems Supplement These Tools • Predictions of future events • Performance • Availability • The target is moving • So we need current data to adjust the algorithms
  • 22. The Machine Helps the DevOps Expert • The machine learning app provides: • Early warning on possible performance issues and failures • Immediate notification of failure or impending failure • Trend analysis of data to predict unhealthy outcomes • The machine learning is an assistant • It can’t fix anything • It can’t necessarily identify the root cause
  • 23. What is the Goal? • We have many ways of monitoring • Many of them are represented at this conference • Each measures something a little different • Latency, response time, availability, network, DNS . . . • Too much data can be no better than no data at all • Machine learning can correlate across measurements • Focus to eliminate false positives
  • 24. Intelligent Systems Are Sometimes Wrong • The problem domain is ambiguous • There is no single “right” answer • “Close enough” is good • We don’t know quite why the software responds as it does • We can’t easily trace code paths
  • 25. Testing Machine Learning Systems • Have objective acceptance criteria • Test with new data • Don’t count on all results being accurate • Understand the architecture of the network as a part of the testing process • Communicate the level of confidence you have in the results to management and users
  • 26. A Cautionary Tale • All events are not created equal • AI systems treat events equally • A failure of a system during busy season is the same as any other • DevOps pros know otherwise • And can exert additional effort in response • And actually fix the problem • We can’t automate what we don’t understand • You need the human in the loop Confidential, Dynatrace LLC
  • 27. Conclusions • DevOps is a natural environment for machine learning systems • Any activity that generates data and requires a decision is fair game • Monitoring is low-hanging fruit • Fixed systems for failure and diagnosis, adaptive for trend analysis Confidential, Dynatrace LLC

Editor's Notes

  • #8: These types of software are becoming increasingly common, in areas such as ecommerce, public transportation, automotive, finance, and computer networks. They have the potential to make decisions given sufficiently well-defined inputs and goals. In some instances, they are characterized as artificial intelligence, in that they seemingly make decisions that were once the purview of a human user or operator.
  • #14: Most machine learning systems are based on neural networks. A neural network is a set of layered algorithms whose variables can be adjusted via a learning process. The learning process involves using known data inputs to create outputs that are then compared with known results. When the algorithms reflect the known results with the desired degree of accuracy, the algebraic coefficients are frozen and production code is generated. Today, this comprises much of what we understand as artificial intelligence.
  • #21: But there is a type of software where having a defined output is no longer the case. Actually, two types. One is machine learning systems. The second is predictive analytics, or adaptive systems.
  • #27: Have objective acceptance criteria. Know the amount of error you and your users are willing to accept. Test with new data. Once you’ve trained the network and frozen the architecture and coefficients, use fresh inputs and outputs to verify its accuracy. Don’t count on all results being accurate. That’s just the nature of the beast. And you may have to recommend throwing out the entire network architecture and starting over. Understand the architecture of the network as a part of the testing process. Few if any will be able to actually follow a set of inputs through the network of algorithms, but understanding how the network is constructed will help testers determine if another architecture might produce better results. Communicate the level of confidence you have in the results to management and users. Machine learning systems offer you the unique opportunity to describe confidence in statistical terms, so use them. One important thing to note is that the training data itself could well contain inaccuracies. In this case, because of measurement error, the recorded wind speed and direction could be off or ambiguous. In other cases, the cooling of the filament likely has some error in its measurement.