Explainable AI
Explainable AI
David Gunning
DARPA/I2O
Dramatic success in machine learning has led to an explosion of AI applications. Researchers have
developed new AI capabilities for a wide variety of tasks. Continued advances promise to produce
autonomous systems that will perceive, learn, decide, and act on their own. However, the effectiveness
of these systems will be limited by the machine’s inability to explain its thoughts and actions to human
users. Explainable AI will be essential, if users are to understand, trust, and effectively manage this
emerging generation of artificially intelligent partners.
Distribution Statement "A" (Approved for Public Release, Distribution Unlimited) 2
Explainable AI – What Are We Trying To Do?
• I understand why
This is a cat: • I understand why not
New •It has fur, whiskers,
• I know when you’ll succeed
and claws.
Learning •It has this feature: • I know when you’ll fail
Process • I know when to trust you
©University Of Toronto • I know why you erred
Training Explainable Explanation User with
Data Model Interface a Task
Prediction Accuracy
machine learning Models
Deep
techniques that Learning Ensemble
Bayesian Methods
produce more Belief Nets
explainable models, SRL Random
while maintaining a CRFs HBNs Forests
AOGs
high level of Statistical MLNs
Models Decision
learning Markov Trees
performance SVMs Models Explainability
Prediction Accuracy
machine learning Models
Deep
techniques that Learning Ensemble
Bayesian Methods
produce more Belief Nets
explainable models, SRL Random
while maintaining a CRFs HBNs Forests
AOGs
high level of Statistical MLNs
Models Decision
learning Markov Trees
performance SVMs Models Explainability
Deep Explanation
Modified deep learning
techniques to learn
explainable features
Generate Examples
Cat
Mammal
Dog
External
Ontology
Fur
Whiskers Claws
Semantic Attributes
Cheng, H., et al. (2014) SRI-Sarnoff AURORA at TRECVID 2014: Multimedia Event Detection and Recounting.
http://www-nlpir.nist.gov/projects/tvpubs/tv14.papers/sri_aurora.pdf
Distribution Statement "A" (Approved for Public Release, Distribution Unlimited) 6
Learning To Generate Explanations
Prediction Accuracy
machine learning Models
Deep
techniques that Learning Ensemble
Bayesian Methods
produce more Belief Nets
explainable models, SRL Random
while maintaining a CRFs HBNs Forests
AOGs
high level of Statistical MLNs
Models Decision
learning Markov Trees
performance SVMs Models Explainability
Bayesian
Program
Learning
Lake, B.H., Salakhutdinov, R., & Tenenbaum, J.B. (2015). Human-level concept learning through probabilistic program
induction. Science. VOL 350, 1332-1338.
Distribution Statement "A" (Approved for Public Release, Distribution Unlimited) 9
Learning More Interpretable Models
5. Implicit sub-AoG
Si, Z. and Zhu, S. (2013). Learning AND-OR Templates for Object Recognition and Detection. IEEE Transactions On
Pattern Analysis and Machine Intelligence. Vol. 35 No. 9, 2189-2205.
Distribution Statement "A" (Approved for Public Release, Distribution Unlimited) 10
Explainable AI – Performance vs. Explainability
Prediction Accuracy
machine learning Models
Deep
techniques that Learning Ensemble
Bayesian Methods
produce more Belief Nets
explainable models, SRL Random
while maintaining a CRFs HBNs Forests
AOGs
high level of Statistical MLNs
Models Decision
learning Markov Trees
performance SVMs Models Explainability
Ribeiro, M.T., Singh, S., and Guestrin, C. (2016). “Why Should I Trust You?” Explaining the Predictions of Any Classifier.
CHI 2016 Workshop on Human Centered Machine Learning. (arXiv:1602.04938v1 [cs.LG] 16 Feb 2016)
Distribution Statement "A" (Approved for Public Release, Distribution Unlimited) 12
Model Induction
Letham, B., Rudin. C., McCormick, T., and Madigan, D. (2015). Interpretable classifiers using rules and Bayesian
analysis: Building a better stroke prediction model. Annals of Applied Statistics 2015, Vol. 9, No. 3, 1350-137
Distribution Statement "A" (Approved for Public Release, Distribution Unlimited) 13
Explainable AI – Why Do You Think It Will Be Successful?
• I understand why
This is a cat:
New • It has fur, whiskers, • I understand why not
and claws. • I know when you’ll succeed
Learning • It has this feature: • I know when you’ll fail
Process • I know when to trust you
©University Of Toronto
• I know why you erred
Training Explainable Explanation
Data Model Interface
Kulesza, T., Burnett, M., Wong, W.-K., & Stumpf, S. (2015). Principles of Explanatory Debugging to Personalize Interactive
Machine Learning. IUI 2015, Proceedings of the 20th International Conference on Intelligent User Interfaces (pp. 126-137).
Measure of Explanation
Effectiveness
User Satisfaction
Explanation Framework
• Clarity of the explanation (user rating)
Task • Utility of the explanation (user rating)
Recommendation, Mental Model
Decision or
• Understanding individual decisions
Action • Understanding the overall model
• Strength/weakness assessment
Explainable Explanation Decision
Model Interface • ‘What will it do’ prediction
The user • ‘How do I intervene’ prediction
makes a
XAI System Explanation decision
Task Performance
The system takes The system provides based on the • Does the explanation improve the
input from the current an explanation to the explanation user’s decision, task performance?
task and makes a user that justifies its • Artificial decision tasks introduced to
recommendation, recommendation, diagnose the user’s understanding
decision, or action decision, or action
Trust Assessment
• Appropriate future use and trust
Correctablity
• Identifying errors
• Correcting errors
• Continuous training
16
Distribution Statement "A" (Approved for Public Release, Distribution Unlimited)
Explainable AI – Challenge Problem Areas
Multimedia Data
Classifies items of Explains why/why not Analyst decides which
interest in large data set for recommended items items to report, pursue
An operator is
Actions
Autonomy Explainable Explanation directing autonomous
Model Interface systems to accomplish
Reinforcement Explanation
Learning Task a series of missions
©ArduPikot.org
©US Army
ArduPilot & SITL Simulation