SlideShare a Scribd company logo
3
Most read
6
Most read
11
Most read
Stories from the Financial
Service AI Trenches
Lessons learned from building AI models in EY
18 November 2020
Tim Santos, Assistant Director, Client Technology AI
Mustafa Somalya , Assistant Director , Client Technology AI
18 November 2020Page 2 Stories from the Financial Service AI Trenches: Lessons learned from building AI models in EY
1 AI in Financial Services Overview
2 Use Cases and Learnings
Agenda
18 November 2020Page 3
AI in Financial Services
How does an experiment-driven disruptive technology such as AI look like in a highly-regulated industry?
Sources:
https://www.fca.org.uk/publication/research/research-note-on-machine-learning-in-uk-financial-services.pdf
https://ec.europa.eu/digital-single-market/en/high-level-expert-group-artificial-intelligence
http://rms.koenig-solutions.com/Sync_data/Trainer/QMS/1752-2020328106-AuditingArtificialIntelligencereseng1218(1).pdf
Stories from the Financial Service AI Trenches: Lessons learned from building AI models in EY
18 November 2020Page 4
Review data sourcing, profiling,
processing, as well as data
quality and ethical issues
Assess approach and models are fit for
purpose, explainable, reproducible,
and robust, with supporting evidence
Confirm outcomes achieve desired level
of precision and consistency, and are
aligned with ethical, lawful, and fair
design criteria
Ensure solution is scalable and
deployable with the right tech
infrastructure, and
continuously monitored
Ensure business purpose,
governance and stakeholder
engagement are properly
identified and aligned
Solution
Lifecycle
Modelling
Outcome
Analysis
Deployment
and
Monitoring
Data and
Processing
Business and
Governance
Source: https://www.ukfinance.org.uk/system/files/Trust%2C%20Context%20and%20Regulation%20-%20Achieving%20more%20explainable%20AI%20in%20financial%20services.pdf
Stories from the Financial Service AI Trenches: Lessons learned from building AI models in EY
AI in Financial Services
How does an experiment-driven disruptive technology such as AI look like in a highly-regulated industry?
How do you train models for rich
yet highly restricted data that could
be difficult to acquire?
18 November 2020Page 5 Stories from the Financial Service AI Trenches: Lessons learned from building AI models in EY
Use Case : Know Your Customer (KYC)
Page 6
KYC
Social Network
Employment
Information
Self-certification
Forms
Biometric Data
Legal Documents
Open
Banking
Proof of Identity
Digital Footprint
KYC requires a lot of time consuming
repetitive manual work that involves
the processing of a variety of data
sources.
Ubiquity, variety of data sources, and
complexity involved in cognitive tasks
make it a very attractive use case for AI.
Stories from the Financial Service AI Trenches: Lessons learned from building AI models in EY18 November 2020
Use Case : Know Your Customer (KYC)
Page 7
Form Field Detection
2
Handwritten Text Recognition
3
Data Synthesis
1
Stories from the Financial Service AI Trenches: Lessons learned from building AI models in EY18 November 2020
Use Case : Know Your Customer (KYC)
► Data is scarce or highly restricted because of sensitive and personally-identifiable information
► SDLC and DevOps can be inadequate for ML development, consider MLOps
► Treat the scarcity of data as a technological and scientific problem
► When using synthetic or generic datasets, ensure that there’s a feedback mechanism for when live
data becomes available
22 November 2020 Presentation titlePage 8
How do you develop models when
data from clients come from
different geographies, have
different legislations and cross-
border restrictions?
18 November 2020Page 9 Stories from the Financial Service AI Trenches: Lessons learned from building AI models in EY
Use Case: Multi-Jurisdiction Models
Generic models and pipelines are reused, iterations produce bespoke models by incorporating country-specific data
22 November 2020 Presentation titlePage 10
Reusable Components
Standard ML Pipeline: Base Model
► Common laws and treaties
► Similar industry trends and
treatments
► Transactional trends
► Language models
► Common data model
► Generic dataset
► Regional market
► Cross regional market
► National market
Base Model
Country X Country Y
Model Y v1Model X v2Model X v1
Model X
v3
Model Y v2
retrain
increment
Country Z
retrain
Model Z v1
Model Y v3
MODELS XYZ
ML Pipeline Iteration XYZ:
Bespoke Model
► Hyperparameter Tuning
► Country-specific datasets and
enrichment
► Additional categories and features
18 November 2020Page 11 Stories from the Financial Service AI Trenches: Lessons learned from building AI models in EY
Multi-Jurisdiction Models Example – MLOps and AutoML
Modelling Outcome
Analysis
Deployment and MonitoringData and
Processing
Data Prep
Exploratory
Data Analysis
Feature
Engineering
Feature
Selection
Benchmark
Metrics
Model Serving
(Inference)
Drift
Monitoring
(inference)
Model Build and AutoML Pipeline
Hyperparameter
Tuning
Training
CI/CD
Model Serving
(Train Pipeline)
Retraining/
Rollback /
Increment
Data slicing
Model Serving (Training Pipeline)
Experimentation
Feature
Importance
Drift
Monitoring
(Training)
Model Serving (Inference)
model is stale
make predictions
Human in
the loop
Consume/
Interface
High confidence
Low confidence
Model Drift Monitoring (Data Signature)
model is
good
• Create training (baseline) and inference dataset signatures from features
• Create signatures from predictions, also called theories
• Measure the distance of signatures
• Population Stability Index : 𝑃𝑆𝐼 = ∑!(𝐴! − 𝐵!) ln
"!
#!
{𝐴!, 𝐵! − 𝑓𝑟𝑒𝑞𝑢𝑒𝑛𝑐𝑦 𝑐𝑜𝑢𝑛𝑡 𝑝𝑒𝑟 𝑓𝑒𝑎𝑡𝑢𝑟𝑒 𝑏𝑢𝑐𝑘𝑒𝑡}
• Numerical Parametric (should pass normality, homoskedasticity): T-test
• Numerical Non-parametric: Kruskal-Wallis, Wilcoxon, Kolmogorov, Mann-Whitney-U
• Categorical Features and theory testing: Pearson’s Chi-squared test
• Provide pass/warning/fail logic to trigger retraining, rollback, AutoML, reinforcement learning
Training
Dataset
Inference
Dataset
Model
features
features
predictions
Inference
Signature
Score
(Distance)
Training
Signature
Data
Augmentation
Transfer learning and Model Finetuning
18 November 2020Page 12
Use Case: Multi-Jurisdiction Models
Data drift monitoring and MLOps tools
Reproducible end-to-end ML pipelines and AutoML
Leveraging “human in the loop” with MLOps framework
and online learning
Enabling components for Multi-Jurisdiction and ML at scale
Time from Technical and Business
SMEs are valuable, a complementing
operating model and tooling would
be necessary to maximise value
Building and deploying bespoke
models for each jurisdiction is difficult
to scale without an end-to-end
MLOps platform
Stories from the Financial Service AI Trenches: Lessons learned from building AI models in EY
18 November 2020Page 13
The appeal of using AI in FS lies in very rich data,
the same reason that makes data very challenging
to acquire.
AI in FS usually involve clients in multiple
jurisdictions, it is imperative to have MLOps
framework and platform to develop ML at scale.
Stories from the Financial Service AI Trenches: Lessons learned from building AI models in EY
Key Takeaways
Q&A
18 November 2020Page 14
Tim Santos
Assistant Director | Global IT
► Global Client Technology AI
► MLOps Lead
► Timothy.Santos@uk.ey.com
Mustafa Somalya
Assistant Director | Global IT
► Global Client Technology AI
► ML Experimentation Lead
► Mustafa.M.Somalya@uk.ey.com
Stories from the Financial Service AI Trenches: Lessons learned from building AI models in EY
EY | Assurance | Tax | Strategy and Transactions | Consulting
About EY
EY is a global leader in assurance, tax, strategy, transaction and consulting
services. The insights and quality services we deliver help build trust and
confidence in the capital markets and in economies the world over. We develop
outstanding leaders who team to deliver on our promises to all of our
stakeholders. In so doing, we play a critical role in building a better working world
for our people, for our clients and for our communities.
EY refers to the global organization, and may refer to one or more, of the member
firms of Ernst & Young Global Limited, each of which is a separate legal entity.
Ernst & Young Global Limited, a UK company limited by guarantee, does not
provide services to clients. Information about how EY collects and uses personal
data and a description of the rights individuals have under data protection
legislation are available via ey.com/privacy. For more information about our
organization, please visit ey.com.
This news release has been issued by EYGM Limited, a member of the global EY
organization that also does not provide any services to clients.
© 2020 EYGM Limited.
All Rights Reserved.
EYG no.
ED MMYY
This material has been prepared for general informational purposes only and is
not intended to be relied upon as accounting, tax or other professional advice.
Please refer to your advisors for specific advice.

More Related Content

What's hot (20)

PPTX
The FDA and industry: A recipe for collaborating in the New Health Economy
PwC
 
PPTX
Creating a Winning Recipe for a Meal Kits Program
L.E.K. Consulting
 
PDF
Growth-Driving Enterprise Innovation Model | Accenture
accenture
 
PDF
A.T. Kearney Consolidation of the US Banking Industry
Kearney
 
PPTX
Top Trends in Payments 2022
Capgemini
 
PDF
Turning diligence insights into actionable integration steps
Deloitte United States
 
PPTX
Global Challengers 2018: Digital Leapfrogs
Boston Consulting Group
 
PPTX
Reinventing Enterprise Operations
accenture
 
PPTX
Creating an Enterprise AI Strategy
AtScale
 
PDF
Strategy Study 2014 | A.T. Kearney
Kearney
 
PPTX
The 4th Annual New Mobility Study 2019
L.E.K. Consulting
 
PDF
The Decade to Deliver: A Call to Business Action
accenture
 
PPTX
The Diversity Imperative: 14th Annual Australian Chief Executive Study
PwC's Strategy&
 
PDF
PwC's Unlock data possibilities - infographic
PwC
 
PPTX
Seven Forces Reshaping Enterprise Software
Boston Consulting Group
 
PDF
Federal Technology Vision 2021: Full U.S. Federal Survey Findings | Accenture
accenture
 
PDF
Fintech New York: Partnerships, Platforms and Open Innovation
accenture
 
PDF
Value Untangled Slideshare
accenture
 
PDF
Future-proofing SMEs TA vFF.pdf
enterpriseresearchcentre
 
PDF
Make Data Work for You
DATAVERSITY
 
The FDA and industry: A recipe for collaborating in the New Health Economy
PwC
 
Creating a Winning Recipe for a Meal Kits Program
L.E.K. Consulting
 
Growth-Driving Enterprise Innovation Model | Accenture
accenture
 
A.T. Kearney Consolidation of the US Banking Industry
Kearney
 
Top Trends in Payments 2022
Capgemini
 
Turning diligence insights into actionable integration steps
Deloitte United States
 
Global Challengers 2018: Digital Leapfrogs
Boston Consulting Group
 
Reinventing Enterprise Operations
accenture
 
Creating an Enterprise AI Strategy
AtScale
 
Strategy Study 2014 | A.T. Kearney
Kearney
 
The 4th Annual New Mobility Study 2019
L.E.K. Consulting
 
The Decade to Deliver: A Call to Business Action
accenture
 
The Diversity Imperative: 14th Annual Australian Chief Executive Study
PwC's Strategy&
 
PwC's Unlock data possibilities - infographic
PwC
 
Seven Forces Reshaping Enterprise Software
Boston Consulting Group
 
Federal Technology Vision 2021: Full U.S. Federal Survey Findings | Accenture
accenture
 
Fintech New York: Partnerships, Platforms and Open Innovation
accenture
 
Value Untangled Slideshare
accenture
 
Future-proofing SMEs TA vFF.pdf
enterpriseresearchcentre
 
Make Data Work for You
DATAVERSITY
 

Similar to Stories from the Financial Service AI Trenches: Lessons Learned from Building AI Models in EY (20)

PDF
Build Intelligence System with AI. Antimo Musone, Ernst & Young
Data Driven Innovation
 
PDF
Romanko chasopys july2015_post
ProstirChasopys
 
PPTX
Siecap Advisory Automation & Supply Chain Trends
Geoffrey Knowles
 
PDF
The role of the COO in the age of AI
Antony Turner
 
PDF
Machine Learning for Finance Master Class
QuantUniversity
 
PDF
TM Forum AI Program Overview
TMForum
 
PPT
CMG Brasil 2011 Keynote por Adam Grummit
Joao Galdino Mello de Souza
 
PDF
AI for optimizing customer journeys in online betting
Frosmo
 
PPTX
Big Data, customer analytics and loyalty marketing
Kevin May
 
PDF
Digital Platform Economy
Grow VC Group
 
PDF
Bi PowerPoint Presentation Slides
SlideTeam
 
PDF
Trust, Context and, Regulation: Achieving More Explainable AI in Financial Se...
Databricks
 
PPTX
Meet the new champ: Why HR managers go wild for QlikView 9 people intelligenc...
TopLine Strategies
 
PDF
Ml master class cfa poland
QuantUniversity
 
PDF
Career sessions IM
SupportGCI
 
PPTX
apidays LIVE Hong Kong 2021 - Federated Learning for Banking by Isaac Wong, W...
apidays
 
PPT
PowerPoint presentation
webhostingguy
 
PDF
The future of FinTech product using pervasive Machine Learning automation - A...
Shift Conference
 
PDF
Pm.ais ummit 180917 final
Nisha Talagala
 
DOCX
Role of 4IR technologies in the MIS of BEXIMCO COMPANY .docx
BGMEA University Of Fashion And Technology
 
Build Intelligence System with AI. Antimo Musone, Ernst & Young
Data Driven Innovation
 
Romanko chasopys july2015_post
ProstirChasopys
 
Siecap Advisory Automation & Supply Chain Trends
Geoffrey Knowles
 
The role of the COO in the age of AI
Antony Turner
 
Machine Learning for Finance Master Class
QuantUniversity
 
TM Forum AI Program Overview
TMForum
 
CMG Brasil 2011 Keynote por Adam Grummit
Joao Galdino Mello de Souza
 
AI for optimizing customer journeys in online betting
Frosmo
 
Big Data, customer analytics and loyalty marketing
Kevin May
 
Digital Platform Economy
Grow VC Group
 
Bi PowerPoint Presentation Slides
SlideTeam
 
Trust, Context and, Regulation: Achieving More Explainable AI in Financial Se...
Databricks
 
Meet the new champ: Why HR managers go wild for QlikView 9 people intelligenc...
TopLine Strategies
 
Ml master class cfa poland
QuantUniversity
 
Career sessions IM
SupportGCI
 
apidays LIVE Hong Kong 2021 - Federated Learning for Banking by Isaac Wong, W...
apidays
 
PowerPoint presentation
webhostingguy
 
The future of FinTech product using pervasive Machine Learning automation - A...
Shift Conference
 
Pm.ais ummit 180917 final
Nisha Talagala
 
Role of 4IR technologies in the MIS of BEXIMCO COMPANY .docx
BGMEA University Of Fashion And Technology
 
Ad

More from Databricks (20)

PPTX
DW Migration Webinar-March 2022.pptx
Databricks
 
PPTX
Data Lakehouse Symposium | Day 1 | Part 1
Databricks
 
PPT
Data Lakehouse Symposium | Day 1 | Part 2
Databricks
 
PPTX
Data Lakehouse Symposium | Day 2
Databricks
 
PPTX
Data Lakehouse Symposium | Day 4
Databricks
 
PDF
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
Databricks
 
PDF
Democratizing Data Quality Through a Centralized Platform
Databricks
 
PDF
Learn to Use Databricks for Data Science
Databricks
 
PDF
Why APM Is Not the Same As ML Monitoring
Databricks
 
PDF
The Function, the Context, and the Data—Enabling ML Ops at Stitch Fix
Databricks
 
PDF
Stage Level Scheduling Improving Big Data and AI Integration
Databricks
 
PDF
Simplify Data Conversion from Spark to TensorFlow and PyTorch
Databricks
 
PDF
Scaling your Data Pipelines with Apache Spark on Kubernetes
Databricks
 
PDF
Scaling and Unifying SciKit Learn and Apache Spark Pipelines
Databricks
 
PDF
Sawtooth Windows for Feature Aggregations
Databricks
 
PDF
Redis + Apache Spark = Swiss Army Knife Meets Kitchen Sink
Databricks
 
PDF
Re-imagine Data Monitoring with whylogs and Spark
Databricks
 
PDF
Raven: End-to-end Optimization of ML Prediction Queries
Databricks
 
PDF
Processing Large Datasets for ADAS Applications using Apache Spark
Databricks
 
PDF
Massive Data Processing in Adobe Using Delta Lake
Databricks
 
DW Migration Webinar-March 2022.pptx
Databricks
 
Data Lakehouse Symposium | Day 1 | Part 1
Databricks
 
Data Lakehouse Symposium | Day 1 | Part 2
Databricks
 
Data Lakehouse Symposium | Day 2
Databricks
 
Data Lakehouse Symposium | Day 4
Databricks
 
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
Databricks
 
Democratizing Data Quality Through a Centralized Platform
Databricks
 
Learn to Use Databricks for Data Science
Databricks
 
Why APM Is Not the Same As ML Monitoring
Databricks
 
The Function, the Context, and the Data—Enabling ML Ops at Stitch Fix
Databricks
 
Stage Level Scheduling Improving Big Data and AI Integration
Databricks
 
Simplify Data Conversion from Spark to TensorFlow and PyTorch
Databricks
 
Scaling your Data Pipelines with Apache Spark on Kubernetes
Databricks
 
Scaling and Unifying SciKit Learn and Apache Spark Pipelines
Databricks
 
Sawtooth Windows for Feature Aggregations
Databricks
 
Redis + Apache Spark = Swiss Army Knife Meets Kitchen Sink
Databricks
 
Re-imagine Data Monitoring with whylogs and Spark
Databricks
 
Raven: End-to-end Optimization of ML Prediction Queries
Databricks
 
Processing Large Datasets for ADAS Applications using Apache Spark
Databricks
 
Massive Data Processing in Adobe Using Delta Lake
Databricks
 
Ad

Recently uploaded (20)

PDF
5991-5857_Agilent_MS_Theory_EN (1).pdf. pdf
NohaSalah45
 
PPTX
Artificial intelligence Presentation1.pptx
SaritaMahajan5
 
PPTX
Model Evaluation & Visualisation part of a series of intro modules for data ...
brandonlee626749
 
PPTX
Daily, Weekly, Monthly Report MTC March 2025.pptx
PanjiDewaPamungkas1
 
PPTX
Project_Update_Summary.for the use from PM
Odysseas Lekatsas
 
PPTX
Monitoring Improvement ( Pomalaa Branch).pptx
fajarkunee
 
PPTX
microservices-with-container-apps-dapr.pptx
vjay22
 
PDF
Blood pressure (3).pdfbdbsbsbhshshshhdhdhshshs
hernandezemma379
 
PPT
Reliability Monitoring of Aircrfat commerce
Rizk2
 
PDF
5- Global Demography Concepts _ Population Pyramids .pdf
pkhadka824
 
PDF
Microsoft Power BI - Advanced Certificate for Business Intelligence using Pow...
Prasenjit Debnath
 
PPT
intro to AI dfg fgh gggdrhre ghtwhg ewge
traineramrsiam
 
PDF
Business Automation Solution with Excel 1.1.pdf
Vivek Kedia
 
PPTX
Generative AI Boost Data Governance and Quality- Tejasvi Addagada
Tejasvi Addagada
 
PDF
A Web Repository System for Data Mining in Drug Discovery
IJDKP
 
PPTX
english9quizw1-240228142338-e9bcf6fd.pptx
rossanthonytan130
 
PDF
CT-2-Ancient ancient accept-Criticism.pdf
DepartmentofEnglishC1
 
PPTX
covid 19 data analysis updates in our municipality
RhuAyungon1
 
PPTX
RESEARCH-FINAL-GROUP-3, about the final .pptx
gwapokoha1
 
5991-5857_Agilent_MS_Theory_EN (1).pdf. pdf
NohaSalah45
 
Artificial intelligence Presentation1.pptx
SaritaMahajan5
 
Model Evaluation & Visualisation part of a series of intro modules for data ...
brandonlee626749
 
Daily, Weekly, Monthly Report MTC March 2025.pptx
PanjiDewaPamungkas1
 
Project_Update_Summary.for the use from PM
Odysseas Lekatsas
 
Monitoring Improvement ( Pomalaa Branch).pptx
fajarkunee
 
microservices-with-container-apps-dapr.pptx
vjay22
 
Blood pressure (3).pdfbdbsbsbhshshshhdhdhshshs
hernandezemma379
 
Reliability Monitoring of Aircrfat commerce
Rizk2
 
5- Global Demography Concepts _ Population Pyramids .pdf
pkhadka824
 
Microsoft Power BI - Advanced Certificate for Business Intelligence using Pow...
Prasenjit Debnath
 
intro to AI dfg fgh gggdrhre ghtwhg ewge
traineramrsiam
 
Business Automation Solution with Excel 1.1.pdf
Vivek Kedia
 
Generative AI Boost Data Governance and Quality- Tejasvi Addagada
Tejasvi Addagada
 
A Web Repository System for Data Mining in Drug Discovery
IJDKP
 
english9quizw1-240228142338-e9bcf6fd.pptx
rossanthonytan130
 
CT-2-Ancient ancient accept-Criticism.pdf
DepartmentofEnglishC1
 
covid 19 data analysis updates in our municipality
RhuAyungon1
 
RESEARCH-FINAL-GROUP-3, about the final .pptx
gwapokoha1
 

Stories from the Financial Service AI Trenches: Lessons Learned from Building AI Models in EY

  • 1. Stories from the Financial Service AI Trenches Lessons learned from building AI models in EY 18 November 2020 Tim Santos, Assistant Director, Client Technology AI Mustafa Somalya , Assistant Director , Client Technology AI
  • 2. 18 November 2020Page 2 Stories from the Financial Service AI Trenches: Lessons learned from building AI models in EY 1 AI in Financial Services Overview 2 Use Cases and Learnings Agenda
  • 3. 18 November 2020Page 3 AI in Financial Services How does an experiment-driven disruptive technology such as AI look like in a highly-regulated industry? Sources: https://www.fca.org.uk/publication/research/research-note-on-machine-learning-in-uk-financial-services.pdf https://ec.europa.eu/digital-single-market/en/high-level-expert-group-artificial-intelligence http://rms.koenig-solutions.com/Sync_data/Trainer/QMS/1752-2020328106-AuditingArtificialIntelligencereseng1218(1).pdf Stories from the Financial Service AI Trenches: Lessons learned from building AI models in EY
  • 4. 18 November 2020Page 4 Review data sourcing, profiling, processing, as well as data quality and ethical issues Assess approach and models are fit for purpose, explainable, reproducible, and robust, with supporting evidence Confirm outcomes achieve desired level of precision and consistency, and are aligned with ethical, lawful, and fair design criteria Ensure solution is scalable and deployable with the right tech infrastructure, and continuously monitored Ensure business purpose, governance and stakeholder engagement are properly identified and aligned Solution Lifecycle Modelling Outcome Analysis Deployment and Monitoring Data and Processing Business and Governance Source: https://www.ukfinance.org.uk/system/files/Trust%2C%20Context%20and%20Regulation%20-%20Achieving%20more%20explainable%20AI%20in%20financial%20services.pdf Stories from the Financial Service AI Trenches: Lessons learned from building AI models in EY AI in Financial Services How does an experiment-driven disruptive technology such as AI look like in a highly-regulated industry?
  • 5. How do you train models for rich yet highly restricted data that could be difficult to acquire? 18 November 2020Page 5 Stories from the Financial Service AI Trenches: Lessons learned from building AI models in EY
  • 6. Use Case : Know Your Customer (KYC) Page 6 KYC Social Network Employment Information Self-certification Forms Biometric Data Legal Documents Open Banking Proof of Identity Digital Footprint KYC requires a lot of time consuming repetitive manual work that involves the processing of a variety of data sources. Ubiquity, variety of data sources, and complexity involved in cognitive tasks make it a very attractive use case for AI. Stories from the Financial Service AI Trenches: Lessons learned from building AI models in EY18 November 2020
  • 7. Use Case : Know Your Customer (KYC) Page 7 Form Field Detection 2 Handwritten Text Recognition 3 Data Synthesis 1 Stories from the Financial Service AI Trenches: Lessons learned from building AI models in EY18 November 2020
  • 8. Use Case : Know Your Customer (KYC) ► Data is scarce or highly restricted because of sensitive and personally-identifiable information ► SDLC and DevOps can be inadequate for ML development, consider MLOps ► Treat the scarcity of data as a technological and scientific problem ► When using synthetic or generic datasets, ensure that there’s a feedback mechanism for when live data becomes available 22 November 2020 Presentation titlePage 8
  • 9. How do you develop models when data from clients come from different geographies, have different legislations and cross- border restrictions? 18 November 2020Page 9 Stories from the Financial Service AI Trenches: Lessons learned from building AI models in EY
  • 10. Use Case: Multi-Jurisdiction Models Generic models and pipelines are reused, iterations produce bespoke models by incorporating country-specific data 22 November 2020 Presentation titlePage 10 Reusable Components Standard ML Pipeline: Base Model ► Common laws and treaties ► Similar industry trends and treatments ► Transactional trends ► Language models ► Common data model ► Generic dataset ► Regional market ► Cross regional market ► National market Base Model Country X Country Y Model Y v1Model X v2Model X v1 Model X v3 Model Y v2 retrain increment Country Z retrain Model Z v1 Model Y v3 MODELS XYZ ML Pipeline Iteration XYZ: Bespoke Model ► Hyperparameter Tuning ► Country-specific datasets and enrichment ► Additional categories and features
  • 11. 18 November 2020Page 11 Stories from the Financial Service AI Trenches: Lessons learned from building AI models in EY Multi-Jurisdiction Models Example – MLOps and AutoML Modelling Outcome Analysis Deployment and MonitoringData and Processing Data Prep Exploratory Data Analysis Feature Engineering Feature Selection Benchmark Metrics Model Serving (Inference) Drift Monitoring (inference) Model Build and AutoML Pipeline Hyperparameter Tuning Training CI/CD Model Serving (Train Pipeline) Retraining/ Rollback / Increment Data slicing Model Serving (Training Pipeline) Experimentation Feature Importance Drift Monitoring (Training) Model Serving (Inference) model is stale make predictions Human in the loop Consume/ Interface High confidence Low confidence Model Drift Monitoring (Data Signature) model is good • Create training (baseline) and inference dataset signatures from features • Create signatures from predictions, also called theories • Measure the distance of signatures • Population Stability Index : 𝑃𝑆𝐼 = ∑!(𝐴! − 𝐵!) ln "! #! {𝐴!, 𝐵! − 𝑓𝑟𝑒𝑞𝑢𝑒𝑛𝑐𝑦 𝑐𝑜𝑢𝑛𝑡 𝑝𝑒𝑟 𝑓𝑒𝑎𝑡𝑢𝑟𝑒 𝑏𝑢𝑐𝑘𝑒𝑡} • Numerical Parametric (should pass normality, homoskedasticity): T-test • Numerical Non-parametric: Kruskal-Wallis, Wilcoxon, Kolmogorov, Mann-Whitney-U • Categorical Features and theory testing: Pearson’s Chi-squared test • Provide pass/warning/fail logic to trigger retraining, rollback, AutoML, reinforcement learning Training Dataset Inference Dataset Model features features predictions Inference Signature Score (Distance) Training Signature Data Augmentation
  • 12. Transfer learning and Model Finetuning 18 November 2020Page 12 Use Case: Multi-Jurisdiction Models Data drift monitoring and MLOps tools Reproducible end-to-end ML pipelines and AutoML Leveraging “human in the loop” with MLOps framework and online learning Enabling components for Multi-Jurisdiction and ML at scale Time from Technical and Business SMEs are valuable, a complementing operating model and tooling would be necessary to maximise value Building and deploying bespoke models for each jurisdiction is difficult to scale without an end-to-end MLOps platform Stories from the Financial Service AI Trenches: Lessons learned from building AI models in EY
  • 13. 18 November 2020Page 13 The appeal of using AI in FS lies in very rich data, the same reason that makes data very challenging to acquire. AI in FS usually involve clients in multiple jurisdictions, it is imperative to have MLOps framework and platform to develop ML at scale. Stories from the Financial Service AI Trenches: Lessons learned from building AI models in EY Key Takeaways
  • 14. Q&A 18 November 2020Page 14 Tim Santos Assistant Director | Global IT ► Global Client Technology AI ► MLOps Lead ► [email protected] Mustafa Somalya Assistant Director | Global IT ► Global Client Technology AI ► ML Experimentation Lead ► [email protected] Stories from the Financial Service AI Trenches: Lessons learned from building AI models in EY
  • 15. EY | Assurance | Tax | Strategy and Transactions | Consulting About EY EY is a global leader in assurance, tax, strategy, transaction and consulting services. The insights and quality services we deliver help build trust and confidence in the capital markets and in economies the world over. We develop outstanding leaders who team to deliver on our promises to all of our stakeholders. In so doing, we play a critical role in building a better working world for our people, for our clients and for our communities. EY refers to the global organization, and may refer to one or more, of the member firms of Ernst & Young Global Limited, each of which is a separate legal entity. Ernst & Young Global Limited, a UK company limited by guarantee, does not provide services to clients. Information about how EY collects and uses personal data and a description of the rights individuals have under data protection legislation are available via ey.com/privacy. For more information about our organization, please visit ey.com. This news release has been issued by EYGM Limited, a member of the global EY organization that also does not provide any services to clients. © 2020 EYGM Limited. All Rights Reserved. EYG no. ED MMYY This material has been prepared for general informational purposes only and is not intended to be relied upon as accounting, tax or other professional advice. Please refer to your advisors for specific advice.