SlideShare a Scribd company logo
AI Journey at
Vision Banco
Paraguay
Rubén Díaz
Data Scientist
Vision Banco
@rubuntu
#H2OWORLD
Rafael Coss
Community Maker
H2O.ai
@racoss
#H2OWORLD
AI journey at Vision Banco
Paraguay
Ruben Diaz
@rubuntu
Data Scientist, Vision Banco
Rafael Coss
@racoss
Community Maker, H2O.ai
#H2OWORLD
Agenda
Introduction
• Vision Banco
• Journey so far
Automatic Machine Learning 101
Journey to Deployment
Lessons Learned
Looking to the future
Tweet @h2oai @visionbanco @rubuntu @racoss #h2oworld
#H2OWORLD
Vision Banco
Who are we?
What are our ML use Cases?
What’s our env?
#H2OWORLD
Paraguay
Population: 6.811 million (2017)
Area: 406,752 km2 / 157,048 sq mi (4.1% of USA area )
Official languages: Spanish & Guarani
ROI: 2nd best (22%) in Latin America
Point of interest: Asuncion, Itaipu Dam , Jesuit Missions, Chaco, Pantanal
Exports:
• 1st global Electric power exporter
• 1st global Organic sugar exporter
• 2nd global Stevia exporter
• 3rd global Yerba Mate exporter (Ilex paraguaiensis)
• 4th global Soybean exporter
• 5th global Chia exporter
• 6th global Meat exporter
#H2OWORLD
• Paraguayan bank member of
"Global Alliance for Banking on Values"
• Top 10 in the country
• The largest number of agencies (95), non-
banking correspondents (2,492) and largest
employer of the financial system (1,758
employees) in Paraguay
• Inclusive: ⅓ of population have accounts
• 800,000+ customers
• Microfinance & SMBs
Tus metas nos inspiran / Your goals inspire us
#H2OWORLD
Vision Banco ML Journey
• 1st Generation Credit Scoring Models, using Logistic Regression
Models developed with IBM SPSS, deployed as a stored procedure on IBM Db2 for i (AS/400) database
• 2nd Generation, state of the art algos: Random Forest, GBM, etc.
Developed with SPSS, KNIME and R, exported to the standard PMML format and implemented as REST
web services using openscoring.io
• 3rd Generation. H2O.ai open source. More algos: XGBoost, Deep Learning, Ensembles, etc.
The open source H2O platform surprised us with its speed to train models. The migration to H2O
involved changing the deployment of models in PMML format to H2O’s POJOS & MOJOS
• 4th Generation. Auto ML with Driverless AI on IBM Power System AC922 server accelerated with
NVIDIA® Tesla® V100 GPUs
Each Step improved the Accuracy and Speed of Model Building & Deploying
#H2OWORLD
Vision Banco Use Cases
Risk Management
• Credit Scoring
• Default Prediction
• Fraud Detection
Business
• Propensity to Purchase (Predictive Lead Scoring)
• Customer Churn Prediction
• Customer segmentation
• Recommendation Engines
#H2OWORLD
Automatic Machine Learning: Driverless AI !!!
Joined the Driverless AI Beta circa November 2017
What is Driverless AI?
• Automates a large part of the Data Science process
POC
• Env: Cloud VM with GPU
• Scenario: Entered the AnalyticsVidhya.com contest "Data
Science Hackathon: Churn Prediction"
• Result: Surprisingly, I got 8th place!
https://www.linkedin.com/pulse/how-get-eighth-place-data-science-competition-using-driverless-diaz/
#H2OWORLD
Driverless AI
Features Targe
t
Data Quality and
Transformation
Modeling
Table
Model
Building
Model
Data Integration
+
Driverless AI:
Automates Data Science and ML Workflows
Confidential11 Confidential11
Automatic Machine Learning 101
SQL
Local
Amazon S3
HDFS
X Y
Automatic Model Optimization
Automatic
Scoring Pipeline
Machine learning
Interpretability
Deploy
Low-latency
Scoring to
Production
Modelling
Dataset
Model Recipes
• i.i.d. data
• Time-series
• More on the way
Advanced
Feature
Engineering
Algorithm
Model
Tuning+ +
Survival of the Fittest
Understand the data
shape, outliers,
missing values, etc.
Powered by GPU Acceleration
1 Drag and Drop Data
2 Automatic Visualization
Use best practice model recipes
and the power of high performance
computing to iterate across
thousands of possible models
including advanced feature
engineering and parameter tuning
3 Automatic Model Optimization
Deploy ultra-low latency
Python or Java Automatic
Scoring Pipelines that include
feature transformations and
models
4 Automatic Scoring Pipelines
Bring data in from
cloud, big data and
desktop systems
Google BigQuery
Azure Blog Storage
Snowflake
Model
Documentation
#H2OWORLD
Solution Architecture (Hardware)
The IBM Power System AC922 server
● Faster I/O - up to 5.6x more I/O bandwidth
than x86 servers
● The best GPUs - 2-6 NVIDIA® Tesla® V100
GPUs with NVLink
● Extraordinary CPUs - 2x POWER9 CPUs,
designed for AI
● Simplest AI architecture - Share RAM across
CPUs & GPUs
● Enterprise-ready - PowerAI DL frameworks
with IBM support
● Next Gen PCIe - PCIe Gen4 2x faster vs PCIe
Gen3 in x86
● Built for the world's biggest AI challenges
The best server for enterprise AI
Powered by GPU Acceleration
Confidential13
Infrastructure
Modeling Deployment
AI
Solutions
Data
Sources
Train
Test
Production
Data
Model
Mgt
Batch
Scoring
Real-time
Scoring
BI
Solutions
dev/ops
On-Prem Cloud
Machine Learning
Workflow
AI Architecture
Data
Prep
Confidential14
AI
Solutions
Model
MgtData
Prep
Infrastructure
Modeling Deployment
Data
Sources
Train
Test
Production
Data Batch
Scoring
Real-time
Scoring
BI
Solutions
dev/ops
On-Prem Cloud
Vision Banco AI Architecture
IBM Notes
Confidential15
AI App Scoring Flow (Model Deployment)
AI
Solutions
Real-time
Scoring
MOJO
Scoring Pipeline
FeaturesScore
Results
See: https://github.com/rubuntu/h2o_scorer
App
Captures
Features
Score
Take Action
1
2
3
4
5
Confidential16
Machine Learning Interpretability
Why Should I Trust Your Model?
Confidential17
Machine Learning Interpretability (cont.)
#H2OWORLD
Lesson Learned
• The automation of the process of Data Science reduces time
and "costs less money"
• It is very important that machine learning models are
interpretable to explain the decisions made by machine
learning algorithms to business people and even to the
company's customers
#H2OWORLD
Looking to the future
Use Cases
• Money laundering prevention
• Time series forecasting
• NLP
• Chatbots
• Voice / Sound recognition
• Image recognition
• Video detection
AI for good.
Democratize AI for Everyone.
H2O is the open leader in AI.
#H2OWORLD

More Related Content

PPTX
Martin Stein, G5 - Driving Marketing Performance with H2O Driverless AI - H2O...
PPTX
Tom Aliff, Equifax - Configurable Modeling for Maximizing Business Value - H2...
PPTX
Robert Coop, Stanley Black & Decker - Optimizing Manufacturing with Driverles...
PPTX
Krish Swamy + Balaji Gopalakrishnan, Wells Fargo - Building a World Class Dat...
PDF
Patrick Hall, H2O.ai - Human Friendly Machine Learning - H2O World San Francisco
PPTX
Automatic Model Documentation with H2O
PDF
A Look Under the Hood of H2O Driverless AI, Arno Candel - H2O World San Franc...
PDF
Introducción al Aprendizaje Automatico con H2O-3 (1)
Martin Stein, G5 - Driving Marketing Performance with H2O Driverless AI - H2O...
Tom Aliff, Equifax - Configurable Modeling for Maximizing Business Value - H2...
Robert Coop, Stanley Black & Decker - Optimizing Manufacturing with Driverles...
Krish Swamy + Balaji Gopalakrishnan, Wells Fargo - Building a World Class Dat...
Patrick Hall, H2O.ai - Human Friendly Machine Learning - H2O World San Francisco
Automatic Model Documentation with H2O
A Look Under the Hood of H2O Driverless AI, Arno Candel - H2O World San Franc...
Introducción al Aprendizaje Automatico con H2O-3 (1)

What's hot (20)

PPTX
Custom Machine Learning Recipes for the Enterprise
PDF
Human-Centered AI: Scalable, Interactive Tools for Interpretation and Attribu...
PDF
Get Behind the Wheel with H2O Driverless AI - Hands on Lab - H2O World San Fr...
PDF
Scalable Automatic Machine Learning with H2O
PDF
Pm.ais ummit 180917 final
PPTX
Keynote by Mike Gualtieri, Forrester Research - Making AI Happen Without Gett...
PPTX
Scaling & Managing Production Deployments with H2O ModelOps
PDF
Accelerate ML Deployment with H2O Driverless AI on AWS
PDF
Data Warehousing Trends
PDF
Using Apache Spark for Intelligent Services by Alexis Roos
PDF
Productionising Machine Learning Models
PDF
Seldon: Deploying Models at Scale
PPTX
Rahul Bhuman, Tech Mahindra - Truck roll prediction using Driverless AI - H2O...
PDF
Apply MLOps at Scale
PDF
Bring Your Own Recipes Hands-On Session
PPTX
Content Analytics Studio – The visualization, machine learning and applicatio...
PDF
TechKnow Fiesta 2021 - Powered by Amazon Web Service in September 2021
PPTX
Driverless AI - Arno Candel, H2O.ai
PDF
UX Analytics for Data-driven Product Development
PDF
Olivier Blais: Want to adopt AI in your business: good luck!
Custom Machine Learning Recipes for the Enterprise
Human-Centered AI: Scalable, Interactive Tools for Interpretation and Attribu...
Get Behind the Wheel with H2O Driverless AI - Hands on Lab - H2O World San Fr...
Scalable Automatic Machine Learning with H2O
Pm.ais ummit 180917 final
Keynote by Mike Gualtieri, Forrester Research - Making AI Happen Without Gett...
Scaling & Managing Production Deployments with H2O ModelOps
Accelerate ML Deployment with H2O Driverless AI on AWS
Data Warehousing Trends
Using Apache Spark for Intelligent Services by Alexis Roos
Productionising Machine Learning Models
Seldon: Deploying Models at Scale
Rahul Bhuman, Tech Mahindra - Truck roll prediction using Driverless AI - H2O...
Apply MLOps at Scale
Bring Your Own Recipes Hands-On Session
Content Analytics Studio – The visualization, machine learning and applicatio...
TechKnow Fiesta 2021 - Powered by Amazon Web Service in September 2021
Driverless AI - Arno Candel, H2O.ai
UX Analytics for Data-driven Product Development
Olivier Blais: Want to adopt AI in your business: good luck!
Ad

Similar to Ruben Diaz, Vision Banco + Rafael Coss, H2O ai + Luis Armenta, IBM - AI journey at Vision Banco (20)

PDF
Start Getting Your Feet Wet in Open Source Machine and Deep Learning
PPTX
ISV Showcase: End-to-end Machine Learning using H2O on Azure
PDF
Introducción al Machine Learning Automático
PPTX
Machine Learning for Smarter Apps - Jacksonville Meetup
PDF
H2O at BelgradeR Meetup
PDF
Belgrade R - Intro to H2O and Deep Water
PPTX
Auto ai for skillsfuture
PPTX
Project "Deep Water"
PPTX
AI and AutoML: Debunking Myths
PDF
H2o.ai presentation at 2nd Virtual Pydata Piraeus meetup
PDF
Driverless AI - Intro + Interactive Hands-on Lab
PDF
H2O PySparkling Water
PDF
Machine Learning on Google Cloud with H2O
PDF
H2O at Berlin R Meetup
PDF
Berlin R Meetup
PDF
ArnoCandelScalabledatascienceanddeeplearningwithh2o_gotochg
PDF
Your AI Transformation
PDF
Intro to Machine Learning with H2O and Python - Denver
PDF
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
PDF
[코세나, kosena] Auto ML, H2O.ai의 제조분야 AI 활용 사례
Start Getting Your Feet Wet in Open Source Machine and Deep Learning
ISV Showcase: End-to-end Machine Learning using H2O on Azure
Introducción al Machine Learning Automático
Machine Learning for Smarter Apps - Jacksonville Meetup
H2O at BelgradeR Meetup
Belgrade R - Intro to H2O and Deep Water
Auto ai for skillsfuture
Project "Deep Water"
AI and AutoML: Debunking Myths
H2o.ai presentation at 2nd Virtual Pydata Piraeus meetup
Driverless AI - Intro + Interactive Hands-on Lab
H2O PySparkling Water
Machine Learning on Google Cloud with H2O
H2O at Berlin R Meetup
Berlin R Meetup
ArnoCandelScalabledatascienceanddeeplearningwithh2o_gotochg
Your AI Transformation
Intro to Machine Learning with H2O and Python - Denver
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
[코세나, kosena] Auto ML, H2O.ai의 제조분야 AI 활용 사례
Ad

More from Sri Ambati (20)

PDF
H2O Label Genie Starter Track - Support Presentation
PDF
H2O.ai Agents : From Theory to Practice - Support Presentation
PDF
H2O Generative AI Starter Track - Support Presentation Slides.pdf
PDF
H2O Gen AI Ecosystem Overview - Level 1 - Slide Deck
PDF
An In-depth Exploration of Enterprise h2oGPTe Slide Deck
PDF
Intro to Enterprise h2oGPTe Presentation Slides
PDF
Enterprise h2o GPTe Learning Path Slide Deck
PDF
H2O Wave Course Starter - Presentation Slides
PDF
Large Language Models (LLMs) - Level 3 Slides
PDF
Data Science and Machine Learning Platforms (2024) Slides
PDF
Data Prep for H2O Driverless AI - Slides
PDF
H2O Cloud AI Developer Services - Slides (2024)
PDF
LLM Learning Path Level 2 - Presentation Slides
PDF
LLM Learning Path Level 1 - Presentation Slides
PDF
Hydrogen Torch - Starter Course - Presentation Slides
PDF
Presentation Resources - H2O Gen AI Ecosystem Overview - Level 2
PDF
H2O Driverless AI Starter Course - Slides and Assignments
PPTX
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
PPTX
Generative AI Masterclass - Model Risk Management.pptx
PDF
AI and the Future of Software Development: A Sneak Peek
H2O Label Genie Starter Track - Support Presentation
H2O.ai Agents : From Theory to Practice - Support Presentation
H2O Generative AI Starter Track - Support Presentation Slides.pdf
H2O Gen AI Ecosystem Overview - Level 1 - Slide Deck
An In-depth Exploration of Enterprise h2oGPTe Slide Deck
Intro to Enterprise h2oGPTe Presentation Slides
Enterprise h2o GPTe Learning Path Slide Deck
H2O Wave Course Starter - Presentation Slides
Large Language Models (LLMs) - Level 3 Slides
Data Science and Machine Learning Platforms (2024) Slides
Data Prep for H2O Driverless AI - Slides
H2O Cloud AI Developer Services - Slides (2024)
LLM Learning Path Level 2 - Presentation Slides
LLM Learning Path Level 1 - Presentation Slides
Hydrogen Torch - Starter Course - Presentation Slides
Presentation Resources - H2O Gen AI Ecosystem Overview - Level 2
H2O Driverless AI Starter Course - Slides and Assignments
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
Generative AI Masterclass - Model Risk Management.pptx
AI and the Future of Software Development: A Sneak Peek

Recently uploaded (20)

PDF
Review of recent advances in non-invasive hemoglobin estimation
PPTX
Big Data Technologies - Introduction.pptx
PDF
GamePlan Trading System Review: Professional Trader's Honest Take
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
Smarter Business Operations Powered by IoT Remote Monitoring
PPTX
breach-and-attack-simulation-cybersecurity-india-chennai-defenderrabbit-2025....
PDF
Chapter 2 Digital Image Fundamentals.pdf
PPTX
Telecom Fraud Prevention Guide | Hyperlink InfoSystem
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
CIFDAQ's Market Wrap: Ethereum Leads, Bitcoin Lags, Institutions Shift
PDF
Advanced IT Governance
PDF
CIFDAQ's Teaching Thursday: Moving Averages Made Simple
PDF
HCSP-Presales-Campus Network Planning and Design V1.0 Training Material-Witho...
PDF
Electronic commerce courselecture one. Pdf
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Newfamily of error-correcting codes based on genetic algorithms
PDF
solutions_manual_-_materials___processing_in_manufacturing__demargo_.pdf
PPTX
MYSQL Presentation for SQL database connectivity
Review of recent advances in non-invasive hemoglobin estimation
Big Data Technologies - Introduction.pptx
GamePlan Trading System Review: Professional Trader's Honest Take
Dropbox Q2 2025 Financial Results & Investor Presentation
Smarter Business Operations Powered by IoT Remote Monitoring
breach-and-attack-simulation-cybersecurity-india-chennai-defenderrabbit-2025....
Chapter 2 Digital Image Fundamentals.pdf
Telecom Fraud Prevention Guide | Hyperlink InfoSystem
NewMind AI Weekly Chronicles - August'25 Week I
20250228 LYD VKU AI Blended-Learning.pptx
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
CIFDAQ's Market Wrap: Ethereum Leads, Bitcoin Lags, Institutions Shift
Advanced IT Governance
CIFDAQ's Teaching Thursday: Moving Averages Made Simple
HCSP-Presales-Campus Network Planning and Design V1.0 Training Material-Witho...
Electronic commerce courselecture one. Pdf
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Newfamily of error-correcting codes based on genetic algorithms
solutions_manual_-_materials___processing_in_manufacturing__demargo_.pdf
MYSQL Presentation for SQL database connectivity

Ruben Diaz, Vision Banco + Rafael Coss, H2O ai + Luis Armenta, IBM - AI journey at Vision Banco

  • 1. AI Journey at Vision Banco Paraguay Rubén Díaz Data Scientist Vision Banco @rubuntu #H2OWORLD Rafael Coss Community Maker H2O.ai @racoss
  • 2. #H2OWORLD AI journey at Vision Banco Paraguay Ruben Diaz @rubuntu Data Scientist, Vision Banco Rafael Coss @racoss Community Maker, H2O.ai
  • 3. #H2OWORLD Agenda Introduction • Vision Banco • Journey so far Automatic Machine Learning 101 Journey to Deployment Lessons Learned Looking to the future Tweet @h2oai @visionbanco @rubuntu @racoss #h2oworld
  • 4. #H2OWORLD Vision Banco Who are we? What are our ML use Cases? What’s our env?
  • 5. #H2OWORLD Paraguay Population: 6.811 million (2017) Area: 406,752 km2 / 157,048 sq mi (4.1% of USA area ) Official languages: Spanish & Guarani ROI: 2nd best (22%) in Latin America Point of interest: Asuncion, Itaipu Dam , Jesuit Missions, Chaco, Pantanal Exports: • 1st global Electric power exporter • 1st global Organic sugar exporter • 2nd global Stevia exporter • 3rd global Yerba Mate exporter (Ilex paraguaiensis) • 4th global Soybean exporter • 5th global Chia exporter • 6th global Meat exporter
  • 6. #H2OWORLD • Paraguayan bank member of "Global Alliance for Banking on Values" • Top 10 in the country • The largest number of agencies (95), non- banking correspondents (2,492) and largest employer of the financial system (1,758 employees) in Paraguay • Inclusive: ⅓ of population have accounts • 800,000+ customers • Microfinance & SMBs Tus metas nos inspiran / Your goals inspire us
  • 7. #H2OWORLD Vision Banco ML Journey • 1st Generation Credit Scoring Models, using Logistic Regression Models developed with IBM SPSS, deployed as a stored procedure on IBM Db2 for i (AS/400) database • 2nd Generation, state of the art algos: Random Forest, GBM, etc. Developed with SPSS, KNIME and R, exported to the standard PMML format and implemented as REST web services using openscoring.io • 3rd Generation. H2O.ai open source. More algos: XGBoost, Deep Learning, Ensembles, etc. The open source H2O platform surprised us with its speed to train models. The migration to H2O involved changing the deployment of models in PMML format to H2O’s POJOS & MOJOS • 4th Generation. Auto ML with Driverless AI on IBM Power System AC922 server accelerated with NVIDIA® Tesla® V100 GPUs Each Step improved the Accuracy and Speed of Model Building & Deploying
  • 8. #H2OWORLD Vision Banco Use Cases Risk Management • Credit Scoring • Default Prediction • Fraud Detection Business • Propensity to Purchase (Predictive Lead Scoring) • Customer Churn Prediction • Customer segmentation • Recommendation Engines
  • 9. #H2OWORLD Automatic Machine Learning: Driverless AI !!! Joined the Driverless AI Beta circa November 2017 What is Driverless AI? • Automates a large part of the Data Science process POC • Env: Cloud VM with GPU • Scenario: Entered the AnalyticsVidhya.com contest "Data Science Hackathon: Churn Prediction" • Result: Surprisingly, I got 8th place! https://www.linkedin.com/pulse/how-get-eighth-place-data-science-competition-using-driverless-diaz/
  • 10. #H2OWORLD Driverless AI Features Targe t Data Quality and Transformation Modeling Table Model Building Model Data Integration + Driverless AI: Automates Data Science and ML Workflows
  • 11. Confidential11 Confidential11 Automatic Machine Learning 101 SQL Local Amazon S3 HDFS X Y Automatic Model Optimization Automatic Scoring Pipeline Machine learning Interpretability Deploy Low-latency Scoring to Production Modelling Dataset Model Recipes • i.i.d. data • Time-series • More on the way Advanced Feature Engineering Algorithm Model Tuning+ + Survival of the Fittest Understand the data shape, outliers, missing values, etc. Powered by GPU Acceleration 1 Drag and Drop Data 2 Automatic Visualization Use best practice model recipes and the power of high performance computing to iterate across thousands of possible models including advanced feature engineering and parameter tuning 3 Automatic Model Optimization Deploy ultra-low latency Python or Java Automatic Scoring Pipelines that include feature transformations and models 4 Automatic Scoring Pipelines Bring data in from cloud, big data and desktop systems Google BigQuery Azure Blog Storage Snowflake Model Documentation
  • 12. #H2OWORLD Solution Architecture (Hardware) The IBM Power System AC922 server ● Faster I/O - up to 5.6x more I/O bandwidth than x86 servers ● The best GPUs - 2-6 NVIDIA® Tesla® V100 GPUs with NVLink ● Extraordinary CPUs - 2x POWER9 CPUs, designed for AI ● Simplest AI architecture - Share RAM across CPUs & GPUs ● Enterprise-ready - PowerAI DL frameworks with IBM support ● Next Gen PCIe - PCIe Gen4 2x faster vs PCIe Gen3 in x86 ● Built for the world's biggest AI challenges The best server for enterprise AI Powered by GPU Acceleration
  • 15. Confidential15 AI App Scoring Flow (Model Deployment) AI Solutions Real-time Scoring MOJO Scoring Pipeline FeaturesScore Results See: https://github.com/rubuntu/h2o_scorer App Captures Features Score Take Action 1 2 3 4 5
  • 18. #H2OWORLD Lesson Learned • The automation of the process of Data Science reduces time and "costs less money" • It is very important that machine learning models are interpretable to explain the decisions made by machine learning algorithms to business people and even to the company's customers
  • 19. #H2OWORLD Looking to the future Use Cases • Money laundering prevention • Time series forecasting • NLP • Chatbots • Voice / Sound recognition • Image recognition • Video detection
  • 20. AI for good. Democratize AI for Everyone. H2O is the open leader in AI.