SlideShare a Scribd company logo
Toward Continual Learning
on the Edge
Talk @ University of Pisa
14-02-2020
Vincenzo Lomonaco
vincenzo.lomonaco@unibo.it
Postdoctoral Researcher @ University of Bologna
Supervisor: Davide Maltoni
About me
• AI/Continual Learning Post-Doc @
University of Bologna
• Co-Founder, President and Co-Director
of Research at ContinualAI.org
• Teaching Assistant of the courses
Machine Learning and Computer
Architectures @ UniBo
• Author andTechnical reviewer of the
online course Deep Learning with R and
book R Deep Learning Essentials.
• Co-Founder and Board Member of
AIforPeople.org
What’s ContinualAI?
• ContinualAI is a non-profit research organization and
the largest research community on Continual Learning
for AI.
• It counts more than 600+ members in 17 different
time-zones and from top-notch research institutions.
• Learn more about ContinualAI at www.continualai.org
Machine Intelligence @ BioLab
Davide Maltoni
Vincenzo Lomonaco Lorenzo Pellegrini Gabriele Graffieti
Outline
1. Introduction and Motivation
2. Rehearsal-free and Task-agnostic
Real-Time Continual Learning
3. Latent Replay for Real-Time Continual Learning
4. Real-world Deployment on Smartphone Devices
Introduction and
Motivation
Machine Learning: State-of-the-art
• Deep Learning holds state-of-the-art performances in
many tasks.
• Mainly supervised training with huge and fixed datasets.
The Curse of Dimensionality
# of possible 227x227 RGB images
The Curse of Dimensionality
# of possible 227x227 RGB images
The Curse of Dimensionality
# of possible 227x227 RGB images
How can we improve AI
scalability and adaptability?
(Hence ubiquitousness and autonomy)
Continual Learning
Continual Learning
Continual Learning (CL)
• Higher and realistic time-scale where data (and tasks)
become available only during time.
• No access to previously encountered data.
• Constant computational and memory resources.
• Incremental development of ever more complex
knowledge and skills.
Practical Applications
• Embedded systems and Robotics
(+Privacy, +efficiency, +fast adaptation, +on the edge,
-Internet connection)
• AutoML and CI systems for AI models
(+scalability, +efficiency, +fast adaptation, -energy
consumption, -$$$)
• Bias Removal
(+Security patches, +fairness patches, +fast update)
The Stability-Plasticity Dilemma
Stability-Plasticity Dilemma:
• Remember past concepts
• Learn new concepts
• Generalize
Biggest Problem in Deep Learning:
• Catastrophic Forgetting
Continual Learning: Approaches
T. Lesort,V. Lomonaco et al. Continual Learning for Robotics. pre-print arxiv arXiv:1907.00182 .
Rehearsal-free and Task-agnostic
Real-Time Continual Learning
3 Short-term Research Objectives for CL
1. Rehearsal-Free: Raw data cannot be stored and re-used
for rehearsal.
2. Task Agnostic: No use of supplementary task supervised
signal “t”.
3. Real-Time: Bounded computational and memory
overheads, efficient, real-time updates.
T. Lesort,V. Lomonaco et al. Continual Learning for Robotics. pre-print arxiv arXiv:1907.00182 .
Task Agnostic Continual Learning
1. New Instances (NI)
2. New Classes (NC)
3. New Instances and Classes (NIC)
Initial Batch Incremental Batches
Τ
. . .
CORe50Website
Dataset, Benchmark, code and additional
information freely available at:
vlomonaco.github.io/core50
LomonacoV. and Maltoni D. CORe50: a New Dataset and Benchmark for Continuous Object Recognition. CoRL2017.
LomonacoV. and Maltoni D. CORe50: a New Dataset and Benchmark for Continuous Object Recognition. CoRL2017.
CORe50: aVideo Benchmark for CL
and Object Recognition/Detection
# Images 164,866
Format RGB-D
Image size 350x350
128x128
# Categories 10
# Obj. x Cat. 5
# Sessions 11
# img. x Sess. ~300
# Outdoor Sess. 3
Acquisition Sett. Hand held
LomonacoV. and Maltoni D. CORe50: a New Dataset and Benchmark for Continuous Object Recognition. CoRL2017.
CORe50: aVideo Benchmark for CL
and Object Recognition/Detection
Fine-Grained Continual Learning
LomonacoV., Maltoni D., Pellegrini L. Fine-Grained Continual Learning. Preprint arxiv arXiv:1907.03799, 2019.
AR-1*
Rehearsal-free andTask Agnostic
Real-Time Continual Learning
Maltoni D. and LomonacoV. Continuous Learning in Single-Incremental-Task Scenarios. Neural Networks Journal, 2019.
LomonacoV., Maltoni D., Pellegrini L. Fine-Grained Continual Learning. Preprint arxiv arXiv:1907.03799, 2019.
AR-1*: Overview (with MobileNet-V1)
Data layer
Output layer (classes)
Low-level
generic
features
Class specific
discriminative
features
LomonacoV., Maltoni D., Pellegrini L. Fine-Grained Continual Learning. Preprint arxiv arXiv:1907.03799, 2019.
AR-1*: Supervised / Unsupervised
Pre-Training Phase
Data layer
Output layer (classes)
Low-level
generic
features
Class specific
discriminative
features
● Supervised or
Unsupervised
Pre-Training from
ImageNet.
● Slowly Fine-tuned or
kept fixed.
● future direction:
unsupervised
co-training from
scratch.
LomonacoV., Maltoni D., Pellegrini L. Fine-Grained Continual Learning. Preprint arxiv arXiv:1907.03799, 2019.
AR-1*: Regularization Phase
Data layer
Output layer (classes)
Low-level
generic
features
Class specific
discriminative
features
LomonacoV., Maltoni D., Pellegrini L. Fine-Grained Continual Learning. Preprint arxiv arXiv:1907.03799, 2019.
AR-1*: Regularization Phase
Data layer
Output layer (classes)
Low-level
generic
features
Class specific
discriminative
features
● Computational
Efficient (independent
from the number of
training batches)
● Just one Fisher matrix
(running sum + max
clip)
● Importance of Batch
ReNormalization
LomonacoV., Maltoni D., Pellegrini L. Fine-Grained Continual Learning. Preprint arxiv arXiv:1907.03799, 2019.
AR-1*: Architectural Phase
Data layer
Output layer (classes)
Low-level
generic
features
Class specific
discriminative
features
● CWR*: generalization of
CWR+ to handle
agnostically NI, NC and
NIC settings
● Dual-Memory system for
memory consolidation.
● Based on zero-init for new
classes, weights
consolidation and
finetuning for already
encountered classes.
LomonacoV., Maltoni D., Pellegrini L. Fine-Grained Continual Learning. Preprint arxiv arXiv:1907.03799, 2019.
CORe50 - NICv2 Results
LomonacoV., Maltoni D., Pellegrini L. Fine-Grained Continual Learning. Preprint arxiv arXiv:1907.03799, 2019.
CORe50 - NICv2 Results
LomonacoV., Maltoni D., Pellegrini L. Fine-Grained Continual Learning. Preprint arxiv arXiv:1907.03799, 2019.
● AR1* has a speed-up
of x69 w.r.t. the
Cumulative Strategy.
● AR1* is x393 more
memory efficient than
the Cumulative
strategy.
● AR1* computation and
memory overhead is
fixed and independent
from the number of
training batches.
Latent Replay for Real-Time
Continual Learning
AR-1*: Closing the Accuracy Gap with
Latent Replay
Data layer
Output layer (classes)
Low-level
generic
features
Class specific
discriminative
features
Backwardpass
(allpatterns)
Forwardpass
(allpatterns)
Forwardpass
(nativepatterns)
Concat
External
storage
(rehearsal
patterns)
(at minibatch
level)
Pellegrini L., Graffieti G. , LomonacoV. and Maltoni D. Latent Replay for Real-Time Continual Learning.To be published.
Backwardpass
(nativepatterns)
AR-1*: Closing the Accuracy Gap with
Latent Replay
Pellegrini L., Graffieti G. , LomonacoV. and Maltoni D. Latent Replay for Real-Time Continual Learning.To be published.
AR-1*: Closing the Accuracy Gap with
Latent Replay
Pellegrini L., Graffieti G. , LomonacoV. and Maltoni D. Latent Replay for Real-Time Continual Learning.To be published.
AR-1*: Sparse Representations
● Imposing Sparsity of
the activactivation
does not affect
accuracy from ~55% to
~35%.
● It has been shown that
sparsity may help the
CL process.
● Less memory overhead
for latent replay.
Pellegrini L., Graffieti G. , LomonacoV. and Maltoni D. Latent Replay for Real-Time Continual Learning.To be published.
Real-world Deployment on
Smartphone Devices
CORe Android App
● AR1*free with Latent
Replay (RM=500).
● 20 sec to gather new
session images (at 5 fps).
● Near real-time training
updates CPUs-only
(about 1 second).
● Inference time of about
5fps without any specific
optimization.
Pellegrini L., Graffieti G. , LomonacoV. and Maltoni D. Latent Replay for Real-Time Continual Learning.To be published.
CORe Android App
Pellegrini L., Graffieti G. , LomonacoV. and Maltoni D. Latent Replay for Real-Time Continual Learning.To be published.
FutureWorks and Research Direction
1. Latent Generative Replay
2. Lowering the amount of Supervision (Unsupervised
Reinforcement Learning, Active Learning)
3. Infer or make use of the sparse “task signal” (context
modulation)
4. Sequence Learning/ Temporal Coherence Integration
5. Improve robustness in real-world embedded
applications (Smartphone devices, Nao Robot, …)
Maltoni D. and LomonacoV. Semi-SupervisedTuning fromTemporal Coherence. ICPR 2016.
LomonacoV., Desai K., Maltoni D. and Culurciello, E. Continual Reinforcement Learning in 3D non-stationary
environments. preprint arxiv arXiv:1905.10112, 2019.
AR-1*: Closing the Accuracy Gap with
Latent Generative Replay
● Two levels (category + classes)
● Dense connections, two losses
● Supervised pre-training
on a large dataset (e.g.
Imagenet).
● Unsupervised slow
tuning based on i)
temporal coherence or
ii) reconstruction of
input.
● Supervised training with latent
rehearsal
● Regularization by temporal
coherence (category level?)
Generative
Model
Pellegrini L., Graffieti G. , LomonacoV. and Maltoni D. Latent Replay for Real-Time Continual Learning.To be published.
Backwardpass
(allpatterns)
Forwardpass
(allpatterns)
Forwardpass
(nativepatterns)
Concat (at minibatch
level)
Backwardpass
(nativepatterns)
Questions?
Talk @ University of Pisa
14-02-2020
Vincenzo Lomonaco
vincenzo.lomonaco@unibo.it
Postdoctoral Researcher @ University of Bologna
Supervisor: Davide Maltoni
Questions?
Talk @ University of Pisa
14-02-2020
Vincenzo Lomonaco
vincenzo.lomonaco@unibo.it
Postdoctoral Researcher @ University of Bologna
Supervisor: Davide Maltoni

More Related Content

What's hot (20)

PDF
Self-supervised Learning for Next-Generation Industry-level Autonomous Drivin...
Gabriele Graffieti
 
PDF
Continual Learning over Small non-i.i.d. Batches of Natural Video Streams
Gabriele Graffieti
 
PDF
Self-supervised Visual Learning 2020 - Xavier Giro-i-Nieto - UPC Barcelona
Universitat Politècnica de Catalunya
 
DOC
Module handout for COM839 - Intelligent Systems [Word format]
butest
 
DOC
COM623M1.doc.doc
butest
 
PDF
Multiple Object Tracking - Laura Leal-Taixe - UPC Barcelona 2018
Universitat Politècnica de Catalunya
 
PDF
MILA DL & RL summer school highlights
Natalia Díaz Rodríguez
 
PPTX
MetaScience: Holistic Approach for Research Modeling and Analysis
Jordi Cabot
 
PDF
Virtual World simulations to support Robot-Mediated Interaction
Michael Vallance
 
PDF
Animal neurophysiology virtual lab: Pedagogical requirements and technologica...
Nicolas Casel
 
PPT
课堂讲义(最后更新:2009-9-25)
butest
 
PDF
Technical computing in Julia
Jiahao Chen
 
PDF
Lecture 3: RNNs - Full Stack Deep Learning - Spring 2021
Sergey Karayev
 
PPTX
Ai overview
Serendipity Seraph
 
PDF
Successes and Frontiers of Deep Learning
Sebastian Ruder
 
PDF
Deep learning and applications in non-cognitive domains II
Deakin University
 
DOCX
Gate
narmathamohan
 
PPTX
Learning End to End
Thilo Stadelmann
 
PPTX
Visualization of Deep Learning
YaminiAlapati1
 
PDF
A Hidden Markov Model Approach to Predict Students' Actions in an Adaptive an...
Simon Bolivar University
 
Self-supervised Learning for Next-Generation Industry-level Autonomous Drivin...
Gabriele Graffieti
 
Continual Learning over Small non-i.i.d. Batches of Natural Video Streams
Gabriele Graffieti
 
Self-supervised Visual Learning 2020 - Xavier Giro-i-Nieto - UPC Barcelona
Universitat Politècnica de Catalunya
 
Module handout for COM839 - Intelligent Systems [Word format]
butest
 
COM623M1.doc.doc
butest
 
Multiple Object Tracking - Laura Leal-Taixe - UPC Barcelona 2018
Universitat Politècnica de Catalunya
 
MILA DL & RL summer school highlights
Natalia Díaz Rodríguez
 
MetaScience: Holistic Approach for Research Modeling and Analysis
Jordi Cabot
 
Virtual World simulations to support Robot-Mediated Interaction
Michael Vallance
 
Animal neurophysiology virtual lab: Pedagogical requirements and technologica...
Nicolas Casel
 
课堂讲义(最后更新:2009-9-25)
butest
 
Technical computing in Julia
Jiahao Chen
 
Lecture 3: RNNs - Full Stack Deep Learning - Spring 2021
Sergey Karayev
 
Ai overview
Serendipity Seraph
 
Successes and Frontiers of Deep Learning
Sebastian Ruder
 
Deep learning and applications in non-cognitive domains II
Deakin University
 
Learning End to End
Thilo Stadelmann
 
Visualization of Deep Learning
YaminiAlapati1
 
A Hidden Markov Model Approach to Predict Students' Actions in an Adaptive an...
Simon Bolivar University
 

Similar to Toward Continual Learning on the Edge (20)

PPTX
Continual Learning with Deep Architectures Workshop @ Computer VISIONers Conf...
Vincenzo Lomonaco
 
PDF
Continual/Lifelong Learning with Deep Architectures, Vincenzo Lomonaco
Data Science Milan
 
PDF
IntroML_1_Introduction_Tagged.pdf
Elio Laureano
 
PDF
IntroML_1_Introduction
Elio Laureano
 
PDF
Deep Learning: a birds eye view
Roelof Pieters
 
PPTX
Deep Learning @ ZHAW Datalab (with Mark Cieliebak & Yves Pauchard)
Thilo Stadelmann
 
PPTX
Remote practical-focused tutorials
Jon Rosewell
 
PPTX
Program Detail - Mastering AI Bootcamp_v2.pptx
khaled125087
 
PDF
Deep Learning, an interactive introduction for NLP-ers
Roelof Pieters
 
PDF
Introduction to Python
MuhammadBakri13
 
PDF
Online Python Resources
Jonathan Fine
 
DOCX
A Decision Tree based Recommendation System for Tourists.docx
spub1985
 
DOCX
Obj report
Manish Raghav
 
PPTX
Python and BIG Data analytics | Python Fundamentals | Python Architecture
Skillspeed
 
PDF
“Understand the Multimodal World with Minimal Supervision,” a Keynote Present...
Edge AI and Vision Alliance
 
PPTX
01 zero lecture lovely professional university CSE111.pptx
mexebi1971
 
PDF
Unit1: Introduction to AI
Tekendra Nath Yogi
 
PDF
BOIL: Towards Representation Change for Few-shot Learning
Hyungjun Yoo
 
PPTX
Thomas Wolf "Transfer learning in NLP"
Fwdays
 
PDF
Robot Learning with Structured Knowledge And Richer Sensing
Akihiko Yamaguchi
 
Continual Learning with Deep Architectures Workshop @ Computer VISIONers Conf...
Vincenzo Lomonaco
 
Continual/Lifelong Learning with Deep Architectures, Vincenzo Lomonaco
Data Science Milan
 
IntroML_1_Introduction_Tagged.pdf
Elio Laureano
 
IntroML_1_Introduction
Elio Laureano
 
Deep Learning: a birds eye view
Roelof Pieters
 
Deep Learning @ ZHAW Datalab (with Mark Cieliebak & Yves Pauchard)
Thilo Stadelmann
 
Remote practical-focused tutorials
Jon Rosewell
 
Program Detail - Mastering AI Bootcamp_v2.pptx
khaled125087
 
Deep Learning, an interactive introduction for NLP-ers
Roelof Pieters
 
Introduction to Python
MuhammadBakri13
 
Online Python Resources
Jonathan Fine
 
A Decision Tree based Recommendation System for Tourists.docx
spub1985
 
Obj report
Manish Raghav
 
Python and BIG Data analytics | Python Fundamentals | Python Architecture
Skillspeed
 
“Understand the Multimodal World with Minimal Supervision,” a Keynote Present...
Edge AI and Vision Alliance
 
01 zero lecture lovely professional university CSE111.pptx
mexebi1971
 
Unit1: Introduction to AI
Tekendra Nath Yogi
 
BOIL: Towards Representation Change for Few-shot Learning
Hyungjun Yoo
 
Thomas Wolf "Transfer learning in NLP"
Fwdays
 
Robot Learning with Structured Knowledge And Richer Sensing
Akihiko Yamaguchi
 
Ad

More from Vincenzo Lomonaco (14)

PDF
2023-08-22 CoLLAs Tutorial - Beyond CIL.pdf
Vincenzo Lomonaco
 
PDF
Continual Learning with Deep Architectures - Tutorial ICML 2021
Vincenzo Lomonaco
 
PDF
Continual Learning for Robotics
Vincenzo Lomonaco
 
PDF
Don't forget, there is more than forgetting: new metrics for Continual Learni...
Vincenzo Lomonaco
 
PPTX
CORe50: a New Dataset and Benchmark for Continual Learning and Object Recogni...
Vincenzo Lomonaco
 
PPTX
Continuous Learning with Deep Architectures
Vincenzo Lomonaco
 
PDF
CORe50: a New Dataset and Benchmark for Continuous Object Recognition Poster
Vincenzo Lomonaco
 
PPTX
Continuous Unsupervised Training of Deep Architectures
Vincenzo Lomonaco
 
PDF
Deep Learning for Computer Vision: A comparision between Convolutional Neural...
Vincenzo Lomonaco
 
PDF
Deep Learning for Computer Vision: A comparision between Convolutional Neural...
Vincenzo Lomonaco
 
PDF
A Framework for Deadlock Detection in Java
Vincenzo Lomonaco
 
PDF
Deep Learning libraries and first experiments with Theano
Vincenzo Lomonaco
 
PDF
Word2vec on the italian language: first experiments
Vincenzo Lomonaco
 
PDF
Machine Learning for Automated Reasoning: An Overview
Vincenzo Lomonaco
 
2023-08-22 CoLLAs Tutorial - Beyond CIL.pdf
Vincenzo Lomonaco
 
Continual Learning with Deep Architectures - Tutorial ICML 2021
Vincenzo Lomonaco
 
Continual Learning for Robotics
Vincenzo Lomonaco
 
Don't forget, there is more than forgetting: new metrics for Continual Learni...
Vincenzo Lomonaco
 
CORe50: a New Dataset and Benchmark for Continual Learning and Object Recogni...
Vincenzo Lomonaco
 
Continuous Learning with Deep Architectures
Vincenzo Lomonaco
 
CORe50: a New Dataset and Benchmark for Continuous Object Recognition Poster
Vincenzo Lomonaco
 
Continuous Unsupervised Training of Deep Architectures
Vincenzo Lomonaco
 
Deep Learning for Computer Vision: A comparision between Convolutional Neural...
Vincenzo Lomonaco
 
Deep Learning for Computer Vision: A comparision between Convolutional Neural...
Vincenzo Lomonaco
 
A Framework for Deadlock Detection in Java
Vincenzo Lomonaco
 
Deep Learning libraries and first experiments with Theano
Vincenzo Lomonaco
 
Word2vec on the italian language: first experiments
Vincenzo Lomonaco
 
Machine Learning for Automated Reasoning: An Overview
Vincenzo Lomonaco
 
Ad

Recently uploaded (20)

PDF
Control and coordination Class 10 Chapter 6
LataHolkar
 
PDF
Pulsar Sparking: What if mountains on the surface?
Sérgio Sacani
 
PDF
Discovery and dynamics of a Sedna-like object with a perihelion of 66 au
Sérgio Sacani
 
PPT
1a. Basic Principles of Medical Microbiology Part 2 [Autosaved].ppt
separatedwalk
 
PPTX
Q1_Science 8_Week4-Day 5.pptx science re
AizaRazonado
 
PPTX
CARBOHYDRATES METABOLSIM, UNIT 2, B Pharm II SEMESTER, BIOCHEMISTRY
ARUN KUMAR
 
PPTX
Internal Capsule_Divisions_fibres_lesions
muralinath2
 
PPTX
Pirimidinas_2025_Curso Ácidos nucleicos. Cinvestav
lalvarezmex
 
PDF
Quorum Sensing and Microbial Communication
Prachi Virat
 
PPTX
Presentation on the LANGERHANS CELLS.pptx
drnikitabaheti
 
PPTX
mirna_2025_clase_genética_cinvestav_Dralvarez
lalvarezmex
 
PPTX
Role of GIS in precision farming.pptx
BikramjitDeuri
 
PDF
A deep Search for Ethylene Glycol and Glycolonitrile in the V883 Ori Protopla...
Sérgio Sacani
 
PPTX
The Toxic Effects of Aflatoxin B1 and Aflatoxin M1 on Kidney through Regulati...
OttokomaBonny
 
PPT
1. Basic Principles of Medical Microbiology Part 1.ppt
separatedwalk
 
PPTX
Metabolismo de Purinas_2025_Luis Alvarez_Biomoleculas 2
lalvarezmex
 
DOCX
Echoes_of_Andromeda_Partial (1).docx9989
yakshitkrishnia5a3
 
PPTX
Quality control test for plastic & metal.pptx
shrutipandit17
 
PPTX
Home Garden as a Component of Agroforestry system : A survey-based Study
AkhangshaRoy
 
PPT
Grade_9_Science_Atomic_S_t_r_u_cture.ppt
QuintReynoldDoble
 
Control and coordination Class 10 Chapter 6
LataHolkar
 
Pulsar Sparking: What if mountains on the surface?
Sérgio Sacani
 
Discovery and dynamics of a Sedna-like object with a perihelion of 66 au
Sérgio Sacani
 
1a. Basic Principles of Medical Microbiology Part 2 [Autosaved].ppt
separatedwalk
 
Q1_Science 8_Week4-Day 5.pptx science re
AizaRazonado
 
CARBOHYDRATES METABOLSIM, UNIT 2, B Pharm II SEMESTER, BIOCHEMISTRY
ARUN KUMAR
 
Internal Capsule_Divisions_fibres_lesions
muralinath2
 
Pirimidinas_2025_Curso Ácidos nucleicos. Cinvestav
lalvarezmex
 
Quorum Sensing and Microbial Communication
Prachi Virat
 
Presentation on the LANGERHANS CELLS.pptx
drnikitabaheti
 
mirna_2025_clase_genética_cinvestav_Dralvarez
lalvarezmex
 
Role of GIS in precision farming.pptx
BikramjitDeuri
 
A deep Search for Ethylene Glycol and Glycolonitrile in the V883 Ori Protopla...
Sérgio Sacani
 
The Toxic Effects of Aflatoxin B1 and Aflatoxin M1 on Kidney through Regulati...
OttokomaBonny
 
1. Basic Principles of Medical Microbiology Part 1.ppt
separatedwalk
 
Metabolismo de Purinas_2025_Luis Alvarez_Biomoleculas 2
lalvarezmex
 
Echoes_of_Andromeda_Partial (1).docx9989
yakshitkrishnia5a3
 
Quality control test for plastic & metal.pptx
shrutipandit17
 
Home Garden as a Component of Agroforestry system : A survey-based Study
AkhangshaRoy
 
Grade_9_Science_Atomic_S_t_r_u_cture.ppt
QuintReynoldDoble
 

Toward Continual Learning on the Edge

  • 1. Toward Continual Learning on the Edge Talk @ University of Pisa 14-02-2020 Vincenzo Lomonaco [email protected] Postdoctoral Researcher @ University of Bologna Supervisor: Davide Maltoni
  • 2. About me • AI/Continual Learning Post-Doc @ University of Bologna • Co-Founder, President and Co-Director of Research at ContinualAI.org • Teaching Assistant of the courses Machine Learning and Computer Architectures @ UniBo • Author andTechnical reviewer of the online course Deep Learning with R and book R Deep Learning Essentials. • Co-Founder and Board Member of AIforPeople.org
  • 3. What’s ContinualAI? • ContinualAI is a non-profit research organization and the largest research community on Continual Learning for AI. • It counts more than 600+ members in 17 different time-zones and from top-notch research institutions. • Learn more about ContinualAI at www.continualai.org
  • 4. Machine Intelligence @ BioLab Davide Maltoni Vincenzo Lomonaco Lorenzo Pellegrini Gabriele Graffieti
  • 5. Outline 1. Introduction and Motivation 2. Rehearsal-free and Task-agnostic Real-Time Continual Learning 3. Latent Replay for Real-Time Continual Learning 4. Real-world Deployment on Smartphone Devices
  • 7. Machine Learning: State-of-the-art • Deep Learning holds state-of-the-art performances in many tasks. • Mainly supervised training with huge and fixed datasets.
  • 8. The Curse of Dimensionality # of possible 227x227 RGB images
  • 9. The Curse of Dimensionality # of possible 227x227 RGB images
  • 10. The Curse of Dimensionality # of possible 227x227 RGB images
  • 11. How can we improve AI scalability and adaptability? (Hence ubiquitousness and autonomy)
  • 14. Continual Learning (CL) • Higher and realistic time-scale where data (and tasks) become available only during time. • No access to previously encountered data. • Constant computational and memory resources. • Incremental development of ever more complex knowledge and skills.
  • 15. Practical Applications • Embedded systems and Robotics (+Privacy, +efficiency, +fast adaptation, +on the edge, -Internet connection) • AutoML and CI systems for AI models (+scalability, +efficiency, +fast adaptation, -energy consumption, -$$$) • Bias Removal (+Security patches, +fairness patches, +fast update)
  • 16. The Stability-Plasticity Dilemma Stability-Plasticity Dilemma: • Remember past concepts • Learn new concepts • Generalize Biggest Problem in Deep Learning: • Catastrophic Forgetting
  • 17. Continual Learning: Approaches T. Lesort,V. Lomonaco et al. Continual Learning for Robotics. pre-print arxiv arXiv:1907.00182 .
  • 19. 3 Short-term Research Objectives for CL 1. Rehearsal-Free: Raw data cannot be stored and re-used for rehearsal. 2. Task Agnostic: No use of supplementary task supervised signal “t”. 3. Real-Time: Bounded computational and memory overheads, efficient, real-time updates. T. Lesort,V. Lomonaco et al. Continual Learning for Robotics. pre-print arxiv arXiv:1907.00182 .
  • 20. Task Agnostic Continual Learning 1. New Instances (NI) 2. New Classes (NC) 3. New Instances and Classes (NIC) Initial Batch Incremental Batches Τ . . .
  • 21. CORe50Website Dataset, Benchmark, code and additional information freely available at: vlomonaco.github.io/core50 LomonacoV. and Maltoni D. CORe50: a New Dataset and Benchmark for Continuous Object Recognition. CoRL2017.
  • 22. LomonacoV. and Maltoni D. CORe50: a New Dataset and Benchmark for Continuous Object Recognition. CoRL2017. CORe50: aVideo Benchmark for CL and Object Recognition/Detection
  • 23. # Images 164,866 Format RGB-D Image size 350x350 128x128 # Categories 10 # Obj. x Cat. 5 # Sessions 11 # img. x Sess. ~300 # Outdoor Sess. 3 Acquisition Sett. Hand held LomonacoV. and Maltoni D. CORe50: a New Dataset and Benchmark for Continuous Object Recognition. CoRL2017. CORe50: aVideo Benchmark for CL and Object Recognition/Detection
  • 24. Fine-Grained Continual Learning LomonacoV., Maltoni D., Pellegrini L. Fine-Grained Continual Learning. Preprint arxiv arXiv:1907.03799, 2019.
  • 25. AR-1* Rehearsal-free andTask Agnostic Real-Time Continual Learning Maltoni D. and LomonacoV. Continuous Learning in Single-Incremental-Task Scenarios. Neural Networks Journal, 2019. LomonacoV., Maltoni D., Pellegrini L. Fine-Grained Continual Learning. Preprint arxiv arXiv:1907.03799, 2019.
  • 26. AR-1*: Overview (with MobileNet-V1) Data layer Output layer (classes) Low-level generic features Class specific discriminative features LomonacoV., Maltoni D., Pellegrini L. Fine-Grained Continual Learning. Preprint arxiv arXiv:1907.03799, 2019.
  • 27. AR-1*: Supervised / Unsupervised Pre-Training Phase Data layer Output layer (classes) Low-level generic features Class specific discriminative features ● Supervised or Unsupervised Pre-Training from ImageNet. ● Slowly Fine-tuned or kept fixed. ● future direction: unsupervised co-training from scratch. LomonacoV., Maltoni D., Pellegrini L. Fine-Grained Continual Learning. Preprint arxiv arXiv:1907.03799, 2019.
  • 28. AR-1*: Regularization Phase Data layer Output layer (classes) Low-level generic features Class specific discriminative features LomonacoV., Maltoni D., Pellegrini L. Fine-Grained Continual Learning. Preprint arxiv arXiv:1907.03799, 2019.
  • 29. AR-1*: Regularization Phase Data layer Output layer (classes) Low-level generic features Class specific discriminative features ● Computational Efficient (independent from the number of training batches) ● Just one Fisher matrix (running sum + max clip) ● Importance of Batch ReNormalization LomonacoV., Maltoni D., Pellegrini L. Fine-Grained Continual Learning. Preprint arxiv arXiv:1907.03799, 2019.
  • 30. AR-1*: Architectural Phase Data layer Output layer (classes) Low-level generic features Class specific discriminative features ● CWR*: generalization of CWR+ to handle agnostically NI, NC and NIC settings ● Dual-Memory system for memory consolidation. ● Based on zero-init for new classes, weights consolidation and finetuning for already encountered classes. LomonacoV., Maltoni D., Pellegrini L. Fine-Grained Continual Learning. Preprint arxiv arXiv:1907.03799, 2019.
  • 31. CORe50 - NICv2 Results LomonacoV., Maltoni D., Pellegrini L. Fine-Grained Continual Learning. Preprint arxiv arXiv:1907.03799, 2019.
  • 32. CORe50 - NICv2 Results LomonacoV., Maltoni D., Pellegrini L. Fine-Grained Continual Learning. Preprint arxiv arXiv:1907.03799, 2019. ● AR1* has a speed-up of x69 w.r.t. the Cumulative Strategy. ● AR1* is x393 more memory efficient than the Cumulative strategy. ● AR1* computation and memory overhead is fixed and independent from the number of training batches.
  • 33. Latent Replay for Real-Time Continual Learning
  • 34. AR-1*: Closing the Accuracy Gap with Latent Replay Data layer Output layer (classes) Low-level generic features Class specific discriminative features Backwardpass (allpatterns) Forwardpass (allpatterns) Forwardpass (nativepatterns) Concat External storage (rehearsal patterns) (at minibatch level) Pellegrini L., Graffieti G. , LomonacoV. and Maltoni D. Latent Replay for Real-Time Continual Learning.To be published. Backwardpass (nativepatterns)
  • 35. AR-1*: Closing the Accuracy Gap with Latent Replay Pellegrini L., Graffieti G. , LomonacoV. and Maltoni D. Latent Replay for Real-Time Continual Learning.To be published.
  • 36. AR-1*: Closing the Accuracy Gap with Latent Replay Pellegrini L., Graffieti G. , LomonacoV. and Maltoni D. Latent Replay for Real-Time Continual Learning.To be published.
  • 37. AR-1*: Sparse Representations ● Imposing Sparsity of the activactivation does not affect accuracy from ~55% to ~35%. ● It has been shown that sparsity may help the CL process. ● Less memory overhead for latent replay. Pellegrini L., Graffieti G. , LomonacoV. and Maltoni D. Latent Replay for Real-Time Continual Learning.To be published.
  • 39. CORe Android App ● AR1*free with Latent Replay (RM=500). ● 20 sec to gather new session images (at 5 fps). ● Near real-time training updates CPUs-only (about 1 second). ● Inference time of about 5fps without any specific optimization. Pellegrini L., Graffieti G. , LomonacoV. and Maltoni D. Latent Replay for Real-Time Continual Learning.To be published.
  • 40. CORe Android App Pellegrini L., Graffieti G. , LomonacoV. and Maltoni D. Latent Replay for Real-Time Continual Learning.To be published.
  • 41. FutureWorks and Research Direction 1. Latent Generative Replay 2. Lowering the amount of Supervision (Unsupervised Reinforcement Learning, Active Learning) 3. Infer or make use of the sparse “task signal” (context modulation) 4. Sequence Learning/ Temporal Coherence Integration 5. Improve robustness in real-world embedded applications (Smartphone devices, Nao Robot, …) Maltoni D. and LomonacoV. Semi-SupervisedTuning fromTemporal Coherence. ICPR 2016. LomonacoV., Desai K., Maltoni D. and Culurciello, E. Continual Reinforcement Learning in 3D non-stationary environments. preprint arxiv arXiv:1905.10112, 2019.
  • 42. AR-1*: Closing the Accuracy Gap with Latent Generative Replay ● Two levels (category + classes) ● Dense connections, two losses ● Supervised pre-training on a large dataset (e.g. Imagenet). ● Unsupervised slow tuning based on i) temporal coherence or ii) reconstruction of input. ● Supervised training with latent rehearsal ● Regularization by temporal coherence (category level?) Generative Model Pellegrini L., Graffieti G. , LomonacoV. and Maltoni D. Latent Replay for Real-Time Continual Learning.To be published. Backwardpass (allpatterns) Forwardpass (allpatterns) Forwardpass (nativepatterns) Concat (at minibatch level) Backwardpass (nativepatterns)
  • 43. Questions? Talk @ University of Pisa 14-02-2020 Vincenzo Lomonaco [email protected] Postdoctoral Researcher @ University of Bologna Supervisor: Davide Maltoni
  • 44. Questions? Talk @ University of Pisa 14-02-2020 Vincenzo Lomonaco [email protected] Postdoctoral Researcher @ University of Bologna Supervisor: Davide Maltoni