SlideShare a Scribd company logo
Alma Mater Studiorum - University of Bologna
School of Science
Department of Computer Science and Engineering DISI
Deep Learning for Computer Vision
Candidate
dott. Vincenzo Lomonaco
Supervisor
prof. Davide Maltoni
Co-examiner
prof. Mauro Gaspari
A comparison between Convolutional Neural
Networks and Hierarchical Temporal Memories on
object recognition tasks
08.09.15 Vincenzo Lomonaco 2
ContentsBackground & Motivations
Objectives
Introduction
CNN and HTM
Key features
Implementations
NORB-sequences
Original NORB dataset
New benchmark design
Experiments and Results
Experiments design
Results
Conclusions
Contents
08.09.15 Vincenzo Lomonaco 3
ContentsBackground & Motivations
Objectives
Introduction
CNN and HTM
Key features
Implementations
NORB-sequences
Original NORB dataset
New benchmark design
Experiments and Results
Experiments design
Results
Conclusions
Contents
08.09.15 Vincenzo Lomonaco 4
Deep Learning
In the last decade, Deep Learning techniques have shown to
perform incredibly well on a large variety of problems both in
Computer Vision and Natural Language Processing, resulting in
the state of the art in many tasks.
08.09.15 Vincenzo Lomonaco 5
Deep Learning advantages
Deep Learning is a branch of machine learning based on a set of
algorithms that attempt to model high-level abstractions in data by
using model architectures composed of multiple non-linear
transformations.
08.09.15 Vincenzo Lomonaco 6
Deep Learning disadvantages
● Poorly understood surrounding theory
● Non-optimal method
● Very difficult to train
● Huge quantity of data needed
● High Performance Computing environment needed
Possible limitations:
08.09.15 Vincenzo Lomonaco 7
Objectives
Proving that taking inspiration from biological learning
systems can help again in advancing the field of DL.
Proving that, with less data, it is however possible to reach
good levels of accuracy.
08.09.15 Vincenzo Lomonaco 8
How
We would like to show that, with a lower quantity of available
data, HTM can outperfom CNN on these tasks remaining
comparable in terms of training times.
Comparing two very different deep learning algorithms on
object recognition tasks:
– CNN: classical approach, state-of-the-art for object
recognition
– HTM: new biologically inspired approach
08.09.15 Vincenzo Lomonaco 9
NORB-sequences
Experiments and Results
Conclusions
ContentsBackground & Motivations
Objectives
Introduction
CNN and HTM
Key features
Implementations
Original NORB dataset
New benchmark design
Experiments design
Results
Contents
08.09.15 Vincenzo Lomonaco 10
CNN
CNNs are MLP variants where individual neurons are tiled in
such a way that they respond to overlapping regions in the
visual field. They are architectural inspired by Hubel and
Wiesel’s early work on the cat’s visual cortex.
● Python
● Using Theano
● 11 source files, 2550+ lns
● Pure supervised method
● Sparse Connectivity
● Shared Weights
Key features: Implementation:
08.09.15 Vincenzo Lomonaco 11
HTM
HTM is known as a new emerging paradigm that is more
biologically inspired. It tries to incorporate concepts like time,
context and attention during the learning process that are
typical of the human brain.
● C#, OPENMP version
● Provided by Biometric
System Lab (DISI)
● Mainly unsupervised method
● Top down and bottom-up
information flow
● Bayesian probabilistic
formulation
Key features: Implementations:
08.09.15 Vincenzo Lomonaco 12
Experiments and Results
Conclusions
Original NORB dataset
New benchmark design
ContentsBackground & Motivations
Objectives
Introduction
CNN and HTM
Key features
Implementations
Experiments design
Results
Contents
NORB-sequences
08.09.15 Vincenzo Lomonaco 13
NORB-Sequences
Since the computer vision community is starting to investigate
object recognition algorithms on videos, we would like to move
our comparison to that direction.
To this purpose, a new benchmark of a large collection of image
sequences starting from the well-know small NORB DATASET
has been created.
THE original NORB DATASET:
● Stores 48,600 96x96 image (5 categories, 10 instances, 6 lightings,
9 elevations, and 18 azimuths).
● Is well-know and accepted by the research community in the
context of object-recognition
08.09.15 Vincenzo Lomonaco 14
original NORB DATASET
Training instances Test instances
08.09.15 Vincenzo Lomonaco 15
Java sequencer
NORB-sequences is made possible thanks to a Java software
that takes in input the small NORB DATASET, and given a
number of different tuning parameters, return a number of
training and a test image sequences.
time
● The sequences are created ad hoc to simulate a camera moving
around a specific object including changes in the surround lighting.
● Integrated KNN baseline, GUI, 10 source files, 2600+ lns
Key features:
08.09.15 Vincenzo Lomonaco 16
NORB Sequences GUI
08.09.15 Vincenzo Lomonaco 17
NORB-sequences
Conclusions
ContentsBackground & Motivations
Objectives
Introduction
CNN and HTM
Key features
Implementations
Original NORB dataset
New benchmark design
Experiments design
Results
Contents
Experiments and Results
08.09.15 Vincenzo Lomonaco 18
Experiments design
1) Validate the CNN implementation on the NORB dataset
2) Evaluate the performance of both algorithms on the plain
NORB dataset
3) Evaluate the performance of both algorithms on the NORB
sequences
08.09.15 Vincenzo Lomonaco 19
CNN validation
In order to validate the new implementation,the goal was to
reproduce Y. LeCun original results on the plain NORB
DATASET.
08.09.15 Vincenzo Lomonaco 20
Plain NORB results
Accuracy results comparison between CNN and HTM on the
plain NORB dataset.
08.09.15 Vincenzo Lomonaco 21
Training times
Training times comparison between CNN and HTM on the
NORB sequences.
Training size CNN times HTM times
100 + 800jit 10.94 m 21.19 m
250 + 2000jit 31.15 m 23.13 m
500 + 4000jit 38.24 m 22.14 m
1000 + 4000jit 91.26 m 26.04 m
2500 + 4000jit 94.90 m 61.08 m
5000 + 4000jit 124.7 m 89.58 m
10000 + 4000jit 187.7 m 143.5 m
24300 + 4000jit 51.31 m 596.2 m
● CNN: GPU Tesla C2075 Fermi
(GPU speedup x3.2)
● HTM: CPU Xeon W3550, 4
cores.
Architectures:
08.09.15 Vincenzo Lomonaco 22
NORB sequences results
Accuracy results comparison between CNN and HTM on the
NORB sequences.
Train Test dist. CNN HTM
2x20 1 84.1% 86.36%
2x20 2 82.96% 86.16%
2x20 3 82.26% 86.16%
2x20 4 81.35% 84.20%
3x20 1 89.27% 91.0%
3x20 2 88.46% 89.52%
3x20 3 88.42% 88.68%
3x20 4 88.0% 85.88%
5x20 1 95.11% 92.69%
5x20 2 94.4% 91.62%
5x20 3 93.1% 92.86%
5x20 4 92.13% 91.23%
Train Test dist. CNN HTM
2x20 1 37.56% 37.08%
2x20 2 34.38% 37.82%
2x20 3 30.71% 33.17%
2x20 4 25.47% 28.89%
3x20 1 49.89% 43.68%
3x20 2 48.01% 44.08%
3x20 3 40.56% 37.93%
3x20 4 33.77% 34.93%
5x20 1 55.17% 52.57%
5x20 2 52.55% 49.74%
5x20 3 45.86% 45.52%
5x20 4 40.08% 41.30%
5-classes 50-classes
08.09.15 Vincenzo Lomonaco 23
NORB-sequences
Experiments and Results
Conclusions
ContentsBackground & Motivations
Objectives
Introduction
CNN and HTM
Key features
Implementations
Original NORB dataset
New benchmark design
Experiments design
Results
Contents
08.09.15 Vincenzo Lomonaco 24
Conclusions
In this dissertation three different milestones have been
achieved:
1) A LeNet-7 with Theano has been successfully implemented.
2) A new benchmark for object recognition in image
sequences has been created.
3) HTM and CNN have been compared on different object
recognition tasks.
It has been proven that the HTM bio-inspired approach can
be highly competitive and could be instrumental for
advancing the field of Deep Learning
08.09.15 Vincenzo Lomonaco 25
The End
http://vincenzolomonaco.com
vincenzo.lomonaco@studio.unibo.it
“If we want machines to think, we need to teach them to see”
Fei-Fei Li, Stanford Computer Vision Lab
Thank you for your attention
Vincenzo Lomonaco

More Related Content

What's hot (20)

PPTX
TensorFlow Tutorial Part2
Sungjoon Choi
 
PPTX
TensorFlow Tutorial Part1
Sungjoon Choi
 
PDF
Deep Learning - 인공지능 기계학습의 새로운 트랜드 :김인중
datasciencekorea
 
PDF
Introduction to Convolutional Neural Networks
Hannes Hapke
 
PDF
Deep learning in Computer Vision
David Dao
 
PDF
AI&BigData Lab 2016. Александр Баев: Transfer learning - зачем, как и где.
GeeksLab Odessa
 
PPTX
Transformer in Vision
Sangmin Woo
 
PPTX
Convolutional neural networks deepa
deepa4466
 
PPTX
Neuroevolution and deep learing
Accenture
 
PPTX
Learn to Build an App to Find Similar Images using Deep Learning- Piotr Teterwak
PyData
 
PPTX
AlexNet
Bertil Hatt
 
PDF
AI&BigData Lab. Артем Чернодуб "Распознавание изображений методом Lazy Deep ...
GeeksLab Odessa
 
PPTX
CNN Tutorial
Sungjoon Choi
 
PDF
Review : Prototype Mixture Models for Few-shot Semantic Segmentation
Dongmin Choi
 
PPTX
Deep Learning Tutorial
Ligeng Zhu
 
PPTX
Introduction to Deep Learning for Image Analysis at Strata NYC, Sep 2015
Turi, Inc.
 
PPTX
150424 Scalable Object Detection using Deep Neural Networks
Junho Cho
 
PPTX
160205 NeuralArt - Understanding Neural Representation
Junho Cho
 
PDF
Transfer Learning (D2L4 Insight@DCU Machine Learning Workshop 2017)
Universitat Politècnica de Catalunya
 
PPTX
Efficient Neural Network Architecture for Image Classfication
Yogendra Tamang
 
TensorFlow Tutorial Part2
Sungjoon Choi
 
TensorFlow Tutorial Part1
Sungjoon Choi
 
Deep Learning - 인공지능 기계학습의 새로운 트랜드 :김인중
datasciencekorea
 
Introduction to Convolutional Neural Networks
Hannes Hapke
 
Deep learning in Computer Vision
David Dao
 
AI&BigData Lab 2016. Александр Баев: Transfer learning - зачем, как и где.
GeeksLab Odessa
 
Transformer in Vision
Sangmin Woo
 
Convolutional neural networks deepa
deepa4466
 
Neuroevolution and deep learing
Accenture
 
Learn to Build an App to Find Similar Images using Deep Learning- Piotr Teterwak
PyData
 
AlexNet
Bertil Hatt
 
AI&BigData Lab. Артем Чернодуб "Распознавание изображений методом Lazy Deep ...
GeeksLab Odessa
 
CNN Tutorial
Sungjoon Choi
 
Review : Prototype Mixture Models for Few-shot Semantic Segmentation
Dongmin Choi
 
Deep Learning Tutorial
Ligeng Zhu
 
Introduction to Deep Learning for Image Analysis at Strata NYC, Sep 2015
Turi, Inc.
 
150424 Scalable Object Detection using Deep Neural Networks
Junho Cho
 
160205 NeuralArt - Understanding Neural Representation
Junho Cho
 
Transfer Learning (D2L4 Insight@DCU Machine Learning Workshop 2017)
Universitat Politècnica de Catalunya
 
Efficient Neural Network Architecture for Image Classfication
Yogendra Tamang
 

Viewers also liked (20)

PPTX
¿Qué es Slide Share?
Family
 
PPTX
Magazine Genre Research
daltonsenger
 
DOCX
Curriculum Vita1
Negash Alemayehu
 
DOC
Tab
Bura Jun
 
PPT
Sarcina nr 2 roma
florinaarsenie
 
PDF
2016-03-31 FAA Visual Arts at MDC Frontenac_May 2016_Press Kit_EN
Kakim Goh
 
PDF
A Framework for Deadlock Detection in Java
Vincenzo Lomonaco
 
PPTX
SCHEMATIC REPORT
Anthony Head
 
PDF
Kato Mivule: An Overview of Adaptive Boosting – AdaBoost
Kato Mivule
 
PPTX
Gsm architecture
Agrima Kothari
 
PDF
CVPR2016読み会 Sparsifying Neural Network Connections for Face Recognition
Koichi Takahashi
 
PPTX
Machine learning with ADA Boost
Aman Patel
 
PPTX
Captcha as graphical passwords a new security primitive based on hard ai prob...
IGEEKS TECHNOLOGIES
 
PPTX
Face recognization using artificial nerual network
Dharmesh Tank
 
PPTX
Face recognition technology
SARATHGOVINDKK
 
PDF
Deep Learning for Computer Vision (2/4): Object Analytics @ laSalle 2016
Universitat Politècnica de Catalunya
 
PPTX
Detection and recognition of face using neural network
Smriti Tikoo
 
PPTX
FACE RECOGNITION USING NEURAL NETWORK
codebangla
 
PPTX
Captcha as graphical password
Gopinath Ramanna
 
¿Qué es Slide Share?
Family
 
Magazine Genre Research
daltonsenger
 
Curriculum Vita1
Negash Alemayehu
 
Sarcina nr 2 roma
florinaarsenie
 
2016-03-31 FAA Visual Arts at MDC Frontenac_May 2016_Press Kit_EN
Kakim Goh
 
A Framework for Deadlock Detection in Java
Vincenzo Lomonaco
 
SCHEMATIC REPORT
Anthony Head
 
Kato Mivule: An Overview of Adaptive Boosting – AdaBoost
Kato Mivule
 
Gsm architecture
Agrima Kothari
 
CVPR2016読み会 Sparsifying Neural Network Connections for Face Recognition
Koichi Takahashi
 
Machine learning with ADA Boost
Aman Patel
 
Captcha as graphical passwords a new security primitive based on hard ai prob...
IGEEKS TECHNOLOGIES
 
Face recognization using artificial nerual network
Dharmesh Tank
 
Face recognition technology
SARATHGOVINDKK
 
Deep Learning for Computer Vision (2/4): Object Analytics @ laSalle 2016
Universitat Politècnica de Catalunya
 
Detection and recognition of face using neural network
Smriti Tikoo
 
FACE RECOGNITION USING NEURAL NETWORK
codebangla
 
Captcha as graphical password
Gopinath Ramanna
 
Ad

Similar to Deep Learning for Computer Vision: A comparision between Convolutional Neural Networks and Hierarchical Temporal Memories on object recognition tasks - Slides (20)

PDF
Deep Learning for Computer Vision: A comparision between Convolutional Neural...
Vincenzo Lomonaco
 
PPTX
Computer vision lab seminar(deep learning) yong hoon
Yonghoon Kwon
 
PPTX
The deep learning technology on coco framework
JIEMS Akkalkuwa
 
PPTX
Deep learning and computer vision
MeetupDataScienceRoma
 
PDF
Finding the best solution for Image Processing
Tech Triveni
 
PDF
Object_Recognition_using_Deeplearning_Report.pptx
Yaopeng (Gyoho) Wu
 
PDF
A Survey of Convolutional Neural Network Architectures for Deep Learning via ...
ijtsrd
 
PPTX
Deep learning
Rajgupta258
 
PDF
HiPEAC 2019 Workshop - Real-Time Modelling Visual Scenes with Biological Insp...
Tulipp. Eu
 
PPTX
L7_finetuning on tamil technologies.pptx
Meganath7
 
PDF
From Conventional Machine Learning to Deep Learning and Beyond.pptx
Chun-Hao Chang
 
PPTX
Lecture 3: Convolutional Neural Networks
Mohamed Loey
 
PPTX
FINAL_Team_4.pptx
nitin571047
 
PDF
Modelling Framework of a Neural Object Recognition
IJERA Editor
 
PDF
Deep-Learning-with-PydddddddddddddTorch.pdf
drjigarsoni28
 
PPTX
Introduction to computer vision
Marcin Jedyk
 
PPTX
Introduction to Deep learning
leopauly
 
PDF
Comparison of convolutional neural network models for user’s facial recognition
IJECEIAES
 
PDF
CNN Algorithm
georgejustymirobi1
 
PPTX
Introduction to computer vision with Convoluted Neural Networks
MarcinJedyk
 
Deep Learning for Computer Vision: A comparision between Convolutional Neural...
Vincenzo Lomonaco
 
Computer vision lab seminar(deep learning) yong hoon
Yonghoon Kwon
 
The deep learning technology on coco framework
JIEMS Akkalkuwa
 
Deep learning and computer vision
MeetupDataScienceRoma
 
Finding the best solution for Image Processing
Tech Triveni
 
Object_Recognition_using_Deeplearning_Report.pptx
Yaopeng (Gyoho) Wu
 
A Survey of Convolutional Neural Network Architectures for Deep Learning via ...
ijtsrd
 
Deep learning
Rajgupta258
 
HiPEAC 2019 Workshop - Real-Time Modelling Visual Scenes with Biological Insp...
Tulipp. Eu
 
L7_finetuning on tamil technologies.pptx
Meganath7
 
From Conventional Machine Learning to Deep Learning and Beyond.pptx
Chun-Hao Chang
 
Lecture 3: Convolutional Neural Networks
Mohamed Loey
 
FINAL_Team_4.pptx
nitin571047
 
Modelling Framework of a Neural Object Recognition
IJERA Editor
 
Deep-Learning-with-PydddddddddddddTorch.pdf
drjigarsoni28
 
Introduction to computer vision
Marcin Jedyk
 
Introduction to Deep learning
leopauly
 
Comparison of convolutional neural network models for user’s facial recognition
IJECEIAES
 
CNN Algorithm
georgejustymirobi1
 
Introduction to computer vision with Convoluted Neural Networks
MarcinJedyk
 
Ad

More from Vincenzo Lomonaco (18)

PDF
2023-08-22 CoLLAs Tutorial - Beyond CIL.pdf
Vincenzo Lomonaco
 
PDF
Continual Learning with Deep Architectures - Tutorial ICML 2021
Vincenzo Lomonaco
 
PDF
Toward Continual Learning on the Edge
Vincenzo Lomonaco
 
PDF
Continual Learning: Another Step Towards Truly Intelligent Machines
Vincenzo Lomonaco
 
PDF
Tutorial inns2019 full
Vincenzo Lomonaco
 
PDF
Continual Reinforcement Learning in 3D Non-stationary Environments
Vincenzo Lomonaco
 
PDF
Continual/Lifelong Learning with Deep Architectures
Vincenzo Lomonaco
 
PDF
Continual Learning for Robotics
Vincenzo Lomonaco
 
PDF
Don't forget, there is more than forgetting: new metrics for Continual Learni...
Vincenzo Lomonaco
 
PDF
Open-Source Frameworks for Deep Learning: an Overview
Vincenzo Lomonaco
 
PPTX
Continual Learning with Deep Architectures Workshop @ Computer VISIONers Conf...
Vincenzo Lomonaco
 
PPTX
CORe50: a New Dataset and Benchmark for Continual Learning and Object Recogni...
Vincenzo Lomonaco
 
PPTX
Continuous Learning with Deep Architectures
Vincenzo Lomonaco
 
PDF
CORe50: a New Dataset and Benchmark for Continuous Object Recognition Poster
Vincenzo Lomonaco
 
PPTX
Continuous Unsupervised Training of Deep Architectures
Vincenzo Lomonaco
 
PDF
Deep Learning libraries and first experiments with Theano
Vincenzo Lomonaco
 
PDF
Word2vec on the italian language: first experiments
Vincenzo Lomonaco
 
PDF
Machine Learning for Automated Reasoning: An Overview
Vincenzo Lomonaco
 
2023-08-22 CoLLAs Tutorial - Beyond CIL.pdf
Vincenzo Lomonaco
 
Continual Learning with Deep Architectures - Tutorial ICML 2021
Vincenzo Lomonaco
 
Toward Continual Learning on the Edge
Vincenzo Lomonaco
 
Continual Learning: Another Step Towards Truly Intelligent Machines
Vincenzo Lomonaco
 
Tutorial inns2019 full
Vincenzo Lomonaco
 
Continual Reinforcement Learning in 3D Non-stationary Environments
Vincenzo Lomonaco
 
Continual/Lifelong Learning with Deep Architectures
Vincenzo Lomonaco
 
Continual Learning for Robotics
Vincenzo Lomonaco
 
Don't forget, there is more than forgetting: new metrics for Continual Learni...
Vincenzo Lomonaco
 
Open-Source Frameworks for Deep Learning: an Overview
Vincenzo Lomonaco
 
Continual Learning with Deep Architectures Workshop @ Computer VISIONers Conf...
Vincenzo Lomonaco
 
CORe50: a New Dataset and Benchmark for Continual Learning and Object Recogni...
Vincenzo Lomonaco
 
Continuous Learning with Deep Architectures
Vincenzo Lomonaco
 
CORe50: a New Dataset and Benchmark for Continuous Object Recognition Poster
Vincenzo Lomonaco
 
Continuous Unsupervised Training of Deep Architectures
Vincenzo Lomonaco
 
Deep Learning libraries and first experiments with Theano
Vincenzo Lomonaco
 
Word2vec on the italian language: first experiments
Vincenzo Lomonaco
 
Machine Learning for Automated Reasoning: An Overview
Vincenzo Lomonaco
 

Recently uploaded (20)

DOCX
Accomplishment Report on YES- O SY 2025 2026.docx
WilsonVillamater
 
PPTX
atom : it is the building unit of the structure of any matter
abdoy2605
 
PPTX
Earthquake1214435435665467576786587867876867888.pptx
JohnMarkBarrientos1
 
PPTX
Organisms of oncogenic Potential.pptx
mrkoustavjana2003
 
PDF
The First Detection of Molecular Activity in the Largest Known Oort Cloud Com...
Sérgio Sacani
 
PDF
Isro (Indian space research organization)
parineetaparineeta23
 
PDF
Can Consciousness Live and Travel Through Quantum AI?
Saikat Basu
 
PDF
HOW TO DEAL WITH THREATS FROM THE FORCES OF NATURE FROM OUTER SPACE.pdf
Faga1939
 
DOCX
Transportation in plants and animals.docx
bhatbashir421
 
PPTX
Single-Cell Multi-Omics in Neurodegeneration p1.pptx
KanakChaudhary10
 
PDF
The MUSEview of the Sculptor galaxy: survey overview and the planetary nebula...
Sérgio Sacani
 
PDF
Integrating Conversational Agents and Knowledge Graphs within the Scholarly D...
Angelo Salatino
 
PPTX
(Normal Mechanism)physiology of labour.pptx
DavidSalman2
 
PDF
Enzyme Kinetics_Lecture 8.5.2025 Enzymology.pdf
ayeshaalibukhari125
 
PPTX
Cancer
Vartika
 
PPTX
1699424534480_FOREST_SOCIETY_and_COLONIALISM (15).pptx
kavishtiwari2009
 
PDF
Evidence for a sub-Jovian planet in the young TWA 7 disk
Sérgio Sacani
 
PDF
Sujay Rao Mandavilli public profile June 2025.pdf
Sujay Rao Mandavilli
 
PPTX
MEDICINAL CHEMISTRY PROSPECTIVES IN DESIGN OF EGFR INHIBITORS.pptx
40RevathiP
 
PDF
POLISH JOURNAL OF SCIENCE №87 (2025)
POLISH JOURNAL OF SCIENCE
 
Accomplishment Report on YES- O SY 2025 2026.docx
WilsonVillamater
 
atom : it is the building unit of the structure of any matter
abdoy2605
 
Earthquake1214435435665467576786587867876867888.pptx
JohnMarkBarrientos1
 
Organisms of oncogenic Potential.pptx
mrkoustavjana2003
 
The First Detection of Molecular Activity in the Largest Known Oort Cloud Com...
Sérgio Sacani
 
Isro (Indian space research organization)
parineetaparineeta23
 
Can Consciousness Live and Travel Through Quantum AI?
Saikat Basu
 
HOW TO DEAL WITH THREATS FROM THE FORCES OF NATURE FROM OUTER SPACE.pdf
Faga1939
 
Transportation in plants and animals.docx
bhatbashir421
 
Single-Cell Multi-Omics in Neurodegeneration p1.pptx
KanakChaudhary10
 
The MUSEview of the Sculptor galaxy: survey overview and the planetary nebula...
Sérgio Sacani
 
Integrating Conversational Agents and Knowledge Graphs within the Scholarly D...
Angelo Salatino
 
(Normal Mechanism)physiology of labour.pptx
DavidSalman2
 
Enzyme Kinetics_Lecture 8.5.2025 Enzymology.pdf
ayeshaalibukhari125
 
Cancer
Vartika
 
1699424534480_FOREST_SOCIETY_and_COLONIALISM (15).pptx
kavishtiwari2009
 
Evidence for a sub-Jovian planet in the young TWA 7 disk
Sérgio Sacani
 
Sujay Rao Mandavilli public profile June 2025.pdf
Sujay Rao Mandavilli
 
MEDICINAL CHEMISTRY PROSPECTIVES IN DESIGN OF EGFR INHIBITORS.pptx
40RevathiP
 
POLISH JOURNAL OF SCIENCE №87 (2025)
POLISH JOURNAL OF SCIENCE
 

Deep Learning for Computer Vision: A comparision between Convolutional Neural Networks and Hierarchical Temporal Memories on object recognition tasks - Slides

  • 1. Alma Mater Studiorum - University of Bologna School of Science Department of Computer Science and Engineering DISI Deep Learning for Computer Vision Candidate dott. Vincenzo Lomonaco Supervisor prof. Davide Maltoni Co-examiner prof. Mauro Gaspari A comparison between Convolutional Neural Networks and Hierarchical Temporal Memories on object recognition tasks
  • 2. 08.09.15 Vincenzo Lomonaco 2 ContentsBackground & Motivations Objectives Introduction CNN and HTM Key features Implementations NORB-sequences Original NORB dataset New benchmark design Experiments and Results Experiments design Results Conclusions Contents
  • 3. 08.09.15 Vincenzo Lomonaco 3 ContentsBackground & Motivations Objectives Introduction CNN and HTM Key features Implementations NORB-sequences Original NORB dataset New benchmark design Experiments and Results Experiments design Results Conclusions Contents
  • 4. 08.09.15 Vincenzo Lomonaco 4 Deep Learning In the last decade, Deep Learning techniques have shown to perform incredibly well on a large variety of problems both in Computer Vision and Natural Language Processing, resulting in the state of the art in many tasks.
  • 5. 08.09.15 Vincenzo Lomonaco 5 Deep Learning advantages Deep Learning is a branch of machine learning based on a set of algorithms that attempt to model high-level abstractions in data by using model architectures composed of multiple non-linear transformations.
  • 6. 08.09.15 Vincenzo Lomonaco 6 Deep Learning disadvantages ● Poorly understood surrounding theory ● Non-optimal method ● Very difficult to train ● Huge quantity of data needed ● High Performance Computing environment needed Possible limitations:
  • 7. 08.09.15 Vincenzo Lomonaco 7 Objectives Proving that taking inspiration from biological learning systems can help again in advancing the field of DL. Proving that, with less data, it is however possible to reach good levels of accuracy.
  • 8. 08.09.15 Vincenzo Lomonaco 8 How We would like to show that, with a lower quantity of available data, HTM can outperfom CNN on these tasks remaining comparable in terms of training times. Comparing two very different deep learning algorithms on object recognition tasks: – CNN: classical approach, state-of-the-art for object recognition – HTM: new biologically inspired approach
  • 9. 08.09.15 Vincenzo Lomonaco 9 NORB-sequences Experiments and Results Conclusions ContentsBackground & Motivations Objectives Introduction CNN and HTM Key features Implementations Original NORB dataset New benchmark design Experiments design Results Contents
  • 10. 08.09.15 Vincenzo Lomonaco 10 CNN CNNs are MLP variants where individual neurons are tiled in such a way that they respond to overlapping regions in the visual field. They are architectural inspired by Hubel and Wiesel’s early work on the cat’s visual cortex. ● Python ● Using Theano ● 11 source files, 2550+ lns ● Pure supervised method ● Sparse Connectivity ● Shared Weights Key features: Implementation:
  • 11. 08.09.15 Vincenzo Lomonaco 11 HTM HTM is known as a new emerging paradigm that is more biologically inspired. It tries to incorporate concepts like time, context and attention during the learning process that are typical of the human brain. ● C#, OPENMP version ● Provided by Biometric System Lab (DISI) ● Mainly unsupervised method ● Top down and bottom-up information flow ● Bayesian probabilistic formulation Key features: Implementations:
  • 12. 08.09.15 Vincenzo Lomonaco 12 Experiments and Results Conclusions Original NORB dataset New benchmark design ContentsBackground & Motivations Objectives Introduction CNN and HTM Key features Implementations Experiments design Results Contents NORB-sequences
  • 13. 08.09.15 Vincenzo Lomonaco 13 NORB-Sequences Since the computer vision community is starting to investigate object recognition algorithms on videos, we would like to move our comparison to that direction. To this purpose, a new benchmark of a large collection of image sequences starting from the well-know small NORB DATASET has been created. THE original NORB DATASET: ● Stores 48,600 96x96 image (5 categories, 10 instances, 6 lightings, 9 elevations, and 18 azimuths). ● Is well-know and accepted by the research community in the context of object-recognition
  • 14. 08.09.15 Vincenzo Lomonaco 14 original NORB DATASET Training instances Test instances
  • 15. 08.09.15 Vincenzo Lomonaco 15 Java sequencer NORB-sequences is made possible thanks to a Java software that takes in input the small NORB DATASET, and given a number of different tuning parameters, return a number of training and a test image sequences. time ● The sequences are created ad hoc to simulate a camera moving around a specific object including changes in the surround lighting. ● Integrated KNN baseline, GUI, 10 source files, 2600+ lns Key features:
  • 16. 08.09.15 Vincenzo Lomonaco 16 NORB Sequences GUI
  • 17. 08.09.15 Vincenzo Lomonaco 17 NORB-sequences Conclusions ContentsBackground & Motivations Objectives Introduction CNN and HTM Key features Implementations Original NORB dataset New benchmark design Experiments design Results Contents Experiments and Results
  • 18. 08.09.15 Vincenzo Lomonaco 18 Experiments design 1) Validate the CNN implementation on the NORB dataset 2) Evaluate the performance of both algorithms on the plain NORB dataset 3) Evaluate the performance of both algorithms on the NORB sequences
  • 19. 08.09.15 Vincenzo Lomonaco 19 CNN validation In order to validate the new implementation,the goal was to reproduce Y. LeCun original results on the plain NORB DATASET.
  • 20. 08.09.15 Vincenzo Lomonaco 20 Plain NORB results Accuracy results comparison between CNN and HTM on the plain NORB dataset.
  • 21. 08.09.15 Vincenzo Lomonaco 21 Training times Training times comparison between CNN and HTM on the NORB sequences. Training size CNN times HTM times 100 + 800jit 10.94 m 21.19 m 250 + 2000jit 31.15 m 23.13 m 500 + 4000jit 38.24 m 22.14 m 1000 + 4000jit 91.26 m 26.04 m 2500 + 4000jit 94.90 m 61.08 m 5000 + 4000jit 124.7 m 89.58 m 10000 + 4000jit 187.7 m 143.5 m 24300 + 4000jit 51.31 m 596.2 m ● CNN: GPU Tesla C2075 Fermi (GPU speedup x3.2) ● HTM: CPU Xeon W3550, 4 cores. Architectures:
  • 22. 08.09.15 Vincenzo Lomonaco 22 NORB sequences results Accuracy results comparison between CNN and HTM on the NORB sequences. Train Test dist. CNN HTM 2x20 1 84.1% 86.36% 2x20 2 82.96% 86.16% 2x20 3 82.26% 86.16% 2x20 4 81.35% 84.20% 3x20 1 89.27% 91.0% 3x20 2 88.46% 89.52% 3x20 3 88.42% 88.68% 3x20 4 88.0% 85.88% 5x20 1 95.11% 92.69% 5x20 2 94.4% 91.62% 5x20 3 93.1% 92.86% 5x20 4 92.13% 91.23% Train Test dist. CNN HTM 2x20 1 37.56% 37.08% 2x20 2 34.38% 37.82% 2x20 3 30.71% 33.17% 2x20 4 25.47% 28.89% 3x20 1 49.89% 43.68% 3x20 2 48.01% 44.08% 3x20 3 40.56% 37.93% 3x20 4 33.77% 34.93% 5x20 1 55.17% 52.57% 5x20 2 52.55% 49.74% 5x20 3 45.86% 45.52% 5x20 4 40.08% 41.30% 5-classes 50-classes
  • 23. 08.09.15 Vincenzo Lomonaco 23 NORB-sequences Experiments and Results Conclusions ContentsBackground & Motivations Objectives Introduction CNN and HTM Key features Implementations Original NORB dataset New benchmark design Experiments design Results Contents
  • 24. 08.09.15 Vincenzo Lomonaco 24 Conclusions In this dissertation three different milestones have been achieved: 1) A LeNet-7 with Theano has been successfully implemented. 2) A new benchmark for object recognition in image sequences has been created. 3) HTM and CNN have been compared on different object recognition tasks. It has been proven that the HTM bio-inspired approach can be highly competitive and could be instrumental for advancing the field of Deep Learning
  • 25. 08.09.15 Vincenzo Lomonaco 25 The End http://vincenzolomonaco.com [email protected] “If we want machines to think, we need to teach them to see” Fei-Fei Li, Stanford Computer Vision Lab Thank you for your attention Vincenzo Lomonaco