SlideShare a Scribd company logo
Temporal Activity Detection in
Untrimmed Videos with Recurrent
Neural Networks
Alberto Montes
July 15th, 2016
Xavi Giró Amaia
Salvador
Outline
1. Introduction
2. Related Work
3. Methodology
4. Results
5. Conclusions and Future Work
2
Motivation
3
Motivation
4
Problem Definition
5
Videos
Problem Definition
6
Videos
Activity Classification
Longboarding
Problem Definition
7
Videos
Activity Temporal Localization
Longboarding
Problem Definition
8
How?
Problem Definition
9
Neural Network
Activity
Problem Definition
10
Activity
CNN RNN+
11
Large-Scale Activity Recognition
Challenge
Stats:
● 19,994 Videos
● 200 Activities
● 660 hours of video
● 313 hours of activities
● 65.6 million of frames
Dataset
12
Outline
1. Introduction
2. Related Work
3. Methodology
4. Results
5. Conclusions and Future Work
13
Literature Approaches
14
Activity
CNN RNN+
Convolutional Neural Network
15
Convolutional Layer
Recurrent Neural Network
16
c0
c1
c2
Literature Approaches
17
Activity
CNN RNN+
3D Convolution
18
Tran, D., Bourdev, L., Fergus, R., Torresani, L., & Paluri, M. (2015, December). Learning spatiotemporal features with
3d convolutional networks. In 2015 IEEE ICCV 2015 (pp. 4489-4497). IEEE.
3D Convolution
19
● 16-frame video clip as input
● 80 million parameters
● 3x3x3 filter size at all conv layers
Tran, D., Bourdev, L., Fergus, R., Torresani, L., & Paluri, M. (2015, December). Learning spatiotemporal features with
3d convolutional networks. In 2015 IEEE ICCV 2015 (pp. 4489-4497). IEEE.
Literature Approaches
20
Activity
CNN RNN+
Literature Approaches
21
Activity
CNN RNN+
Segments Proposals
22
Shou, Z., Wang, D., & Chang, S. F. Temporal Action Localization in Untrimmed Videos via Multi-stage CNNs CVPR
2016.
Literature Approaches
23
Activity
CNN RNN+
RNN for Activity Localization
24
Yeung, Serena, Olga Russakovsky, Greg Mori, and Li Fei-Fei. et al. "End-to-end Learning of Action Detection from
Frame Glimpses in Videos." CVPR 2016
Outline
1. Introduction
2. Related Work
3. Methodology
4. Results
5. Conclusions and Future Work
25
Architecture Overview
26
16 frames 200 activities
+ background
16 frames 200 activities
+ background
16 frames 200 activities
+ background
Outline
3. Methodology
a. Extracting C3D Features
b. Audio Features
c. Network Architecture
d. Training Methodology
e. Post-Processing
27
Outline
3. Methodology
a. Extracting C3D Features
b. Audio Features
c. Network Architecture
d. Training Methodology
e. Post-Processing
28
C3D Network
29
Caffe +
by
feature vector
published on:
C3D Network
30
Caffe
by
feature vector
Outline
3. Methodology
a. Extracting C3D Features
b. Audio Features
c. Network Architecture
d. Training Methodology
e. Post-Processing
31
Audio Features
32
C3D
Recurrent Neural Network Input
Audio Features:
● MFCC
● Spectral
concatvideo
features
Provided by
Ignasi Esquerra
Outline
3. Methodology
a. Extracting C3D Features
b. Audio Features
c. Network Architecture
d. Training Methodology
e. Post-Processing
33
Network Architecture
34
Network Architecture
35
Network Architecture
36
LSTM with previous output feedback
Outline
3. Methodology
a. Extracting C3D Features
b. Audio Features
c. Network Architecture
d. Training Methodology
e. Post-Processing
37
Training Methodology
Categorical Cross Entropy Loss
38
Training Methodology
For unbalanced data, weighted loss:
39
660 hours of video
313 hours of activities
Outline
3. Methodology
a. Extracting C3D Features
b. Audio Features
c. Network Architecture
d. Training Methodology
e. Post-Processing
40
Classification Post-Processing
41
Background
Activity 1
Activity 2
Activity 200
Clip1
Clip2
Clip3
ClipN
Classification Post-Processing
42
Background
Activity 1
Activity 2
Activity 200
Clip1
Clip2
Clip3
ClipN
Average
Classification Post-Processing
43
Background
Activity 1
Activity 2
Activity 200
Clip1
Clip2
Clip3
ClipN
Average
Max Probability
Detection Post-Processing
44
Background
Activity 1
Activity 2
Activity 200
Clip1
Clip2
Clip3
ClipN
Applied a mean filter of k samplestime
Detection Post-Processing
45
Background
Activity
Clip1
Clip2
Clip3
ClipN
Ɣ
Detection Post-Processing
46
Ɣ
Outline
1. Introduction
2. Related Work
3. Methodology
4. Results
5. Conclusions and Future Work
47
Classification: Audio Features
48
mAP = 0.5755mAP = 0.5938
Music unrelated to the activity is often added to the videos in post-processing,
causing a decrease in performance when audio and video features are combined.
Classification: Depth Analysis
49
mAP = 0.5938 mAP = 0.5492 mAP = 0.5635
Deeper networks present overfitting
Classification Results Per Activity
50
Classification Results Per Activity
51
Using the Pommel Horse
Sailing
Playing Ice Hockey
Rock Climbing
BMX
Classification Results Per Activity
52
Drinking Coffee
Peeling Potatoes
Having an Ice Cream
Rock-Paper-Scissors
Polishing shoes
Top Level Classification
53
Detection
54
mAP = 0.2251 mAP = 0.2067
Model with feedback did not improve results
Training with feedback
55
512-LSTM
video features0 0 1 0 0 0
concat
When training
previous
ground
truth
Training with feedback
56
512-LSTM
video features0 0.1 0.6 0.2 0.1 0
concat
When testing
previous
prediction
Comparing Post-Processing
57
Ɣ
Grid search for optimal parameters
Detection Results per Activity
58
Detection Results per Activity
59
Windsurfing
Riding Bumper Cars
Playing Racquetball
Using the Pommel Horse
Using Parallel Bars
Detection Results per Activity
60
Drinking Coffee
Putting on Shoes
Rock-Paper-Scissors
Removing Curlers
Smoking a Cigarette
Top Level Detection
61
Qualitative Evaluation
62
Ground Truth:
Playing water polo
Prediction:
0.765 Playing water polo
0.202 Swimming
0.007 Springboard diving
Qualitative Evaluation
63
Ground Truth:
Hopscotch
Prediction:
0.848 Running a marathon
0.023 Triple jump
0.022 Javelin throw
Qualitative Evaluation
64
Qualitative Evaluation
65
Challenge Results
66
Classification Task
(24 participants)
Baseline
42.20%
0% 100%
93.23%
Winner
Average
Performance
66.26%58.74%
UPC Team
* results over test subset
Slide Design by Issey Masuda
mAP
Challenge Results
67
Detection Task
(6 participants)
Baseline
9.70%
0% 50%
42.47%
Winner
Average
Performance
29.94%22.36%
UPC Team
mAP
* results over test subset
Slide Design by Issey Masuda
Outline
1. Introduction
2. Related Work
3. Methodology
4. Results
5. Conclusions and Future Work
68
Conclusions
69
Classification:
Longboarding
Detection:
42.7s – 193.5s Longboarding
Conclusions
70
Video
Spatial Net
Temporal Net
Output
Winning entry for
ActivityNet
Classification task
Wang, Limin, et al. "Towards good practices for very deep two-stream convnets." arXiv preprint arXiv:1507.02159 (2015).
Conclusions
71
Classification:
Longboarding
Detection:
42.7s – 193.5s Longboarding
Conclusions
72
Best results were obtained for sport categories, due to the pretraining of C3D with the Sports-1M dataset
Future Work: E2E Training
73
Training the whole
pipeline end-to-end would
reduce the bias towards
sport categories
Future Work: Attention Models
74
Temporal
Attention
Filters
Neural Network
Challenge Submission
75
Open Sourced Contributions
76
github.com/imatge-upc/activitynet-2016-cvprw
“Thank you for your attention
77
78
Questions?
79
Support Slides
Metrics
80
Hit@3
Classification Detection
IoU
Smoothing Effect Comparison
81
Post-Processing Effect
82
Smoothing Filter:
Post-Processing Effect
83
Activity Threshold:
Activities Duration
84
AP and Video Appearance Correlation
85
AP and Video Appearance Correlation
86
Preparing Data
87
batch 1
batch 2
Preparing Data
88
Sequence of Video Vector Features
Sequence of Activities
time
Preparing Data
89
time
timesteps
Preparing Data
90
Preparing Data
91
Gradient Propagation
Gathering Audio Features
92
16-Frame Clip
10 ms MFCC Features
t
10 ms MFCC Features
10 ms MFCC Features
10 ms MFCC Features
10 ms MFCC Features
10 ms MFCC Features
16-Frame Clip
Spectral Features
… … …
Gathering Audio Features
93
16-Frame Clip
mean
MFCC
Features
t
std
MFCC
Features
16-Frame Clip
Spectral Features
… … …
mean
MFCC
Features
std
MFCC
Features
Gathering Audio Features
94
16-Frame Clip
mean
MFCC
Features
t
std
MFCC
Features
16-Frame Clip
Spectral Features
… … …
mean
MFCC
Features
std
MFCC
Features
Spectral Features
Convolutional Neural Network
95
Convolutional Layer
Convolutional Neural Network
96
Pooling Layer
Convolutional Neural Network
97
Fully-Connected Layer
Qualitative Evaluation
98

More Related Content

What's hot (20)

PDF
Deep Learning for Computer Vision (1/4): Image Analytics @ laSalle 2016
Universitat Politècnica de Catalunya
 
PDF
Welcome (D1L1 2017 UPC Deep Learning for Computer Vision)
Universitat Politècnica de Catalunya
 
PDF
Deep convnets for global recognition (Master in Computer Vision Barcelona 2016)
Universitat Politècnica de Catalunya
 
PDF
Language and Vision (D3L5 2017 UPC Deep Learning for Computer Vision)
Universitat Politècnica de Catalunya
 
PDF
Image classification on Imagenet (D1L4 2017 UPC Deep Learning for Computer Vi...
Universitat Politècnica de Catalunya
 
PDF
Self-supervised Visual Learning 2020 - Xavier Giro-i-Nieto - UPC Barcelona
Universitat Politècnica de Catalunya
 
PDF
Deep Learning for Computer Vision (2/4): Object Analytics @ laSalle 2016
Universitat Politècnica de Catalunya
 
PDF
Deep Learning for Computer Vision: ImageNet Challenge (UPC 2016)
Universitat Politècnica de Catalunya
 
PDF
Learning with Videos (D4L4 2017 UPC Deep Learning for Computer Vision)
Universitat Politècnica de Catalunya
 
PDF
Neural Architectures for Video Encoding
Universitat Politècnica de Catalunya
 
PDF
Deep Video Object Tracking 2020 - Xavier Giro - UPC TelecomBCN Barcelona
Universitat Politècnica de Catalunya
 
PDF
Learning Representations for Sign Language Videos - Xavier Giro - NIST TRECVI...
Universitat Politècnica de Catalunya
 
PDF
Interpretability of Convolutional Neural Networks - Xavier Giro - UPC Barcelo...
Universitat Politècnica de Catalunya
 
PDF
Deep Video Object Tracking - Xavier Giro - UPC Barcelona 2019
Universitat Politècnica de Catalunya
 
PDF
Image Classification on ImageNet (D1L3 Insight@DCU Machine Learning Workshop ...
Universitat Politècnica de Catalunya
 
PDF
Neural Architectures for Still Images - Xavier Giro- UPC Barcelona 2019
Universitat Politècnica de Catalunya
 
PDF
Deep Neural Networks for Multimodal Learning
Marc Bolaños Solà
 
PDF
Self-supervised Learning from Video Sequences - Xavier Giro - UPC Barcelona 2019
Universitat Politècnica de Catalunya
 
PDF
Unsupervised Learning (DLAI D9L1 2017 UPC Deep Learning for Artificial Intell...
Universitat Politècnica de Catalunya
 
PDF
Deep Learning Architectures for Video - Xavier Giro - UPC Barcelona 2019
Universitat Politècnica de Catalunya
 
Deep Learning for Computer Vision (1/4): Image Analytics @ laSalle 2016
Universitat Politècnica de Catalunya
 
Welcome (D1L1 2017 UPC Deep Learning for Computer Vision)
Universitat Politècnica de Catalunya
 
Deep convnets for global recognition (Master in Computer Vision Barcelona 2016)
Universitat Politècnica de Catalunya
 
Language and Vision (D3L5 2017 UPC Deep Learning for Computer Vision)
Universitat Politècnica de Catalunya
 
Image classification on Imagenet (D1L4 2017 UPC Deep Learning for Computer Vi...
Universitat Politècnica de Catalunya
 
Self-supervised Visual Learning 2020 - Xavier Giro-i-Nieto - UPC Barcelona
Universitat Politècnica de Catalunya
 
Deep Learning for Computer Vision (2/4): Object Analytics @ laSalle 2016
Universitat Politècnica de Catalunya
 
Deep Learning for Computer Vision: ImageNet Challenge (UPC 2016)
Universitat Politècnica de Catalunya
 
Learning with Videos (D4L4 2017 UPC Deep Learning for Computer Vision)
Universitat Politècnica de Catalunya
 
Neural Architectures for Video Encoding
Universitat Politècnica de Catalunya
 
Deep Video Object Tracking 2020 - Xavier Giro - UPC TelecomBCN Barcelona
Universitat Politècnica de Catalunya
 
Learning Representations for Sign Language Videos - Xavier Giro - NIST TRECVI...
Universitat Politècnica de Catalunya
 
Interpretability of Convolutional Neural Networks - Xavier Giro - UPC Barcelo...
Universitat Politècnica de Catalunya
 
Deep Video Object Tracking - Xavier Giro - UPC Barcelona 2019
Universitat Politècnica de Catalunya
 
Image Classification on ImageNet (D1L3 Insight@DCU Machine Learning Workshop ...
Universitat Politècnica de Catalunya
 
Neural Architectures for Still Images - Xavier Giro- UPC Barcelona 2019
Universitat Politècnica de Catalunya
 
Deep Neural Networks for Multimodal Learning
Marc Bolaños Solà
 
Self-supervised Learning from Video Sequences - Xavier Giro - UPC Barcelona 2019
Universitat Politècnica de Catalunya
 
Unsupervised Learning (DLAI D9L1 2017 UPC Deep Learning for Artificial Intell...
Universitat Politècnica de Catalunya
 
Deep Learning Architectures for Video - Xavier Giro - UPC Barcelona 2019
Universitat Politècnica de Catalunya
 

Viewers also liked (17)

PDF
Video Analysis with Recurrent Neural Networks (Master Computer Vision Barcelo...
Universitat Politècnica de Catalunya
 
PDF
Layer-wise CNN Surgery for Visual Sentiment Prediction
Universitat Politècnica de Catalunya
 
PDF
Temporal Action Localization in Untrimmed Videos via Multi Stage CNNs
Universitat Politècnica de Catalunya
 
PDF
Multi-label Remote Sensing Image Retrieval based on Deep Features
Universitat Politècnica de Catalunya
 
PPTX
Human Action Recognition using Lagrangian Descriptors
Esra Açar
 
PDF
Training and Inference for Deep Gaussian Processes
Keyon Vafa
 
PPTX
Activity Recognition using Cell Phone Accelerometers
Ishara Amarasekera
 
PPT
Wearable Computing - Part III: The Activity Recognition Chain (ARC)
Daniel Roggen
 
PDF
Convolutional Features for Instance Search
Universitat Politècnica de Catalunya
 
PPTX
Human Activity Recognition in Android
Surbhi Jain
 
PDF
Human activity recognition
Randhir Gupta
 
PDF
Deep Learning for Public Safety in Chicago and San Francisco
Sri Ambati
 
PPTX
Human Activity Recognition (HAR) using HMM based Intermediate matching kernel...
Rupali Bhatnagar
 
PDF
Hierarchical Object Detection with Deep Reinforcement Learning
Universitat Politècnica de Catalunya
 
PDF
Open-ended Visual Question-Answering
Universitat Politècnica de Catalunya
 
PDF
Recurrent Neural Networks, LSTM and GRU
ananth
 
PDF
Attention mechanisms with tensorflow
Keon Kim
 
Video Analysis with Recurrent Neural Networks (Master Computer Vision Barcelo...
Universitat Politècnica de Catalunya
 
Layer-wise CNN Surgery for Visual Sentiment Prediction
Universitat Politècnica de Catalunya
 
Temporal Action Localization in Untrimmed Videos via Multi Stage CNNs
Universitat Politècnica de Catalunya
 
Multi-label Remote Sensing Image Retrieval based on Deep Features
Universitat Politècnica de Catalunya
 
Human Action Recognition using Lagrangian Descriptors
Esra Açar
 
Training and Inference for Deep Gaussian Processes
Keyon Vafa
 
Activity Recognition using Cell Phone Accelerometers
Ishara Amarasekera
 
Wearable Computing - Part III: The Activity Recognition Chain (ARC)
Daniel Roggen
 
Convolutional Features for Instance Search
Universitat Politècnica de Catalunya
 
Human Activity Recognition in Android
Surbhi Jain
 
Human activity recognition
Randhir Gupta
 
Deep Learning for Public Safety in Chicago and San Francisco
Sri Ambati
 
Human Activity Recognition (HAR) using HMM based Intermediate matching kernel...
Rupali Bhatnagar
 
Hierarchical Object Detection with Deep Reinforcement Learning
Universitat Politècnica de Catalunya
 
Open-ended Visual Question-Answering
Universitat Politècnica de Catalunya
 
Recurrent Neural Networks, LSTM and GRU
ananth
 
Attention mechanisms with tensorflow
Keon Kim
 
Ad

Similar to Temporal Activity Detection in Untrimmed Videos with Recurrent Neural Networks (20)

PDF
Activity recognition based on spatio-temporal features with transfer learning
IAESIJAI
 
PDF
State of the art time-series analysis with deep learning by Javier Ordóñez at...
Big Data Spain
 
PPTX
Reading group - Week 2 - Trajectory Pooled Deep-Convolutional Descriptors (TDD)
Saimunur Rahman
 
PDF
Human Action Recognition Using Deep Learning
IRJET Journal
 
PDF
Action Recognitionの歴史と最新動向
Ohnishi Katsunori
 
PPTX
Final Major project a b c d e f g h i j k l m
bharathpsnab
 
PDF
Video Classification: Human Action Recognition on HMDB-51 dataset
Giorgio Carbone
 
PDF
Human Action Recognition in Videos
IRJET Journal
 
PDF
Human Behavior Understanding: From Human-Oriented Analysis to Action Recognit...
Wanjin Yu
 
PDF
A Intensified Approach on Deep Neural Networks for Human Activity Recognition...
IRJET Journal
 
PPTX
Ppt guangyang
Xiang Zhang
 
PDF
Attention correlated appearance and motion feature followed temporal learning...
IJECEIAES
 
PDF
Development of 3D convolutional neural network to recognize human activities ...
journalBEEI
 
PPTX
Learning spatiotemporal features with 3 d convolutional networks
SungminYou
 
PPTX
Automated Video Analysis and Reporting for Construction Sites
nedasadattaheri1997
 
PPTX
Presentation on Aritificial intelligence and Machine Learning College project
JagadeeshPatil17
 
PDF
Human activity recognition with self-attention
IJECEIAES
 
PDF
Real-Time Pertinent Maneuver Recognition for Surveillance
IRJET Journal
 
PPTX
Iciap 2
Ionut Mironica
 
PPTX
Action_recognition-topic.pptx
computerscience98
 
Activity recognition based on spatio-temporal features with transfer learning
IAESIJAI
 
State of the art time-series analysis with deep learning by Javier Ordóñez at...
Big Data Spain
 
Reading group - Week 2 - Trajectory Pooled Deep-Convolutional Descriptors (TDD)
Saimunur Rahman
 
Human Action Recognition Using Deep Learning
IRJET Journal
 
Action Recognitionの歴史と最新動向
Ohnishi Katsunori
 
Final Major project a b c d e f g h i j k l m
bharathpsnab
 
Video Classification: Human Action Recognition on HMDB-51 dataset
Giorgio Carbone
 
Human Action Recognition in Videos
IRJET Journal
 
Human Behavior Understanding: From Human-Oriented Analysis to Action Recognit...
Wanjin Yu
 
A Intensified Approach on Deep Neural Networks for Human Activity Recognition...
IRJET Journal
 
Ppt guangyang
Xiang Zhang
 
Attention correlated appearance and motion feature followed temporal learning...
IJECEIAES
 
Development of 3D convolutional neural network to recognize human activities ...
journalBEEI
 
Learning spatiotemporal features with 3 d convolutional networks
SungminYou
 
Automated Video Analysis and Reporting for Construction Sites
nedasadattaheri1997
 
Presentation on Aritificial intelligence and Machine Learning College project
JagadeeshPatil17
 
Human activity recognition with self-attention
IJECEIAES
 
Real-Time Pertinent Maneuver Recognition for Surveillance
IRJET Journal
 
Action_recognition-topic.pptx
computerscience98
 
Ad

More from Universitat Politècnica de Catalunya (20)

PDF
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Universitat Politècnica de Catalunya
 
PDF
Deep Generative Learning for All
Universitat Politècnica de Catalunya
 
PDF
The Transformer in Vision | Xavier Giro | Master in Computer Vision Barcelona...
Universitat Politècnica de Catalunya
 
PDF
Towards Sign Language Translation & Production | Xavier Giro-i-Nieto
Universitat Politècnica de Catalunya
 
PDF
The Transformer - Xavier Giró - UPC Barcelona 2021
Universitat Politècnica de Catalunya
 
PDF
Open challenges in sign language translation and production
Universitat Politècnica de Catalunya
 
PPTX
Generation of Synthetic Referring Expressions for Object Segmentation in Videos
Universitat Politècnica de Catalunya
 
PPTX
Discovery and Learning of Navigation Goals from Pixels in Minecraft
Universitat Politècnica de Catalunya
 
PDF
Learn2Sign : Sign language recognition and translation using human keypoint e...
Universitat Politècnica de Catalunya
 
PDF
Intepretability / Explainable AI for Deep Neural Networks
Universitat Politècnica de Catalunya
 
PDF
Convolutional Neural Networks - Xavier Giro - UPC TelecomBCN Barcelona 2020
Universitat Politècnica de Catalunya
 
PDF
Self-Supervised Audio-Visual Learning - Xavier Giro - UPC TelecomBCN Barcelon...
Universitat Politècnica de Catalunya
 
PDF
Attention for Deep Learning - Xavier Giro - UPC TelecomBCN Barcelona 2020
Universitat Politècnica de Catalunya
 
PDF
Generative Adversarial Networks GAN - Xavier Giro - UPC TelecomBCN Barcelona ...
Universitat Politècnica de Catalunya
 
PDF
Q-Learning with a Neural Network - Xavier Giró - UPC Barcelona 2020
Universitat Politècnica de Catalunya
 
PDF
Language and Vision with Deep Learning - Xavier Giró - ACM ICMR 2020 (Tutorial)
Universitat Politècnica de Catalunya
 
PDF
Image Segmentation with Deep Learning - Xavier Giro & Carles Ventura - ISSonD...
Universitat Politècnica de Catalunya
 
PDF
Curriculum Learning for Recurrent Video Object Segmentation
Universitat Politècnica de Catalunya
 
PDF
Deep Self-supervised Learning for All - Xavier Giro - X-Europe 2020
Universitat Politècnica de Catalunya
 
PDF
Deep Learning Representations for All - Xavier Giro-i-Nieto - IRI Barcelona 2020
Universitat Politècnica de Catalunya
 
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Universitat Politècnica de Catalunya
 
Deep Generative Learning for All
Universitat Politècnica de Catalunya
 
The Transformer in Vision | Xavier Giro | Master in Computer Vision Barcelona...
Universitat Politècnica de Catalunya
 
Towards Sign Language Translation & Production | Xavier Giro-i-Nieto
Universitat Politècnica de Catalunya
 
The Transformer - Xavier Giró - UPC Barcelona 2021
Universitat Politècnica de Catalunya
 
Open challenges in sign language translation and production
Universitat Politècnica de Catalunya
 
Generation of Synthetic Referring Expressions for Object Segmentation in Videos
Universitat Politècnica de Catalunya
 
Discovery and Learning of Navigation Goals from Pixels in Minecraft
Universitat Politècnica de Catalunya
 
Learn2Sign : Sign language recognition and translation using human keypoint e...
Universitat Politècnica de Catalunya
 
Intepretability / Explainable AI for Deep Neural Networks
Universitat Politècnica de Catalunya
 
Convolutional Neural Networks - Xavier Giro - UPC TelecomBCN Barcelona 2020
Universitat Politècnica de Catalunya
 
Self-Supervised Audio-Visual Learning - Xavier Giro - UPC TelecomBCN Barcelon...
Universitat Politècnica de Catalunya
 
Attention for Deep Learning - Xavier Giro - UPC TelecomBCN Barcelona 2020
Universitat Politècnica de Catalunya
 
Generative Adversarial Networks GAN - Xavier Giro - UPC TelecomBCN Barcelona ...
Universitat Politècnica de Catalunya
 
Q-Learning with a Neural Network - Xavier Giró - UPC Barcelona 2020
Universitat Politècnica de Catalunya
 
Language and Vision with Deep Learning - Xavier Giró - ACM ICMR 2020 (Tutorial)
Universitat Politècnica de Catalunya
 
Image Segmentation with Deep Learning - Xavier Giro & Carles Ventura - ISSonD...
Universitat Politècnica de Catalunya
 
Curriculum Learning for Recurrent Video Object Segmentation
Universitat Politècnica de Catalunya
 
Deep Self-supervised Learning for All - Xavier Giro - X-Europe 2020
Universitat Politècnica de Catalunya
 
Deep Learning Representations for All - Xavier Giro-i-Nieto - IRI Barcelona 2020
Universitat Politècnica de Catalunya
 

Recently uploaded (20)

PPTX
MODULE 2 Effects of Lifestyle in the Function of Respiratory and Circulator...
judithgracemangunday
 
PPTX
formations-of-rock-layers-grade 11_.pptx
GraceSarte
 
PPTX
Akshay tunneling .pptx_20250331_165945_0000.pptx
akshaythaker18
 
PDF
Step-by-Step Guide: How mRNA Vaccines Works
TECNIC
 
PDF
Chemokines and Receptors Overview – Key to Immune Cell Signaling
Benjamin Lewis Lewis
 
PPTX
Diagnostic Features of Common Oral Ulcerative Lesions.pptx
Dr Palak borade
 
PPTX
Anatomy and physiology of digestive system.pptx
Ashwini I Chuncha
 
PPTX
Q1 - W1 - D2 - Models of matter for science.pptx
RyanCudal3
 
PDF
Continuous Model-Based Engineering of Software-Intensive Systems: Approaches,...
Hugo Bruneliere
 
PPTX
Diuretic Medicinal Chemistry II Unit II.pptx
Dhanashri Dupade
 
PPTX
Different formulation of fungicides.pptx
MrRABIRANJAN
 
PDF
Annual report 2024 - Inria - English version.pdf
Inria
 
PDF
Primordial Black Holes and the First Stars
Sérgio Sacani
 
PPTX
Pratik inorganic chemistry silicon based ppt
akshaythaker18
 
PDF
Introduction of Animal Behaviour full notes.pdf
S.B.P.G. COLLEGE BARAGAON VARANASI
 
PDF
A young gas giant and hidden substructures in a protoplanetary disk
Sérgio Sacani
 
PPT
Human physiology and digestive system
S.B.P.G. COLLEGE BARAGAON VARANASI
 
PPTX
Animal Reproductive Behaviors Quiz Presentation in Maroon Brown Flat Graphic ...
LynetteGaniron1
 
PPT
Cell cycle,cell cycle checkpoint and control
DrMukeshRameshPimpli
 
PDF
Phosphates reveal high pH ocean water on Enceladus
Sérgio Sacani
 
MODULE 2 Effects of Lifestyle in the Function of Respiratory and Circulator...
judithgracemangunday
 
formations-of-rock-layers-grade 11_.pptx
GraceSarte
 
Akshay tunneling .pptx_20250331_165945_0000.pptx
akshaythaker18
 
Step-by-Step Guide: How mRNA Vaccines Works
TECNIC
 
Chemokines and Receptors Overview – Key to Immune Cell Signaling
Benjamin Lewis Lewis
 
Diagnostic Features of Common Oral Ulcerative Lesions.pptx
Dr Palak borade
 
Anatomy and physiology of digestive system.pptx
Ashwini I Chuncha
 
Q1 - W1 - D2 - Models of matter for science.pptx
RyanCudal3
 
Continuous Model-Based Engineering of Software-Intensive Systems: Approaches,...
Hugo Bruneliere
 
Diuretic Medicinal Chemistry II Unit II.pptx
Dhanashri Dupade
 
Different formulation of fungicides.pptx
MrRABIRANJAN
 
Annual report 2024 - Inria - English version.pdf
Inria
 
Primordial Black Holes and the First Stars
Sérgio Sacani
 
Pratik inorganic chemistry silicon based ppt
akshaythaker18
 
Introduction of Animal Behaviour full notes.pdf
S.B.P.G. COLLEGE BARAGAON VARANASI
 
A young gas giant and hidden substructures in a protoplanetary disk
Sérgio Sacani
 
Human physiology and digestive system
S.B.P.G. COLLEGE BARAGAON VARANASI
 
Animal Reproductive Behaviors Quiz Presentation in Maroon Brown Flat Graphic ...
LynetteGaniron1
 
Cell cycle,cell cycle checkpoint and control
DrMukeshRameshPimpli
 
Phosphates reveal high pH ocean water on Enceladus
Sérgio Sacani
 

Temporal Activity Detection in Untrimmed Videos with Recurrent Neural Networks