PHD MOT CNN Proposal

Uploaded by

Soufiane Khaddadi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views3 pages

PHD MOT CNN Proposal

Uploaded by

Soufiane Khaddadi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

PhD Proposal: Deep Learning methods

for human behavior monitoring

INRIA Sophia Antipolis, STARS group

2004, route des Lucioles, BP93
06902 Sophia Antipolis Cedex – France
http://www-sop.inria.fr/members/Francois.Bremond/

1. Scientific context

STARS group works on automatic video monitoring and human behavior understanding for
health applications. The Deep Learning platform developed in STARS, detects mobile objects,
tracks their trajectory and recognizes related behaviors predefined by experts. This platform
contains several techniques for the detection of people and for the recognition of human
postures/gestures using conventional cameras. However, there are scientific challenges in
people tracking when dealing with real word scenes: cluttered scenes, handling wrong and
incomplete person segmentation, handling static and dynamic occlusions, low contrasted
objects, moving contextual objects (e.g. chairs), similar appearance of clothes among different
people ...
Multiple Object Tracking (MOT) is a fundamental task that aims at associating the same objects
across multiple frames in a video clip. A robust and accurate MOT algorithm is indispensable
in broad applications, such as people monitoring and video surveillance. An end-to-end MOT
algorithm can be divided into three different but closely related tasks; single frame detection
of objects, short term tracking and long-term tracking of said objects, the latter two are usually
merged together into a problem commonly known as data association. This gave rise to the
dominant paradigm in MOT, tracking-by-detection, which first obtains bounding boxes by
detection frame by frame, and then generates trajectories by associating the same objects
between frames. While these tasks are part of the same MOT problem, they are often treated
apart, either trained separately or the data association step is not a deep learning-based approach
which hinders the whole process.
On top of the aforementioned issue of separated training, short term tracking and long-term
tracking have the same objective (data association) but they have different inputs. Short term
tracking deals with per frame feature representation of an object and long-term tracking needs
to deal with a historic feature representation that encapsulates the myriad of changes of an
object across a larger frame span. In other words, we need a memory that tracks said changes,
that is differentiable and can back-propagate the information all the way up to the detection
task.

2. General objectives of the PhD

This work consists in designing efficient People Joint Detection and Tracking algorithms. One
potential approach could use differentiable Memory Banks to build a Deep Learning memory-
based architecture that can be trained to learn a feature representation of a tracklet. Therefore,
the main difference with respect to the current state-of-the-art is that this MemoryTracker will
be conceived to mitigate the loss of information from training separately both detection, short
term tracking and long-term tracking tasks. Designing an efficient memory-based architecture
is far from evident. Indeed, the first challenge is to be able to infer dense representations (i.e.
tracklet vectors). To do so, we propose the use of ROI-alignment from the pipeline of
deformable DETR detector. We also can take advantage of joint detection and short-term
tracking by using 3D CNNs, this can allow us to have temporal and spatial information that is
not available with vanilla 2D CNNs. The use of 3DCNNs can output more reliable tracklets
over a small number of frames and use that information to better update the MemoryBank.
In addition to allowing a truly end-to-end pipeline, the MemoryTracker could overcome the
batch training problem by storing the tracklet feature vector with an intra-batch loss and an out-
of-batch loss. Both losses could be based on triplet loss functions that depend on the current
input sequence (intra batch) and the following sequences (out-of-batch). However, while the
features of the current frames are given to the detection pipeline, the features of the previous
frames are given to the MemoryBank.

To validate the work, we will assess the proposed algorithms on video-monitoring applications
and homecare videos from Nice Hospital and from public places, such as the ones in MOT20
https://motchallenge.net/data/MOT20/.

3. Pre-requisites

Master 2 (or Engineer) in Computer Vision or Mathematics,

With theoretical knowledge in Computer Vision, Mathematics, and Deep Learning (PyTorch,
TensorFlow), and technical background in C++ and Python programming, Linux.

Place of PhD: Inria Sophia Antipolis

4. Schedule
1st year:
● Study the limitations of existing DL People Tracking algorithms.
● Proposing a new approach for People Tracking using Joint Detection and Tracking.
2nd year:
● Start to Improve the proposed DL People Tracking approach.
● Writing papers
rd
3 year:
● Evaluate, improve and optimize proposed DL People Tracking approach.
● Writing papers and PhD manuscript.

5. Contact

[email protected]
6. Bibliography

1. JD. Zuniga, Ujjwal and F. Bremond. DeTracker: A Joint Detection and Tracking
Framework. In Proceedings of the 17th International Joint Conference on Computer
Vision, Imaging and Computer Graphics Theory and Applications, VISAPP 2022,
Virtual, February 6-8, 2022.
2. Berclaz, J., Fleuret, F., Turetken, E., and Fua, P. (2011). Multiple object tracking
using k-shortest paths optimization. IEEE transactions on pattern analysis and
machine intelligence.
3. Bergmann, P., Meinhardt, T., and Leal-Taix´e, L. (2019). Tracking without bells and
whistles. In the IEEE International Conference on Computer Vision (ICCV).
4. Chu, P. and Ling, H. (2019). Famnet: Joint learning of feature, affinity and multi-
dimensional assignment for online multiple object tracking. 2019 IEEE/CVF
International Conference on Computer Vision (ICCV), pages 6171–6180.
5. Dehghan, A., Modiri Assari, S., and Shah, M. (2015). Gmmcp tracker: Globally
optimal generalized maximum multi clique problem for multiple object tracking. In
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition,
pages 4091–4099.
6. Dong, X. and Shen, J. (2018). Triplet loss in siamese network for object tracking. In
Proceedings of the European Conference on Computer Vision (ECCV), pages 459–
474.
7. Fabbri, M., Lanzi, F., Calderara, S., Palazzi, A., Vezzani, R., and Cucchiara, R.
(2018). Learning to detect and track visible and occluded body joints in a virtual
world. In European Conference on Computer Vision (ECCV).
8. Feichtenhofer, C., Pinz, A., and Zisserman, A. (2017). Detect to track and track to
detect. In 2017 IEEE International Conference on Computer Vision (ICCV), pages
3057–3065.
9. Feng, W., Hu, Z., Wu, W., Yan, J., and Ouyang, W. (2019). Multi-object tracking
with multiple cues and switcheraware classification. CoRR, abs/1901.06129.
10. He, K., Gkioxari, G., Doll´ar, P., and Girshick, R. (2017). Mask r-cnn. In
Proceedings of the IEEE international conference on computer vision, pages 2961–
2969.

Wpcsin 0 1 2 3 4 5 6 7 Wptools Wpgate: Domain Directory Directories Description
No ratings yet
Wpcsin 0 1 2 3 4 5 6 7 Wptools Wpgate: Domain Directory Directories Description
43 pages
Self-Supervised Deep Correlation Tracking
No ratings yet
Self-Supervised Deep Correlation Tracking
10 pages
Multi Object Tracking - A Literature Review
No ratings yet
Multi Object Tracking - A Literature Review
23 pages
Real-Time Multiple Object Tracking Using Deep Learning Methods2021
No ratings yet
Real-Time Multiple Object Tracking Using Deep Learning Methods2021
30 pages
1910.09761
No ratings yet
1910.09761
25 pages
2307.07635v3
No ratings yet
2307.07635v3
24 pages
Lecture5 -Query_Processing 1
No ratings yet
Lecture5 -Query_Processing 1
23 pages
Thesis Object Tracking
100% (3)
Thesis Object Tracking
4 pages
2411.09388v1
No ratings yet
2411.09388v1
10 pages
2023-1-7
No ratings yet
2023-1-7
8 pages
Tag Draft Especializado
No ratings yet
Tag Draft Especializado
14 pages
2207.04551v2
No ratings yet
2207.04551v2
38 pages
Applsci 12 09597 v2
No ratings yet
Applsci 12 09597 v2
16 pages
Karaev 等 - 2023 - CoTracker It is Better to Track Together
No ratings yet
Karaev 等 - 2023 - CoTracker It is Better to Track Together
13 pages
Deep_Learning-Based_Person_Detection_and_Classification_for_Far_Field_Video_Surveillance
No ratings yet
Deep_Learning-Based_Person_Detection_and_Classification_for_Far_Field_Video_Surveillance
4 pages
ppr (1)
No ratings yet
ppr (1)
15 pages
SRS Project
No ratings yet
SRS Project
10 pages
Advantages of A PROFINET-Switch - Version 4 EN
No ratings yet
Advantages of A PROFINET-Switch - Version 4 EN
6 pages
ZARZOSO PHD TD AF
No ratings yet
ZARZOSO PHD TD AF
3 pages
Extendable Multiple Nodes Recurrent Tracking Framework With 11e0w4nc
No ratings yet
Extendable Multiple Nodes Recurrent Tracking Framework With 11e0w4nc
16 pages
(SOTA) Deep Learning in Multi-Object Detection and Tracking State of The Art
No ratings yet
(SOTA) Deep Learning in Multi-Object Detection and Tracking State of The Art
30 pages
Guia de instalação e-SmartDX - Linux
No ratings yet
Guia de instalação e-SmartDX - Linux
4 pages
Sujet PostDoc J.Raffort
No ratings yet
Sujet PostDoc J.Raffort
2 pages
Signal Processing Onramp Quick Reference
No ratings yet
Signal Processing Onramp Quick Reference
6 pages
IET Computer Vision - 2019 - Xu - Deep Learning For Multiple Object Tracking A Survey
No ratings yet
IET Computer Vision - 2019 - Xu - Deep Learning For Multiple Object Tracking A Survey
14 pages
Human Detection and Tracking With Deep Convolutional Neural Networks
No ratings yet
Human Detection and Tracking With Deep Convolutional Neural Networks
24 pages
Manual en
No ratings yet
Manual en
85 pages
Single Object Tracking A Survey of Methods Dataset
No ratings yet
Single Object Tracking A Survey of Methods Dataset
15 pages
ShivamRai Resume
No ratings yet
ShivamRai Resume
1 page
Multiple Object Tracking in Recent Times a Literat
No ratings yet
Multiple Object Tracking in Recent Times a Literat
18 pages
FMR-5000-C-13
No ratings yet
FMR-5000-C-13
4 pages
X X FX X X X FX X X A: Mathmanti Ou
No ratings yet
X X FX X X X FX X X A: Mathmanti Ou
1 page
Articulo 2
No ratings yet
Articulo 2
5 pages
Experiment 3 - Interfacing Analog Inputs
No ratings yet
Experiment 3 - Interfacing Analog Inputs
10 pages
Fast CNN-Based Object Tracking Using Localization Layers and Deep Features Interpolation
No ratings yet
Fast CNN-Based Object Tracking Using Localization Layers and Deep Features Interpolation
6 pages
PHP Variables
No ratings yet
PHP Variables
7 pages
11
No ratings yet
11
19 pages
Ethical Hacker
No ratings yet
Ethical Hacker
3 pages
Number Guessing Game
No ratings yet
Number Guessing Game
14 pages
Object Tracking Methods-A Review
No ratings yet
Object Tracking Methods-A Review
7 pages
Object Detection With Deep Learning_ A Review Summary
No ratings yet
Object Detection With Deep Learning_ A Review Summary
11 pages
SQL Query To Find Second Highest Salary - GeeksforGeeks
No ratings yet
SQL Query To Find Second Highest Salary - GeeksforGeeks
6 pages
Documentation6 13 15
No ratings yet
Documentation6 13 15
3 pages
1.mot Ijsae
No ratings yet
1.mot Ijsae
10 pages
Real Time Robust Human Detection and Tracking System: Jianpeng Zhou and Jack Hoang I3DVR International Inc
No ratings yet
Real Time Robust Human Detection and Tracking System: Jianpeng Zhou and Jack Hoang I3DVR International Inc
8 pages
Reloj Mitsubishi
No ratings yet
Reloj Mitsubishi
8 pages
Guidelines and Mechanics For Disassemble - Assemble - Crimping
No ratings yet
Guidelines and Mechanics For Disassemble - Assemble - Crimping
3 pages
Electronics 10 02406 v2
No ratings yet
Electronics 10 02406 v2
31 pages
Navigation in Crowded Spaces Using Trajectory Prediction
No ratings yet
Navigation in Crowded Spaces Using Trajectory Prediction
3 pages
Tianyu Yang Learning Dynamic Memory ECCV 2018 Paper
No ratings yet
Tianyu Yang Learning Dynamic Memory ECCV 2018 Paper
16 pages
Hat Certified System Administrator in Red Hat Openstack Exam
No ratings yet
Hat Certified System Administrator in Red Hat Openstack Exam
3 pages
YOLO Based Real Time Human Detection Using Deep Learning
No ratings yet
YOLO Based Real Time Human Detection Using Deep Learning
9 pages
30 Jenkins Interview Questions and Answers
No ratings yet
30 Jenkins Interview Questions and Answers
5 pages
Multiple Object Tracking For Video Analysis and Surveillance A Literature Survey
No ratings yet
Multiple Object Tracking For Video Analysis and Surveillance A Literature Survey
10 pages
Decision Making and Branching: Explain All About If Statements?
No ratings yet
Decision Making and Branching: Explain All About If Statements?
21 pages
Study On MOT
No ratings yet
Study On MOT
17 pages
ds-Dot-Matrix-Printer-DL7400-Pro
No ratings yet
ds-Dot-Matrix-Printer-DL7400-Pro
4 pages
CORT: Class-Oriented Real-Time Tracking For Embedded Systems
No ratings yet
CORT: Class-Oriented Real-Time Tracking For Embedded Systems
10 pages
Design of An Effective Multiple Objects Tracking Framework For Dynamic Video Scenes
No ratings yet
Design of An Effective Multiple Objects Tracking Framework For Dynamic Video Scenes
13 pages
Trackformer
No ratings yet
Trackformer
16 pages
An_Investigation_of_Deep_Neural_Network_based_Techniques_for_Object_Detection_an
No ratings yet
An_Investigation_of_Deep_Neural_Network_based_Techniques_for_Object_Detection_an
6 pages
TCET-SEM 5 Syllabus
No ratings yet
TCET-SEM 5 Syllabus
39 pages
Ijaerv10n9spl 339
No ratings yet
Ijaerv10n9spl 339
9 pages
Multiple Object Tracking: A Literature Review
No ratings yet
Multiple Object Tracking: A Literature Review
36 pages
CSQ3 Soln
No ratings yet
CSQ3 Soln
2 pages
Topic: 1.3.6 Operating Systems: Loading An Operating System
No ratings yet
Topic: 1.3.6 Operating Systems: Loading An Operating System
5 pages
SOYAL Controllers ID
No ratings yet
SOYAL Controllers ID
2 pages
CNNTracking TNN10 Human
No ratings yet
CNNTracking TNN10 Human
14 pages
Computer Vision Paper
No ratings yet
Computer Vision Paper
3 pages
Social Distance
No ratings yet
Social Distance
18 pages
Real Time Object Detection and Tracking Using Deep Learning and Opencv
No ratings yet
Real Time Object Detection and Tracking Using Deep Learning and Opencv
4 pages
1 PB
No ratings yet
1 PB
8 pages
Cisco Secure Network Server Ordering Guide
No ratings yet
Cisco Secure Network Server Ordering Guide
7 pages
Multi Object Tracking in Traffic Environments: A Systematic Literature
No ratings yet
Multi Object Tracking in Traffic Environments: A Systematic Literature
13 pages
SPARC T7-T8 Battle Card 13 Sept 2017 - Partner
No ratings yet
SPARC T7-T8 Battle Card 13 Sept 2017 - Partner
2 pages
Programming Techniques Project
No ratings yet
Programming Techniques Project
6 pages
Adc 0804
100% (1)
Adc 0804
5 pages
Booth Multiplier On 23 06 10
No ratings yet
Booth Multiplier On 23 06 10
25 pages
Object Detection - Deep Learning: Jamia Hamdard
No ratings yet
Object Detection - Deep Learning: Jamia Hamdard
26 pages
YOLO Object Detection Explained: Definitive Reference for Developers and Engineers
From Everand
YOLO Object Detection Explained: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Supervised Machine Learning for Science: How to stop worrying and love your black box
From Everand
Supervised Machine Learning for Science: How to stop worrying and love your black box
Christoph Molnar
No ratings yet
Introduction to Data Science Using R
From Everand
Introduction to Data Science Using R
Prema Alla
No ratings yet
Data Mining: Concepts, Fundamentals And Applications
From Everand
Data Mining: Concepts, Fundamentals And Applications
Enrico Guardelli
No ratings yet
Activity Recognition: Fundamentals and Applications
From Everand
Activity Recognition: Fundamentals and Applications
Fouad Sabry
No ratings yet
Pedestrian Detection: Please, suggest a subtitle for a book with title 'Pedestrian Detection' within the realm of 'Computer Vision'. The suggested subtitle should not have ':'.
From Everand
Pedestrian Detection: Please, suggest a subtitle for a book with title 'Pedestrian Detection' within the realm of 'Computer Vision'. The suggested subtitle should not have ':'.
Fouad Sabry
No ratings yet
Articulated Body Pose Estimation: Unlocking Human Motion in Computer Vision
From Everand
Articulated Body Pose Estimation: Unlocking Human Motion in Computer Vision
Fouad Sabry
No ratings yet
Percept: Fundamentals and Applications
From Everand
Percept: Fundamentals and Applications
Fouad Sabry
No ratings yet
Optical Braille Recognition: Empowering Accessibility Through Visual Intelligence
From Everand
Optical Braille Recognition: Empowering Accessibility Through Visual Intelligence
Fouad Sabry
No ratings yet
Computer Vision: Exploring the Depths of Computer Vision
From Everand
Computer Vision: Exploring the Depths of Computer Vision
Fouad Sabry
No ratings yet
Visual Sensor Network: Exploring the Power of Visual Sensor Networks in Computer Vision
From Everand
Visual Sensor Network: Exploring the Power of Visual Sensor Networks in Computer Vision
Fouad Sabry
No ratings yet
Automatic Target Recognition: Fundamentals and Applications
From Everand
Automatic Target Recognition: Fundamentals and Applications
Fouad Sabry
No ratings yet
Automatic Target Recognition: Advances in Computer Vision Techniques for Target Recognition
From Everand
Automatic Target Recognition: Advances in Computer Vision Techniques for Target Recognition
Fouad Sabry
No ratings yet
Computer Vision: Fundamentals and Applications
From Everand
Computer Vision: Fundamentals and Applications
Fouad Sabry
No ratings yet
Object Detection: Advances, Applications, and Algorithms
From Everand
Object Detection: Advances, Applications, and Algorithms
Fouad Sabry
No ratings yet