0% found this document useful (0 votes)

22 views

230726__en

Uploaded by

lukaschare

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views

230726__en

Uploaded by

lukaschare

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Course guide

230726 - CVDL - Computer Vision with Deep Learning

Last modified: 24/05/2024

Unit in charge: Barcelona School of Telecommunications Engineering
Teaching unit: 739 - TSC - Department of Signal Theory and Communications.

Degree: MASTER'S DEGREE IN TELECOMMUNICATIONS ENGINEERING (Syllabus 2013). (Optional subject).

MASTER'S DEGREE IN ADVANCED TELECOMMUNICATION TECHNOLOGIES (Syllabus 2019). (Optional
subject).

Academic year: 2024 ECTS Credits: 5.0 Languages: English

LECTURER

Coordinating lecturer: Consultar aquí / See here:

https://telecos.upc.edu/ca/estudis/curs-actual/professorat-responsables-coordinadors/respon
sables-assignatura

Others: Consultar aquí / See here:

https://telecos.upc.edu/ca/estudis/curs-actual/professorat-responsables-coordinadors/profess
orat-assignat-idioma

PRIOR SKILLS

Important: You should have the following previous knowledge to follow the course:
- Image processing: pixels, color spaces, histograms, frequency domain representation
- Digital signal processing: linear filters, convolution
- Vector and matrix algebra

Notions of python are useful, but these are easily obtained during the course.

DEGREE COMPETENCES TO WHICH THE SUBJECT CONTRIBUTES

Specific:
1. Ability to apply information theory methods, adaptive modulation and channel coding, as well as advanced techniques of digital
signal processing to communication and audiovisual systems.
2. Ability to integrate Telecommunication Engineering technologies and systems, as a generalist, and in broader and multidisciplinary
contexts, such as bioengineering, photovoltaic conversion, nanotechnology and telemedicine.

Transversal:
3. TEAMWORK: Being able to work in an interdisciplinary team, whether as a member or as a leader, with the aim of contributing to
projects pragmatically and responsibly and making commitments in view of the resources that are available.

4. EFFECTIVE USE OF INFORMATION RESOURCES: Managing the acquisition, structuring, analysis and display of data and information
in the chosen area of specialisation and critically assessing the results obtained.

5. FOREIGN LANGUAGE: Achieving a level of spoken and written proficiency in a foreign language, preferably English, that meets the
needs of the profession and the labour market.

Date: 30/05/2024 Page: 1 / 4

TEACHING METHODOLOGY

- Lectures
- Practical work
- Individual work (distance)
- Exercises
- Mid and final term exams

LEARNING OBJECTIVES OF THE SUBJECT

Learning objectives of the subject:

The aim of this course is to provide an overview of concepts and applications of computer vision, with both classic and Deep Learning
methods. We will introduce low level techniques such as feature extraction and matching, edge detection, cameras and projection
models and optical flow; mid-level topics such as video segmentation and feature tracking; high level methods such as object
tracking. Then, examples of application will be shown, such as face and object recognition.

Learning results of the subject:

- Ability to understand and use techniques for image and video analysis: feature extraction, video segmentation, stereo, object
detection.
- Ability to use computer vision algorithms to implement high-level applications.

STUDY LOAD

Type Hours Percentage

Self study 86,0 68.80

Hours large group 39,0 31.20

Total learning time: 125 h

CONTENTS

1. Introduction

Description:
- Motivation, types of problems in CV
- Image formation, perception, 3D sensors

Full-or-part-time: 7h
Theory classes: 3h
Self study : 4h

2. Image Structure

Description:
- Color, texture, filtering, and contours
- Detection and representation of interesting points and 'blobs'
- Modeling: RANSAC, Hough transform
- Saliency maps

Full-or-part-time: 29h
Theory classes: 9h
Guided activities: 4h
Self study : 16h

Date: 30/05/2024 Page: 2 / 4

3. Stereo and 3D applications

Description:
- Single-camera geometry, camera calibration
- Epipolar geometry, homography
- Camera pose estimation and sensor registration using deep learning

Full-or-part-time: 30h
Theory classes: 9h
Guided activities: 4h
Self study : 17h

4. Video tracking

Description:
- Optical flow: Lucas-Kanade, Shi-Tomasi, Deep Learning methods
- Bayesian tracking: Kalman, Particle filters
- Deep Learning tracking methods

Full-or-part-time: 29h
Theory classes: 9h
Guided activities: 4h
Self study : 16h

5. Detection and recognition

Description:
- Introduction to visual recognition. Review of machine learning Deep learning and convolutional neural networks
- Image classification: Bag of words model. Image classification using CNNs
- Object detection: Sliding windows and local features. Object detection using CNNs
- Object segmentation: Semantic segmentation. Instance segmentation

Full-or-part-time: 30h
Theory classes: 9h
Guided activities: 4h
Self study : 17h

ACTIVITIES

EXERCISES

Description:
- Detecting contours and modeling shapes: Canny, Hough, Ransac, DL
- Finding correspondences between images: Harris, SIFT
- Fundamental matrix estimation
- Application of homography: panorama creation
- Object detection & recognition

Full-or-part-time: 6h
Self study: 6h

Date: 30/05/2024 Page: 3 / 4

EXTENDED ANSWER TEST

Description:
Mid-term examination.

Full-or-part-time: 2h
Theory classes: 2h

EXTENDED ANSWER TEST

Description:
Second term examination

Full-or-part-time: 2h
Theory classes: 2h

GRADING SYSTEM

First-term examination: 40%

Second term examination: 40%
Laboratory/Exercises assessments: 20%

BIBLIOGRAPHY

Basic:
- Szeliski, R. Computer vision: algorithms and applications [on line]. London: Springer, 2011 [Consultation: 20/10/2014]. Available
on: http://site.ebrary.com/lib/upcatalunya/docDetail.action?docID=10421311. ISBN 9781848829350.
- Forsyth, D.A.; Ponce, J. Computer vision: a modern approach [on line]. 2nd ed. Boston, Mass.: Pearson Education, 2012
[Consultation: 09/09/2020]. Available on: https://ebookcentral.proquest.com/lib/upcatalunya-ebooks/detail.action?docID=5173504.
ISBN 9780273764144.

Complementary:
- Hartley, R.; Zisserman, A. Multiple view geometry in computer vision. 2nd ed. Cambridge: Cambridge University Press, 2003. ISBN
0521540518.
- Wang, Y.; Ostermann, J.; Zhang, Y.-Q. Video processing and communications. Upper Saddle River: Prentice Hall, 2002. ISBN
9788131733646.
- Hanjalic, A. Content-based analysis of digital video [on line]. Boston: Kluwer Academic, 2004 [Consultation: 29/07/2013]. Available
on: http://link.springer.com/book/10.1007/b106003/page/1. ISBN 978-1402081149.

RESOURCES

Other resources:
Google Colab

Date: 30/05/2024 Page: 4 / 4

1st PERIODICAL TEST TOS QUARTER 1 2022 - 2023 GRADE 4 AP, EPP - AGRI, MATH, SCIENCE, ENGLISH AND ESP
100% (3)
1st PERIODICAL TEST TOS QUARTER 1 2022 - 2023 GRADE 4 AP, EPP - AGRI, MATH, SCIENCE, ENGLISH AND ESP
8 pages
Syllabus-Topics in Computer Vision
100% (1)
Syllabus-Topics in Computer Vision
5 pages
5 BCA- Electives Syllabus
No ratings yet
5 BCA- Electives Syllabus
10 pages
3 2c735de418 Syllabus Computer Vision Modified
No ratings yet
3 2c735de418 Syllabus Computer Vision Modified
5 pages
AI AND ML
No ratings yet
AI AND ML
6 pages
E
No ratings yet
E
3 pages
00 - Course Info - MSc
No ratings yet
00 - Course Info - MSc
12 pages
EE655_L01-1_b82e4483-9498-412b-886d-cd6469f31413
No ratings yet
EE655_L01-1_b82e4483-9498-412b-886d-cd6469f31413
43 pages
CVDL
No ratings yet
CVDL
3 pages
CourseHandout - Computer Vision CS4158
No ratings yet
CourseHandout - Computer Vision CS4158
9 pages
Computer Vision Syllabus
No ratings yet
Computer Vision Syllabus
2 pages
CEN454 - Computer Vision and Machine Learning (Current)
No ratings yet
CEN454 - Computer Vision and Machine Learning (Current)
6 pages
Cv Digital Notes
No ratings yet
Cv Digital Notes
77 pages
Computer Vision Course Outline
No ratings yet
Computer Vision Course Outline
3 pages
New_CV_Syllabus (1)
No ratings yet
New_CV_Syllabus (1)
3 pages
Gujarat Technological University: Linear Algebra, Vector Calculus, Data Structures and Programming
No ratings yet
Gujarat Technological University: Linear Algebra, Vector Calculus, Data Structures and Programming
2 pages
0-IntroCourseCVDL2023-24
No ratings yet
0-IntroCourseCVDL2023-24
12 pages
Ee267 01
No ratings yet
Ee267 01
7 pages
Computer Vision 3-0-0-3 2016 Prerequisite: EC301 Digital Signal Processing Course Objectives
No ratings yet
Computer Vision 3-0-0-3 2016 Prerequisite: EC301 Digital Signal Processing Course Objectives
2 pages
RVP Syllabus
No ratings yet
RVP Syllabus
4 pages
Syllabus
No ratings yet
Syllabus
3 pages
CS427
No ratings yet
CS427
6 pages
Lec 01 Introduction Compressed
No ratings yet
Lec 01 Introduction Compressed
111 pages
AI Powered Augmented Reality Virtual Reality (ARVR) Computer Vision Builder Odisha
No ratings yet
AI Powered Augmented Reality Virtual Reality (ARVR) Computer Vision Builder Odisha
11 pages
CSE AIML-4 Year
No ratings yet
CSE AIML-4 Year
17 pages
Ai Fellowship 2023
No ratings yet
Ai Fellowship 2023
13 pages
CV and DIP Coures Outline
No ratings yet
CV and DIP Coures Outline
3 pages
Course Curriculum
No ratings yet
Course Curriculum
3 pages
Computer Vision
No ratings yet
Computer Vision
6 pages
Syllabus T.Y.B.Sc. Data Science
No ratings yet
Syllabus T.Y.B.Sc. Data Science
52 pages
Computer Vision 15 Exam q and a(4)
No ratings yet
Computer Vision 15 Exam q and a(4)
44 pages
202046705 Computer Vision and Image Processsing
No ratings yet
202046705 Computer Vision and Image Processsing
3 pages
Gujarat Technological University: Elective Course
No ratings yet
Gujarat Technological University: Elective Course
3 pages
ECT386 - Ktu Qbank
No ratings yet
ECT386 - Ktu Qbank
10 pages
Cosi Ujm Advanced Image Processing
No ratings yet
Cosi Ujm Advanced Image Processing
2 pages
RMK Group 21cs905 CV Unit 5
No ratings yet
RMK Group 21cs905 CV Unit 5
101 pages
AD8703 BCV Unit V 2023
No ratings yet
AD8703 BCV Unit V 2023
83 pages
Computer Vision Nanodegree Syllabus: Before You Start
No ratings yet
Computer Vision Nanodegree Syllabus: Before You Start
5 pages
01_Introduction_To_MachineVision
No ratings yet
01_Introduction_To_MachineVision
53 pages
New Text Document
No ratings yet
New Text Document
2 pages
AI-and-ML
No ratings yet
AI-and-ML
13 pages
AR VR Computer Vision CV Builder
No ratings yet
AR VR Computer Vision CV Builder
12 pages
Lesson Plan - FCV - 2024
No ratings yet
Lesson Plan - FCV - 2024
4 pages
AL3502DEEP LEARNING FOR VISIONL T P C
No ratings yet
AL3502DEEP LEARNING FOR VISIONL T P C
3 pages
Computer Vision
No ratings yet
Computer Vision
3 pages
IVA-new
No ratings yet
IVA-new
2 pages
RMK Group 21cs905 CV Unit 1
No ratings yet
RMK Group 21cs905 CV Unit 1
77 pages
CV-SYLLABUS
No ratings yet
CV-SYLLABUS
3 pages
Lecture 1
No ratings yet
Lecture 1
84 pages
Image and Video Analytics Syllabus
100% (1)
Image and Video Analytics Syllabus
3 pages
Iva Syb With Lab
No ratings yet
Iva Syb With Lab
3 pages
Syllabus
No ratings yet
Syllabus
15 pages
Computer Vision
No ratings yet
Computer Vision
3 pages
CCS349_IVA_CDP
No ratings yet
CCS349_IVA_CDP
3 pages
Unit 1 Computer Vision 2025
No ratings yet
Unit 1 Computer Vision 2025
142 pages
ECE+APSCHE+BLACKBUCKS+LONG+TERM+INTERNSHIP+COURSE+Syllabus
No ratings yet
ECE+APSCHE+BLACKBUCKS+LONG+TERM+INTERNSHIP+COURSE+Syllabus
1 page
Deep Learning Syllabus
No ratings yet
Deep Learning Syllabus
4 pages
CSC455 CV CDF V3.1
No ratings yet
CSC455 CV CDF V3.1
2 pages
CISD412 Syllabus S25
No ratings yet
CISD412 Syllabus S25
7 pages
Computer Engineeirng Department, Uet Taxila Course Plan
No ratings yet
Computer Engineeirng Department, Uet Taxila Course Plan
2 pages
MATLAB for Machine Learning: Unlock the power of deep learning for swift and enhanced results
From Everand
MATLAB for Machine Learning: Unlock the power of deep learning for swift and enhanced results
Giuseppe Ciaburro
No ratings yet
Opportunity Recognition Process
No ratings yet
Opportunity Recognition Process
10 pages
Summative Third-Quarter-Exam-in-Reading-and-Writing
100% (9)
Summative Third-Quarter-Exam-in-Reading-and-Writing
4 pages
Because I Am Girl - The State of The World's Girls 2010 - Digital and Urban Frontiers
No ratings yet
Because I Am Girl - The State of The World's Girls 2010 - Digital and Urban Frontiers
101 pages
proposal training
No ratings yet
proposal training
6 pages
Speech Emotion Recognition Using Deep Convolutional Neural Network and Discriminant Temporal Pyramid Matching
No ratings yet
Speech Emotion Recognition Using Deep Convolutional Neural Network and Discriminant Temporal Pyramid Matching
15 pages
Criticism - Final Term Notes - 5th Semester
No ratings yet
Criticism - Final Term Notes - 5th Semester
43 pages
Rohit Sagar Gate Applicationform d108d59 Pdf898
No ratings yet
Rohit Sagar Gate Applicationform d108d59 Pdf898
1 page
(Leo Tan Wee Hin (Editor), R. Subramaniam (Editor)
No ratings yet
(Leo Tan Wee Hin (Editor), R. Subramaniam (Editor)
475 pages
How To Publish in High Quality Journals
No ratings yet
How To Publish in High Quality Journals
20 pages
TQM Int 1
No ratings yet
TQM Int 1
1 page
Student Holistic Empowerment Quizzes
No ratings yet
Student Holistic Empowerment Quizzes
9 pages
CHAPTER I (Action Research)
No ratings yet
CHAPTER I (Action Research)
13 pages
Stages of Instructional Design
100% (1)
Stages of Instructional Design
2 pages
5 Effective Follow Up
No ratings yet
5 Effective Follow Up
18 pages
Bahasa Indonesia Makalah Surat Lamaran Kerja - Id.en PDF
No ratings yet
Bahasa Indonesia Makalah Surat Lamaran Kerja - Id.en PDF
16 pages
Essay III Principles of Language Assessment
100% (1)
Essay III Principles of Language Assessment
7 pages
RUBRICS-FOR-INDIVIDUAL-PERFORMANCE
No ratings yet
RUBRICS-FOR-INDIVIDUAL-PERFORMANCE
2 pages
Portfolio
No ratings yet
Portfolio
49 pages
Narrative Report Format
No ratings yet
Narrative Report Format
3 pages
Resnet 18
No ratings yet
Resnet 18
6 pages
Signal Transduction in Bacterial Chemotaxis: Lengeler Et Al. Pp. 514-523
No ratings yet
Signal Transduction in Bacterial Chemotaxis: Lengeler Et Al. Pp. 514-523
16 pages
Ei Labz VxWorks 6
No ratings yet
Ei Labz VxWorks 6
3 pages
Lecs 113
No ratings yet
Lecs 113
8 pages
Sigmund Freud - Ego and Id
No ratings yet
Sigmund Freud - Ego and Id
3 pages
Internship Report
No ratings yet
Internship Report
34 pages
Word of Mouth
No ratings yet
Word of Mouth
141 pages
CSS - 2 2 2023 CSS - 2
No ratings yet
CSS - 2 2 2023 CSS - 2
1 page
Multilingualism
No ratings yet
Multilingualism
2 pages
SAP-Brochure
No ratings yet
SAP-Brochure
4 pages