Paper Review of Five Machine Vision Topics

Uploaded by

sashidhar avuthu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views

Paper Review of Five Machine Vision Topics

Uploaded by

sashidhar avuthu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Paper Review of Five Machine Vision Topics

Sashidhar Reddy Avuthu

Sa2220

1. Introduction of Background
The field of machine vision has seen exponential growth with the integration of deep learning,
providing breakthroughs across diverse applications like object detection, image restoration,
segmentation, facial recognition, and scene understanding. This review covers five papers
highlighting different aspects of machine vision, exploring methodologies, proposed solutions,
and their contributions to real-world scenarios.

2. Related Works for Quick Comparison

- Object Detection Using Deep Learning, CNNs, and Vision Transformers has built upon
foundational works like YOLO and Faster R-CNN, extending capabilities through Vision
Transformers for more accurate object representation.
- Image Restoration via Neural Networks and GANs relates to traditional image processing
methods such as wavelet transforms but moves beyond through GANs' capacity to generate
visually plausible content.
- Semantic Segmentation of Urban Scenes Using Deep Networks contrasts with earlier
segmentation techniques by employing a modified U-Net with dense connections, addressing
challenges in urban scene complexity.
- Facial Recognition Using Sparse Representation and Deep Learning enhances previous
sparse representation models with deep learning layers for robust occlusion handling.
- Scene Understanding with Self-Supervised Learning leverages self-supervised
frameworks, contrasting with fully supervised methods that demand extensive labeled datasets.

3. Proposed Methods and Results

- Object Detection: This paper proposed combining CNNs and Vision Transformers, enabling
models to capture long-range dependencies and achieve state-of-the-art detection accuracy on
public benchmarks like COCO.
- Results: Significant improvement in object localization and classification metrics compared to
traditional CNN-based models.
- Image Restoration with GANs: The proposed approach utilized a GAN-based architecture for
denoising and inpainting, generating clearer images than conventional methods.
- Results: Achieved notable reductions in noise levels with visual quality improvements on
benchmark datasets like ImageNet.
- Semantic Segmentation: The improved U-Net architecture, coupled with dense connections,
optimized pixel-level classification tasks in urban scenes.
- Results: Outperformed baseline models in terms of accuracy and robustness on the
Cityscapes dataset.
- Facial Recognition: The hybrid model integrated sparse representation with deep
convolutional layers for enhanced recognition capabilities.
- Results: High recognition accuracy in datasets with occluded faces, outperforming models
that used only sparse representation or deep learning.
- Scene Understanding with Self-Supervised Learning: By leveraging contrastive learning,
this method reduced reliance on labeled data for scene feature extraction.
- Results: Effective cross-domain generalization on several scene understanding benchmarks,
albeit with a slight performance gap compared to supervised models.

4. Analysis and Summarization of Pros and Cons

Object Detection Using Deep Learning, CNNs, and Vision Transformers:
- Pros: Superior object representation; scalable model structure.
- Cons: Computationally intensive; real-time deployment challenges.
Image Restoration via Neural Networks and GANs:
- Pros: High visual fidelity for image restoration; adaptable to various image degradation types.
- Cons: Training complexity and stability issues.
Semantic Segmentation of Urban Scenes:
- Pros: Robust in complex urban settings; accurate segmentation.
- Cons: High memory and computational costs; slower inference speed.
Facial Recognition Using Sparse Representation and Deep Learning:
- Pros: Effective against occlusions; high recognition accuracy.
- Cons: Limited scalability to real-time systems.
Scene Understanding with Self-Supervised Learning:
- Pros: Cost-effective data labeling; good cross-domain performance.
- Cons: Lower performance on fine-grained details compared to supervised models.

References
1. Amjoud, Ayoub Benali, and Mustapha Amrouch. "Object detection using deep learning, CNNs and
vision transformers: A review." IEEE Access 11 (2023): 35479-35516.
2. Rama, P., et al. "Advancement in Image Restoration Through GAN-based Approach." 2024 15th
International Conference on Computing Communication and Networking Technologies (ICCCNT). IEEE,
2024.
3. Li, Yanyi, Jian Shi, and Yuping Li. "Real-Time Semantic Understanding and Segmentation of Urban
Scenes for Vehicle Visual Sensors by Optimized DCNN Algorithm." Applied Sciences 12.15 (2022): 7811.
4. Wright, John, et al. "Robust face recognition via sparse representation." IEEE transactions on pattern
analysis and machine intelligence 31.2 (2008): 210-227.
5. Jiang, Huaizu, et al. "Self-supervised relative depth learning for urban scene understanding."
Proceedings of the european conference on computer vision (eccv). 2018.

Salesforce exam practice test
No ratings yet
Salesforce exam practice test
15 pages
Computer Vision55
100% (1)
Computer Vision55
268 pages
CVlecture 6
No ratings yet
CVlecture 6
33 pages
Technologies 12 00015
No ratings yet
Technologies 12 00015
40 pages
Isassignment
No ratings yet
Isassignment
10 pages
sensors-25-00035-v2
No ratings yet
sensors-25-00035-v2
6 pages
Literature Survey For Robotics
No ratings yet
Literature Survey For Robotics
6 pages
Journal Review (Is)
No ratings yet
Journal Review (Is)
7 pages
Machine Learning: Machine Learning (ML) Applications in Computer Vision (CV)
No ratings yet
Machine Learning: Machine Learning (ML) Applications in Computer Vision (CV)
6 pages
Anand Bhat PHD Thesis
No ratings yet
Anand Bhat PHD Thesis
173 pages
9781638280712-summary
No ratings yet
9781638280712-summary
65 pages
(PDF) Overview of Computer Vision
No ratings yet
(PDF) Overview of Computer Vision
4 pages
Research Paper
No ratings yet
Research Paper
7 pages
Object Detection Techniques A Review
No ratings yet
Object Detection Techniques A Review
9 pages
Fin Irjmets1684232858
No ratings yet
Fin Irjmets1684232858
9 pages
PART B ETI-1
No ratings yet
PART B ETI-1
7 pages
Last Lab Report
No ratings yet
Last Lab Report
6 pages
TFG_Gabriel-Ciprian_Dinu_2019
No ratings yet
TFG_Gabriel-Ciprian_Dinu_2019
60 pages
4
No ratings yet
4
5 pages
IET Computer Vision - 2024 - Massoud - Learnable fusion mechanisms for multimodal object detection in autonomous vehicles
No ratings yet
IET Computer Vision - 2024 - Massoud - Learnable fusion mechanisms for multimodal object detection in autonomous vehicles
13 pages
Overview_of_object_detection_based_on_deep_learnin
No ratings yet
Overview_of_object_detection_based_on_deep_learnin
7 pages
18 TallapallyHarini 162-170
No ratings yet
18 TallapallyHarini 162-170
9 pages
Master's Thesis Deep Learning For Visual Recognition: Remi Cadene Supervised by Nicolas Thome and Matthieu Cord
No ratings yet
Master's Thesis Deep Learning For Visual Recognition: Remi Cadene Supervised by Nicolas Thome and Matthieu Cord
58 pages
LiDar Re
No ratings yet
LiDar Re
13 pages
Research Paper UGR_Team-07
No ratings yet
Research Paper UGR_Team-07
16 pages
Ijlbps 6620dd20c5747
No ratings yet
Ijlbps 6620dd20c5747
8 pages
An_Investigation_of_Deep_Neural_Network_based_Techniques_for_Object_Detection_an
No ratings yet
An_Investigation_of_Deep_Neural_Network_based_Techniques_for_Object_Detection_an
6 pages
Object Detection in Pytorch Using Mask R-CNN
No ratings yet
Object Detection in Pytorch Using Mask R-CNN
4 pages
E3sconf Iconnect2023 04032
No ratings yet
E3sconf Iconnect2023 04032
11 pages
Object Detection With Deep Learning_ A Review Summary
No ratings yet
Object Detection With Deep Learning_ A Review Summary
11 pages
remotesensing-14-03324-v2
No ratings yet
remotesensing-14-03324-v2
15 pages
pepar 1
No ratings yet
pepar 1
13 pages
Thesis AlexanderJaus BIBTEX
No ratings yet
Thesis AlexanderJaus BIBTEX
9 pages
Real Time Object Detection With Deep Learning and OpenCV
No ratings yet
Real Time Object Detection With Deep Learning and OpenCV
5 pages
Image Restoration Using Deep Learning
No ratings yet
Image Restoration Using Deep Learning
12 pages
AI Models for 3D Object Detection in Autonomous Systems: Leveraging LiDAR and Depth Sensing
No ratings yet
AI Models for 3D Object Detection in Autonomous Systems: Leveraging LiDAR and Depth Sensing
8 pages
Objectdetection
No ratings yet
Objectdetection
7 pages
Real Time Object Detection System
No ratings yet
Real Time Object Detection System
31 pages
Deep Learning Research Paper
No ratings yet
Deep Learning Research Paper
1 page
Seminar
No ratings yet
Seminar
23 pages
Computer Vision
No ratings yet
Computer Vision
2 pages
Object Recognition On The REEM Robot
No ratings yet
Object Recognition On The REEM Robot
88 pages
Comparative analysis of feature descriptors and classifiers for real-time object detection
No ratings yet
Comparative analysis of feature descriptors and classifiers for real-time object detection
11 pages
Realtime Visual Recognition in Deep Convolutional Neural Networks
No ratings yet
Realtime Visual Recognition in Deep Convolutional Neural Networks
13 pages
Comprehensive_Review_of_R-CNN_and_its_Variant_Arch
No ratings yet
Comprehensive_Review_of_R-CNN_and_its_Variant_Arch
8 pages
maahi_over-6-54
No ratings yet
maahi_over-6-54
49 pages
Predicting Images Using Convolutional Networks - Visual Scene Understanding With Pixel Maps
No ratings yet
Predicting Images Using Convolutional Networks - Visual Scene Understanding With Pixel Maps
149 pages
MVS_Expt7 Different Technique of Object Recognition
No ratings yet
MVS_Expt7 Different Technique of Object Recognition
6 pages
Manuscript Template 2
No ratings yet
Manuscript Template 2
13 pages
Yu Et Al - 2016 - Recent Developments On Deep Big Vision
No ratings yet
Yu Et Al - 2016 - Recent Developments On Deep Big Vision
2 pages
2802 8020 1 PB
No ratings yet
2802 8020 1 PB
3 pages
A_review_of_advances_in_image_recognition_models_F
No ratings yet
A_review_of_advances_in_image_recognition_models_F
5 pages
Transformer-Based Visual Segmentation - A Survey
No ratings yet
Transformer-Based Visual Segmentation - A Survey
23 pages
Maahi Rajpoot Project
No ratings yet
Maahi Rajpoot Project
21 pages
Dint A 00062
No ratings yet
Dint A 00062
16 pages
Li 2021 J. Phys.: Conf. Ser. 1827 012085
No ratings yet
Li 2021 J. Phys.: Conf. Ser. 1827 012085
11 pages
Image Sorting Using Object Detection and Face Recognition
No ratings yet
Image Sorting Using Object Detection and Face Recognition
6 pages
Object Detection For Indoor Localization System
No ratings yet
Object Detection For Indoor Localization System
3 pages
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
Pyramid Image Processing: Exploring the Depths of Visual Analysis
From Everand
Pyramid Image Processing: Exploring the Depths of Visual Analysis
Fouad Sabry
No ratings yet
Object Detection: Advances, Applications, and Algorithms
From Everand
Object Detection: Advances, Applications, and Algorithms
Fouad Sabry
No ratings yet
Life 2e AmE WB3 U05
No ratings yet
Life 2e AmE WB3 U05
4 pages
How To Rank - Bank FREE APPs - Mobile Games
No ratings yet
How To Rank - Bank FREE APPs - Mobile Games
38 pages
Texto presentación Inglés
No ratings yet
Texto presentación Inglés
4 pages
CK College of Engineering & Technology
No ratings yet
CK College of Engineering & Technology
8 pages
NCERT Solutions For Class 12 Maths Chapter 1 Relations and Functions Exercise 1.3
No ratings yet
NCERT Solutions For Class 12 Maths Chapter 1 Relations and Functions Exercise 1.3
8 pages
Student Record-Keeping Management System
No ratings yet
Student Record-Keeping Management System
17 pages
DOCMAP Review Forms
No ratings yet
DOCMAP Review Forms
8 pages
ESIM384 EN Instal WEB v1.2
No ratings yet
ESIM384 EN Instal WEB v1.2
198 pages
Bizhub C451 Spec & Install V2
No ratings yet
Bizhub C451 Spec & Install V2
13 pages
TAMID Stock Pitch
No ratings yet
TAMID Stock Pitch
31 pages
Linux Magazine USA - Issue 266 January 2023
100% (2)
Linux Magazine USA - Issue 266 January 2023
102 pages
Statement List (STL) For S7-300 and S7-400 Programming: October 2015
No ratings yet
Statement List (STL) For S7-300 and S7-400 Programming: October 2015
4 pages
chap_16
No ratings yet
chap_16
17 pages
Java Imp
No ratings yet
Java Imp
8 pages
Reading Material 2 Lesson 2
No ratings yet
Reading Material 2 Lesson 2
15 pages
DS-K1T343EFX Face Recognition Terminal
No ratings yet
DS-K1T343EFX Face Recognition Terminal
4 pages
Demi Unit-5 Notes
No ratings yet
Demi Unit-5 Notes
23 pages
GSPINTRO105 Navigating The Interface Script
No ratings yet
GSPINTRO105 Navigating The Interface Script
2 pages
Protocols For QoS
No ratings yet
Protocols For QoS
66 pages
Level 1 Visual Fox Pro
No ratings yet
Level 1 Visual Fox Pro
66 pages
Ios2601 Student Guideline For MCQ Exams 20201
No ratings yet
Ios2601 Student Guideline For MCQ Exams 20201
2 pages
Sap User List North America 2012: Organizations Contacts Emails
No ratings yet
Sap User List North America 2012: Organizations Contacts Emails
1 page
Numbersystem Assignment PDF
No ratings yet
Numbersystem Assignment PDF
13 pages
Unit 3 - Theory of Computation - WWW - Rgpvnotes.in
No ratings yet
Unit 3 - Theory of Computation - WWW - Rgpvnotes.in
14 pages
Model Theory Examinations: Ime: 3 Hours Answer ALL Questions Max. Marks 100
No ratings yet
Model Theory Examinations: Ime: 3 Hours Answer ALL Questions Max. Marks 100
2 pages
Classification of Computer Software
No ratings yet
Classification of Computer Software
4 pages
Analogy - Verbal Reasoning Questions and Answers Page 4
No ratings yet
Analogy - Verbal Reasoning Questions and Answers Page 4
2 pages
A Neighborhood of Infinity - You Could Have Invented Monads! (And Maybe You Already Have PDF
No ratings yet
A Neighborhood of Infinity - You Could Have Invented Monads! (And Maybe You Already Have PDF
30 pages

Paper Review of Five Machine Vision Topics

Uploaded by

Paper Review of Five Machine Vision Topics

Uploaded by

Paper Review of Five Machine Vision Topics

Sashidhar Reddy Avuthu

2. Related Works for Quick Comparison

3. Proposed Methods and Results

4. Analysis and Summarization of Pros and Cons

You might also like