0% found this document useful (0 votes)

20 views

GAN Script

GAN

Uploaded by

Sneha Thyagarajan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views

GAN Script

GAN

Uploaded by

Sneha Thyagarajan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

The Aim is to understand how GAN can enhance the object detection systems

In this seminar we will firstly discuss about general architecture of GAN

followed by the limitations of object detection model and how GAN can help enhance them .

so these are some of the papers that provided insights on advancements and applications of GAN

moving on to the main part

GAN is a deep learning generative model that has lately gained so much popularity

GANs are recently developed technique for learning in both supervised and semi-supervised modes.

The GAN architecture has 2 main neural networks that are trained simultaneously

Generator : learns to generate fake data

Discriminator : classifies whether the data is real/fake.

it plays a minmax game where one network tries to reduce the loss while other increases it

with the obtained losses the model is trained again and the generator learns how to convince the
discriminator that its output is real.

Loss function: Log-Loss

-ln(predicted)+ln(1-predicted)

Generator Model Layers

Input layer

FCC

upsampling

activation function

output layer
Discriminator Model Layers

input layer

conv layer

pooling layer

FCC

output layer

This is the basic general architecture of GAN and it have many variants based on the applications

Conditional GAN : In this GAN the architecture differs in this portion where along with noise an
additional data is sent to both NN which acts as a guide to create images it can be any condition

CycleGAN : this is usually used for domain shifting, the change in its architecture will be that it has
two generators and two discriminators . for each domain

DCGAN: instead of using FCC it use conv layers (transposed conv) or (conv with strides) to improve
data quality and stability

This is the general overview of What GAN is now lets look into limitations of obj detection models

It is proven and observed that no matter how accurate a DNN model is in object detection in tends to
underperform in few conditions

where the model can be sensitive to variations in image quality : when a model is only trained on hR
images and when it encounters a LR image which is common in realtime applications it can give
wrong results

for example

Difficulty with small objects: most of the major datasets available for obj detection are focused on
large object detection and in deep CNNs architectures commonly the deeper the feature map, the
lower the resolution, which is counterproductive when the object is so small that it may be lost along
the way

Limited scalability : there are limited options when we try traditional data augmentation techniques
and it has been proven that common re-scaling functions can distort the image which might be
completely different from real time image
Class Imbalance : there are high chances that a dataset can contain underrepresented class of
images and the dnn might not train well on it

We observed that in all these cases the common problem is availability of proper data

so That is the reason we use GAN to generate data that will help our model train well

It can be used in domain adaption for Bridging Domain Gaps i.e with the help of cycleGAN for
example this technique can be used in Medical sector to convert MRI to CT scans which is a kind of
domain adaption .

Data augmentation which helps use create more realistic images than traditional techniques

Synthetic data generation : where completely new data can be generated from existing once helps in
enhanced training, cost effective , customizable .

Rare/small object detection : where GAN can create LR from HR . which can be useful in aerial
applications where objects are too small in realtime.

so the main aim of this RDAGAN is robust fire detection

it uses the generator model along with the image translation network to achieve this task

The generator is used to create the fire object patch

1-2 Fully Connected Layers

1 Reshape Layer

5-7 Transposed Convolutional Layers

1 Output Layer

and a bounding box mask Sampled from a uniform distribution and used to resize the object patch

the resized object patch is then combined with the clean image using image translation network

Downsampling layers : feature extraction , dimensionality reduction . achieved using convolution

(strides) or pooling ,
ResNet : it has skip connections between layers to avoid vanishing gradients , achieved using conv
layers batch normalization , skip connections and output layer (ReLU).

Upsampling layer : Image reconstruction, Detail recovery , achieved using deconvolutional layers.

The network can integrate features from the object patch and the background image at various
stages of the upsampling process. By concatenating or adding feature maps from the object and the
background, the network can learn to blend them effectively.

Another application of GAN is for small object detection

The main goal of DS-GAN is to create smaller versions of high-resolution (HR) objects

when we use traditional methods for reducing the size it might lose imp features so DS-GAN helps to
downsample while maintaining the imp features.

In DS-GAN there are two sets of Objects

HR , LR real objects used to train generator to create synthetic small LR

It has 2 networks generator and discriminator

Generator :

It takes a high-resolution (HR) object as input, along with some random noise.

The generator produces an SLR object that is 4 times smaller image.

The generator has an encoder-decoder structure.

The encoder extracts the important features from the HR object and compresses the
information.

The decoder then takes that compressed information and generates a smaller version while
keeping key features.

The middle part of the generator (the bottleneck) represents the most compressed form of the
image features. It captures high-level, abstract information that is necessary to regenerate the image
with proper detail

Discriminator :

The Discriminator receives both real LR objects from the LR dataset and generated objects from the
generator
The discriminator reduces the image size gradually while increasing the depth (number of channels).
This structure helps it detect high-level features that differentiate real images from generated ones.

Then these LR images are blended to a image by similar process to RDAGAN

moving on to applications

GAN can be used in auto vehicles where it can create data with small or occluded objects which will
make it robust for safer navigation

CCTV : detecting objects in LR

same goes with the drones as ariel images are small

Healthcare to create synthetic data of medical scans which are hard to obtain

In a nutshell GAN is a powerful data augmentation tool for object detection models which enables
them to be robust to any variations which is common in realtime scenarios

Having said that GAN has some disadvantages futurescope

The standard GAN has a mode collapse problem where Generator in a GAN learns to produce a
limited variety of outputs, effectively "collapsing" to a few modes.

This can be avoided by changing the loss functions and researches are going on with wesserstien loss
fuctions which can potentially overcome this

another way to achieve this is using multiple GANs ensuring that the weaknesses of one model do
not severely affect the overall output.

Outputs may lack fine details and appear less realistic due to the inability to prioritize significant
feature, more realistic images can be produced with the help of attention mechanism which helps
the model focus on relevant data.

Evolution of Product Management
No ratings yet
Evolution of Product Management
52 pages
Organizational Health Study
No ratings yet
Organizational Health Study
26 pages
Introduction Generative Adversarial Networks
No ratings yet
Introduction Generative Adversarial Networks
41 pages
12-DL-Deep Learning For GANS
No ratings yet
12-DL-Deep Learning For GANS
75 pages
Deep & Reinforcement - Unit 3
No ratings yet
Deep & Reinforcement - Unit 3
8 pages
DCGAN (Deep Convolution Generative Adversarial Networks)
No ratings yet
DCGAN (Deep Convolution Generative Adversarial Networks)
27 pages
3rd unit Notes
No ratings yet
3rd unit Notes
16 pages
paper4 (GAN)
No ratings yet
paper4 (GAN)
24 pages
Masterclass-GANs
No ratings yet
Masterclass-GANs
20 pages
DL EXP 3
No ratings yet
DL EXP 3
4 pages
Gen AI 10-1
No ratings yet
Gen AI 10-1
60 pages
Image Restoration Using Residual Generative Adversarial Networks-FINAL
No ratings yet
Image Restoration Using Residual Generative Adversarial Networks-FINAL
21 pages
MODULE6
No ratings yet
MODULE6
11 pages
Generative Adversarial Networks Review 1-06-08-1.edit
No ratings yet
Generative Adversarial Networks Review 1-06-08-1.edit
24 pages
Report 16
No ratings yet
Report 16
9 pages
Generative adversarial network An overview of theory and applications
No ratings yet
Generative adversarial network An overview of theory and applications
9 pages
Seminar 3258
No ratings yet
Seminar 3258
29 pages
Generative Adversarial Networks (GANs) - Engine and Applications PDF
No ratings yet
Generative Adversarial Networks (GANs) - Engine and Applications PDF
13 pages
saad tech sem 3
No ratings yet
saad tech sem 3
12 pages
DL Unit5
No ratings yet
DL Unit5
15 pages
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
Generative Adversarial Networks (Gans) : Date: 14.11.2022
100% (1)
Generative Adversarial Networks (Gans) : Date: 14.11.2022
12 pages
Gans
No ratings yet
Gans
14 pages
AAI Extra
No ratings yet
AAI Extra
7 pages
DL Unit6 Gan
No ratings yet
DL Unit6 Gan
44 pages
A Review of Generative Adversarial Networks For Computer Vision TasksElectronics Switzerland
No ratings yet
A Review of Generative Adversarial Networks For Computer Vision TasksElectronics Switzerland
17 pages
Object Detection Using Domain Randomization and Generative Adversarial Refinement of Synthetic Images
No ratings yet
Object Detection Using Domain Randomization and Generative Adversarial Refinement of Synthetic Images
8 pages
The Nature of Generative Adversarial Networks
No ratings yet
The Nature of Generative Adversarial Networks
4 pages
Generative Adversarial Networks For Image and Video Synthesis: Algorithms and Applications
No ratings yet
Generative Adversarial Networks For Image and Video Synthesis: Algorithms and Applications
24 pages
ASWIN TS GAN simplified notes unit 4 gen ai[1]
No ratings yet
ASWIN TS GAN simplified notes unit 4 gen ai[1]
5 pages
Image Super Resolution
No ratings yet
Image Super Resolution
8 pages
Generative AI Fundamentals GANs QB 14 Aug v1.0 (1)
No ratings yet
Generative AI Fundamentals GANs QB 14 Aug v1.0 (1)
24 pages
AAI Module 2
No ratings yet
AAI Module 2
18 pages
Research on Extended Image Data Set Based on Deep Convolution Generative Adversarial Network
No ratings yet
Research on Extended Image Data Set Based on Deep Convolution Generative Adversarial Network
4 pages
Unit-V Deep Generative Models Part-02
No ratings yet
Unit-V Deep Generative Models Part-02
35 pages
image restoration 2
No ratings yet
image restoration 2
8 pages
GANppt
100% (1)
GANppt
34 pages
PDL Unit 5-GAN
No ratings yet
PDL Unit 5-GAN
36 pages
GLeaD
No ratings yet
GLeaD
12 pages
Deep Learning
No ratings yet
Deep Learning
45 pages
Gan June 2019
No ratings yet
Gan June 2019
28 pages
T1_Indian Road Image Generation using GAN
No ratings yet
T1_Indian Road Image Generation using GAN
8 pages
17
No ratings yet
17
16 pages
Image-to-Image Translation With Conditional Adversarial Networks (Review)
No ratings yet
Image-to-Image Translation With Conditional Adversarial Networks (Review)
3 pages
A Technical Seminar2018-19
No ratings yet
A Technical Seminar2018-19
15 pages
Lec19 - GANs
No ratings yet
Lec19 - GANs
47 pages
Text To Image Translation Using Generative Adversarial Networks
No ratings yet
Text To Image Translation Using Generative Adversarial Networks
7 pages
Proceedings of Spie: A Survey On Generative Adversarial Networks and Their Variants Methods
No ratings yet
Proceedings of Spie: A Survey On Generative Adversarial Networks and Their Variants Methods
8 pages
2008.02793
No ratings yet
2008.02793
22 pages
Chapter8_GANs
No ratings yet
Chapter8_GANs
24 pages
Generative Adversarial Networks (GANs)
No ratings yet
Generative Adversarial Networks (GANs)
37 pages
Generative Adversarial Networks
No ratings yet
Generative Adversarial Networks
11 pages
GenerativeAdversialNetwork
No ratings yet
GenerativeAdversialNetwork
21 pages
From Adversarial Training To Geenerative Adversarial Networks
No ratings yet
From Adversarial Training To Geenerative Adversarial Networks
12 pages
Anime Gan
No ratings yet
Anime Gan
1 page
UNIT III
No ratings yet
UNIT III
24 pages
Generative Adversarial Networks
No ratings yet
Generative Adversarial Networks
4 pages
A Survey On Generative Adversarial Networks (GANs)
No ratings yet
A Survey On Generative Adversarial Networks (GANs)
5 pages
GENAI_WEEK5
No ratings yet
GENAI_WEEK5
33 pages
A Review of Generative Adversarial Networks GANs and Its Applications in A Wide Variety of Disciplines From Medical To Remote Sensing
No ratings yet
A Review of Generative Adversarial Networks GANs and Its Applications in A Wide Variety of Disciplines From Medical To Remote Sensing
28 pages
A Survey of Image Synthesis and Editing With Generative Adversarial Networks PDF
No ratings yet
A Survey of Image Synthesis and Editing With Generative Adversarial Networks PDF
15 pages
Pyramid Image Processing: Exploring the Depths of Visual Analysis
From Everand
Pyramid Image Processing: Exploring the Depths of Visual Analysis
Fouad Sabry
No ratings yet
Assignment 1 Itt300
No ratings yet
Assignment 1 Itt300
10 pages
Computer Science 2006 Sem VII&amp VIII
No ratings yet
Computer Science 2006 Sem VII&amp VIII
19 pages
The Beginners Guide To Robotc: Volume 1, 3 Edition
No ratings yet
The Beginners Guide To Robotc: Volume 1, 3 Edition
16 pages
Type of Discontinuity Cheat Sheet
No ratings yet
Type of Discontinuity Cheat Sheet
13 pages
San Beda LLB Curriculum PDF
No ratings yet
San Beda LLB Curriculum PDF
2 pages
Bottles, Scott L. - L.A. and The Automobile
No ratings yet
Bottles, Scott L. - L.A. and The Automobile
170 pages
Marketing Plan
No ratings yet
Marketing Plan
5 pages
SLE350 & SLE500: Medical Air Compressors
No ratings yet
SLE350 & SLE500: Medical Air Compressors
2 pages
Mental MRP CRP
No ratings yet
Mental MRP CRP
8 pages
Constructing Identity and Tradition: Englishness, Politics and The Neo-Traditional House
No ratings yet
Constructing Identity and Tradition: Englishness, Politics and The Neo-Traditional House
13 pages
Video Case Study: Operations Management II
No ratings yet
Video Case Study: Operations Management II
2 pages
ISSUANCE OF DEVELOPMENT PERMIT (DP) Subdivision & Memorial Parks
No ratings yet
ISSUANCE OF DEVELOPMENT PERMIT (DP) Subdivision & Memorial Parks
18 pages
Quanto CDS
100% (1)
Quanto CDS
19 pages
TLE-ICT-Computer-Hardware-Servicing-LM Module 3RD QUARTER M8
No ratings yet
TLE-ICT-Computer-Hardware-Servicing-LM Module 3RD QUARTER M8
8 pages
Deta Pharma Guidelines and Operation Manual en PDF
No ratings yet
Deta Pharma Guidelines and Operation Manual en PDF
24 pages
Harsh roy -Pawzz Offer Letter - C1
No ratings yet
Harsh roy -Pawzz Offer Letter - C1
11 pages
Download Complete Parametric Geometry of Curves and Surfaces Architectural Form Finding 1st Edition Alberto Lastra PDF for All Chapters
100% (6)
Download Complete Parametric Geometry of Curves and Surfaces Architectural Form Finding 1st Edition Alberto Lastra PDF for All Chapters
50 pages
Aga Khan University Examination Board Secondary School Certificate Class Ix Examination 2009
No ratings yet
Aga Khan University Examination Board Secondary School Certificate Class Ix Examination 2009
12 pages
Semester Project Proposal 2
No ratings yet
Semester Project Proposal 2
10 pages
Alcu Conductors ENG
No ratings yet
Alcu Conductors ENG
5 pages
Aw Hook Test SimulationXpress Study 1
No ratings yet
Aw Hook Test SimulationXpress Study 1
7 pages
Remanufactured Truck Electronic Parts Catalogue
No ratings yet
Remanufactured Truck Electronic Parts Catalogue
16 pages
Total Quality Management in Banking Sector
No ratings yet
Total Quality Management in Banking Sector
8 pages
Ebqs Oum
No ratings yet
Ebqs Oum
14 pages
MTS-231 Actuating Systems: Kanwal Naveed
No ratings yet
MTS-231 Actuating Systems: Kanwal Naveed
55 pages
Personal Essay--Reda Driss Ounejjar M'zali (1)
No ratings yet
Personal Essay--Reda Driss Ounejjar M'zali (1)
2 pages
Proforma Invoice Jakarta Selatan ROZALI
No ratings yet
Proforma Invoice Jakarta Selatan ROZALI
5 pages
MG5301
No ratings yet
MG5301
2 pages