Ssssss

The document presents SSGAN, a novel one-stage GAN-based architecture for generating photorealistic images from scene-level freehand sketches, addressing the limitations of existing two-stage methods. It introduces a Semantic Fusion Module (SFM) to enhance the learning of intermediate features, allowing for a more efficient sketch-to-image generation process. Experimental results on the SketchyCOCO dataset demonstrate that SSGAN achieves competitive performance compared to state-of-the-art methods.

Uploaded by

santhuyrgowda

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views5 pages

Ssssss

Uploaded by

santhuyrgowda

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

,&(7,6-DQXDU\+DUELQ&KLQD

SSGAN: Image Generation from Freehand Scene Sketches

Mengying Ji 1,*, Xianlin Zhang 1, Xueming Li 1
1
School of Digital Media and Design Arts, Beijing University of Posts and Telecommunications Beijing Key Laboratory
of Network System and Network Culture, Beijing, China
*
Corresponding author’s email: [email protected]

Abstract
With the remarkable progress on deep CNNs, recent approaches have achieved certain success on image generation from
scene-level freehand sketches. However, most of the researches adopt a two-staged way, that is, to generate the foreground
and the background of the image respectively. In this paper, we propose a novel one-stage paradigm of GAN-based
architecture, which named SSGAN for image generation using sketch-to-image directly. Moreover, we design a novel
Semantic Fusion Module (SFM) for better learn the intermediate features. Extensive experiments on SketchyCOCO demon-
strate that our proposed framework can obtain competitive performance compared with the state-of-the-art methods.

1 Introduction feeding the semantic mask of scene sketch into SSGAN,

can generate high-resolution images with visually con-
In recent years, the advent of Generative Adversarial Net- sistent foreground and background.
works (GANs) had a huge influence on the progress of im- Our contributions are summarized as follows:
age synthesis research. In particular, high-fidelity, realistic- y We present a new unified end-to-end pipeline to real-
looking images could be generated by unconditional gen- ize the image generation from scene-level freehand
erative models trained on object-level data (e.g., face im- sketches.
ages [12] [13]). For practical applications, generating y We present Semantic Fusion Module (SFM) for
photo-realistic images conditioning on certain input could achieving the sketch-to-mask-to-image pipeline.
be more useful. This has been widely investigated in the y Experiments on SketchyCOCO datasets reveal the ef-
recent years, conditional generative approaches have used ficiency of the proposed model.
class labels [1][15], text [10], sketch [4] [7] [14], layout The organization of this paper is as following, Sec. 1 is the
[19], semantic maps [16] [20] [21], to describe the desired Introduction, Related work is presented in Sec.2, Sec.3 de-
image. scribes the proposed model of SSGAN, Experimental re-
In this paper, we are interested in a specific form of condi- sults and analysis is revealed in Sec.4, and Conclusion is
tional image synthesis, which is converting a scene-level given in Sec.5.
freehand sketch to a photorealistic image. Compared to
class label and text, a freehand sketch can express the user's
intention more intuitively. Compared to layout and seman- 2 Related Work
tic maps, freehand sketches are more universal. However,
image generation from scene-level freehand sketches is a 2.1 Sketch-to-Image
challenging task as (a). sketches are abstract, and people Early sketch-based image synthesis approaches are based
may have different expressions of the same object. (b). the on image retrieval. Such as, both the kernel idea of
inconsistency of user’s attention on the contents, the back- Sketch2Photo [3] and PhotoSketcher [5] were first retrieve
ground of the sketches is usually rough or even missing. objects and backgrounds from a given sketch, and then syn-
Besides, the researches on this task are still sparse though thesized realistic images by compositing. SketchyGAN [4]
the generative-based learning methods are sprung up eve- proposed a new training method of gradual transition from
rywhere. Among them, most of the work implemented a edge-image synthesis to sketch-image synthesis. Contextu-
two phases way which means a complicated training and a alGAN [14] proposed a novel sketch-edge joint image
wasted calculation. Different from these recent methods, completion approach. SketchyGAN2 [22] matched the user
we propose a one-stage scene-level sketch-based architec- sketches by adjusting a subset of the model weights on pre-
ture for generative adversarial networks (SSGAN) to ad- trained generative models which were pretrained on large-
dress the sketch-to-image problem by learning sketch-to- scale data. These methods have demonstrated the value of
mask-to-image. There is a lot of blanks in freehand scene GANs for image generation from object-level sketches. In
sketches and inferring the semantics of blank based on ex- fact, SketchyCOCO[7] was the first to realistically propose
isting information is a straightforward intermediate step. a two-staged method from scene sketch to image genera-
Thus, we propose a Semantic Fusion Module (SFM) to re- tion. Different from SketchyCOCO, which generated fore-
alize the sketch-to-mask-to-image pipeline in our model. ground and background of two-stages, our approach de-
That is, give a freehand scene sketch, generates an image signs a single GAN network for generation.
via two steps: (i). a pre-trained semantic segmentation net-
work obtains semantic information about scene sketch; (ii).

,6%1
Authorized licensed use limited to: VTU Consortium. Downloaded on 9'(9(5/$**0%+Â%HUOLQÂ2IIHQEDFK
May 16,2025 at 15:50:44 UTC from IEEE Xplore. Restrictions apply.
,&(7,6-DQXDU\+DUELQ&KLQD

Figure 1 Overview of the proposed SSGAN for image synthesis from scene-level freehand sketches. Given a scene-
level freehand sketch, we obtain the semantic mask of the sketch by using a pre-trained segmentation model. Proposed
Semantic Fusion Module (SFM) realizes the learning of sketch-to-mask-to-image for the generative learning problem of
sketch-to-image. Moreover, the right-bottom illustrates the SFM.

Where ߠீ represents the parameters of the generation func-

2.2 Semantic-to-Image tion.
Semantic Image Synthesis aims to turn semantic label maps
into photo-realistic images. For instance, GauGAN [16] 3.2 Architecture
proposed a spatially-adaptive normalization to preserve se- As illustrated in Figure 1, given a scene-level freehand
mantic information of input semantic masks for generating sketch S, we first convert S into a semantic segmentation
photorealistic images, DAGAN [21] proposed two mod- map ‫ܯ‬଴ ‫{ א‬0,1}ு×ௐ×஼ by leveraging the sketch segmen-
ules, position-wise Spatial Attention Module (SAM) and tation method in [25], where ‫ ܥ‬denotes the number of cat-
scale-Wise Channel Attention (CAM) to learn spatial atten- egories, and ‫ܪ‬, ܹ are the height and width of the seman-
tion and channel attention respectively. OASIS [20] re- tic segmentation map, respectively. After that, by taking
placed the original discriminator with a segmentation- ‫ܯ‬଴ as input, the final image is achieved by SSGAN.
based discriminator. CC-FPSE [26] generated the interme- The structure of SSGAN is shown in Figure 1, the whole
diate feature maps by predicting convolutional kernels con- network is composed of several SPADE layers [16]. How-
ditioned on the semantic label map. Semantic information ever, unlike the original SPADE layer, which uses pre-ex-
provides useful guidance in image generation. isting masks in the datasets as inputˈwe use the semantic
masks learned from SFM.
Specifically, let‫ݔ‬௜ denote the output feature of i-th SPADE
3 Method layer, ‫ݔ‬௜ is mapped to an intermediate mask ݉௜ through
In this section, we first define the problem formulation in a simple ‘ToMask’ operation. Where the ‘ToMask’ opera-
Sec. 3.1. We then introduce the architecture of SSGAN tion is implemented by Conv+Sigmoid.
(Sec. 3.2), and present Semantic Fusion Module (SFM)
(Sec. 3.3). Finally, the optimization objective of the pro- ݉௜ = ܶ‫ݔ(݇ݏܽܯ݋‬௜ ) (2)
posed framework is presented (Sec. 3.4).
Then, we feed ݉௜ and ‫ܯ‬଴ into the SFM to get a new se-
3.1 Problem Formulation mantic feature map ‫ܯ‬௜ as the input of the next SPADE
layer. Note that the input to the first SPADE layer is ‫ܯ‬଴ .
Assuming a set of scene-level freehand sketches ࣭ and
their corresponding images ࣣ, given a ground-truth image ‫ܯ‬௜ = ܵ‫݉(ܯܨ‬௜ , ‫ܯ‬଴ ) (3)
‫ ࣣ א ܫ‬and its corresponding scene sketch S ‫࣭ א‬, we want
to find a generator function ‫ ܩ‬to capture the underlying
conditional data distribution ‫ = ݌‬൫ ‫ܵ פפ ܫ‬, ‫ݖ‬img ൯ , where
3.3 Semantic Fusion Module (SFM)
‫ݖ‬img is the latent code used to control the overall style of The Semantic Fusion Module (SFM) is presented to learn
the image. Similar to [18], we express our task in this work the mask from feature maps at different stage in the gener-
as in Equation 1: ator. There are a lot of unknown parts in the semantic mask
of scene sketch, so we introduced SFM to encode semantic
‫ܩ = ܫ‬൫ܵ, ‫ݖ‬img ; ߠீ ൯ (1) sketch and the intermediate mask obtained by the ‘To-
Mask’ operation to hallucinate a new fine-grained mask
map.

,6%1
Authorized licensed use limited to: VTU Consortium. Downloaded on 9'(9(5/$**0%+Â%HUOLQÂ2IIHQEDFK
May 16,2025 at 15:50:44 UTC from IEEE Xplore. Restrictions apply.
,&(7,6-DQXDU\+DUELQ&KLQD

Mathematically, we define ‫ܯ‬௜ ‫{ א‬0,1}ு×ௐ×஼ as interme- ral images from COCO Stuff [2], using the segmentation
diate mask from the i-th SPADE layer.‫ܯ‬଴ ‫{ א‬0,1}ு×ௐ×஼ is masks of these natural images as reference, scene sketches
the input semantic sketch and ‫ܯ‬௙ ‫{ א‬0,1}ு×ௐ×஼ is the were generated by compositing the instance freehand
foreground segmentation of the sketch which is kept the sketches from Sketchy [17], Tu-berlin [6], and QuickDraw
same shape as ‫ܯ‬଴ by padding 0. As illustrated in Figure [8]. SketchyCOCO datasets contain 14081 images and split
1, we first use a convolutional network ࣠ଵ to encode the them into two sets, 80% for training and the remaining 20%
label maps into feature maps: for test.
We use two metrics to evaluate generated images. The first
݂଴ = ࣠ଵ ൫‫ܯ‬௙ ൯ ْ ܲ௠௘௔௡ ൫࣠ଵ (‫ܯ‬଴ , ‫ܯ‬௜ )൯ (4) metric is FID [9] which has been widely used to evaluate
the quality of generated images. The lower the FID value,
where ܲ௠௘௔௡ represents average pooling, Ͱdenotes ele- the more realistic the image. Another metric is the struc-
mentwise addition. Average pooling is used because it pre- tural similarity metric (SSIM) [23] used to quantify the
serves background information better. We then use another structural similarity between the generated image and the
convolutional network ࣠ଶ to obtain final updated feature ground truth images. The higher the SSIM value, the closer
maps: they are.

݂ = ݂଴ ْ ࣠ଶ (݂଴ ) (5) 4.1.2 Methods in Comparison

SketchyCOCO [7] is the only existing method which is spe-
we obtain the final feature map ݂, which contains infor- cifically designed for image generation from scene-level
mation from both sketch and hallucinated stage segmenta- freehand sketches. In addition to compare our approach
tion map. with it, we also compare with the advanced approaches
which generate images using other forms of input (e.g., lay-
3.4 Objective out, semantic mask).
We train the generator with the same multi-scale discrimi- y SketchyCOCO: SketchyCOCO introduced the first
nator and loss function used in GauGAN [16]. Where the method for automatic image generation from scene-
discriminator adopts the hinge loss while the generator is level freehand sketches, EdgeGAN [7] and Pix2Pix
optimized with three different losses, including the hinge- [11] are used to generate the foreground and back-
based adversarial loss, discriminator feature matching loss, ground respectively.
and perceptual loss, respectively. y GauGAN [16]: The GauGAN model takes the seman-
Therefore, the loss function for discriminator is defined in tic segmentation maps as input. we test the public
Equation 6: model pre-trained on the dataset COCO Stuff. In addi-
tion, we reuse the results reported in the
‫ܮ‬஽ = െॱ(௫,௦) ൣmin൫0, െ1 + ‫ݔ(ܦ‬, ‫)ݏ‬൯൧ SketchyCOCO in our comparisons, where a GauGAN
(6) model is trained by taking the semantic sketches on
െॱ௭,௦ ൣmin൫0, െ1 െ ‫ݖ(ܩ(ܦ‬, ‫)ݏ‬, ‫)ݏ‬൯൧
SketchyCOCO dataset as input.
y LostGANs [19]: The LostGANs model takes the lay-
where ‫ݔ‬, ‫ ݏ‬and ‫ ݖ‬denote the real image, the semantic la-
outs as input. We compared of their pre-trained model
bel map of sketch and the input noise map, respectively.
which trained on the dataset COCO Stuff. To ensure
The loss function for generators is defined in Equation 7:
fairness, we restrict the categories in the generated im-
‫ீܮ‬ = െॱ(௭,௦) ‫ݖ(ܩ(ܦ‬, ‫)ݏ‬, ‫)ݏ‬ ages, test only the categories included in the
SketchyCOCO dataset.
+ߣிெ ॱ(௭,௦) ‫ܮ‬ிெ (‫ݖ(ܩ‬, ‫)ݏ‬, ‫)ݔ‬ (7)
+ߣ௉ ॱ(௭,௦) ‫ܮ‬௉ (‫ݖ(ܩ‬, ‫)ݏ‬, ‫)ݔ‬ 4.1.3 Implementation Details
We evaluate our SSGAN at resolution 256 × 256. We fol-
where ‫ܮ‬ிெ (‫ݖ(ܩ‬, ‫)ݏ‬, ‫ )ݔ‬is the discriminator feature match- low the training procedures of GANs and alternatively train
ing loss and ‫ܮ‬௉ (‫ݖ(ܩ‬, ‫)ݏ‬, ‫ )ݔ‬is the perceptual loss. We set the generator G and discriminator D. We use Adam as the
ߣிெ and ߣ௉ equal to 10 in our experiments. optimizer and set ȕ1 =0, ȕ2 =0.999. The learning rates for
the generator and discriminator are both set to 0.0002. We
4 Experiments conduct the experiments on a single NVIDIA 2080Ti GPU.

4.1 Experimental Setup 4.2 Qualitative results

We provide quantitative results in Table 1. Clearly, the
4.1.1 Dataset and Evaluation metrics GauGAN model trained using semantic maps is superior to
We use SketchyCOCO [7] dataset to evaluate our SSGAN. ours in terms of FID and SSIM. However, the semantic
SketchyCOCO dataset is the only scene-level freehand map specifies the category of each pixel, offered tighter
sketch dataset and covering 3 background classes and 14 constraint than sketch. Another reason is that the GauGAN
foreground classes. SketchyCOCO dataset collected natu-

,6%1
Authorized licensed use limited to: VTU Consortium. Downloaded on 9'(9(5/$**0%+Â%HUOLQÂ2IIHQEDFK
May 16,2025 at 15:50:44 UTC from IEEE Xplore. Restrictions apply.
,&(7,6-DQXDU\+DUELQ&KLQD

model trained using the semantic maps contains all catego- 4.3 Quantitative results
ries in the COCO Stuff dataset, while our model trained on Figure 2 shows the images generated by our method and
SketchyCOCO which only contain a part of categories in the comparison methods. Note that we cannot reproduce
ground truth. Compared with the GauGAN model trained the results of SketchyCOCO because it only provides the
using semantic sketches, SSGAN’s score is the same as pre-trained foreground generation model, not the pre-
GauGAN-semantic sketch’s score in SSIM, but our method trained background generation model. Figure 2 demon-
yields better results for FID. Indicating that the SFM can strates that SSGAN is able to generate complex images
effectively learn fine-grained mask. Compare with the with multiple objects from simple scene-level freehand
scene-level sketch-based image generation baseline model sketches, and the generated images respect the constraints
SketchyCOCO, our SSGAN achieves better score on FID of the input scene-level freehand sketches. We can see our
but lower score on SSIM. This may be because approach produce much better results than LostGANs
SketchyCOCO generate foreground separately, and using which use layouts as input. But compared to the GauGAN
the generated foreground instances as constraints which model trained using semantic maps, our approach produces
provide a more explicit spatial constraint. slightly worse images. This is consistent with our analysis
of the qualitative results.
In Figure 3 we prove the effectiveness of proposed SFM.
(c) shows the semantic masks learned by SFM. It is clear
that our approach represents the foreground object accu-
rately and infer background from limited information.

Figure 3 (a) Input scene-level freehand sketches, (b) Se-

mantic segmentations of scene sketches, (c) Masks learned
by SFM, (d) Generated images by our SSGAN.

5 Conclusion
In this paper, we propose SSGAN for synthesis images
from scene-level freehand sketches, which use a joint learn-
ing paradigm to transform sketch-to-image into sketch-to-
mask-to-image. Specifically, we present a new module,
Figure 2 Scene-level comparison. (a) Input layout, (b)
SFM, which fuses the segmentation masks of phase and the
Generated images by LostGANs, (c) Input semantic map,
semantic sketches to realize the sketch-to-mask-to-image
(d) Generated images by GauGAN, (e) Input scene-level
pipeline. Comprehensive experiments on SketchyCOCO
freehand sketch, (f) Generated images by our SSGAN.
datasets demonstrate the effectiveness of our proposed
model.
Table 1 The results of quantitative experiments
Model ),'Ļ 66,0Ĺ
LostGANs-layout 134.6 0.280
References
GauGAN-semantic map 80.3 0.306 [1] Brock, Andrew, Jeff Donahue, and Karen Simonyan.
“Large Scale GAN Training for High Fidelity Natural
GauGAN-semantic sketch 215.1 0.285
Image Synthesis.” ArXiv Preprint ArXiv:1809.11096,
SketchyCOCO-scene 164.8 0.288 2018.
Ours 123.8 0.285 [2] Caesar, Holger, Jasper Uijlings, and Vittorio Ferrari.
“COCO-Stuff: Thing and Stuff Classes in Context.”
ArXiv:1612.03716 [Cs], March 28, 2018.

,6%1
Authorized licensed use limited to: VTU Consortium. Downloaded on 9'(9(5/$**0%+Â%HUOLQÂ2IIHQEDFK
May 16,2025 at 15:50:44 UTC from IEEE Xplore. Restrictions apply.
,&(7,6-DQXDU\+DUELQ&KLQD

[3] Chen, Tao, Ming-Ming Cheng, Ping Tan, Ariel [19] Sun, Wei, and Tianfu Wu. “Learning Layout and Style
Shamir, and Shi-Min Hu. “Sketch2Photo: Internet Reconfigurable GANs for Controllable Image
Image Montage.” ACM Transactions on Graphics 28, Synthesis.” ArXiv:2003.11571 [Cs], March 26, 2021.
no. 5 (December 2009): 1–10. [20] Sushko, Vadim, Edgar Schönfeld, Dan Zhang,
[4] Chen, Wengling, and James Hays. “SketchyGAN: Juergen Gall, Bernt Schiele, and Anna Khoreva. “You
Towards Diverse and Realistic Sketch to Image Only Need Adversarial Supervision for Semantic
Synthesis.” ArXiv:1801.02753 [Cs], April 12, 2018. Image Synthesis.” ArXiv:2012.04781 [Cs, Eess],
[5] Eitz, M., R. Richter, K. Hildebrand, T. Boubekeur, March 19, 2021.
and M. Alexa. “Photosketcher: Interactive Sketch- [21] Tang, Hao, Song Bai, and Nicu Sebe. “Dual Attention
Based Image Synthesis.” IEEE Computer Graphics GANs for Semantic Image Synthesis.”
and Applications 31, no. 6 (November 2011): 56–66. ArXiv:2008.13024 [Cs], August 29, 2020.
[6] Eitz, Mathias, James Hays, and Marc Alexa. “How Do [22] Wang, Sheng-Yu, David Bau, and Jun-Yan Zhu.
Humans Sketch Objects?” ACM Transactions on “Sketch Your Own GAN.” ArXiv:2108.02774 [Cs],
Graphics 31, no. 4 (August 5, 2012): 1–10. September 20, 2021.
[7] Gao, Chengying, Qi Liu, Qi Xu, Limin Wang, [23] :DQJ = ³,PDJH 4XDOLW\ $VVHVVPHQWௗ )URP (UURU
Jianzhuang Liu, and Changqing Zou. “SketchyCOCO: Visibility to Structural Similarity.” IEEE Trans-
Image Generation from Freehand Scene Sketches.” actions on Image Processing, 2004.
ArXiv:2003.02683 [Cs], April 7, 2020. [24] Zhao, Bo, Lili Meng, Weidong Yin, and Leonid Sigal.
[8] Ha, D., and D. Eck. “A Neural Representation of “Image Generation From Layout.” In 2019 IEEE/CVF
Sketch Drawings,” 2017. Conference on Computer Vision and Pattern Recog-
[9] Heusel, M., H. Ramsauer, T. Unterthiner, B. Nessler, nition (CVPR), 8576–85. Long Beach, CA, USA:
and S. Hochreiter. “GANs Trained by a Two Time- IEEE, 2019.
Scale Update Rule Converge to a Local Nash [25] Zou, Changqing, Haoran Mo, Chengying Gao, Ruofei
Equilibrium,” 2017. Du, and Hongbo Fu. “Language-Based Colorization
[10] Hong, Seunghoon, Dingdong Yang, Jongwook Choi, of Scene Sketches.” ACM Transactions on Graphics
and Honglak Lee. “Inferring Semantic Layout for 38, no. 6 (November 8, 2019): 1–16.
Hierarchical Text-to-Image Synthesis.” [26] Wang, Ting-Chun, Ming-Yu Liu, Jun-Yan Zhu,
ArXiv:1801.05091 [Cs], July 25, 2018. Andrew Tao, Jan Kautz, and Bryan Catanzaro. “High-
[11] Isola, Phillip, Jun-Yan Zhu, Tinghui Zhou, and Alexei Resolution Image Synthesis and Semantic Manipu-
A Efros. “Image-to-Image Translation with Condi- lation with Conditional GANs.” ArXiv:1711.11585
tional Adversarial Networks.” In Proceedings of the [Cs], August 20, 2018.
IEEE Conference on Computer Vision and Pattern
Recognition, 1125–34, 2017.
[12] Karras, Tero, Miika Aittala, Janne Hellsten, Samuli
Laine, Jaakko Lehtinen, and Timo Aila. “Training
Generative Adversarial Networks with Limited Data.”
ArXiv:2006.06676 [Cs, Stat], October 7, 2020.
[13] Karras, Tero, Samuli Laine, and Timo Aila. “A Style-
Based Generator Architecture for Generative Adver-
sarial Networks.” ArXiv:1812.04948 [Cs, Stat],
March 29, 2019.
[14] Lu, Yongyi, Shangzhe Wu, Yu-Wing Tai, and Chi-
Keung Tang. “Image Generation from Sketch
Constraint Using Contextual GAN.” ArXiv:1711.08972
[Cs], July 25, 2018.
[15] Mirza, Mehdi, and Simon Osindero. “Conditional
Generative Adversarial Nets.” ArXiv:1411.1784 [Cs,
Stat], November 6, 2014.
[16] Park, Taesung, Ming-Yu Liu, Ting-Chun Wang, and
Jun-Yan Zhu. “Semantic Image Synthesis with Spa-
tially-Adaptive Normalization.” ArXiv:1903.07291
[Cs], November 5, 2019.
[17] Sangkloy, Patsorn, Nathan Burnell, Cusuh Ham, and
James Hays. “The Sketchy Database: Learning to
Retrieve Badly Drawn Bunnies.” ACM Transactions
on Graphics 35, no. 4 (July 11, 2016): 1–12.
[18] Sun, Wei, and Tianfu Wu. “Image Synthesis From
Reconfigurable Layout and Style,” n.d., 10.

,6%1
Authorized licensed use limited to: VTU Consortium. Downloaded on 9'(9(5/$**0%+Â%HUOLQÂ2IIHQEDFK
May 16,2025 at 15:50:44 UTC from IEEE Xplore. Restrictions apply.

Canonical Sg 2 Im
No ratings yet
Canonical Sg 2 Im
31 pages
Image_generation
No ratings yet
Image_generation
10 pages
Sketch to Image Generation
No ratings yet
Sketch to Image Generation
16 pages
Neural Scene Decoration From A Single Photograph
No ratings yet
Neural Scene Decoration From A Single Photograph
29 pages
2310.15160
No ratings yet
2310.15160
17 pages
+L0loGAsYhDhr2eFwV2buw==
No ratings yet
+L0loGAsYhDhr2eFwV2buw==
7 pages
Final All Correct
No ratings yet
Final All Correct
49 pages
BATCH 16 (1)
No ratings yet
BATCH 16 (1)
24 pages
Learning To Predict Layout-To-Image Conditional Convolutions For Semantic Image Synthesis
No ratings yet
Learning To Predict Layout-To-Image Conditional Convolutions For Semantic Image Synthesis
15 pages
basepaper1
No ratings yet
basepaper1
15 pages
Wang High-Resolution Image Synthesis CVPR 2018 Paper
No ratings yet
Wang High-Resolution Image Synthesis CVPR 2018 Paper
10 pages
Koley Its All About Your Sketch Democratising Sketch Control in Diffusion CVPR 2024 Paper
No ratings yet
Koley Its All About Your Sketch Democratising Sketch Control in Diffusion CVPR 2024 Paper
11 pages
3rd unit Notes
No ratings yet
3rd unit Notes
16 pages
3d-Aware Conditional Image Synthesis: Kangle Deng Gengshan Yang Deva Ramanan Jun-Yan Zhu Carnegie Mellon University
No ratings yet
3d-Aware Conditional Image Synthesis: Kangle Deng Gengshan Yang Deva Ramanan Jun-Yan Zhu Carnegie Mellon University
15 pages
Sketch To Photo
No ratings yet
Sketch To Photo
19 pages
s41095-023-0371-3
No ratings yet
s41095-023-0371-3
13 pages
SAW-GAN
No ratings yet
SAW-GAN
11 pages
A State-of-the-Art Review On Image Synthesis With Generative Adversarial Networks
No ratings yet
A State-of-the-Art Review On Image Synthesis With Generative Adversarial Networks
24 pages
Dataset Diffusion Diffusion-based Synthetic Dataset
No ratings yet
Dataset Diffusion Diffusion-based Synthetic Dataset
21 pages
Zhang_Panoptic-Aware_Image-to-Image_Translation_WACV_2023_paper
No ratings yet
Zhang_Panoptic-Aware_Image-to-Image_Translation_WACV_2023_paper
10 pages
FISS GAN A Generative Adversarial Network For Foggy Image Semantic Segmentation
No ratings yet
FISS GAN A Generative Adversarial Network For Foggy Image Semantic Segmentation
12 pages
Development and deployment of a generative model-based framework for text to photorealistic image generation
No ratings yet
Development and deployment of a generative model-based framework for text to photorealistic image generation
16 pages
Sketch2face: Conditional Generative Adversarial Networks For Transforming Face Sketches Into Photorealistic Images
No ratings yet
Sketch2face: Conditional Generative Adversarial Networks For Transforming Face Sketches Into Photorealistic Images
9 pages
1803 07422xasxa
No ratings yet
1803 07422xasxa
28 pages
Object Detection Using Domain Randomization and Generative Adversarial Refinement of Synthetic Images
No ratings yet
Object Detection Using Domain Randomization and Generative Adversarial Refinement of Synthetic Images
8 pages
Es Como Algo
No ratings yet
Es Como Algo
14 pages
Photo-Realistic Photo Synthesis Using Improved Conditional Generative Adversarial Networks
No ratings yet
Photo-Realistic Photo Synthesis Using Improved Conditional Generative Adversarial Networks
8 pages
Cartooniation Using White-Box Technique in Machine Learning
100% (2)
Cartooniation Using White-Box Technique in Machine Learning
5 pages
2405.00196v1
No ratings yet
2405.00196v1
11 pages
Kim2019 Article LatentTransformationsNeuralNet
No ratings yet
Kim2019 Article LatentTransformationsNeuralNet
15 pages
Lift3D
No ratings yet
Lift3D
10 pages
Photographic Text-to-Image Synthesis With A Hierarchically-Nested Adversarial Network
No ratings yet
Photographic Text-to-Image Synthesis With A Hierarchically-Nested Adversarial Network
10 pages
GIRAFFE; Representing Scenes as Compositional Generative Neural Feature Fields _2011.12100v2
No ratings yet
GIRAFFE; Representing Scenes as Compositional Generative Neural Feature Fields _2011.12100v2
12 pages
Yayi Final Seminar
No ratings yet
Yayi Final Seminar
19 pages
Report 16
No ratings yet
Report 16
9 pages
2-7
No ratings yet
2-7
6 pages
Sketch To Image Using GAN
No ratings yet
Sketch To Image Using GAN
5 pages
Liao Text To Image Generation With Semantic-Spatial Aware GAN CVPR 2022 Paper
No ratings yet
Liao Text To Image Generation With Semantic-Spatial Aware GAN CVPR 2022 Paper
10 pages
Verisimilar Image Synthesis For Accurate Detection and Recognition of Texts in Scenes
No ratings yet
Verisimilar Image Synthesis For Accurate Detection and Recognition of Texts in Scenes
18 pages
Tao DF-GAN A Simple and Effective Baseline For Text-to-Image Synthesis CVPR 2022 Paper
No ratings yet
Tao DF-GAN A Simple and Effective Baseline For Text-to-Image Synthesis CVPR 2022 Paper
11 pages
Jeong_3D_Scene_Painting_via_Semantic_Image_Synthesis_CVPR_2022_paper
No ratings yet
Jeong_3D_Scene_Painting_via_Semantic_Image_Synthesis_CVPR_2022_paper
11 pages
Sketchygan: Towards Diverse and Realistic Sketch To Image Synthesis
No ratings yet
Sketchygan: Towards Diverse and Realistic Sketch To Image Synthesis
10 pages
2211.09869v4
No ratings yet
2211.09869v4
15 pages
Meta
No ratings yet
Meta
17 pages
Sketch Image Translation
No ratings yet
Sketch Image Translation
7 pages
Synthetic Data Generation For Scarce Road Scene Detection Scenarios
No ratings yet
Synthetic Data Generation For Scarce Road Scene Detection Scenarios
10 pages
Satgan Paper
No ratings yet
Satgan Paper
17 pages
Image Generation From Caption
No ratings yet
Image Generation From Caption
10 pages
Base Paper Batch 9 Final Updated 3
No ratings yet
Base Paper Batch 9 Final Updated 3
10 pages
BTP Report On Text To Image Synthesis
No ratings yet
BTP Report On Text To Image Synthesis
62 pages
CNN-generated Images Are Surprisingly Easy To Spot... For Now
No ratings yet
CNN-generated Images Are Surprisingly Easy To Spot... For Now
13 pages
lata2019
No ratings yet
lata2019
4 pages
Wang CNN-Generated Images Are Surprisingly Easy To Spot... For Now CVPR 2020 Paper
No ratings yet
Wang CNN-Generated Images Are Surprisingly Easy To Spot... For Now CVPR 2020 Paper
10 pages
Sketch To Image Using GAN
No ratings yet
Sketch To Image Using GAN
6 pages
Broadcast Announcing and Performance
100% (2)
Broadcast Announcing and Performance
29 pages
Image-to-Image Translation With Conditional Adversarial Networks (Review)
No ratings yet
Image-to-Image Translation With Conditional Adversarial Networks (Review)
3 pages
SketchGAN CVPR2019
No ratings yet
SketchGAN CVPR2019
10 pages
Ise-Vi-Management and Entrepreneurship (10al61) - Notes PDF
100% (1)
Ise-Vi-Management and Entrepreneurship (10al61) - Notes PDF
177 pages
A Style-Based Generator Architecture For Generative Adversarial Networks
No ratings yet
A Style-Based Generator Architecture For Generative Adversarial Networks
12 pages
Gan Types
No ratings yet
Gan Types
8 pages
Sr. No Title Published Problem Statement Methodology Dataset Dataset Avail-Ability
No ratings yet
Sr. No Title Published Problem Statement Methodology Dataset Dataset Avail-Ability
2 pages
CONTENT
100% (3)
CONTENT
80 pages
Deborah Ocansey
No ratings yet
Deborah Ocansey
43 pages
Full Catalogue Italy 2021 FINAL
No ratings yet
Full Catalogue Italy 2021 FINAL
27 pages
Discovering and Knowing Your Learning Style
No ratings yet
Discovering and Knowing Your Learning Style
23 pages
Bodily Kinesthetic Intelligence
100% (1)
Bodily Kinesthetic Intelligence
11 pages
Chapter 5
No ratings yet
Chapter 5
19 pages
Rewriting The Soul
No ratings yet
Rewriting The Soul
6 pages
Job Analysis Bba
No ratings yet
Job Analysis Bba
45 pages
Deped Copy: Releasing The Power Within
No ratings yet
Deped Copy: Releasing The Power Within
8 pages
Quantum Magick
100% (1)
Quantum Magick
10 pages
Single Aisle Technical Training Manual T1+T2 (CFM 56) (LVL 2&3) General Level 2 & 3
100% (1)
Single Aisle Technical Training Manual T1+T2 (CFM 56) (LVL 2&3) General Level 2 & 3
60 pages
Revisi Teaching Materials Farhan Athalah
No ratings yet
Revisi Teaching Materials Farhan Athalah
12 pages
DLL Q1 Week 2 English
No ratings yet
DLL Q1 Week 2 English
10 pages
Observation Method
No ratings yet
Observation Method
16 pages
Springer
100% (2)
Springer
169 pages
Negotiation
No ratings yet
Negotiation
49 pages
Panduan Penskoran BI K2 T3
100% (3)
Panduan Penskoran BI K2 T3
3 pages
Language, Ideology and The World View By: Rana Faqir M. Aslam
100% (2)
Language, Ideology and The World View By: Rana Faqir M. Aslam
26 pages
Mavia Hamid Siddiqui PMA Cs 2 4416096 1272603853
No ratings yet
Mavia Hamid Siddiqui PMA Cs 2 4416096 1272603853
5 pages
Investigating The Impact of AI Websites On Students' Academic Performance
No ratings yet
Investigating The Impact of AI Websites On Students' Academic Performance
3 pages
Preposition: Study These Sentences
No ratings yet
Preposition: Study These Sentences
2 pages
Mental Status Examination
No ratings yet
Mental Status Examination
3 pages
Nicole App Review Assignment
No ratings yet
Nicole App Review Assignment
9 pages
Bahasa Inggeris PMR
No ratings yet
Bahasa Inggeris PMR
19 pages
Chapter 1 Final Year Project Report
No ratings yet
Chapter 1 Final Year Project Report
2 pages
Genre Analysis Assignment
No ratings yet
Genre Analysis Assignment
3 pages
Why Do You Want To Be A Teach For Malaysia Fellow
No ratings yet
Why Do You Want To Be A Teach For Malaysia Fellow
5 pages
Philosophy of Medicine
No ratings yet
Philosophy of Medicine
5 pages
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet