DDE-GAN_ Integrating a Data-Driven Design Evaluator Into Generative Adversarial Networks for Desirable and Diverse Concept Generation
DDE-GAN_ Integrating a Data-Driven Design Evaluator Into Generative Adversarial Networks for Desirable and Diverse Concept Generation
Tucker Marion Generative adversarial networks (GANs) have shown remarkable success in various gener-
D’Amore-McKim School of Business, ative design tasks, from topology optimization to material design, and shape parametriza-
Department of Industrial Engineering, tion. However, most generative design approaches based on GANs lack evaluation
Northeastern University, mechanisms to ensure the generation of diverse samples. In addition, no GAN-based gen-
Boston, MA 02115 erative design model incorporates user sentiments in the loss function to generate samples
e-mail: [email protected] with high desirability from the aggregate perspectives of users. Motivated by these knowl-
edge gaps, this paper builds and validates a novel GAN-based generative design model with
Mohsen Moghaddam1 an offline design evaluation function to generate samples that are not only realistic but also
Department of Mechanical and diverse and desirable. A multimodal data-driven design evaluation (DDE) model is devel-
Industrial Engineering, oped to guide the generative process by automatically predicting user sentiments for the
Khoury College of Computer Sciences, generated samples based on large-scale user reviews of previous designs. This paper incor-
Northeastern University, porates DDE into the StyleGAN structure, a state-of-the-art GAN model, to enable data-
Boston, MA 02115 driven generative processes that are innovative and user-centered. The results of experi-
e-mail: [email protected] ments conducted on a large dataset of footwear products demonstrate the effectiveness of
the proposed DDE-GAN in generating high-quality, diverse, and desirable concepts.
[DOI: 10.1115/1.4056500]
1 Introduction studying methods and tools to improve the effectiveness and effi-
ciency of creative tasks, such as concept development [4,15–17].
The generation of innovative, diverse, and user-centered design
Creativity is an essential and central part of the ideation process
concepts is an essential phase in the early stages of the product
[18]. In human-led design practices, ideation is often an iterative
development process and is known to have a significant impact
and exploratory process [19], where designers share, modify, and
on the quality and success of the design [1–4]. Creating a wide
use various stimuli to generate new ideas and concepts [20].
range of solutions that differ significantly from each other can
Humans approach this process through various cognitive processes,
benefit the ideation process of designers and therefore increase
which research has classified into types and has been shown to
the possibility of creating high-quality concepts [5–8]. Various
affect the effectiveness of ideation [21]. Over the past 25 years,
approaches in the literature focus on automatically developing
research on computers and artificial intelligence (AI) has increas-
diverse and innovative concepts. The argument is that a large set
ingly focused on how these systems can be used to enhance the cre-
of concepts promote creativity and logically allows the selection
ative ideation process [22,23]. With its ability to synthesize data and
of better ideas from the set [5,9]. However, it is difficult for design-
make predictions at great speed, the potential for AI to be a gener-
ers to manually generate a large set of samples with great diversity
ator of new and creative design ideas and concepts has garnered
and novelty because designers naturally tend to fixate on specific
substantial attention from both academia and industry [16,19].
design specifications [10–12]. Moreover, most existing design
The methods and frameworks used to apply AI and machine
problem-solving practices rely heavily on the designers’ experi-
ences and preferences. They lack advanced computing methods to learning in design and engineering are numerous. Deep learning
help navigate larger solution spaces by generating more diverse, and generative modeling have recently attracted researchers’ atten-
unexpected, and viable solutions [5,11,13,14]. tion for their potential impact. Recent advances in AI research have
Developing methods to assess and improve creativity has histor- made remarkable progress in the machine’s ability to generate
ically been challenging due to its intangible and subjective nature. design ideas [24]. AI can be an inspiration tool in the creative
Significant research in engineering design is currently focused on process and a generative tool to assist designers in developing
design concepts. AI-powered generative design tools can poten-
tially augment designers’ ability to create concepts faster and
1
Corresponding author. more efficiently due to their increased speed and efficiency. The
Contributed by the Design Theory and Methodology Committee of ASME for power of AI lies in the speed with which it can analyze large
publication in the JOURNAL OF MECHANICAL DESIGN. Manuscript received July 12,
2022; final manuscript received December 5, 2022; published online January 10, amounts of data and suggest design adjustments. The designer
2023. Assoc. Editor: Christopher Mccomb. can then choose and approve adjustments based on these data.
Journal of Mechanical Design Copyright © 2023 by ASME APRIL 2023, Vol. 145 / 041407-1
An emerging research area on using AI to generate novel and design concept with descriptive phrases that can automatically
realistic design concepts is the use of generative adversarial net- convey a novel design concept remains a challenge. This work
works, or GANs [25]. A typical GAN architecture comprises two merely deploys the pretrained ResNet network of the DDE model
neural network architectures: a generator and a discriminator. The [23] to examine and evaluate the visual samples generated. The
generator neural network is trained to generate samples (e.g., DDE model, which excludes inputs from the product description,
images) almost identical to real samples. On the other hand, the dis- was incorporated into the architecture of the DDE-GAN model pre-
criminator neural network learns to differentiate between them. sented in Sec. 3. Future research should focus on building a multi-
GANs have made significant progress in synthesizing and gene- modal DDE-GAN model that couples images and descriptions for
rating “realistic” images as their central objective. Several success- automated generation and evaluation of design concepts.
ful GAN architectures have recently been proposed, mainly for The remainder of this paper is organized as follows. Section 2
synthesizing and generating facial images. Examples include provides a detailed overview of related work and topics in tradi-
CycleGAN [26], StyleGAN [27], PixelRNN [28], Text2Image [29], tional and GAN-based generative design. Section 3 provides the
and DiscoGAN [30]. These powerful image synthesis models can details of the proposed DDE-GAN model. Section 4 presents the
Fig. 1 Architecture of the integrated automated design evaluator-generative adversarial network (DDE-GAN) model.
WGAN-GP: Wasserstein GAN + Gradient Penalty [70]; DDE: data-driven design evaluation [23].
product. ResNet-50 can represent complex functionality and learn + λDE E [LDE (x̃)] (4)
x̃∼Pg
features at many different levels of abstraction to understand the
connections between orthographic representations of design con-
cepts (inputs) and user sentiment intensity values (outputs). The 1 N 2
bidirectional encoder representation from transformers (BERT) LDE (x) = f i (x) − ŷi (5)
N i=1
model, a different model in the DDE system, extracts and analyzes
product descriptions written in natural language [76]. The BERT where λDE is a constant that defines the loss weight, LDE is the DDE
model can determine the connection between a product’s technical loss function to evaluate the feature of the generated samples in the
description and the user’s emotional sentiment level. The DDE characteristic of performance, fi(x) is the prediction for all design x
system then integrates the various meaningful data collected from generated by the DDE model, and ŷi is the desired design evaluation
the Internet platform and models the relationships between score and is set as 1 for each attribute, indicating that the models are
images, text, and statistics. The DDE model synthesizes different trained to generate samples with the highest possible expectation.
modes of data using a novel fusion mechanism to develop a more The StyleGAN loss terms regulate the high quality of the produced
accurate context about the product and the associated user feedback pictures and the DDE loss guarantees that the produced samples
[23]. The DDE model was trained on a large-scale dataset that was have high user sentiment scores. Combining the two elements
scraped from a major online footwear store. In the dataset, each allows the proposed DDE-GAN model to simultaneously create
product has four types of information: six orthographic images, high-quality images and high user sentiment ratings. This new set
one numerical rating score, a list of textual product descriptions, of loss functions provides a more accurate and evaluation-guided
and real textual customer reviews from an e-commerce platform, generator and discriminator in DDE-GAN compared to previous
where images and feature descriptions are the inputs to the DDE work and can be easily tuned. Information on the constants and
model and the numerical rating score and sentiment intensity other implementation details is provided in Sec. 4.
values from customer reviews are the outputs. The dataset is consti-
tuted of a total number of 8706 images and 113,391 reviews for
1452 identified shoes. Numerical experiments on this large
dataset indicated promising performance by the DDE model with
0.001 MSE loss and over 99.1% accuracy. 4 Experiments and Results
The DDE model can accurately predict user sentiments for a new In this section, the dataset and implementation details of the pro-
design concept based only on its orthographic images and descrip- posed DDE-GAN model are first described, followed by the intro-
tions and provide numerical design performance values associated duction of metrics established to investigate the effectiveness of the
with each attribute of the generated concept. This paper builds a developed DDE-GAN model in generating realistic samples with
novel loss function based on the DDE model, called the DDE high desirability and diversity. The results of the experimental anal-
loss, into the GAN’s discriminator to enable an accurate and yses are presented next, comparing the outcomes generated by the
lower FID score than the DDE-GAN model. A lower FID score distance between the feature vectors calculated for the real and gen-
means that the model is more stable and correlates better with erated images. Lower scores indicate the two groups of images are
higher-quality images. However, the FID score of the DDE-GAN more similar, or have more similar statistics, with a perfect score
model and its standard deviation are close to StyleGAN with only being 0.0 indicating that the two groups of images are identical.
a small change (a 0.23 decrease), and it is empirically concluded Therefore, from the perspective of similarity, StyleGAN with
that an FID score below 10 is sufficient to demonstrate the effective- lower FID represents that the generated samples are more similar
ness of a generative model [82].2 In addition, the difference to real images compared with the DDE-GAN with a higher FID
between DDE-GAN (mean = 6.45) and StyleGAN (mean = 6.22) score. DDE-GAN with higher FID reveals that the generated
is verified with t-test, P = 0.0026. Therefore, the DDE-GAN samples are distinct from existing images, which is further validated
model performs well in achieving high-quality results. FID can next.
also be explained as a similarity metric, because it calculates the Diversity test: The primary rationale behind the proposed
DDE-GAN model is to promote the diversity of images generated
by GAN. The similarity between the produced samples and the orig-
2
https://nealjean.com/ml/frechet-inception-distance/ inal input is calculated using the MMD metric to estimate the
Fig. 6 (a) The MMD linear kernel and (b) MMD polynomial kernel results for StyleGAN and
DDE-GAN
Table 2 Results of the DDE test [23] regarding “predicted sentiment values” on 480 randomly selected samples generated by
StyleGAN and DDE-GAN
Model