0% found this document useful (0 votes)

126 views11 pages

Grid Search Random Search Genetic Algorithm A Big

This document compares grid search, random search, and genetic algorithms for neural architecture search (NAS) on convolutional neural networks (CNNs). It defines the basic search space as a chain-structured space with convolutional and dense cells. Experiments on CIFAR-10 show that genetic algorithms outperform grid search and random search, finding neural network architectures with higher accuracy in less time due to their ability to evolve better solutions over generations by simulating biological evolution.

Uploaded by

Dr. Khan Muhammad

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

126 views11 pages

Grid Search Random Search Genetic Algorithm A Big

Uploaded by

Dr. Khan Muhammad

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

Grid Search, Random Search, Genetic Algorithm: A Big Comparison for NAS

Grid Search, Random Search, Genetic Algorithm: A Big

Comparison for NAS

Petro Liashchynskyi [email protected]

Department of Computer Engineering
Ternopil National Economic University
Ternopil, 46003, Ukraine
Pavlo Liashchynskyi
arXiv:1912.06059v1 [cs.LG] 12 Dec 2019

Department of Computer Engineering

Ternopil National Economic University
Ternopil, 46003, Ukraine

Abstract
In this paper, we compare the three most popular algorithms for hyperparameter opti-
mization (Grid Search, Random Search, and Genetic Algorithm) and attempt to use them
for neural architecture search (NAS). We use these algorithms for building a convolutional
neural network (search architecture). Experimental results on CIFAR-10 dataset further
demonstrate the performance difference between compared algorithms. The comparison re-
sults are based on the execution time of the above algorithms and accuracy of the proposed
models.
Keywords: Neural architecture search, grid search, random search, genetic algorithm,
hyperparameter optimization

1. Introduction
Over the last few years, convolutional neural networks (CNNs) and their varieties have seen
great results on a variety of machine learning problems and applications (Shin et al. (2016);
Krizhevsky et al. (2012); Wu (2019); Xu et al. (2014); Goodfellow et al. (2014)). However,
each of the currently known architectures is designed by human experts in machine learning
(Simonyan and Zisserman (2014); Szegedy et al. (2014); He et al. (2015a)). Today, the
number of tasks that can be solved using neural networks is growing rapidly and designing a
neural network architecture becomes a long, slow and expensive process. It’s a big challenge
to design a good neural network.
Typical CNN architecture consists of several convolution, pooling, and fully-connected
layers. While designing a network architecture, an expert has to make a lot of design
choices: the number of layers of each type (convolution, pooling, dense, etc.), the ordering
of the layers, the hyperparameters for each layer, the receptive field size, stride, padding for
a convolution layer, etc.
Many kinds of research in the field of so-called automated machine learning (Auto-ML)
have been made (Zhong et al. (2017); Cai et al. (2018)). Zoph and Le (2016) propose a
reinforcement learning-based method for neural architecture search. Hebbal et al. (2019)

1
Liashchynskyi

Figure 1: An illustration of different architecture spaces. Each node in the graphs corre-
sponds to a layer in a neural network: a convolutional or pooling layer, etc. An
edge from layer Li to layer Lj denotes that Lj receives the output of Li as in-
put. Left: an element of a chain-structured space. Right: an element of a more
complex search space with additional layer types and multiple branches and skip
connections (Elsken et al. (2018).)

experiment with Bayesian optimization algorithm using deep Gaussian processes. Pham
et al. (2018) propose another approach using parameter sharing. Jin et al. (2018) and Weill
et al. (2019) presented the most inspiring frameworks for NAS and Auto-ML called Auto
Keras and AdaNet respectively.
Despite advances in the field of automated machine learning, in this paper, we attempt
to use classic hyperparameter optimization algorithms to find the optimal neural network
architecture. We compare the execution time between the above algorithms and finally will
know what algorithm proposes a model with the highest score for less time.

2. Search space
The search space defines which neural network architectures might be discovered by used
algorithm. There are many methods and strategies for neural architecture search of CNNs.
In most cases, experts build architecture from scratch by alternating convolutional and
fully-connected layers.
The simple search space is the space of chain-structured neural networks, as illustrated
in Figure 1 (Elsken et al. (2018)).
Rather than designing the entire convolutional network, one can design smaller mod-
ules and then connect them together to form a network (Pham et al. (2018)). Using this
approach, a neural network architecture can be written as a sequence of layers. Then the
search space is parametrized by:

• n number of layers;

• type of every layer (e.g., convolutional, pooling, fully-connected layers);

2
Grid Search, Random Search, Genetic Algorithm: A Big Comparison for NAS

• hyperparameters of every layer (e.g., kernel size and filters for a convolutional layer).

In this paper, we use the next approach:

1) Define our basic architecture, as illustrated in Figure 2.
2) Build a new architecture by adding the convolutional and dense cells (Figure 3) to
the basic architecture. A dense cell is just a fully-connected layer with ReLU activation.
The choice of the search space determines the difficulty of the optimization problem:
even for the case of the search space based on a single cell with fixed architecture, the
optimization problem remains difficult (Elsken et al. (2018)) and relatively high-dimensional
(since more complex models tend to perform better, resulting in more design choices).

3. Search strategy
Grid Search. The traditional method of hyperparameters optimization is a grid search,
which simply makes a complete search over a given subset of the hyperparameters space
of the training algorithm (Figure 4). Because the machine learning algorithm parameter
space may include spaces with real or unlimited values for some parameters, it is possible
that we need to specify a boundary to apply a grid search. Grid search suffers from high
dimensional spaces, but often can easily be parallelized, since the hyperparameter values
that the algorithm works with are usually independent of each other.

Random Search. It overrides the complete selection of all combinations by their random
selection. This can be easily applied to discrete cases, but the method can be generalized
to continuous and mixed spaces. Random search can outperform a grid search, especially
if only a small number of hyperparameters affect the performance of the machine learning
algorithm.

Genetic Algorithm. The genetic algorithm is an evolutionary search algorithm used to

solve optimization and modeling problems by sequentially selecting, combining, and varying
parameters using mechanisms that resemble biological evolution. Genetic algorithms simu-
late the process of natural selection which means those species who can adapt to changes
in their environment can survive and reproduce and go to the next generation.
Each generation consist of a population of individuals and each individual represents a
point in search space and possible solution. Each individual is represented as a string of
character/integer/float/bits. This string is analogous to the chromosome.
The genetic algorithm begins with a randomly generated population of chromosomes.
Then, it makes a selection process and recombination, based on each chromosome fitness
(score). Parent genetic materials are recombined to generate child chromosomes produc-
ing the next generation. This process is iterated until some stopping criterion is reached
(Loussaief and Abdelkrim (2018)).

4. Experiments and Results

Dataset. In our experiments we use the CIFAR-10 dataset with data preprocessing and
augmentation procedures. This dataset consists of 50,000 training images and 10,000 test
images. We first preprocess the data by mean and standart deviation normalization. Ad-

3
Liashchynskyi

input

biases ﬁlters kernel_size

Convolution

BiasAdd

ReLU

BatchNorm

MaxPooling

Dropout

Convolution Cell

Flatten

Dense Cell

Dropout

Dense

Figure 2: An illustration of a basic CNN architecture. The search space defines the number
of convolutional and dense blocks.

4
Grid Search, Random Search, Genetic Algorithm: A Big Comparison for NAS

input ﬁlters kernel_size

FLAGS:
biases Convolution pooling: bool
batch_norm:bool

BiasAdd

ReLU

BatchNorm

MaxPooling

Dropout

Figure 3: An illustration of a convolutional block. This block accepts the following parame-
ters as required: input, filters, kernel size. If additional parameters are specified,
the output is passed through the BatchNormalization and MaxPooling layers.

5
Liashchynskyi

Figure 4: An illustration of a grid search space. We manually set a range of the possible
parameters and the algorithm makes a complete search over them. In other words,
the grid search algorithm is a complete brute-force and takes a too long time to
execute.

Figure 5: An illustration of a random search space. We manually set a range of bounds

of the possible parameters and the algorithm makes a search over them for the
number of iterations we set.

6
Grid Search, Random Search, Genetic Algorithm: A Big Comparison for NAS

ditionally, we shift the training images horizontally and vertically, and randomly flipping
them horizontally.

Search spaces. We apply each of the algorithm to one search space: the macro search
space over basic convolutional model with adding convolutional cells to it (Section 2).

Training details. All convolutional kernels are initialized with He uniform initialization
(He et al. (2015b)). We also apply L2 weight decay with rate 10−4 and set the kernel size
in convolutions to 3. The first convolutional layer uses 32 filters. Each convolutional cell
uses 64 filters. All convolutions are followed by BatchNormalization with MaxPooling. The
parameters of each network are trained with Adamax optimizer (Kingma and Ba (2014)),
where the learning rate is set to 2e − 3 and other parameters is stay by default. We set the
dropout rate to 0.2 in each convolutional cell. Dropout rate in the basic model is 0.5 and
the number of units in the dense cells is 512. Each architecture search is run for 50 epochs
on Nvidia Tesla K80 GPU. The basic model achieves 76% accuracy after 50 epochs.

4.1 Grid Search

The possible number of the convolutional cells is set to (0, 2, 3, 4) and number of the dense
cells is set to (1, 2).

Results. The whole trainig procedure of 2 × 4 = 8 (length of conv cells list × length of
dence cells list) models took ≈4.3 hours.

Model params Evaluating

Conv cells Dense cells Size Accuracy % Score

0 1 4.2M 75±0.3 0.72

0 2 4.4M 77±0.2 0.74

2 1 0.58M 82 0.57

2 2 0.84M 83±0.4 0.57

3 1 0.23M 81.6 0.6

3 2 0.49M 81.8 0.61

4 1 0.16M 80.9 0.65

4 2 0.43M 80.1 0.66

Table 1: Results of Grid Search

As you can see, the best accuracy is about 83%. The model has 2 convolutional and 2
dense cells.

7
Liashchynskyi

4.2 Random Search

We restricted our search space by a random integer between 2 and 8 for the possible number
of convolutional cells and a random integer between 1 and 4 for possible dense cells 1 . We
took 5 runs and got the following results.
Results. The whole training procedure of 5 runs took ≈2.7 hours.

Model params Evaluating

Conv cells Dense cells Size Accuracy % Score

3 2 0.49M 80±0.05 0.67

2 2 2.4M 83.65±0.02 0.60

4 1 0.66M 85.8 0.51

7 2 0.64M 83.8 0.61

3 1 0.62M 85.4 0.49

Table 2: Results of Random Search

The best model shows about 86% accuracy. As you can see, this algorithm is faster than
Grid Search, but if we take more runs it will be much longer.

4.3 Genetic Algorithm

In this experiment, we set population size to 2, the number of generations to 8, and the
genome length to 8. The genome of each individual has represented as a random variates
Bernoulli distribution with a random state of 0.5. The first 4 bits represent the number of
convolutional blocks and the rest are for the number of dense blocks 2 .
Results. The evolutionary algorithm took about ≈4.13 hours to run.
The best model shows about 86% accuracy. It’s almost the same as other algorithms
show.

5. Final thoughts
We tested the neural architecture search approach with the three most popular algorithms
— Grid Search, Random Search, and Genetic Algorithm. Almost all of the tested algorithms
take a long time to search for the best model. Grid Search is too slow, Random Search
is limited to search space distributions. The most inspiring is the evolutionary algorithm,
where we encode the number of parameters as a genome. Evolution may take too long
before we get the best model.
1. We had also to limit the number of MaxPooling layers in the convolutional cells to prevent dimensions
error when input dimension comes too low. We skip MaxPooling if the dimension of the input comes
lower than (?, 2, 2, ?).
2. We set MaxPooling layers randomly in convolutional blocks to prevent low dimensionality of the output.

8
Grid Search, Random Search, Genetic Algorithm: A Big Comparison for NAS

Model params Evaluating

Conv cells Dense cells Size Accuracy % Score

10 1 0.49M 85.7 0.62

4 3 2.75M 83.6 0.72

6 1 0.73M 83.5 0.6

Table 3: Results of Genetic Algorithm — Top 3 models

Regarding hyperparameter optimization, it’s difficult to say which of the above algo-
rithms will show the best results. If a model and search space is not too large then the grid
search or random search may be a good choice to do that. But if a model has too many
layers and large search space, then the evolutionary algorithm may be the best choice.

5.1 Summary
Grid search is a brute force algorithm. This makes a complete search for a given subset of the
hyperparameter space. If the search space is too large, do not choose this algorithm. That
is, we will train and test every possible combination of network parameters we provided.
The random search maybe a little faster, but it does not guarantee us the best results.
Finally, if our search space is large then the best choice is the evolutionary algorithm.
It also takes a long time to run, but we can control it over the number of generations and
length of the population. Each individual represents a solution in the search space for a
given problem. Each individual is coded as a finite length vector of components. These
variable components are analogous to genes. Thus a chromosome (individual) is composed
of several genes (variable components).
When there are too many parameters for optimization, the genetic algorithm performs
faster than others. Choose this one and the evolution will do all for you.

References
Han Cai, Jiacheng Yang, Weinan Zhang, Song Han, and Yong Yu. Path-level network
transformation for efficient architecture search. CoRR, abs/1806.02639, 2018. URL http:
//arxiv.org/abs/1806.02639.

Thomas Elsken, Jan Hendrik Metzen, and Frank Hutter. Neural architecture search: A
survey, 2018.

Ian J. Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil
Ozair, Aaron Courville, and Yoshua Bengio. Generative adversarial networks, 2014.

Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. Deep residual learning for image
recognition, 2015a.

9
Liashchynskyi

Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. Delving deep into rectifiers:
Surpassing human-level performance on imagenet classification, 2015b.

Ali Hebbal, Loic Brevault, Mathieu Balesdent, El-Ghazali Talbi, and Nouredine Melab.
Bayesian optimization using deep gaussian processes, 2019.

Haifeng Jin, Qingquan Song, and Xia Hu. Efficient neural architecture search with network
morphism. CoRR, abs/1806.10282, 2018. URL http://arxiv.org/abs/1806.10282.

Diederik P. Kingma and Jimmy Ba. Adam: A method for stochastic optimization, 2014.

Alex Krizhevsky, Ilya Sutskever, and Geoffrey E. Hinton. Imagenet classification with deep
convolutional neural networks. Commun. ACM, 60:84–90, 2012.

Sehla Loussaief and Afef Abdelkrim. Convolutional neural network hyper-parameters

optimization based on genetic algorithms. International Journal of Advanced Com-
puter Science and Applications, 9(10), 2018. doi: 10.14569/IJACSA.2018.091031. URL
http://dx.doi.org/10.14569/IJACSA.2018.091031.

Hieu Pham, Melody Y. Guan, Barret Zoph, Quoc V. Le, and Jeff Dean. Efficient neural
architecture search via parameter sharing. CoRR, abs/1802.03268, 2018. URL http:
//arxiv.org/abs/1802.03268.

H. Shin, H. R. Roth, M. Gao, L. Lu, Z. Xu, I. Nogues, J. Yao, D. Mollura, and R. M.

Summers. Deep convolutional neural networks for computer-aided detection: Cnn ar-
chitectures, dataset characteristics and transfer learning. IEEE Transactions on Medical
Imaging, 35(5):1285–1298, May 2016. doi: 10.1109/TMI.2016.2528162.

Karen Simonyan and Andrew Zisserman. Very deep convolutional networks for large-scale
image recognition, 2014.

Christian Szegedy, Wei Liu, Yangqing Jia, Pierre Sermanet, Scott Reed, Dragomir
Anguelov, Dumitru Erhan, Vincent Vanhoucke, and Andrew Rabinovich. Going deeper
with convolutions, 2014.

Charles Weill, Javier Gonzalvo, Vitaly Kuznetsov, Scott Yang, Scott Yak, Hanna Mazzawi,
Eugen Hotaj, Ghassen Jerfel, Vladimir Macko, Ben Adlam, Mehryar Mohri, and Corinna
Cortes. Adanet: A scalable and flexible framework for automatically learning ensembles.
CoRR, abs/1905.00080, 2019. URL http://arxiv.org/abs/1905.00080.

Jianxin Wu. Convolutional neural networks. LAMDA Group, 2019.

Li Xu, Jimmy SJ Ren, Ce Liu, and Jiaya Jia. Deep convolutional neural network for
image deconvolution. In Z. Ghahramani, M. Welling, C. Cortes, N. D. Lawrence,
and K. Q. Weinberger, editors, Advances in Neural Information Processing Systems 27,
pages 1790–1798. Curran Associates, Inc., 2014. URL http://papers.nips.cc/paper/
5485-deep-convolutional-neural-network-for-image-deconvolution.pdf.

Zhao Zhong, Junjie Yan, and Cheng-Lin Liu. Practical network blocks design with q-
learning. CoRR, abs/1708.05552, 2017. URL http://arxiv.org/abs/1708.05552.

10
Grid Search, Random Search, Genetic Algorithm: A Big Comparison for NAS

Barret Zoph and Quoc V. Le. Neural architecture search with reinforcement learning.
CoRR, abs/1611.01578, 2016. URL http://arxiv.org/abs/1611.01578.

Thesis - Optimization of Analog Integrated Circuits
No ratings yet
Thesis - Optimization of Analog Integrated Circuits
82 pages
Cashdollar - 1996 - Coal Dust Explosibility-Annotated
No ratings yet
Cashdollar - 1996 - Coal Dust Explosibility-Annotated
12 pages
Patrick Siarry (Editor) - Metaheuristics-Springer (2016) PDF
No ratings yet
Patrick Siarry (Editor) - Metaheuristics-Springer (2016) PDF
497 pages
2006.12703v1
No ratings yet
2006.12703v1
16 pages
CNN and Genetic Algorithm
No ratings yet
CNN and Genetic Algorithm
12 pages
NSGA-Net: Neural Architecture Search Using Multi-Objective Genetic Algorithm
No ratings yet
NSGA-Net: Neural Architecture Search Using Multi-Objective Genetic Algorithm
9 pages
NowakowskiG Neuralnetwork
No ratings yet
NowakowskiG Neuralnetwork
10 pages
EE Computer Science
No ratings yet
EE Computer Science
46 pages
Simple and Efficient Architecture Search For Convolutional Neural Networks
No ratings yet
Simple and Efficient Architecture Search For Convolutional Neural Networks
14 pages
Deep Architecture
No ratings yet
Deep Architecture
33 pages
The Evaluation of Convolutional Neural Network and Genetic Algorithm Performance Based On The Number of Hyperparameters For English Handwritten Recognition
No ratings yet
The Evaluation of Convolutional Neural Network and Genetic Algorithm Performance Based On The Number of Hyperparameters For English Handwritten Recognition
10 pages
A Survey On Neural Architecture Search
No ratings yet
A Survey On Neural Architecture Search
53 pages
0167-8191_2890_2990086-o20160525-24977-kmpvdz-with-cover-page-v2
No ratings yet
0167-8191_2890_2990086-o20160525-24977-kmpvdz-with-cover-page-v2
16 pages
Introduction to Neural Architecture Search: Optimizing AI Models
From Everand
Introduction to Neural Architecture Search: Optimizing AI Models
Robert Johnson
No ratings yet
N A S R L: Eural Rchitecture Earch With Einforcement Earning
No ratings yet
N A S R L: Eural Rchitecture Earch With Einforcement Earning
16 pages
Gacnn - Training Deep Convolutional Neural Networks With Genetic Algorithm
No ratings yet
Gacnn - Training Deep Convolutional Neural Networks With Genetic Algorithm
4 pages
Neural Networks
From Everand
Neural Networks
Sasha Kurzweil
No ratings yet
Lit Rev 3
No ratings yet
Lit Rev 3
13 pages
Neuroevolution: Fundamentals and Applications for Surpassing Human Intelligence with Neuroevolution
From Everand
Neuroevolution: Fundamentals and Applications for Surpassing Human Intelligence with Neuroevolution
Fouad Sabry
No ratings yet
Optimization of Control Parameters For Genetic Algorithms
No ratings yet
Optimization of Control Parameters For Genetic Algorithms
7 pages
Efficient Algorithms and Structures with Heaps: Definitive Reference for Developers and Engineers
From Everand
Efficient Algorithms and Structures with Heaps: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Essentials of Metaheuristics by Sean Luke
No ratings yet
Essentials of Metaheuristics by Sean Luke
237 pages
Essentials of Metaheuristics
No ratings yet
Essentials of Metaheuristics
237 pages
Essentials of Metaheuristics: Sean Luke
No ratings yet
Essentials of Metaheuristics: Sean Luke
237 pages
Chapter+2+ +Genetic+Algorithm+in+Machine+Learning
No ratings yet
Chapter+2+ +Genetic+Algorithm+in+Machine+Learning
23 pages
ResearchPaper2_1_David_Laredo
No ratings yet
ResearchPaper2_1_David_Laredo
31 pages
Feedforward Neural Networks: Fundamentals and Applications for The Architecture of Thinking Machines and Neural Webs
From Everand
Feedforward Neural Networks: Fundamentals and Applications for The Architecture of Thinking Machines and Neural Webs
Fouad Sabry
No ratings yet
GRE Fens Tette 1986
No ratings yet
GRE Fens Tette 1986
7 pages
FULLTEXT02
No ratings yet
FULLTEXT02
48 pages
Unit-1 AI
No ratings yet
Unit-1 AI
8 pages
Intelligent Agents Overview (1)-2
No ratings yet
Intelligent Agents Overview (1)-2
7 pages
Youssef Hamadi, Eric Monfroy, Frédéric Saubion (Auth.), Youssef Hamadi, Eric Monfroy, Frédéric Saubion (Eds.) - Autonomous Search-Springer-Verlag Berlin Heidelberg (2012)
No ratings yet
Youssef Hamadi, Eric Monfroy, Frédéric Saubion (Auth.), Youssef Hamadi, Eric Monfroy, Frédéric Saubion (Eds.) - Autonomous Search-Springer-Verlag Berlin Heidelberg (2012)
307 pages
Essentials of Heuristics
100% (1)
Essentials of Heuristics
233 pages
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
Optimization of Hyper-Parameter For CNN Model Using Genetic Algorithm
No ratings yet
Optimization of Hyper-Parameter For CNN Model Using Genetic Algorithm
6 pages
Feleye Qurban olum
No ratings yet
Feleye Qurban olum
5 pages
d22033s_Maru-61-89
No ratings yet
d22033s_Maru-61-89
29 pages
Finding Optimal Neural Network Architecture Using Genetic Algorithms
No ratings yet
Finding Optimal Neural Network Architecture Using Genetic Algorithms
10 pages
Efficient String Processing with Trie Structures: Definitive Reference for Developers and Engineers
From Everand
Efficient String Processing with Trie Structures: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Mastering Deep Learning with Keras: From Basics to Expert Proficiency
From Everand
Mastering Deep Learning with Keras: From Basics to Expert Proficiency
William Smith
No ratings yet
DARTS: Differentiable Architecture Search
No ratings yet
DARTS: Differentiable Architecture Search
12 pages
Mesh Generation: Advances and Applications in Computer Vision Mesh Generation
From Everand
Mesh Generation: Advances and Applications in Computer Vision Mesh Generation
Fouad Sabry
No ratings yet
Using Genetic Algorithms To Evolve Artificial Neural Networks
No ratings yet
Using Genetic Algorithms To Evolve Artificial Neural Networks
24 pages
Combining Genetic Algorithm and Neural Networks
No ratings yet
Combining Genetic Algorithm and Neural Networks
67 pages
Development of Deep Residual Neural Networks For Gear Pitting Fault Diagnosis Using Bayesian Optimization
No ratings yet
Development of Deep Residual Neural Networks For Gear Pitting Fault Diagnosis Using Bayesian Optimization
15 pages
D N N A R L: Esigning Eural Etwork Rchitectures Using Einforcement Earning
No ratings yet
D N N A R L: Esigning Eural Etwork Rchitectures Using Einforcement Earning
18 pages
Essentials of Metaheuristics
No ratings yet
Essentials of Metaheuristics
263 pages
Neural Networks and Fuzzy Logic
From Everand
Neural Networks and Fuzzy Logic
C. Naga Bhaskar
No ratings yet
CS2109S Notes
No ratings yet
CS2109S Notes
19 pages
2023 IEEE TNNLS A Survey On Evolutionary Neural Architecture Search
No ratings yet
2023 IEEE TNNLS A Survey On Evolutionary Neural Architecture Search
21 pages
Optimising Frame Structures by Di!erent Strategies of Genetic Algorithms
No ratings yet
Optimising Frame Structures by Di!erent Strategies of Genetic Algorithms
22 pages
Genetic Algorithms For Association Rule Mining: A Comparative Study
No ratings yet
Genetic Algorithms For Association Rule Mining: A Comparative Study
7 pages
Research Paper
No ratings yet
Research Paper
19 pages
Attractor Networks: Fundamentals and Applications in Computational Neuroscience
From Everand
Attractor Networks: Fundamentals and Applications in Computational Neuroscience
Fouad Sabry
No ratings yet
Hybrid Neural Networks: Fundamentals and Applications for Interacting Biological Neural Networks with Artificial Neuronal Models
From Everand
Hybrid Neural Networks: Fundamentals and Applications for Interacting Biological Neural Networks with Artificial Neuronal Models
Fouad Sabry
No ratings yet
Weka Sample
No ratings yet
Weka Sample
21 pages
Darts: D A S: Ifferentiable Rchitecture Earch
No ratings yet
Darts: D A S: Ifferentiable Rchitecture Earch
13 pages
AI General Game Player Using Neuroevolution Algorithms
100% (1)
AI General Game Player Using Neuroevolution Algorithms
64 pages
Essentials PDF
No ratings yet
Essentials PDF
253 pages
Topic:-CA-1: Advanced Search and Optimization Techniques
No ratings yet
Topic:-CA-1: Advanced Search and Optimization Techniques
11 pages
Artificial Neural Networks: Fundamentals and Applications for Decoding the Mysteries of Neural Computation
From Everand
Artificial Neural Networks: Fundamentals and Applications for Decoding the Mysteries of Neural Computation
Fouad Sabry
No ratings yet
Pedregosa Et Al. - 2011 - Scikit-Learn Machine Learning in Python-Annotated
No ratings yet
Pedregosa Et Al. - 2011 - Scikit-Learn Machine Learning in Python-Annotated
7 pages
Lundberg, Lee - 2017 - A Unified Approach To Interpreting Model Predictions (2) - Annotated
No ratings yet
Lundberg, Lee - 2017 - A Unified Approach To Interpreting Model Predictions (2) - Annotated
11 pages
Lerman - 1980 - Fitting Segmented Regression Models by Grid Search-Annotated
No ratings yet
Lerman - 1980 - Fitting Segmented Regression Models by Grid Search-Annotated
9 pages
Li Et Al. - 2016 - Experimental Research of Particle Size and Size Dispersity On The Explosibility Characteristics of Coal Dust-Annotated
No ratings yet
Li Et Al. - 2016 - Experimental Research of Particle Size and Size Dispersity On The Explosibility Characteristics of Coal Dust-Annotated
28 pages
Castellanos, Bagaria, Mashuga - 2020 - Effect of Particle Size Polydispersity On Dust Cloud Minimum Ignition Energy-Annotated
No ratings yet
Castellanos, Bagaria, Mashuga - 2020 - Effect of Particle Size Polydispersity On Dust Cloud Minimum Ignition Energy-Annotated
6 pages
Feature Impact For Prediction Explanation (2) - Annotated
No ratings yet
Feature Impact For Prediction Explanation (2) - Annotated
9 pages
Traoré Et Al. - 2009 - Dust Explosions How Should The Influence of Humidity Be Taken Into Account-Annotated
No ratings yet
Traoré Et Al. - 2009 - Dust Explosions How Should The Influence of Humidity Be Taken Into Account-Annotated
7 pages
Copper Ore Heap Leaching Agglomeration
No ratings yet
Copper Ore Heap Leaching Agglomeration
1 page
Cashdollar - 2000 - Overview of Dust Explosibility Characteristics-Annotated
No ratings yet
Cashdollar - 2000 - Overview of Dust Explosibility Characteristics-Annotated
17 pages
Cao Et Al. - 2017 - Experimental and Numerical Studies On The Explosion Severities of Coal Dustair Mixtures in A 20-L Spherical Vessel-Annotated
No ratings yet
Cao Et Al. - 2017 - Experimental and Numerical Studies On The Explosion Severities of Coal Dustair Mixtures in A 20-L Spherical Vessel-Annotated
7 pages
How OSA Slurry Sampling Sampler Affects Recovery and NSR Calculations
No ratings yet
How OSA Slurry Sampling Sampler Affects Recovery and NSR Calculations
16 pages
Agglomeration in The Heap Leaching of Copper Ore
No ratings yet
Agglomeration in The Heap Leaching of Copper Ore
8 pages
How Sampling Errors Impact Metallurgy Balance Meterial Mass Balancing
No ratings yet
How Sampling Errors Impact Metallurgy Balance Meterial Mass Balancing
15 pages
A Methodology For Design and Operation of Heap Leaching Systems
No ratings yet
A Methodology For Design and Operation of Heap Leaching Systems
14 pages
Introduction To Sampling For Mineral Processing Part 5 in A Series "Process Control Samplers"
No ratings yet
Introduction To Sampling For Mineral Processing Part 5 in A Series "Process Control Samplers"
20 pages
Metallurgical Belt Samplers For Crushers Linear Samplers Rotary Vezin Secondary Tertiary Samplers
No ratings yet
Metallurgical Belt Samplers For Crushers Linear Samplers Rotary Vezin Secondary Tertiary Samplers
26 pages
3-Coal Properties PDF
No ratings yet
3-Coal Properties PDF
3 pages
Marine and Petroleum Geology: M.Y.A. Madjid, V. Vandeginste, G. Hampson, C.J. Jordan, A.D. Booth
No ratings yet
Marine and Petroleum Geology: M.Y.A. Madjid, V. Vandeginste, G. Hampson, C.J. Jordan, A.D. Booth
12 pages
Jiang 2018
No ratings yet
Jiang 2018
9 pages
2005 - Lab02 - Gisou Into To GIS
No ratings yet
2005 - Lab02 - Gisou Into To GIS
1 page
1-Detecting Coal Fires With A Neural Network To Reduce The Effect of Solar Radiation On Landsat Thematic Mapper Thermal Infrared Images
No ratings yet
1-Detecting Coal Fires With A Neural Network To Reduce The Effect of Solar Radiation On Landsat Thematic Mapper Thermal Infrared Images
14 pages
1 Coal
No ratings yet
1 Coal
13 pages
Correlation of Properties of Coal With Radionuclides PDF
No ratings yet
Correlation of Properties of Coal With Radionuclides PDF
5 pages
Minerals 08 00560 PDF
No ratings yet
Minerals 08 00560 PDF
34 pages
AI Term-II Periodic Test
No ratings yet
AI Term-II Periodic Test
3 pages
Artificial-Intelligence-High-Technology-EDITED 1 C1
No ratings yet
Artificial-Intelligence-High-Technology-EDITED 1 C1
5 pages
Ker As Tutorial
No ratings yet
Ker As Tutorial
33 pages
Lit Review
No ratings yet
Lit Review
10 pages
SSN Ieee
No ratings yet
SSN Ieee
3 pages
Haneena Jasmine 2021 IOP Conf. Ser. Mater. Sci. Eng. 1114 012012
No ratings yet
Haneena Jasmine 2021 IOP Conf. Ser. Mater. Sci. Eng. 1114 012012
10 pages
WinnieXu CV
No ratings yet
WinnieXu CV
2 pages
Module 4 ISML
No ratings yet
Module 4 ISML
88 pages
ColdGAN Resolving Cold Start User Recommendation by using Generative
No ratings yet
ColdGAN Resolving Cold Start User Recommendation by using Generative
7 pages
DL Tabular
No ratings yet
DL Tabular
43 pages
Types of Regularization in Machine Learning - by Aqeel Anwar - Towards Data Science
No ratings yet
Types of Regularization in Machine Learning - by Aqeel Anwar - Towards Data Science
11 pages
Artificial_Intelligence_Notes
No ratings yet
Artificial_Intelligence_Notes
3 pages
Transformers and Attention Mechanisms - Post Quiz - Attempt Review
No ratings yet
Transformers and Attention Mechanisms - Post Quiz - Attempt Review
5 pages
A Review of Dimensionality Reduction Techniques For Efficient INTERNATIONAL CONFERENCE ON RECENT TRENDS IN ADVANCED COMPUTING Computation Computation
No ratings yet
A Review of Dimensionality Reduction Techniques For Efficient INTERNATIONAL CONFERENCE ON RECENT TRENDS IN ADVANCED COMPUTING Computation Computation
8 pages
CV 110121 Introduction
No ratings yet
CV 110121 Introduction
27 pages
Comp 304 - Artificial Intelligence: Assignment 2: Backpropagation Networks
No ratings yet
Comp 304 - Artificial Intelligence: Assignment 2: Backpropagation Networks
6 pages
20IT7301 - Deep Learning Syllabus
No ratings yet
20IT7301 - Deep Learning Syllabus
3 pages
Credit
No ratings yet
Credit
6 pages
Artificial Intelligence Course Unlocking The Future of Technology
No ratings yet
Artificial Intelligence Course Unlocking The Future of Technology
3 pages
AAM VAISHAA-1
No ratings yet
AAM VAISHAA-1
12 pages
Few AIML Lab Viva QA
No ratings yet
Few AIML Lab Viva QA
3 pages
NLP Mini Project
No ratings yet
NLP Mini Project
26 pages
B.tech Project Final
No ratings yet
B.tech Project Final
17 pages
Chapter #4 - Machine Learning
No ratings yet
Chapter #4 - Machine Learning
29 pages
Download Full Deep Learning for Medical Image Analysis, 2nd Edition S. Kevin Zhou PDF All Chapters
100% (3)
Download Full Deep Learning for Medical Image Analysis, 2nd Edition S. Kevin Zhou PDF All Chapters
40 pages
UGRD-AI6100-2323T - Midterm Lab Quiz 1 - Attempt PERFECT
No ratings yet
UGRD-AI6100-2323T - Midterm Lab Quiz 1 - Attempt PERFECT
6 pages
The Fundamentals of Computational Intelligence (2017)
100% (1)
The Fundamentals of Computational Intelligence (2017)
389 pages
LLM Lingo p2
No ratings yet
LLM Lingo p2
1 page
Natural Language Processing With Improved Deep Lea
No ratings yet
Natural Language Processing With Improved Deep Lea
8 pages
Image Matching Using SIFT (Scale Infariant Feature Transform) and SURF (Speed Up Robust Feature)
No ratings yet
Image Matching Using SIFT (Scale Infariant Feature Transform) and SURF (Speed Up Robust Feature)
16 pages

Grid Search Random Search Genetic Algorithm A Big

Uploaded by

Grid Search Random Search Genetic Algorithm A Big

Uploaded by

Grid Search, Random Search, Genetic Algorithm: A Big Comparison for NAS

Grid Search, Random Search, Genetic Algorithm: A Big

Petro Liashchynskyi [email protected]

Department of Computer Engineering

• type of every layer (e.g., convolutional, pooling, fully-connected layers);

In this paper, we use the next approach:

Genetic Algorithm. The genetic algorithm is an evolutionary search algorithm used to

4. Experiments and Results

biases ﬁlters kernel_size

input ﬁlters kernel_size

Figure 5: An illustration of a random search space. We manually set a range of bounds

4.1 Grid Search

Model params Evaluating

Conv cells Dense cells Size Accuracy % Score

0 1 4.2M 75±0.3 0.72

0 2 4.4M 77±0.2 0.74

2 2 0.84M 83±0.4 0.57

3 1 0.23M 81.6 0.6

3 2 0.49M 81.8 0.61

4 1 0.16M 80.9 0.65

4 2 0.43M 80.1 0.66

Table 1: Results of Grid Search

4.2 Random Search

Model params Evaluating

Conv cells Dense cells Size Accuracy % Score

3 2 0.49M 80±0.05 0.67

2 2 2.4M 83.65±0.02 0.60

4 1 0.66M 85.8 0.51

7 2 0.64M 83.8 0.61

3 1 0.62M 85.4 0.49

Table 2: Results of Random Search

4.3 Genetic Algorithm

Model params Evaluating

Conv cells Dense cells Size Accuracy % Score

10 1 0.49M 85.7 0.62

4 3 2.75M 83.6 0.72

6 1 0.73M 83.5 0.6

Table 3: Results of Genetic Algorithm — Top 3 models

Sehla Loussaief and Afef Abdelkrim. Convolutional neural network hyper-parameters

H. Shin, H. R. Roth, M. Gao, L. Lu, Z. Xu, I. Nogues, J. Yao, D. Mollura, and R. M.

Jianxin Wu. Convolutional neural networks. LAMDA Group, 2019.

You might also like