0% found this document useful (0 votes)

18 views

Deep Learning in Solving Mathematical Equations

Uploaded by

hereniamunoz77

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views

Deep Learning in Solving Mathematical Equations

Uploaded by

hereniamunoz77

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

University of Benghazi ‫جامعة بنغازي‬

Faculty of Education Almarj ‫كلية التربية – المرج‬

ISSN 2518-5845

Global Libyan Journal ‫المجلة الليبية العالمية‬

2024 / ‫ يونيو‬/ ‫العدد الواحد والسبعون‬
Deep Learning in Solving Mathematical Equations
.............................................................................................................................................................................................................................
Bassma .A. Awad Abdlrazg Hanan .G.A. Atetalla Sumaia Masoud Abdalla

Department of Mathematics , Faculty of Science , Omar AL- Mokhtar University , Albaida – Libya

[email protected]

1
University of Benghazi ‫جامعة بنغازي‬
Faculty of Education Almarj ‫كلية التربية – المرج‬
ISSN 2518-5845

Global Libyan Journal ‫المجلة الليبية العالمية‬

2024 / ‫ يونيو‬/ ‫العدد الواحد والسبعون‬
Deep Learning in Solving Mathematical Equations
Abstract
Solving a mathematical equation can be a tough task. There is a need for employing deep
learning in this field. Hence, it can be easier to solve mathematical equations. Deep networks
have been applied into various fields and showed a great performance in terms of accurate
generalization. In this paper, a deep network, named stacked auto-encoder (SAE) with two
hidden layers is trained to learn solving mathematical equations. The network is trained and
tested using 200 different equations, where 100 are used for training and 100 for testing.
Experimentally, the network showed good generalization accuracy in predicting the answers
of solving the mathematical equations which were not used during the training of the
network.
Keywords: Mathematical equations, Deep learning, Neural networks, Generalization,
Stacked auto-encoder.

‫التعلم العميق يف حل املعادالت الرايضية‬

‫حنان جاب هللا عبدهللا‬.‫أ‬ ‫مسية مسعود عبدهللا‬.‫أ‬ ‫بسمة ابراهيم عوض‬.‫أ‬
:‫امللخص‬
‫ قد يكون من‬،‫ و ابلتايل‬.‫ هناك حاجة لتوظيف التعلم العميق يف هذا اجملال‬.‫ميكن أن يكون حل معادلة رايضية مهمة صعبة‬
.‫ مت تطبيق الشبكات العميقة يف خمتلف اجملاالت وأظهرت أداء رائعا من حيث التعميم الدقيق‬.‫األسهل حل املعادالت الرايضية‬
‫ يتم‬.‫ تسمى تشفري املكدس التلقائي مع طبقتني خمفية لتعلم حل املعادالت الرايضية‬، ‫ مت تدريب شبكة عميقة‬، ‫يف هذه الورقة‬
‫ أظهرت‬.‫ لالختبار‬100 ‫ للتدريب و‬100 ‫ حيث يتم استخدام‬، ‫ معادالت خمتلفة‬200 ‫تدريب الشبكة واختبارها ابستخدام‬
.‫الشبكة جتريبيا دقة تعميم جيدة يف التنبؤ إبجاابت حل املعادالت الرايضية اليت مل يتم استخدامها أثناء تدريب الشبكة‬

. ‫ تشفري املكدس التلقائي‬، ‫ التعميم‬، ‫الشبكات العصبية‬، ‫ التعلم العميق‬، ‫ املعادالت الرايضية‬:‫الكلمات املفتاحية‬

2
University of Benghazi ‫جامعة بنغازي‬
Faculty of Education Almarj ‫كلية التربية – المرج‬
ISSN 2518-5845

Global Libyan Journal ‫المجلة الليبية العالمية‬

2024 / ‫ يونيو‬/ ‫العدد الواحد والسبعون‬
1. Introduction

Deep learning is considered as a new method of machine learning which is closer to artificial
intelligence than other previous and conventional neural networks models [1]. In contrast to
the traditional neural networks which are linear models, deep networks are hierarchical
representations of data to extract complexity and abstractions of data. In other words, deep
learning networks are neural networks of many hidden layers, that is why they are called
deep, which allows them to extract different levels features from input data. The training and
working principles vary from one deep network to another [2]. For example, stacked auto-
encoder which is a deep network has its own learning principles in which it uses Greedy-
layer wise training, however, a convolutional neural network has a different architecture and
training algorithm than stacked auto-encoder.
Recently, deep learning has encountered progress in the field of mathematics. Neural
networks have been applied first in solving many differential equations by Dissanayake and
Phan-Thien [3]. Furthermore, the concept of generality in a network layer was elaborated upon by Yosinski
et al. [4] as its capacity to exhibit effective performance across a spectrum of tasks. This notion stemmed from
the observation that initial layers within image-oriented convolutional neural networks (CNNs) converged
towards comparable features, spanning multiple network architectures and applications. We assert that these
concepts share a correlation: when a particular representation fosters exemplary performance across diverse
tasks, networks that adapt to any of these tasks are likely to uncover analogous representations.
To illustrate, Li et al. [5] introduced a methodology for gauging layer similarity through the
identification of neuron alterations that enhance the interrelation between networks. In
contrast, while preserving a logarithmic efficiency, the Singular Value Canonical Correlation
Analysis (SVCCA) stands out as a more computationally streamlined alternative to change-
based techniques. Additionally, Berg and Nyström [6] concurrently endeavored to unravel the
composition of Deep Evolutionary Neural Networks (DENNs) by scrutinizing weights and
biases directly. However, their findings were impeded by the sensitivity of the outcomes to
local minima, into which their networks converged.
Hence, in this work we employ stacked auto-encoder which is a deep network consisting of
two hidden layers which are learned by this network, trained on solving mathematical
equation tasks.

3
University of Benghazi ‫جامعة بنغازي‬
Faculty of Education Almarj ‫كلية التربية – المرج‬
ISSN 2518-5845

Global Libyan Journal ‫المجلة الليبية العالمية‬

2024 / ‫ يونيو‬/ ‫العدد الواحد والسبعون‬
2. Neural Networks

The artificial neuron model constitutes a mathematical representation of a biological neuron,

characterized by multiple inputs (x1, xj1) and a solitary output (y). This conception aligns
with the foundational framework presented by McCulloch and Pitts, often denoted as a
rudimentary neuronal paradigm. This construct assimilates input patterns and configures
them as input parameters through the corresponding weights. To elucidate, the linear
threshold system, an archetype of this neuron, adeptly processes input from numerous units,
culminating in tangible values, contingent on the activation function. This procedural
progression adheres to the transfer function, orchestrating the translation from input (actual
values) to output (within a specified interval); this conversion can manifest as either linear or
nonlinear in nature. McCulloch and Pitts' model employs the sigmoidal function, recognized
as the hard limiter, as their chosen transfer function, denoted by (Ø).
Within the artificial neuron framework, the synaptic connections are termed as weights (w),
signifying the conduits that interlink inputs and the neuron. In the context of McCulloch and
Pitts' model, the weight (w) and threshold (θ) values remain invariant. Notably, the artificial
neuron model adeptly segregates input sets into dichotomous classes, thereby generating a
binary output (Figure 1). The ultimate output (y) in both the artificial neuron and McCulloch
and Pitts' model is explicated through the summation of the dot product, encapsulating
weights and input parameters (wi. xi ), while being governed by the activation function Ø
[7,8].

Figure 1: Architecture of artificial-neuron model [8]

4
University of Benghazi ‫جامعة بنغازي‬
Faculty of Education Almarj ‫كلية التربية – المرج‬
ISSN 2518-5845

Global Libyan Journal ‫المجلة الليبية العالمية‬

2024 / ‫ يونيو‬/ ‫العدد الواحد والسبعون‬
In the context of this study, the symbol N is indicative of an artificial neuron network model,
while the variable x encapsulates the input parameters under consideration. The weight or
connection lines between inputs and the transfer function are denoted by w, signifying their
role in transmitting information within the network. The activation function, governed by the
symbol Ø, is characterized as sigmoidal in nature. Moreover, θ represents the threshold value,
a parameter employed to displace the decision boundary, thereby enabling the network to
discern patterns that extend beyond the origin.
In the year 1957, the inception of the initial perceptron, a construct characterized by a
singular layer, was credited to Rosenblatt. This pioneering development drew inspiration
from the McCulloch & Pitts model and the tenets of Hebbian learning. Notably, Rosenblatt's
Perceptron model transcends the constraints of its precursor, the artificial neuron (McCulloch
& Pitts) model, in that it demonstrates the capacity to classify input sets into more than two
categories. This stands in contrast to the latter, which exclusively accommodates the
bifurcation of input sets (Figure 2).
Within the realm of the single-layer perceptron model, the selection of diverse activation
functions (denoted as Ø) has been explored, encompassing variants such as the bipolar
function. The determination of weights (w) and thresholds or biases (θ) is achieved either
through analytical computation or by means of a learning algorithm. Notably, the output ('y')
of the single-layer perceptron is mathematically articulated as follows [7,8,9,10].

Figure 2: Architecture of Rosenblatt’s Perceptron [7]

5
University of Benghazi ‫جامعة بنغازي‬
Faculty of Education Almarj ‫كلية التربية – المرج‬
ISSN 2518-5845

Global Libyan Journal ‫المجلة الليبية العالمية‬

2024 / ‫ يونيو‬/ ‫العدد الواحد والسبعون‬
The single-layer perceptron exhibits competence solely in resolving issues that are amenable
to linear separation. The modulation of inter-neuron weights is facilitated by the employment
of a learning algorithm, a principle underscored by Rosenblatt's perceptron convergence
theorem. This mechanism operates by iteratively adapting the weight values, an iterative
process often guided by the error equation (Et, j). Furthermore, the operational framework of
the perceptron's learning algorithm can be articulated as presented below:

Let N represent a network comprising single-layer perceptrons. In this context, xt,I pertains to
the ith input of the tth example. The weight connecting neurons, specifically wij, pertains to
the weight associated with the ith node at the tth instance.
Furthermore, the symbol θ denotes the bias or threshold applied to the neuron.
Concomitantly, Ø is indicative of the transfer or activation function characterizing the
neuron's behavior.
The term Et, j is utilized to represent the error encountered within the system. In a similar
vein, yt,i signifies the actual output, aspired for and contingent upon the network's
performance. Conversely, 'yt,i denotes the effective or actual output, an outcome derived from
the network's predictive capabilities, as delineated by Du and Swamy in 2013.
3. Proposed Stacked Auto-Encoder for solving mathematical equations.

This paper introduces a novel approach employing deep learning techniques to address
mathematical equation solving. The method leverages a stacked auto-encoder (SAE) as its
core mechanism for this purpose. The choice of utilizing an SAE was influenced by extensive
research within the mathematical domain, wherein existing networks predominantly
comprised backpropagation neural networks or convolutional neural networks. Consequently,

6
University of Benghazi ‫جامعة بنغازي‬
Faculty of Education Almarj ‫كلية التربية – المرج‬
ISSN 2518-5845

Global Libyan Journal ‫المجلة الليبية العالمية‬

2024 / ‫ يونيو‬/ ‫العدد الواحد والسبعون‬
the adoption of a stacked auto-encoder was prompted by the need to explore and assess its
efficacy in handling mathematical equations, a task characterized by its inherent complexity.
Given the intricate nature of the prediction task, a preprocessing step involving data
normalization was incorporated prior to inputting data into the network. This normalization
step aims to discern pivotal and unique data features, thereby facilitating the learning phase
of the stacked auto-encoder. The anticipated outcome of this preprocessing is enhanced
prediction performance and heightened accuracy.
The architecture of the proposed network is delineated in Figure 1, encapsulating the
concepts.
The core objective of system identification pertains to establishing a model relationship,
determining system orders, and approximating an elusive function through a neural network
model. This procedure relies on a dataset comprising input and output data pairs. The neural
identifiers utilized in this context are Multi-Layer Feed-Forward artificial neural networks
(MLFF), characterized by an input layer, one or more hidden layers with biases, and a linear
or non-linear output layer. The proposed Black-Box (BB) system specifically employs an
MLFF with a single hidden layer. This configuration can be further elaborated as follows:
a) Input Layer:
Comprising (n+1) neurons, this layer encapsulates the equation coefficients within a linear
algebraic system. Each neuron corresponds to the coefficients of an equation, with an
additional neuron dedicated to representing the equation results.
b) Hidden Layer:
Encompassing 48 neurons, this layer serves as a conduit for individual bits from the
Expansion Permutation process.
c) Output Layer:
Featuring n neurons, this layer corresponds to the desired variable values within the equations
under consideration.
The architectural depiction of the Black-Box Neural Network (BBNN) system pertaining to
three equations with three variables is depicted in Figure 3, outlining the structural
components and their interconnections.

7
University of Benghazi ‫جامعة بنغازي‬
Faculty of Education Almarj ‫كلية التربية – المرج‬
ISSN 2518-5845

Global Libyan Journal ‫المجلة الليبية العالمية‬

2024 / ‫ يونيو‬/ ‫العدد الواحد والسبعون‬

Figure 3: Topology of the proposed stacked auto-encoder for solving mathematical

equations.
3.1 SAE Training
The discussion of the training of the proposed SAE and its performance in this stage is
presented in this section. It is important to mention that the data were all normalized to range
between 0 and 1 before being fed into the network, which may help in arranging all data to
range of 0 to 1. Note that SAE is trained on 100 samples out of 200, which include different
examples of equations.
Since the stacked auto-encoder is a deep network, it means it needs pre-training. Pre-training
is achieved by training the stacked auto-encoder layer by layer, which is called greedy layer-
wise training. During pre-training, outputs are not labeled as in this phase; the network is
being trained to learn how to extract important features from the input data. This may help
the network to perform accurately in the fine-tuning where it is trained to predict different
answers of mathematical equations.
When the network finishes pre-training, it becomes ready to be fine-tuned. Fine-tuning is
done using the conventional backpropagation algorithm that traditional neural networks use.

8
University of Benghazi ‫جامعة بنغازي‬
Faculty of Education Almarj ‫كلية التربية – المرج‬
ISSN 2518-5845

Global Libyan Journal ‫المجلة الليبية العالمية‬

2024 / ‫ يونيو‬/ ‫العدد الواحد والسبعون‬
Table 1: Dataset and data division
Image sets Number of data
Training set 100
Testing set 100
Total number of data 200

Table 1 presents the data samples number in the dataset and its data division for training and
testing, respectively.
The training parameters of the network during pre-training and fine-tuning are shown in
Table 2.
Table 2: Learning parameters of networks during pre-training and fine-tuning
Learning parameters Values (Pre-training) Values (Fine-training)
Number of training images 100 100
Number of output neurons 100 3
Number of layers of the network 4 4
Number of hidden layers 2 2
Learning rate 0.47 0.18
Maximum number of iterations 500\500 323\400
Transfer function Sigmoid Sigmoid

Figure 4 illustrates the learning trajectory of the network throughout its pre-training phase.
The discernible trend showcases a reduction in the network error as the iterations progress.
However, it is noteworthy that the error fails to converge to an exceedingly diminutive value
(0.04), indicative of a residual level of imprecision in the network's predictive outcomes.

9
University of Benghazi ‫جامعة بنغازي‬
Faculty of Education Almarj ‫كلية التربية – المرج‬
ISSN 2518-5845

Global Libyan Journal ‫المجلة الليبية العالمية‬

2024 / ‫ يونيو‬/ ‫العدد الواحد والسبعون‬

Figure 4: Learning curve of SAE during pre-training

The learning of the network can usually be seen by looking at its learning curve. Learning
curve is a graph that shows the variation of error versus the increase of iterations required for
the network to finish training. As can be seen from Figure 4, the developed network seems to
learn well as the error is decreasing sharply with increase of number of iterations. The
network was capable of reaching an error of 0.040218 with maximum iterations of 500.

Figure 4: Learning curve of SAE1 during fine-tuning

10
University of Benghazi ‫جامعة بنغازي‬
Faculty of Education Almarj ‫كلية التربية – المرج‬
ISSN 2518-5845

Global Libyan Journal ‫المجلة الليبية العالمية‬

2024 / ‫ يونيو‬/ ‫العدد الواحد والسبعون‬
Table 3: Training network performance
Learning results Pre-training Fine-tuning
Number of training images 100 100
Training recognition rate 87% 100%
Minimum square error achieved (MSE) 0.0402 0.0009
Iterations required 500 323
Training time 250 secs 38 secs

The performance of the designed SAE for solving mathematical equations can be
summarized in Table 3. As can be seen, the network was trained on 470 samples during pre-
training and fine-tuning. However, its performance was not the same. It is seen that the
accuracy of network in predicting different mathematical equations was 100% during fine-
tuning which is than that obtained during pre-training. Note that this accuracy was achieved
in a shorter time (38 secs) than that of pre-training (250 secs). Moreover, it is important to
mention that the network required a smaller number of iterations during fine-tuning to
achieve an error of 0.0009, which is smaller than that reached during pre-training (0.0402).
This can be considered as a major problem as the performance of the network is measured by
its performance during fine-tuning as it is learning to predict data. However, during pre-
training the network is just trying to warm up and get some power in extracting and
identifying features in data.
3.2 Deep Models Testing
Once trained, the stacked auto-encoder network (SAE) is tested using 100 other samples.
Note that these samples are other than those used for training the network.in addition these
data include samples of all 3 classes. Table 8 shows the performance of the network during
testing, in terms of accuracy.
Table 4 shows that the trained and tested SAE was capable of achieving a good
generalization ability when tested on data that are unseen before, where it reached an
accuracy of 91.2%

11
University of Benghazi ‫جامعة بنغازي‬
Faculty of Education Almarj ‫كلية التربية – المرج‬
ISSN 2518-5845

Global Libyan Journal ‫المجلة الليبية العالمية‬

2024 / ‫ يونيو‬/ ‫العدد الواحد والسبعون‬
Table 4: Classification rates of network during testing
Deep Networks Number of testing data Prediction rates
SAE 152 91.2%

4. Results Discussion

A deep stacked auto-encoder based deep learning is developed in this study from the purpose
of solving mathematical equations. The designed SAE is first built using two auto-encoders
and then it is pre-trained in order to learn the extraction of useful features from input data
which helps then in predicting the answers of different mathematical equations more
accurately. These kinds of networks are trained using Greedy-layer wise training. This is an
algorithm that allows the training of an auto-encoder layer by layer, in an unsupervised
manner. The importance of this training technique (pre-training) is that it provides the
network with good features extracting abilities and learned weights that are used in fine-
tuning the network.
In this study, the data is normalized before being fed into network and then the pre-training
starts. Then, the same data are fed into a network which is also fine-tuned to predict answers
of mathematical equations of different 3 classes. Table 5 shows the deep networks’
performances.
Table 5: Performance evaluation
Performance parameters SAE1
Number of training images 100
Number of testing images 100
Training recognition rates 100%
Testing recognition rates 91.2%
Minimum square error 0.0009
achieved (MSE)
Iterations required 323
Training time 288 secs

12
University of Benghazi ‫جامعة بنغازي‬
Faculty of Education Almarj ‫كلية التربية – المرج‬
ISSN 2518-5845

Global Libyan Journal ‫المجلة الليبية العالمية‬

2024 / ‫ يونيو‬/ ‫العدد الواحد والسبعون‬
Table 5 shows the evaluation of the network’s performance during both training and testing.
It is seen that the network was able to achieve a 100% training accuracy; however, it required
a relatively long training time of 288 secs and 323 iterations to achieve such accuracy. Note
that this network also achieved a very small error of 0.0009. Moreover, the network was
tested on 152 different samples, and it achieved an accuracy of 91.2% which can be
considered as promising results from such mathematical application.
5. Conclusion

The current investigation centers around the utilization of a deep network, specifically a
stacked auto-encoder, as a tool for the resolution of mathematical equations. The application
of a deep network, especially a stacked auto-encoder, within this domain is inherently
intricate, given the task's demand for precise equation solution based on their intrinsic data
and attributes. This prerequisite for high accuracy is driven by the field's intolerance for even
minor error margins. Thus, the deployment of a sophisticated predictive model like the deep
network is instrumental in expediting and facilitating the work of mathematicians.
The chosen approach employs a stacked auto-encoder model, which is subjected to training
and testing using datasets comprising 100 samples each. The model's initial training involves
the utilization of normalized data derived directly from the equations. The primary aim of
data normalization is the enhancement and extraction of pertinent, distinctive attributes,
enabling the model to discern unique characteristics within each sample. This feature
extraction process is crucial, contributing to the model's proficiency in accurately
categorizing the data. Consequently, this enhancement in classification accuracy serves to
expedite the network's learning process and bolster its convergence. Subsequently, the model
is assessed through testing on a distinct set of 100 samples, with its performance evaluated in
terms of accuracy, training time, and the extent of error incurred.
In summation, the assessment of the stacked auto-encoder's performance demonstrates its
adeptness at extracting pertinent features and delivering accurate predictions for previously
unseen equations. This underscores the model's potential as an effective predictor for
mathematical equations, given a preliminary preprocessing stage aimed at refining
classification, all while adhering to a stringent error margin.

13
University of Benghazi ‫جامعة بنغازي‬
Faculty of Education Almarj ‫كلية التربية – المرج‬
ISSN 2518-5845

Global Libyan Journal ‫المجلة الليبية العالمية‬

2024 / ‫ يونيو‬/ ‫العدد الواحد والسبعون‬
References
[1] Helwan, A., El-Fakhri, G., Sasani, H., & Uzun Ozsahin, D. (2018). Deep networks in
identifying CT brain hemorrhage. Journal of Intelligent & Fuzzy Systems, (Preprint), 1-1.

[2] Helwan, A., & Uzun Ozsahin, D. (2017). Sliding window based machine learning system
for the left ventricle localization in MR cardiac images. Applied Computational Intelligence
and Soft Computing, 2017.

[3] M. W. M. G. Dissanayake and N. Phan-Thien. Neural-network-based approximations for

solving partial differential equations. Communications in Numerical Methods in Engineering,
10(3):195–201, 1994. doi: 10.1002/cnm.1640100303. URL
https://onlinelibrary.wiley.com/doi/abs/10.1002/cnm. 1640100303

[4] Jason Yosinski, Jeff Clune, Yoshua Bengio, and Hod Lipson. How transferable are
features in deep neural networks? In Advances in neural information processing systems,
pages 3320–3328, 2014.

[5] Yixuan Li, Jason Yosinski, Jeff Clune, Hod Lipson, and John Hopcroft. Convergent
learning: Do different neural networks learn the same representations? In Feature Extraction:
Modern Questions and Challenges, pages 196–212, 2015.

[6] Jens Berg and Kaj Nyström. A unified deep artificial neural network approach to partial
differential equations in complex geometries. arXiv preprint arXiv:1711.06464, 2017.

[7] Haykin, S. S. (2009). Neural networks and learning machines (Vol. 3). Upper Saddle
River, NJ, USA:: Pearson.
[8] Du, K. L., & Swamy, M. N. (2013). Neural networks and statistical learning. Springer
Science & Business Media.
[9] Zhang, Y., Jiang, D., & Wang, J. (2002). A recurrent neural network for solving Sylvester
equation with time-varying coefficients. IEEE Transactions on Neural
Networks, 13(5), 1053-1063.

[10] Lagaris, I. E., Likas, A., & Fotiadis, D. I. (1998). Artificial neural networks for solving
ordinary and partial differential equations. IEEE transactions on neural
networks, 9(5), 987-1000.

Computers & Education: Anna Y.Q. Huang, Owen H.T. Lu, Stephen J.H. Yang
No ratings yet
Computers & Education: Anna Y.Q. Huang, Owen H.T. Lu, Stephen J.H. Yang
12 pages
Introduction To Neural Networks by Ingrid Russel
No ratings yet
Introduction To Neural Networks by Ingrid Russel
16 pages
Tutorial Math Deep Learning 2018 PDF
No ratings yet
Tutorial Math Deep Learning 2018 PDF
103 pages
Guilhoto Math
No ratings yet
Guilhoto Math
25 pages
Approach To The Synthesis of Neural Network Structure During Classification
No ratings yet
Approach To The Synthesis of Neural Network Structure During Classification
7 pages
THE_DEEP_NEURAL_NETWORK-A_REVIEW
No ratings yet
THE_DEEP_NEURAL_NETWORK-A_REVIEW
5 pages
Neural Networks EN
No ratings yet
Neural Networks EN
16 pages
The Modern Mathematics of Deep Learning
No ratings yet
The Modern Mathematics of Deep Learning
78 pages
Neural Network Theory22
No ratings yet
Neural Network Theory22
60 pages
Daniel A. Roberts, Sho Yaida - The Principles of Deep Learning Theory - An Effective Theory Approach To Understanding Neural Networks
No ratings yet
Daniel A. Roberts, Sho Yaida - The Principles of Deep Learning Theory - An Effective Theory Approach To Understanding Neural Networks
473 pages
A Comparison of Dropout and Weight Decay For Regularizing Deep Ne
No ratings yet
A Comparison of Dropout and Weight Decay For Regularizing Deep Ne
15 pages
Deep Learning Algorithms and Architectures
No ratings yet
Deep Learning Algorithms and Architectures
26 pages
AND Operation Using NN
No ratings yet
AND Operation Using NN
53 pages
Machine Learning Unit 4
No ratings yet
Machine Learning Unit 4
21 pages
Review of Deep Learning Algorithms and Architectur
No ratings yet
Review of Deep Learning Algorithms and Architectur
29 pages
Cheatsheets For Deep Learning 1650192034
No ratings yet
Cheatsheets For Deep Learning 1650192034
95 pages
Deep Learning Has Evolved Significantly Since Its Inception in the 1940s
No ratings yet
Deep Learning Has Evolved Significantly Since Its Inception in the 1940s
50 pages
A Selective Overview of Deep Learning: Jianqing Fan Cong Ma Yiqiao Zhong April 16, 2019
No ratings yet
A Selective Overview of Deep Learning: Jianqing Fan Cong Ma Yiqiao Zhong April 16, 2019
37 pages
Artificial Neural Network Application in Logic System: Siddharth Saxena TCET, Mumbai
No ratings yet
Artificial Neural Network Application in Logic System: Siddharth Saxena TCET, Mumbai
5 pages
Deep Learning Final Sheet
No ratings yet
Deep Learning Final Sheet
915 pages
Toward Causal Representation Learning: Byb S, F L, S B, N R K, N K, A G, Y B
No ratings yet
Toward Causal Representation Learning: Byb S, F L, S B, N R K, N K, A G, Y B
23 pages
Lect 2
No ratings yet
Lect 2
68 pages
Q1
No ratings yet
Q1
7 pages
MODULE 2 Deep Learning
No ratings yet
MODULE 2 Deep Learning
26 pages
Using Artificial Neural Networks As One of The Data Mining Algorithms To Predict The Iraqi Gross Domestic Product
No ratings yet
Using Artificial Neural Networks As One of The Data Mining Algorithms To Predict The Iraqi Gross Domestic Product
5 pages
Neural Networks and CNN
No ratings yet
Neural Networks and CNN
25 pages
Towards Causal Representation Learning
No ratings yet
Towards Causal Representation Learning
24 pages
DL Intro
No ratings yet
DL Intro
64 pages
Face Recognition Based On Deep Autoencoder Networks With Dropout
No ratings yet
Face Recognition Based On Deep Autoencoder Networks With Dropout
4 pages
Deep Learning (1)
No ratings yet
Deep Learning (1)
19 pages
Supervised Learning Neural Networks
No ratings yet
Supervised Learning Neural Networks
34 pages
The Little Book of Deep Learning
No ratings yet
The Little Book of Deep Learning
168 pages
Deep Learning: Nicholas G. Polson Vadim O. Sokolov
No ratings yet
Deep Learning: Nicholas G. Polson Vadim O. Sokolov
18 pages
Statistical Physics of Deep Neural Networks: Generalization Capability, Beyond the Infinite Width, and Feature Learning.
No ratings yet
Statistical Physics of Deep Neural Networks: Generalization Capability, Beyond the Infinite Width, and Feature Learning.
206 pages
ML Google
No ratings yet
ML Google
15 pages
Unit-3
No ratings yet
Unit-3
16 pages
Tikas-FYP
No ratings yet
Tikas-FYP
37 pages
CC511 Week 5 - 6 - NN - BP
No ratings yet
CC511 Week 5 - 6 - NN - BP
62 pages
14010 Cr
No ratings yet
14010 Cr
22 pages
AI Lab 1
No ratings yet
AI Lab 1
11 pages
Dave Reed: Connectionist Approach To AI
No ratings yet
Dave Reed: Connectionist Approach To AI
26 pages
Neural Networks and Deep Learning: A Comprehensive Overview of Modern Techniques and Applications
No ratings yet
Neural Networks and Deep Learning: A Comprehensive Overview of Modern Techniques and Applications
15 pages
LBDL
No ratings yet
LBDL
185 pages
Module I
No ratings yet
Module I
109 pages
The Little Book of Deep Learning
No ratings yet
The Little Book of Deep Learning
143 pages
Chapter 7
No ratings yet
Chapter 7
31 pages
Deep Learning concepts ppt
No ratings yet
Deep Learning concepts ppt
13 pages
Artificial Neural Network With The Levenberg-Marquardt Algorithm For Numerical Solution of Two-Dimension Poisson's Equation
No ratings yet
Artificial Neural Network With The Levenberg-Marquardt Algorithm For Numerical Solution of Two-Dimension Poisson's Equation
7 pages
2022PhD - Princeton - Bridging Theory and Practice in Deep Learning Optimization and Generalization
No ratings yet
2022PhD - Princeton - Bridging Theory and Practice in Deep Learning Optimization and Generalization
540 pages
Introduction To Neural Networks: RWTH Aachen University Chair of Computer Science 6 Prof. Dr.-Ing. Hermann Ney
No ratings yet
Introduction To Neural Networks: RWTH Aachen University Chair of Computer Science 6 Prof. Dr.-Ing. Hermann Ney
31 pages
NLP-NeuralNetworks Reading Notes
No ratings yet
NLP-NeuralNetworks Reading Notes
13 pages
lect 5
No ratings yet
lect 5
41 pages
Lecture 5 - CS50's Introduction to Artificial Intelligence with Python
No ratings yet
Lecture 5 - CS50's Introduction to Artificial Intelligence with Python
16 pages
Little Book of Deep Learning
100% (1)
Little Book of Deep Learning
158 pages
Artificial Intelligence - Chapter 7
No ratings yet
Artificial Intelligence - Chapter 7
18 pages
Bio Optimization of Deep Learning Network Architectures 22fguqp5
No ratings yet
Bio Optimization of Deep Learning Network Architectures 22fguqp5
11 pages
The Little Book of Deep Learning
No ratings yet
The Little Book of Deep Learning
155 pages
AML_mod4
No ratings yet
AML_mod4
22 pages
Lecture 5-6
No ratings yet
Lecture 5-6
45 pages
Neural Networks and Fuzzy Logic
From Everand
Neural Networks and Fuzzy Logic
C. Naga Bhaskar
No ratings yet
Feedforward Neural Networks: Fundamentals and Applications for The Architecture of Thinking Machines and Neural Webs
From Everand
Feedforward Neural Networks: Fundamentals and Applications for The Architecture of Thinking Machines and Neural Webs
Fouad Sabry
No ratings yet
A General Branch-And-bound Framework For Continuou
No ratings yet
A General Branch-And-bound Framework For Continuou
34 pages
Comparison of Disposal Strategies in Linear Reverse Logistics Models
No ratings yet
Comparison of Disposal Strategies in Linear Reverse Logistics Models
14 pages
Jet 2024
No ratings yet
Jet 2024
18 pages
Cultural Exchange and Transmission of Values in To
No ratings yet
Cultural Exchange and Transmission of Values in To
19 pages
Math
No ratings yet
Math
207 pages
MCM 2024
No ratings yet
MCM 2024
13 pages
On The Limitations of Univariate Grey Prediction M
No ratings yet
On The Limitations of Univariate Grey Prediction M
11 pages
Narrative Modality in Magda Szabo S Nove
No ratings yet
Narrative Modality in Magda Szabo S Nove
24 pages
Infer Ential Knowledge P Men A
No ratings yet
Infer Ential Knowledge P Men A
10 pages
2023 5483 AJE Tashyoush 07
No ratings yet
2023 5483 AJE Tashyoush 07
23 pages
Current Sociological Theories and Issues
No ratings yet
Current Sociological Theories and Issues
28 pages
Lov 03 10 2023
No ratings yet
Lov 03 10 2023
33 pages
A Deep Learning Approach Towards Student Performance Prediction in Online Courses Challenges Based On A Global Perspective
No ratings yet
A Deep Learning Approach Towards Student Performance Prediction in Online Courses Challenges Based On A Global Perspective
6 pages
DABD (KMBNIT01) Model Paper With Solution
No ratings yet
DABD (KMBNIT01) Model Paper With Solution
19 pages
Explainable Artificial Intelligence Objectives Stakeholders and Future Research Opportunities
No ratings yet
Explainable Artificial Intelligence Objectives Stakeholders and Future Research Opportunities
12 pages
Summer Training Report ON "What Makes Customer Driver in To The Store (Customer Loyalty) "
100% (1)
Summer Training Report ON "What Makes Customer Driver in To The Store (Customer Loyalty) "
76 pages
Face Recognition Using Genetic Algorithm and Back Propagation Neural Network
No ratings yet
Face Recognition Using Genetic Algorithm and Back Propagation Neural Network
4 pages
Perceptron
No ratings yet
Perceptron
24 pages
913 Assignment 1
No ratings yet
913 Assignment 1
1 page
Cargill CS EN
No ratings yet
Cargill CS EN
5 pages
56.eye Ball Cursor Movement Using Opencv
75% (4)
56.eye Ball Cursor Movement Using Opencv
47 pages
ISO_42001_Checklist
100% (3)
ISO_42001_Checklist
6 pages
Assignment 3 Subhesh Introduction To Supply Chain Management
No ratings yet
Assignment 3 Subhesh Introduction To Supply Chain Management
9 pages
Azure Cognitive Services Openai
No ratings yet
Azure Cognitive Services Openai
350 pages
DIT University: Registration Form
No ratings yet
DIT University: Registration Form
2 pages
Sonata Software Limited Annual Report 2022 2023
No ratings yet
Sonata Software Limited Annual Report 2022 2023
413 pages
Introduction To Support Vector Machines: Hsuan-Tien Lin
No ratings yet
Introduction To Support Vector Machines: Hsuan-Tien Lin
20 pages
Multi-Agent Robots Using Artificial Intelligence
No ratings yet
Multi-Agent Robots Using Artificial Intelligence
18 pages
04 Wcotech22 Chapter 4 e
No ratings yet
04 Wcotech22 Chapter 4 e
8 pages
srr
No ratings yet
srr
2 pages
On Unit-3
No ratings yet
On Unit-3
30 pages
Mtech Handboo
No ratings yet
Mtech Handboo
17 pages
CERW Lab Project
No ratings yet
CERW Lab Project
20 pages
Presentation On Artificial Intelligence
No ratings yet
Presentation On Artificial Intelligence
19 pages
AI Essay Writerhgrib
No ratings yet
AI Essay Writerhgrib
2 pages
2 Artificial Intelligence and Fake News
100% (1)
2 Artificial Intelligence and Fake News
15 pages
Top AI Tools for Creators for 2025 _ Social Media Examiner
No ratings yet
Top AI Tools for Creators for 2025 _ Social Media Examiner
17 pages
Management Assignment 02
No ratings yet
Management Assignment 02
6 pages
Advanced Techniques in Machine Learning and Optimization (3)
No ratings yet
Advanced Techniques in Machine Learning and Optimization (3)
8 pages
A Brief Introduction To Computer Vision
No ratings yet
A Brief Introduction To Computer Vision
3 pages
Us Banking and Capital Markets
No ratings yet
Us Banking and Capital Markets
12 pages

Deep Learning in Solving Mathematical Equations

Uploaded by

Deep Learning in Solving Mathematical Equations

Uploaded by

University of Benghazi ‫جامعة بنغازي‬

Faculty of Education Almarj ‫كلية التربية – المرج‬

Global Libyan Journal ‫المجلة الليبية العالمية‬

Global Libyan Journal ‫المجلة الليبية العالمية‬

‫التعلم العميق يف حل املعادالت الرايضية‬

Global Libyan Journal ‫المجلة الليبية العالمية‬

Global Libyan Journal ‫المجلة الليبية العالمية‬

The artificial neuron model constitutes a mathematical representation of a biological neuron,

Figure 1: Architecture of artificial-neuron model [8]

Global Libyan Journal ‫المجلة الليبية العالمية‬

Figure 2: Architecture of Rosenblatt’s Perceptron [7]

Global Libyan Journal ‫المجلة الليبية العالمية‬

Global Libyan Journal ‫المجلة الليبية العالمية‬

Global Libyan Journal ‫المجلة الليبية العالمية‬

Figure 3: Topology of the proposed stacked auto-encoder for solving mathematical

Global Libyan Journal ‫المجلة الليبية العالمية‬

Global Libyan Journal ‫المجلة الليبية العالمية‬

Figure 4: Learning curve of SAE during pre-training

Figure 4: Learning curve of SAE1 during fine-tuning

Global Libyan Journal ‫المجلة الليبية العالمية‬

Global Libyan Journal ‫المجلة الليبية العالمية‬

Global Libyan Journal ‫المجلة الليبية العالمية‬

Global Libyan Journal ‫المجلة الليبية العالمية‬

[3] M. W. M. G. Dissanayake and N. Phan-Thien. Neural-network-based approximations for

You might also like