Symbolic Regression
Symbolic Regression
Subhashree Rautray
Group :3540901/91701
School, Institute :Peter the Great St. Petersburg Polytechnic University
1
Introduction
• Term "symbolic regression" (SR) represents process during which are measured
data fitted by suitable mathematical formula like x2 + C etc .
• So the goal of this to generate a function that describes a given set of datapoints
means to discover hidden , non-linear relationships in your data.
2
Evolutionary Algorithms
• With Genetic Programming, this is regression without a model. It will discover the
model from the data during the analysis. Compared to symbolic regression, the other
classic regression techniques are more like optimizations since they are just trying to
find the best coefficients to the variables that are predetermined.
3
Evolutionary Algorithms
4
Conventional vs Symbolic Regression
5
How Does Symbolic Regression Work?
• Then, we determine the terminals, which are our variables and constants,
(e.g. x, e, PI, {1..5}). These serve as our building blocks for constructing the
GP parse trees.
• The Genetic Programming algorithm will discover the function that best fits
the data using an error function such as root mean square as part of the
fitness function.
6
Example
7
Example
8
Example
• At around generation 39, we get our best of run individual, hitting all 20 fitness cases with a very small
overall error.
• This is not perfect solution but pretty close .As it determine on its own a function that matches the data 9
,approximately .
Example 2
• data set called Facebook metrics from the UCI Machine Learning Repository.
• It has been created based on an undisclosed cosmetics brand Facebook page
• The target Total Interactions is a sum of all likes, shares and comments a given post got
after it was published.
• To keep things simple, only the default binary operations are enabled: add, sub, mul, div.
The fittest solution after 20 generations is the following.
10
Output : Tree Structure
11