Regression Test Lesson Notes (Optional Download)
Regression Test Lesson Notes (Optional Download)
In this module, you will learn about linear regression and how to perform
regression given a data.
Learning Objective:
Linear Regression
If you determine that your data sets are associated or correlated with each
other, you can perform linear regression on them. Doing so enables you to
predict additional data based on that regression model. There is single
linear when you have one independent variable (x) and one dependent
variable (y). And there is multiple linear regression when you have
multiple independent variables (x's) but only one dependent variable (y).
You can think of a dependent variable as an event that is influenced by the
independent variable. For example, when comparing how much sunlight a
plant receives versus its growth in length, it is probably better to say that
the plant’s growth is more likely dependent on the amount of sunlight than
the reverse. Thus, the plant’s length is the dependent variable and the
amount of sunlight is the independent variable.
regre
Lets talk about the graph above. So the blue dots are our actual data points
and the red light is the line of best fit for said graph. A linear regression
model assumes that the relationship is linear. So if we are given an y we
can use our linear regression model to predict the y. The difference is
between our actual and the value on the line at that point is error term.
Now that we have our x and our y. We are going to create our model and
then find the coefficient of determination. The coefficient of determination
or R-squared examines how the difference of the two variables compare to
each other. R-square is a value between 0.0 and 1.0. Where 1.0 represent a
perfect fit. The closer to 0.0 the value is the less reliable we can consider
our model to be.
Logistic Regression
Logistic regression is performed when you have categorical data in the
mix. For example, if you want to determine if sex is dependent on the
height of person, you should use logistic regression.