Regression Using Excel
Regression Using Excel
Correlation
The correlation coefficient (a value between -1 and +1) tells you how strongly two
variables are related to each other. We can use the CORREL function or the Analysis
Toolpak add-in in Excel to find the correlation coefficient between two variables.
In Excel, we use regression analysis to estimate the relationships between two or more
variables. There are two basic terms that you need to be familiar with:
The Dependent Variable is the factor you are trying to predict.
The Independent Variable is the factor that might influence the dependent variable.
Summary Output
The summary output tells you how well the calculated
linear regression equation fits your data source.
Standard Error is another goodness-of-fit measure that shows the precision of your
regression analysis.
ANOVA
ANOVA stands for Analysis of Variance. It gives
information about the levels of variability
within your regression model.
You can quickly visualize the relationship between the two variables by creating a graph. To create
linear regression graph, follow these steps:
Select the two variable columns of your data, including the headers.
Go to Insert tab > Charts group > Scatter Plot.
Residuals
The residuals show you how far away the actual data points are fom the predicted data points
(using the equation). For example, the first data point equals 8500. Using the equation, the
predicted data point equals 8536.214 -835.722 * 2 + 0.592 * 2800 = 8523.009, giving a residual
of 8500 - 8523.009 = -23.009.
You can also create a scatter plot of these
residuals.