We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 26
Linear Regression
• Linear regression is a type of supervised machine
learning algorithm that computes the linear relationship between the dependent variable and one or more independent features by fitting a linear equation to observed data. • Since linear regression shows the linear relationship, which means it finds how the value of the dependent variable is changing according to the value of the independent variable. • The linear regression model provides a sloped straight line representing the relationship between the variables Types of Linear Regression
• Linear regression can be further divided into two types
of the algorithm: • Simple Linear Regression: If a single independent variable is used to predict the value of a numerical dependent variable, then such a Linear Regression algorithm is called Simple Linear Regression. • Multiple Linear regression: If more than one independent variable is used to predict the value of a numerical dependent variable, then such a Linear Regression algorithm is called Multiple Linear Regression. • Linear Regression Line • A linear line showing the relationship between the dependent and independent variables is called a regression line. A regression line can show two types of relationship: • Positive Linear Relationship: If the dependent variable increases on the Y- axis and independent variable increases on X- axis, then such a relationship is termed as a Positive linear relationship • Finding the best fit line: • When working with linear regression, our main goal is to find the best fit line that means the error between predicted values and actual values should be minimized. The best fit line will have the least error. • The different values for weights or the coefficient of lines (a0, a1) gives a different line of regression, so we need to calculate the best values for a0 and a1 to find the best fit line, so to calculate this we use cost function. • Cost function- • The different values for weights or coefficient of lines (a0, a1) gives the different line of regression, and the cost function is used to estimate the values of the coefficient for the best fit line. • Cost function optimizes the regression coefficients or weights. It measures how a linear regression model is performing. • We can use the cost function to find the accuracy of the mapping function, which maps the input variable to the output variable. This mapping function is also known as Hypothesis function. • Gradient Descent: • Gradient descent is used to minimize the MSE by calculating the gradient of the cost function. • A regression model uses gradient descent to update the coefficients of the line by reducing the cost function. • It is done by a random selection of values of coefficient and then iteratively update the values to reach the minimum cost function • Model Performance: • The Goodness of fit determines how the line of regression fits the set of observations. The process of finding the best model out of various models is called optimization. It can be achieved by below method 1. R-squared method: • R-squared is a statistical method that determines the goodness of fit. • It measures the strength of the relationship between the dependent and independent variables on a scale of 0-100%. • The high value of R-square determines the less difference between the predicted values and actual values and hence represents a good model. • It is also called a coefficient of determination, or coefficient of multiple determination for multiple regression