Regression: Unit Iii
Regression: Unit Iii
REGRESSION
Negative relationship
LINEAR MODEL:
Linear regression is a linear approach for modelling the relationship
between a scalar response and one or more explanatory variables (also
known as dependent and independent variables).
TERMINOLOGIES RELATED TO THE REGRESSION
ANALYSIS:
Dependent Variable: The main factor in Regression analysis which we want to
predict or understand is called the dependent variable. It is also called target
variable.
Independent Variable: The factors which affect the dependent variables or which are
used to predict the values of the dependent variables are called independent variable,
also called as a predictor.
Outliers: Outlier is an observation which contains either very low value or very high
value in comparison to other observed values. An outlier may hamper the result, so it
should be avoided.
Outliers are defined as abnormal values in a dataset that don't go with the regular
distribution and have the potential to significantly distort any regression model.
Multicollinearity: If the independent variables are highly correlated with each other
than other variables, then such condition is called Multicollinearity. It should not be
present in the dataset, because it creates problem while ranking the most affecting
variable.
WHY DO WE USE REGRESSION ANALYSIS?
Regression estimates the relationship between the target and the independent
variable.
It is used to find the trends in data.
It helps to predict real/continuous values.
By performing the regression, we can confidently determine the most important
factor, the least important factor, and how each factor is affecting the other
factors.
Types of Regression
Linear Regression
Logistic Regression
Polynomial Regression
Support Vector Regression
Decision Tree Regression
Random Forest Regression
Ridge Regression
Lasso Regression:
LINEAR REGRESSION
R- Sqaured(R^2) Error
MEAN SQUARED ERROR (MSE) :
This improves the drawback we encountered in Mean Error above. Here a
square of the difference between the actual and predicted value is calculated
to avoid any possibility of negative error.
It is measured as the average of the sum of squared differences between
predictions and actual observations.
Calculate MAE,MSE,RMSE?
MAE
Model Performance:
The Goodness of fit determines how the line of
The equation for a model with two input variables can be written as:
y = β0 + β1.x1 + β2.x2
The equation for a model with three input variables can be written as:
y = β0 + β1.x1 + β2.x2 + β3.x3
variance.
ASSIGNMENT NO:03