Report Containing Steps of How I Reached The Final Model
Report Containing Steps of How I Reached The Final Model
Independent variable - Credit-Deposit (%), Net Profit / Total funds (%), Operating Expenses /
Total Funds (%), Log_Advance to Real Estate Sector (Rs. Cr), Log_Advance to Capital Market
Sector (Rs. Cr), % of Net Non-Performing Assets to Net Advance, Return on Assets (%), and
RONW (%)
Control Variable - Repo Rate, Inflation, Gross Domestic Product (GDP)
To find out the effect of independent variables on the dependent variable I run the following
regression model.
Step 1 – Regress all independent variable against NPA
Pooled OLS Regression Model
Again, I checked for multicollinearity using VIF to ensure that there is no multicollinearity
among the variables in the new model. As VIF for each variable is less than 5. Hence,
multicollinearity does not exist anymore.
Step 4– Check for Heteroskedasticity using Breusch Pagan and white test.
Null Hypothesis – Homoscedastic in nature
Alternative hypothesis – Heteroscedastic in nature
As p-value < 0.05 in both the cases, we will reject the null hypothesis. Hence,
Heteroscedasticity exist.
As the data is heteroskedastic the pooled OLS regression will give biased and inconsistent
estimator. Therefore, I used Random effect GLS (RGLS) regression and Fixed effect GLS (FGLS)
regression for constructing new model.
Step 5 – Used RGLS and FGLS model to estimate better coefficients. Also, used Hausman test
to determine the more appropriate model between RGLS and FGLS.
Hausman Test
Null hypothesis – RGLS is appropriate
Alternative Hypothesis – RGLS is not appropriate
Fixed effect GLS (FGLS) regression
As p-value = 0.1006 > 0.05. Therefore, we will accept the null hypothesis i.e. RGLS is an
appropriate model.
Step 6 – As, many expected variables were not coming to be significant. Therefore, I checked
the model for autocorrelation using Wooldridge test and cross-sectional dependence using
Breusch-Pagan LM test of independence
As, p-value < 0.05, we will reject null hypothesis i.e. residuals across entities are not
correlated.
Result of Wooldridge Test
As, p-value < 0.05, we will reject null hypothesis i.e. No first-order autocorrelation exists.
Hence, there exist the problem of autocorrelation and cross-sectional dependence.
Step 7 – Use Cross-sectional time-series FGLS regression which solves the issue of
autocorrelation and cross-sectional dependence
Step 8 – Adding controlled variable to the model in order to remove omitted variable bias.
Included Log (GDP) and LOG (Domestic Credit) in the model as controlled variables as their
presence makes the GDP growth rate a statistically significant variable.