0% found this document useful (0 votes)

3 views

Simple Regression Model

This presentation covers the fundamentals of simple regression models, focusing on one independent variable. It explains key concepts such as the population regression function, ordinary least squares (OLS) estimates, and how to interpret coefficients, using practical examples like the relationship between hourly wage and years of experience. The presentation concludes with an analysis of the Gauss-Markov assumptions that ensure the unbiasedness of estimators.

Uploaded by

phamfred1207

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views

Simple Regression Model

Uploaded by

phamfred1207

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

You are on page 1/ 15

this presentation will be about simple regression model this is a

regression model with one independent variable the outline of this presentation
would be first to review some terminology that we use provide examples and
interpretations of the coefficient then go a little more theoretical talk about
population regression function and we can derive the ols estimates then things
would get more practical with examples of simple regression including how to
interpret the results then we'll talk about variation and r

squared then we will talk about different log transformations for the
dependent and independent variable and we will end with a very important gauss
markov assumptions and how that leads to unbiasedness of the estimators and the
variance formula so let's start with the terminology this is how a simple
regression model looks like y equals beta0 plus beta 1 times x plus u here y is the
dependent variable the one that we're trying to explain x is the independent
variable uh and in this case we have only one for simple

regression u is the error term and beta 0 and beta 1 are the parameters
notice that this regression model is for the population the population is everyone
that we're trying to uh find this relationship for after we obtain sample data we
can estimate the following equation y hat equals beta0 hat plus beta 1 hat times x
here y is called the predicted value beta 0 hat and beta 1 hat are the coefficients
these are the numbers the coefficients estimated from using the sample data and as
a result we have the residual u

hat which is the difference between the actual value of the dependent
variable and the predicted value of the dependent variable which is this one so
here i provided an important distinguishment between population and sample
population is say all the u.s workers that we're interested in sample would be say
only the thousand people that we survey in our data so the parameters which we do
not know beta are for the population the coefficient that we actually get from
using the sample data is called

beta hat and error refers to the population again we don't know that the
residuals can be estimated with the sample data after the regression is is run so
let me provide a practical example of this terminology so here again we have
dependent variable y independent variable x so in this case we for y we would have
hourly wage in dollars and as independent variable we would have years of
experience so let's suppose these are the first four observations and for the first
person they have hourly wage of twenty dollars and one

year of experience for the second person this is the data and so on after
we estimate the model simple linear regression model this would be the equation of
the line of the estimated line so here we would have y hat equals beta 0 hat would
be 20 plus beta 1 hat would be 0.5 times x so suppose this is the equation that we
estimated so because we have this equation we can calculate the predicted value the
predicted value is if we substitute the formula here with x whatever the actual
value of x is

for the first person so the predicted value would be 20 plus 0.5 times 1
and for the second person would be 2 1 and 3. so i'm basically just replacing the x
with the values that we have there so these are now the predicted value the
residuals you had would be the difference between the actual and the predicted
value so this is the actual value uh whatever the person has or hourly wage and
this is what we predict using the model and that's how we calculate the residuals
here so we can now plot this estimated uh regression line

uh on this graph here and this would be actually the equation for it y
hat equals 20 plus 0.5 times x so if you look at this line it would hit here at
exactly a 20 because when x is 0 y would be 20. so that would be considered the
intercept the slope would be 0.5 so this means that for every 1 unit increase here
of x for every 1 year additional experience we would actually see um 20. we would
see 0.5 increase in the hourly wage so basically 50 cents so this is the slope of
this line is rising at 0.5 so

these are the predicted wages so the y hats always follow on this line
and the y the actual values are here so for example let's track this first person
20 is the actual wage so this is the actual wage and the predicted wage is right
here 20.5 so right here on the line for the second person or for the third person
21 is the actual wage with one years of experience so this is this dot is for the
third person here and the predicted value for them is again on this regression line
so again y hats are here and these are

the actual points here if we take the difference between the actual and
the predicted value this would be the residual here so for this person we have a
positive residual for this person we have minus 0.5 a negative residual and the the
way that we came up with this um estimated uh line estimated equation is we
actually try to plot a line here that is as close as possible to all the actual
points in the data and that's how we came up with this line here so again our
simple regression is hourly wage depends on

years of experience and here i have shown you both in a table and in
graph how this looks like so let's go a little bit more generic uh here and talk in
general about actual values predicted values and residuals so if we have the data
points being these uh points right here these are the actual points for y for our
dependent variable let's suppose here is a y i this is an actual point and this is
the predicted value that we have or here is also called the fitted value y hat so
the difference between the actual

value and the predicted value y minus y hat that is the you had the
residual so in this case we have a positive residual so for this point here we have
a negative residual and so on so for this point right here we have this being the
actual value the predicted values right here on the line so in this case the
residual would be this difference right here it would be a negative value so one
important thing to notice is that we will care but a lot about these u hats the
residuals and we would think about their

properties as the this presentation is continuing so again the goal here

is to try to plot this line through all the points in as close as possible to all
of them so how would we interpret the coefficients beta 1 hat that would be the
slope would be delta y divided by delta x so it would be interpreted so it will be
the same as changing y over the change in x the way we interpret beta 1 hat is that
it it measures by how much the dependent variable changes when the independent
variable changes by

one unit this is the generic interpretation beta 1 hat is also called the
slope in the simple linear regression and i showed you why on the graph that we saw
and the reason why we call it the slope is because it's a derivative of a function
is another function that shows the slope so in y equals uh beta 0 plus beta 1 x 1
plus u if we take the derivative basically we would find uh that this would be the
slope would be a this beta 1 that we we talked about so this formula above here
would be correct

only if we have that the change in u given the change in x is zero uh

this means that all other factors are fixed and more on that would be coming up
later in this presentation so let's talk about the population regression function
this is the regression function for the population that we we are actually trying
to estimate with data from the sample here we have expected value of y our
dependent variable given the independent variable equals and now what i will do
here is i would replace y the dependent variable with the

equation that we have seen before um given x and then because of the
properties of the expected value beta 0 and beta 1 x they they don't vary there no
therefore the expectation of beta beta0 is beta zero um so they would just
basically come out of these expectation uh term and then we have plus the expected
value of u given x so if we assume that this value is zero then we would have this
being equal to beta0 plus beta1 x so this assumption is a very important assumption
and we would call it zero conditional

mean later on and i would explain it a lot more so what this population
regression function is showing you that the expected value of y given x for the
population is a linear function of x so it's beta 0 plus beta 1x so um again that's
that's what this represents so let me show you graphically what this means so here
is a population regression function the expected value of y given x equals beta0
plus beta 1x so this one looks like the estimated regression line that we have y
hat equals beta 0 hat plus beta 1 hat

beta 1 hat times u times x but it's a different uh this one is for the
population the other one comes estimated using the sample that we currently have so
again this could be a an actual value but again these are the errors that we truly
do not know because uh they're coming from the population and we do not know and we
would never know the values of these parameters we would just only know
coefficients that are estimated with sample data so what this population regression
function also shows is that for a given value of x that

the expected value of y given x would be right here on this line but the
the value actually could be anywhere along here so it could be here here here here
and many of you can recognize this being as the normal distribution but most likely
will be around here and the expected value or the average of it would be right on
this population regression line one small point here to note is that here x1 x2 and
x3 refers not to different uh variables but just x i a given a given number or a
value of x rather than the variable uh

variable one variable two and variable three so let's uh think about
derivation of the orless estimates for a regression model we have y equals beta0
plus beta 1 x plus u so in order to estimate this regression equation which is y
hat equals beta 0 hat plus beta 1 hat times x we need to find these coefficients we
do not have these coefficients so how are we going to find them is by first
calculating residuals so we have u hat equals y minus y hat so basically the
residuals are the actual minus the predicted value

well this predicted value we can replace with the function that we have
here so we can replace with this expression so now we have a an expression for u
hat for these residuals and how we can find these two parameters is we can find a
random sample of data that has n observations um so this would be x i and y i where
we would have each observation i would be anywhere between 1 to the total number of
observations n and the goal would be to obtain as good fit as possible of this
estimated regression equation so what

does it mean to have as good fit as possible well we will minimize the
sum of squared residuals so this is what this function is doing we're minimizing
the sum of the squared residuals so again minimize the sum of the squared residuals
so we have the residuals we're going to square them and we want those differences
those residuals to be the square of them to be as small as possible so we have
already an expression for this u hat so we're going to substitute it here and the
way we minimize a function we will basically be
taking the derivative of it putting it to zero and so these expressions
are going to come from this minimization process so we would be obtaining the ls
coefficients as beta 1 hat equals and it would be summation of x i minus x bar
times y i minus y bar and in the denominator we would have x i minus x bar squared
so if you look very carefully at this expression this is exactly the formula that
we have for the covariance of x and y so basically we are thinking about how does
each x differ from its mean and how does each

y differ from its mean and in the bottom x minus x bar or x minus the
average x this is a part of the formula for the variance of x so this is what the
slope would be how does x vary with y divided by how does what's the variance of x
or in other words the covariance of x with itself and beta 0 hat this is the
intercept that would be equal to the average value of the dependent variable minus
the estimated coefficient that we have here times the average value of x so onls is
called ordinary least squares

and it's based on minimizing the squared residuals it's least because
it's minimum and squares is because we're squaring them and it's ordinary because
it's not weighted or any other kind uh so that's where the oil s term comes from so
again oil less is the method the ordinary least squares is the method we use to
obtain the coefficients and that's why they're called or ls coefficients and we do
that by minimizing the sum of square residuals so we basically want to be as close
as possible to all of the actual points

for y so based on uh uh the formulas that we have these are some of the
properties of these uh of the ols estimators first of all the we have y bar equals
beta0 had plus beta 1 hat times x this is coming directly from the formula that we
have of beta0 had just rearranged and so what we're saying here is that the sample
average for the dependent variable and for the independent variable are on the
regression line because this is the line we actually estimated estimating and so
the averages would be

on that line the second property that we have here is the summation of u
hat equals zero so basically the summation of the residuals equals zero so these
residuals uh summing up to zero means that if we have like values of the actual uh
y above the regression line we would have as many below at such that these
distances sum up to zero so note that we minimize the sum of the squared residuals
but if we just sum up the residuals they will sum up to zero and the final property
here is that the summation of x

times u hat equals zero so basically what we're saying is that the
independent variable times the residuals the summation of that equals zero in other
words the covariance between the independent variable and this residuals is 0. and
that's a very important property because what that means is that the independent
variable and the residuals would not be correlated in any way so we wouldn't see
that when x is increasing u would also be increasing the residuals would also be
increasing so this was the theoretical uh

introduction of the ols estimators for the simple regression so now let's
turn into proceeding with some examples so let's look into ceo salary and we would
be estimating simple regression model to explain how the return on equity or roe
affects the ceo salary so how we had the regression model before y equals beta0
plus beta 1 x plus u now i replaced y with the salary which is for my particular
example and x is roe so this is the regression model that applies to the population
so these are the parameters we don't know what these

are this is the error we don't know what it is but if we have sample data
which we do um we can estimate this equation salary hat that's the same as the y
hat equals beta 0 hat plus beta 1 hat times r o e and once we have these uh
coefficients uh we would be also looking at the coefficient with the at the
residuals u had which would be the actual value of salary minus the predicted value
of salary so we're going to be estimating this regression to find these
coefficients here and we would be interpreting beta 1 had

as the change in the ceo salary associated with one unit increasing
return on equity holding other factors fixed so here's the estimated equation after
we had the data and we actually ran our model so here beta 0 hat would be 963 and
beta 1 hat would be 18.5 so again these are the estimated coefficients and they
came you know based on our sample data so salary here is measured in thousand
dollars and return on equity is measured in percent it just so it happened to be
measured in percent so again beta 1 measures the change in

the ceo salary associated with one unit increasing roe holding other
effectors fixed so how would we interpret this uh beta 1 coefficient here well we
would say that the ceo salary increases by 18.501 units well what are the units for
salary well this is measured in thousand dollars so that's how we got eighteen
thousand five hundred and one because it's 18.501 thousand dollars is the units and
so for each one unit increase in roe what are the units of the roe they're in
percent so that's why we say

for each one percent increase in roe and that's how beta zero is inter
beta one head is interpreted the interpretation of beta zero head uh that's the
intercept is that if roe is equal to zero like if this is equal to zero then the
predicted value of the salary would be uh 963 units so the 963 units are in
thousand dollars so that's how we would interpret that now i would show you just
one time how we estimated this model and we did that with stata so this is how a
output for a simple regression looks

like this one came from stata but r or sas or any other software would
give you similar regression output and we will be doing the regret regression of
salary on roe and every kind of computer program gives you uh these coefficients
here so this is the dependent variable salary this is the independent variable and
this is the constant or or the intercept and so we would be picking the
coefficients from here so notice that in the previous slide i said that salary
equals 963 plus 18 times roe well this is 963

is the uh the constant here that we have and 18 is the coefficient on roe
and that's what this value is so whether you see a regression output like that that
came from a statistical program or you see an estimated equation like this they
mean exactly the same thing so to make things even more interesting how economists
like to present the results is usually in a table and this is the most common way
we would see this uh later on and here um this is the dependent variable here is
the independent variable roe and

that's constant so i just rewrote uh what is in the previous slide in a

table format so again this is the coefficient here on the roe this is the constant
the intercept that we have and the uh we haven't talked about this yet but these
would be the standard errors that you see in the table and the stars correspond are
based on the uh p values basically uh okay so these are the some of the other
things that we will talk about later um but i have shown you now three alternative
ways of looking at the same uh

the same output uh basically estimating the coefficients so you could put
it like that in an equation you could put it in a table or you can look straight at
the output from a statistical program so in this case um we estimated um the
regression line and that's how it looks like so this is the equation that we
estimated and we know these coefficients after we ran the model notice that the
population regression function for the population is still unknown uh so we will
never know the true values

of these parameters beta0 and beta1 but using our sample exactly of the
ceos that uh and their salaries their that are in our data we calculated this with
sample data so this is coming from the sample well we know and this is the
population well if we get a different sample we would be able to estimate a
different regression line maybe it would come something like that but those would
be different coefficients based on a different sample but this is the true model
that we don't know and we're basically trying to estimate these

parameters and we're getting these coefficients here so here's how the
estimated regression looks like if we estimate the model so we can actually plot
here on the horizontal line it's the x variable return on equity and on the
vertical line here that is our y variable and this is salary in thousands of
dollars and so here's how the data looks like they're like different uh points this
is the actual data and the way we estimated the regression line right here is that
it actually passes as close as

possible to all the points of the data so if we come here or anywhere

else we wouldn't be as close as possible to all of the points because the way we
found this line is we minimized the sum of these squared residuals that you see
here so basically the actual uh value to the predicted value we we minimize the sum
of um of these squared residuals and this is how um the estimated regression line
looks like with with actual data so this is an actual value that's a predicted
value so all the predicted values are on the line and all

the actual values are the dots around it so now how can we calculate the
residuals if these are the predicted values right here right on the line and these
are the actual values well a residual then is the difference between the uh the
actual or the true value and the predicted value right here so a residual would be
a a value such as uh this value that would actually be the residual for for this so
here are the residuals right here in in green and what they're doing is we already
said this these residuals

are actually if you look at that they're centered around zero so have we
seen this before yes we have because the summation of all these residuals is 0. so
it means that we have as many on below to above not not just as far as how many but
the distance here so the sum of these residuals below and above the line they they
should basically sum up to zero and so what we've effectively done is instead of
having the slope here we we did away with this slope and we just got left with the
with the residual uh values

so um everything that we said about this line here is also true here it
would hit the uh the zero here at 900 and something and if you see out of 5000 this
will be hitting right here at 963 or something like that so this is the intercept
that we saw and the slope of this is 18 so again the slope is like for each unit
increase in return on equity we would see 18 thousand dollars increase uh in uh in
in um salary of the ceo okay so to look into this yet another way this is how the
data looks like so we

have the salary uh here's the salary of the ceo this is basically in a
thousand dollars and roe the return equity for their firm is in percent measured in
percent so if we estimate um the predicted value that comes by just looking at the
estimated uh coefficients so this was the intercept plus the slope times the roe so
we can actually uh come up with the predicted value by just uh looking at uh at
this function and so for the first value here we have this number plus this times
the roe is this number

well that's how we got this number uh salary had for the second one we
have this number plus this number times the rose right here 10.9 and that's how we
got this number and so forth this is how we calculate the predicted values for the
dependent variable and the residuals are basically the actual value of salary which
is right here minus the predicted value right this one so this number minus this
number that's the residual this number minus this number that's the residual and
notice that we have negative

or positive residuals and if you sum them all up the mean for these
residuals would be zero if you actually find the mean for this predicted values
that would be exactly the mean as for the salary and for the actual values of of
the salary and these things are not coincidental they come from the properties of
the ols okay so now let's continue with another example this one is for wage and so
we can consider a simple regression model explaining how education affects the
wages for workers so in this case our regression model

would be wage equals beta0 plus beta1 education plus u so we're trying to
explain what is the hourly wage for workers given their education the estimated
equation would be wage hat equals beta0 had plus beta 1 hat times education so
these would be the estimated coefficients here and after we know the coefficients
we can calculate the residuals which are the actual minus the predicted value for a
wage so here beta1 head would be measuring the change in wage associated with one
more year of education

holding other factors uh fixed so you can say also how does uh wage
increase when um education increases by one year notice that as we talked about
before there is no correlation causation here everything is correlation uh when we
interpret these results and we also need to say holding other factors fixed because
in this case we're holding everything that is unobservable in the error term fixed
as well so if we estimate the equation we obtain that the intercept is minus 0.9
and the slope is point 54.

wage is measured in dollars per hour and education is measured in years

so um here we would interpret beta 1 hat as the hourly wage increases by 0.54 units
which is dollars per hour for each additional year of education because education
is measured in years so that's how we got that hourly wage increases by 0.54
dollars for each one additional year in education the interpretation of beta0 is
that if if education is zero the person's wage would be minus uh 90 cents but this
is an artificial con construct

because there's no one in the sample that has um zero education so here's
how regression output looks like uh again this is coming from state it could be
coming from anything else these coefficients uh we see them right here so this is
the intercept minus point nine and this is the slope point point four uh and that's
what we're concentrating on right now so that's that's where to find them in a
regression output okay so let's uh talk then about variations uh that we have so
the first variation that we're going

to be talking about is sum of squares total and that would be the sum of
squares measuring the total variation in the dependent variable so here we would
have the actual value minus the average value squared this will be the total
variation in the dependent variable sst sse would be the explained sum of squares
and that would be the difference between the predicted value and the actual value
squared and the sum of it and then the sum of squared residuals would be the
difference between the actual value and the

fitted value or the predicted value squared the sum of it so if you

actually look at this last term the actual minus the predicted value that is
exactly the residual here so we're coming full circle between these three measures
because sst the total variation in the dependent variable is the sum of squares
that we can explain with the regression and the sum of squares that we cannot
explain with the regression so one very confusing thing here is that some called
the sum of squared explained they call this sum of squared due to

regression and some called the residual sum of squares the r uh call it
sum of squares for the error and so in this case e and the r are very confusingly
reversed so when you see sse and ssr always double check what what the what they
mean okay so let's look at them on a graph so what did we mean by these things so
suppose we have an actual value of the the dependent variable y and it's here so
this is y here this number is y hat the predicted value this number right here is y
bar this is the average value for the

dependent variable so how are these variations uh coming along together

so if we don't have a regression or anything the best predictor for what uh y would
be is our the average value so we want to know by how much this regression actually
help us make a better prediction and so if we look at the predicted value minus the
average value this is the variation that we can explain with the regression if you
look at the actual values minus the predicted value that is actually the variation
that we cannot explain with with the regression

or that's the residual and the total variation is the actual value minus
the average uh the average value so this is the total variation so that's how we
are breaking down the total variation of being what we can explain with the
regression and here is the residual what we cannot explain with the regression and
what we want is as much as possible to be able to explain and as little as possible
that we're not able to explain for the regression okay so this leads us to a
goodness of fit measure we will call this r squared

and so r square is the explained sum of squares divided by the total sum
of squares so this is the explained sum of squares divided by the total sum of
squares and because sse is equal to sst minus ssr this is the same expression as
this so what the r square measures is the proportion of the total variation that is
explained by the regression so this is what we can explain with the regression
divided by the total variation in the dependent variable so r squared of 0.7 would
be interpreted as 70 of the variation is explained by the

regression and the rest is due to error and typically this like a rule of
thumb but not always used but typically an r squared of greater than 0.25 is
considered a good fit so if we can predict with our regression if we can explain at
least 25 percent of the variation we have a very good regression so here is how r
square is calculated if we look at the regression output it would give you ss the
sum of squares this would be the total sum of squares this would be the residual
sum of squares and we call this uh explain but they're

calling here a model sum of squares so this is the same as explained and
that's that value r squared is actually automatically calculated and it's 0.16 but
if we want to calculate it by hand all we have to do is divide the ss for the model
or the explained by the ss total so i will be dividing this number by that number
and i will be getting an r squared of point sixteen so the way to interpret this is
that sixteen percent of the variation in the wage would be explained by the
regression and the rest is due to error

well this is not a very good fit because we can only explain sixteen
percent of the variation with this regression the rest is actually due to error so
it's not it's not a very good fit okay now completely changing gears we would be
talking about log transformation or logging variables so sometimes variables y or x
are expressed in logs such as log of y or log of x and so with logs the
interpretation is not in units but it's in percentage or elasticity so so why would
we use logs well variables that are age or education that
are measured in units such as years should not be logged why because
oftentimes the interpretation is in percentage so what would it mean to be one
percent older right we wouldn't say this we would say one year older this is why
we're not logging variables that are measured in years variables measured in
percentage points such as interest rates also should not be logged because if we
have a one percent as as interest rate we shouldn't be further logging this because
what's a percent increase in in

something that's already a percentage and logs also cannot be used if we

have variables that have zero or negative values well so when our logs used is
actually uh often times to reduce the problems when we have very large values of
the data or so called outliers that are well uh different than the rest of the
values and taking logs we're going to talk about this later on but it helps with
homo schedulity and normality usually and we would be discussing this later on okay
so let's talk about the log log

form so in a in the linear regression model that we have we usually have

the dependent variable y and the independent variable x entering like that with the
log log form we instead of y we would have the log of y like that and instead of x
we would have the log of x here and so it would be the log log form because both
the dependent variable and the independent variable would be logged so the
interpretation of the beta 1 hat coefficient wouldn't be changing y divided by
change in x it would be changing the log of y

over the change in log of x well by the properties of the logs the change
in log of y is actually delta y divided by y and the change in log of x is delta x
divided by x which went into the numerator but if you look at this formula what is
the change of y over the value of y well that is the percent change in y and what's
the change in x divided by x well that is the percent change in x so here we would
be interpreting this beta one coefficient as the dependent variable changes by
beta1 percent

when the independent variable changes by one percent so then we have the
long linear form also called the semi-log form here is where we can log the
dependent variable but the x variable would not be logged so in this case beta 1
hat would be the change in log of y divided by the change in x and so by the
properties of the log here this will have change in y divided by y and here we will
just have the change in x so here we would just have the percent change in y but
here we would have change in x how would we interpret this

coefficient well we would say that the dependent variable changes by beta
1 times 100 percent when the independent variable changes by one unit then we have
the linear log form and so here in the linear log form the y variable is the same
but the x variable is the one that is log the independent variable so this beta 1
would be the change in y over the change in log of x well this change in log of x
is the same as changing x divided by x so here we would have the change in y divide
the percent change in x so the dependent variable y

would be changing by beta 1 hat divided by 100 units when the independent
variable changes by one percent that would be the interpretation here okay so let
me give you examples i will start with the easy example first here we have data on
wages hourly wages and this is the log wage we're taking logs of this wage variable
and this is education years so notice what this log does here we have a very a very
large value and notice how taking the logs made these values much more similar to
the rest of the data

that would be important later on when i show you a graph so here is how
we have the linear form wage on education so here we have the education on the
horizontal line and wage on the vertical line these are all the data points and you
see like how we have some large values here for for the wages and this is the
estimated regression line now if instead of using y the wage as the raw variable we
take logs of y we would have these points you see how like they're a lot more
closer together than say these points so these values

right here that had very high values are now these values right here that
are much much closer uh next to the rest of the points so again that taking logs
kind of helps put the data in in like similar ranges and this is how the log linear
uh form looks like with the estimated regression line so if we estimate the
regression these are the results so here we have wage this the linear model where
we have wage regressed on education and here we have the log wage regressed on
education so this is the

coefficient of point 54 so we've seen that before so the way to interpret

this coefficient is that wage increases by 0.54 units which is in dollars for each
additional year of education okay that's how we interpret the linear form how would
we interpret the log linear form is that when education increases by one year wage
would increase by 8.2 percent so we take this coefficient multiply by a hundred and
this is the percent increase that we we have here so one additional year of
education leads to eight point two

percent increase in uh wages okay so here is a little bit more complex

example where we have data with logs so here we have salary measured in thousand
dollars and the log of salary would be basically taking logs of these values notice
how again these these values here are very different but when we take logs they
seem more similar to each other and same with sales which is measured in million
dollars we have numbers that are very different but when we take log of sales these
numbers are actually closer to each other so

notice again the measurement units as we are about to interpret the

results so here is how the linear versus the log log forms look like this is the
raw data where we have sales is our x variable and salary is our y variable here
and so all the data points are close together but we have a few outliers here on
the salary side and we have few outliers here for the sales part so when we take
logs of the data notice that this is not like sales anymore but log of sales and
you see like how this data points

got closer together and instead of uh salary here we have the log of
salary and these points here got closer together again you don't see these points
here as much anymore so this is how the linear uh form looks like this is how the
log log form looks like and the estimated regression line so the other way are if
we just take log on this side so we brought these points down closer to each other
but we still have these points here in sales there in the original variable or the
other way is you could leave the

y variable as the original variable but take the log of the sale so now
these points are closer to each other but these points are still far away so we can
estimate four different forms depending on which variable we decide to log and
which one to not log let's interpret the coefficients so this is how the table
looks like in terms of different variables so the first the linear form is the
traditional model where we have salary regressed on sales so how we're going to
interpret this coefficient is that

when for each one unit increase in sales well sales is measured in
million dollars so for each 1 million dollars in sales increase we would have that
salary increases by the coefficient and because salary is measured in thousand
dollars by so basically it's increasing by point one five five thousand dollars
well the way to read this better is by saying uh by a hundred and fifty-five
dollars for the log log form uh we would have to interpret and this is an easy
interpretation where if sales increases by one percent then

salary uh would increase by 0.25 now notice that we're not saying the log
salary is increasing by that but the salary itself is increasing by 0.25 percent in
the log linear the interpretation of this coefficient which is very very small is
that salary would increase by point zero zero fifteen percent and that is basically
this coefficient that we see here times hundred for each additional one unit
increase in sales and sales are measured in million of dollars millions of dollars
so with the final

form we have that when uh sales increases by one percent we would see
that salary would increase by 2.64 thousand dollars basically we pick up this
coefficient we divide by 100 and we attach the units behind it so again depending
on which which of these models you want to estimate which of these transformation
you could have very different interpretation of these coefficients and that depends
basically on what what you would be interested in so with that said the review
questions for this part would be to define the

regression model estimated equation and residuals think about what method
is used to obtain the coefficients what are the all s properties how is the r
squared defined and then what does taking of logs of the variables do so now let us
continue with the gauss markov assumptions for the linear regression model these
are the standard assumptions that we need uh for the models and they are one
linearity of parameters two random sampling three no perfect collinearity or we
have the sample variance in the independent variable

this is for the case of simple regression then we have exogeneity or zero
conditional mean and what that means is that the independent variables or
regressors are not correlated with the error term and then the last assumption we
have homoscedasticity which means that the variance of the error term is constant
so let's review each of them in detail assumption number one means linearity in
parameters so here we have the linear form y equals beta 0 plus beta 1 plus x plus
times x plus u here the relationship

between y and x is linear in the population so notice that the beta 0 and
beta 1 parameters are entering linearly in this function so note that the
regression model can have different types of variables that are for example log
variables like the way we saw it with log sales early on or it could be squared
variables such as education squared or in the multiple regression could be
interactions of variables such as education time experience but the beta parameters
are linear so basically if even if you give an x to a computer

program such as theta that is squared or logged or anything like that to

stata it's still a bunch of numbers so it's still linear in the parameters that it
needs to estimate so that's an assumption number one we want these beta0 and beta1
parameters to enter linearly like that and not be multiplied or to the power of or
anything like that assumption number two means that we have random sampling so here
x i and y i are the sample points uh for the x and the y variable and here we have
i each observation is from 1 to n

going through all of the n observations in the sample so we want the data
to be a random sample drawn from the population where each observation would follow
the population equation so let's say we have data on workers so we have wage and
educational workers so suppose all the population is all of the workers in the u.s
about 150 million and the sample is the workers that are selected for the study so
about say a thousand people so we need to draw randomly from the population which
means that each worker would have equal
probability of being selected and that's a very important assumption
sometimes that's not how sampling is done and for example young workers could be
oversampled so you could have more young workers in the sample but that would not
be a random or representative sample and therefore if you try to make inferences
about the population they're not going to be correct if you don't have random
sample to begin with assumption number three is no perfect collinearity basically
what that means is that no two variables

move together exactly in the same way so in the simple regression with
only one independent variable that assumption means that there needs to be sample
variation in the independent variable or the variance of x needs to be positive so
the way to read this is the sum of squares total for x which is each value of x
minus the deviation from the mean squared needs to be positive so not all of the x
needs to be the same number such as if you have education but all everyone in the
sample has 12 years of education

you cannot estimate a model like that and the reason for that is because
this uh variable of education of 12 years for everyone would be perfectly
correlated with the constant in the model so therefore you cannot estimate the
model another way to see why we cannot estimate such model is that this sst or the
deviation of the sample value for observation i minus the mean is in the
denominator so if there's no variation and each one is equal to the sample mean of
x well this value would be a zero we

cannot be uh taking uh we cannot calculate this expression with the

denominator of zero assumption number four is very important and it's called the
zero conditional mean or also means exogeneity so here we we assume that the
expected value of ui given x i is equal to zero so the expected value of the error
term given the independent variable x is 0. so what that means is that the expected
value of the error should not differ based on the values of the independent
variable and we already know that these

errors must sum up to zero but here we're saying that they must stop sum
up to zero for each value of x so basically uh for each value of x you want those
errors to cancel out let me show you an example so let's say we have a regression
model where we have wages regressed on education so we want to explain how wage
varies based on education and the education of a person so now suppose that the
ability of a person is unobserved if such variable is unobserved it will be part of
the error term right

here and it will be included in u so one of the issues that happens here
is that when we have that the ability here being higher that may likely drive that
also the education for that person is higher so here we would have a violation of
the zero conditional mean assumption because when you would you would be higher
when x would be higher we we we do not want that we want that no matter what uh the
value of x is we want you to be independent of that so let me show you an example
of this in graphs so here we have education

and suppose these are the residuals here that we have so what this means
is that for each value of education so pick for example education 12 we have that
the expected value of the residual of the error term equals zero so basically we
could have here these residuals being above zero or below zero but their expected
value is zero so on average we would have uh those uh those residuals being equal
to zero and so independent of where we're at for x each of these would be um the
expected value of of these would be equal to 0.

now i just made up this example where this is not the case so if we have
a case something like that and these are now the residuals notice that here if we
have high value of education we would have higher expected value of the residuals
here if we have lower value of x of education we would have lower lower expected
value of the residual so here is the case where again suppose that ability is
wrapped up in the error term we have people that have higher education also have a
higher ability and that's a

violation of the zero conditional mean assumption and it's the case where
we have endogeneity here we have exogeneity so this is a very important assumption
that we need to have in order for this is very important assumption that we need to
have so why are these assumptions important well the gauss mark of assumptions one
through four which are linearity random sampling no perfect linearity and zero
conditional mean that we talked about they lead to the unbiasedness of the rls
estimators and what unbiasedness means

is that the expected value of beta0 hat equals beta 0 and the expected
value of beta 1 hat equals beta 1. so what this means is that the expected value of
the sample coefficient is the same as the population parameter so again we don't
know what the population parameters are we would never know but when we get a lot
of these sample coefficients with different samples the expected value or the
average value of these is always going to be the population parameter um so again
for any given sample the

coefficients may be different from the population but we want on average

for them to hit the population parameter this is a very very important assumption
we definitely want to have unbiasedness of of our all our oilless estimators and if
we have these four assumptions satisfied this theorem here shows that our
estimators or the coefficients here would not be biased that is very important so
let's continue with assumption five and this is the assumption of homoskedasticity
so what homoscedasticity means is that

the variance of the error term given x is equals to sigma squared or in

other words this is a constant value that does not depend based on x so the zero
conditional mean said that the expected value of u given x is zero uh so it's about
where the mean of the expect the mean of the error term is but this is about the
variance of the error term and we want this to be constant under homo scholasticity
it the opposite of homo scholasticity is heteroscedasticity and here's where the
variance of u given x is not a constant

so it varies based on for each x so let me show you examples of

homoscedasticity and heteroskedasticity so if we have this is the population
regression function that we talked about so if if um you know the the previous
assumption we talked that these um errors need to be centered around zero so
basically we would have that being right here in in the middle but this is about
how widespread they are and so here in the case of homoscedasticity they're equally
spread based on what x is in heteroskedasticity

uh in this case that we have here is that for low values of education we
have much tighter distribution here of the error term and for higher values of
education we have much more spread distribution so with the example that i gave you
of ability if that is in the error term we could have if education is low people
have very may have very similar ability to each other but in the case of say of
high education people may have very different ability from each other and so that
would be again a violation of this homoskeleton

assumption would be heteroskedasticity so these this would be the two

different uh cases so actually to look uh at this with the sample data here's a
case of homoscedasticity and we're looking at these residuals here so okay we
talked about them before centering around zero and we talked about how we want the
expected value or the mean to be zero meaning as much above that much below that's
mean to be centered around zero but now what we're saying is that for any value of
x we want these points

to be about equally distributed above and below the heteroskedasticity

here example that i'm showing you is where we have uh where for high values of
education look how widespread this varian this error term is so the variance is
very high here and look at how there's not enough spread here so the variance is
very low here so here again we have for low level of education the sarah charmer
said the abilities of people are much closer to each other and here for high uh
education values the ability of

people are very much all over the place so from very low to very high
ability so again we want homoscedasticity as our fifth assumption not a case of
heteroskedasticity so if we detect this in our data uh there are some corrections
that we need to do uh with with the data so now um let's talk about estimating the
variance of the error term uh so how we can calculate this variance of the error
term sigma hat squared so uh we we would be using the following formula we would
just get the residuals

we would square these residuals and that would be a formula for a

variance we just need to divide this by n minus 2 and this is a degrees of freedom
corrected correction where we have n the number of observation minus k which is the
independent variables in this case is one independent variable minus one so this
will be n minus two so if we use this formula that is like the variance of these
residuals that is actually this this variance of the error term this estimation has
a very desirable property

and this is that the gauss mark of assumptions one through five that we
talked about so far which is linearity random sampling no perfect linearity zero
means homoscedasticity that lead to the unbiasedness of the error variance so we
would have that the expected value of the sample variance is the same as the
population variance so not only that we have unbiasedness of the coefficients but
we also have this unbiasedness in the error variance as well and that's also a
desirable um desirable outcome here

so these estimated coefficients that we have the regression coefficients

that we talked about earlier on they're random because the sample is random so if
we had different samples we would have very different coefficients from one sample
to the next and so they actually may vary quite a bit from a sample to a sample so
what we need to know here is what is the sample variability in these ols
coefficients and how far away are from the population parameters and so these are
the formulas here for the variance

so let's go over them the variance of the slope coefficient beta 1 hat is
sigma squared this is the variance of the error term divided by the summation of
each data point x minus its mean squared so the bottom we already saw this
expression before that this sst for x this is total sum of squares or this total
variation in in x and so again this is the uh variance of the error term divided by
the total variance of x and for the variance of beta0 hat we have almost similar
formula here but it's also multiplied by this expression

so let's uh the slope is what we really care about here and so let's
think when we're going to get the high variance for the coefficients well the
variance would be high when the variance of the error term is high and this
variance would be high when the total variation in in x is slow is low so do we
want high variance or low variance of our coefficients we want low variance we want
coefficients that are very what we call precise which means they don't vary very
much from sample to sample that's a desirable

property well the way we're going to get such desirable properties that
if we have if we want low variance in the coefficients we need to have low variance
in the error terms so not much variance in in error but we want high variance in
the independent variable so again we want that independent variable to be as
variable as possible like we don't want everyone in the sample to have 12 years of
education we want people with six years or uh people with 20 years of education so
that's a good thing

as far as the the variance because high variation in in the education

years would lead to lower variance and we want also low variance for the error term
not much a lot of not a lot of variance in the air so how can we now find what we
call the standard errors for our coefficients well the standard error is the square
root of the variance and then we can replace this variance with the expression that
we talked about only then instead of sigma for the population we put sigma hat so
this is actually coming

from the sample and so these standard errors um are the ones that are
actually also shown in regression outputs that i showed you early so i showed you
their coefficients and their standard errors next to them and so these are
basically how much the the uh coefficients vary from one sample to the next or how
precisely these coefficients are calculated so with that said the review questions
for the gauss markov assumptions are you need to list and explain the five of them
you need to know which assumptions are

needed to get unbiasedness of the coefficients which assumptions are

needed to calculate the variance of these coefficients and here's a question for
you is it possible to have zero conditional mean but heteroskedasticity so that was
the end of the lecture on simple regression

The Art of Problem Solving Intermediate Algebra
96% (25)
The Art of Problem Solving Intermediate Algebra
720 pages
Digital SAT Math Practice Questions
61% (31)
Digital SAT Math Practice Questions
29 pages
Discovering Geometry Solutions Manual
70% (10)
Discovering Geometry Solutions Manual
304 pages
Woodcock Johson IV Training Manual PDF
100% (2)
Woodcock Johson IV Training Manual PDF
48 pages
Beginner's Step-By-Step Coding Course Learn Computer Programming The Easy Way, UK Edition
98% (46)
Beginner's Step-By-Step Coding Course Learn Computer Programming The Easy Way, UK Edition
360 pages
Introduction To Geometry
90% (21)
Introduction To Geometry
580 pages
The Motivational Interviewing Workbook - Exercises To Decide What You Want and How To Get There
100% (10)
The Motivational Interviewing Workbook - Exercises To Decide What You Want and How To Get There
224 pages
Algebra Cheat Sheet
97% (72)
Algebra Cheat Sheet
2 pages
Workout Log
63% (19)
Workout Log
8 pages
Physics Primer - Homework - 1
95% (42)
Physics Primer - Homework - 1
40 pages
Case 1
100% (2)
Case 1
6 pages
Golf Strategies - Dave Pelz's Short Game Bible PDF
92% (24)
Golf Strategies - Dave Pelz's Short Game Bible PDF
444 pages
Catherine V Holmes - How To Draw Cool Stuff, A Drawing Guide For Teachers and Students
97% (35)
Catherine V Holmes - How To Draw Cool Stuff, A Drawing Guide For Teachers and Students
260 pages
[Algebra Essentials Practice Workbook with Answers Linear and Quadratic Equations Cross Multiplying and Systems of Equations Improve your Math Fluency Series] Chris McMullen - Algebra Essentials Practice Workbook with A.pdf
82% (11)
[Algebra Essentials Practice Workbook with Answers Linear and Quadratic Equations Cross Multiplying and Systems of Equations Improve your Math Fluency Series] Chris McMullen - Algebra Essentials Practice Workbook with A.pdf
207 pages
Parts Work 4th Edition
100% (30)
Parts Work 4th Edition
166 pages
Math 87 Mathematics 8 - 7 Textbook An Incremental Development Stephen Hake John Saxon
100% (10)
Math 87 Mathematics 8 - 7 Textbook An Incremental Development Stephen Hake John Saxon
696 pages
Astrology Cheatsheet
98% (44)
Astrology Cheatsheet
15 pages
Lesson 11 Multiple Linear Regression
No ratings yet
Lesson 11 Multiple Linear Regression
35 pages
Etc 2410 Notes
50% (2)
Etc 2410 Notes
133 pages
Self-System Therapy For Depression Client Workbook
100% (9)
Self-System Therapy For Depression Client Workbook
113 pages
Econometrics: A Simple Introduction
From Everand
Econometrics: A Simple Introduction
K.H. Erickson
3.5/5 (5)
The Colossal Book of Mathematics PDF
100% (11)
The Colossal Book of Mathematics PDF
744 pages
Algebra 8-1studyguide
71% (7)
Algebra 8-1studyguide
110 pages
CH - 02 - Simple Linear Regression - TQT
No ratings yet
CH - 02 - Simple Linear Regression - TQT
61 pages
Linear Regression Model: Man - PN@VNP - Edu.vn
No ratings yet
Linear Regression Model: Man - PN@VNP - Edu.vn
77 pages
What Is Linear Regression
No ratings yet
What Is Linear Regression
14 pages
Multiple regression
No ratings yet
Multiple regression
14 pages
Lecture 2-3
No ratings yet
Lecture 2-3
8 pages
Text On Class
No ratings yet
Text On Class
18 pages
Chapter Two: Bivariate Regression Mode
100% (1)
Chapter Two: Bivariate Regression Mode
54 pages
Lecture 2: Simple Linear Regression Model: Recap
No ratings yet
Lecture 2: Simple Linear Regression Model: Recap
5 pages
Text - On - Class Econometrics
No ratings yet
Text - On - Class Econometrics
17 pages
Manual ML 1
No ratings yet
Manual ML 1
8 pages
Econometric estimation BETA
No ratings yet
Econometric estimation BETA
36 pages
CH - 05 - Further Issues - TQT
No ratings yet
CH - 05 - Further Issues - TQT
35 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
49 pages
Linear Regression
No ratings yet
Linear Regression
20 pages
WEEK2 Simple Regression
No ratings yet
WEEK2 Simple Regression
133 pages
3 Multiple Regression Model
No ratings yet
3 Multiple Regression Model
48 pages
Chapter 3 - Classical Simple Linear Regression
No ratings yet
Chapter 3 - Classical Simple Linear Regression
52 pages
Experiment 1
No ratings yet
Experiment 1
17 pages
Practical # 10
No ratings yet
Practical # 10
5 pages
STAT 445 Regression Analysis
No ratings yet
STAT 445 Regression Analysis
49 pages
Simple Linear and Logistic Regression
No ratings yet
Simple Linear and Logistic Regression
81 pages
Lec Topic2
No ratings yet
Lec Topic2
68 pages
04 16 Simple Regression
No ratings yet
04 16 Simple Regression
47 pages
03 Revisions L Regression
No ratings yet
03 Revisions L Regression
25 pages
Module 3 - Data Analysis_S RM
No ratings yet
Module 3 - Data Analysis_S RM
63 pages
Notes Chapter 2
No ratings yet
Notes Chapter 2
19 pages
Data Science 03 - Regression PDF
No ratings yet
Data Science 03 - Regression PDF
32 pages
STAT22209 - Chapter 02-Regression Analyisis - 2022
No ratings yet
STAT22209 - Chapter 02-Regression Analyisis - 2022
41 pages
BST 32202 LINEAR REGRESSION 6 SLR ASSUMPTIONS LSE
No ratings yet
BST 32202 LINEAR REGRESSION 6 SLR ASSUMPTIONS LSE
20 pages
Lecture 6
No ratings yet
Lecture 6
19 pages
Simple LR Lecture
No ratings yet
Simple LR Lecture
60 pages
Simple LR Lecture
No ratings yet
Simple LR Lecture
60 pages
Lecture 4
No ratings yet
Lecture 4
3 pages
Lecture 8
No ratings yet
Lecture 8
32 pages
Linear Regression2
No ratings yet
Linear Regression2
9 pages
1. Linear regression Model - Applied_Part 1&2
No ratings yet
1. Linear regression Model - Applied_Part 1&2
69 pages
Engineering - Simple Correlation and Regression - 2024
No ratings yet
Engineering - Simple Correlation and Regression - 2024
35 pages
Linear Regression
No ratings yet
Linear Regression
59 pages
Chap 10 Regression Analysis
No ratings yet
Chap 10 Regression Analysis
68 pages
ECON3049 Lecture Notes 1
No ratings yet
ECON3049 Lecture Notes 1
32 pages
Solutions Week 10
No ratings yet
Solutions Week 10
7 pages
Third, Regression Analysis Predicts Trends and Future Values
No ratings yet
Third, Regression Analysis Predicts Trends and Future Values
2 pages
R-programming - Unit 5
No ratings yet
R-programming - Unit 5
43 pages
SimpleLinearRegression PDF
No ratings yet
SimpleLinearRegression PDF
86 pages
Lecture 3 Simple Linear Regression
No ratings yet
Lecture 3 Simple Linear Regression
46 pages
CH 02 Simple Regression TQT
No ratings yet
CH 02 Simple Regression TQT
62 pages
Chapter2 1
No ratings yet
Chapter2 1
55 pages
CH-2 -Part-I- PPT
No ratings yet
CH-2 -Part-I- PPT
78 pages
Unit-4
No ratings yet
Unit-4
18 pages
1170_10045_411513
No ratings yet
1170_10045_411513
55 pages
Linear Regression Model
No ratings yet
Linear Regression Model
3 pages
Regression Analysis
No ratings yet
Regression Analysis
22 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
64 pages
Chapter - 2
No ratings yet
Chapter - 2
59 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
31 pages
EECM3724 Unit 9 ch14 Slides 2023
No ratings yet
EECM3724 Unit 9 ch14 Slides 2023
57 pages
Chapter 2 Regression Analysis Notes
No ratings yet
Chapter 2 Regression Analysis Notes
11 pages
14 Statistics and Probability
No ratings yet
14 Statistics and Probability
37 pages
Fractions
100% (10)
Fractions
50 pages
Algebra 2
95% (19)
Algebra 2
200 pages
How To Read Sheet Music For Beginners
100% (2)
How To Read Sheet Music For Beginners
15 pages
Tarasov Calculus
100% (1)
Tarasov Calculus
179 pages
How To Distinguish ADHD From Typical Toddler Behavior
100% (1)
How To Distinguish ADHD From Typical Toddler Behavior
24 pages
Pre-Algebra and Algebra
100% (23)
Pre-Algebra and Algebra
66 pages
Mathematics Fundamentals
89% (9)
Mathematics Fundamentals
198 pages
Day 2 Math, Distributive Property
No ratings yet
Day 2 Math, Distributive Property
9 pages
M. Aurelius PDF
100% (10)
M. Aurelius PDF
366 pages
Florida Teacher Certificate Examinations (FTCE) Study Guide
0% (2)
Florida Teacher Certificate Examinations (FTCE) Study Guide
20 pages
ECG Rhythm Interpretation 2007
100% (20)
ECG Rhythm Interpretation 2007
533 pages
Introduction To Generalized Linear Models: Logit Model With Categorical Predictors. Before
No ratings yet
Introduction To Generalized Linear Models: Logit Model With Categorical Predictors. Before
24 pages
Logistic Regression
No ratings yet
Logistic Regression
6 pages
Machine Learning
No ratings yet
Machine Learning
22 pages
Penggunaan Model Arima Dalam Peramalan Suhu Udara Di Sekitar Palangkaraya
No ratings yet
Penggunaan Model Arima Dalam Peramalan Suhu Udara Di Sekitar Palangkaraya
37 pages
Familywise Error: Analysis of Variance (ANOVA) Design of Experiment (DOE) Analysis of Covariance (ANCOVA)
No ratings yet
Familywise Error: Analysis of Variance (ANOVA) Design of Experiment (DOE) Analysis of Covariance (ANCOVA)
7 pages
Btech III Year i Semester (Ar20)
No ratings yet
Btech III Year i Semester (Ar20)
7 pages
Multiple Linear Regression (MLR)
No ratings yet
Multiple Linear Regression (MLR)
17 pages
Coefficient Stability
No ratings yet
Coefficient Stability
41 pages
Pengaruh Kepercayaan Diri Terhadap Prestasi Belajar Siswa
No ratings yet
Pengaruh Kepercayaan Diri Terhadap Prestasi Belajar Siswa
8 pages
Instrumental Variables: 2-Stage and 3-Stage Least Squares Regression of A Linear Systems of Equations
No ratings yet
Instrumental Variables: 2-Stage and 3-Stage Least Squares Regression of A Linear Systems of Equations
22 pages
Statistical Properties of The OLS Coefficient Estimators: ECON 351 - NOTE 4
No ratings yet
Statistical Properties of The OLS Coefficient Estimators: ECON 351 - NOTE 4
12 pages
Correlation & Regression Analysis
No ratings yet
Correlation & Regression Analysis
19 pages
Comparing Two Regression Slopes by Means of An ANCOVA
No ratings yet
Comparing Two Regression Slopes by Means of An ANCOVA
4 pages
Juniarti Dan Evelyn
No ratings yet
Juniarti Dan Evelyn
22 pages
lecture_8
No ratings yet
lecture_8
29 pages
Amazon-Fine-Food-Review - K-Means, Agglomerative & DBSCAN Clustering
No ratings yet
Amazon-Fine-Food-Review - K-Means, Agglomerative & DBSCAN Clustering
79 pages
K-Nearest Neighbour (KNN)
No ratings yet
K-Nearest Neighbour (KNN)
14 pages
6 Correlation and Regression
No ratings yet
6 Correlation and Regression
29 pages
Scatter Plot Advertisment Vs Sales
No ratings yet
Scatter Plot Advertisment Vs Sales
5 pages
STA408 Appendix
No ratings yet
STA408 Appendix
2 pages
MR Quiz 9
No ratings yet
MR Quiz 9
3 pages
GRADE 11 - Summative Assessment in Statistics and Probability
100% (2)
GRADE 11 - Summative Assessment in Statistics and Probability
5 pages
Metodologi Penelitian: Prof. Dr. H. UJIANTO, MS
No ratings yet
Metodologi Penelitian: Prof. Dr. H. UJIANTO, MS
11 pages
Lecture Notes - Random Forests PDF
100% (1)
Lecture Notes - Random Forests PDF
4 pages
Naskah Publikasi
No ratings yet
Naskah Publikasi
23 pages
18CSO106T Data Analysis Using Open Source Tool: Question Bank
No ratings yet
18CSO106T Data Analysis Using Open Source Tool: Question Bank
26 pages
VTU Module-4 Chapter-2 Ensemble Learning and Random Forests
No ratings yet
VTU Module-4 Chapter-2 Ensemble Learning and Random Forests
61 pages

Simple Regression Model

Uploaded by

Simple Regression Model

Uploaded by

this presentation will be about simple regression model this is a

properties as the this presentation is continuing so again the goal here

only if we have that the change in u given the change in x is zero uh

that's constant so i just rewrote uh what is in the previous slide in a

possible to all the points of the data so if we come here or anywhere

wage is measured in dollars per hour and education is measured in years

fitted value or the predicted value squared the sum of it so if you

dependent variable so how are these variations uh coming along together

something that's already a percentage and logs also cannot be used if we

form so in a in the linear regression model that we have we usually have

coefficient of point 54 so we've seen that before so the way to interpret

percent increase in uh wages okay so here is a little bit more complex

notice again the measurement units as we are about to interpret the

program such as theta that is squared or logged or anything like that to

cannot be uh taking uh we cannot calculate this expression with the

coefficients may be different from the population but we want on average

the variance of the error term given x is equals to sigma squared or in

so it varies based on for each x so let me show you examples of

assumption would be heteroskedasticity so these this would be the two

to be about equally distributed above and below the heteroskedasticity

we would square these residuals and that would be a formula for a

so these estimated coefficients that we have the regression coefficients

as far as the the variance because high variation in in the education

needed to get unbiasedness of the coefficients which assumptions are

You might also like