SlideShare a Scribd company logo
What is econometrics? Simple, non-technical introduction on Linear Regression/OLS as a technique
About this document… This document is not meant for presentation and is best viewed together in slideshow or printed format.  It is meant to be ‘read’, not ‘presented’ This document also covers the very basics of Econometrics.  Econometrics – as a subject – is theoretically complex.  The goal of this document is to empower the reader with an understanding of econometrics so she/he can discuss the topic with some confidence
About this document This document assumes  ‘zero-knowledge’  in econometrics and in linear regression It may appear to be long-winded at times, but it is designed to be so in order impress upon the reader the concepts that are being discussed herein Some online references and books are at the end of the document for those who are interested in further learning about econometric and statistical modeling
About this document Readers who have either a formal background in, conceptual understanding of, or keen interest in statistics would find this document helpful in ‘transitioning’ towards econometric modeling… A conceptual understanding of linear regression will also be helpful to appreciate econometrics, but this document will assume zero-knowledge in regression Econometrics as a science is founded on complex equations and assumptions based on the theories of probability and statistics – these are not covered in this document.
What is econometrics?
“Econometrics?  Isn’t that difficult?”
It’s full of formulas… and it could be complex
But…
Things must be made as simple as possible – but never simpler
This is an attempt to present econometrics as simple as possible…
What’s required to learn a little bit of econometrics
… lots of curiosity
… a little bit of patience
… a little bit of brains
… confidence in dealing with numbers
… a belief that numbers can tell stories
Let’s start with a little bit of definition What is econometrics?
What is econometrics? Econometrics is an application of statistics and mathematics  …  aimed at identifying and quantifying the relationships between two sets of variables –  (1) the predicted variables and  (2) the predictor variables.  The goal of econometrics is to  test a hypothesized causal relationship  between the predicted and the predictor variables.
What is econometrics? Econometrics is an application of statistics and mathematics  Econometrics is derived from statistics – largely regression and ‘trending’ techniques  - and from mathematics There are differences between statistics and econometrics – but the differences are academic*…  * … but not necessarily moot and unimportant For those interested about the differences, see future tutorials…
What is econometrics? …  aimed at identifying and quantifying the relationships between two sets of variables –  (1) the predicted variables and  (2) the predictor variables.  The basic goal of econometrics is to explain using formulas and numbers the relationship between  a predictor variable  – such as GRPs, adspends, competitive spends, temperature, and seasonality – and  a predicted variable  – such as awareness, sales, revenues, and profits
What is econometrics? This relationship is expressed in an equation – such as y   is the ‘predicted’ variable x  is the ‘predictor’ variable m ,  b  and  u  are the values  that econometrics want to uncover
What is econometrics? This relationship is expressed in an equation – such as y   is the ‘predicted’ variable x  is the ‘predictor’ variable m ,  b  and  u  are the values  that econometrics want to uncover We know the values of y and x Econometrics helps us identify the values of m, b and u
If we were interested in awareness and GRPs…  We can rewrite the first equation taking our interest into consideration as follows awareness = m  •  GRPs + b + u NB.  This is simplifying the relationship between GRPs and awareness drastically. The relationship is far more complex, of course – but let’s assume that this equation is true for now. What econometrics does is “estimate” the values of “m”, “b” and “u” based on the available data on Awareness and GRPs, such that we have an equation that relates Awareness and GRPs. Once m, b and u are identified and estimated, we can then use the equation to explain the movements in awareness with respect to GRPs – and predict how awareness is going to move in the future given different levels of GRPs
There are many econometric techniques…  But the most common technique is  linear regression
What is  linear regression ? A brief introduction to linear regression How to create regression lines? Regression in econometrics and marketing
Introduction to linear regression Let’s assume that x is the evolution of the number of users of a certain product across months (in ‘000), represented by time  t In the first month, for example, we see that there are 4’905 users of the product. By the 5 th  month, that has increased to about 6’800 users – and by the 26 th  month, the number of users have increased to around 34’200 Clearly, there is an increase in the number of users – and it seems, from looking at the data alone that indeed, there is a significant uptrend
If we plotted the data, we would indeed see an upward trend…  Time t, in months Product users ‘000 In the 1 st  month, we see that there are about 5’000 product users By the 30 th  month, the number of users have increased to about 40’000 users
The question If this trend held and continued into the next 12 months, how many more users will we have?
To answer this question…  …  we need to understand first the  past relationship  between the two variables –  time  and  numbers of users . We will then use this understanding of the past to predict what’s going to happen in the next 12 months The Past The Future
What bridges the gap between the past  and the future…  Once we have identified the equation or the model, we will have a better grasp of (1)  the past trends  and (2)  the potentials of the future Linear regression comes into the picture by bridging that gap between the past and the future The Past The Future Linear regression equation
With that in mind, let’s look at the chart again
From mere observation, we see an uptrend in users across time… Time t, in months Product users ‘000
How do we quantify* that uptrend? Time t, in months Product users ‘000 * Remember: In order to project into the future, we need to create a model that quantifies the relationship between time and number of users
There are an infinite number of lines that we could use to characterize the uptrend…  Time t, in months Product users ‘000 Different people have different views – even when viewing the same set of data: I can argue that the best line is the grey line, another can argue that the blue line is best, and still another can argue that the best line is the pink line
Linear regression insists that there is one (and only one) line that would best characterize the trend and the relationship between the two variables
Linear regression also insists that this equation be of the following form: … where  y is the number of users per month ‘000 x is time b is the constant u is the unexplained variance
This one line that best describes the relationship between the two variables is derived through OLS OLS – which stands for  “ordinary least squares”  – is an algorithm that defines the values of  m ,  b  and  u  …  such that the distance between the actual values and the line defined by the final values of m, b and u are at its minimum Huh
Let’s go back a few charts…  What OLS does is it  objectively  goes through these infinite number of lines – and finds the best-fitting line such that the distance between the line and the original data-points are at a minimum OLS does this iteratively – that is, through  trial-and-error  – until it arrives at the values of m, b, and u that define a line with minimum distance between it and the original data.  (Think of OLS as a search-algorithm that tries different m-b-u combinations to achieve the best-fitting line.) Remember: Given any data set, there are an infinite number of lines that can be used to describe the trend.  One can choose the “pink” to be the best and rationalize it; another person can argue that the yellow line is the best, and still another third person can defend the blue line. We can argue indefinitely about the merits of each of these infinite number of lines.
Going back to the data – the best fitting regression line, after applying OLS is…  Time t, in months Product users ‘000
By applying OLS, the equation  «y = 1.416x + 3.6329»  is found to be the best-fitting regression line It is objective and unbiased  By using OLS, we are assured that this is unbiased and objective It is linear  It conforms to the «y= mx + b + u» requirement of econometrics) It is the best-fitting line Because the OLS algorithm is aimed at minimizing the distance between the line and the data points, we are assured that it is the best-fitting line
Now comes the interesting part…  So what does the  equation exactly  mean?
The story behind  «y = 1.416x + 3.6329» This equation suggests the following –  For every  1.416-unit change in x , there is a corresponding  1-unit change in y Applying this to our data, we can say that for every 1.416 months (about 5-6 weeks), there is an additional 1’000 new users of the product 3.6329  is called the constant – it is the number of users when the product was rolled out into the marketplace (at time t = 0) These are perhaps the early adopters of the product or those who have been exposed to the product through free samples
OK, we have an equation – how do we know it’s the correct equation? First, we “eyeball” the line and the actual data Are the data points within ‘reasonable’ distance of the line?  If each of the data points  seem  to be near the trendline, then we can say initially that we have a good fit If there are data-points that are significantly far from the line, then the equation may need to be revisited – or that outlying data-point may be caused by something else apart from time
Let’s eyeball the model: There  seem  to be no data-points that are significantly away from the line…  Time t, in months Product users ‘000
Eyeballing the data, however, brings back  subjective interpretations Time t, in months Product users ‘000 One can argue that point at month 11 is significantly away from the line – and so is data for month 24… We therefore need a more accurate, more objective measurement of “fit”
How else do we know if the equation is valid or not? We look at the  r-squared  (r 2 ) –  0.9391 This suggests that the variable “time” is able to explain 93.91% of the variance or movements in the number of users The other 6.09% are unexplained by the variable “time” – and could be due to other factors that are beyond time The 6.09% unexplained variance could also be because of errors in measurements, or simply ‘random’ errors that we will never be able to uncover An r-squared of 0.75+ is considered to be acceptable as a  ‘rule-of-thumb’ The r-squared is only one of few that measure goodness-of-fit (GIF).  Other measures include adjusted R-squared, AIC/Akaike Information Criteria, RMSE/root-mean squared error, and GLM-ANOVA.  These will not be discussed here.
Will we ever have a r-squared of 1.00? Possible – but highly improbable The higher the r-squared, the better – and it possible to have a 1.00 r-squared, but in the real world, highly-improbable A r-squared of 1.00 will only happen in a perfect scenario where the model  perfectly fits and explains  the data Getting an r-squared of 0.75+ in and of itself will be a challenge
But there are deviations between the line and the data! Why do we have deviations? Because there are other things that we probably are not taking into account in this model
Deviations are not entirely bad…  Actually, the deviations are part of the story… Because these deviations are an indication that  something else apart from time  is at work, it is worth checking  why  these deviations exist This is where  analytics  and  econometrics/statistics  meet –  uncovering why things are explainable and not-explainable by a model .
Let’s go back to the original question:
What have we done so far…? We’ve modeled and derived an equation relating time-t with purchases for the first 30months
What have we done so far…? We’re fairly confident with the model because it explains about 94% of the variance in the number of purchasers, as reflected by the r-squared
Let’s now project what’s going to happen in the next 12 months…  Time t, in months Product users ‘000 At the end of the next 12 months [by month 42], we can expect to have 543’000 users – if all things remain equal
Since we don’t really know what’s going to happen in the future – and we don’t have a perfect model…  We can report ranges instead of just a line… The dashed lines indicate the range of expectations for the next 12 months We can expect that there will be about 470’000 to 616’000 users by month 42
Are you still there?
Take a sigh of relief…
Linear regression through OLS is just amongst of the many techniques in econometrics… For those interested…  Wikipedia’s page on linear regression is  here  and the OLS technique is discussed  here . Specifically on econometrics, Wikipedia’s entry is  here .  An international organization of econometricians – and some information on econometrics – can be found  here . A more detailed introduction to econometrics can be found  here .
Books on econometrics that we’ve found useful…  Econometrics by Samuel Cameron, in  Amazon.Com , is an approachable introduction to the concepts Introductory Econometrics  by Humberto Barreto uses Microsoft Excel® and includes a CD-ROM with interactive files.  A Guide to Econometrics  by Peter Kennedy is considered by most teachers in beginning econometrics and practitioners to be a good guide
Other books that might be helpful Probability plays a major role in econometrics; for those interested, ET Jaynes has an e-book (in PDF)  here .  This is heavy reading, but enlightening.  An HTML version can be found  here Since econometrics builds on statistical theory, try reading chapters on linear regression (bivariate/multivariate) in Stat101 books.  Amazon has  this list  for you to choose from.
Credits for the images use Most of the images in the presentation are from Gettyimages.Com; the ownership of GettyImages over these photos are asserted and no claims are made by the presenter, author, nor by the company on these images. We acknowledge GettyImages’ ownership of copyright over their work in this presentation. We also acknowledge and claim no ownership of the other images that have been used in this presentation/file.
This presentation Author:  Philip Tiongson  [email_address]   Audiences: Staff interested in the basics of econometrics
 
Ad

More Related Content

What's hot (20)

Time value of money
Time value of moneyTime value of money
Time value of money
Rai University
 
Financial Leverage
Financial LeverageFinancial Leverage
Financial Leverage
Farzana Doctor
 
Financial planning process
Financial planning processFinancial planning process
Financial planning process
MUDASSAR AFZAL
 
Simple linear regression
Simple linear regressionSimple linear regression
Simple linear regression
RekhaChoudhary24
 
Probability concept and Probability distribution
Probability concept and Probability distributionProbability concept and Probability distribution
Probability concept and Probability distribution
Southern Range, Berhampur, Odisha
 
Normal as Approximation to Binomial
Normal as Approximation to Binomial  Normal as Approximation to Binomial
Normal as Approximation to Binomial
Long Beach City College
 
elementary statistic
elementary statisticelementary statistic
elementary statistic
Atcharaporn Khoomtong
 
Chapter 7 an introduction to risk and return
Chapter 7 an introduction to risk and returnChapter 7 an introduction to risk and return
Chapter 7 an introduction to risk and return
Chang Keng Kai Kent
 
Normal Distribution, Binomial Distribution, Poisson Distribution
Normal Distribution, Binomial Distribution, Poisson DistributionNormal Distribution, Binomial Distribution, Poisson Distribution
Normal Distribution, Binomial Distribution, Poisson Distribution
Q Dauh Q Alam
 
Binomial distribution
Binomial distributionBinomial distribution
Binomial distribution
Global Polis
 
Financial Statement Analysis
Financial Statement AnalysisFinancial Statement Analysis
Financial Statement Analysis
LJ Foja
 
Continous random variable.
Continous random variable.Continous random variable.
Continous random variable.
Shakeel Nouman
 
Multivariate reg analysis
Multivariate reg analysisMultivariate reg analysis
Multivariate reg analysis
Irfan Hussain
 
Chapter (10).
Chapter (10).Chapter (10).
Chapter (10).
Dr. Muath Asmar
 
Introduction to Probability and Probability Distributions
Introduction to Probability and Probability DistributionsIntroduction to Probability and Probability Distributions
Introduction to Probability and Probability Distributions
Jezhabeth Villegas
 
Cash budget
Cash budgetCash budget
Cash budget
Jagannath Das
 
An overview of financial management
An overview of financial managementAn overview of financial management
An overview of financial management
premarhea
 
Regression Analysis
Regression AnalysisRegression Analysis
Regression Analysis
Salim Azad
 
Financial Management chapter-2
Financial Management chapter-2Financial Management chapter-2
Financial Management chapter-2
Rakesh Singh
 
Binomial distribution good
Binomial distribution goodBinomial distribution good
Binomial distribution good
Zahida Pervaiz
 
Financial planning process
Financial planning processFinancial planning process
Financial planning process
MUDASSAR AFZAL
 
Chapter 7 an introduction to risk and return
Chapter 7 an introduction to risk and returnChapter 7 an introduction to risk and return
Chapter 7 an introduction to risk and return
Chang Keng Kai Kent
 
Normal Distribution, Binomial Distribution, Poisson Distribution
Normal Distribution, Binomial Distribution, Poisson DistributionNormal Distribution, Binomial Distribution, Poisson Distribution
Normal Distribution, Binomial Distribution, Poisson Distribution
Q Dauh Q Alam
 
Binomial distribution
Binomial distributionBinomial distribution
Binomial distribution
Global Polis
 
Financial Statement Analysis
Financial Statement AnalysisFinancial Statement Analysis
Financial Statement Analysis
LJ Foja
 
Continous random variable.
Continous random variable.Continous random variable.
Continous random variable.
Shakeel Nouman
 
Multivariate reg analysis
Multivariate reg analysisMultivariate reg analysis
Multivariate reg analysis
Irfan Hussain
 
Introduction to Probability and Probability Distributions
Introduction to Probability and Probability DistributionsIntroduction to Probability and Probability Distributions
Introduction to Probability and Probability Distributions
Jezhabeth Villegas
 
An overview of financial management
An overview of financial managementAn overview of financial management
An overview of financial management
premarhea
 
Regression Analysis
Regression AnalysisRegression Analysis
Regression Analysis
Salim Azad
 
Financial Management chapter-2
Financial Management chapter-2Financial Management chapter-2
Financial Management chapter-2
Rakesh Singh
 
Binomial distribution good
Binomial distribution goodBinomial distribution good
Binomial distribution good
Zahida Pervaiz
 

Viewers also liked (20)

Some Advanced Remarketing Ideas
Some Advanced Remarketing IdeasSome Advanced Remarketing Ideas
Some Advanced Remarketing Ideas
Chris Thomas
 
Wireframes - a brief overview
Wireframes - a brief overviewWireframes - a brief overview
Wireframes - a brief overview
Jenni Leder
 
Understand A/B Testing in 9 use cases & 7 mistakes
Understand A/B Testing in 9 use cases & 7 mistakesUnderstand A/B Testing in 9 use cases & 7 mistakes
Understand A/B Testing in 9 use cases & 7 mistakes
TheFamily
 
How to Plug a Leaky Sales Funnel With Facebook Retargeting
How to Plug a Leaky Sales Funnel With Facebook RetargetingHow to Plug a Leaky Sales Funnel With Facebook Retargeting
How to Plug a Leaky Sales Funnel With Facebook Retargeting
Digital Marketer
 
Optimize Your Sales & Marketing Funnel
Optimize Your Sales & Marketing FunnelOptimize Your Sales & Marketing Funnel
Optimize Your Sales & Marketing Funnel
HubSpot
 
Intro to Facebook Ads
Intro to Facebook AdsIntro to Facebook Ads
Intro to Facebook Ads
Ximena Sanchez
 
Google Analytics Fundamentals: Set Up and Basics for Measurement
Google Analytics Fundamentals: Set Up and Basics for MeasurementGoogle Analytics Fundamentals: Set Up and Basics for Measurement
Google Analytics Fundamentals: Set Up and Basics for Measurement
Orbit Media Studios
 
Using Your Growth Model to Drive Smarter High Tempo Testing
Using Your Growth Model to Drive Smarter High Tempo TestingUsing Your Growth Model to Drive Smarter High Tempo Testing
Using Your Growth Model to Drive Smarter High Tempo Testing
Sean Ellis
 
Lean Community Building: Getting the Most Bang for Your Time & Money
Lean Community Building: Getting the Most Bang for  Your Time & MoneyLean Community Building: Getting the Most Bang for  Your Time & Money
Lean Community Building: Getting the Most Bang for Your Time & Money
Jennifer Lopez
 
10 Mobile Marketing Campaigns That Went Viral and Made Millions
10 Mobile Marketing Campaigns That Went Viral and Made Millions10 Mobile Marketing Campaigns That Went Viral and Made Millions
10 Mobile Marketing Campaigns That Went Viral and Made Millions
Mark Fidelman
 
How Top Brands Use Referral Programs to Drive Customer Acquisition
How Top Brands Use Referral Programs to Drive Customer AcquisitionHow Top Brands Use Referral Programs to Drive Customer Acquisition
How Top Brands Use Referral Programs to Drive Customer Acquisition
Kissmetrics on SlideShare
 
Brenda Spoonemore - A biz dev playbook for startups: Why, when and how to do ...
Brenda Spoonemore - A biz dev playbook for startups: Why, when and how to do ...Brenda Spoonemore - A biz dev playbook for startups: Why, when and how to do ...
Brenda Spoonemore - A biz dev playbook for startups: Why, when and how to do ...
GeekWire
 
10 Ways You're Using AdWords Wrong and How to Correct Those Practices
10 Ways You're Using AdWords Wrong and How to Correct Those Practices 10 Ways You're Using AdWords Wrong and How to Correct Those Practices
10 Ways You're Using AdWords Wrong and How to Correct Those Practices
Kissmetrics on SlideShare
 
No excuses user research
No excuses user researchNo excuses user research
No excuses user research
Lily Dart
 
User experience doesn't happen on a screen: It happens in the mind.
User experience doesn't happen on a screen: It happens in the mind.User experience doesn't happen on a screen: It happens in the mind.
User experience doesn't happen on a screen: It happens in the mind.
John Whalen
 
Stop Leaving Money on the Table! Optimizing your Site for Users and Revenue
Stop Leaving Money on the Table! Optimizing your Site for Users and RevenueStop Leaving Money on the Table! Optimizing your Site for Users and Revenue
Stop Leaving Money on the Table! Optimizing your Site for Users and Revenue
Josh Patrice
 
SQL Tutorial for Marketers
SQL Tutorial for MarketersSQL Tutorial for Marketers
SQL Tutorial for Marketers
Justin Mares
 
HTML & CSS Masterclass
HTML & CSS MasterclassHTML & CSS Masterclass
HTML & CSS Masterclass
Bernardo Raposo
 
How to: Viral Marketing + Brand Storytelling
How to: Viral Marketing + Brand Storytelling How to: Viral Marketing + Brand Storytelling
How to: Viral Marketing + Brand Storytelling
Elle Shelley
 
The Beginners Guide to Startup PR #startuppr
The Beginners Guide to Startup PR #startupprThe Beginners Guide to Startup PR #startuppr
The Beginners Guide to Startup PR #startuppr
Onboardly
 
Some Advanced Remarketing Ideas
Some Advanced Remarketing IdeasSome Advanced Remarketing Ideas
Some Advanced Remarketing Ideas
Chris Thomas
 
Wireframes - a brief overview
Wireframes - a brief overviewWireframes - a brief overview
Wireframes - a brief overview
Jenni Leder
 
Understand A/B Testing in 9 use cases & 7 mistakes
Understand A/B Testing in 9 use cases & 7 mistakesUnderstand A/B Testing in 9 use cases & 7 mistakes
Understand A/B Testing in 9 use cases & 7 mistakes
TheFamily
 
How to Plug a Leaky Sales Funnel With Facebook Retargeting
How to Plug a Leaky Sales Funnel With Facebook RetargetingHow to Plug a Leaky Sales Funnel With Facebook Retargeting
How to Plug a Leaky Sales Funnel With Facebook Retargeting
Digital Marketer
 
Optimize Your Sales & Marketing Funnel
Optimize Your Sales & Marketing FunnelOptimize Your Sales & Marketing Funnel
Optimize Your Sales & Marketing Funnel
HubSpot
 
Google Analytics Fundamentals: Set Up and Basics for Measurement
Google Analytics Fundamentals: Set Up and Basics for MeasurementGoogle Analytics Fundamentals: Set Up and Basics for Measurement
Google Analytics Fundamentals: Set Up and Basics for Measurement
Orbit Media Studios
 
Using Your Growth Model to Drive Smarter High Tempo Testing
Using Your Growth Model to Drive Smarter High Tempo TestingUsing Your Growth Model to Drive Smarter High Tempo Testing
Using Your Growth Model to Drive Smarter High Tempo Testing
Sean Ellis
 
Lean Community Building: Getting the Most Bang for Your Time & Money
Lean Community Building: Getting the Most Bang for  Your Time & MoneyLean Community Building: Getting the Most Bang for  Your Time & Money
Lean Community Building: Getting the Most Bang for Your Time & Money
Jennifer Lopez
 
10 Mobile Marketing Campaigns That Went Viral and Made Millions
10 Mobile Marketing Campaigns That Went Viral and Made Millions10 Mobile Marketing Campaigns That Went Viral and Made Millions
10 Mobile Marketing Campaigns That Went Viral and Made Millions
Mark Fidelman
 
How Top Brands Use Referral Programs to Drive Customer Acquisition
How Top Brands Use Referral Programs to Drive Customer AcquisitionHow Top Brands Use Referral Programs to Drive Customer Acquisition
How Top Brands Use Referral Programs to Drive Customer Acquisition
Kissmetrics on SlideShare
 
Brenda Spoonemore - A biz dev playbook for startups: Why, when and how to do ...
Brenda Spoonemore - A biz dev playbook for startups: Why, when and how to do ...Brenda Spoonemore - A biz dev playbook for startups: Why, when and how to do ...
Brenda Spoonemore - A biz dev playbook for startups: Why, when and how to do ...
GeekWire
 
10 Ways You're Using AdWords Wrong and How to Correct Those Practices
10 Ways You're Using AdWords Wrong and How to Correct Those Practices 10 Ways You're Using AdWords Wrong and How to Correct Those Practices
10 Ways You're Using AdWords Wrong and How to Correct Those Practices
Kissmetrics on SlideShare
 
No excuses user research
No excuses user researchNo excuses user research
No excuses user research
Lily Dart
 
User experience doesn't happen on a screen: It happens in the mind.
User experience doesn't happen on a screen: It happens in the mind.User experience doesn't happen on a screen: It happens in the mind.
User experience doesn't happen on a screen: It happens in the mind.
John Whalen
 
Stop Leaving Money on the Table! Optimizing your Site for Users and Revenue
Stop Leaving Money on the Table! Optimizing your Site for Users and RevenueStop Leaving Money on the Table! Optimizing your Site for Users and Revenue
Stop Leaving Money on the Table! Optimizing your Site for Users and Revenue
Josh Patrice
 
SQL Tutorial for Marketers
SQL Tutorial for MarketersSQL Tutorial for Marketers
SQL Tutorial for Marketers
Justin Mares
 
How to: Viral Marketing + Brand Storytelling
How to: Viral Marketing + Brand Storytelling How to: Viral Marketing + Brand Storytelling
How to: Viral Marketing + Brand Storytelling
Elle Shelley
 
The Beginners Guide to Startup PR #startuppr
The Beginners Guide to Startup PR #startupprThe Beginners Guide to Startup PR #startuppr
The Beginners Guide to Startup PR #startuppr
Onboardly
 
Ad

Similar to Simple (and Simplistic) Introduction to Econometrics and Linear Regression (18)

Glm
GlmGlm
Glm
Berg McFly
 
Statistics for Managers notes.pdf
Statistics for Managers notes.pdfStatistics for Managers notes.pdf
Statistics for Managers notes.pdf
Velujv
 
Real Estate Data Set
Real Estate Data SetReal Estate Data Set
Real Estate Data Set
Sarah Jimenez
 
Pentaho Meeting 2008 - Statistics & BI
Pentaho Meeting 2008 - Statistics & BIPentaho Meeting 2008 - Statistics & BI
Pentaho Meeting 2008 - Statistics & BI
Studio Synthesis
 
Statistics
StatisticsStatistics
Statistics
Learnbay Datascience
 
The future is uncertain. Some events do have a very small probabil.docx
The future is uncertain. Some events do have a very small probabil.docxThe future is uncertain. Some events do have a very small probabil.docx
The future is uncertain. Some events do have a very small probabil.docx
oreo10
 
Shopping Centers, Big Data & I.O.T
Shopping Centers, Big Data & I.O.TShopping Centers, Big Data & I.O.T
Shopping Centers, Big Data & I.O.T
Oscar Cuenca Roca
 
KIT-601 Lecture Notes-UNIT-2.pdf
KIT-601 Lecture Notes-UNIT-2.pdfKIT-601 Lecture Notes-UNIT-2.pdf
KIT-601 Lecture Notes-UNIT-2.pdf
Dr. Radhey Shyam
 
Economics for entrepreneurs
Economics for entrepreneurs Economics for entrepreneurs
Economics for entrepreneurs
Dr. Trilok Kumar Jain
 
Essay On Math 533
Essay On Math 533Essay On Math 533
Essay On Math 533
Kristin Oliver
 
Essentials of machine learning algorithms
Essentials of machine learning algorithmsEssentials of machine learning algorithms
Essentials of machine learning algorithms
Arunangsu Sahu
 
Advanced Econometrics L3-4.pptx
Advanced Econometrics L3-4.pptxAdvanced Econometrics L3-4.pptx
Advanced Econometrics L3-4.pptx
akashayosha
 
The Controversy Between Theory To Measurement And...
The Controversy Between Theory To Measurement And...The Controversy Between Theory To Measurement And...
The Controversy Between Theory To Measurement And...
Tasha Holloway
 
16 USING LINEAR REGRESSION PREDICTING THE FUTURE16 MEDIA LIBRAR.docx
16 USING LINEAR REGRESSION PREDICTING THE FUTURE16 MEDIA LIBRAR.docx16 USING LINEAR REGRESSION PREDICTING THE FUTURE16 MEDIA LIBRAR.docx
16 USING LINEAR REGRESSION PREDICTING THE FUTURE16 MEDIA LIBRAR.docx
novabroom
 
Relationship between Linear Algebra and StatisticsLinear algebra.docx
Relationship between Linear Algebra and StatisticsLinear algebra.docxRelationship between Linear Algebra and StatisticsLinear algebra.docx
Relationship between Linear Algebra and StatisticsLinear algebra.docx
debishakespeare
 
Workbook Project
Workbook ProjectWorkbook Project
Workbook Project
Brian Ryan
 
Chapter five Application to Crosssectional analysis.pptx
Chapter  five  Application to Crosssectional analysis.pptxChapter  five  Application to Crosssectional analysis.pptx
Chapter five Application to Crosssectional analysis.pptx
KifuYe
 
Statistik Chapter 1
Statistik Chapter 1Statistik Chapter 1
Statistik Chapter 1
WanBK Leo
 
Statistics for Managers notes.pdf
Statistics for Managers notes.pdfStatistics for Managers notes.pdf
Statistics for Managers notes.pdf
Velujv
 
Real Estate Data Set
Real Estate Data SetReal Estate Data Set
Real Estate Data Set
Sarah Jimenez
 
Pentaho Meeting 2008 - Statistics & BI
Pentaho Meeting 2008 - Statistics & BIPentaho Meeting 2008 - Statistics & BI
Pentaho Meeting 2008 - Statistics & BI
Studio Synthesis
 
The future is uncertain. Some events do have a very small probabil.docx
The future is uncertain. Some events do have a very small probabil.docxThe future is uncertain. Some events do have a very small probabil.docx
The future is uncertain. Some events do have a very small probabil.docx
oreo10
 
Shopping Centers, Big Data & I.O.T
Shopping Centers, Big Data & I.O.TShopping Centers, Big Data & I.O.T
Shopping Centers, Big Data & I.O.T
Oscar Cuenca Roca
 
KIT-601 Lecture Notes-UNIT-2.pdf
KIT-601 Lecture Notes-UNIT-2.pdfKIT-601 Lecture Notes-UNIT-2.pdf
KIT-601 Lecture Notes-UNIT-2.pdf
Dr. Radhey Shyam
 
Essentials of machine learning algorithms
Essentials of machine learning algorithmsEssentials of machine learning algorithms
Essentials of machine learning algorithms
Arunangsu Sahu
 
Advanced Econometrics L3-4.pptx
Advanced Econometrics L3-4.pptxAdvanced Econometrics L3-4.pptx
Advanced Econometrics L3-4.pptx
akashayosha
 
The Controversy Between Theory To Measurement And...
The Controversy Between Theory To Measurement And...The Controversy Between Theory To Measurement And...
The Controversy Between Theory To Measurement And...
Tasha Holloway
 
16 USING LINEAR REGRESSION PREDICTING THE FUTURE16 MEDIA LIBRAR.docx
16 USING LINEAR REGRESSION PREDICTING THE FUTURE16 MEDIA LIBRAR.docx16 USING LINEAR REGRESSION PREDICTING THE FUTURE16 MEDIA LIBRAR.docx
16 USING LINEAR REGRESSION PREDICTING THE FUTURE16 MEDIA LIBRAR.docx
novabroom
 
Relationship between Linear Algebra and StatisticsLinear algebra.docx
Relationship between Linear Algebra and StatisticsLinear algebra.docxRelationship between Linear Algebra and StatisticsLinear algebra.docx
Relationship between Linear Algebra and StatisticsLinear algebra.docx
debishakespeare
 
Workbook Project
Workbook ProjectWorkbook Project
Workbook Project
Brian Ryan
 
Chapter five Application to Crosssectional analysis.pptx
Chapter  five  Application to Crosssectional analysis.pptxChapter  five  Application to Crosssectional analysis.pptx
Chapter five Application to Crosssectional analysis.pptx
KifuYe
 
Statistik Chapter 1
Statistik Chapter 1Statistik Chapter 1
Statistik Chapter 1
WanBK Leo
 
Ad

Recently uploaded (20)

AlaskaSilver Corporate Presentation Apr 28 2025.pdf
AlaskaSilver Corporate Presentation Apr 28 2025.pdfAlaskaSilver Corporate Presentation Apr 28 2025.pdf
AlaskaSilver Corporate Presentation Apr 28 2025.pdf
Western Alaska Minerals Corp.
 
Salesforce_Architecture_Diagramming_Workshop (1).pptx
Salesforce_Architecture_Diagramming_Workshop (1).pptxSalesforce_Architecture_Diagramming_Workshop (1).pptx
Salesforce_Architecture_Diagramming_Workshop (1).pptx
reinbauwens1
 
Avoiding the China Tariffs: Save Costs & Stay Competitive
Avoiding the China Tariffs: Save Costs & Stay CompetitiveAvoiding the China Tariffs: Save Costs & Stay Competitive
Avoiding the China Tariffs: Save Costs & Stay Competitive
NovaLink
 
TMG - Q3 2025 Earnings Call Slides - v4.pptx
TMG - Q3 2025 Earnings Call Slides - v4.pptxTMG - Q3 2025 Earnings Call Slides - v4.pptx
TMG - Q3 2025 Earnings Call Slides - v4.pptx
Marketing847413
 
Alan Stalcup - The Enterprising CEO
Alan  Stalcup  -  The  Enterprising  CEOAlan  Stalcup  -  The  Enterprising  CEO
Alan Stalcup - The Enterprising CEO
Alan Stalcup
 
www.visualmedia.com digital markiting (1).pptx
www.visualmedia.com digital markiting (1).pptxwww.visualmedia.com digital markiting (1).pptx
www.visualmedia.com digital markiting (1).pptx
Davinder Singh
 
Liberal Price To Buy Verified Wise Accounts In 2025.pdf
Liberal Price To Buy Verified Wise Accounts In 2025.pdfLiberal Price To Buy Verified Wise Accounts In 2025.pdf
Liberal Price To Buy Verified Wise Accounts In 2025.pdf
Topvasmm
 
Smart Home Market Size, Growth and Report (2025-2034)
Smart Home Market Size, Growth and Report (2025-2034)Smart Home Market Size, Growth and Report (2025-2034)
Smart Home Market Size, Growth and Report (2025-2034)
GeorgeButtler
 
Affinity.co Lifecycle Marketing Presentation
Affinity.co Lifecycle Marketing PresentationAffinity.co Lifecycle Marketing Presentation
Affinity.co Lifecycle Marketing Presentation
omiller199514
 
Petslify Turns Pet Photos into Hug-Worthy Memories
Petslify Turns Pet Photos into Hug-Worthy MemoriesPetslify Turns Pet Photos into Hug-Worthy Memories
Petslify Turns Pet Photos into Hug-Worthy Memories
Petslify
 
INTRODUCTION OF MANAGEMENT.pdf CA SUVIDHA CHAPLOT
INTRODUCTION OF MANAGEMENT.pdf CA SUVIDHA CHAPLOTINTRODUCTION OF MANAGEMENT.pdf CA SUVIDHA CHAPLOT
INTRODUCTION OF MANAGEMENT.pdf CA SUVIDHA CHAPLOT
CA Suvidha Chaplot
 
Influence of Career Development on Retention of Employees in Private Univers...
Influence of Career Development on Retention of  Employees in Private Univers...Influence of Career Development on Retention of  Employees in Private Univers...
Influence of Career Development on Retention of Employees in Private Univers...
publication11
 
Disinformation in Society Report 2025 Key Findings
Disinformation in Society Report 2025 Key FindingsDisinformation in Society Report 2025 Key Findings
Disinformation in Society Report 2025 Key Findings
MariumAbdulhussein
 
Kiran Flemish - A Dynamic Musician
Kiran  Flemish  -  A   Dynamic  MusicianKiran  Flemish  -  A   Dynamic  Musician
Kiran Flemish - A Dynamic Musician
Kiran Flemish
 
Accounting_Basics_Complete_Guide_By_CA_Suvidha_Chaplot (1).pdf
Accounting_Basics_Complete_Guide_By_CA_Suvidha_Chaplot (1).pdfAccounting_Basics_Complete_Guide_By_CA_Suvidha_Chaplot (1).pdf
Accounting_Basics_Complete_Guide_By_CA_Suvidha_Chaplot (1).pdf
CA Suvidha Chaplot
 
Comments on Cloud Stream Part II Mobile Hub V1 Hub Agency.pdf
Comments on Cloud Stream Part II Mobile Hub V1 Hub Agency.pdfComments on Cloud Stream Part II Mobile Hub V1 Hub Agency.pdf
Comments on Cloud Stream Part II Mobile Hub V1 Hub Agency.pdf
Brij Consulting, LLC
 
The Rise of Payroll Outsourcing in the UK: Key Statistics for 2025
The Rise of Payroll Outsourcing in the UK: Key Statistics for 2025The Rise of Payroll Outsourcing in the UK: Key Statistics for 2025
The Rise of Payroll Outsourcing in the UK: Key Statistics for 2025
QX Accounting Services Ltd
 
BeMetals_Presentation_May_2025 .pdf
BeMetals_Presentation_May_2025      .pdfBeMetals_Presentation_May_2025      .pdf
BeMetals_Presentation_May_2025 .pdf
DerekIwanaka2
 
2_English_Vocabulary_In_Use_Pre-Intermediate_Cambridge_-_Fourth_Edition (1).pdf
2_English_Vocabulary_In_Use_Pre-Intermediate_Cambridge_-_Fourth_Edition (1).pdf2_English_Vocabulary_In_Use_Pre-Intermediate_Cambridge_-_Fourth_Edition (1).pdf
2_English_Vocabulary_In_Use_Pre-Intermediate_Cambridge_-_Fourth_Edition (1).pdf
ThiNgc22
 
The Fascinating World of Hats: A Brief History of Hats
The Fascinating World of Hats: A Brief History of HatsThe Fascinating World of Hats: A Brief History of Hats
The Fascinating World of Hats: A Brief History of Hats
nimrabilal030
 
Salesforce_Architecture_Diagramming_Workshop (1).pptx
Salesforce_Architecture_Diagramming_Workshop (1).pptxSalesforce_Architecture_Diagramming_Workshop (1).pptx
Salesforce_Architecture_Diagramming_Workshop (1).pptx
reinbauwens1
 
Avoiding the China Tariffs: Save Costs & Stay Competitive
Avoiding the China Tariffs: Save Costs & Stay CompetitiveAvoiding the China Tariffs: Save Costs & Stay Competitive
Avoiding the China Tariffs: Save Costs & Stay Competitive
NovaLink
 
TMG - Q3 2025 Earnings Call Slides - v4.pptx
TMG - Q3 2025 Earnings Call Slides - v4.pptxTMG - Q3 2025 Earnings Call Slides - v4.pptx
TMG - Q3 2025 Earnings Call Slides - v4.pptx
Marketing847413
 
Alan Stalcup - The Enterprising CEO
Alan  Stalcup  -  The  Enterprising  CEOAlan  Stalcup  -  The  Enterprising  CEO
Alan Stalcup - The Enterprising CEO
Alan Stalcup
 
www.visualmedia.com digital markiting (1).pptx
www.visualmedia.com digital markiting (1).pptxwww.visualmedia.com digital markiting (1).pptx
www.visualmedia.com digital markiting (1).pptx
Davinder Singh
 
Liberal Price To Buy Verified Wise Accounts In 2025.pdf
Liberal Price To Buy Verified Wise Accounts In 2025.pdfLiberal Price To Buy Verified Wise Accounts In 2025.pdf
Liberal Price To Buy Verified Wise Accounts In 2025.pdf
Topvasmm
 
Smart Home Market Size, Growth and Report (2025-2034)
Smart Home Market Size, Growth and Report (2025-2034)Smart Home Market Size, Growth and Report (2025-2034)
Smart Home Market Size, Growth and Report (2025-2034)
GeorgeButtler
 
Affinity.co Lifecycle Marketing Presentation
Affinity.co Lifecycle Marketing PresentationAffinity.co Lifecycle Marketing Presentation
Affinity.co Lifecycle Marketing Presentation
omiller199514
 
Petslify Turns Pet Photos into Hug-Worthy Memories
Petslify Turns Pet Photos into Hug-Worthy MemoriesPetslify Turns Pet Photos into Hug-Worthy Memories
Petslify Turns Pet Photos into Hug-Worthy Memories
Petslify
 
INTRODUCTION OF MANAGEMENT.pdf CA SUVIDHA CHAPLOT
INTRODUCTION OF MANAGEMENT.pdf CA SUVIDHA CHAPLOTINTRODUCTION OF MANAGEMENT.pdf CA SUVIDHA CHAPLOT
INTRODUCTION OF MANAGEMENT.pdf CA SUVIDHA CHAPLOT
CA Suvidha Chaplot
 
Influence of Career Development on Retention of Employees in Private Univers...
Influence of Career Development on Retention of  Employees in Private Univers...Influence of Career Development on Retention of  Employees in Private Univers...
Influence of Career Development on Retention of Employees in Private Univers...
publication11
 
Disinformation in Society Report 2025 Key Findings
Disinformation in Society Report 2025 Key FindingsDisinformation in Society Report 2025 Key Findings
Disinformation in Society Report 2025 Key Findings
MariumAbdulhussein
 
Kiran Flemish - A Dynamic Musician
Kiran  Flemish  -  A   Dynamic  MusicianKiran  Flemish  -  A   Dynamic  Musician
Kiran Flemish - A Dynamic Musician
Kiran Flemish
 
Accounting_Basics_Complete_Guide_By_CA_Suvidha_Chaplot (1).pdf
Accounting_Basics_Complete_Guide_By_CA_Suvidha_Chaplot (1).pdfAccounting_Basics_Complete_Guide_By_CA_Suvidha_Chaplot (1).pdf
Accounting_Basics_Complete_Guide_By_CA_Suvidha_Chaplot (1).pdf
CA Suvidha Chaplot
 
Comments on Cloud Stream Part II Mobile Hub V1 Hub Agency.pdf
Comments on Cloud Stream Part II Mobile Hub V1 Hub Agency.pdfComments on Cloud Stream Part II Mobile Hub V1 Hub Agency.pdf
Comments on Cloud Stream Part II Mobile Hub V1 Hub Agency.pdf
Brij Consulting, LLC
 
The Rise of Payroll Outsourcing in the UK: Key Statistics for 2025
The Rise of Payroll Outsourcing in the UK: Key Statistics for 2025The Rise of Payroll Outsourcing in the UK: Key Statistics for 2025
The Rise of Payroll Outsourcing in the UK: Key Statistics for 2025
QX Accounting Services Ltd
 
BeMetals_Presentation_May_2025 .pdf
BeMetals_Presentation_May_2025      .pdfBeMetals_Presentation_May_2025      .pdf
BeMetals_Presentation_May_2025 .pdf
DerekIwanaka2
 
2_English_Vocabulary_In_Use_Pre-Intermediate_Cambridge_-_Fourth_Edition (1).pdf
2_English_Vocabulary_In_Use_Pre-Intermediate_Cambridge_-_Fourth_Edition (1).pdf2_English_Vocabulary_In_Use_Pre-Intermediate_Cambridge_-_Fourth_Edition (1).pdf
2_English_Vocabulary_In_Use_Pre-Intermediate_Cambridge_-_Fourth_Edition (1).pdf
ThiNgc22
 
The Fascinating World of Hats: A Brief History of Hats
The Fascinating World of Hats: A Brief History of HatsThe Fascinating World of Hats: A Brief History of Hats
The Fascinating World of Hats: A Brief History of Hats
nimrabilal030
 

Simple (and Simplistic) Introduction to Econometrics and Linear Regression

  • 1. What is econometrics? Simple, non-technical introduction on Linear Regression/OLS as a technique
  • 2. About this document… This document is not meant for presentation and is best viewed together in slideshow or printed format. It is meant to be ‘read’, not ‘presented’ This document also covers the very basics of Econometrics. Econometrics – as a subject – is theoretically complex. The goal of this document is to empower the reader with an understanding of econometrics so she/he can discuss the topic with some confidence
  • 3. About this document This document assumes ‘zero-knowledge’ in econometrics and in linear regression It may appear to be long-winded at times, but it is designed to be so in order impress upon the reader the concepts that are being discussed herein Some online references and books are at the end of the document for those who are interested in further learning about econometric and statistical modeling
  • 4. About this document Readers who have either a formal background in, conceptual understanding of, or keen interest in statistics would find this document helpful in ‘transitioning’ towards econometric modeling… A conceptual understanding of linear regression will also be helpful to appreciate econometrics, but this document will assume zero-knowledge in regression Econometrics as a science is founded on complex equations and assumptions based on the theories of probability and statistics – these are not covered in this document.
  • 6. “Econometrics? Isn’t that difficult?”
  • 7. It’s full of formulas… and it could be complex
  • 9. Things must be made as simple as possible – but never simpler
  • 10. This is an attempt to present econometrics as simple as possible…
  • 11. What’s required to learn a little bit of econometrics
  • 12. … lots of curiosity
  • 13. … a little bit of patience
  • 14. … a little bit of brains
  • 15. … confidence in dealing with numbers
  • 16. … a belief that numbers can tell stories
  • 17. Let’s start with a little bit of definition What is econometrics?
  • 18. What is econometrics? Econometrics is an application of statistics and mathematics … aimed at identifying and quantifying the relationships between two sets of variables – (1) the predicted variables and (2) the predictor variables. The goal of econometrics is to test a hypothesized causal relationship between the predicted and the predictor variables.
  • 19. What is econometrics? Econometrics is an application of statistics and mathematics Econometrics is derived from statistics – largely regression and ‘trending’ techniques - and from mathematics There are differences between statistics and econometrics – but the differences are academic*… * … but not necessarily moot and unimportant For those interested about the differences, see future tutorials…
  • 20. What is econometrics? … aimed at identifying and quantifying the relationships between two sets of variables – (1) the predicted variables and (2) the predictor variables. The basic goal of econometrics is to explain using formulas and numbers the relationship between a predictor variable – such as GRPs, adspends, competitive spends, temperature, and seasonality – and a predicted variable – such as awareness, sales, revenues, and profits
  • 21. What is econometrics? This relationship is expressed in an equation – such as y is the ‘predicted’ variable x is the ‘predictor’ variable m , b and u are the values that econometrics want to uncover
  • 22. What is econometrics? This relationship is expressed in an equation – such as y is the ‘predicted’ variable x is the ‘predictor’ variable m , b and u are the values that econometrics want to uncover We know the values of y and x Econometrics helps us identify the values of m, b and u
  • 23. If we were interested in awareness and GRPs… We can rewrite the first equation taking our interest into consideration as follows awareness = m • GRPs + b + u NB. This is simplifying the relationship between GRPs and awareness drastically. The relationship is far more complex, of course – but let’s assume that this equation is true for now. What econometrics does is “estimate” the values of “m”, “b” and “u” based on the available data on Awareness and GRPs, such that we have an equation that relates Awareness and GRPs. Once m, b and u are identified and estimated, we can then use the equation to explain the movements in awareness with respect to GRPs – and predict how awareness is going to move in the future given different levels of GRPs
  • 24. There are many econometric techniques… But the most common technique is linear regression
  • 25. What is linear regression ? A brief introduction to linear regression How to create regression lines? Regression in econometrics and marketing
  • 26. Introduction to linear regression Let’s assume that x is the evolution of the number of users of a certain product across months (in ‘000), represented by time t In the first month, for example, we see that there are 4’905 users of the product. By the 5 th month, that has increased to about 6’800 users – and by the 26 th month, the number of users have increased to around 34’200 Clearly, there is an increase in the number of users – and it seems, from looking at the data alone that indeed, there is a significant uptrend
  • 27. If we plotted the data, we would indeed see an upward trend… Time t, in months Product users ‘000 In the 1 st month, we see that there are about 5’000 product users By the 30 th month, the number of users have increased to about 40’000 users
  • 28. The question If this trend held and continued into the next 12 months, how many more users will we have?
  • 29. To answer this question… … we need to understand first the past relationship between the two variables – time and numbers of users . We will then use this understanding of the past to predict what’s going to happen in the next 12 months The Past The Future
  • 30. What bridges the gap between the past and the future… Once we have identified the equation or the model, we will have a better grasp of (1) the past trends and (2) the potentials of the future Linear regression comes into the picture by bridging that gap between the past and the future The Past The Future Linear regression equation
  • 31. With that in mind, let’s look at the chart again
  • 32. From mere observation, we see an uptrend in users across time… Time t, in months Product users ‘000
  • 33. How do we quantify* that uptrend? Time t, in months Product users ‘000 * Remember: In order to project into the future, we need to create a model that quantifies the relationship between time and number of users
  • 34. There are an infinite number of lines that we could use to characterize the uptrend… Time t, in months Product users ‘000 Different people have different views – even when viewing the same set of data: I can argue that the best line is the grey line, another can argue that the blue line is best, and still another can argue that the best line is the pink line
  • 35. Linear regression insists that there is one (and only one) line that would best characterize the trend and the relationship between the two variables
  • 36. Linear regression also insists that this equation be of the following form: … where y is the number of users per month ‘000 x is time b is the constant u is the unexplained variance
  • 37. This one line that best describes the relationship between the two variables is derived through OLS OLS – which stands for “ordinary least squares” – is an algorithm that defines the values of m , b and u … such that the distance between the actual values and the line defined by the final values of m, b and u are at its minimum Huh
  • 38. Let’s go back a few charts… What OLS does is it objectively goes through these infinite number of lines – and finds the best-fitting line such that the distance between the line and the original data-points are at a minimum OLS does this iteratively – that is, through trial-and-error – until it arrives at the values of m, b, and u that define a line with minimum distance between it and the original data. (Think of OLS as a search-algorithm that tries different m-b-u combinations to achieve the best-fitting line.) Remember: Given any data set, there are an infinite number of lines that can be used to describe the trend. One can choose the “pink” to be the best and rationalize it; another person can argue that the yellow line is the best, and still another third person can defend the blue line. We can argue indefinitely about the merits of each of these infinite number of lines.
  • 39. Going back to the data – the best fitting regression line, after applying OLS is… Time t, in months Product users ‘000
  • 40. By applying OLS, the equation «y = 1.416x + 3.6329» is found to be the best-fitting regression line It is objective and unbiased By using OLS, we are assured that this is unbiased and objective It is linear It conforms to the «y= mx + b + u» requirement of econometrics) It is the best-fitting line Because the OLS algorithm is aimed at minimizing the distance between the line and the data points, we are assured that it is the best-fitting line
  • 41. Now comes the interesting part… So what does the equation exactly mean?
  • 42. The story behind «y = 1.416x + 3.6329» This equation suggests the following – For every 1.416-unit change in x , there is a corresponding 1-unit change in y Applying this to our data, we can say that for every 1.416 months (about 5-6 weeks), there is an additional 1’000 new users of the product 3.6329 is called the constant – it is the number of users when the product was rolled out into the marketplace (at time t = 0) These are perhaps the early adopters of the product or those who have been exposed to the product through free samples
  • 43. OK, we have an equation – how do we know it’s the correct equation? First, we “eyeball” the line and the actual data Are the data points within ‘reasonable’ distance of the line? If each of the data points seem to be near the trendline, then we can say initially that we have a good fit If there are data-points that are significantly far from the line, then the equation may need to be revisited – or that outlying data-point may be caused by something else apart from time
  • 44. Let’s eyeball the model: There seem to be no data-points that are significantly away from the line… Time t, in months Product users ‘000
  • 45. Eyeballing the data, however, brings back subjective interpretations Time t, in months Product users ‘000 One can argue that point at month 11 is significantly away from the line – and so is data for month 24… We therefore need a more accurate, more objective measurement of “fit”
  • 46. How else do we know if the equation is valid or not? We look at the r-squared (r 2 ) – 0.9391 This suggests that the variable “time” is able to explain 93.91% of the variance or movements in the number of users The other 6.09% are unexplained by the variable “time” – and could be due to other factors that are beyond time The 6.09% unexplained variance could also be because of errors in measurements, or simply ‘random’ errors that we will never be able to uncover An r-squared of 0.75+ is considered to be acceptable as a ‘rule-of-thumb’ The r-squared is only one of few that measure goodness-of-fit (GIF). Other measures include adjusted R-squared, AIC/Akaike Information Criteria, RMSE/root-mean squared error, and GLM-ANOVA. These will not be discussed here.
  • 47. Will we ever have a r-squared of 1.00? Possible – but highly improbable The higher the r-squared, the better – and it possible to have a 1.00 r-squared, but in the real world, highly-improbable A r-squared of 1.00 will only happen in a perfect scenario where the model perfectly fits and explains the data Getting an r-squared of 0.75+ in and of itself will be a challenge
  • 48. But there are deviations between the line and the data! Why do we have deviations? Because there are other things that we probably are not taking into account in this model
  • 49. Deviations are not entirely bad… Actually, the deviations are part of the story… Because these deviations are an indication that something else apart from time is at work, it is worth checking why these deviations exist This is where analytics and econometrics/statistics meet – uncovering why things are explainable and not-explainable by a model .
  • 50. Let’s go back to the original question:
  • 51. What have we done so far…? We’ve modeled and derived an equation relating time-t with purchases for the first 30months
  • 52. What have we done so far…? We’re fairly confident with the model because it explains about 94% of the variance in the number of purchasers, as reflected by the r-squared
  • 53. Let’s now project what’s going to happen in the next 12 months… Time t, in months Product users ‘000 At the end of the next 12 months [by month 42], we can expect to have 543’000 users – if all things remain equal
  • 54. Since we don’t really know what’s going to happen in the future – and we don’t have a perfect model… We can report ranges instead of just a line… The dashed lines indicate the range of expectations for the next 12 months We can expect that there will be about 470’000 to 616’000 users by month 42
  • 55. Are you still there?
  • 56. Take a sigh of relief…
  • 57. Linear regression through OLS is just amongst of the many techniques in econometrics… For those interested… Wikipedia’s page on linear regression is here and the OLS technique is discussed here . Specifically on econometrics, Wikipedia’s entry is here . An international organization of econometricians – and some information on econometrics – can be found here . A more detailed introduction to econometrics can be found here .
  • 58. Books on econometrics that we’ve found useful… Econometrics by Samuel Cameron, in Amazon.Com , is an approachable introduction to the concepts Introductory Econometrics by Humberto Barreto uses Microsoft Excel® and includes a CD-ROM with interactive files. A Guide to Econometrics by Peter Kennedy is considered by most teachers in beginning econometrics and practitioners to be a good guide
  • 59. Other books that might be helpful Probability plays a major role in econometrics; for those interested, ET Jaynes has an e-book (in PDF) here . This is heavy reading, but enlightening. An HTML version can be found here Since econometrics builds on statistical theory, try reading chapters on linear regression (bivariate/multivariate) in Stat101 books. Amazon has this list for you to choose from.
  • 60. Credits for the images use Most of the images in the presentation are from Gettyimages.Com; the ownership of GettyImages over these photos are asserted and no claims are made by the presenter, author, nor by the company on these images. We acknowledge GettyImages’ ownership of copyright over their work in this presentation. We also acknowledge and claim no ownership of the other images that have been used in this presentation/file.
  • 61. This presentation Author: Philip Tiongson [email_address] Audiences: Staff interested in the basics of econometrics
  • 62.  

Editor's Notes

  • #16: http://www.gettyimages.com/detail/88640738/The-Image-Bank
  • #17: http://www.gettyimages.com/detail/73209874/Rubberball-Productions
  • #30: http://www.flickr.com/photos/cuppini/1194887952/ http://www.flickr.com/photos/fabi42/292643814/
  • #31: http://www.flickr.com/photos/cuppini/1194887952/
  • #56: http://www.gettyimages.com/detail/200539930-010/Stone
  • #57: http://www.gettyimages.com/detail/200498009-001/Photonica