0% found this document useful (0 votes)

48 views

Statistics For Economists: Lecturer: DR Omid Mazdak Email: Omid - Mazdak@kcl - Ac.uk

The document provides an overview of descriptive statistics concepts for economists. It discusses types of economic data including cross-sectional, time series, and panel data. Key summary statistics are introduced, such as measures of central tendency like the mean, median, and mode, and measures of dispersion like variance and standard deviation. Random variables, samples, populations, and random sampling are also covered.

Uploaded by

Finn Wilson

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

48 views

Statistics For Economists: Lecturer: DR Omid Mazdak Email: Omid - Mazdak@kcl - Ac.uk

Uploaded by

Finn Wilson

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 25

Statistics for Economists

Lecture 1
Lecturer: Dr Omid Mazdak
Email: [email protected]

1
Lecture 1 – Descriptive Statistics

• Basic Concepts in Statistics

• Types of economic data
➢ Cross sectional data
➢ Time series data
➢ Panel data
• Key summary statistics:
➢ Measures of central tendency: mean, median, mode,
➢ Percentiles – e.g. quartiles, box plots
➢ Measures of dispersion - variance, standard deviation

2
Purpose of Statistics

Essential purpose of statistics:

• Existing knowledge and theory on which decisions are based are
often incomplete. Thus empirical observation, data collection and
statistics can aid in the decision making process.
• It is also possible to make theoretical inference from data, drawing
upon measures such as absolute frequencies, relative frequencies,
averages, dispersion and correlation.
• Sample statistics can be used to estimate population parameters.
• Statistics are used both to test existing hypothesis and for inductive
analysis.

3
Concept: Random Variable

• A random variable (RV) is any variable whose value/outcome is

non-constant and cannot be predicted exactly.
- E.g. Total exports from the UK is a random variable which varies
over time.

• A discrete random variable takes on only finite (or countably

infinite) number of values. E.g. example, numbers of workers or
students.
• A continuous random variable is a random variable that can take
on any value in some interval of values. E.g. height or weight of
individuals, travel distances etc.

4
Concept: Sample Vs. Population
• A Sample: a subset of observations from variable(s) from the
population.
- E.g. An election exit poll is drawn from a sample of the voter
population.

• The Population: All possible observations from the variable(s)

of interest.
- E.g. the final election result is from the population of the
voters.

5
Concept: Sample Vs. Population (2)
• Sample size is usually indicated by n
and the population size by N with n <
N.
• A parameter: numerical measure that
describes a specific characteristic of a
population.
• A sample statistic/estimator: is
numerical measure that describes a
specific characteristic of a sample
which is used to estimate the
population parameter. The sample
mean is an example of a sample
statistic and is used to estimate the
population mean.
Random Samples

Simple random sampling is a procedure in which

• each member of the population is chosen strictly by chance,

• each member of the population is equally likely to be chosen,
• every possible sample of n objects is equally likely to be
chosen

The resulting sample is called a random sample. Ideally, all

samples are purely random samples so that they give an
unbiased representation of the population.
7
Why use samples?
• In economics and social sciences in general, the population of data
may not be available or too costly and time consuming to collate.
• We analyse samples and obtain statistical information from samples
as a way of estimating characteristics of the population.
• In general, the larger the sample, the better the estimate of the
population parameters becomes.

8
Types of Economic Data

• Variables
• Categorical variables (defined categories or groups, e.g. male/female)
• Numerical variables
• Discrete variables (counted items)
• Continuous variables (measured characteristics)
• Data
• Cross-sectional data
• Time series data
• Panel data

9
Types of Economic Data – Cross Sectional Data

Cross-section data:
Observations from multiple
variables, at a given moment
time.

E.g. of cross sectional data:

GDP of different countries, at
a given time – e.g. 2019.

10
Types of Economic Data – Cross Sectional Data (2)
Example 2 of Cross sectional data: Countries with Largest Trade Surpluses (2019)
Trade Balance (2019)

China
Germany
Russian Federation
Saudi Arabia
Ireland
Netherlands
Italy
Australia
United Arab Emirates
Brazil
Qatar
Taipei, Chinese
Iraq

0 50 100 150 200 250 300 350 400 450 500

US$ Billions Source: International Trade Centre
11
Types of Economic Data – Time Series Data

Time-series UK GDP (1998 - 2019) Current US$ Trilions

3.5
data: 3
Is a set of 2.5
observations of

US$ Trillions
2
a single 1.5
variable over a 1
period of time… 0.5
E.g. UK GDP 0
1998-2019
1998
1999
2000
2001
2002
2003
2004
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
Source: World Bank

12
Types of Economic Data – Panel Data
UK, China, US, GDP (1998 - 2019) Current US$
Panel data: Trillions
Is a set of 25

observations of 20

multiple

US$ Trillions
15
variables over a UK
period of time… 10 China
US
E.g. UK, US 5

and China GDP 0

1998-2019

2007
1998
1999
2000
2001
2002
2003
2004
2005
2006

2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
Year

13
Measures of Central Tendency
Add up all observations of
1 n variable x, from 1 to n,
The Mean: Arithmetic mean = x =  xi and divide by the number
n i =1 of observations, n.

The median: the numerical value corresponding to the

middle observation in a dataset.

The mode: the value of the most frequent (most common)

observation in a dataset.

14
Measures of Central Tendency: Arithmetic Mean

Sample Arithmetic Mean: summing all values from all

observations from the sample and dividing by n:

Population Arithmetic Mean: summing all values from

all observations from the population and dividing by n:

15
Measures of Central Tendency: Median

The median is also known as the 50th Percentile. In other words, 50% of
the observations are below or equal to this value.

•To find the median of a distribution:

1) Arrange all the observations in order from smallest to largest.
2) The location of the median is 0.5(n + 1) observations up from the bottom of
the list.
- If the number of observations n is odd, the median is the centre observation.
- If the number of observations n is even, the median is the mean of the two
centre observations.
16
Measures of Central Tendency: Simple Example Question:

Xi = 1, 2, 5, 5, 6, 9, 11, 15

Find the a) mean, b) median and c) mode of the variable X.

a) Mean = = (1 + 2 + 5 + 5 + 6+ 9 + 11 + 15)/8 = 6.75

b) Median = 0.5(n + 1) observations up from the bottom of the list. In this

case, the median = 0.5(8+1) = 4.5th observation. So, the median is midway
between observation 4 and observation 5, which equals to (5 + 6)/2 = 5.5 =
median

c) Mode = 5 (since 5 is the most common observation)

17
Measures of Central Tendency – Grouped data (1)

Example – calculating the mean from grouped data

x=
 fx i i

UK income survey:
f i

x f fx
Class in £ Mid income point Number in thousand
0-10k 5 2448 12240
10-25k 17.5 1823 31902.5
25-40k 32.5 1375 44687.5
40-50k 45 480 21600
50-60k 55 665 36575
60-80k 70 1315 92050
80-100k 90 1640 147600
100-150k 125 2151 268875
150-200k 175 2215 387625
200-300k 250 1856 464000
300-500k 400 1057 422800
500-1000k 750 439 329250
1000-2000k 1500 122 183000
2000k+ 3000 50 150000
total 17636 2592205
Mean 146983.726 Mean ≈ 147k 18
Measures of Central Tendency – Grouped data (2)

UK income survey:
Class in £ Number in thousand frequency cumulative freq.
Mode ≈ 5k 0-10k 2448 13.88% 13.88%
10-25k 1823 10.34% 24.22%
25-40k 1375 7.80% 32.01%
40-50k 480 2.72% 34.74%
50-60k 665 3.77% 38.51%
Median ≈ 80k 60-80k 1315 7.46% 45.96%
80-100k 1640 9.30% 55.26%
100-150k 2151 12.20% 67.46%
Mean ≈ 147k 150-200k 2215 12.56% 80.02%
200-300k 1856 10.52% 90.54%
300-500k 1057 5.99% 96.54%
500-1000k 439 2.49% 99.02%
1000-2000k 122 0.69% 99.72%
2000k+ 50 0.28% 100.00%
total 17636 100%

19
Measures of Central Tendency – Grouped data (3)
UK Income Survey:
The Mode The Median The Mean
0.25 Histogram

0.2

0.15

0.1

0.05

0
10 60 110 160 210 260

• The mode < the median < the mean. So the distribution is skewed to
the right. (If the reverse was true, it would be skewed to left)
• If mode = the median = the mean, then it would be a symmetrical
distribution 20
Percentiles
A percentile is the percent of observations that are less than or equal to a given
value.
To calculate pth percentile, (for any percentile, p) the observations need to be first
ordered from lowest to highest.
Pth percentile = value located in the (P/100)(n + 1)th ordered position
So, for e.g., the 25th percentile (also known as the first, or lower quartile, Q1):
Q1 = the value in the 0.25(n + 1)th ordered position.

The 50th percentile (also known as the median):

Median = the value in the 0.5(n + 1)th ordered position.

The 75th percentile (also known as the third or upper quartile, Q3):
Q3 = the value in the 0.75(n + 1)th ordered position.
21
BOX PLOT

A Box plot can be used to summarize key percentiles and the

total range of the data. The range is the difference between the
maximum and minimum values.

22
Measures of dispersion: Variance, Standard Deviation, Coefficient of Variation
Variance is a measure of the dispersion of the data from the mean. The larger the
variance, the larger the standard deviation and the larger the coefficient of variation.

• The (sample) variance The (population) variance

𝑛 𝑛
1 1 2
𝑠2 = ෍ 𝑥𝑖 − 𝑥ǉ 2 σ2 = ෍ 𝑥𝑖 − 𝑥ǉ
𝑛−1 𝑛
𝑖=1 𝑖=1

Sample Standard deviation (Population) Standard deviation

s= 𝑠 2 σ = σ2

Sample coefficient of variation (Population) coefficient of variation

𝑠 σ
𝑥ǉ 𝑥ǉ 23
Small vs Large Standard Deviation

When the variance and

standard deviation is
relatively small, most
observations are
relatively close to the
mean.

When the variance and

standard deviation is
relatively large,
observations tend to be
further away from the
mean.

24
Summary
• Statistics can be used both to test theoretical hypothesis, and also to
create new theory from empirical observation.
• Data, can be in the form of cross sectional, time series and panel
data.
• There are three main measures of central tendency, the mean,
median and mode.
• Observations from a variable can be divided into percentiles, to give
an idea of the dispersion, and data distribution can be summarized
using a box plot.
• Variance, and the standard deviation of a variable can be used to
give a formal measure of the dispersion of the observations from the
mean.
• Next lecture, some additional descriptive statistics (covariance,
correlation) is covered, and probability theory is introduced. 25

DAF Manual PDF
No ratings yet
DAF Manual PDF
16 pages
Topic 0
No ratings yet
Topic 0
55 pages
ETA W1
No ratings yet
ETA W1
33 pages
Analitik Data Dalam Bisnis
No ratings yet
Analitik Data Dalam Bisnis
52 pages
Chapter-I
No ratings yet
Chapter-I
16 pages
Topic 1
No ratings yet
Topic 1
66 pages
CHAP01
No ratings yet
CHAP01
18 pages
FIN4333 Group Project: Market Study: Professor Abu Khan
No ratings yet
FIN4333 Group Project: Market Study: Professor Abu Khan
8 pages
Topic 1: Balance of Payments: ECON 1270 International Monetary Economics
No ratings yet
Topic 1: Balance of Payments: ECON 1270 International Monetary Economics
55 pages
Analysis of Financial Data
No ratings yet
Analysis of Financial Data
24 pages
Macroeconomics Chapter 1
No ratings yet
Macroeconomics Chapter 1
24 pages
Week 1a - MT
No ratings yet
Week 1a - MT
40 pages
Panel Data Models
No ratings yet
Panel Data Models
112 pages
7C-Group-5-BADM-ASSIGNMENT
No ratings yet
7C-Group-5-BADM-ASSIGNMENT
22 pages
Statistics Chapter 1 Notes G H J
No ratings yet
Statistics Chapter 1 Notes G H J
5 pages
Demanda Autonoma Sraffa
No ratings yet
Demanda Autonoma Sraffa
48 pages
Chapter 02 - The Structure of Economic Data and Basic Data Handling
No ratings yet
Chapter 02 - The Structure of Economic Data and Basic Data Handling
12 pages
Econ 299 Chapter 1.0
No ratings yet
Econ 299 Chapter 1.0
107 pages
GEE_Lecture03_2024_Manysheva
No ratings yet
GEE_Lecture03_2024_Manysheva
76 pages
Macro Session 1
No ratings yet
Macro Session 1
27 pages
Lecture 2 Feasibility of International Trade ppt
No ratings yet
Lecture 2 Feasibility of International Trade ppt
35 pages
Unit 1 Introduction To Forecasting
No ratings yet
Unit 1 Introduction To Forecasting
36 pages
Manegrial Economics en G66 006
No ratings yet
Manegrial Economics en G66 006
21 pages
Chapter 2
No ratings yet
Chapter 2
32 pages
chap. I..introduction
No ratings yet
chap. I..introduction
32 pages
Chapter 3 - Growth and The Asian Experience
No ratings yet
Chapter 3 - Growth and The Asian Experience
55 pages
1-What-is-macroeconoimics-Measuring-the-macroeconomy
No ratings yet
1-What-is-macroeconoimics-Measuring-the-macroeconomy
58 pages
L4
No ratings yet
L4
33 pages
The Chart Below Shows The Expenditure of Two Countries On Consumer Goods in 2010
100% (1)
The Chart Below Shows The Expenditure of Two Countries On Consumer Goods in 2010
35 pages
Article On Internet Trading
No ratings yet
Article On Internet Trading
13 pages
H2_CollaIdeCozman_ECML_PKDD06
No ratings yet
H2_CollaIdeCozman_ECML_PKDD06
8 pages
Chap1 - Introduction To Macro
No ratings yet
Chap1 - Introduction To Macro
27 pages
Chapter 2 Macrodata 2023 S
No ratings yet
Chapter 2 Macrodata 2023 S
77 pages
Applied Econometrics For HRM2021-23: Pcpadhan@xlri - Ac.in
No ratings yet
Applied Econometrics For HRM2021-23: Pcpadhan@xlri - Ac.in
22 pages
BDM Notes All Weeks
No ratings yet
BDM Notes All Weeks
68 pages
S1-2_BusStats_Intro (1)
No ratings yet
S1-2_BusStats_Intro (1)
28 pages
Chapter 1
No ratings yet
Chapter 1
37 pages
Lecture1StatDataf2016 PDF
No ratings yet
Lecture1StatDataf2016 PDF
54 pages
Econs Topic List Y11
No ratings yet
Econs Topic List Y11
3 pages
Business_Economics_-_Session_1_PPT_VImI2RkxYL
No ratings yet
Business_Economics_-_Session_1_PPT_VImI2RkxYL
49 pages
Eco All Sessions
No ratings yet
Eco All Sessions
295 pages
Hanke, John E. - Wichern, Dean W. - Business Forecasting
No ratings yet
Hanke, John E. - Wichern, Dean W. - Business Forecasting
45 pages
GDP Growth Determinants (CW Econometrics)
No ratings yet
GDP Growth Determinants (CW Econometrics)
9 pages
Banking Primer and Emerging Markets
No ratings yet
Banking Primer and Emerging Markets
44 pages
Big Data in Finance: Bin Fang and Peng Zhang
No ratings yet
Big Data in Finance: Bin Fang and Peng Zhang
22 pages
ch01 BOP
No ratings yet
ch01 BOP
36 pages
Assessment of Statistical Quality of Real Sector Data Categories in India
No ratings yet
Assessment of Statistical Quality of Real Sector Data Categories in India
38 pages
Google
No ratings yet
Google
19 pages
##Some Known Facts About Financial Data
No ratings yet
##Some Known Facts About Financial Data
13 pages
Trends and Patterns: How To Find Them and Can You Believe Them?
No ratings yet
Trends and Patterns: How To Find Them and Can You Believe Them?
20 pages
320Lecture 7 2024
No ratings yet
320Lecture 7 2024
27 pages
Statistics - Introduction Arranging Data
100% (2)
Statistics - Introduction Arranging Data
45 pages
L1 Introduction
No ratings yet
L1 Introduction
57 pages
Lecture Note - 10.11
No ratings yet
Lecture Note - 10.11
23 pages
Unit 1introduction To Report Writing
No ratings yet
Unit 1introduction To Report Writing
34 pages
L5 Data
No ratings yet
L5 Data
14 pages
Statistics For Economics
No ratings yet
Statistics For Economics
79 pages
Uribe y Schmitt Grohé slides_empirics
No ratings yet
Uribe y Schmitt Grohé slides_empirics
56 pages
Introduction To Statistics
No ratings yet
Introduction To Statistics
23 pages
Chapter
No ratings yet
Chapter
64 pages
Asset Rotation: The Demise of Modern Portfolio Theory and the Birth of an Investment Renaissance
From Everand
Asset Rotation: The Demise of Modern Portfolio Theory and the Birth of an Investment Renaissance
Matthew P. Erickson
No ratings yet
Metode Konduktometri - Compressed
No ratings yet
Metode Konduktometri - Compressed
5 pages
Mahesh Resume Word
No ratings yet
Mahesh Resume Word
3 pages
Fourth National Climate Outlook Forum (NCOF) May-June-July (MJJ) Wet Season
No ratings yet
Fourth National Climate Outlook Forum (NCOF) May-June-July (MJJ) Wet Season
34 pages
edTPA task 3 Assessment Commentary
No ratings yet
edTPA task 3 Assessment Commentary
8 pages
Evaluation of Armored Vehicles Flotation Ability
No ratings yet
Evaluation of Armored Vehicles Flotation Ability
9 pages
This Document Will Be Valid For 180 Days From Jun 15, 21 or Until It Has Been Incorporated in The EM, Whichever Occurs First.
No ratings yet
This Document Will Be Valid For 180 Days From Jun 15, 21 or Until It Has Been Incorporated in The EM, Whichever Occurs First.
5 pages
Revco Operation Manual
No ratings yet
Revco Operation Manual
15 pages
Eia - 1007 Full Study-Kamulu Nock Petrol Station
No ratings yet
Eia - 1007 Full Study-Kamulu Nock Petrol Station
72 pages
Yamaha R S700 R S500
No ratings yet
Yamaha R S700 R S500
73 pages
Basic Choreography and Kinesthetics
No ratings yet
Basic Choreography and Kinesthetics
5 pages
Activity 3 C
No ratings yet
Activity 3 C
4 pages
Ethics Lecture 1
100% (1)
Ethics Lecture 1
68 pages
Quiz 2 - Cost Accounting
No ratings yet
Quiz 2 - Cost Accounting
4 pages
Losh Diagram
No ratings yet
Losh Diagram
2 pages
Milestone Trend Analysis (MTA)
No ratings yet
Milestone Trend Analysis (MTA)
2 pages
Ch11 Energy Methods
No ratings yet
Ch11 Energy Methods
19 pages
PXA
No ratings yet
PXA
3 pages
Session 3 The Barangay Development Planning BDP and CapDev Agenda Formulation Process
No ratings yet
Session 3 The Barangay Development Planning BDP and CapDev Agenda Formulation Process
24 pages
USAMO-2024-notes
No ratings yet
USAMO-2024-notes
19 pages
Presentation - Open-High School
No ratings yet
Presentation - Open-High School
15 pages
Chapter 1
No ratings yet
Chapter 1
23 pages
Planning Projects 2. Scheduling Projects: Module-2
No ratings yet
Planning Projects 2. Scheduling Projects: Module-2
23 pages
Netbackup Error Codes Trouble Shoot
No ratings yet
Netbackup Error Codes Trouble Shoot
7 pages
Claim and Counterclaim Bellwork
No ratings yet
Claim and Counterclaim Bellwork
21 pages
Urban Hacking: The Versatile Forms of Cultural Resilience in Hong Kong
No ratings yet
Urban Hacking: The Versatile Forms of Cultural Resilience in Hong Kong
14 pages
DE ASS 1 Key
No ratings yet
DE ASS 1 Key
2 pages
Sap 18
No ratings yet
Sap 18
8 pages
Hall Effect in P-Germanium: L L L L
No ratings yet
Hall Effect in P-Germanium: L L L L
8 pages
Master Circular Savings Bank Account
No ratings yet
Master Circular Savings Bank Account
35 pages

Statistics For Economists: Lecturer: DR Omid Mazdak Email: Omid - Mazdak@kcl - Ac.uk

Uploaded by

Statistics For Economists: Lecturer: DR Omid Mazdak Email: Omid - Mazdak@kcl - Ac.uk

Uploaded by

Statistics for Economists

• Basic Concepts in Statistics

Essential purpose of statistics:

• A random variable (RV) is any variable whose value/outcome is

• A discrete random variable takes on only finite (or countably

• The Population: All possible observations from the variable(s)

Simple random sampling is a procedure in which

• each member of the population is chosen strictly by chance,

The resulting sample is called a random sample. Ideally, all

E.g. of cross sectional data:

0 50 100 150 200 250 300 350 400 450 500

Time-series UK GDP (1998 - 2019) Current US$ Trilions

and China GDP 0

The median: the numerical value corresponding to the

The mode: the value of the most frequent (most common)

Sample Arithmetic Mean: summing all values from all

Population Arithmetic Mean: summing all values from

•To find the median of a distribution:

Find the a) mean, b) median and c) mode of the variable X.

a) Mean = = (1 + 2 + 5 + 5 + 6+ 9 + 11 + 15)/8 = 6.75

b) Median = 0.5(n + 1) observations up from the bottom of the list. In this

c) Mode = 5 (since 5 is the most common observation)

Example – calculating the mean from grouped data

The 50th percentile (also known as the median):

A Box plot can be used to summarize key percentiles and the

• The (sample) variance The (population) variance

Sample Standard deviation (Population) Standard deviation

Sample coefficient of variation (Population) coefficient of variation

When the variance and

When the variance and

You might also like