Non-normality

Uploaded by

seokamilla

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views3 pages

Non-normality

Uploaded by

seokamilla

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Non-normality

The linear regression model assumes that the error term ui is normally distributed.
This assumption is critical when the sample size is relatively small because the commonly
used significance tests, such as t and F , are based on the assumption of normality. The
OLS estimates are not affected by non-normality, the problem is with the inference.

Causes

Presence of extreme values or outliers

Insufficient data (There is no rule of thumb, but it is ideal to have more than 30
observations)

Two or more processes overlapping

Subset of the main sample

Data follows some other distribution

How can one detect the non-normality?

It is thus important that we check whether the error term is normally distributed.
There are two types of methods:

Informal methods: Normal Q-Q plot of residuals (in Gretl select the residual
variable, click on Variable and Normal Q-Q Plot. In Figure 1 you can see some
patterns in the normal Q-Q plot (b, c, d and e show a non-normal pattern while a
shows a normal pattern). Technically, a normal Q-Q plot compares the distribution
of a data set to the normal distribution. The line represents perfect quantile match-
ing. If the distributions were perfectly matched, all quantile points would lie on the
line. In Figure 2, since most of the points are on the line (except at the extremes),
there is no reason to worry about normality.

Formal methods: Tests of normality

– Doornik-Hansen test (χ2 −goodness of fit test for the normal distribution)
– Shapiro-Wilk test
– Lilliefors test (Kolmogorov-Smirnov test for the normal distribution)
– Jarque-Bera test

The first three tests are studied in the Statistics II course.

The Jarque-Bera test is a large sample test and may not be appropriate in small
samples. The test statistic is given by

S 2 (K − 3)2

JB = n + ∼ χ2 (2)
6 24
where n is the sample size, S = skewness coefficient, and K = kurtosis coefficient.
The null hypothesis is the joint hypothesis that S = 0 and K = 3 ⇔ Normal
distribution.
If the JBexp > χ22;1−α , we reject the null hypothesis that the error term is normally
distributed; otherwise, we cannot reject it. Obviously, we can also make the decision
with the p−value.

Possible solutions

Remove outliers

Increase the sample size

Take transformations of the variables (logarithms, squares, square roots...)

Consider other regression models that take into account the lack of normality (Gen-
eralized linear models, GLM)

And if we still have problems with the normality assumption, what do we do?

That is up to you. If you still find (minor) problems in the residuals after following all
the above recommendations, you have to decide how accurate you want your model to be.
George Box said: ”Basically all models are wrong, but some are useful”. So, in general, a
decent model is better than no model at all.
Figure 1: Patterns in a normal Q-Q plot

Figure 2: Example of a normal Q-Q plot

(The SAGE Quantitative Research Kit) Peter Martin - Linear Regression - An Introduction To Statistical Models-SAGE Publications (2022)
No ratings yet
(The SAGE Quantitative Research Kit) Peter Martin - Linear Regression - An Introduction To Statistical Models-SAGE Publications (2022)
201 pages
Naveen Kumar - Inferential Statistics
No ratings yet
Naveen Kumar - Inferential Statistics
14 pages
Stat 136 Chapter 10 Nonnormality and Heteroskedasticity
No ratings yet
Stat 136 Chapter 10 Nonnormality and Heteroskedasticity
49 pages
Ljung GM, Box GEP. 1978. On A Measure of Lack of Fit in Time Series Models. Biometrica. 65 (2) - 297-303. Doi-10.2307:2335207
No ratings yet
Ljung GM, Box GEP. 1978. On A Measure of Lack of Fit in Time Series Models. Biometrica. 65 (2) - 297-303. Doi-10.2307:2335207
8 pages
Some Thoughts About The Assumption of Normality
No ratings yet
Some Thoughts About The Assumption of Normality
21 pages
Testing For Normality Using SPSS PDF
100% (1)
Testing For Normality Using SPSS PDF
12 pages
Test For Normality PDF
No ratings yet
Test For Normality PDF
30 pages
2012-Assumption and Data Transformationnew
No ratings yet
2012-Assumption and Data Transformationnew
57 pages
Non Parametric Tests
No ratings yet
Non Parametric Tests
27 pages
00000chen - Linear Regression Analysis3
No ratings yet
00000chen - Linear Regression Analysis3
252 pages
Normality Test in Excel
No ratings yet
Normality Test in Excel
5 pages
Testing for Normality & Specification Error
No ratings yet
Testing for Normality & Specification Error
7 pages
7 OLS Assumptions
No ratings yet
7 OLS Assumptions
37 pages
Vii. Assumption of Analysis of Variance
No ratings yet
Vii. Assumption of Analysis of Variance
27 pages
Is Important Because:: TECH 6300 Introduction To Statistical Inference The Normal Distribution
100% (1)
Is Important Because:: TECH 6300 Introduction To Statistical Inference The Normal Distribution
19 pages
Jarque Bera PDF
No ratings yet
Jarque Bera PDF
5 pages
Linear Models Bias
No ratings yet
Linear Models Bias
17 pages
Application of SGT Family Distributions in QMLE
No ratings yet
Application of SGT Family Distributions in QMLE
22 pages
Unit 8
No ratings yet
Unit 8
17 pages
Statistical Analysis Using SPSS and R - Chapter 4 PDF
No ratings yet
Statistical Analysis Using SPSS and R - Chapter 4 PDF
106 pages
Testing For Assumptions (Listed in 6.2) of The Disturbance of The Population Regression
No ratings yet
Testing For Assumptions (Listed in 6.2) of The Disturbance of The Population Regression
10 pages
Tesis - Garcia Anchelia Rodolfo Manuel - Fpycf
No ratings yet
Tesis - Garcia Anchelia Rodolfo Manuel - Fpycf
84 pages
04 Assumptions
No ratings yet
04 Assumptions
53 pages
GoralYadav SMHomework2
No ratings yet
GoralYadav SMHomework2
9 pages
FALK 2010 Comparison of Common Tests For Normality
No ratings yet
FALK 2010 Comparison of Common Tests For Normality
103 pages
Cedlas Wp 178
No ratings yet
Cedlas Wp 178
13 pages
6. Assumption_16_oct18
No ratings yet
6. Assumption_16_oct18
48 pages
Solutions Stat CH 7
No ratings yet
Solutions Stat CH 7
6 pages
Set 4
No ratings yet
Set 4
29 pages
Aitkin
No ratings yet
Aitkin
9 pages
Da SMNR
No ratings yet
Da SMNR
32 pages
Measuring Relationship via Regression Analysis and Correlation-1
No ratings yet
Measuring Relationship via Regression Analysis and Correlation-1
18 pages
Normality
No ratings yet
Normality
3 pages
Testing of Normality: Peter Luk January 17, 2009
No ratings yet
Testing of Normality: Peter Luk January 17, 2009
20 pages
A Generalized Jarque-Bera Test of Conditional Normality: Yi-Ting Chen Chung-Ming Kuan
No ratings yet
A Generalized Jarque-Bera Test of Conditional Normality: Yi-Ting Chen Chung-Ming Kuan
12 pages
STATA Red Tutorial
100% (1)
STATA Red Tutorial
84 pages
Testing Normality Using R/R-Studio: Dean, FCM, BPSMV, Khanpur Kalan, Sonipat, Haryana
No ratings yet
Testing Normality Using R/R-Studio: Dean, FCM, BPSMV, Khanpur Kalan, Sonipat, Haryana
9 pages
Jarque and Bera 1987 - A Test For Normality of Observations and Regression Residuals PDF
No ratings yet
Jarque and Bera 1987 - A Test For Normality of Observations and Regression Residuals PDF
11 pages
Assumption Checking On Linear Regression
No ratings yet
Assumption Checking On Linear Regression
65 pages
Week 2 Lecture 1
No ratings yet
Week 2 Lecture 1
14 pages
1333355396testing For Normality Using SPSS
No ratings yet
1333355396testing For Normality Using SPSS
19 pages
FCDS - RA ch3 Sp21
No ratings yet
FCDS - RA ch3 Sp21
20 pages
Normality Test
No ratings yet
Normality Test
27 pages
Lec 5 - Normality Testing
No ratings yet
Lec 5 - Normality Testing
30 pages
Jarque -Bera Test
No ratings yet
Jarque -Bera Test
3 pages
slides 2 tema 2
No ratings yet
slides 2 tema 2
33 pages
PY1PR1 Stats Lecture 6 Handout
No ratings yet
PY1PR1 Stats Lecture 6 Handout
35 pages
Chapter 4 MLR
No ratings yet
Chapter 4 MLR
17 pages
JB Test
No ratings yet
JB Test
11 pages
Research Method
No ratings yet
Research Method
18 pages
3 Residual Analysis
No ratings yet
3 Residual Analysis
5 pages
Checking the normality of a dataset
No ratings yet
Checking the normality of a dataset
6 pages
Tests For Normality in Linear Panel-Data Models: 15, Number 3, Pp. 822-832
No ratings yet
Tests For Normality in Linear Panel-Data Models: 15, Number 3, Pp. 822-832
11 pages
Financial Market
No ratings yet
Financial Market
36 pages
Is Linear Regression Valid When The Outcome (Dependant Variable) Not Normally Distributed?
No ratings yet
Is Linear Regression Valid When The Outcome (Dependant Variable) Not Normally Distributed?
3 pages
Malnutrition in The World
No ratings yet
Malnutrition in The World
11 pages
Ho - Diagnostics Examples 2 in SPSS
No ratings yet
Ho - Diagnostics Examples 2 in SPSS
4 pages
Unit 2 Sesion 1 and 2
No ratings yet
Unit 2 Sesion 1 and 2
25 pages
Testing For Normality Using SPSS
No ratings yet
Testing For Normality Using SPSS
12 pages
Unit 1 Sesion 1-3
No ratings yet
Unit 1 Sesion 1-3
47 pages
Chapter 14 - Sampling
No ratings yet
Chapter 14 - Sampling
44 pages
Simultaneous Equations
No ratings yet
Simultaneous Equations
11 pages
Community Project: Checking Normality For Parametric Tests in R
No ratings yet
Community Project: Checking Normality For Parametric Tests in R
4 pages
Chapter 5.
No ratings yet
Chapter 5.
14 pages
Ijaerv15n6 12
No ratings yet
Ijaerv15n6 12
16 pages
non parametric tests 002 (1)
No ratings yet
non parametric tests 002 (1)
15 pages
Community Project: Checking Normality For Parametric Tests in SPSS
No ratings yet
Community Project: Checking Normality For Parametric Tests in SPSS
4 pages
transparencias slides
No ratings yet
transparencias slides
35 pages
Assignment 6 Answer
No ratings yet
Assignment 6 Answer
17 pages
Normality Checking 11 Ps
No ratings yet
Normality Checking 11 Ps
4 pages
Efbs Test1 2023 Memo Sem2
No ratings yet
Efbs Test1 2023 Memo Sem2
9 pages
305 - TC 508 Booklet
No ratings yet
305 - TC 508 Booklet
18 pages
L3 Intro OFAT
No ratings yet
L3 Intro OFAT
13 pages
Sec 8 1 Steps in Hypothesis Testing Traditional Method
No ratings yet
Sec 8 1 Steps in Hypothesis Testing Traditional Method
37 pages
SDSC3006_Assignment 1
No ratings yet
SDSC3006_Assignment 1
2 pages
DOE 5.1class Notes
No ratings yet
DOE 5.1class Notes
250 pages
Sample Exam - Indicators Exercise (1)
No ratings yet
Sample Exam - Indicators Exercise (1)
2 pages
James & McCulloch 1990
No ratings yet
James & McCulloch 1990
40 pages
Does Strategic Planning Improve Organizational Performance - A Meta-Analysis 2019
No ratings yet
Does Strategic Planning Improve Organizational Performance - A Meta-Analysis 2019
10 pages
Lind 10e Chap04
No ratings yet
Lind 10e Chap04
30 pages
Questions Chapter 4
No ratings yet
Questions Chapter 4
1 page
Introduction To Statistical Quality Control, 7th Edition by Douglas C. Montgomery. 1
No ratings yet
Introduction To Statistical Quality Control, 7th Edition by Douglas C. Montgomery. 1
42 pages
mgn343 Ca4
No ratings yet
mgn343 Ca4
9 pages
Lesson 2.2 MEASURES OF Central Tendency and Location
No ratings yet
Lesson 2.2 MEASURES OF Central Tendency and Location
24 pages
PPT-MSTT-Trip-Generation
No ratings yet
PPT-MSTT-Trip-Generation
20 pages
WGU C784 - APPLIED HEALTHCARE STATISTICS PRE- ASSESSMENT EXAM QUESTIONS AND ANSWERS UPDATED (2024 - 2025)
No ratings yet
WGU C784 - APPLIED HEALTHCARE STATISTICS PRE- ASSESSMENT EXAM QUESTIONS AND ANSWERS UPDATED (2024 - 2025)
15 pages
ANOVA and MANOVA: Statistics For Psychology
No ratings yet
ANOVA and MANOVA: Statistics For Psychology
34 pages
Regresión Lineal Jorge Andrés Grenett Munita Estadística Instituto IACC 10 de Diciembre 2017
No ratings yet
Regresión Lineal Jorge Andrés Grenett Munita Estadística Instituto IACC 10 de Diciembre 2017
5 pages
DONE QMB 2100.001 Basic Business Statistics - Practice Test #2
100% (1)
DONE QMB 2100.001 Basic Business Statistics - Practice Test #2
14 pages
Extra Unit 1. Sustainable Investment and Financing
No ratings yet
Extra Unit 1. Sustainable Investment and Financing
9 pages
DOX Exam2020 Comprehensive
No ratings yet
DOX Exam2020 Comprehensive
3 pages
5) DOE Design and Analysis Using Minitab
No ratings yet
5) DOE Design and Analysis Using Minitab
48 pages
(Normal Probability) (Discrete Probability) : Xy Xy X y
No ratings yet
(Normal Probability) (Discrete Probability) : Xy Xy X y
1 page
STAT1008 Cheat Sheet
100% (1)
STAT1008 Cheat Sheet
1 page
RIFTY VALLEY University Gada Campus Departement of Accounting
No ratings yet
RIFTY VALLEY University Gada Campus Departement of Accounting
9 pages
Statistics Q4-Summative
No ratings yet
Statistics Q4-Summative
7 pages
Estimation in Statistics
100% (1)
Estimation in Statistics
4 pages
Chi Squared for Beginners
From Everand
Chi Squared for Beginners
Stephanie Glen
No ratings yet
Learn Statistics Fast: A Simplified Detailed Version for Students
From Everand
Learn Statistics Fast: A Simplified Detailed Version for Students
Hesbon R.M
No ratings yet