Cheatsheet Part 2

Basic lvl2 Statistics

Uploaded by

kidkapper007

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views

Cheatsheet Part 2

Basic lvl2 Statistics

Uploaded by

kidkapper007

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

Linear equations:

B coefficient associated with group variable = difference between lines at X = 0, group – reference
B coefficient associated with x variable = slope of the reference
B coefficient with interaction = where lines cross is the starting point, then you move 1 to the right and check the difference between lines, group – reference

Addition = both variables independently effect the dependent

Interaction = variable effects rela between two indep, with dichotomy, take lowest expected effect as 0

Residuals should look like: overall normality (correlation residuals - predict values) and same variance (scatterplot variables – residuals, should be in a box)

If residuals are problematic: 1) Change/reconceptualize variables: log of variable. 2) Change the model/include extra variables (called omitted variable)

If we are low on the Y variable, then the residuals will be negative. If the residuals are on top, high level of Y, then the residuals will be positive. In the middle,
the residuals will be both negative and positive residuals. It is about the predictive value of Y, and the residuals. Not the Y itself, because there will always be
a relationship.

Non linearity = when the line bends slightly over time, Parabolic = when the line bends all the way to the starting point over time, kwadraad
Logarithmic = it doesn’t bend again, For removing non linearity: kwadraard gebruiken of sqrt of Y

If a model is good, errors will be random

Discrepancies = the differences between what you expect to find, had it been a normal distribution, as compared to what you see in dataset

Shapiro-Wilk = W: range between 0 and 1. 1 means perfectly normality. W tells you how (un)likely it is that this comes from a normal distribution. Null hypo
= population = normal distribution. P value below 0.05, then null hypo is rejected = not normal distribution.

Homoscedasticity = homogeneity (of variances) = equal variance

Heteroscedasticity = heterogeneity (of variance) = unequal variance

Levene’s test = mainly for groups: when you want to detect whether the error variance in one group is different from another. Null hypo = equal variances.
Big sample size or unequal variances will not really affect the s.e. estimates in larger samples = always significant = disadvantage of test.
Breusch-Pagan test = linear models: studying whether residuals are associated with one or more variables. If there is association = no homogeneity. Null
hypo = homogeneity. P value above 0.05 = equal variance in residuals

Residual = extent to which a point is away from estimated line/model, Leverage = outlier on independent (x)
Influence = extent to which slope of line is affected by datapoint (= high residual + high leverage)

Cooks distance: when bigger than 1 = problematic. Outside lines.

Deterministic bivariate = y is fully explained by one variable. Probablistic = y is NOT fully explained by one variable.
𝒄𝒓𝒊𝒕𝒊𝒄𝒂𝒍_𝒗𝒂𝒍𝒖𝒆𝟐 (𝒑)(𝟏−𝒑) p∗(1−p)
Calculate N = 𝑴𝒂𝒓𝒈𝒊𝒏 𝒐𝒇 𝑬𝒓𝒓𝒐𝒓𝟐
= if you don’t know p, take 0.05 ////// SE = √
sample size
CI = ESTIMATE + MARGIN OF ERROR, MARGIN OF ERROR = 2 x sd of pop or SE of sample

1)They give moe:

P=
moe =
nanswer = (1.96^2*p*(1-p))/moe^2

2)They don’t give moe:

nquestion =
p=

Don’t touch this part:

se = sqrt(p*(1-p))/sqrt(nquestion)
lower = p-1.96*se
upper = p+1.96*se
moe = 1.96*se/2
nanswer = (1.96^2*p*(1-p))/moe^2

Non parametric tests:

Kruskal-Wallis test: three or more independent groups -> determine significant diff between medians of groups
Mann-Whitney-wilcoxon test = wilcoxon rank sum test: two independent groups with ordinal or continuous data
Wilcoxon Signed-Rank test: two paired groups with ordinal or continuous data
Sign test: used to determine whether the median of a sample differs from known hypothesized value

R:
#adding residuals and predicted values
dataname$res1 <- model$residuals
dataname$pred1 <- model$fitted.values
dataname %>% or use plot(model, 1)
ggplot(aes(x = pred1, y = res1)) + geom_...

R packages: tidyverse, broom, modelr, car, lmtest, haven, dplyr

Self Compacting Concrete
No ratings yet
Self Compacting Concrete
48 pages
Linear Regression
100% (2)
Linear Regression
228 pages
ECON6001 F2021 Topic4
No ratings yet
ECON6001 F2021 Topic4
76 pages
Data Science Interview Preparation
100% (1)
Data Science Interview Preparation
113 pages
CH 2
No ratings yet
CH 2
31 pages
Ra Web
No ratings yet
Ra Web
70 pages
Lecture 4
No ratings yet
Lecture 4
57 pages
BA501 Week5 Linear Regression
No ratings yet
BA501 Week5 Linear Regression
45 pages
Regression Analysis
No ratings yet
Regression Analysis
65 pages
Regression
No ratings yet
Regression
60 pages
Statistical Methods
No ratings yet
Statistical Methods
7 pages
Lecture set 5
No ratings yet
Lecture set 5
54 pages
Linear Regression for Real
No ratings yet
Linear Regression for Real
1 page
BA - Advanced statistical method using R (P2)
No ratings yet
BA - Advanced statistical method using R (P2)
12 pages
Third, Regression Analysis Predicts Trends and Future Values
No ratings yet
Third, Regression Analysis Predicts Trends and Future Values
2 pages
Which Test When: 1 Exploratory Tests
No ratings yet
Which Test When: 1 Exploratory Tests
5 pages
Lec2 ASE
No ratings yet
Lec2 ASE
86 pages
R-programming - Unit 5
No ratings yet
R-programming - Unit 5
43 pages
Chapter 3 - Classical Simple Linear Regression
No ratings yet
Chapter 3 - Classical Simple Linear Regression
52 pages
Annotated 4 Ch4 Linear Regression F2014
No ratings yet
Annotated 4 Ch4 Linear Regression F2014
11 pages
Math Notes 1
No ratings yet
Math Notes 1
3 pages
Short - Notes - Econometric Methods
No ratings yet
Short - Notes - Econometric Methods
22 pages
Using R For Linear Regression
No ratings yet
Using R For Linear Regression
9 pages
Module01.1 LinearRegression
No ratings yet
Module01.1 LinearRegression
32 pages
Linear Regression Model
No ratings yet
Linear Regression Model
3 pages
Data Science 03 - Regression PDF
No ratings yet
Data Science 03 - Regression PDF
32 pages
What Is Linear Regression
No ratings yet
What Is Linear Regression
14 pages
STAT22209 - Chapter 03-Multiple Regression - 2022
No ratings yet
STAT22209 - Chapter 03-Multiple Regression - 2022
41 pages
Chapter 5 - Eng
No ratings yet
Chapter 5 - Eng
20 pages
Linear Regresion
No ratings yet
Linear Regresion
28 pages
Lecture 4
No ratings yet
Lecture 4
60 pages
Prof. Dr. Moustapha Ibrahim Salem Mansourms@alexu - Edu.eg 01005857099
No ratings yet
Prof. Dr. Moustapha Ibrahim Salem Mansourms@alexu - Edu.eg 01005857099
110 pages
Unit 2 - Scatterplots Correlation and Regression Summer 2021
No ratings yet
Unit 2 - Scatterplots Correlation and Regression Summer 2021
43 pages
Regression Analysis (Simple)
100% (1)
Regression Analysis (Simple)
8 pages
An Overview of Regression Analysis: Notes
No ratings yet
An Overview of Regression Analysis: Notes
5 pages
Econometrics Chapter 8 PPT Slides
100% (1)
Econometrics Chapter 8 PPT Slides
42 pages
Unit 561 Unequal Variance and More With Answers
No ratings yet
Unit 561 Unequal Variance and More With Answers
13 pages
lect_w4m08ab_f2023
No ratings yet
lect_w4m08ab_f2023
8 pages
QBM 101 Lecture 10
No ratings yet
QBM 101 Lecture 10
45 pages
STAT630Slide Adv Data Analysis
No ratings yet
STAT630Slide Adv Data Analysis
238 pages
Bivariate
No ratings yet
Bivariate
28 pages
Topic 8 - Regression Analysis
No ratings yet
Topic 8 - Regression Analysis
51 pages
Introduction of Regression
No ratings yet
Introduction of Regression
57 pages
ECON 342 AE Model Specification and Data Problems 2021
No ratings yet
ECON 342 AE Model Specification and Data Problems 2021
43 pages
Unit 3
No ratings yet
Unit 3
24 pages
unit5_R
No ratings yet
unit5_R
5 pages
stats notes
No ratings yet
stats notes
4 pages
Notes 9
No ratings yet
Notes 9
57 pages
Regression Assumptions Explained
No ratings yet
Regression Assumptions Explained
6 pages
T-Tests, Anovas & Regression: and Their Application To The Statistical Analysis of Neuroimaging
No ratings yet
T-Tests, Anovas & Regression: and Their Application To The Statistical Analysis of Neuroimaging
39 pages
Extrapolation
No ratings yet
Extrapolation
48 pages
Regression and Correlation
No ratings yet
Regression and Correlation
14 pages
DA-3rd unit
No ratings yet
DA-3rd unit
16 pages
Iml Exp. 3
No ratings yet
Iml Exp. 3
4 pages
Machine Learning and Linear Regression
100% (1)
Machine Learning and Linear Regression
55 pages
CH 06
No ratings yet
CH 06
22 pages
Useful R Functions-1
No ratings yet
Useful R Functions-1
4 pages
Bus 173 - Lecture 5
No ratings yet
Bus 173 - Lecture 5
38 pages
Linear Regression Analysis and Least Square Methods
No ratings yet
Linear Regression Analysis and Least Square Methods
65 pages
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
From Everand
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
Jeffrey M. Wooldridge
No ratings yet
SAT Math: Master the Skills in 40 Pages
From Everand
SAT Math: Master the Skills in 40 Pages
Jennifer L Johnson
No ratings yet
grammar-2
No ratings yet
grammar-2
3 pages
Communication Studies IA
No ratings yet
Communication Studies IA
11 pages
Senior Letter
No ratings yet
Senior Letter
4 pages
Implementing Primavera P6 in Public Dept
100% (1)
Implementing Primavera P6 in Public Dept
47 pages
1st Merit List For MSC (Hons) - MPhil-MS Evening Program
No ratings yet
1st Merit List For MSC (Hons) - MPhil-MS Evening Program
19 pages
HDL Designer Series: Student Workbook
No ratings yet
HDL Designer Series: Student Workbook
10 pages
Engineering and Technology, R.T.M. Nagpur University, Nagpur. Syllabus For B.E. (First Semester)
No ratings yet
Engineering and Technology, R.T.M. Nagpur University, Nagpur. Syllabus For B.E. (First Semester)
25 pages
Trust Research Paper
No ratings yet
Trust Research Paper
21 pages
MT6737T Android Scatter
No ratings yet
MT6737T Android Scatter
8 pages
DCS orPLC
No ratings yet
DCS orPLC
12 pages
Basic Mathematics - Lecture 10
No ratings yet
Basic Mathematics - Lecture 10
7 pages
Work Immers
No ratings yet
Work Immers
9 pages
Behavioral and Emotional Disorder
No ratings yet
Behavioral and Emotional Disorder
14 pages
Information and Communication Technology Book
No ratings yet
Information and Communication Technology Book
248 pages
HR Planning
No ratings yet
HR Planning
26 pages
ISM Code
100% (1)
ISM Code
1 page
From Fail-Safe To Safe-To-Fail: Sustainability and Resilience in The New Urban World
No ratings yet
From Fail-Safe To Safe-To-Fail: Sustainability and Resilience in The New Urban World
16 pages
Diversity of Living World: Very Short Answer Questions
No ratings yet
Diversity of Living World: Very Short Answer Questions
9 pages
4 PDF
100% (2)
4 PDF
3 pages
David Ricardo
No ratings yet
David Ricardo
11 pages
Micro Controller and Embedded Systems (Revised)
No ratings yet
Micro Controller and Embedded Systems (Revised)
2 pages
DLP 18W5M7NS-1e-1
No ratings yet
DLP 18W5M7NS-1e-1
3 pages
GAP MODEL (Autosaved)
No ratings yet
GAP MODEL (Autosaved)
18 pages
BRM Statwiki
No ratings yet
BRM Statwiki
55 pages
Msds SP
No ratings yet
Msds SP
9 pages
Gec 9 Module 2
No ratings yet
Gec 9 Module 2
7 pages
Farhan File Mass Communication
No ratings yet
Farhan File Mass Communication
45 pages
Reflection: "How To Avoid Death by Powerpoint" by David JP Phillips
No ratings yet
Reflection: "How To Avoid Death by Powerpoint" by David JP Phillips
2 pages
Phy Notes CL 9th New by MR Jamal Shah
No ratings yet
Phy Notes CL 9th New by MR Jamal Shah
35 pages