Regression hw3

The document discusses various regression analyses, highlighting the best models for predicting costs and salaries based on different predictors. It emphasizes the significance of variables like PAPER and MACHINE in cost prediction and GENDER in salary analysis, while also addressing model selection techniques such as stepwise regression. Additionally, it presents the results of various statistical tests and models, including coefficients and R-squared values.

Uploaded by

詠芯謝

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views

Regression hw3

Uploaded by

詠芯謝

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

Regression hw3

1.(a)The best model should contain PAPER and MACHINE as the predictors,
which gives the smallest AIC(210.8477).
(b)First step, we compare COST~ MACHINE, COST~PAPER, COST~OVERHEAD
and COST~LABOR. We found that COST~MACHINE is the best.
Second step, we compare COST~ MACHINE + PAPER, COST~ MACHINE +
OVERHEAD and COST~ MACHINE + LABOR. We find the best is COST~
MACHINE + PAPER.
Third step, we compare COST~ MACHINE + PAPER, COST~ MACHINE + PAPER
+ OVERHEAD and COST~ MACHINE + PAPER + LABOR. The best model remain
the same which is COST~ MACHINE + PAPER, therefore, the procedure will be
stop.
(c) COST = 59.432 + 0.949(PAPER)+2.386(MACHINE)

(d) R2 = 0.9987, adjusted R2 = 0.9986, residual standard error = 10.98

(e) The variables chosen are the same included in the final regression model for parts
(a) and(b).

2.(a) Using the all-possible regression technique, when there is a large number of candidate -
X variable, this approach may not be poetically feasible, because of the computational time.
Therefore, we would like to choose stepwise regression.
(b) PROD, FOV and HOUSE, are included in the final Model because they are significant
with SALES.
(c)

3(a) The SALARY is expected to increase 579.76 units for every unit increase in GENDER by 1,
keeping the YEARS, POSITION and EDUCAT constant.
(b) The residual degrees of freedom are d.f.= 47-5-1= 41
(d)

3
a
dataset <- read.csv("hwk3q3.csv")
model <- lm(SALARY ~ YEARS + as.factor(POSITION) + as.factor(EDUCAT)
+ as.factor(GENDER), data = dataset)
summary(model)

##
## Call:
## lm(formula = SALARY ~ YEARS + as.factor(POSITION) +
as.factor(EDUCAT) +
## as.factor(GENDER), data = dataset)
##
## Residuals:
## Min 1Q Median 3Q Max
## -1410.3 -204.5 -103.4 230.3 752.1
##
## Coefficients: (1 not defined because of singularities)
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) 1320.86 411.76 3.208 0.00272 **
## YEARS 20.38 41.65 0.489 0.62736
## as.factor(POSITION)2 186.91 479.54 0.390 0.69889
## as.factor(POSITION)3 -223.54 409.34 -0.546 0.58820
## as.factor(POSITION)4 1437.47 521.08 2.759 0.00888 **
## as.factor(POSITION)5 2301.07 518.38 4.439 7.52e-05 ***
## as.factor(EDUCAT)2 133.16 321.02 0.415 0.68063
## as.factor(EDUCAT)3 -685.85 477.76 -1.436 0.15932
## as.factor(EDUCAT)4 NA NA NA NA
## as.factor(GENDER)1 231.36 338.49 0.684 0.49842
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## Residual standard error: 495 on 38 degrees of freedom
## Multiple R-squared: 0.7504, Adjusted R-squared: 0.6979
## F-statistic: 14.28 on 8 and 38 DF, p-value: 2.407e-09

The estimated coefficient of GENDER indicates that the average expected monthly salary of
men is 231.36 units higher than that of women in the same situation

b
The remaining degrees of freedom of the model can be 47-10=37, which does not match the
R output

c
which(dataset$POSITION == 4 | dataset$POSITION == 5)

## [1] 4 7 8 10 15 16 20 21 24 26 30 33 34 35 41 42 43 45 46 47

which(dataset$EDUCAT == 3 | dataset$EDUCAT == 4)

## [1] 4 7 8 10 15 16 20 21 24 26 30 33 34 35 41 42 43 45 46 47

Data points with “POSITION” values of 4 or 5 and “EDUCAT” values of 3 or 4 appear to be

the same, suggesting that these higher positions and education levels correspond to the
same group of individuals in the data set.

d
full_model <- lm(SALARY ~ YEARS + as.factor(POSITION) +
as.factor(EDUCAT) + as.factor(GENDER), data = dataset)
reduced_model <- lm(SALARY ~ YEARS + as.factor(POSITION) +
as.factor(EDUCAT), data = dataset)
anova(reduced_model, full_model)

## Analysis of Variance Table

##
## Model 1: SALARY ~ YEARS + as.factor(POSITION) + as.factor(EDUCAT)
## Model 2: SALARY ~ YEARS + as.factor(POSITION) + as.factor(EDUCAT)
+ as.factor(GENDER)
## Res.Df RSS Df Sum of Sq F Pr(>F)
## 1 39 9424448
## 2 38 9309984 1 114464 0.4672 0.4984

The results of the partial F test comparing the simplified model (excluding gender) and the
full model (including gender) provide a P-value of 0.4984 for the inclusion of the gender
variable. This P-value is much higher than the common significance level of 0.05, suggesting
that adding a gender variable to the model does not significantly improve the model’s ability
to explain wage differences among BigTex Services employees, that is, gender does not
statistically significantly explain wage differences among employees in the provided dataset.

4(a)
LungCap = 1.05157 + (0.55823-0.0597)(Age) + 0.22601 (Smokeyes)
LungCap = 1.05157 + 0.55823(Age)

Week 3
100% (2)
Week 3
10 pages
Homework 5
No ratings yet
Homework 5
5 pages
PDF
No ratings yet
PDF
9 pages
Bachelor of Science in Statistics Curriculum
No ratings yet
Bachelor of Science in Statistics Curriculum
1 page
Assignment 3
No ratings yet
Assignment 3
10 pages
Problem Set
No ratings yet
Problem Set
8 pages
Categorical Predictor S
No ratings yet
Categorical Predictor S
41 pages
Mid-term-test-2021 2911
No ratings yet
Mid-term-test-2021 2911
5 pages
Mid-term-test-2021_2911
No ratings yet
Mid-term-test-2021_2911
5 pages
Regression in R
No ratings yet
Regression in R
40 pages
Centeno - Alexander PSET2 LBYMET2 Final
No ratings yet
Centeno - Alexander PSET2 LBYMET2 Final
11 pages
Econometrics Trial exam 1
No ratings yet
Econometrics Trial exam 1
15 pages
Ecotrix Assignment
No ratings yet
Ecotrix Assignment
5 pages
Term Paper Sample PDF
No ratings yet
Term Paper Sample PDF
10 pages
Multicollinearity and Oaxaca -Tutorial
No ratings yet
Multicollinearity and Oaxaca -Tutorial
35 pages
PS3 Stata
No ratings yet
PS3 Stata
3 pages
Dummy Variable With Regression
No ratings yet
Dummy Variable With Regression
3 pages
Text On Class
No ratings yet
Text On Class
18 pages
Heckman Selection Model
No ratings yet
Heckman Selection Model
9 pages
ansprac2
No ratings yet
ansprac2
6 pages
Statistics Econometrics Exam Feb
No ratings yet
Statistics Econometrics Exam Feb
8 pages
27.12.10h15 KTLTC De-1
No ratings yet
27.12.10h15 KTLTC De-1
6 pages
Generalized Additive Model
No ratings yet
Generalized Additive Model
10 pages
STA108HW4-1
No ratings yet
STA108HW4-1
5 pages
Problem-Set - 1 Practise Problems From Textbook
No ratings yet
Problem-Set - 1 Practise Problems From Textbook
2 pages
Regn_lect_5
No ratings yet
Regn_lect_5
9 pages
Lecture 01
No ratings yet
Lecture 01
26 pages
Assignment 2
No ratings yet
Assignment 2
5 pages
In-Semester Test - Proposed Solutions
No ratings yet
In-Semester Test - Proposed Solutions
6 pages
ps5 Fall+2015
No ratings yet
ps5 Fall+2015
9 pages
Document (1)
No ratings yet
Document (1)
4 pages
Discrete Choice Modeling: William Greene Stern School of Business New York University
No ratings yet
Discrete Choice Modeling: William Greene Stern School of Business New York University
58 pages
PS2
No ratings yet
PS2
2 pages
BDA MSC It
No ratings yet
BDA MSC It
35 pages
Text - On - Class Econometrics
No ratings yet
Text - On - Class Econometrics
17 pages
Homework4: Jiawei Li Sahil Bhagat Shahrzad Baraeinezhad Input Data
No ratings yet
Homework4: Jiawei Li Sahil Bhagat Shahrzad Baraeinezhad Input Data
13 pages
Solutions Week 10
No ratings yet
Solutions Week 10
7 pages
1
No ratings yet
1
9 pages
Im ch01
No ratings yet
Im ch01
11 pages
CJ Econometrics
No ratings yet
CJ Econometrics
6 pages
Additional Problem Set Units I and II
No ratings yet
Additional Problem Set Units I and II
8 pages
Presentación Modelo 4
No ratings yet
Presentación Modelo 4
27 pages
Instrumental Variable Estimation 2: Implementation in R: Instructor: Yuta Toyama Last Updated: 2021-05-18
No ratings yet
Instrumental Variable Estimation 2: Implementation in R: Instructor: Yuta Toyama Last Updated: 2021-05-18
34 pages
ps1 Build
No ratings yet
ps1 Build
4 pages
CS1B April 2024
No ratings yet
CS1B April 2024
9 pages
Michael Joseph-Introductory Econometrics
No ratings yet
Michael Joseph-Introductory Econometrics
8 pages
Practicefinalsolutions
No ratings yet
Practicefinalsolutions
7 pages
Chapter 9
No ratings yet
Chapter 9
38 pages
Y F (X, Z) : Regression Statistics
No ratings yet
Y F (X, Z) : Regression Statistics
12 pages
Homework 3
No ratings yet
Homework 3
10 pages
Introductory Econometrics A Modern Approach 4th Edition Wooldridge Solutions Manual - Quickly Download For The Best Reading Experience
100% (3)
Introductory Econometrics A Modern Approach 4th Edition Wooldridge Solutions Manual - Quickly Download For The Best Reading Experience
49 pages
ETC1010 S12015 Solution Part 1
No ratings yet
ETC1010 S12015 Solution Part 1
7 pages
Homework 2
100% (1)
Homework 2
12 pages
Quiz 1 - Econometrics 2
No ratings yet
Quiz 1 - Econometrics 2
8 pages
Bayes Regression
No ratings yet
Bayes Regression
16 pages
Pbset1 Dofile
No ratings yet
Pbset1 Dofile
3 pages
Econometrics Assignment HW4
No ratings yet
Econometrics Assignment HW4
8 pages
Section: - This Is An Open-Book and Open-Note Test. However, Sharing of Material Is NOT Permitted
No ratings yet
Section: - This Is An Open-Book and Open-Note Test. However, Sharing of Material Is NOT Permitted
9 pages
Enjoy immediate access to the full Introductory Econometrics A Modern Approach 4th Edition Wooldridge Solutions Manual in PDF.
100% (11)
Enjoy immediate access to the full Introductory Econometrics A Modern Approach 4th Edition Wooldridge Solutions Manual in PDF.
48 pages
The Essential R Reference
From Everand
The Essential R Reference
Mark Gardener
No ratings yet
Advanced C Concepts and Programming: First Edition
From Everand
Advanced C Concepts and Programming: First Edition
Gayatri
3/5 (1)
Introduction to PHP, Part 2, Second Edition
From Everand
Introduction to PHP, Part 2, Second Edition
Adam Majczak
No ratings yet
Oracle Certified Professional Java Programmer OCPJP 1Z0 809
From Everand
Oracle Certified Professional Java Programmer OCPJP 1Z0 809
Manish Soni
No ratings yet
L2A-Multiple Regression a 2022-03-01 15-52-48
No ratings yet
L2A-Multiple Regression a 2022-03-01 15-52-48
25 pages
T2B-Tutorial Problem
No ratings yet
T2B-Tutorial Problem
2 pages
L2D-Multiple Regression D 2022-03-03 21_20_03
No ratings yet
L2D-Multiple Regression D 2022-03-03 21_20_03
31 pages
L2C-Multiple Regression C 2022-03-03 21_20_04
No ratings yet
L2C-Multiple Regression C 2022-03-03 21_20_04
24 pages
A2 copy 2
No ratings yet
A2 copy 2
8 pages
L2B-Multiple Regression B 2022-03-02 08_50_53 2022-03-03 21_20_02
No ratings yet
L2B-Multiple Regression B 2022-03-02 08_50_53 2022-03-03 21_20_02
23 pages
MS4226 Project Progress Report
No ratings yet
MS4226 Project Progress Report
3 pages
ch08_money_mortgage
No ratings yet
ch08_money_mortgage
52 pages
CB3044 Midterm Ch6 Answer.docx
No ratings yet
CB3044 Midterm Ch6 Answer.docx
10 pages
Chapter2_2024
No ratings yet
Chapter2_2024
66 pages
Chapter1_2024
No ratings yet
Chapter1_2024
94 pages
ch09_banking_mutual
No ratings yet
ch09_banking_mutual
52 pages
Lecture 7 Examples
No ratings yet
Lecture 7 Examples
24 pages
Group Assignment
No ratings yet
Group Assignment
7 pages
LESSON 7: Non-Parametric Statistics: Tests of Association & Test of Homogeneity
No ratings yet
LESSON 7: Non-Parametric Statistics: Tests of Association & Test of Homogeneity
21 pages
Practical Research
No ratings yet
Practical Research
25 pages
Module 1 Lesson 1.1 From AAS
No ratings yet
Module 1 Lesson 1.1 From AAS
36 pages
AP Stats 8.1 Practice Quiz
No ratings yet
AP Stats 8.1 Practice Quiz
4 pages
A Study On The Factor of Student Absenteeism at Faculty of Business
No ratings yet
A Study On The Factor of Student Absenteeism at Faculty of Business
15 pages
Lecture Note On Statistics For Physical
No ratings yet
Lecture Note On Statistics For Physical
95 pages
Hypothesis Testing Statistics
No ratings yet
Hypothesis Testing Statistics
59 pages
9 Regression Analysis
No ratings yet
9 Regression Analysis
38 pages
Explanatory Sequential and Exploratory Sequential
No ratings yet
Explanatory Sequential and Exploratory Sequential
35 pages
FORM_TWO__SCHEME_term_2
No ratings yet
FORM_TWO__SCHEME_term_2
10 pages
Legume Legacy TCD
No ratings yet
Legume Legacy TCD
5 pages
Biostatistics for the Biological and Health Sciences 2nd Edition Triola Test Bank download
No ratings yet
Biostatistics for the Biological and Health Sciences 2nd Edition Triola Test Bank download
35 pages
SSC CHSL Syllabus
No ratings yet
SSC CHSL Syllabus
4 pages
Guidelines For Reporting Outcomes in Trial Reports The CONSORT-Outcomes 2022 Extension
No ratings yet
Guidelines For Reporting Outcomes in Trial Reports The CONSORT-Outcomes 2022 Extension
13 pages
Recruitment Brochure 2023-24
No ratings yet
Recruitment Brochure 2023-24
26 pages
EPGP in Data Science Gen AI PDF
No ratings yet
EPGP in Data Science Gen AI PDF
63 pages
Kud ConsumerBehaviour PDF
No ratings yet
Kud ConsumerBehaviour PDF
93 pages
Concept Paper: General Objective
No ratings yet
Concept Paper: General Objective
6 pages
Statistical Method Book For Lectures
No ratings yet
Statistical Method Book For Lectures
348 pages
3rd Quarterly Exam in PR2 (REVIEWER) - 2nd
No ratings yet
3rd Quarterly Exam in PR2 (REVIEWER) - 2nd
101 pages
Data Collection
No ratings yet
Data Collection
112 pages
Exercises chap 9
No ratings yet
Exercises chap 9
4 pages
AAA ECO 3772 20 REPORT
No ratings yet
AAA ECO 3772 20 REPORT
40 pages
GMD 7 1247 2014
No ratings yet
GMD 7 1247 2014
5 pages
Sant Gadge Baba Amravati University: B.Sc. Part-I, Semester-I Examination of Summer-2017
No ratings yet
Sant Gadge Baba Amravati University: B.Sc. Part-I, Semester-I Examination of Summer-2017
31 pages
Analysis of Variance (ANOVA)
No ratings yet
Analysis of Variance (ANOVA)
45 pages
Fear and Loathing of Math
No ratings yet
Fear and Loathing of Math
2 pages

Regression hw3

Uploaded by

Regression hw3

Uploaded by

Regression hw3

(d) R2 = 0.9987, adjusted R2 = 0.9986, residual standard error = 10.98

Data points with “POSITION” values of 4 or 5 and “EDUCAT” values of 3 or 4 appear to be

## Analysis of Variance Table

You might also like