0% found this document useful (0 votes)

13 views

Unit 2 Assignment SKELETON R spr18

Uploaded by

admin

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views

Unit 2 Assignment SKELETON R spr18

Uploaded by

admin

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

Psy/Educ 6600: Unit 2 Homework

Groundwork for Inference

Your Name
Spring 2018

Contents
Chapter 1. DATA PREPARATION 2
Load Packages . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
Import Data, Define Factors, and Compute New Variables . . . . . . . . . . . . . . . . . . . . . . . 2

Chapter 5. Intro to Hypothesis Testing: 1 Sample z-Test 3

5C-3. 1 Sample z-Test compared to historic controls for mathquiz and statquiz . . . . . . . . . . 3
5C-4. Test for Normaity for mathquiz and statquiz . . . . . . . . . . . . . . . . . . . . . . . . . . 4
Skewness and Kurtosis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
Shapiro-Wilk’s Test . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
Create Histograms . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
Create QQ Plots . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6

Chapter 6. Confidence Interval Estimation: The t Distribution 7

6C-1. 1-sample t-tests for anx_base, anx_pre, and anx_post . . . . . . . . . . . . . . . . . . . . . 7
6C-2. 1-sample t-tests for hr_base among MEN . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
6C-3. 1-sample t-tests for hr_post among FEMALE . . . . . . . . . . . . . . . . . . . . . . . . . . 9

Chapter 7. Independent Samples t-Test for Means 10

7C-1. Independent Samples t-Test for Mean hr_base by genderF . . . . . . . . . . . . . . . . . . . 10
Assumtion Check: Homogeneity of Variance . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
Perform the t-Test for Means in 2 Indep Groups . . . . . . . . . . . . . . . . . . . . . . . . . 11
7C-5. Independent Samples t-Test for Mean hr_post by coffeeF . . . . . . . . . . . . . . . . . . . 12
Assumtion Check: Homogeneity of Variance . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
Perform the t-Test for Means in 2 Indep Groups . . . . . . . . . . . . . . . . . . . . . . . . . 12

1
Chapter 1. DATA PREPARATION

Load Packages

• Make sure the packages are installed (Package tab)

library(tidyverse) # Loads several very helpful 'tidy' packages
library(readxl) # Read in Excel datasets
library(furniture) # Nice tables (by our own Tyson Barrett)
library(psych) # Lots of nice tid-bits
library(car) # Companion to "Applied Regression"

Import Data, Define Factors, and Compute New Variables

• Make sure the dataset is saved in the same folder as this file
• Make sure the that folder is the working directory
NOTE: I added the second line to convert all the variables names to lower case. I still kept the
F as a capital letter at the end of the five factor variables.
data_clean <- read_excel("Ihno_dataset.xls") %>%
dplyr::rename_all(tolower) %>%
dplyr::mutate(genderF = factor(gender,
levels = c(1, 2),
labels = c("Female",
"Male"))) %>%
dplyr::mutate(majorF = factor(major,
levels = c(1, 2, 3, 4,5),
labels = c("Psychology",
"Premed",
"Biology",
"Sociology",
"Economics"))) %>%
dplyr::mutate(reasonF = factor(reason,
levels = c(1, 2, 3),
labels = c("Program requirement",
"Personal interest",
"Advisor recommendation"))) %>%
dplyr::mutate(exp_condF = factor(exp_cond,
levels = c(1, 2, 3, 4),
labels = c("Easy",
"Moderate",
"Difficult",
"Impossible"))) %>%
dplyr::mutate(coffeeF = factor(coffee,
levels = c(0, 1),
labels = c("Not a regular coffee drinker",
"Regularly drinks coffee"))) %>%
dplyr::mutate(hr_base_bps = hr_base / 60) %>%
dplyr::mutate(anx_plus = rowsums(anx_base, anx_pre, anx_post)) %>%
dplyr::mutate(hr_avg = rowmeans(hr_base + hr_pre + hr_post)) %>%
dplyr::mutate(statDiff = statquiz - exp_sqz)

2
Chapter 5. Intro to Hypothesis Testing: 1 Sample z-Test

5C-3. 1 Sample z-Test compared to historic controls for mathquiz and statquiz

TEXTBOOK QUESTION: (A) In the past 10 years, previous stats classes who took the same math quiz
that Inho’s students took averaged 28 with a standard deviation of 8.5. What is the two-tailed p value
for Inho’s students with respect to that past population? (Don’t forget that the N for mathquiz is not 100.)
Would you say that Inho’s class performed significantly better than previous classes? Explain. (B) Redo
part a assuming that the same previous classes had also taken the same statquiz and averaged 6.1 with a
standard deviation of 2.5.
DIRECTIONS: Find the mean (M) and sample size (n) for mathquiz and statquiz and then work the
rest of the statistical test by hand in the printed homework packet.
NOTE: You may use the furniture::table1() funciton gives the mean, but it only gives the
total n for all variables. Since some students were missing the math quiz, but not the stat quiz
the sample sizes are different. So use the psych::describe() function to get the means and the
sample size for each variable.
# Find the mean and n for: mathquiz, statquiz

3
5C-4. Test for Normaity for mathquiz and statquiz

TEXTBOOK QUESTION: Test both the math quiz and stat quiz variables for their resemblance to
normal distributions. Based on skewness, kurtosis, and the Shapiro-Wilk statistic, which variable has a
sample distribution that is not very consistent with the assumption of normality in the population?

Skewness and Kurtosis

DIRECTIONS: Find the skewness and kurtosis for mathquiz and statquiz
NOTE: Yes, you just did this above using the psych::describe() function… so you may skip
it here if you want.
# Find the skewness and kurtosis for: mathquiz, statquiz

Shapiro-Wilk’s Test

DIRECTIONS: Use the shapiro.test() function to test for normality in a small’ish sample.
NOTE: You must use a dplyr::pull() step to pull out one variable from the dataset before
you can use the shapiro.test() function.
# Shapiro-Wilk's Normality Test for: mathquiz

# Shapiro-Wilk's Normality Test for: statquiz

4
Create Histograms

DIRECTIONS: Use geom_histogram() after setting the ggplot(aes()). Make sure to try different bins
= # or binwidth = # to get a ‘good looking’ plot.
NOTE: For histograms, you do need to specify the variable name as xin the aes(x = variable)
option.
# Histogram for: mathquiz

# Histogram for: statquiz

5
Create QQ Plots

DIRECTIONS: Use geom_qq() after setting the ggplot(aes()).

NOTE: For qq plots, you do need to specify the variable name as samplein the aes(sample =
variable) option.
# Histogram for: mathquiz

# Histogram for: statquiz

6
Chapter 6. Confidence Interval Estimation: The t Distribution

6C-1. 1-sample t-tests for anx_base, anx_pre, and anx_post

TEXTBOOK QUESTION: Perform one-sample t tests to determine whether the baseline, pre-, or postquiz
anxiety scores of Inho’s students differ significantly ( α = .05, two-tailed) from the mean (µ = 18) found by a
very large study of college students across the country. Find the 95% Cconfidence interval for the population
mean for each of the three anxiety measures.
DIRECTIONS: Use the t.test(mu = #) function to perform a 1 sample t-test. Make sure to sepify the
Null hypothesis value for µ.
NOTE: You must use a dplyr::pull() step to pull out one variable from the dataset before
you can use the t.test() function.
# 1-sample t-test for: anx_base

# 1-sample t-test for: anx_base

7
6C-2. 1-sample t-tests for hr_base among MEN

TEXTBOOK QUESTION: Perform a one-sample t test to determine whether the average baseline heart
rate of Inho’s male students differs significantly from the mean heart rate (µ = 70) for college-aged men at
the .01 level, two-tailed. Find the 99% confidence intervals for the population mean represented by Inho’s
male students.
DIRECTIONS: Similar to the last problem, use the t.test(mu = #) function to perform a 1 sample
t-test. This time, make sure the subset out the males only with a dplyr::filter() step prior to the
dplyr::pull() step.
note: To change from the default 95% confidence intervals, make sure to specify conf.level =
0.99 inside the t.test() function.
# 1-sample t-test for MALES: hr_base

8
6C-3. 1-sample t-tests for hr_post among FEMALE

TEXTBOOK QUESTION: Perform a one-sample t test to determine whether the average postquiz heart
rate of Inho’s female students differs significantly (α = .05, two-tailed) from the mean resting heart rate
(µ = 72) for college-aged women. Find the 95% confidence interval for the population mean represented by
Inho’s female students.
DIRECTIONS: This time, subset out WOMEN and choose the post-quiz heart rate. Also, use a different
population null value.
# 1-sample t-test for MALES: hr_base

9
Chapter 7. Independent Samples t-Test for Means

7C-1. Independent Samples t-Test for Mean hr_base by genderF

TEXTBOOK QUESTION: Perform a two-sample t test to determine whether there is a statistically

significant difference in baseline heart rate between the men and the women of Inho’s class. Do you
have homogeneity of variance? Report your results as they might appear in a journal article. Include the
95% CI for this gender difference.

Assumtion Check: Homogeneity of Variance

DIRECTIONS: Before performing the test, check to see if the assumption of homogeneity of variance is
met using Levene’s Test. For a independent samples t-test for means, the men and women need to have
the same amount of spread (SD) in their baseline hear rates.
NOTE: Use the car:leveneTest() function to do this. Inside the funtion you need to specify
at least three things (sepearated by commas):
• the formula: continuous_var ~ grouping_var (replace with your variable names)
• the dataset: data = . to pipe it from above
• the center: center = "mean" since we are comparing means

10
Perform the t-Test for Means in 2 Indep Groups

DIRECTIONS: Test if men and women have different baseline heart rates using the t.test() function.
Use the same t.test() funtion we have used in the prior chapters. This time you need to speficy
a few more options:
• the formula: continuous_var ~ grouping_var (replace with your variable names)
• the dataset: data = . to pipe it from above
• independent vs. paired: paired = FALSE (this is the default)
• is homogeneity satified: var.equal = TRUE (NOT the default)
• confidence level: conf.level = # (defults to .95)
# indep groups t-test for means: hr_base by genderF

11
7C-5. Independent Samples t-Test for Mean hr_post by coffeeF

TEXTBOOK QUESTIONS: Perform a two-sample t test to determine whether coffee drinkers exhibited
significantly higher postquiz heart rates than nondrinkers at the .05 level. Is this t test significant at the
.01 level? Find the 99% confidence interval for the difference of the two population means and explain its
connection to your decision regarding the null hypothesis at the .01 level.

Assumtion Check: Homogeneity of Variance

DIRECTIONS: Just like the last question, run Levene’s test first.

Perform the t-Test for Means in 2 Indep Groups

DIRECTIONS: Make sure to change the confidence level to 99%.

# indep groups t-test for means: hr_post by coffeeF

DL RS 299a
100% (3)
DL RS 299a
9 pages
MH3511 Midterm 2017 Q
No ratings yet
MH3511 Midterm 2017 Q
4 pages
Basic Statisticks 1 - Assignment - Vivek T
100% (7)
Basic Statisticks 1 - Assignment - Vivek T
18 pages
Stat 151 - Final Review
No ratings yet
Stat 151 - Final Review
15 pages
02 - Perceived - Supervisor - Support Scale
No ratings yet
02 - Perceived - Supervisor - Support Scale
10 pages
0 0 1 1 1 W A P 1 N N I 1 I X I N 1 N N I 1 I 2
No ratings yet
0 0 1 1 1 W A P 1 N N I 1 I X I N 1 N N I 1 I 2
2 pages
Unit 1 Assignment SKELETON R spr18
No ratings yet
Unit 1 Assignment SKELETON R spr18
23 pages
EDUC/PSY 6600: Unit 2 Homework: Your Name Fall 2019
No ratings yet
EDUC/PSY 6600: Unit 2 Homework: Your Name Fall 2019
48 pages
Statistical Computing by Using R
100% (1)
Statistical Computing by Using R
11 pages
Lab6_Hypothesis testing and confidence intervals in R
No ratings yet
Lab6_Hypothesis testing and confidence intervals in R
3 pages
Modelling in R
No ratings yet
Modelling in R
47 pages
Lab Checkup Notes 2 - Google Docs
No ratings yet
Lab Checkup Notes 2 - Google Docs
7 pages
Cohen Chap 7 T Test For Independent Sample Means (Screen)
No ratings yet
Cohen Chap 7 T Test For Independent Sample Means (Screen)
20 pages
Introducing Inferential Statistics
No ratings yet
Introducing Inferential Statistics
55 pages
ProbList4-24-Sln (1)
No ratings yet
ProbList4-24-Sln (1)
19 pages
Copy of Hints of Assignment5_Fall 2024
No ratings yet
Copy of Hints of Assignment5_Fall 2024
11 pages
Tutorial Manual
100% (1)
Tutorial Manual
93 pages
5 Single Sample T JASP
No ratings yet
5 Single Sample T JASP
10 pages
Hypothesis Tests in r
No ratings yet
Hypothesis Tests in r
25 pages
Test On Variables: in Surveys, The Foolish Ask Questions, Wise Cannot Answers
No ratings yet
Test On Variables: in Surveys, The Foolish Ask Questions, Wise Cannot Answers
24 pages
Copy of Assignment5_Fall 2024
No ratings yet
Copy of Assignment5_Fall 2024
14 pages
Assignment# 06
No ratings yet
Assignment# 06
16 pages
lect_w7_f2023
No ratings yet
lect_w7_f2023
13 pages
Lab 5 - Shell
No ratings yet
Lab 5 - Shell
7 pages
T-Tests & Chi2
No ratings yet
T-Tests & Chi2
35 pages
Final Exam Practice Questions
No ratings yet
Final Exam Practice Questions
16 pages
AP Stats Cheat Sheet FINAL
No ratings yet
AP Stats Cheat Sheet FINAL
8 pages
R 2nd IA
No ratings yet
R 2nd IA
7 pages
Ttest
No ratings yet
Ttest
16 pages
R Inferential Statistics
No ratings yet
R Inferential Statistics
78 pages
STATS 10 Assignment 1
No ratings yet
STATS 10 Assignment 1
7 pages
[FREE PDF sample] Statistics Using IBM SPSS: An Integrative Approach – Ebook PDF Version ebooks
100% (3)
[FREE PDF sample] Statistics Using IBM SPSS: An Integrative Approach – Ebook PDF Version ebooks
35 pages
Dissertation Boxplot
100% (2)
Dissertation Boxplot
8 pages
Introduction To Data Science Exploratory Data Analysis
No ratings yet
Introduction To Data Science Exploratory Data Analysis
55 pages
3 - Data Analysis - Tests of Differences
No ratings yet
3 - Data Analysis - Tests of Differences
50 pages
Exam
No ratings yet
Exam
7 pages
2. Lecture 2_MAT361 (21 JAN 2025)
No ratings yet
2. Lecture 2_MAT361 (21 JAN 2025)
40 pages
Computer Lab 1 MM
No ratings yet
Computer Lab 1 MM
26 pages
HLST 2301 Notes Print Me
No ratings yet
HLST 2301 Notes Print Me
29 pages
Module2 Analytical Tool
No ratings yet
Module2 Analytical Tool
25 pages
Quantitative Research Artifact
No ratings yet
Quantitative Research Artifact
13 pages
Samenvatting Statistiek 10tm17
No ratings yet
Samenvatting Statistiek 10tm17
11 pages
Dsp
No ratings yet
Dsp
26 pages
Ed Aaaaaaa
No ratings yet
Ed Aaaaaaa
7 pages
Lab6_HT and CI in R some solutions
No ratings yet
Lab6_HT and CI in R some solutions
7 pages
Analysing and Presenting Data: Practical Hints: Daniele CEI, Giorgio MATTEI
No ratings yet
Analysing and Presenting Data: Practical Hints: Daniele CEI, Giorgio MATTEI
53 pages
Probability and Statistics: Progress Test 2
No ratings yet
Probability and Statistics: Progress Test 2
4 pages
Lecture 9_t-test
No ratings yet
Lecture 9_t-test
29 pages
ProbList5-24-Sln
No ratings yet
ProbList5-24-Sln
9 pages
(eBook PDF) Elementary Statistics 4th Edition instant download
100% (1)
(eBook PDF) Elementary Statistics 4th Edition instant download
52 pages
Assignment06 1
No ratings yet
Assignment06 1
4 pages
biostatistics notes part 1
No ratings yet
biostatistics notes part 1
9 pages
Test Bank for Essentials of Statistics for the Behavioral Sciences 4th by Nolan - Download PDF
100% (6)
Test Bank for Essentials of Statistics for the Behavioral Sciences 4th by Nolan - Download PDF
42 pages
Basic Statistics Formula Sheet
No ratings yet
Basic Statistics Formula Sheet
5 pages
Chapter 3 Hypothesis Testing
No ratings yet
Chapter 3 Hypothesis Testing
80 pages
W3 - Testing Means - Choose Your Test
No ratings yet
W3 - Testing Means - Choose Your Test
7 pages
Get Statistics Using IBM SPSS: An Integrative Approach – Ebook PDF Version PDF ebook with Full Chapters Now
100% (4)
Get Statistics Using IBM SPSS: An Integrative Approach – Ebook PDF Version PDF ebook with Full Chapters Now
65 pages
Statistics Help Card Full
No ratings yet
Statistics Help Card Full
6 pages
EDUC/PSY 6600: Unit 6 Homework: Categorical Data - Binomial and Chi Squared Tests
No ratings yet
EDUC/PSY 6600: Unit 6 Homework: Categorical Data - Binomial and Chi Squared Tests
34 pages
X X Number of Class Intervals Number of Occurrencesof The Score - Total Number of Scores
No ratings yet
X X Number of Class Intervals Number of Occurrencesof The Score - Total Number of Scores
8 pages
Which Test When: 1 Exploratory Tests
No ratings yet
Which Test When: 1 Exploratory Tests
5 pages
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
César Pérez López
No ratings yet
Chapter 5 (Time Series Analysis - Forecasting)
No ratings yet
Chapter 5 (Time Series Analysis - Forecasting)
71 pages
MPS, SD, Mean
No ratings yet
MPS, SD, Mean
4 pages
Sharpening The Blade Missing Data Imputation Using Supervised Machine Learning
No ratings yet
Sharpening The Blade Missing Data Imputation Using Supervised Machine Learning
24 pages
Point Estimation: Statistics (MAST20005) & Elements of Statistics (MAST90058) Semester 2, 2018
No ratings yet
Point Estimation: Statistics (MAST20005) & Elements of Statistics (MAST90058) Semester 2, 2018
12 pages
CHAPTER THREE - Multiple Linear Regression Analysis
No ratings yet
CHAPTER THREE - Multiple Linear Regression Analysis
77 pages
Chap 014
No ratings yet
Chap 014
16 pages
đề ôn final 2
No ratings yet
đề ôn final 2
20 pages
Statistical Techniques in Bioassay
No ratings yet
Statistical Techniques in Bioassay
249 pages
Multinomial Probit and Logit Models R Program and Output
No ratings yet
Multinomial Probit and Logit Models R Program and Output
7 pages
Homework 2
50% (2)
Homework 2
3 pages
Support Vector Machine: Scenario 1
No ratings yet
Support Vector Machine: Scenario 1
3 pages
Sample of Raw Data For Analysis
No ratings yet
Sample of Raw Data For Analysis
12 pages
M4 L01 NormalDistribution
No ratings yet
M4 L01 NormalDistribution
6 pages
Robust Statistics - How Not To Reject Outliers
100% (1)
Robust Statistics - How Not To Reject Outliers
5 pages
P and S Gtu Pyq Past 3 Years
No ratings yet
P and S Gtu Pyq Past 3 Years
19 pages
Lincolnville School Bus Project 2
0% (3)
Lincolnville School Bus Project 2
3 pages
HW 3.3.2 The Logistic Growth Model
No ratings yet
HW 3.3.2 The Logistic Growth Model
3 pages
STA200 - O2 - Summer20 - Online 2 - Tonguc Cagin - Final Assessment
No ratings yet
STA200 - O2 - Summer20 - Online 2 - Tonguc Cagin - Final Assessment
7 pages
Durbin, J., & Watson, G. S. (1951) - Testing For Serial Correlation in Least Squares Regression. II. Biometrika, 38 (12), 159.
No ratings yet
Durbin, J., & Watson, G. S. (1951) - Testing For Serial Correlation in Least Squares Regression. II. Biometrika, 38 (12), 159.
20 pages
NCERT Solutions For Class 10 Maths Unit 14
No ratings yet
NCERT Solutions For Class 10 Maths Unit 14
34 pages
Control Charts: 2WS02 Industrial Statistics A. Di Bucchianico
No ratings yet
Control Charts: 2WS02 Industrial Statistics A. Di Bucchianico
72 pages
SP Q4 Module 3 PPT 2
No ratings yet
SP Q4 Module 3 PPT 2
25 pages
Correlation and Regression: © The Mcgraw-Hill Companies, Inc., 2000
No ratings yet
Correlation and Regression: © The Mcgraw-Hill Companies, Inc., 2000
32 pages
Tutorial 2
No ratings yet
Tutorial 2
4 pages
HW12 Sol
No ratings yet
HW12 Sol
9 pages
Some Examples of Non-Stationary Time Series:: Model ACF Pacf
No ratings yet
Some Examples of Non-Stationary Time Series:: Model ACF Pacf
12 pages
One-Way ANOVA
No ratings yet
One-Way ANOVA
37 pages
Smoothing Constant 0.18 0.15 0.07 Date Close Price 10 Days Sma 10 Days Ema 12 Days Ema 26 Days Ema Difference
No ratings yet
Smoothing Constant 0.18 0.15 0.07 Date Close Price 10 Days Sma 10 Days Ema 12 Days Ema 26 Days Ema Difference
25 pages

Unit 2 Assignment SKELETON R spr18

Uploaded by

Unit 2 Assignment SKELETON R spr18

Uploaded by

Psy/Educ 6600: Unit 2 Homework

Groundwork for Inference

Chapter 5. Intro to Hypothesis Testing: 1 Sample z-Test 3

Chapter 6. Confidence Interval Estimation: The t Distribution 7

Chapter 7. Independent Samples t-Test for Means 10

• Make sure the packages are installed (Package tab)

Import Data, Define Factors, and Compute New Variables

Skewness and Kurtosis

# Shapiro-Wilk's Normality Test for: statquiz

# Histogram for: statquiz

DIRECTIONS: Use geom_qq() after setting the ggplot(aes()).

# Histogram for: statquiz

6C-1. 1-sample t-tests for anx_base, anx_pre, and anx_post

# 1-sample t-test for: anx_base

# 1-sample t-test for: anx_base

7C-1. Independent Samples t-Test for Mean hr_base by genderF

TEXTBOOK QUESTION: Perform a two-sample t test to determine whether there is a statistically

Assumtion Check: Homogeneity of Variance

Assumtion Check: Homogeneity of Variance

Perform the t-Test for Means in 2 Indep Groups

DIRECTIONS: Make sure to change the confidence level to 99%.

You might also like