0% found this document useful (0 votes)

139 views

STAT 1000 Assignment - Solutions

This document provides instructions for an assignment in STAT 1000 that involves analyzing student grade data. Students are asked to import a dataset of 500 student grades, isolate the grades for one section of 250 students, and assign letter grades. The document then lists 15 questions for students to answer using the grade data, including calculating descriptive statistics, creating histograms and boxplots, and determining grade point averages.

Uploaded by

Masudul Islam

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

139 views

STAT 1000 Assignment - Solutions

Uploaded by

Masudul Islam

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

STAT 1000 - Assignment 1

Solutions

2023-02-06

Setup [3 marks]

Before you begin, set your name and student number in Line 3. [1 mark]

0. Import the FullCourse dataset, available on the UMLearn page. Make sure you have “Heading” set to
“Yes” when you import the data, and make sure you name the object FullCourse. [2 marks]

FullCourse <- read.csv("~/R_Datasets/FullCourse.csv")

This dataset contains Midterm and Final Exam grades for an entire course of 500 students (including two
sections of 250 students each). The Midterm and Final Exam grades are each marked out of 100.
Suppose that you are the instructor for one of the sections of this course. The block of code below will isolate
the grades of the students in your section only, and save them as a dataframe named MyClass. Also, it will
assign letter grades based on their weighted total grade (the midterm is worth 40% of the total grade, while
the final is worth 60%). This is the dataset you will use to answer the assignment questions.
After importing the data, replace 1111111 with your seven-digit student id number in the set.seed function
below, and click the green arrow at the top-right hand side of the code chunk. This part is not worth marks,
but you will receive a five-mark deduction on your assignment if it is not completed correctly.

set.seed(1111111)
MyClass = FullCourse[sample(1:NROW(FullCourse), 250), ]
MyClass$Letters = cut(0.4*MyClass$Midterm + 0.6*MyClass$Final,
c(0, 50, 60, 65, 70, 75, 80, 90, 100),
c("F", "D", "C", "C+", "B", "B+", "A", "A+"))
rm(FullCourse)

Make sure you complete the setup steps before beginning your assignment!

Questions [22 marks]

For the following questions, use the MyClass dataset that you created in the setup stage.

1. For each of the variables in the MyClass dataset, what is the data type? (I.e., categorical and nominal,
categorical and ordinal, or quantitative?) [2 marks]

Midterms and Final Exams are quantitative, while the Letters are categorical and ordinal.

1
2. Create a histogram of the Midterm grades and a histogram of the Final Exam grades. Set the breaks
to 10. Determine an appropriate title for each graph, and an appropriate label for the x-axes (do not
leave them at their default values). Make sure the two histograms are different colours. [3 marks]

hist(MyClass$Midterms, breaks = 10, main = "Midterm Grades", xlab = "Grades", col = "violetred1")

Midterm Grades
60
50
40
Frequency

30
20
10
0

20 40 60 80 100

Grades

hist(MyClass$Finals, breaks = 10, main = "Final Exam Grades", xlab = "Grades", col = "tomato2")

2
Final Exam Grades
50
40
Frequency

30
20
10
0

0 20 40 60 80 100

Grades

3. Describe the shape of the distributions you see (in particular, the direction of the skewness). [1 mark]

Both datasets are left-skewed.

4. Based on your previous answer, do you expect the mean of each variable to be greater than, less than,
or approximate equal to the median? Why? [1 mark]

Since the datasets are left-skewed, I expect the mean of each variable to be less than the median.

5. Calculate the means of the Midterm grades and the Final Exam grades. [1 mark]

mean(MyClass$Midterms)

## [1] 71.668

mean(MyClass$Finals)

## [1] 66.108

6. Calculate the medians of the Midterm grades and the Final Exam grades. [1 mark]

median(MyClass$Midterms)

## [1] 74.5

3
median(MyClass$Finals)

## [1] 67

7. Do your results in Question 5 and Question 6 match your remarks in Question 4? [1 mark]

For each variable, the mean is less than the median, so yes.

8. Calculate the five number summaries of the Midterm grades and the Final Exam grades. [1 mark]

fivenum(MyClass$Midterms)

## [1] 11.0 61.0 74.5 86.0 100.0

fivenum(MyClass$Finals)

## [1] 5 54 67 81 99

9. Calculate the standard deviations of the Midterm grades and the Final Exam grades. [1 mark]

sd(MyClass$Midterms)

## [1] 17.96666

sd(MyClass$Finals)

## [1] 18.64938

10. Based on the shape of the each histogram, would it be better to describe these distributions with the
mean and standard deviation, or with the five number summary? Why? [1 mark]

Since the distributions are skewed, it would be preferable to describe them with the five number summary as
opposed to the mean and standard deviation.

11. Create a horizontal outlier boxplot of the Midterm grades. Determine an appropriate title for the
graph, and an appropriate label for the x-axis (it is okay to have the same name for the title and the
x-axis). [2 marks]

boxplot(MyClass$Midterms, horizontal = TRUE, xlab = "Grades", main = "Midterm Grades")

4
Midterm Grades

20 40 60 80 100

Grades
12. Create a horizontal quantile boxplot of the Midterm grades. Determine an appropriate title for the
graph, and an appropriate label for the x-axis (it is okay to have the same name for the title and the
x-axis). [1 mark]

boxplot(MyClass$Midterms, horizontal = TRUE, xlab = "Grades", main = "Midterm Grades", range = 0)

Midterm Grades

20 40 60 80 100

Grades

5
13. Create a side-by-side vertical outlier boxplot comparing the Midterm and Final Exam grades. Deter-
mine an appropriate title for the graph, and an appropriate label for the y-axis (it is okay to have the
same name for the title and the x-axis). Use the names argument to set the names of the individual
boxplots. [2 marks]

boxplot(MyClass$Midterms, MyClass$Finals, ylab = "Grades", main = "Midterm vs Final Exam Grades", names

Midterm vs Final Exam Grades

100
80
60
Grades

40
20

Midterm Final Exam

To answer the following two questions, just use R as a calculator. You don’t need to use any functions.*

14. Using the five number summary calculated in Question 8, calculate and print out the upper and lower
fences used in the construction of the outlier boxplot in the previous question. [2 marks]

LF.midterms = 61 - 1.5*(86 - 61)

UF.midterms = 81 + 1.5*(86 - 61)
LF.finals = 54 - 1.5*(81 - 54)
UF.finals = 81 + 1.5*(81 - 54)

LF.midterms

## [1] 23.5

UF.midterms

## [1] 118.5

LF.finals

## [1] 13.5

6
UF.finals

## [1] 121.5

15. Below is a frequency table of the letter grades in this class (knit the file to view):

##
## F D C C+ B B+ A A+
## 32 43 28 17 27 35 50 18

What is the average number of grade points received by students in this class?
Note: the table below displays the letter grade to grade point conversion. Knit this file to PDF and view the
output to see it.

Letter Grade A+ A B+ B C+ C D F
Grade Point 4.5 4.0 3.5 3.0 2.5 2.0 1.0 0.0

(032 + 143 + 228 + 2.517 + 327 + 3.535 + 450 + 4.518)/250

## [1] 2.504

Lecture-18 Canonical Form
No ratings yet
Lecture-18 Canonical Form
5 pages
Running Head: Statistical Analysis: Statistical Analysis Student Name Tutor's Name Date
No ratings yet
Running Head: Statistical Analysis: Statistical Analysis Student Name Tutor's Name Date
6 pages
Slides of Discovering Statistics Using SPSS by Muhammad Yousaf Abid. Iqra University Islamabad.
No ratings yet
Slides of Discovering Statistics Using SPSS by Muhammad Yousaf Abid. Iqra University Islamabad.
31 pages
Bubt STA231 Mid Term Question Summer 2021
No ratings yet
Bubt STA231 Mid Term Question Summer 2021
2 pages
Om Assignment Roll No 88
No ratings yet
Om Assignment Roll No 88
14 pages
Gen Exam CH 1 SOLUTION
No ratings yet
Gen Exam CH 1 SOLUTION
6 pages
Chapter 5 Anova
No ratings yet
Chapter 5 Anova
10 pages
Stat Course Outline Unity University
No ratings yet
Stat Course Outline Unity University
3 pages
Name Date Period: - : This Worksheet Will Walk You Through How To Calculate Standard Deviation
No ratings yet
Name Date Period: - : This Worksheet Will Walk You Through How To Calculate Standard Deviation
4 pages
Mathematics in Our World - Mathematics As A Tool: Data Management
100% (1)
Mathematics in Our World - Mathematics As A Tool: Data Management
24 pages
Chapter One Introduction To Business Statistics
No ratings yet
Chapter One Introduction To Business Statistics
29 pages
Differential Calculus: Calculus and Integral Calculus. The Concept of
No ratings yet
Differential Calculus: Calculus and Integral Calculus. The Concept of
19 pages
Introduction To Statistics - Doc1
No ratings yet
Introduction To Statistics - Doc1
236 pages
Stationary and Non Stationary
100% (1)
Stationary and Non Stationary
5 pages
Skewness, Moments and Kurtosis
No ratings yet
Skewness, Moments and Kurtosis
23 pages
SPSS Lecture Note 2022
No ratings yet
SPSS Lecture Note 2022
226 pages
Chapter 6 Section 4-5: Probability: Multiple Choice
No ratings yet
Chapter 6 Section 4-5: Probability: Multiple Choice
7 pages
Two-Way Anova: (BS Chem 3B - Group 2)
No ratings yet
Two-Way Anova: (BS Chem 3B - Group 2)
21 pages
Solved at The Beginning of 2015 Mansen PLC Acquired Equipment Costing
No ratings yet
Solved at The Beginning of 2015 Mansen PLC Acquired Equipment Costing
1 page
Biostatistics Assignment
No ratings yet
Biostatistics Assignment
3 pages
Central Tendency
No ratings yet
Central Tendency
26 pages
Tabulation of Data
100% (1)
Tabulation of Data
5 pages
R Programming Exam With Solutions
No ratings yet
R Programming Exam With Solutions
9 pages
List of Formula - Managerial Statistics
No ratings yet
List of Formula - Managerial Statistics
6 pages
Ix. Introduction To Statistical Concepts: Frequency Distribution Measures of Central Tendency Measures of Variability
No ratings yet
Ix. Introduction To Statistical Concepts: Frequency Distribution Measures of Central Tendency Measures of Variability
119 pages
Various Measures of Central Tendenc1
No ratings yet
Various Measures of Central Tendenc1
45 pages
Chapter 3 - Excel Data Operation
No ratings yet
Chapter 3 - Excel Data Operation
30 pages
Chapter-3-Measures of Central Tendency
No ratings yet
Chapter-3-Measures of Central Tendency
20 pages
Chemistry - Intro To Measurements
No ratings yet
Chemistry - Intro To Measurements
28 pages
QM Statistic Notes
No ratings yet
QM Statistic Notes
24 pages
Applied Statistics in Business & Economics,: David P. Doane and Lori E. Seward
No ratings yet
Applied Statistics in Business & Economics,: David P. Doane and Lori E. Seward
48 pages
R - (2017) Understanding and Applying Basic Statistical Methods Using R (Wilcox - R - R) (Sols.)
No ratings yet
R - (2017) Understanding and Applying Basic Statistical Methods Using R (Wilcox - R - R) (Sols.)
91 pages
Review Mid-Term Exam 2
No ratings yet
Review Mid-Term Exam 2
8 pages
Fundamentals of Biostatistics 7th Edition Bernard Rosner - Instantly access the full ebook content in just a few seconds
No ratings yet
Fundamentals of Biostatistics 7th Edition Bernard Rosner - Instantly access the full ebook content in just a few seconds
54 pages
Tutorial 5 Discrete Distributions
No ratings yet
Tutorial 5 Discrete Distributions
6 pages
QTB Important Questions 2021
No ratings yet
QTB Important Questions 2021
3 pages
AK - STATISTIKA - 01 - Describing Data
No ratings yet
AK - STATISTIKA - 01 - Describing Data
26 pages
PSCV Unit-Iii Digital Notes
No ratings yet
PSCV Unit-Iii Digital Notes
46 pages
Week 1-2 Exercises
100% (1)
Week 1-2 Exercises
36 pages
Basicstat 1011
No ratings yet
Basicstat 1011
115 pages
Quartiles, Deciles, Percentiles
100% (1)
Quartiles, Deciles, Percentiles
5 pages
Study of Averages Final
No ratings yet
Study of Averages Final
111 pages
Confirmatory Factor Analysis Using AMOS: Step 1: Launch The AMOS Software
100% (1)
Confirmatory Factor Analysis Using AMOS: Step 1: Launch The AMOS Software
12 pages
The Application of Derivatives To The Concepts of Marginal Cost
No ratings yet
The Application of Derivatives To The Concepts of Marginal Cost
4 pages
Statistics and Data
No ratings yet
Statistics and Data
67 pages
SampleMidterms PDF
No ratings yet
SampleMidterms PDF
15 pages
University of Aberdeen Common Grading Scale (CGS) : (Predominantly Essay-Based Courses)
No ratings yet
University of Aberdeen Common Grading Scale (CGS) : (Predominantly Essay-Based Courses)
4 pages
Measure of Dispersion Statistics
No ratings yet
Measure of Dispersion Statistics
24 pages
Unit 10 Randomised Block Design: Structure
No ratings yet
Unit 10 Randomised Block Design: Structure
16 pages
Statistics Report..
No ratings yet
Statistics Report..
34 pages
Measures of Central Tendency
No ratings yet
Measures of Central Tendency
10 pages
Chap1 and 2
No ratings yet
Chap1 and 2
62 pages
Stat Chapter 5-9
No ratings yet
Stat Chapter 5-9
32 pages
Comparison of Several Multivariate Means
No ratings yet
Comparison of Several Multivariate Means
111 pages
Statistical Computing Using Statistical Computing Using
No ratings yet
Statistical Computing Using Statistical Computing Using
128 pages
CH - 1 - Introduction To Econometrics Software Stata
No ratings yet
CH - 1 - Introduction To Econometrics Software Stata
35 pages
Harmonic Mean
No ratings yet
Harmonic Mean
14 pages
OUTLIERS
100% (1)
OUTLIERS
5 pages
Assignment of Econometrics
No ratings yet
Assignment of Econometrics
12 pages
Research Methods in Economics Part II STAT
No ratings yet
Research Methods in Economics Part II STAT
350 pages
Stat - Assignment
No ratings yet
Stat - Assignment
2 pages
Kest 106
No ratings yet
Kest 106
17 pages
Complete Answer Guide for Strategic Compensation A Human Resource Mangement Approach 8th Edition Martocchio Test Bank
100% (21)
Complete Answer Guide for Strategic Compensation A Human Resource Mangement Approach 8th Edition Martocchio Test Bank
38 pages
Ilk Asl Week 16 Measures of Central Tendency
No ratings yet
Ilk Asl Week 16 Measures of Central Tendency
3 pages
Ignou Statistics2
No ratings yet
Ignou Statistics2
153 pages
Handling Data 1
No ratings yet
Handling Data 1
13 pages
Statistics For Management MCQs and Terminal Questions From All Units
No ratings yet
Statistics For Management MCQs and Terminal Questions From All Units
22 pages
Properties of The Normal Distribution Curve, Skewness, and Kurtosis I. Properties of The Normal Distribution Curve
No ratings yet
Properties of The Normal Distribution Curve, Skewness, and Kurtosis I. Properties of The Normal Distribution Curve
3 pages
Unit - II - Lesson 1 - Understanding - The - Normal - Curve - Distribution
No ratings yet
Unit - II - Lesson 1 - Understanding - The - Normal - Curve - Distribution
15 pages
Sports and Child Development
No ratings yet
Sports and Child Development
23 pages
sqqs1013 chp06
No ratings yet
sqqs1013 chp06
22 pages
DSTAT271 Assignment 2
No ratings yet
DSTAT271 Assignment 2
1 page
Module 4 Data Management (Part 1)
No ratings yet
Module 4 Data Management (Part 1)
27 pages
Casio FX 100 S Stats
No ratings yet
Casio FX 100 S Stats
5 pages
LBOLYTC Quiz 1 Reviewer
No ratings yet
LBOLYTC Quiz 1 Reviewer
21 pages
DLP 3 (Finding The Mean, Variance, and Standard Deviation of Discrete Probability Distributions)
No ratings yet
DLP 3 (Finding The Mean, Variance, and Standard Deviation of Discrete Probability Distributions)
4 pages
McDougal Littell - Algebra 1 Ch13
No ratings yet
McDougal Littell - Algebra 1 Ch13
68 pages
Ad3491 Fdsa Unit 3 Notes
No ratings yet
Ad3491 Fdsa Unit 3 Notes
37 pages
Ulva Lactuca As Biofertilizer
No ratings yet
Ulva Lactuca As Biofertilizer
6 pages
STA6166 HW1 Ramin Shamshiri Solution
No ratings yet
STA6166 HW1 Ramin Shamshiri Solution
9 pages
Lesson 3
No ratings yet
Lesson 3
20 pages
An Introduction To Benefit of The Doubt' Composite Indicators
No ratings yet
An Introduction To Benefit of The Doubt' Composite Indicators
36 pages
List 4
No ratings yet
List 4
3 pages
Assignment 1
No ratings yet
Assignment 1
9 pages
Colleges and Universities School Type Median SAT Acceptance Rate Expenditures/Student
No ratings yet
Colleges and Universities School Type Median SAT Acceptance Rate Expenditures/Student
7 pages
Modelling Data Uncertainty in Growth Forecasts: Karmeshu T and F. Lara-Rosano
No ratings yet
Modelling Data Uncertainty in Growth Forecasts: Karmeshu T and F. Lara-Rosano
7 pages
Health Information Management of a Strategic Resource 5th Edition Abdelhak Test Bankinstant download
100% (4)
Health Information Management of a Strategic Resource 5th Edition Abdelhak Test Bankinstant download
36 pages
Two Population - Hypothesis - ch4 PDF
No ratings yet
Two Population - Hypothesis - ch4 PDF
30 pages
MMW Lecture 4.2 Data Management Part 2
100% (1)
MMW Lecture 4.2 Data Management Part 2
57 pages

STAT 1000 Assignment - Solutions

Uploaded by

STAT 1000 Assignment - Solutions

Uploaded by

STAT 1000 - Assignment 1

FullCourse <- read.csv("~/R_Datasets/FullCourse.csv")

Questions [22 marks]

Both datasets are left-skewed.

## [1] 11.0 61.0 74.5 86.0 100.0

boxplot(MyClass$Midterms, horizontal = TRUE, xlab = "Grades", main = "Midterm Grades")

boxplot(MyClass$Midterms, horizontal = TRUE, xlab = "Grades", main = "Midterm Grades", range = 0)

Midterm vs Final Exam Grades

Midterm Final Exam

LF.midterms = 61 - 1.5*(86 - 61)

(0*32 + 1*43 + 2*28 + 2.5*17 + 3*27 + 3.5*35 + 4*50 + 4.5*18)/250

You might also like

(032 + 143 + 228 + 2.517 + 327 + 3.535 + 450 + 4.518)/250