MATH 1002 Assignment 1 LPP
MATH 1002 Assignment 1 LPP
Assignment #1
Due Date
Friday, October 11, 2024 at 11:59PM.
Submission
You are required to submit an electronic copy of your assignment to me via Blackboard. Please follow the
submission instructions carefully.
• Assignment #1 consists of five sections (A to E) with several questions per section. The
point value for each section is indicated.
• The “Assignment #1” link will be available at the bottom of the Module 4 content page. From there
you will be presented with questions from each section, copied from the assignment, and you will
answer each section separately. You can prepare answers in one document and then post
individual answers in the appropriate sections.
• If, for whatever reason, you cannot submit your assignment on Blackboard (e.g., Blackboard is
down), please email me the file at [email protected]. However, even if you email me the file, I
would also like you to submit on Blackboard as soon as it is accessible again.
Grading
The assignment is worth 10% of your final grade. The assignment will be graded for: (i) accuracy (i.e.,
statistical analysis was performed correctly and graphs contain all appropriate information – axis titles,
etc.), (ii) your ability to answer the specific questions that are asked, and (iii) overall presentation quality.
Late assignments will be penalized 10% per day, up to 5 days (as per the BScN Student Handbook).
Overview
Each student will be provided with a different dataset to analyze. To download the dataset that is
specific to you, go to the end of this document to find the list of assigned datasets. The dataset files
are attached to the assignment link and you can download your specific dataset.
Although every student will be expected to perform the same analyses, the details of the analysis will
differ. The numerical details of each dataset will vary, but all of them are similar data from a fictional
study, described below.
Suppose you are a researcher who is interested in the Body Mass Index (BMI) of individuals in your
study. You have collected BMI measurements from 900 different study participants. The dataset that you
will be provided is an Excel file that contains a list of these 900 data points. In other words, there are 900
cells with numbers in the data file, and each cell represents the BMI of one patient. Note: Be sure to
scroll down and across to see all 900 data points.
There are five sections (A to E) to this assignment. For each section, you will need to answer specific
questions that are asked (in sentence form) and include any requested output data (tables, graphs,
calculations, etc.).
Section A (9 marks)
Using all 900 data points for the BMI of study participants, answer the following questions:
Question A1: In Excel, construct a complete ungrouped frequency distribution table, including
frequencies, percentages, and cumulative percentages. Paste your table here. (4 marks)
Question A2: What is the most common BMI value among the sample? (1 mark)
Question A3: What is the statistical name given to this value? (1 mark)
Question A5: How many patients have a BMI less than or equal to 26? (1 mark)
Question A6: What percentage of patients have a BMI between 27 and 30, inclusive? (1 mark)
Using all 900 data points for the BMI of study participants, answer the following question:
Question B1: In Excel, using the ungrouped frequency distribution that you generated in Question A1,
generate a properly labeled histogram. Paste your histogram here. (4 marks)
Next, copy all 900 data points and paste them in a blank sheet. To the pasted data points, add 30 additional
participants having a BMI value of 20 (in cells AE1 to AE30). Also, add 30 additional participants having
a BMI value of 36 (in cells AF1 to AF30). Using all 960 data points for the BMI of study participants,
answer the following questions:
Question B2: In Excel, using the now 960 data points, construct a new complete ungrouped frequency
distribution table, including frequencies, percentages, and cumulative percentages. Paste your table here.
(4 marks)
Question B3: In Excel, using new ungrouped frequency distribution that you generated in Question B2,
generate a properly labeled histogram. Paste your histogram here. (4 marks)
Question B4: Comparing the histograms generated in Questions B2 and B3, describe each histogram
(symmetrical, skewed-left or skewed-right) Explain your answers. (2 marks)
Using the original 900 data points for the BMI of study participants, answer the following questions:
Question C1: In Excel, tabulate a grouped frequency distribution table for the BMI of study
participants. Your table will consist of six class intervals. Your grouped frequency distribution should
include frequencies, percentages, and cumulative percentages. Paste your table here. (4 marks)
Question C2: Which class interval has the highest number of participants in it? (1 mark)
Question C3: In Excel, using the grouped frequency distribution that you generated in Question C1, generate
a bar graph showing the percentages for each of the class intervals. Paste your bar graph here. (4 marks)
Question C4: In Excel, using the grouped frequency distribution that you generated in Question C1, generate
a pic chart showing the percentages for each of the class intervals. Paste your pie chart here. (4 marks)
Question C5: You generated both a bar graph and pie chart based on the exact same data. If you were to
present the data in a study, which figure would you use? Explain your choice. (1 mark)
Using the original 900 data points (ie. population) for the BMI of study participants, answer the following
questions:
Question D1: What is the population mean? Include the single Excel command required for determining this
value. (2 marks)
Question D2: What is the population standard deviation? Include the single Excel command required for
determining this value. (2 marks)
Question D3: What is the population median? Include the single Excel command required for determining
this value. (2 marks)
Question D4: What is the population first quartile (Q1) value? Include the single Excel command required
for determining this value. (2 marks)
Question D5: What is the population third quartile (Q3) value? Include the single Excel command required
for determining this value. (2 marks)
Question D6: What is the population interquartile range (IQR) value? (1 mark)
Question D7: In Excel, generate a box-and-whisker plot showing Q1, Q3, median and whiskers. Paste
your plot here. (4 marks)
Question D8: Above what BMI value would be considered a high, extreme outlier? (1 mark)
Section E (6 marks)
Selecting a sample of the BMI of study participants, BMI values in cells AC1 to AC30, answer the
following questions:
Question E1: What is the sample mean? Include the single Excel command required for determining this
value. (2 marks)
Question E2: What is the sample standard deviation? Include the single Excel command required for
determining this value. (2 marks)
Question E3: What is the sample variance? Include the single Excel command required for determining this
value. (2 marks)