0% found this document useful (0 votes)

65 views

Descriptive Statistics Tutorial 1 Solutions

This document provides solutions to descriptive statistics tutorial questions. It calculates summary measures like means, medians, and modes for sample data on birth weights. The mean, median, and percentiles are reported. Variables are also classified as quantitative vs. categorical, and nominal vs. ordinal vs. continuous. Measures of central tendency and variance are calculated separately for male and female birth weights. In summary, it demonstrates calculating and interpreting common descriptive statistics.

Uploaded by

Jinitha Babe

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

65 views

Descriptive Statistics Tutorial 1 Solutions

Uploaded by

Jinitha Babe

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Introduction to Biostatistics

PUB HLTH 7074 / 7074UAC / 4274

Descriptive Statistics: Tutorial 1 Solutions

1. Using summation symbol 

We will consider the case when these xi values are quantitative data values such as:

x = {2, 2, 5.5, -1, 0, 7, 1, 12} for example.

Using the data above, calculate the following:

5
(a) x
i 1
i  2  2  5.5  1  0  8.5

7
(b) x
i 4
i  1  0  7  1  7

3
(c) x
i 1
2
i  2 2  2 2  (5.5) 2  38.25

2
 3 
(d)   xi   ( 2  2  5.5)  (9.5)  90.25
2 2

 i 1 

2. Using more than one summation symbol

In some situations we may wish to use more than one summation symbol, for instance we might have a
2 2
formula that looks like x
i 1 j 1
ij .

To illustrate, we shall consider a study of proneness to Acute Respiratory Infection, where the following data
were obtained:

Sex Total
Prone male (column 1) female (column 2)
not prone (row 1) 107 124 231
prone (row 2) 157 101 258
Total 264 225 489

Note: It is common for i to index the rows and j to index the columns. As a result, x12 would represent the
number in row 1, column 2, i.e. not prone and female = 124.

Using data from the Acute Respiratory Infection study above, calculate the following:

2
(a) x
j 1
1j  x11  x12  107  124  231
2 2 2
(b)  xij   xi1  xi 2  x11  x12  x21  x22  107  124  157  101  489
i 1 j 1 i 1

Page 1 of 5
3. Using more than one set of data

Using the following data:

x = {2, -3, 6, -1, 0, 7, 1, 12}

y = {1, 0, -1, 2, 8, 4, 10, 3},

calculate the following quantities:

3
(a) x y
i 1
i i  x1 y1  x 2 y 2  x3 y3  (2  1)  (3  0)  (6  1)  2  0  6  4
3 3
(b) x y
i 1
i
i 1
i  (2  3  6)  (1  0  1)  5  0  0

4. Using what we know to calculate a mean and a variance

Let’s say we have three recordings of systolic blood pressure (in mmHg) on the same individual, i.e.

x = { 120, 130, 125 }, where x  {x1 , x2 , x3 } .

(a) Calculate the arithmetic mean for these data and comment on why this may be an informative measure.

x i
x1  x2  x3 (120  130  125)
mean = x i 1
   125 mmHg
n n 3

The arithmetic mean may be a good estimate for the true mean systolic blood pressure but we need to know
a bit more about the data (i.e. under what circumstances were they collected? At times that were close
together, or say in low, medium and high stress environments respectively? You can probably think of other
ways the data could have arisen...)

(b) Calculate the variance of the blood pressure recordings (in mmHg2) and comment on why this may be an
informative measure.

 x  x
2

i 1
i
(120  125 ) 2  (130  125 ) 2  (125  125 ) 2 25  25  0
variance =    25 mmHg2
n 1 3 1 2
The variance may be informative as it gives us an idea of how the data may vary about its mean. However,
its accuracy may be dubious with only three observations - and again, its relevance depends on the context
of the data. As a measure on its own (i.e. 25), this means little to us (at this stage in the course) unless it is
to compare, for instance, with another person who has lower or higher measures of variance (e.g. 10 mmHg2
or 50 mmHg2).

Page 2 of 5
5. Classification of variables

Classify the following variables as either quantitative (then either discrete or continuous) or categorical (then
nominal or ordinal).

(a) A ‘Likert’ scale used in an opinion poll taking values 1 to 5, (where 1 = strongly disagree, ... ,
3 = agree, ... , 5 = strongly agree).

Definition: a widely used questionnaire format named by developer, Rensis Likert. Respondents of
questionnaires are asked to choose from several responses in a range such as ‘strongly agree’, ‘agree’,
‘undecided’, ‘disagree’, and ‘strongly disagree’. Each response receives a number rating. The five-point
Likert scale is most common.

This is a qualitative variable since the actual numbers you use are not important, you could equally state
that A = strongly disagree, ... , C = agree, ... , E = strongly agree.
There is however a clear ordering apparent so the variable is ordinal.

(b) Heartbeats per minute for a newborn baby.

This variable is truly numerical, and as a result, quantitative. In addition it is discrete as it will typically only
take whole numbers [although one could argue continuous].

(c) Time taken to complete an operation.

This variable is also truly numerical, and as a result, quantitative. In addition it is continuous as its
accuracy is only limited by the measuring equipment.

(d) Colour of hair for students enrolled in Introduction to Biostatistics.

This is a qualitative variable since colour of hair is clearly categorical. There is also no clear ordering of
colour so the variable is nominal.

6. Summary measures

Assuming that these babies are a random sample of those born in Australia between midnight and 7am on
18 December 1997, use the sample of birth weights for the nine babies to address the following:

(a) Determine the sample mean, sample median and sample mode.

Sample mean x = 3.10 kg

[ x  (3.8  3.3  ...  3.2) / 9 ]

Sample median = 3.30 kg

[If weights are ordered from smallest to largest, i.e. 1.7, 2.2, 2.8, 3.2, 3.3, 3.6, 3.6, 3.8, and 3.8, the sample
median sits in the middle.]

Sample mode = 3.60 kg and 3.80 kg

[The sample mode is the most frequently occurring value. In this example, two values occur equally
frequently so it is possible to have more than one mode – hence a bimodal distribution.]

Note: it is also perfectly acceptable to have one decimal place for your estimates but for the purposes of this
course we suggest you include one more decimal place than that of the original data.

(b) Discuss which quantity you believe is the most informative measure of central tendency for birth weight
in this example.

The sample median would be most appropriate here since data are left skewed, i.e. a histogram shows

Page 3 of 5
If you weren’t sure about the skewness in the data, there could be some argument for the sample mean
since there sample size is small, and under these circumstances estimates of the mean would be less
variable than estimates of the median.

In other words, if we estimated the mean, median and mode for many samples of the same size from the
same population we would expect:
- estimates of the mean to be closer together (less variable) than
- estimates of the median and
- estimates of the mode

Sorted from smallest to largest, the birth weight values are:

Index (i) Birth weight value

1 1.7
2 2.2
3 2.8
4 3.2
5 3.3
6 3.6
7 3.6
8 3.8
9 3.8

Following the method of Bland on page 49, the median (second quartile) is given by the value
corresponding to index 0.5 x (9+1) = index 5 and so the median is 3.30 (as we already saw in part a).
The 25th percentile (first quartile) is given by 0.25 x (9+1) = index 2.5 and so the 25 th percentile is the
average of the index 2 and index 3 values, given by (2.2+2.8)/2 = 2.50. The 75th percentile is given by
0.75 x (9+1) = index 7.5, and so equal to (3.6+3.8)/2 = 3.70.

(d) Calculate the sample mean birth weight for males and females separately and a measure of sample
variance for both.

For males, x = 3.40 kg and variance = 0.16 kg2.

For females, x = 2.75 kg and variance = 0.94 kg2.

Page 4 of 5
(e) Discuss what the measures calculated in (d) indicate for differences in sex.

The mean weight for males is higher than that for females but variation is a lot smaller for males than
females. So weights for male babies appear to be higher and fairly close to 3.4 kg. Weights for female
babies appear to be generally lower but there is more of variation, i.e. they vary more about the mean
value of 2.75 kg.

We will learn methods for distinguishing whether or not these differences are significant or simply a result
of random variation in later lectures.

Page 5 of 5

French Alphabet Lesson Plan
100% (1)
French Alphabet Lesson Plan
4 pages
Biostatistics and Exercise
100% (8)
Biostatistics and Exercise
97 pages
Big Data Analytics in Genomics Ka Chun Wong PDF
No ratings yet
Big Data Analytics in Genomics Ka Chun Wong PDF
426 pages
Biostatistics Teaching
No ratings yet
Biostatistics Teaching
283 pages
Introduction To Biostatistics
No ratings yet
Introduction To Biostatistics
53 pages
1 Introduct
No ratings yet
1 Introduct
9 pages
Descriptive Statistics and Graphical Techniques-V1
No ratings yet
Descriptive Statistics and Graphical Techniques-V1
52 pages
Bio-Statistics and RD Lecture Note
No ratings yet
Bio-Statistics and RD Lecture Note
176 pages
WK 1b Biostat
No ratings yet
WK 1b Biostat
38 pages
Lecture 1_Online_INTRODUCTION TO BIOSTATISTICS [Compatibility Mode]
No ratings yet
Lecture 1_Online_INTRODUCTION TO BIOSTATISTICS [Compatibility Mode]
28 pages
Biostat Intro
No ratings yet
Biostat Intro
60 pages
Measures of Central Tendency Dispersion and Location
No ratings yet
Measures of Central Tendency Dispersion and Location
3 pages
Biostatistics Nurses Hnd
No ratings yet
Biostatistics Nurses Hnd
125 pages
Lecture4_5_6_7_BCH_4088_2022
No ratings yet
Lecture4_5_6_7_BCH_4088_2022
104 pages
BOT 315 slide
No ratings yet
BOT 315 slide
20 pages
Stat 109
No ratings yet
Stat 109
167 pages
‎⁨نسخة ملزمة-الإحصاء⁩
No ratings yet
‎⁨نسخة ملزمة-الإحصاء⁩
165 pages
Full Slides Beginselen2019
No ratings yet
Full Slides Beginselen2019
364 pages
2statsnotes 1
No ratings yet
2statsnotes 1
24 pages
Lecture 2_Descriptive Statistics
No ratings yet
Lecture 2_Descriptive Statistics
53 pages
Topic 1 - W1-3 Introduction To Biostatistics
No ratings yet
Topic 1 - W1-3 Introduction To Biostatistics
52 pages
5 Introduction To Statistics
No ratings yet
5 Introduction To Statistics
12 pages
Bio Statistics
No ratings yet
Bio Statistics
435 pages
Bio Statistics 3
No ratings yet
Bio Statistics 3
13 pages
Theory Session: Introduction To Biostatistics
No ratings yet
Theory Session: Introduction To Biostatistics
22 pages
2.4 General Epidemiological Measures
No ratings yet
2.4 General Epidemiological Measures
32 pages
20 - Basic Concepts and Terminology in Biostatistics (SepI2020)
No ratings yet
20 - Basic Concepts and Terminology in Biostatistics (SepI2020)
38 pages
Introduction To Biostatistics - Research Etymology: Notes From The Lecture & Orientations
No ratings yet
Introduction To Biostatistics - Research Etymology: Notes From The Lecture & Orientations
2 pages
Lec3&4 02sep2016
No ratings yet
Lec3&4 02sep2016
43 pages
Chapter 1 Introduction to Biostatistics
No ratings yet
Chapter 1 Introduction to Biostatistics
26 pages
Biostatistics Module Sep2023 240520 122333
No ratings yet
Biostatistics Module Sep2023 240520 122333
65 pages
18- Introduction and levels of measurements(2017-18)
No ratings yet
18- Introduction and levels of measurements(2017-18)
41 pages
Biostatistics CN
No ratings yet
Biostatistics CN
79 pages
Introduction to Biostatistics And
No ratings yet
Introduction to Biostatistics And
114 pages
Lecture 1
100% (1)
Lecture 1
33 pages
Introduction (Data Presentation & Summarization
No ratings yet
Introduction (Data Presentation & Summarization
148 pages
Statistics Supplement McEvoy
No ratings yet
Statistics Supplement McEvoy
10 pages
1 - Introduction To Statistics
No ratings yet
1 - Introduction To Statistics
34 pages
Statistics and Biostatistics: Mrs. Khushbu K. Patel Assistant Professor Shri Sarvajanik Pharmacy College
100% (1)
Statistics and Biostatistics: Mrs. Khushbu K. Patel Assistant Professor Shri Sarvajanik Pharmacy College
87 pages
Basic Statistics: Populations and Samples
No ratings yet
Basic Statistics: Populations and Samples
10 pages
COURSE TOPIC-Nures2 CM - CU7
No ratings yet
COURSE TOPIC-Nures2 CM - CU7
11 pages
01_Scales of mesurement_Sumarising numeric data
No ratings yet
01_Scales of mesurement_Sumarising numeric data
26 pages
Biostatistics Series Module 1: Basics of Biostatistics: Resumen
No ratings yet
Biostatistics Series Module 1: Basics of Biostatistics: Resumen
27 pages
Bio Statistics
No ratings yet
Bio Statistics
10 pages
Introduction To Biostatistics1
No ratings yet
Introduction To Biostatistics1
23 pages
Introduction Bio.
No ratings yet
Introduction Bio.
12 pages
Biostatistics in Orthodontics
100% (3)
Biostatistics in Orthodontics
108 pages
Stats
No ratings yet
Stats
2 pages
Contact Details:: Dr. Joy C. Chavez
No ratings yet
Contact Details:: Dr. Joy C. Chavez
54 pages
SPH 2 Lecture - 1 Introduction and Data
No ratings yet
SPH 2 Lecture - 1 Introduction and Data
118 pages
STT034 Lecture
No ratings yet
STT034 Lecture
6 pages
Basic Concepts in Biostatistics-1
No ratings yet
Basic Concepts in Biostatistics-1
40 pages
1 Introduction To Biostatistics
100% (2)
1 Introduction To Biostatistics
52 pages
PSM_2k23
No ratings yet
PSM_2k23
32 pages
Unit 1: Essence of Biostatistics: CS4220: Knowledge Discovery Methods For Bioinformatics
No ratings yet
Unit 1: Essence of Biostatistics: CS4220: Knowledge Discovery Methods For Bioinformatics
114 pages
Different Types of Variable Used in Data Collection
No ratings yet
Different Types of Variable Used in Data Collection
26 pages
Intro to Biostat (1)
No ratings yet
Intro to Biostat (1)
43 pages
5268-1-19590-2-10-20130508
No ratings yet
5268-1-19590-2-10-20130508
3 pages
Intro SRM
No ratings yet
Intro SRM
73 pages
Biostatics and Epidemiology 2022 1
No ratings yet
Biostatics and Epidemiology 2022 1
17 pages
Learn Statistics Fast: A Simplified Detailed Version for Students
From Everand
Learn Statistics Fast: A Simplified Detailed Version for Students
Hesbon R.M
No ratings yet
Factoring and Algebra - A Selection of Classic Mathematical Articles Containing Examples and Exercises on the Subject of Algebra (Mathematics Series)
From Everand
Factoring and Algebra - A Selection of Classic Mathematical Articles Containing Examples and Exercises on the Subject of Algebra (Mathematics Series)
CSPacademic
No ratings yet
Manitou Forklift Tmt315 Fl Parts Manual 547633p
No ratings yet
Manitou Forklift Tmt315 Fl Parts Manual 547633p
25 pages
Exercises II Linear Programming, Duality, Sensitivity Analysis
No ratings yet
Exercises II Linear Programming, Duality, Sensitivity Analysis
2 pages
Engineering Material Specification
100% (1)
Engineering Material Specification
9 pages
PA2 - 12 - BST - Set A Practice
No ratings yet
PA2 - 12 - BST - Set A Practice
10 pages
Service Manual Section
No ratings yet
Service Manual Section
403 pages
Warning Against Extremism by Shaikh Saleh Aalus Shaykh
100% (2)
Warning Against Extremism by Shaikh Saleh Aalus Shaykh
54 pages
CSE 390a: Intro To Shell Scripting
No ratings yet
CSE 390a: Intro To Shell Scripting
22 pages
Mara Resume Dicas
No ratings yet
Mara Resume Dicas
1 page
Daragan v. Short Term Trading Analysis
No ratings yet
Daragan v. Short Term Trading Analysis
61 pages
History of IBM Mainframe Operating Systems
No ratings yet
History of IBM Mainframe Operating Systems
13 pages
2012 Calendar
No ratings yet
2012 Calendar
17 pages
Sohar Islamic's Offices Shift To Ghala
No ratings yet
Sohar Islamic's Offices Shift To Ghala
2 pages
A Soda Bottle Magnetometer
No ratings yet
A Soda Bottle Magnetometer
5 pages
pure substance (1)
No ratings yet
pure substance (1)
15 pages
Farm Management and Production Economics
100% (1)
Farm Management and Production Economics
85 pages
(Ebook) Heavenly Mathematics: The Forgotten Art of Spherical Trigonometry by Glen Van Brummelen ISBN 9781400844807 download
No ratings yet
(Ebook) Heavenly Mathematics: The Forgotten Art of Spherical Trigonometry by Glen Van Brummelen ISBN 9781400844807 download
57 pages
Dongyang - Hydraulic Breakers
100% (1)
Dongyang - Hydraulic Breakers
16 pages
SHS Literature
No ratings yet
SHS Literature
4 pages
Topic 4.1 Worksheet (Answers)
No ratings yet
Topic 4.1 Worksheet (Answers)
3 pages
CHAPTER 38 Neurological and Cognitive Problems - Nanda - PPT
No ratings yet
CHAPTER 38 Neurological and Cognitive Problems - Nanda - PPT
24 pages
IIT-MBPT Report
No ratings yet
IIT-MBPT Report
353 pages
Soal & Jawaban USBN Bahasa Inggris - KTSP Paket A
No ratings yet
Soal & Jawaban USBN Bahasa Inggris - KTSP Paket A
24 pages
14 - Waves General Waves and Wave Intensity - 14
No ratings yet
14 - Waves General Waves and Wave Intensity - 14
4 pages
Controller m70 4u e Datasheet20160815
No ratings yet
Controller m70 4u e Datasheet20160815
5 pages
General Assembly: United Nations
No ratings yet
General Assembly: United Nations
3 pages
RMP-D8: Operations Manual
No ratings yet
RMP-D8: Operations Manual
22 pages
ASRES DIRIBA Project Assignment
No ratings yet
ASRES DIRIBA Project Assignment
15 pages
ServiceNow CSA Dump
No ratings yet
ServiceNow CSA Dump
26 pages

Descriptive Statistics Tutorial 1 Solutions

Uploaded by

Descriptive Statistics Tutorial 1 Solutions

Uploaded by

Introduction to Biostatistics

PUB HLTH 7074 / 7074UAC / 4274

Descriptive Statistics: Tutorial 1 Solutions

1. Using summation symbol 

x = {2, 2, 5.5, -1, 0, 7, 1, 12} for example.

Using the data above, calculate the following:

2. Using more than one summation symbol

Using the following data:

x = {2, -3, 6, -1, 0, 7, 1, 12}

calculate the following quantities:

4. Using what we know to calculate a mean and a variance

x = { 120, 130, 125 }, where x  {x1 , x2 , x3 } .

(b) Heartbeats per minute for a newborn baby.

(c) Time taken to complete an operation.

(d) Colour of hair for students enrolled in Introduction to Biostatistics.

Sample mean x = 3.10 kg

Sample median = 3.30 kg

Sample mode = 3.60 kg and 3.80 kg

Sorted from smallest to largest, the birth weight values are:

Index (i) Birth weight value

For males, x = 3.40 kg and variance = 0.16 kg2.

You might also like