0% found this document useful (0 votes)
5 views

Univariate Data-Lesson Ppt

The document discusses various statistical concepts related to univariate data, including stem-and-leaf plots, measures of central tendency (mean, median, mode), and the importance of data representation. It highlights the implications of lead contamination in Flint River water and food supply disparities between low and high-income countries. Additionally, it emphasizes the role of central tendency in summarizing data and making predictions.

Uploaded by

karen
Copyright
© © All Rights Reserved
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
5 views

Univariate Data-Lesson Ppt

The document discusses various statistical concepts related to univariate data, including stem-and-leaf plots, measures of central tendency (mean, median, mode), and the importance of data representation. It highlights the implications of lead contamination in Flint River water and food supply disparities between low and high-income countries. Additionally, it emphasizes the role of central tendency in summarizing data and making predictions.

Uploaded by

karen
Copyright
© © All Rights Reserved
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 41

Univariate Data

Flint River
Corroded water pipes

lead
Legionnaire’s disease
• A type of lung disease
Reflect and discuss
• How could just two samples change a conclusion about lead levels in
a river? Explain

• How much evidence would you need to be conviced that water from
the Flint River is now safe to drink?

• What would it be like to find out that you and your family had been
drinking lead-contaminated water for six months?
Stem and leaf plots
Lead level
• 20 water sources throughout Europe. The following is a sample of
some of the data collected.

23 17 9 12 14 24 23 12 8 4
20 15 12 8 28 15 11 10 5 22

Can you work in group to present the data using a stem and leaf plot?
Reflect and disuss
• What types of data (categorical, ordinal, discrete, continuous) can be
presented with a stem and leaf plot? Explain

Categorical: birthday; favorite sport; hair color; height


Ordinal: customer satisfaction levels; economic status;
Discrete:
Continues:
Refelct and discuss
• Why would a bar graph be less effective in representing this set of
data?
Working in pairs
• Represent the following set s of data on a back to back stem and leaf
plot. Be sure (make sure) to indicate (write) a key for the plot.
Male height (cm) Female height (cm)
152 133
154 148
148 132
161 150
173 125
159 128
150 131
161 138
166 119
P164 individual work
• Create a back to back stem and leaf plot (remember to indicate a key)
Table a
Food is available for each person
living in a region (place)
• Crop supply: the number of kilocalories (measurement) consumed
(eaten) per capita(per person) per day
• A higher number means that more food is available for each person
living there.
p264
• Look at the key first, what does the key tell us?
• What’s the purpose to have a key for stem and leaf plot?
• Write down the highest and lowest crop supply values for low and
high income($$$) countries.
Low income: 2020 &2520
High income: 3000 & 3640
• Is there any overlap (have the same or similar values) between crop
supply values between the two groups?
No
• Explain …

No, the lowest crop supply in high-income


countries is greater than the highest crop supply
in low-income countries
From the data, does it look as though low and high
income countries have equal access to food
supplies? Explain.
• Equal access 公平性 : treating everyone the same

• Food supplies
HIV cases 艾滋病
Despite advances in treatments and
educaiton, HIV continues to be an issue
(problem) worldwide. Stem-and –leaf
plots for the number of new HIV cases
in countries categorized as high and
low income are shown (show) on
textbook p265
• What is the key for this stem and leaf plot?

• What conclusion(s) can you draw (say) based on the data?

• Justify(explain with reasons-why) your answer.


• What questions do you have about the data that might influnece
(change) your conclusion(s)?

• Why do you think there is such a difference between the number of new
HIV cases in the two regions (areas, countries)?

• How effective (good) is a stem-and -leaf plot for representing (show) data
so that you can draw conclusions (have better understanding of the
situation?

• Explain
Practicing:
• Create stem-and-leaf plot
• Interperate the plot
• Draw conclusions
Analysing data: measures of central
tendency
• Mode: the mode tells us about the value we are likely to find when we
sample a set of data.
• Investigation: what is Mode?
3,5,7,8,8,10,11,12,14 Mode is 8
10,10,10,14,17,18,24,24,26,31. Mode is 10

Write down a rule in your maths book to determine (decide) the


mode of a set of data
Work in pairs
• Why would the mode be a useful statistic to represent a set of data?
Explain.

• Is the mode always a value in the data set? Explain

• Some would say that, if a data set has no values that repeat, every
value is the mode. Does this make sense? Explain.
Median
• 3,5,7,8,8,10,11,12,14. median: 8
• 10,10,10,14,17,18,24,24,26,31,35. median 18
• 12,9,5 median 9
• 3,5,2,7,18,10,14,21,15 median 10

Write down a rule in your maths book for finding the median of a set of
data
Reflect and discuss in your group
• Why would the median be a useful statistic to represent a set of data?
Explain

• Can there be more than one median? Explain.

• Is the median always a data value in the set? Explain suing an example
Practice Percentage of overweight Frequency
men

62% 2
What is the mode? 63% 3
64% 2
67% 65%
66%
3
2
What id the median? 67% 6

65%
Central Tendency
• Central tendency is a way to describe a set of data by identifying its
center or typical value. It helps us summarize a large amount of data
with a single representative number. The three main measures of
central tendency are: Mean (Average), Median and Mode
Why do we calculate central
tendency?
To Summarize Data: Instead of looking at every number in a data set,
we can use central tendency to describe the overall trend with one
number.
To Compare Data Sets: We can compare different groups (e.g., test
scores of two different classes) using measures of central tendency.
To Make Predictions: Knowing the average (mean) or most common
value (mode) helps in making future decisions (e.g., estimating how
much time students usually take to finish a math test).
To Identify Patterns: Central tendency helps in analyzing trends and
drawing conclusions in real-life applications like economics, sports, and
science.
Sentence Starters for Justification
• The best measure of central tendency for this data is
the (mean/median/mode) because…
• The (mean/median/mode) is a better representation because it is not
affected by…
• Since the data has (outliers/skewed distribution/equal distribution),
the best measure is…
• The (mean/median/mode) is more useful in this case because it
accurately reflects…
Which measure of central tendency best represents
the data? Justify your answer.

• Question: A teacher recorded the test scores of 10 students:


45, 50, 50, 60, 60, 60, 75, 80, 90, 100.
• The best measure of central tendency for this data set
is the median because it is less affected by extreme
values. The mean is 67, but it is pulled higher due to the
outlier (100), making it less representative of most
students' scores. The mode is 60, which represents the
most common score, but it does not account for the
overall distribution of scores. The median, 60, is a better
representation because it falls in the middle of the data
and is not influenced by the high score of 100.
Watch this video
• https://statisticsbyjim.com/basics/measures-central-tendency-mean-
median-mode/
Kahoot
Range
• Often refers to something that is spread out
Go to p 279
• Find a partner and describe how to find the range in words

To calculate the range, you need to find the largest


observed value of a variable (the maximum) and
subtract the smallest observed value (the minimum).

You might also like