Comparing Two Distributions Using Box Plots
Comparing Two Distributions Using Box Plots
Mathematics
Lesson Plan
Box plots, which are sometimes called box-and-whisker plots, can Lesson
Explainer
be a good way to visualize differences among groups that have
been measured on the same variable. Lesson Playlist
Before we look at comparing data sets using box plots, however,
Lesson
let us remind ourselves of the key elements of a box plot.
Worksheet
https://www.nagwa.com/en/explainers/812192146073/ 1/18
9/27/22, 9:41 AM Lesson Explainer: Comparing Two Distributions Using Box Plots | Nagwa
https://www.nagwa.com/en/explainers/812192146073/ 2/18
9/27/22, 9:41 AM Lesson Explainer: Comparing Two Distributions Using Box Plots | Nagwa
Answer
In the box plot for tests taken in the morning, the median
score (the vertical line inside the box) sits above the axis
approximately at the 87.5 mark, whereas the median for the
scores of tests taken in the afternoon sits above about 75 on
the axis.
https://www.nagwa.com/en/explainers/812192146073/ 3/18
9/27/22, 9:41 AM Lesson Explainer: Comparing Two Distributions Using Box Plots | Nagwa
Box plots are often but not always drawn with the variable values
on the horizontal axis. Our next example has the
variable values
on the vertical axis.
Answer
To decide whether or not cats are more popular than dogs in
internet searches, we want to know, on average,
which pet
https://www.nagwa.com/en/explainers/812192146073/ 4/18
9/27/22, 9:41 AM Lesson Explainer: Comparing Two Distributions Using Box Plots | Nagwa
https://www.nagwa.com/en/explainers/812192146073/ 5/18
9/27/22, 9:41 AM Lesson Explainer: Comparing Two Distributions Using Box Plots | Nagwa
https://www.nagwa.com/en/explainers/812192146073/ 6/18
9/27/22, 9:41 AM Lesson Explainer: Comparing Two Distributions Using Box Plots | Nagwa
Answer
Part 1
Part 2
https://www.nagwa.com/en/explainers/812192146073/ 7/18
9/27/22, 9:41 AM Lesson Explainer: Comparing Two Distributions Using Box Plots | Nagwa
This means that 75% of rap tracks are less than 4.40 minutes
long and only 25% are longer than 4.40 minutes.
This means that 25% of heavy metal tracks are shorter than
4.40 minutes long and 75% of the tracks are longer than
4.40 minutes.
Answer
We can see that Ramy and Mona traveled similar ranges of
miles
at night. Ramy traveled between 2 and 17 miles
a
night. Mona traveled between 2 and 15 miles
a night.
https://www.nagwa.com/en/explainers/812192146073/ 9/18
9/27/22, 9:41 AM Lesson Explainer: Comparing Two Distributions Using Box Plots | Nagwa
If we look within each box, notice that the box to the left of
Ramy’s median is narrower than the box to the right of the
median. Also note that his left whisker is shorter than his
right whisker (i.e., the whiskers attached to the box, not
Ramy’s actual whiskers!). Both of these features tell us that
Ramy traveled shorter distances, less than
5 miles, 50% of the
https://www.nagwa.com/en/explainers/812192146073/ 10/18
9/27/22, 9:41 AM Lesson Explainer: Comparing Two Distributions Using Box Plots | Nagwa
Similarly, for Mona (as shown in the box plot below), 25% of
the time her distances were concentrated between 11 (her
median) and 12 (Q3)
miles. And 25% were between 12 (Q3)
and 15 miles, her maximum distance.
https://www.nagwa.com/en/explainers/812192146073/ 11/18
9/27/22, 9:41 AM Lesson Explainer: Comparing Two Distributions Using Box Plots | Nagwa
9 miles
(Q1) and 25% between 9 (Q1) miles
and her median,
11 miles.
Note
We have compared three different statistical measures in this
example: the shape, average and spread
of the data. When
looking at whether or not a data set is skewed (or
symmetric), we are looking at the shape of
the data. When
considering the median, we are talking about the “average”
and when looking at the range and
interquartile range, we
are considering the spread of the data.
https://www.nagwa.com/en/explainers/812192146073/ 12/18
9/27/22, 9:41 AM Lesson Explainer: Comparing Two Distributions Using Box Plots | Nagwa
Answer
In comparing the distributions, we will look at the “shape”,
“average,” and “spread” of each data set and compare. We
start with the “average” and the first thing to note that the
median WNBA score is around 85 and the median NBA
score is somewhat higher, at approximately 114. In general,
then, we can say that the NBA winning scores are higher
than those for the WNBA.
The next thing we look at is the spread of the two data sets,
using the range and the interquartile range (IQR). WNBA
scores range from a low of 70 to a high of 110, so their range
is 110 − 70 = 40 , whereas the lowest NBA score is 100 and
the highest,
approximately, is 128, with a calculated range of
128 − 100 = 28 . Hence, overall, the WNBA results have a
wider spread than those for the NBA.
https://www.nagwa.com/en/explainers/812192146073/ 13/18
9/27/22, 9:41 AM Lesson Explainer: Comparing Two Distributions Using Box Plots | Nagwa
https://www.nagwa.com/en/explainers/812192146073/ 14/18
9/27/22, 9:41 AM Lesson Explainer: Comparing Two Distributions Using Box Plots | Nagwa
For the WNBA data, the left whisker is shorter than the
right. This indicates that the data is a little right (or
positively) skewed. That is, the lower scores are more
concentrated within a narrower range of values than those in
the higher range. The box itself is symmetric on either side
of the median, however, so the data is not heavily skewed.
For the NBA data, the whiskers are approximately equal and
there is only a small lack of symmetry on either side of the
median inside the box, indicating that the data is more or
less symmetric about the center.
https://www.nagwa.com/en/explainers/812192146073/ 15/18
9/27/22, 9:41 AM Lesson Explainer: Comparing Two Distributions Using Box Plots | Nagwa
Key Points
When comparing the distributions of two data sets on the
same measurement using box plots, we can compare the
“shape”, “average,” and “spread” of the data sets.
https://www.nagwa.com/en/explainers/812192146073/ 16/18
9/27/22, 9:41 AM Lesson Explainer: Comparing Two Distributions Using Box Plots | Nagwa
Note
Box plots can be used to compare multiple data sets where
the values are measurements of the same variable. Also, box
plots can be drawn vertically as opposed to horizontally, as
illustrated below.
https://www.nagwa.com/en/explainers/812192146073/ 18/18