0% found this document useful (0 votes)
12 views

HW1

The document outlines Homework 2 for IEDA 2540, due on February 21, and consists of various statistical problems involving data analysis. Topics include calculating means, medians, modes, variances, standard deviations, and detecting outliers across different datasets. Additionally, it includes optional practice questions on graphical representations and percentiles.

Uploaded by

aiden050917
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
12 views

HW1

The document outlines Homework 2 for IEDA 2540, due on February 21, and consists of various statistical problems involving data analysis. Topics include calculating means, medians, modes, variances, standard deviations, and detecting outliers across different datasets. Additionally, it includes optional practice questions on graphical representations and percentiles.

Uploaded by

aiden050917
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 3

IEDA 2540 Homework 2 Due Feb 21, end of day

1. (20 points) Weights of Apples in an Orchard. An orchard manager sampled 12 apples to


assess the uniformity of the produce. The weights (in grams) are recorded as:
148, 152, 150, 149, 151, 155, 150, 153, 152, 150, 154, 151.
Answer the following:
(a) Compute the sample mean of the apple weights.
(b) Determine the sample median.
(c) Find the sample mode.
(d) Compute the sample midrange (defined as min +2 max ).
2. (30 points) Annual Rainfall in a Region. A meteorological study recorded the annual
rainfall (in millimeters) for 25 cities in a region. The data are:
500, 750, 620, 480, 950, 870, 1100, 400, 680, 720, 560, 490,
1030, 860, 750, 680, 600, 920, 710, 830, 1050, 990, 650, 780, 800.
Answer the following:
(a) Find the range of the rainfall amounts.
(b) Compute the sample variance and sample standard deviation. (Calculator assistance
is allowed.)
(c) Determine the inter-quartile range (IQR).
(d) Compute the sample coefficient of variation (CV).
3. (20 points) Outlier Detection in Exam Scores. A teacher wishes to check for outliers in
three sets of exam scores (on a 100-point scale) using the 1.5 × IQR rule.
Note: For the quartiles, you may use any method you prefer. If you learn something that
differ from our formula, you can use it. However, please make sure to include all steps in your
calculation, otherwise you will not get full credit.
(a) Data Set (a): 78, 85, 92, 88, 91, 87, 73.
(b) Data Set (b): 55, 60, 62, 59, 58, 95.
(c) Data Set (c): 45, 47, 48, 52, 53, 75, 80, 46, 50.
4. (20 points) Daily Step Counts of Individuals. A fitness study recorded the number of
steps taken in one day by 30 individuals. The data (in steps) are:
11850, 12120, 11930, 11780, 12250, 12000, 12340, 11990, 11670, 12100,
11800, 12030, 12210, 11750, 11960, 12090, 12180, 11820, 11900, 12270,
11790, 12040, 12150, 11880, 11980, 12020, 12110, 11920, 12050, 11890.
Answer the following:
(a) Compute the sample mean and sample standard deviation of the step counts.
(b) Count how many individuals’ step counts fall within 1, 2, and 3 standard deviations of the
mean.
5. (10 points) Graphical Representations for Daily Step Counts. Using the data from
Question 4, construct a stem-and-leaf diagram and describe a histogram.
IEDA 2540 Homework 2, Page 2 of 3

Optional Practice questions


1. Analysis of Exam Scores: Boxplot and Percentiles. A statistics instructor recorded the
scores (out of 100) of 15 students on a recent exam. The data (already arranged in increasing
order) are:
55, 62, 68, 70, 72, 75, 76, 78, 80, 82, 85, 88, 90, 92, 95.
Answer the following:
(a) Compute the five-number summary (minimum, first quartile Q1 , median, third quartile
Q3 , and maximum) of the data.
(b) Estimate the 10th percentile and the 90th percentile of the data. (Hint: Use the
formula Pk = data value at position (k/100)(n + 1) with interpolation if needed.)
(c) Compute the interdecile range (the difference between the 90th and 10th percentiles)
and briefly comment on the spread of the middle 80% of the scores.
(d) Describe how you would construct a boxplot using the five-number summary.
2. Ogive and Percentile Estimation from Grouped Data. A researcher collected data on
the number of hours per week spent on leisure reading by a sample of 50 individuals. The data
were grouped into the following intervals with corresponding frequencies:

Hours (Interval) Frequency


0–2 8
2–4 15
4–6 12
6–8 10
8–10 5

Answer the following:


(a) Construct the cumulative frequency distribution for the data.
(b) Explain briefly how you would use the cumulative frequency data to draw an ogive.
(c) Use the ogive method to estimate (a) the median (50th percentile) and (b) the 75th
percentile of the leisure reading hours.
Note: An ogive, also known as a cumulative frequency graph, is a line graph that repre-
sents the accumulation of data points as you move through the range of a dataset. It is
constructed by plotting the upper boundary of each class interval on the horizontal axis
against the corresponding cumulative frequency on the vertical axis. This graph provides
a visual summary of the data, making it easy to estimate medians, quartiles, and per-
centiles, as well as to understand the overall distribution and spread of the dataset. To
determine the median (the 50th percentile), one simply locates the point on the vertical
axis that corresponds to half the total frequency and then draws a horizontal line to in-
tersect the ogive. The point where this line meets the curve can be traced vertically down
to the horizontal axis, providing an estimate of the median value. Similarly, any desired
percentile can be estimated by identifying the cumulative frequency that represents that
percentile (for example, 25% for the first quartile or 75% for the third quartile), drawing
the corresponding horizontal line, and reading the value from the horizontal axis where
the line intersects the curve.
IEDA 2540 Homework 2, Page 3 of 3

3. Distances of the Brightest Stars. The distances (in light-years) from Earth for the brightest
stars with distances less than 100 light-years are given by

8.6, 36.7, 42.2, 16.8, 33.7, 77.5, 87.9, 4.4, 25.3, 11.4, 65.1, 25.1, 51.5.

Answer the following:


(a) Compute the sample mean.
(b) Compute the sample median.
(c) Compute the sample mode.
min + max
(d) Compute the sample midrange (defined as 2 ).

4. Fatalities in Passenger Vehicle Crashes. The number of people killed in each state in a
specific year is:

778, 309, 1067, 826, 218, 492, 193, 262, 730, 1185, 305, 123, 69, 451, 155,
450, 414, 981, 214, 130, 1110, 324, 705, 76, 205, 152, 65, 186, 712, 452, 875,
82, 2707, 1279, 390, 948, 343, 602, 951, 104, 985, 2080, 565, 875, 2786, 82,
793, 396, 620, 797.

Answer the following:


(a) Find the range of the data.
(b) Compute the sample variance and sample standard deviation (calculator assistance
is permitted).
(c) Compute the inter-quartile range (IQR).
(d) Compute the sample coefficient of variation (CV).

5. Outlier Detection. Examine each data set for outliers using the 1.5 × IQR rule.
(a) Data: 88, 72, 97, 84, 86, 85, 100.
(b) Data: 145, 119, 122, 118, 125, 116.
(c) Data: 14, 16, 27, 18, 13, 19, 36, 15, 20.

6. Serum Cholesterol Levels. The serum cholesterol levels (in mg/dL) of 30 individuals are:

211, 240, 200, 212, 256, 203, 231, 212, 201, 247, 237, 227, 255, 219, 204,
193, 187, 205, 210, 221, 249, 236, 204, 187, 206, 187, 200, 221, 192, 196.

(a) Compute the sample mean and sample standard deviation.


(b) Count the number of data values that fall within 1, 2, and 3 sample standard deviations
of the mean.
7. Graphical Representations. For the cholesterol data in Question 4, construct a stem-and-
leaf diagram and describe a histogram.

You might also like