Chapter 1: Descriptive Statistics: Example 1: Making Steel Rods
Chapter 1: Descriptive Statistics: Example 1: Making Steel Rods
1.1 Introduction
Consider a machine that makes steel rods for use in optical storage
devices. The specification for the diameter of the rods is 0.450.02
cm. The machine makes 1000 rods per hour (continuous flow
production). The engineer wants to be fairly certain that the
percentage of good rods is at least 90%; otherwise he will shut
down the process for recalibration.
Alloy A Alloy B
404 365
406 452
396 378
392 461
402 344
1
Alloy A
Alloy B
2
Example 4: Fatigue Tests for Aircraft Wheels
A machine is used to roll the wheel under the desired design load.
The times until failure for wheels randomly selected from the
production run (in thousands of km) are obtained. The data are
used to estimate time until failure for all wheels.
Digitization of oil fields: surface rock and soil type, seismic data
(creating shock waves that pass through hidden rock layers and
interpreting the waves that are reflected back to the surface),
satellite images, small core samples obtained by shallow drilling.
Statistical models used to find economically viable fields.
Generalization
Sample Population
3
Inferences - statements about the population based on the sample
data
1.4 Variables
Variables
4
1.5 Displaying Categorical Variables
A vertical bar erected over each category; the height of the bar is
the frequency or the percentage of observations in the category.
Percent
60
40
Females Males
Females
Males
Slices represent categories;
size of each slice corresponds
to the percentage for the
category
5
1.6 Describing Quantitative Variables
Sample Mean
x .
In compact notation: x
i
Sample Mode
7
(b) Measures of Spread
Sample Range
Range ignores all of the information between the largest and the
smallest values.
x x1
x1 x
Observations: x1, x2, , xn
Variance s2 is defined as
( x1 x )2 ( x2 x ) 2 ...( xn x ) 2
s
2
.
n 1
s 2
(x x )
i
2
.
Compact notation: n 1
Standard deviation s: s s
2
8
Properties of s:
Solution:
9
Sample Quantiles
Three useful quantiles are quartiles. The lower (or first) quartile has
p=25, the median (or second) quartile denoted by Q2 has p=50, and
the upper (or third) quartile has p=75.
M=median= Q2
n even n odd
10
Q1 = Median of the Lower Half
Q2 = Overall Median,
Q3 = Median of the Upper Half.
Q1 Q2 Q3
Example 10: Obtain the quartiles and IQR for the sample:
30 30 40 40 48 50 50 60 66 86 94 112
Solution:
11
1.7 Outliers
outlier
Q1 Q2 Q3
1.5 IQR 1.5 IQR
Solution:
12
1.8 Displaying Quantitative Variables
75 79 58 73 82 94
61 77 54 77 65 67
62 61 64 45 58 86
66 83 70 91 48 78
86 66 52 80 59 55
(a) Histograms
( )( )()( )(
13
Frequency Table:
40 50 60 70 80 90 100
14
9/30
7/30
2/30
40 50 60 70 80 90 100
Shapes of histograms
Shapes of histograms
Symmetric Skewed
15
4
Frequency
2
Symmetric
4
Frequency
Skewed Left
4
Frequency
Skewed Right
16
(b) Boxplots
Q3
IQR Q2
R Q1
The smallest observation
within 1.5 IQR from Q1
Skewed right 17
Skewed left Symmetric
For symmetric distribution: Mean=Median=Mode.
Example 13: Obtain the boxplot for the 30 exam scores. Repeat the
exercise with the score 94 replaced by 120.
(c) Scatterplots
Sales
Price
18
(d) Time Series Plots (line charts)
Variable
| | | | |
19
20