MATH30-6-Lecture-3
MATH30-6-Lecture-3
MATH30-6
Probability and Statistics
Objectives
At the end of the lesson, the students are expected to
• Define and differentiate various measures of describing
data;
• Describe a given set of data using various measures;
and
• Interpret values that arise from computation.
Measures of Describing Data
• Measure of Central Tendency
- Also known as Measure of Center, Measure of Central
Location
- Measure of finding the mean, median or mode of the
dataset
• Measure of Position
- Measure of finding the kth element of the distribution
- Also the quantiles or fractiles of distribution
Measures of Describing Data
• Measure of Variation
- Measure of how the data is distributed about the
mean.
• Measure of Shape
- Measure of the degree of symmetry of a distribution.
The Mean
• Most widely used parameter of describing a ratio
data.
• May be classified as
- Arithmetic mean
- Geometric mean
- Harmonic mean
- Trimmed mean
- Quadratic or Root Mean Square (RMS)
Arithmetic Mean
For Discrete Case
Sample mean
Population mean
Arithmetic Mean
Characteristics
• All values are used.
• It is unique.
• The arithmetic mean is the only measure of central
tendency where the sum of the deviations of each
value from the mean is zero.
• It is calculated by summing the values and dividing by
the number of values.
• Every set of interval-level and ratio-level data has a
mean.
• The mean is affected by unusually large or small data
values.
Arithmetic Mean
Arithmetic Mean
For Continuous Case
Arithmetic Mean
Examples:
Determine the average voltage of a signal that follows the
following waveforms.
1. ,
2. ,
3.
Weighted Mean
The weighted mean of a set of numbers with
corresponding weights is computed from the following
formula:
Weighted Mean
Example:
1. The Carter Construction Company pays its hourly
employees $16.50, $19.00, or $25.00 per hour. There
are 26 hourly employees, 14 of which are paid at the
$16.50 rate, 10 at the $19.00 rate, and 2 at $25.00
rate. What is the mean hourly rate paid of the 26
employees?
Geometric Mean
• Used in factors being multiplied to another quantity
• Examples of which are interest and discount rates,
amplification factors, being used.
Geometric Mean
Examples:
1. What is the geometric mean between the numbers 5,
9, 25, and 80?
Sample median
if is odd,
if is even.
The Median
Characteristics
• There is a unique median for each data set.
• It is not affected by extremely large or small values and
is therefore a valuable measure of central tendency
when such values occur.
• It can be computed for ratio-level, interval-level, and
ordinal-level data.
• It can be computed for an open-ended frequency
distribution if the median does not lie in an open-
ended class.
The Median
Example:
1. Find the median of
1.8, 2.1, 1.7, 1.6, 0.9, 2.7, and 1.8
The Mode
• The value of the observation that appears most
frequently
The Mode
Characteristics
• Used when you want to find the most
occurring/frequent score
• A quick approximate of the average
• An inspection average
• The most unreliable among the three measures
because its value is undefined in some observations
• The only measure of central location that can be used
for nominal data
• Usually used in polls
• If a distribution is said to have 2 modes, it is bi-modal,
if three, a tri-modal. Generally, multi-modal.
The Mode
Example:
1. At a certain polls, the following data were recorded:
1 − Yes, 2 – No, 0 – Undecided. What is the modal choice?
()
Measures of Location
• Quantiles (or Fractiles) are points taken at regular
intervals from the cumulative distribution function of a
random variable.
• Dividing ordered data into essentially equal-sized data
subsets is the motivation for -quantiles; the quantiles
are the data values marking the boundaries between
consecutive subsets.
• There are -quantiles, with an integer satisfying .
Measures of Location
Quartiles
• Dividing the dataset into 4 groups.
Deciles
• Dividing the dataset into 10 groups.
Percentiles
• Dividing the dataset into 100 groups.
Quartile
• Any of the three fractiles obtained by dividing the set
of data into four equal parts
Program
STATDISK 4.5 12.5 24.5
Minitab 3.75 12.5 26.25
Excel 5.25 12.5 22.75
TI-83 Plus 4.5 12.5 24.5
Measures of Location
By Interpolation
• Quartile – One fourth
First (1/4), Second (1/2), Third (3/4)
Quartile locator (Lq):
• Decile – One tenth
10%, 20%, …, 90%
Decile locator (Ld):
• Percentile − One hundredth
1%, 2%, …, 99%
Percentile locator (Lp):
Measures of Location
Examples:
1. Consider the observations 11, 14, 17, 23, 27, 32, 40,
49, 54, 59, 71, and 80. What is the 29th percentile?
Hint: Solve by interpolation.
Locator ()
Measures of Location
2. The magazine Forbes publishes annually a list of the
world’s wealthiest individuals. For 2007, the net worth
of the 20 richest individuals, in billion of dollars, in no
particular order, is as follows:
18, 18, 18, 18, 19, 20, 20, 20, 21, 22, 22, 23, 24, 26, 27, 32,
33, 49, 52, 56
Females
Measures of Variation
Example:
where:
standard deviation ( or )
Player ’s record of assists and points in Game 1:
A 7 10 9 1 5 3 4 7 9 4
P 25 25 30 22 23 22 16 35 20
• Kurtosis
- The degree of peakedness exhibited by the distribution
- Computed as the fourth degree moment from the
mean
Skewness
Pearsonian Coefficient of Skewness (Pearson’s Coefficient
of Skewness)
Interpretation of values:
1. Sk < 0, “negatively skewed” or “skewed to the left”
2. Sk = 0, symmetrical
3. Sk > 0, “positively skewed” or “skewed to the right”
Skewness
• A measure of the asymmetry of the frequency distribution
Leptokurtic Platykurtic
Mesokurtic
Kurtosis
Moment Based Coefficient of Kurtosis
Kurtosis
Absolute Kurtosis