Lecture Note (Basic Statistics Acc & Fina)
Lecture Note (Basic Statistics Acc & Fina)
(Stat 2131)
October, 2023
Course Outline
1. Introduction to Statistics
What is Statistics?
Descriptive Versus Inferential Statistics
Types of Variables and Scales of Measurement
Statistics in Business Decision
2. Visual Description of Data
The frequency Distribution and the Histogram
The Stem and Leaf Display and the Dot Plot
Other Methods for Visual Representation of Data
Bar chart, Line Graph, Pictogram, Pie Chart
The scatter Diagram
Tabulation and Contingency Table
3. Statistical Description of Data
Statistical Description: Measures of Central Tendency
Statistical Description: Measures of Dispersion
Descriptive Statistics from Grouped Data
Statistical Measures of Association
7
1.1 Introduction
Definition of Statistics
Plural form
numerical facts and figures collected for a certain purposes
Singular form
systematic collection and interpretation of numerical data to make a decision
8
Classification of Statistics
Descriptive Statistics
Mainly concerned with the methods and techniques used in collection,
The sample shows 40% of year I students have positive attitude toward the
delivery of lectures.
10
Classification of Statistics …
Inferential Statistics
Deals with the method of inferring or drawing conclusion about the
Utilizes sample data to make decision for entire data set based on sample
11
Definition of Some Basic Statistical Terms
Data
a collection of related facts and figures from which conclusions may be
drawn
Population/target population
a totality of things, objects, peoples, etc about which information is being
collected
12
Definition of Some Basic Statistical Terms
Sample
part of a population selected to draw conclusions about the population
Subset of a population
Population
Sample
Census
a complete enumeration of the population. But in most real problems it
cannot be realized, hence we take sample.
13
Definition of Some Basic Statistical Terms
Statistic
A value computed from the sample, used to describe the sample.
Parameter
A descriptive measure (value) computed from the population.
Variable
is a characteristic or attribute that can assume different values.
Sampling frame
A list of people, items or units from which the sample is taken.
14
Stages in Statistical Investigation
15
Stages in Statistical Investigation
1. Data Collection
The processes of measuring, assembling and gathering data
16
Stages in Statistical Investigation …
2. Data Organization
It is a stage where we edit our data
The collected data involve irrelevant figures, incorrect facts, omission and
mistakes
3. Data Presentation
The organized data can now be presented in the form of tables, diagram and
graphs.
17
Stages in Statistical Investigation …
4. Data Analysis
Study the data to draw conclusions about the population parameter
5. Data Interpretation
Draw valid conclusions from the results obtained through data analysis
18
Uses and Limitations of Statistics
Uses of Statistics
Condenses and summarizes complex data
19
Uses and Limitations of Statistics …
Limitations of Statistics
Statistics doesn’t deal with single (individual) values rather it deals with
aggregate values
the subject
20
Applications in Business and Economics
Accounting
Public accounting firms use statistical sampling procedures
when conducting audits for their clients.
Economics
Economists use statistical information in making forecasts
about the future of the economy or some aspect of it.
Marketing
Electronic point-of-sale scanners at retail checkout counters
are used to collect data for a variety of marketing research
applications.
Finance
Financial advisors use price-earnings ratios and dividend yields
to guide their investment recommendations
21
Scales of Measurment
A variable in statistics is any characteristic, which can take on different
values for different elements when data are collected
22
Scales of Measurement
Measurement “is assigning numbers to objects, events, or abstract
concepts according to a known set of rules”
Ordinal Scale
Interval Scale
24
Scales of Measurement …
Interval Scales of Measurement
A measure of order and quantity
25
Sources of Data
Primary data
data measured or collect by the investigator or the user directly from the source
the data you collect is unique to you and your research and, until you publish, no one
The primary sources of data are objects or persons from which we collect the
Secondary data
second-hand information and data or information that was either gathered by
someone else
26
Sources of Data
27
Methods of Data Collection
Planning to data collection requires
Identify source and elements of the data
28
Methods of Data Collection
29
Methods of Data Collection
of standard procedure.
Provides accurate information but it is expensive and inconvenient.
Example: laboratory tests, clinical measurements and physical
examination etc.
30
Methods of Data Collection
Interviewers: are individuals those who are recorded the responses given by the
respondents.
31
Methods of Data Collection
a) Face to Face Interviews (questionnaires in charge of enumerators)
Advantages
The interviewer can help the respondent if he/she has difficulty in understanding
the questions. The difficulty could be due to language, concentration or limited
intellectual capacity.
There is more flexibility in presenting the items; they can range from closed to
open.
Skip patterns means skipping a questions or a group of questions which are not
32
applicable.
Methods of Data Collection
a) Face to Face Interviews (questionnaires in charge of enumerators)
Disadvantages
Advantages
• It is less expensive in time and money compared with face to face interviews.
• The interviewer is able to help the respondent if he/she doesn’t understand the
question (as seen with face to face interview)
• Broad representative samples can be obtained for those who have telephone lines.
Disadvantage
Advantages
Disadvantage
Secondary data may help to clarify or redefine the definition of the problem as part of the exploratory
research process.
Provides a larger database as compared to primary data
Time saving
Does not involve collection of data
It is difficult to get information needed, when records are compiled in unstandardized manner.
37
Methods of Data Presentation/Visualization
The major objectives of data presentation are
To presenting data in visual display and more understandable
Diagrams, and
Graphs
38
Methods of Data Presentation …
Tabular presentation of data
Tables are important to summarize large volume of data in more
understandable way.
Tables can be
Simple (one way table): table which present one characteristics for example age
distribution.
Two way table: it presents two characteristics in columns and rows for example
age versus sex.
A higher order table: table which presents two or more characteristics in one
table.
39
Methods of Data Presentation …
Frequency Distribution
It is the organization of raw data in table form, using classes and frequencies.
40
Methods of Data Presentation …
Categorical Frequency Distribution
The categorical frequency distribution is used for data which can be placed
A B C D
Class Tally Frequency Percent
41
Methods of Data Presentation …
Example: Data on smoking status by gender of a sample of 20 health workers in
Jimma Hospital 1986 E.C was given. Construct categorical frequency
distribution.
Observation 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20
Gender M F M M F F F M M M F F F F M F M F M M
Smoking Y N N Y N N Y N N N N N N Y Y Y N N Y Y
status
Characteristics Tally Frequency
Gender
Male //// //// 10
Female //// //// 10
Smoking status
No //// //// // 12
Yes //// /// 8
42
Methods of Data Presentation …
Ungrouped Frequency Distribution
It is the distribution that use individual data values along with their
frequencies.
often constructed for small set of data on discrete variable (when data are
The major components of this type of frequency distributions are class, tally,
43
Methods of Data Presentation …
Example: Age in year of 20 women who attended health education at Jimma
Health center in 1986 are given as follows. Construct ungrouped frequency
distribution
30 25 23 41 39 27 41 24 32 29 29 35 31 36 33 36 42
35 37 41
Age(xj) 23 24 25 27 29 30 31 32 33 35 36 37 39 41 42
Tally / / / / // / / / / // // / / /// /
Frequency(f) 1 1 1 1 2 1 1 1 1 2 2 1 1 3 1
44
Methods of Data Presentation …
the data must be grouped in which each class has more than one unit in
width.
We use when the range of the data is large, and for data from continuous
variable.
45
Methods of Data Presentation …
Guidelines for classes
There should be 5 to 20 classes. Determine using Sturge’s rule
K = 1+ 3.32 log n
Classes should be continuous.
Range R
W= =
Number of classes K
46
Methods of Data Presentation …
Class limit (CL)
It separates one class from another.
have gaps between the upper limits of one class and the lower limit of the next class.
Class boundary(CB)
Separate one class in a grouped frequency distribution from the other.
The boundary has one more decimal place than the raw data.
There is no gap between the upper boundaries of one class and the lower boundaries
47
Methods of Data Presentation …
Unit of measurement (U)
This is the possible difference between successive values. E.g. 1, 0.1, 0.01 …
The class width is also the difference between the lower limit or upper limits of two
consecutive classes.
sum by two.
48
Methods of Data Presentation …
Steps to construct grouped frequency distribution
Find smallest (S) and largest (L) values in your data
Take the smallest value as the first class lower class limit, and add class width to get consecutive
lower class limits
To get upper class limit subtract unit of measurement from second class lower class limit, and add
class width to get remaining upper class limits
Subtract half of unit of measurement from lower class limit to get class boundary, and add half of
unit of measurement to upper class limit to get upper class boundary
Tally data
49
Methods of Data Presentation …
Example: Age in year of 20 women who attended health education at Jimma
Health center in 1986 are given as follows. Construct grouped frequency
distribution
30 25 23 41 39 27 41 24 32 29 29 35 31 36 33 36 42
35 37 41
n=20
k=1+3.322(log20) =1+3.322(1.3010) = 5.196 k=6
w= (42-23)/5=4
The grouped frequency table using Sturges formula
50
Methods of Data Presentation …
Diagrammatic and Graphic presentation of the data
One of the most effective and interesting alternative way in which a statistical data may be
presented is through diagrams and graphs.
There are several ways in which statistical data may be displayed pictorially such as
different types of graphs and diagrams.
Pie chart
Bar chart
Pictogram
Histogram
Line Graph
Dot Plot
pie chart.
Component part
Angle of sec tor = 3600
Total
These angles are made in the circle by mean of a protractor to show different
components.
52
Methods of Data Presentation …
Pie Chart (Example)
The following table gives the details of quarterly sale of a Sport Wear
Month Profit($,000,000)
1st quarter 100
2nd quarter 300
3rd quarter 500
4th quarter 600
Total 1500
53
Methods of Data Presentation …
Pie Chart (Example)
Quarter Angle of sector Percen
Profit($,000,000)
(in degrees) t (%)
1st quarter
7%
2nd quarter
33%
54
Methods of Data Presentation …
Bar Chart
Use vertical or horizontal bins to represent the frequencies of a distribution.
While we draw bar chart, we have to consider the following two points.
Make the units on the axis that are used for the frequency equal in size
55
Methods of Data Presentation …
Simple Bar Chart
Used to represents data involving only one variable classified on spatial,
56
Methods of Data Presentation …
Multiple Bar Chart
When two or more interrelated series of data are depicted by a bar diagram
Example: Suppose we have export and import (in million) figures for a
80
60
40 Export
20 Import
0
2010 2011 2012
57
Methods of Data Presentation …
Stratified/Stacked Bar Chart
used to represent data in which the total magnitude is divided into
different or components.
First make simple bars for each class taking total magnitude in that class
and then divide these simple bars into parts in the ratio of various
components
58
Methods of Data Presentation …
Stratified/Stacked Bar Chart
The table below shows the profit of a company ($ Millions) from different
item sales in 1st quarter of the year. Draw stratified/stacked bar chart
Company Shoe T-shirt Ball Total
X 30 50 40 120
Y 33 16 27 76
Z 37 13 37 87
140 Ball
120 T-shirt
Sales in $,000,000
100 40 Shoe
80
37
60 27
50
40 16 13
20 30 33 37
0
X Y Z
Company 59
Methods of Data Presentation …
Deviation Bar Chart
Used when the data contains both positive and negative values such as data
commodity.
Commodity Net profit
Soap 80
Sugar -95 Net profit
Coffee 125 150
100
50 Soap
0 Sugar
Soap Sugar Coffee
-50 Coffee
-100
-150
60
Methods of Data Presentation …
Pictogram
Is a figure that represents something using an image or illustration.
Pictographs are often used in writing and graphic systems in which the
61
Methods of Data Presentation …
Histogram
Histogram is a special type of bar graph in which the horizontal scale
represents classes of data values and the vertical scale represents frequencies.
The height of the bars correspond to the frequency values, and the drawn
represent frequencies.
62
Methods of Data Presentation …
Histogram
A histogram shows the shape of continuous data, checks for homogeneity, and
To construct a histogram, we split the range of data into equal intervals, “bins,”
63
Methods of Data Presentation …
Line Graph
Compare how several things change over time relative to each other.
Example: Line graph showing sales (in thousands Birr) over six months
64
Methods of Data Presentation …
Stem and Leaf Display
Uses place value to organize data
65
Methods of Data Presentation …
Stem and Leaf Display
How to Draw One:
1. Put the first digits of each piece of data in numerical order down the left-
hand side
2. Go through each piece of data in turn and put the remaining digits in the
proper row
3. Re-draw the diagram putting the pieces of data in the right order
4. Add a key
66
Methods of Data Presentation …
Stem and Leaf Display
Example: the following are scores of students:
67
Methods of Data Presentation …
Dot Plot
Is a simple form of data visualization that consists of data points plotted as
dots on a graph with an x- and y-axis.
These types of charts are used to graphically depict certain data trends or
groupings.
68
Methods of Data Presentation …
Dot Plot
Example: This data set gives pulse rates, in beats per minute, for a group
of 30 students.
68 60 76 68 64 80 72 76 92 68 56 72 68 60 84 72 56 88 76 80
68 80 84 64 80 72 64 68 76 72
69
Methods of Data Presentation …
The scatter Diagram
A scatter graph is a graph using paired data that can be used to find out
paired data is two separate pieces of data referring to the same thing
70
Methods of Data Presentation …
The scatter Diagram
Example: 10 students sat both a Math and a Stat exam, here are their scores:
Subj Stud 1 Stud 2 Stud 3 Stud 4 Stud 5 Stud 6 Stud 7 Stud 8 Stud 9 Stud 10
Math 56 24 67 70 71 42 48 32 52 80
Stat 65 38 71 72 73 51 56 42 57 82
71
3. Statistical Description of Data
72
3.1. Measures of Central Tendency
A measure of central tendency is a descriptive statistic that describes the
average, or typical value of a set of scores.
Typical value
(Center of data)
73
Types of measures of central tendency
it should be defined rigidly which means that it should have a definite value
74
Types of measures of central tendency
Median
Mode
75
The Summation Notation
Also called Sigma notation
Let X is a variable
n ending point/
X
Upper limit of
the summation
i
i =1
Summation
notation
Xi is the index of
summation, each
starting point/
term of the sum
Lower limit of
the summation
(index of the
summation)
76
The Summation Notation..
Properties of summation notation
n
X
i =1
i = X1 + X 2 + + X n
XY
i =1
i i = X 1Y1 + X 2Y2 + + X nYn
i 1 2
X 2
i =1
= X 2
+ X 2
+ X 2
n
n n
CX
i =1
i = C X i = CX 1 + CX 2 + + CX n
i =1
77
The Mean
Mean is the most commonly used measure of central tendency. There are
different types of mean
Arithmetic mean,
Weighted mean,
78
The Arithmetic Mean
It is computed by adding all the values in the data set divided by the number of
observations in it.
X i
X= i =1
fX i i
X= i =1
n
If we have frequency distribution (grouped) mean is given by the formula
n LCB/UCB is lower/upper class boundary
fm i i
LCBi + UCBi
X= i =1
, where mi =
n 2
79
The Arithmetic Mean …
Example 1: The following data is the weight (in Kg) of eight youths:
32,37,41,39,36,43,48 and 36. Calculate the arithmetic mean of their weight.
(Ans:312/8=39 )
81
Properties of Arithmetic Mean …
It can be computed for any set of numerical data, it always exists, and unique.
The sum of deviations of the observations about the mean is zero i.e.
The sum of squares of deviations of all observations about the mean is the minimum
If a constant is added to all observations, the new mean is old mean plus constant
If all observations are multiplied by a constant, the new mean is the multiple of the constant and old
mean
If wrong value is recorded and latter on it is discovered, the new corrected mean is
X corr= X wrong+
(X corr − X wrong )
n
82
Weighted Mean
Weighted mean is calculated when certain values in a data set are more
important than the others.
w x i i
Xw = i =1
k
w
i =1
i
Example: CGPA of a students (each result is weighted by credit of a course) [Ans: 2.88]
83
Geometric Mean
It is defined as the arithmetic mean of the values taken on a log scale.
Solution:
Example 2: The man gets three annual raises in his salary. At the end of first year,
he gets an increase of 4%, at the end of the second year, he gets an increase of 6%
and at the end of the third year, he gets an increase of 9% of his salary. What is the
average percentage increase in the three periods?
Solution:
85
Properties of geometric mean
negative.
zero.
Harmonic Mean
Another important mean is the harmonic mean, which is suitable measure of
central tendency when the data pertains to speed, rates and price.
Note: SHM is used for equal distances, equal costs and equal rates.
87
Harmonic Mean
Example 1: A motorist travels for three days 480 km at each day. On the first day
he travels 10 hours at a rate of 48 km/h, on the second day 12 hours at a rate of 40
km/h, on the third day 15 hours at a rate of 32 km/h. What is the average speed?
Solution: Since the distance covered by the motorist is equal
( ), so we use SHM.
Solution: since the distance covered by the driver is not equal, so we use
WHM by taking the distance as weights (wi).
(480+540+600)
𝑤. ℎ. 𝑚 = 480 540 600 =43.78
+ +
48 45 40
Properties of harmonic mean
It is based on all observation in a distribution.
Used when a situations where small weight is give for larger observation
and larger weight for smaller observation
If all the values in a data set are the same, then all the three means (arithmetic
mean, GM and HM) will be identical.
As the variability in the data increases, the difference among these means also
increases.
Arithmetic mean is always greater than the GM, which in turn is always greater
than the HM.
AM > GM > HM
91
Median
If the sample data are arranged in increasing order, the median is
if n is an odd number, median is middle value
Example: systolic blood pressure of seven persons were given as 113, 124, 124,
132, 146, 151, and 170. what is the median systolic blood pressure? (Ans: 132)
Six men with high cholesterol participated in a study to investigate the effects of diet
on cholesterol level. At the beginning of the study, their cholesterol levels (mg/dL)
were as follows:366, 327, 274, 292, 274 and 230. what is the median cholesterol
level? (Ans:283)
92
Median …
If the data is in ungrouped frequency distribution, median is the class with largest
less than cumulative frequency smaller than or equal to half of the total observation
Example: Forty five students were taken to field and evaluated their performance using 60m
pure speed test. The time is recorded in seconds, and the result is summarized in the table. What
is the median performance of these students. (Ans: 19 secs)
93
Median …
If the data is in grouped frequency distribution, median is
Example: fifty students were taken to field and evaluated their performance using 100 m
pure speed test. The time is recorded in seconds, and the result is summarized in the table.
What is the median performance of these students. (Ans: 20.81 secs)
There can be only one mode-unimodal Eg: 25, 27, 22, 25,18
There can be two mode-bimodal Eg: 25, 27, 22, 27, 25, 18, 20
There can be more than two mode-multimodal Eg: 25, 27, 22, 27, 25, 18, 20, 19, 22, 17
95
Mode…
The most frequent observation (value) in a data
Example: Twenty five amateur cyclists were taken to field and their time is
recorded to complete a given distance. The time is recorded in seconds, and
the result is summarized in the table. What is the modal time to complete the
distance. (Ans: 29.5 secs)
96
2.3 Quantiles
Quartiles are three points which divide an array into four parts in
such a way that each portion contains an equal number of
elements.
First quartile (Q1) 25% of the observations lies below or equal to it
b)
13 23.5 39
98
Quantiles
The ith quartile for grouped frequency distribution is
99
Quantiles …
Deciles are nine points which divide an array into 10 parts in such
a way that each part contains equal number of elements.
The nine deciles are denoted by D1, D2, …, D9
Second decile (D2) 20% of the observations lies below or equal to it etc
100
Quantiles …
101
Quantiles …
Example:- The following frequency distribution is the score of 25 students.
Score Number
of
students Compute the following quantities
25-29 1 ● First quartile (Ans:44.92)
30-34 1
●Ninth decile (Ans:65.75)
35-39 1
●forty fifth percentile (Ans:51.38)
40-44 3
45-49 3 Remark:
50-54 6
Q1 = P25
55-59 4
Q2 = D5 = P50 = Median
60-64 3
65-69 2 Q3 = P75
70-74 1 D1 = P10 ; D2 = P20 ;; D9 = P90
102
3.2. Measures of Dispersion
103
Introduction
Central tendency measures do not reveal the variability present in the data.
all the data are the same and increases as the data become more diverse.
104
Introduction…
Properties of a good measures of dispersion
It should be rigidly defined
105
Introduction…
There are many types of dispersion measures
Range /Relative Range (Coefficient of range)
106
Range (R)
Range is the difference between two extreme values in a data
Denoted by R
R = max − min
107
Relative Range (RR)
Relative range is the ratio of the difference and sum of the two
extreme values in a data
Denoted by RR/CR
max− min
RR =
max+ min
108
Properties of range
= Q3 - Q1
The SIR is often used with skewed data as it is insensitive to the extreme
scores
110
Coefficient of Quartile Deviation
The ratio of the difference to sum of the two extreme quartiles of a
data. Denoted by CQD
Q3 − Q1
CQD =
Q3 + Q1
111
Properties of IQR
x −x i
MAD = i =1
113
Coefficient of Mean Deviation (CMD)
MAD
CMD =
x
All values are used in the calculation.
114
Solution
Step 2 Step 3
4 37.6 -4 4
units.
Therefore standard deviation is more natural since it recovers the original units.
117
In general, the sample variance is computed by:
𝑛 𝑛
𝑖=1(𝑥𝑖 − 𝑥 )2 2
𝑖=1 𝑥𝑖 − 𝑛𝑥
2
= . → 𝑓𝑜𝑟 𝑟𝑎𝑤 𝑑𝑎𝑡𝑎.
𝑛−1 𝑛−1
𝑘 2 𝑘 2 2
𝑖=1 𝑓𝑖 (𝑥𝑖 − 𝑥 ) 𝑖=1 𝑓𝑖 𝑥𝑖 − 𝑛𝑥
𝑠2 = 𝑘 = . → 𝑓𝑜𝑟 𝑢𝑛𝑔𝑟𝑜𝑢𝑝𝑒𝑑 𝑑𝑎𝑡𝑎.
𝑓
𝑖=1 𝑖 − 1 𝑛 − 1
𝑘 2 𝑘 2 2
𝑖=1 𝑓𝑖 (𝑚𝑖 − 𝑥 ) 𝑖=1 𝑓𝑖 𝑚𝑖 − 𝑛𝑥
𝑘 = . → 𝑓𝑜𝑟 𝑔𝑟𝑜𝑢𝑝𝑒𝑑 𝑑𝑎𝑡𝑎.
𝑓
𝑖=1 𝑖 − 1 𝑛 − 1
Standard Deviation
One of the most useful measures of dispersion is the standard deviation.
119
Example 1: Compute the variance for the sample: 5, 14, 2, 2 and
17. 𝑛 𝑛
Solution: 𝑛 =5, 𝑥𝑖 = 40, 𝑥 = 8 , 𝑥𝑖 2 = 518 .
𝑖=1 𝑖=1
𝑛 2
𝑖=1 𝑥𝑖− 𝑛𝑥 2 518 − 5 𝑥 82
𝑠2 = = = 49.5. , 𝑆 = 49.5 = 7.04.
𝑛−1 5−1
2
31376 − 20 𝑥 (39.4)2
𝑥 = 39.4 , 𝑠 = = 17.31. , 𝑆 = 17.31 = 4.16.
19
Properties of Variance
The variance is always non-negative ( 𝑠 2 ≥ 0).
s
CV = 100%
x
All values are used in the calculation.
The actual value of the CV is independent of the unit in which the measurement has been
For comparison between data sets with different units or widely different means, one
122
Coefficient of Variation
Example: Last semester, the students of Biology and Chemistry Departments took
Stat 273 course. At the end of the semester, the following information was
recorded.
Department Biology Chemistry
Mean score 79 64
Standard deviation 23 11
Compare the relative dispersions of the two departments’ scores using the
appropriate way.
Solution: Biology Department Chemistry Department
11 23
CV = 100 = 17.19% CV = 100 = 29.11%
64 79
123
2.5 Standard Score
If X is a measurement from a distribution with mean X and standard
deviation S, then its value in standard units is
X −X
Z=
S
Z gives the deviations from the mean in units of standard deviation
124
Standard Score
Example: Two groups of people were trained to perform a certain task
and tested to find out which group is faster to learn the task. For the two
groups the following information was given:
Value Group one Group two
Mean 10.4 min 11.9 min
Stan.dev. 1.2 min 1.3 min
Relatively speaking:
125
Solution
S1 1.2
Coefficient of variation for group 1: CV = 100% = 100% = 11.54%
x1 10.4
S2 1.3
Coefficient of variation for group 2: CV = 100% = 100% = 10.92%
x2 11.9
x A − x1 9.2 − 10.4
Z-score of Person A: Z= = = −1.00
S1 1.2
xB − x2 9.3 − 11.9
Z-score of Person B: Z = S =
1.3
= −2.00
2
For example, physician says that a patient has a 50-50 chance of surviving a
certain operation.
Another physician may say that she is 95% certain that a patient has a
particular disease
An economist may say that he is 80% certain this year inflation may be
128
Fundamental Concepts
Experiment: Any process that generates well defined outcomes.
Action to be performed
129
Fundamental Concepts
Deterministic Experiment
130
Fundamental Concepts
Non-Deterministic/ probabilistic/Stochastic Experiment
Even exact knowledge of input and action does not allow exact prediction of
outcome.
All the possible outcomes are known prior to conducting the exoeriment.
Example: Rolling a die and observing the number that is rolled is a probability
experiment.
131
Fundamental Concepts
The result of a single trial in a probability experiment is the outcome.
The set of all possible outcomes for an experiment is the sample space.
S={T, H}
S={1, 2, 3, 4, 5, 6}
132
Fundamental Concepts
An event consists of one or more outcomes and is a subset of the sample
space.
Let event A be getting tail: A={T}
133
Classical Probability
Classical (or theoretical) probability is used when each outcome
in a sample space is equally likely to occur. The classical
probability for event E is given by
Number of outcomes in event
P (E ) = .
Total number of outcomes in sample space
134
Empirical Probability
Empirical (or statistical) probability is based on observations obtained from
probability experiments. The empirical frequency of an event E is the relative
frequency of event E.
P (E ) = Frequency of Event E = f
Total frequency n
Example: A company manufactures light bulbs. Out of 100, 000 produced light
bulbs 6 are defective. What is the probability next produced bulb will be
defective.
135
Subjective Probability
Subjective probability results from intuition, educated guesses, and estimates.
May differ from person to person.
Example: A business analyst predicts that the probability of a certain union
going on strike is 0.15.
136
Rules of Probability
Range of Probabilities Rule
The probability of an event E is between 0 and 1, inclusive. That is
0 P(A) 1.
The complement of Event E is the set of all outcomes in the sample space
137
Conditional Probability
A conditional probability is the probability of an event occurring,
given that another event has already occurred.
P (B |A) “Probability of B, given A”
Example: There are 5 red chip, 4 blue chips, and 6 white chips in a basket. Two
chips are randomly selected. Find the probability that the second chip is red
given that the first chip is blue. (Assume that the first chip is not replaced.)
Because the first chip is selected and not replaced, there are only
14 chips remaining.
5
P (selecting a red chip|first chip is blue) = 0.357
14
138
Conditional Probability
Example: 100 college students were surveyed and asked how many hours a
week they spent studying. The results are in the table below. Find the
probability that a student spends more than 10 hours studying given that the
student is a male.
The sample space consists of the 49 male students. Of these 49, 16 spend more than 10
hours a week studying.
16
P (more than 10 hours|male) = 0.327
49
139
Independent Events
Two events are independent if the occurrence of one of the events does not
affect the probability of the other event. Two events A and B are independent if
141
Fundamental Counting Principle
If one event can occur in m ways and a second event can occur in n wa
ys, the number of ways the two events can occur in sequence is m· n.
This rule can be extended for any number of events occurring in a sequ
ence.
Example:
A meal consists of a main dish, a side dish, and a dessert. How many different meals can be
selected if there are 4 main dishes, 2 side dishes and 5 desserts available?
4 2 5 = 40
There are 40 meals available.
Fundamental Counting Principle
Example:
Two coins are flipped. How many different outcomes are there? List the sample
space.
Start
1st Coin
Tossed
Heads Tails 2 ways to flip the coin
2nd Coin
Tossed
Heads Tails Heads Tails 2 ways to flip the coin
The access code to a house's security system consists of 5 digits. Each digit
can be 0 through 9. How many different codes are available if
a.) each digit can be repeated?
b.) each digit can only be used once and not repeated?
a.) Because each digit can be repeated, there are 10 choices for each of the 5 digits.
10 · 10 · 10 · 10 · 10 = 100,000 codes
b.) Because each digit cannot be repeated, there are 10 choices for the first digit, 9 choices
left for the second digit, 8 for the third, 7 for the fourth and 6 for the fifth.
10 · 9 · 8 · 7 · 6 = 30,240 codes
Permutations
A permutation is an ordered arrangement of objects. The number
of different permutations of n distinct objects is n!.
“n factorial”
Example:
How many different surveys are required to cover all possible question
arrangements if there are 7 questions in a survey?
7! = 7 · 6 · 5 · 4 · 3 · 2 · 1 = 5040 surveys
Permutation of n Objects Taken r at a Time
Example:
You are required to read 5 books from a list of 8. In how many different orders c
an you do so?
n Pr = 8 P5 = 8! = 8! = 8 7 6 5 4 3 2 1 = 6720 ways
(8 − 5)! 3! 3 2 1
Distinguishable Permutations
The number of distinguishable permutations of n objects, where n1 a
re one type, n2 are another type, and so on is
n! , where n1 + n2 + n3 + + nk = n.
n1 ! n2 ! n3 ! nk !
Example:
In how many ways can you order the word PSSISSIPIP
10! 10 9 8 7 6 5 4!
=
3!4!3! 3!4!3!
nC r =
n! .
# in the (n − r)! r !
collection
# taken from the
collection
Example:
You are required to read 5 books from a list of 8. In how many d
ifferent ways can you do so if the order doesn’t matter?
C 5 = 8! = 8 7 6 5! = 56 combinations
8
3!5! 3!5!
Application of Counting Principles
Example:
In a state lottery, you must correctly select 6 numbers (in any order)
out of 44 to win the grand prize.
a.) How many ways can 6 numbers be chosen from the 44 numbers?
b.) If you purchase one lottery ticket, what is the probability of
winning the top prize?
44!
a.) C = = 7,059,052 combinations
44 6 6!38!
b.) There is only one winning ticket, therefore,
1
P (win) = 0.00000014
7059052
Random variables
A random variable is a numerical quantity that is generated by a
random experiment.
A random variable is called discrete if it has either a finite or a
countable number of possible values.
Roll two fair dice Sum of the number of dots 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12
on the top faces
Flip a fair coin twice Count number of tails 0, 1, 2
Example: Flip a coin three times, let X be the number of heads in three
tosses. Construct a probability distribution for X.
151
Discrete random variables: Probability distribution
Example: Flip a coin three times, let X be the number of heads in three
tosses. Construct a probability distribution for X.
S Rx
X(S)
HHH 3
HHT 2
HTH 2
HTT 1
THH 2
THT 1
TTH 1
TTT 0
X = {0, 1, 2, 3}
152
Discrete random variables: Probability distribution
Example: probability distribution
X 0 1 2 3
P(X=x) 1/8 3/8 3/8 1/8
153
The Mean and Standard Deviation of a Discrete Random Variable
The mean of a random variable may be interpreted as the average of the values
assumed by the random variable in repeated trials of the experiment.
154
The Mean and Standard Deviation of a Discrete Random Variable
The variance, σ2, of a discrete random variable X is the number
σ2
The standard deviation, σ, of a discrete random variable X is the square root of
its variance, hence is given by the formulas
155
The Mean and Standard Deviation of a Discrete Random Variable
Example: A manufacturer receives a certain component from a supplier in
shipments of 100 units. Two units in each shipment are selected at random and
tested. If either one of the units is defective the shipment is rejected. Suppose a
shipment has 5 defective units.
a) Construct the probability distribution for the number X of defective units in such a
sample.
156
The Mean and Standard Deviation of a Discrete Random Variable
Example: A manufacturer receives ….
a) The probability distribution
X 0 1 2
P(X=x) 0.902 0.096 0.002
a) P(shipment is accepted)=0.902
b) Mean =∑XP(X=x)=(0*0.902)+(1*0.096)+(2*0.002)=0.10
and variance=0.094
157
The Binomial probability distribution
Suppose a random experiment has the following characteristics.
There are n identical and independent trials of a common procedure.
There are exactly two possible outcomes for each trial, one termed “success”
Then the discrete random variable X that counts the number of successes
in the n trials is the binomial random variable with parameters n and
p. We also say that X has a binomial distribution with parameters n
and p.
158
The Binomial probability distribution
If X is binomial, then
159
The Binomial probability distribution
Example: A corporation has advertised heavily to try to insure that over half the
adult population recognizes the brand name of its products. In a random sample
of 20 adults, 14 recognized its brand name.
a) What is the probability that 14 or more people in such a sample would recognize its
brand name if the actual proportion p of all adults who recognize the brand name
were only 0.50?
b) What is the mean number of adults who recognized the brand name
c) What is the variance number of adults who recognized the brand name
160
The Binomial probability distribution
Example: A corporation has …
Let X be the number pf adults who recognized brand name
P=0.5
n=20→x~BINOM(20,0.5)
a) What is the probability that 14 or more people in such a sample would recognize its
brand name if the actual proportion p of all adults who recognize the brand name
were only 0.50?
P(X≥14)=0.0577
b) What is the mean number of adults who recognized the brand name
Mean=E(X)=np=10
c) What is the variance number of adults who recognized the brand name
Var(X)=np(1-p)=5
161
The Poisson probability distribution
The Poisson probability distribution provides a good model for the
probability distribution of the number of “rare events” that occur
randomly in time, distance, or space.
162
The Poisson probability distribution
Assumptions of a Poisson probability distribution:
You are counting the number times a particular event occurs in a unit;
and
As the unit gets smaller, the probability that two or more events will
163
The Poisson probability distribution
The random variable X is said to follow the Poisson probability
distribution if it has the probability function:
e − x
P( x) = , for x = 0, 1,2,...
where x!
• P(x) = the probability of x successes over a given period of time or space, given
x = E ( X ) = and x2 = E[( X − ) 2 ] =
164
The Poisson probability distribution
Example: If calls to your cell phone are a Poisson process with a
constant rate =2 calls per hour,
what’s the probability that, if you forget to turn your phone off in a
E(X) = t = 2(1.5) = 3
165
The Poisson probability distribution
Example: A life insurance company insures the lives of 5,000 men
of age 42. If actuarial studies show the probability of any 42-year-
old man dying in a given year to be 0.001.
a) What is the probability that a company will pay 4 claims per year
b) What is the mean number of claims per year the company will pay.
c) What is the probability that a company will pay at least 1 claims per
year
166
The Poisson probability distribution
Example: A life insurance company….
b) What is the mean number of claims per year the company will pay.
Mean=5
c) What is the probability that a company will pay at least 1 claims per
year
𝑃 𝑋 ≥1 =𝑃 𝑋 =1 +𝑃 𝑋 =2 +⋯=1−𝑃 𝑋 =0
𝑒 −5 5−0
=1- 0! =0.993262 167
Continuous Probability Distribution
A continuous random variable is a variable that can assume any value in
an interval
thickness of an item
temperature of a solution
height, in meters
168
Continuous Probability Distribution
A continuous random variable has an infinite number of possible
values that can be represented by an interval on the number line.
0 3 6 9 12 15 18 21 24
169
Normal Probability Distribution
The most important probability distribution in statistics is the normal
distribution. f(x)
Bell Shaped’
Symmetrical
σ
Mean, Median and Mode are Equal
μ x
Location is determined by the mean, μ
170
Normal Probability Distribution
The normal distribution closely approximates the probability distributions of a
wide range of random variables
The normal probability distribution has led to good business decisions for a
number of applications
171
Normal Probability Distribution
For a normal random variable X with mean μ and variance σ2 , i.e., X~N(μ, σ2),
the cumulative distribution function is
F(x 0 ) = P(X x 0 )
f(x)
P(X x 0 )
0 x0 x 172
Normal Probability Distribution
There may be thousands of normal distribution curves, each with a different
mean and a different standard deviation.
Since the shapes are different, the areas under the curves between any two points
are also different.
173
Standard Normal Probability Distribution
The letter z is used to designate the standard normal random variable.
=1
z
0
• Converting to the Standard Normal Distribution requires the use of this formula
Value - Mean x -μ
z= = .
Standard deviation σ
• If X is distributed normally with mean of 100 and standard deviation of 50, the Z
value for X = 200 is 2.0
174
Standard Normal Probability Distribution
a −μ b −μ
P(a X b) = P Z
σ σ
f(x) b −μ a −μ
= F − F z
σ σ
x
a µ b
a −μ b −μ
Z
σ 0 σ
175
Standard Normal Probability Distribution
Properties of the Standard Normal Distribution
1. The cumulative area is close to 0 for z-scores close to z = −3.49.
z = −3.49 z = 3.49
z=0
Area is 0.5000.
z=0
176
Standard Normal Probability Distribution
Example: Find the area that corresponds to a z-score of between 0 and 2.71.
Find the area by finding 2.7 in the left hand column, and then moving across the row to the column under 0.01
178
Standard Normal Probability Distribution
b. Find the maximum number of hours per day that the bottom quartile of
households uses a personal computer for entertainment.
179
Standard Normal Probability Distribution
180
Uniform Probability Distribution
f(x)
Total area under the
uniform probability
density function is 1.0
xmin xmax x
181
Uniform Probability Distribution
1
if a x b
b−a
f(x) =
0 otherwise
where
a = minimum value of x
b = maximum value of x
182
Exponential Probability Distribution
Used to model the length of time between two occurrences of an event (the time
between arrivals)
Examples:
183
Exponential Probability Distribution
The exponential random variable T (t>0) has a probability density
function
f(t) = λ e − λ t for t 0
Where
is the mean number of occurrences per unit time
e = 2.71828
184
Exponential Probability Distribution
The exponential random variable T (t>0) has a probability density
function
f(t) = λ e − λ t for t 0
Where
is the mean number of occurrences per unit time
e = 2.71828
185
Exponential Probability Distribution
Defined by a single parameter, its mean (lambda)
The cumulative distribution function (the probability that an arrival time is less
than some specified time t) is
−λt
F(t) = 1− e
where
186
Exponential Probability Distribution
Example: Customers arrive at the service counter at the rate of 15 per hour.
What is the probability that the arrival time between consecutive customers is
less than three minutes?
187