MMWChapter4 6
MMWChapter4 6
Data Management
Learning Outcomes:
Introduction
4.1 Preliminaries
AREAS OF STATISTICS
There are two main areas of statistics: descriptive and inferential statistics.
Descriptive statistics is the simple collecting, presenting, and analyzing of the data
and its primary purpose is only to describe the characteristics of the population/sample under
investigation.
TYPES OF DATA
There are two basic types of data: qualitative and quantitative data.
Quantitative Data (also termed as numerical data)- is the data that involves quantities
which came from counting measurement. Its value differs in degree.
61
Examples are height, weight, number of employees, salary, etc.
This can also be classified as either discrete or continuous. Discrete data are those
data that can be counted like number of students, number of likes and shares in FB post,
etc. While continuous data are those data that are obtained through measurement like
weight, length of your hair, thickness of your eyeglasses, etc.
Qualitative data (also termed as categorical data) is the data that involves qualities
which cannot be measured. Examples are sex, nationality, color of skin, religion, etc.
Levels of Measurement
Data can be classified according to the levels of measurement. This classification
includes nominal, ordinal, interval and ratio data.
1. Nominal level is the lowest level of data and is used purely for classification and
identification purposes only. Examples of this level are gender, house number,
home ownership, etc.
3. Interval level – it specifies the precise difference between or among the values or
ranks.
4. Ratio level- has the same characteristics as the interval, however, the ratio level
starts from zero. In addition, it has a presence of units of measures.
62
3. Experimentation. This method of data collections involves the researcher’s
intervention on the conditions that may affect the outcome of the variables of interest.
As it has been said, in data gathering, it is usually less expensive when only a
segment of the population, or sample, is considered. Apart from economy reasons, that is,
saving money, time, and effort, gathering data from a sample is easier and at times, more
practical. The following are sampling techniques that may be used.
A. Probability Sampling - each unit in the population has a known probability of selection,
and a random number table or other randomization mechanism is used to choose the
specific units to be included in the sample
- relatively small sample can be used to make inferences about an
arbitrarily large population
1. Simple Random Sampling (SRS). This is the simplest form of probability sampling
wherein all the elements of the population have equal chances of being
selected as sample. This usually serves as the foundation of more complex
sampling design.
- every kth unit thereafter, is chosen to be in the sample. If N/n is an integer, let
k = N/n; otherwise, let k be the next integer after N/n.
- consists of units that are equally spaced in the list
- may not be representative of the population
3. Stratified Random Sampling. In this sampling method, the elements are divided into
subgroups called strata. Then a random sample of units is taken from
each stratum. Elements in the same stratum often tend to be more similar
than randomly selected elements from the whole population, so
stratification often increases precision.
63
4. Cluster Sampling. Here, observation units in the population are aggregated into larger
sampling units, called clusters, and sampling is done on clusters and uses
all members of the cluster as samples.
Note:
Elements in the same stratum often tend to be more similar than randomly selected
elements from the whole population, so stratification often increases precision.
Illustration:
Suppose you want to estimate the average amount of time that professors at
CSU say they spent grading homework in a specific week.
B. Non-Probability Sampling - Not all units in the population has a chance of being selected
as sample
Frequency Distribution. This organizes raw data in table form, using classes and
frequencies or counts. Each raw data value is placed into a quantitative or qualitative
category called a class. The frequency of a class is the number of data values
contained in that specific class.
64
A frequency distribution is used to:
organize the data in a meaningful, intelligible way.
enable the reader to determine the nature or shape of the distribution.
facilitate computational procedures for measures of average and spread.
enable the researcher to draw charts and graphs for the presentation of
data.
enable the reader to make comparisons among different data sets.
1. Categorical Frequency Distributions - used for data that can be placed in specific
categories, such as nominal or ordinal level data.
Example 1:
A B B AB O
O O B AB B
B B O A O
A O O O AB
AB A O B A
2. Ungrouped Frequency Distribution - used for data whose range of values is relatively
small. The single data values are used as classes.
Example 2:
65
The data shown here represent the number of miles per gallon (mpg) that 30
selected four-wheel-drive sports utility vehicles obtained in city driving. Construct a
frequency distribution, and analyze the distribution.
12 17 12 14 16 18
16 18 12 16 17 15
15 16 12 15 16 16
12 14 15 12 15 15
19 13 16 18 16 14
3. Grouped Frequency Distributions - used for data that has a very large range. Data are
grouped into classes that are more than 1 unit in width.
Tabular Form - effective devices of presenting both qualitative and quantitative data.
- make comparisons and draw relationships between and among variables
66
Exploratory/Graphical Method - uses visual representation of data like graphs to describe
the data
Since the class boundaries are used in the graph, the bars in a histogram are
contiguous, unlike those in a bar or column chart.
2. Frequency Polygon - graph that displays the data by using lines that connect points
plotted for the frequencies at the midpoints of the classes. The frequencies
are represented by the heights of the points.
67
To construct a frequency polygon, follow the steps below.
𝐿𝐶𝐵 + 𝑈𝐶𝐵
i. Find the midpoint of each class as ̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅
2
ii. Draw the x and y axes. Label the x axis with the midpoint of each class,
and then use a suitable scale on the y axis for the frequencies.
iii. Using the midpoints for the x values and the frequencies as the y values,
plot the points
iv. Connect adjacent points with line segments. Draw a line back to the x
axis at the beginning and end of the graph, at the same distance that the
previous and next midpoints would be located
The graph below shows the frequency polygon corresponding to the distribution above.
68
The cumulative frequency distribution of the data is as follows.
Cumulative Frequency
less than 99.5 0
less than 104.5 2
less than 109.5 10
less than 114.5 28
less than 119.5 41
less than 124.5 48
less than 129.5 49
less than 134.5 50
Example:
69
5. Time Series Graph - used to represent data that occur over a specific period of time
Example:
70
6. Pie Graph - show relationships of the parts to the whole by visually comparing the sizes
of the sections.
- sections/wedges are divided according to the percentage of frequencies in
each category of the distribution
16% Singapore
Timor-Leste
71
7. Scatter Plot - shows relationship between two variables.
72
4.2 Measures of Central Tendency
The measures of central tendency are used to determine the cluster of the data about
the center. The most common measures of central tendency are the mean, median and
mode.
4.2.1 MEAN
The mean or arithmetic mean (𝒙 ̅) is the average of all the values in the data set. It
can be obtained by getting the sum of all the observations divided by the total number of
observations.
∑ 𝑋𝑖
where: 𝑋𝑖 is the individual observations
𝑋̅ = 𝑖=1
̅̅̅̅̅̅̅ 𝑛 is the total number of observations
𝑛
Find the mean height of the 12 basketball players whose heights (in cm) are
150, 160, 163, 159, 174, 178, 165, 156, 187, 176, 175, 180.
Solution: Let X be the height of the players and n for the total number of players
150 + 160 + 163 + 159 + 174 + 178 + 165 + 156 + 187 + 176 + 175 + 180
𝑋̅ = ̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅
12
2,023
= ̅̅̅̅̅̅̅̅ = 168.6
12
Note:
If the observations are in whole number, the final answer must be in tenth place, while
if the raw data has one decimal place, then its final answer must be in two decimal and so
on.
What is the mean of the set of values: 6.7, 4.6, 5.5, 3.4, 8.2, and 5.8
Solution:
6.7 + 4.6 + 5.6 + 3.4 + 8.2 + 5.9
𝑋̅ = ̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅
6
34.4
= ̅̅̅̅̅̅ = 5.73
6
Twelve students were given an arithmetic test and the times (in minutes) to
complete it were as follows:
73
10, 9, 12, 11, 8, 15, 9, 7, 8, 6, 12, 10
Solution:
117
𝑋̅ = ̅̅̅̅̅̅ = 9.8
12
Therefore, the average time to complete the arithmetic test is 9.8 minutes.
There are some cases when individual values do not have equal importance. A
weighted mean is appropriate to use. The formula in the computation of the weighted
arithmetic mean is:
n
W X i i
W1 X 1 W2 X 2 . .. Wn X n
Xw i 1
n
W1 W2 ... Wn
W
i 1
i
Solution:
W X i i
W1 X 1 W2 X 2 . .. Wn X n
Xw i 1
n
W1 W2 ... Wn
W
i 1
i
74
Properties of the Mean:
1. It is unique value in a data set and always exists.
2. It is affected by the extreme and deviant values.
3. It is used only if the data are interval or ratio and when normally distributed.
4. It is the most reliable measure of central tendency.
4.2.3 MEDIAN
The median is the middlemost value in the data set. It divides the distribution into
two equal parts.
If the number of observation is even, the median is the average of the two middle
values, while if the number of observation is odd, then the median is the middlemost value
in the data set.
𝑛+1
Median (Rank Value) = ̅̅̅̅̅̅̅
2
Example 1:
Find the median height of the 12 basketball players whose heights (in cm)
were as follows:
150, 160, 163, 159, 174, 178, 165, 156, 187, 176, 175, 180.
Solution:
The first step is to arrange the data in an increasing order (from lowest to
highest).
Thus, 150, 156, 159, 160, 163, 165, 174, 175, 176, 178, 180, 187
Since the data set is an even number of observation, we will be getting the
average of the two middle values and following the formula for the median (rank
value)
Since the middle value falls on the 6.5, then we are going to get its 6 th and 7th value.
Therefore,
Example 2:
The daily rates of a sample of 9 employees at GMS Inc. are ₱550, ₱420,
₱650, ₱500, ₱700, ₱480, ₱520, ₱860, and ₱670. Find the median rate.
Solution:
The first step is to arrange the data set in an increasing order. Thus,
₱420, ₱480, ₱500, ₱520, ₱550, ₱650, ₱670, ₱700, ₱860
Since the data set is an odd number, then the median is the middlemost value.
Therefore, the median is the 5th value which is ₱550.
75
1. It is a positional average and it is not influenced by the position of the items.
2. Like the mean, it may also be a unique value in a data set.
3. It is associated with ordinal data.
4.2.4 MODE
The mode is the most frequent observation. It is the observation which occur most
often in the data set.
If there is only one observation having the highest frequency, then the data set is said
to be unimodal. If it has two, then it is bimodal. If it has three observations with the same
highest frequency, it is said to be trimodal. And, if there is no repetition of the individual
values in the data set, no mode exists.
Example 1:
The following are the scores of the students in Mathematics quiz. Determine
the mode of the data set.
40, 27, 20, 40, 26, 24, 25, 29, 30, 31, 27, 33, 39, 36, 22, 36, 28, 27,
27, 26, 20, 21, 30, and 19.
Solution:
The most frequent number that appears in the data set is 27. Since there is only one
observation having the highest frequency, then it is unimodal.
Example 2:
Determine the mode of the grades of 19 engineering students in Mathematics
subject as follows:
2.2, 1.7, 2.1, 2.0, 1.9, 2.3, 2.0, 2.4, 1.9, 2.1, 2.2, 2.4, 2.0, 1.9,
2.1, 2.1, 2.1, 2.0, 2.0
Solution:
Since the grade of 2.0 and 2.1 appeared the most and with the same number of times,
then they are considered as the mode. The type of mode is bimodal.
The measures of dispersion or variability tell about the spread of the data or how the
individual values are dispersed from the mean. The common measures of dispersion are the
range, variance and standard deviation.
The variance is defined as the average of the squared deviations from the mean. The
square root of this variance is known as the standard deviations. The variance for a sample
data is denoted by s2 while the population variance is σ2.
76
The formula for the sample variance is:
∑(𝑥 − 𝑥̅ )2
𝑠2 =
̅̅̅̅̅̅̅̅̅̅̅̅̅̅
𝑛−1
To determine the variance of ungrouped data, let us follow the steps below:
1. Arrange the values in order (i.e. increasing or decreasing) vertically.
2. Calculate the mean of the data set.
3. Subtract the mean from the individual values. Place this on another column.
4. Add another column for the square of the difference of individual values and the
mean.
5. Get the sum of the squared deviations.
6. Divide the sum in step 5 by n-1 for a sample data and N for the population data.
Example 1:
Determine the range, variance and standard deviation of the following data on a
sample of weights of pre-school children: 25.2, 19.5, 20.4, 21.5, 18.2, 16.0, 17.8, 17.6
Solution:
a. Range = highest value – lowest value
= 25.2 – 16.0
= 9.2
𝑋 𝑋 − 𝑋̅ (𝑋 − 𝑋̅)2
16.0 –3 .24 10.50
17.6 –1.64 2.69
17.8 –1.44 2.07
18.2 –1.04 1.08
19.5 0.26 0.07
20.4 1.16 1.34
25.2 5.96 35.52
̅
𝑋= 19.24 (𝑋 − 𝑋̅)2 = 53.27
c. Since the standard deviation is the square root of the variance, then
𝑠 = √8.88 = 2.98
This value of the standard deviations implies that the cluster of observation is in the
range of 2.98 units above and below the mean.
Example 2:
The marks of 10 students of a class is given to be 0, 4, 9, 12, 25, 2, 21, 7, 11
and 12. What is the variance of the data set?
Solution:
Step 1: Organize the marks of the students in a table.
77
Marks (x) 𝑥 − 𝑥̅ (𝑥 − 𝑥̅ )2
25 14.7 216.09
21 10.7 114.49
12 1.7 2.89
12 1.7 2.89
11 0.7 0.49
9 – 1.3 1.69
7 – 3.3 10.89
4 – 6.3 39.69
2 – 8.3 68.89
0 – 10.3 106.09
𝑥̅ = 10.3 (𝑥 − 𝑥̅ )2 = 56.4
Step 3: Get the square root of the variance to obtain the standard deviation.
√𝑠2 = √62.67 = 7.92
The measures of location describe the data in some situations and it would be
beneficial knowing how to interpret the obtained values. Quartiles, percentiles and standard
scores are the most commonly used measures of location.
Quartiles divide the distribution into four equal parts (segments of 25% each). Three
quartiles are defined: Q1, Q2 and Q3.
Percentiles divide the distribution into 100 equal parts denoted by Pk where k is the
percentile rank. Say P50 means 50th percentile, P75 means 75th percentile and so on. P50 and
Q2 is also the same as the median of the distribution. Same with P 25 that is equal to Q1.
𝑛𝑘
𝑄𝑘 = ̅̅̅̅ , where 𝑘 = 1, 2, 3
4
𝑛𝑘
𝑃𝑘 = ̅̅̅̅̅̅ , where 𝑘 is an integer from 1 to 99
100
If the value of 𝑄𝑘 and 𝑃𝑘 is an integer, the kth percentile/quartile is the average of the
value of the obtained percentile/quartile rank and the value preceding it. If the value is not
an integer, then it must be round up.
Percentile rank refers to the percentile ranking of a certain value. This can be
obtained by following the equation below:
78
Example 1:
What is the third quartile (Q3) of the following data set?
20, 40, 50, 65, 70, 75, 80, 100
Solution:
If the data set are not arranged in chronological order, then you must arrange it either
increasing or decreasing order.
Since the given data set are already arranged, then we must compute for Q 3.
Q3 =nk/ 4 = (8)(3)/4 = 24/4 = 6
Since the value of Q3 is an integer then we must get the average of the 6 th and 7th
value in the data set.
Example 2:
For the data set below, which value is in the 75th percentile?
1, 3, 3, 4, 6, 7, 7, 7, 8, 9, 9, 10, 12, 15, 16, 17
Solution:
Since we want to find the P75, and we know that there are 16 values in the data set,
then computing for P75 = nk/100 = (16)(75)/100 =1200/100 = 12
Again since the obtained value is an integer, then we must get the average of the 12 th
and 13th value in the data set. That is, P75 = (10 + 12)/2 = 22/2 = 11
Therefore, the 75th percentile is 11. This implies that 75% in the data set have values
less than 11 and only 25% have values greater than 11.
Example 3:
Find the percentile rank of the value of 6 in Example 2:
Solution:
Using the equation for the percentile rank and substituting the given information,
4 + 0.5
= ̅̅̅̅̅̅̅̅̅ ∙ 100%
16
= 28%
79
4.5 Probabilities and the Normal Distribution
Standard score can be obtained by getting the ratio of the difference of the value and
the mean and the standard deviation. In symbols,
𝑋 − 𝑥̅
𝑍=
𝑠
Note: A positive (+) z-score means that the observed value is above the mean.
A negative (-) z score means that the observed value is below the mean.
A zero (0) z –score means that the observed value is equal to the mean.
Example 1:
In a given distribution, the mean is 85 and the standard deviation is 10. Find
the corresponding standard score of the ff. values:
a. 95 b. 87 c. 68 d. 55
Solution:
95 − 85
z= = 1.0
10
Since the standard score is positive, this implies that the score of 95 is 1
standard deviation above the mean.
This implies that the score of 87 is 0.2 standard deviations above the mean.
Since the standard score is negative, this implies that score of 68 is 1.7
standard deviations below the mean.
80
4. The standard score of 55 is:
55 − 85
z= = −3.0
10
This implies that the score of 55 is 3 standard deviations below the mean.
Two parameters are used to describe the normal curve; the mean and the standard
deviation. Negative standard deviations are located at the left side while the positive standard
deviations are on the right side of the curve.
The area of the normal distribution represents probability. Thus, the larger the area
the greater probability.
Source: Kanbanize
Example: 1.
Solution:
a. the area of z = 1.99 is equal to 0.4767 or 47.67%, since we will find the area to
the left of 1.99 we must add the other 50% on the left side of the normal curve.
Therefore, the total area would be 97.67%.
81
b. the area of z = 2.04 is equal to 0.4793 or 47.93%, since this area pertains to the
left of the normal curve and we are looking for the area to the right, then we must
add the other 50% of the normal curve. Therefore, the total area to the right of -
2.04 would be 97.93%.
c. the area of z = 2.00 is equal to 0.4772 or 47.72% and the area of z = 2.47 is equal
to 0.4932 or 49.32%, since we are looking for the area between these regions,
we must subtract the values. Therefore, 49.32% subtracted by 47.72% is equal
to 1.60%.
d. the area of z= 1.02 is equal to 0.3461 or 34.61% and the area of z = 2.35 is equal
to 0.4906 or 49.06%, since we are looking for the area between these regions,
and it can be noticed that the z scores comes from left and right of the curve, then
we must add the areas in order to get the total area. Therefore, 34.61% added by
49.06% is equal to 83.67%.
Several problems in different fields can be solved with the application of the normal
curve. The only requirement is that the variable be normally or approximately normally
distributed.
Example 1:
A survey found that women spend on average ₱146. 21 on beauty products
during the summer months. Assume that the standard deviation is ₱29.44 and the
variable is normally distributed. Find the percentage of women who spend less than
₱160. 00.
Solution:
Step 1: Draw the normal curve and represent the area.
160.00
𝑥̅ =146.21
₱160.00 − ₱146.21
z = ̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅ = 0.47
₱29.44
82
Step 3: Find the area, using the Table for Areas Under the Normal Curve. Look for
z = 0.47. From the table, the area of z= 0.47 is 0.1808. Since the question is to
look for the percentage of women who spend less than 160.00, so we need to
add the area below the mean which is 50%.
Example 2:
To qualify for a police academy, candidates must score in the top 10% on a
general abilities test. The test has a mean of 200 and a standard deviation of 20. Find
the lowest possible score to qualify. Assume that the test scores are normally
distributed.
Solution:
Since the test scores are normally distributed, the test value x that cuts off the
upper 10% of the area under the normal distribution is desired. (refer to figure below).
The shaded region represents the students who qualify for the test.
𝑥̅ 1.28
Step 1: Subtract 0.1000 from 1.000 to get the area under the normal distribution to the left of
x: 1.000 – 0.1000 = 0.9000
Step 2: Find the z value that corresponds to an area of 0.9000. If the specific value cannot
be found, use the closest value. In this case, 0.8997. The corresponding value is 1.28.
Therefore, a score of 226 should be used as a cut off. Anybody scoring 226 and above
qualifies.
83
two variables. The data can be represented by the ordered pairs (𝑥, 𝑦) where x is the
independent (or explanatory) variable, and y is the dependent (or response) variable.
The correlation coefficient is a measure of the strength and the direction of a linear
relationship between two variables. The symbol r represents the sample correlation
coefficient. The formula for 𝑟 is
𝑛 ∑ 𝑥𝑦 − ∑ 𝑥 ∑ 𝑦
𝑟 = ̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅
√𝑛 ∑ 𝑥 2 − (∑ 𝑥)2 √𝑛 ∑ 𝑦 2 − (∑ 𝑦)2
where:
r = Pearson r correlation coefficient
n = number of observations
∑ 𝑥𝑦 = sum of the products of paired scores
∑ 𝑥 = sum of x scores
∑ 𝑦 = sum of y scores
∑ 𝑥 2 = sum of squared x scores
∑ 𝑦 2 = sum of squared y scores
A value of +1 indicates that there is a perfect positive correlation. This means that if
one variable increases, the other variable also increases. The value of -1 indicates that there
is a negative correlation. This implies that as one variable increases, the other variable
decreases. A value of 0 indicates that there is no correlation between variables. (Tolentino,
et al., 2018) The complete list of values was presented below to further interpret the value of
computed r.
0.0 - no correlation
±1.00 - perfect correlation
±0.01 − ±0.25 - very low correlation
±0.26 − ±0.50 - moderately low correlation
±0.51 − ±0.75 - high correlation
±0.76 − ±0.99 - very high correlation
Determine the correlation between the age and the weight of 10 preschool children at
barangay Marilima as shown in the table below:
84
Solution:
Step 1: To obtain the values, we may construct another table adding columns for specific
values necessary in the computation of the Pearson r.
The result of 0.93 indicates a positive with very high correlation between the age of
pre-school children and their weight. This implies that as age of the pre-school children
increases, their weight also increases.
Example 2:
For the following data set, find the Pearson r and the r2.
85
x 12 2 5 9 11 10 4 1
y 10 3 7 5 9 8 6 3
Solution:
The following steps are helpful for the computation of the correlation coefficient.
X Y XY X2 Y2
12 10 120 144 100
2 3 6 4 9
5 7 35 25 49
9 5 45 81 25
11 9 99 121 81
10 8 80 100 64
4 6 24 16 36
1 3 3 1 9
∑ 𝑥 =54 ∑ 𝑦 =51 ∑ 𝑥𝑦 =412 ∑ 𝑥 2 =492 ∑ 𝑦 2 =373
𝑛 ∑ 𝑥𝑦 − ∑ 𝑥 ∑ 𝑦
𝑟 = ̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅
√𝑛 ∑ 𝑥 2 − (∑ 𝑥)2 √𝑛 ∑ 𝑦 2 − (∑ 𝑦)2
8 (412) − (54)(51)
= ̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅
√[(8)(492) − 542 ] √[(8)(373) − 512 ]
3296 − 2754
= ̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅
√(3936 − 2916) √2984 − 2601
542 542
= ̅̅̅̅̅̅̅̅̅̅̅̅̅ = ̅̅̅̅̅ = 0.09
√390660 625
86
Step 4. Interpret the results.
The value of r = 0.09 indicates that the variables have very low correlation between
variables. This implies that whatever the value of one variable does not affect the values of
the other variable.
Step 5. Computing for r2 where r = 0.09, r2 = 0.09 which implies that only 9% explained the
aforementioned correlation.
Simple linear regression is a statistical method that allows us to summarize and study
relationships between continuous (quantitative) variables. One variable, denoted by x
represents the predictor variable or the independent variable. The other variable, denoted
by y represents the response or the dependent variable.
Simple linear regression is appropriate when the following conditions are satisfied.
The dependent variable Y has a linear relationship to the independent variable X. To
check this, make sure that the XY scatterplot is linear and that the residual plot shows
a random pattern.
For each value of X, the probability distribution of Y has the same standard deviation
σ. When this condition is satisfied, the variability of the residuals will be relatively
constant across all values of X, which is easily checked in a residual plot.
The least square regression equation can be formed from a set of sample data using
the formula:
𝑦̂ = 𝑎 + 𝑏𝑥
The constants a, b in the regression equation are called the regression coefficients.
The values of a and b can be found using the following equations:
∑ 𝑥𝑦 − 𝑛𝑥̅ 𝑦̅
𝑎 = 𝑦̅ − 𝑏𝑥̅ and 𝑏 = ̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅
∑ 𝑥 2 − 𝑛(∑ 𝑥)2
The regression equation can be used to predict the value of one variable when the
value of the other variable is known.
Example 1:
The values of x and their corresponding values of y are shown in the table below
X 0 1 2 3 4
Y 2 3 5 4 6
87
Solution:
Step 1: Organize the listing of the values of x and y. Include the necessary values like x2, y2,
xy and the mean of x and y.
𝑥 𝑦 𝑥𝑦 x2 y2
0 2 0 0 4
1 3 3 1 9
2 5 10 4 25
3 4 12 9 16
4 6 24 16 36
x = 10 y = 20 xy = 49 x = 30 x = 90
2 2
𝑥̅ = 2.0 𝑦̅ = 4.0
∑ 𝑥𝑦 − 𝑛𝑥̅ 𝑦̅ 49 − 5(2.0)(4.0) 9
𝑏 = ̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅
2 2
= ̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅
2
= ̅̅̅̅ = 0.9
∑ 𝑥 − 𝑛(∑ 𝑥) 30 − 5 (2) 10
𝑎 = 𝑦̅ - b𝑥̅ = 4 – (0.9)(2) = 2.2
Example 2:
The table below shows the height, 𝑥, in inches and the pulse rate, 𝑦, per
minute, for 9 people. Find the correlation coefficient and interpret your result.
x 68 72 65 70 62 75 78 64 68
y 90 85 88 100 105 98 70 65 72
Solution:
Step 1:
Height Pulse rate
𝑥𝑦 𝑥2 𝑦2
(𝑥) (𝑦)
68 90 6120 4624 8100
72 85 6120 5184 7225
65 88 5720 4225 7744
70 100 7000 4900 10000
62 105 6510 3844 11025
75 98 7350 5625 9604
78 70 5460 6084 4900
64 65 4160 4096 4225
68 72 4896 4624 5184
∑ 𝑥 =622 𝑦 = 773 𝑥𝑦 = 53336 𝑥 = 43206
2 ∑ 𝑦 2 =68007
𝑥̅ = 69.1 𝑦̅ = 85.9
88
𝑎 = 𝑦̅ − 𝑏𝑥̅ = 85.9 − (−0.37)(69.1) = 111.47
Exercise 4.1
Exercise 4.2
1. Calculate the mean, median and mode of the following sets of data.
a. 14, 16, 8, 8,10, 12, 9, 12, 13, 14, 8, 9, 5, 7, 11
b. 12.5, 15.5, 10.5, 10.2, 11.15, 9.5, 8.2, 18.3, 10.5, 10.5
2. Find the General Weighted Average of a student whose grades for a 3 unit subjects
such as Math, English, Filipino, Science, and Social Studies were 2.0, 1.8, 1.4, 2.2
and 2.5, respectively.
89
3. Heather’s test scores are 81, 93, 74 and 95. What score must she get on the fifth test
in order to get a mean of 85 on all five tests?
Exercise 4.3
1. Determine the range, variance and standard deviations of the following data sets.
a. 7, 8, 4, 3, 2, 3, 6, 5 and 7
b. 3, 6, 15, 18, 24, 12, and 10
c. 2, 8, 11, 17, 12, 6 and 4
2. The result of the college entrance examination of 10 students in a certain university were
as follows:
2.5, 3.4, 5.6, 3.8, 4.2, 2.8, 3.0, 3.0, 3.4, 4.2
Compute for the variance and standard deviation.
3. The newspaper company reported that samples of their weekly sales (in hundred
thousand pesos) are: 345, 452, 254, 137, 483, 515 and 218. Calculate and interpret the
variance and standard deviations.
4. A high school teacher at a small private school assigns trigonometry practice problems
to be worked via the net. Students must use a password to access the problems and the
time of log-in and log-off are automatically recorded for the teacher. At the end of the
week, the teacher examines the amount of time each student spent working the assigned
problems. The data is provided below in minutes.
15 22 22 22
28 43 33 20
25 49 27 39
48 34 25
Find the Range, Standard Deviation, and Variance for the above data.
90
Exercise 4.4
3. Listed are 29 ages for Academy Award-winning best actors in order from smallest to
largest:
18, 21, 22, 25, 26, 27, 29, 30, 31, 33, 36, 37, 41, 42, 47, 52,
55, 57, 58, 62, 64, 67, 69, 71, 72, 73, 74, 76, 77
a. Find the 70th percentile.
b. Find the 83rd percentile.
4. At a high school, it was found that the 30 th percentile of number of hours that students
spend studying per week is seven hours. Interpret the 30 th percentile in the context of this
situation.
5. The wheat production (in Kg) of 20 acres is given as: 1120, 1240, 1320, 1040, 1080, 1200,
1440, 1360, 1680, 1730, 1785, 1342, 1960, 1880, 1755, 1720, 1600, 1470, 1750, and
1885. Find the quartile deviation and coefficient of the quartile deviation.
Exercise 4.5
91
Exercise 4.6
1. A researcher carefully computes the correlation coefficient between two variables and
gets r = 1.23. What does this value mean?
X Y X Y
72 45 80 36
73 38 81 29
75 41 82 34
76 35 83 38
77 31 84 26
78 40 85 32
79 25 86 28
80 32 88 27
a. Sketch a scatterplot.
b. Compute the correlation coefficient, r.
c. Compute the coefficients of the linear regression line, y = b 1x + b0.
d. What is the estimated value for X = 7?
92
Chapter 5
The Mathematics of Finance
Learning Outcomes:
where I is the interest, P is the principal, r is the interest rate, and t is the time period. In the
simple interest formula, the time t is expressed in the same period as the rate. For example, if
the rate is given as an annual interest rate, then the time is measured in years; if the rate is
given as a monthly interest rate, then the time must be expressed in months. Interest rates
are most commonly expressed as annual interest rates. Therefore, unless stated otherwise,
we will assume the interest rate is an annual interest rate. Interest rates are generally given
as percents. Before performing calculations involving an interest rate, write the interest rate
as a decimal.
Example 1. Calculate the simple interest earned in 1 year on a deposit of P1000 if the
interest rate is 5%.
Solution:
Use the simple interest formula. Substitute the following values into the formula:
P = 1000, r = 5%, and t = 1
I = Prt
I = 1000(0.05)(1)
I = 50
93
Solution:
Use the simple interest formula. Substitute the following values into the formula:
P = 2000, r = 6.5% = 0.065. Because the interest rate is an annual rate, the
3 months 3
time must be measure in years: 𝑡 = ̅̅̅̅̅̅̅̅̅̅̅̅̅̅ = ̅̅̅̅
12 months 12
I = Prt
I = 2000(0.065)(3/12)
I = 32.5
Example 3. Calculate the simple interest due on a two-month loan of P500 if the
interest rate is 1.5% per month.
Solution:
Use the simple interest formula. Substitute the following values into the formula:
P = 500, r = 1.5% = 0.015. Because the interest rate is per month, the time
period of the loan is expressed as the number of months 𝑡 = 2
I = Prt
I = 500(0.015)(2)
I = 15
Remember that in the simple interest formula, time t is measured in the same period
as the interest rate. Therefore, if the time period of a loan with an annual interest rate is given
in days, it is necessary to convert the time period of the loan to a fractional part of a year.
There are two methods for converting time from days to years; the exact method and the
ordinary method. Using the exact method, the number of days of the loan is divided by 365,
the number of days in a year.
number of days
Exact method: 𝑡 =
̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅
365
The ordinary method is based on there being an average of 30 days in a month and
12 months in a year (30)(12) = 360. Using this method, the number of days of loan is divided
by 360.
number of days
Ordinary method: 𝑡 =
̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅
360
Example 4. Calculate the simple interest due on a 45-day loan of P3500 if the interest
rate is 8% per month.
Solution:
Use the simple interest formula. Substitute the following values into the formula:
P = 3500, r = 8% = 0.08. Because the interest rate is per month, the time period
45
of the loan is expressed as the number of months 𝑡 =
360
I = Prt
I = 3500(0.08)(45/360)
I = 35
94
The simple interest due is P35.
The simple interest formula can be used to find the interest rate on a loan when the
interest, principal, and time period of the loan are known. An example is given below.
Example 5. The simple interest charged on a six-month loan of P3000 is P150. Find
the simple interest rate.
Solution
Use the simple interest formula. Solve the equation for r.
I = Prt
150 = 3000(r)(6/12)
150 = 1500r
0.10 = r, r = 10%
Exercise 5.1
1 2000 6% 1 year
1. Calculate the simple interest earned in 1 year on a deposit of P1900 if the interest rate
is 8%.
2. Calculate the simple interest earned in 1 year on a deposit of P2300 if the interest rate
is 7%.
3. You deposit P1500 in an account earning 10.4% interest. Calculate the simple interest
earned in 6 months.
4. Calculate the simple interest due on 150-day loan of P4800 if the interest rate is 7.25%.
5. Calculate the simple interest due on a two-month loan of P800 if the interest rate is
1.5% per month.
95
5.2 Future Value and Maturity Value
When you borrow money, the total amount to be repaid to the lender is the sum of the
principal and interest. This sum is calculated using the following future value or maturity value
formula for simple interest.
A = P + I, where A is the amount after the interest, I, has been added to the
principal, P.
This formula can be used for loans or investments. When used for a loan, A is the total
amount to be repaid to the lender; this sum is called the maturity value of the loan.
For an investment, such a deposit in a bank savings account, A is the total amount
on deposit after the interest has been added to the principal. Tis sum is called the future value
of the investment.
Solution
Step 1. Find the interest. Use the simple interest formula
I = Prt
I = 8000(0.0975)(8/12)
I = 520
Step 2. Find the maturity value. Use the maturity value formula.
A=P+I
A = 8000 + 520
A = 8520
The maturity value of the loan is P8520.
Recall that the simple interest formula states that I = Prt. Hence, by substitution
A=P+I
A = P + Prt
A = P (1+rt),
where A is the future value of an investment or the maturity value of a
loan, P is the principal, r is the interest rate, and t is the time period.
96
Day Jan Feb Mar Apr May Jun Jul Aug Sep Oct Nov Dec
1 1 32 60 91 121 152 182 213 244 274 305 335
2 2 33 61 92 122 153 183 214 245 275 306 336
3 3 34 62 93 123 154 184 215 246 276 307 337
4 4 35 63 94 124 155 185 216 247 277 308 338
5 5 36 64 95 125 156 186 217 248 278 309 339
6 6 37 65 96 126 157 187 218 249 279 310 340
7 7 38 66 97 127 158 188 219 250 280 311 341
8 8 39 67 98 128 159 189 220 251 281 312 342
9 9 40 68 99 129 160 190 221 252 282 313 343
10 10 41 69 100 130 161 191 222 253 283 314 344
11 11 42 70 101 131 162 192 223 254 284 315 345
12 12 43 71 102 132 163 193 224 255 285 316 346
13 13 44 72 103 133 164 194 225 256 286 317 347
14 14 45 73 104 134 165 195 226 257 287 318 348
15 15 46 74 105 135 166 196 227 258 288 319 349
16 16 47 75 106 136 167 197 228 259 289 320 350
17 17 48 76 107 137 168 198 229 260 290 321 351
18 18 49 77 108 138 169 199 230 261 291 322 352
19 19 50 78 109 139 170 200 231 262 292 323 353
20 20 51 79 110 140 171 201 232 263 293 324 354
21 21 52 80 111 141 172 202 233 264 294 325 355
22 22 53 81 112 142 173 203 234 265 295 326 356
23 23 54 82 113 143 174 204 235 266 296 327 357
24 24 55 83 114 144 175 205 236 267 297 328 358
25 25 56 84 115 145 176 206 237 268 298 329 359
26 26 57 85 116 146 177 207 238 269 299 330 360
27 27 58 86 117 147 178 208 239 270 300 331 361
28 28 59 87 118 148 179 209 240 271 301 332 362
29 29 88 119 149 180 210 241 272 302 333 363
30 30 89 120 150 181 211 242 273 303 334 364
31 31 90 151 212 243 304 365
The day-of-the-year table can be used to determine the number of days from one date
to another date. For example, because May 15 is day 135 and August 23 is day 235, there
are 235 – 135 = 100 days from May 15 to August 23.
The table can also be used to determine the due date of a loan. For example, a 12-0-
day loan made on June 9, which is day 160, is due on day 160 – 120 = day 280, which is
October 7.
Example 3. Calculate the simple interest due on a P5000 loan made on September
20 and repaid on December 9 of the same year. The interest rate is 6%.
Solution:
September 20 is day 263. December 9 is day 343.
343 – 263 = 80. The term of the loan is 80 days. Hence,
I = Prt
I = 5000 (0.06)(80/360)
I = 66.67
The simple interest due is P66.67.
97
Try the following:
Exercise 5.2
A. In items 1 – 5, use the formula A = P (1 + rt) to calculate the maturity value of the
simple interest loan.
1 2000 6% 1 year
6. Calculate the maturity vale of a simple interest, eight-month loan of P7000 if the
interest rate is 8.7%.
7. Calculate the maturity value of a simple interest, 10-month loan of P6,600 if the
interest rate is 9.75%.
8. You deposit P880 in an account paying 9.2% simple interest. Find the future vaue
of the investment after 1 year.
9. You deposit P750 in an account paying 7.3% simple interest. Find the future value
of the investment after 1 year.
10. The simple interest charged on a 6-month loan of P2700. Find the simple interest
interest rate.
98
5.3 Compound Interest
Simple interest is generally used for loans of 1 year or less. For loans of more than 1
year, the interest paid on the money borrowed is called compound interest. Compound
interest is the interest calculated not only on the original principal, but also on any interest
that has already been earned.
To illustrate compound interest, suppose you deposit P1000 in a savings account
earning 5% interest, compounded annually (once a year).
I = Prt
I = 1000(0.05)(1) = 50
At the end of the first year, the total amount in the account is
A=P+I
A = 1000 + 50 = 1050
During the second year, the interest earned is calculated using the amount in the
account at the end of the first year.
I = Prt
I = 1050(0.05)(1) = 52.50
Note that the interest earned during the second year (P52.50) is greater than the
interest earned during the first year (P50). This is because the interest earned during the first
year was added to the original principal, and the interest for the second year wa calculated
using this sum. If the account earned simple interest rather than compound interest, the
interest earned each year would be the same (P50).
At the end of the second year, the total amount in the account is the sum of the amount
in the account at the end of the first year and the interest earned during the second year.
A=P+I
A = 1050 + 2.50 = 1102.50
The interest earned each year keeps increasing. This is the effect of compound
interest.
In the given example, the interest is compounded annually. However, compound
interest can be compounded semi-annually (twice a year), quarterly (four times a year),
monthly or daily. The frequency with which interest is compounded is the compounding
period.
If, in the preceding example, interest is compounded quarterly rather than annually,
3 1
then the first interest payment on the P1000 in the account occurs after 3 months (𝑡 = = ;
12 4
3 months is one quarter of a year). That interest is then added to the account, and the interest
earned for the second quarter is calculated using that sum.
99
A = P + I = 1025.16 + 12.81 = 1037.97
A = P(1 + rt)
A = 500[1 + 0.06(6/12)]
A = 515
Example 1. How much money will Mr. Agustin have at the end of 5 years if he deposits
P1000 at 9% interest compounded semi-annually?
𝑟
Solution: Using the formula, we have 𝐴𝑛 = 𝐴𝑜 (1 + )𝑛𝑠
𝑠
0.09 2(15)
𝐴𝑛 = 1000(1 + )
2
𝐴𝑛 = 𝑃1552.97
0.08 5(4)
𝐴𝑛 = 25000(1 + )
4
𝐴𝑛 = 𝑃37100
100
Try this
Using example no. 2, how much money will Mang Lando have if interest is
compounded
a. Quarterly
b. Monthly
c. Daily
Exercise 5.3
No. Principal (P) Rate (r) mode Time (t) Amount (A)
1. On his 18th birthday, Merly’s parents placed P10,000 in time deposit at 10%
compounded monthly. In 10 years, how much money would be available for her
educational expenses?
2. For the college education of his son, the father of a 7-year old boy wants to put
enough money in time deposit at 14% annual interest compounded yearly. How
much must he deposit ti have P60,000 when his son reaches 18 years old?
101
5.4 Credit Cards
When a customer uses a credit card to make a purchase, the customer is actually
receiving a loan. Therefore, there is frequently added cost to the consumer who purchases on
credit. This added cost may be in the form of an annual fee or interest charges on purchases.
A finance charge is an amount paid in excess of the cash price; it is the cost to the customer
for the use of credit.
Most credit card companies issue monthly bills. The due date on the bill is usually 1
month after the billing date (the date the bill is prepared and sent to the customer). If the bill is
paid in full by the due date, the customer pays no finance charge. If the bill is not paid in ful by
the due date, a finance charge is added to the next bill.
Suppose a credit card billing date is 10th day of each month. If a credit card purchase
is made on April 15, then May 10 is the billing date (the 10th day of the month following April).
The due date is June 10 (one month from the billing date). If the bill is paid in full before June
10, no finance charge is added. However, if the bill is not paid in full, interest charges on the
outstanding balance will start to accrue (be added) on June 10, and any purchase made after
June 10 will immediately start accruing interest.
The most common method of determining finance charges is the average daily
balance method. Interest charges are based on the credit card’s average daily balance,
which is calculated by dividing the sum of the total amounts owed each day of the month by
the number of days in the billing period.
Suppose an unpaid bill for P315 had a due date of April 10. A purchase of P28 was
made on April 12, and P123 was charged on April 24. A payment of P50 was made on April
15. The next billing date is May 10. The interest on the average daily balance is 1.5% per
month. Find the finance charge on the May 10 bill.
To find the finance charge, first prepare a table showing the unpaid balance for each
purchase, the number of days the balance is owed, and the product of these numbers. A
negative sign in the Payments or Purchase column of the table indicates that a payment was
made on that date.
Payments
Balance No. of Days Until Unpaid Balance
Date or
Each Day Balance Charges Times No.of Day
Purchases
April 10-11 315 2 630
April 12-14 28 343 3 1029
April 15-23 -50 293 9 2637
April 24- May 9 123 416 16 6656
Total 10,952
The sum of the total amounts amount owed each day of the month is PhP10,952.
102
Find the average daily balance.
An unpaid bill for P620 had a due date of March 10. A purchase of P214 was
made on March 15, and P67 was charged on March 30. A payment of P200 was made
on March 22. The interest on the average daily balance is 1.5% per month. Find the
finance charge on April 10 bill
Solution:
First, calculate the sum of the total amounts owed each day of the month.
Payments
Balance No. of Days Until Unpaid Balance
Date or
Each Day Balance Charges Times No.of Day
Purchases
Mar 10-14 620 5 3100
Mar 15-21 214 834 7 5838
Mar 22-29 -200 634 8 5072
Mar 30-Apr 9 67 701 11 7711
Total 21,721
The sum of the total amounts owed each day of the month is P21,721.
Try this:
A bill for P1024 was due on July 1. Purchases of P315 were made on July 7,
and P410 was charged on July 22. A payment of P400 was made on July 15.
The interest on the average daily balance is 1.2% per month. Find the finance
charge on the August 1 bill.
103
Exercise 5.4
In items 1-4, calculate the finance charge for a credit card that has the given average daily
balance and interest rate.
5.5.1 Stocks
Stocks, bonds, and mutual funds are investment vehicles, but they differ in nature.
When owners of a company want to raise money, generally to expand their business, they
may decide to sell part of the company to investors. An investor who purchases a part of the
company is said to own stock in the company. Stock is measured in shares; a share of stock
in a company is a certificate that indicates partial ownership in the company. The owners of
the certificates are called stockholders or shareholders. As owners, the stockholders share
in the profits or losses of the corporation.
A stock pays an annual dividend of P0.84 per share. Calculate the dividends
paid to a shareholder who has 200 shares of the company’s stock.
Solution:
(0.84 per share) x (200 shares) = P168
The shareholder receives P168 in dividends.
A dividend yield, which is used to compare companies’ dividends, is the
share amount of the dividend divided by the stock price and is expressed as a
percent. Determining a dividend yield is similar to calculating the simple interest
rate earned, the stock price as the principal, and the yield as the interest rate.
104
Example 2. (Calculate a dividend yield)
A stock pays an annual dividend of P1.75 per share. The stock is trading at
P70. Find the dividend yield.
Solution:
I = Prt
1.75 = 70 r (1), let I = annual dividend, P = stock price
r = 0.025, the dividend yield is 2.5%
5.5.2 Bonds
When a corporation issues stock, it is selling part of the company to the stockholders.
When it issues a bond, the corporation is borrowing money from the bondholders; a
bondholder lends money to a corporation. Corporations and government agencies issue
bonds. These entities need money to operate—for example, to fund the government deficit,
repair roads, or build a new factory—so they borrow money from the public by issuing bonds.
Bonds are usually issued in units of P1000. The price paid for the bond is the face
value. The issuer promises to repay the bondholder on a particular day, called the maturity
date, at a given rate of interest, called the coupon.
Assume that a bond with a P1000 face value has a 5% coupon and a 10-year maturity
date. The bondholder collects interest payments of P50 in each of those 10 years. The
payments are calculated using the simple interest formula, as shown below.
I = Prt
I = 1000(0.05)(1)
I = 50
At the end of the 10-year period, the bondholder receives from the issuer the P1000
face value of the bond.
A bond with a P10,000 face value has a 3% coupon and a 5-year maturity date.
Calculate the total of the interest payments paid to the bondholder.
Solution: Use the simple interest formula to find the annual interest payments.
Substitute the following values into the formula: P = 10,000, r = 3%, and t = 1.
An investment trust is a company whose assets are stocks and bonds. The purpose of
these companies is not to manufacture a product but to purchase stocks and bonds with the
hope that their value will increase. A mutual fund is an example of investment trust.
105
When investors purchase shares in a mutual fund, they are adding their money to a
pool along with many other investors. The investments within a mutual fund are called the
funds’ portfolio. The investors in a mutual fund share the fund’s profits or losses from the
investments in the portfolio.
Because a mutual fund owns many different stocks, each share of the fund owns a
fractional interest in each of the companies. Each day, the value of a share in the fund, called
the net asset of the fund, or NAV, depends on the performance of the stocks in the fund. It is
calculated by the following formula:
Example 4. (Calculate the net asset value of, and the number of, shares purchased
in a mutual fund)
A mutual fund has P600M worth of stock, P5M worth of bonds, and P1M in
cash. The fund’s total liabilities amount to P2M. There are 25 million shares
outstanding. You invest P15,000 in this fund.
Solution:
A − L 606 million − 2 million
a. NAV = ̅̅̅̅̅̅̅̅ = ̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅ = 24.16
N 25 million
The NAV of the fund is P24.16
15,000
b. = 620
̅̅̅̅̅̅̅̅̅̅
24.16
Try these:
1. Annual Dividends. A stock pays an annual dividend of P1.02 per share. Calculate
the dividends paid to a shareholder who has 375 shares of the company’s stock
2. Annual Dividends. A stock pays an annual dividend of P0.58 per share. Calculate
the dividends paid to a shareholder who has 1500 shares of the company’s stock.
3. Dividend Yield. Find the dividend yield for a stock that pays an annual dividend of
P1.24 per share and has a current price of P49.375. Round to the nearest hundredth
of a percent.
106
4. Dividend Yield. A cooperative has declared an annual dividend of P0.50 per share.
The stock is trading at P40 per share. Find the dividend yield.
Initial Expenses. When you purchase a home, you generally make a downpayment
and finance the remainder of the purchase price with a loan obtained through a bank or
savings or loan association. The amount of the downpayment can vary, but it is normally
between 10% and 30% of the selling price. The mortgage is the amount that is borrowed to
buy the real estate. The amount of the mortgage is the difference between the selling price
and the downpayment.
This formula is used to find the amount of the mortgage. For example, suppose you
buy a P240,000 home with a downpayment of 25% of the purchase price.
Then find the mortgage by subtracting the downpayment from the selling price.
The downpayment is generally the largest initial expense in purchasing a home, but
there are other epenses associated with the purchase. These payments at the closing, when
the sale of the house is finalized, and are called closing costs. The bank may charge fees
for attorneys, credit reports, loan processing, and title searches. There may also be a loan
origination fee. This fee is usually expressed in points. One point is equal to 1% of the
mortgage.
Suppose you purchase a home and obtain a loan for P180,000. The bank charges a
fee of 1.5 points. To find the charge for points, multiply the loan amount by 1.5%.
Solution:
Downpayment = 20% of P392,000 = 78,400
The downpayment is P78,400.
107
= 313, 600
Hence, the mortgage is P313,600.
Mortgages
When a bank agrees to provide you with a mortgage, you agree to pay off that loan in
monthly payments. If you fail to make the payments, the bank has the right to foreclose, which
means that the bank takes possession of the property and has the right to sell it. There are
many types of mortgages available to home buyers today, so the terms of mortgages differ
considerably. Some mortgages are adjustable rate mortgages (ARMs). The interest rate
charged on an ARM is adjusted periodically to more closely reflect current interest rates. The
mortgage agreement specifies exactly how often and how much the interest rate can change.
A fixed rate mortgage or conventional mortgage is one in which the interest rate
charged on the loan remains the same throughout the life of the mortgage. For a fixed rate
mortgage, the amount of the monthly payment also remains unchanged throughout the term
for the loan.
The term of a mortgage can vary. Terms of 15, 20, 25, and 30 years are most common.
The monthly payment on a mortgage is the mortgage payment. The amount of the
mortgage payment depends on the amount of the mortgage, the interest rate on the loan, and
the term of the loan. This payment is calculated by using the payment formula for an APR
loan.
Solution:
annual interest rate 0.065
a. 𝑖 = ̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅ = ̅̅̅̅̅̅̅
number of payments per year 12
0.065
12
𝑃𝑀𝑇 = 134,000 (̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅ ) = 846.97
0.065 −360
1 − (1 + )
12
108
b. To determine the total of the payments, multiply the number of payments (360) by
the monthly payment (846.97)
846.97(360) = 304,909.20
c. To determine the amount of interest paid, subtract the mortgage from the total of
the payments
Exercise 5.5
1. You buy a P258,000 home with downpayment of 25%. Find the amount of the
downpayment and the mortgage amount.
2. Mr. Co purchases a home and secures a loan of P250,000. The bank charges a fee of
2.25 points. Find the charge for the points.
3. The purchase price of a home is P309,000. A downpayment of P30% is made. The
bank charges P350 fees plus 3 points. Find the total of the downpayment and the
closing costs.
109
Chapter 6
Codes
Learning Outcomes:
Introduction
Language teaches people how to communicate and socialize with one another. It
enables them to comprehend all the things that occur in the environment. On the other hand,
coding operates in its own unique language. Understanding coding helps us to know how the
technology works around us and makes us much better in utilizing and interacting with it.
Computers exist in almost all of everyday functions of the society such as financing
and accounting, transportation, researches and various professions. Computers would be
useless without the utility of coding. Codes are the set of instructions that computers follow.
Computers operate based from the codes that tells it what to do.
The Decimal Number System is a 10 digit (deci=10) number system that that people
use in daily activities. The digits are 0, 1, 2, 3, 4, 5, 6, 7, 8 and 9. Its base is 10 and each
position is represented by the powers of 10.
4 0 6 2
103 102 101 100
thousands hundreds tens ones
110
Binary Number System
The Binary Number System, on the other hand, is a number system familiar to people
who operates computers. It is a two-digit number system represented by the numbers zero (0)
and one (1). Its base is 2 and the positions are represented by the power of 2.
1 0 0 1
23 22 21 20
eights fours twos ones
Most computers and computerized devices use binary codes as a method to store, send and
receive information. UTF-8 (Unicode Transformation Format) as designed to represent
characters and symbols using an 8-bit code. A bit (short for binary digit) is the computer’s
smallest unit of data. An 8-bit code is called a byte.
http://www.sciencefriday.com/wp-content/uploads/2015/08/UTF8-Table-7802.png
Example 1
Suppose we want to convert the word “INTEGER” in a binary code. The
process is to find the corresponding codes of the letters I, N, T, E, G, E, and R.
Solution:
I 01001001 G 01000111
N 01001110 E 01000101
T 01010100 R 01010011
E 01000101
Thus, the meaning of messages written in binary codes can also be deciphered using
the UTF-8 table.
111
Example 2
Decipher the code:
Solution:
Using the table, we have the message “M A T H I S F U N”
In writing a number, position of the digits is represented by the powers of the base. For
instance, the decimal number “432” can also be written as “4(10 2) + 3(101) + 2(100)”. Binary
numbers, read from right to left, can also be written using the powers of 2. Numbers such as
“10001” and “10101” may be understood as either decimal or binary, so using a subscript is
important to remove the confusion. Hence, 100012 is in binary while 1000110 is in decimal.
To convert binary numbers to decimal numbers, the face value in each position is
multiplied to its corresponding power of 2. This is expressed as:
Example 3:
Solution:
To convert decimal numbers to binary numbers, continuous division by 2’s is used until
the quotient is zero. The resulting remainders represent the binary number. This procedure is
presented below:
112
Example 4
113
Checking may also be done to verify the result by using the process of converting
binary numbers to decimal numbers.
a. 1111001 = 1(26) + 1(25) + 1(24) + 1(23) + 0(22) + 0(21) + 1(20)
= 64 + 32 + 16 + 8 + 0 + 0 + 1
= 121
b. 100101100 = 1(28) + 0(27) + 0(26) + 1(25) + 0(24) + 1(23) + 1(22) + 0(21) + 0(20)
= 256 + 0 + 0 + 32 + 0 + 8 + 4 + 0 + 0
= 300
c. 1111101000 = 1(29) + 1(28) + 1(27) + 1(26) + 1(25) + 0(24) + 1(23) + 0(22) + 0(21) + 0(20)
= 512 + 256 + 128 + 64 + 32 + 0 + 8 + 0 + 0 + 0
= 1000
Just like in decimal numbers, arithmetic operations can also be performed on binary
numbers.
Adding binary numbers follows the process in adding decimal numbers. However, it
should be noted that only 0 and 1 are used in the binary system. Thus, the sum of 0 and 0 is
0; the sum of 0 and 1 is 1, and the sum of 1 and 1 is 10.
Example 5
Solution:
1
1 1 0 1
1 0 1 0
1 0 1 1 1
Solution:
1 1 1 1 1
1 0 0 0 1 1
1 0 1 1 0 1
1 0 1 0 0 0 0
114
c. Find the sum of 1111011 and 1111110.
Solution:
1 1 1 1 1 1
1 1 1 1 0 1 1
1 1 1 1 1 1 0
1 1 1 1 1 0 0 1
To check if the result is correct, convert the addends and sum to decimal numbers.
To subtract binary numbers, the smaller number must be subtracted from the bigger
number. The difference between binary numbers must also be composed of binary digits 0
and 1. The difference of 1 and 1 is 0; the difference of 0 and 0 is 0; 1 subtracted by 0 is 1; and
0 subtracted by 1 is also 0 provided 1 will be borrowed from the next digit of the minuend.
Example 6
a. Subtract 10001 from 11110.
Solution:
115
b. Find the difference of 111000 and 100011.
Solution:
Example 7
a. Find the product of 1101 and 1011.
Solution:
1 1 0 1
1 0 1 1
1 1 1
1 1 0 1
1 1 0 1
0 0 0 0
1 1 0 1
1 0 0 0 1 1 1 1
116
b. Multiply 11011 by 10101.
1 1 0 1 1
1 0 1 0 1
1 1 1 1 1
1 1 0 1 1
0 0 0 0 0
1 1 0 1 1
0 0 0 0 0
1 1 0 1 1
1 0 0 0 1 1 0 1 1 1
In dividing binary numbers, the bit length must always be observed. Furthermore, skill
in adding, subtracting and multiplying binary numbers is essential in this operation.
Example 8
a. Divide 10010 by 11.
Solution:
1 1 0
1 1 1 0 0 1 0
1 1
1 1
1 1
0
0
0
117
Division of binary numbers follows the same process in decimal division. It starts by
dividing the leftmost digit of the dividend to the divisor.
The leftmost digit is 1 but 11 does not divide 1, the same with 10. Therefore, 100 is
used to divided resulting to a quotient of 1.
To check:
10010 = 1(24) + 0(23) + 0(22) + 1(21) + 0(20) = 18
11 = 1(20) + 1(20) = 3
110 1(22) + 1(21) + 0(20) = 6
6.5 Errors
Error is a condition when the output information does not match with the input
information. It usually happens during transmission of information through digital signal. During
transmission, digital signals suffer from noise that can introduce errors in the binary bits
travelling from one system to other. That means 0 bit may change to 1 or a 1 bit may change
to 0.
https://www.tutorialspoint.com/computer_logical_organization/images/error.jpg
118
Types of Errors
0 1 0 0 1 1
0 1 0 1 1 1
0 1 0 0 1 1
0 1 1 0 0 1
c. Burst Error
Burst error happens when two or more bits are changed in the data
transmission and the error burst is calculated from the first bit change to the last bit
change. It usually occurs in serial communication system wherein the noise may
disturb the sequencing or the positioning of multiple bits. This type of error is
difficult to solve and requires multiple and complex error detecting and correcting
codes.
0 1 0 1 0 1
Burst Error
0 1 1 1 0 0
Error detection is a process used to verify the correctness of information from the input
data to the output data. This is done by adding redundant bit to the input data then checking
of possible errors in the output data. These redundant bits are used as error detecting codes
to predict an error in the transmission of data.
119
http://ecetutorials.com/digital-electronics/error-detection-and-correction/
a. Even Parity – Even parity happens when the number of 1’s in a given code is
even.
Example: 10110001, 11100111, 11111111
b. Odd Parity – Odd parity happens when the number of 1’s in a given code is odd.
Example: 10000000, 10001001, 11101111
The parity bit can be either 0 or 1 depending on the required parity. For an even parity,
0 or 1 is used if the entire code has even number of 1’s. For an odd parity, 0 or 1 is used if the
entire code has an odd number of 1’s.
3. Checksum
Each bit of data in a code are separated into sections. This process uses 1’s
aggregate to get the sum of the sections. The sum is aggregated to get the checksum. The
checksum section is then transmitted together with the data sections. The output data is then
checked using 1’s aggregate to get the sum. The result must be zero to determine the data is
correct.
120
6.7 Hamming Codes
Like the previous error-detecting codes, Hamming code utilizes the concept of parity
bits, which are additional bits attached to digital data in the process of transmission from input
to output. Furthermore, multiple parity bits are used not just to detect bit errors but also locate
its position in the given code.
Redundant Bits
Extra binary bits, called redundant bits, are generated and attached to the data to make
sure that no bit were lost in the transmission process. The following formula is used to
determine the number of redundant bits:
𝟐𝒓 ≥ 𝒅 + 𝒓 + 𝟏
where r represents the number of redundant bits and d represents the data bit.
The number of redundant bits is done using trial and error method. Thus, the use of
smaller numbers is advised. For instance, the data bit is 5, the number of redundant bits can
be calculated using the formula above. Try 3 for instance.
𝟐𝟑 ≥ 𝟓 + 𝟑 + 𝟏
𝟖≥𝟗
𝟐𝟒 ≥ 𝟓 + 𝟒 + 𝟏
𝟏𝟔 ≥ 𝟏𝟎
The result satisfies the formula. Therefore, 4 redundant bits are needed in a data bit of 5.
The key in using Hamming code is the use of extra parity bits to identify a single error.
1. All bit positions that are powers of 2 are marked as parity bits (1, 2, 4, 8, … ). Use
the binary form to represent each power (1, 10, 11, 001, … ).
2. All remaining positions are marked as data bits.
3. Each data bit is uniquely used as set of parity bits using its binary form.
a. Parity bit 1 includes all bit positions whose binary form includes 1 in the least
significant position.
b. Parity bit 2 includes all bit positions whose binary form includes 1 in the second
position from the least significant bit.
c. Parity bit 3 includes all bit positions whose binary form includes 1 in the third
position from the least significant bit.
d. Parity bit 4 includes all bit positions whose binary form includes 1 in the fourth
position from the least significant bit.
e. In general, each parity bit covers all bits where the bitwise and of the parity
position and the bit position is non-zero.
4. Even parity is used in determining the value of parity bits. Use a parity bit of 1 if the
total number of 1’s in the positions checked is odd. Otherwise, use a parity bit of 0.
121
Positioning the Redundant Bits
Using the procedure above, if the data bit is 5, then the number of redundancy bits is
4. Thus, the total number of bits is 9.
9 8 7 6 5 4 3 2 1
1001 1000 111 110 101 100 11 10 1
d9 d8 d7 d6 d5 d4 d3 d2 d1
Redundant Bits
Thus, the new bit representation is:
9 8 7 6 5 4 3 2 1
1001 1000 111 110 101 100 11 10 1
d9 r4 d7 d6 d5 r3 d3 r2 r1
Redundant Bits
Suppose the input data is 11001. The position of the redundancy bits are as follows:
9 8 7 6 5 4 3 2 1
1001 1000 111 110 101 100 11 10 1
1 r4 1 0 0 r3 1 r2 r1
9 8 7 6 5 4 3 2 1
1001 1000 111 110 101 100 11 10 1
1 1 0 0 1 r1
The highlighted bits are bit positions included in parity bit 1. Since the total number of
1’s is odd, then it has a parity bit of 1. Therefore, r1 is 1.
2. To find r2, check the bit positions with 1 in the second position from the least significant
bit. Then check for even parity.
9 8 7 6 5 4 3 2 1
1001 1000 111 110 101 100 11 10 1
1 1 0 0 1 r2
The highlighted bits belongs to parity bit 2. Since the total number of 1’s is even, then
it has a parity bit of 0. Therefore, r2 is 0.
3. To find r3, check the bit positions with 1 in the third position from the least significant
bit. Then check for even parity.
9 8 7 6 5 4 3 2 1
1001 1000 111 110 101 100 11 10 1
1 1 0 0 r3 1
Since the total number of 1’s is odd, then it has a parity bit of 1. Therefore, r3 is 1.
4. To find r4, check the bit positions with 1 in the fourth position from the least significant
bit. Then check for even parity.
9 8 7 6 5 4 3 2 1
1001 1000 111 110 101 100 11 10 1
1 r4 1 0 0 1
122
Since the total number of 1’s is odd, then it has a parity bit of 1. Therefore, r4 is 1.
123
Exercise 6.1
b. Decimal to Binary
1. 25 6. 555
2. 49 7. 843
3. 101 8. 1000
4. 220 9. 1555
5. 300 10. 7777
b. Subtraction
1. 11001 – 10101
2. 111001 – 110111
3. 1100001 – 1010101
4. 1100111 – 1010001
5. 10101011 - 10011100
c. Multiplication
1. (1011)(101)
2. (1100)(110)
3. (1010)(111)
4. (1111)(1101)
5. (10101)(10111)
d. Division
1. 110011 ÷ 11
2. 1000000 ÷ 100
3. 1111101 ÷ 101
4. 11110000 ÷ 110
5. 1010111100 ÷ 111
124
C. Determine the parities of the following codes.
1
REFERENCES
Alejan, R. O., et al. (2018). Mathematics in the Modern World. Malabon: Mutya Publishing
House, Inc.
Ariola, M. Q., et al. (2018). Deal with the World of Today through Mathematics. Malabon:
Mutya Publishing House, Inc.
Bluman, Allan G. (2008). Elementary Statistics, A Step by Step Approach. Mc. Graw-Hill
Companies Inc. New York
Error Detection and Correction. (2019). Retrieved from Tutorials Point Web site:
https://www.tutorialspoint.com/computer_logical_organization/error_codes
Error Detection. (2019). Retrieved from Teach Computer Science Web site:
https://teachcomputerscience.com/error-detection/
Navidi, William Cyrus (2006) Statistics for Engineers and Scientists. McGraw-Hill Companies
Inc. New York
Nelson, S. (2018). 5 Reasons why Coding is Important for Young Minds. Retrieved from
Learning Resources - Blog: http://blog.learningresources.com/5reasonskidscoding/
Pandey, H. (2018). Computer Network | Hamming Code. Retrieved from Geeks for Geeks
Web site: https://www.geeksforgeeks.org/computer-network-hamming-code/
Rouse, M. (2010). What is bit (binary digit)? Retrieved from What Is Web site:
https://whatis.techtarget.com/definition/bit-binary-digit
Rouse, M. (2014). Hamming code. Retrieved from WhatIS.com Web site:
https://whatis.techtarget.com/definition/Hamming-code
Sirug, W. S. (2015). Basic Probability and Statistics, Revised Edition. Manila: Minshapers
Co., Inc.
Talents Print. (2018). Number System and Its Types. Retrieved from Talents Print:
https://www.talentsprint.com/blog/2018/01/number-system-i-what-is-number-syste.html
Tutorials Point. (2012). Computer - Number System. Retrieved from Tutorials Point:
https://www.tutorialspoint.com/computer_fundamentals/computer_number_system.htm
Why Is Coding So Important? (2018). Retrieved from Forbes Web site:
https://www.forbes.com/sites/quora/2018/08/16/why-is-coding-so-important/#
17690e5b2 adc
Zorilla, Roland S., Esller, Beda H., Mendoza, Violeta C., Cabrera, Milna K. (2013) Statistics:
Basic Concepts and Applications, Mutya Publishing House, Inc.
http://ecetutorials.com/digital-electronics/error-detection-and-correction/
http://www.sciencefriday.com/wp-content/uploads/2015/08/UTF8-Table-7802.png
https://users.cs.fiu.edu/~downeyt/cop3402/hamming.html
https://www.analyzemath.com/statistics/linear_regression.html
https://www.techopedia.com/definition/5422/data-management
https://stattrek.com/regression/linear-regression.aspx
http://faculty.webster.edu/woolflm/variability.html