ADDB - Week 1
ADDB - Week 1
• Time series: data collected over several time periods focusing on certain
groups of people, specific events, or objects. Time series can include hourly,
daily, weekly, monthly, quarterly, or annual observations.
Types of data (Data format)
• Structured data: generally refers to data that has a well-defined length and
format. Structured data reside in a predefined row-column format.
Structured data generally consist of numerical information that is objective.
• Which of the following variables are qualitative and which are quantitative?
If the variable is quantitative, then specify whether the variable is discrete or
continuous.
• Points scored in a football game.
• Colors of cars in a mall parking lot.
• Heights of 15-year-olds.
Types of measurement scales
• Interval: assured that the differences between scale values are meaningful.
Thus, the arithmetic operations of addition and subtraction are meaningful. the
value of zero is unspecified.
• Example: Temperature
• Ratio: Ratio data have all the characteristics of interval data as well as a true
zero point, which allows us to interpret the ratios of values. A ratio scale is used
to measure many types of data in business analysis.
• Ex: Sales, profits, weights
Exercise
• Frequency distribution groups data into categories and records the number
of observations that fall into each category
• A brief outlook on automotive brands sold in February 2021 is given below:
Toyota Toyota Toyota Toyota Honda Nissan Toyota
Toyota Toyota Nissan Toyota Toyota Toyota Nissan
Toyota Toyota Toyota Nissan Toyota Nissan Toyota
Nissan Toyota Toyota Toyota Nissan Toyota Toyota
Summarizing qualitative data
• A local restaurant is committed to providing its patrons with the best dining experience possible. On a
recent survey, the restaurant asked patrons to rate the quality of their entrées. The responses ranged
from 1 to 5, where 1 indicated a disappointing entrée and 5 indicated an exceptional entrée. The
results of the survey are as follows:
a. Construct frequency and relative frequency distributions that summarize the survey’s results.
b. Are patrons generally satisfied with the quality of their entrées? Explain.
Summarizing quantitative data
• The number of classes usually ranges from 5 to 20. This is a guideline, not an
absolute rule.
• Approximating the class width:
What is the price range?
For quantitative data, a frequency distribution groups data into intervals
called classes and
records the number of observations that falls into each class.
300 up to 400 4 4
400 up to 500 11 4 + 11 = 15
500 up to 600 14 4 + 11 + 14 = 29
600 up to 700 5 4 + 11 + 14 + 5 = 34
700 up to 800 2 4 + 11 + 14 + 5 + 2 = 36
Total 36
Relative & cumulative relative frequency
distribution
a relative frequency distribution identifies the proportion (or the
fraction) of observations that falls into each class—that is,
700 up to 800 2 2/36 = 0.06 0.11 + 0.31 + 0.39 + 0.14 + 0.06 ≈ 1.0
Total 36 1.0
Histograms
(students are asked to tell the difference between two provided histograms)
• An ogive connects a series of neighboring points where each point represents the
upper limit of a particular class and its associated cumulative frequency or
cumulative relative frequency.
• An ogive differs from a polygon in that the use of the upper limit of each class as the
x-coordinate is evident and the cumulative frequency or cumulative relative
frequency of the corresponding class as the coordinate. To close the ogive, we need
to use the lower end by intersecting the x-axis at the lower limit of the first class.
Example of Ogive
Stem & Leaf Diagram