Lecture 1 Data and Statistics
Lecture 1 Data and Statistics
Data
Data Sources
Descriptive Statistics
Statistical Inference
Slide 2
Applications in
Business and Economics
Accounting
Public accounting firms use statistical sampling
procedures when conducting audits for their
clients.
Finance
Financial advisors use a variety of statistical
information, including price-earnings ratios and
dividend yields, to guide their investment
recommendations.
Marketing
Electronic point-of-sale scanners at retail
checkout counters are being used to collect data
for a variety of marketing research applications.
Slide 3
Applications in
Business and Economics
Production
A variety of statistical quality control charts are
used to monitor the output of a production
process.
Economics
Economists use statistical information in
making forecasts about the future of the
economy or some aspect of it.
Slide 4
Data
Scales of Measurement
Slide 5
Data and Data Sets
Data are the facts and figures that are
interpreted.
Slide 6
Elements, Variables, and
Observations
The elements are the entities on which data are
collected.
A variable is a characteristic of interest for the
elements.
The set of measurements collected for a
particular element is called an observation.
The total number of data values in a data set is
the number of elements multiplied by the
number of variables.
Slide 7
Data, Data Sets, Elements,
Variables, and Observations
Variables
Stock Annual Earn/
Company Exchange Sales($M) Sh.($)
Dataram AMEX 73.10 0.86
EnergySouth OTC 74.00 1.67
Keystone NYSE 365.70 0.86
LandCare NYSE 111.40 0.33
Psychemedics AMEX 17.60 0.13
Slide 8
Scales of Measurement
Scales of measurement include:
Nominal
Ordinal
Interval
Ratio
The scale determines the amount of information
contained in the data.
The scale indicates the data summarization and
statistical analyses that are most appropriate.
Slide 9
Scales of Measurement
Nominal
may be used.
Slide 10
Scales of Measurement
Nominal
Example:
Students of a university are classified by
the school in which they are enrolled using
a nonnumeric label such as Business,
Humanities, Education, and so on.
Alternatively, a numeric code could be used
for the school variable (e.g. 1 denotes
Business, 2 denotes Humanities, 3 denotes
Education, and so on).
Slide 11
Scales of Measurement
Ordinal
Slide 12
Scales of Measurement
Ordinal
Example:
Students of a university are classified by
their class standing using a nonnumeric
label such as Freshman, Sophomore, Junior,
or Senior.
Slide 13
Scales of Measurement
Interval
Slide 14
Scales of Measurement
Interval
Example:
Melissa has an SAT score of 1205, while
Kevin has an SAT score of 1090. Melissa
scored 115 points more than Kevin.
Slide 15
Scales of Measurement
Ratio
The data have all the properties of
interval data and the ratio of two values
is meaningful.
Variables such as distance, height,
weight, and time use the ratio scale.
This scale must contain a zero value that
indicates that nothing exists for the
variable at the zero point.
Slide 16
Scales of Measurement
Ratio
Example:
Slide 17
Qualitative and Quantitative Data
Data can be further classified as being
qualitative or quantitative.
Slide 18
Qualitative Data
Qualitative data are labels or names used to
identify an attribute of each element.
Slide 19
Quantitative Data
Quantitative data indicate either how many or
how much.
Quantitative data that measure how many
are discrete.
Quantitative data that measure how much
are continuous because there is no separation
between the possible values for the data..
Quantitative data are always numeric.
Ordinary arithmetic operations are meaningful
only with quantitative data.
Slide 20
Cross-Sectional and Time Series
Data
Cross-sectional data are collected at the same
or approximately the same point in time.
Example: data detailing the number of
building permits issued in June 2000 in each
of the counties of Texas
Time series data are collected over several time
periods.
Example: data detailing the number of
building permits issued in Travis County,
Texas in each of the last 36 months
Slide 21
Data Sources
Existing Sources
Data needed for a particular application
might already exist within a firm.
Detailed information is often kept on
customers, suppliers, and employees for
example.
Substantial amounts of business and
economic data are available from
organizations that specialize in
collecting and maintaining data.
Slide 22
Data Sources
Existing Sources
Slide 23
Data Sources
Internet
The Internet has become an important source
of data.
Most government agencies, like the Bureau of
the Census (www.census.gov), make their data
available through a web site.
More and more companies are creating web
sites and providing public access to them.
A number of companies now specialize in
making information available over the
Internet.
Slide 24
Data Sources
Statistical Studies
Statistical studies can be classified as either
experimental or observational.
In experimental studies the variables of
interest are first identified. Then one or more
factors are controlled so that data can be
obtained about how the factors influence the
variables.
In observational (nonexperimental) studies no
attempt is made to control or influence the
variables of interest.
•A survey is perhaps the most common type
of observational study.
Slide 25
Data Acquisition Considerations
Time Requirement
Searching for information can be time
consuming.
Information might no longer be useful by the
time it is available.
Cost of Acquisition
Organizations often charge for information
even when it is not their primary business
activity.
Data Errors
Using any data that happens to be available
or that were acquired with little care can lead
to poor and misleading information.
Slide 26
Descriptive Statistics
Slide 27
Example: Hudson Auto Repair
The manager of Hudson Auto would like to
have a better understanding of the cost of
parts used in the engine tune-ups
performed in the shop. She examines 50
customer invoices for tune-ups. The costs
of parts, rounded to the nearest dollar, are
listed below.
91 78 93 57 75 52 99 80 97 62
71 69 72 89 66 75 79 75 72 76
104 74 62 68 97 105 77 65 80 109
85 97 88 68 83 68 71 69 67 74
62 82 98 101 79 105 79 69 62 73
Slide 28
Example: Hudson Auto Repair
Tabular Summary (Frequencies and Percent
Frequencies)
Parts Percent
Cost ($) Frequency Frequency
50-59 2 4
60-69 13 26
70-79 16 32
80-89 7 14
90-99 7 14
100-109 5 10
Total 50 100
Slide 29
Example: Hudson Auto Repair
Graphical Summary (Histogram)
18
16
14
Frequency
12
10
8
6
4
2
Parts
50 60 70 80 90 100 110 Cost ($)
Slide 30
Example: Hudson Auto Repair
Numerical Descriptive Statistics
Slide 31
Statistical Inference
Statistical inference is the process of using data
Slide 32
Example: Hudson Auto Repair
Process of Statistical Inference
1. Population
consists of all 2. A sample of 50
tune-ups. Average engine tune-ups
cost of parts is is examined.
unknown.
Slide 33