Business Statistics: Lecturer
Business Statistics: Lecturer
Business Statistics
Lecturer
Dr. Nguyen Thi Xuan Mai
Faculty of Statistics, National Economics University
Address: Room No. 801, Building A1, NEU
Handphone: 0983.608.295
Email: [email protected]
Website: www.khoathongke.neu.edu.vn
1
8/11/2022
Objectives
By the end of the course, students should be able to:
• Offer appropriate and effective descriptions of sets of data
• Describe data with graphical, tabular, and quantitative summaries
• Calculate and apply measures of central location and measures of dispersion
• Calculate and interpret confidence intervals for samples dealing with population
means and proportions
• Form and test well‐defined hypotheses about a population’s mean or proportion
• Conduct and interpret the results of a simple regression analysis
• Use time series analysis and forecasting models to make better forecasts
• Calculate and interpret index numbers
In addition, you will learn some of the basic skills for using SPSS to present and
analyze data.
3
Content
• Chapter 1: Introduction to Statistics
• Chapter 2: Presenting Data in Tables and Charts
• Chapter 3: Numerical Descriptive Measures
• Chapter 4: Sampling surveys
• Chapter 5: Simple Linear Regression
• Chapter 6: Time-series and Forecasting
• Chapter 7: Index numbers
2
8/11/2022
Textbook
Statistics for Business and Economics, Thirteenth Edition
David R. Anderson, Dennis J. Sweeney, Thomas A. Williams, Jeffrey D.
Camm, James J. Cochran
South‐Western, Cengage Learning, 2017
Assessment & Grading Policy
• Attending class: 10%
• Mid‐term exams (open books, open notes): 40%
• Final exam (open books, open notes): 50%
3
8/11/2022
Chapter 1. Introduction to Statistics
Learning objectives
This chapter will help you learn:
How statistics is used in economics and business
4
8/11/2022
Why learn Statistics?
Everyday decisions are based on incomplete information, i.e, we must deal with
uncertainty
Consider:
• Will the job market be strong when I graduate?
• Will the price of Vinamilk stock be higher in six months than it is now?
• Will interest rates remain low for the rest of the year if the state budget deficit is
as high as predicted?
Why learn Statistics?
10
5
8/11/2022
What is Statistics?
11
Statistical Applications in Economics and Business
12
6
8/11/2022
Statistical Applications in Economics and Business
Production: Statistical quality control charts are used to monitor the output of
a production process
Economics: we estimate and test economic models and their predictions; Use
empirical models for prediction, forecasting, and policy analysis
…
13
Some basic concepts and terminologies
• Populations & Samples
• Parameters & Statistics
• Variables & Data
• Elements & Observations
14
7
8/11/2022
Populations & Samples
a b cd
• A population is the entire set of observations under study
ef gh i jk l m n
• E.g: A population of all NEU students
o p q rs t u v w
A population of all enterprises located in Vietnam x y z
• A sample is a subset of a population b c
• E.g: A sample of 100 NEU students gi n
A sample of 500 enterprises located in Vietnam o r u
y
15
Populations & Samples
• Example:
In a recent survey, 250 students at NEU were asked if they smoked cigarettes
regularly, 35 of the students said yes.
Identify the population and the sample.
Responses of all students at NEU (population)
Responses of students
in survey (sample)
16
8
8/11/2022
Parameters & Statistics
Parameter Population
Statistic Sample
Note: A sample statistic can differ from sampe to sample, whereas the
population parameter is constant.
17
Parameters & Statistics
• Example:
Decide whether the numerical value describes a population parameter or a sample
statistic.
a. A recent survey of a sample of 450 college students reported that the average
weekly income for students is $325.
18
9
8/11/2022
Parameters & Statistics
A politician who is running for the office of mayor of a city with 25,000 registered
voters commissions a survey. In the survey, 48% of the 200 registered voters
interviewed say they plan to vote for her.
a. What is the population of interest?
b. What is the sample?
c. Is the value 48% a parameter or a statistic? Explain
19
Variables & Data
A variable is characteristic of an item or individual
Eg: Height of female students
Skin colour of international students in class A
A data is simply a “scientific” term for facts, figures, information and
measurements
→ Data are different values associated with a variable
Eg: Height of 10 female students: 1.6, 1.7, 1.55, 1.59, 1.5, 1.58, 1.64,
1.67, 1.58, 1.55
Skin colour of 5 international students in class A: black, white, white,
yellow, brown, yellow
The data collected in a particular study are referred to as the data set.
20
10
8/11/2022
Elements & Obseverations
• The elements are the entities on which data are collected.
→ A variable is a characteristic of interest for the elements.
• The set of measurements collected for a particular element is called an
observation.
• The total number of data values in a data set is the number of elements
multiplied by the number of variables.
21
Summary Table
Variables
Element
Names Stock Annual Earn/
Company Exchange Sales($M) Share($)
Data Set
22
11
8/11/2022
Types of Data
Data
Categorical Numerical
(Qualitative) (Quantitative)
Discrete Continuous
23
Categorical (qualitative) data
• Consists of attributes, labels, or nonnumerical entries.
24
12
8/11/2022
Numerical (quantitative) data
• Consists of numerical measurements or counts.
25
Note
• The appropriate statistical analysis depends on whether the data for the variable
are qualitative or quantitative.
• There are more options for statistical analysis when the data are quantitative.
26
13
8/11/2022
Types of Data
For each of the following examples of data, determine the type:
i. The number of kilometers joggers run per week
ii. The cities/provinces in Vietnam
iii. The starting salaries of graduates of NEU
iv. The months in which a firm’s employees choose to take their vacations
v. The occupation of graduates of NEU
27
Levers of Measurement (Measurement Scales)
• The level of measurement determines which statistical calculations
are meaningful.
• The four scales of measurement are: nominal, ordinal, interval, and
ratio.
Nominal
Ordinal Lowest
Levels of
to
Measurement
Interval highest
Ratio
28
14
8/11/2022
Nomimal Scale
• Data are labels or names used to identify an attribute of the element.
• Eg. Gender, occupation, marital status
Colors in the skin
Names of students in your class
Textbooks you are using this semester
• Data at the nominal scale are qualitative only.
• No mathematical computations can be made at this level.
29
Ordinal Scale
• The data have the properties of nominal data and the order or rank of the data is
meaningful.
• Eg. Students of a university are classified by their class standing using a
nonnumeric label such as: freshman, sophomore, junior, senior
Levels of satisfaction with life (dissatisfied, slightly dissatisfied, neutral,
slightly satisfied, satisfied)
Top 50 songs played on the radio
• Data at the ordinal scale are qualitative or quantitative.
30
15
8/11/2022
Interval Scale
• The data have the properties of ordinal data, and the interval between
observations is expressed in terms of a fixed unit of measure.
• Data at the interval scale are quantitative only.
• Eg. Temperatures; Scores …
• A zero entry simply represents a position on a scale; the entry is not an inherent
zero, i.e, no natural starting point.
• The interval differences are meaningful but, we can’t defend ratio relationships.
• Eg. The difference between 10 and 20 degrees is the same as between 80 and
90 degrees but, we can’t say that 80 degrees is twice as hot as 40 degrees
31
Ratio Scale
• The data have all the properties of interval data and the ratio of two values is
meaningful.
• This scale must contain a zero value (a natural starting point) that indicates that
nothing exists for the variable at the zero point.
• Data at the ratio scale are quantitative only.
• Eg. Variables such as distance, height, weight, and time…
32
16
8/11/2022
Summary of Levels of Measurement
Determine if one
Arrange Subtract data data value is a
Level of Put data in
data in values multiple of
measurement categories
order (Differences) another (A natural
starting point)
Nominal Yes No No No
Ordinal Yes Yes No No
Interval Yes Yes Yes No
Ratio Yes Yes Yes Yes
33
What kind of data? What kind of scale?
The placement office at a university regularly surveys the graduates 1 year after graduation
and asks for the following information. For each, determine the type of data.
a. What is your occupation?
b. What is your income?
c. What is your marital status?
d. What is the amount of your student loan?
e. How would you rate the quality of instruction? (excellent, very good, good, fair, poor)
34
17
8/11/2022
What kind of data? What kind of scale?
• PCI questionnaire
• ..\SFBE\6.ENG_public awareness.docx
35
Types of Data
Data
36
18
8/11/2022
Cross‐sectional Data
37
Time‐series Data
• Time‐series data are collected over several time periods.
• They are usually collected at fixed intervals, such as daily, weekly, monthly,
quarterly, annually, etc
• E.g. Price of stocks
GDP of Vietnam over 20 years
• Time series data requires different technique to analyze the data compare to
cross‐sectional data.
38
19
8/11/2022
Pooled Data
• Pooled data is a mixture of time‐series data and cross‐sectional data.
• E.g. GDP per capita of all Asian countries over ten years
39
What kind of data?
40
20
8/11/2022
What kind of data?
41
Data sources
• Based on the place of collecting information:
42
21
8/11/2022
Data sources
• Based on the method of collecting information:
43
Sources of secondary data
1
Internet research
2
Government data
and official publications
3
Internal and by-product data
44
22
8/11/2022
Internet research
Search through Vietcombank website (www.vietcombank.com.vn) to know the
exchange rate
Search through Google to gather information about the performance of private
firms in Vietnam since ‘Doi moi’
45
Government data and official publications
46
23
8/11/2022
Internal and by‐product data
Data collected from different departments in an organisation and used all together
Data from Sale Department
Data from Human resource Department
Customer records
Sale reports
Inventory orders …
=> To make decision
47
Sources of secondary data
For each of the following examples of data sources, determine the type:
i. An article on poverty reduction in Vietnam
ii. A report from the Department of Marketing
iii. Data from the Production Department
iv. The consumer price index (CPI)
v. Information about customers of Vin Commercial
48
24
8/11/2022
Sources of primary data
1
Experimental study
2
Survey
3
Observational study
49
Experimental study
50
25
8/11/2022
Survey
A survey is an investigation of one or more characteristics of a population.
• A census is a measurement of an entire population (collecting data for a population)
• Ask the preference of all customers of Vietcombank
• The 2019 Census on Population and Housing of Vietnam (all Vietnamese
citizens)
• A sample survey is a measurement of part of a population (collecting data for a
sample)
• Ask the preference of some customers of Vietcombank
• Vietnam Household Living Standard Survey 2020 (some households)
51
Observational study
52
26
8/11/2022
Two branches of Statistics
Descriptive Statistics Inferential Statistics
Collecting and describing Making decisions based on
data sample data
Collect data
Estimation
Present data
Hypothesis testing
Summarize data
53
Descriptive Statistics
54
27
8/11/2022
Descriptive Statistics
• Collect data
• e.g., Survey
• Present data
• e.g., Tables and graphs
• Summarize data
• e.g., Sample mean = X i
55
Descriptive Statistics, Example
• According to the Bureau of the Census, there are 2.2 million U.S. households
with a single father and one or more children younger than 18.
• There have been 82 confirmed or suspected suicides among active‐duty service
personnel this year, compared to 51 for the same period in 2018.
• The number of mutual funds peaked at 8305 in 2001, but the combination
of bear markets and mergers and acquisitions has driven the number of funds
down to 8011.
• Since March 4, 2009, there have been 190,000 mortgage modifications through
President Obama’s relief plan; 396,724 homes in payment default; and 607,974
homes in either foreclosure or auction proceedings.
56
28
8/11/2022
Inferential Statistics
• Inferential Statistics uses data that have been collected from a small group
(sample) to draw conclusions about a larger group (population).
• Because a sample is typically only a part of the whole population, sample data
provide only limited information about the population. As a result, sample
statistics are generally imperfect representatives of the corresponding population
parameters.
57
Inferential Statistics
• Estimation
• e.g., Estimate the population mean
weight using the sample mean
weight
• Hypothesis testing
• e.g., Test the claim that the
population mean weight is 70 kg
58
29
8/11/2022
Inferential Statistics, Example
• In observing a sample of nurses and other healthcare workers who were likely infected
with the swine flu, researchers found that only half routinely wore gloves when dealing
with patients.
• In a Zagat survey of diners, Outback Steakhouse had the top‐rated steaks in the full‐
service restaurant category.
• Survey results revealed that 26% of thirsty golfers order a sports drink when they finish
their round and head for the clubhouse.
• In a survey of U.S. motorists, 33% said their favorite American roadside store was South
of the Border, in South Carolina.
59
Descriptive statistic or inferential statistics
• Example:
In a recent study, volunteers who had less than 6 hours of sleep were four times
more likely to answer incorrectly on a science test than were participants who had
at least 8 hours of sleep. Decide which part is the descriptive statistics and what
conclusion might be drawn using inferential statistics.
60
30
8/11/2022
Descriptive statistic or inferential statistics
A recent study examined the math and verbal SAT scores of high school seniors
across the country. Which of the following statements are descriptive in nature
and which are inferential.
• The mean math SAT score was 492.
• The mean verbal SAT score was 475.
• Students in the Northeast scored higher in math but lower in verbal.
• 80% of all students taking the exam were headed for college.
• 32% of the students scored above 610 on the verbal SAT.
• The math SAT scores are higher than they were 10 years ago.
61
Designing a Statistical Study
GUIDELINES
1. Identify the variable(s) of interest (the focus) and the population of the study.
2. Develop a detailed plan for collecting data. If you use a sample, make sure the
sample is representative of the population.
3. Collect the data.
4. Describe the data.
5. Interpret the data and make decisions about the population using inferential
statistics.
6. Identify any possible errors.
62
31
8/11/2022
Data analysis using SPSS
• SPSS means “Statistical Package for the Social Sciences” and was first launched in
1968.
• Since SPSS was acquired by IBM in 2009, it's officially known as IBM SPSS
Statistics but most users still just refer to it as “SPSS”.
• SPSS is software for editing and analyzing all sorts of data.
• SPSS is used by market researchers, health researchers, survey companies,
government entities, education researchers, marketing organizations, data
miners, and many more for the processing and analyzing of survey data.
63
SPSS window
• Data View: Used to display data
• Columns represent variables
• Rows represent individual units or groups of units that share common values
of variables
• Variable View: Used to display information on variables in dataset
• Output View: Displays Results of analyses/graphs
64
32
8/11/2022
Enter data in SPSS directly
FILE/OPEN/DATA
Set File name
Files of type: SPSS Statistics (*.sav)
65
Data View
Columns:
variables
Rows: cases
Under Data
View
66
33
8/11/2022
Enter variables
NOTE: The first character
2. Type 4. Description of the variable name must
variable name of variable be alphabetic.
Variable names must be
3. Type: numeric unique, and have to be less
or string… than 64 characters.
Spaces are NOT allowed.
1. Click this
Window
67
Enter variable
Based on your code
book!
68
34
8/11/2022
Enter cases
69
Import data from Excel
FILE/OPEN/DATA
Files of type: Excel
Select the file you want to import
70
35
8/11/2022
Open Excel files in SPSS
71
Open Excel files in SPSS
Save this
file as
SPSS data
72
36
8/11/2022
Summary
Understand what is Statistics
Distinguish population and sample
Discribe variables and data
Distinguish types of data
Categorical data
Numerical data
Distinguish scales of measurement
Understand different sources of data
Distinguish two branches of statistics
73
37