0% found this document useful (0 votes)
3 views

Lecture 5 Stats

Uploaded by

atalguneet
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
3 views

Lecture 5 Stats

Uploaded by

atalguneet
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 11

STATISTICS 8096 BBMM Trimester II

L5 Organizing data I
Organizing categorical and numerical variables

Dr. AHMAD M KHALID


This is a copyrighted material owned by Dr. Ahmad Mohd
Khalid. Do not share and post.
Organizing Data
• Data is organised to visualise and analyse it
• The way we organise depends on the type of variable: categorical
and numerical
• Tools for organising categorical variables – summary table &
contingency table
• Tools for organising numerical variables – frequency distribution,
cumulative distribution

This is a copyrighted material owned by Dr. Ahmad Mohd


Khalid. Do not share and post.
Categorical Data
• For a single categorical variable a summary table is used
• For organising data from 2 or more categorical variables, a
contingency table is used

This is a copyrighted material owned by Dr. Ahmad Mohd


Khalid. Do not share and post.
Summary Table
A summary table presents tallied responses as frequencies or percentages
for each category.

https://www.ibef.org/industry/india-automobiles/infographic

https://www.idc.com/getdoc.jsp?containerId=prAP52129424

This is a copyrighted material owned by Dr. Ahmad Mohd


Khalid. Do not share and post.
Contingency Table
A contingency table allows you to study patterns that may exist between the responses
of two or more categorical variables

https://datatab.net/tutorial/cross-table https://statisticsbyjim.com/probability/contingency-tables-probabilities/

This is a copyrighted material owned by Dr. Ahmad Mohd


Khalid. Do not share and post.
Numerical Data
• Numerical data is organized by creating ordered arrays or
distributions
• Examples include frequency distribution and cumulative
distribution

This is a copyrighted material owned by Dr. Ahmad Mohd


Khalid. Do not share and post.
Frequency Distribution
A frequency distribution summarizes numerical values by tallying them into a set of numerically
ordered classes

Steps to calculate frequency distribution


1. Find the Range
2. Decide on the Number of Groups
(Classes)
3. Calculate the Class Width
4. Create the Groups (Classes)
5. Count the Frequencies

This is a copyrighted material owned by Dr. Ahmad Mohd


Khalid. Do not share and post.
Relative Frequency and Percentage Distribution
Incase of two or more groups, proportion or the percentage is sometimes more meaningful than the
frequencies for each
Steps to calculate relative frequency Problem Illustration
Frequency Distributions of the Cost per Meal for 50 City
1. Add the total frequency in your classes Restaurants and 50 Suburban Restaurants
2. Divide the frequency of each class by
total frequency

Steps to calculate percentage distribution


1. Convert relative frequencies to decimals
2. To get percentages, multiply the
decimal values by 100

Source: Brenson et al.


Chapter 2. Basic Business
Statistics: Concepts and
Applications.

This is a copyrighted material owned by Dr. Ahmad Mohd


Khalid. Do not share and post.
Cumulative Percentage Distribution
The cumulative percentage distribution provides a way of presenting information about the
percentage of values that are less than a specific amount

Source: Brenson et al. Chapter 2. Basic Business Statistics: Concepts


This is and Applications.
a copyrighted material owned by Dr. Ahmad Mohd
Khalid. Do not share and post.
Cross Tabulation of Data using two or more categorical variables
• Used to analyze the relationship between two or more categorical variables by organizing data into a
contingency table
• Helps to identify patterns, trends, and correlations that may not be apparent

Problem Illustration
Mr. Charan opened a sports and fitness center and conducted a survey
of his 148 regular customers to assess their satisfaction with his products
and services. He focused on gender and satisfaction level

Create a cross-tabulation table to visualize the relationship between gender


and customer satisfaction.

o Total satisfied customers: 86 (58% of respondents)

o Total unsatisfied customers: 30 (20% of respondents)

o Total unsure customers: 32 (22% of respondents)

This is a copyrighted material owned by Dr. Ahmad Mohd


Khalid. Do not share and post.
Problem Illustration
Respondent
Exercise Frequency Dietary Preference
ID
1 Daily Vegetarian
2 Occasionally Non-Vegetarian
3 Rarely Vegetarian
4 Daily Non-Vegetarian
5 Occasionally Non-Vegetarian
6 Rarely Vegetarian

Q: Use cross-tabulation to explain the dietary preference and exercise pattern of


respondents

This is a copyrighted material owned by Dr. Ahmad Mohd


Khalid. Do not share and post.

You might also like