QT Summary Document 1
QT Summary Document 1
Techniques
Diving into Statistics
Session Data and
Statistics
Segment Introduction to Data
and Statistics
HOW DATA HAS EVOLVED - A TIMELINE
Ancient
1800s 1990s
World
Ishango bone – tally stick Herman Hollerith’s punch Sir Tim Berners Lee
for numerical records. card machine used to creates hyperlink for
Writing and libraries process US census data . data sharing.
evolve. Internet invented.
Google search launched.
What is Statistics?
01
Collection, analysis,
interpretation and presentation
of numerical data
How do we classify Statistics?
02
Population – large collection of
persons, objects or items
Viewed shows/movies
Language Artists
Location Genre
Viewing
History
TYPES OF STATISTICAL ANALYSIS
Descriptive Inferential
Car Manufacturing
01 Calculating the time required to finish
manufacturing a car based on extensive factory
floor data.
Cricket
02 Concluding the “Man of the Match” post-game
after studying the performance of all players
Marathons
03 Concluding the “Man of the Match” post-game
after studying the performance of all players
Healthcare
04 Deducing the average time taken by a
participant to complete a marathon after all
participants have crossed the finish line
Inferential Statistics
Inferential statistics refers to the process of
selecting a representative sample to draw out
inference for the whole population.
The conclusions or decisions are made using a
portion of the population from the universe.
INFERENTIAL ANALYSIS IN EVERYDAY LIFE
Car Manufacturing
01 Estimating the time taken to manufacture based
on a sample of 50 cars from a total of 5000 cars
Cricket
02 Guessing the “Man of the Match” in the game
of cricket based on the past performance of
the players
Marathons
03 Predicting the average time to finish a
Marathon based on the finish times of 30
random runners
Healthcare
04 Estimating the effectiveness of immunisation
by studying a random sample of 100 children
for a state
APPLICATIONS OF STATISTICS
Column/ Variable
Row/
Observation
Data
TYPES OF DATA
Data
Quantitative Qualitative
Quantitative Qualitative
Examples:
Number of iPads sold
2 Number of complaints at call center
Number of employees
3
QUANTITATIVE - CONTINUOUS
Examples:
Percentage return on a stock
2 Glucose quantity in blood
Infant mortality rate
QUALITATIVE - NOMINAL
Examples:
Gender
2 Colour
City
QUALITATIVE - ORDINAL
Examples:
Education levels (BS, MS, PhD)
2 Customer rating of item (Poor, Avg, Good)
Mutual fund risk ratings
DICRETE, CONTINUOUS, NOMINAL AND ORDINAL DATA
First Level
1 Nominal Level of Measurement
Second Level
2 Ordinal Level of Measurement
Third Level
3 Interval Level of Measurement
Fourth Level
4 Ratio Level of Measurement
LEVELS OF DATA MEASUREMENT
Nominal Measurement
Purpose
To classify, name or categorise data
Meaning
It does not denote any order of classification. Used only as a label
to differentiate one from another
Example
Which of the following employment classifications best
describes your area of work?
⚫ 1. Educator 2. Construction worker 3. Manufacturing
worker 4. Lawyer 5. Doctor 6. Other
⚫ We can only interpret the name
⚫ Name: Profession
LEVELS OF DATA MEASUREMENT
Ordinal Measurement
Purpose
To rank or order data
Meaning
It denotes the superiority or inferiority among data but does not
denote the amount of the same.
Example
This online training is:
⚫ 1 - not helpful
⚫ 2 - somewhat helpful
⚫ 3 - moderately helpful
⚫ 4 - very helpful
We can interpret the name and rank
Name: Whether or not helpful; Rank: Ascending order from not
helpful to very helpful
LEVELS OF DATA MEASUREMENT
Interval Measurement
Purpose
To categorise and rank order data with equal intervals
Meaning
Gives meaning to distances between consecutive numbers
Example
Numbers on a temperature scale : 10 Degree Celsius
We can interpret the name, rank and distance between two values
Name: Degree Celsius
Rank: 100o is hotter than 10o
Distance: Equal interval between 2-4 degrees and 8-10 degrees.
LEVELS OF DATA MEASUREMENT
Ratio Measurement
Purpose
To derive relation between different parts of data
Meaning
The ratio of two numbers is meaningful and denotes the relation
Example
Weight: 80 pounds
We can interpret the name, rank, distance and it makes sense as a ratio.
Name: Pound
Rank: 80 pounds is heavier than 40 pounds
Distance: Intervals between 20-40 pounds and 60-80 pounds is equal
Absolute zero: Ratios of two weights is meaningful
Segment Visualising Data
AN ANALYSIS OF DIGITAL USAGE IN INDIA
0.8
0.0
Population Mobile Connections
01 02 03
The value halfway across The proportion of the total The running total of
the class interval and can be frequency that is in any frequencies through the
calculated as the average of given class interval in a classes of a frequency
the two class endpoints. frequency distribution. distribution.
The representative value for The individual class Frequency for present class
each class in most group frequency divided by the interval added to the
statistics calculations. total frequency. preceding cumulative total.
80
Two-dimensional graph plot of pairs of
1 points from two numerical variables
60
40
Shows the extent of relation between
2 variables
20
0
0 1 2 3 4 5
India Pakistan
10 20
20 30
70 70
40 50
GRAPHICAL REPRESENTATION
Pie Charts
India
Circular depiction of data where the area of
1 the whole pie represents 100% of the data
and slices of the pie represent a percentage
breakdown of the sublevels.
1980
0 25 50 75 100
Australia Pakistan India
India Pakistan Australia
1980 10 20 30
1990 20 30 90
2000 70 70 50
2010 40 50 60
GRAPHICAL REPRESENTATION
Histogram