0% found this document useful (0 votes)

107 views73 pages

Statistics Introduction: Dr. Sudeep Mallick

Statistics can help analyze and make sense of data to support decision making. Descriptive statistics summarize and describe data through tables, charts, and summary calculations. Inferential statistics are used to predict unknown population parameters, test hypotheses, and generalize samples to populations. The document discusses using statistics to design an attractive cell phone plan for students, including collecting call data, analyzing descriptive statistics, testing hypotheses, and predicting behavior through techniques like regression and ANOVA. It also outlines applying statistics in business functions like marketing, finance, HR, and operations.

Uploaded by

VishalRathore

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

107 views73 pages

Statistics Introduction: Dr. Sudeep Mallick

Uploaded by

VishalRathore

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 73

Statistics Introduction

Dr. Sudeep Mallick

Why statistics?
Decision making is often based on
analysis of data.
Statistics helps you to make sense of the
data by using tools that summarize,
present and analyze the data
Decision maker can also ascertain the
confidence in the decisions.

Types of Decisions
Analyze: effect of the variable
Predict: relationship between two
variables
Predict: likely outcome
Observe: trend
Generalize: about population at large

Cell Phone Scheme

How to make an attractive plan for IMI, Cal students
What data to collect
Number of calls made
Duration of calls made
Time of call
Amount of data usage

How does the data look and feel (descriptive statistics)

Can the IMI, Cal student data be used for predicting behavior of other
colleges? (estimation and hypothesis testing)
What would be the confidence in the prediction (probability)
What is the use of existing database?
Can the previous years data hold for this year too? (hypothesis testing)
Is the call rate for students of IMI Cal and IMI Delhi similar (ANOVA)
Does the call rate distribution follow a normal distribution (chi-square)
Can we use the age of the user group to predict the average call duration
during peak hours of students (regression and correlation)

Use of Statistics in Business Functions

Finance

HR metrics
visualization
HR Policy
effectiveness
analysis
Comparison
of attrition
rates with
industry
averages

Statistical
models of
portfolio
management
(BlackScholes
model uses
normal
distribution)
Financial
modelling
using
probabilities

Marketing
Marketing
research
CRM and
analytics

Operations
Six sigma
SPC
Quality
Management

Examples
How many newspapers should the vendor stock
to maximize revenue?
Depends on the probability distribution of demand and
expected profit

Are two or more market segments significantly

different?
Hypothesis testing

What proportion of people are happy with the

Sixth-pay commission report?
Parameter estimation

Business Research Methods

Statistics course lays the foundation
Business Research Methods of Research
Methodology courses

Subdivision within Statistics

Descriptive Statistics
Collect
Organize
Summarize
Display
Analyze

Inferential Statistics
Predict and forecast
values of population
parameters
Test hypotheses about
values of population
parameters
Make decisions

Descriptive Statistics
Graphical statistics / Visualization
pictures
Picture is worth a thousand words

Summary statistics numbers

Simplify information
Use single number to describe characteristics
of a data set

Visualization

Types of Data Variables

Variable - A variable is any measured characteristic or attribute that differs for

different subjects e.g. height of a building, eye colour.
Qualitative (or categorical) Descriptive variable measuring a particular
characteristic (e.g. eye colour) or the variable can be ranked (e.g. finished
first, fourth etc.)
Quantitative A numerical variable measured on two scales (interval/ratio)
Nominal Assigning items to categories e.g. number of people with blue
eyes. Frequency distributions are usually used to tabulate and analyse
problems involving nominal data.
Ordinal A set of data is said to be ordinal if the values belonging to it can be
ranked
Interval - An interval scale is a scale of measurement where the distance
between any two adjacent units of measurement (or intervals) is the same
but the zero point is arbitrary
Ratio - Ratio data are continuous data where both differences and ratios are
interpretable and have a natural zero

Measurement Scale Examples

Measurement
Scale
Nominal data

Ordinal data

Interval data

Ratio data

Recognising a measure scale

1. Classification data e.g. male or female, red or
black car.
2. Arbitrary labels e.g. m or f, r or b, 0 or 1.
3. No ordering e.g. it makes no sense to state that
r > b.
1. Ordered list e.g. student satisfaction scale of 1,
2, 3, 4, and 5.
2. Differences between values are not important
e.g. political parties can be given labels: far
left, left, mid, right, far right etc. and student
satisfaction scale of 1, 2, 3, 4, and 5.
1. Ordered, constant scale, with no natural zero
e.g. temperature, dates, psychological scales,
etc.
2. One unit on the scale represents same
magnitude across the whole range of the scale
3. Differences make sense, but ratios do not e.g.
temperature difference
1. Ordered, constant scale, and a natural zero e.g.

Math or Statistics allowed for

various scales
Nominal frequency distribution (f.d.) mode
Ordinal f.d., median, mode
Interval f.d., mean, median, mode, SD,
variance
Ratio all as for interval scale in addition to
geometric mean, harmonic mean, coefficient of
variation and many other statistical measures
involving ratios

the number of pound lost during a six-week diet

ratio
the proportion of weight lost during a six-week diet
ratio
the heart rate of the participant
ratio
the percent shift in heart rate over baseline during an emotionally demanding task
ratio
the percent of errors made on a classification task
ratio
the number of false alarm responses in a monitoring task
ratio
the types of gramatical errors made in a writing sample
nominal
one's ice cream preference
nominal
how quickly a person gives up on an impossible task that looks like it should be possibl
ratio
a student's SAT score
Interval
the religious group that one affiliates with
nominal

the percentile rank from an achievement test

ordinal
the type of categorization errors in a sorting task
nominal
the age at which one went on his or her first date
ratio
the number of children in your family
ratio
the score on an anxiety sensitivity scale
interval
whether one has a pet (yes/no)
Nominal
Whether one has a pet (0 for none, 1 for non-zero)
Ordinal
Number of pets
Ratio
the rank of a person's salary within the company
Ordinal
the square footage of each participant's house or apartment
Ratio
the number of frustrated comments made during a project
assignment
ratio

Frequency Distribution

Example - Frequency Distribution

The following are the departure delay in minutes of 52 flights
selected at random from a particular airport.
10

Grouped Frequency Distribution

When there is a wider variety of data
points
Usually create 5 12 classes in the
grouped frequency distribution
Class width = LCB(k) LCB(k+1) =
UCB(k) UCB(k+1)

(Largest value 1) - smallest value

Class width
Number of Classes

Frequency distribution
Delay in
minutes

Frequency

Relative
frequency

015

0.286

15 - 30

0.190

30 45

0.143

45 60

0.333

0.048

60 or more
Total

Graphical Representation of
Data
The next stage of analysis
after the data has been
tabulated is to graph the
data using a variety of
methods to provide a
suitable graph. In this
section we will explore:
1.
2.
3.
4.
5.
6.

Bar charts
Pie charts
Histograms
Frequency polygons
Scatter plots
Time series plots

The type of graph you will use to graph the

data depends upon the type of variable you
are dealing with within your data set e.g.
category (or nominal), ordinal, or interval (or
ratio) data as follows:
Data type
Which graph to use?
Category Bar chart, pie chart, cross tab
or
tables (or contingency tables)
nominal
Ordinal
Bar chart, pie chart, scatter
plots.
Interval or Histogram, frequency polygon,
ratio
histogram.
Cumulative frequency curve (or
ogive), scatter plots, time series
plots.

Histogram
A graph of the data in a frequency distribution is called a
histogram. The area of each bar is a measure of the
frequency of occurrence (number of values) within each
category. If the bar widths are the same (constant) then
the height of the bar is directly related to the frequency
and this information can then be used to construct the
histogram.

Frequency distribution- histogram

Frequency Histogram
16
14
12
10
Frequency - absolute numbers

8
6
4
2
0

0-15

15-30

30-45
Delay in Minutes

45-60

60 or more

Relative frequency Histogram

Relative frequency histogram
0.35
0.3
0.25
0.2
Relative frequency - fraction/percent
0.15
0.1
0.05
0

0-15

15-30

30-45
Delay in Minutes

45-60

60 or more

Bar Chart
Party

Frequency

Proposed voting behaviour

Frequency

600

Conservative

400

500

Labour

510

300

Democrat

Green

Other

400
200

Frequency

100
0

Party

Horizontal Bar Chart

Month
January
February
March
April
May
June

Pink
5200
4100
6000
6900
6050
7000

Blue
2100
1050
2950
5000
6300
5200

M
o
n
t
h

Half yearly car sales

June
May
April

Blue
Pink

March
February
January
0

2000

4000

6000

8000 10000 12000 14000

Number of cars

Pie Chart

Frequency Polygon
A frequency polygon is formed from a histogram by
joining the mid-points of the tops of the rectangles by
straight lines. The mid-points of the first and last class
are joined to the x-axis to either side at a distance equal
to (1/2)th the class interval of the first and last class.

Note on Class Boundary Styles

Class (inclusive) Frequency

0 to 10

0 - 10

11 to 20

11 - 20

If the next data item is 10 it goes to the first class, if it is 11 it goes

to the next class
The above structure is EXACTLY SAME as the one below
Class (UCB
Frequency
Class (UCB
excluded)
excluded)

Frequency

0 to less than 11

0 - 11

11 to less than 21

11 - 21

Both the class structures are equivalent, none is better than the other.
It is just a matter and style and taste which one to adopt.
Now less than 11 implies either 10, or 10.50 or 10.9 or 10.99 or 10.999
depending upon the nature of data

Note on Class Boundary Styles

Class (UCB
excluded)

Frequency

0 to less than 11

Class (UCB
excluded)

Frequency

11 to less than 21

0 - 11

Class (UCB
included)

Frequency

11 - 21

0 up to 11

11 up to 21

Problem with this structure is that it is not

immediately clear if the overlapping boundary is
included in the upper or the lower class. A
convention has to be followed.
Often the convention is that the UCB is not
included in the class. That is it means 0 to less
than 11, 11 to less than 21, etc.
This provides an advantage for cases of decimal
data.
Example for a data point such as 10.97 we know
that it lies in the class (0 - 11)

Note on Class Boundary Styles

Class (inclusive) Frequency
0 - 10

11 - 20

2
Problem with this structure is that in case of
decimal data we would need to modify
boundary so that there are no gaps.
Example for a data point such as 10.33 we
would need to modify boundary such that it
has precision of 2 decimal places

Class (inclusive) Frequency

0 10.50

10.51 - 20

Ogive
Cumulative frequency distribution
Less than
More than

Cumulative Frequency
Ungrouped Data
X

More than
(X)

c.f.

Less
than (X)

c.f

14-6=8

0+6=6

8-0=8

6+0=6

8-1=7

6+1=7

7-4=3

7+4=11

Total = 14 7

3-3=0

11+3=14

Cumulative Frequency
Grouped Data
X

More than
(X)

c.f.

Less
than (X)

c.f

1-10

11-20

14-6=8

0+6=6

21-30

8-0=8

6+0=6

31-40

8-1=7

6+1=7

41-50

7-4=3

7+4=11

Total = 14 50

3-3=0

11+3=14

(Extra class needed here)

Ogive Example

Cumulative Frequency
Distribution
Helps answer less than, more than type questions
with ease
Helps create cumulative probability distribution which
answers cut-off probability questions

Exercise
Analysing class marks
Working with EXCEL/SPSS
Choosing appropriate class boundaries
Experimenting with class boundaries

Cross tabulation

A joint frequency distribution of two variables (e.g. nature of airline, delay in

minutes)

Scatter Plot
Shows relationship between two variables

More
Pivot Tables of EXCEL
Visualization software such as Tableau

Visualization

Descriptive statistics Summary Statistics

Summary Statistics
Measure of central tendency
Measure of dispersion
Measure of shape

Summary Statistics

Measures of Central Tendency

Arithmetic Mean
Median
Mode
Percentiles
Quartiles

Arithmetic mean
The mean of a data set is the average
of all the data values.
xi
x
n
xi

Sample mean

Population mean

Mean example
Average delay in flight departure

Pros:

1354/42 = 32.2381 minutes

Makes use of full data

Cons:
Affected by extreme values
Good for only symmetrical distribution
Excel Function Method
Mean = Cell E12 Formula:=AVERAGE(B4:B16)=56.4615

Mean
General formula

f X
X
f

For grouped data, X is the class mid-point

Class mid-point = LCB + (class-width/2)

Weighted Average
Example - Calculation of CGPA

Median
It is the middle item in a data set that is
arranged in ascending/descending order
If there are n observations then the
Median = (n+1)/2 th observation.
computation rule
if n is odd then (n+1)/2 is an integer

if n is even then use average of n/2 and n/2 +1 th

observation
Excel Function Method
Mean = Cell E13 Formula:=MEDIAN(B4:B16)=53

Example
Sorted 42
observations
median is average of
21st and 22nd
observation
= (34+38)/2
= 36

Median for Grouped Data

Compute Cumulative frequency
Find median class holding the median element using
(n+1)/2 formula
Use formula:
(Levin)
(Davis Pecar)
L = LCB of median class
C = median class width
F = cumulative frequency before median class
f = frequency within median class

Median
Not affected by extreme values
Does not use full data
Good measure of central tendency for
non-symmetrical data distribution
(skewed)

Mode
Mode is the highest occurring observation
mode in the example is 0
The greatest frequency can occur at two or more
different values.
If the data have exactly two modes, the data are
bimodal.
If the data have more than two modes, the data are
multimodal.
Excel Function Method
Mode = Cell E14 Formula:=MODE(A5:A17)=52

Mode for Grouped Data

L = LCB of the modal class

f0 = frequency of the class below the modal class
f1 = frequency of the modal class
f2 = frequency of the class above the modal class
C = modal class width

Percentiles and Quartiles

Given any set of ordered numerical

observations

nth percentile means n percent of data are equal

or below that value.
Quartiles divide the data into 4 parts (so there are
3 quartiles)

Position of percentile = (n+1)P/100

EXCEL may give you slightly different
values than manual calculation

Quartiles

Quartiles are special names to percentiles

Q1 = 25th percentile
Q2 = 50th percentile = median
Q3 = 75th percentile

Percentile and Quartile

Grouped Data
Percentile P value

L = LCB of percentile class

C = percentile class width
F = cumulative frequency before percentile class
f = frequency within percentile class
P = nth position in 100

Percentile and Quartile

Excel Function Method
25th Percentile = Cell E15 Formula:=PERCENTILE.INC(B4:B16,0.25)=48
First Quartile = Cell E16 Formula:=QUARTILE.INC(B4:B16,1)=48
Second Quartile = Cell E17 Formula:=QUARTILE.INC(B4:B16,2)=53
Third Quartile = Cell E18 Formula:=QUARTILE.INC(B4:B16,3)=60

Measures of Variability

Range
Interquartile Range
Variance
Standard Deviation
Coefficient of Variation

Range
The range of a data set is the difference between the
largest and smallest data values.
It is the simplest measure of variability.
It is very sensitive to the smallest and largest data
values.
Example from airline delay data
Range = 95 0 = 95 minutes

Excel Function Method

Range = Cell F13 Formula:=MAX(B4:B16)-MIN(B4:B16)=71

Interquartile range
The interquartile range of a data set is the
difference between the third quartile and the first
quartile.
It is the range for the middle 50% of the data.
It overcomes the sensitivity to extreme data
values.
Excel Function Method
Q1 = Cell F14 Formula:=QUARTILE.INC (B4:B16,1)
Q3 = Cell F16 Formula:= QUARTILE.INC(B4:B16,3)
QR = Cell F17 Formula:= F16-F14
SIQR = Cell F18 Formula:=(F16-F14)/2

Variance
The variance is a measure of variability
that utilizes all the data.
It is based on the difference between the
value of each observation (xi) and the
mean (x for a sample, for a population).
2
2 ( xi )

N

< - Population variance

Sample variance - >

2
(
x

x
)

i
s2
n 1

Variance

X X

Variance
f

Variance

Excel Function Method

varp = Cell F20 Formula:=VAR.P(B4:B16)
sdp= Cell F21 Formula:=STDEV.P(B4:B16)

2
X

( X )2

Variance
For frequency distribution use a slightly
different formula:
Variance

2
f
X

( X )2

For grouped data use the class midpoint

as the value of X

Sample Variance

Standard deviation
The standard deviation of a data set is the positive
square root of the variance.
It is measured in the same units as the data, making it
more easily comparable, than the variance, to the mean.
If the data set is a sample, the standard deviation is
denoted s.
If the data set is a population, the standard deviation is
denoted (sigma).

SD Var

Use of EXCEL

Coefficient of Variation
The coefficient of variation indicates how large the
standard deviation is in relation to the mean.
If the data set is a sample, the coefficient of variation
is computed as follows:

s s (100)
(100)
xx

If the data set is a population, the coefficient of

variation is computed as follows:

(100)

Measure of Shape - Skewness

Skewness
Skewness - is a measure of the degree of
asymmetry of a distribution
Pearsons coefficient of skewness
PCS =

Excel uses Fishers measure of skewness

FS =
Critical 2

6
N

Excel Function Method

Fishers skew = Cell E7 Formula:=SKEW(B4:B16) = 0.4410

Measure of Shape - Kurtosis

Kurtosis
Kurtosis is a measure of whether the data are peaked or
flat relative to a normal distribution.
Mesokurtic (bell shaped) (ZERO)
Leptokurtic (peaked) (POSITIVE)
Platykurtic (flat) (NEGATIVE)

Fishers Kurtosis
FS =
Excel Function Method
Fishers kurtosis = Cell E10 Formula:=KURT(B4:B16)= - 0.4253

Cri 2

24
N

Data Management: Bryan S. Ambre
100% (2)
Data Management: Bryan S. Ambre
104 pages
Hot Topics in Machine Learning For Research and Thesis
No ratings yet
Hot Topics in Machine Learning For Research and Thesis
10 pages
Introduction To Statistics
No ratings yet
Introduction To Statistics
34 pages
Automatic Radar Plotting Aid (ARPA) : Vasile Radu Adrian ET32
100% (1)
Automatic Radar Plotting Aid (ARPA) : Vasile Radu Adrian ET32
4 pages
2.3 Depletion and Diffusion Capacitances
No ratings yet
2.3 Depletion and Diffusion Capacitances
19 pages
Introduction To Statistics and SPSS
100% (1)
Introduction To Statistics and SPSS
110 pages
Parts of Speech
No ratings yet
Parts of Speech
18 pages
Aaac PDF
No ratings yet
Aaac PDF
1 page
Lecture_4
No ratings yet
Lecture_4
61 pages
1.ungrouped Data Mean, Median&Mode
No ratings yet
1.ungrouped Data Mean, Median&Mode
39 pages
REVISION TEST PAPER -Time
No ratings yet
REVISION TEST PAPER -Time
2 pages
Analytical Techniques Lec 1
No ratings yet
Analytical Techniques Lec 1
42 pages
Wa0001
No ratings yet
Wa0001
24 pages
17th International Conference on Applications of Graph Theory in Wireless Ad hoc Networks and Sensor Networks (GRAPH-HOC 2025)
No ratings yet
17th International Conference on Applications of Graph Theory in Wireless Ad hoc Networks and Sensor Networks (GRAPH-HOC 2025)
2 pages
FINAL_SYCS_ANDROID_APPLICATION_DEVELOPMENT_PRACTICAL_MANUAL[2]
No ratings yet
FINAL_SYCS_ANDROID_APPLICATION_DEVELOPMENT_PRACTICAL_MANUAL[2]
114 pages
MMW
No ratings yet
MMW
7 pages
Data Analitics For Business: Descriptive Statistics
No ratings yet
Data Analitics For Business: Descriptive Statistics
66 pages
L1 Introduction-Displaying Data
No ratings yet
L1 Introduction-Displaying Data
8 pages
2 - JEE Main 2021 Question Paper With Solutions 24 February Evening
No ratings yet
2 - JEE Main 2021 Question Paper With Solutions 24 February Evening
191 pages
01. INTRODUCTION TO BIOSTATISTICS
No ratings yet
01. INTRODUCTION TO BIOSTATISTICS
59 pages
Mamw100 Problem Solving and Reasoning
No ratings yet
Mamw100 Problem Solving and Reasoning
69 pages
BBA - 240 - Lecture - 1 - Introduction - To - Statistics Slides
No ratings yet
BBA - 240 - Lecture - 1 - Introduction - To - Statistics Slides
27 pages
MFC Question Part
No ratings yet
MFC Question Part
28 pages
Lecture2-Part1 Introduction To Measurements
No ratings yet
Lecture2-Part1 Introduction To Measurements
26 pages
Group Discussion Tips - 1
No ratings yet
Group Discussion Tips - 1
9 pages
Unit-2 MFAI
No ratings yet
Unit-2 MFAI
118 pages
Media and Children Literature Survey
No ratings yet
Media and Children Literature Survey
10 pages
Chap 1 - 2: Business Statistics
No ratings yet
Chap 1 - 2: Business Statistics
38 pages
History: E-Commerce Is A Type of Industry Where The Buying and Selling of Products or
No ratings yet
History: E-Commerce Is A Type of Industry Where The Buying and Selling of Products or
12 pages
Arguments Vs Non Arguments
100% (2)
Arguments Vs Non Arguments
21 pages
Midterm Reviewer
No ratings yet
Midterm Reviewer
8 pages
Employess As Customers
No ratings yet
Employess As Customers
11 pages
Statistics
No ratings yet
Statistics
46 pages
PROBABILITY Lecture 1 - 2 - 3
No ratings yet
PROBABILITY Lecture 1 - 2 - 3
63 pages
Elementary Probability Theory For CS648A
No ratings yet
Elementary Probability Theory For CS648A
19 pages
2008 Monaghan Plausibility of GPS Guided Planes Into Towers On 911
No ratings yet
2008 Monaghan Plausibility of GPS Guided Planes Into Towers On 911
11 pages
slides_week1
No ratings yet
slides_week1
46 pages
Catatan Statisktik FIX
No ratings yet
Catatan Statisktik FIX
59 pages
01_Introduction to Statistics
No ratings yet
01_Introduction to Statistics
24 pages
Chapter 19 Financing Infrastructure Projects
No ratings yet
Chapter 19 Financing Infrastructure Projects
22 pages
Lecture 1
No ratings yet
Lecture 1
27 pages
Concepts of Fields
No ratings yet
Concepts of Fields
21 pages
Statistical Foundations - Intro 64zlf
100% (2)
Statistical Foundations - Intro 64zlf
86 pages
Statistics Lec 1
No ratings yet
Statistics Lec 1
28 pages
Advanced Morden Solid State Physics 2
No ratings yet
Advanced Morden Solid State Physics 2
59 pages
01 - Introduction To Statistics
No ratings yet
01 - Introduction To Statistics
38 pages
Kualitas Daun Binahong (Anredera Cordifolia) Pada Suhu Pengeringan Berbeda
No ratings yet
Kualitas Daun Binahong (Anredera Cordifolia) Pada Suhu Pengeringan Berbeda
10 pages
Part II Heat Capacity and Calorimetry
No ratings yet
Part II Heat Capacity and Calorimetry
47 pages
3rd-qtr-stats-reviewer
No ratings yet
3rd-qtr-stats-reviewer
24 pages
Sample Lab Report For Student Use - 1
No ratings yet
Sample Lab Report For Student Use - 1
3 pages
Week_2
No ratings yet
Week_2
15 pages
Statistics- slide 2
No ratings yet
Statistics- slide 2
15 pages
FMIII
No ratings yet
FMIII
144 pages
2. presenting of data_١١١٠٥٩
No ratings yet
2. presenting of data_١١١٠٥٩
39 pages
1 NUMERICALS Module 01 Properties and Fundamental Operations On Matrices 1
No ratings yet
1 NUMERICALS Module 01 Properties and Fundamental Operations On Matrices 1
9 pages
Module 2 - Statistical Foundations
No ratings yet
Module 2 - Statistical Foundations
108 pages
PHP JSON Functions
No ratings yet
PHP JSON Functions
3 pages
1 Stats Intro 14022024 105127am
No ratings yet
1 Stats Intro 14022024 105127am
26 pages
QT Module-2
No ratings yet
QT Module-2
45 pages
SMA 140 Lectures Notes 2024 Sep
No ratings yet
SMA 140 Lectures Notes 2024 Sep
87 pages
Nahavandi Chapter 11 Proof
No ratings yet
Nahavandi Chapter 11 Proof
32 pages
Assessments Chemical Engineering
No ratings yet
Assessments Chemical Engineering
17 pages
Cybersecurity Lab Maual
No ratings yet
Cybersecurity Lab Maual
66 pages
Statistics 101: Introduction To Data Management
No ratings yet
Statistics 101: Introduction To Data Management
37 pages
Valuation of Ashok Leyland Fiev Report
No ratings yet
Valuation of Ashok Leyland Fiev Report
31 pages
Basic-Statistical-Concepts-_-Measures-of-Location.docx
No ratings yet
Basic-Statistical-Concepts-_-Measures-of-Location.docx
14 pages
ADDB - Week 1
No ratings yet
ADDB - Week 1
44 pages
Unit 5 NTMT
No ratings yet
Unit 5 NTMT
22 pages
Lecture 01 Introduction to Statistics Ppt 06022025 095924am
No ratings yet
Lecture 01 Introduction to Statistics Ppt 06022025 095924am
40 pages
Introduction To Statistics
No ratings yet
Introduction To Statistics
39 pages
Intro To Statistics Lecture
No ratings yet
Intro To Statistics Lecture
41 pages
01 Data & Statistics
No ratings yet
01 Data & Statistics
35 pages
STATISTICS (Tanya) PG 1 - 28
No ratings yet
STATISTICS (Tanya) PG 1 - 28
35 pages
Descriptive Statistics, Tables and Graphs 20
No ratings yet
Descriptive Statistics, Tables and Graphs 20
34 pages
Lesson 5 - Quantitative Analysis and Interpretation of Data
No ratings yet
Lesson 5 - Quantitative Analysis and Interpretation of Data
78 pages
Topic 3
No ratings yet
Topic 3
22 pages
Statistics 1232445944520487 1
No ratings yet
Statistics 1232445944520487 1
101 pages
What Is Statistics ? and Describing Data: Frequency Distributio N
No ratings yet
What Is Statistics ? and Describing Data: Frequency Distributio N
17 pages
Stats For PGDM
No ratings yet
Stats For PGDM
52 pages
Part1 141104090445 Conversion Gate01
No ratings yet
Part1 141104090445 Conversion Gate01
27 pages
Nielsen Numeric & WTD Distribution
No ratings yet
Nielsen Numeric & WTD Distribution
38 pages
Intro To Statistics
No ratings yet
Intro To Statistics
35 pages
GE Type 3 Technical Instructions 07-30-2012 - Isolated Operation
No ratings yet
GE Type 3 Technical Instructions 07-30-2012 - Isolated Operation
224 pages
BADB1014 Quantitative Methods - Lesson 3
No ratings yet
BADB1014 Quantitative Methods - Lesson 3
23 pages
1st Mid
No ratings yet
1st Mid
19 pages
4CBT2R, 3R 4CBTK4R, 4cbtyk4r Ce676 PDF
No ratings yet
4CBT2R, 3R 4CBTK4R, 4cbtyk4r Ce676 PDF
393 pages
Statistics A Review
No ratings yet
Statistics A Review
47 pages
Introduction To Statistics: "There Are Three Kinds of Lies: Lies, Damned Lies, and Statistics." (B.Disraeli)
No ratings yet
Introduction To Statistics: "There Are Three Kinds of Lies: Lies, Damned Lies, and Statistics." (B.Disraeli)
32 pages
Intro of Statistics - Ogive
No ratings yet
Intro of Statistics - Ogive
35 pages
Introduction Book 1
No ratings yet
Introduction Book 1
41 pages
ECON 230 - Statistics and Data Analysis - Lecture 1
No ratings yet
ECON 230 - Statistics and Data Analysis - Lecture 1
90 pages
Introduction To Stati Stics: There Are Three Kinds of Lies: Lies, Damned Lies, A ND Statistics." (B.Disraeli)
No ratings yet
Introduction To Stati Stics: There Are Three Kinds of Lies: Lies, Damned Lies, A ND Statistics." (B.Disraeli)
39 pages
Lecture 1-Statistics Introduction-Defining, Displaying and Summarizing Data
No ratings yet
Lecture 1-Statistics Introduction-Defining, Displaying and Summarizing Data
53 pages
RVO-STATISTICS - Statistics - Introduction To Statistics IBBI
No ratings yet
RVO-STATISTICS - Statistics - Introduction To Statistics IBBI
93 pages
CD9211 Computer Applications in Design Jan-10 PDF
No ratings yet
CD9211 Computer Applications in Design Jan-10 PDF
3 pages
Vtu Syllabus Civil
No ratings yet
Vtu Syllabus Civil
49 pages
Thinking Statistically
From Everand
Thinking Statistically
Anthony Banfield
5/5 (1)
Introduction To Business Statistics Through R Software: Software
From Everand
Introduction To Business Statistics Through R Software: Software
Editor IJSMI
No ratings yet

Statistics Introduction: Dr. Sudeep Mallick

Uploaded by

Statistics Introduction: Dr. Sudeep Mallick

Uploaded by

Statistics Introduction

Dr. Sudeep Mallick

Cell Phone Scheme

How does the data look and feel (descriptive statistics)

Use of Statistics in Business Functions

Are two or more market segments significantly

What proportion of people are happy with the

Business Research Methods

Subdivision within Statistics

Summary statistics numbers

Types of Data Variables

Variable - A variable is any measured characteristic or attribute that differs for

Measurement Scale Examples

Recognising a measure scale

Math or Statistics allowed for

the number of pound lost during a six-week diet

the percentile rank from an achievement test

Example - Frequency Distribution

Grouped Frequency Distribution

(Largest value 1) - smallest value

The type of graph you will use to graph the

Frequency distribution- histogram

Relative frequency Histogram

Proposed voting behaviour

Horizontal Bar Chart

Half yearly car sales

8000 10000 12000 14000

Note on Class Boundary Styles

Class (inclusive) Frequency

If the next data item is 10 it goes to the first class, if it is 11 it goes

Note on Class Boundary Styles

Problem with this structure is that it is not

Note on Class Boundary Styles

Class (inclusive) Frequency

(Extra class needed here)

A joint frequency distribution of two variables (e.g. nature of airline, delay in

Descriptive statistics Summary Statistics

Measures of Central Tendency

1354/42 = 32.2381 minutes

Makes use of full data

For grouped data, X is the class mid-point

if n is even then use average of n/2 and n/2 +1 th

Median for Grouped Data

Mode for Grouped Data

L = LCB of the modal class

Percentiles and Quartiles

Given any set of ordered numerical

nth percentile means n percent of data are equal

Position of percentile = (n+1)P/100

Quartiles are special names to percentiles

Percentile and Quartile

L = LCB of percentile class

Percentile and Quartile

Excel Function Method

< - Population variance

Excel Function Method

For grouped data use the class midpoint

If the data set is a population, the coefficient of

Measure of Shape - Skewness

Excel uses Fishers measure of skewness

Excel Function Method

Measure of Shape - Kurtosis

You might also like