0% found this document useful (0 votes)
7 views

1 Stats Intro 13092024 113537pm

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPT, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
7 views

1 Stats Intro 13092024 113537pm

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPT, PDF, TXT or read online on Scribd
You are on page 1/ 15

LESSON -1

Basics of Statistics
Definition: Science of collection, presentation, analysis, and reasonable
interpretation of data.

Statistics presents a rigorous scientific method for gaining insight into data. For
example, suppose we measure the weight of 100 patients in a study. With so
many measurements, simply looking at the data fails to provide an informative
account. However statistics can give an instant overall picture of data based
on graphical presentation or numerical summarization irrespective to the
number of data points. Besides data summarization, another important task of
statistics is to make inference and predict relations of variables.
Types of Statistics
 Descriptive Statistics – Methods of organizing,
summarizing, and presenting data in an informative
way
 Inferential Statistics – The methods used to
determine something about a population on the
basis of a sample
 Population –The entire set of individuals or
objects of interest or the measurements
obtained from all individuals or objects of
interest
 Sample – A portion, or part, of the population of
interest
Descriptive Statistics
 Collect data
 e.g., Survey..

 Present data
 e.g., Tables and graphs..

 Summarize data
 e.g., Sample mean etc…
Inferential Statistics
 Estimation
 e.g., Estimate the population mean weight
using the sample mean weight
 Hypothesis testing
 e.g., Test the claim that the population mean
weight is 70 kg

Inference is the process of drawing conclusions or making


decisions about a population based on sample results
Statistical Description of Data
 Statistics describes a numeric set of
data by its
 Center
 Variability
 Shape
 Statistics describes a categorical set
of data by
 Frequency, percentage or proportion of
each category
Statistical data
 The collection of data that are relevant to the problem
being studied is commonly the most difficult, expensive,
and time-consuming part of the entire research project.
 Statistical data are usually obtained by counting or
measuring items.
 Primary data are collected specifically for the analysis
desired
 Secondary data have already been compiled and are
available for statistical analysis
 A variable is an item of interest that can take on many
different numerical values.
 A constant has a fixed numerical value.
Data
Statistical data are usually obtained by counting or
measuring items. Most data can be put into the
following categories:

 Qualitative - data are measurements that each fall


into one of several categories. (hair color, ethnic
groups and other attributes of the population)
 Quantitative - data are observations that are
measured on a numerical scale (distance traveled
to college, number of children in a family, etc.)
Qualitative data
Qualitative data are generally described by words or
letters. They are not as widely used as quantitative data
because many numerical techniques do not apply to the
qualitative data. For example, it does not make sense to
find an average hair color or blood type.
Qualitative data can be separated into two subgroups:
 dichotomic (if it takes the form of a word with two
options (gender - male or female)
 polynomic (if it takes the form of a word with more than
two options (education - primary school, secondary school
and university).
Quantitative data
Quantitative data are always numbers and are the
result of counting or measuring attributes of a
population.
Quantitative data can be separated into two
subgroups:
 discrete (if it is the result of counting (the number
of students of a given ethnic group in a class, the
number of books on a shelf, ...)
 continuous (if it is the result of measuring
(distance traveled, weight of luggage, …)
Frequency Distribution
Consider a data set of 26 children of ages 1-6 years. Then the
frequency distribution of variable ‘age’ can be tabulated as
follows:
Frequency Distribution of Age

Age 1 2 3 4 5 6
Frequency 5 3 7 5 4 2
Grouped Frequency Distribution of Age:
Age Group 1-2 3-4 5-6

Frequency 8 12 6
Frequency Distribution
It is a statistical table shows arrangement of data according to
magnitude, size etc. Also called a group data as the data is
presented in the form of groups.

Suppose marks of 25 students:


2638 52 55 43 46 46 51 43 40 43 42 46
3543 42 29 36 40 52 51 42 31 32 39
The above is a raw data, can be arranged in the form of array in
ascending order…

After having data in array of ascending order, gives information


about min and max values (marks) called range..
Frequency Distribution

Range= max value – min value


= 55 – 26
= 29
The ideal number of classes is between 5 to 15, we also have a
sturges method which gives K = 1 + 3.33 logn
K is tentative number of classes
N is the total number of observations
Log is a common log of base 10

For previous example: K = 1 + 3.33 log25


= 1 + 3.33 (1.3979)
= 5.655
Frequency Distribution
Class width: h = (max val – min val)/no. of classes
= 29/6
= 5 app

26-30, 31-35, 36-40, 41-45, 46-50, 51-55


Are the class intervals. Smaller is the lower and
larger is the upper class intervals.

While 25.5- 30.5, 30.5-35.5,…..are class boundaries


i.e. lower and upper class boundaries.…
Cumulative Frequency
Cumulative frequency of data in previous page
Age 1 2 3 4 5 6

Frequency 5 3 7 5 4 2

Cumulative Frequency 5 8 15 20 24 26

Age Group 1-2 3-4 5-6


Frequency 8 12 6

Cumulative Frequency 8 20 26

You might also like