0% found this document useful (0 votes)
145 views18 pages

Origin and Growth of Statistics

The document discusses the origins and growth of statistics as a field of study. It notes that statistics originally referred to the analysis of state data by governments but has since expanded to many other domains, including business, science, and medicine. The key characteristics of statistics are outlined, such as the focus on aggregate data rather than individual cases and the numerical expression of data. Common terms used in statistics like frequency, range, and measures of central tendency are defined. Examples of how statistics is used in different fields like business, agriculture, education, and medicine are also provided.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
145 views18 pages

Origin and Growth of Statistics

The document discusses the origins and growth of statistics as a field of study. It notes that statistics originally referred to the analysis of state data by governments but has since expanded to many other domains, including business, science, and medicine. The key characteristics of statistics are outlined, such as the focus on aggregate data rather than individual cases and the numerical expression of data. Common terms used in statistics like frequency, range, and measures of central tendency are defined. Examples of how statistics is used in different fields like business, agriculture, education, and medicine are also provided.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 18

ORIGIN AND GROWTH OF STATISTICS

(history)
The word ‘statistics’ and ‘statistical’ are
derived from the Latin word status, means
political state.
 Ø The German statistics, first
introduced by Gottfried ache wall (1749),
originally designated the data analysis of
state.
 Ø It was used by the British mainly for
administrative and governmental bodies.
 Ø In particular census provides regular
information about the population.
 Ø Today however statistics had
broadened Far beyond the service of a state
or government, it includes areas such as
 Ø Business
 Ø Natural and social sciences and Ø
Medicines
 Ø Before 3000B.C. the Babylonians used
small clay tablets to record tabulations of
agricultural yields and of commodities
bartered or sold
 Ø The Egyptians analysed the
population and material wealth of their
kingdoms.
 The Roman Empire was the first
government to gather extensive data about
population, area and wealth of territories
that they controlled. FUNDAMENTAL
CHARACTERSTICS OF STAISTICS
 Ø They are related to each other and are
comparable.
 Ø They are aggregate of facts and not a
single observation. Statistics do not take
into account individual cases
 Ø Statistics data are numerically
expressed. Statistics are collection of data
in a systematic manner. Statistics are
collected for a predetermined purpose.
 Ø Statistics deals with group and
doesn’t study individually.
 Ø Statistics laws are not exact, they are
true only on averages.
 Ø the data collected by someone else,
other than the investigator, are known as
secondary data.
 Ø the data obtained in the original form
are called ungrouped data or raw data.
 Ø An arrangement of raw numerical
data in ascending or descending order of
magnitude is called array.

Sub Topics
Some Related Definitions
These fig. are in the ascending order
4,5,8,18,28,29,29,31,40,40,43,43,46,46,46,47,
47,50,50,55,55,70,71,75,75,80,90

Insert table here…….


The above data in the ascending order is
called an ‘arrayed’ and the way of
arrangement is called an ‘array’.
The way of arrangement of data in the
table is known is known as ‘frequency
distribution’.
Marks are called “variates” the no. of
students who secured a particular no. of
marks are frequency of variates is called
“frequency of the variate”.
The number of times a number has been
repeated is called the “frequency of the
variate”.
CONTINOUS: Quantities which can take
all numerical values within a certain
interval .
DISCONTINOUS: Quantities or variable
which can take only  a finite set of values.
Each groups into which a raw data is
condensed is called a “class”. The size of
class is known as class interval.
For ex. 10 is the class interval of class “0-
10”.
Each class is bounded by 2 fig. which are
called the “lower limits “and “20”is the
“upper limit”.
The difference between the upper limit of
the class and the lower limit of class is
called as the “class size”.
The value which lies midway between lower
and upper limits of a class is known as its
“mid value or class mark”.
Class mark = upper limit + lower limit
2
The difference between the two extreme
observations in an arranged data i.e. the
difference between the maximum and
minimum values of observations is known
as the “Range”.
Three measures of central tendency are
 Ø Mean
 Ø Mode
 Ø Median
MEAN
Median of groped data :if x1 ,x 2, x3 ,
……….x nare variables of a variable x ,
then the arithmetic mean or simply mean
of these values is denoted by X and is
defined as X= x1 +x 2+x 3+……x n or X=/

Algorithm: Step I= Prepare the


frequency table in such a way that its first
column consists of the values of the variate
and the second column the ∑
Step II=multiply the frequency of each row
with the corresponding values of variable
to obtain third column containing fix;
Step III= Find the sum of all entries in
column III to obtain ∑f x
Step IV= Find the sum of all the
frequencies in column II to obtain ∑f = N
Step V= Use the formula: X =
For Ex= Find the missing frequencies in the
following frequency distribution if it is
known that the mean of the distribution is
1.46 No. of accidents (x): 0 1 2 3 4
5                                                          
Total Frequency    (f): 46 ? ? 25 10
5                                                     200
Computation of Arithmetic Mean
Insert table here………..
∑f = N = 86+f 1+f2 ∑f 1x 1=140+f 1+2f2
N= 200
 Ø 200= 86 +f 1+f 2
 Ø f 1+f2 =114+f 2+f 2
 Ø f 1+f 2=114
Also, Mean=1.46
--------------------------------------------------------
---1
 Ø 1.46= ∑f ixiN
 Ø 1.46= 140+f1 +2f 2/ 200
 Ø 292=140+f 1+2f 2
 Ø F 1+2f 2=150
--------------------------------------------------------
-------2

Solving 1,2
 Ø F =76 and f =38
Step Deviation Method
Step I- Obtain the frequency
distribution and prepare the frequency
table in such a way that its first column
consists of the values of the variable and
the second column corresponding
frequencies.
Step II- Choose a number ‘A’(generally
known as the assumed mean) and take
deviations d =x - A about A. Write these
deviations against the corresponding 
di’s in the IV column.
Step IV- Multiply the frequencies in
second column with the corresponding u
’s in IV column to prepare V column of f
u . Step V- find the sum of all entries in
V column to obtain (∑ f x ) and the sum
of all frequencies in  column to obtain
N=(∑ f ). Use formula:X=A+h{ / f x }
MEDIAN
The median is the middle value of a
distribution is the value of the variable
which divides it into two equal parts.
Step I-Arrange the observation x ,x ,
………..x in ascending or descending
order of magnitude.
Step II- Determine the total no. of
observation, say, n.
Step III- If n is odd , then median is the
value of (n+⅟ ) observation. For ex.-
Calculate the median from the following
distribution:
Class:                5-10         10-15          15-
20      20-25           25-30
Frequency:         5                6                
15           10                  5 30-35            35-
40                 40-45 4                   
2                          2
SOLUTION: First cumulative table to
complete median.

Insert table her…….


SO,
N=49 & N/ 2=24.5→The cumulative
frequency just greater than N/ is 26 and
corresponding class is 15-20
(Median class) L=15, f=15, F=11, h=5 :.
Median = ( / -F)/ = / *5 =19.5
MODE
The mode or modal value of a
distribution is that value of the variable
for which the frequency is maximum. In
order to compute the mode of a series of
individual observations. We first
convert it into a discrete series
frequency distribution by preparing a
frequency table. From the frequency
table, we identify the value having
maximum frequency. The value of
variable to obtain is the mode or modal
value.
FOR EX.
Obtain the value of the following:
l→ lower limit
h → width
f x→ frequency
f 1→ frequency of the class preceding
f 2→ frequency of the class following.
MODE= l+ / * h
Relationship among mean, median and
mode
MODE= 3median – 2mean
Or MEDIAN = mode+⅔ (mean – mode)
Or MEAN = mode + / (median – mode)
PIE-CHART
A pie – chart displays data as a
percentage as a percentage of the whole
Each pie has a label and percentage . A
total data no. is included . These have a
circle divided into parts or sectors of
different sizes to show different amounts
of data.
Advantages
 Ø Visually appealing
 Ø Shows % of total for each category
Disdvantages
 Ø No exact numerical data.
 Ø Hard to compare two data sets.
 Ø “Other” category can be a problem.
 Ø Total unknown unless specified.
 Ø Best for 3 to 7 only.
BAR - GRAPH
A bar graph display data in separate
columns. It data is on a continous scale,
such as height, the bars touch each
other. The  bars can be vertical or
horizontal.
Advantages
 Ø Visually strong.
 Ø Can compare 2 or 3 data sets.
Disdvantages
 Ø Graph categories can be recorded to
emphasis certain effects.
LINE GRAPH
A line graph plots continuous data as
points and then joins them with a line.
Multiple data sets can be grouped
together, but a key must be used.
Advantages
 Ø Can compare multiple continuous
data sets easily.
Ø Interim data can be inferred from
graph line Disdvantages
 Ø Use only with continuous data.

Run Rate Graph


LINE GRAPH
HISTOGRAM
A histograph displays continuous data
in order column. Categories are of
continuous measures such as time,
inches, temperature, etc.
Advantages
 Ø Visually strong
 Ø Can compare to normal curve.
 Ø Usually vertical axis is a frequency
count of items falling into each category.

Disadvantages
 Ø Cannot read exact values because
data is grouped into categories.
 Ø More difficult to compare two data
sets
 Ø Use only with continuous data.
HISTOGRAM
Uses and applications of statistics
Statistics and its studies have been used
to answer questions such as:-
Industries and Business
 Ø Report of early sales & comparison
others.
 Ø It shows where the factory or its sales
lack and where they are good
Agriculture
 Ø What amount of crops are grown this
year in comparison to previous year or in
comparison to required amount of crop for
the country
 Ø Quality and size of grains grown due
to use of different fertilizer.
Forestry
 Ø How much growth has been occurred
in area under forest or how much forest
has been depleted in last 5 years?
 Ø How much different species of flora
and fauna have increased or decreased in
last 5 years?
Education
 Ø Money spend on girls education in
comparison to boys education?
 Ø Increase in no. of girl students who
seated in who seated for different exams?
 Ø Comparison for result for last 10
years.

Ecological Studies
 Ø Comparison of increasing impact of
pollution on global warming?
 Ø Increasing effect of nuclear reactors
on environment? Medical Studies
 Ø No. of new diseases grown in last 10
year.
 Ø Increase in no. of patients for a
particular disease. Sports
 Ø Used to compare run rates of to
different teams.
 Ø Used to compare to different players.
Conclusion
Statistics has been a great learning
experience and is a very interesting
experience and an important topic that
is very-very helpful for people of all ages
and for teacher to clear their concept
and increase their intelligence level.

You might also like