Introduction of Statistics
Introduction of Statistics
STATISTICS
Prepared By
Dr Umme Habibah Rahman
Introduction of Statistics
Definition of Statistics:
The word ‘Statistics’ is probably derived from the Latin word ‘status’ or the Italian word
‘Statista’. The word ‘Statistics’ is used in singular as well as in plural sense. As a plural,
statistics may be defined as the numerical data relating to an aggregate of individuals and as a
singular it is defined as the science of collection, organization, presentation, analysis and
interpretation of numerical data. According to Sir R.A. Fisher “The science of Statistics is
essentially a branch of applied mathematics and may be regarded as mathematics applied to
observational data”. Fisher’s definition is the most exact in the sense that it covers all aspects
and fields of Statistics who is known as “Father of Modern Statistics”.
Scope of Statistics:
During last few decades statistics has penetrated into almost all sciences like agriculture,
biology, business, social, engineering, medical, etc. Statistical methods are commonly used for
analysing and interpreting experimental data. Also, wide and varied applications have led to
the growth of many new branches of statistics such as Industrial Statistics, Biometrics,
Biostatistics, Agricultural Statistics and the most recently developed Statistical Bioinformatics.
In brief we may summarize the scope of statistics as follows:
(a) Statistics has great significance in the field of physical and natural sciences. It is used in
propounding and verifying scientific laws.
(b) Statistics is often used in agricultural and biological research for efficient planning of
experiments and for interpreting experimental data.
(c) Statistical techniques are used to study the various economic phenomena such as wages,
price analysis, analysis of time series, demand analysis etc.
(d) Successful business executives make use of statistical techniques for studying the needs
and future prospects of their products.
(e) In industry, the statistical tools are very helpful in the quality control and assessment.
Limitations of Statistics:
i) Statistical methods are best applicable to quantitative data.
ii) Statistical decisions are subject to certain degree of error.
iii)Statistical laws do not deal with individual observations but with a group of observations.
iv) Statistical conclusions are true on an average.
v) Statistics is liable to be misused. The misuse of statistics may arise because of the use of
statistical tools by inexperienced and untrained persons.
Types of Data
The data collected by an investigator which have not been organized numerically and used by
anybody else is known as raw data. An arrangement of raw numerical data in ascending or
descending order of magnitude. The data can also be classified into two types----
(a) Primary data
(b) Secondary data.
Primary data: The data collected directly from the original source is called the primary data
i.e. the data collected for the first time. The primary data may be collected by:
Secondary Data: The secondary data is one which has already been collected by a source other
than the present investigator. Data which are obtained from published or unpublished sources
are known as secondary data.
The chief sources of Secondary data can be classified into two groups-----
1. Published Sources
2. Unpublished Sources
Published Sources: There are certain national, international or local sources which published
statistical data on a regular basis. These sources can be summed up as follows----
(i) National/ International Publications: Indian and Foreign governments and international
agencies published regular and occasional reports on various topics. These are perennial
sources of information.
(ii) Newspapers and Magazines: There are millions of newspapers and magazines who
maintained their research bureaus and publish original data on important problems.
(iii)Individuals research scholars: The individual research scholars of universities and other
allied agencies also supply a rich material on matters of importance.
Unpublished Sources: There are various sources of unpublished data such as the records
maintained by the various Government and private offices, studies made by the research
scholars in the universities and other research institutions etc.
Struge’s formula: A numerical formula as suggested by H.A. Struge may be used for
determining approximately the class size and the number of classes. According to this
formula the number of classes (k) is given
k = 1+ 3.322 log10 N, where N is the number of observations. Then class size is determined
as
Class width (h)=Largest value−smallest test value/Number of Classes.