8614 Assignment - Copy Final
8614 Assignment - Copy Final
ISLAMABAD
According to Boddington:
“Statistics is the science of estimates and probabilities.”
According to Lovitt:
“Statistics is the science which deals with the collection, classification
and tabulation of numerical facts as the basis for explanation, description
and comparison of phenomena.”
Tate, M.N. is statistics in Education (1953) has very beautifully summed up
the concept of statistics as It’s all perfectly clear; you complete statistics
(means, medians, modes etc.) from statistics (numerical facts) by
statistics (statistics as a science or methodology).
2
Characteristics of Statistics
3
Data should be numerically expressed, enumerated of
estimated
2. Individual Comparison:
Statistics helps in the individual comparison of students differing in respect
of their ages, abilities and intelligence levels. It is statistics which tells us
why thus students who are similar in every other respect yet do not show
similar achievement is one particular subject.
4
vocational guidance so that they make the best use of these abilities and
the process of guidance is based upon statistics only.
Question # 02
Discuss different types of data. Also elaborate
differences between primary and secondary data.
Primary Data Definition
Primary data is the data that is collected for the first time through personal
experiences or evidence, particularly for research. It is also described as
raw data or first-hand information. The mode of assembling the
information is costly, as the analysis is done by an agency or an external
organization, and needs human resources and investment. The investigator
supervises and controls the data collection process directly.
The data is mostly collected through observations, physical testing, mailed
questionnaires, surveys, personal interviews, telephonic interviews, case
5
studies, and focus groups, etc.
Pros
Cons
6
assembled is for some other purpose and may not meet the present
research purpose or may not be accurate.
Pros
Cons
7
The differences between the primary and secondary data are represented
in a comparison format as follows:
Primary Data Secondary Data
Definition Primary data are those Secondary data refer
that are collected for to those data that have
the first time. already been collected
by some other person.
Originality These are original These are not original
because these are because someone else
collected by the has collected these for
investigator for the his own purpose.
first time.
Nature of Data These are in the form These are in the
of raw materials finished form.
Reliability and These are more These are less reliable
Suitability reliable and suitable and less suitable as
for the enquiry someone else has
because these are collected the data
collected for a which may not
particular purpose. perfectly match our
purpose.
Time and Money Collecting primary data Secondary data
8
is quite expensive both requires less time and
in the terms of time money; hence it is
and money. economical.
Precaution and No particular Both precaution and
Editing precaution or editing is editing are essential as
required while using secondary data were
the primary data as collected by someone
these were collected else for his own
with a definite purpose.
purpose.
Question # 03
Explain ‘pictogram’ as a technique to explore/explain data.
What is a pictogram?
Pictograms are types of charts and graphs that use icons and images to
represent data. Also known as “pictographs”, “icon charts”, “picture charts”,
and “pictorial unit charts”, pictograms use a series of repeated icons to
visualize simple data. The icons are arranged in a single line or a grid, with
each icon representing a certain number of units (usually 1, 10, or 100). A
pictogram is one of the simplest and most popular forms of data
visualization out there. Besides making your data look nice, pictograms
can make your data more memorable.
9
percentage, you can use pictograms to add visual impact to simple data.
How To Make a Pictogram?
The first step in creating a pictograph is to gather the relevant data that we
want to show. Once the data has been collected, arrange it in a table or list.
The data categories must also be correctly specified.
Select visual representations for the data, such as symbols or images. The
most relevant symbols or visual representations of the provided data must
be used. Fractions can be drawn for the symbols if the frequency is not
a whole number.
Assign a key
Use a key to indicate the value of each image when displaying the data
using images because one image cannot adequately depict the data if the
frequency of the data is too high. As a result, the “key” numerical value,
which must also be written with the pictograph, is used.
Use columns
When making a pictogram, use columns to indicate the category and data.
Create a pictogram that represents the frequency of data using symbols
and/or images.
Once the pictograph has been created, check that the categories or
labelling and the images must accurately reflect the data.
Consider the avocado, coconut, lemon, mango, orange, and pear trees
found on a farm. Let us create a pictogram that displays how many trees
10
are on a farm.
The data gathered are shown in the table below. The count and tree types
are presented.
20 35 15 25 10 30
Assign a key
Since the given data is a multiple of 5, we will represent each symbol equal
Use columns
In constructing the pictograph for the data collected, we will use two
columns. The labels in the first column are the types of trees. The second
column indicates the number of trees.
11
Avocado
Coconut
Lemon
Mango
Orange
= 5 trees
Regarding the data gathered, each row must accurately reflect the total
number of trees in each category. There should be four trees as there are a
total of 20 avocado trees. Seven trees will be used to represent the coconut,
three for the lemons, five for the mango, two for the orange, and six for the
pear.
Avocado
Coconut
12
Lemon
Mango
Orange
Pear
= 5 trees
Avocado 5 + 5 + 5 + 5 = 20
Coconut 5 + 5 +5 + 5 + 5 + 5 + 5 = 35
Lemon 5 + 5 + 5 =15
Mango 5 + 5 + 5 + 5 + 5 = 25
Orange 5 + 5 = 10
13
Pear 5 + 5 + 5 + 5 + 5 + 5 =30
Pictograms are used for a variety of reasons, some of which are described
below.
Question # 04
Pie Chart is a common way to depict data. Discuss its
usage and drawbacks
What is a Pie Chart?
The “pie chart” is also known as a “circle chart”, dividing the circular
statistical graphic into sectors or sections to illustrate the numerical
problems. Each sector denotes a proportionate part of the whole. To find
out the composition of something, Pie-chart works the best at that time. In
most cases, pie charts replace other graphs like the bar graph, line plots,
histograms, etc.
Formula
The pie chart is an important type of data representation. It contains
different segments and sectors in which each segment and sector of a pie
chart forms a specific portion of the total(percentage). The sum of all the
data is equal to 360°.
The total value of the pie is always 100%.
15
To work out with the percentage for a pie chart, follow the steps given
below:
16
If you create a pie chart showing product line performance, for example,
and you have two lines that each account for 50 percent of turnover, your
pie chart will simply have two halves. If there were a 75/25 split, the bigger
line would take up three quarters of the chart, leaving one quarter for the
other. Effects such as three-dimensional charting, slice pivoting and
dragging slices out of the circle adds emphasis to individual data sets and
makes charts more visually appealing.
Advantages
17
Provides a data comparison for the audience at a glance to give an
immediate analysis or to quickly understand information
Disadvantages
It becomes less effective if there are too many pieces of data to use
If there are too many pieces of data. Even if you add data labels and
numbers may not help here, they themselves may become crowded
and hard to read
As this chart only represents one data set, you need a series to
compare multiple sets
This may make it more difficult for readers when it comes to analyze
and assimilate information quickly
Question # 05
What do you understand by ‘measure of dispersion’? Also
briefly discuss some common measures of dispersion.
What is Dispersion in Statistics?
Dispersion is the state of getting dispersed or spread. Statistical dispersion
means the extent to which numerical data is likely to vary about an average
value. In other words, dispersion helps to understand the distribution of the
data.
18
Measures of Dispersion
A measure of dispersion indicates the scattering of data. It explains the
disparity of data from one another, delivering a precise view of their
distribution. The measure of dispersion displays and gives us an idea about
the variation and the central value of an individual item.
19
An absolute measure of dispersion contains the same unit as the original
data set. The absolute dispersion method expresses the variations in terms
of the average of deviations of observations like standard or means
deviations. It includes range, standard deviation, quartile deviation, etc.
The types of absolute measures of dispersion are:
1. Co-efficient of Range
2. Co-efficient of Variation
3. Co-efficient of Standard Deviation
4. Co-efficient of Quartile Deviation
5. Co-efficient of Mean Deviation
20
Co-efficient of Dispersion
The coefficients of dispersion are calculated (along with the measure of
dispersion) when two series are compared, that differ widely in their
averages. The dispersion coefficient is also used when two series with
different measurement units are compared. It is denoted as C.D.
The common coefficients of dispersion are:
21