Presentation of Data
Presentation of Data
com
PRESENTATION OF DATA
1.2 INTRODUCTION
Any observation collected in respect of any characteristic or event is called data. The data convey
little meaning when considered alone. However, when it is minimized, processed/analyzed and then
presented systematically it is converted to information. The information is then used for evaluation,
planning and decision making
Whenever data is presented pictorially or graphically it gives a better visual impression. Statisticians
must organize, explore and summarize data for easy interpretation. Tables, Pictograms, charts, and
graphs are used to present data for various purposes.
Presentation of Data
Tabular Graphical
Simple table Frequency distribution For quantitative data For qualitative data
table
1. Histogram 1. Bar chart
2. Frequency polygon 2. Pictogram
3. Normal distribution curve 3. Pie chart
4. Line graph 4. Map diagram
5. Cumulative distribution curve
6. Lorenz Curve
7. Scatter diagram
LectureNotes@[email protected]
1.3TABLES
A table is a layout of information or data in form of columns and rows. They are the simplest form
of data display. Some of the possible reasons for which a table may be constructed are to:
The principles to construct a table so that it achieves its objective in the best way possible are as
follows:
Measurements of single set are presented using simple tables. When characteristics with values are
presented in the form of table, it is known as simple table
Example 1
The workers in a certain company were appraised and given either of the ratings: good, very good,
fair, and extremely good. The table shows the number of workers in each rating.
Example 2
Table 1.1 Infant mortality rates of selected countries in 2004
Cases where one variable appears more than one times, frequency distribution tables are used. These
tables show the items and the number of times each item appears. A frequency distribution table can
be constructed running horizontally or vertically. The data is arranged in order of size either
increasing or decreasing.
Example 3
In a certain Institute, tutors are paid an hourly rate and the amount depends on their experience.
The table shows how many tutors are paid each rate.
LectureNotes@[email protected]
In most case when dealing with a lot of data, the data is first split up into convenient groups (class
interval) and the number of items (frequency) which occur in each group is shown in adjacent
columns. Hence it is a table showing the frequency with which the values are distributed in different
groups or classes with some defined characteristics. In this case, the following guidelines should be
followed:
Example 4
In a simple bar chart, a separate bar for each class is drawn to a height proportional to the class
frequency. You can identify the figure that each bar represents at the base of the bar or you can use
a key to show that a color or shade indicates a particular item. One of the most important points to
remember when drawing Simple bar charts is that you must start the scale from zero.
When there are many categories on X-axis and they have further subcategories, then to
accommodate the categories, the bars may be divided into parts, each part representing a certain
item and proportional to the magnitude of that particular item
LectureNotes@[email protected]
1.5 HISTOGRAMS
Used for Quantitative, Continuous Variables i.e variables which have no gaps e.g age, weight, height,
blood pressure, blood sugar etc.
LectureNotes@[email protected]
Frequency polygons can always be used in place of histogram, but are particularly useful when there
are many classes in the distribution; or if two or more frequency distributions need to be compared
LectureNotes@[email protected]
A cumulative frequency curve also known a Ogive is drawn with the cumulative frequency plotted
against the upper boundaries of the relevant interval. This kind of curve allows you to read off
numbers below (or less than) a specified value.
To draw a cumulative frequency, curve the points are joined with a smooth curve. These curves can
be used for estimation purposes. Example : The frequency table below shows marks of candidates in
an exam
A Lorenz curve is composed of a cumulative percentage frequency curve of the less than type. A
Lorenz curve plots cumulative percentage frequency against cumulative percentage class totals. A
class total for any frequency distribution class is the total value of all items belonging to the class.
Class totals are estimated using: Class total=class mid-point x class frequency
LectureNotes@[email protected]
A Lorenz curve has a horizontal axis that begins at 0% and ends at 100% and a vertical axis that
begins at 0% and ends at 100%.
The purpose of such a curve is to demonstrate relative proportions of values represented along the
horizontal versus values represented along the vertical axes.
Adrian $ 90.00
Bob 15.00
Cathy 70.00
Derek 200.00
Eddie 125.00
A Lorenz curve may be obtained as follows:
1 Arrange in ascending order: 3Find % age income 4 Find % age Cum income
Bob $15.00
3% 3%
Cathy 70.00
14% 17%
Adrian 90.00
18% 35%
Eddie 125.00
25% 60%
Derek 200.00
40% 100%
Lorenz Curve
L LED
120
Cumm Percent of Income
100
80
60
40 Lorenz
20
curve
0
0 20 40 60 80 100 120
Cumm Percent of Households
LectureNotes@[email protected]
The aim of a Lorenz curve is to show how the total value of the measurement of some economic
variable is shared out among the subjects or items involved. The aim is realized by comparing a
Lorenz curve with the line of equal distribution (LED). The further away the Lorenz curve is from
the LED, the less equally the commodity involved is distributed. Standard situations where Lorenz
curves are used (and often quoted) a’e distributions of incomes (both before and after tax) personal
wealth, turnover of companies, GNP of countries and similar monetary data
They are used to show inequalities in connection with matters, such as:
Frequency polygons may take many different shapes, but many naturally occurring
phenomena are approximately distributed according to the symmetrical, bell-shaped normal
or Gaussian distribution.
In normal distribution curve, the three measures of central tendency are identical
approximately 68% of the distributions falls within ±1 standard deviation of the mean
approximately 95% of the distributions falls within ± 2 standard deviation of the mean
approximately 99.7% of the distributions falls within ±3 standard deviation of the mean men
and women (each gender forms its own distribution around a different midpoint).
Asymmetrical distribution are called skewed distributions. The three measures of central
tendency differ. Mode is highest point on curve, the mean is pulled up or down by the
influence of a relatively small number of very high or very low scores and the median lies
between the two.
n Positively (or right) skewed distributions and negatively (or left) skewed distributions can
be identified by the location of the tail of the curve:
-Positively skewed distributions have a relatively large number of low scores and a small
number of very high scores.
-Negatively skewed distributions have relatively large number of high scores and a small
number of low scores.
LectureNotes@[email protected]
A line graph represents the data concerning one variable on the horizontal and other variable on the
vertical axis. It uses points and lines to show change over time. It is plotted from a set of points and
then joined by a line. Different data sets can be plotted on the same graph but a key must be used to
identify each data set.
Line diagrams are used to show the trend of events with the passage of time.
The line graphs below shows trend of profit for a certain company for six years and unemployment
rates for the last 10 years respectively
LectureNotes@[email protected]
This is a plot of points on the Cartesian plane that shows the general shape and trend as either being
a straight line or curve or none at all. It is used to show the existence of association or relationship
among variables
LectureNotes@[email protected]
Pie chat is Most common way of presenting qualitative data The value of each category is divided
by the total values and then multiplied by 360 and then each category is allocated the respective
angle to present the proportion it has.
It is often necessary to indicate percentages in the segment as it may not be sometimes very easy
virtually, to compare the areas of segments.
LectureNotes@[email protected]
1.13 PICTOGRAMS
This is a popular method of presenting data to those who cannot understand orthodox charts. Small
pictures or symbols are used to present the data,e.g a picture of a doctor to represent the population
of physician. Fraction of the picture can be used to represent numbers smaller than the value of
whole symbol
When statistical data refers to geographic or administrative areas, it is presented either as statistical
map or dot map. The shaded maps are used to present data of varying size. The areas are shaded
with different colours or different intensities of the same colour, which is indicated in the key.
LectureNotes@[email protected]