L5 - Presentation of Data
L5 - Presentation of Data
and
Frequency Distribution
Presentation of Data
Presentation of data is of utter importance nowadays. Afterall
everything that’s pleasing to our eyes never fails to grab our attention.
Presentation of data refers to an exhibition or putting up data in an
attractive and useful manner such that it can be easily interpreted.
Data presentation is defined as the process of using various graphical
formats to visually represent the relationship between two or more data
sets so that an informed decision can be made based on them.
Data can be presented in one of the three forms: text, tables, and/or
graphs. The selection of the method of presentation depends on the
type of data, method of analysis, and type of information sought from
the data.
Textual Form Presentation of Data
The discussion about the presentation of data starts off with
it’s most raw and vague form which is the textual presentation.
In such form of presentation, data is simply mentioned as
mere text, that is generally in a paragraph. This is commonly
used when the data is not very large.
Textualpresentation of data means presenting data in the
form of words, sentences and paragraphs.
Textual Form Presentation of Data
For example, “the 2002 earthquake proved to be a mass murderer
of humans. As many as 10,000 citizens have been reported dead”. The
textual representation of data simply requires some intensive reading.
This is because the quantitative statement just serves as an evidence
of the qualitative statements and one has to go through the entire text
before concluding anything.
Tabular Form Presentation of Data
A table facilitates representation of even large amounts of data in an
attractive, easy to read and organized manner. The data is organized in
rows and columns. This is one of the most widely used forms of
presentation of data since data tables are easy to construct and read.
In a tabular presentation, data are presented in rows and columns. The
most important advantage of tabulation is that it organizes data for further
statistical treatment and decision making. Classification used in tabulation
is of four kinds:
Qualitative
Quantitative
Temporal and
Spatial
Tabular Form
Presentation of Data
Qualitative classification
When classification is done
according to attributes, such as
social status, physical status,
nationality, etc.
Tabular Form
Presentation of Data
Quantitative classification
the data are classified on the
basis of characteristics which
are quantitative in nature, these
characteristics can be
measured quantitatively. For
example, age, height,
production, income, etc.
Tabular Form
Presentation of Data
Temporal classification
Inthis classification time
becomes the classifying
variable and data are
categorised according to time.
Time may be in hours, days,
weeks, months, years, etc
Tabular Form
Presentation of Data
Spatial classification
When classification is done on
the basis of place, it is called
spatial classification. The place
may be a village/town, block,
district, state, country, etc.
Tabular Form Presentation of Data
Components of Data Tables
Table Number: Each table should have a specific table number for
ease of access and locating. This number can be readily mentioned
anywhere which serves as a reference and leads us directly to the data
mentioned in that particular table.
Title: A table must contain a title that clearly tells the readers about the
data it contains, time period of study, place of study and the nature
of classification of data.
Headnotes: A headnote further aids in the purpose of a title and
displays more information about the table. Generally, headnotes
present the units of data in brackets at the end of a table title.
Tabular Form Presentation of Data
• Stubs or Row Headings: These are titles of the rows in a table. Thus
a stub display information about the data contained in a particular row.
• Caption or Column Headings: A caption is the title of a column in the
data table. In fact, it is a counterpart if a stub and indicates the
information contained in a column.
• Body or field: The body of a table is the content of a table in its
entirety. Each item in a body is known as a ‘cell’.
• Footnotes: Footnotes are rarely used. In effect, they supplement the
title of a table if required.
• Source: When using data obtained from a secondary source,
this source has to be mentioned below.
Tabular Form
Presentation
of Data
Array Form Presentation of Data
Inmathematics, an array is an arrangement of numbers or symbols in
rows and columns. In statistics it is a group of numbers in rows and
columns with the smallest at the beginning and the rest in order of size
up to the largest at the end.
For example:
1, 1, 2, 2, 2, 3, 3, 3, 3, 4, 4, 4, 4, 4, 5, 5, 5, 5, 5, 5, 6, 6, 6, 6, 6, 7, 7, 7,
7, 8, 8, 8, 9, 9
Stem Leaf Presentation
Stem
and leaf plots are used to display quantitative data (i.e.
numbers) for small data sets.
Theydisplay the exact values of each data point. These plots
are useful for visualizing frequency distributions in data sets.
A stem and leaf plot looks something like a bar graph. Each
number in the data is broken down into a stem and a leaf,
thus the name. The stem of the number includes all but the
last digit. The leaf of the number will always be a single digit.
Stem Leaf Presentation
A good stem and leaf plot
• shows the first digits of the number (thousands, hundreds or
tens) as the stem and shows the last digit (ones) as the leaf.
• usually uses whole numbers. Anything that has a decimal
point is rounded to the nearest whole number. For example,
test results, speeds, heights, weights, etc.
• looks like a bar graph when it is turned on its side.
• shows how the data are spread—that is, highest number,
lowest number, most common number and outliers (a
number that lies outside the main group of numbers).
Stem Leaf Presentation
Example: A teacher asked 10 of her students how many books they had read in
the last 12 months. Their answers were as follows: 12, 23, 19, 6, 10, 7, 15, 25, 21,
12
Books read in a year by 10 students :
stem 0 represents the class interval 0 to 9;
stem 1 represents the class interval 10 to 19; and
stem 2 represents the class interval 20 to 29.
The number 6 can be written as 06, which means that it has a stem of 0 and a leaf of 6.
Usually, a stem and leaf plot is ordered, which
simply means that the leaves are arranged in
ascending order from left to right.
Stem Leaf Presentation
Example – Splitting stems using decimal values
The weights (to the nearest tenth of a kilogram) of 30 students were measured and recorded as
follows: 59.2, 61.5, 62.3, 61.4, 60.9, 59.8, 60.5, 59.0, 61.1, 60.7, 61.6, 56.3, 61.9, 65.7, 60.4, 58.9,
59.0, 61.2, 62.1, 61.4, 58.4, 60.8, 60.2, 62.7, 60.0, 59.3, 61.9, 61.7, 58.4, 62.2
Weights of 30 students :
Frequency
Distribution
Table
Frequency
The frequency of a particular data value is the number of times the
data value occurs.
Forexample, if four students have a score of 80 in mathematics, and
then the score of 80 is said to have a frequency of 4. The frequency of
a data value is often represented by f.
Frequency table/chart
A frequency table/chart is constructed by arranging
collected data values in ascending order of magnitude
with their corresponding frequencies
Example:
We had 183 students fill out a questionnaire. One of the
questions was which study major they're following. The
screenshot below shows part of these data.
Frequency Distribution Table - The resulting table (below)
shows how frequencies are distributed over values -study majors
in this example- and hence is a frequency distribution. Note that the
frequencies add up to our sample size of 183 students. This is
always the case unless a variable contains missing values:
respondents can sometimes skip a question or answer “no answer”
or something similar.
Relative Frequencies
Entry Table –
Example multiple mode : 8, 9, 10, 10, 10, 11, 11, 11, 12, 13
The mode is the number repeated most often. This list has two values
that are repeated three times; namely, 10 and 11, each repeated
three times. Therefore the mode is 10 and 11