2 LESSON 2 Freq Graphs FQ

This document discusses frequency distributions and principles for constructing them. It defines frequency distributions and explains univariate and bivariate distributions. For univariate distributions, it describes series of individual observations, discrete frequency distributions which use tally marks, and continuous frequency distributions which classify data into groups. The key principles for constructing distributions are that classes should be clearly defined, exhaustive and mutually exclusive. The optimal number of classes balances losing information with irregular distributions, and generally ranges between 5-20 classes. The class interval size depends on the number of classes.

Uploaded by

exams_sbs

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

167 views

2 LESSON 2 Freq Graphs FQ

Uploaded by

exams_sbs

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

You are on page 1/ 21

2 LESSON 2

Presenting Data in Tables and Charts

2.0 Learning Objectives

To develop tables and charts for categorical data

To develop tables and charts for numerical data

The principles of properly presenting graphs

Fig1

FREQUENCY DISTRIBUTION
AND GRAPHICAL PRESENTATION
2.1What is frequency distribution
Collected and classified data are presented in a form of frequency distribution. Frequency
distribution is simply a table in which the data are grouped into classes on the basis of common
characteristics and the number of cases which fall in each class are recorded. It shows the
frequency of occurrence of different values of a single variable. A frequency distribution is
constructed to satisfy three objectives :
(i) to facilitate the analysis of data,
(ii) to estimate frequencies of the unknown population distribution from the distribution of
sample data, and
(iii) to facilitate the computation of various statistical measures.
Frequency distribution can be of two types :
1. Univariate Frequency Distribution.
2. Bivariate Frequency Distribution.
In this lesson, we shall understand the Univariate frequency distribution. Univariate distribution
incorporates different values of one variable only whereas the Bivariate frequency distribution
incorporates the values of two variables. The Univariate frequency distribution is further
classified into three categories :
(i) Series of individual observations,
(ii) Discrete frequency distribution, and
(iii) Continuous frequency distribution.
Series of individual observations is a simple listing of items of each observation. If marks of 14
students in statistics of a class are given individually, it will form a series of individual
observations.
Marks obtained in Statistics :
Roll Nos. 1 2 3 4 5 6 7 8 9 10 11 12 13 14
Marks : 60 71 80 41 81 41 85 35 98 52 50 91 30 88
Marks in Ascending Order Marks in Descending Order
30 98
35 91
41 88
41 85
50 81
52 80
60 71
71 60
80 52
81 50
85 41
88 41
91 35
98 30
Discrete Frequency Distribution: In a discrete series, the data are presented in such a way that
exact measurements of units are indicated. In a discrete frequency distribution, we count the
number of times each value of the variable in data given to you. This is facilitated through the
technique of tally bars.
In the first column, we write all values of the variable. In the second column, a vertical bar called
tally bar against the variable, we write a particular value has occurred four times, for the fifth
occurrence, we put a cross tally mark ( / ) on the four tally bars to make a block of 5. The
technique of putting cross tally bars at every fifth repetition facilitates the counting of the
number of occurrences of the value. After putting tally bars for all the values in the data; we
count the number of times each value is repeated and write it against the corresponding value of
the variable in the third column entitled frequency. This type of representation of the data is
called discrete frequency distribution.
We are given marks of 42 students:
55 51 57 40 26 43 46 41 46 48 33 40 26 40 40 41
43 53 45 53 33 50 40 33 40 26 53 59 33 39 55 48
15 26 43 59 51 39 15 45 26 15
We can construct a discrete frequency distribution from the above given marks.
Marks of 42 Students
Marks Tally Bars Frequency
15 3
26 5
33 4
39 2
40 5
41 2
43 3
45 2
46 2
48 2
50 1
51 2
53 3
55 3
57 1
59 2
Total 42
The presentation of the data in the form of a discrete frequency distribution is better than
arranging but it does not condense the data as needed and is quite difficult to grasp and
comprehend. This distribution is quite simple in case the values of the variable are repeated
otherwise there will be hardly any condensation.
Continuous Frequency Distribution: If the identity of the units about a particular information
collected, is neither relevant nor is the order in which the observations occur, then the first step
of condensation is to classify the data into different classes by dividing the entire group of values
of the variable into a suitable number of groups and then recording the number of observations in
each group. Thus, we divide the total range of values of the variable (marks of 42 students) i.e.
59_15 = 44 into groups of 10 each, then we shall get (42/10) 5 groups and the distribution of
marks is displayed by the following frequency distribution:
Marks of 42 Students
Marks (×) Tally Bars Number of Students (f)
15—25 3
25—35 9
35—45 12
45—55 12
55—65 6
Total 42
The various groups into which the values of a variable are classified are known classes, the
length of the class interval (10) is called the width of the class. Two values, specifying the class,
are called the class limits. The presentation of the data into continuous classes with the
corresponding frequencies is known as continuous frequency distribution. There are two methods
of classifying the data according to class intervals :
(i) exclusive method, and
(ii) inclusive method
In an exclusive method, the class intervals are fixed in such a manner that upper limit of one
class becomes the lower limit of the following class. Moreover, an item equal to the upper limit
of a class would be excluded from that class and included in the next class. The following data
are classified on this basis.
Income. No.ofPersons
(Rs.)
200—250 50
250—300 100
300—350 70
350—400 130
400—450 50
450—500 100
Total 500
It is clear from the example that the exclusive method ensures continuity of the data in as much
as the upper limit of one class is the lower limit of the next class. Therefore, 50 persons have
their incomes between 200 to 249.99 and a person whose income is 250 shall be included in the
next class of 250—300.
According to the inclusive method, an item equal to upper limit of a class is included in that class
itself. The following table demonstrates this method.
Income. No.ofPersons
(Rs.)
200—249 50
250—299 100
300—349 70
350—399 130
400—449 50
450—499 100
Total 500
Hence in the class 200—249, we include persons whose income is between Rs. 200 and Rs. 249.
2.2 Principles for Constructing Frequency Distributions
Inspite of the great importance of classification in statistical analysis, no hard and fast rules are
laid down for it. A statistician uses his discretion for classifying a frequency distribution and
sound experience, wisdom, skill and aptness for an appropriate classification of the data.
However, the following guidelines must be considered to construct a frequency distribution:
1. Type of classes: The classes should be clearly defined and should not lead to any ambiguity.
They should be exhaustive and mutually exclusive so that any value of variable corresponds to
only class.
2. Number of classes: The choice about the number of classes in which a given frequency
distribution should be divided depends upon the following things;
(i) The total frequency which means the total number of observations in the distribution.
(ii) The nature of the data which means the size or magnitude of the values of the variable.
(iii) The desired accuracy.
(iv) The convenience regarding computation of the various descriptive measures of the frequency
distribution such as means, variance etc.
The number of classes should not be too small or too large. If the classes are few, the
classification becomes very broad and rough which might obscure some important features and
characteristics of the data. The accuracy of the results decreases as the number of classes
becomes smaller. On the other hand, too many classes will result in a few frequencies in each
class. This will give an irregular pattern of frequencies in different classes thus makes the
frequency distribution irregular. Moreover a large number of classes will render the distribution
too unwieldy to handle. The computational work for further processing of the data will become
quite tedious and time consuming without any proportionate gain in the accuracy of the results.
Hence a balance should be maintained between the loss of information in the first case and
irregularity of frequency distribution in the second case, to arrive at a suitable number of classes.
Normally, the number of classes should not be less than 5 and more than 20. Prof. Sturges has
given a formula :
k = 1+ 3.322 log n
where k refers to the number of classes and n refers to total frequencies or number of
observations. The value of k is rounded to the next higher integer :
If n = l00 k = 1 + 3.322 1og l00 = 1 + 6.644 = 8
If n =10,000 k = 1 + 3.22 log 10,000 = 1 + 13.288 = 14
However, this rule should be applied when the number of observations are not very small.
Further, the number or class intervals should be such that they give uniform and unimodal
distribution which means that the frequencies in the given classes increase and decrease steadily
and there are no sudden jumps. The number of classes should be an integer preferably 5 or
multiples of 5, 10, 15, 20, 25 etc. which are convenient for numerical computations.
3. Size of Class Intervals : Because the size of the class interval is inversely proportional to the
number of classes in a given distribution, the choice about the size of the class interval will
depend upon the sound subjective judgment of the statistician. An approximate value of the
magnitude of the class interval say i can be calculated with the help of Sturge's Rule :
where i stands for class magnitude or interval, Range refers to the difference between the largest
and smallest value of the distribution, and n refers to total number of observations.
If we are given the following information; n = 400, Largest item = 1300 and Smallest item =
340.then,

Another rule to determine the size of class interval is that the length of the class interval should
not be greater than 1/4th of the estimated population standard deviation. If 6 is the estimate of
population standard deviation then the length of class interval is given by: i £ 6/4.
The size of class intervals should be taken as 5 or multiples of 5, 10, 15 or 20 for easy
computations of various statistical measures of the frequency distribution, class intervals should
be so fixed that each class has a convenient mid-point around which all the observations in that
class cluster. It means that the entire frequency of the class is concentrated at the mid value of the
class. It is always desirable to take the class intervals of equal or uniform magnitude throughout
the frequency distribution.
4. Class Boundaries: If in a grouped frequency distribution there are gaps between the upper
limit of any class and lower limit of the succeeding class (as in case of inclusive type of
classification), there is a need to convert the data into a continuous distribution by applying a
correction factor for continuity for determining new classes of exclusive type. The lower and
upper class limits of new exclusive type classes are called class boundaries.
If d is the gap between the upper limit of any class and lower limit of succeeding class, the class
boundaries for any class are given by:
d/2 is called the correction factor.
Let us consider the following example to understand :
Marks Class Boundaries
20—24 (20—0.5, 24 + 0.5) i.e., 19.5—24.5
25—29 (25—0.5, 29 + 0.5) i.e., 24.5—29.5
30—34 (30—0.5, 34 + 0.5) i.e., 29.5—34.5
35—39 (35—0.5, 39 + 0.5) i.e., 34.5—39.5
40—44 (40—0.5, 44 + 0.5) i.e., 39.5—44.5
5. Mid-value or Class Mark: The mid value or class mark is the value of a variable which is
exactly at the middle of the class. The mid-value of any class is obtained by dividing the sum of
the upper and lower class limits by 2.
Mid value of a class = [Lower class limit + Upper class limit]

The class limits should be selected in such a manner that the observations in any class are evenly
distributed throughout the class interval so that the actual average of the observations in any
class is very close to the mid-value of the class.
6. Open End Classes : The classification is termed as open end classification if the lower limit of
the first class or the upper limit of the last class or both are not specified and such classes in
which one of the limits is missing are called open end classes. For example, the classes like the
marks less than 20 or age above 60 years. As far as possible open end classes should be avoided
because in such classes the mid-value cannot be accurately obtained. But if the open end classes
are inevitable then it is customary to estimate the class mark or mid-value for the first class with
reference to the succeeding class. In other words, we assume that the magnitude of the first class
is same as that of the second class.
Example: Construct a frequency distribution from the following data by inclusive method taking
4 as the class interval:
10 17 15 22 11 16 19 24 29 18
25 26 32 14 17 20 23 27 30 12
15 18 24 36 18 15 21 28 33 38
34 13 10 16 20 22 29 19 23 31
Solution: Because the minimum value of the variable is 10 which is a very convenient figure for
taking the lower limit of the first class and the magnitude of the class interval is given to be 4,
the classes for preparing frequency distribution by the Inclusive method will be 10—13, 14—17,
18—21, 22—25, ..................... 38—41.
Frequency Distribution
Class Interval Tally Bars Frequency (f)
10—13 5
14—17 8
18—21 8
22—25 7
26—29 5
30—33 4
34—37 2
38—41 1
Example: Prepare a statistical table from the following :
Weekly wages (Rs.) of 100 workers of Factory A
88 23 27 28 86 96 94 93 86 99
82 24 24 55 88 99 55 86 82 36
96 39 26 54 87 100 56 84 83 46
102 48 27 26 29 100 59 83 84 48
104 46 30 29 40 101 60 89 46 49
106 33 36 30 40 103 70 90 49 50
104 36 37 40 40 106 72 94 50 60
24 39 49 46 66 107 76 96 46 67
26 78 50 44 43 46 79 99 36 68
29 67 56 99 93 48 80 102 32 51
Solution: The lowest value is 23 and the highest 106. The difference between the lowest and
highest value is 83. If we take a class interval of 10, nine classes would be made. The first class
should be taken as 20—30 instead of 23—33 as per the guidelines of classification.
Frequency Distribution of the Wages of 100 Workers
Wages (Rs.) Tally Bars Frequency (f)
20—30 13
30—40 11
40—50 18
50—60 10
60—70 6
70—80 5
80—90 14
90—100 12
100—110 11
Total 100
2.3 Graphs of Frequency Distributions
The guiding principles for the graphic representation of the frequency distributions are same as
for the diagrammatic and graphic representation of other types of data. The information
contained in a frequency distribution can be shown in graphs which reveals the important
characteristics and relationships that are not easily discernible on a simple examination of the
frequency tables. The most commonly used graphs for charting a frequency distribution are :
1. Histogram
2. Frequency polygon
3. Smoothed frequency curves
4. Ogives or cumulative frequency curves.
2.3.1. Histogram
The term `histogram' must not be confused with the term `historigram' which relates to time
charts. Histogram is the best way of presenting graphically a simple frequency distribution. The
statistical meaning of histogram is that it is a graph that represents the class frequencies in a
frequency distribution by vertical adjacent rectangles.
While constructing histogram the variable is always taken on the X-axis and the corresponding
frequencies on the Y-axis. Each class is then represented by a distance on the scale that is
proportional to its class-interval. The distance for each rectangle on the X-axis shall remain the
same in case the class-intervals are uniform throughout; if they are different the width of the
rectangles shall also change proportionately. TheY-axis represents the frequencies of each class
which constitute the height of its rectangle. We get a series of rectangles each having a class
interval distance as its width and the frequency distance as its height. The area of the histogram
represents the total frequency.
The histogram should be clearly distinguished from a bar diagram. A bar diagram is one-
dimensional where the length of the bar is important and not the width, a histogram is two-
dimensional, where both the length and the width are important. However, a histogram can be
misleading if the distribution has unequal class intervals and suitable adjustments in frequencies
are not made.
The technique of constructing histogram is explained for :
(i) distributions having equal class-intervals, and
(ii) distributions having unequal class-intervals.
When class-intervals are equal, take frequency on the Y-axis, the variable on the X-axis and
construct rectangles. In such a case the heights of the rectangles will be proportional to the
frequencies.

Histograms
It is often useful to look at the distribution of the data, or the frequency with which certain values
fall between pre-set bins of specified sizes. The selection of these bins is up to you, but
remember that they should be selected in order to illuminate your data, not obfuscate it.
A histogram is similar to a bar chart. However histograms are used for continuous (as opposed to
discrete or qualitative) data. The defining property of a histogram is:
The area of each bar is proportional to the frequency.
If each bin has an equal width, then this can be easily done by plotting frequency on the
vertical axis. However histograms can also be drawn with unequal bin sizes, for which one
can plot frequency density.
To produce a histogram with equal bin sizes:

 Select a minimum, a maximum, and a bin size. All three of these are up to you. In the
histogram data used above the minimum is 1, the maximum is 110, and the bin size is 10.
 Calculate your bins and how many values fall into each of them. For the histogram data
the bins are:
 1 ≤ x < 10, 16 values.
 10 ≤ x < 20, 4 values.
 20 ≤ x < 30, 4 values.
 30 ≤ x < 40, 2 values.
 40 ≤ x < 50, 2 values.
 50 ≤ x < 60, 1 values.
 60 ≤ x < 70, 0 values.
 70 ≤ x < 80, 0 values.
 80 ≤ x < 90, 0 values.
 90 ≤ x < 100, 0 value.
 100 ≤ x < 110, 0 value.
 110 ≤ x < 120, 1 value.
 Plot the counts you figured out above. Do this using a standard bar plot.

Worked Problem
Let's say you are an avid roleplayer who loves to play Mechwarrior, a d6 (6 sided die) based
game. You have just purchased a new 6 sided die and would like to see whether it is biased
(in combination with you when you roll it).
What We Expect
So before we look at what we get from rolling the die, let's look at what we would expect.
First, if a die is unbiased it means that the odds of rolling a six are exactly the same as the
odds of rolling a 1--there wouldn't be any favoritism towards certain values. Using the
standard equation for the arithmetic mean find that μ = 3.5. We would also expect the
histogram to be roughly even all of the way across--though it will almost never be perfect
simply because we are dealing with an element of random chance.
What We Get
Here are the numbers that you collect:
15641355641566451436

13642416422434116355

43534225654353315445

12516543242133346113

66146665315634555244

Analysis
Referring back to what we would expect for an unbiased die, this is pretty close to what
we would expect. So let's create a histogram to see if there is any significant difference in
the distribution.
The only logical way to divide up dice rolls into bins is by what's showing on the die
face:
1 23 4 5 6
16 9 17 21 20 17

If we are good at visualizing information, we can simple use a table, such as in the one
above, to see what might be happening. Often, however, it is useful to have a visual
representation. As the amount of variety of data we want to display increases, the need
for graphs instead of a simple table increases.

Looking at the above figure, we clearly see that sides 1, 3, and 6 are almost exactly what
we would expect by chance. Sides 4 and 5 are slightly greater, but not too much so, and
side 2 is a lot less. This could be the result of chance, or it could represent an actual
anomaly in the data and it is something to take note of keep in mind. We'll address this
issue again in later chapters.

Frequency Density
Another way of drawing a histogram is to work out the Frequency Density.
Frequency Density
The Frequency Density is the frequency divided by the class width.

Q1 Draw a histogram from the following data :

Classes Frequency
0—10 5
10—20 11
20—30 19
30—40 21
40—50 16
50—60 10
60—70 8
70—80 6
80—90 3
90—100 1
When class-intervals are unequal the frequencies must be adjusted before constructing a
histogram. We take that class which has the lowest class-interval and adjust the frequencies of
other classes accordingly. If one class interval is twice as wide as the one having the lowest
class-interval we divide the height of its rectangle by two, if it is three times more we divide it by
three etc., the heights will be proportional to the ratios of the frequencies to the width of the
classes.
Q2 Represent the following data on a histogram.
Average monthly income of 1035 employees in a construction industry is given below:
Monthly Income (Rs.) No. of Workers
600—700 25
700—800 l00
800—900 150
900—1000 200
1000—1200 240
1200—1400 160
1400—1500 50
1500—1800 90
1800 or more 20
When mid point are given, we ascertain the upper and lower limits of each class and then
construct the histogram in the same manner.
Q3. togram of the following distribution :
Life of Electric Lamps Frequency
(hours) Firm A FirmB
1010 10 287
1030 130 105
1050 482 26
1070 360 230
1090 18 352
Solution: Since we are given the mid points, we should ascertain the class limits. To calculate
the class limits of various classes, take difference of two consecutive mid-points and divide the
difference by 2, then add and subtract the value obtained from each mid-point to calculate lower
and higher class-limits.
Life of Electric Lamps Frequency
(hours) Firm A FirmB
1000—1020 10 287
1020—1040 130 105
1040—1060 482 76
1060—1080 360 230
1080—1100 18 352

2.3.2. Frequency Polygon

This is a graph of frequency distribution which has more than four sides. It is particularly
effective in comparing two or more frequency distributions. There are two ways of constructing a
frequency polygon.
(i) We may draw a histogram of the given data and then join by straight line the mid-points of the
upper horizontal side of each rectangle with the adjacent ones. The figure so formed shall be
frequency polygon. Both the ends of the polygon should be extended to the base line in order to
make the area under frequency polygons equal to the area under Histogram.
(ii) Another method of constructing frequency polygon is to take the mid-points of the various
class-intervals and then plot the frequency corresponding to each point and join all these points
by straight lines. The figure obtained by both the methods would be identical.

Frequency Polygon
This is a histogram with an overlaid frequency polygon.
Midpoints of the interval of corresponding rectangle in a histogram are joined together by
straight lines. It gives a polygon i.e. a figure with many angles.
It is used when two or more sets of data are to be illustrated on the same diagram such as death
rates in smokers and non smokers, birth and death rates of a population etc.
One way to form a frequency polygon is to connect the midpoints at the top of the bars of a
histogram with line segments (or a smooth curve). Of course the midpoints themselves could
easily be plotted without the histogram and be joined by line segments. Sometimes it is
beneficial to show the histogram and frequency polygon together.But sometimes, the frequency
polygon is much more accurate than the histogram because you can evaluate which is the low
point and the high point.
Unlike histograms, frequency polygons can be superimposed so as to compare several frequency
distributions.
Frequency polygons were created in the 9th century as a way of not only storing data, but
making it easily accessible for people who are illiterate
Frequency polygon has an advantage over the histogram. The frequency polygons of several
distributions can be drawn on the same axis, which makes comparisons possible whereas
histogram cannot be used in the same way. To compare histograms we need to draw them on
separate graphs.
2.3.3. Smoothed Frequency Curve
A smoothed frequency curve can be drawn through the various points of the polygon. The curve
is drawn by free hand in such a manner that the area included under the curve is approximately
the same as that of the polygon. The object of drawing a smoothed curve is to eliminate all
accidental variations which exists in the original data, while smoothening, the top of the curve
would overtop the highest point of polygon particularly when the magnitude of the class interval
is large. The curve should look as regular as possible and all sudden turns should be avoided. The
extent of smoothening would depend upon the nature of the data. For drawing smoothed
frequency curve it is necessary to first draw the polygon and then smoothen it. We must keep in
mind the following points to smoothen a frequency graph:
(i) Only frequency distribution based on samples should be smoothened.
(ii) Only continuous series should be smoothened.
(iii) The total area under the curve should be equal to the area under the histogram or polygon.
2.3.4. Cumulative Frequency Curves or Ogives
We have discussed the charting of simple distributions where each frequency refers to the
measurement of the class-interval against which it is placed. Sometimes it becomes necessary to
know the number of items whose values are greater or less than a certain amount. We may, for
example, be interested in knowing the number of students whose weight is less than 65 lbs. or
more than say 15.5 lbs. To get this information, it is necessary to change the form of frequency
distribution from a simple to a cumulative distribution. In a cumulative frequency distribution,
the frequency of each class is made to include the frequencies of all the lower or all the upper
classes depending upon the manner in which cumulation is done. The graph of such a
distribution is called a cumulative frequency curve or an Ogive. There are two method of
constructing ogives, namely:
(i) less than method, and
(ii) more than method.
In less than method, we start with the upper limit of each class and go on adding the frequencies.
When these frequencies are plotted we get a rising curve.
In more than method, we start with the lower limit of each class and we subtract the frequency of
each class from total frequencies. When these frequencies are plotted, we get a declining curve.
This example would illustrate both types of ogives.
Q4: Draw ogives by both the methods from the following data.
Distribution of weights of the students of a college (lbs.)
Weights No. of Students
90.5—100.5 5
100.5—110.5 34
110.5—120.5 139
120.5—130.5 300
130.5—140.5 367
140.5—150.5 319
150.5—160.5 205
160.5—170.5 76
170.5—180.5 43
180.5—190.5 16
190.5—200.5 3
200.5—210.5 4
210.5—220.5 3
220.5—230.5 1
Less than (Weights) Cumulative Frequency
100.5 5
110.5 39
120.5 178
130.5 478
140.5 845
150.5 1164
160.5 1369
170.5 1445
180.5 1488
190.5 1504
200.5 1507
210.5 1511
220.5 1514
230.5 1515
Plot these frequencies and weights on a graph paper. The curve formed is called an Ogive.
Although the graphs are a powerful and effective method of presenting statistical data, they are
not under all circumstances and for all purposes complete substitutes for tabular and other forms
of presentation. The specialist in this field is one who recognizes not only the advantages but also
the limitations of these techniques. He knows when to use and when not to use these methods
and from his experience and expertise is able to select the most appropriate method for every
purpose.
Q5. The following distribution is with regard to weight in grams of mangoes of a given variety.
If mangoes of weight less than 443 grams be considered unsuitable for foreign market, what is
the percentage of total mangoes suitable for it? Assume the given frequency distribution to be
typical of the variety:
Weight in gms. No. of mangoes Weight in gms. No. of mangoes
410—419 10 450—459 45
420—429 20 460—469 18
430—439 42 470—479 7
440—449 54
Draw an ogive of `more than' type of the above data and deduce how many mangoes will be
more than 443 grams.

2.3.5 Pie Charts

A pie chart showing the racial make-up of the US in 2000.

Pie chart of populations of English language-speaking people

A Pie-Chart/Diagram is a graphical device - a circular shape broken into sub-divisions. The sub-
divisions are called "sectors", whose areas are proportional to the various parts into which the
whole quantity is divided. The sectors may be coloured differently to show the relationship of
parts to the whole. A pie diagram is an alternative of the sub-divided bar diagram.
To construct a pie-chart, first we draw a circle of any suitable radius then the whole quantity
which is to be divided is equated to 360 degrees. The different parts of the circle in terms of
angles are calculated by the following formula.

Component Value / Whole Quantity * 360

The component parts i.e. sectors have been cut beginning from top in clockwise order.
Note that the percentages in a list may not add up to exactly 100% due to rounding. For example
if a person spends a third of their time on each of three activities: 33%, 33% and 33% sums to
99%.
Warning: Pie charts are a poor way of communicating information. The eye is good at judging
linear measures and bad at judging relative areas. A bar chart or dot chart is a preferable way of
displaying this type of data.
Cleveland (1985), page 264: "Data that can be shown by pie charts always can be shown by a dot
chart. This means that judgments of position along a common scale can be made instead of the
less accurate angle judgments." This statement is based on the empirical investigations of
Cleveland and McGill as well as investigations by perceptual psychologists.
Three-dimensional (3d) pie charts compound perceptual misinterpretation of statistical
information by altering the relative angle of pie slices to create the impression of depth into
a vanishing point. Angles and areas at the bottom of the chart must be exaggerated and the angles
and areas at the top of the chart reduced in order to create the dimensional effect; a specifically
false depiction of the data.

2.3.6 Comparative Pie Charts[

A pie chart showing preference of colors by two groups.

The comparative pie charts are very difficult to read and compare if the ratio of the pie chart is not
given.
Examine our example of color preference for two different groups. How much work does it take to
see that it is quite challenging to work out who ate the pie? First, we have to find Fingerprints on
either pie, and then remember how many sensirivity vectors it has. If we did not include the share for
blue in the label, then we would probably be approximating the comparison. So, if we use multiple
pie charts, we have to expect that comparisions between charts would only be approximate.
What is the most popular color in the left graph? Red. But note, that you have to look at all of the
colors and read the label to see which it might be. Also, this author was kind when creating these
two graphs because he used the same color for the same object. Imagine the confusion if one had
made the most important color get Red in the right-hand chart?
If two shares of data should not be compared via the comparative pie chart, what kind of graph
would be preferred? The stacked bar chart is probably the most appropriate for sharing of the total
comparisons. Again, exact comparisons cannot be done with graphs and therefore a table may
supplement the graph with detailed information.
2.4 QUESTIONS:

1) Explain a) Plot b) Chart c) Orgive d) histgram with examples

2. What is dot plot, discuss in detail.
3. Explain the advantages and stem an leaf display
4. In which situation we use Pie chart, simple Bar chart and multiple bar charts?
5. What is Frequency? What are the steps for making frequency distribution?
6. For the data given below, we can make class boundaries easily.
Data:
Class Limits Class Boundaries
3.5 to 4.4 ---
3.45 to 4.45 ---
4.5 to 5.4 ---
4.45 to 5.45 ---
5.5 to 6.4 ---
5.45 to 6.45 ---
7.Find the difference between 4.4 (upper class limit of first class) and 4.5 (lower class limit of second
class), i.e. 4.5-4.4=0.1 Now divide the difference by 2 i.e. 0.1/2=0.05 Subtract this resulting value of 0.05
from 3.5 (lower class limit of first class); we will get 3.45. Add this resulting value of 0.05 in 4.4 (upper
class limit of first class); we will get 4.45. For all classes continue this subtraction in lower class limit and
addition in upper class limit of each class.
8. What is the difference between cumulative frequency distribution and Cumulative Frequency
Polygon? What are their uses?

Process Analysis by Statistical Methods D. Himmelblau
100% (4)
Process Analysis by Statistical Methods D. Himmelblau
474 pages
SM 1
No ratings yet
SM 1
125 pages
Frequency Distribution and Charts and Graphs
No ratings yet
Frequency Distribution and Charts and Graphs
61 pages
Frequency Distribution Lecture 2 3
No ratings yet
Frequency Distribution Lecture 2 3
11 pages
Statistics in Education - Made Simple
100% (1)
Statistics in Education - Made Simple
26 pages
09042020212858practical Statistical Methods 2019-20
No ratings yet
09042020212858practical Statistical Methods 2019-20
91 pages
Statistics-1 - LESSON 2 CONSTRUCTION OF FREQUENCY DISTRIBUTION AND GRAPHICA155953 PDF
No ratings yet
Statistics-1 - LESSON 2 CONSTRUCTION OF FREQUENCY DISTRIBUTION AND GRAPHICA155953 PDF
16 pages
Unit 4 Frequency Distribution and Graphical Presentation: Structure
No ratings yet
Unit 4 Frequency Distribution and Graphical Presentation: Structure
21 pages
Statistics in Psychology IGNOU Unit 4
No ratings yet
Statistics in Psychology IGNOU Unit 4
21 pages
EPISODE 2
No ratings yet
EPISODE 2
11 pages
Frequency
100% (1)
Frequency
36 pages
Frequency Distribution: A Frequency Distribution Is Constructed For Three Main Reasons
No ratings yet
Frequency Distribution: A Frequency Distribution Is Constructed For Three Main Reasons
15 pages
ThinkPad P1 Platform Specifications
No ratings yet
ThinkPad P1 Platform Specifications
204 pages
Graphs
No ratings yet
Graphs
20 pages
Unit 2 Statistics Analytics
No ratings yet
Unit 2 Statistics Analytics
10 pages
Chapter1 Statistics
No ratings yet
Chapter1 Statistics
12 pages
Lec 01 - Frequency Distribution_Stat_1
No ratings yet
Lec 01 - Frequency Distribution_Stat_1
4 pages
STA 111 - Topic One - Lecture 2
No ratings yet
STA 111 - Topic One - Lecture 2
20 pages
18bge14a U2
No ratings yet
18bge14a U2
27 pages
Chapter 2
No ratings yet
Chapter 2
46 pages
DATA PRESENTATION
No ratings yet
DATA PRESENTATION
19 pages
Statistics and Probability
100% (7)
Statistics and Probability
141 pages
Frequency Distribution Stat
No ratings yet
Frequency Distribution Stat
2 pages
Graphs-and-Tables-for-BBA-Class-note
No ratings yet
Graphs-and-Tables-for-BBA-Class-note
4 pages
Chapter 1 INTRODUCTION TO DATA
No ratings yet
Chapter 1 INTRODUCTION TO DATA
9 pages
Classification
No ratings yet
Classification
7 pages
Variables and Attributes
No ratings yet
Variables and Attributes
4 pages
Collecting Presenting
No ratings yet
Collecting Presenting
18 pages
Paper-4 (Business Statistics)
100% (1)
Paper-4 (Business Statistics)
246 pages
Statistics 2025
No ratings yet
Statistics 2025
160 pages
organisation_of_datayeeeeeeeeeeeeeeeeeeeeee
No ratings yet
organisation_of_datayeeeeeeeeeeeeeeeeeeeeee
61 pages
Lesson 2 Frequency Distributions
No ratings yet
Lesson 2 Frequency Distributions
8 pages
3. ORGANISATION OF DATA 4
No ratings yet
3. ORGANISATION OF DATA 4
48 pages
AE-134-Handouts-Part-3
No ratings yet
AE-134-Handouts-Part-3
8 pages
Technical Terms Used in Formulation Frequency Distribution
100% (1)
Technical Terms Used in Formulation Frequency Distribution
22 pages
Wordpress Documentation
No ratings yet
Wordpress Documentation
24 pages
UNIT 2 Presentation of Data
100% (1)
UNIT 2 Presentation of Data
28 pages
Chapter 2 - Organizing Data
No ratings yet
Chapter 2 - Organizing Data
33 pages
Classification of Data PDF
No ratings yet
Classification of Data PDF
23 pages
Unit 7 Lecture Note
No ratings yet
Unit 7 Lecture Note
25 pages
Frequency Distribution and Graphical Representation of Data
No ratings yet
Frequency Distribution and Graphical Representation of Data
20 pages
work(s)(4)
No ratings yet
work(s)(4)
4 pages
18BST5EL-U2
No ratings yet
18BST5EL-U2
21 pages
Statistics: Class Mark Cumulative Frequency Histogram Frequency Polygon Mean Median Mode
No ratings yet
Statistics: Class Mark Cumulative Frequency Histogram Frequency Polygon Mean Median Mode
27 pages
Statistics, mg4
No ratings yet
Statistics, mg4
58 pages
Tabulation
No ratings yet
Tabulation
10 pages
Statistics (Kind of Statistics, Classification)
No ratings yet
Statistics (Kind of Statistics, Classification)
2 pages
Lesson 3. Frequency Distribution
No ratings yet
Lesson 3. Frequency Distribution
6 pages
Data Presentation
No ratings yet
Data Presentation
16 pages
Lecture_04_frequency & Frequency Distribution
No ratings yet
Lecture_04_frequency & Frequency Distribution
23 pages
STA112 Week 2 Class Note
No ratings yet
STA112 Week 2 Class Note
102 pages
Tables, Charts & Graphs
No ratings yet
Tables, Charts & Graphs
13 pages
10th Class Maths Notes 2024 Ch 6
No ratings yet
10th Class Maths Notes 2024 Ch 6
33 pages
STUDY94@817302
No ratings yet
STUDY94@817302
18 pages
Chapter 1 QM (PC)
No ratings yet
Chapter 1 QM (PC)
17 pages
Statistics I Essentials
From Everand
Statistics I Essentials
Emil G. Milewski
No ratings yet
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
From Everand
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
SUJAUL CHOWDHURY
No ratings yet
De-Mystifying Math and Stats for Machine Learning: Mastering the Fundamentals of Mathematics and Statistics for Machine Learning
From Everand
De-Mystifying Math and Stats for Machine Learning: Mastering the Fundamentals of Mathematics and Statistics for Machine Learning
Seaport AI Madhavan
No ratings yet
Business Statistics I Essentials
From Everand
Business Statistics I Essentials
Louise Clark
5/5 (5)
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
Statistical Foundations for Psychology
From Everand
Statistical Foundations for Psychology
James C. Ware
No ratings yet
B.B.A. DEGREE EXAMINATION December 2014
No ratings yet
B.B.A. DEGREE EXAMINATION December 2014
3 pages
Unit-5 Core - Prof Adwait Oak
No ratings yet
Unit-5 Core - Prof Adwait Oak
6 pages
M.B.A. Degree Examination, 2013: 273. Production Planning, Control and Maintenance
No ratings yet
M.B.A. Degree Examination, 2013: 273. Production Planning, Control and Maintenance
3 pages
Principles of Management: K.Balasri Prasad
No ratings yet
Principles of Management: K.Balasri Prasad
43 pages
A Few More Past Exam Questions (Multiple Choice)
No ratings yet
A Few More Past Exam Questions (Multiple Choice)
6 pages
Table 1-1 Comparing Manufacturing and Service Operations: Just in Time & Kanban, Lean
No ratings yet
Table 1-1 Comparing Manufacturing and Service Operations: Just in Time & Kanban, Lean
5 pages
Reviewing Balance Scorecard BSC Applicat PDF
No ratings yet
Reviewing Balance Scorecard BSC Applicat PDF
4 pages
Six Sigma Green Belt Training For Emerson Industries. Wha The DMAIC Method of Six Sigma How Is Six Sigma Different?
No ratings yet
Six Sigma Green Belt Training For Emerson Industries. Wha The DMAIC Method of Six Sigma How Is Six Sigma Different?
1 page
Unit 1 Introduction To Managerial Economics: Objectives
No ratings yet
Unit 1 Introduction To Managerial Economics: Objectives
13 pages
Statistics Dispersion
No ratings yet
Statistics Dispersion
34 pages
02 Internationaltradepolicy Chap2 - 150218235618
No ratings yet
02 Internationaltradepolicy Chap2 - 150218235618
33 pages
Basic Concept of Macro Economics: Dr.S.Balasubramaniam
No ratings yet
Basic Concept of Macro Economics: Dr.S.Balasubramaniam
31 pages
What Is Business Analysis? Lesson Overview
No ratings yet
What Is Business Analysis? Lesson Overview
15 pages
MBA Linear Programming
No ratings yet
MBA Linear Programming
61 pages
01 Intro International Economics
No ratings yet
01 Intro International Economics
17 pages
MBA Linear Programming
No ratings yet
MBA Linear Programming
61 pages
Competition Law in Digital Markets in TH
No ratings yet
Competition Law in Digital Markets in TH
19 pages
Sampling Distributions:: N X X X X
No ratings yet
Sampling Distributions:: N X X X X
3 pages
L09 Measurement Uncertainty in Microbiological Examinations of Foods Technique For Determination of Pathogens - Hilde Skår Norli
No ratings yet
L09 Measurement Uncertainty in Microbiological Examinations of Foods Technique For Determination of Pathogens - Hilde Skår Norli
20 pages
15 QMB11e
100% (1)
15 QMB11e
11 pages
Statistics and Probability2021 - Quarter 3 2
No ratings yet
Statistics and Probability2021 - Quarter 3 2
38 pages
Full Practice Questions Sample
No ratings yet
Full Practice Questions Sample
13 pages
Lecture Notes
100% (1)
Lecture Notes
324 pages
Introduction To The Scenario Approach Marco C Campi Simone Garatti download
No ratings yet
Introduction To The Scenario Approach Marco C Campi Simone Garatti download
43 pages
Er. Santosh Kumar Baral Er. Suraj Basant Tulachan: Communications and Knowledge Engineering
No ratings yet
Er. Santosh Kumar Baral Er. Suraj Basant Tulachan: Communications and Knowledge Engineering
44 pages
Determination of Variable Sampling Plans For Non-Normal Process Through Skewness and Kurtosis
No ratings yet
Determination of Variable Sampling Plans For Non-Normal Process Through Skewness and Kurtosis
6 pages
1 Descriptive Statistics
No ratings yet
1 Descriptive Statistics
20 pages
Operating Characteristic (OC) Curves
No ratings yet
Operating Characteristic (OC) Curves
18 pages
06 The Continuous Uniform Distribution
No ratings yet
06 The Continuous Uniform Distribution
10 pages
Poisson Distribution
0% (1)
Poisson Distribution
30 pages
Random Variables
No ratings yet
Random Variables
15 pages
STATISTIC
No ratings yet
STATISTIC
100 pages
Work Sampling
100% (1)
Work Sampling
25 pages
Chapter 6 Prob Stat (Random Variables)
No ratings yet
Chapter 6 Prob Stat (Random Variables)
49 pages
Hypergeometric Distribution
No ratings yet
Hypergeometric Distribution
9 pages
Topic19 8p6 Galvin
No ratings yet
Topic19 8p6 Galvin
29 pages
Bbs14e PPT ch07
No ratings yet
Bbs14e PPT ch07
31 pages
Introductory Statistics
No ratings yet
Introductory Statistics
742 pages
Stats 1 Chapter 06 Statistical Distributions Booklet
No ratings yet
Stats 1 Chapter 06 Statistical Distributions Booklet
12 pages
Business Statistics May Module
No ratings yet
Business Statistics May Module
72 pages
(Ebook) Concise encyclopedia of biostatistics for medical professionals by Abhaya Indrayan, Martin Patrick Holt ISBN 9781482243871, 9781482243888, 1482243873, 1482243881 download
100% (2)
(Ebook) Concise encyclopedia of biostatistics for medical professionals by Abhaya Indrayan, Martin Patrick Holt ISBN 9781482243871, 9781482243888, 1482243873, 1482243881 download
54 pages
SM025 CHAPTER 9 2018
No ratings yet
SM025 CHAPTER 9 2018
35 pages
Summer 578 Assignment 2 Solutions
100% (1)
Summer 578 Assignment 2 Solutions
13 pages
TG4 Acc115
No ratings yet
TG4 Acc115
7 pages
MCE321 Lecture Note 4
No ratings yet
MCE321 Lecture Note 4
43 pages
Cfa L1 Full Mock Solutions 17.11.2018 - Am Session - Correction
No ratings yet
Cfa L1 Full Mock Solutions 17.11.2018 - Am Session - Correction
32 pages