Due To Rounding, Some Totals May Not Correspond With The Sum of The Separate Figures
Due To Rounding, Some Totals May Not Correspond With The Sum of The Separate Figures
When performing statistical data analyses, quality professionals are always challenged
to maintain data integrity. When should you round up the answer; when should you
round down? How many significant figures are appropriate for the data set that has
been taken?
Below are a set of simple rules that should help you traverse the perils of statistical data
analysis.
General recommendations
Due to rounding, some totals may not correspond with the sum of the separate figures.
Specific recommendations
The specific recommendations are based on three increasing levels of approximation:
while detailed figures from data sources (e.g. Eurobase) should not be changed when
preparing graphs and maps (level 0), they should be partially rounded for compiling
tables (level 1) and rounded even further when writing texts (level 2).
Summary
Level 0: Graphs and maps should be built using unrounded figures from the
original dataset(s).
Examples: 12.34 % and 56.789 %, 1 234 and 56 789 persons
Level 1: For tables with percentages, the general rule is to round to one decimal.
For tables with absolute numbers, identify the smallest number, decide how
many digits to keep for this number, and then round all other entries to those
digits.
Examples: 12.3 % and 56.8 %, 1 200 and 56 800 persons
Level 2: In text two significant (non-zero) digits are in general sufficient.
Examples: 12 % and 57 %, 1 200 and 57 000 persons
Details
The assessment whether a target has been met should be done on the bases of
unrounded figures (and properly reflecting the accuracy of the data).
Level 1: Numbers and percentages shown in tables should be the result of a first
level of rounding (this recommendation concerns the text tables, not the detailed
tables that appear in annexes).
When compiling a table with absolute numbers, identify the “shortest” number (in
terms of number of digits) and decide which significant digits to keep (two are in general
sufficient), then round all other entries to those digits. In this way the rounded total will
be consistent with the sum of the rounded addends (except for small rounding
differences).
If several indicators are presented in the same table, try to keep the same number of
significant digits at one side for all absolute figures, and at the other side for all relative
quantities.
Level 2: Numbers and percentages shown in text should present a further level
of rounding compared to tables.
In general it is not necessary to report detailed numbers and percentages in texts when
presenting analysis (about tables, maps and/or graphs). If the reader could benefit from
such detailed numbers and percentages, consider the inclusion of an additional table in
the publication.
When precision is not strictly needed, envisage grouping countries (or other statistical
units) and describe their characteristics with wording such as “at least”, ”about”, “less
than”, etc. In order to do so, do the regrouping by using unrounded numbers and then
do the rounding. It may be necessary to reconsider the text after the rounding as close
numbers may end up in the same rounded number.
When analysing absolute numbers, keeping the same number of significant digits for
countries of different size may eventually result in keeping different significant
positions. These inconsistencies are not of concern in texts as their primary function is
the communication of concepts.
When reporting totals, do not add up rounded addends, but add unrounded numbers or
percentages and round the total.
As a consequence of the general rule, the advice for relative quantities such as
percentages and proportions is to report in text one decimal for percentages below 10 %,
no decimals for figures above 20 %, and for percentages between 10 and 20 % the choice
between one or no decimal depends on the precision of the indicator.
Tables
Consider the following population data extracted from Eurobase:
Luxembourg shows the smallest figure; keeping two significant digits implies rounding
to the “ten-thousands” position, i.e. to 340 000. By rounding all other numbers to this
position we obtain the table:
The unrounded total is 137 836 166 which, according to the rounding scheme applied in
the table, rounds to 137 840 000. This is exactly the sum of the rounded figures in the
table!
Even though the proposed rounding scheme reduces the possibility of inconsistencies
between the sum of rounded numbers and the rounded sum, small differences could still
occur. In this case, it is important to report the disclaimer described above (general
recommendations number 6).
Rounding Fractions
Rounding fractions works exactly the same way as rounding whole numbers. The
only difference is that instead of rounding to tens, hundreds, thousands, and so on, you
round to tenths, hundredths, thousandths, and so on.
7.8199 rounded to the nearest tenth is 7.8
1.0621 rounded to the nearest hundredth is 1.06
3.8792 rounded to the nearest thousandth is 3.879
Here's a tip: to avoid getting confused in rounding long decimals, look only at the
number in the place you are rounding to and the number that follows it. For
example, to round 5.3824791401 to the nearest hundredth, just look at the number in
the hundredths place—8—and the number that follows it—2. Then you can easily round
it to 5.38.
Variation Data Round-off
Rule: Round off the answer to one more significant figure than present in the original
data.
Rule: This rule is only valid for final results, not intermediate values.
Example: Cycle-time data for an application receipt to account opening process is as
follows (in days): 4, 4, 3, 5, 1, 5. The mean of these values is 3.66666666…, and should
be rounded to 3.7. Because the original data were whole numbers, we round the answer
to the nearest tenth.
Probability Data Round-off
Rule: Either provide the exact fraction or decimal of the probability or round-off the
final result to three significant digits.
Example: The probability of rolling a ‘4’ with a single die is 1/6 or 0.16666666…, which
would be rounded-off to 0.167. The probability of a coin landing on ‘tails’ is 1/2 or 0.5 –
because 0.5 is an exact figure, it is not necessary to express it as 0.500.
Sample Size Round-off
Rule: When the calculated sample size is not a whole number, it should be rounded up
to the next higher whole number.
Rule: Rounding up a sample size calculation for conservativeness ensures that your
sample size will always be representative of the population.
Example: A sample size calculation determined that 2006.083 data points were
necessary to represent the population. In this case, 2007 data points samples should be
taken.
When there are leading zeros (such as 0.006), don't count them because they are only
there to show how small the number is:
0.0165 rounded to 2 significant digits is 0.017
as the next digit (5) is 5 or more
The Rounding Rule for the Mean, Variance, and Standard Deviation for a
Probability Distribution
The rounding rule for the mean, variance, and standard deviation for variables of a
probability distribution is this: The mean, variance, and standard deviation should be
rounded to one more decimal than the outcome X. When fractions are used, they should
be reduced to lowest terms.