Introduction To Descriptive Statistics I: Sanju Rusara Seneviratne Mbpss
Introduction To Descriptive Statistics I: Sanju Rusara Seneviratne Mbpss
Descriptive Statistics I
Sanju Rusara Seneviratne MBPsS
Overview of Intro to Descriptive Statistics I
2. Measures of spread:
These are ways of summarizing a group of data by
describing how spread out the scores are.
Measures of spread help us to summarize how spread out data
are. To describe this spread, a number of statistics are available
us, including the range, quartiles, absolute deviation, variance
and standard deviation.
Summarizing Descriptive Statistics
• Mode - The mode is the number that occurs most often within a set
of numbers.
• Range - The range is the difference between the highest and lowest
values within a set of numbers. To calculate range, subtract the
smallest number from the largest number in the set.
Skewness and Kurtosis
• Skewness - a measure of symmetry, or more precisely,
the lack of symmetry. A distribution, or data set, is
symmetric if it looks the same to the left and right of the
center point.
• The four quarters that divide a data set into quartiles are:
The lowest 25% of numbers.
The next lowest 25% of numbers (up to the median).
The second highest 25% of numbers (above the median).
The highest 25% of numbers.
Percentiles
• The most common definition of a percentile is a number where a certain
percentage of scores fall below that number.
The 25th percentile is also called the first quartile.
The 50th percentile is generally the median (if you’re using the third definition—
see below).
The 75th percentile is also called the third quartile.
The difference between the third and first quartiles is the interquartile range.
• Percentile Rank:
The nth percentile is the lowest score that is greater than a certain
percentage (“n”) of the scores.
The nth percentile is the smallest score that is greater than or equal to a
certain percentage of the scores. To rephrase this, it’s the percentage of
data that falls at or below a certain observation.
Bar Charts
Pie Charts
Histograms
Box-Plots
Scatter Plots
Bar Charts
• Pie charts are useful for displaying data that are classified
into nominal or ordinal categories.
Nominal data are categorised according to descriptive or
qualitative information such as county of birth or type of
pet owned.
Ordinal data are similar but the different categories can
also be ranked, for example in a survey people may be
asked to say whether they classed something as very poor,
poor, fair, good, very good.
Pie Charts
Thinking Point:
What can you infer about the
normal distribution of this data
from this chart?
Box-Plots
• It can tell you about your outliers and what their values
are.
Articles:
• Bickel, P. J., & Lehmann, E. L. (1975). Descriptive Statistics
for Nonparametric Models I. Introduction. The Annals of
Statistics, 3(5), 1038-1044. doi:10.1214/aos/1176343239 |
https://link.springer.com/content/pdf/10.1007/978-1-
4614-1412-4_42.pdf