Untitled document 4
Untitled document 4
Key word(s):
central tendency
mean
median
mode
range
What is the central tendency? When you study data, one of the most common things to
look at is an average. The most common measures of central tendency are mean,
median, and mode. Another statistic that many are interested in when analyzing data is
the range.
Mean:
The average number - add all numbers, divide by the number of numbers
Median:
The middle number - find the number in the middle, if even amount find the number
between the middle two with .5 at the end
Mode:
The most common number - find the number that there is the most of
Range:
Largest number minus smallest number - Subtract the largest number from the smallest
number
Example:
Student 1 - 2 - 3 - 4 - 5 - 6 - 7 - 8 - 9 - 10 - 11 - 12
Height (cm) 160 - 165 - 164 - 162 - 161 - 162 - 160 - 159 - 170 - 162 - 160 - 161
159, 160, 160, 160, 161, 161, 162, 162, 162, 164, 165, 170,
Mean: 162.2
Median: 165.5
Range: 11
Key word(s):
frequency table
line plot
histogram
data
The first step to analyzing data is displaying the data in an organized way. Tables of
information are the first step to doing so. It provides a way to organize the information
so that someone else can understand it. Sometimes the data tables can be difficult to
understand when there is a lot of information. So frequency tables, line plots, and
histograms are a couple ways to make the data more manageable.
Frequency Tables
A frequency table is usually made from a tally table. A frequency table is much easier to
read and interpret.
Example:
Favorite Dinosaur
Name Tally
Stegosaurus II
A frequency table uses numbers to record data.
Favorite Dinosaur
Name Number
Tyrannosaurus Rex 4
Stegosaurus 2
Line Plots
This line plot tells you how many students spent certain numbers of hours studying in a
week:
Histograms
There are some differences between bar and histogram. Notice there are number
ranges along the bottom instead of categories.
Box-and-whiskers plots
Key word(s):
box-and-whiskers plot
box plot
Quartzite
Interquartile range
When we have a set of data, the measures of central tendency, such as the mean,
median and the mode
Mean
Mean is calculated by dividing the sum of the data points by the total number of data
points. The mean is also known as the average and gives us one value that represents
the entire set of data.
Mode
Mode is the value in the data set that occurs most often. You can have no mode, one
mode, or more than one mode in a data set. The mode tells us the data point that is
most frequent. Mode is often used when there is a voting situation.
Median
The median is the number that is in the middle of a set of data after the data is placed in
order. The median splits data into two equal parts and shows the middle of these two
parts.
Quartiles are the values that divide a list of numbers into quarters. To separate values
into quartiles, begin by putting the list of numbers in order. Then cut the list into four
equal parts.
Example: 6, 8, 4, 4, 7, 3, 5
Q1 = lower quartile
Q3 = upper quartile
Quartile 1 (Q1) = 4
Quartile 2 (Q2) = 5
Quartile 3 (Q3) = 7
Quartile 3 was 7 and Quartile 1 was 4; thus, the IQR is the difference between the two,
or 3.
A box-and-whisker graph looks like this. Note that the maximum, minimum, and quartile
values are all indicated on the graph. You can see why it is sometimes called a
box-and-whisker graph--because it looks like a box with whiskers.
Circle Graphs
Key word(s):
circle graph
pie chart
sector
Circle graphs are used to picture fractional parts of a whole and allow for quick
understanding of the distribution of data. They are also called pie charts.
The graph is divided into pieces called sectors. Each sector represents a percent and all
sectors must add up to equal 100%
Example:
You can also find out how many data points are in each sector if you do not know the
actual number but know the percent and the total number represented in the circle
graph.
Here are the steps needed to transform data into your own circle graph.
Step 1: Find the whole. What is the total value for the items on your graph?
Step 2: Find the parts. Each item to be graphed represents a part of the whole. To
complete the circle graph, you must find exactly what fraction or percent each item
represents. The easiest way to do this is to divide the part by the whole and then
convert the result to a percent
Step 3: Find the degrees for each part. Every circle is made up of 360 degrees. To find
the angle measure for each item, use this formula:
angle measure for an item = percent the item represents × 360 degrees.
Step 4: Draw a circle and a radius. Use a protractor to draw each angle. Each new
angle should be measured from the previously drawn line segment. Draw the angles
from largest to smallest in a clockwise direction.
Outliers
Key word(s):
outlier
Student Time
Zac 0:58
Ben 0:56
Kenny 1:05
Jared 1:03
Simon 1:34
Ron 1:04
Bobby 1:02
Kareem 0:59
Notice most of the boys' times range from 56 seconds to a minute and 4
seconds--except for Simon. Simon's time is 1 minute and 34 seconds. This is an
example of an outlier. It lies above the majority of the data points in the table.
Student number Height jumped
1 14
2 18
3 28
4 28
5 29
6 29
7 29
8 30
9 30
10 31
11 33
12 34
13 35
14 36
15 42
Did you notice after looking at the data that one student jumped much higher than the
rest of the students? There are many possible reasons for that.
*NOTE*
It is important to pay attention to these outliers because they can affect your measures
of central tendency calculations.
Examining Samples
Key words:
sample
random
generalization
population
To gather data that reflects characteristics of the population, you need a random
sample. When a random sample is being chosen, all members of a population have
equal chances of being selected.
Simple random sampling guarantees that the sample chosen represents the population
and is unbiased. The sample is chosen randomly and completely by chance and each
member of the population has the same probability of being chosen. This sampling
method ensures that the results are valid.
Systematic Sampling
In the systematic sampling every nth person is taken from a population. For example, if
you take a directory of people, maybe your sample will consist of every 19th person in
the directory.
Convenience Sampling
For example, suppose you are conducting a study on what kind of movies people like to
see. You start with creating a survey for people to complete that allows you to gather
data. Take a look at the following two questions:
Question:
Do you notice any difference in the two questions? Is one more biased than another?
Answer:
The first question shows no bias because a person is free to pick any of the types of
movies without being pointed in any direction. On the other hand, notice that the second
question calls romance movies "boring" while calling action and adventure movies
"exciting." This may lead a person to choose one type of movie over another. This is not
good when collecting data.
Key words:
Trends
data sets
A scatter plot is a graph of plotted points that compares and shows a relationship
between two sets of data.
For example, let's say we are comparing height and weight of the level of the glaciers
over some period of time. From the scatter plot you can see if there is a correlation or
relationship between the data.
Example:
Suppose you collected data about the relationship of students' height and shoe size by
looking at two different classrooms. Now you have data from two different rooms, and
you need to organize and analyze the data.
You could put the data on one scatter plot, assigning each room a different color and
then compare the data not only for your classroom, but compare the two classrooms.
You can also graph the data on separate scatter plots.