Unit+1+Intro+and+Theory+I
Unit+1+Intro+and+Theory+I
INFORMATION
VISUALIZATION
BMI 6340: Health Information Visualization and
Visual Analytics
Todd R. Johnson, PhD
ANSCOMBE’S
QUARTET (1973)
Statistically identical
data sets
Mean
Variance
Correlation
Regression line
WHAT IS INFORMATION
VISUALIZATION?
Abstract data
no natural physical form Image of real object
Static image?
Zoomable Satellite view?
Information
Visualization?
Information
Visualization?
Static image?
Zoomable view plus click on icons to get more details
Information
Visualization?
Information
Visualization?
Information
Visualization?
Information
Visualization?
Information
Visualization?
Information
Visualization?
Sky
Shady side of pyramid
Click image to view Sunny side of pyramid
WHAT IS DATA
VISUALIZATION?
“The use of visual
representation to
explore, make sense of,
and communicate data.”
Stephen Few,
author, scientist
Syllabus Review
BMI 6340 Health Information
Visualization and Visual Analytics
3 Semester Credit Hours
Secondary
Houston, TX 77030
Office Hours
40%
Homeworks 40.000000%
Term Project Proposal
10%
Midterm Project 10.000000%
20%
100%
Late Assignment
Policy
25% penalty for each day after the due date.
Example:
Homeworks
Open book, notes and we. Just do your own work so that
you learn the material.
Online in Canvas
Dashboards
Time-Series Data
Barchart
An unsorted
sorted
Sorting from low to
barchart
from low does
to high
not
high supports rank
workworks
also well
comparisons
Part to Whole
Relations
What proportion
does one value
contribute to a
whole?
What is the
distribution of
Birthweight by
Race?
Correlations
Countries: population,
health spending per
person, GDP, etc.
Questions
Point events
Questions
(Encoding)
Visual
Representati
on
Perception
+
Knowledge
Information
Income Life
Populatio
Data
Country Region per
Person
Expectanc
y
n
China Asia 9502 75 1.35 B
Sub-
Congo Saharan 403 50 70 M
Africa
Size X
Mapping
(Encoding)
Visual
Representati
on
Changing the
mapping
changes the
Income Life
Populatio
Data
Country Region per
Person
Expectanc
y
n
China Asia 9502 75 1.35 B
Sub-
Congo Saharan 403 50 70 M
Africa
Mapping
(Encoding)
Visual
Representati
on
What is the
mapping?
All Kinds of Mappings are
What’s
Possible, But Not All Arethe
GoodNot
Mapping?
all mappings are
good!
Bar Height
Bar Color
Bars ordered
An
Data effective
mapping depends
on
Mapping Lets look
Characteristics of the data at these in
(Encoding) detail
How we perceive visual objects
and relationships
1 2009 23.611
2 2009 76.389
Row Column
Hindu-Arabic Numeral
2008 2009 2010 2011 2012
Very precise
Which clinic has a higher
proportion of patient visits in
2012?
2008 2009 2010 2011
Clinic 1
2012
Much harder
Multiple comparisons
Hardest yet?
If the trends continue, approximately
what proportion of patient visits will
Clinic 1 have in 2013?
Around 70?
2008 2009 2010 2011 2012
Very difficult
Calculate growth
WHICH IS EASIEST?
Which clinic has a higher
proportion of patient visits in
2012?
WHICH IS EASIEST?
Is the proportion of patients
visiting Clinic 1 generally
increasing over time?
WHICH IS EASIEST?
From 2008 to 2011 does one
clinic consistently see more
patients?
WHICH IS EASIEST?
Between which two years does one
clinic overtake the other in terms
of proportion of patients seen?
WHICH IS EASIEST?
If the trends continue, approximately
what proportion of patient visits will
Clinic 1 have in 2013?
Around 70?
WHICH IS EASIEST?
Key Points
The best visualization (mapping) depends on the data
and the information need(s)
To pick the best graph you need know your data, the
users’ information need(s), and the users’ background
knowledge
mapping?
to understand variables,
measurement scales,
and visualBar Height
perception
Bar Color
Bars ordered
Variables and
Measurement Scales
Quantitative
Steven’s Scale Types
Formal Scale
Properties Types
Nominal = Ordinal < ,
Interval - Ratio ÷
,≠ >
Category
X X X X
(equality)
Magnitude
(greater or X X X
less)
Equal
Interval
(equality
X X
of
differences
)
Absolute
Nominal
Values are non-numeric with no meaningful
order OR values are numbers used as names
or labels, where the value of the number is
meaningless
10º C = 283.15º K
20º C = 293.15 K
0º C
20º C is just 1.035 times
as much heat as 10º C:
293.15/283.15 = 1.035
Scales and Mappings
mapping?
mappings are
Ratio
mismatched
Nominal
Bar Height
Ratio
Bar Color
Countries (Nominal) to
Distinct Labeled Bars
(Nominal property of
What’s wrong with
this graph?
Y axis is no longer
uniform
Looks like ratio, but
isn’t
An Accurate Mapping is
Necessary But Not Sufficient
for an Effective Mapping
Summary
Information visualization is the use of computer
supported, interactive visualizations of abstract data
to amplify cognition
A data visualization
Nominal (categorical)
Ordinal
Interval
Ratio