VISU2
VISU2
Introduction
Marco Winckler
Université Côte d’Azur (Polytech Nice) | I3S | SPARKS team | bureau 453
[email protected]
http://www.i3s.unice.fr/~winckler/
Data, information and knowledge
2
‘Misleading’ with data visualization
3
‘Misleading’ with data visualization
• Example of omitting data, where (a) shows the original data source,
and (b) shows half of the data of (a), creating a trend that cannot be
seen in (a) (graphs created in excel)
4
• A final example is the correlation – causation issue. Correlation does
not imply causation. Nevertheless, a correlation is often seen as a
causation, for example by internet articles with headers such as
“People drinking beer live longer; drinking beer is healthy!”.
5
“Graphics reveal data”
• John Snow’s map of
water wells in
London (1854)
8.14
Basics
9.14 10.0
8.0
7.46
7.0
6.77
5.0
8.0
4.82
8.0
5.68
6.58
7.0
5.76
5.0
7.26
4.74mean
mean
7.0 5.0
5.0 9.0
5.0
6.428.0
5.73
9.0 13.0
7.5
5.68
5.68
8.07.58
7.5
5.0
5.0
8.06.95
6.89
9.0
9.0 13.0 7.5
4.74
8.0 4.74
7.91 5.0
8.145.0
7.58.749.0
8.0 5.73
9.0
13.0 7.5
5.73
7.5
8.0
6.77 8.0
8.0
9.0
12.74 9.0
8.0
6.89
6.89
5.76
7.5
7.71
7.5
8.74 mean
13.0 12.74
9.0 8.0
7.5 7.71
9.0 7.5 var.
var.9.0 10.0
10.0 9.0
7.5 3.75 8.81
9.0 10.0
3.75 9.0
7.5 3.75
10.0 8.77
3.75 10.0 9.0
10.0 3.75 7.11 8.0
3.75 10.0
10.0 3.75 8.84
3.75
8.77 var.9.0 10.0 7.11 3.758.0 10.08.84 3.75corr.10.0
corr. 11.0
3.75
0.816
0.816 10.0 8.33 11.0
3.750.816
0.816 9.26 11.00.816
0.816 7.81 8.0 8.47
0.816
0.816
9.26 corr.
11.0 7.81
0.816 8.0 8.47
0.816 0.816 14.0 9.96
0.816 14.0 8.10 14.0 8.84 8.0 7.04
8.10 14.0 8.84 8.0 7.04 6.0 7.24 6.0 6.13 6.0 6.08 8.0 5.25
6.13 6.0 6.08 8.0 5.25 4.0 4.26 4.0 3.10 4.0 5.39 19.0 12.50
3.10 4.0 5.39 19.0 12.50 12.0 10.84 12.0 9.13 12.0 8.15 8.0 5.56
7.0 4.82 7.0 7.26 7.0 6.42 8.0 7.91
9.13 12.0 8.15 8.0 5.56
5.0 5.68 5.0 4.74 5.0 5.73 8.0 6.89
7.26 7.0 6.42 8.0 7.91 mean 9.0 7.5 9.0 7.5 9.0 7.5 9.0 7.5
4.74 5.0 5.73 8.0 6.89 var. 10.0 3.75 10.0 3.75 10.0 3.75 10.0 3.75
7.5 9.0 7.5 9.0 7.5 corr. 0.816 0.816 0.816 0.816
3.75 10.0 3.75 10.0 3.75
0.816 0.816 0.816
8
Presentation of temporal data; facts and evidences
Communicating data
https://www.youtube.com/watch?v=jbkSRLYSojo
9
References
• Tamara Munzner. Visualization Analysis and
Design. AK Peters Visualization Series, CRC Press
(2014).
• Colin Ware. Information Visualization, Third
Edition: Perception for Design (Interactive
Technologies). Morgan Kaufmann. 536 pages
(2012)
• Scott Murray. Interactive Data Visualization for
the Web. O'Reilly Media. 273 pages (2013)
• Edward Tufte. The Visual Display of Quantitative
Information. 1983
10
Objectives:
• The goal is to present information visualization techniques and apply them to
solve problems related to the interaction with large datasets
• Understand the principles of information visualization
• Know the Schneiderman’s mantra of information visualization
• Know the main information visualization techniques
• Know the tools for dealing with information visualization
• Know the data structures used to visualize data
• Implement a pipeline for information visualization
• Interact and use different information visualization techniques
• Be able to reuse information visualization techniques available
• Be able to program basic information visualization techniques
11
Planning
• Contents:
• Introduction to information visualization
• Information visualization pipeline
• Information visualization techniques (ex. graphs, hierachies,
multidimensional data, …)
• Data processing
• Programming of information visualization techniques
12
Information visualization
x
Computer Graphics
13
The domain
Graphics pipeline
14
The domain
Graphics pipeline
(2)
(1) Ward, M., Grinstein, G e Keim, D. Interative Data (2) Ware, Colin. information Visualization – Perception for
Visualization – Foundations, Techniques and Design. San Francisco, CA: Morgan Kaufmann, 2000 16
Applications. Wellesley, MA: A K Peters, 2010
The domain
User interaction
17
Principles of data visualization
• Characterizing data
Scientific data: information
• User perception associates with
positions/regions of a space
• Users task and interaction (implicit or explicit geometry)
18
Trajectories
Characterizing data
• Many classifications…
grids
Types of Type of data
fields positions
dataset attributes
ordered category
items,
geometry
dynamic attributes
quantitative ordinal (qualitative)
clusters,
static lists, items
sets attributes
20
Munzner, Tamara Visualization Analysis and Design. Boca Raton, FL:Taylor & Francis, 2014
Characterization of data
Attributes
Type of data
• Basic types
ordered category
• Item (node)
• Attributes
• Edge/line quantitative ordinal (qualitative)
• Position
• Grid Semantic of data
Key/value
Independent/dependent
Value scalar, value vectorial, …
Temporal/non-temporal
spatial/non-spatial
Continuous/discrete
Munzner, Tamara Visualization Analysis and Design. Boca Raton, FL:Taylor & Francis, 2014
Exercise
Type of data
tables items
attributes
grids
Types of Type of data
fields positions
dataset attributes
ordered category
items,
geometry
dynamic attributes
quantitative ordinal (qualitative)
clusters,
static lists, items Numeric, quantitative or ordered
sets attributes Categories (items in a enumeration)
24
Munzner, Tamara Visualization Analysis and Design. Boca Raton, FL:Taylor & Francis, 2014
Principles of data visualization
• Characterizing data
Process by which we
• User perception interpret what is around
• Users task and interaction us to create a mental
representation of
insight evidences
Data recognize, organiza
and interpret
sensorial
mapping mapping-1 information
Visualization
25
Model of
Human processing
• Perceptive (every sense)
• Cognitive (memory + processing)
• Motor (movements, ex. Fitts’ law)
P Non
Visual memory recognized
T = 100 ms graphics
27
Motor system
• Movements are not continuous pas a repetition of discreet micro-
movements
• The movement corresponds to user interaction with physical devices
• Time of micro-movement : 70 ms (cycle base of motor system)
• Time of selection of a graphical element: T = I.log 2D/L with D :
distance, W : size of the target, I = 0,1 sec. (Fitts’ law 1954)
L
• • • •
X0 X1 X2 target
Paul M. Fitts (1954). The information capacity of the human motor system in
controlling the amplitude of movement. Journal of Experimental Psychology,
volume 47, number 6, June 1954, pp. 381–391. (Reprinted in Journal of
Experimental Psychology: General, 121(3):262–269, 1992) 28
Example
29
facts about knowledge of perception
31
Perception
32
Perception
33
Gestalt principles of perception
34
The principle of emergence
• addresses the process where humans usually first identify the whole
and then the parts
35
The principle of reification
36
The principle of multi-stability
Example of Multi-stability,
where the cube can be
seen in two ways (source:
geoff-hart.com)
37
The principle of invariance
• addresses the fact that similar and different objects can be identified
independent of the scale, rotation or translation.
38
Laws of the gestalt psychology in relation to
information visualization
• Law of Similarity: Items that are similar
are grouped together by the brain
39
Laws of the gestalt psychology in relation to
information visualization (cont.)
• Law of Continuity: lines are seen
as following the smoothest path.
40
Laws of the gestalt psychology in relation to
information visualization (cont.)
• Law of common regions: Elements that
are located in the same closed region are
perceived as part of a group.
• Position
• Shape
• Size
• Bright
• Color
• Orientation
• Texture
• Movement
42
Perception: visual variables
• Position
• Shape
• Size
• Bright
• Color
• Orientation
• Texture
• Movement
43
Perception: visual variables
• Position
• Shape
• Size
• Bright
• Color
• Orientation
• Texture
• Movement
44
Principles of data visualization
45
Interaction gulfs (according to D. Norman, 1986)
goal
goal
goal
Execution gulf is the effort
Semantic
Intention comparison
distance
Semantic
Semantic
Intention
Intention comparison
comparison
distance
distance
ofof
of
inputs
outputs
inputs
an intention in terms of
inputs
outputs
outputs
distance
Semantic
meaning of meaning of
distance
distance
Semantic
meaning
meaning of meaning
meaningof commands or instructions
Semantic
of of
Evaluation
ofof
inputs outputs
Evaluationgulf
gulf
of
inputs
inputs outputs
outputs
Executiongulf
detailed
detailed interpretation
action plan interpretation
interpretation the system are meaningful or
action
action plan
plan
understandable by the users,
Articulatory
distance
Articulatory
Articulatory
way to and in accordance with their
distance
distance
gulf
how to way
way to
to
present
ofof
how
how to
to present
present goals
of
inputs
execute
outputs
results
inputs
inputs
execute
outputs
execute
outputs
results
results
Articulatory
Articulatory
Articulatory
distance
distance
ofof
distance
of
execution perception
execution
execution perception
perception
46
system
system
system
User tasks (for visualization)
48
Exemple of techniques
49
Data characterization
Standard visualization model
User interaction
task
Raw data
• to explore ofData
data sets Data of models evaluation
warehouse interests
• to confirm
Data
hypothesis
selection, filtering, Data mining
cleanning grouping, ...
• to present data
task
DATA
task
http://www.jasondavies.com/parallel-sets/
Data characterization
• Many classifications…
Keller, P. e Keller, M. Visual Cues: Practical Data Nominal data: type of rocks
Visualization. IEEE Computer Society Press, 1994
Data characterization
• Many classifications…
Keller, P. e Keller, M. Visual Cues: Practical Data Vectorial fields of speed (color)
Visualization. IEEE Computer Society Press, 1994 http://web.cs.wpi.edu/~matt/courses/cs563/talks/flowvis/flowvis.html
Data characterization
Shneiderman (1996)
• Unidimensional
• Bidirectional maps
• Tridimensional world
• Temporal
• Multidimensional
• Trees
• Networks
• Unidimensional
• Bidirectional maps
• Tridimensional world
• Temporal
• Multidimensional
• Trees
• Networks
• Unidimensional
• Bidirectional maps
• Tridimensional world
• Temporal
• Multidimensional
• Trees
• Networks
Keim (2002)
• Unidimensional
• Dimensional
• Multidimensional
• Text and hypertext
• Hierarchy and graphs
• Algorithms and software Seesoft
• Classes of data
• Type of data
• Nature of the domain
• Dimension of the domain
Classes of data
• https://syntagmatic.github.io/parallel-coordinates/
Classes of data
Material density of oil well perforation in the sea (Binotto et al., 2003)
Classes of data
(VTk)
Stream ribbons:
3D representation showing
the “form” of the flow
Geovis-UFRGS (1998)
• Data = relationship
• link
• hierarquia
http://www.smartmoney.com/marketmap/
https://finviz.com/map.ashx
http://www.bewitched.com/marketmap.html
http://bost.ocks.org/mike/miserables
Characterization of data
• Classes of data
• Type of data data
• Nature of the domain
• Dimension of the domain
Type of dados
• Quantitative
• Categories
• Ordered
• Relations
Type of data x classes of data data
• Characteristics
• Numeric, quantitative or
ordered
• Categories (items in a
enumeration)
Campo escalar
(UFRGS – Pinto e Freitas, 2006)
Type of data (scalar)
atmospheric pressure, temperature and magnitude of wind speed
(UFRGS – Pinto e Freitas, 2007)
• Classes of data
• Type of data
• Nature of domain
• Dimension of domain
Nature of domain
http://archive.ics.uci.edu/ml/datasets/Iris
Nature of domain
• Continous
• Example
• weather
http://tweetping.net/#
Nature of domain
• Continous-discret
Characterization of data
• Classes of data
• Types of data
• Nature of domain
• Dimension of domain
Dimension of domain
• 1D
• Temporal informaiton (ex. Historical facts)
• 2D
• Data associated to flat regions (ex. maps)
• 3D
• Data associated with region in the space (volumes)
• Multidimensional
• Tabular data
Dados 1D
• Text
• Temporam imformation
• Example:
• Seesoft
• Temporal information
indicated by color
Dados 2D
• Information associated with
positions, regions in a map
Representation of the space (2D ou 3D)
cartesian regular
curbe
rectangular points unstructured
Dados 2D/3D
regular
VolView (www.kitware.com)
Representation of space (2D ou 3D)
points
Geovis-UFRGS (1998)
Representation of space (2D ou 3D)
curbe
Flow simulation
(VTk)
Dimension of domain ?
• Hierarchy
• Ex. file system
• Network
• Ex. Web site, communication network, social networks
• Collection of documents
• Ex. digital libraries
• Collection of entities in a database
• Ex; archive of people’s file
Data nD
• Table
• Attributes associated to isolated entinties
• Trees
• Entities with hierarquical relationship
• Graps
• Entities with multiple relationships
Table: each attribute is a dimension
Parallel coordinates
(Inselberg)
trees
97
graphs
98
Approach “what-why-how”
• It is a way to analyze visualization
techniques using three questions:
What? • What (data)
• Which data are represented
• Why (user task)
• Why users are using the visualization
Why? technique
• How (infovis technique)
• Which are the visual codification and
which are the interaction techniques
How? implemented
Munzner, Tamara Visualization Analysis and Design. Boca Raton, FL:Taylor & Francis, 2014