IN4089 - Lecture 01 - Intro - What Why How-Pdfjam
IN4089 - Lecture 01 - Intro - What Why How-Pdfjam
[*xkcd] 2
3 4
Projects Labs
• 2 Projects: • No mandatory attendance
• Information Visualization (Open, recommendation: javascript / D3) • Time to work on projects with access to support
• See D3Intro on Brightspace to get started • We will be available in for feedback/discussion during the lab hours
• Volume Visualization (C++) - 50% group, 50% individual
• Groups of 3
• Same group for both projects.
• Work on and bring up issues early"
• Make groups now" Group Finding channel on Brightspace
5 6
• Final Mark
Design Study Methodology Flipped Classroom
• Written exam (30%) 29th Nov. MCV, Interaction, Animation 30th Nov. InfoVis
• Each component has to get the mark of 5 or higher 10th Jan. Indirect Volume Rendering 11th Jan. VolVis
• Exam: 17th Jan. Vector Field Visualization 18th Jan. VolVis lecture
lab
• Lecture material" You should show you understand and can apply what was discussed. 31th Jan. Exam exam
VolVis Project Deadline: 03.02.2023
7 8
"
Questions!
Visualization is the process that transforms
(abstract) data into interactive graphical representations
for the purpose of
exploration, confirmation, or presentation.
10
A B C D A B C D
x y x y x y x y x y x y x y x y
1 8. 1 9. 1 7. 8. 6. 1 8. 1 9. 1 7. 8. 6.
0.
8. 04
6. 0.
8. 1
8. 0.
8. 46
6. 0
8. 58
5. 0.
8. 04
6. 0.
8. 1
8. 0.
8. 46
6. 0
8. 58
5.
0
1 95
7. 0
1 1
8. 0
1 77
12 0
8. 76
7. 0
1 95
7. 0
1 1
8. 0
1 77
12 0
8. 76
7.
3.
9. 58
8. 3.
9. 7
8. 3.
9. .7
7. 0
8. 71
8. 3.
9. 58
8. 3.
9. 7
8. 3.
9. .7
7. 0
8. 71
8.
0
1 81
8. 0
1 7
9. 0
1 11
7. 0
8. 84
8. 0
1 81
8. 0
1 7
9. 0
1 11
7. 0
8. 84
8.
1.
1 33
9. 1.
1 2
8. 1.
1 81
8. 0
8. 47
7. 1.
1 33
9. 1.
1 2
8. 1.
1 81
8. 0
8. 47
7.
4.
6. 96
7. 4.
6. 1
6. 4.
6. 84
6. 0
8. 04
5. 4.
6. 96
7. 4.
6. 1
6. 4.
6. 84
6. 0
8. 04
5.
0
4. 24
4. 0
4. 1
3. 0
4. 08
5. 0
1 25
12 0
4. 24
4. 0
4. 1
3. 0
4. 08
5. 0
1 25
12
0
1 26
10 0
1 1
9. 0
1 39
8. 9.
8. .5
5. 0
1 26
10 0
1 1
9. 0
1 39
8. 9.
8. .5
5.
2.
7. .8
4. 2.
7. 1
7. 2.
7. 15
6. 0
8. 56
7. 2.
7. .8
4. 2.
7. 1
7. 2.
7. 15
6. 0
8. 56
7.
0
5. 82
5. 0
5. 2
4. 0
5. 42
5. 0
8. 91
6. 0
5. 82
5. 0
5. 2
4. 0
5. 42
5. 0
8. 91
6.
0 68 0 7 0 73 0 89 0 68 0 7 0 73 0 89
Visualization is the process that transforms Visualization is the process that transforms
(abstract) data into interactive graphical representations (abstract) data into interactive graphical representations
for the purpose of for the purpose of
exploration, confirmation, or presentation. exploration, confirmation, or presentation.
10 10
Three Types of Goals Visualization for Data Exploration I
Present • Can’t machines do (learn) that#
• “Everything” known about the data,
• Machine learning is great for
➫ Visualization used for Communication of results
• Well defined problems
Analyze
• Verifying Hypothesis
• There are hypotheses
➫ Visualization used Verification or Falsification • Not so great for
Explore • Finding the unknown
• Nothing is known • Fuzzy problems
➫ Visualization used for data Exploration • Hypothesis generation
11 12
Visualization for Data Exploration II Numbers do not tell the whole story…
• Interactive Visualization / Visual Analytics
A B C D
x y x y x y x y
great at pattern recognition 13.0 7.58 13.0 8.74 13.0 12.74 8.0 7.71
initial/early exploration 14.0 9.96 14.0 8.10 14.0 8.84 8.0 7.04
• Sometimes statistics only tell half the story 4.0 4.26 4.0 3.10 4.0 5.39 19.0 12.50
Anscombe’s Quartet
13 14
Numbers do not tell the whole story… Numbers do not tell the whole story…
15 15
A B C D
Statistics: A B C D R2 = 0.6665 R2 = 0.6662
12 12
x y x y x y x y x y x y x y x y
13.0 7.58 13.0 8.74 13.0 12.74 8.0 7.71 13.0 7.58 13.0 8.74 13.0 12.74 8.0 7.71 A B
Mean(y): A=B=C=D=7.50 (2 decimals) 3
3 7 11 15 19
3
3 7 11 15 19
9.0 8.81 9.0 8.77 9.0 7.11 8.0 8.84 9.0 8.81 9.0 8.77 9.0 7.11 8.0 8.84
11.0 8.33 11.0 9.26 11.0 7.81 8.0 8.47 Variance(y): A=B=C=D=4.125 (+/- 0.003) 11.0 8.33 11.0 9.26 11.0 7.81 8.0 8.47
14.0 9.96 14.0 8.10 14.0 8.84 8.0 7.04 14.0 9.96 14.0 8.10 14.0 8.84 8.0 7.04
Correlation(x,y): A=B=C=D=816 (3 decimals) 15 15
6.0 7.24 6.0 6.13 6.0 6.08 8.0 5.25 6.0 7.24 6.0 6.13 6.0 6.08 8.0 5.25 R2 = 0.6663 R2 = 0.6667
12 12
4.0 4.26 4.0 3.10 4.0 5.39 19.0 12.50 4.0 4.26 4.0 3.10 4.0 5.39 19.0 12.50
12.0 10.84 12.0 9.13 12.0 8.15 8.0 5.56 12.0 10.84 12.0 9.13 12.0 8.15 8.0 5.56 9 9
7.0 4.82 7.0 7.26 7.0 6.42 8.0 7.91 7.0 4.82 7.0 7.26 7.0 6.42 8.0 7.91
6 6
5.0 5.68 5.0 4.74 5.0 5.73 8.0 6.89 5.0 5.68 5.0 4.74 5.0 5.73 8.0 6.89 C D
3 3
3 7 11 15 19 3 7 11 15 19
http://blog.revolutionanalytics.com/2017/05/the-datasaurus-dozen.html
16 http://blog.revolutionanalytics.com/2017/05/the-datasaurus-dozen.html
16
we are great at pattern recognition find the red dot
Data Visualization
visualization pictures
! !
History stock market
https://www.flickr.com/photos/watz/2962012261/ The purpose of computing is insight, not numbers.
http://www.eea.europa.eu/data-and-maps/daviz/learn-more/chart-dos-and-donts
http://www.eea.europa.eu/data-and-maps/daviz/learn-more/chart-dos-and-donts
http://www.valuewalk.com/2014/06/bubbles-history-shows-us-bonds-stocks/
21 22
"
InfoVis SciVis
Abstract Data
Spatial Field Data
No/limited spatial reference
N-dimensional
Mostly 2 or 3-dimensional
Heterogeneous
Numerical, text, Scientific, engineering,
images, multimedia biomedical
Questions!
The two parts of this course
23
A lot of options – Design space is huge
5
3.8
2.5
1.3
Visualization Design 0
Category 1 Category 2 Category 3 Category 4
5
Series 1
3.8
2.5
1.3
0
Category 1 Category 2 Category 3 Category 4
26
A lot of options – Design space is huge When does Visualization make sense#
5
3.8
2.5
1.3
+Interaction !
Category 1 Category 2 Category 3 Category 4
5
Series 1
3.8
2.5
1.3
0
Category 1 Category 2 Category 3 Category 4
Sedlmair et al., IEEE TVCG, 2012 28 Sedlmair et al., IEEE TVCG, 2012 28
Sedlmair et al., IEEE TVCG, 2012 28 Sedlmair et al., IEEE TVCG, 2012 28
How to build a visualization system# Analysis framework: Four levels, three questions
• domain situation
• who are the target users# Goals and questions#
Sedlmair et al., IEEE TVCG, 2012 28 Munzner. IEEE TVCG, 2009 / Brehmer and Munzner. IEEE TVCG, 2013 29
Analysis framework: Four levels, three questions Analysis framework: Four levels, three questions
• domain situation • domain situation
• who are the target users# Goals and questions# • who are the target users# Goals and questions#
• abstraction • abstraction
• translate from specifics of domain to vocabulary of vis • translate from specifics of domain to vocabulary of vis
• what is shown# data abstraction • what is shown# data abstraction
• why is the user looking at it# task abstraction • why is the user looking at it# task abstraction
• idiom
• how is it shown#
• visual encoding idiom: how to draw
• interaction idiom: how to manipulate
Munzner. IEEE TVCG, 2009 / Brehmer and Munzner. IEEE TVCG, 2013 29 Munzner. IEEE TVCG, 2009 / Brehmer and Munzner. IEEE TVCG, 2013 29
Analysis framework: Four levels, three questions Why is validation difficult#
• domain situation different ways to get it wrong at each level
• who are the target users# Goals and questions#
• abstraction Domain situation
• translate from specifics of domain to vocabulary of vis You misunderstood their needs
• idiom
Visual encoding/interaction idiom
• how is it shown# The way you show it doesn’t work
• visual encoding idiom: how to draw Algorithm
• interaction idiom: how to manipulate Your code is too slow
• algorithm
• efficient computation
Munzner. IEEE TVCG, 2009 / Brehmer and Munzner. IEEE TVCG, 2013 29 30
Observe target users after deployment ( ) Observe target users after deployment ( )
Measure adoption Measure adoption
anthropology/
Observe target users after deployment ( ) Observe target users after deployment ( )
Measure adoption ethnography Measure adoption
design
What" Why" How"
Visual encoding/interaction idiom
Justify design with respect to alternatives
anthropology/
Observe target users after deployment ( )
Value in cell
Value in cell
Value in cell
Value
Valueinincell
cell Value in cell
Geometry (Spatial)
Value in cell
Position
How?
35 35
Dataset and data types Attribute types Attributes
35 36
Further reading
"
• A Multi-Level Typology of Abstract Visualization Tasks. Brehmer and Munzner. IEEE Trans.
Visualization and Computer Graphics (Proc. InfoVis) 19:12 (2013), 2376–2385.
• Low-Level Components of Analytic Activity in Information Visualization. Amar, Eagan, and
Stasko. Proc. IEEE InfoVis 2005, p 111–117.
• A taxonomy of tools that support the fluent and flexible use of visualizations. Heer and
Shneiderman. Communications of the ACM 55:4 (2012), 45–54.
• Rethinking Visualization: A High-Level Taxonomy. Tory and Möller. Proc. IEEE InfoVis 2004, p Questions!
151–158.
• Visualization of Time-Oriented Data. Aigner, Miksch, Schumann, and Tominski. Springer, 2011.
37
Why? Why?
Actions Targets Actions Targets
Attributes Attributes
Produce Produce
Annotate Record Derive One Many Annotate Record Derive One Many
tag Distribution Dependency Correlation Similarity tag Distribution Dependency Correlation Similarity
Extremes Extremes
Search Search
–compare trends
Query Paths –locate outliers Query Paths
Identify Compare Summarize Identify Compare Summarize
What? –browse topology What?
Spatial Data Spatial Data
Why? Why?
Shape Shape
How? How?
Munzner Book - Chapter 3 Munzner Book - Chapter 3
• enjoy • enjoy
• newcomer • newcomer
aka casual, social aka casual, social
Produce Produce
• •
Annotate Record Derive Annotate Record Derive
• produce tag
• produce tag
Location Location
Lookup Browse Lookup Browse
known known
Location Location
Locate Explore Locate Explore
unknown unknown
Query Query
Attributes
Produce
Annotate Record Derive One Many
tag Distribution Dependen
Why? Extremes
• one of the four major strategies for handling complexity Query Paths
Attributes Identify Compare Summarize
Produce
exports Annotate Record Derive One Many
Distribution Dependency Correlation Similarity Spatial Data
imports tag
trade Shape
balance
Extremes
Search
trade balance =Target
exports −imports
known Target unknown
LocationDerived Data
Original Data known
Lookup Browse 42 Network Data 43
"
Location Locate Explore Topology
unknown
Query Paths
Identify Compare Summarize
What?
Spatial Data
Why?
Shape
How?
Questions!
Slides (partially) based on lectures by
Anna Vilanova (TU Eindhoven)
Tamara Munzner (University of British Columbia)