0% found this document useful (0 votes)
9 views

IN4089 - Lecture 01 - Intro - What Why How-Pdfjam

Uploaded by

frankxlebrun
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
9 views

IN4089 - Lecture 01 - Intro - What Why How-Pdfjam

Uploaded by

frankxlebrun
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 16

Practical Information

Data Visualization Lecturers Teaching Assistants

IN4089 Data Visualization


Thomas Höllt - Computer Graphics & Visualization
Nafie
* Nicolas
Thomas Höllt Elmar Eisemann Martin Skrodzki
[email protected] Soumyadeep
Casper

Contact: Brightspace forum for organization, answers.ewi.tudelft.nl for content

[*xkcd] 2

Practical Information Lectures


• Lectures: On Campus lecture hall Boole • No mandatory attendance, but strongly advised
• Labs: • No streaming/recording
• answers.ewi.tudelft.nl for questions (use/follow tag in4089) • Previous years lectures are on Collegerama
• Hands on session Wednesdays 8:45-12:30 in PC Hall 2 (+3), Drebbelweg (no guarantee on completeness wrt this years course)
• Exams: hand-written, on campus" Register no later than 2 weeks before" • Book (extra information)
• Brightspace • Visualization Analysis and Design, Tamara Munzner
• Project assignments ebook: TU Delft Library

• Lecture slides, Papers, etc.

3 4
Projects Labs
• 2 Projects: • No mandatory attendance
• Information Visualization (Open, recommendation: javascript / D3) • Time to work on projects with access to support
• See D3Intro on Brightspace to get started • We will be available in for feedback/discussion during the lab hours
• Volume Visualization (C++) - 50% group, 50% individual

• Groups of 3
• Same group for both projects.
• Work on and bring up issues early"
• Make groups now" Group Finding channel on Brightspace

5 6

Assessment Tentative Schedule


Tuesday Wednesday

• This is IN4089 and gives 5 Credits" 8:45 – 10:30 8:45 – 12:30

Introduction Color Perception


15th Nov. 16th Nov. (11:00)

• Final Mark
Design Study Methodology Flipped Classroom

22th Nov. Visualization Idioms 23th Nov. InfoVis

• Written exam (30%) 29th Nov. MCV, Interaction, Animation 30th Nov. InfoVis

• 2 Projects (70%, 35% each)


Graphs
6th Dec. 7th Dec. InfoVis
Dimensionality Reduction

Course is passed if the final combined grade is 6 or higher


13th Dec. Virtual Reality 14th Dec. InfoVis

20th Dec. Direct Volume Rendering 21th Dec. InfoVis/VolVis InfoVis Project Deadline: 06.01.2023

• Each component has to get the mark of 5 or higher 10th Jan. Indirect Volume Rendering 11th Jan. VolVis

• Exam: 17th Jan. Vector Field Visualization 18th Jan. VolVis lecture
lab

• Lecture material" You should show you understand and can apply what was discussed. 31th Jan. Exam exam
VolVis Project Deadline: 03.02.2023

7 8
"
Questions!
Visualization is the process that transforms
(abstract) data into interactive graphical representations
for the purpose of
exploration, confirmation, or presentation.

10

A B C D A B C D
x y x y x y x y x y x y x y x y
1 8. 1 9. 1 7. 8. 6. 1 8. 1 9. 1 7. 8. 6.
0.
8. 04
6. 0.
8. 1
8. 0.
8. 46
6. 0
8. 58
5. 0.
8. 04
6. 0.
8. 1
8. 0.
8. 46
6. 0
8. 58
5.
0
1 95
7. 0
1 1
8. 0
1 77
12 0
8. 76
7. 0
1 95
7. 0
1 1
8. 0
1 77
12 0
8. 76
7.
3.
9. 58
8. 3.
9. 7
8. 3.
9. .7
7. 0
8. 71
8. 3.
9. 58
8. 3.
9. 7
8. 3.
9. .7
7. 0
8. 71
8.
0
1 81
8. 0
1 7
9. 0
1 11
7. 0
8. 84
8. 0
1 81
8. 0
1 7
9. 0
1 11
7. 0
8. 84
8.
1.
1 33
9. 1.
1 2
8. 1.
1 81
8. 0
8. 47
7. 1.
1 33
9. 1.
1 2
8. 1.
1 81
8. 0
8. 47
7.
4.
6. 96
7. 4.
6. 1
6. 4.
6. 84
6. 0
8. 04
5. 4.
6. 96
7. 4.
6. 1
6. 4.
6. 84
6. 0
8. 04
5.
0
4. 24
4. 0
4. 1
3. 0
4. 08
5. 0
1 25
12 0
4. 24
4. 0
4. 1
3. 0
4. 08
5. 0
1 25
12
0
1 26
10 0
1 1
9. 0
1 39
8. 9.
8. .5
5. 0
1 26
10 0
1 1
9. 0
1 39
8. 9.
8. .5
5.
2.
7. .8
4. 2.
7. 1
7. 2.
7. 15
6. 0
8. 56
7. 2.
7. .8
4. 2.
7. 1
7. 2.
7. 15
6. 0
8. 56
7.
0
5. 82
5. 0
5. 2
4. 0
5. 42
5. 0
8. 91
6. 0
5. 82
5. 0
5. 2
4. 0
5. 42
5. 0
8. 91
6.
0 68 0 7 0 73 0 89 0 68 0 7 0 73 0 89

Visualization is the process that transforms Visualization is the process that transforms
(abstract) data into interactive graphical representations (abstract) data into interactive graphical representations
for the purpose of for the purpose of
exploration, confirmation, or presentation. exploration, confirmation, or presentation.

10 10
Three Types of Goals Visualization for Data Exploration I
Present • Can’t machines do (learn) that#
• “Everything” known about the data,
• Machine learning is great for
➫ Visualization used for Communication of results
• Well defined problems
Analyze
• Verifying Hypothesis
• There are hypotheses
➫ Visualization used Verification or Falsification • Not so great for
Explore • Finding the unknown
• Nothing is known • Fuzzy problems
➫ Visualization used for data Exploration • Hypothesis generation

11 12

Visualization for Data Exploration II Numbers do not tell the whole story…
• Interactive Visualization / Visual Analytics
A B C D

x y x y x y x y

• Human visual system


10.0 8.04 10.0 9.14 10.0 7.46 8.0 6.58

8.0 6.95 8.0 8.14 8.0 6.77 8.0 5.76

great at pattern recognition 13.0 7.58 13.0 8.74 13.0 12.74 8.0 7.71

Show the data for


9.0 8.81 9.0 8.77 9.0 7.11 8.0 8.84

• 11.0 8.33 11.0 9.26 11.0 7.81 8.0 8.47

initial/early exploration 14.0 9.96 14.0 8.10 14.0 8.84 8.0 7.04

6.0 7.24 6.0 6.13 6.0 6.08 8.0 5.25

• Sometimes statistics only tell half the story 4.0 4.26 4.0 3.10 4.0 5.39 19.0 12.50

12.0 10.84 12.0 9.13 12.0 8.15 8.0 5.56

7.0 4.82 7.0 7.26 7.0 6.42 8.0 7.91

5.0 5.68 5.0 4.74 5.0 5.73 8.0 6.89

Anscombe’s Quartet
13 14
Numbers do not tell the whole story… Numbers do not tell the whole story…
15 15
A B C D
Statistics: A B C D R2 = 0.6665 R2 = 0.6662
12 12
x y x y x y x y x y x y x y x y

10.0 8.04 10.0 9.14 10.0 7.46 8.0 6.58


Mean(x): A=B=C=D=9 exact 10.0 8.04 10.0 9.14 10.0 7.46 8.0 6.58 9 9

8.0 6.95 8.0 8.14 8.0 6.77 8.0 5.76


Variance(x): A=B=C=D=11 exact 8.0 6.95 8.0 8.14 8.0 6.77 8.0 5.76
6 6

13.0 7.58 13.0 8.74 13.0 12.74 8.0 7.71 13.0 7.58 13.0 8.74 13.0 12.74 8.0 7.71 A B
Mean(y): A=B=C=D=7.50 (2 decimals) 3
3 7 11 15 19
3
3 7 11 15 19
9.0 8.81 9.0 8.77 9.0 7.11 8.0 8.84 9.0 8.81 9.0 8.77 9.0 7.11 8.0 8.84

11.0 8.33 11.0 9.26 11.0 7.81 8.0 8.47 Variance(y): A=B=C=D=4.125 (+/- 0.003) 11.0 8.33 11.0 9.26 11.0 7.81 8.0 8.47

14.0 9.96 14.0 8.10 14.0 8.84 8.0 7.04 14.0 9.96 14.0 8.10 14.0 8.84 8.0 7.04
Correlation(x,y): A=B=C=D=816 (3 decimals) 15 15
6.0 7.24 6.0 6.13 6.0 6.08 8.0 5.25 6.0 7.24 6.0 6.13 6.0 6.08 8.0 5.25 R2 = 0.6663 R2 = 0.6667
12 12
4.0 4.26 4.0 3.10 4.0 5.39 19.0 12.50 4.0 4.26 4.0 3.10 4.0 5.39 19.0 12.50

12.0 10.84 12.0 9.13 12.0 8.15 8.0 5.56 12.0 10.84 12.0 9.13 12.0 8.15 8.0 5.56 9 9

7.0 4.82 7.0 7.26 7.0 6.42 8.0 7.91 7.0 4.82 7.0 7.26 7.0 6.42 8.0 7.91
6 6

5.0 5.68 5.0 4.74 5.0 5.73 8.0 6.89 5.0 5.68 5.0 4.74 5.0 5.73 8.0 6.89 C D
3 3
3 7 11 15 19 3 7 11 15 19

Anscombe’s Quartet Anscombe’s Quartet


14 15

http://blog.revolutionanalytics.com/2017/05/the-datasaurus-dozen.html
16 http://blog.revolutionanalytics.com/2017/05/the-datasaurus-dozen.html
16
we are great at pattern recognition find the red dot

Data Visualization

visualization pictures
! !
History stock market
https://www.flickr.com/photos/watz/2962012261/ The purpose of computing is insight, not numbers.
http://www.eea.europa.eu/data-and-maps/daviz/learn-more/chart-dos-and-donts

- Richard Wesley Hamming


19 - Card, Mackinlay, Shneiderman
Data Visualization Two Main Flavors of Visualization
Non-Spatial Data Spatial Field Data
Information Visualization Scientific Visualization

History stock market


https://www.flickr.com/photos/watz/2962012261/

http://www.eea.europa.eu/data-and-maps/daviz/learn-more/chart-dos-and-donts

http://www.valuewalk.com/2014/06/bubbles-history-shows-us-bonds-stocks/

21 22

Two Main Flavors of Visualization

"
InfoVis SciVis
Abstract Data
Spatial Field Data
No/limited spatial reference
N-dimensional
Mostly 2 or 3-dimensional
Heterogeneous
Numerical, text, Scientific, engineering,
images, multimedia biomedical
Questions!
The two parts of this course

23
A lot of options – Design space is huge
5

3.8

2.5

1.3

Visualization Design 0
Category 1 Category 2 Category 3 Category 4
5

Series 1
3.8

2.5

1.3

0
Category 1 Category 2 Category 3 Category 4

26

A lot of options – Design space is huge When does Visualization make sense#
5

3.8

2.5

1.3

+Interaction !
Category 1 Category 2 Category 3 Category 4
5

Series 1
3.8

2.5

1.3

0
Category 1 Category 2 Category 3 Category 4

26 Sedlmair et al., IEEE TVCG, 2012 27


How to build a visualization system# How to build a visualization system#

Sedlmair et al., IEEE TVCG, 2012 28 Sedlmair et al., IEEE TVCG, 2012 28

How to build a visualization system# How to build a visualization system#

Sedlmair et al., IEEE TVCG, 2012 28 Sedlmair et al., IEEE TVCG, 2012 28
How to build a visualization system# Analysis framework: Four levels, three questions
• domain situation
• who are the target users# Goals and questions#

Sedlmair et al., IEEE TVCG, 2012 28 Munzner. IEEE TVCG, 2009 / Brehmer and Munzner. IEEE TVCG, 2013 29

Analysis framework: Four levels, three questions Analysis framework: Four levels, three questions
• domain situation • domain situation
• who are the target users# Goals and questions# • who are the target users# Goals and questions#
• abstraction • abstraction
• translate from specifics of domain to vocabulary of vis • translate from specifics of domain to vocabulary of vis
• what is shown# data abstraction • what is shown# data abstraction
• why is the user looking at it# task abstraction • why is the user looking at it# task abstraction
• idiom
• how is it shown#
• visual encoding idiom: how to draw
• interaction idiom: how to manipulate

Munzner. IEEE TVCG, 2009 / Brehmer and Munzner. IEEE TVCG, 2013 29 Munzner. IEEE TVCG, 2009 / Brehmer and Munzner. IEEE TVCG, 2013 29
Analysis framework: Four levels, three questions Why is validation difficult#
• domain situation different ways to get it wrong at each level
• who are the target users# Goals and questions#
• abstraction Domain situation
• translate from specifics of domain to vocabulary of vis You misunderstood their needs

• what is shown# data abstraction Data/task abstraction


• why is the user looking at it# task abstraction You’re showing them the wrong thing

• idiom
Visual encoding/interaction idiom
• how is it shown# The way you show it doesn’t work
• visual encoding idiom: how to draw Algorithm
• interaction idiom: how to manipulate Your code is too slow

• algorithm
• efficient computation
Munzner. IEEE TVCG, 2009 / Brehmer and Munzner. IEEE TVCG, 2013 29 30

Why is validation difficult# Why is validation difficult#


solution: use methods from different fields at each level solution: use methods from different fields at each level
Domain situation Domain situation
Observe target users using existing tools Observe target users using existing tools

Data/task abstraction Data/task abstraction

Visual encoding/interaction idiom Visual encoding/interaction idiom


Justify design with respect to alternatives Justify design with respect to alternatives

Algorithm computer Algorithm technique-


Measure system time/memory science Measure system time/memory driven work
Analyze computational complexity Analyze computational complexity
Analyze results qualitatively Analyze results qualitatively
Measure human time with lab experiment (lab study) Measure human time with lab experiment (lab study)

Observe target users after deployment ( ) Observe target users after deployment ( )
Measure adoption Measure adoption

Munzner. IEEE TVCG, 2009 31 Munzner. IEEE TVCG, 2009 31


Why is validation difficult# Why is validation difficult#
solution: use methods from different fields at each level solution: use methods from different fields at each level
Domain situation
Observe target users using existing tools
anthropology/ Domain situation
Observe target users using existing tools
ethnography
Data/task abstraction Data/task abstraction

design Visual encoding/interaction idiom design Visual encoding/interaction idiom


Justify design with respect to alternatives Justify design with respect to alternatives

computer Algorithm technique- computer Algorithm technique-


science Measure system time/memory driven work science Measure system time/memory driven work
Analyze computational complexity Analyze computational complexity
cognitive Analyze results qualitatively cognitive Analyze results qualitatively
psychology Measure human time with lab experiment (lab study) psychology Measure human time with lab experiment (lab study)

anthropology/
Observe target users after deployment ( ) Observe target users after deployment ( )
Measure adoption ethnography Measure adoption

Munzner. IEEE TVCG, 2009 31 Munzner. IEEE TVCG, 2009 31

Why is validation difficult#


solution: use methods from different fields at each level
problem-driven work
anthropology/ Domain situation
Observe target users using existing tools (design study)
ethnography
Data/task abstraction

design
What" Why" How"
Visual encoding/interaction idiom
Justify design with respect to alternatives

computer Algorithm technique-


science Measure system time/memory driven work
Analyze computational complexity
cognitive Analyze results qualitatively
psychology Measure human time with lab experiment (lab study)

anthropology/
Observe target users after deployment ( )

ethnography Measure adoption

Munzner. IEEE TVCG, 2009 31


Three major datatypes
What?
Datasets Attributes

What? Data Types Attribute Types


Items Attributes Links Positions Grids Categorical Dataset
Dataset
Dataset
Dataset Types
Types
Types
Types Dataset Types
Why? Data and Dataset Types TablesTypes
Dataset
Tables
Tables
Tables Networks
Networks
Tables
Networks
Networks FieldsFields
Networks
Fields
Fields
Spatial
(Continuous)
(Continuous)
(Continuous)
Geometry
(Continuous) Geometry
Fields
Geometry
Geometry (Spatial)
(Spatial)(Spatial)
(Continuous)(Spatial)
Tables Networks & Fields Geometry Clusters, Ordered
Trees Sets, Lists Ordinal Grid of positions
Attributes (columns) Grid of Grid
positions
How? Items Items (nodes) Grids Items Items Tables
Attributes (columns)
Attributes
Attributes (columns)
(columns) Networks
Attributes (columns) Fields Gridof
ofpositions
positions
(Continuous) Geometry
Grid of positions(Spatial)
Attributes Links Positions Positions
Items Items Link Link
Attributes Attributes Quantitative
Items
Items Items Link
Link Cell Cell Link
Grid of positions
(rows)(rows) Attributes (columns) Cell
Cell Cell Position Position
(rows)
(rows) (rows) Node Node Position
Position
Dataset Types Node
Node Node
Ordering Direction Items (item)Link(item)
Cell containing
Cell containing value value (item)
(item) Cell (columns)
Attributes (item)
Attributes (columns)
Tables Networks Fields (Continuous) (rows) Cell containing value Attributes
Attributes(columns)
(columns) Attributes (columns) Position
Sequential Cell
Cellcontaining
containingvalue
value
Attributes (columns) Grid of positions Node
Items Link (item) Value in cell Value in cell
(rows)
Cell
Diverging Cell containing value Attributes
Valueinin(columns)
Value cell
cell Value in cell
Node
Multidimensional Table Table
Multidimensional Trees Trees
Trees
Trees
Multidimensional Table Trees
(item)
Cell containing value Attributes (columns)
Multidimensional
MultidimensionalTable
Table
Value in cell
Cyclic Value in cell
Multidimensional Table Trees
Multidimensional Table Trees

Value in cell
Value in cell
Value in cell
Value
Valueinincell
cell Value in cell
Geometry (Spatial)
Value in cell

Position

Munzner Book - Chapter 2 34


What?

Dataset and data types Dataset and data types


Dataset Availability
Static Dynamic Why?

How?

Data Types Data Types


Items Attributes Links Positions Grids Items Attributes Links Positions Grids

Data and Dataset Types


Tables Networks & Fields Geometry Clusters,
Trees Sets, Lists
Items Items (nodes) Grids Items Items
Attributes Links Positions Positions
Attributes Attributes

35 35
Dataset and data types Attribute types Attributes

Data Types Attribute Types


Items Attributes Links Positions Grids Categorical Ordered
Ordinal Quantitative
Data and Dataset Types
Tables Networks & Fields Geometry Clusters,
Trees Sets, Lists
Items Items (nodes) Grids Items Items
Attributes Links Positions Positions Ordering Direction
Attributes Attributes
Sequential Diverging Cyclic

35 36

Further reading

"
• A Multi-Level Typology of Abstract Visualization Tasks. Brehmer and Munzner. IEEE Trans.
Visualization and Computer Graphics (Proc. InfoVis) 19:12 (2013), 2376–2385.
• Low-Level Components of Analytic Activity in Information Visualization. Amar, Eagan, and
Stasko. Proc. IEEE InfoVis 2005, p 111–117.
• A taxonomy of tools that support the fluent and flexible use of visualizations. Heer and
Shneiderman. Communications of the ACM 55:4 (2012), 45–54.
• Rethinking Visualization: A High-Level Taxonomy. Tory and Möller. Proc. IEEE InfoVis 2004, p Questions!
151–158.
• Visualization of Time-Oriented Data. Aigner, Miksch, Schumann, and Tominski. Springer, 2011.

37
Why? Why?
Actions Targets Actions Targets

Analyze All Data Analyze All Data


Consume Trends Outliers Features Consume Trends Outliers Features
Discover Present Enjoy Discover Present Enjoy

Attributes Attributes
Produce Produce
Annotate Record Derive One Many Annotate Record Derive One Many
tag Distribution Dependency Correlation Similarity tag Distribution Dependency Correlation Similarity

Extremes Extremes
Search Search

{action, target} pairs


Target known Target unknown Target known Target unknown
Location Location
Network Data
–discover Network Data
Lookup Browse Lookup Browse
known known
Location
unknown
Locate Explore Topology distribution Location
unknown
Locate Explore Topology

–compare trends
Query Paths –locate outliers Query Paths
Identify Compare Summarize Identify Compare Summarize
What? –browse topology What?
Spatial Data Spatial Data
Why? Why?
Shape Shape
How? How?
Munzner Book - Chapter 3 Munzner Book - Chapter 3

Actions: Analyze Actions Actions: Analyze Actions

• consume Analyze • consume Analyze


• discover vs present Consume • discover vs present Consume
• aka explore vs explain Discover Present Enjoy
• aka explore vs explain Discover Present Enjoy

• enjoy • enjoy
• newcomer • newcomer
aka casual, social aka casual, social
Produce Produce
• •
Annotate Record Derive Annotate Record Derive
• produce tag
• produce tag

• annotate, record, derive • annotate, record, derive


• crucial design choice • crucial design choice
Search Search

Target known Target unknown41 Target known Target unknown41

Location Location
Lookup Browse Lookup Browse
known known

Location Location
Locate Explore Locate Explore
unknown unknown

Query Query
Attributes
Produce
Annotate Record Derive One Many
tag Distribution Dependen

Why? Extremes

Derive Why: Targets


Search
Actions Targets
Target known Target unknown
• don’t just draw what you’re given" Analyze All Data Location
Lookup Browse Network Data
known
• decide what the right thing to show is Consume Trends Outliers
Location Features
Locate Explore Topology
Discover Present Enjoy unknown
• create it with a series of transformations from the original dataset

• one of the four major strategies for handling complexity Query Paths
Attributes Identify Compare Summarize
Produce
exports Annotate Record Derive One Many
Distribution Dependency Correlation Similarity Spatial Data
imports tag

trade Shape
balance
Extremes
Search
trade balance =Target
exports −imports
known Target unknown
LocationDerived Data
Original Data known
Lookup Browse 42 Network Data 43

"
Location Locate Explore Topology
unknown

Query Paths
Identify Compare Summarize
What?
Spatial Data
Why?
Shape
How?

Questions!
Slides (partially) based on lectures by
Anna Vilanova (TU Eindhoven)
Tamara Munzner (University of British Columbia)

You might also like