0% found this document useful (0 votes)

14 views

Lecture 12 - Evaluation

The document discusses evaluation methods like usability testing and experiments. It explains that usability testing involves observing typical users perform typical tasks with a product in a controlled setting. Data is collected through video recordings and interaction logs to measure performance times, errors, and user satisfaction. Experiments test hypotheses about the relationship between variables, while usability testing aims to check that a system is usable. The document outlines factors to consider for usability testing like representative users and tasks, controlled conditions, and collecting data on metrics like completion times and error rates. It recommends testing with 5-10 participants.

Uploaded by

Red Armani

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views

Lecture 12 - Evaluation

Uploaded by

Red Armani

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 45

Lecture 12 (Part A)

EVALUATION
The aims
• Explain the key concepts and terms used in evaluation
• Introduce different types of evaluation methods.
• Show how different evaluation methods are used for
different purposes at different stages of the design
process and in different contexts of use.
• Show how evaluators mix and modify methods to meet
the demands of evaluating novel systems.
• Discuss some of the challenges that evaluators have
to consider when doing evaluation.
• Illustrate how methods discussed in Chapters 7 and 8
are used in evaluation and describe some methods
that are specific to evaluation.
www.id-book.com 2
Why, what, where and when to
evaluate
Iterative design & evaluation is a continuous
process that examines:
• Why: to check users’ requirements and that they
can use the product and they like it.
• What: a conceptual model, early prototypes of a
new system and later, more complete prototypes.
• Where: in natural and laboratory settings.
• When: throughout design; finished products can be
evaluated to collect information to inform new
products.
www.id-book.com 3
Bruce Tognazzini tells you why you need to
evaluate
“Iterative design, with its repeating cycle of
design and testing, is the only validated
methodology in existence that will consistently
produce successful results. If you don’t have
user-testing as an integral part of your design
process you are going to throw buckets of
money down the drain.”

See AskTog.com for topical discussions about

design and evaluation.

www.id-book.com 4
Types of evaluation
• Controlled settings involving users, eg
usability testing & experiments in
laboratories and living labs.
• Natural settings involving users, eg field
studies and in the wild studies to see
how the product is used in the real world.
• Settings not involving users, e.g. to
predict, analyze & model aspects of the
interface analytics.
www.id-book.com 5
Living labs
• People’s use of technology in their everyday
lives can be evaluated in living labs.
• Such evaluations are too difficult to do in a
usability lab.
• Eg the Aware Home was embedded with a
complex network of sensors and audio/video
recording devices (Abowd et al., 2000).

www.id-book.com 6
Usability testing & field studies can
compliment

www.id-book.com 7
Evaluation case studies
• Experiment to investigate a computer game

• In the wild field study of skiers

• Crowdsourcing

www.id-book.com 8
Challenge & engagement in a
collaborative immersive game
• Physiological measures
were used.
• Players were more engaged when playing
against another person than when playing
against a computer.
• What precautionary measures did the evaluators
take?

www.id-book.com 9
Challenge & engagement in a
collaborative immersive game

www.id-book.com 10
What does this data tell you?

www.id-book.com 11
Why study skiers in the wild ?

www.id-book.com 12
e-skiing system components

www.id-book.com 13
What did we learn from the case
studies?
• How to observe users in natural settings.
• Unexpected findings resulting from in the wild
studies.
• Having to develop different data collection and
analysis techniques to evaluate user experience
goals such as challenge and engagement.
• The ability to run experiments on the Internet that
are quick and inexpensive using crowdsourcing.
• How to recruit a large number of participants using
Mechanical Turk.Test text

www.id-book.com 14
Evaluation methods
Method Controlled Natural Without users
settings settings

Observing x x

Asking users x x

Asking x x
experts
Testing x
Modeling x

www.id-book.com 15
The language of evaluation
Analytics Informed consent form
Analytical evaluation In the wild evaluation
Living laboratory
Biases
Predictive evaluation
Controlled experiment Reliability
Crowdsourcing Scope
Ecological validity Summative evaluation
Expert review or crit Usability laboratory
User studies
Field study
Usability testing
Formative evaluation Users or participants
Heuristic evaluation Validity
www.id-book.com 16
Participants’ rights and getting their
consent
• Participants need to be told why the
evaluation is being done, what they will be
asked to do and their rights.
• Informed consent forms provide this
information.
• The design of the informed consent form, the
evaluation process, data analysis and data
storage methods are typically approved by a
high authority, eg. Institutional Review Board.
www.id-book.com 17
Things to consider when
interpreting data
• Reliability: does the method produce the
same results on separate occasions?
• Validity: does the method measure what it is
intended to measure?
• Ecological validity: does the environment of
the evaluation distort the results?
• Biases: Are there biases that distort the
results?
• Scope: How generalizable are the results?

www.id-book.com 18
Key points
• Evaluation and design are very closely integrated.
• Some of the same data gathering methods are used in
evaluation as for establishing requirements and
identifying users’ needs, e.g. observation, interviews,
and questionnaires.
• Evaluations can be done in controlled settings such as
laboratories, less controlled field settings, or where
users are not present.
• Usability testing and experiments enable the evaluator
to have a high level of control over what gets tested,
whereas evaluators typically impose little or no control
on participants in field studies.

www.id-book.com 19
Lecture 12 (Part B)
EVALUATION
The aims:
• Explain how to do usability testing

• Outline the basics of experimental

design

• Describe how to do field studies

www.id-book.com 21
Usability testing
• Involves recording performance of typical users
doing typical tasks.
• Controlled settings.
• Users are observed and timed.
• Data is recorded on video & key presses are
logged.
• The data is used to calculate performance times,
and to identify & explain errors.
• User satisfaction is evaluated using
questionnaires & interviews.
• Field observations may be used to provide
contextual understanding.
www.id-book.com 22
Experiments & usability testing

• Experiments test hypotheses to discover new

knowledge by investigating the relationship
between two or more variables.

• Usability testing is applied experimentation.

• Developers check that the system is usable by the

intended user population for their tasks.

www.id-book.com 23
Usability testing & research
Usability testing Experiments for research

• Improve products • Discover knowledge

• Few participants • Many participants
• Results inform design • Results validated
• Usually not completely statistically
replicable • Must be replicable
• Conditions controlled as • Strongly controlled
much as possible conditions
• Procedure planned • Experimental design
• Results reported to • Scientific report to
developers scientific community

www.id-book.com 24
Usability testing
• Goals & questions focus on how well users
perform tasks with the product.

• Comparison of products or prototypes is

common.

• Focus is on time to complete task & number &

type of errors.

• Data collected by video & interaction logging.

• Testing is central.

• User satisfaction questionnaires & interviews

provide data about users’ opinions.

www.id-book.com 25
Testing conditions
• Usability lab or other controlled space.
• Emphasis on:
– selecting representative users;
– developing representative tasks.
• 5-10 users typically selected.
• Tasks usually around 30 minutes
• Test conditions are the same for every
participant.
• Informed consent form explains procedures and
deals with ethical issues.
www.id-book.com 26
Types of data
 Time to complete a task.

 Time to complete a task after a specified time away

from the product.

 Number and type of errors per task.

 Number of errors per unit of time.

 Number of times online help and manuals accessed.

 Number of users making an error.

 Number of users successfully completing a task.

www.id-book.com 27
How many participants is enough
for user testing?
• The number is a practical issue.
• Depends on:
– schedule for testing;
– availability of participants;
– cost of running tests.
• Typically 5-10 participants.
• Some experts argue that testing should
continue until no new insights are gained.
www.id-book.com 28
Usability lab with observers
watching a user & assistant

www.id-book.com 29
Portable equipment for use in the
field

www.id-book.com 30
Portable equipment for use in the
field

www.id-book.com 31
Mobile head-mounted eye tracker

www.id-book.com 32
Usability testing the iPad
• 7 participants with 3+ months experience with iPhones
• Signed an informed consent form explaining:
– what the participant would be asked to do;
– the length of time needed for the study;
– the compensation that would be offered for participating;
– participants’ right to withdraw from the study at any time;
– a promise that the person’s identity would not be disclosed; and
– an agreement that the data collected would be confidential and
would be available to only the evaluators
• Then they were asked to explore the iPad
• Next they were asked to perform randomly assigned specified
tasks

www.id-book.com 33
Examples of the tasks

www.id-book.com 34
Example of the equipment

www.id-book.com 35
Problems and actions
• Problems detected:
– Accessing the Web was difficult
– Lack of affordance and feedback
– Getting lost
– Knowing where to tap
• Actions by evaluators:
– Reported to developers
– Made available to public on nngroup.com
• Accessibility for all users important

www.id-book.com 36
Experiments
• Test hypothesis
• Predict the relationship between two or
more variables.
• Independent variable is manipulated by the
researcher.
• Dependent variable influenced by the
independent variable.
• Typical experimental designs have one or
two independent variables.
• Validated statistically & replicable.

www.id-book.com 37
Experimental designs
• Different participants - single group of
participants is allocated randomly to the
experimental conditions.
• Same participants - all participants appear
in both conditions.
• Matched participants - participants are
matched in pairs, e.g., based on expertise,
gender, etc.
www.id-book.com 38
Different, same, matched
participant design
Design Advantages Disadvantages

Different No order effects Many subjects &

individual differences a
problem
Same Few individuals, no Counter-balancing
individual differences needed because of
ordering effects
Matched Same as different Cannot be sure of
participants but perfect matching on all
individual differences differences
reduced

www.id-book.com 39
Field studies
• Field studies are done in natural settings.
• “In the wild” is a term for prototypes being used
freely in natural settings.
• Aim to understand what users do naturally and
how technology impacts them.
• Field studies are used in product design to:
– identify opportunities for new technology;
– determine design requirements;
– decide how best to introduce new technology;
– evaluate technology in use.
www.id-book.com 40
Technology for context-aware field
data collection

www.id-book.com 41
An in the wild study:
UbiFit Garden

www.id-book.com 42
Data collection & analysis

• Observation & interviews

– Notes, pictures, recordings
– Video
– Logging
• Analyzes
– Categorized
– Categories can be provided by theory
• Grounded theory
• Activity theory

www.id-book.com 43
Data presentation
• The aim is to show how the products are
being appropriated and integrated into
their surroundings.
• Typical presentation forms include:
– Vignettes,
– Excerpts,
– Critical incidents,
– Patterns, and narratives.
www.id-book.com 44
Key points
• Usability testing takes place in controlled usability labs or temporary labs.

• Usability testing focuses on performance measures, eg. how long and how many errors
are made when completing a set of predefined tasks. Indirect observation (video and
keystroke logging), user satisfaction questionnaires and interviews are also collected.

• Affordable, remote testing systems are more portable than usability labs. Many also
contain mobile eye-tracking and other devices.

• Experiments test a hypothesis by manipulating certain variables while keeping others

constant.

• The experimenter controls independent variable(s) in order to measure dependent

variable(s).

• Field studies are evaluation studies that are carried out in natural settings to discover
how people interact with technology in the real world.

• Field studies that involve the deployment of prototypes or technologies in natural settings
may also be referred to as ‘in the wild’.

• Sometimes the findings of a field study are unexpected, especially for in the wild studies
in which explore how novel technologies are used by participants in their own homes,
places of work, or outside.
www.id-book.com 45

SDLC Assignment 1 BKC18400
67% (3)
SDLC Assignment 1 BKC18400
24 pages
Plano Mudulo Puertas Q21-1050
No ratings yet
Plano Mudulo Puertas Q21-1050
2 pages
Practical Guide To Usability Testing PDF
No ratings yet
Practical Guide To Usability Testing PDF
24 pages
A-10-Lock On Guide Quick
100% (1)
A-10-Lock On Guide Quick
40 pages
Chapter 12 - Introducing Evaluation
No ratings yet
Chapter 12 - Introducing Evaluation
20 pages
Automated Software Testing Interview Questions You'll Most Likely Be Asked
From Everand
Automated Software Testing Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
Research Data Strategy
No ratings yet
Research Data Strategy
9 pages
07-Evaluation_week15
No ratings yet
07-Evaluation_week15
61 pages
Chapter14 - Evaluation Studies From Controlled To Natural Settings - 2019
No ratings yet
Chapter14 - Evaluation Studies From Controlled To Natural Settings - 2019
19 pages
08 UI Testing
No ratings yet
08 UI Testing
48 pages
W13 - Evaluation Studies - From Controlled To Natural Settings
No ratings yet
W13 - Evaluation Studies - From Controlled To Natural Settings
40 pages
Chapter 15 5e
No ratings yet
Chapter 15 5e
23 pages
Chapter 14 5e
No ratings yet
Chapter 14 5e
18 pages
LU5 Usability Evaluation Methods
No ratings yet
LU5 Usability Evaluation Methods
53 pages
Chapter 14 5e
No ratings yet
Chapter 14 5e
22 pages
Chapter 15 5e
No ratings yet
Chapter 15 5e
26 pages
ch15 Hci
No ratings yet
ch15 Hci
28 pages
HCI Sem1 202021 LU5
No ratings yet
HCI Sem1 202021 LU5
57 pages
Week - 10 Evaluation
No ratings yet
Week - 10 Evaluation
30 pages
Lec11 ch8
No ratings yet
Lec11 ch8
36 pages
Review Ii: Course: COMP6176 / Human - Computer Interaction Year: 2015
No ratings yet
Review Ii: Course: COMP6176 / Human - Computer Interaction Year: 2015
19 pages
UNIT - 3 - Evaluation and User Experience
No ratings yet
UNIT - 3 - Evaluation and User Experience
25 pages
Evaluating Interface Designs: Designing The User Interface: Strategies For Effective Human-Computer Interaction
No ratings yet
Evaluating Interface Designs: Designing The User Interface: Strategies For Effective Human-Computer Interaction
24 pages
Evaluation and The User Experience: Designing The User Interface: Strategies For Effective Human-Computer Interaction
No ratings yet
Evaluation and The User Experience: Designing The User Interface: Strategies For Effective Human-Computer Interaction
26 pages
Usability Testing
100% (2)
Usability Testing
27 pages
Session 13 - 14 - IsYS6596 - Techniques For Designing UX Evaluation
No ratings yet
Session 13 - 14 - IsYS6596 - Techniques For Designing UX Evaluation
47 pages
Chapter 7-Evaluation Techniques and Universal Design
No ratings yet
Chapter 7-Evaluation Techniques and Universal Design
22 pages
Usability Testing - Part 1
No ratings yet
Usability Testing - Part 1
43 pages
Human Computer Interaction
No ratings yet
Human Computer Interaction
56 pages
Introducing Evaluation: ISE 217-Human Computer Interaction
No ratings yet
Introducing Evaluation: ISE 217-Human Computer Interaction
46 pages
HCI - Evaluation Techniques
No ratings yet
HCI - Evaluation Techniques
43 pages
Testing Your Designs
No ratings yet
Testing Your Designs
64 pages
Techno - Module 7
No ratings yet
Techno - Module 7
4 pages
DTUI6 Chap05 accessiblePPT
No ratings yet
DTUI6 Chap05 accessiblePPT
28 pages
CH 4 Evaluating Interface Designs (Reference Book 01)
No ratings yet
CH 4 Evaluating Interface Designs (Reference Book 01)
30 pages
HCI Module 6 and 7
No ratings yet
HCI Module 6 and 7
44 pages
Evaluation in HCI
No ratings yet
Evaluation in HCI
41 pages
Chapter 2 HCI
No ratings yet
Chapter 2 HCI
32 pages
2023 Slide Deck ITHIA1-33 - Week 2 - Lesson 2
No ratings yet
2023 Slide Deck ITHIA1-33 - Week 2 - Lesson 2
11 pages
Week 09 - Data Collection Techniques
No ratings yet
Week 09 - Data Collection Techniques
25 pages
Human-Computer Interaction: Evaluation - Part I
No ratings yet
Human-Computer Interaction: Evaluation - Part I
18 pages
DT unit 6
No ratings yet
DT unit 6
50 pages
Chap7 Evaluation
No ratings yet
Chap7 Evaluation
20 pages
Topic 4. Usability and User EXperience (UX)
No ratings yet
Topic 4. Usability and User EXperience (UX)
63 pages
An Introduction To Usability Testing
No ratings yet
An Introduction To Usability Testing
8 pages
LECTURE_-_10_-_Usability_testing[1]
No ratings yet
LECTURE_-_10_-_Usability_testing[1]
23 pages
Chapter9 HCI Evaluation Techniques
No ratings yet
Chapter9 HCI Evaluation Techniques
6 pages
How To Plan A Usability Test
No ratings yet
How To Plan A Usability Test
7 pages
Student-Run Usability Testing
No ratings yet
Student-Run Usability Testing
9 pages
User Interfaces Evaluation: DCO10104: User-Centered Design and Testing
No ratings yet
User Interfaces Evaluation: DCO10104: User-Centered Design and Testing
25 pages
chp 5
No ratings yet
chp 5
71 pages
LECTURE_-_7-_Evaluation_Techniques[1]
No ratings yet
LECTURE_-_7-_Evaluation_Techniques[1]
34 pages
Chapter 12
No ratings yet
Chapter 12
28 pages
Slides-Usability Testing
No ratings yet
Slides-Usability Testing
21 pages
Hci - 16
No ratings yet
Hci - 16
36 pages
Usability Design of Software Applications-gzgCderghzzdhz-Usability Testingzdhzdhazhdzdh
No ratings yet
Usability Design of Software Applications-gzgCderghzzdhz-Usability Testingzdhzdhazhdzdh
9 pages
M2s1-Supplementary 2
No ratings yet
M2s1-Supplementary 2
2 pages
Chapter Six: Evaluation Techniques and Universal Design
No ratings yet
Chapter Six: Evaluation Techniques and Universal Design
27 pages
Chapter 16 5e
No ratings yet
Chapter 16 5e
24 pages
Human-Computer Interaction (HCI) : Evaluation Techniques
No ratings yet
Human-Computer Interaction (HCI) : Evaluation Techniques
6 pages
Chapter 7
No ratings yet
Chapter 7
59 pages
18mit13c U5
No ratings yet
18mit13c U5
14 pages
Chapter 16 5e
No ratings yet
Chapter 16 5e
27 pages
W02-DesignHeuristics_UsabilityTesting-01
No ratings yet
W02-DesignHeuristics_UsabilityTesting-01
27 pages
The Vertical Cut
No ratings yet
The Vertical Cut
5 pages
r Gupta.pdf Compressed
No ratings yet
r Gupta.pdf Compressed
201 pages
Unit - 1 Architecture of Distributed Systems
No ratings yet
Unit - 1 Architecture of Distributed Systems
22 pages
00157500_B MAAP-X Quick Reference Guide
No ratings yet
00157500_B MAAP-X Quick Reference Guide
38 pages
Plant Breeding Tools - Software For Plant Breeders PDF
67% (3)
Plant Breeding Tools - Software For Plant Breeders PDF
40 pages
VUMTdeluxe Manual
No ratings yet
VUMTdeluxe Manual
17 pages
Data Security Considerations - (Backups, Archival Storage and Disposal of Data)
No ratings yet
Data Security Considerations - (Backups, Archival Storage and Disposal of Data)
3 pages
Csps
No ratings yet
Csps
52 pages
CN module 3
No ratings yet
CN module 3
55 pages
CloudBased TaskManagement For VVorkFromHomeCulture
No ratings yet
CloudBased TaskManagement For VVorkFromHomeCulture
4 pages
ESF7 FAQs Based On The Orientation As of 23october2023 1
No ratings yet
ESF7 FAQs Based On The Orientation As of 23october2023 1
14 pages
NEW GL Euro Conversion of The Company Code Currency
No ratings yet
NEW GL Euro Conversion of The Company Code Currency
9 pages
Est
No ratings yet
Est
39 pages
Effects of E-Procurement On Supply Chain Management in The Modern Era
No ratings yet
Effects of E-Procurement On Supply Chain Management in The Modern Era
16 pages
Job Portal
No ratings yet
Job Portal
8 pages
CS411-Visual Programming UPdated MIDTERM MCQS Solved by Arslan Arshad (Zain Nasar) With Refrence
No ratings yet
CS411-Visual Programming UPdated MIDTERM MCQS Solved by Arslan Arshad (Zain Nasar) With Refrence
76 pages
Data 1.1 Presentation
No ratings yet
Data 1.1 Presentation
21 pages
Metadata 5
No ratings yet
Metadata 5
3 pages
Irrigation Project Report
No ratings yet
Irrigation Project Report
69 pages
Lesson 4
No ratings yet
Lesson 4
6 pages
Lab 5 - Implement Etherchannel
No ratings yet
Lab 5 - Implement Etherchannel
2 pages
(Computer Supported Cooperative Work) R. Harper, L. Palen, A. Taylor - The Inside Text - Social, Cultural and Design Perspectives On SMS-Springer (2005)
No ratings yet
(Computer Supported Cooperative Work) R. Harper, L. Palen, A. Taylor - The Inside Text - Social, Cultural and Design Perspectives On SMS-Springer (2005)
331 pages
Interface Python With SQL Database
No ratings yet
Interface Python With SQL Database
7 pages
Predictive PDF
100% (1)
Predictive PDF
5 pages
PAKAI Ini Ya
No ratings yet
PAKAI Ini Ya
2 pages
WeScale - CloudRadar - CloudNative - Partie1
No ratings yet
WeScale - CloudRadar - CloudNative - Partie1
29 pages

Lecture 12 - Evaluation

Uploaded by

Lecture 12 - Evaluation

Uploaded by

Lecture 12 (Part A)

See AskTog.com for topical discussions about

• In the wild field study of skiers

• Outline the basics of experimental

• Describe how to do field studies

• Experiments test hypotheses to discover new

• Usability testing is applied experimentation.

• Developers check that the system is usable by the

• Improve products • Discover knowledge

• Comparison of products or prototypes is

• Focus is on time to complete task & number &

• Data collected by video & interaction logging.

• User satisfaction questionnaires & interviews

 Time to complete a task after a specified time away

 Number and type of errors per task.

 Number of errors per unit of time.

 Number of times online help and manuals accessed.

 Number of users making an error.

 Number of users successfully completing a task.

Different No order effects Many subjects &

• Observation & interviews

• Experiments test a hypothesis by manipulating certain variables while keeping others

• The experimenter controls independent variable(s) in order to measure dependent

You might also like