HCI Module 6
HCI Module 6
MODULE 6
Validation
• Validation in HCI refers to the process of assessing
whether a design meets its intended goals and
satisfies the needs of its users.
• It is a critical phase in the design process aimed at
ensuring that the developed system or interface is
effective, efficient, and user-friendly.
• Involves gathering feedback from users or
stakeholders through various methods such as
usability testing, user interviews, surveys, and
expert evaluations.
Validation Techniques
• Usability Testing
• User Interviews
• Surveys
• Expert Evaluations
• Heuristic Evaluation
• Cognitive Walkthrough
• A/B Testing
• Prototype Testing
• Field Studies
Usability Testing
• 5) Find solutions.
Usability Testing Methods
• In-person testing
• Remote testing
• Guerilla testing
Usability Testing Process
1. Plan –
a. Define what you want to test.
• Ask yourself questions about your design/product.
• What aspect/s of it do you want to test?
• You can make a hypothesis from each answer.
• With a clear hypothesis, you’ll have the exact aspect you
want to test.
b. Decide how to conduct your test – e.g., remotely.
• Define the scope of what to test (e.g., navigation) and stick to
it throughout the test.
• When you test aspects individually, you’ll eventually build a
broader view of how well your design works overall.
Usability Testing Process
2) Set user tasks –
Cognitive Walkthrough
Heuristic Evaluation
Review-based evaluation
Cognitive Walkthrough
• Example heuristics
• system behaviour is predictable
• system behaviour is consistent
• feedback is provided
• Model-based evaluation
• Disadvantages:
• lack of context
• difficult to observe several users cooperating
• Appropriate
• if system location is dangerous or impractical for constrained single user
systems to allow controlled manipulation of use
Field Studies
• Advantages:
• natural environment
• context retained (though observation may alter it)
• longitudinal studies possible
• Disadvantages:
• distractions
• noise
• Appropriate
• where context is crucial for longitudinal studies
Evaluating Implementations
Requires an artefact:
simulation, prototype, full
implementation
Experimental evaluation
• Null hypothesis:
• states no difference between conditions
• aim is to disprove this
• Type of data
• discrete - finite number of values
• continuous - any value
Analysis - types of test
• Parametric
• assume normal distribution
• robust
• powerful
• Non-parametric
• do not assume normal distribution
• less powerful
• more reliable
• Contingency Table
• classify data by discrete attributes
• count number of data items in each group
Analysis of data (cont.)
• What information is required?
• is there a difference?
• how big is the difference?
• how accurate is the estimate?
Problems with:
• subject groups
• choice of task
• data gathering
• analysis
Subject groups
problems:
• synchronisation
• sheer volume!
one solution:
• record from each perspective
Analysis
N.B. vast variation between groups
solutions:
• within groups experiments
• micro-analysis (e.g., gaps in speech)
• anecdotal and qualitative analysis
Contrast:
psychology – controlled experiment
sociology and anthropology – open study and rich data
Observational Methods
Think Aloud
Cooperative evaluation
Protocol analysis
Automated analysis
Post-task walkthroughs
Think Aloud
• user observed performing task
• user asked to describe what he is doing and why, what he thinks
is happening etc.
• Advantages
• simplicity - requires little expertise
• can provide useful insight
• can show how system is actually use
• Disadvantages
• subjective
• selective
• act of describing may alter task performance
Cooperative evaluation
• variation on think aloud
• user collaborates in evaluation
• both user and evaluator can ask each other questions throughout
• Additional advantages
• less constrained and easier to use
• user is encouraged to criticize system
• clarification possible
Protocol analysis
• paper and pencil – cheap, limited to writing speed
• audio – good for think aloud, difficult to match with other protocols
• video – accurate and realistic, needs special equipment, obtrusive
• computer logging – automatic and unobtrusive, large amounts of data difficult to
analyze
• user notebooks – coarse and subjective, useful insights, good for longitudinal studies
Interviews
Questionnaires
Interviews
• analyst questions user on one-to -one basis
usually based on prepared questions
• informal, subjective and relatively cheap
• Advantages
• can be varied to suit context
• issues can be explored more fully
• can elicit user views and identify unanticipated problems
• Disadvantages
• very subjective
• time consuming
Questionnaires
• Set of fixed questions given to users
• Advantages
• quick and reaches large user group
• can be analyzed more rigorously
• Disadvantages
• less flexible
• less probing
Questionnaires (ctd)
• Need careful design
• what information is required?
• how are answers to be analyzed?
• Styles of question
• general
• open-ended
• scalar
• multi-choice
• ranked
Physiological methods
Eye tracking
Physiological measurement
Eye tracking
• head or desk mounted equipment tracks the position of the eye
• measurements include
• fixations: eye maintains stable position. Number and duration indicate level of
difficulty with display
• saccades: rapid eye movement from one point of interest to another
• scan paths: moving straight to a target with a short fixation at the target is
optimal
Physiological measurements
• emotional response linked to physical changes
• these may help determine a user’s reaction to an
interface
• measurements include:
• heart activity, including blood pressure, volume and pulse.
• activity of sweat glands: Galvanic Skin Response (GSR)
• electrical activity in muscle: electromyogram (EMG)
• electrical activity in brain: electroencephalogram (EEG)
• some difficulty in interpreting these physiological
responses - more research needed
Choosing an Evaluation Method
when in process: design vs. implementation
style of evaluation: laboratory vs. field
how objective: subjective vs. objective
type of measures: qualitative vs. quantitative
level of information: high level vs. low level
level of interference: obtrusive vs. unobtrusive
resources available: time, subjects, equipment,
expertise