Untitled document (5)
Untitled document (5)
5
Dr Rox notices that her students always
seem more alert following a cup of coffee
and decides to carry out an experiment
to investigate the effect of caffeine on reaction time. She gives one group of students a 300 ml cup of coffee
containing 100 mg of caffeine and tests their reaction time immediately
after they have drunk it. She compares their results
with another group of students who are given a 300 ml glass of water (and no
coffee). She predicts that the average reaction time for the caffeine group will
be significantly faster than the average reaction time in the no-caffeine group.
» The independent variable is A
whether the participants are given
caffeine
or water (this is what
Dr Rox manipulated).
» The dependent variable is participants'
170
reaction times measured in milliseconds on the reaction time test.
When Dr Rox analyses the results, she finds her prediction was correct, but can you think of any reasons why
Dr Rox should not conclude that it was the caffeine that caused the faster reaction times?
Laboratory experiments
Location
The term laboratory refers to the setting (location) ni which the experiment takes place. Alaboratory is any
setting over which the experimenter has a high level of control - for example, they can control the temperature,
lighting and noise level. In Dr Rox's caffeine study, the students would be brought to a specific room at the
university where they are given the coffee or water. Next, they would complete a specially designed
reaction-time test on a computer - for example, touching the screen as quickly as they can every time they see
the number 2on screen. The time taken to press the button wil be measured ni milliseconds.
Controls
Controlled variables are kept the
same between
all levels (groups or conditions) of the independent variable, to ensure that only the independent variable is
causing any effect on
the dependent variable. Working in a laboratory setting allows researchers to control many variables. This
means the researcher can be much more confident that the independent
variable really did cause the changes ni the dependent variable. In Dr Rox's study, fi the room was rather hot
on the day the coffee group took part but colder on the day the water
group took part, the difference ni temperature might affect reaction times. Therefore, Dr Rox should control the
temperature of the room and the temperature of the water and the coffee. She would also need to control
noise levels within the room, ensuring that both
groups have the same quiet environment and do not become distracted.
Evaluating laboratory experiments
Validity
Another strength si that, because of the high levels of control, researchers are able to infer cause and effect
because they have isolated the variable ot be manipulated and the variable that si being measured. For
example, ni an experiment on the effects of caffeine on reaction time, if the researchers found a significant
difference in the results between the caffeine
and the control group, they could say that caffeine had an effect on reaction time.
0:003
IV: the drink (coffee or water)
DV: time taken to press
the button (milliseconds)
Figure 5.4 Dr Rox's experiment to investigate the effect of caffeine on
reaction time.
IV, independent variable; DV, dependent variable
Aweakness of some laboratory experiments is that they can be low in ecological validity because the tasks
participants are asked to perform sometimes do not reflect what they would do ni real life. For example, ni
Pozzulo et al's (line-ups) experiment, participants
were asked to watch some videos, then identify cartoon and (real) human characters from the videos ni photo
line-ups. This si not the same as witnessing a crime ni real life, where people are unlikely to be expecting the
event, may feel more intense emotions and wil be viewing the situation ni real life not on a screen. These
factors are likely to affect their recall for faces in a different way to a calm, controlled setting. This means that
the results of laboratory experiments should only be generalised to everyday life with caution as when people
are in a real-life setting, they may behave very differently.
Another weakness is that validity can also be affected by demand characteristics, which
are features of the environment that reveal the aim of the study. fI participants are able to work out the aim of
the study, they may act in a way they feel the researchers want them to,
rather than behaving how they normally would.
Reliability
One strength of laboratory experiments is that the setting means the procedure can be
highly standardised. This means that al participants within each group/condition have the exact same
experience as each other. This is important because it means other researchers
should be able to replicate the study to test the findings for reliability (i.e. consistency).
Ethics
Astrength is that laboratory experiments should be more ethical than field experiments (see below) because
participants know they are taking part in research (even if they have not been told the exact hypothesis). They
will have given their consent to take part and
wil be made aware of their right to withdraw. Unlike field experiments, researchers are generally able to debrief
their participants following any deception. This is important as upholding ethical guidelines helps to ensure that
participants are treated with respect and that they leave the research setting ni the same psychological state
as they were ni when they arrived.
Field experiment
Location
When psychologists talk about working in 'the field', they mean conducting research in real-world settings.
Away from the controlled world of the laboratory, field experiments take place in natural environments, where
one might expect to see the behaviours of interest, such as the street, supermarket or workplace. One or more
independent variables are still
manipulated by the experiment ni order ot observe the effect on the dependent variable(5). However, the
natural environment means that ni comparison with laboratory experiments,
there may be uncontrolled variables both within and between groups/conditions. These are factors that are not
being measured or controlled by the researcher and can have an
unwanted effect on the dependent variable. These could be things such as the temperature of the room or
noise levels.
How could Dr Rox redesign her study as a field experiment? Where would it take place and how could she
measure reaction times in a more naturalistic way?
Controls
When researchers conduct field experiments, they try to maintain control over variables that could affect the
dependent variable to allow cause-and-effect conclusions to be drawn, but this is much more difficult than in a
laboratory environment. In natural environments, like
public places (e.g. Piliavin et al., subway Samaritans), it si not possible to control how many people board the
train or how close people are standing to each other, for example. These uncontrolled, situational variables
(see page 201) make it hard to draw firm conclusions.
171
5.1 Research methods
5 RESEARCH METHODOLOGY
172
Can you think of any variables that would be difficult to control fi Dr Rox decided to conduct her study in a
coffee shop? Imagine she has ten students in each group, would it be possible to ensure that they all have the
exact same experience as each other?
Evaluating field experiments
Validity
Astrength of field experiments is that they generally have higher ecological validity than
a laboratory experiment, meaning that participants' behaviour si more likely to be natural as they are not in an
unfamiliar laboratory setting and often unaware that they are even
taking part in the research. This means their results should be generalisable to other similar situations. Another
strength of field experiments is that participants are also much less likely to show demand characteristics fi
they do not know they are taking part ni research. This means that they are unlikely to try to work out what the
aim of the research si and act in the way that they
think the researchers want them to.
Field experiments take place in real-world settings. The behaviours of interest are observed/measured ni
everyday situations. For this reason, they tend to have higher ecological validity. Also, participants are less
likely ot be affected by demand characteristics, especially fi they do not even know they are in a study. The
researcher
still manipulates an aspect of the situation (e.g. Piliavin et al. staged an emergency on the subway, see page
154) and so the setting si not completely natural.
» Do you think it si possible for a laboratory experiment to have ecological validity? » Do you think ti si possible
for a field experiment to lack ecological validity?
Reliability
Aweakness si that researchers have much less control ni a field experiment. For example, ni
the Piliavin et al. experiment (see page 153), researchers could not control who was sitting in the train carriage
or how crowded it was when the experiment took place. Neither could
they control whether the passengers had already seen the confederate collapse in a previous
trial, leading them to become suspicious and potentially affecting their response. Field experiments tend to be
more difficult to replicate due to the limited ability to standardise
the procedure. This makes ti difficult to test the reliability of the findings.
Ethics
Aweakness of field experiments si that participants often do not know that they are taking part ni research, so
consent cannot be sought, which also removes the right to withdraw.
When the deception si revealed, ti may lead participants ot doubt psychologists ni the future. Deception should
only be used when the research aims cannot be met without it taking place
and only if the participants are unlikely to feel discomfort on finding out about the deception. The research
design should protect the dignity of the participants.
Control groups and/or conditions
Experiments involve making comparisons between data obtained in two or more groups or conditions.
Astrength of control groups/conditions is that the data provided in this part of the experiment provides a
baseline to which the data from the experimental group or condition can be compared. Control
groups/conditions allow the researchers to confirm that the results of their study are due to the manipulation of
the independent variable. In
Dr Rox's study, the water group provides the reaction speed baseline to which the coffee group's average
speed can be compared.
Can you think of a different control group that Dr Rox could have used to provide the baseline in
her study?
RESEARCH METHODOLOGY
174
KEY TERMS random allocation
experimental design
experimental group control group
independent measures baseline
repeated measures control condition
demand characteristics
order effects (practice and fatigue]
counterbalancing experimental condition
matched pairs
Think!
How would you rank the three experimental designs in terms of their practicality and validity? Put the
experimental designs in order from least to most practical and least to
most valid. If possible, discuss this with a classmate to see whether you agree.
In Dr Rox's caffeine experiment (see page 170), the students in the experimental group consumed caffeine
and their average reaction time was compared with the average reaction time of the participants in the control
group, who did not consume caffeine
the water group).
» Which experimental design was used ni this study? » What were the problems of using this design?
» How could Dr Rox have improved her experiment?
Independent measures design
In an independent measures design, participants are put into two or more different
groups, relating to the levels of the independent variable. These groups are sometimes
called the experimental group and the control group. Data from the control group provides a baseline to which
the researcher can compare the data collected in the experimental group. For example, in the study by
Andrade (doodling), participants in the experimental group
were asked to shade circles and squares on an A4 piece of paper, while participants in the control group were
not allowed to shade or doodle. The average number of words recalled by
the two different groups of participants was then compared to see fi there was a difference between them.
An independent measures design uses different participants for each condition of the experiment. In what way
might using different participants for different conditions
affect the validity of the results, and how might this be avoided?
Random allocation and participant variables
Another key feature of laboratory experiments is that the researchers are able to randomly
allocate participants into either the experimental or the control group. This is often done
by assigning each participant a number and then using a random number generator to select participants for
the two or more groups.
Without random allocation, it is possible that members of one group all share some
common characteristic that makes them different from the people ni the other group. These characteristics are
called participant variables, and may be things like gender, age or level of intelligence or empathy. When this
happens, ti is impossible to know whether differences in the dependent variable result from the manipulation of
the independent variable or the
participant variables. The purpose of random allocation, therefore, si to increase validity (see
below).
Dr Rox realises that there might be some
important participant variables affecting the students' results in her study on the effects of caffeine on reaction
times
(see page 170). An important participant variable in her study might be how much
coffee (and other caffeinated drinks/ snacks) the students usually consume. If all
the heavy-caffeine users end up in the
coffee group and all the light-caffeine
users, by chance, end up in the water
group, ti would be unclear whether any difference in average reaction times results
from the drink consumed before the test or
differences ni the students' regular caffeine consumption. How could Dr Rox reduce the
effect of participant variables ni her study?
A Figure 5.5
176
In Dr Rox's experiment on the effects of caffeine on reaction time (see page 170),
the participants might perform better the second time
they do the reaction time test because they have already done it once
and had the chance to practise. Similarly, participants may perform worse in the
Counterbalancing
second condition as they have already been concentrating on the screen in the
first condition (fatigue effect). They might also
need the toilet fi they have consumed both a cup of coffee and a glass of water! How could
you set up the experiment to avoid order effects?
Counterbalancing is a technique that can be used ni a repeated measure design so that order effects do not
compromise the validity of the findings. Participants are put into one of two
groups: group one does the experimental condition first and then the control condition, and group two does the
control condition first and then the experimental condition. This
way, although practice effects wil still occur, their impact on the findings si reduced.
Matched pairs design
In a matched pairs design, each participant ni the experimental group wil be matched with a participant in the
control group on a number of relevant factors. This means for every participant in the experimental group, the
control group will contain a person who is very
similar to them on key variables that might have otherwise affected the dependent variable of the study. For
example, participants could be matched on handedness (preference for using the right or left hand), gender,
socioeconomic status or age. Amatched pairs design overcomes the problem of participants' variables and
order effects, but in practice, they can be difficult to set up as careful screening of the participants is necessary
in order to match the groups. It can also be difficult to recruit people who have the necessary characteristics.
Dr Rox is still concerned that participant
variables might affect the validity of her conclusions about the effect of caffeine on
reaction times. She decides to use a matched
pairs design where she will match the participants
ni each condition on age, sex, handedness and experience in playing video games, all variables
which may affect the validity of her conclusions. What would she need to do to set up her matched groups of
participants?
• Figure 5.7 In matched pairs designs,
participants are matched across the experimental and control conditions
LEARNING LINK
Experimental
group: Soraya, female,
aged 22, left-handed
Control group:
Elina, female, aged 22,
left-handed
The research by Hölzel et al. (mindfulness and brain scans) uses a combination of both independent
measures design and repeated measures design (see page 29).
Participants in the experimental group took part in an eight-week Mindfulness-Based Stress Reduction
(MBSR) programme. MRI brain scans were conducted both before
and after the intervention. Data from the scans was compared to see fi the MBSR
programme had changed grey matter concentration.
• Which part of Hölzel's study is independent measures and which part is repeated measures?
• Why was it important in this study to also have a control group?
5 RESEARCH METHODOLOGY
5 RESEARCH METHODOLOGY
178
One special type of experiment is called a randomised control trial (RCT). Randomised control trials (RCTs)
are a special type of experiment. They use various techniques to
increase validity and they are often referred to as the 'gold standard' in clinical and health research, such as
drug testing. For example, participants might be randomly assigned to either an experimental group (who are
given the new drug) or a control group. The control groups ni RCTs sometimes receive no treatment or they
receive a placebo.
The placebo effect
Sometimes it is hard to know whether the positive effect of a new drug is due to the drug itself or the fact that
participants expect their symptoms to improve once they begin the new course of treatment. In order to isolate
how much of any improvement is due to the
drug and how much si due to expectancy effects, researchers often use placebo treatments. Here participants
ni the control group receive a treatment that is administered ni exactly the same way as the real treatments
(e.g. in the case of orally administered
tablets, the tablet will look, smell and taste just like the real drug) but contains no active ingredients.
Importantly, the participants do not know whether they have received the real drug or the placebo. This si
called a single-blind design. Changes ni symptoms (before and after
treatment) are calculated for both groups. For example, ni the study by Perry et al. (personal space),
participants were asked to self-administer either oxytocin or a saline solution (placebo), before taking part in
experiments on personal space. They were unaware whether they had received the drug or the placebo (the
saline solution) to avoid this knowledge
affecting their behaviour.
Often, placebo groups experience significant improvements over time (the placebo effect) but any significant
difference ni the amount of improvement between the experimental and control groups must be caused by the
independent variable - that is, the presence of the active ingredient in the real drug.
RCTs are also used in studies investigating the efficacy of psychological treatments such as cognitive
behavioural therapy (CBT) and other interventions to improve health and wellbeing. Here the placebo
treatment is something that is similar to the genuine
treatment/intervention ni many ways (e.g. weekly appointments with a counsellor), but without the CBT
element.
Double-blind design
As noted above, participants do not know whether they have received the placebo or the
real drug but often the researcher who measures the severity of the participants' symptoms is also unaware
whether the participants are in the placebo group or treatment group. This is
called a double-blind design. In some studies, the person analysing the data is also unaware whether data
points were contributed by people ni the placebo or intervention group and this is known as a triple-blind
design!
Evaluating RCTs
Validity
Astrength of the double-blind design is that it reduces experimenter bias as well as reducing
demand characteristics. For example, fi the experimenter knew who was ni the placebo group they might
expect those participants to have experienced less improvement over the course of the study. The
experimenter's expectations could affect the data provided by the
participant, reducing the validity of the findings. Adouble-blind design makes sure this cannot happen, ensuring
the experimenter's assessment of the participant remains objective.
Reliability
Another strength is that the double-blind design allows researchers to test interventions (e.g. treatments and
therapies) in a controlled way. As none of the experimenters who come
into contact with the participants are aware which groups they are in, al participants are treated in exactly the
same way by the experimenters, meaning that their assessments can be replicated and tested for inter-rater
reliability.
Ethics
Aweakness is that valid, informed consent is not possible. Double-blind RCTs rely not
only on the researchers being unaware which group (placebo or genuine treatment) the participant is in, but
also the participant, meaning they are not giving fully informed consent. This is important, however, as ti
preserves the validity of the findings and arguably the benefits of the research outweigh the cost to
participants, especially given that the participants have consented to the possibility that they will receive a
placebo.
Aweakness of RCTs is that it could mean that participants randomly assigned to the control group do not have
access to treatments that could significantly improve their quality of
life. In order to still conduct important RCTs on treatments for mental health disorders, the
control groups are often offered the treatment they missed out on as soon as the study has ended. The start of
treatment, however, would still be delayed, potentially causing unnecessary psychological harm. They are
often referred to as a wait-list control group.
In Dr Rox's study about the effect of caffeine on reaction times (see page 170), the control group drank a cup
of water. Now that you have a better understanding of expectancy effects, you should be able to explain why
the coffee group might have experienced faster reaction times than the water group for psychological rather
than
biological reasons.
» How could Dr Rox have turned her study into a RCT with a placebo group and a
double-blind design?
LEARNING LINK
Lewin et al. (1992) used a randomised control trial to test the effectiveness of a home-
based exercise programme to help improve fitness after a heart attack (see page 617). Participants were
randomly assigned to receive either the self-help rehabilitation
intervention or standard care plus a placebo package of information and informal counselling.
T E S TYOURSELF
1 Explain one strength and one weakness of using randomised control trials. [4]
Suggest one ethical issue which may arise from the use of randomised control
trials, using any example. [2]
3 Dr Phil would like to test a new drug he has developed to support individuals with
gambling disorders. Suggest how Dr Phil might use a randomised control trial to
test whether the new drug is more effective than a placebo. [2]
4 Hajra would like to find out whether yoga has a positive impact on people's subjective wellbeing, as previous
research has suggested. Suggest one reason why a
randomised control trial may not be a suitable method to investigate this. [2]
5
Malcolm is investigating the effectiveness of electro-convulsive therapy (ECT) as a treatment for
schizophrenia. He plans to use a randomised control trial
where the control group think they have been given ECT but have not.
a Suggest two ways that Malcolm can ensure that his data is objective.
b Outline one difference and one similarity between the experimental and
[4]
control groups in Malcolm's study. [4]
6 Jolene has had pain in her shoulder for months following a car accident. She is participating in a randomised
control trial to test the effectiveness of acupuncture.
Explain one ethical issue relating to randomised control trials. You must refer to Jolene in your answer.
[2]
179
0.........dood
5 RESEARCH METHODOLOGY
5
KEY TERMS
open question closed question
self-report
questionnaire interview
rich (as in data)
target population
5.1.4 Self-reports: questionnaires
Think!
What is the difference between an open question and a closed question?
Think!
Imagine you are investigating the effect of stress on the content of people's dreams.
Which type of question (open or closed) would be best to find out about people's stress levels and their dream
content?
Think!
Why do you think some researchers prefer rating scales with an even number of choices,
such as six-point scales from 0-5, and others prefer to use scales with an odd number, such as seven-point
scales from 0-6?
Self-reports are a method of research where the participant responds to either open or closed questions. Data
may be gathered about their beliefs and opinions as well how they believe they would behave in certain
situations. Self-reports include both questionnaires and interviews and may collect both quantitative and
qualitative data.
Questionnaires
Questionnaires can be completed in a number of ways. Printed questionnaires may be completed by hand, but
nowadays most questionnaires are distributed digitally and participants complete them online. This saves
printing and postage costs and data can be collated much quickly and easily. This said, online questionnaires
can only be completed
by participants who have access to the internet and understand the technology, meaning traditional 'pen and
paper' surveys are still used and may be more convenient for many types of research.
Closed questions
Closed questions offer a fixed choice of answers. They may use some form of scale such as a Likert scale, in
which the participant can circle or tick the response to show the extent to
which they agree with a statement. An example of a Likert scale might be:
I do not like people invading my personal space, especially fi I do not know them.
180
123
45
Please circle the response that best fits your view: 1= Completely disagree, 2= Disagree, 3= Neither agree nor
disagree, 4= Agree, 5 = Completely agree.
Here si another example of a closed question is: D' o you find mindfulness meditation relaxing? Yes/No. This
type of closed question may not provide enough options to reflect
the participants' range of experiences with mindfulness meditation. For example, it si not possible to
communicate that you occasionally find mindfulness meditation relaxing but
often find ti ineffective. In this example, a rating scale may allow participants to provide more valid responses.
For example, To what extent do you agree: Mindfulness meditation helps me to relax 0=Never found it helpful
to 5=Aways very helpful.
Although dichotomous questions (with only two answers) are often not very useful, they may be helpful fi you
wish to provide contingency questions. These allow the participant to ignore questions that are not relevant to
them. For example:
Have you ever tried mindfulness meditation as a way of relaxing? Yes/No. If yes, please answer questions 2-5.
If no, please continue to question 6.
5
2 Ambiguous (unclear) questions: it is extremely important that participants can
understand what the question si actually asking. If not, it wil affect the validity of their responses. Pilot studies
are useful to 'trial' drafts of newly developed questionnaires.
Technical terms: avoid using technical language unless your target population wil understand it. For example, fi
you were only going to be using doctors as your target population, ti would be acceptable to use medical
terminology, but not fi your target population had no medical training.
Emotive language: you should avoid using words that are likely to offend or upset your participants for ethical
reasons but also emotive language may be persuasive and lead participants to give an answer that they would
not have given if the question was worded differently.
STUDY TIP
fI you are creating a questionnaire, ti is best to undertake a small pilot study on a few
participants before distributing ti to a larger number of people. This will help you to ensure that participants
understand all of the questions and are able to answer as many as
possible. fI the pilot study highlights any problems, these can be resolved before collecting any further data.
Evaluating questionnaires
• Table 5.2 Advantages and disadvantages of questionnaires
182
Advantages
Relatively quick and easy to obtain the views
of a large number of participants; large sample size means results should be more
representative, although this also depends on the sampling technique (see page 199).
Participants may be less likely to give socially
desirable answers than during interviews
because they are not answering face-to-
face and their responses can be provided anonymously.
STUDY TIP
Disadvantages
Participants may be less willing to elaborate fuly on answers to open questions when writing, therefore limiting
the depth of data collected.
Participants may still give socially desirable answers. These are answers that make
them look good or acceptable within their society but do not actually reflect their own viewpoint.
Although researcher bias and subjectivity can be an issue in the analysis of qualitative data, ti does not mean
that al researchers wil show researcher bias. When evaluating methods and studies, it is best to use words
such as 'could' and 'might' to show that bias is a possibility rather than a certainty.
Psychologist Hans Eysenck created a well-known personality test called the Eysenck Personality Inventory
(EPI). It includes nine questions to assess whether a participant is likely to have given socially desirable
answers or not (the lie scale) (Eysenck and Eysenck,
1991). Anyone scoring five or above was probably lying. This allows researchers to tell fi the participant is
likely to be giving truthful answers about the rest of the personality questions. For example, one question on
the EPI si I'f you say you will do something do you always keep your promise, no matter how inconvenient it
might be to do so?' If the participant responds 'yes' to this question, it is likely that they are lying and giving a
socially desirable answer as most people at some point break a promise. Do you think it is valid to assume that
participants lied on other questions based on their answers to the
lie scale questions?
5 RESEARCH METHODOLOGY
5
KEY TERMS structured interview unstructured interview
semi-structured interview
in-depth
5.1.5 Self-reports: interviews
Think!
What are the key features of a semi-structured interview?
Think!
Imagine you are investigating unusual phobias (e.g. buttons, flowers). You are interested to know whether
people can remember any traumatic events that might have become
associated with their feared object. Why might an interview be more effective than a questionnaire for
collecting data about this topic?
Think!
There are many factors that can affect the validity of the data obtained in an interview, from the experience and
appearance of the interviewer to the location of the interview. What advice might you give to a researcher to
ensure their interview data si valid?
Interviews
Interviews differ from questionnaires ni that the researcher speaks directly to individual participants, asks them
questions and records their responses. Interviews are generally done on smaller numbers of participants as
they are far more time-consuming to conduct.
Interviews can be
conducted face-to-face or by telephone.
Just like questionnaires, interviewers can ask both open and closed questions. However, interviews are likely
to have more open questions as people are often willing to speak ni greater depth about a topic than they are
to write about ti ni a questionnaire.
•
Table 5.4 Types of interview
184
Interview format Structured interviews
Unstructured interviews
Semi-structured interviews
Key features
The researcher has a pre-determined, fixed list of questions; al participants are asked the same questions, in
the same order.
The main topics are pre-determined.
There will be a research question that steers the direction of the
interview; the interviewer does not follow a set list of questions.
Questions are based on whatever the participant chooses to talk about.
The researcher has a number of key questions that guide the
interview, but they can be asked in any order and the interviewer may introduce new questions based on what
the participant
says.
The interviewer is also able to ask follow-up questions to clarify anything that the participant has said and to
probe more deeply into interesting or unexpected areas.
Some of the questions will be the same for each participant but not all.
5 RESEARCH METHODOLOGY
186
Psychometric tests
The word psychometric literally means 'measuring the mind. Psychometric tests are carefully designed to
provide valid and reliable measures of individual differences between
people relating to personality, aptitude, knowledge or skills. The tests are standardised
on large random samples in order to establish norms or average scores. This means that an
individual's scores can be compared to the norm to reveal whether they score higher or lower than average,
and if so, how much higher or lower than other people who are comparable to
them (e.g. the same age).
Floor and ceiling effects
fI tested on a large enough random sample, a valid psychometric test will provide scores which range from
very low to very high. Most people should score somewhere in the middle.
This is called a normal distribution. Some tests do not differentiate between people very well - for example, the
range of scores is low. If the majority of the scores are very high, we
get a ceiling effect - people are unable to achieve any higher (they are bumping their heads on the ceiling!).
Alternatively, fi everyone receives a very low score, this si called a floor effect.
Again, the test is not valid as it does not allow us to rank the participants from those with
the highest scores to those with the lowest scores. An intelligence or QI test, for example,
must not be too hard (to avoid floor effects) or too easy (to avoid ceiling effects). In a clinical setting,
researchers must also ensure that the tests cover a range of scores - for example, a test of depression must
be able to show scores from not depressed at al to severely depressed.
•
Table 5.6 Evaluating psychometric tests
Advantages
Psychometric tests are objective as the tests produce numerical data, which does not require the subjective
interpretation of
psychologists.
Psychometric tests are a fast, cost-effective way of shortlisting job applicants before interviewing, which may
allow employers to recruit more suitable candidates.
LEARNING LINK
Disadvantages
Participants may respond to demand characteristics and answer in a way they think
is expected of them, reducing the validity of the data collected.
If a participant feels under stress, ti may affect how they respond on a psychometric
test, meaning that it is less likely to be valid or reliable (if they were tested again, when
less stressed, they may respond differently).
•
Whichever options you are studying, you will find psychological research using
psychometric tests ni al the applied areas, including Clinical, Consumer, Health and Organisational
Psychology. For example, a visual analogue scale (VAS) is a psychometric scale for measuring attitudes,
feelings or characteristics that cannot be directly
measured, like subjective perception of pain, along a continuum of values (see example ni Figure 5.8). Can
you think of any reason why a scale like this might not provide valid data? Do you think there are any groups of
people in particular that might not understand
this scale very well? What implications might this have? 23456789
Nagging,
Intense. Mild, annoying
uncomfo
Distressing,
rtable,
No pain
miserable
dreadful, pain
troublesome
pain
horrible pain pain
Figure 5.8 An example of a visual analogue scale for measuring pain
10
Worst possible, unbearable, excruciating
pain
5 RESEARCH METHODOLOGY
188
•
Table 5.7 Advantages and disadvantages of case studies
Advantages
Case studies are detailed investigations of individual people or small groups (units) of people, such as people
from the same family or a single shop or health clinic. They tend to be used to study unusual or rare cases. For
example, in the Saavedra and Silverman (button phobia) study, a nine-year-old Hispanic-American boy's
unusual phobia was investigated.
Akey feature of the case study as a research method si that data si gathered using a number of different
techniques, such as interviews, observations and psychometric tests. This is known as triangulation. Data is
often gathered over an extended period of time; therefore,
most case studies are also longitudinal studies. The data gathered may be a combination
of both qualitative and quantitative data. As well as gathering information from the participants themselves,
information may also be gathered from family, friends, school records, psychiatric records, the workplace or
any other relevant sources of information. When psychologists write up case studies for publication, they often
begin with a case history, which provides rich and detailed background information, which often helps to put
current behaviour into context.
Evaluating case studies Disadvantages
Researchers can gather rich and detailed information about an individual or small group of individuals. This
gives them a
unique insight into their participants' lives in order to gain a good understanding of whatever aspect of
behaviour they are studying.
As case studies use more than one method of gathering information, researchers can triangulate the data to
ensure its
For case studies, the researchers often work very closely with a participant, usually over an extended period of
time, so they get to know the participant very wel. This could result
in the researcher being less objective when analysing the data because they may be influenced by how they
feel about the participant, reducing the validity of the research.
It is very hard to replicate a case study, simply because ti is a detailed analysis of one unique individual, or a
small group of individuals. It might be that their specific experience leads them to act ni a certain way that is
not representative of others, meaning that findings may not be generalisable to others.
validity.
Case study evidence is sometimes criticised for being unreliable because it can be hard to replicate. However,
some case studies include quantitative data collected using standardised procedures and/or psychometric
tests. These measures can be checked for reliability. Can you think of any ways of checking the reliability of
qualitative data, from
interviews, for example?
LEARNING LINK
Saavedra and Silverman's case study (button phobia) (see page 114) collected both qualitative and
quantitative data using a variety of methods, such as structured interview, observation and the use of a
psychometric scale to measure subjective distress. The study was longitudinal as the boy's behaviour was
studied before, during and after treatment and he also attended follow-up sessions at 6 and 12 months after
treatment. How might the researchers redesign the study to increase generalisability?
STUDY TIP
While case studies only focus on one, or a small group of, unique individual(s), making it
difficult to generalise from the findings, it is important to consider the significance of the topic being
investigated. Consider whether the participant/s of the case study is/are truly
unique, or representative of a certain group of individuals. While the nine-year-old boy
in the study by Saavedra and Silverman (button phobia) had a rare phobia of buttons, the
researchers aimed to establish an effective treatment for disgust-based phobias, which could be more widely
applied to others.
Advantages
Unstructured observations can be used as part of a pilot study
to give researchers a good understanding and overview of the
range of behaviours that they might observe. The researchers would then choose a limited number of these to
observe for
their main research.
An advantage of overt observations is that it is more ethical
because participants know they are being observed so it avoids having to deceive participants.
An advantage of covert observations is that it avoids any potential demand characteristics because the
participants do not know they are being observed.
An advantage of participant observations is that the
observer is not viewing from a distance and therefore may gain a
greater understanding of the participants' behaviour.
An advantage of non-participant observations is that the
observer si likely to be more objective ni their observations as they are not personally getting involved in the
study.
An advantage of naturalistic observations is that behaviour is likely to be normal as the participants are in their
own natural
settings and the researchers do not interfere ni any way. Therefore it is likely to be high in ecological validity.
An advantage of controlled observations is that they can be
more easily replicated by other researchers as they can use the same behaviour schedule and it is easier to
standardise
the situation for all participants. This makes controlled observations more reliable.
Disadvantages
In unstructured observations, it can be difficult to record al behaviour and some of the subtler (but interesting)
behaviours
might be missed.
Adisadvantage of overt observations is that participants
may show demand characteristics because they know they are being watched and therefore may not act in a
natural way
Adisadvantage of covert observations is that they are less ethical than overt observations because participants
do not know they are being observed and studied.
Adisadvantage of
participant observations si that the observer may become too involved with the people that they are observing,
and become less objective ni their observations.
Adisadvantage of non-participant observations is that
because the observer si watching from a distance some behaviours may be missed.
Adisadvantage of naturalistic observations is that it is
much harder to control for variables that might affect the participants' behaviour.
Adisadvantage of controlled observations is that behaviour may be less natural if participants are aware that
they are in a
controlled setting. Researchers might also miss key behaviours fi they have a very rigid behaviour schedule.
5
The children in Figure 5.11 have been told
that the researcher will be present at
lunchtimes this week, although they have not been told exactly what she is
researching. She keeps her distance and does not interact with the children or the
play-time supervisors.
» What are the four features of this
observation?
» How could each feature be changed and what impact would this have on the
validity and reliability of the data? » How can the psychologist ensure
ethical guidelines are followed in this observation?
LEARNING LINK
• Figure 5.11 These children are being observed by a psychologist who is recording acts of physical and verbal
aggression, using a behavioural checklist
In Fagen et al. (elephant learning), researchers observed whether elephants could be
taught to complete a trunk wash. It was a structured observation, as the researchers were observing the
elephants training sessions using a carefully operationalised
behavioural checklist. Imagine you are observing elephants in the wild and want to investigate interactions
between family groups. Would your observation be naturalistic or controlled? Would you choose to use a
structured or unstructured observation and why?
191
5.1 Research methods
Do you think time spent on video games improves reaction speed or do people with good reaction speeds
play more video games?
Think back to Dr Rox's study about the effects of caffeine on reaction time (see
page 170). The research method was an experiment as Dr Rox manipulated whether the
participants drank coffee or water before taking the reaction time test. She now wants to conduct a
correlational study on another variable that may or may not be related to
reaction time. She decides to investigate the relationship between time spent playing video games each week
and reaction speed.
» Do you think she will predict a positive or a negative correlation? How would this
correlation look on a scatter graph?
» How could she measure time spent on video games in a way that is valid, reliable and
ethical?
» Why is Dr Rox unable to conclude that playing video games improves reaction speed?
Correlational studies look at the relationship between two variables. These are called co-
variables. For each participant, researchers will gather two sets of data which they can plot on a scatter graph
to see whether there is a correlation between the two measures.
Operational definitions for co-variables
Just as it is important to operationalise your VI and DV in an experiment, it is also
important to operationalise your co-variables ni a correlation. This means stating exactly what they are and
how they will be measured. For example, Co-variable 1: average time
spent playing video games per week, measured in hours. co-variable 2: reaction speed on an app measured in
minutes, seconds and milliseconds. It is always important to operationalise your hypotheses for correlations.
Correlations are often used by researchers to investigate new areas of psychology before conducting
experimental research, or in cases where ti is not practical or ethical to manipulate variables.
Various methods may be used to collect data for correlations, as long as the data is
quantitative. In order to analyse the data, the pairs of scores are plotted against one another on a scatter
graph (see Figure 5.12) to see whether there is a correlation or not.
A scatter graph is a helpful way of quickly seeing whether two variables appear to be correlated or not. It
should also be possible to infer the direction and strength of the relationship.
Positive, negative and no correlation
When there is a positive correlation, the two variables being measured increase together - as
one value increases so does the other. For example, ni Dement and Kleitman (sleep and dreams), the longer
the duration of REM sleep (in minutes) the more words participants used to describe their dream. When there
is a negative correlation, as one value increases, the other decreases.
In Baron-Cohen et al. (eyes test), the higher the Eyes Test score the lower the Autism Spectrum Quotient
score. Sometimes there is no consistent relationship between the scores. In Hölzel
et al. (mindfulness and brain scans), there was no correlation between time spent on
mindfulness activities between the weekly MBSR sessions and grey matter concentration. Figure 5.12 shows
how each of these outcomes might look on a scatter graph.
Positive correlation Negative correlation No correlation A Figure 5.12 Types of correlation on scatter graphs
193
5.1 Research methods
5 RESEARCH METHODOLOGY
5
The strength of the relationship between the two variables can be represented by a correlation coefficient. In
general, the closer to 1 (for a positive correlation) or 1-
(for a negative correlation) that the coefficient is, the stronger the correlation is. Acorrelation of +0.75/-0.75 si
considered to be a 'strong correlation'. This means that the two variables being studied are very strongly
related.
Evaluating correlations
• Table 5.9 Advantages and disadvantages of correlations
194
Advantages
Correlations allow psychologists to investigate new areas of research to see whether they are worth
investigating further with experimental methods.
Correlations alow researchers to investigate areas where ti si not practical or ethical ot manipulate variables.
LEARNING LINK
Disadvantages
Correlations can only show the strength of the relationship between two variables and cause and effect cannot
be inferred. They also do not tell us why that relationship has occurred.
The presence of a third variable cannot be
ruled out ni correlations. Athird variable that is not being measured may have influenced
the relationship.
Baron-Cohen et al. (eyes test) (see page 60] found a negative (inverse) correlation between
performance on the Revised Eyes Test (where participants have to look at photographs of
people's eyes and decide which word best describes the mood of the person in the photo]
and the Autism Spectrum Quotient (a self-report questionnaire where a high score indicates responses
consistent with traits of autism]. The correlation was -0.53. What does this tell
us about the relationship between the Autism Spectrum Quotient and the Eyes Test?
STUDY TIP
It is important to remember that a correlation is not an experiment. It is critical to refer to your two variables as
'co-variables when planning a correlational study. There is no independent or dependent variable in a
correlation.
TESTYOURSELF
1 Explain what is meant by the term lack of causality' as a disadvantage of correlations in psychological
research.
[2]
2 Outline what is meant fi a study says that there was no correlation. [1]
3.
Dement and Kleitman (sleep and dreams) found a positive correlation between
length of REM and number of words in a dream report. Explain what is meant
by the term positive correlation, using this study as an example. [2]
4 Baron-Cohen et al. (eyes test) used both experiments and correlations in their research. State one similarity
and one difference between experiments and
[4]
[2]
[2] [2]
Trixie found a correlation between aggressive playground behaviour (scored
• Tradnig coer to eaetn i real payeoru
using a structured observation, where the higher the score, the more aggressive the child) and amount of time
spent playing violent video games. Explain why
6
Trixie cannot conclude that playing violent games causes childhood aggression.
Rebecca finds a negative correlation between empathy (measured on a self-report
questionnaire, where the higher the score, the more empathetic the person) and time taken to offer help to a
confederate in a staged emergency situation (in seconds).
a
Explain what the term 'negative correlation' means, using an example from Rebecca's study.
b
Suggest one weakness of Rebecca using a correlation in her study.
5 RESEARCH METHODOLOGY
196
Lots of people had left Mis Steps dance class. Why do you think that might be? What does this example tell
you about
attrition ni longitudinal studies?
• Figure 5.13 Posy was pleased with
her progress, but was Miss Steps, the teacher, really all that good?
LEARNING LINK
Hölzel et al. (mindfulness and brain scans) was an experiment with a longitudinal design into the effects of
mindfulness on the brain, as data was collected over an eight-week period. Al participants provided two sets of
data, once at the beginning and once at the end of the eight-week period.
STUDY TIP
Longitudinal designs can be a research method in their own right or they can be part of
an experiment with a longitudinal design, for example. An experiment with a longitudinal design will have
independent and dependent variables, with the effect of the independent
variable on the dependent variable being measured over time, whereas a longitudinal study will not have an
independent variable and will just involve repeatedly gathering data
from the participants over several weeks, months or even years.
TESTYOURSELF
1
Give one strength of experiments with longitudinal designs.
[2]
[2]
study as an example. [2] Explain one reason why a psychologist might conduct a longitudinal study. Use
an example from Saavedra and Silverman (button phobia) in your answer. [3]
Rick and his puppy, Basil, have been attending Dr Barker's puppy training classes every week for six months.
Basil has done brilliantly and is now much
more obedient and Rick cannot thank Dr Barker enough. At the end of the
programme, many of the original puppies and owners have dropped out. Outline
one reason why Rick should be cautious when making generalisations about the effectiveness of Dr Barker's
classes. [2]
6
Shawna has a son who has been diagnosed with autism spectrum disorder. She and her son are part of a
longitudinal study tracking children's social and emotional development. Shawna sometimes feels sad after the
monthly assessments as her
son is not making very good progress. She also has four other children and struggles to find the time to attend.
Explain two ethical guidelines that must be considered
[4]
2
Explain one disadvantage of longitudinal studies relating to validity.
3
The study by Hölzel et al. (mindfulness and brain scans) was an experiment with a longitudinal design. Outline
what is meant by a longitudinal design', using this
4
5
when conducting longitudinal studies. You must refer to Shawna in your answer.
198
Do you remember Dr Rox and her experiment about the effects of caffeine on reaction times? She's back and
she needs your help. Initially, she was going to measure reaction times using a computer program, where the
participant had to touch the screen every
time a number 2 appeared. The computer would record the participants' average reaction time across a series
of trials
in milliseconds.
» Before making a decision, Dr Rox wants to know fi you can think of any other ways she could measure
reaction time?
» See fi you can think of three alternatives. Remember, fully operationalised definitions include units.
The term experimental hypothesis is used for studies where this is an independent variable (e.g. laboratory
and field experiments), whereas the term alternative hypothesis is used in non-experimental studies, such as
correlational studies. Hypotheses can be directional (also
known as one-tailed) or non-directional (or two-tailed).
Directional hypotheses
With a directional hypothesis, the researcher will predict whether the experimental group wil perform
significantly better or worse than the control group (or ni a correlation,
whether ti wil be a positive or a negative correlation). Directional hypotheses tend to be used where previous
research has been done in the area, so the researcher has a good idea of
what results to expect.
Dr Rox decides to use a directional hypothesis for her study on the effects of caffeine on reaction times. She
quickly scribbles her hypothesis down and asks you to operationalise
it for her. What does she mean and how would you do this?
Hypothesis: People who drink a cup of coffee wil have quicker reaction times than people who drink a cup of
water.
Tip: You need to clarify the size of the cup and the exact amount of caffeine, as well as stating exactly how
reaction time will be measured and in what units.
Non-directional hypotheses
For a non-directional hypothesis, the researcher states that there will be a significant difference between the
experimental and control groups but does not state in which direction
that difference will be - for example, whether the experimental group will be faster or slower. Similarly, with a
correlation, the researcher will state that there wil be a significant correlation between two variables, but not
whether ti will be a positive or negative correlation.
Researchers tend to use non-directional hypotheses when there has not been previous research conducted in
the area that they are researching or when the research is contradictory regarding what the possible outcome
might be. Non-directional hypotheses
tend to start with the words 'There wil be a significant difference/correlation...
Dr Rox has changed her mind. She now wants you to rewrite her hypothesis but this time it should be
non-directional. Look back at your fully operationalised directional hypothesis above. What do you need to
change to make it non-directional? What wil stay the same?
Null hypothesis
The null hypothesis is the statement that the independent variable will have no effect
on the dependent variable, or that two variables are not correlated with each other. It is important for the null
hypothesis to also be operationalised. Nul hypotheses normally start
with 'There wil be no difference/correlation.., and they usually finish by saying that 'any difference that does
arise will be due to chance.
5 RESEARCH METHODOLOGY
You are looking forward to collecting some data now and maybe even having a cup of coffee of your own, but
there is one last thing to do. Dr Rox reminds you that you need to write a null hypothesis before you do
anything else.
» What will be different about the null hypothesis and what will stay the same?
Data analysis
Psychologists analyse quantitative data using descriptive statistics, as explained on
page 217. These help to summarise the data and make it easier to see whether there are any obvious patterns
or trends. You are not likely to encounter this ni your AS studies, but sometimes ti is hard to know whether
these trends are significant and so psychologists use a branch of mathematics called inferential statistics,
which allows them to calculate the
probability (chance) that their result would have arisen fi the null hypothesis was true. This is called a p value.
Having completed their analysis, the researchers will decide whether
to accept or reject their null hypothesis. The nul hypothesis si rejected fi the p value (the probability of a
significant result having occurred due to chance) is less than 0.05 1( in 20)
and ti is accepted fi ti si more than 0.05 1( ni 20). Inferential statistics are beyond the requirements of the
syllabus.
LEARNING LINK
In the experiment by Andrade (doodling), the independent variable was whether the
participants were allowed to doodle or not while they listened to the phone message.
Andrade had three dependent variables, which were operationalised as follows:
1 Monitoring accuracy: the number of correct names (out of eight) recorded while
listening to the tape
2 Memory for monitored information: number of correct names recalled (out of eight)
3 Memory for incidental information: number of correct places recalled (out of eight).
What are the main advantages of the way in which Andrade operationalised these variables? Can you see any
disadvantages?
STUDY TIP
Many of the core studies have more than one independent and dependent variable, which
means the researchers are testing multiple different hypotheses. For example, in the
study by Dement and Kleitman (sleep and dreams), there were three main experimental. hypotheses, each
with their own independent variables.
TESTYOURSELF
1
Explain what is meant by the term 'null hypothesis', using any example. [2]
2
Explain the difference between an independent and dependent variable, using
any examples. [4)
3
Describe how obedience was operationalised in the study by Milgram (obedience). [2]
4
Baron-Cohen et al. (eyes test) hypothesised that there would be a negative correlation between score on the
Eyes Test and score on the Autism Spectrum
Quotient. Identify whether this is a directional (one-tailed) or non-directional (two-tailed) hypothesis, including a
reason in your answer. []
5
Tristan would like to investigate whether people are happier in the presence of
babies. Suggest how you might operationalise the dependent variable in this study. [2]
6
Nung wants to see whether meditation affects people's subjective level of
wellbeing. Write an operationalised, non-directional hypothesis for Nung's study. [2]
199
5.2 Methodological concepts
5
KEY TERMS standardisation
control
participant variables situational variables
uncontrolled
5.2.2 Controlling of variables
Think!
Why is standardisation so important in psychological research?
Think!
Imagine you have to write a research proposal for an experiment on the impact of
online/remote working on people's attitudes towards their colleagues. You wil organise either an online or a
face-to-face interview between each participant and a confederate, who they are told is a new member of staff
at their workplace. You will then ask your
participants to rate the confederate on a number of personality traits. How wil you control participant variables
and situational variables?
Think!
Twenty people have their memory tested individually ni a quiet room. Ten complete the study ni total silence
and ten complete the study with quiet classical music. During the study, someone has a distracting coughing fit
in the next room. What sort of variable is this? How could this affect the findings of this study?
In order for Dr Rox to be sure about the effect of caffeine on reaction times (see
page 170), she had to control a wide range of variables. How many can you think of? Try for five. fI this is too
easy, try for seven! Go back over your list and note down why each
variable needs to be controlled - that is, how might ti have changed the results fi ti was uncontrolled?
When conducting experiments, researchers attempt to control all variables other
than the one being manipulated (the independent variable). Controlled variables are 'held constant', or
standardised, between the two or more conditions. This allows the researcher to state that any changes in the
dependent variable were caused by the independent variable.
Standardisation of a procedure
In order to make research replicable so that it can be checked for reliability, it is essential that the procedure si
standardised for al participants. In an experiment, the only difference should be what the researcher si
manipulating (the independent variable). However, al research, whether it si experimental or not, should follow
standardised procedures. This
includes participants all receiving the same information and instructions (unless this is something that is being
manipulated as part of the independent variable).
Participant variables
Participant variables are aspects of a person's background, personality, cognitive abilities, health and
wellbeing, etc., that affect how they behave or respond ni a study. Individual differences between people are of
course inevitable and this si why psychologists work with averages calculated from larger groups of people.
However, participant variables can
cause problems in experimental designs fi some aspect of the participants (e.g. empathy, aggression or
intelligence) varies systematically between the two or more experimental and control groups. For example, if al
passengers ni Piliavin et al's 'cane' condition
200
5 RESEARCH METHODOLOGY
5
STUDY TIP
It is important to remember the difference between a standardised procedure and
controls. Standardisation refers to features of the procedure being kept the same, while control variables are
features of the study that are kept the same between each level of the
independent variable. Standardisation is essential for replicating the study in the future, while controls are
essential for ensuring that the independent variable was the only factor affecting the dependent variable.
202
KEY TERMS
quantitative data qualitative data subjective data objective data
Imagine you have been asked to conduct some research on the effects of social media on teenagers' mental
health. Would you rather collect quantitative data or qualitative data? What are your reasons for this answer?
Your supervisor says that she would rather
you collect the opposite type of data. How wil you convince her ot come round ot your point of view?
Think!
People sometimes think that quantitative data cannot be subjective and qualitative data cannot be objective.
Can you think of an example that challenges these views?
Dr Rox collected her reaction time data ni milliseconds. What type of data si this?
Quantitative and qualitative data
Quantitative data refers to any data that is numerical. For example, closed questions on questionnaires
produce quantitative data.
TESTYOURSELF
Explain what is meant by 'situational variables', using any example. [3] Define 'control variables'
and 'uncontrolled variables', using any examples.
[6] Describe one control used in the study by Perry et al. (personal space). [2]
The study by Andrade (doodling) used a standardised procedure. Explain why it
was important to use a standardised procedure in this study. [2]
Buck is investigating the effect of background music on memory for faces. The
experimental group looks at a set of 50 faces while listening to quiet classical
music. The control group looks at the same 50 faces in silence. Describe one situational variable and one
participant variable that Buck must control to
ensure the validity of his findings. [4)
6
Georgia's teacher tells her she cannot start collecting data for her project as her standardised procedure is
unclear. Georgia ignores this advice and goes ahead
12
34
5
anyway. Explain why a standardised procedure is important for a study.
5.2.3 Types of data
Think!
What do people mean when they say that data is subjective?
Think!
[2]
5 RESEARCH METHODOLOGY
5
T E S TYOURSELF
204
KEY TERMS
generalisation sample representative sampling techniques population opportunity sampling
random sampling volunteer (self-selecting) sampling
You see an advert on social media asking for participants for a memory study. The advert says that the
research could have far-reaching benefits in terms of the early detection
of Alzheimer's disease. Would you respond to the advert? What factors would affect your decision? How else
do you think the researcher could recruit participants ni order to get the largest and most diverse sample
possible?
Think!
People often think the larger the sample the more representative ti si and, therefore, the more generalisable
the findings will be. Why is this not necessarily the case? Why is it still important to have a large sample?
Sample and population
Sampling refers to how researchers obtain participants for their research. Researchers will have a target
population of people to whom they wish to generalise their findings. For example, the target population might
be the global community of Cambridge International A Level psychology students. It would be practically
impossible to test the entire population and instead the researcher selects a sample from the target population.
Collectively, researchers refer to their participants as the sample.
Sampling techniques
Opportunity sampling
There are a number of ways that researchers can select their sample. Opportunity sampling is a technique in
which researchers select participants who are readily available. For example,
1
Define subjectivity using any example. [2]
2
Explain why data collected using an observation as a research method may be subjective. [2]
3
Perry et al. (personal space) collected quantitative data in their study.
a Define quantitative data using an example from this study. [2]
[2]
data from this study. [2] Jagdeep and his cousin Nishit are planning a study on mental health and therapy
in India. Jagdeep thinks the findings will be more valid fi they collect qualitative
data, but Nishit thinks quantitative data will provide more valid findings. Explain whether you agree more with
Jagdeep or Nishit. [6]
6
Sampson is listening to a podcast where the presenter is discussing a new piece of research on teenagers
attitudes towards climate change. The presenter says
b Suggest one advantage of using quantitative data in this study.
4
Milgram (obedience) collected data on participants' responses to orders from an authority figure. Identify one
piece of quantitative data and one piece of qualitative
5
the findings were very subjective. Outline what the presenter means with reference to the study on attitudes
towards climate change.
5.2.4 Sampling of participants
Think!
When we make generalisations ni psychology, what does this actually mean?
Think!
[2]
5 RESEARCH METHODOLOGY
5
LEARNING LINK
The study by Milgram (obedience) used a volunteer sample as he advertised for
participants in the local New Haven newspaper and by directly mailing potential
participants.
• Why do you think this type of sample was particularly problematic for a study on obedience?
•
How might the results have been different fi Milgram had used random or opportunity sampling?
STUDY TIP
Volunteer sampling can sometimes be confused with opportunity sampling. Remember that in volunteer
sampling participants are self-selecting themselves to take part in the
research, while in opportunity sampling they have been directly approached. Volunteer
samples, therefore, tend to consist of more motivated people who may have a particular interest in the
research, while an opportunity sample simply happened to be in a certain
place at a certain time and did not refuse!
TESTYOURSELF
1 Define opportunity sampling using any example. [2]
206
KEY TERMS valid consent
ethical guidelines informed consent
right to withdraw harm
privacy
lack of deception
debriefing confidentiality
2
Suggest one advantage and one disadvantage of random sampling.
[4]
3
Milgram (obedience) used volunteer sampling in his research.
a Explain how he obtained his sample. [3]
b Suggest one disadvantage of using volunteer sampling in this study. [2]
4
Piliavin et al. (subway Samaritans) used opportunity sampling in their study investigating helping behaviour on
a New York subway. Explain one reason why it may be difficult to make generalisations from this sample.
5
Puangchompoo would like to interview people on their meditation habits.
a Explain one sampling technique Puangchompoo might use to obtain her
sample for this study. [2]
b Suggest one advantage of using this sampling technique to obtain participants
for this study. [2]
6
Nancibelle wants to conduct research using a questionnaire to find out what types of gifts people like. She
uses opportunity sampling to obtain a sample
from outside a large department store in Malaysia. Suggest one disadvantage of Nancibelle using this
sampling technique in her study. [2]
5.2.5 Ethics and human participants
Think!
Why si ti sometimes ethically justifiable to use deception ni psychological research?
Think!
Imagine you were a participant ni Milgram's study (obedience). You administered a shock of 450 V, were
debriefed and met with the learner in the reconciliation phase. How do
you think you would have felt when you met the learner? Do you think Milgram's actions were justified by the
possible benefits of his findings?
[2]
Think!
Why do you think it might be difficult to gain truly valid consent from people who are detained, such as people
who are waiting for asylum requests to be processed?
Does your country have its own ethical guidelines? You could compare them to the British Psychological
Society Code of Human Research Ethics (2021), or the guidelines of
the American Psychological Association, or the Chinese Psychological Society. What do the guidelines have in
common and what differences can you find?
Ethics are a critical component of all psychological research, which is why wherever in the world there is
psychology, there are also ethical guidelines. Although local cultural norms
and values may be reflected, many of the core principles are the same, including the need for
informed consent and the right to withdraw. While ethical guidelines serve to protect the participants and
ensure they do not come to any harm, they also help to ensure that research that benefits wider society can be
pursued, as long as certain safeguards are in place.
fI you carry out any research yourself, it is critical that you obtain ethical approval from your teacher, before
collecting data from participants. Al ethical guidelines must be upheld when planning and conducting your
research.
Ethical guidelines in psychological research
Valid consent
This means that participants know exactly what will be happening during the research before
not considered valid unless it is informed; participants must know what they have signed up for and, therefore,
should be given detailed, age-appropriate information before being asked for consent. Researchers must be
able to provide evidence that consent was obtained; this
could be by ticking a box online, signing their name on a consent form or even giving audio- recorded verbal
consent.
Consent should be given by all participants, wherever possible, including children. However,
ni the case of some vulnerable groups, parents, family members or caregivers should also give consent. For
example, consent from a close relative may be necessary when working
with adults whose cognitive abilities and ability to communicate are significantly impaired, such as people with
severe dementia. Parental consent must be obtained when working with
children, although age restrictions may vary from country to country (e.g. under 16 versus under
18). Children should also be asked for their assent.
Right to withdraw
Although participants give their consent, they must also be made aware that they can withdraw from the
research at any time without any pressure to continue. They must also be informed
that their data wil be destroyed. If there si a point in the research when it si no longer possible to withdraw their
data (for example, they are completing an anonymous online questionnaire and they click 'submit'), then
participants must be made aware of this in advance. Researchers working with very young children or adults
with significantly impaired communication ability should look out for signals (both verbal and non-verbal) that
the participant si no longer happy to participate - for example, a young child that gets up and wanders round
the room. If a participant shows any sign of distress, the researcher must end the trial.
Minimising harm and maximising benefit
Researchers must aim to minimise harm to participants' psychological wellbeing, personal values, privacy or
dignity and mental health. The risk of harm should be no greater than the
participant might expect to experience in their everyday lives. Researchers should aim to
maximise the benefits of their research at all stages of the research process from developing the initial theories
and hypotheses, to publishing their findings and applying them to help people in their everyday lives.
5
207
5.2 Methodological concepts
208
Lack of deception
By obtaining fully informed consent, participants will be very clear about what they will be doing and the aim
and purpose of the study. Lack of deception means that the participants wil not be deceived about anything to
do with the research. In reality, ti can be difficult
to conduct research without any deception as simply knowing the true aim of the research could change
participants' behaviour. Researchers must explore al possible ways of conducting their research before
deciding to use deception. This is because it can put
participants at greater risk of psychological harm (e.g. feeling upset, angry, embarrassed or ashamed). When
deception si involved, full debriefing si essential.
Confidentiality
An individual's results should remain confidential before and after publication and an individual's data should
not be identifiable. This is particularly important fi the participants have provided qualitative data, which is more
likely to be personal and thus more identifiable.
It is important that any data that the researchers keep while conducting the research and analysing the data si
stored confidentially. Participants' names should not be included with
their data. If the participants need to return to complete another part of the research at
a later point, then participants should be assigned a code number so that the data can be paired up. Al data
should be stored in password-protected files or in locked cupboards.
Privacy
Privacy is generally discussed in the context of observational research. Participants should
only be observed in public situations where they might expect to be observed by other people, such as in a
supermarket or train station. It is not acceptable to observe people in their own homes without their consent.
Researchers who conduct research in public places
need to gain consent from whoever owns the space - for example, the station manager, or the local
government if it si an outdoor public space like a park.
Debriefing
Debriefing happens after the participant has completed the research. The purpose is to
explain any deception that has taken place, to answer any questions the participant may have and to ensure
they leave the research in the same state in which they entered it. For
example, fi the research has left the participant ni a negative state of mind, ti si important that the researcher
ensures that they leave
feeling more
positive.
LEARNING LINK
Think about the research by Hölzel et al. (mindfulness and brain scans; see page 29).
Participants were given brain scans, then underwent an eight-week programme of
Mindfulness-Based Stress Reduction, before returning for another brain scan. It would have been important for
Hölzel et al. to have been able to match up the participants'
scores for analysis at the end. However, they also needed to ensure the data was kept confidential. Can you
think of any of the other core studies that presented ethical issues?
STUDY TIP
It can be easy to confuse privacy and confidentiality. Maintaining confidentiality means
taking steps to ensure personal information is not disclosed to anyone outside of the
immediate research team (for example, using initials to refer to participants in case
studies) whereas privacy refers to each participant's right not to be interrupted, observed or disturbed and this
can extend to public spaces such as restaurants and shops. Agood
way to ensure that you fully understand each of the ethical guidelines and how they differ from each other is to
form your own ethics committee with people in your class or an
online study group. Whenever any practical work is undertaken, it must first be formally proposed to the
committee and discussed. The committee should provide feedback and suggestions for improvements or
areas that need more careful consideration. Your teacher should be head of the committee but remember,
ethics si everyone's responsibility.
5 RESEARCH METHODOLOGY
5
Ethical guidelines for non-human animals
Minimise harm and maximise benefit
As with human research, psychologists must seek to minimise harm, discomfort and suffering to the animals
that they are studying and maximise the benefit of the research in applying their findings to helping either other
animals of the same species or helping humans.
Replacement
The British Psychological Society guidelines recommend that, where possible, the use of animals is replaced
with video footage from prior research or computer simulations. This is
particularly important fi the animals are to be used for training or teaching purposes.
Species
It is important that researchers use species of animals that are both scientifically and ethically suitable for the
research. The researchers must have a good understanding of the animal's natural history and its level of
sentience (its ability to feel). The researchers should
know whether the animals have been bred ni captivity and whether or not they have been involved in previous
research. The researchers must choose the species that is likely to
suffer the least while still meeting the research objectives. Researchers should discuss these choices with
colleagues who are also knowledgeable about the species that they plan to
use. The use of non-human primates should be avoided where possible because of their high level of
sentience.
Numbers
According to the Animals (Scientific Procedures) Act 1986, researchers should use the smallest number of
animals possible to meet the research aims. However, this must not mean
that those animals chosen are used over an extended period of time, which could prolong any suffering.
Reducing the number of animals used in an experiment can be achieved by appropriate pilot studies, reliable
measures of behaviour, good experimental design and the appropriate use of statistical tests. Pilot studies
should be used with a very small number
of animals to ensure that the design is appropriate and testing what the researchers want
to investigate, before conducting the research on larger numbers of animals. Researchers should consider
using measures of behaviour that have been used in previous research and have been demonstrated to be
reliable. Good experimental design ensures the minimum use of animals. It is recommended that researchers
have a good understanding of statistics so that appropriate tests can be used to minimise the number of
animals used.
Procedures: pain and distress
The researchers should avoid causing death, disease, psychological or physical discomfort to any animals.
Where possible they should look at enriching the animals' environment rather than depriving them or using
observational techniques in situations where naturally occurring instances of adverse conditions are
happening. It is important that researchers
carefully consider the balance between the harm or potential harm caused to the animals and the benefits that
would be gained from the research. Pain can only be ethically justified
in psychological research when there si no alternative research method available and the research would have
significant scientific or educational value. Any surgical procedures should be conducted using anaesthesia to
reduce any pain and prevent infection. fI an animal's life needs to be terminated at the end of the research for
any reason, then this should be done swiftly.
Housing
The social and natural behaviour of the species being studied should be considered when housing the
animals. Isolating animals who would normally live ni social groups could cause stress. Similarly, overcrowding
could also cause stress and aggression. Using the same
cage to both house and test the animals could reduce the negative impact and should be considered
210
5
T E S TYOURSELF
1 Describe the ethical guidelines of 'species' and replacement in relation to
animals in research, using any examples. [6]
2
State what is meant by the ethical guideline of 'numbers' ni relation to animals.
[2] In the study by Hassett et al. (monkey toy preferences), explain how two ethical guidelines relating to
animals were followed. [4)
4 In the study by Fagen et al. (elephant learning), explain how the ethical guideline
212
you to give their study a score from 1-10 for ecological validity (where 1si very poor Whtaies your answer eul
uoy about vaet ywdi bychniginat aesard uoy change ?ti:
Think!
Is it possible to have a study that is high in reliability and also high in ecological validity?
Validity refers to the extent to which the researcher is measuring what they think they are measuring and the
extent to which the findings are useful and meaningful.
Ahistorical example of a psychometric test that lacked validity is the Army Alpha and Beta
QI tests. The term 'Id' stands for 'intelligence quotient'. nI 1917, a psychologist called Yerkes worked with
colleagues to create QI tests for army recruits ni the United States
of America, to see which section of the army would be most appropriate for them, based on their QI levels.
Many of the recruits were immigrants who had not lived in the country
very long. However, the I'Q' tests contained questions such as W' hat si Crisco?' and Who si Christy
Mathewson?'. These questions were impossible to answer unless the recruit had a good knowledge of
American culture. This led to recruits from certain nations experiencing prejudice as they (understandably)
received low QI scores. This si a good example of a test that lacks validity - it was testing cultural knowledge,
not IQ.
Ecological validity
Ecological validity si the extent to which behaviour that participants demonstrate during research relates to
how they would behave ni their real lives. Research conducted ni normal, real-life settings, as ni field
experiments, si likely to be (but not always) higher ni ecological
KEY TERMS
ecological validity validity
subjectivity
objectivity
demand characteristics
generalisability temporal validity
ethical guidelines Yoko must follow when conducting this research.
5.2.7 Validity
Think!
of procedures le.g. housing, reward, deprivation and aversive stimuli] was followed.
[3]
[2]
13J
5
Mirabel is observing two groups of gorillas in the wild, a large group of 65 and a small group of 20. She is
recording the number of close social relationships each gorilla has with other gorillas in the group. Outline why
Mirabel must have a strong knowledge of the 'species' before being allowed to conduct this work.
6
Yoko is conducting an experiment on mice. The mice have been genetically engineered to make them more
anxious than normal. Yoko will then test a new drug on the mice to see whether she can reverse their anxiety.
Describe three
When psychologists say that a study lacks ecological validity, what do they actually mean?
Think!
Imagine that you are a participant ni Perry et al's study (personal space). Can you remember what you had to
do? If not, check page 144 for a summary. Perry has asked
5 RESEARCH METHODOLOGY
5 RESEARCH METHODOLOGY
214
Demand characteristics
In research studies, participants might try to guess the aim of the research and change
their behaviour based on what they think the researcher is looking for. This
would reduce
the validity of the study. Any features of the research environment which may give away the study's aim are
called demand characteristics. This is why some studies use deception to avoid the participants' behaviour
being affected. In Milgram (obedience), participants were deceived in a variety of ways to attempt to hide the
study's true purpose. However, critics
have argued that participants did pick up on demand characteristics and knew that the learner could not
possibly come to any harm. For example, the experimenter's total lack of concern for the learner may have
cued at least some of the participants to realise that the shocks were not real (Orne and Holland, 1968).
One of the strengths of using animals as participants is that they do not actively try to work out what the study
is about. Therefore, demand characteristics are reduced and validity
increased. This said, animals may unconsciously pick up on subtle cues from researchers, which may affect
their behaviour.
Generalisability
Findings should only be generalised to the target population fi the sample was sufficiently
representative. This wil depend upon the sampling technique, for example random samples are typically more
representative than opportunity or volunteer samples. The extent to
which findings can be generalised beyond the sample si sometimes called 'population validity. For example,
the study by Piliavin et al. (subway Samaritans) looked at helping behaviour on a New York subway carriage. It
may not be appropriate to generalise their findings to all cultures because helping behaviour may differ in other
countries, based on
cultural values. Whether the results from a study can be generalised to real-life situations or not is called
ecological validity. For example, as the study by Piliavin et al. (subway Samaritans) took place on a normal
subway journey, as part of people's real lives, it is
possible to generalise the findings to similar real-life contexts.
Temporal validity
Temporal validity refers to whether the results of a study can be generalised to a different
time period. For example, Bandura's study was conducted in the 1960s and the results may
not be the same today as modern children have access to personal technology such as phone
and tablets and gender roles have also changed significantly. This means the findings may be era-bound and
said to lack temporal validity.
Other studies, such as Hassett's study of sex differences and play in monkeys, are less likely to be affected by
temporal validity since they are based on a generally stable biological
functions. Social psychological research may be most susceptible to issues with temporal I validity, as cultural
and societal norms change over time.
STUDY TIP
It is important to remember the difference between demand characteristics and socially desirable answers -
demand characteristics are features of the environment that give
away a study's aim, while socially desirable answers are those given by the participant to ensure they are
viewed ni a more positive way. Both of these, however, can affect the validity of data collected.
5
Standardisation
Standardisation of the procedure and instructions provided to the participants is an important way of making a
study replicable. It also means that all aspects of the research
are consistent for every participant (except for the manipulation of the independent variable ni experimental
studies). For example, ni the study by Bandura et al. (aggression), the model always displayed the same
sequence of behaviours, in the same order and for the same length of time. The toys were laid out in exactly
the same way in each of the rooms and
the children were always given the exact same standardised instructions. The observer also scored the
children's aggressive and non-aggressive behaviour in the same way according
to the categories on the behavioural checklist. This high degree of standardisation means
Bandura's study is easy to replicate and, therefore, his findings could be checked for reliability.
Dr Rox needs to ensure the replicability of her study on the effects of caffeine on reaction times (see page
170), so that her study can be checked for reliability.
» What can she do to make sure every participant has the exact same experience apart
from whether they drink coffee or water?
» How would you ensure that everyone understands the task before they start?
» How could you ensure that everyone begins the reaction time task at exactly the right time, after finishing the
drink?
» When writing up the procedure for publication, how can Dr Rox make sure that ti si replicable?
T a b l e 5.13 Ways of increasing reliability
216
Technique
Test-retest reliability
Inter-rater reliability
Inter-observer reliability
Description
Participants repeat a test or questionnaire a few weeks or months later to see whether their scores are
consistent. If they are, the researcher can say that results are reliable.
Inter-rater reliability refers to the extent to which two researchers agree in their scoring of a questionnaire or
test. fI the two raters plot their scores
for each participant against each other on a scatter graph, they should see
a positive correlation, suggesting that they are both applying the scoring criteria consistently (in the same
way).
Inter-observer reliability refers to the extent to which two researchers agree ni their rating or coding of
behaviours ni an observation.
LEARNING LINK
In the learning approach, you studied the experiment by Bandura et al. (aggression). This study provides an
excellent example of a highly standardised procedure, which
increases replicability and the ability to check for reliability. It also included examples
of both inter-rater and inter-observer reliability. Be sure to cross-reference your notes
on techniques to increase reliability with this study. What level of agreement was found between the
experimenter and
the nursery teacher regarding
the children's pre-existing aggression? What level of agreement was found between the two observers who
assessed the children's behaviour in the third and final room? Tip: These are numbers between 0 and 1- the
closer to 1.0, the stronger the agreement. What do these figures tell us about the reliability of Bandura et al.'s
study?
5 RESEARCH METHODOLOGY