Research Methods For Education, 2nd Edition PDF
Research Methods For Education, 2nd Edition PDF
Peter Newby is an affable and welcoming guide, but don’t let that fool you; his introduction rep-
resents the crystallization of careful, sophisticated practical, technical, and conceptual thinking,
combined with a sure-footedness in and around the folk-ways of educational research. This is a
book with a considerable span of interests, permeated with a strong sense of why such work, and a
commitment to doing it well, really matters. That Newby has done all this without compromising
the complexities and challenges that make all this work important is an achievement that makes this
an especially useful and enjoyable book for beginning and experienced researchers alike.
Peter Freebody, The University of Sydney
Crafting an appropriate and effective research design is a challenging task for many students,
novice and experienced researchers. Users of this comprehensive text will find it very helpful in
designing suitable research tools, seeking consistency in theoretical underpinnings and making
critical research decisions. The text is intelligently grounded – it provides useful insight into real-
life research situations and examples. It is a very accessible text, easy to read and navigate through.
I would have no hesitation in recommending it to students embarking on educational research
and to lecturers about to teach a course in research methodology.
Marc Shäfer, Rhodes University, South Africa
An excellent text for students studying at all There are few things more important than
levels from undergraduate to doctoral quali- good research into education, and in this
fications. The structure of the book leads the book Peter Newby makes sure his readers can
reader through the complete research pro- meet this challenge. He is a reliable, thorough
cess, highlighting the many ambiguities and and confident guide for anyone setting out on
challenges faced during the research. Clear their research journey. The text is particularly
language makes the text accessible and helps helpful for researchers developing action or
to clarify some of the more difficult issues policy in this field.
without minimising their complexity. This
James Wisdom, Visiting Professor of Educational
book will be a great asset to many first time
Development, Middlesex University
as well as experienced researchers.
Sheine Peart, Nottingham Trent University
This is an excellent, up-to-date and accessible methods text which will greatly appeal to students
grappling with the research process. The style of the book is clear and user-friendly, whilst the con-
tent anticipates many of the problems which students are likely to encounter during their research
in education. Comprehensive and good value for money.
Samantha Punch, University of Stirling
A refreshing approach to basic research issues, in a comprehensive research text that should stand
the test with students who find some issues difficult to grasp. Its combination of theory and practi-
cal illustrations guides the reader through all aspects of the research process, the management of
quantitative methodology and analysis a particular strength. The relaxed style of writing and presen-
tation, and online features, will be appreciated by staff and students alike.
Molly Cumming, University of Strathclyde (retd)
One of the most thorough and comprehensive research texts available. The author offers a thorough
presentation of all aspects of the research process, draws on a wide range of real examples from prac-
tice and offers particular support to those students who might struggle when presenting quantitative
data in their research process.
Liz Keeley-Browne, Oxford Brookes University
I am impressed by Newby’s concrete and structured way of guiding the student through the entire
research process. His descriptions of complex theories and procedures is conveyed in an interesting
and accessible way. Students will also enjoy the writing style and pedagogical organization of the
book.
Carina Rönnqvist, Umeå School of Education, Sweden
Peter Newby provides a lucid and accessible guide to research methods for education. His approach,
which sees such methods as a means to an end, is a much needed reminder that the main aim of
research is to answer difficult questions and to break new theoretical and empirical ground.
Richard Andrews, Institute of Education, University of London
Research Methods for
Education
Research Methods for Education, Second Edition takes the student by the hand and
guides them through the complex subject of research methods in an engaging,
witty and clear way. The book covers the philosophical approaches and epistemol-
ogy, as well as the practical aspects of research, such as designing questionnaires
and presenting conclusions.
Each chapter is split into ‘Context’ and ‘Practice’ and both sections are packed with
exercises, examples and comparative international material from other educational
contexts. Peter Newby’s book is the student-friendly text which demystifies the
research process with clarity and verve.
Key features:
t written in a clear and friendly manner to help students feel more confident deal-
ing with the complexities of research and particularly useful for those new to
research or less confident with numbers
t a mixed methods approach, which doesn’t simply prioritise quantitative or
qualitative methods, allowing for greatest possible coverage
t contains guidance on analytic procedures that require more advanced tools
such as SPSS and Minitab
t many excellent international examples and case studies specifically from educa-
tion, which break away from a parochial focus on UK education system.
Additional support such as activities, multiple choice questions, data-sets, exam-
ples of good and bad research tools and help with mathematics is available on the
website www.routledge.com/cw/newby.
Peter Newby headed up educational development at Middlesex University for
ten years. After this he set up an education research and development centre
where the focus of the work was the exploration of learning processes and frame-
works that could deliver prosperity and greater social equality to communities.
Peter is now Emeritus Professor of Higher Education at Middlesex University.
This page intentionally left bank
Research
Methods
for Education
SECOND EDITION
Peter Newby
Emeritus Professor of Higher Education,
Middlesex University
This edition published 2014 by Routledge
2 Park Square, Milton Park, Abingdon, Oxon OX14 4RN
and by Routledge
711 Third Avenue, New York, NY 10017
Routledge is an imprint of the Taylor & Francis Group, an informa business
© 2014 Routledge
The right of Peter Newby to be identified as author of this work has been asserted by him in accordance
with sections 77 and 78 of the Copyright, Designs and Patents Act 1988.
All rights reserved. No part of this book may be reprinted or reproduced or utilised in any form or by
any electronic, mechanical, or other means, now known or hereafter invented, including photocopying
and recording, or in any information storage or retrieval system, without permission in writing from the
publishers.
Trademark notice: Product or corporate names may be trademarks or registered trademarks, and are used
only for identification and explanation without intent to infringe.
List of figures xv
List of tables xviii
List of case studies xxi
List of activities xxiii
About the author xxv
Preface xxvi
Acknowledgements xxix
Peter Newby
Preface
I would like to begin by explaining to students who are using this text why you are
expected to follow a course in research methods and, almost certainly, why you are ex-
pected to produce some research of your own. When students join a university, they are,
invariably, presented with details of the course they will follow. These show what will be
studied at each stage of the course, where they have choices and how assessment is con-
ducted. In many institutions, certainly in the UK, students will also be told what learning
outcomes are expected. However, what is often lost in this detail is a bigger picture of the
principles that underpin a university education. Perhaps surprisingly, these principles are
common to many disciplines, from the sciences, through the social sciences to the arts.
They almost certainly inform undergraduate programmes in education. And from my
perspective as author of this text, research plays a major part in giving these principles
substance.
An undergraduate programme develops students so that they can, with integrity and a
sense of having earned the honour, call themselves ‘graduates’. Generally there are three
steps in this process. Each builds an important element in the infrastructure of becom-
ing a graduate. The first is understanding. You demonstrate your ability to understand a
subject by reproducing arguments, perspectives and evidence in your own words. So, if
you ever wondered why you were given essays, term papers and examinations, this is the
reason. As you progress in your course, the need to demonstrate understanding becomes
intertwined with another principle on which graduateness is based, critical capability. This
is concerned with how well we exercise our judgement. We develop critical capability by
using our understanding to assess and judge subject material and we demonstrate our
critical capability through the quality of the arguments we construct. The final stage of
becoming a graduate is to develop the ability to create knowledge. This can only happen
when we exercise critical judgement, when we see where there are gaps in our knowledge
or where our understanding is flawed.
It should be clear now where a course in research methods fits in. It is the key to the fi-
nal stage of becoming a graduate. As a graduate, you will be expected, wherever you work,
to be capable of influencing development based on a cogently argued case for change.
You will not be able to do this without sound research to back up your argument. And the
same argument applies if your research journey begins at the postgraduate level.
I am aware that not every student finds a course in research methods as appealing
as one on educational policy or special needs or citizenship but you should remember
PREFACE xxvii
that a university education is designed to give you the intellectual and technical skills
to shape the world. I have written this text to help you develop the ability to do this. I
recognise, however, that learning about research methods and how to use them can seem
challenging so, while writing, I have had in mind a student who is a little daunted by the
thought not only of passing a course in research methods but also of having to produce
a piece of research work. I have tried to see the expression on that student’s face and in
the eyes, so that I can see whether my ideas are understood. I have tried, also, to write the
text so that it is approachable and readable. Nothing is more daunting than reading an
author who likes to show that he or she knows more than the person studying the text.
There is, of course, technical vocabulary in this text, but it is explained. I have, wherever
possible, sought to give context to what I have included, with background on some of the
people associated with the techniques and approaches. You will learn that the history of
social and educational research has the drama of professional rivalries and the corrupt-
ing influence of personal ambition as well as the knowledge generated by the efforts of
researchers. I have tried also to show the utility of the methods with examples of how
and where they have been used. The learning model is explanatory but I hope that the
inclusion of activities, case studies and the Web materials developed and provided by
Mike Radford, moves the text away from being overly didactic. Certainly my object was
to engage students with both the excitement of research and the sense that they could
do it themselves.
I have said that I had a student in mind when I was writing, but who are you? As I
imagine you, you are just beginning a research journey. You may have done research
elsewhere, perhaps at school, but that was not designed to enable you to produce re-
search that could inform and shape the world. Typically you can begin this research jour-
ney at one of three points. First, you may be an undergraduate taking a course in educa-
tion. Second, you could be a trainee teacher or a newly qualified teacher who has moved
into education from a specialist subject field and are taking a course of professional
development over and above a postgraduate teaching qualification. Third, you might be
following a postgraduate programme at masters or doctorate level, often after a period
in teaching and as a means of advancing your career. While these three starting points
inevitably imply that people will have different amounts of contextual knowledge, it is
unlikely that you will have the appreciation of how to go about research in ways that
other education and social researchers will find acceptable and convincing. For this rea-
son I have assumed little prior knowledge and, in mathematical terms, only the ability
to add up, subtract, multiply and divide. The challenges in research methods (as in most
other courses) are met by thinking logically and creatively. There is some mathematical
formulation to enable you to make the leap to more advanced texts and academic papers
where statements about statistical tests are an integral part of communication.
So what has my approach been? It is founded on the belief that something that is well
explained is better understood. It is designed to develop the confidence to undertake
research. This is not a text that sets out rules and recipes for how to carry out research.
I believe that my role is to build understanding and a self-belief that you are capable of
research so that when you apply your learning to tackling a research problem, you under-
stand the opportunities open to you, the freedom you have to select methods and that
you are able to justify your research practice. I have used the opportunity of a second edi-
tion to reshape some of the material and introduce up-to-date research and ideas about
research. In terms of structure, I have chosen to make explicit frameworks that were im-
plicit in the first edition in order to encourage students and teachers to use the text more
flexibly. While there is clearly a linear narrative to the whole, the changes to each chapter
that I have introduced should make it easier to use the text in ways that suit how a course
xxviii PREFACE
is taught. There no need to start at the beginning and continue through to the end. Each
chapter and section of a chapter can be read independently. Part 1 gives the context for
research in education. It deals with the implications of philosophy and terminology and
concepts used by research methodologists to make the research process comprehensible.
Weaving through these sections are discussions of approaches to education research.
Some of these ways of looking at research (the qualitative/quantitative in particular) are
deeply rooted but things may change as young researchers grasp the opportunity to at-
tack the research question without feeling the need to be bound by research convention.
Part 2 examines the process of data collection and Part 3 data analysis. Each chapter is
now divided into two sections; one, research practice, as it suggests, adopts a ‘how to’
approach and the other, context, gives guidance on issues researchers should consider
before embarking on the research practice.
There are some key messages within the text that I would like to highlight here.
First, it will introduce you to a range of research positions, methods and approaches.
I should make it clear that I do not advocate any one over others. My position is that
you should use your view of the world and the issue you intend to research to determine
what you do and how you do it.
Second, I believe that all researchers should have a broad appreciation of research
approaches so that they can develop a research strategy and design that is appropriate to
the circumstances.
Third, I want readers to feel confident about their research approach. It is for this
reason that I have highlighted context and practice in each chapter. The context sections
and Part 1 in particular provide the intellectual arguments for justifying research practice.
Fourth, many beginning researchers are needlessly concerned about their mathemati-
cal competence and therefore their ability to use quantitative approaches. All this text
asks of you is that you understand (i) straightforward arithmetic, (ii) the logic that un-
derpins statistical tests and (iii) the idea that mathematics has a language of its own
whose basic vocabulary can be picked up quickly. There are now so many utilities on the
Internet that actual statistical calculation can be done at the click of a button.
Finally, perhaps the most important message: research is not just enjoyable (hon-
estly!), it is also liberating. It will open your eyes to new ideas, possibilities and ways of
viewing the world, it will show you what you can achieve and it will give you the confi-
dence to set your sights high and achieve even more.
How I have approached writing this text is the product of many years’ teaching
students research methods. As the character of the student population has changed,
so has my approach. I hope what I have written meets the needs of present day stu-
dents. While the words and the framework are mine, I have had valuable help from a
number of people. In particular, for the first edition, I should mention Dr Liz Browne
(Oxford Brookes University), Helen Channon (University of Cumbria), Molly Cum-
ming (University of Strathclyde), Dr Lisa Lucas (University of Bristol) and Dr Steve
Strand (University of Warwick). To the anonymous referees of both the first and second
editions and those who found ways of contacting me directly, I send you my thanks
too. I would also like to record my thanks first to the team at Pearson who produced
the first edition. To Catherine Yates who chanced on a throwaway remark that I knew
something about research methods and to my editors, Stuart Pearce whose protesta-
tions that he ‘didn’t quite understand’ revealed, instead, significant knowledge about
research methods. Second, to Natalie Larkin, Victoria Bate and Vicky Parting who man-
aged the transfer to Taylor and Francis and the production of this second edition so
effectively. All of you have helped improve Research Methods for Education. Any errors
that remain are mine alone.
Acknowledgements
Figures
Figure 2.3 after Getting Better All The Time: making benchmarking work, The Audit
Commission (2000); Figure 4.3 from http://www.barnet.gov.uk/online-services-maps,
(c) the London Borough of Barnet; Figure 9.2 from Lesson Observation Notes and
Prompts’ document from: http://education.exeter.ac.uk/pages.php?id=527; Figure 9.3
from Problem Basing Learning: An Exploration of the Method and Evaluation of its Effec-
tiveness in a Continuing Nursing Education Programme, Middlesex University (Newman,
M. 2004), (c) Mark Newman; Figures 10.3, 10.19a, 10.19b, 11.3, 11.4 from Screenshot
from IBM’s ‘Many Eyes’ website, ibm.com/manyeyes. Reprint Courtesy of International
Business Machines Corporation, © International Business Machines Corporation;
Figure 11.2 from Constant Companion Method: A Kaleidoscope of Data, The Qualitative
Report, 4 (1/2) by Dye, J., Schatz, I., Rosenberg, B. and Coleman, S. (2000), http://www
.nova.edu/ssss/QR/QR4-1/dye.html; Figure 11.5 after Screenshot from SmartDraw.com’s
‘SmartDraw’ software; Figure 11.7 from Stakeholders’ meanings of effective school leader-
ship: A case study in a New Zealand primary school, Unpublished doctoral thesis, Griffith
University, Australia (Brooker, B. 2005) htttp://www4.gu.edu.au:8080/adt-root/public/
adt-QGU20061023.15153
Screenshots
Screenshot 2.1 from http://www.eric.ed.gov, Reproduced with permission; Screenshot 4.4
from http://www.ons.gov.uk/ons/index.html, Source: Office for National Statistics licensed
under the Open Government Licence v.1.0; Screenshot 4.5a from http://www.education
.gov.uk/researchandstatistics/statistics/statistics-by-topic/performance;Screenshot4.5bfrom
http://www.education.gov.uk/researchandstatistics/datasets; Screenshot 4.9 from http://
stats.uis.unesco.org/unesco/ReportFolders/ReportFolders.aspx?IF_ActivePath=P,50&IF_
Language=eng, UNESCO Institute for Statistics (UIS), http://stats.uis.unesco.org; Screen-
shot 4.10a from http://www.oecd.org/education/educationataglanceindicatorsrawdata.
xxx ACKNOWLEDGEMENTS
Tables
Table 4.1 from http://www.caci.co.uk/ACORN/, Acorn (c) CACI Limited; Table 5.1 from
http://www.texasldcenter.org/htmlpages/debates/taylorresponse.html, Reprinted by
permission of SAGE Publications; Table 8.6 after Mind the Gap: Are Students Prepared
for Higher Education?, Journal of Further and Higher Education, 27 (1), 53–76 (Lowe. H.
and Cook, I. 2003), Reprinted by permission of the publisher (Taylor & Francis Ltd,
http://www.tandf.co.uk/journals); Table 9.7 from Toward Self-Directed Learning in Sec-
ondary Schools: What Do Teachers Do?, Teaching and Teacher Education, 17 (7), 837–55
(Bolhuis, S. and Voeten, M 2001), With permission from Elsevier; Table 10.2 after http://
www.hesa.ac.uk/performanceindicators/0506/sp6_0506.xls, Hesa cannot accept respon-
sibility for any inferences or conclusions derived from the data by third parties; Table
10.5a from DCSF: School Destinations of Secondary School Pupils Resident in London
Boroughs, 2007 http://www.education.gov.uk/researchandstatistics/statistics/allstatis-
tics/a00195564/dcsf-school-destinations-of-secondary-school-pupil; Table 10.10 from
Child and youth well-being in the United States, 1975–98: Some findings and a new
index, Social Indicators Research, 56 (3), 241–318 (Land, K., Lamb, V. and Mustillo, S.
2001), With kind permission from Springer Science and Business Media; Table 10.11
from Pisa 2006: Data/Données Volume 2, Table 6.2c; Table 10.13 from ‘Summary of
academic staff (excluding atypical) in all UK institutions 2006/07’ at http://www.hesa.
ac.uk/dox/dataTables/staff/download/staff0607.xls?v=1.0; Table 11.4 from Table 9.1
from National Standardised Assessment, INCA, http://www.inca.org.uk/pdf/table9.pdf;
Table 11.6 from http://www.transana.org/images/TransanaShortcuts.pdf, Jeffersonian
Transcription Noation is described in detail in G. Jefferson, Transcription Notation, in
J. Atkinson and J. Heritage (eds), Structures of Social Interaction, New York: Cambridge
University Press, 1984.; Table 12.1 from The Importance of Socio Economic Status and
Individual Characteristics on the Prevalence of Head Lice in School Children, European
Journal of Dermatology 15 (5) 387–92 (Willems, S., Lapeere, H., Haedens, N., Pastels, I.,
Naegert, J.-M. and De Maeseneer, J. 2005); Table 12.11 from Birth Statistics: Review of
the Registrar General on Births and Patterns of Family Building in England and Wales,
2006, Office for National Statistics, Table 2.1 (2006), Source: Office for National Statistics
licensed under the Open Government Licence v.1.0.; Table 13.15 after Making it Work:
Low-Income Working Mothers’ Involvement in Their Children’s Education, American
Educational Research Journal 40 (4), 879–901, Table 3 (Weiss, H. et al 2003), Reprinted
by permission of SAGE Publications.
Text
Quote in Chapter 2 from Morality for Beautiful Girls, Abacus (McCall Smith, A. 2003); Ac-
tivity 2.2 from British Educational Research Association (BERA, 1992), Reproduced with
permission of BERA. www.bera.ac.uk; Case Study 9.2 from Taken from a study undertak-
en by Hockey, J., Robinson, V. and Meah, A., at the Dept. of Comparative & Applied Social
Sciences, University of Hull, Reproduced by kind permission of the authors; Activity 10.3
from PISA 2006 survey results, Table 5.4; Case Study 11.2 after Rusby, J., Estes, A., and
ACKNOWLEDGEMENTS xxxi
Dishion, J. (1991) The Interpersonal Process Code (IPC), Oregon Social Learning Centre,
Eugene, Oregon. Available at http://www.oslc.org/resources/codemanuals/interpersonal-
processcode.pdf; Activity 13.4 from Reading Achievement & Social Selection in Indepen-
dent Schools in Sweden: Results from IEA PIRLS 2001 Reading Test, Scandinavian Journal
of Educational Research 50(2) 185–205 (Myrberg, E. and Rosen, M. 2006), Reprinted by
permission of the publisher (Taylor & Francis Ltd, http://www.tandf.co.uk/journals).
In some instances we have been unable to trace the owners of copyright material, and we
would appreciate any information that would enable us to do so.
Data collection Data analysis
T he c o n t e x t f o r y o u r re s e a rc h
Chapter 2
Philosophy Paradigms Chapter 3
Quantitative research
Qualitative research
Chapter 2 Chapter 2
The research process
Theories and models Ethics
Part 1
This first part of the text sets the scene for re- techniques should be used only in specific
search in education. Its purpose is to provide circumstances.
you with confidence in handling the concepts
and ideas that condition research practice. 2. If this is your first introduction to research
These are important if you want your research methods, the discussion and explanations
to be taken seriously. While some of the ma- are still within your grasp, especially if they
terial can seem challenging, it is discussed in are supported by the guidance of a tutor
a straightforward way. Because of the need and class discussions.
to explain concepts and ideas to students who Chapter 1 sets the scene for education research.
may not have met them before, there has had
to be some simplification as well as selection.
Chapters 2 and 3 explore influences on the re-
search process that we undertake. These con-
Simplifying things in order to help people stitute the context for our research and they
understand them can lead to concerns being are shown in the preceding figure.
ignored or glossed over. There is, however,
guidance on further reading that will take
In Chapter 2 we will look at the influence of
you more deeply into issues. Nonetheless,
philosophy, methodology, research paradigms
there are two points of advice that I offer in
and research questions. The figure shows that
respect of this section.
these are highly connected issues. In fact para-
1. It is possible to omit much of Part 1 and digms, broad approaches to research activity,
go on to Parts 2 and 3, which deal with the are so important in giving shape to our under-
more technical aspects of research data standing of the research process that they are
collection and analysis. But do note, there considered in depth in Chapter 3. In Chapter 2
will be times when it will be necessary to as well we will look at how theories and mod-
return to Part 1 in order to understand fully els fit into the research process and how our
why certain things should be done in a cer- research should rest on ethical considerations
tain way and why some approaches and of how others are affected by our work.
Chapter contents
Learning themes 5
Summary 29
Further reading 30
References 30
Chapter 1
Learning themes
t Learn how this text can meet your needs, By the end of this chapter you will:
whatever stage you are at in your research
t Understand how this text meets your
development.
needs and how you can use it.
t Why people undertake education
t Appreciate the character of educational
research.
research, its broad goals and objectives
t The character of the research community and be able to use these to stimulate your
in education. own research thinking.
t How to be successful in education t Be able to think in advance of what you
research. should do to minimise the risk of your
research programme being derailed.
t Educational research is highly contested.
stimulating and second, that doing your first be ‘solved’ at various levels of expertise and
piece of research is engrossing and one of the this chapter will help you understand which
most enjoyable and satisfying things that you level you, as a reader, are at. You will also ap-
will ever do – believe me! preciate that education research has not one
heart but three – one whose beat provides the
Think of this text as if it were a game or a puz- life-support for academics as ‘searchers after
zle, for example a jigsaw. This introductory truth’, one for practitioners and the third for
chapter is the box that contains the puzzle. It policy makers. It will help you understand that
gives some instructions and guidance on how solving the research puzzle involves decisions
to put it together. One thing you need to ap- and judgements on your part. You will see
preciate is that the research puzzle is unusual that the interest in education research does
because, unlike a conventional jigsaw, it can not end at the school gates. Education is seen
be put together in an infinite number of ways. as a way of achieving a wide range of social,
To this extent, it is more like a computer game economic and political goals, so it concerns a
in which the object is to get to the end but far wider community than just education pro-
because of the random nature of events there fessionals. Most important though, this chap-
are a great many routes for reaching the goal. ter will give you guidance on the standards
that you have to maintain if you want your
What does this chapter tell you about the research to be taken seriously.
puzzle? It will give you a perspective on re-
search in education. As a puzzle, research can Welcome to the research world.
There is no absolutely right or wrong way of putting together the first section of any text
but there is always a way that makes sense in terms of the message that the author wants
to convey. In this instance, rather than getting into the intricacies of the research process,
we are going to look at who this text is for and why it will be useful at all stages in your
research career. You are going to see why we do education research and just who does it,
because it is important you understand this when you read education research. And you
will also learn what the key objectives of any research are as well as some rules of thumb
that you should know before you become a researcher.
In order to write this text, I first identified the people who might use it so that they could
see how it could be valuable to them. See if you recognise yourself in Table 1.1. For every-
one in the world of research this text, and this chapter particularly, has two key messages.
t First, real world research is not necessarily clear cut and well structured. It is impor-
tant that you develop the skill of knowing when, where and how to compromise
with what theory and accepted practice says that you should do. You can choose to
Chapter 1 RESEARCH: A MESSY BUSINESS 7
work within one of the traditional research approaches and abide by their rules (we
cover these in Chapter 3). But you do not have to; you can mix approaches and styles
of research to give you the information you need to solve your research problem.
If you do work within a set of rules, you should recognise that you are letting go of
your freedom to take decisions and to construct your research programme in ways
that seem best for you. The argument of this text is that you need to understand the
consequences of doing or not doing something, so that you are able to make good
decisions according to the circumstances.
t This leads on to the second key message. Once you have this understanding and you
are confident in your judgements, then you will have the intellectual command of
your subject to persuade others that the choices you have made and the way you have
done things are appropriate. This word ‘appropriate’ is important. It does not mean
‘best’ necessarily, but it does mean ‘justifiable’ and ‘acceptable in delivering results
that are fit for purpose’. Once you can do this, then you will meet the prime, per-
haps the only, requirement of delivering research output, namely that it is acceptable
to the audience you select.
But what of the specific value to each of the groups?
Let’s begin by looking at the most important question, why bother to research education
at all? The answers will begin the process of building a framework that will help you un-
derstand how the research process works. At one level the answer to the question is quite
simple but when you start to look at the reality of research it is a little more complex than
you might think. There are three broad reasons for doing research in education.
What might be occurring to you now is that these three reasons for doing research
could come together in the way of a poorly fitting jigsaw. Figure 1.1 shows how that
happens. We can use this model of education research to say something about who
Policy drives
I·eseal·ch
Policy
Changing judging
practice Finding
out
Praxis
.Answering Policy
drives
questions shaping
research
Policy
Exploring testing
Improving
issues
practice Policy
improving
Government
Science & HE FE/ Schools Children & Community Creative Sport Crime, crime
Innovation lifelong families cohesion industries prevention and
learning Early years anti-social behaviour
Ofqual
Research HEFCE Skills Standards Teaching National Education Local Youth Justice
Councils Funding and Testing Agency College Funding Safeguarding Board
Agency Agency for School Agency Children Boards
Leadership
Office for Education Local Authority
Fair Access Authorities Children’s Services
Ofsted Early
Intervention
Foundation
second, if you are involved in any aspect of policy assessment, you need to understand
the range of departmental interests and ensure that you appreciate all inter-departmental
linkages. The complexity of these is hinted at in Figure 1.2, which shows the Government
Departments in England that have an interest in education.
The complexity shown in Figure 1.2 is replicated in most developed states. What is more
it is a complexity that all education researchers should be aware of because the situation
for education can change as governments change. This happened in 2010 in England. The
Department for Children, Schools and Families (DCSF) which had overall responsibility
for education from pre-school education, play and care, through primary and secondary
compulsory education to 16–19 education was replaced by the Department for Education
(DfE). The Department for Innovation, Universities and Skills (DIUS) was replaced by
the Department for Business, Innovation and Skills which has overall responsibility for
post-19 education and training. Government departments and their responsibilities are
shown in black in Figure 1.2. The organisations that deliver education and training services
are nursery schools, playgroups, childminders (for early years) and individual schools, col-
leges and universities. These are managed by agencies and councils (in blue in Figure 1.2).
Not all agencies are shown. Agencies are public bodies that operate at arm’s length from
Government. They are managed by a Chief Executive and work within a framework of
policy set by the Government. All service deliverers work within funding streams, targets
and, where appropriate, local priorities and preferences. Early years provision is a complex
mix of public and private sector, corporate and individual provision. Those who register
with Ofsted (The Office for Standards in Education) are assessed by Ofsted. Registration
is mandatory except when a child’s close relatives are involved, or the child is cared for
two hours or less each day, or no money changes hands (the actual regulations are more
complex than this, but this is the gist of them, September 2012).
Local authorities still play an important role in shaping service provision. The Early
Intervention Foundation is newly established. Its role is to determine what works in early
Chapter 1 RESEARCH: A MESSY BUSINESS 13
intervention and to advise local commissioners. At school level some schools are still di-
rectly funded through a local educational authority. An increasing number are choosing
to be funded directly through a new Government agency, the Education Funding Agency.
For this they receive an increased allowance. Thus the very act of service delivery repre-
sents policy, so how an organisation chooses to work and the relative priority it gives to
tasks can effectively shape policy.
Other agencies operate independently of service deliverers to monitor and maintain
standards. The Office for Standards in Education (Ofsted) is the quality assurance agency
for schools and early years providers and the Office of the Qualifications and Examina-
tions Regulator (Ofqual) regulates qualifications and examinations. Three other agencies
were established when the Government changed. They took over work previously done
by other bodies and became more integrated into the Department for Education. The
Standards and Testing Agency is responsible for all statutory assessments up to the end
of Key Stage 3. The Teaching Agency is responsible for the flow of recruits to the teach-
ing profession, staff retention and sustaining quality in teaching. The National College
for School Leadership, which was a ‘non-departmental public body’, is now an executive
agency of the Department. Its responsibilities have evolved but are fundamentally con-
cerned with leadership good practice and leadership training. From a policy perspective
bringing this role into the Department gives Ministers greater control over its activities.
The responsibility of the DfE extends beyond teaching, teachers, pupils and students
to include, for example, health and safety, buildings and finance. Other Government
departments’ work impacts on young people and the education service. The Department
for Business, Innovation and Skills (BIS) is responsible for higher education, lifelong learn-
ing (especially the National Apprenticeship Service) and, through the Research Councils,
policy on science. Other departments implement cross-cutting actions. Rather than see
how they intersect with education and schooling, it is easier to understand how schools
link with those departments to address deep-rooted social problems. These revolve around
issues of school exclusion, school absence, youth culture, behaviour and the youth justice
system, community development and youth employment and arise from a key piece of leg-
islation, The Children Act 2004. These areas interact locally in Local Safeguarding Children
Boards. The recent past suggests that this organisational framework and the names of the
bodies may well change whenever there is a change of government.
The situation in Sweden and the Netherlands is as complex as it is in the UK
with a network of agencies and authorities managing the educational programme
(see Figure 1.3). Note that it does not include the social and economic areas of interest
to educational researchers that were set out in Figure 1.2. In Sweden the management
of the system is structured according to educational level with national agencies respon-
sible for schools, vocational training institutions and higher education. While those for
vocational and higher education deal directly with the institutions that they fund, the
agency for schools works with municipalities and it is these that have ‘hands on’ respon-
sibility for dealing with the schools. The School Inspectorate is an independent agency
responsible for monitoring standards. In the Netherlands the situation is similar. In
both Sweden and the Netherlands several of the areas that in England are dealt with
by separate ministries are integrated into the Education Ministry, for example, higher
education and research, further education, youth policy and gender equality in Sweden,
and higher and vocational education, culture and media in the Netherlands. Theoreti-
cally this should make action across these boundaries by education providers more
effective. Day-to-day administration in the Netherlands occurs at an institutional level
in vocational institutions and universities and by municipalities for schools. Provinces
have some regulatory responsibility as well. The Netherlands, as well, has a number
14 Research Methods for Education Part 1 The Context for Your Research
Sweden Netherlands
Ministry of Education and Research Ministry of Education, Culture and Science
Schools Schools
Figure 1.3 Structure of the education systems in Sweden and the Netherlands
of agencies. The Central Funding Agency disburses money. The Education Inspectorate
manages an inspection regime and the Educational Council and Science and Technology
Advisory Council provide general and specific advice to government.
The implication of this breadth across departments and policy areas is that educa-
tional researchers have to be outward as well as inward looking. The complexity of edu-
cation as a research area means that educational researchers not only have to be aware of
policy over a wide range, they need also to have a breadth of capability, knowledge and
understanding that will enable them to cope with the complex and consequently messy
world of education.
What we have learnt so far is something about the nature of educational research and
the character (and breadth) of the research community. You should now be able to see
where you fit into the research community (and, by implication, how you can build a
career around research). For everyone in this research community, there are two funda-
mental objectives, each of which must be achieved with every piece of research. These
objectives are the foundations of successful research, whether it is an undergraduate
project, a piece of personal research to improve practice or an investigation into policy
effectiveness. You need to understand these objectives because they will test you in every
project and at every stage in your research career.
field. This not only gives us background out of which we can begin to shape our own
views and ideas, it also throws up whether someone has already done the work that
is beginning to interest us. We can start looking from either a theoretical or practice
perspective. A theoretical perspective means that we read what others have written
about how the elements we are investigating should relate to each other, or about how
they do relate to each other and about what should happen in order for situations to
improve. If we get into a topic through our own work or practice, we will find ques-
tions such as ‘How can this be improved?’ and ‘Why did that work well?’ focusing
our enquiry. Often we will read what is written in academic journals or what govern-
ment publishes in reports and, sometimes, what journalists write in newspapers and
magazines. We need to get a sense of the different perspectives people have brought
to an issue. Texts and review papers are good for this. We have to appreciate the range
of research on an issue, where the balance of research effort has been and how it has
changed over time. This enables us to understand how issues emerge and perspectives
evolve. Databases of articles that we can search by topic and by period are good for this
(see Activity 1.1). Reading is important throughout the research process. It helps us
We can now access many research articles via the t British Education Index (http://www.leeds.ac.uk/
Internet. Some are available free of charge on the bei/) is a growing database of British research held
Web, others are available through subscription or on at Leeds University.
a pay-per-view basis. Many educational institutions t Australian Education Index (http://www.acer.edu.
and organisations subscribe to on-line journals. You au/library/catalogues/aei.html) is operated by the
will need to practise accessing research information Australian Council for Education Research.
via the Internet. There are two ways of going about
it. Both can be time-consuming but with practice you
will get better. Activity 1
Look up the name of one (or more) of your tutors
t If your institution subscribes to e-journals you can (a) on-line via your library search facilities and (b) in
type in your institution’s password for each journal each of the Web search options listed. Make a record
you want to search and then access the contents of what each approach gives you. Now look up the
of each year or volume, much as you would if you tutor’s home page and assess how much of the listed
were picking volumes up off the shelf of a library. research output is given by each search option.
As yet, it is not as quick just to search the Internet.
t The second way is to search a database of articles
taken from a range of journals. Many databases Activity 2
are now on-line and available through institutional Use the on-line search options to see how many refer-
subscription. In addition, here are some public ences each gives you for (a) phonics and (b) synthetic
databases: phonics (both are approaches to teaching reading).
By looking at the references for the first two pages
There are, as well, Web search options.
only, assess what proportion of the publications come
t Google Scholar (http://scholar.google.com/) is from the USA, UK and Australia.
available on the Web. Whichever search engine
you use, there is usually a hyperlink to take you to What you should learn from this is that:
the interface of the journal in which it is published. 1. we need to look at more than one database in
t Google Books (http://books.google.com/) offers a order to get a rounded picture, and
similar facility for books. 2. differences in searches on different databases can
t ERIC (http://www.eric.ed.gov/) is a major repository be a good indication of the significance of the is-
of education research, principally from the USA. sue in the research and policy communities that
Each entry is usually accompanied by a full précis. the databases serve.
16 Research Methods for Education Part 1 The Context for Your Research
identify our topic (see Chapter 5) and then refine it. As we start to crystallise our own
ideas and focus on a particular area, we need to put this at the centre of our reading and
work outwards, getting a more detailed picture of the research that has taken place. At
this point, we are beginning to move into the ‘literature review’ that is considered in
more detail later in the text (see Chapter 5).
However, we do not always have to begin by knowing a lot before we start. Some
research starts by trying to make sense of data. Some researchers wait for major statistical
series to be published, a census of population would be a good example. We could look
at the ethnic and socio-economic mix in an area and see if this is reflected in the com-
munity’s schools, or we could identify where the population is growing most in order to
judge whether the provision of school places has kept pace. Other researchers just come
across data that makes them think and then starts them on a long journey of enquiry.
A lecturer looking for a data set for students to work on can stumble on something that
is intriguing, where some results seem to be obeying one rule and others another. For
example, the educational attainment of children looked after by local authorities is very
poor. But what if you found an authority where the young people’s attainment was far
higher than usual? Questions follow and answers are sought by looking at other data sets
or through a field investigation.
All of this can be summarised as ‘make sure that we understand our issue and where
it is located in the field of education before we start’. If we want to find something
out, we have to maintain this focus throughout. We have to ensure that the data and
information we collect are directly linked to the issue we are investigating (or as directly
as we can make them). It is very easy when we are reading to follow up on references
and then discover that we have strayed a long way from our topic. And finally, before
we start, we have to have a good idea of what techniques we are going to use to extract
the information and meaning from our data. So, if we want to be sure that we will find
something out, we have to make sure that we have a research plan in place before we
start. This should specify the focus of our research, identify our data needs, how we will
obtain our data, how we will process them and place the whole programme in a time
frame. Remember though, it can always be changed if conditions and circumstances
alter.
To summarise, if we want to be sure that we will find something out that is new and
which adds to the body of knowledge we have about education, then:
(a) Validity
Validity is, in many ways, a problematic concept because it is affected by the research
procedure, the researcher’s position and the world that receives the research results. With
this situation we could be forgiven for thinking that the research world is a difficult one.
Actually it is not that bad, just that in a few situations it can be a hard life. In general, we
are likely to find it more accommodating than we might expect. If you are submitting a
thesis for a research degree and, in the thesis, challenge some established positions, the
likelihood of the examiner failing you is low even if they hold a different point of view.
One of the benefits of academic freedom is the ability to disagree with others – but only
when you can support your argument! Of course, there are situations when expressing a
different point of view could be the equivalent of committing academic suicide. In the
past, when academia was much smaller and some subject associations were dominated
by a cabal that controlled access to journals, it could be difficult for academics to get con-
trary views published. This situation may exist when not only reputations are at stake but
also personal and research income. The British Medical Journal makes it clear that people
reviewing or commenting on papers should reveal ‘competing interest’, which it explains
as a situation where ‘professional judgment concerning a primary interest (such as patients’
welfare or the validity of research) may be influenced by a secondary interest (such as
financial gain or personal rivalry)’. This is clearly an issue of ethics in research, an issue
that we shall look at in more detail shortly (see Chapter 2). When researchers move out-
side established traditions they may well encounter criticism and hostility. This might
apply, for example, to sociologists who look to explain behaviour in terms of the physical
18 Research Methods for Education Part 1 The Context for Your Research
(b) Acceptability
The issue of the acceptability of our research is a complex one because it has to take ac-
count of the quality of our research work and the acceptability to our audience of our
conclusions.
If our results are not valid, then they will not be accepted. Valid results are well
grounded in the evidence we present and are justifiable in the circumstances in which
we collect and analyse our data. If our conceptualisation and execution of our research
is poor, our results will not be accepted. This is a situation faced periodically by all
departments in higher education in the UK that would like to receive funding to carry
out research. The Higher Education Research Council for England (it is similar in other
parts of the UK) judges research and will not fund departments whose research output,
as assessed by peers, is poor.
But even if our methods and methodology are appropriate, there is still the issue
of the acceptability of our message. If our conclusions run counter to expectations,
prevailing wisdom or the direction an organisation would like to go in, then they may
not be accepted. In his 1999 address to the British Educational Research Association,
Peter Mortimore pointed out that performance-related pay for teachers was not an ini-
tiative that could be supported by research evidence and that there was specific research
that drew attention to why it would be a retrograde step (Mortimore, 1999). But it was
still introduced. He drew attention to the way in which other education research has,
over time, run counter to government initiatives. The implication of this is that political
values are more important than good research practice when it comes to the acceptability
of education research.
If our conclusions are not accepted, the appropriateness of our procedures is likely
to be questioned. As researchers we should always remember that established
orthodoxies (and organisations wanting evidence to support a decision) do not like to
be challenged. If we are challenging orthodoxy, we should make sure our procedures
are robust.
You should, by now, have picked up some messages on the position that this text takes
on research. It is that:
t research throws up problems;
t problems have to be solved; and
t there is always more than one solution.
20 Research Methods for Education Part 1 The Context for Your Research
The research community is large, fragmented and has different interests, purposes and
goals. Despite this fragmentation, there are objectives that are common and determine
how acceptable the research world will find the research we do. Surprisingly the research
world of researchers, policy makers, consultants and those who commission research is
a fairly forgiving place if researchers have to compromise with what research theory says
should happen. Why is this? Think about it; it is obvious why researchers take a relaxed
view about research procedure; if they did not, it would be very difficult for them to un-
dertake much research themselves. The guidelines that follow are not meant to provide
you with authority to break any rule in the research rule book but they are designed to
give you the confidence to argue your case and to give you an idea of what you have to
do when you research in the real world.
and more than anything else, this means that we have to compromise with perfection.
It certainly is a good idea to pre-test data collection procedures to make sure that they
deliver what you want them to – but what can you do if time does not allow it? It would
be good practice to test the significance of a dozen possible influences on the behav-
iour of a population but what if your research design, with you as the only researcher,
requires a sample size in excess of 30,000 in order to do this? When we start designing
our research programme we must audit the resources available to us, note when they
are available and estimate the call on our resources of various procedures. We should
make a list of the resources we have and those we will need. In general, we are likely
to require resources for data collection (we may be able to call on friends, colleagues
or students), for specialist advice (on data processing, for example). We will require
access to information sources (both traditional and on-line), phones, faxes and post
(and unless someone else is picking up the bill, we will have to pay for these). We will
also need to know how much time we have available to pull it all together. We should
now see that developing a research programme is not something we do and then imple-
ment. We have to make sure that the procedures we are using will give us the informa-
tion we need and that we can afford to use them. If the equation does not balance, then
it’s back to the drawing board. That’s the trouble with the real world, it keeps getting in
the way of what is best!
The columns in Table 1.2 describe the impact of things happening. Impact is a
measure of cost and consequence. If our hard drive crashes, taking what we hoped was
almost the final version of our thesis with it, that is quite an impact. The consequence
is that we have to start writing all over again and the time it takes is either a real or
an opportunity cost. Impact is expressed in terms of the recoverability of the research
programme. Again there are three states: devastating, that is the programme is non-
recoverable; moderate, that is the programme is recoverable with some effort; and
low, when the programme is recoverable with little extra effort. We have to determine
the limits to each category of recoverability. For example if our research is scheduled
to last three months, up to two weeks to recover to where we were might be ‘low’, be-
tween two weeks and one month may be ‘moderate’ and anything over one month is
non-recoverable. For other lengths of research programme we would choose different
boundaries to our categories.
Table 1.2 also shows the acceptability of something happening without our taking
any action to avoid it. Acceptability is determined by a combination of likelihood and
impact. How we label each cell in the matrix is a matter of judgement. Table 1.2 is not
definitive, it is just my view. For those cells labelled ‘Negligible’, I would not plan any
mitigation measures at all. For those labelled ‘Unacceptable’, I would ensure that I would
take action to minimise the risk, while for those labelled ‘Acceptable’, I would for some
or if the timescale for the research was short. This is the theory. What we have to do is
put this into effect. Experience, and making mistakes, are great teachers. However, we can
build experience without actually researching by analysing other people’s research. Start
this process by looking at Activity 1.2.
Read any journal article that has researched an edu- impact they would have had on the research (think
cational issue through the collection and analysis of in terms of time); (c) what contingency you would
information (that is, it should not be a conceptual build in to accommodate them. Think in terms of
or philosophical paper). It does not matter how the the following as a start:
information was collected as long as it was, at least t Sources of information: think about alternatives
in part, from ‘primary sources’, for example, straight to the ones used, how many there are, their
from the horse’s mouth or by direct observation. availability when the researcher needs them in
Data collected by questionnaire would be suitable, an emergency, the cost and effort of setting up
as would interviews, written accounts, pictures and a second programme to generate information
watching people and recording their actions. that might never be used.
t The potential for obtaining information that is
t Are any problems identified by the author? If so, variable in quality: if you are employing people to
at what stage did they occur in the research pro- collect data, what is the likelihood of their using
gramme and could they have been foreseen? the same procedure? What is the cost of training
t Look at the research process in terms of the follow- them? What is the cost of monitoring their work?
ing stages: t The possibility that the resources you thought
t collection of data would be there suddenly disappear.
t storage of data t Whether the quality of the data supports the
t processing of data type of analysis. Do you actually feel convinced?
t analysis t Loss of data and loss of files: how can this happen?
t writing and reporting. Are there times when it is less serious? Is there a
t For each stage, consider (a) what problems might point when it becomes catastrophic? What could
have occurred; even if they did not, (b) what you do?
Chapter 1 RESEARCH: A MESSY BUSINESS 23
issues we have investigated at the centre. If we do not do this, then our work will not
have a good storyline and will read like a series of loosely connected passages. Well-
written research should be like a good novel; it should take the reader along with it, the
bits of the story falling into place in a convincing and compelling way and, not least, it
should be a good and interesting read.
The guidance to take away from this is that we should not be afraid to revisit our deci-
sions and that we should make sure we tie up all the loose ends before we embark on the
next stage of implementing our research plan.
This first chapter is all about giving us context as education researchers. In this last part
we will get a perspective on the character of educational research.
The popular image of research comes from science. We could be forgiven for thinking
that some scientists wear white coats just to reinforce the popular image of science.
Chapter 1 RESEARCH: A MESSY BUSINESS 25
The reality, of course, is that IT analysis and modelling now takes the place of much
laboratory work – and you do not need a white coat to work with a computer. The
image, however, remains and it generates an interpretation that research is something
that is set apart from everyday experience. When we look at educational research we
have a bit of a problem with this perception because the object of study in education
research is everyday experience. This places the education researcher in a very different
position from the scientific researcher. While education researchers are still experts, they
are experts in an area where a significant proportion of the population can have strong
and valid opinions. It is important that we, as education researchers, understand this.
So, in the public mind, while the scientist is doing good work manipulating the DNA of
plants to generate medicines, the education researcher could be part of the problem with
‘education today’. This is not to say that the researcher is doing poor research, just that
its accessibility as an issue on which many people have a view means that educational
research is a highly contested area.
Use the Web to identify educational issues of con- debates’ and ‘educational problems’) and then see
cern to more than one country. Use national sites what the World Bank, UNESCO and the OECD have
in the first instance (try searches on ‘educational to contribute.
the toes of the criminal justice system, the educational welfare system and the social
services system all the way up to the Government. Working in a crowded field across
boundaries is interesting – but there are more people to offend. Be prepared to take
criticism and stand your ground.
How do we adopt this sceptical perspective? We need to ask the right questions of
every stage of the research that we are reading. Why was the research topic chosen? What
is the context for the research? The possibility of there being a political dimension to
research designed to improve practice may be low but if the research is designed to assess
a classroom methodology developed by the researcher, then at least one person stands
to benefit if it is shown to be effective. Look to see how an author explores and reviews
the research issue when defining the research question. Do we think it is balanced? How
far does the language introduce a sense of concern (problems unresolved or not even
recognised over a long period) or define the issue within a political context, for example,
testing a policy proposal of a political party? What evidence do we have of the stance
taken by the author in their non-academic workings such as newspaper articles?
It is not just what is chosen to be researched that we need to look at but also at
how the research was carried out. If we are reading something published in an academic
journal, we should check to see if it has been judged by other academics as being of an
appropriate standard, a process called ‘refereeing’. Most have been. This is one of the
guarantees of academic quality. Nonetheless, when we read it we need to assure our-
selves that the way in which the evidence has been attained is robust and that the conclu-
sions are warranted. You will be able to do this effectively by the time you have reached
the end of this text.
Finally, we have to read very closely the words used by the author. Enthusiasm is
acceptable but it can tip over to a level of commitment that begins to undermine the neu-
trality of the conclusions. ‘The failure of particular groups of children to perform at the
level expected of them’ is acceptable language; a ‘shocking waste of talent’ says much the
same thing but has an emotional content that lays bare the values of the author. When
you read, look for words that are laden with values; pay attention to the author’s style
and see if there is a shift, especially in the conclusion, from studied neutrality towards a
more committed tone.
If doing all of this sounds a tall order, it is not meant to be. All researchers should
develop the skill of reading behind the words to detect other messages but we should
not imagine that there is a radical manifesto behind each publication. Most work is
honestly attempted and often the most serious criticism that can be made is that it is
pedestrian. But then, in an army of researchers, there are bound to be more foot soldiers
than anyone else.
will enable it to go forward with what it wants to do. If our research results suggest that
there is genuinely a better alternative, then we have a duty of care to our client to make
the organisation aware of it. With all of these issues we begin to realise that collecting the
information is often the relatively straightforward part of research.
you to higher levels of understanding and insight, qualities that are inherently transfer-
able. You will, as well, get enormous satisfaction when you present your research as a
dissertation or thesis or as a research paper. When you see it typed up and bound, you
will feel a sense of satisfaction knowing that it represents your ideas, your toil and your
resolution in bringing the project to a conclusion. Your first piece of research is a rite of
passage. You will never look back.
The purpose of this chapter is to give new researchers an insight into the context in
which educational research sits. The key message here and throughout is the need for
researchers to be flexible and to allow the requirements of the research issue to influ-
ence the character of the research programme. If you have already been introduced to
research methods in education, either in an introductory course, or through another
text or through reading research articles, you might have been led to a different conclu-
sion. There are many texts and, almost certainly, many courses that present research as a
highly structured, procedurally ordered process. The expectation is that the researcher fits
a template over the research problem and then follows the rules. However, the templates
are not always appropriate; you as a researcher will have to make many decisions and
the research world is riven by intellectual and political positioning. Is research a messy
business? Yes, it is. It is messy because:
t the real world is not as clearly defined as research templates require;
t you will rarely have the luxury of an abundance of resources to draw on, so you will
have to make real choices and compromises;
t the transition from fact to opinion is not always clear cut, especially when the
research is on attitudes and beliefs;
t researchers mark out their territories and defend them;
t people use research to advance themselves.
You have to stay safe. Alienating others is acceptable only when you are absolutely
sure of your ground. To be a successful researcher, you have to know enough in order to
make choices. For every research issue you tackle, you must draw on a broad understand-
ing of research principles, models and methods. Without this, your choice is constrained.
This text will give you the understanding you need. Because it is driven by the need
to find answers rather than follow procedures, it will give you the confidence to put
together research procedures that are fit for purpose. Now let’s get started.
Summary
t The purpose of education research is to answer questions about educational issues.
Often the research aims to inform either policy or practice. In the case of policy, it
may be concerned with its development or effectiveness or impact. In the case of
practice, its goal is either to improve practice or to understand whether professional
practice (in the classroom or in administration and management) is effective.
▲
30 Research Methods for Education Part 1 The Context for Your Research
Further reading
Brew, A. (2001) The Nature of Research, Routledge Falmer. in different directions. In this book she explores the idea
The idea that the research process is uncontroversial of research traditions, the difference between research
should be nailed firmly on the head. All research, as and scholarship, how we create new knowledge, ways
Angela Brew points out, is conducted in social, political of looking at research – as a commodity, as a learning
and intellectual contexts which can pull the researcher process, and the link between research and teaching.
References
Mortimore, P. (1999) Does Educational Research Mat- in 40 Countries, TIMSS & PIRLS International Study
ter? Presidential address to the British Educational Centre, Lynch School of Education, Boston College.
Research Association Annual Conference, available Rose, J. (2006) Independent Review of the Teaching of Early
from Education-Line, http://www.leeds.ac.uk/educol/ Reading, Final Report, Department for Education and
documents/00001206.htm, accessed January 2008. Skills.
Mullis, I., Martin, M., Kennedy, A. and Foy, P. (2007) Scott Murray, M.T., Clermont, Y. and Werquin, P. (2005)
PIRLS 2006 International Report: IEA’s Progress in Learning and Living: First Results of the Adult Literacy
International Reading Literacy Study in Primary Schools and Life Skills Survey, Statistics Canada and OECD.
This page intentionally left bank
Chapter contents
Learning themes 33
Introduction 33
Summary 89
Further reading 90
References 90
Chapter 2
Learning themes
Getting to grips with the research puzzle: and how to use terminology appropriately to
produce an effective research proposal.
t You do not just start research. You do not By the end of this section you will:
get up one day having decided to research
an issue and go out to collect the informa- t Appreciate the significance of research
tion. There are influences that you have to philosophy and researcher values to edu-
take account of and decisions you have to cational and social research.
make.
t Understand the significance of ethics to
t We will see how research philosophy, educational research.
research paradigms and ethical principles
affect how we go about research.
t Understand the difference between
research methodology and research
t We will look at four important methodol- methods.
ogies: case studies, ethnography, evalua-
tion and action research.
t Understand the implication of how a
research question is phrased.
We will outline rules and principles that
govern how you put your research together
t Be able to begin the process of identifying
a research topic.
Introduction
If, after reading Chapter 1, you are feeling that can affect your research output. As you
getting into education research is like being go through the section, you will find that
sent into battle without adequate arms, then it will:
take heart. This section will begin the process
t demystify research terms;
of building your understanding of research
so that you can acquit yourself well. You will
t show you the confusion that can arise
when people use terms in different ways;
learn about things that influence the way you
go about your research and the procedures t begin the process of putting the research
that you adopt. You will understand about issue at the centre of the research enquiry;
the broad goals of research and how they and
34 Research Methods for Education Part 1 The Context for Your Research
explore wh at sorts of questions research in have to make choices about all of these. Making
education can answer. these choices ls not an arbitrary process. There
ls a whole series of factors that can influence
Research ls a very creative process, You have
your decision. These too are shown in Figure 2.1.
to bring together ideas about how to proceed,
This is what we shall examine here. In the first
resolve questions about what you want to
part ofthis section we shalllook ftrst at ideas that
achieve, select the research tools you want to
influence the research process, then at ideas that
use, identify the issues you want to study and
influence the outcomes you want your research
even the context in which you would Iike to
to achieve and finally ideas that influence the
study them. Figure 2.1 shows three principal
research lssue and the research question. In the
components of any research process: the goals
second part of the section, you will be guided on
and the outcomes you can achieve, the processes
the first stages of developing a research project.
you can use and the issues you can research. You
Research
Methodology Methods Model
question
Research Research
agenda problem
Research issue
Research
question
Research is not always a matter of collecting information and answering the question. Some
people might think that you are not asking the right question, others that you have not
used the right approach to obtain your answer. Others might question whether you have
the right information to answer the question. Unless you know the decisions you have to
make about the way in which you are going to do your research and unless you know on
what basis to make those decisions, your work could be seriously flawed and you could
be walking into controversy and an unfavourable critique of your research. In this section,
you will learn about the following concepts and how they can influence the way you work:
t Philosophy
t Paradigms
t Research principles
t Research methodology
t Research methods
t Research theory.
very different from a child whose parents work abroad and send her to boarding school.
We might, as researchers, set out to map out their life chances, but how easy would it be
for us to set aside emotion, sentiment and judgements of right, wrong, good and bad?
In other words, our view of what should be researched and our interpretation of the
research results is mediated by our own values and view of the world. Or is it? Can we
strive to be objective, neutral with respect to values, judging research data as facts to be
assembled into an understanding, and accepting what we find as reality? If we do, then
we can call ourselves realists, if not, then we are relativists, because if we interpret evi-
dence through our moral, political, economic and cultural perspectives, then we have to
accept that others with different perspectives can have a different interpretation (though
we can still draw the line in accepting the validity of their interpretations).
These differences in how we as researchers view the world will impact on what we will
accept as evidence for knowing the world. This takes us into a new area of philosophy:
epistemology. At its heart from a research perspective, epistemology asks the question
‘how can we be sure?’ How can we be sure that someone else’s interpretation is correct?
The answer is only by seeing the evidence and learning how the evidence was collected.
But if I were a realist, would I necessarily accept the evidence of a relativist? If my real-
ism were deeply engrained, then it is not likely that I would accept it as evidence at all
because the assumptions under which it was gathered conflicted with my own view of
the world as an objective reality. But then neither would the relativist accept my evidence
either and, probably, neither of us would accept that the other’s research questions were
the right ones to ask in any case. So the view we take of the world affects the questions
we ask and what we will accept as evidence in order to help us make sense of the world,
ontology and epistemology linked. The consequence of this is that research as an activ-
ity has developed schools of practice, each with its own philosophical standpoint that
crosses subject and disciplinary boundaries. These philosophical positions can influence
not just how the research is conducted but more importantly what is researched and
how the evidence is interpreted. If you thought that all you had to do was accurately col-
lect some data and honestly report your findings, then think again. If you do, there are
researchers who, from their own philosophical standpoint, will castigate your research
naivety. If you are to defend yourself, then it is important to understand what you are
defending yourself from.
Research philosophy is a complex field and it is difficult to put together a simple
framework that incorporates the main philosophical positions and standpoints. Re-
search philosophy is multi-dimensional; that is, it comprises approaches and beliefs that
are not mutually exclusive. If we look at a philosophy one way, we can use one set of
terminology and if we look at it another, then we use another. We could divide research-
ers into those who believe that a real world exists and those who believe it is a personal
construction (that is, reality exists only in our minds), between those who believe it is
their role to understand and explain and those who believe that out of understanding
should come change, between those who believe that social action is the product of our
exercising individual free will and those who believe that there are powerful structures in
society that constrain choice. In other words, the world of research philosophy is com-
plex and all this text can do is open a few windows to let you glimpse some of the main
domains of the research landscape. The areas that we shall look at are:
t Scientism and Positivism
t Humanism, Phenomenology and Existentialism
t Critical theory
t Postmodernism.
Chapter 2 UNDERSTANDING THE RESEARCH PROCESS 37
that two people should have the same view of what the ‘real world’ is (because the real
world on which they base their decisions and actions is inside their heads). In fact, for
humanist researchers, it is important to understand divergences in views. Humanism is
marked by a range of traditions but common to all is a belief in the value of the human
experience and particularly its significance in creating what is meaningful. If I, for example,
go to Auschwitz I can be sickened by the scale of the slaughter and the images I will see but
my reactions are the product only of my belief in right and wrong and my sense of human-
ity. If my colleagues, whose relations were killed there were to go, the experience will have
a different meaning for them. It will go far beyond what I feel, perhaps even to a sense of
guilt that they and their parents survived when so many did not. It is this difference that
has implications not just for the way in which research is conducted but also for what is
actually researched and, by further implication, what actually constitutes an appropriate
research question and an answer.
Humanistic approaches to research evolved very differently from the scientific. The
positivist approach would even question whether humanistic styles of enquiry con-
stitute research at all, largely because the phenomena under investigation (emotions,
viewpoints, perceptions, understandings) may not exist independently out there in
the ‘real world’. Not surprisingly, those who conducted investigations from a human-
istic perspective in literature, the arts and the social sciences, did not like the conclu-
sion that what scientists did was research, while their efforts were merely studies.
From their perspective their approach had formal procedures that gave worthwhile
results. It was, in other words, fit for purpose. Imagine we are studying what effective
leadership is in terms of schools. We could look at effectiveness in terms of student
outcomes but these could be the result of effective teaching and nothing to do with
leadership. We could measure leadership in terms of the turnover of staff in a school
(on the basis that teachers want to work with good leaders) but our results could
reflect a downturn in the job market. Our best option is to go and talk to teachers
and support staff and then to the head teacher to see if we can pull together themes
that seem to represent the data we have collected. It was with research like this that
another research tradition became established in a range of subjects (including educa-
tion and even in mathematics) and this tradition has become associated with a set of
research tools that go under the umbrella description of ‘qualitative’ (again, we will
look at this in more detail in Chapter 3).
The stimulus for the humanistic tradition in research can be found in French philoso-
phy of the 1930s but it was not until after 1945 that a new research approach began to
emerge. In many ways the development of a new approach reflected many of the ideas
of Thomas Kuhn (an American philosopher of science who worked in the middle of
the twentieth century) on how disciplines evolve and change. In essence, he argued that
ways of working are matched to the challenges of a time and that as time moves on, so
ways of working fail to find answers to all of the questions we want to ask. In the case of
educational research there was a sense that positivism could not provide all the answers
to the questions that educational researchers were asking and a belief that behaviour was
affected by how people viewed the world. Educational researchers were influenced by
what was happening in sociology in the 1960s and 1970s. In sociology the research focus
moved from one of looking for order (typified by Ernest Burgess, Professor of Sociology
at the University of Chicago, whose work on the social structure of the modern American
city influenced two generations of sociological research) to understanding variety and
change in social systems. What was recognised in sociology (and other social sciences)
was that there were issues worth exploring that existing positivist approaches could not
manage. From the 1960s, themes of social justice and equality emerge within research
Chapter 2 UNDERSTANDING THE RESEARCH PROCESS 39
programmes. While positivism could tell us how many people were poor, it could not
help us to understand what it was like to be poor. This is the essence of humanistic re-
search traditions.
The division between the scientific and humanistic research traditions has often been
bitter. The situation in Sweden is typical of many other countries. Englund’s review of
Swedish education research (Englund, 2006) identifies the decline of the quantitative tra-
dition from the 1970s onwards and its replacement by what we can loosely call a human-
istic approach. The sorts of issue that have been important for education when investigated
through a humanistic perspective include the emotional state of children, understanding
the learning process, interpersonal relationships, managing change, experiencing educa-
tion as a student, teacher and parent, leadership, self-representation and consumption and
status. Humanist researchers would not see their main purpose as measuring these but
capturing the experiences that help us understand what we might do to change, man-
age or reproduce those experiences. All of these are areas where people’s views, feelings,
emotions and concerns are treated as valid influences in understanding how they interact
with the world. The research environment, from a humanistic perspective, is not black
and white but a rainbow of all colours where the hues gradually merge into each other.
One area where humanistic approaches are gathering momentum at the present time is in
medical and nursing research (including research into education, learning and training),
again areas that have long been dominated by the scientific tradition. Can you think why
this might be? If you want to improve your understanding of humanistic research, look at
Activity 2.1 in which you will explore what researchers do by accessing ERIC, one of the
research databases introduced in Chapter 1.
The focus of humanistic enquiry is the world we inhabit, the world we create and the
world we experience. These are all worlds within our minds. There are two important
schools of thought that we should be aware of: phenomenology and existentialism.
Phenomenology
Phenomenology, very simply, is concerned with how we experience the world rather
than ideas and concepts about how the world really is. Implied in this is a concern with
the meanings we give to the things we experience. If you are a phenomenologist, then
the core of what you study is individual and collective experiences of what many refer to
as the ‘lifeworld’. The world that is explored by research is a perceived and experienced
world. The methods phenomenologists use are, amongst others, description, observa-
tion, reporting and reflection.
Dave Trotman, a lecturer at Newman University College, worked within a phe-
nomenological framework for his doctoral research (Trotman, 2006). His objectives
were to understand what it was like to work in primary schools (an investigation of
the teachers’ lifeworlds) and to consider how such understanding could be used in the
evaluation of educational practice. His approach was to work with a small group of
teachers. Initial discussions explored the role of the teacher and out of this emerged a
pupil focus, specifically a concern with developing the imaginative and creative capa-
bilities of children. He went on to delve into the differences between the teachers and
to seek to understand how far their approaches to developing creativity requires them
to enter the children’s lifeworlds. Trotman’s is a well-argued case for phenomenologi-
cal research but there are also many other examples. Phenomenology has been used
as a research approach to study employability and career development, the experience
of learning, group dynamics and inclusion, the value of special education settings and
understanding the processes through which we assume roles. Case Study 2.1 gives an
example.
40 Research Methods for Education Part 1 The Context for Your Research
references and see if you can find any that fit these t papers that deal with the application of humanis-
categories: tic ideals to the curriculum and learning process.
t papers that deal with identity and affective charac- For the first 50 papers accessed, count how many are
teristics (that is, those concerned with emotion and in each of the above categories. Then analyse what
feelings); topics are dealt with in each of the categories. Write
t papers that use the term ‘humanistic’ to describe a a report of no more than 200 words on the areas that
therapy or treatment; humanistic research deals with.
Existentialism
Existentialism is more difficult to pin down because of the many people who have
claimed to work within its assumptions and beliefs. Its focus is to understand the nature
of human existence. It is, in many ways, a self-centred view of the world with its emphasis
on freedom of choice and a belief that individuals must be responsible for creating value
in their own lives. It generates a research environment in which we seek understanding
by looking at the world from a personal perspective, subject to emotional pressures and
driven by conviction and desire. The novelist (and academic) Alexander McCall Smith
Chapter 2 UNDERSTANDING THE RESEARCH PROCESS 41
gets to the heart of the philosophy through the main character, a woman detective, in one
of his books. He writes:
Mma Ramotswe had listened to a World Service broadcast on her radio one day
which had simply taken her breath away. It was about philosophers who called
themselves existentialists and who, as far as Mma Ramotswe could ascertain, lived
in France. These French people said that you should live in a way which made you
feel real, and that the real thing to do was the right thing too. Mma Ramotswe had
listened in astonishment. You did not have to go to France to meet existentialists,
she reflected; there were many existentialists right here in Botswana. Note Mokoti,
for example. She had been married to an existentialist herself, without even know-
ing it. Note, that selfish man who never once put himself out for another – not even
for his wife – would have approved of existentialists, and they of him. It was very
existentialist, perhaps, to go out to bars every night while your pregnant wife stayed
at home, and even more existentialist to go off with girls – young existentialist girls –
you met in bars. It was a good life being an existentialist, although not too good for
all the other, nonexistentialist people around one.
While existentialism could be a rationale for living life in a particular way, it has also
been influential in generating representations of the world. As an artistic representation,
for example in literature or film, it could (and has been) a framework for understanding
society and, as such, has been investigated by researchers. The implication of this is that
what we might use as data in an existential investigation suddenly expands. We can ask
our subjects to represent their understanding and views of their world through creative
activities such as drawing and writing. If we ask them to shoot a video or take photos, we
see the world through their eyes. We can also work in metaphor, for example, if we were
to ask a group of young people to use a T V programme to describe their lives, we might
get better insights than by using a questionnaire. Themes that an existential approach to
research could deal with might be identity and the freedom to behave in a particular way
(look at Note Mokoti above. Such a theme could lead to an exploration of youth cul-
ture and deviance). Two areas of research in the USA seem to be particularly associated
with an existential perspective: the nature of curriculum and adolescence. Existentialism
is applied to curriculum as a design philosophy (I can find no work that uses it as an
investigative tool). An existential curriculum is essentially one of self-discovery. With
adolescence, it is used as a lens through which to find and look at research evidence. Case
Study 2.1 provides another example of existentialism in education.
In summary, while philosophical considerations can predispose us to collect and ana-
lyse data in a particular way (many will predispose us to use some type of qualitative
data analysis), their real significance is how they expect us to look at the issue we want
to research. We should think of them as a mindset we adopt when we approach our
research and which influences the type of issue we want to investigate and the nature of
the research question we ask.
principally from the UK. The method is very different in that the premise, that the claim
‘education services the capitalist economy’, serves as the basis for assembling the evidence.
(d) Postmodernism
The final philosophical perspective we shall consider is postmodernism. Postmodernism
rejects the assumption of a single explanation of things (such as science seeks to develop)
and recognises that the world is a complex place that is full of contradictions. How then does
this differ from modernism? Table 2.1 sets out some of the differences between them from a
researcher’s perspective. Modernism is marked by the search for order as epitomised by the
derivation of theory (for example, Marxism) and big themes through which to understand
events (for example, the idea that we should have theories of education that would serve
the needs of society and individuals is, itself a modernist ‘big idea’). Postmodernists believe
that all explanation involves assumptions, that the world is multi-layered, that people and
organisations can play several, sometimes conflicting, roles and that all understanding of
action is affected by the context in which the action occurs. A school where all the students
meet the national standard for assessment for their age may not be newsworthy, until we
find out that the majority of students do not have English as their mother tongue. Locality
and identity are strong postmodernist themes while modernism is more concerned with
‘grand designs’ and a focus on the general. In education this might be the introduction of a
national curriculum rather than giving schools the freedom to construct a curriculum that
meets the needs of their community. Postmodernists are critical of the way in which the
research process gives the researcher authority over those who are researched because this
replicates power structures found in the world outside research. In her book The Graffiti
Subculture, Nancy Macdonald argues that involving ‘the researched’ in the analysis of the
research data ensures that ‘interpretational authority’ is shared but she also adds that when
‘we collect accounts . . . we reserve the right to decree what people are really saying or really
doing’ (Macdonald, 2001, p. 20). Postmodernists seek ways to emancipate those on whom
research is done and find ways of giving voice to those whom modernists dispossess (by
virtue of their supposed neutrality in research and consequent non-involvement of those
who provide data). In the modernist world, knowledge is presented in cause and effect
models. Postmodernists assemble information in order to describe and, through description,
understand realities, that is, what actually exists rather than what exists in an ideal situation.
‘Modern’ societies are viewed as being ordered in terms of class relationships and family
structures; postmodern societies are seen as being fragmented, with new forms of family
(single parent) and partnership (same gender). Education policy and practice is not immune
Modernism Postmodernism
Understanding comes through reason and analysis. All analysis and explanation is based on assumptions.
Understanding is demonstrated in general laws Laws cannot predict individual behaviour.
about behaviour. Understanding requires context.
Understanding is timeless. Interest in the particular.
Focus on the general. Knowledge creates privilege and should be shared
Authority with the researcher. with those who might benefit from it.
Knowledge is presented in cause–effect models. Knowledge is assembled to reflect local conditions.
Chapter 2 UNDERSTANDING THE RESEARCH PROCESS 45
(ii) Paradigms
The next idea that may influence the way we look at or conduct our research is a para-
digm. This is a way of thinking about a subject and proceeding with research that is
accepted by people working in that area. This implies that both purpose and process
are widely agreed within a discipline or part of a discipline and that what is delivered
through research enables researchers to understand the world more effectively. The im-
plication of this is that if you work within a paradigm, you work with a set of rules that
determine your research procedures and that may even shape your research question,
and that there will always be a group of people sympathetic to your approach.
The idea of a paradigm was popularised by the philosopher of science, Thomas Kuhn,
to explain why science operated so successfully for long periods within a body of accepted
theory and procedure, only to change radically and move in a new direction governed
by a new set of challenges, goals, theories and procedures. He published his book, The
Structure of Scientific Revolutions, in 1962 and it was this that popularised the term ‘para-
digm shift’ to describe the move from one way of explaining and researching to another.
While Kuhn’s ideas about the development of science are now widely challenged because
the detailed history of what was happening rarely added up to a widespread ‘eureka’ mo-
ment, there is still an appeal in using the concept to understand big shifts in direction.
However, the paradigm concept is also used at a much smaller scale. In business
analysis, for instance, James Belohlav, a management professor at De Paul University
in the USA identified different business strategies over a 30-year period which he called
paradigms (Belohlav, 1996). The equivalent in education might be teaching and learn-
ing strategies that held sway over a period. For example, from the mid-1960s, compre-
hensive education was introduced to replace a system based on selection at the end
of primary education. While this shift was the result of a political policy (in much the
Chapter 2 UNDERSTANDING THE RESEARCH PROCESS 47
same way that changes from the 1980s onwards have been politically driven), how the
schools organised their teaching was a matter for the senior management team. Through-
out this period there have been debates between competing paradigms, such as those
who believed in whole class teaching and those who believed in some sort of separation
according to ability. In terms of education research, some approaches to research might
also be thought of as paradigms because they constitute ways of working which research-
ers believe produce valid and worthwhile results. Each approach has its own rulebook
and sets of procedures that have become accepted as canons of good practice. The differ-
ence between the way in which ‘paradigm’ is used here and the way in which Kuhn used
it is that paradigms can co-exist, whereas Kuhn argued that they could only co-exist for a
short period before one succeeded another to become the dominant mode of thinking.
What sort of research approaches might we consider as paradigms? This is not as easy
to answer as it might seem. The reason for this is that, as social scientists, we have been
rather loose in our use of terminology. In particular, we have not distinguished between
a paradigm and research methodology. Look at the definitions below, taken from the
Oxford English Dictionary:
t Paradigm: a conceptual model underlying the theories and practice of a scientific subject.
t Methodology: a system of methods used in a particular field.
From the definitions it should be clear that the concept of ‘paradigm’ functions at a
higher level than methodology; it ties the way a researcher works to ideas about what it
is appropriate to investigate and on what basis the research output should be considered
to be a truth. In other words, it links research philosophy and the practice of research.
We shall look at some research frameworks such as ethnography and case study research,
when we discuss methodology. We should be aware, though, that many authors call
these (and other methodologies) paradigms. In this text I refer to them as methodologies
because they are more concerned with the processes of collecting and analysing data and
not higher level conceptual thinking involving assumptions, concepts and values. We
now look at quantitative, qualitative and mixed approaches as paradigms. We shall consider
the implications of approaching research from these three perspectives in greater detail
in the next section of the text (see Chapter 3). Now all we have to understand is why they
can be thought of as paradigms. The issues are summarised in Table 2.2.
process; they are neutral technocrats who have no commitment to the issue under inves-
tigation other than to seek the truth (that is, the researcher is not swayed by emotional
attachment to an issue or commitment to a cause). Quantitative approaches have logi-
cally consistent procedures through which to pass the evidence and reach a conclusion.
The values are those of science, the assumption is that of an objective reality and the
concepts are concerned with the nature of what is evidence and what constitutes proof.
The pragmatism of the mixed methods approach has potential benefits for many
education researchers. This is because we are often concerned with an issue at both a
significant scale (for instance, teenage pregnancy, in which the UK has very high rates
in comparison with other countries) as well as the conditions that give rise to it and
the experiences of those affected by it. These are different levels of enquiry and a mixed
methods approach is, essentially, complimentary, with quantitative approaches dealing
with the issue of scale and qualitative with the issue of experience.
(iii) Principles
Paradigms are concerned with rules. For the researcher working within a paradigm, freedom
to act in a particular way may not exist. As we will see in the next section (see Chapter 3),
each approach has its own rulebook and sets of procedures that have become accepted
as canons of good practice. However, even with rules it is possible for two researchers to
work in different ways. There is, therefore, an area where researchers have freedom to take
decisions and their decisions are constrained not by rules but by principles. As a researcher,
we have to identify our principles and then ensure that we comply with them throughout.
Principles reflect the moral position that you have, so, in this sense, there is a link to
philosophy. There are situations in which you can legitimately work outside of a prin-
ciple but only if the moral good outweighs the moral loss. If we are convinced that
learning by finding out is the most beneficial approach for students over the long term
(because it is learning that stays and is not forgotten), then it may well be worthwhile
helping one group to learn by discovery and sacrificing another by teaching the same
material in a didactic way just to test whether your way is genuinely better. The disad-
vantage to the second group is (we believe and hope) outweighed by the advantage to all
other students in the future. If, however, we ever think of breaching a principle for the
sake of expediency, then we must hope that we are never found out!
How do we determine what our principles are when it comes to research? Here are
some pointers:
1. We have to consider our position with respect to honesty in relation to the data we col-
lect and present and the results we publish. It is very easy to ignore data that does not
fit in with the interpretation and explanation that we are working towards. We might
also be tempted to ‘construct’ data that fits the ideas we are testing. From a pragmatic
point of view, it is not sensible for a researcher to falsify data because if (or, more likely,
when) someone else tries to replicate our results and finds that it cannot be done, it
is our reputation that has been ruined. As researchers we have a moral duty to other
50 Research Methods for Education Part 1 The Context for Your Research
researchers to be honest in how we work. The best way of demonstrating our honesty
is to be transparent with our methods of data collection, analysis and interpretation.
2. We have to decide how far our responsibility extends to others that we involve in our
research. Do we name and shame people if they are doing or have done something
wrong? Do we maintain the confidentiality of people who give us sensitive informa-
tion? Do we allow readers the ability to identify organisations that have given us
access as case studies? These are difficult areas and we need to determine where we
stand. One moral principle that may help us is the notion that people or organisa-
tions with whom we work should not be harmed by how we work or what we do with
the outcomes of our research. However, principles can conflict with other principles
and we have to determine which ones are overriding. For instance, if our research into
school management suggests that there is financial corruption which, if made public,
will inevitably affect the reputation of the school, should we ignore it or is the pos-
sibility of there having been criminal activity a higher moral principle?
3. We should think about whether it is appropriate to share the benefits of undertak-
ing the research and publishing the results and, if so, with whom. Is it acceptable to
get information from an organisation, publish the results or present a thesis and not
tell the people we contacted what we found out? If we do this, then our research is
rather like a smash and grab raid. It could be labelled ‘intellectual imperialism’ where
the researcher stands aloof from the subjects and treats them as disposable suppli-
ers of data. If our research generates benefits, who shares in these? If we are examin-
ing the educational consequences of disadvantage, have we an obligation to help our
respondents understand the consequences or even to help them overcome them? We
need as well to take a position on who owns the intellectual property embodied in
our analysis and conclusions? If you are a student and you have been guided by a
supervisor, what rights does the supervisor have to your conclusions? If your work is
published, whose names are on the publication – and whose is first? Not surprisingly
this is a contentious area. Of what significance are the discussions that led to the actual
activity of data collection and analysis? At the heart of the issue is whether the research
is the product of one person or the result of advice, guidance and even collaboration.
4. How far should those involved in our research be free to withdraw? This question
suggests that, at least, they ought to know that they are participating. If we ask ques-
tions either directly or in a questionnaire, then at least they will know that some-
thing out of the ordinary is occurring. If we are testing computer assisted learning
against classroom learning then participants may not know that they are part of an
experiment. Should they? And if our participants are children, should they consent
or should we just obtain the agreement of parents and, if necessary, teachers? This is,
essentially, a question of when children become responsible beings and when their
rights are superior to those of adults who notionally have responsibility for them. A
research group from the University of Huddersfield led by Rachel Balen has explored
just this issue. They use the phrase ‘active beings’ to describe children who are mature
enough to make informed decisions and the phrase ‘human becomings’ to describe
them at points before this (Balen et al., 2006). They point out that children’s perspec-
tives may be different from adults’ but are equally valid, even if they do not wholly
appreciate the nature of the research. This is an interesting argument that suggests
that children can assent to participating in a research endeavour when they appreciate
that the outcome can be beneficial to others but without understanding the exact na-
ture of the benefit. The research team gave an interesting example of this – obtaining
data from the children of drug addicts.
Chapter 2 UNDERSTANDING THE RESEARCH PROCESS 51
The guidelines below are from the British Educational permission should be obtained from the school,
Research Association (BERA, 1992). Look at the BERA and if they so suggest, the parents.
website http://www.bera.ac.uk/publications/ethical- 9. Honesty and openness should characterise the
guidelines and identify the ways in which the latest relationship between researchers, participants
set of guidelines differ. and institutional representatives.
10. Participants have the right to withdraw from a
1. The British Educational Research Association study at any time.
believes that all educational research should be 11. Researchers have a responsibility to be mindful
conducted within an ethic of respect for persons, of cultural, religious, gendered, and other sig-
respect for knowledge, respect for democratic nificant differences within the research popula-
values, and respect for the quality of education- tion in the planning, conducting, and reporting
al research. of their research.
(iv) Methodology
The next concept that could influence the way you do your research is methodology. But
before we look at it in detail, there is something that you should be aware of. As you read
more literature about conducting research, you will realise that authors fall into one of
three groups:
t those who use the terms research methods and research methodology very precisely and
with different meanings;
t those who see little distinction between the two terms and use predominantly one or
the other to mean what the first group refers to as research methods; and
t those who are ‘flexible’ in their use of the terms and use them interchangeably.
As you will appreciate by now, one of the themes of this text is flexibility in developing a
solution to a research problem. However, that flexibility should not really extend to the way
in which researchers apply concepts or apply meanings to terms. If we do not use words with
precision and with constancy (and I too have been at fault in not following this advice) then
there is limited basis for effective communication and academic debate. Research methods
and research methodology are quite distinct ideas and should be used in different ways.
Research methodology is concerned with the assembly of research tools and the applica-
tion of appropriate research rules. Research methods are the research tools themselves, for
example, questionnaires, observation, statistical analysis. At its simplest, for the practical
researcher, methodology is how the toolkit of research methods is brought together to
crack an individual and specific research problem. In some disciplines it is still possible
to talk of a dominant methodology but this is not the case in education where there are
many competing methodologies, each with a distinctive combination of principles, pro-
cedure and practice. What we will appreciate is that certain methods are closely linked
with specific methodologies and, by virtue of this, specific philosophies. It is impossible,
in this book, to cover every methodology used in education research but it is important
to say something about the principal ones.
and found that a very large proportion of those who had a reading age of seven or
less had not experienced any nursery education while those with a reading age of 10
or above had, we would (a) have found a pattern and (b) a possible process that had
influenced the pattern. These processes constitute the general actors at work.
t Second, we are also interested in variations from the expected. Why do these pupils
from this school who have not had access to nursery education have a reading age of
10 or more? What is happening locally to disrupt the expected pattern? The question is
whether a study of the individual case can tell us anything about the general situation.
The answer is that it can. For example, we might ask what a study of an individual school
can tell us about schools in general. We could use this information in the following ways:
t We could look at statistics that place one school in the context of others, either na-
tionally or locally, and then see the school as being characteristic of where it is, for
instance, average, above average, well below average etc.
t We could look at one school in the context of case studies of other schools to see what
common features emerge.
t We can also generalise about something like group behaviour within a case. If our
case study involves surveying the attitudes or behaviour of a large group (such as all
the teachers in a group), then assuming that they are not all in one category (male,
young, in post a long time), we are likely to generate data that is reasonably charac-
teristic of a typical group of teachers.
t Finally, we might have an expectation of what should be the situation in our case
study; we might determine this on the basis of an established theory or it might be
based on our assumption about what would constitute reasonable behaviour in the
circumstances. The case study becomes a test of our expectations, and, while it is a
limited test, it is, nonetheless a test. If you think about it, this is the principal way of
using the case study in medicine. The medical intervention is based on the theory of
what should happen and on the outcomes of previous interventions.
A case study can make use of a wide range of methods, documentary sources, statis-
tics, external reports and evaluations and so on as well as data collected by interview,
questionnaire and observation. Clearly these methods can be used by other methodolo-
gies as well and, sometimes, the differences between one methodology and another can
be difficult to determine. For example, a case study of pupil behaviour in a school could
be little different in terms of the methods used to collect data from an ethnographic
study in a school where the focus is on classroom behaviour.
The question of how case studies can be used in research requires us to look at them in
two ways, first, in terms of what they can be used for and, second, how they can be used
in the overall research approach. Let us look at each of these in turn.
t What can research issues be used for? Table 2.3 presents a matrix showing how we can rep-
resent case studies. The principal ways of using them is shown by the rows of the matrix.
These are the purposes for which we are using them. There are three: exploration, explana-
tion and description. We can use a case study to find out what is going on, to throw light
on something that we have never met before or do not understand. This is exploration.
We start by not knowing and we use the case study to establish understanding. With expla-
nation, however, we start either with a question, ‘Why?’ or with an assumption that this
is what will happen. In this instance our goal is to explain some sort of outcome, such as,
‘Why did the teachers take the students out when the weather forecast was bad?’ Finally,
we can use the case study to describe or record a situation. Given all that has been said,
Chapter 2 UNDERSTANDING THE RESEARCH PROCESS 55
Exploration How children manage When does a child Staff pass a motion The effectiveness of
with a new approach to bully others? of ‘no confidence’ procedures through
teaching numeracy. in college manage- which children are
ment. allocated places at
secondary schools.
Explanation How can counselling The growth in reports The circumstances Why some parents
help in resolving behav- of bullying behaviour. that led up to the feel procedures
iour problems? incident. to allocate school
places are inappro-
priate.
Description A day in the life of a Managing a parent’s How many hours a Leisure activities of a
typical primary head complaint about day staff spend on cohort born in 1996.
teacher. bullying. college activities.
even in this book, about the search for generalisation, the fact that description is identified
as a purpose at all may come as a surprise. If so, it will be an even bigger surprise to learn
that this is one of the principal uses of the case study. How come? Well, we only have to
think about the significance of practice improvement to the education system to begin to
appreciate why. Case studies of good or effective practice abound on teaching, classroom
management, community engagement, school organisation, institutional leadership and
many, many more. The columns of the matrix in Table 2.3 show what can be investigated.
Depending on the issue that we are investigating, we can look at it in the context of what
is typical or what is unusual or atypical. The former will give us a picture of what usually
happens and the latter what happens in more extreme circumstances. Good practice could
be an example of this, as could poor performance. We can also use a case study to inves-
tigate an issue or a process, such as the admissions process to higher education and how
it treats students from ethnic minorities or, as another example, the interaction between
schools, support agencies, the police and the youth justice system who are all involved if a
young person becomes involved with anti-social or criminal behaviour. Another example
of the use of the case study is to investigate critical incidents and their consequences. Such
studies might include the death of teachers and pupils while on a school trip, or identify-
ing the critical decisions that led to collective management failure, or when a teaching ses-
sion went particularly well or badly. Finally, there is a particular instance of a case study,
usually undertaken with government resources, that is particularly important – long-term
cohort study. This follows a group over an extended period of time. It is used in medicine
to track survival rates of cancer patients, especially those who have been involved in a new
treatment. The National Children’s Study in the USA records the relationship between
environment and health for 100,000 children up to the age of 21. The Education Longi-
tudinal Study (2002) is following a representative cohort of 2002 high school students
in the USA through to employment. In the UK, the Government commissioned an eight-
year-long study (ending in 2009) on citizenship education and its effects. There have been
long-term studies looking at the outcomes of full-time education in Sweden (Evaluation
Through Follow Up, which began in 1961), and in the Netherlands and France (educa-
tional pathways in secondary education). A good summary of some long-term studies was
prepared for the German Government as part of its policy making process, to determine
whether it should instate similar studies (Kristen et al., 2005).
56 Research Methods for Education Part 1 The Context for Your Research
t How can case studies be used? This is a question that relates to the overall objective of
the research. What is it that we are trying to achieve? Is it to find out or to answer a
question? If a question, is it generic or specific to the case? Are we trying to under-
stand operational details within an organisation or the character of the organisation?
Broadly speaking there are four ways of using a case study. A single case study deals
with one instance only. In this instance, the case has to be representative of some-
thing, for example the typical, the extreme, the modern, the old fashioned and so on.
We could, for instance, make a case study of a failing school. Multiple case studies can
be designed to be repetitive (that is, to test the findings of earlier cases, to compare
a failing school at a later point in time) or comparative (in which case the character-
istics of the cases are deliberately and knowingly varied in order to assess the signifi-
cance of the differences). It is also possible to differentiate between the holistic case
study (a whole organisation study designed not only to focus on the research issue but
also to appreciate the organisational culture) and an embedded study where details of
the organisation (for example, a department) are considered in detail.
Case studies can be excellent at giving researchers a rich understanding of a situation
but they are not unproblematic.
t There may be a problem of getting at the truth. If the actions we are concerned with
took place a long time ago people may forget. More usually they may reconstruct
their interpretation to position themselves in a better light. They may be economical
with the truth and not actually tell you a lie but not reveal the whole circumstance
either. These issues become significant if we interview those who were responsible for
decisions. Few people are so self-confident that they will reveal their failings. Most
senior people are concerned with how their legacy will be read. If people have lost
their jobs, they are likely to place blame for failure elsewhere. This is a difficult issue
that is resolved only by digging for more information and by giving an individual’s
response to others involved at the time and presenting the individual with their re-
sponses. Ultimately, however, we have to make a decision about whom to believe.
t The second problem with a case study is that we can get it horribly wrong. If we start
off on a track seeking to show something, it can lead us to wrong conclusions. The re-
search world will tell us when we are wrong but it is best to cultivate a neutral attitude.
t What are the limits of a case study? In the real world, one thing leads to another. At
what point are we dealing with another issue? The answers to this problem are either
to set the limits at the beginning (time period, to whom we should speak, conse-
quential issues to be covered) or to keep going back to the main problem we want to
understand each time a new dimension shows itself and to ask whether finding out
more really benefits our analysis.
t Lastly, case studies that emphasise uniqueness are not really all that helpful. What is
valuable about a case study is what can be transferred to other situations such as other
organisations that face or might face similar problems to those in the case study or to
those performing a similar role in an organisation. The case study should always seek
to identify the learning that can be transferred.
(b) Evaluation
Evaluation is concerned with whether a process or activity delivered the outputs or out-
comes that were expected of it. It is extensively used, from the large scale (such as the
environmental impact of construction projects for example, the Channel Tunnel), to the
Chapter 2 UNDERSTANDING THE RESEARCH PROCESS 57
Goals
Focus
Objectives Processes Outcomes Sustainability
Types
Goal orientated
Summative
Decision orientated
Formative
Conflict orientated
small scale (the adoption of a new teaching approach). It is used in many sectors includ-
ing infrastructure development, finance, regeneration and development as well as educa-
tion. The one common element to all is the degree of focus of the research. The concern
of evaluation is to look at something specific such as project outcomes, how decisions
were made, or a failure in procedure. While many evaluations are concerned with a pro-
ject, this is not necessarily the case. The easiest way to understand evaluation is not to
look at the areas where it has been applied but to understand the types of goal that can be
set for it. There are four (see Figure 2.2), and they are not necessarily mutually exclusive.
The first level of decision that we have to take when setting up an evaluation is to
determine what our goal is.
t At the most general level an evaluation can be conducted to understand what is hap-
pening. Why did things turn out differently from what we expected? Why did we
not foresee these outcomes and consequences? Why does this system actually work?
The issue to be resolved is ‘not knowing’. Answering these questions could involve
anything from a detailed description of processes and decisions (such as you would
get in a case study) to an experiment conducted under scientific principles and any
combination of methods in between.
t The second task is quite basic; it is to ensure compliance. Organisations have to com-
ply with regulations and implement policy directives that arise from higher levels of
authority. Educational institutions in most countries are inspected to ensure compli-
ance with the requirements of state funding. Projects are evaluated to reassure those
58 Research Methods for Education Part 1 The Context for Your Research
who finance them that the claimed outputs actually exist and the money has been
spent appropriately. The evaluation process in these cases seeks to produce incontro-
vertible evidence that things are as they should be. Evaluation in this instance is an
exercise in accountability.
t The third role that evaluation can perform is to improve organisational management
and practice. In this type of evaluation, the evaluator looks at the understanding and
appreciation of what is happening in context. This context might be anything the
evaluator knows about from organisational and management theory to examples
and instances that constitute experience in a field. In some instances the evaluation is
comparative, when a process within an organisation is compared with what happens in
other organisations. This approach is used by the UK Audit Commission and is referred
to as benchmarking. (Note, the Government announced plans to contract out the audit
work to other bodies from October 2012, leaving the Audit Commission to manage the
contract process.) Figure 2.3 sets out the procedure followed by the Commission. The
key stage is that of identifying ‘partners’. The Audit Commission calls them partners
because of the context of its evaluations, which is to compare, as far as is possible,
like with like. What it has done is create groups of organisations (such as education
authorities) that are similar in terms of their socio-economic and demographic
characteristics. With this model, inner urban areas are compared with inner urban areas,
outer ones with outer, rural areas with rural areas and so on. Statistics are collected for
every aspect of performance. At a higher operational level there are regional partnerships
that benchmark practice in managing special educational needs (SEN) in the UK. In
the Netherlands, the Government has set up a benchmark project to compare aspects
of the Dutch education system with what happens in a select group of other countries,
Germany, Flanders in Belgium, the UK, Sweden, Denmark, Finland and Norway.
t Finally, evaluation can be a tool for policy formulation and shaping. When used to do
this it operates at two levels, first to translate experience from a test bed to a regional
or national scale and second, to learn from what happens in other countries. Gov-
ernments will try out new ways of doing things on a limited project basis in order to
test whether they work. If they are not successful, they can be modified to improve
performance but if this does not work, then failure is not expensive. Learning from
other countries is almost the same as benchmarking. The difference, and it is rela-
tively minor, lies in the fact that benchmarking is a continuous activity, while evalu-
ation of practice elsewhere in order to determine policy actions is more of a one-off
activity. This process has been extensive in education in recent years. For example,
quality processes developed in the UK are now being developed by countries across
the world. The Australian experiment in students paying for tuition has been adopted
in New Zealand and the UK and will inevitably sweep through the developed world.
Degree level education throughout Europe is being reformulated on a three-year
undergraduate/two-year postgraduate framework.
The second decision that we take in an evaluation concerns its focus. Again there are
four and, again, they are not mutually exclusive.
Chapter 2 UNDERSTANDING THE RESEARCH PROCESS 59
t First, we could be concerned with the objectives or goals of the project or organisation
or process we are investigating. If we are concerned with objectives, then the under-
pinning issue is one of relevance and we seek to determine whether the outcomes are
relevant to the current situation and, by implication, whether the objectives continue
to be meaningful. After an influx of refugees a school might, for example, employ
support teachers to help non-native speakers cope with lessons. Each year it could
evaluate the benefits of its investment and, if it were to find that students’ capability
in the language of tuition had improved, it might conclude that the benefits of its
programme were outweighed by the costs.
t Second, we might be concerned with processes, in which case the underlying issues
are ones of efficiency. Are the processes producing either the quantity or quality of
outcomes that are warranted in terms of the inputs? Is this organisation efficient? Is
this school, set in a middle class catchment area, offering sufficient value added to its
pupils in terms of their leaving qualifications? Is the investment in this new system
of private finance for public services giving us value for money? Or, in the case of our
example of employing support teachers, are they producing the benefits expected of
them and, if not, what might be changed?
t Third, the focus might be on outcomes, in which case we are asking questions about
effectiveness. Are the outcomes, both expected and unforeseen, delivering what we
wanted to happen? Are the anti-bullying policy and associated measures actually cre-
ating a safer environment? Is the action of placing police in schools leading to a
reduction in anti-social behaviour?
t Finally, our focus might be on sustainability. Here there is a range of questions that
might be asked. Is the initiative likely to continue beyond a period of start up fund-
ing? Is there a champion who will support it? Does it continue to meet the needs of
the community? Does it have a continuing utility? Will we know when it ceases to be
useful?
The final decision that we have to take concerns the type of evaluation we wish to
set up. There are two ways of looking at this and they can be combined. First, an evalu-
ation can be summative or formative. Summative evaluations occur at the end of some-
thing, such as a project, and are concerned with whether what was supposed to happen
actually did happen. Formative evaluations occur at one or more stages during a project
or programme and have the objective either of ensuring that the project is keeping to a
planned schedule or of improving performance if it does not meet expectations. Second,
an evaluation can be goal orientated, decision orientated or conflict orientated.
t Goal orientated evaluation is concerned with outcomes but, because outcomes are a
consequence of the whole implementation process, from the goals that were estab-
lished, through the resource inputs that were applied and the decisions that were
taken to the management actions that implemented those decisions, a goal orien-
tation must consider the whole project or programme. It has to consider not only
whether goals were met (and if not, why not) but also whether the right goals were
specified (and if not, why not). We could, for example, evaluate a school’s attempt
to stamp out disruptive classroom behaviour with a strong discipline policy. We can
collect data to judge how effective it was but we ought also to consider why students
were disruptive and whether curriculum change might not have been a better strategy.
t A decision making focus looks at how decisions were made and the context in which
they were made, from the resources available to be used to the management actions.
A decision making focus might also investigate whether decisions were reviewed and
60 Research Methods for Education Part 1 The Context for Your Research
what the consequences of review were. In our example we would want to understand
why the senior management in a school chose discipline as the means of managing
behaviour.
t A conflict orientation investigates how conflicts were resolved within a project, from
decisions about goals, objectives and priorities to the decision making embedded in
project delivery. How far one takes the analysis through into the delivery phase is
dependent upon whether conflicts at earlier stages were resolved or whether points
of view were so well established in the management framework that conflicts contin-
ued. A conflict analysis is usually undertaken when a project has failed or is under-
performing. The governors of our school with the discipline problems might review
the implementation of the programme if behaviour did not improve and exclusions
increased. An evaluation might find that the senior management team had been di-
vided over which direction to take to solve their problem and that half were vehe-
mently opposed to using policing and punishment strategy for improving behaviour.
What issues do we need to be aware of if we are considering using an evaluation meth-
odology? Three stand out.
t First, evaluation is presented as being detached from delivery, systematic in its work,
objective in its outlook with the evaluator independent of what is being evaluated.
It clearly believes that it draws on the principles of scientific research for its inspira-
tion. The reality, however, is that evaluation is a bit like constructing a case for the
prosecution or the defence in law. You assemble the evidence and present it in a way
that argues your case in the most cogent and persuasive way possible. The scientific
approach lets the evidence speak for itself. An evaluator brings the evidence together
and creates a perspective based on perceived insights that reconfigure the evidence to
confirm those insights. The insights need not be wrong but, equally, they need not
be correct. If the evaluation is internal to an activity, then there is potential if not for
bias, then at least accentuation of the positive.
t Second, evaluation should take account of multiple perspectives, those who might be
funding a project, stakeholders who might be involved in the delivery of a project, the
beneficiaries of the project outcomes and the community in general. The balance of
these can affect the character of the evaluation. To return to our example of a school’s
decision to institute a strict disciplinary regime, evaluation might show that pupils
may not give it wholehearted approval, educational authorities may not either if it
results in more pupils being excluded from schools but teachers and the community
may think it an excellent idea.
t Finally, the boundaries of an evaluation must be clearly established at the outset. Be-
cause evaluations can follow what seem to be interesting lines of enquiry, it is quite
easy for them to stray outside their remit.
(c) Ethnography
Ethnography comes from two Greek words that mean ‘people’ or ‘nation’ and ‘writing’,
so ethnography is writing about people. In an educational context we could use an
ethnographic approach to ‘write about’ the culture of a school staff room and the
relationships within it. We can observe what happens there and even ask questions, both
informally and formally through interviews. The writing element describes a process that
leads to analysis, synthesis and conclusions. It may seem descriptive but the writing leads
to the identification of structures and processes. In our staff room we may find that one
Chapter 2 UNDERSTANDING THE RESEARCH PROCESS 61
Research issue
Data collection
(Chapters 8 and 9)
Conclusion
place is recognised as being that of a particular teacher or that the chemistry department
always sits together but the English department never does. The evidence is assembled
to create an argued interpretation. Speculation can play an element in constructing a
possible explanation and further research can seek the evidence on which to accept or
reject it. The people involved in an ethnographic study can be any group from a formal
structure such as an organisation to an informal group such as a gang. Ethnography is
well established as a method in education research and is used to understand such things
as classroom behaviour, the learning process, group values and behaviour, organisational
management and change and so on.
The stages in an ethnographic study are shown in Figure 2.4. Once the issue to be
researched has been identified, the first stage is to find where we can study it. In the case
of our school staff room, we have to find a head teacher and a group of staff that will
agree to our being with them. The selection of the study environment must be done with
care. It should be representative of both the issue and the context in which the issue is
normally found. As we shall see in Chapter 6, identifying our study location is a matter of
sampling and the right location is important if the results of the investigation are to have
any wider implications. If the context is atypical, it could still be used for an investigation
but the researcher has to then make a clear judgement of those conclusions that might
and those that might not be applicable to other circumstances. The process of data col-
lection uses standard methods of observation, conversation, interviews, questionnaires
and others that we will meet in Chapters 8 and 9. What makes a study ethnographic is
that the researcher spends a long and intensive period in the study environment. It is this
that produces the insight to make sense of the data. Analysis consists of the assembly of
evidence to identify themes. At the heart of this process is the categorisation of research
data and the linking up of the categories by the researcher. At this stage the researcher
often makes use of background knowledge and understanding to inform the selection of
62 Research Methods for Education Part 1 The Context for Your Research
themes. But not everything may be resolved at this stage. Analysis may raise new ques-
tions that will require further data collection, so there can be a significant feedback loop
at this stage. This should be built into our research schedule. The final stage is to write up
a report setting out and arguing for our conclusions.
The centrality of the researcher to the process is emphasised in the sifting and sorting
of the data to identify how it best fits together. There are techniques that can be used,
for example classification and coding (see Chapter 11), but the structures that these
represent arise from the researcher’s assessment of the evidence. So the researcher is the
point of synthesis and the synthesis is influenced not just by the data that is collected but
also by what the researcher knows from other similar or related studies. In addition, of
course, any researcher is likely to be influenced by the existence of:
t social and family structures;
t concepts such as respect, dominance and equality that frame relationships; and
t ideas and interpretations developed by other researchers, for example territoriality
(the attachment to a location or the defence of a location) and identity (individual or
group).
Within the ethnographic approach the objective is not to change the behaviour of
individuals or groups but to act as an observer of that behaviour. This is more difficult
than it may seem. If our observer status is kept hidden from the study group, then our
position becomes problematic if we do not behave in ways that the group expects. Often,
the option of being a ‘hidden’ researcher is not open to us. For example, as a 20-year-old
it would be difficult not to stand out in a staff room. Being a ‘hidden’ researcher is likely
to be a better option in studies of organisations because we can take on a formal role and
do a normal job. Many other researchers use the term ‘participant observer’ to describe a
‘hidden researcher’. It matters little which term we use, though ‘participant’ does have the
sense of being fully involved in the interactions that are being studied.
If we are not a ‘hidden’ researcher, then we are visible to the group and our researcher
status known to the members. (The visible researcher is also described as a ‘non-
participant observer’.) In this situation, the researcher has to gain the group’s agreement
for the research to take place and for the group to be observed. This can affect how some
group members behave. This may be a particular issue at the start of the research pro-
gramme but it is likely to wane as the observer becomes just part of the wallpaper. There
is another problem, as well, in being visible – you can be asked to intervene at an indi-
vidual or collective level. Imagine that you are observing a class of infants. At that age they
certainly will not understand your role as a researcher, you are just another teacher. What
do you do when a pupil asks you for help (because once you have helped, there is no
going back)? What is the answer? I suspect that most people would help. Does it matter?
If we adopt an ethnographic approach we should be aware of some potential pitfalls.
t The issue of influencing the behaviour of those you are studying has already been
mentioned. Once people become aware that they are being observed, they may
change how they behave and how they represent themselves to the world. A bully
might stop bullying, teachers might express agreement with the changes being intro-
duced by the head teacher, young people might act with more bravado or insolence
than would be normal. If we suspect this might be the case, what can be done to
improve the situation? The answer is to stay longer, become part of the environment
and hope that your subjects’ normal behaviour reasserts itself.
t Second, there is a real issue of researcher misinterpretation. The reflective approach is
so central to the analysis that one wrong inference can lead to erroneous conclusions.
Chapter 2 UNDERSTANDING THE RESEARCH PROCESS 63
This is difficult to guard against. That is why research is published, so others can see
how conclusions are reached and, if necessary, criticise them.
t Third, the position of the researcher as an insider can raise ethical issues. What hap-
pens if we spot something that we think is wrong? If we suspect child abuse, the
chances are that we will act. If we suspect that a teacher is altering the grades that
children are given in tests that reflect their attainment over the past year, what should
we do then? Remember that it is our work that will be affected! With an ethnographic
approach we may find that we get hold of privileged information. Can we use this in
our research – and publish it? Read section 2.1.(iii) on principles again.
t Fourth, there is a significant potential for researchers to identify with their subjects in
some way. For example, if we are studying why a school has low attainment standards
and much of our research has been looking at how teachers cope with a significant
annual intake of students whose first language is not the language of tuition, we
might be tempted to write a report that was sympathetic to the teacher’s situation
rather than one that highlighted a collective failure in identifying methods used suc-
cessfully elsewhere to achieve better results. The researcher is so important to ethnog-
raphy that critical distance is vital if the results are to be credible.
t Finally, ethnographic research can become just a description rather than the insight
into something of a more general nature. As a research approach it seeks to provide
the research community with generalisations – ideas and concepts that are applicable
to other situations. If we adopt an ethnographic approach, we should make sure that
our study talks to a wider set of circumstances. A study of a breakdown in leadership
in one college should have something to say about preventative measures for most
other colleges, and probably schools as well. With an ethnographic approach, the
researcher is immersed in the detail of the study but always finishes by looking at an
issue with a wider perspective.
Reviewand
reflect
Action
in that the knowledge they produce is usually passed out of the research environment
to someone else to consider and act upon. With action research, the action is part of the
research process. This gives rise to a cyclical procedure in which research generates action
and action becomes the evidence base for further research. This is shown in Figure 2.5.
Let us start with the issue that is at the centre of the investigation. This is placed at
the centre of Figure 2.5. In an attempt to resolve the issue (which might be that very few
students choose to study a statistics option in a research methods course), those involved
in the action research programme develop an action plan. It would be appropriate in the
first cycle to put a process of assessment, analysis and reflection between the issue and
the action plan because it is out of this initial consideration that the first action plan
arises. In our case, this could be a meeting of the course team members to discuss the
problem and decide what they can do about it. The implementation of the plan gives
rise to the next stage, the actions themselves. These actions are different from what went
on before and the difference constitutes the driver for change and the resolution of the
issue. If the course team believe that students do not take the option because they are not
confident in their numeracy, they might develop numerical problem solving activities to
give them confidence. The actions are monitored and observed through data collection
procedures that were determined at the initial planning stage. In reality, of course, obser-
vation does not follow action, it runs in parallel with it (or just behind it – if it is too far
behind, then memory can play funny tricks with recall). In our case, we will have student
test data and assessments of self-confidence with numbers. Finally the data is brought
together and assessed in the stage called review and reflection. The formal processes of
data analysis and reporting are determined at the planning stage. These analyses, plus the
experiences of those involved, become the evidence to judge what has been achieved. A
crucial part of this process is to ask questions: Why did this work well and that not? Why
was there a hiccup at that stage? Was everyone pulling their weight? How far have we re-
solved the issue? It is this question that triggers the next cycle. What can we do to resolve
the issue or get closer to our goal next time? This is the second plan.
The issues that action research can address are shown in Figure 2.6. Three purposes or
types of task are identified. Each of these is particularly well suited to a target environ-
ment.
t Organisational change may require a change in goals, practices or procedures. Achieving
this, however, may require deeper change in outlooks, aspirations and attitudes. Because
of action research’s power in achieving organisational change, it is used extensively in
Chapter 2 UNDERSTANDING THE RESEARCH PROCESS 65
Purpose
Distinctiveness Researcher focus Learn from the Question focus Change focus
particular
Purpose Understand, Explain, explore, Understand, test Change, improve,
explain describe compliance, improve, build, develop
inform
Researcher Hidden or visible External analyst Outside the process Inside the process
status participant but may be internal to and the project
project
Methods Observation, Information assembly, Any Reflection
conversation interview
Limitations t Influencing t Getting at the truth t Not detached t Lack of rigour
and issues behaviour can be difficult t Limited perspectives t Not detached
t Misinterpretation t Wrong interpretation t Setting boundary t Group decisions
of evidence t Can it be generalised? to study about actions can
t Ethical issues be compromises
t Not detached
t Lack of
generalisation
You should not, however, imagine that these methodologies are your only options.
You will meet other approaches including experimentation in Chapter 3 (considered
there because they are better understood in terms of the issues discussed in the chap-
ter). Researching is a dynamic field that changes as researchers address new condi-
tions and circumstances. For example, we are beginning to consider the principles
that might underpin Web-based research (there is some further reading on this at the
end of the text). It is possible for us to create new frameworks that combine meth-
ods and develop new ones and which will lead us via a new path through to an end
point. We should bear in mind, as well, that there is a growing acceptance of mixed
methodologies (see Chapter 3). Pragmatism in research is becoming acceptable. What
we have to do, as researchers, is present a strong case for tackling the issue in the way
that we have.
Table 2.5 The link between research question and research approach
research (in the sense that we have no options but to do it only one way), just that it
predisposes us to look in certain directions at particular methodologies.
Table 2.5 summarises the types of research question that can be generated and the
research approach that often follows. The intention of Table 2.5 is not only to show the
association between our research question and the way we do our research but also that
in every case there is more than one approach that we can adopt. Each of the research
approaches has a theory of how it should be applied.
policy, then there is little to choose between evaluation and policy research. One thing
that should distinguish evaluation from other approaches is the fact that it should iden-
tify unintended outcomes as well as intended, unforeseen as well as foreseen and nega-
tive consequences as well as positive. (We shall look at evaluation again in Chapter 14.)
Questions about impact can, however, have a narrower focus than the ones we have just
considered. What is the effect of food type at lunchtime on pupil behaviour and perfor-
mance in the afternoon? What is the effect of class size on pupil performance? What is the ef-
fect of ambient temperature on pupil performance? Such questions as these may be capable
of being answered by a causal analysis using an experimental or quasi-experimental design
(see Chapter 3). With this the subjects are allocated to groups experiencing or not experi-
encing an intervention either in ways that seek to minimise the differences or in ways that
randomise factors that might interfere with a direct causal relationship. This approach seeks
to replicate the type of design found, for example, in medical research. It is, however, dif-
ficult to accomplish because the number of variables that can influence the research result is
very large and the more that are uncontrolled the greater are the problems of interpretation.
In section 2.1 we explored issues that we have to take a stand on and which influence
how we work. In this section we shall look at different issues on which we have to take
decisions and which will influence what goals we set for our research and what outcomes
we seek to attain. Figure 2.7 shows the two principal issues that we have to consider, edu-
cation theory and models. Both of these can influence our research goals.
Education theory
Goals and
Processes Issues
outcomes
Models
theory deals with the subject matter of education – child development, learning, curricu-
lum design and so on. Research theory, which may influence the process we adopt, is a
specification of procedure. Education theory derives from conjecture about what might
(or should) be the case or from evidence drawn from the real world. Research theory
is based on either a set of premises and consequences derived logically from them (the
quantitative paradigm) or what researchers accept (the qualitative paradigm). Education
theory is (or aims to be) generalisable, whereas research theory is specific to a particular
type of problem or approach. Education theory can be tested, while research theory can
only be benchmarked against the principles on which it is based. In summary, education
theory shapes our understanding and gives us choice when making educational deci-
sions. Research theory is a rule book whose legitimacy stems from principles accepted by
the academic community and whose coherence owes as much to custom and practice as
to any overarching theorisation.
Now let us look in more detail about how theory is used in the field of education. Just
what is a theory? It is a generalisation that should apply in all or most cases (which is
how it tends to be used in science) or in some cases or particular circumstances (which
is how it is often used in the social sciences). A theory seeks to explain things in terms
of cause and effect and with varying degrees of certainty. A good theory is predictive.
Education theory falls into two types (there are other classifications with more than two
categories but remember this is a book on research procedure and there is no time to go
into detail on this issue).
quality of the word (semantic) map that each of us has and his theory argues that the
semantic map is both the product of learning and the means of enabling more learning
to take place.
Human capital theory is another explanatory theory. It sees investment in intellectual
and practical skill development as the basis of economic growth and economic suc-
cess. It is the theory that underpins investment in numeracy and literacy at pre-school
and primary levels, in the development of generic skills (for instance, problem solving
and group work) at secondary and higher education and investment in lifelong learn-
ing to enable people to retrain and find employment with the skills needed by modern
economies. We look for evidence of the effectiveness of human capital development in
international surveys of student performance.
Education theory performs a vital role; it is a framework for advancing our under-
standing of the subject. It shapes our thinking about the desirability of outcomes; it
enables us to understand the processes at work and it helps us assess the way in which
the real world works. Education theory can be constructed at a broad social scale or in the
context of a specific idea. Professor Dickson Mungazi (who died in 2008) has reviewed
the evolution of education theory in the USA, from the period of the Founding Fathers
to the late twentieth century (Mungazi, 1999). He shows how the first influences were
philosophical, including John Locke’s opposition to authoritarianism, the rationalism
and liberalism of the Enlightenment and the differing religious convictions of the early
settlers. These shaped a new country that espoused a democratic ideal and separated
church and state. Later he points out the role of the education system in meeting national
goals, such as the development of a more equal society. Mungazi’s thesis is that educa-
tional theory is intimately associated with the evolution of society, both as a driver and
reflector of change. As he says, ‘Any change in society and the values of society itself must
be reflected in the educational process’ (Mungazi, 1999, p. 210).
At a specific level, education theory seeks to frame our understanding of aspects of
education. Educationalists draw on psychological theory to understand how education
should match a child’s development. Psychology too underpins our attempts to create
learning theory. The theory of intelligence rests on psychology, though our understand-
ing of multiple skills and capability draws on a broader appreciation of personality. We
have theories of curriculum and behaviour management, theory that seeks to link peda-
gogy with specific educational outcomes, theories that define a learning process (such
as experiential learning), theories of instructional design that guide authors of on-line
learning materials.
So how can education theory affect research? In two ways, it can either be the goal of
research or we can seek to develop theory through research.
(2006) Learning Styles Helper’s Guide and their Learning Styles Questionnaire) and give
it to the children with whom we are concerned. We should be aware though that the
results would require some considerable insight on our part to assess that the theory was
transferable outside its usual domain of application (the boundaries of what might be
considered normal). It would, however, be an interesting and perhaps significant piece
of research. Look at Case Study 2.2 for an example of theory testing that has actually
taken place.
another, such a study would require a significant amount of resource (and educational
research is not especially well endowed). So most research aiming to build theory pro-
ceeds step by step, considering one influence and then another. At each stage a little bit
of understanding is gained and pieced together with what others have found. In this way
a theory takes shape. Theory building in education research is, more often than not, a
collective process in which researchers in different institutions and different countries
pool their research findings in research literature. This is one of the reasons why a litera-
ture review is such an important part of any research project. Case Study 2.3 looks at the
topic of teacher burnout and shows how our understanding changes over time.
based on theoretical intuition without empirical evidence, in which case we call them
normative models. They can also be constructed from empirical evidence. Summarising
the concepts of theory and model:
t both can be developed from logical deduction and both can be created from empiri-
cal study;
t both relate reality to abstract thinking about reality.
In view of this summary you might well ask, ‘Why do we need them both?’ Theory
is valuable because it can be the starting point of an investigation and it organises our
understanding of a situation. This need not be an empirical situation, though in the
case of education, it usually is. Models are valuable because they can abstract from and
visually represent our understanding of complex situations, key players and processes in
education activity and they can be a basis for testing our understanding of reality, for ex-
ample, do the relationships that were found in one situation reappear in another? Mark
Warford used a model as a template to research the diffusion of an educational innova-
tion (Warford, 2005). Innovation diffusion is a well understood process but not in edu-
cation. People who adopt new ideas can be categorised into innovators, early adopters,
early majority, late majority and laggards. The numbers in each category are small at first,
rise to a peak in early and late majority and fall off thereafter. Warford used this model
to see if it applied to an educational innovation, new guidelines on foreign language
teaching. His findings are interesting in that they indicate how an innovation diffusion
process can stall. Two points stand out. First, although this was an educational policy
initiative, it was not taken up. Second, those who had adopted the innovation reported
that local education administrations were barriers to change. This is interesting research
given the pressures facing teachers to change to meet new demands.
Models usually show a link between one event and another. The implication is that
the link is causal in nature. You might, for example, develop a model to show what in-
fluences children’s selection of meals in the school or college cafeteria. Your model can
take the form of a statement or a diagram or it can be expressed in symbolic terms (see
Figure 2.8). Models can be used in all research approaches, in conjunction with all meth-
odologies and at various stages in the research process.
t We can use them to organise our understanding of previous studies and to highlight
what we consider to be the key relationships. This type of model helps us shape our
research programme and it may be something we seek to verify through our research.
t We can develop one to shape and represent our thoughts without reference to previ-
ous studies (much as we did for Figure 2.8). We can move directly to find the evidence
for our model or we could use it as a benchmark for other studies.
t Finally we can use a model to represent our research findings.
If we are beginning to see some value models in research, our next question, perhaps,
ought to be, ‘How do I go about building one?’ There are whole books on the proced-
ures such as simulation, soft-systems modelling, probabilistic modelling and so on, so
what we say here only touches on the topic. In broad terms we can represent reality as
mathematical symbols or relationships, or as formal objects (such as people and or-
ganisations) and their relationships in a diagram or in words (look at Figure 2.8 again).
Models can be representations of reality (the school meal example in Figure 2.8) or they
can be as mental constructs, representations of what people think the world is like. These
are called mental models and many psychologists accept that they determine the way in
which we interact with the ‘real’ world. We can move into mathematical modelling using
78 Research Methods for Education Part 1 The Context for Your Research
Exposure to
healthyeating Children's
Socio-economic preferences
status Type of food
eaten at home
Amount
Children's food
available to
selection
spend
Constraints Constraints
Awareness of
placed on placed on choice
healthyeating
choice by parents by school
some of the statistical processes we shall meet in Chapters 12 and 13. Chapter 7 deals,
in part, with what we call ‘variables’, which can be represented as formal objects and, if
capable of measurement, can be translated into a mathematical model. Once we start
identifying the variables that exist in our research problem and think about how they
might interact in the real or imagined world, we have started the process of modelling.
Chapter 2 UNDERSTANDING THE RESEARCH PROCESS 79
Research agenda, research issue, research problem and research question – you could
be forgiven for thinking that these terms are all alternatives for saying the same thing.
Many writers use them interchangeably, indeed I suspect that I have been a little too
‘flexible’ on occasion in the way I have used them in this text! However, there are dif-
ferences between them that relate to different stages in your thinking about research.
The research agenda is what the research community (including those who sponsor or
pay for research) think are the issues that it is worthwhile investigating. Since funding
is important for so many researchers these days, it is important that they are aware of
other people’s priorities. Activity 2.4 is an opportunity to become aware of some research
priorities worldwide. We can, of course, have our own research agenda which indicates
what we will research and when. In all likelihood, however, it will fit in with topics being
investigated by others and which will be part of a bigger picture of research.
A research issue is a theme from the research agenda. It is a topic or area where there
is potential for research and, usually, benefit in undertaking the research. To all intents
and purposes the term can be used interchangeably with ‘research problem’ and this
term itself is frequently used as an alternative to ‘research question’. It is, however, more
accurate to see a research problem as having a tighter, more focused specification than a re-
search issue. Thus disaffection might be a research issue and the research problem could
be to define the nature of any link between youth culture and motivation for education.
Research Research
agenda problem
Research
question
While the idea of a research problem can ‘shade’ from one term to the other, there is
more of a difference between research issue and research question. The former is more gen-
eral and descriptive of a field and the latter is specific, identifying the particular question
that the research process is meant to answer. To follow through the disaffection example,
a research question might be, ‘Can computer gaming be used to attract boys back into
learning?’ Sometimes it is difficult to focus the research on a particular question and that
is here we can usefully use the term ‘research problem’.
No one is likely to be too critical if, on occasion, you are a little loose with your ter-
minology in this area. However, it should be apparent that the terms ideally relate to a
progressive focusing of our research from the identification of a broad area of interest or
concern to a question that can be answered.
Now that these terms are a little clearer, how do they relate to each other so that we
finish up with a focus and direction for our research? Figure 2.9 has some of the answers.
Research takes off when the research question is sharply defined both as text and in the
researcher’s mind. At this point we may not know how we are going to go about the
research but we are clear on the dimensions that need to be investigated and the factors
that can influence the outcome. Getting to this point is not necessarily easy and can take
time. Indeed, it may even be necessary to undertake some exploratory data collection and
analysis to make sure that there is an issue to be investigated. The purpose of this next sec-
tion is to show how research themes arise and what can influence their selection. Our task
is to specify a research question. Most people get to this point by moving from the general
to the specific, from a general understanding of a topic to a specific research question.
ideas now circulate internationally very quickly and we may get a better understanding of
the up and coming agenda. Activity 2.4 will help us build up a picture of principal issues in
several countries. Staying abreast of the international research agenda is something that we
should do on a regular basis. Remember, the articles that are published in journals may relate
to research that was completed 18 months or so ago and that was started usually at least a year
before that. In other words, journals reflect an agenda that is two or three years old.
1. Identify your interests worked and what didn’t and ask if you know
why.
t Create a table to show the following types
t Think about the teachers you have had at
of educational issue: education policy, educa-
school or college, what ‘according to you’ dif-
tion management and change, whole school
ferentiated the best from the worst?
issues, curriculum issues, classroom practice,
t Generate another scenario where you can
society and education. If you want to use your
exercise your judgement of your experiences.
own list, go ahead.
t For each category, put in three examples and
rank them from most interesting to least 3. Take up an issue
interesting.
t Go to a daily newspaper and mark up every
t Compare the ‘most interesting’ in the first
article or report concerned with education.
category with the second, judge which one
Identify any contentious statements and ask
you would prefer to work in and then con-
yourself, ‘Would I be interested in researching
tinue with all the other categories.
it?’
t Could you find something interesting to do
t Go to a weekly newspaper for teachers (for
with the last one?
example, The Times Educational Supplement
or The Times Higher Education Supplement in
the UK or Education Week, Teacher Magazine
2. Review your experience or The Chronicle of Higher Education in the
t Look back over the last year of your higher USA) and make a list of issues that are con-
education. List the topics that captured your tested by groups with different viewpoints or
imagination and those that bored you. problems and concerns that are reported. Do
t If you have worked in education, for example, a paired comparison in order to identify the
as a teacher or administrator, identify what one that interests you most.
research ideas? People will have their own ideas about what to do, which will be just
as valid as what is suggested here. What is important is that the ball starts to roll and a
momentum is created that leads inexorably to the statement of the research question.
Three approaches that could start the process are: exploring our interests, reviewing our
experiences and latching onto issues (see Activity 2.5). Even if you have identified a re-
search field, do the start-up activities to see what other areas emerge. You never know,
you might change, or you could generate topics for a later project.
value to you. Scoping will help with this. It uses many of the same approaches as literature
review but its purpose is different. It should answer questions that will help us decide wheth-
er to go forward with research and, if so, how best to do so. One thing we should remember
is that answering the questions that will take us forward is no simple matter. Once we have
found some information that seems to give us an answer to a particular question, we may
have to go back and look again at an earlier question that we thought we had answered.
There are two dimensions to feasibility that we have to consider, first whether some-
thing is worth doing and, second, whether it can be done. The first task in scoping the
research field is to find out if the research that is beginning to take shape in our mind has
ever been done before. If it has, if the results are generally accepted and non-contentious
(which may be a lot to ask in education) then there is probably not a lot of point in repeat-
ing the study. There are, however, exceptions to this. If a previous study was selective in
some way then another may be useful. For example, if it was concerned only with selective
schools, there is likely to be some value in replicating it in non-selective or if the research
was limited to an area, there would be merit in a confirmatory study based on another. We
can, with some advantage, turn this process round and look to see what research has been
done in another country such as the USA or Australia. The USA in particular has led with
many education innovations and changes and the question, ‘Will it work/has it worked
here?’ is one that is likely to be fruitful. Even if someone appears to have tackled the issue
in which we are interested before, all may not be lost. We need to look at the research and
ask whether every possible factor that could influence the results has been covered. We
shall look at these factors again in Chapter 7, when we will call them variables. As a start,
we could ask whether published research has taken account of the influence of ethnicity,
of status, of gender, of age, of family background. These are the obvious factors; there
may well be others that are relevant to our field. Another fruitful avenue to follow that
may help us identify a research area is a subject from which ideas can be ‘borrowed’ and
applied to education. We should not consider this as in any way being of doubtful legiti-
macy. Education is an open intellectual area and cross-fertilisation with other areas such as
psychology, business, sociology and, with access to educational opportunity now being so
important in political terms across the world, geography too. Even subfields of education
can be a source for knowledge transfer. For instance, how we think about curriculum de-
sign is probably more sophisticated in e-learning than it is in any other area of pedagogy.
At the heart of the scoping process is access to information that we then use to answer
questions and identify opportunities. If we have not gone into the literature with a
research idea at the back of our mind, just the process of distilling the information may
help us find an answer to the question, ‘What avenues are left unexplored?’ More impor-
tant for the busy researcher is, ‘Where do we find this information?’ Increasingly govern-
ments are funding ‘observatories’ that scope the field for us. Observatories are usually
publicly funded organisations that search and become repositories for publications on
particular issues or themes and are an alternative to the need to search reports, books
and journal articles ourselves. They are useful because they will often present digests
that will help us appreciate the field. Of course, not every field of education is covered.
Table 2.6 indicates areas of education that are covered and gives examples. To access the
websites, just put the terms into a search engine to get to the correct website in one or
two clicks. Most of the observatories (as opposed to databases of research) are located
in EU countries and the observatory concept appears to have been stimulated by EU
funding to cover many areas of research, development and public policy. Originally a
mechanism for collecting, storing and making available digests of information, many are
evolving into ‘think-tanks’ with a view on policy directions. Not every area of interest to
educationalists is covered but diligent use of the observatories can throw up information
84 Research Methods for Education Part 1 The Context for Your Research
Area Examples
classroom but a master’s student should recognise that there is a range of influences
on both teacher and learner and a doctoral student should seek to grapple with at
least several of these.
t Does it allow me to demonstrate my grasp of the field and my intellectual command
of the subject?
t What impact will my research have? If we are replicating someone else’s work, the
impact is low. If we are extending someone’s work then there is greater impact but if
our work is new and innovative we will have the potential to make a real impact.
At the end of the scoping stage we will have taken a decision on whether or not to
go ahead. The chances are that we will have decided to proceed with our research rather
than go back and start again because there is still plenty left to research in education.
Much of the research published in education lacks potential for synergy with other re-
search because it is either individualistic or anachronistic. To be really effective, research
has to link in with what has gone before and highlight what should come after. This is
the type of research approach for a whole subject area that has built success in the natural
and applied sciences and even in areas of social science. It is an approach that gives rise
to synthesis as an accepted research strategy, an approach that is just beginning to appear
in education. For us, however, the next stage is to frame our research question. This is the
topic for the next section.
As we have undoubtedly appreciated from these first two chapters, preparing to get involved
in research is a long process. It only becomes shorter as our experience increases. We will
know that we have arrived on the research scene when other people begin quoting us and
our work is beginning to shape the whole research agenda. Until we reach this stage, our
best course of action is to follow a structured framework out of which our research proposal
and plan will arise. This is what this last part deals with. It is an activity which draws to-
gether key themes from Chapters 1 and 2. We will test our understanding of the following:
t our purpose in conducting research (this is not the research field or the research ques-
tion but what you want to use the research for);
t research terminology;
t research planning, including risk assessment and management;
t ethical issues in research.
The significance of these issues to the research process only becomes apparent when
we start to work with them. It is for this reason that this part has been introduced. It will
help us explain our research ideas to ourselves and it will begin the process of building a
knowledge base to support our research.
our research approach (which may restrict the number of research approaches we use)
and how capable we are in generating data that throws light on the issue (which may
expand the number of research approaches we use). Our judgement is to get the balance
right and choose a topic that gives us opportunity. In making our decision we should
recognise that a decision downstream in the research process is influencing a decision
right at the headwaters of the whole process. This is just another example of the need to
know something before we can fully understand what we have to do.
Do we have an area that we are interested in researching? If not we should plan a
programme of activities that will:
t increase our knowledge of educational issues, especially those that are in the public
arena;
t enhance our contacts with other students and researchers; and
t improve our critical and evaluative capability.
If we are just stuck for some initial ideas, Activity 2.5 could help. In the end the crucial
elements in developing research ideas are a wide knowledge base and a thoughtful and
critical perspective. This is what you have to work at developing.
research army? How many others are researching in this area – and is there room for
one more? We should remember that research is a stepping-stone to authority in the
academic world.
t Does the topic have a political dimension? Is the area one that is contested by educa-
tionalists or political commentators or politicians? If our answer is ‘yes’ to any of these,
what will our position as a researcher be? Do our own values and beliefs predispose
us to one position rather than another? Will this influence our approach, for example,
will we work within a framework for change and, if we do, how will we create a level
of neutrality in our work that ensures that it is acceptable to a wide range of researchers
and that it is capable of influencing people who have not yet taken a position?
t How much do we know about the area? How do we know that our knowledge and
understanding is sufficient to construct a research proposal that takes understanding
further forward rather than replicate existing findings? If we are unsure how to go
about this, we should read section 2.3.(iv) on scoping research again. Having started
to build a picture of the area with references to other people’s research, how should
we organise it? Activity 2.6 will help us. It gives us a framework for analysing mate-
rial to pick out the research perspectives and the research findings. As the volume of
references increases, we may find it easier to use this approach as a first sift and to
connect key themes via a ‘spider diagram’. (If you are unfamiliar with these as a way
of expressing association, just look the term up using a search engine. You will find
that there are even computer programmes to help you.)
The process of identifying the principal themes of t How far does it theorise about the area?
published material is central to research. This activ- t Does it present a model of relationships? Are
ity will help you appreciate the insights you need to these relationships specified before the re-
develop. search or are they an outcome of the research?
1. Make a list of at least ten publications associated This gives you a sign whether the researcher
with your research area that you have read. For is approaching the issues from the perspective
each one, identify the following: of ‘finding patterns’ (an inductive approach) or
testing pre-determined relationships (a deduc-
t The philosophical position it adopts. It could tive approach).
be just one (positivist, humanist, critical theo- t What is the focus of the research that is pre-
rist, postmodern) or it could be an amalgam. sented? For example, what aspect of the topic
t The researcher’s position, whether an out- is put under the microscope? How central do
sider or an insider, neutral or committed. Do you think this is to the area overall?
you get any sense of the researcher’s values or t What are the conclusions? Can you create cat-
their political position? egories to show how the conclusions of the re-
t The type of research question (see Table 2.5). search can be grouped, such as for and against,
t What ethical issues are identified? Can you the significance of social relationships or indi-
suggest others that you would have consid- vidual character.
ered?
t How well does it communicate with the read- 3. Finally, write a review of the research you have
er? Is it accessible to the general public or do analysed, identifying the research approaches
you have to know a lot about the topic in and research issues. Bring out conflicts in inter-
order to understand what it is saying? pretation, preferences for research approach and
ensure that your judgements on these are clearly
2. Now look at the substantive research presented apparent.
in each publication.
88 Research Methods for Education Part 1 The Context for Your Research
We will then be able to say something about what we think the impact of our re-
search will be.
t Specify our research question. Explain why we chose to express the issue in this way.
Why did we reject other formulations of the question? The answers to this might be
related to the process of data collection. After we have thought about this, we may
need to go back and look again at the research question.
Summary
t Research terms can be confusing. While authors have sometimes confused them,
there is a progressive focus in the sequence of research agenda, research issue and
research question.
t How we go about our research is influenced and affected by philosophy, methodol-
ogy, research paradigms, theories, models principles and ethics.
t Research philosophy is something we have to understand in order to (a) appreciate
where other researchers are coming from and (b) guide our own research approach.
The principal philosophies that we should understand are scientism (and the positiv-
ist research approach), phenomenological and existential philosophy (which under-
pin much qualitative research), critical theory (with its concern for repression and
inequality) and postmodernism (which refocuses our attention from big theories to
local circumstances).
t We should know where our research might lead, either (a) to a general understand-
ing and the development of a theory or model (b) to a deeper understanding of a
▲
90 Research Methods for Education Part 1 The Context for Your Research
specific situation, the identification of key influences and even the development of
a model. We should understand the part played by normative theory and explana-
tory theory in our research. We can test both, even if we are new to research, but
any generalisations from our research are likely to be in the form of models and not
theories.
t Ethical considerations should run through our work and not be just something we
deal with at the beginning. We should be guided by principles and it may be a good
idea to state what these are.
t You should now be able to identify a research area, demonstrate your understand-
ing of research in that area and show how your research theme fits in. You are now
in a position to begin to prepare a research proposal. As we go on to look at details
of research methods, you should continue reading in your field in order to deepen
your understanding of the issues.
Further reading
Research and philosophy Oliver, P. (2004) The Student’s Guide to Research Ethics,
Bridges, D. and Smith, R. (2007) Philosophy, Methodology Open University Press.
and Educational Research, Blackwell, Malden, Mass. These two books take different approaches to appreciat-
This edited collection of papers gives substance to many ing the ethical position in research. Israel and Hay are
of the issue considered in this book. Chapter 2 exposes analytical, drawing a distinction between regulatory
misunderstandings about positivism but, in doing so, compliance and ethical conduct. They explore ethical
raises doubts as to its appropriateness. David Bridges’ principles: informed consent, confidentiality, avoiding
own chapter argues that we are in a period of fluidity and harm, doing good, integrity and care. Paul Oliver’s
change as we seek to establish education as a discipline. book is more of a guide, with three useful chapters on
Pring, R. (2004) Philosophy of Educational Research, ethics before research begins, during data collection
Continuum, London. and afterwards.
Richard Pring’s book is an excellent introduction to the On-line research
fractured world of research philosophy. His argument
Exploring On-Line Research, http://www.geog.le.ac.uk/
that the polarisation between positivism and interpre-
ORM/index.htm.
tivism and between quantitative and qualitative posi-
This website is a sound introduction to an evolving
tions fails to recognise the actual variety and complex-
way of researching. It gives advice on ways of collect-
ity of research on the ground is echoed in this book.
ing data, a link to resources that can inform on-line
Ethics in research research and an excellent self-study programme on
Israel, M. and Hay, I. (2006) Research Ethics for Social ethics in on-line research.
Scientists, Sage.
References
Audit Commission (2000) Getting better All the Time: Health And Social Research – “Human Becomings”
Making benchmarking work, The Audit Commission, or “Active Beings”?’ Childhood, 13(1): 29–48.
London. Bartlett, T. and Smallwood, S. (2004) ‘Four Academic
Balen, R., Blyth E., Calabretto, H., Fraser, C., Horrocks, Plagiarists You’ve Never Heard Of: How Many More
C. and Manby, M. (2006) ‘Involving Children In Are Out There’, The Chronicle, 17-12-2004.
Chapter 2 UNDERSTANDING THE RESEARCH PROCESS 91
Belohlav, J.A. (1996) ‘The Evolving Competitive Grant, R. (2005) A Phenomenological Investigation into
Paradigm: Corporate Strategy of the Future is Lecturers’ Understanding of Themselves as Assessors at
Re-inventing Business Itself in Search of Competitive Rhodes University, thesis submitted for the degree of
Essence’, Business Horizons, 39(2), 11–19. PhD, available at http://eprints.ru.ac.za/300/
BERA (1992) ‘BERA Ethical Guidelines’, Research (accessed January 2009).
Intelligence, 43, 19–21. Greaves, N. M., Hill, D. and Maisuria, A. (2007)
Bonawitz, E.B., Griffiths, T. L. and Schulz, L. (2006) ‘Embourgeoisment, Immiseration, Commodification
‘Modelling Cross-Domain Causal Learning in Marxism Revisited: a Critique of Education in
Preschoolers as Bayesian Inference’, Proceedings Capitalist Systems’, Journal for Critical Education
of the Twenty-Eighth Annual Conference of the Policy Studies, 5(1), available at http://www.jceps.co
Cognitive Science Society, Vancouver, available at m/?pageID=article&articleID=83 (accessed January
http://web.mit.edu/liz_b/www/ (accessed February 2009).
2009). Hastings, R. P. and Bham, M.S.C (2003) ‘The Relationship
Brouwers, A. and Tomic, W. (2000) ‘A Longitudinal between Student Behaviour Patterns and Teacher
Study of Teacher Burnout and Perceived Self-efficacy Burnout’, School Psychology International, 24(1),
in Classroom Management’, Teaching and Teacher 115–127.
Education, 16, 239–253. Honey, P. and Mumford, A. (2006) The Learning Styles
Byrne, B.M. (1994) ‘Burnout: Testing for the Validity, Helper’s Guide, Peter Honey Publications Limited,
Replication, and Invariance of Causal Structure Across Maidenhead.
Elementary, Intermediate, and Secondary Teachers’, Horkheimer, Max (1982) Kritische Theorie: Critical Theory:
American Educational Research Journal, 31(3), Selected Essays, translated by Matthew J. O’Connell
645–673. et al., Continuum, New York.
Cunningham, W. G. (1983) ‘Teacher Burnout – Iwanicki, E. F. (1983) ‘Towards Understanding and
Solutions for the 1980s: A Review of the Literature’, Alleviating Teacher Burnout’, Theory into Practice,
The Urban Review, 15(1), 37–51. 22(1), 27–32.
Englund, T. (2006) ‘New Trends in Swedish Educational Kristen, C., Römmer, A., Muller, W. and Kalter, F.
Research’, Scandinavian Journal of Educational Research, (2005) ‘Longitudinal Studies for Education Reports:
September 2006, 50(4), 383–396. European and North American Examples’, Education
Fore, C. (2002) ‘Teacher Burnout in Special Education: Reform, 10, Federal Ministry of Education and
The Causes and the Recommended Solutions’, The Research (BMBF), Bonn and Berlin.
High School Journal, 86(1), 36–44. Macdonald, N. (2001) The Graffiti Subculture: Youth,
Gino, F. and Ariely, D. (2012) ‘The Dark Side Masculinity and Identity in London and New York,
of Creativity: Original Thinkers Can Be More Palgrave, Basingstoke.
Dishonest’, Journal of Personality and Social Psychology, Mallick, M. and McGowan, B. (2007) ‘Issues in Practice
102(3), 445–459. Based Learning in Nursing in the United Kingdom
Goddard, R., O’Brien, P. and Goddard, M. (2006) and the Republic of Ireland: Results from a Multi
‘Work Environment Predictors of Beginning Teacher Professional Scoping Exercise’, Nurse Education Today,
Burnout’, British Educational Research Journal, 32(6), 27, 52–59.
857–874. Mungazi, D. A. (1999) The Evolution of Educational
Gopnik, A., Glymour, C., Sobel, D.M. and Schulz, L. Theory in the United States, Praeger, Westport,
E. (2001) ‘Causal Learning Mechanisms in Very Conn.
Young Children: Two-, Three-, and Four-Year-Olds O’Farrell, C. (1999) ‘Postmodernism for the Uninitiated’,
Infer Causal Relations From Patterns of Variation in Meamore, D., Burnett, B. and O’Brien, P.
and Covariation’, Developmental Psychology, 37(5), (eds), Understanding Education: Contexts and
620–629. Agendas for the New Millennium, Prentice Hall,
Gopnik, A., Glymour, C., Sobel, D. M., Schulz, L. E., Sydney, 11–17.
Kushnir, T. and Danks, D. (2004) ‘A Theory of Causal Peled, E. and Leichtentritt, R. (2002) ‘The Ethics of
Learning in Children: Causal Maps and Bayes Nets’, Qualitative Social Work Research’, Qualitative Social
Psychological Review, 111(1), 3–32. Work, 1(2), 145–169.
92 Research Methods for Education Part 1 The Context for Your Research
Russell, H. (2007) ‘Learning for Being: An Ontological Warford, M. (2005) ‘Testing a Diffusion of Innovations
and Existential Approach’, International Journal of in Education Model (DIEM)’, The Innovation Journal,
Lifelong Education, 26(4), 363–384. 10(3), Article 32, available from http://www.
Schulz, L.E. and Gopnik, A. (2004) ‘Causal Learning innovation.cc/ (accessed January 2009).
Across Domains’, Developmental Psychology, 40(2), West, A., Barham, E. and Hind, A. (2009) Secondary School
162–176. Admissions in England: Policy and Practice, Research and
Trotman, D. (2006) ‘Interpreting Imaginative Lifeworlds: Information on State Education Trust, London.
Phenomenological Approaches in Imagination and Whitaker, K.S. (1966) ‘Exploring Causes of Principal
the Evaluation of Educational Practice’, Qualitative Burnout’, Journal of Educational Administration, 34(1),
Research, 6(2), 245–265. 60–71.
This page intentionally left bank
Chapter contents
Learning themes 95
Introduction 96
Summary 139
Further reading 140
References 141
Chapter 3
Learning themes
t This chapter deals with the three broad It was developed as an approach to resolve
approaches that we can adopt for research research problems that were insoluble
in education – quantitative, qualitative by either of the other two approaches by
and mixed methods. themselves.
t The first part of the chapter outlines the t We shall also be introduced to two more
character of these approaches, the second methodologies, survey and analysis and
part more practical questions and issues experimentation.
concerned with implementation.
t This chapter is a bridge to the processes of
t Quantitative research looks for numerical data collection in Part 2 and data analysis
evidence on which to reach conclusions. in Part 3.
It makes many assumptions on data
By the end of this section you will:
collection and analysis, which may
constrain some investigations. t Understand the philosophical roots of
quantitative, qualitative and mixed
t Quantitative research sets up questions
methods research.
as testable hypotheses and assesses these
terms of probabilities. t Know how quantitative and qualitative
research approaches use the concepts
t Qualitative research uses behaviour, words
of theory, proof and hypothesis.
and images as the evidence on which to
base its conclusions. Its object is to under- t Be able to construct a hypothesis and
stand how people experience their lives as know on what basis you can reject it.
a means of providing rich and deep insights
into why things happen as they do.
t Be able to apply ideas on what can go
wrong in research to your own research
t Mixed methods research combines design.
quantitative and qualitative approaches.
96 Research Methods for Education Part 1 The Context for Your Research
Introduction
This section will help us by showing how There are a number of ways of putting a
we can put our research design together. research design together. Some of them have
Chapter 2 introduced us to some key influ- conventions that govern procedures. Some
ences that should shape our research. Now we researchers regard these as rules not to be
shall look at approaches to research that we broken. Many others come from a more flex-
can draw on. The chapter presents an over- ible tradition and have, as a guiding principle,
view of these approaches, a sort of ’road map’ ’What works?’ In getting to grips with ap-
to making them work for us. It sets out the proaches to research, we should bear in mind
principles on which they are based, the prin- this division amongst researchers. We should
cipal elements of the procedures and what seek to understand the approaches and their
sort of things might derail our research. The implications for our research field. We will
remaining chapters are then concerned with need to decide whether our preference for
the methods and techniques that you can slot one is based on its conceptual and procedural
into these approaches. superiority, or whether it is best suited to the
circumstances we face.
students worked harder than others and what their motivation was or, in the case
of school choice, what made parents think that one school was better than another.
Clearly, the nature of qualitative enquiry is different from quantitative and this differ-
ence is reflected in the character of the data (a concern with feelings and values) and
also the methods used to analyse such data.
t Mixed methods research seeks to combine both quantitative and qualitative traditions
on the basis that research issues in education are often so complex that the insights of
both approaches are required if we are to gain a good understanding. An emotional
response to a situation (such as anger) may be qualitative but it could be equally im-
portant to know how many people share it. As we have expressed it, this is essentially
a pragmatic argument for giving mixed methods the same sort of status as quantita-
tive and qualitative research.
Whatever framework we choose to work within, our task is to uncover some truth about
the issue we are researching. Truth is central to the notion of research. As a researcher we
need to convince others that our results are not false and that our conclusions are valid. We
cannot do this simply by expressing our views forcefully or through the use of rhetoric. The
term ’epistemology’ (see Chapter 2) is often used in research literature to describe research
designs, especially designs that conform to a generally accepted template. More appropri-
ately, however, epistemology, as a philosophical idea, is the study of knowledge and, by
implication, how we know what we know. It is the word ’how’ that principally concerns us
here because it implies that there are some ways of gaining knowledge that are more valid
and some that are less and, as a result, there is some knowledge that is ’true’ and some that,
if not actually false, cannot be shown to be true. The issue of ’how’ is important because
the process by which knowledge is developed, determined and acquired is, ultimately, the
basis for determining whether it is true, not true or false. However, the issue of validity is,
in the end, not always absolute but often a question of what someone believes to be valid.
The ’truth’ that results from belief in a faith is very different from what I believe to be true
from the perspective of a social scientist and both are likely to be different from what a
physicist would accept as truth. The standards we apply to determining truth depend on
where we are coming from, what we are using the truth for and, we should not ignore this,
how many people agree with us. As educational researchers all we can do is, first of all,
convince others that the processes we used to collect our data and the analysis we used to
convert them into evidence are reasonable and, then, that the argument we construct to
reach a conclusion is logical. These are the steps that our research design should specify, so
it is, by implication, fundamental to the truth of our research findings.
In the remainder of this chapter, we will look at the character and principles that de-
fine these three styles of research: quantitative, qualitative and mixed methods. However,
we should remember that:
And there is one final important point that we should make. While our research ques-
tion guides our research, we should not be blind to what else our research data may be
telling us. Case Study 3.1 makes the same point but does so in a far more memorable way.
98 Research Methods for Education Part 1 The Context for Your Research
Figure 3.1
Finding out
how parents They will
Replicate in the
choose a school: choose the
same area
closest school
a quantitative
approach
that the goal is the construction of theory, we must appreciate that the development of
theory advances step by small step. An idea is tested in one location and then another. It
is modified according to this circumstance and then another. And then everything has to
be replicated to ensure that there are no spurious claims as to relationships and predic-
tions. Figure 3.1 shows how this might work for the problem ’How do parents choose a
school for their child?’ The first researcher might explore whether they choose the school
closest to where the family lives. Subsequently researchers would test this again in the
same area in order to replicate the study and validate the original results. Others would
explore whether the same results were found in other areas. Duplication of the research
is a test of the extent to which individual conclusions are more generally valid. Another
researcher might question whether parents choose a school on the basis of the school’s
reputation. The research would again be replicated and duplicated. Yet other researchers
would look to see if parents choose a school on the basis of faith or according to what
aspirations they thought the school had for their child and again the research would
be both duplicated and replicated. In this way the research community would build up
a picture of how significant the various potential influences actually are. If we were to
think, though, that there are co-ordinated teams of researchers doing this, we are likely to
be wrong. Most educational researchers work alone or in small groups and the most that
they can do is to identify pattern and order over a small part of the educational terrain.
The researcher who brings things together to specify a theory comes along infrequently.
amounts of time with a teacher. How do we know whether the pattern that we think we
see really exists? How do we prove to others that the pattern exists? The concept of proof
lies at the heart of quantitative research.
What do we mean by proof? It all depends on the circumstances. When we fly in an
aircraft we want to be sure that the theoretical relationship between thrust, wing surface
and lift that the engineer is using is not a Friday afternoon theory knocked out so that
the scientist can get a good start to the weekend. In science ’round about’ is not good
enough. Prediction here means just that, if something happens then the result that is
predicted will also happen. In other words, there is a defined and stable relationship.
In a criminal court, in many countries, proof is ’beyond reasonable doubt’. There are
many researchers who would accept that this is a perfectly reasonable specification and,
in terms of qualitative research, it probably comes closest to what is practised. The aca-
demic equivalent of a jury is that the research is published to the community of scholars
who either accept it, or reject it. But for quantitative research there are ways of convincing
others that our findings constitute a proof.
In science we understand that a theory is proved when there is certainty in a re-
lationship. The problem for education is that it deals with people and the best that
can be said of people is that there is a likelihood of something happening. Let us look
again at the question of the factors that might influence parents’ choice of school,
the example that we first met in Figure 3.1. Our research on travel time might show
that, as travel time increases, the proportion of children going to a particular school
decreases. We might even find that some children will pass by a school in order to
attend a school that is further away. Because we know the numbers of children in an
area, we can say that 30% of them will go to school A, 60% to school B and 10% to
school C. We can even express this in a different way and say that for any child there
is a 30% chance of going to school A, a 60% chance of going to school B and a 10%
chance of going to school C. In other words, we can use probability to say something
about behaviour.
Now imagine that we wanted to go further in our research and wanted to understand
in greater detail the factors influencing parental choice of a school.
t First, we have to specify the factors that might influence demand. We have already
identified travel time as a possible factor as well as the reputation of the school
(which is the parents’ belief about how good or bad it is), whether the school is faith
based and what its aspirations for its pupils are. We could add as factors the attain-
ment of students, how many progress into higher education and the student/teacher
ratio. When these factors are combined we might have a means of predicting demand
for places in any one school.
t Second, we have to present these factors in a particular way. As they stand, they
effectively constitute a theory of what is likely to happen. We can express this theory
as a proposition, ’demand for places at a school is influenced by (a) the travel time
of potential pupils plus (b) the school’s reputation plus (c) whether it offers a faith-
based education plus (d) the character of the school’s aspirations for its pupils plus
(e) student attainment plus (f) student progression plus (g) the staff/student ratio’.
Such a proposition is called a hypothesis. We may have been guided by other peo-
ple’s research in our selection of factors but their inclusion represents an assump-
tion on our part that this combination of factors is causative and therefore predictive
of demand. We might even go so far and say that shorter travel times will be pref-
erable to longer, a high reputation preferable to a poor one, high grades in public
examinations preferable to low ones, a high proportion of students progressing to
102 Research Methods for Education Part 1 The Context for Your Research
higher education preferable to a low proportion and a low ratio of students to staff
preferable to a high ratio. A faith-based school may appeal particularly to parents
actively practising a faith, and a school’s aspirations for its pupils (which might range
from a traditional academic education through to a commitment to community and
public service) will strike different chords with different parents and, probably, at
different times too. In other words, we can be really quite specific about how the
causation works to influence parental choice.
Can we prove our hypothesis? The answer to this is ’maybe’ but it is not straight-
forward. First we have to go back to the notion of proof and what can or cannot be
proved. At this point we do have to concern ourselves with philosophy and the process
of logical deduction. In terms of logic, the only truth that we can establish is whether
or not a conclusion is false and not whether a conclusion is true (this is because we
cannot possibly test every situation for truth). The implication of this is that we have to
phrase our hypothesis in a particular way, as an inverse of what we are trying to prove.
If our hypothesis is that A is related to B, the inverse that we have to test is that there is
no relationship between A and B. This is called the null hypothesis. If you want to under-
stand the reasoning behind this in more detail, read Case Study 3.3. We will return to
the process of establishing proof in the next section. We will meet the null hypothesis
again in section 13.1.
While theorising and a concern with proof shape the character of quantitative re-
search, we should not think that they limit the ways we can research. There are two
broad approaches to undertaking a quantitative study: either collecting our data (often
through a survey) and analysing them using statistical and numerical procedures or
collecting our data within an experimental setting and framework. Much, though not
all, of their procedure is common. We shall look at these in the second part of this
chapter.
Explanatory
Qualitative
Quantities Meaning Humanistic Empiricist
research
Factor Holistic
Subjective Interpretive Positivist
control Organic
descriptive and concerned with how and why things happen as they do. Quantitative
approaches are objective, experimental and, within the positivist tradition, value the
empirical observation of cause and effect. Qualitative approaches, in contrast, are held
to stem from a humanistic tradition. They draw on insight and interpretation, and allow
researchers to draw on their subjective responses to evidence. With quantitative research,
numerical evidence is the basis for drawing conclusions. An issue is structured as influ-
ences (causes) and effects and the research process is presented as being strait-jacketed.
But with qualitative research the emphasis is on wholeness and detailed connections
between our social worlds, emotional and cognitive processes and economic circum-
stances, all which have to be understood in order to recreate the meaning that we give
to our lives.
Let us now examine the character of qualitative research and see what its distinctive
contribution is to research.
However, qualitative researchers should be confident in their response for two reasons.
t First, individual methodologies, for example, grounded theory, which we shall
meet later (see Chapter 11), have established their own procedures that allow other
researchers to see clearly what a researcher’s decisions have been.
t Second, if research were a process where events and outcomes fell easily into neat
relationships, the world would be a simpler place to live in. But the real world and
the research world are not like this. In the discussion of quantitative research we
noted that we could deal in probabilities. For example, young people from the poor-
est backgrounds may have only a 10% chance of going to university while those from
affluent backgrounds have a 70% chance. However, probabilities do not tell us what
we have to do to change this situation. If there is a goal of overcoming social disad-
vantage through access to higher education, it is necessary to understand the motives
and aspirations of those who already go as well as those who do not. Understanding
people does mean that you have to deal with people’s feelings, values and emotions
as well as their behaviours, their attachments to place and people, their fears, hopes
and motivations as well as their perceptions of the world, the organisations with
which they have contact and their relationships with them. What formal research
procedure is there that can put all of this together? It requires sorting, evaluating, jux-
taposing, contrasting and rejecting before accepting. Some of this can undoubtedly
be accomplished by formal analysis but the insights that create the integration that
lead to understanding are made by researchers themselves.
the public good or are some working to ensure that a rival does not gain an advantage in
promotion to higher office? Does every consumer go to the nearest supermarket? How do
we account for those who choose to shop at one that is further way? If the nearest super-
market is expensive and we receive social security, it is not irrational to shop for cheaper
goods, even if it means travelling further. In other words, these examples demonstrate that
there is no single rationality that determines the way people behave.
Chapter 3 PUTTING YOUR RESEARCH DESIGN TOGETHER 107
Deduction Induction
Theory testing Theory building
The task of the qualitative researcher is not to look at how people behave as an out-
sider but to understand how individuals see the world.
Quantitative Qualitative
The level of motivation in Group A The level of motivation and the motivational
is higher than in Group B. drivers are different in the two groups.
Which becomes
There is no significant difference in the
levels of motivation in the two groups.
apparent. The quantitative test specifies the nature of the difference in the research
hypothesis (motivation in group A is higher than group B) and the hypothesis that
is tested is the null hypothesis. What we would hope from this is that our statistical
test shows whether the levels of motivation are so different that the difference is sta-
tistically significant. The qualitative hypothesis, on the other hand, explores not only
whether the levels of motivation are different but also why they are different. The edu-
cational psychologist, Alan McLean, argues that there are four drivers to motivation
in respect of education: the student has to engage with the learning process (often,
in the first instance, with the teacher), structure (an understanding of goals and the
student tutor relationship), relevance (essentially an acceptance of curriculum values
and content) and feedback (so the students can gauge their progress) (McLean, 2003).
A qualitative research project would seek to see if these issues played a part differen-
tially between ethnic groups but, if not, the cultural dimension rather than the level
of motivation could be the prime influence on attainment and this would have to be
explored further.
At this point, a quantitative approach would feed the data into a statistical
procedure in order to determine proof. This clearly cannot happen with qualitative
research, so we have to adjust our idea of what constitutes proof. In a qualitative
approach all we have are the judgement and the argument of the researcher. It is help-
ful at this point to think in terms of an analogy. In a court of law, placing a suspect
at the scene of a crime with fingerprints, DNA or corroborated sightings as evidence
would convince most juries. But even if this evidence were not available, circumstan-
tial evidence might still be sufficient for a jury to convict. Circumstantial evidence is
evidence from which a reasonable person can infer a conclusion. If some money went
missing from an office safe at the same time as I, on my limited means, bought a new
and expensive car, I think I would convict myself if I were on the jury. So from law, we
have a working model of proof. Proof is whether our professional colleagues or tutors
accept our conclusions.
where interviewees had a free response to questions (Arnon and Reichel, 2009).
Scottish researchers, Edward Sosu, Angus McWilliam and Donald Grey also used mixed
methods in their study of teachers’ commitment to environmental education (Sosu
et al., 2008). They collected data from teachers using attitude scales and analysed this
using sophisticated statistical techniques to test a model of commitment that they had
devised. They also collected data through interviews to explore how organisational,
political and personal factors influenced individual teacher’s commitment to environ-
mental education.
It has, however, proved controversial to mix quantitative and qualitative approaches
in a single research design, more so in some subjects than in others. In education and
medicine the debate appears to have been particularly loud. Why should there be con-
troversy? What is so controversial about a research design that follows a survey with a
case study? Put like this, the combination seems eminently reasonable. The survey gives
the general picture and the case study gives a richer understanding. However, for some
researchers on either side of the quantitative/qualitative divide, the incorporation of a
procedure from ’the other side’ is tantamount to supping with the devil. Where does this
attitude come from? A number of reasons can be identified.
t Some researchers so identify with an approach that any transgression against its prin-
ciples is seen as an attack by vandals on the citadel of civilised research.
t Some subjects have a research tradition that is dominated by one approach to re-
search. If the other approach is little known or valued, then the opportunities for
combining methods are restricted and even if a researcher were to, it is likely that
any resulting research papers would find difficulty in being published. Psychol-
ogy, dominated for so long by a scientific methodology, possibly falls into this
category.
t Some researchers regard epistemology not as a description of how research is con-
ducted but as a rule book that specifies how all research should be conducted. If
so, they greatly misunderstand research and look for answers to research problems
in categorical rules rather than appreciating the need for the researcher to exercise
judgement.
t Some researchers follow the implicit or even explicit guidance of those who fund
research. The American academic Madhabi Chatterji (Chatterji, 2004) drew attention
to this in the USA. Read Case Study 3.5 for more on this.
t Some traditions are given more weight when students are taught about research
methods. In the UK, the Commission on the Social Sciences (an independent review
of the work of the social sciences conducted under the auspices of the Academy of
Learnt Societies for the Social Sciences) expressed concern over the ’deficit of quanti-
tative skills’ (Commission on the Social Sciences, 2003, p. 57). For Professor Stephen
Gorard and Dr Chris Taylor, both British educational researchers, the small num-
ber of quantitative studies in education was a consequence of this over-emphasis
on qualitative research approaches in research methods courses (Gorard and Taylor,
2004, p. 9). There is, however, another dimension to this, which is the attitude of
students to numeracy and their capability in mathematics. The implication of this is
that the root causes are to be found at an earlier stage in a person’s education. The
outcome of all of this is that students are taught more about qualitative approaches
than about quantitative and are less able or inclined to use quantitative. Hopefully
this book will help redress the imbalance. Quantitative approaches are illuminating
and need not be difficult.
110 Research Methods for Education Part 1 The Context for Your Research
What does research have to look like before it can be said to possess a distinct tradi-
tion? Very simply, it should have rules of procedure. This is the case for both quantitative
and qualitative approaches, though it is fair to say that those for qualitative research are
less prescriptive than for quantitative and apply at the level of methodology rather than
epistemology. In other words, qualitative rules tell us how to carry out a case study, for
example, while quantitative analysis rests on the principle of randomness. Some order
is emerging in the ways in which methods can be combined, though these are either
conceptual typologies or operational guidance based on what seems to work. It is dif-
ficult at this stage to recognise a claim for a distinctive methodology, though the need to
be flexible in the way we use research methods to resolve specific research issues is now
much more widely accepted.
(c) The argument that mixed methods are not a new approach
The absence of an accepted conceptual infrastructure for mixed methods certainly supports
the argument against the approach being a new tradition, though it is not a watertight case.
Purists on both sides of the qualitative/quantitative divide will attack the combination of
approaches, arguing that chalk and cheese are just not compatible. What we might care
to think about, however, is that as the world evolves, frameworks that do not change will
gradually lose touch with what they seek to manage. Innovation is a necessity in a changing
environment, in research as in anything else. Mixed methods may not be the right innova-
tion but the approach should not be rejected on the basis that it does not fit with what
worked in the past. In research, as in life, the past is another country.
112 Research Methods for Education Part 1 The Context for Your Research
Now we are going to look at some of the more practical issues concerned with the con-
cepts, ideas and issues that we considered in the first part of this chapter. Our framework
will again be quantitative, qualitative and mixed methods approaches to conducting
research. This structure will not only make it straightforward to link research in practice
with what we looked at in terms of the context for these frameworks but it will also make
comparison between the approaches easier.
School choice is related to travel time. School choice is not related to travel time.
or
There is no relationship between school choice and
travel time.
As travel time increases, the number of parents There is no relationship between travel time to school
choosing a school will fall. and parents’ choice of school.
The number of children attending a school is There is no relationship between the number of children
inversely related to the travel time to the school. attending a school and the travel time to the school.
114 Research Methods for Education Part 1 The Context for Your Research
Case study 3.6 Calculating the possible number of samples of a given size
To keep the numbers a manageable size, let us assume With our study the total number of refugee families (n)
that we wanted to know about the decision making is 20; r, the number of families we wish to get informa-
of 20 refugee families that had moved into the study tion from, is 8; and (n 2 r) is, therefore, 12. The answer
area in the last three years and that 8 of them would be to the question ’How many combinations of 8 families
selected to give their information. The GENERAL ex- are there out of a group size of 20?’ is given by:
pression for calculating the number of combinations
of a given size (r 5 8) out of a total group size of n is: 20! (20 3 19 3 18 3 17 . . .)
5
8! 3 12! (8 3 7 36 . . . 3 12 3 11 310 . . .)
n!
Combination(n,r) 5 2432903008176640000
r!(n2r!) 5 5 126,037
40320 3 479001600
The mathematic symbol! represents a calculation called
a factorial. This is very simply a number multiplied The answer is about 126,000, so the chance of hav-
by one less than the number and then two less until ing any sample of 8 is 1 in 126,000. If our concern is
the final multiplication is by one. So, factorial 4 is 4 3 choosing people who are not typical, then this figure
3 3 2 3 1. As a general expression, this is written: should reassure us. When we do this for larger sam-
ples, the numbers get very large indeed! (And that is
(n ∙ (n 2 1) ∙ (n 2 2) ∙ (n 2 3) . . . an exclamation mark, not a factorial.)
Chapter 3 PUTTING YOUR RESEARCH DESIGN TOGETHER 115
We shall return to this issue of the number of combinations of a certain size when we
look at sampling in Chapter 6. For now, we shall assume that the research has been con-
ducted and the results processed using one of the test techniques outlined in Chapter 13.
All of these techniques yield a result (a numerical value) that can be assessed in terms of
the likelihood that it occurred by chance. In broad terms the values that result from the
statistical test are influenced by the data that are collected and by the size of the sample,
that is, the number of bits of data that have been collected. As our sample size increases,
the number of bits of data in our analysis increases and this, in turn, is likely to have an
effect on the size of the value that our statistical test produces. The reason for this is quite
obvious. Subtracting one number from another (often the basis of a statistical test) is a
simple measure of difference. If we do this calculation 20 times and sum the resulting
differences, the result is likely to be larger than if we do it five times. It is clear then that
we need to be aware of how large our sample is because this determines how many bits
of data there are in our calculation.
Assuming that we have allowed for the size of the sample (and how we do this is con-
sidered in Chapter 13, when we look at something called degrees of freedom), we judge
whether the probability that it has occurred by chance is more or less than a threshold
that we will accept as being proved or not proved. This threshold is referred to as the
significance level. Finding an answer to the question ’What is an appropriate level of
significance?’ could take us round in circles. What may be significant for a researcher
wishing to prove a point may not be significant for someone more sceptical, such as a
researcher whose conclusions are about to be challenged. Fortunately common sense
and convention offer us an answer. If there is a 50% probability that research results were
obtained by chance, then our understanding of the issue being researched has not been
advanced. If the probability is 10%, then many researchers are likely to be convinced; at
5%, chance is not a big player in the game and at 1%, this constitutes proof that most
people would accept. The next level, 0.1%, has only a one in a thousand chance that
chance itself had been the influencing factor.
The basic model is that proof is more convincing when chance is less likely to be an
influence. Conventionally the 10% significance level is a weak proof, 5% a solid proof and
1% convincing proof. The level of probability associated with the value resulting from the
statistical test then has to be applied to the null hypothesis (not the research hypothesis
because it is impossible in social research to have all the data to demonstrate that it is
true). At one level this is a straightforward process (which is outlined in more detail in
Chapter 13), but there are occasions on which it can call for some mental agility. There
is another more appropriate way of expressing our research result in relation to the null
hypothesis and it relates specifically to concepts introduced in Chapter 13. Instead of say-
ing that there is a 1 in 20 chance that our research result has occurred by chance, we could
say that in only 1 out of 20 occasions when we get this result will we be wrong in rejecting
the null hypothesis. This does not seem unreasonable, given that people do not always
do what is logical (or even sensible) and that research is a messy business. The issues in
educational research, especially when the numbers we are dealing with are small, can be
fuzzy and indistinct and a 95% probability that the results are not due to chance seems
pretty good to me.
the potential for things to go wrong is quite high. This section highlights some of the
fundamentals that have to be correct.
Poor data
The process of obtaining data can be fraught with difficulty. If we intend that the conclu-
sions we draw from the analysis of our research data will apply to other situations (that
is, we want to be able to generalise our conclusions), then we have to pay particular at-
tention to how we select our data. Chapter 6 which deals with obtaining data is crucial
here and we need to pay special attention to sampling. Only a limited number of sam-
pling procedures are appropriate if we want to extend our conclusions from the data we
have analysed to a wider world.
Even if our procedure for choosing the sample is appropriate, we can still get things
wrong. In general, the smaller the quantity of data that we have, the greater the pos-
sibility that we will select data that is atypical. For example, suppose that we have
assessed the IQ of 100 children and were going to select a sample to represent the
group. If we selected only one, then we have as much chance of choosing the highest
or lowest in the group as any other. Using this one to represent the group is not par-
ticularly appropriate.
However, even with an appropriate sample size, we can still finish up with a sample
that is not balanced. Let us assume that we are seeking to obtain sensitive information,
for example, we might want to know from parents what the household income is, or,
from students, whether their parents give them any sex education. In these circumstances
we should be aware that:
t Some people may inflate their income – would you expect this to be the same rate for
everybody or to be more in evidence in certain income groups?
t Some people may refuse to tell you their incomes or whether they receive sex educa-
tion from their parents. Again, would you expect this to be a random effect or con-
centrated in certain groups?
How to ask questions in the right way is considered in Chapter 8.
Failure to obtain information can produce bias and our research should show that our
data is not biased. We do this by examining the pattern of non-response. For example,
if a school were to send a questionnaire for parents to complete, it should check on the
geographical distribution of the replies by comparing the postcodes of respondents with
the postcodes of the children in the school. Any marked variation would constitute bias
and imply that the voice of a particular group, such as an ethnic minority, was not being
heard.
To summarise, we need to:
t take care about sample size
t think about whether we are being given truthful information
t examine non-response for evidence of bias.
the data that can be analysed. To make sure that we do not use the wrong test we should
understand the following.
t The assumptions on which the test is based.
t What our purpose is in conducting a test, for example to test whether two things are
different (boys’ and girls’ non-attendance at school) or to see whether things are associ-
ated (assessment score at age ten with the number of years of pre-school education).
t How the way in which we measure our data influences the type of test we use.
(Chapter 4 explains about data measurement.)
Finally, when we put all of these things together and we are still faced with a number
of possibilities that seem appropriate, we should appreciate that two different tests can
(but not will) produce different levels of significance with the same research data.
0 0
50 1.6989
250 2.3979
1,000 3.0000
2,500 3.3979
5,000 3.6989
10,000 4.0000
25,000 4.3979
For example, student debt is a continuous series. Someone could have a debt of £49, some-
one else £4861 and a third person a debt of £18,324. If we are dealing with a large number
of people, it is often easier to compress the data into classes (such as £1 to £500, £501
to £1,000, etc.). But how do we choose the classes? This is an absolutely crucial question
because selection of different class boundaries can produce totally different statistical
results. Look at Case Study 3.7 for an illustration of this and an indication of the problems
that we might face. However, Chapter 10 gives us some answers to choosing the number of
classes with which to represent data and the identification of class boundaries.
ecological fallacy problem. The only way we can minimise the problem without complex
data manipulation (which may not work anyway) is to use very small-sized areal units
(such as postcodes).
The ecological fallacy has much in common with the class boundary problem that
we looked at earlier. It has been shown that different combinations of spatial units will
generate different statistical test outcomes. This should not surprise us. The manipula-
tion of boundaries to gain electoral advantage, a process known as gerrymandering, has
long been practised by corrupt administrations. It is an issue that should not be ignored
and even new researchers should show that they understand it.
Summary
While the quantitative approach has strong procedures to guide us, we should never
under-estimate the number of times we have to take a decision about what to do.
Each decision has to be justified so that it is acceptable to our peers. We have to plan
our strategy from start to finish to ensure that the various stages and components fit
together and then, almost certainly, we will have to review and revise it as we take
account of the things that can cause us to be wrong in our conclusions. Table 3.4 sum-
marises the stages. Having identified a research area, we focus on an issue and phrase a
research question. We unpick the question to identify what we think may be the factors
at work and present them as a model to be tested. For each factor we state a research
and null hypothesis and identify what data we will need to test the hypotheses and
where we will get the data from. At this stage we also need to specify the statistical test
we will use to ensure that the way we obtain the data is compatible with what the test
requires. If not, it is back to the beginning to start all over again. We can appreciate that
this procedure necessitates considerable pre-planning, so much so that it often takes
up at least a quarter and often much more of the time available for the whole project.
The model is fundamentally the same for the experimental approach that we shall look
at in the next section.
Parental
Teacher
support
Learning
Time of day Reading ability
envi ronment
Attention Reading
span of pupil approach
any change in reading ability must be due to the influence of that factor. This approach
of allowing one thing to vary while others are held constant is described as ceteris paribus,
which is usually translated as ’other things being equal’. Just how this is done we shall
see in the next section.
Experimental
group
Testing performance
Control group
Testing performance
without the factor
(a)
Subjects Random
participating in allocation to
the experiment (a) or (b)
(b)
GroupA Group A
Testing performance Testing performance
with the factor without the factor
Group B Group B
Testing performance Testing performance
with the factor without the factor
Experimental
group
Testing performance
Allocate to groups
for convenience
Control group
Testing performance
without the factor
(c)
Subjects
participating in
the experiment
(d)
Experimental
group
Allocate to groups to
ensure that groups Testing performance
match in terms of
key characteristics
(eg, ethnicity,
Control group
age, gender)
Testing performance
without the factor
not necessarily long lasting. It would be appropriate, for example, if we wanted to assess
the effect of ambient temperature or noise in the classroom on children’s quality and
quantity of work.
Factorial designs
The experimental approaches so far have been concerned with isolating one factor. A factor-
ial design allows us to test two factors (or even more) together but to assess their influence
separately. However, a little tweak to the matched group design takes us into a much more
powerful experimental model in which we can explore this effect of a number of influences
at the same time. Let us imagine that we want to assess the effect of two factors, the special
reading programme and parental support for both the standard and the special reading
programmes. Our design manages to ensure that the same teacher will take all classes,
thus removing another source of possible influence. With this specification we have set up
a factorial design. What would make it robust as an experimental approach would be to
allocate pupils to our groups randomly and if we could match individuals it would be a
Rolls Royce of a design. The logic of a simple factorial design is shown in Case Study 3.9. It
shows how, given the right conditions, we can generate powerful conclusions.
t individual subject experiments, where one or more subjects are subject to experimen-
tal intervention;
t sequential designs in which results of an intervention are collected on a continuous
basis and the experiment is halted once the significance (or insignificance) of the
intervention is established.
Chapter 3 PUTTING YOUR RESEARCH DESIGN TOGETHER 127
Sample variation
What happens when the subjects we thought were going to take part do not? What if
our experimental procedure requires a matched design and, on the day of the research,
a virus takes out a quarter of our control group? What if the procedure extends over a
period of time and the control and experimental groups lose members? We have to have
answers to these questions. In some cases we will drop people or add others to create a
matched design. In some cases we will go from a matched to an unmatched design. In
some cases the best solution might be to start again with a different design. Whatever
our solution, we have to recognise that we may be deviating from the ideal and this may
128 Research Methods for Education Part 1 The Context for Your Research
affect the validity of our results. As ever, it is how we deal with problems and issues that
will determine our credibility in the eyes of many researchers.
Validity Credibility
Reliability Dependability
Objectivity Confirmability
flexibility, preparedness to break conventions, call it what you will, that gives qualita-
tive research its identity.
First
information source
Wh at you learn
Second
Wh at you learn What you conclude
information source
Wh at you learn
Third
information source
were the case, we would have gained a new piece of data, either that two people had been
mistaken or had conspired. In either case, the question would be ’Why?’
How can triangulation be implemented? The following are examples:
t Repeat data collection by another researcher as a test of reliability.
t Obtaining information about the same process from two or more different people
involved with the process.
t Corroborating interview evidence with documentary evidence or vice versa.
While there may be differences in detail or nuance these are acceptable if the thrust
from both sources is the same. If not, then we have to become an investigative researcher
in order to expose and account for the differences.
Summary
Qualitative research is a powerful research approach with the potential to achieve even
more than it has to date. It is enormously flexible but to realise its potential it has to be
in the hands of the right type of researcher. To make full use of the qualitative approach
we have to:
t believe in its value to resolve research issues;
t understand the implications of using a particular approach;
t demonstrate how we ensure quality in our research, and
t be open in all we do.
Table 3.6 provides guidance on the sections of this book that will support a qualitative
research programme.
They also include the conversion of data from one format to another (that is, from
qualitative to quantitative).
t Mixed model design, in which qualitative and quantitative approaches are integrated
throughout the study. The trigger to a mixed model design is the research question,
which should take the research in directions that require both approaches.
t Multi level design, in which one approach dominates at one level and the other at
another. In terms of level, Tashakkori and Teddlie give child and family as an exam-
ple. Quantitative data may be collected on the educational attainment of children in
single parent families (the higher level) and qualitative data on the life experiences of
children (the lower level) in order to understand how this contributes to educational
outcomes. Class group and pupil group, year group and class group, and national
and local are also appropriate level differentiators.
While it is useful to have a representation of options, it does not take us far in understand-
ing the nature and value of a mixed methods approach. This is what we shall consider next.
4. To test the accuracy of evidence. This is called triangulation and it allows us to use
different approaches to verify our evidence base. With the flexibility that mixed meth-
ods offers there is an increased potential for new ways of triangulating evidence (see
Figure 3.6). Gorard and Taylor (2004, pp. 44–46) make the interesting point that the
introduction of two perspectives gives us a better fix not just on the measurement of
an issue but also on our appreciation of it. They point out that the use of quantita-
tive and qualitative approaches together allow researchers to compare results in a
complementary way. They cannot validate each other with any precision but they can
reinforce each other. This idea of complementarity can operate in different ways in
the research process. It can be used to investigate critical areas or incidents; it can also
work at a higher level to provide rich detail and to improve understanding, for exam-
ple, when a large scale survey is supported by a case study or a number of case studies.
In summary, triangulation takes on a different mantle when qualitative and quantita-
tive approaches are brought to bear on an issue. It is less concerned with the accuracy
of measurement than it is with the correctness of the insight and the legitimacy of the
interpretation. Read the short Case Study 3.11 and you will understand this.
5. To ask contrasting but related questions. Rather than leave economic questions to
economists, psychological questions to psychologists, social questions to sociologists
and geographical questions to geographers, there is good reason for specialists to
work together as a team, or, if the issue is educational, for an educational researcher
to research disciplinary questions from a disciplinary perspective. Educational issues
situation, we will need a robust link between question and method. By robust, we mean
that we have to be sure that the method will generate data appropriate to the question and,
with a mixed method approach, it is unlikely that our research question will be just a sen-
tence; it is more likely to be a short paragraph outlining the complexity of the issue to be
explored, followed by a series of questions that deal with the dimensions of the question.
Going off-piste
Exploring virgin territory is adventurous and exciting; it is also dangerous. Yet without
people doing new things, the boundaries of mixed methods research will not be pushed
out. Doing things in the way that the research question wants them done is what a mixed
methods approach should be all about. We should not be afraid to fail (and my view is
that students should not be penalised for adventurous failure) but it is something that
might happen. To use an analogy, how do you ski off-piste and still stay safe?
The answer is by looking at guidelines put forward by other researchers, judging whether
you should work within them and, if you do not, then make clear why you have not and
explain how you determined your way forward. What guidelines can we identify?
Summary
Mixed methods in research are growing in importance. There is now a journal dedicated
to the approach (Journal of Mixed Methods Research, Sage). Combining methods can be
powerful. They are cost-effective ways of dealing with complex issues. However, they do
require us to take great care at the stage of specifying the research issue. While they re-
quire competence over a wide range of methods, this should not put us off. The methods
that we are most likely to be combining are really quite robust and will take some com-
promise with perfection. The real danger is of not knowing when we have done things
wrongly. An important member of any research project is a critical friend. Who is yours?
Table 3.7 shows the issues we have to consider in mixed methods research and where
they are dealt with in this text.
At the end of Chapter 2, you were asked to specify your research questions. Now you are
nearly at the end of this chapter, you are in a position to move on and to sketch out your
research strategy.
t Does your core research question fit into any of t Now that you have a better understanding of
the types that were considered in Chapter 2 (see quantitative, qualitative and mixed methods ap-
Table 2.2)? If the question requires data or causal proaches, would you like to revisit your research
analysis, you may be looking at a quantitative ap- question? In particular, does your research ques-
proach. Other types of question could be either tion need to be unpicked and presented as a series
qualitative or mixed methods, though evaluation, of linked questions? If so, can these be answered
policy analysis and action research may predispose at the same time or do some need to be answered
you to mixed methods. before others? What sort of research approach do
t Are you concerned with outcomes or causal re- these questions imply?
lationships? In which case, look towards a quan- t Take a draft of your research questions to some-
titative approach. If your research is more about one whose judgement you trust. Explain why your
understanding processes, a qualitative or mixed questions take the form they do. What other for-
methods approach may be more suitable. mulations did you consider and reject – and why?
Chapter 3 PUTTING YOUR RESEARCH DESIGN TOGETHER 139
Summary
t Quantitative, qualitative and mixed approaches to research are all appropriate for
education research. Each has its own character and way of working. Quantitative
research is more constraining in its requirements than either qualitative or mixed
methods.
t The approach we select should be based on the research issue as the starting point,
not on an abstract commitment to a particular style of research approach.
▲
140 Research Methods for Education Part 1 The Context for Your Research
t Quantitative research can analyse data collected from surveys or from experimental
situations. Both approaches require rigorous conditions in data collection.
t Qualitative research focuses on understanding people’s life experience. This requires
researchers to look in depth at their subjects. Qualitative research is, therefore, usu-
ally smaller in scale than quantitative.
t Mixed methods research combines both qualitative and quantitative methods
of data collection and analysis. The combination of the two approaches produces
understanding with deep insight over a large population. This can make it a very
cost-effective research strategy.
By the end of this first part of the text you should have:
Further reading
References
Arnon, S. and Reichel, N. (2009) ’Closed and Open- Mason, J. (2006) ‘Six Strategies for Mixing Methods
Ended Question Tools in a Telephone Survey About and Linking Data in Social Science Research’, ESRC
“The Good Teacher”: an Example of a Mixed Method National Centre for Research Methods NCRM Work-
Study’, Journal of Mixed Methods Research, 3(2), ing Paper Series 4 06.
172–196. McLean, A. (2003) The Motivated School, Paul Chapman
Chatterji, M. (2004) ’Evidence on “What Works”: An Publishing, London.
Argument for Extended-Term Mixed-Method (ETTM) Petter, S. and Gallivan, M.J. (2004) Proceedings of the
Evaluation Designs’, Educational Researcher, 33(9), 37th Hawaii International Conference on Systems
3–13. Sciences, IEEE Computer Society Press, Washington.
Commission on the Social Sciences (2003) Great Robinson, W. (1950) ’Ecological Correlations and the
Expectations: the Social Sciences in Britain, Academy of Behaviour of Individuals’, American Sociological
Learnt Societies for the Social Sciences, London. Review, 15(3), 351–357.
Doyal, L. and Harris, R. (1986) Empiricism, Explanation, Simons, D.J. and Chabris, C. F. (1999) ’Gorillas in our
and Rationality: An Introduction to the Philosophy of Midst: Sustained Inattentional Blindness for Dynamic
the Social Sciences, Routledge and Kegan Paul, Events’, Perception, 28, 1059–1074.
London. Skolverket (2005) Educational Inspection 2004: A
Finlay, L. (2006) “‘Rigour’, ’ethical integrity’ or ’artistry’? Summary of Inspection Results in the Swedish Repart 26b,
Reflexively reviewing criteria for evaluating qualita- Swedish National Agency for Education (Skolverket),
tive research”, British Journal of Occupational Therapy Stockholm.
69(7), 319–326. Sosu, E., McWilliam, A. and Grey, D. (2008) ’The Com-
Giddings, L.S. (2006) ’Mixed-Methods Research – plexities of Teachers’ Commitment to Environmental
Positivism Dressed in Drag?’ Journal of Research in Education’, Journal of Mixed Methods Research, 2(2),
Nursing, 11(3), 195–203. 169–189.
Gorard, S. and Taylor, C. (2004) Combining Methods in Tashakkori, A. and Teddlie, C. (2003) Handbook of
Educational and Social Research, Open University Press, Mixed Methods in Social and Behavioural Research,
Maidenhead. Sage, Thousand Oaks.
Johnson, R.B. and Onwegbuzie, A.J. (2004) ’Mixed Whyte, W.F. (1943) Street Corner Society: The Social
Methods Research: a Research Paradigm whose Time Structure of an Italian Slum, University of Chicago
has Come’, Educational Researcher, 33(7), 14–26. Press.
Data collection Data analysis
Modelling data
needs
Chapter 7
Literature as Obta ining data
data questionnaries
Chapter 5 Chapter 8
Chapter 9
Data quality
Primary data
sampling
Chapter 4
Chapter 6
Part 2
In Part Two we look in detail at the process of that are now available to us are so large.
data collection. What we will learn are that Chapter 5 considers literature as data and
there are different sources for our data and examines how we should make use of what
many ways of collecting them. The research- others have written throughout our research
er’s skill lies in combining these in order to en- programme. In Chapter 6, we return to pri-
sure a sufficient quantity and quality of data mary data and how we can ensure the qual-
to answer the research question. One conse- ity of our data when we deal with samples
quence of combining methods in data collec- rather than the whole population. Chapter 7
tion is that research often moves towards a introduces a way of looking at the research
mixed methods approach. process that helps us identify the data we
need to resolve our research issues by analys-
We shall approach the data collection in the ing the research question and representing
following way. First we shall give an over- our analysis as a model. Textbook authors
view of data in Chapter 4. This covers such do not always introduce this approach but it
things as ways in which we can categorise does sharpen our analytical skills. Finally in
data and ways in which we can measure Chapters 8 and 9 we look at ways of collect-
them. We shall look in detail at secondary ing data through questionnaires, interview
data, because the sources of secondary data and observation.
Chapter contents
Introduction 145
Review 187
Summary 188
Further reading 189
References 189
Chapter 4
Learning themes
This chapter is all about understanding and t Issues we have to be aware of when
practising. The objective is to help you feel accessing secondary data.
comfortable exploring data sources and using
new data in your research. t The last part of this chapter is about
practice and introduces the riches of the
t The first part of this chapter is about Internet and the many sources of data
context. It looks at data from different for education research that exist. There
perspectives in order to understand its are example activities to push out the
character and characteristics. Specifically boundaries of our understanding of
we are concerned with: research.
t The difference between data and By the end of this chapter you will:
information.
t The difference between qualitative and t Appreciate what we mean by data and
quantitative data. how they can be measured.
t How the informational content of data t Understand the difference between
can be measured. primary and secondary data.
t Where we can obtain our data. t Be confident that you can source the
t The difference between primary and secondary data you need.
secondary data. t Be aware of issues that can compromise
the value of secondary data.
Introduction
One of the surprising things about texts on into the basic elements of the process and
education research methods is that they say style of cuisine. In most cases this concerns
so little about data. Other texts that tell us ingredients and the guidance ranges from
how to do something also usually inform what to look for in order to identify quality,
us about the fundamentals of the things to information about ingredients that are
that we will use. Take cookery books, for not part of a cultural background. But texts
example. They usually give some insight on research methods are, at best, limited or
146 Research Methods for Education Part 2 The Process of Data Collection
often silent when it comes to data. This is examine what we can use as data and how
surprising because the volume of data that we can measure them, look at data from
is publicly available is growing by huge leaps several perspectives (each with their own
each year and many people are unaware of method of classification) and finally review
the vast riches of data that are a click or two the wealth of data that are now available on
away on the Internet. In this chapter we will websites worldwide.
Why do we need to consider data as a separate issue? The answer is that data are
the material out of which we construct our research argument. We should, therefore,
understand what constitutes data and what characteristics it can have. Once we know
what we can use as data, we can then consider how it might affect the way in which
we phrase our research questions or how we go about structuring the research process.
It is important that we know this because certain types of data lead to particular
types of analysis, which themselves require certain procedures to be followed in data
collection. It is not good enough to collect the data and then think about how we are
going to process it to extract the information we need for our research. So, in summary,
we need to consider data as part of the whole process of planning a research enquiry
and, to do this, we should know what data are available and what the implications
of using it are.
There is, as well, another reason why we should look at data as a research issue –
and this is, possibly, the most important reason. We are, all of us, conditioned by our
education and our experiences; we are limited by our knowledge, our understanding
and by our imagination. As researchers, we should constantly challenge these limits by
seeing how others conceive of and create data. We should venture outside our comfort
zone of experience and knowledge to see what other subjects use as data. We need
to be aware of what other disciplines that have an interest in our research field are
doing so that their findings can become our data. For instance, if we are exploring
behavioural problems in the classroom and are not aware that food type and food
additives might be factors in understanding a child’s ability to concentrate, then we
might find ourselves in the situation of promoting solutions without understanding
the cause. If we are researching child development, biochemistry and genetics may well
be fields that we should get to know, given the rapid advances that have been made in
our understanding about the significance of these areas for our lives (and deaths). And,
if our interests as educational researchers spread this far, who is to say that we should
not see blood and urine tests as part of our battery of techniques, in much the same way
as we accept the use of psychometric tests? We might also ask, if it is reasonable to try
to understand a child’s emotional state as a factor influencing learning behaviour, what
is to stop an imaginative researcher going further and drawing on reported dreams for
other insights?
Chapter 4 DATA: ASSEMBLING THE RESEARCH TOOLKIT 147
This is not an argument for weird research in education; most research is unlikely
to stray beyond the bounds of conventional data. It is, however, a plea that researchers
should feel able to break the bounds of convention and should use whatever data provide
insight for their analyses. We should be encouraged to be imaginative in our approach
to research and be creative in our appreciation of what data can be used. Just because it
has not been used before does not mean that there should be a ban on using a new type
of data in the future. Case Study 4.1 shows how an innovative approach to research and
data solved a problem for some economics researchers.
The simple fact is that the data do not directly produce our answers. Data is raw stuff.
Think of it as being clay in the hands of the potter. Only with the potter’s skill does the
clay become something that is attractive or useful. Data are neutral; they just sit there
waiting to be discovered and waiting for their ‘message’ (their informational content)
to be extracted. The role of a researcher is to identify the right data and make them give
up their message. In other words, researchers have to extract from data the information
that is useful to their research, information that will help them answer their research
question.
But that is not the end of the process; researchers also have to convert their informa-
tion into evidence. In research it is not enough to assemble our data and then present
them to our audiences. We have to interpret it so that people reading our work can see
what conclusions we are drawing and then judge whether they would do the same. If,
before we begin our research, we understand how our argument might logically progress,
then we could use this insight to influence the type of the data we assemble. Supposing
we were studying classroom behaviour; it might be enough to assemble data on two
groups of children, those reported by teachers as being disruptive and those who were
not. However, it would also be valuable to combine this with observations on parenting
behaviour, family lifestyles, and classroom behaviour through the day. If similar data
had been collected by researchers at some point in the past and archived, we could use it
to give us a temporal perspective. If we are not aware of what data exist and what we can
use, then we will always be disadvantaged.
The purpose of this section is to give a sense of what data are available to us as educa-
tion researchers.
This division between qualitative and quantitative data is well established, extensively
used and deeply rooted in all areas of social research. And yet, it is often misunderstood
in terms of its implications. Describing data as qualitative and quantitative may give the
impression that what we do with one, we cannot do with the other. More simply put,
some people believe that qualitative characteristics are not capable of being measured.
This is wrong. Qualitative data are an attribute of a person or object. It can be an opinion
or the colour scheme used in a school. It is perfectly feasible (and reasonable) to count
the number of people or objects who share a characteristic. We would, for example,
find it valuable to know how many mathematics teachers believe that they would get
better assessment results if they taught their students in ability sets. Similarly it would
be interesting to know how many parents, whose children were in low ability sets,
agreed with streaming. The measurement of qualitative characteristics is important for
education research. It is not just the strength of a view or opinion that is significant but
the number of people who hold it.
Qualitative data are inherently more complex than quantitative because they can take
more forms. We shall examine these in the next section.
problem about asking people about usual (or frequent) behaviours (such as the usual
school day or the usual sports lesson) is that individual events of particular significance
may not be reported.
An American psychologist, Norbert Schwarz, provides a useful framework for
understanding reports of past behaviour. He suggests that there are three dimensions to
consider:
1. The duration of the period.
2. The length of time since the period or event.
3. The significance of the period or event (Schwarz, 2007).
Each of these affects the quality of what we are told. Longer periods of an experience
or activity (Schwarz was interested in the experience of pain) are more memorable
than short. In general, the passage of time blurs events, so that only those that have
particular significance at the time or subsequently are remembered with any clarity.
Evidence from elsewhere indicates that memory for detail declines rapidly the further
we go back in time, with inaccuracy appearing when people are asked to reflect
on what happened last week. For anything over one month, inaccuracy increases
significantly. Two other psychologists, Lin Chiat Chang and Jon Krosnick (2003)
looked at reported behaviour in relation to actual behaviour. This is an important
issue for social researchers because questions about typical behaviours are often
asked in large scale and longitudinal surveys. Chang and Krosnick asked questions
about TV viewing and newspaper reading. They found that those who reported the
‘typical week’ over-estimated their interaction with media as compared with their
actual behaviour.
These findings should be a warning to us. If we ask about what people usually do or
what typical activities are, we are likely to obtain poor quality data. Typical behaviour
and intended activity presents many traps for researchers.
Source: http://www.caci.co.uk/ACORN/
Reviewing so far
Let us take stock of what we have learnt about data in section 4.1. We have explored some
of the basic characteristics of data, and how the strength of those characteristics can be meas-
ured and expressed. It should now be fairly obvious to us that while quantitative data can
be expressed in any measurement scale (though conventionally interval and ratio scales are
154 Research Methods for Education Part 2 The Process of Data Collection
most commonly used), qualitative data are measured frequently only on an ordinal or nom-
inal scale. It is important to understand the difference between these levels of measurement
because they affect the type of statistical analysis and test that can be used. (We shall learn
more about these in Chapter 13.) However, so far we have thought about data in a rather
abstract way. Let us now become more concrete and ask, ‘Where can we get our data from?
This section introduces another perspective on data that introduces another way of clas-
sifying them. Identifying sources of data could take up a whole text – and a very boring
one it would be as well! What this section will do is introduce us to an enormous field
and, hopefully, give us the interest and confidence to go out and explore it ourselves.
Very crudely, data can come from people or they can come from places. Generally data
from people are collected personally and data from places already exist, and there is a feeling
amongst some researchers (and not just new ones) that data collected personally are worth
more than data that already exist. In part this has to be correct because the data that we
collect directly should be closely configured to the requirements of our research question. If
the research question requires data on job moves by teachers plus contextual information on
career aspirations, stage reached in the family life cycle and the values that individuals hold
dear, then this information is so precise that there is only one way to get it and that is to go
directly to teachers and ask them. Many studies, however, seem to collect data that already
exist, for example, going to individual schools to find out the number of pupil exclusions
when this information is already collected at an administrative level. The reality is that there
is often a better way of getting the data that we require. All we need is a different mindset and
the ability to combine robust data sets that already exist. If we were exploring the links between
the educational performance of pupils in schools with the social conditions of the areas in
which they lived, the data are ready made for us in the UK, with school performance data
available from the Department for Education and national indices of multiple deprivation
(that take account of social and economic circumstances, family structure, housing and so
on) available from the Department for Communities and Local Government. But if we did
not know this, then we will have to spend a great deal of time collecting the data and this is
likely to limit the number of people from whom we can collect it.
The data we collect ourselves are called primary data, because we are the first people to
make them available to the research community. Primary data can be both quantitative
and qualitative. We shall look at how we can collect primary data in Chapters 8 and 9.
Data that already exist are called secondary data. These too can be both quantitative and
qualitative. We will say a lot more about secondary than primary data in this section
because it is a vast domain of potentially useful information that few, especially young
researchers, are aware of.
The purpose of this activity is to get you to think administration that they have to do, which includes
about how different types of primary data can be everything from recruiting the students to the
used in your research. You can either try this out in management of their assessment and curriculum.
relation to your own enquiry or in relation to the
investigation set out below. t You know that group meetings can give a
distorted perspective; some people may not want
You have been approached by a newly-appointed to put their heads above the parapet and say what
Vice Chancellor of a university who has said that he they really think while others with strong opinions
wants to find out ‘what makes it tick’. In your initial might hog the occasion. What would you do to
discussion you have found out that she is perplexed ensure that people spoke with their authentic
that the performance of some departments is voice? How would you get people to share their
lacklustre while others can hold their heads up in perspectives with others?
any academic gathering. She wants you to give her t What behaviours would you observe to give your
a sense of the mood in the place and to understand insights into collegiality, commitment, imagination?
why some areas seem dynamic and others do not. You t You want to understand how people feel about in-
have decided to focus initially on people’s enthusiasm dividual and institutional worth. You will ask them
for the job, their teaching, research and the necessary but what can you also do that is non-intrusive?
t stress levels, as measured by cortisol levels, in- Source: Legendre, A. (2003) ‘Environmental Features Influenc-
creased in group situations (that is, most of the ing Toddlers’ Bioemotional Reactions in Day Care Centre’s,
Environment & Behaviour, 35(4), 523–49.
day) when they should have fallen;
We can include ideas that can be taken for granted as primary data sources, where our
comments are, as far as we know, original. If our research were into learning technology,
it would be reasonable for us to talk about the ideal e-learning platform or to compare
the use of a whiteboard with an overhead projector. These are things that we can
reasonably expect people to know, so, equally, we can expect them to understand and
appreciate our ideas. Abstract notions too can be a primary source. We can comment
on national identity and multi-culturalism from a value perspective or regeneration or
social disadvantage from the perspective of experience, ethics or economics. We can have
a view on climate change, genetic manipulation and abortion without summarising what
other people have written because we can reasonably expect our educated readers to have
some understanding and knowledge of the issues.
At the heart of using ideas as a primary source is our own creativity. The more thought-
ful we are about issues and the more imaginative we are about asking the right questions,
the more we will be able to link ideas and solve problems. We are our own resource and
we should aim to develop this capability. To this end, we should be aware of issues being
discussed in the academic, professional and popular press so that we can see links that
others have missed. For this we need to be widely read. We should develop an analytical
and questioning approach that we can use to think through issues. Activity 4.3 will show
you that you are really quite good at this and that all you need is practice.
These are exercises you can try by yourself. It is, In at least 800 words, present your leadership analysis
however, useful if you can compare your responses and consider whether different types of leader are
with other people’s just to see how they approached appropriate in different stages of any institution or
them. For all of the exercises you can draw on your organisation’s development.
experience of being a student, your awareness of the
wider world and the insights you can gain from think-
ing and then questioning yourself. (3) Student experience
Draw on your own experience as a student at what-
(1) Curriculum principles ever level and, if appropriate, as a teacher to identify
the character of a good student experience in learn-
Think about the choices of subject you made at
ing. Set this within the context of any appropriate
school and at university. Identify what subjects you
theory and your own beliefs about the purpose of
had to study. Why was your curriculum organised like
education. Convert your notes into text that could be
this? If you are working in an educational setting,
used in a research paper.
do the same analysis and ask the same questions.
Now approach the issue from a ‘what if’ perspective.
How would you organise a senior (11–16) curriculum
and/or an undergraduate degree in a subject? How
Learning lessons
would a postgraduate masters’ degree be different? What you should learn from this is (a) that your
Organise your thoughts and write between 200 and own experience can be an evidence base for you
300 words identifying principles of curriculum design. to question and examine, and (b) once you have
identified the issue and isolated the right question
to solve a problem, you will get an answer. It may
(2) Leadership not be the best answer but it will set you on a path
Using the same approach of drawing on your own of thinking that will probably lead you to a better
experience and, thinking of examples you have come one. One of the things you quickly learn in research
across, of situations that have to be confronted and is that finding the right question to ask is often more
the characteristics required to do the job: difficult than getting the answer!
The wealth of data that falls into this category of secondary data is truly enormous.
Until 15 years ago, accessing this data could be difficult. It involved going to reference
libraries to look up (and then copy down) statistical series or going to a specialist library,
such as the UK’s National Newspaper Library to access, for example, newspapers of the
1970s to see how educational issues were presented. Academic journal articles had to
be consulted at an institutional library or obtained by post from another library and
securing reports meant a trip to local or national education offices. It was not surprising
that many researchers found it easier to collect primary data. The Internet, however, has
changed all of this.
Across the world, organisations, particularly governments and those in the public
sector, have a policy of making much information accessible. However it is not just
availability that has made secondary data widely accessible, it is also the fact that there
are now understandable frameworks, including some comprehensive Web portals, for
accessing the data. The map of data and how to access it is becoming clearer and clearer
year by year. Most academic researchers, though, including educational researchers,
are not making use of this data bonanza. In part this is likely to be because they
are creatures of their pasts, when they needed to create primary data because of the
difficulty of accessing secondary. Those days, however, have passed and it is up to
the new generation of researchers to understand the riches that can now be accessed
through the Internet.
We shall look at some key sources and the data they contain in section 4.3 when we
look at practice but, first, it is important to appreciate some of the issues that have to be
managed when we use some secondary sources.
There are five principal issues that we should be aware of if we use secondary data in
our research. They are:
1. the process of data collection;
2. the scale of data presentation;
3. the scale at which we use the data;
4. data comparison;
5. data quality.
160 Research Methods for Education Part 2 The Process of Data Collection
Each of these can affect our analysis and conclusions. It is important that we understand
their impact before we use secondary data so that we can use our understanding to select
the most appropriate data sets.
A data archive is a depository for research data. The Denmark, Finland, the Netherlands, Norway and
controllers of an archive establish standards for data Sweden). Many of the national archives are searchable
deposition. Much of the data is usually public but in English and have reports in English. For example,
private researchers can place their own data in the the Finnish archive identified almost 600 resources on
archive. The UK Data Archive (www.data-archive.ac.uk) education. The CESSDA website also provides access
holds both official series of data and data collected for to national sites in other parts of the world and to
specific research projects. It has the following categories some non-profit making sites.
for education:
Other resources include:
General Research
t Inter-University Consortium for Political and
Higher and Further School Leaving
Social Research
Literacy Teaching Profession www.icpsr.umich.edu
Primary, Pre-Primary t Netherlands Historical Data Archive
and Secondary www.dans.knaw.nl
t National Digital Archive of Data Sets (UK)
These generate well over 200 data sets. Older data sets
www.ndad.nationalarchives.gov.uk
enable researchers to develop baseline data for their
t The Norwegian Historical Data Centre
own research.
www.rhd.uit.no
The Council of Europe has established a web portal t National Data Archive on Child Abuse And Neglect
(www.cessda.org) which gives access to 21 national (USA)
archives in Europe (including archives in Belgium, www.ndacan.cornell.edu
certainly investigate the procedures used. The same goes for reports. Minutes of official
meetings can normally be relied upon to be accurate.
Many countries have data archives. These are databases where governments and indi-
vidual researchers can deposit survey results and other forms of data (see Case Study 4.3).
Data that are deposited in a data archive are usually of a high quality. Such data can come
from official and public sources but even when they are deposited by research organisa-
tions and individuals they are reliable because data archives filter accessions through a
quality control process. There are, as well, thematic and private data archives (for more
information, see Case Study 4.3). For any data archive, it is important to check accession
procedures as an assurance of standards.
Corporate statistics are also normally of a high quality because they are usually
collected under a regulatory framework such as Stock Exchange regulations and
Government regulations for the disclosure of financial accounts. We might wonder why
educationalists might need to access such information. The answer is to be found in the
number of commercial organisations that are entering the mainstream education sector
and running schools, colleges and universities. These organisations, both for profit and
not for profit, are bound by regulations concerning disclosure of information. In only
rare circumstances is this inaccurate.
Other corporate data are, though, less reliable. They take the form of press releases,
headline statistics and reports and profiles. Most of these are designed to give a strong
corporate message that accentuates a positive interpretation. If we use such data, we
should read them with care and triangulate the interpretation with other sources. If we
find that the data are slanted towards a particular interpretation, then the existence of
the bias itself becomes a source of information. Comparing school and college press
releases with official quality assessment reports can produce interesting results, which
162 Research Methods for Education Part 2 The Process of Data Collection
reveal that public sector organisations are increasingly having to behave like their private
sector counterparts.
We should retain the same level of scepticism in respect of data put into the public
domain by groups and individuals. This is called self-referenced data in Table 4.2. For
example, if we search for ‘citizenship education’ on the Web, we find websites that aim to
support teachers of citizenship (www.citizenshipfoundation.org.uk) ones where teachers
can share what they do (www.citized.info) and ones where people come together to argue
the case for citizenship education and to offer mutual support (www.teachingcitizenship.
org.uk). The data on these sites can be robust but there is no guarantee that the selection
of content in any but the research sites is not influenced by attitude and values. If we
are dealing with viewpoints and opinion forming in our research or with individual
classroom practice, then they are a valuable source but otherwise it is probably best to
find something in which our audience will have more confidence.
North
East
North
West
Yorkshire
and the
Humber
East
Midlands
West
Midlands East of
England
London
South East
South West
from school at a regional level unless there is some policy responsibility at this level.
It is far more appropriate to seek data at the local administrative level (where there is
likely to be responsibility for managing the school system) or at the school level (where
responsibility for managing the pupils lies). We might think that it would be even less
appropriate to look at the data at a national level but, in fact, we would be wrong. At
this level an analysis represents a test of how well a government is implementing policy.
It may seem a challenge to have to consider the spatial scale of data sets. However, it is
usually pretty obvious at which level we should extract our data.
(c) The scale at which we use the data (the level of analysis)
If we thought that we faced problems in terms of the scale at which we access secondary
data, then we face even more when we change spatial scale to actually analyse data or
present them as evidence in our argument. There are three reasons for this.
t First, every aggregation changes the informational content of data. In broad terms
every data set is composed of two elements – the message and noise. The message
is represented by order and pattern in the data set, which we can seek to explain.
Noise is represented by random variation, people whose behaviour is so individual
that it would require research into their personalities and psychological make-up
to understand it. At different levels of data aggregation the balance of message and
164 Research Methods for Education Part 2 The Process of Data Collection
% of pupils achieving
noise changes. In general, the process of aggregating data strengthens the message
but only up to a certain point. After that messages can be lost or changed. An exam-
ple will show this more clearly. School pupils in England are tested against national
standards at various ages. The last test, at the end of compulsory schooling, is a
national, externally assessed examination called GCSE. Table 4.3 shows the stand-
ards attained by pupils in one area of London. The Level 1 standard is five or more
grades in the public examinations at grades A* to G and the Level 2 standard is
five or more grades at grades A* to C. Level 2 defines a narrower range of attain-
ment. The table shows the results from nine schools. Two schools had a 100% pass
rate at Level 2. Three schools had a pass rate of below 50% of pupils meeting the
standard. This divergence in attainment should immediately start us thinking
about factors at work that could produce this degree of divergence. These might
include the quality of the teachers, the management and organisation of the
school and the socio-economic background and family circumstances of the pu-
pils. The table also shows the average for the local authority (47.1%). This value
incorporates all of the nine schools but it cannot be used to help us understand why
the variations occur within the authority. It is only of interest in comparison with
the results from other local authorities. At this level, school character and processes,
the most localised influences, are more random in nature. Socio-economic circum-
stances become influential in creating a pattern and these interact as potential influ-
ences with the type and degree of support that the local authority offers its schools.
In summary, as the scale of analysis changes, so does the informational content that
drives the explanation.
t The second reason why spatial scale is an issue at the stage of analysis is that
inappropriate aggregation can blur messages. Figure 4.2 represents a schematic
section of a city. Assume that the sectors AB, CD and EF are administrative units
and that each part sector (A,B,C,D,E, and F) represents an areal unit for data
Chapter 4 DATA: ASSEMBLING THE RESEARCH TOOLKIT 165
B
Affluent
area D
New heusing
estates
A
Peer
area C
Mixed
heusing
F
Affluent
area
E
Peer
area
Peer area
Affluent area
collection. If data are presented by administrative unit (that is AB, CD and EF)
then, unless we know something about the social geography of cities, we will
lose information and possibly reach an incorrect conclusion. If we think about
this, we will identify the reason immediately. Some of the poorer areas tend to
be close to the city centre, so by combining the outer areas with the inner, we
blur the messages that both areas separately can give us. So it possibly would
be better to combine A with C and E, and B, with D and F so as to combine like
with like. If this seems far-fetched, look at Figure 4.3. This shows the electoral
districts (called wards) of the London Borough of Barnet. These wards have
been grouped together as clusters (shown by the lines drawn on the map). If we
were exploring the association between ethnicity and pupil attainment for our
ward clusters, we would generate results that, at best, would have little meaning
and, at worst, would be erroneous. Why? Because, West Hendon, with 59% of
its population from an ethnic minority is in the same cluster as Finchley where
41% are non-white. And because Coppetts, with a 42% ethnic minority population
is joined together with Oakleigh and East Barnet, both of which have less than a
30% ethnic minority population. The principle is that we should construct groups
on the basis of similarities and not just geographical proximity or administrative
convenience.
t Finally we should remember what we read about the ecological fallacy in Chapter 3
(see Case Study 3.5). Methodological studies quite clearly show that the level of data
aggregation used in analysis can affect the results. Look at Case Study 4.4, which
gives an insight into ‘Simpson’s Paradox’. This shows how aggregation can produce
answers that put us on the wrong track.
166 Research Methods for Education Part 2 The Process of Data Collection
High Barnet
East
Underhill Barnet
Oa
kle
igh
Totteridge Brunswick
Hale
Park
Edgware
house
Mill Hill
Coppetts
ey
Fin est
Wood
chl
W
Burnt
Oak
Finchley
Colind East
Church End
Finchley
Hendon
ale
Garden
en
West Hendon re Suburb
sG
d er
Gol
Childs
Hill
Rotorua Nelson
The moral of this is that you should take great care Source: Westbrooke, I. (1997) Simpson’s Paradox: An Example
when combining data to ensure that you do not gener- in a New Zealand Survey of Jury Composition, Statistics
New Zealand.
ate spurious results.
what can we do? There are a number of numerical manipulations that we can use
(see Case Study 4.5). If we are making an international study of qualifications at the
end of full-time study, how do we equate GCSEs in England at age 16 with the more
complex pre-university, senior general and pre-vocational system in the Netherlands
where students complete at age 16, 17 and 18? In this case we probably do not. The
only direct points of comparison between the two systems are age 18 or, turning the
measure around, and looking at the number of students who do not progress in full-
time education or training after 16.
t Second, we must check that the way in which data are presented do not change
over time. This is particularly important in terms of spatial units. Data, for instance,
are often presented according to electoral areas. These, however, change to reflect
changes in population distribution and even though the name may remain the
168 Research Methods for Education Part 2 The Process of Data Collection
1991 2005
In this comparison, there is only one common data organisations such as the National Health Service. This
class: local authority rented. In order to compare the sit- should be treated as analogous to local authority rented.
uation over time, we have to make assumptions. Regis- Since, in 2005, owner occupied and privately rented are
tered social landlords will include housing associations, combined, the same should be done for 1991. This is
so these can be treated as being in the same group. Other less than satisfactory but all we can do with this dataset.
public housing includes accommodation provided by The revised comparison is shown below.
1992 2005
(2) Apportioning classes schools in Alnwick at some point in the future. For
example, those aged three will enter in two years’ time.
Imagine that we want to project demand for primary This is quite a crude estimate because we are not tak-
education provision for Alnwick in Northumberland ing account of the month in which they were born.
and that we need to test our predictions against his- Our argument, however, is that if we test our model
torical data. For this we would need to know the num- against historical data, we will begin to see how reli-
ber of children in each year age group. This is available able our estimate is. However, when we go to the data
from the 2001 Census. Our model is that the number for 1991, we find that it only gives a per cent of chil-
of children in a pre-school age group will enter the dren aged 0–4. How do we resolve this problem?
▲
Chapter 4 DATA: ASSEMBLING THE RESEARCH TOOLKIT 169
The first stage is to convert the percentage aged 0–4 to an actual number:
1991 2001
The total population in 1991 is 30,081; 5.7% of bers of children will increase by 2.5% each year from
this population are aged 0–4. This is 1,715 chil- age 0–1 to age 3–4. We can express this weighting
dren. We then apportion this total to the four age as 1,000 for 0–1, 1,025 for 1–2, 1,050 for 2–3 and
groups. The simplest way is to put an equal number 1,075 for 3–4. We use four figures just to get rid of
(428 4 429) in each. However, this is a very crude the decimal points. This will produce the numbers
estimate. A better estimate might be to assume the for each age group shown in the table below. How-
birth rate in each of 1991, 1990, 1989 and 1988. To ever, this is an approximation because 2.5% a year
do this, we subtract the number of children aged 0–4 over 10 years compounds itself, so we have under-
in 2001 (1,274) from the number in 1991 (1,715). estimated the population in our base year of 1991.
The difference is 440. We express this difference as We could also improve our prediction by applying
a percentage of the 1991 population. The answer is infant mortality rates and migration effects to our
25.7%. Over 10 years, this is about 2.5% a year. The totals and adjust them accordingly but this is much
implication of this is that the weighting for the num- more complex.
1991 2001
Equal allocation Weighted allocation Actual numbers
Summary
Manipulating data is a skill all researchers should develop. It usually requires nothing more than a little logical
thought and some simple arithmetic manipulation.
same, boundaries may have shifted quite markedly. What can we do in this case?
If the boundary change over the period we are investigating represents only a small
proportion of the population and the groups included and excluded do not change
markedly the overall balance of the population, we are probably safe in turning
a blind eye to the problem and using the data as found. If not, then we have two
choices, either to construct our own estimate of what would be the case if the
boundaries had not changed, or to see if the data are presented in another spatial
format that is stable over time. This is an issue to which solutions are now being
developed. In England and Wales, for example, some data are now being presented
170 Research Methods for Education Part 2 The Process of Data Collection
in a format called Super Output Areas that are fixed over time, thus removing the
problem of boundaries that change. They have another advantage, as well. Electoral
divisions vary in size. The smallest, for example, can have as few as 100 residents
or as many as 30,000. This range can make comparison difficult. However, there is
another problem that researchers face with districts with the smallest populations –
the most recent data are not available because individuals could be identified. This
is unacceptable.
It is the advent of modern data management techniques that has enabled Government
geo-statisticians to create Super Output Areas. Three layers of Super Output Area are
being developed:
t a lower layer with a lower population of 1,000 and an average population of 1,500;
t a middle layer with a lower population of 5,000 and an average population of 7,200;
t an upper layer (that is not yet developed) with a lower population expected to be
about 25,000.
Scotland and Northern Ireland have equivalent structures called Data Zones and
Super Output Areas respectively, though in each case the population limits are different
from those in England and Wales. As an educational researcher, particularly if you are at
the beginning of your career, you should remain aware of developments in the spatial
organisation of data and ways of accessing such data.
that we look to see what others who might take a different stance have to say. If there
is something that underpins the data that we have access to (for example, a report
that has given rise to a press release), we would be advised to get hold of the report.
t Do we have to accept the arguments, explicit or implied, in secondary output? We
would be foolish to accept anything uncritically. Even academic papers that have
been peer reviewed before publication should not be accepted as being a definitive
statement. Remember what we learnt in Chapter 2 about influences on research
outlook. We would not expect a structuralist to interpret data in the same way as
a liberal educationalist. Sentiment, values, beliefs about what should be and self-
interest, all colour how people present their views.
t If processed data are potentially so corrupted, is there any value in looking at them
in the first place? Oh yes! We can use this sort of data to understand how decisions
are reached, how people are influenced and how opinions are formed. We can
extract information that helps us produce a timeline of how a situation evolved. The
very fact that what is made public is a partial view gives us potential evidence for
demonstrating attitudes, objectives, interests, values.
Secondary processed data are a rich resource; we just have to use them appropriately.
172 Research Methods for Education Part 2 The Process of Data Collection
Now we are going to look at examples of the actual secondary data that are available.
We shall look at the collection of primary data in Chapter 8. The numbers of data
sources that we can access via the Internet are truly breathtaking for the educational
researcher. We shall review, first of all, on-line resources relating to data within specific
countries. It is impossible, however, to deal with every country and we would be over-
whelmed if we were to try to deal with every data source of potential value to education
researchers. There are just too many to list, let alone describe. The implication of this is
that this and the following sections should be read in a particular way. Do not expect
to find definitive specifications. Do not expect to find answers to this sort of question:
‘What is the Web address for data on school enrolments in Denmark?’ There are several
reasons for this.
t First, national sites do not always have the same data. Just because the UK has data
on exclusions from school does not mean the same is true of every country.
t Second, websites change as governments restructure their executive frameworks. One
thing that seems to be happening is that government, at whatever level, is consolidat-
ing its data resources so that they can be accessed through one or just a few Web por-
tals. Different countries are at different points along this road and what exists today
is likely to be different in a few years’ time.
t And the last reason why we should not expect to find specific answers to ‘where is’
type questions is that I do not know the answers myself. If I need to know, I start
searching, and that is what every researcher has to do. The data we want is likely to be
stored somewhere.
So what can we expect from this and subsequent sections? The intention is to
help us appreciate the possibilities with Internet access to secondary data for educat-
ional researchers and the potential that they can realise. In this section we shall look at
data availability in the UK, Denmark, Finland, the Netherlands, Sweden and the USA.
(i) The UK
Devolution of power to Scotland, Wales and Northern Ireland has meant that while
some data are available for the country as a whole, other data are only available through
sub-national sites for England, Scotland, Wales and Northern Ireland. This is important
to know because education is a devolved responsibility to sub-national administrations.
In general, however, there is a common pattern to the data.
profiles for population, housing, the labour market and thematic analyses drawing out
regional comparisons for demography, education and training, labour market, housing,
health, lifestyle, crime, transport, environment and industry) and (c) maps. Not all of the
data can be accessed via the Web.
The website is, as might be expected, complex. Others might disagree and say that if
we want to understand what complexity is, we should look at the EU website. Certainly
the National Statistics site is nothing in comparison with the EU’s but it can be difficult
to find the data we want. There are four routes we can follow (see Figure 4.4).
t First, we can browse the site by theme. Amongst things of interest to educationalists
we can look at crime and justice, the economy, health care and social care, the labour
market, people and places and population as well as children, education and skills.
Each theme takes us to a sub-classification and at the next level there is a topic guide
with access to the data and links to other sources, topics and text materials.
t Second, we can access data through their location by following the ‘Regional’ route.
This gives access to a mix of geographical and topical sources, of which ‘Neighbour-
hood Statististics’ will be amongst the most useful. This is an outstanding route and
resource for educational researchers looking at issues in a local context. It gives us a
choice to look at a range of data for a locality or specific data for a locality. We can
choose the type of area for which we want the data, for example local authority or
ward or one of the new super output areas (areas that are defined so that they contain
174 Research Methods for Education Part 2 The Process of Data Collection
broadly the same numbers of people). The first option gives information for ‘Lower
Layer Super Output Areas’ (LLSO, the smallest spatial units). For the LLSO I live in
there are 304 separate tables. The population of this LLSO was 1,586 in 2001. This
gives some indication of the detailed level of analysis that we can undertake. Data can
also be outputted for middle layer super output areas, education authority, primary
care organisation, parliamentary constituency and parish. Not all information is yet
available for all areas. For one education authority, there were 339 data sets. The
range of data available to us is truly staggering.
t Third, we can search for data. A search for ‘pupil numbers’ produced 165 results. We
can narrow our search using the filters on the left. Filtering by local authority reduced
it to fourteen data sets.
t Fourth, and reflecting the fact that public data are not yet completely linked, we can
search via an external Web engine. A search for ‘pupil numbers gov uk’ via Google
brought up a range of key resources from the first hit.
The only way through, to appreciate what data are available, is to begin to use it.
Activity 4.4 begins this process.
Go to the home page of the Neighbourhood Stat- Deprivation. You do not have to understand (yet!)
istics website l (http://www.neighbourhood.statistics. how these indices are constructed. To guide you, look
gov.uk). Enter the search term ‘Derby’ and identify at the first one in the table, the average score for all
it as a local authority. Derby is in the East Midlands areas in the Borough.
of England. Go to the section for data on Education, There are, naturally, differences between the Boroughs.
Skills and Training and open the page for Key Figures. In 2010, 26% of Newham’s 16–24-year-olds were re-
Start by looking at the percentages of students at ceiving the Job Seeker’s Allowance and 32% of
Key Stage 4 with 5 or more GCSEs (the national Derby’s, while Derby had a teenage conception rate
qualifications at age 16) at A* to C. Now do the of 55.6 births per 1,000 in the age group and Newham
same for Newham Education Authority. The London one of 45.0 per 1,000. Overall Derby is the 88th most
Borough of Newham is right next to Tower Hamlets. deprived area in England and Newham the third. Go
What are the differences in the data? What do they back to the data on education and training. If you
suggest to you? Now look to see how different the investigate in more detail, you will see that Newham
two authorities are. Go to the Economic Deprivation is performing rather better than might be expected.
data set and look at the Benefits Data (per cent of You have now identified an issue. The question is,
age group 16–24 receiving the Allowance), and the ‘Why is this?’ Your task as a researcher is to answer it.
under 18 conception rates in the Healthcare data set
and the Local Authority summaries for the Indices of
Chapter 4 DATA: ASSEMBLING THE RESEARCH TOOLKIT 175
3. By publication type: this is reached via the ‘Data’ tab and will give us all of the publica-
tions of a particular type. Figure 4.5(b) shows that the principal classification is into
‘Data’, which gives a description of the data set and ‘Statistics’, which gives access
to the detail in the data sets. Note the tags on the left of the page which can be used
to select data relating to that topic. Clicking on the ‘Statistics’ tab takes us to a page
which allows us to filter data sets by category. The categories include publication
type, publication status, date and topic tags. There is an enormous wealth of data
available here to researchers, even down to the level of individual schools, which
should not be overlooked.
Organisation Website
data archives, see Case Study 4.3), has responsibility for managing longitudinal surveys
such as the National Child Development Survey. The Question Bank is another database
that stores surveys, not survey results. The Treasury has produced a superb review of
evaluation methodology in the Magenta Book. (The title was given to distinguish it from
the Green Book, which is the Treasury’s set of guidelines for the economic appraisal of
policy and projects.)
(ii) Denmark
The statistics Denmark website (www.dst.dk) is published, in part, in English. Data on a
range of subjects are available on-line and more detailed tables are available after regis-
tration (www.statbank.dk). Figure 4.6 shows some of the tables available. The means of
extracting the data is clear and straightforward. The extraction system allows us to pivot
tables (a technique for manipulating rows and columns to get different data analyses) as
well as outputting the data graphically. The range of data available without registration
is laid out in Table 4.5. Data on social conditions includes child maintenance, housing
and others that can be used to describe the social context for education.
Theme Tables
(iii) Finland
The following websites provide information about statistical data, and other informa-
tion about education in Finland:
1. Statistics Finland (www.tilastokeskus.fi/index_en.html or www.stat.fi/index_en. html).
This site describes the types of data held. For education follow link under Topics.
The site gives data held on schools, the autumn audit of schools and pupils, educa-
tional finance, student employment, teachers and staff and, usefully, there is an email
contact.
2. Ministry of Education (www.minedu.fi). This site describes educational policy and
summarises the character of the educational system.
education financing, school data, flows through the educational system and voca-
tional training), publications, methods and FAQs.
t By searching the Statline database (Figure 4.7(b), access via the figures link). This
route enables us to search by area of interest using keywords (a search on young
people’s leisure identified two tables) and by area.
(v) Sweden
The Statistics Sweden website (www.scb.se) is also available in English as well as
Swedish. The routes into the data are again by theme (in this case called subject areas)
and by searching the statistical database. The website currently gives access to more data
sets in English than the Dutch site. There are over 60 data sets for education and research
covering education of the population, financial aid, research, higher education and the
school system and day care. The search route takes us either to a listing of detailed data
sets or to a free word search. Access without registration limits data output to 1,000 data
cells. To get access to more data we have to register. Registration is free.
Figure 4.8(a) Data tools on the National Centre for Education website
(c) California
(www.ed-data.k12.ca.us)
California is presented as an illustration of data availability at state level. On this site
we can:
t Find information on any school, school district or administrative county. The search
can be by name or by zipcode. The data output includes a profile of the school,
performance monitoring, pupil performance and pupil withdrawal. There is a facility
to compare schools.
t Find profiles for the state, county, district and school.
182 Research Methods for Education Part 2 The Process of Data Collection
(d) Eurydice
Eurydice is an information network supported by the EU that makes available compar-
able data and information on education systems and policies in member states plus
Chapter 4 DATA: ASSEMBLING THE RESEARCH TOOLKIT 185
t data series: with comparative analyses of, for example, quality assessment, school ad-
ministration and governance, participation in education, educational funding, teacher
training, workload and teacher numbers in Key Data on Education in Europe (published
annually), plus publications on language teaching, higher education and ICT.
(e) Eurostat
Eurostat is the principal source for a wide range of social, economic and environm-
ental data (http://epp.eurostat.ec.europa.eu/portal/page/portal/eurostat/home). Edu-
cation and training data are found under population and social conditions. The web-
site has 14 data areas which function as indicators on education system performance
(see Figure 4.11).
(there are 21 for education) or examine data by topic (42 data sets for education, see
Figure 4.12). The Research tab takes us to a Data and Research page with a drop-down
menu. Selecting data by country takes us to an overview page and then we follow the
links to obtain data on the country or countries we are interested in. The World Bank
is introducing a new databank search framework. There is a tutorial to help you get
the most out of the system.
t The World Bank also has a webpage devoted purely to education data, EdStats (Home/
Topics, Education/Data and Statistics). This gives access to data on enrolments,
completion, learning outcomes, expenditure, staffing, and education stages. There is
even a Iphone/Ipad App to help you find data. What researchers will find particularly
useful is the mapping tool and bubblechart tool (which allows the comparative
analysis of three variables) to visualise data.
Review
The enormous scale of secondary data available to us over the Internet should, by now,
be apparent. We can use these data in the following ways:
t to set the context for an issue we are going to investigate at a detailed level using
primary data we correct ourselves;
t as a large data set to be analysed using statistical analysis to identify patterns and issues;
t to compare one area with another in order to identify and explain similarities and
differences.
188 Research Methods for Education Part 2 The Process of Data Collection
To use them, however, we have to feel confident – confident in our ability to find out
what is available; confident in being able to access the data; and confident in being able
to pick up the ‘messages’ of the data. Confidence comes with familiarity. Activity 4.5 will
help with this.
1. Select any national data set. Follow through the 3. See if you can answer the following questions:
links from the home page. In your own words de- t In which country, Australia, Denmark, the
scribe the structure of the website. Pay particular Netherlands, Sweden and the UK, have gradu-
attention to the way in which data can be accessed ate numbers increased most in absolute and
and the levels (school, education district/authority, percentage terms since 2000?
region/state, nation) at which data are presented. t Has the growth been more amongst men or
Is the same data presented at each level? women?
2. Search for data on the following topics in any t The phrase school life expectancy describes
national data set: how many years children and young people
t Performance of pupils at key points in their ed- can expect to spend in education based on
ucation. Is there any differentiation by gender, current trends. Use the UNESCO website to
school type, ethnicity? identify the school life expectancy for boys
t Absence/truanting from school. and girls from primary to secondary in the fol-
t People with qualifications. Can you differenti- lowing countries: Jordan, Yemen, China, Hong
ate between those with academic and those Kong, Sweden, the UK, India and Pakistan. Can
with vocational qualifications? you suggest explanations for any differences?
t Data on pupils with special educational needs. How has school life expectancy changed over
t Data on pre-school learning/early-years educa- time? How would you go about determining
tion and childcare; does this data give you any whether your explanations are valid?
appreciation of trends in provision over time?
t One other topic in which you have a particular
interest.
Summary
The purpose of this section was to make you more adventurous in your use of data.
However, one of its main messages is that you should treat your data with scepticism
until you are convinced that it is giving you legitimate answers.
Key points to take away from this section:
Further reading
O’Dochartaigh, N. and Sleeman, P. (2007) Internet on how to drill down to the information you need.
Research Skills, Sage, London. Niall O’Dochartaigh and Patricia Sleeman’s book
Stacey, A. and Stacey, A. (2004) Effective Information is also worth dipping into. Any book that begins
Retrieval from the Internet, Chandos, Oxford. with, ‘Why not just search Google?’ knows how to
The Web contains a wealth of data for the educational grab our attention. It is written by people who not
researcher. Alison and Adrian Stacey’s book will take only know the Web but also understand the research
you through the processes of how to find informa- process from personal experience. There is a very
tion and how to assess its quality and the quality of useful section on accessing discussion groups, news
the website that hosts it. They also offer useful tips and blogs.
References
Chang, L.C. and Krosnick, J.A. (2003) ‘Measuring the Schwarz, N. (2007) ‘Retrospective and Concurrent Self-
Frequency of Regular Behaviours: Comparing the Reports: The Rationale for Real-Time Data Capture’,
“Typical Week” to the “Past Week”’, Sociological in Stone, A.A., Schiffman, S.S., Atienza, A. and
Methodology, 33, 55–80. Nemling, L. (eds), The Science of Real-Time Data
Fisman, R. and Miguel, E. (2006) ‘Cultures of Capture: Self-Reports in Health Research, Oxford
Corruption: Evidence from Diplomatic Parking University Press, New York.
Tickets’, Working Paper 2312, National Bureau of Westbrooker, I. (1997) Simpson’s Paradox: An Example
Economic Research, Cambridge, MA. in a New Zealand Survey of Jury Composition, Statistics
Legendre, A. (2003) ‘Environmental Features Influencing New Zealand.
Toddlers’ Bioemotional Reactions in Day Care
Centres’, Environment & Behaviour, 35(4), 523–49.
Chapter contents
Introduction 191
Summary 221
Further reading 221
References 221
Chapter 5
Learning themes
t How to use other people’s research in your By the end of this section you will be able to:
work.
t Understand that literature research is an
t How other people’s research can be mis- integral part of your research programme
used. from beginning to end.
t How to set up a strategy for a literature t Appreciate ways in which literature can be
search. used in your research.
t Writing a literature review. t Be able to undertake a literature search
and write a literature review.
Introduction
There is often an assumption that the data Many texts on research methods restrict their
we assemble and collect ourselves is what re- guidance to what is called ‘the literature
search is all about. Well, it is certainly impor- review’. This section is much broader. A
tant, no one would deny that, but the work literature review is, without question, a key
that other people have done on and around part of anyone’s research. But to do justice to
our topic is important too. Here we shall look our research we need to embed it in the wider
at how we use other people’s research and research environment and to achieve this we
what we shall find is that staying in touch need to use other people’s work throughout
with the wider research field will occupy us as a foil to our own ideas and to highlight the
throughout the research process. significance of our approach and findings.
192 Research Methods for Education Part 2 The Process of Data Collection
ideas, new insights and new research directions. Being aware of what is being discussed
in education research should be part of every researcher’s ongoing activity because it is
an important input in identifying the research field.
Case Study 5.1 looks at stimulus from a different perspective. The study shows how
research findings can be used to improve educational practice, one of the goals of car-
rying out the research in the first instance. When you have read it, look at Activity 5.1.
194 Research Methods for Education Part 2 The Process of Data Collection
Work on this activity after you have read Case t How many times had you accessed journals elec-
Study 5.1. This shows how the results of educational tronically?
research were used to construct tools to measure t How many of the research seminars were you
how far children’s environments deliver the features aware of?
that the educational research says are important. We t Did you know that there are Internet discussion
are going to do something like this for your research lists?
environment. t If a good performance is to look through five jour-
nals, go on-line five times, be aware of 25% of the
meetings organised by the education department
Stage 1: The audit and to know that academic discussion lists exist,
t Go to your library and make a list of (a) all the edu- then judge your performance as excellent, good,
cation journals that are currently taken, (b) how fair or poor.
many education journals the institution subscribes
to on-line and (c) other journals that can provide
insights to education researchers (look especially Stage 3: The action plan
at journals in sociology, psychology, social work, If your personal assessment is that you did not do par-
criminology, urban studies). ticularly well, then you are relying on other people to
t Find out how many research seminars the educa- create a stimulating environment for you to work in.
tion department organised last year (or is organis- If this is the case, you need to take more responsibility.
ing for the current year).
t Search the Internet to find education discussion t Identify five journals (library and Internet) that
lists (these are services that enable people with you will browse every three or four months.
similar interests to communicate with each oth- t Identify one seminar or meeting to attend each term
er). As a start, in the UK, look at JISCmail (www. or semester. It does not have to be in education.
jiscmail.ac.uk). Stop when you reach 50, it will not t Join at least five discussion lists.
take long! At least 10% of your results should be
lists based in other countries.
more deeply into the research literature. Our focus is the overall approach and method-
ology as well as the detail of the methods. The question we need to ask is whether the
data are a reliable description of the situation. We need to look critically at what other
researchers have written. Just because something has been published does not mean that
it is beyond criticism. We should examine closely what approaches to data collection
and analysis other researchers have used for two reasons. First, to convince ourselves
that their conclusions are valid. If the methodology is inappropriate, the conclusions are
unwarranted. If the methods of data collection (for more on this, see Chapters 6, 8 and
9) are not robust, then the conclusions should be hedged with caveats. If the analysis
does not probe the complexity of an issue, then the level of understanding revealed is
likely to be simplistic. We should not skip over the section that deals with methodology,
otherwise we may be badly misled by the research.
The second reason why we should look closely at what researchers say about their re-
search approach is to see if particular methodologies are preferred (and why). We should
look to see what works where. We do not need to break new ground if other people have
done some of the work for us already.
(a) Ignorance
The first way in which people can mislead other researchers is to ignore (or be genuinely
unaware of) relevant work. Leaving information out can distort analysis as much as put-
ting false information in and if it is done to benefit one’s own work, then it is fraudulent
and unethical. Being unaware of research findings is certainly not fraudulent but it is
an issue for every researcher. The more interests someone has, the more of a problem
it becomes. How can we stay abreast of a rapidly expanding field, especially now with
the growth in the number of research journals and the fact that the Internet has made
published work so much more readily available? There are strategies that we can adopt.
r Each time we start a research project we should ensure that we scope the field in a
methodical way. A process for doing this is outlined in section 5.3.
r We can identify the key journals in a field and make sure that we read contents pages
of each edition. It is even easier now that we can sign up for email alerts from pub-
lishers on nominated topics.
r Another option is to read a review journal, that is, one whose purpose is to synthe-
sise research in a field rather than to publish the outcomes of a research project. The
Chapter 5 USING LITERATURE IN RESEARCH 197
Review of Educational Research and the Review of Research in Education both do this. The
Education Review (USA) is devoted to book reviews.
r Finally, when the number of publications we have to read becomes overwhelming,
we have to specialise.
Despite these actions, we may still miss important contributions to our field. All we
can do is have them drawn to our attention. If we are part of a research group, this is
more likely to happen.
(b) Misunderstanding
The second way we can mislead others is to misrepresent research because we misunder-
stand it. Let us look at an instance of this that has been brought to light. There is real con-
cern in both North America and Europe about the number of teachers teaching subjects for
which they are not qualified. There are many worries: that the teachers might misunder-
stand a concept or explanation and teach their students something that is factually wrong;
that the teachers might rely too much on teaching aids like textbooks so that lessons be-
come predictable and dull; that the teachers lack the background to communicate the exci-
tement of a subject. This concern is particularly important, for example, in mathematics,
sciences and modern languages where there are often shortages of subject specialists. Some
researchers have argued that being taught by a non-specialist in the sciences leads to lower
levels of attainment, a fall in the numbers of students choosing these subjects at university
and a decline in the skill base that underpins innovation. An American researcher, Ingersoll
(1999) looked at the issue in the USA. His research showed that being taught by a non-
specialist was widespread. At the time of his study a third of all secondary mathematics
teachers had no background in mathematics, a quarter of secondary English teachers had
no background in the subject and about 20% of secondary science teachers lacked an appro-
priate background in the subject they were teaching. What is significant for us though is that
he showed that the assumptions people made about causes were largely incorrect. In other
words, other researchers had misunderstood the real nature of the problem. He was able to
show that the assumptions that the situation was due to (a) teacher unions who sought to
ensure that established long-serving teachers retained their jobs even if their professional
profiles did not meet local needs and (b) teacher shortages (where there are not enough
teachers in particular subjects) were incorrect. His analysis suggested that it would be more
fruitful to explore the organisation and management of schools and the staffing decisions
taken by principals as being more significant in explaining the prevalence of the problem.
Misunderstanding can occur when some of the participants in an exchange of views
are not experts. We might, for example, be likely to have misunderstandings in relation
to discussions of ethnicity, religion and multi-culturalism where some people argue on
the basis of research evidence and others on the basis of personal experience. We might
also expect to find misunderstanding in comparative education research, especially when
the comparisons are across national boundaries, when people fail to appreciate the nu-
ances of culture or the realities of the local political and organisational structure. Some-
times people claim that others misunderstand a situation when the reality is that each
has a different understanding that is the result of a different philosophical standpoint.
Another American educationalist, Stephen Friedman (2000), took a contrary view to
Ingersoll and argued that knowing how to teach was more important than having a good
knowledge of the subject. Ingersoll replied to Friedman, (Ingersoll, 2001), pointing out
how he felt Friedman had misunderstood not just Ingersoll’s argument, but the nature
of the issue itself. Wherever we stand on this, both viewpoints owe as much to belief
198 Research Methods for Education Part 2 The Process of Data Collection
and commitment to a point of view as they do to evidence. This highlights the need for
researchers to be aware of and to understand the philosophical and personal perspectives
that can create these ‘misunderstandings’; they can be important.
(d) Misrepresentation
The final way in which work can be misused is to misrepresent the findings or the process.
This is a far more serious issue in the world of professional and academic research than
misunderstanding the implications of someone else’s research or being ignorant of it.
Misrepresentation of someone else’s work is far beyond being what the academic
community would accept as being a legitimate critique and when someone is accused
of misrepresentation, it is a serious matter. Because, for some people, a philosophical
position is important in determining what is ‘right’ and what is ‘wrong’ in education, the
purpose of an argument is to stack up the evidence to support the ‘right’ conclusion. In
these circumstances, truth can become a casualty.
Chapter 5 USING LITERATURE IN RESEARCH 199
The purpose of this activity is to show you how to 6. Search the literature to see who else has pub-
follow up reference (citation) linkages and to assess lished in the area. Have any significant papers (es-
how far the references represent a balanced picture pecially those by authors in other countries) not
of research. been referenced?
1. Identify an author who has written more than Note: there are suggestions on ways of searching the
three journal articles and select any one article literature in a field in section 5.4.
appropriate to your research or to your studies. After you have done this, ask if there is any evidence
List all the references. of mutual citation (where authors cite each other)
2. Identify other articles written by the same author. and, if there is, is it warranted? Look to see if any
List the references and note mutual citations of work has been ignored. If so, ask why. Did it reach
other authors. a different conclusion? How far would you say that
3. Identify papers written by authors who have there is a group of researchers who are working to-
been referenced more than once. Look at the ref- gether? If there is evidence, does it represent a ma-
erences in these papers and identify if any cite (a) lign influence pushing a particular perspective, or
the first author or (b) other authors cited by the merely that the pool of researchers is small and know
first author. (and value) each other’s work?
4. It may help if you set up a matrix to show the
number of co-citations.
5. Follow the linkages between authors as far as you
can and judge the extent to which authors co-cite
each other.
Author#2
Author#3
Author#4
Author#5
A case of misrepresentation was alleged in the USA recently. The background is the
genuine concern, in most economically advanced countries, about the competence of
children from lower socio-economic backgrounds in the basic skills that enable people
to function adequately in society. In particular, there has been concern over reading
skills. Not surprisingly educational research that seeks to assess the effectiveness of strat-
egies for teaching reading is of more than academic interest. Because of this, proponents
of different approaches to teaching can engage in furious and at times acrimonious ex-
changes. Nowhere was this more true than in the USA where, as in other parts of the
world, the lines of conflict were drawn between those who argued for phonics as an
approach to teaching, versus others. In one exchange between authors, things came to
a head in 2000 with the publication of a paper titled ‘Misrepresentation of Research by
Other Researchers’.
While the issue erupted in 2000, it had clearly been simmering for some time before. On
the one hand was Barbara Taylor, Director of the Minnesota Center for Reading Research,
and others who take the view that teaching reading requires an integrated strategy that pays
regard to the context of the student as well as the teaching strategy. On the other hand was
Barbara Foorman, Director of the Center for Academic and Reading Skills at the University
200 Research Methods for Education Part 2 The Process of Data Collection
of Texas Health Science Center, Houston and others who published an academic paper in
1998 that showed the importance of phonics in an integrated approach to teaching read-
ing. The accusation of misrepresentation was made by Foorman et al. (2000) in response
to a paper in the journal Educational Researcher by Taylor et al. (2000). Foorman et al.’s
paper is hard-hitting. The language is certainly strong with claims, for example, of the dis-
tortion of facts. Table 5.1 sets out some of the claims of misrepresentation that Foorman
et al. make. The evidence presented in Table 5.1 does not do full justice to the claims of
misrepresentation by Foorman, nor does it adequately represent the views of Taylor or any
of the others who are criticised by Foorman. What it does do, however, is highlight some
of the issues that Foorman and her co-authors claim constitute misrepresentation. In the
Taylor et al. criticize us for a ‘simple solution to the We have never maintained that a single solution
complex problem of raising the literacy of young exists to this problem, let alone a ‘simple solution’.
children in high-poverty neighbourhoods’.
Researchers investigating beginning reading should We agree with this, which is why our discussion sec-
exercise extra caution to delimit findings from their tion cited nine specific limitations of the study.
own studies.
Taylor et al. criticize us for presenting results prior Their critique was posted on a public website 10
to publication. months before publication. We also note that some
members of B. Taylor et al. discussed their critique
with a reporter long before we were aware of its
existence. The reporter believed a scandal had been
unearthed, but eventually decided there was nothing
to write about. In fact, the results of the Foorman,
Francis, et al. (1998) study were presented at two
invited symposia prior to publication. In retrospect,
the order of events may appear unfortunate. But the
reality is that in virtually all areas of research, includ-
ing education, presentation prior to publication is the
norm.
In contrast to the implications of B. Taylor et al., We have responded in writing and by personal com-
(this can be inferred to mean that Foorman et al. munication to numerous misrepresentations of our
did not respond to criticisms of their work). research by the media, by policy-orientated organiza-
tions, and by reading professionals. For example,…
we sent Gerry Coles a disk containing our data. Coles
proceeded to selectively drop classrooms in our data-
set so that he could write his essay (Coles, 1999). (See
also Coles, 2000.)
When the authors of this widely publicised study This article is a critique of commercial reading pro-
use their results as the basis for promoting specific grams and the focus of policymakers on these types of
commercial programs…, they contribute to the programs.
impression that students’ reading problems will be
solved if a school simply buys the right program.
B. Taylor et al. imply that the Foorman, Francis, There is considerable evidence that explicit teaching
et al. (1998) study is to blame for 67 phonics bills: of the alphabetic principle is a necessary (but not suf-
‘Since 1996, when information about the Foorman, ficient) component of reading instruction for children.
Francis, et al. study was first publicized, 67 bills to If Foorman, Francis, et al. (1998) never existed, the
make phonics the law have been proposed in states pendulum in the reading wars would still be swinging
around the country.’ towards the phonics side.
Chapter 5 USING LITERATURE IN RESEARCH 201
left column is the claim of misrepresentation paraphrased or quoted from Taylor et al.’s
paper and in the right column Foorman et al.’s rebuttal of that claim. Some of the issues
are methodological, some a matter of principle, some a matter of interpretation and some
a matter of research practice. However, the most serious assertion by Foorman et al. is that
their opponents ‘have taken part of the puzzle and blown it out of proportion to fit their
story’ (p. 33). This gets to the heart of the issue, and is a claim that Taylor et al. have written
a paper that is more concerned with defending their position than anything else.
This is, as might be expected, a complex issue but there is no getting away from the
fact that, tied up with the dispute, are philosophies of education, reputations and, in all
likelihood, access to opportunities. People come to evaluation and criticism of other
people’s research with their own ‘stories’. In this particular case, we must all make our
own minds up on whether the claims of misrepresentation are justified (and the refer-
ences are there to be read) but whenever there are debates in the pages of an academic
journal, we should look to see how far criticism is balanced and fair minded and how far
it is designed to support a position.
202 Research Methods for Education Part 2 The Process of Data Collection
The first parts of this section have looked at the big picture in getting to grips with research
literature; first how we can use this literature in our own work and, second, how to read
research literature to make sure that we can make the most important critical judgement
– whether it is accurate, honest and truthful in its representation of the field. In this next
part, we move on to the process of gathering the literature together. Broadly there are two
approaches to this. On the one hand, there is following our noses and benefiting (or not)
from serendipity. On the other hand, we can take a more structured approach, which
downplays the element of luck. We will follow this approach! The elements in this struc-
tured approach are to:
r map the issue;
r use questions to frame and direct the search process;
r know the sources;
r select search options;
r follow leads;
r review the outcomes.
guide our research but a framework to test the existence of research literature that deals
with the dimensions we have identified. Case Study 5.2 gives an example of this initial
brainstorming process.
Accommodation
Accommodation
bureau
Student
Assessment Language capability
support
Student
Loneliness induction
Classroom Coping
experience academically
Academic
support
Disciplinary
Racism
action
Mapping the issue, however, can move beyond producing a list of factors and influ-
ences. We can set out how we think these factors relate to each other, which one is a
direct influence and which one a secondary influence, what happens first in a process
and what happens later. It is at this point that we move beyond guiding our search for
appropriate literature to shaping a model that can explain a situation. We then use this
to help direct our literature search.
Figure 5.1 illustrates this and takes our example of researching the experience of
overseas students (Case Study 5.2) a little further. This diagram looks specifically at
some issues that a foreign student in higher education might have. In the centre of
the diagram are some concerns or situations that might be experienced (for instance,
racism or accommodation problems). On the right are areas of university action and
responsibility and on the left the tutor’s responsibility for teaching and learning. What
we are trying to do here is understand how student issues are recognised and managed.
Thus the university accommodation bureau is there to allocate accommodation and
resolve problems. The bureau implements university policy – is it better to house all
foreign students together or to mix them with home students? The former allows bet-
ter provision of support services but can create a ‘ghetto’. In either case, there may be
incidents of racism. Is there a procedure for dealing with this? The university might
recognise the role of an induction programme for foreign students in helping them es-
tablish friendships but what if a tutor sees that a student is isolated in a class? We need
to see if the link between the academic and support side of the university is strong. The
same link needs to exist if a student is not coping well academically. If it is a matter of
just understanding the way teaching and learning works, this could be dealt with in in-
duction but if it is language, language support should be provided. We could produce
another map, this time with the tutor in the centre of the diagram. Its focus might be
Chapter 5 USING LITERATURE IN RESEARCH 205
a tutor who is unwilling to change to meet the needs of foreign students. What linkage
map can you construct for this situation?
(b) Databases
The second way of accessing literature is via databases (see Table 5.3). These bring us
closer to the documentary and other sources we want. Some databases are public (for
example, the British Library’s Inside and the US Government’s ERIC), while others are
Chapter 5 USING LITERATURE IN RESEARCH 207
private (for example, Ingenta and Jstor). Some are free, some are subscription and a few
have a free trial. In general, the public and publicly supported databases (that is, those
deriving the bulk of their funding from state finance) provide access to a wide range of
literature while the private databases (that is, those that are privately financed) only con-
tain material from journals produced by specific publishers. Those that are subscription
services may be available through an institutional subscription (check with your own
library or information service).
For some researchers it may be worthwhile to purchase a copy of an on-line
bibliographic search engine. These can speed up the research process quite significantly.
They are designed to search many of the databases listed in Table 5.3 (and more besides)
as well as major library sites (such as the Library of Congress in the USA). They have the
functionality to import references, receive alerts from data sources and create a library
208 Research Methods for Education Part 2 The Process of Data Collection
Biblioscope www.biblioscope.com
Bookends Plus www.sonnysoftware.com
End note www.endnote.com
Procite www.adeptscience.co.uk
Refworks www.refworks.com
of resources downloaded on-line or from scans. They also enable users to write notes on
the record. Their value is not limited to accessing and storing references. When it comes
to writing the literature review (or any other text), they can set the reference into format
required. Accurate citation is particularly important and this software will save many
hours searching for missing commas, full stops and extra spaces. Table 5.4 lists some of
the principal software.
(c) Reviews
The third way of getting to the literature we want is through pre-existing reviews. If these
exist, we are fortunate but it does not mean that we should not continue to look for other
published material; there is no guarantee, after all, that the review is comprehensive and,
certainly, there may well be other work published afterwards. What existing reviews do
provide, however, is a picture of the main research avenues. These might give structure
to our own search, analysis and review. The following are useful sources for educational
researchers.
r Review journals: these are journals devoted to reviewing, summarising and com-
menting on areas of research. They do not publish original research studies. While
they can identify new directions for research, their principal function is to present an
overview so that academics and research professionals can take stock and make judge-
ments about future progress. In looking at review articles, we need to ask ourselves
these sorts of question: Does what we know add up to a convincing explanation?
Where are the gaps in our understanding? Does this method work? Can we define
good practice? Is this approach producing the results we expected? The main review
journals in education are both American and produced by the American Education
Research Association. Both tend to reference research published in North America
more than research from other areas. This is something that we should be aware of
and rectify in our own investigations. The Review of Educational Research is published
quarterly. Its articles are often quite narrowly focused. In June 2007, for example,
Peter Stevens published an article reviewing the literature on race, ethnicity and in-
equality in English secondary schools (Stevens, 2007). The articles frequently have a
pronounced critical dimension that reflects the values of the author. Schutz’s article
‘Home Is a Prison in the Global City: The Tragic Failure of School-Based Community
Engagement Strategies’ (Schutz, 2006) clearly falls into this category. Because of this
critical dimension, there can be subsequent commentaries on articles from people
with a different perspective. One instance of this concerns schema theory. The con-
cept of schema can be traced back to Piaget but has subsequently been developed
Chapter 5 USING LITERATURE IN RESEARCH 209
r UK: BERA publishes the British Educational Research Journal (BERJ) plus Research Intelli-
gence, which has some short methodological articles, and has an Occasional Paper series.
r Netherlands: The Netherlands and Flemish Educational Research Associations pub-
lish Pedagogische Studiën.
r Sweden, Denmark, Norway, Finland: The countries of Scandinavia publish the Scan-
dinavian Journal of Educational Research.
r USA: The American Educational Research Association (AERA) publishes four jour-
nals. The American Educational Research Journal publishes papers of social, institution-
al and pedagogic interest. Educational Evaluation and Policy Analysis publishes papers
on research for policy and research about policy. Educational Researcher contains some
Chapter 5 USING LITERATURE IN RESEARCH 211
general papers but because it reports news about the association it has a ‘newsletter’
feel. The Journal of Educational and Behavioural Statistics is strongly quantitative in its
methodology.
r Europe: The European Educational Research Journal has a focus of the developing ‘Eu-
ropeanisation’ of educational policies and practice as a result of EU policy initiatives
agreed by member governments.
As well as these journals, there are many others devoted to a particular topic, such as
early years, mathematics education, work-based learning and so on. They are too numer-
ous to mention here. It is also worth noting that there are a growing number of free-
to-view open access on-line journals. The American Educational Research Association
publishes a list at http://www.ergobservatory.info/ejdirectory.html.
r We can select a number range in our search by separating the limiting numbers
with two dots. Thus ‘foreign students’ near problems 2000..2007 will only select doc-
uments including articles published within the year range 2000 to 2007. Using
‘near’ means that the phrase ‘foreign students’ and the word ‘problems’ only have
to be near each other, not next to each other.
r We can suppress a word by putting – between two words (the latter is not includ-
ed), ‘foreign students’ – fees will omit references where fees are linked with foreign
students. We can also include words by putting a sign in between words, so
‘foreign students’ fees will include references to papers including the two.
r Putting and, or, and not between two words will find paragraphs of text on web-
sites containing the requested combinations.
r The phrase allintitle: will find titles of articles, books and anything else held in
the database (such as reports) containing the requested word or phrase; allintitle:
foreign students problems will find titles containing the phrase ‘foreign students’
plus problems, issues, concerns, worries, etc.
r Alternative searches: the types of search considered so far are called Boolean searches
(named after George Boole, a self-taught nineteenth century mathematician who de-
veloped a mathematical system for expressing logical expressions, a system that is
now referred to as Boolean Algebra). Some websites (such as the US National Crimi-
nal Justice Service, https://www.ncjrs.gov/) permit other forms of searching such as
pattern and concept searching. These simplify the process of including some of the
operators identified above. A pattern search will look for words with similar spelling
and a concept search for associated concepts.
Searching the Web is a skill that all researchers should develop if they want to make
use of the rich resources that are available. More guidance is usually available on the
‘help’ pages of a search engine’s website.
Citation chain
A3 C
author self-cites a paper that is cited in paper A2 by the first author. You should now be
able to work out how the third author fits in. This sort of diagram is important in picking
up the relationships between authors.
Following links in these ways can be very productive. We have, however, to be strict
with ourselves and recognise when we are straying outside the field of greatest relevance
to us. This type of search can easily spiral out of control. A further point to note is
that different search engines are more appropriate for different academic disciplines and
themes. Any search engine is only as good as its database and Google Scholar, for in-
stance, like many, has a strong core of scientific work.
r Are the references in well-regarded publications and have they been anonymously
refereed?
r Is the topic thought to be important by other researchers?
Negative answers to these questions do not necessarily mean that we should change
our research topic but they should cause us to reflect on whether other people are seeing
a blind alley where we see a research project. However, negative evaluations might also
mean that we have a search strategy that is flawed and that we need to revise it or to start
again. In either case, it is better to find out sooner rather than later.
Having begun to identify relevant literature, the next step is to start to shape it into a lit-
erature review appropriate to the needs of the research. A literature review is the means by
which we tie our research intentions and outcomes to what other people working in the
field have done. It is central to most research projects, from an undergraduate dissertation
to a paper submitted to an academic journal. Often, by those just starting out on their re-
search careers, the literature search is poorly done and poorly written up. At one extreme,
researchers set their sights too low and just record what is written in textbooks. While it
is acceptable to use these as a source, they are usually several years (at best) behind the
research and may not deal with the material in a way that is appropriate to the research
topic. At the other extreme some people think that it has to be a record of virtually every-
thing written on the topic. Unless everything that is written is relevant, this shows a lack
of judgement in selecting work and themes to highlight. Some people think that only
material produced in their own country is relevant and many seem to believe that it is a
free-standing section in the research report rather than an integral part of the argument.
One function of the literature search is to demonstrate the need for the research.
Now that we know what it should not be, we need to find out how to do it properly
and to use it to not only improve the quality of our research but also our standing as
researchers.
… research has found substantial evidence of sig- Establishes what we know and justifies claims with
nificant variations in students’ achievement scores 11 references.
across primary schools and classes…as well as across
secondary schools and classes… (pp. 419–420)
On the other hand, only a few follow-up studies Establishes a weakness in the research position.
have been undertaken concerning the continuing
effects of schools and classes (p. 420)
Such research is important because it could… shed Makes a claim for research. Following text establishes
new light on educational practice and educational that conclusions on the long-term effect are mixed
impact (p. 420) and, by implication, establishes a need for more work.
This article addresses the effect of Flemish primary Outlines the research agenda.
schools and classes/teachers upon language and
mathematics achievement at the end of primary
schooling (short-term effects) and during the first
two years of secondary education (long-term
effect). (p. 421)
The authors concluded that while differences between primary schools and classes have a short-term effect, this
is eroded when pupils transfer to secondary school.
situation, or they can show how effective the outcomes of interventions are. It is the
ability to link these levels of analysis and types of literature that shows not only com-
mand of the field but also the ability to develop and sustain an argument.
r Believing that all academic prose is good: Academics write in order to bring their
ideas to a wider audience but some also want to demonstrate to that audience that
they are leaders in the field. One way of doing this is to write in such a way that read-
ers, finding the ideas complex and difficult to understand, believe that the author
must be better than they are. If we do this in our research then we run the risk of
producing text that is unintelligible and/or pretentious and of substituting jargon
for prose. What is more, if we copy the style without understanding the meaning, we
can make complete fools of ourselves. Academic writing should always be accessible.
Of course, the intended audience shapes the accessibility but if an academic finds it
difficult to understand another academic’s writing, then there are only two reasons
for it, either the reader is stupid or the author is pretentious. For anyone, even those
carrying out their first research enquiry, the academic literature should be accessible.
Look at Case Study 5.4 for examples of what to avoid.
It thus relatavises discourse not just to form – that Much of the writing that has been criticised comes
familiar perversion of the modernist; nor to authorial from the word processors of academics exploring
invention – that conceit of the romantics; nor to a their subjects through the concepts and frameworks
foundational world beyond discourse – that desperate of critical theory. In 2003 the critical theorists hit
grasping for a separate reality of the mystic and back with the publication of Just Being Difficult? (Cul-
scientist alike; nor even to history and ideology – len and Lamb) and claimed the need for a specialist
those refuges of the hermeneuticist; nor even less language through which to communicate their analy-
to language – that hypostecised abstraction of the ses, arguing that other subjects, such as computer sci-
linguist; nor, ultimately, even to discourse – that ence and medicine, were not stigmatised by the use of
Nietzchean playground of world-lost signifiers of such language. However, there is a difference between
the structuralist and grammatologist, but to all or a technical vocabulary and an obfuscatory language
none of these, for it is anarchic, though not for the that excludes those who do not know or understand
sake of anarchy but because it refuses to become a the codes. Our research findings should be accessible
fetishised object among objects – to be dismantled, and inclusive!
confidence in their claims. Authors may make assertions without supporting evi-
dence. We should also be watchful for authors who support their own methodology
by quoting others who have used the same methodology rather than justifying the
research approach in terms of research principles. Unless there are clear principles
guiding the approach, all this does is create a self-referencing structure without any
foundations or substance. If several studies come to different conclusions we should
seek to explain why. Criticism, however, is not only negative. We should acknowl-
edge those who break new ground and create new agendas. We should highlight
those whose methodology is beyond reproach (and show how it has influenced our
own approach). In order to show our judgements we should raise our game in terms
of how we express our ideas. ‘Good’ and ‘bad’ are hackneyed and carry no weight in
a sentence. We should use words like ‘influential’; ‘outstanding’; ‘limited’; ‘frankly
disappointing’; ‘interesting but… flawed, limited, etc.’. It is our range of vocabulary
and flexibility and strength of expression that convey colour, provide richness and
allow subtlety of view to come through. The words we use represent not just our
insights but also our interest and excitement within the topic.
or resolution of the issue? Following through these questions in the literature allows
us to represent stakeholder perspectives.
our project, we will almost certainly have to revise it for the finished report in order to bring
out the connections. The review section should build on the identification of the issue in
the introduction to the research report and tease out themes that lead to the specification of
research questions. Sometimes these themes identify no need for further research but some
will have to support the research questions we pose. The review, in particular, should hold
the interest of readers and propel them to the same conclusions that we reach. A good piece
of advice is not to fill their hearts with foreboding by calling the section ‘Literature Review’.
This sounds dull to start with. A report of research should be interesting, so give it a title.
Summary
t The literature review should inform all parts of a thesis, dissertation or research
report.
t Literature survey is an ongoing process throughout the research.
t We should be sceptical about published work, even work published in refereed jour-
nals, and we should assure ourselves that it represents an honest statement of the topic.
t The literature survey is more effective if guided by a pre-established strategy.
t On-line resources ease the process of identifying references.
t The literature review is the means by which we establish the need for the research.
t Literature review is more than a description of the literature.
t The books, articles and reports we review constitute the evidence we use to make a
case for our research.
t The review should be focused, not a compendium.
t We should write the review so that it reads like literature and not a piece of dull
academic prose.
Further reading
Fink, A. (2005) Conducting Research Literature Reviews: Arlene Fink’s book may fool you into thinking that it is
From the Internet to Paper, Sage, Thousand Oaks, Calif. just a slimmed-down version of Chris Hart’s or just
Hart, C. (2004) Doing a Literature Search: A Comprehen- another book on research methods (it has sections on
sive Guide for the Social Sciences, Sage, London. sampling, variables and measurement scales) but it is
Chris Hart gives sound advice on managing searches and quite different. Its focus is on reviewing literature and
how to find materials. Though the passage of time everything in the book is introduced to help us reach
makes some of his sources look outdated, he still gives judgements about the quality of what we read.
a clear introduction to a wide range.
References
Coles, G. (1999) ‘No End to the Literacy Debate’ Cullen, J. and Lamb, K. (eds) (2003) Just Being Difficult?
[commentary], Education Week, 27 January, 55. Academic Writing in the Public Arena, Stanford Univer-
Coles, G. (2000) Misreading Reading: The Bad Science sity Press.
That Hurts Children, Heinemann, Portsmouth, NH. Foorman, B.R., Francis, D.J., Fletcher, J.M., Schatschneider,
C. and Mehta, P. (1998) ‘The Role of Instruction
222 Research Methods for Education Part 2 The Process of Data Collection
in Learning to Read: Preventing Reading Failure in McVee, M.B., Dunsmore, K. and Gavelek, J.R. (2005)
At-risk Children’, Journal of Educational Psychology, 90, ‘Schema Theory Revisited’, Review of Educational
37–55. Research, 75(4), 531–566.
Foorman, B.R., Fletcher, J.M., Francis, D.J. and Schatsch- McVee, M.B., Dunsmore, K. and Gavelek, J.R. (2007)
neider, C. (2000) ‘Response: Misrepresentation ‘Considerations of the Social, Individual, and
of Research by Other Researchers’, Educational Embodied: A Response to Comments on “Schema
Researcher, 29(6), 27–37. Theory Revisited”’, Review of Educational Research,
Friedman, S.J. (2000) ‘Research News and Comment: 77(2), 245–248.
How Much of a Problem? A Reply to Ingersoll’s “The Pustjens, H., Van de Gaier, E., Van Damme, J., Onghena,
Problem of Underqualified Teachers in American P. and Van Landeghem, G. (2007) ‘The Short-term
Secondary Schools”’, Educational Researcher, 29(5), and the Long-term Effect of Primary Schools and
18–20. Classes on Mathematics and Language Achievement
Gredler, M.E. (2007) ‘Of Cabbages and Kings: Con- Scores’, British Educational Research Journal, 33(3),
cepts and Inferences Curiously Attributed to Lev 419–440.
Vygotsky’ [commentary on McVee, Dunsmore and Schutz, A. (2006) ‘Home is a Prison in the Global City:
Gavelek, 2005], Review of Educational Research, 77(2), The Tragic Failure of School-based Community
233–238. Engagement Strategies’, Review of Educational Research,
Ingersoll, R.M. (1999) ‘The Problem of Underqualified 76(4), 691–743.
Teachers in American Secondary Schools’, Educational Stevens, P.A.J. (2007) ‘Researching Race/Ethnicity and
Researcher, 28(2), 26–37. Educational Inequality in English Secondary Schools:
Ingersoll, R.M. (2001) ‘Rejoinder: Misunderstanding A Critical Review of the Research Literature Between
the Problem of Out-of-field Teaching’, Education 1980 and 2005’, Review of Educational Research, 77(2),
Researcher, 30(1), 21–22. 47–185.
Krasny, K.A., Sadoski, M. and Paivio, A. (2007) ‘Unwar- Taylor, B.M., Anderson, R.C., Au, K.H. and Raphael, T.E.
ranted Return: A Response to McVee, Dunsmore and (2000) ‘Discretion in the Translation of Research to
Gavelek’s (2005) “Schema Theory Revisited”’, Review Policy: A Case from Beginning Reading’, Educational
of Educational Research, 77(2), 239–244. Researcher, 29(6), 16–26.
Limlingan, M.C. (2011) ‘On the Right Track: Measuring
Early Childhood Development Program Quality In-
ternationally’, Current Issues in Comparative Education,
14(1), 38–47.
This page intentionally left bank
Chapter contents
Introducton 225
Summary 266
Further reading 266
References 266
Chapter 6
Learning themes
t The right data sources and the right t Modification of sampling procedures to
sample lay the foundations for good meet project needs.
research.
By the end of this section you will be able to:
t Choosing primary data sources.
t Determine the quality of your data sources.
t Why we sample. t Understand the principles of sampling and
t The differences between different know when and how to use a sampling
sampling approaches and how to apply procedure.
them. t Determine sample size.
t How to determine sample size. t Take action to prevent potential problems
t What can go wrong with a sample. with sampling your research design.
Introduction
Chapter 4 has given us a better understanding someone has already collected it for us (and this
of what education researchers mean by data. is called secondary data). Our job as researchers
The focus there was on secondary data; here is to make data answer our research question.
we shall be concerned with primary data To do this we have to extract and exploit the
and understanding more about what we call information that is contained in it. There are
information. To draw a distinction between four steps to this:
data and information may seem a little like nit-
t First, we have to be sure that we get data
picking but it is an appropriate one to make.
from the right source. This is what we shall
While it probably matters little in terms of
consider in this section.
actually doing research, it is more significant in
terms of understanding the process of doing t Second, we have to be sure that we get
research. good quality information from our sources.
As we saw in Chapter 4, we either collect We shall look at this in Chapters 8 and 9
data ourselves (this is called primary data) or when we consider survey techniques.
226 Research Methods for Education Part 2 The Process of Data Collection
t Third, we have to process the data to ex- t Fourth, we have to use the information to
tract the information from it that we want. create the evidence that we use to argue
This is the content of Chapters 10 to 13. our case. We will look at this in Chapter 13.
The foundation of any research is to get data from the right source and to ensure the
quality of the data obtained. A simple example will illustrate the point. We can have the
best questionnaire in the world on the effect of bullying on a child’s life at home and
out of school but if the parents who receive it do not have any children who are being
bullied, we will not get the data we need. The converse is equally true. If we give a poor
questionnaire to the right people, our data will be poor. In each of these two cases, the
informational content in the data is poor and no matter what processes we use to extract
and amplify the information, it will always be poor. In this section we shall identify
criteria against which we can judge the appropriateness of our data. We have to recognise
though that delivering quality comes with costs attached. The implication of this is that
we have to make a judgement about the trade-off between data quality and cost. We have
therefore to look at the resources available to us so that our decisions about research
procedure are made not in a theoretical but a practical, real-world context. However,
central to the whole process of getting good quality data is sampling. This is what will
concern us most of all. To see how important sampling is, read Case Study 6.1.
Knowledge
Think, for a moment, of how you acquire knowledge; you will probably come up with
a list like this:
from direct experience;
from others (either by being told or by reading); or
through the media.
A source that has direct experience of something is more valuable to us as researchers than
someone who has been told about it or read about it in a newspaper, so this immediately
establishes a hierarchy of quality in sources that we should look for. Direct experience
can take the form of a personal written account or a report but if our investigation is
into something that took place in the past and personal records are not available, then
indirect reports or media accounts are what we have to work with. We can place what
we learn on a continuum that ranges from absolute certainty, through not being sure to
utter disbelief. The ideal source is one about whom or which we have absolute certainty.
We can be pretty sure about the grade results of a school or college or when an education
authority tells us that the schools in its area are short of five mathematics teachers. We
can equally be sure that we do not want to use as a source something that we know to be
wrong. Look at Case Study 6.2 and make up your own mind. It highlights an important
point about what can influence the transformation of data into information.
Most of the sources we use are likely to be ones on which we can rely. There may
be some about whom we are not sure but this is where triangulation comes into its
own (see section 3.2.(iii)). However, there may be circumstances when a knowledgeable
source offers us information that is or may be questionable. As researchers we have to
have our antennae tuned to detect this possibility. Imagine a situation in which we are
interviewing head teachers about the policies they have introduced and the practices
they implement to raise pupil achievement. One head teacher tells us that the results
at another school are the consequence of the school implementing a selection policy
under the counter, something that is not permitted, and not by improving teaching and
learning methods. How do we react to this as information? It falls into a broad category
of rumour and opinion that generally should not be used as evidence unless it can be
substantiated. A comment to the same effect from another head teacher makes it more
plausible but only a dissection of the school enrolment process and an analysis of its
outcomes over several years will give us information that we can totally trust.
228 Research Methods for Education Part 2 The Process of Data Collection
‘… There aren’t separate sets of “evidences” for Ultimately, as AIG says, it is how you use the data that
evolution and creation – we all deal with the same is important; ‘“facts” don’t speak for themselves, but
evidence (we all live on the same earth, have the must be interpreted’. This is the role we all take on
same fossils, observe the same animals, etc.). when we undertake educational research – and that
The difference lies in how we interpret what we is why we should always know the background of the
study.’ person making the interpretation.
Credibility
Credibility is the second element in being an authoritative source. Establishing the
credibility of a source of data means that we have to take account of the experience, role
and position of the source in relation to our research issue. If we wanted to quantify
the need for early years provision in an area, then the Director of Social Services, the
Director of Housing and the Director of Education in the area are likely to be able to
provide us with information. However, if we wanted to know what works in terms of
improving young children’s sociability and communication, we should talk to early
years teachers. In other words, it is position in relation to the issue that is important,
not position in a hierarchy.
Chapter 6 GETTING THE RIGHT INFORMATION 229
However, we should not be dogmatic about this. Above we said that opinion or hearsay
should not normally be used as data but there are circumstances when it is valid. When
someone in authority says ‘We have to’ or ‘I think we should…‘ then it is reasonable to
use this as information because it comes from a source who can make things happen.
Power in this context is important, even when those who exercise it are wrong. Equally, if
sufficient numbers of people say, ‘I think that…’, then even their unsubstantiated views
become legitimate information because they exercise a political influence, irrespective of
whether they are right or wrong. So sources can be credible if they have the possibility of
changing situations, and this possibility comes from individual authority or a collective
voice that can influence decision making processes.
Reputation
Is the third factor to consider in establishing how authoritative a source is. Reputation is
the way in which someone is perceived, either in general by society or more specifically by a
professional or other group. In most cases reputation is far less important than knowledge
relevant to the research but there are investigations where it is crucial. Examples are research
into how policy is determined, why a problem occurred or something went wrong, whether
guidelines are being followed in practice, in other words research into issues that could
show someone in a less than favourable light. Here reputation is inextricably tied up with
truthfulness. Our task as researchers is to ensure the accuracy of the data we collect and
for this the responses we receive from people involved with the issue have to be truthful.
How do we ensure that someone does not misrepresent what happened in order to
present him or herself in a better light? The answer is triangulation, testing and comparing
evidence from different sources and making the process known to all involved. This is
where reputation becomes important and, for respondents, the issue is that untruths may
come out. So when we talk about reputation, it is not a source’s reputation itself that is
important (though this is always good for validating a research claim) but the fear of
losing that reputation. It is the possibility of this that is the researcher’s lever for gaining
the truth. We should recognise, though, that there is an ethical dimension here because
our tool for ensuring the accuracy of our data is public or professional humiliation. We
can either be absolutist and say that truth is paramount or reach a judgement about the
significance of our research in relation to the possibility of a personal scandal. Fortunately
these issues are unlikely to arise when you are just starting out in education research.
If getting the right data from an authoritative source is the bedrock on which good
research is based, the question then arises, how perfect does everything have to be?
To answer this, we return to the points briefly made in Chapter 1. Of course, what
would be ideal is always best but what is ideal often runs foul of the real world in which
we all live and work. There has to be a balance between what is ideal and what is possible.
It is important that we do the following for each research project we are involved with.
t Plan the whole research programme. Specify the data required, identify the data
sources, determine the quantity of data required for the analysis and the number of
sources needed to generate this quantity. Note here that we have to work backwards
because, in order to convince others with our explanation, we may need a specific
amount of data to generate the information we require. We have to plan forwards
and, following this, backwards. We also should specify the way in which the data
will be processed and analysed in order to release the information content. Finally,
for each of our activities, data collection, data analysis and data reporting, we have
to enumerate the resources required to carry out the task and the time each one will
take. It is a good idea to plot these quantities against a timeline. Figure 6.1 shows
Chapter 6 GETTING THE RIGHT INFORMATION 231
2005 2006
Jul Aug Sep Oct Nov Dec Jan Feb Mar Apr May Jun Jul Aug Sep
Observation of
user activity
1.1: Identify groups
1.2: Focus groups
1.3: Web surveys
1.4: Analysis and report
Design evaluation
3.1: Identify demonstrators
3.2: Build mockups
3.3: Evaluation trials
3.4: Analysis and report
Dissemination
4.1: Website build
4.2: Conference/workshop
feedback
4.3: Delphi exercise
4.4: Journal/conference
paper presentation
4.5: Final report
As well as being authoritative, the people and organisations that give us data have to
be representative of something meaningful. A city administration that manages to turn
around education provision in schools would be representative of good systems organi-
sation and management. A survey of the views of a selection of early years senior prac-
titioners in the UK on the quality and value of their professional qualifications could
be representative of judgements nationwide of the ability of universities and colleges
to organise professional education in a sector that they have long ignored. A survey of
what happened (in terms of student grades and performance) in Sweden following the
removal of selection procedures for entry to secondary education could indicate what
234 Research Methods for Education Part 2 The Process of Data Collection
might happen in England if the same policy were to be adopted. (The School Admissions
Code in England and Wales seeks to ensure equity in access to schools. It is overseen by
the Office of the Schools Adjudicator.)
In order for our data to be representative, we have to ensure that we can explore a
more general agenda or issue through the data that we collect. If we deal with everyone
or every organisation in a group, for example, every school operating and recognised as an
academy in the UK, or every teacher dismissed for having an inappropriate relationship
with a pupil in a given period, the issue of being representative is not a problem because
we are considering what statisticians call the population. However, if the population is all
PhD students currently registered with Swedish universities or teachers in their first year
of employment in the Netherlands, then the task of contacting them may not be so easy.
In situations like this we select people or organisations from the whole population in a
process that we call sampling.
We will now deal with probably the most important stage in the research process. In
it we shall look at the following:
t why we sample;
t the basic principle of sampling, linking the sample to the population;
t in the section below on research practice we shall look in detail at different approaches
to sampling, when to use a particular approach, how many people or organisations to
include in a sample and, finally, what can go wrong with samples.
parking spaces provided on all university campuses. The only option in this case is to
sample and to make contact with those who can be identified. This poses problems for
analysis and relating the findings of the sample to the population but it is all that the
researcher can do. There are also many situations in which we can estimate the size of the
population (for example, children under 16 who have one or more parents with AIDS)
but the only realistic solution to finding out how this impacts on their lives is to carry
out a sample.
Target population All of the potential sources of data Size may be known or
determined by the research issue. unknown.
Sampling frame Every instance that can be used to Not every instance may
generate research data. be identified.
Sample The instances that are selected to Extent to which it is
provide information. representative of the
population.
Chapter 6 GETTING THE RIGHT INFORMATION 237
If we were researching all forms of childcare in an The crucial thing about the sampling frame is that it
area, we could define our population in terms of should identify every element of the population in
organisations listed in a directory such as Yellow Pages order to make the sampling robust and in order for
or a classified telephone directory. However, while us to be specific about the nature of the relationship
these sources will give us an immediate population between the sample and the population.
to sample, the likelihood is that they will not identify When we are unable to identify and enumerate
all the organisations and carers. As a sampling frame every element in the population, the sample frame
they are, therefore, limited. changes in character and is only indicative of the link
Test this out in an area known to you. We could use between sample and population. We may have an
both of the following websites as our sample frame. estimate of the number of men and women with
HIV/AIDS but we cannot identify and enumerate each
t Go to Yell.com: identify childcare providers. one. If we carry out a sample of HIV/AIDS sufferers,
t Go to Family Information Services for the same we have to assume that it is representative, we can-
area: identify childcare providers. not prove that it is.
Sample choice
The first part of this section has outlined the criteria we can use to judge whether a source
is appropriate and the principles we use to determine whether we collect data from the
whole population or a sample. In this part we turn to the practice of carrying out a
sample and the methods open to us.
1
2
3
4
Number each case in the 61
target population to 77 s
s
generate the sample frame 387 s
s
s s
s s
s s
429
The random numbers are used to identify the schools. Once these have been selected,
we then contact them to request the data that will answer our research question.
In drawing a simple random sample we have to take account of one decision we
might have to make: what if the same data source were chosen twice? Fortunately this is
less of a problem if we use a random number generator rather than a random number
table because we can usually suppress duplicate numbers. But what do we do when
we meet this situation? The answer is that we will almost certainly reject the second
selection. Think of it in terms of the respondent to be contacted. There would be some
head scratching on receipt of a second request for information that was accompanied by
a note saying, ‘We know that you have already sent us this information but please send
it to us again’. This decision means that we have chosen to sample without replacement.
There may be situations when it is appropriate to sample with replacement, that is, count
the same set of data more than once, but they are rare. In general, re-using data reduces
the efficiency of the sample, that is, the sample is a poorer description of the population.
Think of it like this. If, in our sample of 510 Danish schools, we allow replacement and
if we have 10 duplicate samples and if each of these is one of the 10 lowest staff/student
ratios, the picture we get of all of the schools will be distorted. It is this potential for
Chapter 6 GETTING THE RIGHT INFORMATION 241
distortion that means that sampling with replacement is less efficient than sampling
without replacement.
that must be contacted to give us the data we require. A numerical example will
show this.
Once we have determined the size of the sample (and we shall see how to do this
later in this section), we know the proportion of the total population to be sam-
pled. In this case, the sample size is 400 households out of a total population of
10,000 houses. This means that we have to make contact with just 4% of the houses
(400/10,000). With random sampling we would have drawn 400 random numbers
to identify the sample cases, with systematic sampling we do it differently.
t This proportion (4%) is used to determine the selection of the houses that are
contacted: 4% means that 1 house in 25 is contacted. Once we know this, the rest
is straightforward. The first step is to establish some means of identifying houses.
It is easiest to give each one a unique number. The process for doing this is shown
in Figure 6.4, which shows a portion of the Swedish town of Jönköping taken from
Google Earth. Moving up and down each street, we give each house a number. We
continue this until every housing unit in the sample area is numbered. Starting
with the first identifying number, we draw a random number between 1 and 25 to
determine the first household to contact. Figure 6.4 shows that we selected the 25th
house in the sequence. After contacting the inhabitants and obtaining our data, we
count 24 households along our route and select the 25th (number 50 in the sequence).
5 to 9 10 to13 17 to 14 18 to 23 31 to 24 25
This process is repeated. If a household refuses to take part or is absent, we then move
to the next available household.
Systematic sampling can be adapted to many situations. It can be used to sample
pupils in a classroom, a telephone survey of parents with children at a school, stu-
dents entering a university library, primary schools in an area, education districts and
many more.
(c) Stratification
Stratification is a procedure that is applied to random and systematic sampling rather than
a separate approach to sampling. We shall first look at how to stratify a sample and then
at the benefits of stratification.
Stratification works by separating the population (and thus the sample frame)
into categories. These categories are usually meaningful in terms of the analysis that
we subsequently intend to carry out. For example, it would be reasonable to think of
populations structured by gender, or age group, or family size. It would be equally
reasonable to structure a sample by type of school, university department or year of first
enrolment at college if these are the factors we intend to investigate in our research. The
crucial thing is that we should know what proportion of our total population is male or
female, or is a single child or has siblings, or is a primary school or secondary school.
The classification can have more than two classes; for example three age groups each
split by gender. All that is necessary is that we should know, before we start, what the
proportion each group is of the total. The purpose of stratifying like this is to minimise the
use of resources in data collection without affecting sampling quality. What we are doing
is breaking up the sample into units that are more homogeneous than the population
overall. Because there is less variety in each unit or stratum, we need make contact with
fewer cases than if we were to sample the population as a whole or, as in the example
below, we would obtain sample results that are a far more accurate representation of the
population.
The proportions in each stratum or unit are then used to allocate the sample. Let
us imagine that a population of schools is classified into four types of curriculum
specialisation: business studies, sports, science and other. The information we have for
our school population is that 18% specialise in business studies, 27% in sports, 36% in
science and the remainder (19%) in other subjects. If our sample size is 280, then we have
enough information to stratify our sample. The simple rule is that each stratum accounts
for the same proportion of the sample as it does for the population. Table 6.3 shows the
results for our example. The total sample of 280 schools is built up from 50 specialising
in business studies (280 3 18%), 76 in sports (280 3 27%), 101 in science (280 3 36%)
and 53, the remainder. The schools are sampled either randomly or systematically.
Chapter 6 GETTING THE RIGHT INFORMATION 245
Summarising stratification
t The groups (strata) that are used must be meaningful in terms of the research issue or
analysis.
t The relative size of each group in relation to the total population must be known.
246 Research Methods for Education Part 2 The Process of Data Collection
t The sample size of a stratum is determined by the stratum’s proportion of the total
population.
t Samples are chosen by a random or systematic procedure.
t Overall sample size may be reduced by treating each stratum as a population and
determining the sample size for each stratum separately.
England
394
840
Cluster sam pie of
805
909
841 808 806 807
North West
Yorkshire and secondary schools
Sampie procedure the Humber
816
Sam pie frame
• 150 local education
811
• Sampie types
888
890 380 383
810
381
authorities (lEAs)
889
in nine administrative
340 342
831
892
ofwhich 894
860
855
926
Sampie procedure
comprehensive 2,815 West
893
336 335
333
332 330
856
857
874
• Select a sampie of
grammar 164 Midlands 331
937 928
873
935
education authorities
modern 115 884
885
826 820
East of England whose schools will
middle 250 821
919
then represent all
other 23 secondary schools in
916 881
931 825
801
803
866
887
882
England
Academies
867
27
802
800 865
936
886
850
South East
933
908
880
879
Greater London
sample. Thus, if the total sample is 400 schools, grammar schools (equivalent to
gymnasia in many other European countries) would be 164/3,367 per cent (that is,
4.87% = 19 schools) of the total.
t Cluster analysis works differently. In this case the sample frame is the 150 local edu-
cation authorities in England. A number of these are sampled and their schools are
taken to be representative of all the schools in England.
We can now see from this example that stratification seeks to segment the population
horizontally (the types of school are found throughout England), while clustering
segments the population vertically (it assumes that all school types are found in each
area). In terms of sampling theory, stratification seeks to maximise homogeneity in the
sample (that is, schools in any one type are more similar to each other than to schools in
other types), while cluster sampling seeks to replicate the variety found in the population
as a whole.
Cluster sampling is likely to produce benefits in terms of reduced cost. With the
example in Figure 6.5, it is far easier to identify local education authorities than it is to
identify the 164 grammar schools in England – and then to repeat this for all the other
types of school. However, the theoretical evidence suggests that cluster sampling is not as
effective in reproducing the features and characteristics of the population as stratification.
Indeed, because clusters are unlikely to replicate the character of the whole population,
there is a strong likelihood that cluster sampling may produce weaker estimates of
population characteristics than simple random sampling. Because of this uncertainty,
cluster sampling should only be preferred over other methods when cost, time or ease
of access to data sources are a major consideration or when it is not feasible to identify
and enumerate individually all possible data sources. However, on the plus side, because
the selection process is random, this enables us to make use of the statistical procedures
in Chapter 11.
Case Study 6.3 illustrates the use of cluster sampling in research. It describes three
studies, each of which shows a different aspect of cluster sample design.
used as the data source. With multi-stage sampling, the final stage would be to randomly
sample pupils from the year group.
Like cluster sampling, multi-stage sampling is effective in managing large populations
and, especially, populations distributed over a large area. By concentrating the actual
process of data collection, the procedure allows resources to be concentrated, with the
likelihood that effort and costs are reduced. Case Study 6.4 describes the use of multi-
stage sampling in the Family Spending Survey.
Multi-stage sampling can also be used effectively by individual researchers, as the Greek
researcher, Theodora Papatheodorou (Papatheodorou, 2000), has shown in her study of
the methods used by Greek nursery teachers to manage behavioural problems. Figure 6.6
Chapter 6 GETTING THE RIGHT INFORMATION 249
The sample frame consists of a postcode address file As with all surveys, there are rules to deal with prob-
called the ‘small user’ database. A ‘small user’ is de- lems (such as more than one household at an address
fined as any address that receives fewer than 50 postal or defining what makes a household ineligible for selec-
items a day. This definition excludes large businesses tion). In your samples, you should try to identify what
and organisations such as hospitals and colleges. The might cause problems so that you can set up rules be-
database is updated twice each year. forehand rather than make them up as you go along.
The sample design is multi-stage. *Each country has its own definition of ‘head of household’ (or
an equivalent term). In the UK the definition is one that con-
t First, the UK is divided into a number of Primary tinues to evolve to reflect changing social relationships. Prior to
Sampling Units (PSUs). The PSUs contain the basic the 1980s, the head of household was assumed to be a man in
data sampling units, the small user address file. The a household where there was a man and a woman. in the case
of cohabitees (for instance, young people sharing accommoda-
source of these data units is the Post Office’s list of tion), the head was the person who was responsible for paying
addresses nationwide. The PSUs are clusters of ad- for the accommodation. A decade later, as the once assumed
dresses from the small user address file. relationship between household and family was breaking up,
t Second, the PSUs are clustered by administrative the head of household was defined as the ‘household reference
person’, the person named as owner or on a rental agreement.
area, initially grouped by Government Office Re- Where two or more people were named, the household refer-
gions and, within each region, by metropolitan ence person was the one with the highest income. The ‘highest
and non-metropolitan districts. income householder’ is now used as the principal indicator.
is a diagrammatic representation of the process she used. First she randomly sampled
regions, selecting three out of 10. The regions selected are shown in blue in Figure 6.6.
From the three regions, she then selected nine sub-regions. She sampled 25 teachers out
of all those employed in each of the sub-regions, giving a total sample population of
225 teachers to represent all nursery teachers in Greece. A total of 154 teachers responded
to her survey. Her goal was to understand which management approaches were used in
what circumstances. Teachers described their approaches and Papatheodorou classified
them as behavioural, cognitive or punitive. All three approaches were found to be used by
teachers and Papatheodorou’s assessment was that there was more of an emphasis on res-
olving short-term problems than resolving the long-term issues that might underlie them.
Multi-phase sampling is a variant on multi-stage sampling. With this approach the
population is sampled at two levels, a general level and at a more detailed level for a
250 Research Methods for Education Part 2 The Process of Data Collection
Nation
Regions
Sub-
regions
25
teachers
smaller (but still randomly drawn and representative) set of the data population. The
procedure allows the findings at the more detailed level to be applied to the more general
level. An example will help illustrate this. Let us imagine that we are investigating teacher
recruitment and teacher retention. We could use a cluster approach to identify our sample,
first selecting areas and then the sample schools within them. Having identified our sample
schools, let us say 88, we then ask each head teacher to complete a survey. This would ask
about staff turnover in general and in particular subjects, about the numbers of applicants
for posts both currently and historically, about how well qualified the applicants were
and so on. The second phase would be to treat the 88 schools in our first sample as a
population and conduct a survey of teachers in a sample of these. If we manage this process
well, we can apply the results of the second survey to the wider population. We would
ask questions about job satisfaction and what affected it, the quality of management and
support, problems of pupil behaviour, whether teachers had been approached with job
offers, how long teachers intended to stay and why they would move. In other words, we
would look at the issue from the perspective of teacher experience and seek to relate that
to the broader picture of staffing given us by the head teachers.
A multi-phase sampling approach is well worth consideration by individual
researchers and small research teams because it allows detailed insights to be collected in
a cost-effective way. For example, at the more general level a research programme might
collect data on people’s behaviour, while a carefully drawn sub-sample could collect
additional data on attitudes, feelings and emotions that might drive those behaviours.
Simple random For smaller studies where the population t Costs increase with sample size.
can be defined and itemised. t If using a table of random numbers,
ensure first number is randomly selected.
Systematic For larger populations where we can t Need to know size of population to
create a sequence in the data, e.g. people determine sample fraction.
passing a point, houses in a street. t Selection of first survey point should be
random.
Stratification To achieve a more accurate sample (or t Link stratification to factors that might
to save cost if you are happy with less ac- influence or be associated with outcomes
curate results). (e.g., age and attitude, social status and
behaviour).
t Setting up a stratified sample frame can
be costly.
Cluster For large populations where we can ’drill t Higher sampling error than random
down’ to identify groups to represent the sampling.
whole population. t Clusters should not be uniform in
character.
define the sample are also likely to be frameworks through which we analyse the resulting
data. In all likelihood, we would group our parents into those with one child, those with
two, those with three and those with four or more and use this classification to process
the data we collect. Let us assume that the proportions are as given in Table 6.5. From the
family types we are using to classify our parents, it is possible to see our research questions:
t Does the number of children in the household affect parental satisfaction with holi-
day play schemes?
t Does whether a parent is working affect parental satisfaction?
t Does the number of parents in a household affect parental satisfaction?
t Are levels of satisfaction of working parents different from those of non-working
parents (this is called the interaction effect) and does this differ for the number of
children in the household?
The percentages in the second column of Table 6.5 represent the proportion of all
households with children in a population of 2,000 households. If the sample size is
300 (we shall see, when we look at how to calculate sample size, that this implies we
are happy to establish levels of parental satisfaction plus or minus 5%), we multiply
the household type percentages by 300 to give the sample size for each household type.
Thus, for a single working parent with 1 child:
1.8% of 300 is 1.8/100 3 300 5 1.8 3 3 5 5
Having established the size of each quota, we then identify locations where we can
expect to meet the sort of people we are interested in. It could be a combination of
locations at different times, outside school gates or in a shopping centre at the weekend.
The surveyor identifies likely respondents (excluding, for example, those who appear
to be too young or too old) and the first question is always to establish whether or not
they meet one of the quota categories (‘Do you have children? Does your child/do your
children attend holiday play schemes?’).
Before we leave this example, look again at the quotas. There could be problems here.
In the quota sample that we have constructed it is difficult to use a sample of one to
establish the views of a whole category. In this situation, we can either:
t scale up the whole sample (we would need at least 15–20 in the smallest category –
this would make the whole sample between 4,500 and 6,000); or
t we can choose to be non-proportional and have a minimum size or equal quota per
group. This places more emphasis on the data analysis and the comparison of behav-
iours, judgements and attitudes rather than on the characteristics of the population; or
t we can merge categories (and quotas). This is what we would be likely to do in this
instance. But even then we may have to increase our sample size.
Quota sampling can be very effective and it is relatively straightforward but before we go
down this route, there are some other things to take into account.
t First, while it is usually fairly easy to begin filling up the quotas, it can take a frustrat-
ingly long time to find the final subject (likely to be the working single parent with
more than one child).
t Second, it is important to think how the data, once collected, will be analysed.
t Third, if we want to say something sensible in our conclusions, we have to base it
on a reasonable number of people (as a rule of thumb, not less than 30 per group).
Even then, the sample size we start with might not be the sample size we finish up
254 Research Methods for Education Part 2 The Process of Data Collection
with – after we have started to process our results, we might find that we need more
respondents to make the issue clear.
t Finally, while quota samples can generate good results, this is not guaranteed. It all
depends on how good the selection of cases in the sample is. Bias is possible. Read
Case Study 6.5 for an example.
who are respected by their peers. The snowball sampling technique works by an interviewee
or respondent identifying other possible respondents for the researcher. The assumption is
that people with the attitudes, beliefs of behaviours that we are interested in are more likely
to know others with similar attitudes, beliefs or behaviours than the general population
or even us as researchers. With this approach it is important to identify several initial
contacts in case one of the referral chains goes cold. Researchers have to have a very good
understanding of the issue and a sense of where initial contacts might be found. They have
to appear credible to the contacts, otherwise there may well be a reluctance to expose a
sensitive friendship or contact network. For this reason the approach is probably best used
by established researchers. Anyone conducting their first research enquiry should think
seriously whether there is an alternative approach. Internet technology, though, may have
some solutions. Baltar and Brunet (2012) showed that the use of social network sites led
to an increase in the number of respondents, possibly because the respondents feel more
confident when they see the researcher’s profile in the public domain.
While snowball sampling presented here is purposive in nature, we should be aware
that sampling concepts and theory are always moving forward. This is the case with
snowball sampling and the development of what is being called respondent driven
sampling. This seeks to establish estimates of a snowball sample’s reliability using a
mathematical procedure to model the population. By bringing snowball sampling much
closer to probabilistic principles, it is probably one of the most important developments
in sampling theory in recent years. It is too advanced to deal with in an introductory text
such as this but it can be followed up by reading the work of the American sociologist,
Douglas Heckathorn (see, for example, Heckathorn, 1997).
one expert which others they would recommend, a type of snowball process. Where
the research is more concerned with experience and intimate knowledge of a situation,
the selection criteria are likely to include the degree of exposure to the situation. Each
research project will generate its own criteria.
There are particular circumstances in which specialist group sampling is important. One
has already been noted, constructing a focus group. Another is prediction and what is called
‘foresight planning’. This is a process whereby people work collaboratively to develop scen-
arios of what the future might be like. Case Study 6.6 has an example of this particular
approach to explore what would be appropriate vocational education in the Netherlands.
In the context of this damning assessment, we should ask why such a flawed system
ever saw the light of day and what value there is in using it. There are obvious benefits
to media organisations. Programme makers use it to involve their audience and create
audience participation. This encourages the audience to return and helps keep viewing
or listening figures high. Newspapers choose topics (and sometimes phrase questions) in
ways that appeal to their core readership. This appeals to readers’ values and reinforces
the paper as an outlet for their opinions. Again, the benefit is to the bottom line of the
papers.
But when should academic and professional researchers use it? A Web survey is cer-
tainly convenient for the researcher because completed surveys can be fed directly into
a database for analysis. It could be used within an organisation to allow members to
express their opinions about how it is run or its development strategy. It is often used
to allow students to express their views of a course. It can also be used for a preliminary
exploration of a field, where the object is to identify issues that can be explored further
within a representative sample. In this case the fact that enthusiasts or those with strong
opinions are more likely to respond is a positive advantage. If the issue is sensitive, then
a self-selecting sample may be the most effective way to generate a database for more
detailed surveys. For this reason the method appears to be used more in health education
research than other areas of education research. The sort of example we might meet is
young people self-reporting smoking, drug use and sexual activity.
Quota Small to medium sized samples where it t Many analytical statistical procedures are
is important to isolate the views/actions based on the assumption that the data
of well-defined groups. Usually quicker is collected on a random basis. Quota
to collect sample data than with sampling does not meet this condition.
probability samples. Note, however, that with some techniques
this assumption can be relaxed without
affecting the analysis greatly.
Snowball Where the population is hard to reach t Researcher must be credible to gain entry
or where the population is defined by to a network.
personal knowledge or reputation. t Need to have sufficient awareness of the
issue being investigated to judge whether
contacts and leads are appropriate.
Specialist group When the research calls for a specialist t Develop criteria for selection of group
perspective or input. Frequently used for members.
futures analysis. t Put in place a policy of what to do if
members drop out in the survey process.
Convenience Use for preliminary studies or when the t Do not attempt any statistical predictions
population is small or when time is of unless the group is demonstrably
the essence. representative of a larger population.
Case study To represent a particular situation t More a case of selection according to the
(good, bad, typical) or to understand features under investigation rather than
the process that has led to an outcome. sampling.
t Justify selection.
Self-selecting Targeted studies. Especially appropriate t Demonstrate how representative group is
for Web-based surveys. of larger population.
t Ensure against multiple completion of
survey and take care to check that a group
does not ‘fix the vote’.
prevent a group with a particular interest from rounding up its members and getting
them to complete the survey? Both of these will generate bias. With any sample, however,
it is important to show that the group is representative of something other than itself. We
have to be able to talk about a population. If we cannot do this on the basis of a sample,
it hardly seems worthwhile collecting the sample in the first place.
How do we choose which sampling method to use? The principal condition that we
have to satisfy is that we should use the method that gives us the best results for the
circumstances of our research. In selecting a method we should take account of the
following points.
t The purpose of our research: If we specify a hypothesis that we want to disprove on the
basis of probability – the likelihood of the hypothesis being wrong (see Chapter 3
Chapter 6 GETTING THE RIGHT INFORMATION 261
again), then our choice is highly constrained; we must use a probabilistic sample. We
should only use snowball, specialist group, convenience and self-selecting samples
and case studies for the purposes for which they are intended. For general surveys the
choice is usually a random approach or quota sampling.
t The nature of our analysis: If we intend to use a statistical procedure to infer a
relationship or to determine the characteristics of the parent population, we again
constrain our choice and must use a probabilistic sample.
t When speed is of the essence: We can choose between a probabilistic or non-
probabilistic sample. In general it takes more time (and cost) to collect data using
random approaches than with quota sampling.
t When resources are limited: Our choices are limited in the same way as when we have
to sample quickly.
The next question is, How large should our sample be?’ This is difficult to answer at this
stage because the answer depends on understanding something about the character of
a statistical population that we call the normal distribution. (We shall look at this in
more detail in Chapter 12.) There are several ways of determining sample size, but the
most precise relies on being able to calculate some statistical measures of our sample that
we shall only meet in Chapter 12. Because of this a detailed answer to the question, ‘How
large should a sample be?’ is given in Appendix 1.
However, there is another solution to determining sample size, which is to use
sample size calculators that we can find on the Internet. There are several on-line
calculators for determining sample size but using them is not necessarily as straight-
forward as it may seem. For one thing, we have to know the size of our population,
and as we saw there are some probability-based approaches to sampling that allow
us to drill down through our population to draw a sample without necessarily hav-
ing a precise knowledge of the population size. In addition, the calculators are not
always clear which sampling method they are predicting for. If they do not say, then
we should assume either a simple random or systematic procedure. We also have to
be aware that the sample size calculators usually assume that what we are interested
in investigating is divided into only two classes, a and not a. This last assumption is
the basis, for example, of polling the popularity of a political party (such as the Green
Party in the UK, Sweden and the Netherlands), where we either support or do not
support the party. While we can configure education data into this a and not a clas-
sification (for example, young people with a criminal record and those with no crimi-
nal record, or classroom assistants who have an educational qualification and those
who do not), we are far more likely to be dealing with data that is divided into more
than one class (such as age groups, attainment levels, number of days’ unauthorised
absence) or is actually a continuous value (such as measures of IQ or age). In these
cases, if we use an on-line calculator we implicitly have to restructure our data into
two classes. In the long run, this may be less than satisfactory in terms of its effective-
ness in estimating sample size when compared with some of the approaches we shall
meet in Chapter 12. Having said this, however, on-line calculators are far quicker than
any other method.
262 Research Methods for Education Part 2 The Process of Data Collection
50 43 49 44 49 47 49
100 74 95 80 96 87 98
150 97 138 108 141 123 145
200 116 179 132 185 154 191
250 131 218 151 227 182 236
300 143 255 169 267 207 280
400 162 324 196 343 250 365
500 176 387 217 414 286 446
1000 214 629 278 706 400 806
2000 239 917 322 1091 500 1351
3000 249 1082 341 1334 545 1743
4000 254 1189 351 1501 571 2040
5000 257 1264 357 1622 588 2271
10000 264 1447 370 1936 624 2938
20000 267 1560 377 2144 644 3444
50000 270 1636 381 2291 657 3841
Sample size calculators are often available as free resources from commercial compa-
nies. Amongst the most straightforward to use are those provided by:
t Creative Research Systems – www.surveysystem.com/sscalc.htm.
t Raosoft – www.raosoft.com/samplesize.html.
t National Statistics Service (Australia) – www.nss.gov.au/nss/home.NSF (access via
statistical references).
Table 6.7 gives sample sizes with particular margins of error for populations of known
size based on various assumptions. The sample sizes are derived from on-line sample
size calculators. The population from which the sample is drawn is shown in the left-
hand column (it ranges from 50 to 50,000). The other columns show the conditions
that affect the sample size. Each column has two conditions: the amount of error we are
prepared to accept (which we call the confidence interval) and the level of assurance
that we will be within this margin of error (which we call the confidence level).
t The confidence intervals shown in the table are 5% and 2%. The best way to under-
stand these is in terms of the opinion polls that we read in newspapers. If these report
that ‘42% of people will vote for…’ and that ‘the margin of error is 2%’, it means that
between 44% (42 1 2) and 40% (42 – 2) will vote for…
t The confidence levels shown in Table 6.7 are 90%, 95% and 99%. These mean that
for 90% of the time we want our samples to be within the error we specify or, for 95%,
our samples should be within the error 95% of occasions and the same for 99%.
Putting these two conditions, confidence interval and confidence level, together we
see that for any sample size we are demanding greater levels of accuracy in our sample
as we move from left to right. So, if our population is 200, we need 116 respondents to
Chapter 6 GETTING THE RIGHT INFORMATION 263
meet our condition (the proportion of the population who will vote for a political party
or the proportion of schoolchildren who will have school dinners) of being within 5%
of actual value for 90% of occasions. Now let us look at the figures in the table. Some of
them make sobering reading. We should note the following:
t The sample sizes increase from left to right in the table because the sampling condi-
tions become more rigorous from left to right.
t The increase in confidence interval from 5% to 2% produces a marked increase in
sample size that is particularly apparent in larger samples.
t The increase in confidence level, from 90% to 95% and 99% has a more limited but
still significant effect on sample size.
t It is barely worth the effort of setting up a sample design if the population is 100 or
below and the benefits of sampling are questionable if the population is 200 or below.
t Most individual researchers can cope with a sample size of 200 to 500. These are
shown in blue in Table 6.7. These demonstrate the trade-off between accuracy and
the ability to manage a survey single-handedly. As the size of population grows, we
have to limit the levels of accuracy we aspire to.
It is important to understand the significance of sample size in relation to the value
we can place on research conclusions. A review of research on thinking skills published
in academic journals carried out by researchers at Newcastle University (led by Professor
Steve Higgins) as part of an initiative to identify effective practice (Higgins et al., 2005),
specifically draws attention to sampling as an issue that was considered when deciding
whether to include research in the assessment or not.
Researchers and journal editors should note that studies were often excluded
because basic information… (such as number of pupils involved) was not included
in the published papers. Moreover, the details of sampling strategies and full sets
of results were frequently omitted. Abstracts sometimes referred to data which was
not then reported in detail. (p. 4)
The point is well made, especially in the context of Table 6.7, that without information
on sample size, population size, sample method and confidence limits and levels, we
cannot assess whether the researchers have reached a correct, incorrect or even valid
conclusion. Activity 6.3 will help you appreciate the value you can place on other
people’s research.
Activity 6.3 What level of reliability can you place on research results?
This activity applies your knowledge of sampling on sample and population size and confidence
techniques and practice to published research. Pre- level, use a sample size calculator (such as http://
pare a: www.surveysystem.com/sscalc.htm) to check the
confidence interval. If you find a study with a small
1. checklist of sampling techniques;
sample size, you might be surprised.
2. template to show the sampling technique used in
t For each article, judge whether the sampling
a research paper and, for probability samples, the
method is appropriate. Pay particular attention to
confidence interval and level selected, the sample
the non-probability approaches that are used.
size and the population.
t How many articles provide information on popula-
t Find five journal articles that have used sampling tion and sample size that allow you to assess the
techniques. For each one, complete the template reliability of the conclusions?
as far as possible. Where you have the information
264 Research Methods for Education Part 2 The Process of Data Collection
Let us look now at what can go wrong with a sample. Making the theory of sampling
work in practice may not always be straightforward. Things can go wrong which can
affect the quality of the data collected. However, unless we are faced with a complete
disaster the situation can normally be salvaged. In this section we shall see what we can
do to resolve problems when things go wrong.
t Build the likelihood of drop out into the sample from the start. The implication of
this is that we over-sample from the beginning. As long as we can afford it, this is not
a problem.
t If we have not done this, then we can replace like with like (another university in
the case of our study of students settling in). This is not the best solution since we
are assuming that the same sample characteristics will generate the same response
pattern.
t If this is not an option, then we can weight the data from any category that is under-
represented. We will know which categories these are by comparing the sample to the
population. In our survey of students, we might find that male and female represen-
tation is acceptable in the sample but that overseas students are under-represented. If
this is just one case, there is not likely to be a problem but as the under-representation
increases, we have to hope that the (few) cases we are left with are still typical of the
category. This would be a problem, for example, if the university with the outbreak
was small and the next smallest was twice the size.
Summary
Sampling is the bedrock on which research is based. If the sample is bad, the analysis
and research results are worthless. The following points are the principal messages
from this section:
Further reading
From purposive to probabalistic sampling undertakes the statistical analysis and the accompany-
Heckathorn, D.D. (1997) ‘Respondent Driven Sampling: ing manual.
A New Approach to the Study of Hidden Populations’,
Probabilistic sampling
Social Problems, 44(2), 174–199.
Douglas Heckathorn shows how the use of a statistical Barnett, V. (2002) Sample Survey Principles and Methods,
modelling approach called Markov Chain Analysis can 3rd Edition, Arnold, London.
be used to give probability estimates for a snowball Sheaffer, R.L., Mendenhall III, W. and Ott, R.L. (1996)
sampling procedure. His work was on drug users and Elementary Survey Sampling, Duxbury Press, London.
he used one user to identify others. He classified those For a fuller introduction to statistical theory in sampling,
who were identified according to their ethnicity and see Vic Barnett. Sheaffer et al. take us into the higher
was able to calculate the probability that a contact reaches of probability sampling approaches and pro-
would identify someone of the same or different eth- vide significant detail on stratification, systematic and
nicity. This data can be manipulated mathematically single and two-stage cluster sampling. There is a useful
to estimate the size of the drug user population. chapter on estimating population size and a good sec-
For more information on respondent driven sampling, tion on how to conduct a sample when the questions
visit www.respondentdrivensampling.org. This website are sensitive or have a high likelihood of not being
has the opportunity to download a programme that answered truthfully.
References
Baltar F. and Brunet, I. (2012) ‘Social research 2.0: Fitz, J., Taylor, C. and Gorard, S. (2002) ‘Local Education
virtual snowball sampling method using Facebook’, Authorities and the Regulation of Educational Mar-
Internet Research 22(1), 57-74. kets: Four Case Studies’, Research Papers in Education,
Clark, B.R. (1998) Creating Entrepreneurial Universities, 17(2), 125–146.
UNESCO, Pergamon, Oxford.
Chapter 6 GETTING THE RIGHT INFORMATION 267
Heckathorn, D.D. (1997) ‘Respondent Driven Lintonen, T.P., Konu, A.L. and Rimpelä, M. (2001)
Sampling: A New Approach to the Study of ‘Identifying Potential Heavy Drinkers in Early
Hidden Populations’, Social Problems, 44(2), Adolescence’, Health Education, 101(4), 159–168.
174–199. Papatheodorou, T. (2000) ‘Management Approaches
Higgins, S., Hall, E., Baumfield, V. and Moseley, Employed by Teachers to Deal with Children’s Behav-
D. (2005) ‘A Meta-analysis of the Impact of the iour Problems in Nursery Classes’, School Psychology
Implementation of Thinking Skills Approaches on International, 21(4), 415–440.
Pupils’, in Research Evidence in Education Library, Samdal, O., Nuttlam, D., Wold, B. and Kannas, L.
EPPI-Centre, Social Science Research Unit, Institute of (1998) ‘Achieving Health and Educational Goals
Education, University of London. Through Schools – a Study of the Importance of
Kloep, M., Henday, L.B., Ingelsrigsten, J.E., Glendinning, the School Climate and the Students’ Satisfaction
A. and Espnes, G.A. (2001) ‘Young People in with School’, Health Education Research, 13(3),
“Drinking Societies” Norwegian, Scottish and 383–397.
Swedish Adolescents’ Perceptions of Alcohol Use’, Van Zolingen, S.J. and Klaasen, C.A. (2003)
Health Education Research, 16(3), 279–291. ‘Selection Processes in a Delphi Study About Key
Lindley, D.V. and Scott, W.F. (1995) New Cambridge Qualifications in Senior Secondary Vocational
Statistical Tables, 2nd Edition, Cambridge University Education’, Technological Forecasting and Change, 70,
Press. 317–340.
Chapter contents
Introduction 269
7.3 Taking stock and preparing for the next step 293
Summary 295
Further reading 296
References 296
Chapter 7
Learning themes
t How to open up a research question and By the end of this section you will:
dig down into the research issue.
t Know how to break down a question into
t How to model a research enquiry. component parts and show the relation-
ship between them as a system.
t How to define the variables for a research
enquiry. t Be aware of types of variable and the rela-
tionship between them and indicators.
t How to define the data (indicators) that
will describe the variables. t Know the issues that can affect data qual-
ity and take action to ensure that they do
t What standards our data must reach. not arise.
Introduction
In this section we shall look at how we can for educational researchers for local
‘break open’ our research question and investigations, for national investigations and
through this devise a model of the factors for international comparative investigations.
that we think may be influencing our research However, using secondary data is not without
issue. We will look at a useful process which risk. There is no doubt that the vast bulk of
will enable us to be more precise about our secondary data are of excellent quality and
data needs. This section is an important collected according to well-established and
link between our previous discussions of high-standard audit, census and sampling
data character and types in Chapter 4 and procedures. So, if this is the case, where is the
the approaches we can use to collect our risk? It comes from the possibility that there
data that are considered in Chapters 8 and is a gap between the data needs generated
9. It continues the theme of ensuring that by the research question and the information
the data we collect are of good quality. In that is available. It may be that the data are
Chapter 4, we saw that there was a wealth presented at the wrong spatial scale or that
of secondary data that we can use. Secondary the categories used for the data are not quite
data are, undoubtedly, an enormous resource right for our investigation. In other words,
270 Research methods for Education Part 2 The Process of Data Collection
there may be an element of compromise be sure that our data have the right focus and
between what we want for our research and the right level of detail to enable us to explore
what is available. If the compromise that we, our research issue and answer our research
as researchers, have to make becomes so great question. The approach we shall consider in
as to call into question the meaningfulness of this section will help us determine whether
our analysis and conclusions, then, in these primary data will meet our needs and, if
situations, we have to collect our own data. If not, it will then help us identify the specific
we have to generate primary data, we have to secondary data we need to assemble.
The step that we are now taking in our research is to move from the research question
to collecting the data that we need to answer our question. We know already that
the issues to consider (because we considered them in Chapter 6) is from whom (or
what) we should get our data and how we should identify them. If we were ever to
imagine that this was a relatively straightforward process, just a matter of asking some
questions, read Case Study 7.1 which shows that even professional researchers can
collect data that do not meet the needs of the research. However, before we can identify
our data sources we have to be much more specific about the implications of our
research question.
Let us suppose that we have been retained as an educational consultant to look at
the teacher supply over the medium term. There are two areas for us to research, the
first is teacher retention and the second is teacher recruitment. Let us further imagine
that there is satisfactory information about teacher retention and that a decision has
been taken to concentrate the research on the likely supply of young graduate teachers.
This, broadly, is the topic identified by Anthony Stones for his PhD research in New
South Wales, Australia (Stokes, 2007). The context for his research was that the range
of employment opportunities for graduates has increased over time and, as a result,
the proportion of graduates studying education and intending to become a teacher
fell from 21.3% of the total in 1983 to 9.7% in 2004. This context informed the way
in which the research question was deconstructed. The particular perspective that he
emphasised was why students chose not to study education. From the topics that were
investigated, we can see that his research focus was on the criteria that influenced career
decision making. These were identified through a literature search and by just thinking
about the issues prior to data collection. The decision criteria selected are presented in
Table 7.1 (factors are ranked from high to low). The interesting aspect of this study was
that students were asked what sort of salary increase would persuade them to become
teachers. For those who were unsure whether they wanted to teach, it would take a sal-
ary increase of 20% to make over 50% of them change their minds and for those who
did not want to teach, the salary increase would have to be over 40% to make over half
of them change.
Chapter 7 CRACKING THE RESEARCH QUESTION 271
v
Case study 7.1 What I say is not what I mean
There are two types of social science researcher, those The answer, of course, is that it was, so we ought to
who believe what the person opposite is saying, and ask another question, ‘How was the decision made?’
those who are sceptical, who record what they are told Concern over a healthy diet represented a market
but always ask themselves the question, ‘Why is this opportunity but no company jumps in without
person telling me this?’ This case study shows what researching the nature and extent of the opportunity –
happens when the people who believe what they are and this is where things began to go wrong. The market
told are in a position to influence decisions. researchers asked the wrong questions. A spokesman
for the American chain, Wendy’s, said, ‘We listened to
In recent years the message about the need for healthy
consumers who said they wanted to eat fresh fruit but
eating has been growing stronger and stronger. Most
apparently they lied’ (The Guardian, 23 August 2006).
supermarkets in the UK now have a ready meal selec-
tion, which is calorie concerned and pays attention From a research perspective, the problem was that
to the balance of ingredients. Organic foods are now the researchers were not sufficiently sceptical. If the
more widespread. Even the fast-food chains reacted whole world is concerned about obesity, ‘junk food’
and, in 2005, put more salads and fruit on an equal and the need to eat healthily, it is not surprising that
footing with burgers and fried chicken on their menus. someone will say that they want fresh fruit but that
While the salads remain, the fruit has disappeared deep down they would really rather have something
from many of them. The reason, of course, is that the that is starchy, sugary and filling.
restaurants could not sell it and a high wastage rate
There is a message in this for all researchers – dig deep
meant higher costs and lower profits. It was a commer-
to make sure that you probe the issue and consider how
cial decision. Yet before we just accept this, we ought
people might respond, why they would respond in
to ask ourselves, ‘Wasn’t it a commercial decision to
those ways and whether their responses would give you
introduce it in the first place?’
the data you need. And if they don’t, start over again.
What can we learn from this that can inform our own research?
It is this – the overarching question or issue has to be broken down into specific
questions or themes that determine the process of generating data. In this example of
career decision making, we can infer the progression set out in Figure 7.1. At the top
272 Research methods for Education Part 2 The Process of Data Collection
level, the overarching issue was, ‘Why did the number of students entering education
programmes decline?’ The more focused question was, ‘What was the basis on which
students chose their careers?’ This led to the identification of types of influence and,
finally, the specification of factors to test (structured around rewards, social attitudes and
personal costs). The process of breaking down the research question is an analytic one. It
is informed by the literature but, finally, it is a process of critical and analytical thinking.
It is this stage that very often determines how strong or weak a piece of research is.
The remainder of this section gives advice on how this process can be tackled.
Who is involved? Why did it What is creating What is the goal? What was the
happen? change? object?
Who is in Who was What are the What must happen to What has
authority? involved? outcomes? achieve this? happened?
What are the Who was What can intervene Who is responsible What has
consequences? affected? to affect the process? for . . .? happened that
was not intended?
What influences Who gained? Who will be What has to change? Are the
the decisions? involved? consequences
beneficial/bad?
Who is the Who lost? When does/will it Who wins/loses? Who has gained/
constituency? happen? lost?
Who stands to When did it How could it be
gain/lose? happen? better?
Should it have Why did it happen
happened like like this?
that?
What went
wrong/right?
Traditionally learning in higher education is accom- the introduction of virtual learning environments
plished through a combination of face to face con- (VLEs). In broad terms, these establish new ways
tact between students and tutors and student private for tutors and students to interact. Some of this
study, within frameworks often established by tutors. is synchronous but most is asynchronous, that is,
While this model still exists, it is beginning to be student and tutor do not have to be present at the
eroded. The reasons for this are threefold. same time. In this situation the tutor establishes
a programme of work for students to follow and
t First, the significant growth in student numbers in provides support with reading, testing and perhaps
many parts of the world has meant that tutors can on-line communication.
no longer give the same level of attention to indi-
vidual students. The question we should ask of this innovation is,
t Second, university identities and status are deter- ‘How suitable is it for higher education?’ Treat this
mined by their visibility in research terms. For aca- as your research question. Your task is to dissect the
demics, their research contribution is often the key issue and construct a series of further questions that
to career progression. In these circumstances indi- you think should be answered in order to reach a
viduals and institutions rebalance the effort that satisfactory conclusion. The word you have to crack
goes into research by reducing the amount that open is ‘suitable’. To give you a start you can range
goes into teaching. over everything from the outcomes achieved by tra-
t Third, the cost of employing academics is high (rel- ditional forms of teaching and learning, through the
ative to institutional revenues). This, in combina- nature of the higher education experience, to the
tion with other processes, has led to an increase in purpose of higher education.
the ratio of students to tutors. When you have created your own list of questions,
It is not surprising that both institutions and read the paper by Koskela et al. (2005) on this very
academics have sought new ways of managing topic. Look to see how many of your questions they
the staff–student interaction. The most significant asked. If there are differences between your and
developments in the last decade have occurred with their questions, can you say why?
are at work in our research issue or the dimensions that need to be tackled. In many
instances we will have sufficient to construct a prototype explanation, a model, of what
might be happening. How we represent this model will be dealt with in this next section.
The modelling approach is found in all approaches to research, quantitative,
qualitative and mixed methods, though we should remember that the approach of
much qualitative research is, first, to assemble the data, then sort and distil them prior
to devising a model. In this case we develop a ‘proto-model’, a first stage to help define
our data needs. Whatever our research approach, deductive, inductive, quantitative or
qualitative, modelling creates a framework for the hypotheses or ideas or questions that
are being tested. One of the most common ways of representing a model is as a process,
a set of influences on actions or activities or events and the outcomes that result. We call
this type of model a ‘systems model’.
subject areas, for example understanding the complex causation that might lead to cli-
mate change and the idea that a creature, plant, economy or city functions as a system. In
education, we are familiar with the term ‘education system’, the framework of education
provision that takes children in at the age of around 4 or 5 and enables their intellectual
and emotional development so that they can emerge at between 16 and 25 equipped to
function in society. The education system continues to provide further learning and de-
velopment opportunities for adults. However, the systems that we are usually concerned
with in our research are usually at a far smaller scale than this. For instance, understand-
ing bullying as a set of influences and factors that create the bully and the bullied, or the
conditions and reasons that interact to give rise to a high proportion of nursery teachers
and assistants leaving their posts in their early thirties.
There are two types of system, closed or open.
t A closed system has a boundary, across which there are no interactions whatsoever.
The computer programmes we use are closed systems. They may interact with other
programmes but only within a boundary set by the programmer. Realistically, in
education, we have no closed systems.
t Everything we deal with in education is an open system and has the potential to inter-
act with everything else. The selection of a school may be affected by the location of
the school relative to where the parents live (or parent lives) as well as its educational
programme. A child’s performance at school may be affected by the level of parental
support as well as the quality of teaching. Parental support may be related to socio-
economic status. Socio-economic status may affect where we live. We can represent
this as a diagram (Figure 7.2). This diagram is a first stab at representing a system.
The situation we face in education, as in all the social sciences, is of clusters of strong
interactions with fewer links joining the clusters together either directly or indirectly.
Figure 7.2 shows the elements of a system:
t processes at work, shown by the arrows (for example, socio-economic status
affecting where parents live by virtue of how much they can afford to pay for
housing);
t entities or connections or states (school, parental support) that initiate, create or
influence a process or are affected by a process;
t processes that can be classified as inputs (creating, interacting, influencing – socio-
economic status mentioned in the first bullet point) or outputs (a changed state
that may be an input elsewhere, a child’s performance at school, for example, being
affected by the quality of the school and the type and level of support offered by
parents).
Home
School
location
Socio-economic
status
Child's Parental
performance support
Figure 7.2 School choice and child performance represented in system terms
276 Research methods for Education Part 2 The Process of Data Collection
A system is a dynamic process. We need to reflect this in the way we think about
research. A question we need to ask ourselves, therefore, is ‘What happens when the
system we identify keeps on working?’ Even with a model as crude as that in Figure 7.2,
we can see that the process is likely to produce ‘sink’ schools, that is, schools to which
many parents do not want to send their children. In social terms, this is an unstable state
that will create a strong likelihood of undesirable consequences. In educational terms we
would like to achieve a more stable state where a child’s performance is not adversely
affected by the school they attend or the parents they have. What inputs would we have
to add to Figure 7.2 to achieve this? By asking this question, we can begin to look for
evidence of schools that defy our model and then identify the factors and processes at
work that create the difference.
t First, set out the processes or outcomes identified by other researchers. In terms of
pupil performance, it could be just pupil attainment, or high pupil attainment or
low pupil attainment or value added (by the school).
t Next, identify the inputs to these processes and classify their role. Are they causa-
tive (creating or initiating a process) or do they influence or interfere with a pro-
cess? This sort of classification is often highly judgemental, usually because our
ideas are speculative and what we want to evaluate in our research.
t Then, identify the nature of the impact. Is it positive or negative? Does it predis-
pose decision A, B or C? At the same time assess the strength of the impact.
t Finally, we add in our own conjectures and ideas. At this point, we have the mater-
ial for the final stage.
t Representing: There is no right and wrong way of representing system interactions. The
most simple is shown in Figure 7.2, where the entities, the things that are affected and
the things that cause them to change are shown in boxes, and the processes (inputs
and outputs) are shown by arrows. One way to show all of the information we have
gathered is to use different colours for different studies and superimpose them on one
diagram. Some solutions work well and others less well. If the diagram becomes too
crowded, we have to do something different. What should emerge is something that
represents our view of the research to date and which highlights (and is implicitly a
rationale for) the research we wish to do. Case Study 7.2 gives an example of this.
The process of representing our thinking about a research issue is improved by
practice. Activity 7.2 will help you think in a ‘systems’ way and construct a model of a
research issue.
Transformational Commitment to
leadership school mission
Collective Commitment to
teacher efficacy community
partnerships
SES
teacher commitment would have a modest but sig- In a time when change is effected by introducing new
nificant effect of learning outcomes. ways or working, attainment targets, new organisa-
t Teacher participation in community partnerships tional structures, new leaders and new training pro-
had the greatest impact on achievement. grammes, it is an interesting and valid question and
raises the possibility that we should see leadership as a
An interesting aspect of their conclusion is to question
‘soft power’ as well as command and control.
how teachers can enhance their efficacy and self-belief.
You might look at papers in academic journals and of analysis, start by reading Gary Marks’ paper (2006)
ask, ‘Where is the systems diagram?’ It is true that on low achievement. He sets out the entities for you
more papers do not include it than do. First, we have to represent quite clearly. In the first paragraph there
to exclude all those following a broadly inductive are things to put on your diagram. The section on
approach. For the rest, a systems diagram may be ‘Previous studies’ should enable you to devise quite a
a valid way of representing the research issue. Just full diagram. As you construct it, think about whether
because it was not published should not lead us to there is anything else that you would insert.
assume that it was not used by the researcher(s) to After you have thought about the issue by yourself,
identify the variables they wanted to study and look at the research on attainment funded through the
to show how the variables interacted. This activity department for education in the UK and its predeces-
practises systems thinking by reconstructing the sors. Go to the website and navigate to the research
research issue in a systems format. You can do this and statistics homepage. Follow the link to the re-
with any appropriate academic paper but if you want search page and select, as your keyword, ‘attainment’.
to be fairly certain of coming up with the right sort Can you enrich your diagram through this source?
Chapter 7 CRACKING THE RESEARCH QUESTION 279
Reduction in Unemployment
Delivery
youth reduction
vector
unemployment programme
Drug dependency
Physical incapacity
Intervening
Stress
variables
Access to transport
Value of education
seemed happy, contented, unhappy by observing every 15 minutes whether they were
laughing, crying, smiling, angry, etc. and counting the frequency for each category. An
indicator, therefore is something that we can measure, whereas a variable is not necessar-
ily capable of being measured directly.
This idea of identifying indicators for our variables is important, because there will be
many variables in our education research where the link between the variable and our
data is not clear or is a matter of judgement. We need, therefore, to explain our choice of
indicators. This important stage is the link between the conceptual/theoretical/planning
stage of our research and the empirical/operational stage.
Indicators generate data that are proxies for our variables. That is, they represent
the variables though they may not be exactly the same as the variables. For example, if
we wanted to measure student motivation but only had access to data collected by the
school, we could use the number of times a student was recorded as being late or absent
(on the assumption that poor motivation was more likely to be associated with lateness
and absence). This is what we mean by the indicator being a proxy.
Our research is only as good as the indicators we choose. In most cases the selection
will be uncontroversial but where there is more than one option, the choice has to be
justified. One of the challenges facing teachers is the spread of abilities in their class. One
of the policy decisions that schools take is whether pupils should be grouped by ability in
some or all of the subjects that are taught. Not surprisingly, whether whole class teaching
or grouping students by ability produces better results is a topic for educational research.
In 2005 Judith Ireson and colleagues published a study on the effects of grouping by
ability on pupil attainment (Ireson et al., 2005). While their finding, that when ‘students
follow essentially the same curricula, ability grouping has little impact on attainment’
(p. 454) contributes to the debate about selection and grouping, we are more interested
at this time in their approach to the research and the indicators they used for the concept
of ‘ability grouping’. They needed to measure ‘grouping’ and they chose to reflect two
attributes.
t First, they determined the extent of grouping and developed a two-fold classification
of mixed ability teaching: partial grouping and complete ability grouping. Using this
classification they could assess the proportion of classes that were taught in a group
mode.
t Second, they reflected the extent of a pupil’s experience of grouping, in terms of num-
ber of years in a five-year period.
Using these indicators, they thus developed an estimate of grouping at the level of the
school and at the level of the individual pupil. Their indicators enabled them to measure
the variable they were interested in, ability grouping.
We can appreciate, with this last example, that our variable (the concept of ability
grouping) has become more complex (than, for instance, age or gender) but what we
also have to understand is that there are far more complex concepts that we have to
grapple with in social research. What is quality in education? Is pupil attainment only
concerned with examination grades, or should it include social and personal dimensions
as well? How good is a school, college or university? How effective is a teacher? What is
social exclusion? The level of difficulty in cracking the problem of ability grouping is minor
in comparison with these issues, each of which can be explained in several different
ways. They are much more complex and they have many dimensions. Which ones do we
choose to represent as an indicator and which do we omit? The task is almost certainly
likely to be too great for a lone researcher. Fortunately, many of these concepts have been
researched and standard indicator sets produced.
282 Research methods for Education Part 2 The Process of Data Collection
The indicators that are generated for complex concepts such as those above are called
compound indicators because several indicators come together to represent the different
attributes of the variable. Read Case Study 7.3 for an example of this. The composition of
an indicator is a matter of choice and judgement. The fundamental requirement of any
indicator is that it should validly represent what it purports to represent and that it should
be reliable (in space and time) in representing it. Many composite indicators are collected
by public agencies or bodies and are usually compiled to assess some aspect of public
policy. In this situation it is possible to select attributes or change definitions in ways that
give a positive assessment of the variable. Take what appears to be a simple concept like
unemployment. In the UK, after the Second World War, the calculation of unemployment
was conceptually simple – those registered as unemployed as a percentage of the total of
employees and unemployed. After the boom years of the 1960s, with unemployment
rising, the definition was changed to the number of registered unemployed over 18 as
a proportion of the workforce (that is, employees, self-employed, those in the armed
forces, those on government training programmes and the unemployed). Why were these
changes made? Because they reduced the apparent level of unemployment. The changes
reduced the rate because (a) the numerator, the number of registered unemployed, fell
(the school leaving age was raised to 16 in 1972, so unemployed 14- to 16-year-olds
did not count) and (b) the denominator was increased with the inclusion of the self-
employed, armed forces and those in training). In 1983 the definition was changed again
so only those claiming unemployment benefit were counted. Again this led to an apparent
reduction in unemployment. Clearly what researchers must do when using indicators,
especially composite indicators, is to examine what is included.
Domain Attribute
Compound indicators are used in public policy (including education) in three ways:
1. To define the extent and/or location of a problem, for example, this is why the
Governments of England and Wales, Scotland and Northern Ireland have devised an
index of deprivation that identifies localities where deprivation (evaluated on a range
of dimensions) is high (see Case Study 7.3).
284 Research methods for Education Part 2 The Process of Data Collection
2. For comparative analysis: for example, the Physical Quality of Life Index combines a
measure of adult literacy with infant mortality and life expectancy.
3. For longitudinal analysis, that is, looking at how situations in different countries or
regions change over time, the Index of Social Progress (developed by Professor Richard
Estes of the University of Pennsylvania) combines 40 indicators covering education,
health, economy, environment, social conditions, welfare, demography, arms
expenditure, status of women and cultural diversity. The 2000 report on Europe shows
that Sweden and Denmark had the highest index score (107), Belgium, Netherlands
and the UK had an index of between 95 and 97 and that the average score for all of
the states of Eastern and Western Europe was 87.4, which was marginally above the
score for the USA (Estes, 2003).
Compound indicators are important for educational researchers because we can use them
to provide a single measure of complex variables. Deprivation, poverty, the digital divide
and social exclusion are all issues that impact on education, in terms of education provision,
access to education, experience of education and educational attainment. Each of these is
multi-faceted and multi-dimensional. There is no simple measure for any of them. The Social
and Cultural Planning Office in the Netherlands has used an index to the measure quality
of life of Dutch citizens (Life Situation Index, published biennially as The Social State of the
Netherlands). This index brings together measures of housing, health, leisure activity, consumer
durables, sport activity, vacation, social participation and mobility. The United Nation’s
Human Development Index integrates measures of life expectancy, education and standard of
living. In the UK, the Department of Communities and Local Government produces the Index
of Child Well-Being, a compound of measures to reflect material well-being, health, education,
crime, housing and environment. The Index is calculated for localities and seeks to reflect
the quality of life of children. By using these indicators and many other measures, they can
be associated with such things as educational progression, educational attainment, access to
higher education and even school quality reports. They become, in terms of our categorisation,
an independent variable which is assessed in terms of its impact on the dependent variables
(such as educational attainment and access to higher education).
leave increases with age. Looking back on our sample, we see that our male teachers are
predominantly young and our female teachers are old. Gender may have an effect but it
is masked by the effect of age.
This example shows the danger in much education and social science research. The
issues we investigate are so complex and the influences upon them so multi-faceted that
it can be difficult to isolate the influence of one variable. As a research community we
have coped with this in two ways:
First, we have developed (and used) a greater range of qualitative approaches to gain
insights into the issues (see Chapter 11). If we are concerned to understand more about
low achievement, we could gain a deeper understanding through case studies of family
lifestyles than we could by profiling the socio-economic characteristics of low achievers
and their families. It is partially because of the complexity of education that we have seen
such an increase in qualitative studies in the last 20 to 30 years.
The second way we try to manage complexity is by developing more powerful ana-
lytical (usually statistical) tools (see Chapter 13). For example, we know that processes
operate at different levels. A pupil’s overall academic performance is affected by what the
school does as well as by what is within their control and family circumstances. Recogni-
tion of this situation throughout the social sciences led to the development of multi-level
analysis. These powerful techniques allow us to gain insights into complex issues in ways
that help us to shape policy actions. Read Case Study 7.5 for more about this.
286 Research methods for Education Part 2 The Process of Data Collection
So far we have found that if we are to have good information, we need to:
t find the right people to give it to us, that is, people who are knowledgeable about the
issue and who, by being representative of a group, allow us to make valid statements
about that group;
t ensure that we have broken down our research question into smaller and smaller
questions so that we can identify the variables we will use to answer our questions
and the indicators that will reveal those variables to us.
Now we are going to begin the process of finding out how we can actually collect this
data. We have looked already in Chapter 4 and seen in some detail the wealth of second-
ary data that is available, with primary data the options available to us are really quite
wide ranging, from obtaining our data in a highly structured way to obtaining them
using more flexible and opportunistic means. Some of the decisions we make about
which method to use are influenced by the nature of the environment we are working in
and the nature, particularly the sensitivity, of the issue we are investigating. For example,
if we are investigating an aspect of institutional organisational failure, it might be unwise
to announce both the purpose of our research and our presence because the tendency of
people who are involved in failure is to gloss over or even suppress the evidence. If, on
the other hand, our investigation involves dealing with school children then we have to
make clear the issue we are studying, who we are and the methods we will use not just to
the school but to the parent/s as well. In both cases, we have to understand that there are
critical issues about the subjects knowing the purpose of our research. As we have sug-
gested, though, we might come to different decisions in different circumstances.
However, our choice is influenced not only by the relationship between the research
topic and the research environment, but also by the nature of the data we expect to col-
lect. The data collection methods we can use are not equally appropriate for all types of
data. We cannot even say that if we are collecting data on behaviour, use this approach,
and if we are collecting data on opinions, use this other method, because some opin-
ions, for example, are best revealed by behaviour rather than being expressed in writing
or through speech. And, in addition to all of this, we have to judge whether we have the
time or the resources to use a particular method. It all seems very complex but in real-
ity the selection of an approach is usually straightforward and in most cases we do have
alternatives. On top of this, the people we are dealing with are usually only too willing
to help – though this, as we shall see, can be a problem on some occasions.
t Getting data from a t Opening up an issue. t Preliminary research. t Obtaining data from those
large number of people. with limited verbal skills.
t Group interviews and discussions: a session led by one or more researchers which
involves probing groups of respondents using a question, discussion or stimulus/
response format.
t Observation: where the researcher, who may or may not be known to the subjects,
collects data using approaches that range from structured to opportunistic. Observa-
tion takes place in or captures real time in the research environment.
Interviews also take place in real time though the location of the interview need have
nothing to do with the issue being researched. Group sessions are likely to occur in an
environment removed from the research issue. Questionnaires are flexible in terms of
when and where they are completed. There are other data sources we can use, for example,
documents, but obtaining secondary data is more of question of assembly than collection.
As we shall see there are variations within each of these four categories and we shall
meet these in the next two sections. At this point we need only to know the circumstance
in which each method is best used. Table 7.3 sets these out. However, before we discuss
what these situations are, we should remember that if the circumstances require us to
use a less appropriate alternative we have to reach a decision whether the compromise
in data quality is so great as to undermine the validity of the measurement (and of our
analysis and interpretation).
t Questionnaires: The principal situation in which we use questionnaires is when we
want to access a large number of people. Scale is particularly important for hypothesis-
based quantitative research where we need minimum numbers in categories in order
to draw statistically valid conclusions. In this situation, where we have to process
large amounts of data, we need that data to be structured. Questionnaires do this
admirably. (Chapter 8 considers questionnaires in detail.)
t Interviews: Interviews can range from asking different respondents a series of common
questions to a conversation around a topic. Their most important characteristic is flex-
ibility. If a question is misunderstood, it can be rephrased. If a point is made that throws
new light on an aspect of interest, then we can explore it further. It is this ability to re-ask
questions that helps researchers establish whether they are being told the truth or a ver-
sion of the truth that places the interviewee in a good light. This ability to probe and
open up an issue is not the only reason for using an interview approach. The particular
strength of the interview is that the respondent’s voice is heard and the words chosen, the
Chapter 7 CRACKING THE RESEARCH QUESTION 289
phrasing used, the pauses and exclamations, and the tone of the reply could all nuance
the interpretation of the response. (Interviews are dealt with in detail in Chapter 9.)
t Group interviews (focus groups) and discussions: Focus groups can be used at an
early stage in an investigation to obtain insights prior to constructing a questionnaire.
As a method of collecting data they can be used to gain a collective perspective speed-
ily. Because a focus group is discussion-based, it provides a way of establishing how
stable people’s views and opinions are. Its real strength, however, is that it can be used
to explore the deeper significance of the responses given by members. It can be used to
determine motives and values but this usually requires an approach that goes beyond
discussion. (We look at focus groups and how to manage them in Chapter 9.)
t Observation: Direct observation of people’s actions, behaviour and attitude is an
alternative to asking them about it. Observation is preferred when we are not sure
that verbal responses (often from memory) will be accurate. This is especially true if
the variable we are interested in alters in intensity or duration and not just presence
or absence. For example, observation will show us how long a pupil’s attention span
was before signs of boredom set in. We can use observational approaches as well with
groups whose ability to verbalise is limited. This would include young children and
infants and people with profound or severe learning difficulties. (Observation is dealt
with in Chapter 9.)
While it is appropriate to see these approaches as alternatives best suited to particular
circumstance, we should also look at how they can be brought together in our research
strategy to deliver better quality information. If our research is into awareness of drugs
and drug use amongst 13- to 15-year-old children, a questionnaire given to an appro-
priate sample would give us the broad picture but a focus group would give us deeper
insights and a richer picture. In this way we can combine scale and depth in our analysis
and demonstrate that our detailed insights are applicable to a wider population.
we are introducing the idea of goodness in explanations at this stage – it has to be part
of every researcher’s consideration before data collection begins. This is why, in this
chapter, we explained how to break down the research question. Case Study 7.6 gives
an example of goodness in explanation at work. Goodness in explanation is actually
quite a complex area of scientific philosophy which this short paragraph hardly does
justice to. For more detail, read Peter Lipton’s book on the topic (Lipton, 2004).
How do these two conditions actually work? If we are monitoring interactions
amongst pre-school children with a view to creating a typology of social development
and if one of our measurements is ‘spoke with other child/children’, then this is likely
to be a fairly blunt (and ineffective) measuring tool because it will not differentiate the
child who spoke once from the ones who spoke many times, or the one who spoke to
one other child from the one who spoke to different children. The consequence of this
is that our second way of looking at goodness, goodness in explanation, is unlikely to be
met because our data will not allow us to show how types and amounts of interaction
are associated with types and levels of social development.
Before we move on to look at data collection in detail (in Chapters 8 and 9), we
should be aware of how our data collection can go wrong. Goodness in measurement
has two components – accuracy and precision. Together they ensure that our measure-
ment of data is valid.
t Accuracy refers to the way in which we describe the attribute in which we are inter-
ested. For our measurement to be accurate, there must be a constant relationship
between our measurement and the true value.
t Precision means that we or other researchers, using the same approach to gather data,
should be able to replicate our measurements. This, of course, assumes that the context
in time and place is unchanged. If it is not, then the change itself could be a variable.
Table 7.4 shows how accuracy and precision can interact in any investigation. Accuracy
and precision have two states, either they are accurate or not accurate or precise or not
precise. There are four possible combinations of these states. These combinations are
shown as ticks (when data are accurate or precise) and crosses (when the data are not
accurate or not precise). Out of the four possible combinations, only one is acceptable for
analysis and for creating an explanation – when our data are both accurate and precise. In
all other situations we have a problem. Imagine a dartboard or circular target, our object
is to hit the centre. If our aim were precise but not accurate, our darts would be clustered
away from the centre, for example in the outer segment of number 1. If they were accurate
but not precise, they would be distributed within the inner ring. If they were neither precise
nor accurate, they would be spread randomly over the board. Only when they are clustered
in the red bulls’-eye can we reasonably say that they are both precise and accurate.
It is important, therefore, that we should be aware what the problems of measure-
ment can be before we start.
Precise ✓✓ 3✓
Not Precise 3✓ 33
292 Research methods for Education Part 2 The Process of Data Collection
collection is specified in advance. This usually requires some preliminary work and
pre-testing of the methodology. The same sort of problem can arise with a single in-
vestigator, who allows the basis for data collection to evolve.
t A second thing that can generate measurement inaccuracy is an interaction effect be-
tween researcher and those being researched. A desire to please may lead to a respond-
ent giving more positive answers to questions. Hostility can have an opposite effect.
Knowledge of being watched can lead to people behaving differently from normal. (We
will see how to manage this interaction effect at other points in Chapters 8 and 9.)
t Inaccuracy also occurs when the measuring instrument is erratic. A thermometer that
does not register the correct temperature is of little use. Researchers can find that they
have constructed a broken thermometer at any time but its likelihood is greater in
two circumstances, the first, when we measure attitudes on a numerical scale and the
second, the construction of composite indicators. The problem with attitude scaling
is that we can never be sure that one person’s ‘very’ is not another’s ‘quite’. Imagine
two students doing a mathematics test. They get the same answers for each question
and therefore the same test score. They were also asked to judge each question (the
research was into whether the range of questions sufficiently discriminated between
those with a high level of capability and those who were less capable). Where one
answered very difficult or easy, the other answered quite difficult or easy. The inference
is that they are making the same judgements but that they are using different terms
to describe them. The problem for us is that these terms constitute our measurement
scale. There are ways of establishing valid scales but they are not always applied. If you
think that this is not important, then look at John Richardson’s research on response
bias (Richardson, 2012). He examined student responses to a Course Experience
Questionnaire and a Revised Approaches to Studying Questionnaire and found
two biases, an acquiescent bias in which students showed a tendency to agree with
questionnaire items and an extreme response bias, the tendency to use the extreme
points of the scale. Fortunately the biases did not invalidate the findings with respect
to student opinions but they did mean that those opinions were not always accurately
measured. With composite scales there are two possible sources of error – what is
included and how the elements are combined. For example, just how do we estimate
value added in academic development and how do we combine this with something
that is just as difficult to measure, the social responsibility of pupils? For some
researchers, these are crucial issues and the best test of the validity of their estimates is
the critical assessment and approval of peers.
Part 1 Part 2
As a research co-ordinator, you have to assign question- Having thought about the factors that you should
naire administrators, interviewers and group leaders take into account in pairing data collectors with
for data collection. You appreciate that for interviews, data providers, put the following term into any Web
questionnaires administered by interview and focus search engine: interviewer response effect. Make a
groups there is a social interaction between those col- list of all the interactions that researchers have tested
lecting the data and those giving it. You are also bound and their significance.
by legislation (and good practice) relating to equality
of opportunity for the people who apply to work for
you. In assigning people to the following data collec- Part 3
tion tasks, what would inform your decisions?
The final task in this activity is to identify at least ten
1. A survey of voter intentions in a multi-ethnic com- papers dealing with an aspect or aspects of education
munity. that interest you and which use interviewers or group
2. The lives of women in a first generation, non-host facilitators. For each one, consider whether there
language speaking immigrant community. could have been an interviewer response effect. If so,
3. Rural lifestyles. is the possibility examined by the author(s)?
suppositions and perceptions while a focus group (or interview) is more likely to
capture the feelings and concerns of the workers. The likelihood of error is greater
with the first than with the second.
While there are ways of trying to overcome the difficulties set out above, the worst
problem we face as a research community is not being sufficiently critical of the research
output. We should be sure that the data that are produced are a valid portrayal of the
issue. For this we need to be able to judge the selection of respondents and the appropri-
ateness of the procedure used to collect the data. To do this, we must have access to detail
that is often lacking. Transparency is crucial if research is to be convincing. Activity 7.3
takes us further into the issue of accuracy and transparency.
Represent the
research issue
Define variables
Deterrnine indicators
of the research issue or question. We have seen that there are many potential sources
for this but most will take us through a literature search (the theme of Chapter 5). The
next stage sees two decisions made in tandem. We determine our research approach
and we dissect the research question. Unless our approach is pure induction, we should
think of representing our thinking in diagrammatic terms. This allows us to define
the nature of our variables. Finally, we have to deconstruct our complex variables to iso-
late the attributes necessary to resolve our research question. This allows us to determine
the indicators.
While this may appear to be a straightforward sequence, we should bear in mind that
solving our two areas of uncertainty, the right sources and the right data, are inextricably
connected. We have to move between them, making sure that the decisions we make for
one are compatible with the other. If they are not, then we must revisit our decisions
and remove the incompatibilities. Only when we are confident that our procedures are
robust can we move to the next stage, actually collecting our data.
It is important to understand that the foundations of our research should be deep and
solid, so at this point we shall review what we have to do to ensure that this is the case.
t Is our research driven by moral principle or ethical purpose? We reviewed some
positions that researchers have taken in Chapter 2. The desire to achieve change in a
situation is no bad thing but it presents dangers for researchers – of preferring some
data to others, of interpreting results in line with our views when other interpretations
Chapter 7 CRACKING THE RESEARCH QUESTION 295
are also possible. The defence against this is to make our position clear and to ensure
that our methodology is robust.
t The way we approach our research is influenced by a number of factors:
t Do we believe that all research (or all research on this topic) should adopt a spe-
cific paradigm?
t Do we think that we should adopt paradigms used by others?
t Do we think that our approach should be determined by the circumstances of the
issue, including the availability of data?
We looked in detail at quantitative and qualitative paradigms in Chapter 3. The issue
for us as researchers is not about which one should we choose but whether there is no
choice (because the one we use is determined by how we think research should be con-
ducted). If our choice is not constrained, what we do is a response to the situation we
face. The flexibility this implies, however, and the compromises we may be forced into,
create dangers.
t Are we clear about our research question? We looked at this in Chapter 2, where we
related types of research question to particular research approaches. In this section
we saw how important it was to break down the research question so that we could
identify the variables at work and the indicators that would yield data.
t In Chapter 6 we saw how important it was to understand just how the data sources
we would approach were representative of something. This is important if our re-
search conclusions are to have any value or worth in another situation.
t In this section we learnt that the procedures we use to collect data can influence the
quality of the data that we collect. Another pitfall for us to negotiate!
The consequence of all of this for our research is that we should move backwards
and forwards, testing out our possible courses of action. If the research process has deep
foundations, our own research methodology should have foundations that are equally
deep. The consequence of this is that data collection should not proceed until all the
ramifications have been explored and a way forward emerges that will produce good data.
Summary
t The quality of our research is determined by the quality of the data we collect.
Quality data comes from the right people and is the right data to answer our
research question.
t The link between the research question and the data we need has been the theme
of this section. Modelling the research issues was introduced as a way of ensuring
that we get to the right data.
t We saw how the process of ‘questioning the question’ enabled us to identify factors
and influences that might be at work in our issue. We learnt that the proper term
for ‘factors’ and ‘influences’ is variable.
t Variables can have different dimensions. For every dimension that is relevant to our
investigation we need an indicator. Indicators describe variable attributes. We can
draw on established compound indicators for complex variables.
t We judge how good our data are in terms of accuracy and precision.
296 Research methods for Education Part 2 The Process of Data Collection
Further reading
Systems modelling The paper by Bela Banathy and Peter Jenlink is an excel-
Checkland, P. (1999) Soft Systems Methodology: A 30-Year lent overview of systems approaches and an explora-
Retrospective and Soft Systems Thinking, Systems Practice, tion of how they might be used in education research.
Wiley, Chichester.
Goodness in explanation
Banathy, B.H., and Jenlink, P.M. (2004) ‘Systems Inquiry
and its Application in Education’. In D.H. Jonassen Lipton, P. (2004) Inference to the Best Explanation, Rout-
(Ed.) Handbook of Research on Educational Communica- ledge, London.
tions and Technology (pp. 37–57), Lawrence Erlbaum, Peter Lipton’s text on scientific philosophy is readable
Mahwah, NJ. and the arguments he sets out are attractive and reflect
The name ‘Peter Checkland’ is synonymous with systems the way that, intuitively, many researchers appear
modelling. His work focuses on ‘soft systems’, essentially to work. The argument rests on the link between
a process developed to cope with questions or problems explanation and evidence. Assuming that the evidence
where there is no clear answer. This, for instance, could is correct, then out of all explanations there will be a
be, ‘How do we reduce teenage pregnancy?’ In this book, ‘best explanation’. Much of social science research is
Checkland describes the origins and development of soft likely to work within this framework. If circumstan-
systems thinking as a preface to the text published tial evidence of a crime points a finger at a particular
20 years ago. The process he describes is easy to under- person and no others, the best explanation we have
stand and highly applicable to educational research. is that they did it. What this comes down to is that in
seeking to explain something, we collect data that we
use to accept that our explanation is correct.
References
Estes, R. (2003) ‘European Social Development Trends: Mayston, D. (2006) Educational Value and Programme
Development Challenges of the “New Europe”’ in Evaluation, Research Report RW87, DfES.
Vogel, J. (ed.), ‘Good Times and Hard Times in Richardson, J. (2012) ‘The Role of Response Biases in the
Sweden During the 1990s’, Report 100: Living Relationship Between Students’ Perceptions of Their
Conditions Series, Statistics Sweden, 435–468 Courses and Their Approaches to Studying in Higher
(available at http://www.sp2.upenn.edu/~restes/ Education’, British Educational Research Journal
praxis/Estes_Papers.html). 38(3), 399–418.
Ireson, J., Hallam, S. and Hurley, C. (2005) ‘What are the Ross, J. and Gray, P. (2006) ‘School Leadership and
Effects of Ability Grouping on GCSE Attainment?’, Student Achievement: The Mediating Effects of
British Educational Research Journal, 31(4), 443–458. Teacher Beliefs’, Canadian Journal of Education
Koskela, M., Killti, P., Vilpola, I. and Ternoven, J. (2005) 29(3), 798–822.
‘Suitability of a Virtual Learning Environment for Stokes, A. (2007) ‘Factors Influencing the Decisions
Higher Education’, Electronic Journal of e-Learning, of University Students to Become High School
3(i), 21–30 (available at www.ejel.org). Teachers’, Issues in Educational Research, 17(1),
Lipton, P. (2004) Inference to the Best Explanation, 127–145.
Routledge, London. Usang, B., Akuegwu, B., Udida, L. and Koley, F. (2007)
Marks, G.N. (2006) ‘Influences on, and the ‘Academic Staff Research Productivity: A Study
Consequences of, Low Achievement’, Australian of Universities in South-South Zone of Nigeria’,
Educational Researcher, 33(1), 95–115 (available at Educational Research and Review, 2(5), 103–108
www.aare.edu.au/aer/contents.htm). (available at www.academicjournals.org/ERR.
This page intentionally left bank
Chapter contents
Introduction 299
Summary 336
Further reading 337
References 337
Chapter 8
ASKING QUESTIONS
Learning themes
t The types of question we can construct. By the end of this section you will:
t How to administer questionnaires and get t Understand what types of information we
them to respondents. can gather using questionnaires and rating
scales.
t How to generate a good response.
t Be able to construct an effective
t How to collect good data. questionnaire and rating scale.
t Thinking about data analysis as part of t Be able to determine an appropriate way
questionnaire design. of administering a questionnaire for any
research situation.
Introduction
Questionnaires are amongst the most popular asked and there are many examples of poor
of data gathering instruments. We encounter layout, such as routes through the question-
them on a regular basis in our daily lives. They naire that take respondents to a dead end be-
come unsolicited through the post and appear fore the questions are finished and questions
as pop-ups on our computer screens. With so that do not allow for all possible responses.
many questionnaires in existence, we might Like a scalpel in a surgeon’s hands, a question-
think that they are easy to put together and naire needs a sensitive touch. In this section
effective as a way of obtaining data. How- we shall learn what to do in order to produce
ever, one of their most common character- a questionnaire that will get us good data to
istics is that they are poorly constructed. We work with.
can find problems with the way questions are
300 Research Methods for Education Part 2 The Process of Data Collection
First we will look at the three basic elements of a questionnaire, the information we
can collect, the question types we can use and getting a gradation of responses by us-
ing scales. Constructing a questionnaire is obviously much more than putting these ele-
ments together and we shall consider the things that will make a questionnaire an effec-
tive research instrument in the section on practice.
we might initially think that the environmental impact means that we should buy things
that are locally sourced. However, if the questions also cover what we know about the
system of production and the significance of the trade for the lives of those who work in
the sectors, we might begin to think that the economic development and the impact on
people’s life chances for poor parts of the world outweigh the environmental considera-
tions. Completing such a questionnaire may affect the position we adopt.
As researchers we use open questions to do certain things.
t First, we can use them to obtain a richer picture of some aspect under investigation.
If we were researching the demand for early years childcare, it would be important
to understand what factors influenced parental choice. A question such as, ‘Please
describe how you went about selecting your childcare?’ would give us the data in the
respondents’ own words (and the space we leave in the questionnaire is an indication
of the detail we want).
t Second, we want to be sure that the structured questions we have asked have not
omitted a significant response. We can often do this by adding a category ‘other’ to
our list and asking those who mark it to give some detail. On other occasions we
can ask a supplementary question. If part of our questionnaire is concerned with the
workload of teachers and we have asked questions on the number of hours worked
in the past week and what they did, we could finish the section by asking them how
typical this was and their views on their work/life balance.
t Third, we usually want to use direct quotes from respondents that they give in open
questions to reveal insights and to give personality to a written report. Not only does
this provide an authentic voice, it adds emotion and passion and enables us to con-
vey in a powerful way issues and perspectives that are important to the interpretation
and explanation.
Table 8.1 gives examples of open and closed questions. All the closed questions
give no freedom to respondents to reply in their own ways. Closed questions can be
used for straightforward categorisation (Are you male/female?), to collect information
about which people might feel sensitive if you asked them to give specific quantities (for
example, income, age) and to extend people’s thinking about an issue (factors influenc-
ing university choice). We should be aware that responses to some questions (such as
income) frequently contain an aspirational element. We should check whether there has
been overstatement by benchmarking our results against national surveys of income.
Open questions can be structured (limiting people to three factors) but, whatever for-
mat, all allow people to use their own words. This, however, is an issue if people’s ability
to express their thoughts is limited. Certainly age is a factor to consider here; responses
from young children should normally be obtained in ways other than questionnaires.
And if we are researching learning disability, we should consider whether questionnaires
are the most appropriate means of collecting data. Language capability may also be a lim-
iting factor in using questionnaires if we are dealing with lower socio-economic groups
or people with low educational attainment. It certainly will be a factor if a response is
expected in a language other than someone’s first language (for example, if a South Asian
or East European parent whose English is limited is asked to respond in English).
Our choice of whether to use open or closed questions should be determined by the
issue, by the respondents we expect to have and by the balance between the two types
that we want in our questionnaire as a whole. There is no compelling evidence that one
or the other is better at collecting quality data. Indeed, the evidence is rather more robust
in suggesting that they are equally good. An American research group led by Howard
302 Research Methods for Education Part 2 The Process of Data Collection
t Are you: male t What do you think are the three most important
female things that make for a good lecture?
1.
2.
3.
t Please indicate how much income from all sources t What are your favourite school meals?
(e.g. from parents, regular part-time work) you
receive each week
under £5
between £5 and £10
between £10.01 and £20
between £20.01 and £30
between £30.01 and £40
between £40.01 and £50
between £50.01 and £70
over £70
t Please identify the five most important factors in t Tell us how the building work at school affected
terms of their importance in choosing a university you.
at which to study. Place 1 next to the factor you
think most important, 2 next to the second most t What do you think are the principal issues that the
important and so on. college has to address?
t The university’s reputation
t The department’s reputation
t The research done in the department
t The department’s quality assurance rating
t The number of applications for each place
t What the department says about teaching and
learning
t The success of students in getting jobs
t The quality of the social life
t The town it is in
t How close it is to my home
Schuman explored this issue in the context of how people perceived the threat of nu-
clear war (Schuman et al., 1986). They concluded that while open questions produced a
larger range of responses, the categories they formed were small in size. In other words,
the responses were highly individual. If there is interest in obtaining a greater breadth
of response within a closed question structure, the method is to add ‘other’ as a category
and to ask respondents to add detail.
While Table 8.1 gives a hint about the types of information that we can collect, it is
worth going into this in more detail. We will do this in the next part of this section.
Research issue –
Hoped-for information
problem
Research issue –
Information
answer
Chapter 6.) The framework is shown in Figure 8.1. Our purpose in asking questions is to get
at the information that we know (or hope) exists and which will help us resolve our research
issue. The information we hope exists is defined by the research issue. We identify the infor-
mation that we think will be useful to us and phrase our questions to obtain it. Once we have
asked our questions, we collect data. The purpose of our analysis is to release the information
content of the data and use it to provide some sort of answer to our research problem.
With this background, let us now look at what types of information we might be in-
terested in. Table 8.2 shows types of information and the question type that can be used
to gather it.
category as does expressing preference for a set of alternatives. For example, faced with a
major financial deficit, a university might ask lecturers to prioritise possible actions such
as asking full-time staff to teach more hours, asking full-time staff to have more students
in their classes, deciding not to run low enrolment classes – all in the expectation that
the salary bill for part-time staff would fall. We could ask pupils whether they approve
of school policies on bullying, exclusion, unauthorised absence and we could present
them with complex scenarios that test their judgements in ambivalent situations (a girl
excluded for persistent lateness and non-attendance resulting from her taking on the role
of primary carer for her siblings – and keeping the situation to herself).
Understanding the types of information we can access gives us a different perspective
on questionnaire design. It helps us think about our research and places at the fore-
front of the process the need to answer the question, ‘What do we need to know?’ By
understanding that there are types of information, we can ask ourselves which ones we
might need in our research and once we have identified the need for a particular type of
information, it (a) is often far easier to identify who might be in a position to give it to
us and (b) makes it more straightforward to draft the questions. Activity 8.1 provides an
opportunity to think about these issues.
The context for many of the decisions that school you here? As you start to untangle the issues in your
teachers have to make and the actions they take own mind, go back to the information framework
about the children for whose education they are and ask:
responsible are the lifestyles and social and family
t What facts do I need to know about their lives and
circumstances of those children. While this context
lifestyles?
is important at all ages, it is particularly potent for
t For which issues do I need to show whether their
young people on the edge of adulthood. Sometimes
understanding is accurate?
researchers want to know aspects of this context in
t Are there any schemas or mental constructs that I
order to better understand the issue (for example,
need to expose?
if the issue were under-performance, it would be
t Do I need to ask them to analyse situations or
valuable to know about paid work, socialising and
events? If so, where will this help meet my brief?
exposure to drink and drugs). It is also valuable for
t Are there any areas where appreciating the young
teachers, especially if they have a pastoral role, to
people’s values are important?
have insights into the lives teenagers lead.
Imagine that you have been asked by a teachers’ In your exploration of the issue, you may decide that
union to design a questionnaire that can be used by some types of information are not required. If so,
teachers to appreciate the lifestyles of their pupils. explain to yourself how and why you reached this
The intention is that head teachers should be a able decision.
to authorise its use in their schools. You are not re- Because understanding the lives of young people is
quired at this stage to produce the questions. What so important, you will not be surprised to learn that
the Steering Committee of the project wishes to a survey similar to the one you have been thinking
see are the types of information you think will give about was conducted in 1998 and 2003 in the UK
teachers the insights they require. Using the frame- (except Northern Ireland). You should look at the 2003
work outlined in section 9.3, outline the information Young People’s Social Attitudes Survey questionnaire
you would collect to indicate the young people’s so- to see what sorts of information the researchers felt
cial life outside school. For this age group (16–18), the was required and compare it with your own. The two
Steering Committee has also asked for some sense of will almost certainly be different but that does not
how prepared young people are to enter an adult mean that yours is not as good. The Young Person’s
world. Your initial thoughts are that you might ex- Social Attitudes Survey is available at: ukdataservice.
plore their knowledge of the political system, their ac.uk/get-data.aspx. For background on youth policy
commitment to family and willingness to share in and studies in the Netherlands, visit the website
family life, their feelings about prejudice and their of the Netherlands Youth institute at http://www
moral values. What sort of information would help .youthpolicy.nl/.
306 Research Methods for Education Part 2 The Process of Data Collection
Mark a point on the line above to indicate how much you like to study
mathematics
Mark a point on the line above to indicate how much you like to study
mathematics
rely on the words we use sitting in a semantic space that is more or less common to all of
us. Subjects are asked to make a mark on the line to indicate their own position (on liking
to study mathematics). This may give a spurious sense of accuracy. How precise should
the measurement be – to one or two decimal places? There is also the question of how
long it would take to extract the data from the measurement scales. We would have to use
a ruler to measure each response and for many researchers this is one task too many.
t Second, we can define the scale in terms of a numerical progression. This is shown in
Figure 8.3 (a)–(e). Words still define the end points of the scale but steps are constructed
so that the scale is broken up into unit lengths. We intuitively know that the steps in (a)
and (b) are different in size because the length of the scales is the same (both begin and
end at the same point) while the number of steps is different. Example (c) shows an-
other way of representing the scale, as spreading out from a mid-point. Numerical scales
like these have the advantage for researchers that the data is easy to extract. The data are
coarse (there are only five or seven steps between the extremes as opposed to the infinite
number of steps in Figure 8.2) but there is a very real question of our ability to differenti-
ate attitudinal and value-based conditions at fine levels. There is another problem that
we have to face and that is whether the distance between the scale items (that is between
1 and 2, 2 and 3, and 3 and 4, etc.) is always the same. If this is confusing you, then think
1 2 3 4 5
(a) On the scale above, mark how much you like to study mathematics
1 2 3 4 5 6 7
(b) On the scale above, mark how much you like to study mathematics
−1 −2 −3 0 1 2 3
(c) On the scale above, mark how much you like to study mathematics
(d) On the scale above, mark how much you like to study mathematics
(e) On the scale above, mark how much you like to study mathematics
back to what we said about measurement scales (see section 4.4). Are we measuring on
an ordinal or an interval scale? Unfortunately for us it is actually the former and this will
affect the type of numerical manipulation and analysis we can perform. For instance, it
is advisable not to calculate average values until the data has been transformed. This,
however, becomes less of a problem as sample numbers increase.
t Third, we can construct the scale using words (see Figure 8.3 (d) and (e)). Like the use
of ordinal numbers, this creates a series of units in the scale. However, the problem
that was apparent with the numbers is even more pronounced with words as the scale
units because there is no pretence at equal intervals. In fact, we have to start from
the positions that the intervals are not equal. This does limit our ability to analyse
the data unless it is manipulated to allow us to do so. This can be done through a
procedure referred to as item analysis that looks at the pattern of responses for each
scale. This, however, is a topic for another text.
All we can do is highlight some of the issues to consider so that they can be considered
in the context of a specific research problem.
t Do we need specifically to identify a mid-point? If so, we should use a numerical
scale with a zero mid-point or a verbal scale with an apparent mid-point. But we
should remember that a mid-point can be a ‘lazy option’. We look at this issue in
greater detail when we consider scaling methods.
t It is possible to have the best of all worlds by using a scale with an uneven num-
ber of scale positions (5 or 7 are common). The third or fourth scale positions (see
Figure 8.3 (a) and (b)) can be variously interpreted as a mid-point or a point where
the respondent is ambivalent between the options.
t Should we use words or numbers? Numbers have the advantage that we understand
them and we can make the assumption that people with the same view will mark the
same position. When we process the numbers, however, what do they represent other
than being closer to or further away from an end point? We find it easier to describe in
words than numbers. If we use a verbal scale we must take steps to demonstrate that
people use descriptors such as ‘a little’, ‘not much’ and ‘very’ in much the same way.
t Do we want to minimise the number of key-strokes when we transfer our data from
the survey form to a digital file? If so, we should use a simple numerical scale (1–5 or
1–7) – half the key-strokes of a scale with a neutral mid-point!
t How many scale positions should we use? The more scale positions we have, the
more potential there is for people to use them differently. Individually we tend to
lose the ability to discriminate between scale positions and the number increases.
Conventionally, few scales are longer than seven scale units. Seven and five scale
units are common. For children, even these scales may be too long. We look at this
issue in greater detail below.
310 Research Methods for Education Part 2 The Process of Data Collection
We are now going to look at the process of developing a questionnaire. This is far more
than just writing a series of questions related to the research issue. It requires that we
work at two scales, at one extreme deciding on the appropriateness of the individual
words we use and at the other ensuring that the layout and appearance of the question-
naire is both meaningful and attractive to respondents.
t be aware of our respondents’ appreciation of the issues – parents, for example, may
be able to explain how they chose a school for their child but may find it difficult to
comment on what principles a school should use to select its pupils;
t be aware of our own reading capability and understanding – do not use jargon, tech-
nical words, abbreviations that are part of your world but are not likely to be known
by your target respondents.
Questionnaire 1 Questionnaire 2
2 10
314 Research Methods for Education Part 2 The Process of Data Collection
percentages in the response categories are percentages of those who chose to respond,
not of the whole sample.)
t Second, the proportion supporting a legal penalty for euthanasia (and, by impli-
cation opposing euthanasia) in questionnaire 2 was far higher than those who ex-
pressed a negative attitude in questionnaire 1.
t Third, suppressing the ‘don’t know’ category forced those with this attitude either not
to respond or to oppose euthanasia. Whoever said that attitudes were stable?
The implications of these two studies are significant for questionnaire design. They
show quite unequivocally that:
t The tone of the question (general versus specific) can influence the number of re-
spondents who are prepared to voice an opinion.
t The way we phrase the question can affect the responses we get. This is particularly
true if the issue polarises opinion.
t ‘Don’t know’ as a response category is a lazy option for some respondents. If we use it
in our questionnaires, we need to be certain that it is a genuine option and we should
develop a device that seeks to prevent a ‘lazy response’ by our subjects. For example,
we could ask them to explain why they marked the ‘don’t know’ category.
For a third example of the impact of question phrasing, read Case Study 8.1. Perhaps
more than the others, this brings home the significance of the way in which we use words.
From these examples, what are the issues that we have to be aware of?
1. Where there is confusion about an issue (praying and smoking) or where the issue is
emotive, the way we word questions is important. This does not mean that it is not
important for other questions, only that it is less likely to have an impact.
2. Different ways of phrasing questions (positive/negative, reflecting general attitudes/
exploring penalties) may tap different dimensions of a complex issue and may not be
true alternatives to each other.
3. The whole question of the words we use is surrounded by a high level of uncertainty.
While we can understand that people are more prepared to allow things than to for-
bid them, circumstances may change this. High levels of concern about terrorism, for
example, might see more people prepared to forbid free speech. Where there is potential
Chapter 8 ASKING QUESTIONS 315
for uncertainty, this must be tackled before the questionnaire is given to people. This
is an important issue that is only rarely reflected in published research that uses ques-
tionnaires. Where there is a possibility that it exists, good practice suggests that ques-
tions ought to be asked in different formulations at a pre-test stage and only used if the
responses are more or less the same. If they are not, then more investigation is required
before an appropriate version of the question materialises.
sciences. STILE is a specialist database: Statistics and Indicators on the Labour Market in
the e-Economy. Its archive is structured by language, by country and by topic. While many
of the substantive questions are of little relevance to educationalists, diligent searching will
identify questions on, for example, training, equal opportunities, health and safety and
such contextual areas for education as social exclusion and flexible working. The Finnish
Social Science Data Archive allows searching and browsing by title, by topic and by key
word. Topic themes relevant to educationalists include child research, education, educa-
tional psychology, educational research, family studies and youth studies. Data are not
available on-line but have to be ordered.
1http://epp.eurostat.ec.europa.eu
2http://www.uis.unesco.org/template/pdf/wei/manuals/UOE2007manual_volume%201.pdf
questionnaires. The research team then analysed the patient responses from all surveys
and were able to show that:
t the outcomes as measured by the questionnaire reflected the measurements of other
health outcome questionnaires;
t the level of variation in patient responses to individual sets of questions was low;
t when patients were given the questionnaire on a second occasion, shortly after the
first, their answers were largely the same; and
t when patients whose coughs had improved were given the questionnaire again, the
improvement was reflected in their responses.
The conclusion we draw from this is that when a questionnaire is an effective instru-
ment for collecting data in one language, it is likely to be as effective in another language
if it is well translated, that is, when linguistic equivalence is established. This issue is
one that should particularly concern educational researchers dealing with cross-cultural
international comparisons.
what may well be a draining experience. Against this, it is an opportunity, because of its
neutrality, impersonality and distance, for people to express their feelings. Questioning
people about sensitive issues (such as how teachers and classmates cope with the death
of a pupil, exploring a person’s sexuality, assessing the extent of violence in the family)
raises quite significant questions of ethics. Will any of the participants be damaged by
the experience? How secure is the information from prurient eyes? What should the
researcher do if there is evidence of unresolved harm or criminality? These questions
would daunt even seasoned researchers and so are unlikely to be within the capability
of those just starting out. There are, however, issues of principle that, as abstract con-
cepts, we can ask people to consider, despite their sensitive nature. Euthanasia is a case
in point. The care of people in the final stages of a terminal illness is a matter faced by
medical staff on a daily basis. It is not surprising, therefore, to find that there have been
many surveys of medical staff on euthanasia in the context of pain relief and end of life
care. A survey of Italian doctors in 1999 showed that only a minority of doctors endorsed
it. A survey of Norwegian doctors revealed similar findings. An international survey of
Scandinavian physicians showed that Danish and Swedish doctors were more liberal.
While education research is unlikely to deal with anything as difficult as euthanasia, how
people respond to sensitive issues is something that we, as researchers, must be aware of.
Methodological studies suggest that the following are the key dimensions that we should
consider when asking questions about issues that people might find difficult or sensitive.
t The importance of the issue to the respondent. Things that are important to us gen-
erate strong reactions. If we were researching why a school was under-performing
in terms of student attainment, it would be significant if we were to find that staff
teaching mathematics, sciences and modern languages were implacably opposed to
the school’s policy (initiated by the head teacher) of whole group teaching.
t Direct experience of or exposure to the issue generates knowledge, and helps us po-
sition that knowledge (for example, good/bad). It can also generate an emotional
response (for instance, if we have direct experience of child abuse, either through
having suffered it or having identified it).
t Regularity/frequency of the issue in a personal context will affect how we view it or
how we respond to it. If the principal or vice chancellor of a university comes round
to speak to us two or three times a year and if we have the opportunity to run into
them at coffee or lunch, we are more likely to have a favourable impression than if
the opposite were the case.
t The social perspective in a person’s reference group may well affect us. If our immedi-
ate subject colleagues think that we should revamp the curriculum, the chances are
that we will go along with it.
t The social perspective in the broader community may influence our views on bigger
issues. For instance, support for capital punishment has fallen steadily over the past
50 years in many developed countries.
While we must consider the implications of these issues in our research, we still have to
produce the question. Our key ally in producing the questions is the privacy of the ques-
tionnaire and the anonymity we should afford the respondent. A study by Anthony Ong
based on his doctoral research (Ong and Weiss, 2000) explored the relative significance
of these in relation to student cheating. The research procedure began with a class test in
which books and support materials were left scattered around. The students were told to
ignore them but they were marked so that researchers would know if students had ac-
cessed them. The second stage of the research presented the students with a questionnaire
Chapter 8 ASKING QUESTIONS 319
Table 8.5 Proportions of subjects answering ‘YES’ to questions under conditions of anonymity and
confidentiality
on student behaviour, one question related to cheating. Table 8.5 shows some of the
questions they asked and the proportions acknowledging the behaviours. Half of the
respondents were told that their responses would be anonymous (that is, the researchers
would not be able to identify individuals with a response) and the other half was assured
that their responses would be given in confidence (that is, the researchers would know
what they had said but would not release the information). These conditions established
(for the respondents who cheated) whether they could be identified as cheats. In each
group the researchers were able to identify the students who cheated (‘the peekers’ in
Table 8.5). The design of the research was sophisticated. In methodological terms, Ong
could identify the impact of a strong guarantee (anonymity) as compared with a weaker
guarantee (confidentiality) and assess each in relation to the nature of the question.
Overall, guarantees of anonymity produced a higher response for questions about
behaviour frowned on by society than conditions of confidentiality (look at the pro-
portions admitting to driving under the influence of alcohol and shoplifting). This is
confirmed when the responses of those who cheated were extracted: 74% of cheats (the
‘peekers’) admitted cheating under conditions of anonymity, while only 25% admitted
it under conditions of confidentiality. Where behaviour could be reputation enhancing
(gun ownership, use of marijuana), confidentiality allowed some respondents to over-
state the situation.
What conclusions can we draw from this research:
t under-reporting is likely for sensitive issues;
t anonymity and confidentiality will both aid the process of obtaining responses but
anonymity has the greater impact;
t people will respond to direct questions of a personal nature;
t responses to questions on sensitive issues are indicative and are not an accurate esti-
mate for the population.
320 Research Methods for Education Part 2 The Process of Data Collection
54% found gambling personally acceptable Many gambling experts believe that Internet gam-
for everyone bling will continue no matter what the government
does to try to stop it. Do you agree or disagree that
87% agreed that the Government should not the federal government should allocate government
tell people what to do with their time resources and spend taxpayer money trying to stop
and money adult Americans from gambling on-line?
83% agreed that adults should be allowed to
take reasonable risks Comment: Another long question. The opening state-
ment could influence responses. The phrase ‘spend
Convincing figures – until you start to examine the
taxpayer money’ is emotive. One of the principles of
questions. Let us look at some of them.
professional practice of the American Association for
The United States Congress is currently (2007) con- Public Opinion Research is that ‘we should not know-
sidering legislation that would attempt to abolish on- ingly select research tools and methods of analysis that
line gambling. With the assumption of something yield misleading conclusions’. In your judgement, do
that is morally offensive to almost everyone, such as the questions meet that principle?
child pornography, do you think it is appropriate for
the federal government to restrict what adults do on Source: www.onlinegamingmythsandfacts.com/polls.htm
the Internet in the privacy of their own homes? (accessed 9 January 2008)
Chapter 8 ASKING QUESTIONS 321
Bias can occur in questions when they are presented or contextualised in such a way
that it is made difficult for the respondent to agree or disagree with it. For example, if the
question were, ‘Now that English is the first foreign language that people in many other
countries learn, there is no reason to require pupils to study a modern language. Do you
agree or disagree?’, it would almost certainly influence responses. From a research per-
spective, the fact that other people learn English is only one of several factors that should
influence why children in the UK should learn a foreign language.
Bias can also occur when an issue is associated with a person, an event or an object
that is clearly established with their/its own image in the eyes of respondents. If we
ask, ‘Organised sporting activities, such as those provided by David Beckham’s Football
Academy, can help overcome disaffection. Do you agree or disagree?’ our respondents
are likely to be influenced by the David Beckham association. This is called a ‘prestige’
effect. The reality is that the question is perfectly valid without it (and the evidence is that
such programmes can counter disaffection).
Another way of affecting responses is to impute the question rather than state it di-
rectly. For example, the direct question is, ‘Should pupils be taken out of school by their
parents for a family holiday?’ The imputed question (called an elliptical question be-
cause respondents can fill in the missing words to create a question) is, ‘Parents should
not take their children out of school for a family holiday’. The research evidence is that
the latter construction generates more extreme views than the former. The guidance for
avoiding bias is to pare the question down to its essentials. There should be nothing in
the phrasing that competes as an additional trigger to the stimuli of the questions.
What we have dealt with so far are the issues we have to consider in determining our
approach to scaling. Now we shall consider some actual approaches to opinion and
attitude scaling. There are a great many methods we could consider. However, we shall
only look at two that use rating scales of the sort that we have discussed so far. At this
point we should emphasise the distinction between a rating scale used as a measure-
ment device and an attitude scale that combines rating scales to give a single statistic to
represent a person’s viewpoint. The former can be used to provide specific information
about a person’s viewpoint in a questionnaire. An attitude scale is a collection of rating
scales that represent the full range of viewpoints on a topic. The individual rating scales
that comprise the attitude scale have been tested to ensure that, when combined, the nu-
merical value that results is representative of a person’s attitude on the whole spectrum
of attitudes. We shall see this distinction in the first example below.
Numbers of students
is a single peak for each question. When this happens it suggests that (a) respondents
are only responding to a single stimulus and (b) we have drawn a sufficiently large
sample. To understand why this should be the case, we have to understand something
that we have not yet covered, the normal distribution (we look at this in Chapter 13).
At the present time, we just have to accept that what we are looking for is a single
peak. Where there is more than one peak (for example, ‘To enjoy myself before start-
ing work’), we have a difficulty. Do the two peaks suggest that we have not sampled
enough people and that were we to increase the sample size the respondents would be
concentrated between the two existing peaks? This is not likely where the two peaks
are either side of the scale mid-point. More likely, there are two distinct groups of
students each with a different outlook on life which affects their reaction to this state-
ment. Only further investigation will reveal the answer but, if there are two groups,
we have to reject the statement in our attitude scale because it is multi-dimensional,
not uni-dimensional.
t Second, we need to be certain that the statements reflect the attitude range and are
not bunched. Rating the statement in terms of how positive or negative it is will
help here. In broad terms we should aim to have an even progression of statements
through the attitude range. We should reject statements where our judges are po-
larised and discard those that are too similar to others. As the end of this stage we
should have a Likert scale that will perform effectively. However, we will not have the
statistical analysis to demonstrate this to others. For this, we have to master an area
called reliability analysis, which, again, you will have to read in a specialised text.
Weak
Passive
(Activity)
Unattractive
Attractive
(Evaluation)
Active
Strong
(Potency)
How do we choose the adjectival opposites that form the semantic differential scales?
We can offer the following guidance.
a range of scales so that all of these dimensions are covered, usually several times. The
next step after this is to conduct a thorough literature search (a) to identify how other
researchers have deconstructed the concept we intend to research (this can suggest ad-
jectival descriptors) and (b) to identify any that have used this technique. If their scales
work, we should consider using them. A wider literature survey of the technique itself
(rather than the subject we intend to research) will also throw up successful scales. Fi-
nally, we should think about our topic and identify other concepts that may be useful to
our research. We use all of these sources to build a set of scales and, from the list we have,
we should reduce it to between 15 and 30. Below this number we may not be tapping all
the nuances we require, above it and fatigue will set in amongst the subjects.
The semantic differential is easy to operationalise. Respondents understand quite
readily what they have to do. Completion of the scale can be done on an individual basis
but, equally, it can be done by groups. This makes data collection very effective. Analysis
is straightforward, either using some of the methods considered in Chapters 12 and 13,
or using a specific measure of ‘difference’ to show how respondents (or even stimuli)
cluster together. Let us see how this works.
Imagine that our research is into parents’ ideas of education and learning. In particu-
lar, we want to see what sort of image they have of the value of education and learning.
We could give a selection of parents in a town a range of ten semantic scales against
which to judge ‘the value of education’. What we want to do is see how far the parents
differ in their judgements and whether they cluster together in their attitudes. How can
we show that one person is similar or different to another? The method is quite simple.
t Each parent uses the same ten scales. If we compare their judgements on each scale,
we can measure how close to each other they are.
t We compare their judgements by:
(i) subtracting the score of parent A on scale one from the score of parent B on scale
one and squaring the result.
(ii) We do the same for scale two and continue doing the same up to scale ten.
(iii) We add up the ten squared differences and take the square root of the result.
This figure is a measure of the difference between parent A and parent B.
t We repeat this for parent A and parent C and parent A and Parent D and so on until
we have compared parent A with every other parent.
t We then do the same for parent B and then parent C and so on until every parent has
been compared with every other parent.
t This yields data that shows how close everyone is to everyone else. With even up to
20 people, we could make sense of the data but beyond that we have to use a pro-
gramme such as SPSS to show the clusters. This not only produces the cluster analysis,
it will also calculate interpersonal differences.
The semantic differential continues to be used in social and educational research. See
Case Study 8.4 for an example.
chosen, it is important to ensure that there is some ‘quality control’ in the selection of the
scale terms – and it is this that may swing the choice one way or the other. The scale terms
(statements in the case of a Likert scale and adjectival pairs in the case of the semantic
differential) should not be plucked out of the air and used. Not only do they have to
relate to the issue, they should also constitute a reasonable representation of attitudes. In
the case of Likert scales, this does mean that we should go through quite a long testing
procedure unless we use a scale already developed and tested by someone else. (It is fair
to say, though, that many published studies appear not to have gone through this testing
procedure without undermining the integrity of the results). With the semantic differen-
tial, as long as we choose adjectival pairs that reflect a balance of the three underpinning
328 Research Methods for Education Part 2 The Process of Data Collection
dimensions of meaning– evaluation, potency and activity – we are likely to ensure that
we reflect a wide range of views. Doing this can be less time-consuming than establishing
the appropriateness of a set of statements. However, as research becomes more advanced,
there is some methodological evidence that semantic differential scales may provide a
better representation of viewpoints. In a study of how teachers react to academically
gifted students, a British and an Australian academic, Professor John Geake and Professor
Miraca Gross, stated that they preferred the semantic differential because:
t they believe that the agree/disagree character of Likert scales generates neural activity
in different sides of the brain and has a tendency to produce ‘middling’ judgements,
and
t more down to earth, they think the object of the Likert scale is so obvious that it
affects responses (Geake and Gross, 2008).
In a comparative test of the two methods a Norwegian team, led by Oddgeir Friborg,
found that when 334 students completed the two scaling procedures, the semantic dif-
ferential data indicated that the students were more consistent in their judgements, that
they clustered more and that the patterns determined by the semantic differential data
produced a better fit with the data (Friborg et al., 2006).
If the first step in getting a good response to questionnaires is to find people who are pre-
pared to answer our questions, the second is to ensure that they answer them. How the
questionnaire appears to them is a key aspect of this. Here, we shall identify some goals
that we should set ourselves and suggest what we can do to achieve them.
We can go a long way to achieving commitment to the survey with our sequencing
of questions. We should try to make our respondents feel that it is a waste if they do not
complete the questionnaire. This is what we can do:
t Put straightforward questions, ones to which they know the answer, at the beginning.
This is why questions on gender, age, etc. that create the respondents’ profile are often
at the beginning.
t Put sensitive questions towards the end. If we need to ask about income, then this is
better placed after people have responded to substantive questions on the research
topic and have begun building commitment.
t Put difficult questions, ones that require thought rather than a quick response,
towards the end.
t Do not put open-ended questions at or too close to the beginning because people
have to think about how they want to respond. It is also good practice not to put
in lines on which people can put their responses. This is likely to limit answers; just
leave a blank space.
t How will the data be put into a form for analysis? If the questions are closed and
we have access to an optical character reader (OCR – a device that scans a ques-
tionnaire page, reads the boxes that have been marked by the respondent and
loads it into a data file for subsequent analysis), we can put the data straight into a
spreadsheet. If we do not have an OCR, then the data has to be inputted manually.
Figure 8.5 shows how this is done. Conventionally the rows of a spreadsheet are
our subjects and the columns are our questions. We will find it far easier to in-
put the data if we put the column number next to the question or the response
boxes. Check also whether there are any specific requirements of the tools that
you will use or formats that are required (for instance, column separation between
Chapter 8 ASKING QUESTIONS 331
Questions
Respondent #1
1. How old are you? (3) 0 – 16 1 2. What is your gender? (2) Male 1
17 – 30 2 Female 2
31 – 45 3
46 – 60 4
61 – 75 5
above 75 6
Constructing a questionnaire requires one set of decisions, how to get it in front of re-
spondents requires another. And the two sets of decisions affect each other. We have four
ways of putting questionnaires in front of our subjects: by bringing the subjects together,
by taking the questionnaires to them (where they work or in their homes), by mailing
them and by sending them by email or providing access to them by the Internet. Each
has its pros and cons that are summarised in Table 8.8.
There are, however, downsides. It is difficult to get all subjects together at one time.
There will be people who cannot make it and this will impact on sample design. Get-
ting people together is an issue if there is to be more than one questionnaire. Group
sizes are usually smaller than with other approaches. At a very practical level, there
may be a problem with people finishing at different times. If some are allowed to leave
before others, this may result in some people choosing not to complete the survey in
order to leave.
be different from other rank and file members. Others we are likely to bring together
to administer questionnaires are schoolchildren. This is particularly so if we are ad-
ministering the questionnaire through the school. We have to consider the staff in
the school and make the task straightforward for them. Rather than have a random
sample of children, it is likely that we will sample years or classes and administer the
sample in class groups.
t Is it likely that our respondents will need advice and support to answer our ques-
tions? The groups this might refer to are children, those with limited learning dis-
abilities and the elderly. If our questionnaire deals with technical or professional
issues that our respondents may not be familiar with, we should include these people
as well. In these conditions we would bring the people together or administer the
questionnaire using the drop and collect method. If our research was into using the
Internet to create a stronger partnership between parents and schools, we would need
to know (a) the level of technical provision at the children’s homes, (b) the level of
parents’ technical understanding and (c) their interest in working in partnership with
the school. Given that many of the questions would be technical in nature, we could
administer the questionnaires in two ways, either at parents’ open days (when they
come to talk about their child’s progress) or using a drop and collect approach. The
latter might be preferable because it would include those parents who do not attend
open days. It is possible to provide support via email or the Web but this is not likely
to be effective for the groups or topics described.
Even by applying everything we have learnt – and more – we can still produce a question-
naire that is less than effective. That is why, when we have produced a draft questionnaire,
we should test it to identify any flaws and failings. A process for testing a questionnaire
is set out in Table 8.9.
We should begin the pilot by having some people with expertise in questionnaire
design look over the first draft. This may well remove many of the potential problems.
They will be able to point out any leading or respondent-influencing questions, prob-
lems of routing through the questionnaire, questions that lack clarity or where there are
opportunities for respondents to add detail. They will advise whether the question will
produce the data we need to answer our research question and whether we have been
too optimistic with the amount of data we want (if it is too long, respondents will not
complete it). New eyes will see issues that are not apparent to us. Two or three people is
all that it takes.
After this we revise the questionnaire and take it to some users to complete. We will
not be using their responses, so (unless it is a national sample) we should ensure that
they are not in our sample frame. We do not have to sample in order to identify them,
just find some people with similar characteristics to the population we will sample. For
instance, if we are producing a survey for teachers in schools a, b and c, we could get
some teachers in school d to complete the questionnaire. When we have received the
completed questionnaires, we should go through them to determine that the questions
have been appropriately answered and to ensure that respondents have answered the
questions we required of them. We should, with the pre-test, be able to identify which
respondent completed which questionnaire. While this may not be necessary in the final
survey, it is important at this stage so that we can go back to the respondents to probe
more deeply into how they completed the questionnaires.
The sorts of issues we should follow up are:
t whether the questionnaire was straightforward to complete or whether it caused
them any confusion;
t about questions that caused them any difficulty in responding;
t whether they felt the questionnaire touched on all the topics that they think impor-
tant (if not, ask them what other questions/issues should be introduced);
t about the layout, sequence of questions, the visual impression, their feelings on see-
ing it for the first time.
One approach (and we shall learn more about it in Chapter 9), is to sit with respond-
ents as they complete the questionnaire and ask them to comment on the questions as
they answer them.
We should, as well, ask at least some of the respondents to follow the instructions for
returning the questionnaires. This is most important for postal questionnaires. It is one
way of ensuring that we have put a return address on the form (and not just the covering
letter – things can get separated) and that the address is correct. Having done all of this,
we should be able to put a questionnaire out into the field that most people will be able
to use. But every questionnaire is a learning experience and there will be something that,
with hindsight, we will wish we had changed.
Questionnaires are amongst the most used of data collection instruments in social science
and education research. There are good reasons for this. They can generate lots of data
and they are apparently easy to use. However, there are considerable differences between
good and bad questionnaires, that is, those that are effective in their task and those that
should never have seen the light of day. The difference, though, between good and bad
questionnaires can be slight, so with forethought all researchers should be able to produce
questionnaires that are effective and efficient. Now you should try – look at Activity 8.2.
336 Research Methods for Education Part 2 The Process of Data Collection
You are working at your desk when you get an in- t the attitudes of young people to computers and
dication that you have just received an email. You new technology.
open it and see that it is from a colleague in another
country. On reading the message, you find that they In preparing your questionnaire you should:
are inviting you to work with them and other spe- t think about the issues and jot down your ideas;
cialists on a survey of the use young people make of t search to see if others have researched the topic
new technology. The survey is to be conducted in EU and, if so, look at their work;
states. You read that there is to be a meeting in two t think about the age of the respondents and how
weeks’ time and the participants are asked to bring a this might affect the way you ask the questions;
draft questionnaire. They have attached the brief for t consider how a very large survey would be pro-
the survey. You reply that you will be there. cessed and how this may affect the questions.
Now you have to prepare the questionnaire. The
The research was undertaken as part of the OECD
brief for the survey is to see:
Programme for International Student Assessment
t how far access to and use of computers and new (PISA). This programme conducts periodic studies of
technology varies for young people (aged 15–16) the performance of students in OECD countries on a
throughout the EU; standard test. Look at the website: http://www.pisa.
t how young people use computers and how confi- oecd.org. It is one that you may find useful if you are
dent they feel with the technology; interested in comparative education analysis.
Summary
t Developing a questionnaire requires that we think ‘in the round’. How we deliver
our questionnaire can affect our question types and layout. How we process data
for analysis should be reflected in the questionnaire layout.
t We should have a balance between closed and open questions that takes account of
(a) our respondents and their ability to manage different question types and (b) our
analysis and the resources we have available. For us, there is a trade-off between
new data and high cost.
t We should know what type of information we require from a detailed analysis of
our research question.
t We should not think that all questions have to be new ones. If there is a tried and
tested question, we should use it.
t Questionnaires should look appealing and should not be off-putting to respondents
because they are too long. Respondent interest and commitment is inversely related
to the length and complexity of the questionnaire.
t Biased questions give useless data.
t Scaled questions are useful but they have to be well constructed.
t Remove problems before they can affect results by piloting the questionnaire.
Chapter 8 ASKING QUESTIONS 337
Further reading
Bradburn, N., Sudman, S. and Wansink, B. (2004) Asking J.B., Kurtzman, H.S. and Cain, V.S. (eds), Lawrence
Questions: The Definitive Guide to Questionnaire Design, Erlbaum, Mahwah, NJ.
John Wiley, Chichester. There are many texts on questionnaire design that could
Oppenheim, A.N. (2000) Questionnaire Design, Interview- be suggested. However, I always recommend Oppen-
ing and Attitude Measurement, Continuum, London. heim. First published in 1966, it has stood the test of
Shaeffer, N.C. (1999) ‘Asking Questions About Threaten- time and is a particularly good introduction to a wide
ing Topics: A Selective Overview’, Chapter 7 in The range of attitude measurements. Bradburn’s book
Science of Self-report: Implications for Research and Prac- contains much practical advice and Shaeffer’s article
tice, Stone, A.A., Turkkan, J.S., Bachrach, C.A., Jobe, gives good guidance on a tricky issue.
References
Avramidis, A., Bayliss, P. and Burden, R. (2000) ‘A Jenkins, J., Russell, W.A. and Suci, G.J. (1958), ‘An Atlas
Survey into Mainstream Teachers’ Attitudes Towards of Semantic Differential Profiles for 360 Words’,
the Inclusion of Children with Special Educational American Journal of Psychology, 71, 688–699.
Needs in the Ordinary School in one Local Education Kroeger, S. and Phillips, L. (2007) ‘Positive Behaviour
Authority’, Educational Psychology, 20(2), 191–211. Support Assessment Guide: Creating Student-Centred
Friborg, O., Martinussen, M. and Rosenvinge, J. (2006) Behaviour Plans’, Assessment for Effective Intervention,
‘Likert-based vs. Semantic Differential-based Scorings 32(2), 100–112.
of Positive Psychological Constructs: A Psychometric Lowe, H. and Cook, I. (2003) ‘Mind the Gap: Are
Comparison of Two Versions of a Scale Measuring Students Prepared for Higher Education?’, Journal of
Resilience’, Personality and Individual Differences, Further and Higher Education, 27(1), 53–76.
40(5), 873–884. Martin, E. (2006) Survey Questionnaire Construction,
Geake, J. and Gross, M. (2008) ‘Teachers’ Negative Affect Research Report Series#2006-13, Director’s Office US
Towards Academically Gifted Students: An Evolution- Census Bureau, Washington DC.
ary Psychological Study’, Gifted Child Quarterly, 52(3), Ong, A.D. and Weiss, D.J. (2000) ‘The Impact of Ano-
217–231. nymity on Responses to Sensitive Questions’, Journal
Hagelin, J., Nilston, T., Han, J. and Carlsson, H.E. of Applied Social Psychology, 30(8), 1691–1708.
(2004) ‘Surveys on Attitudes Towards Legalisation Schuman, H., Ludwig, J. and Krosnick, J.A. (1986) ‘The
of Euthanasia: Importance of Question Phrasing’, Perceived Threat of Nuclear War, Salience, and Open
Journal of Medical Ethics, 521–523. Questions’, Public Opinion Quarterly, 50, 519–536.
Heise, D.R. (1965) ‘Semantic Differential Profiles for Sudman, S. and Bradburn, N.M. (1982) Asking Questions:
1,000 Most Frequent English Words’, Psychological A Practical Guide to Questionnaire Design. Jossey Bass,
Monographs, 79(8), whole edition. San Francisco.
Huisman, A.N., Wu, M.Z., Uil, S.M. and van den Berg,
J.W.K (2007) ‘Reliability and Validity of a Dutch Version
of the Leicester Cough Questionnaire’, Cough, 3(3).
Chapter contents
Introduction 339
9.1 Talking and listening: the context for individual and group interviews 340
(i) All interviews are not the same 341
(ii) Beyond the standard question 342
9.2 Watching and listening: the context for observation as a data collection
method 347
(i) What can we use observation to do? 348
Summary 390
Further reading 390
References 391
Chapter 9
Learning themes
t The strengths and weaknesses of inter- By the end of this section you will:
view, collective interviews and observa-
t Understand the strengths and weaknesses
tion.
of different ways of interviewing and
t The procedure for setting up and conduct- observing and be able to determine which
ing interviews, collective interviews and approach could be used in any research
observation. situation.
t The difference between types of interview t Be aware of the different stimuli that can
and interviews and focus groups. be used to create an interviewee’s response.
t The role of the researcher in the data t Be able to develop and operate an inter-
collection process and the pressures that viewing and observational approach to
researchers face. data collection.
t Ethical issues associated with these meth- t Be aware of ethical and other issues that
ods of data collection. can affect interviews and observation
and be able to take account of them in
research design.
Introduction
As we saw in Chapter 8, questionnaires are, than questionnaires and allow the investigator
even with open-ended questions, a highly to open up and explore avenues of enquiry.
structured data gathering process. Their benefit In other words, the route for data collection
is that the data they collect directly address the is not fixed; it can change as the process of
research question; their downside is that if the collecting the data changes our understanding
researcher fails to cover aspects of the research about the issue being researched. This is a
issue in the formulation of the questionnaire, powerful opportunity – the problem is that
important data are not collected. The methods many ‘opportunities’ lead only to dead ends.
that we shall look at here, interviews, focus So when we are considering data collection
groups and observation, are more flexible methods and thinking about which ones to use,
340 Research Methods for Education Part 2 The Process of Data Collection
we should always remember their potential to questionnaires.) A common strategy is for one
distract us and blur the focus of our research approach (for example, questionnaires) to be
enquiry. Having said this, they can also expose used to establish a broad picture. Another
new and perhaps more significant questions. approach, for example, focus groups, is then
used to understand the mindsets and behav-
While we shall deal with interviews, focus iours of particular groups. Alternatively, focus
groups and observations separately, we should groups can highlight issues that become the
note that there is nothing written in research basis of a questionnaire. Such combinations
methodology to say that they should be used are now standard practice in research design.
in a mutually exclusive way. (If anything they Talking, listening and watching are the back-
work well together with each other and with bone of data collection.
With this understanding as a context for the way in which we can use interview
approaches, in this part we shall:
t review types of interview;
t look at alternatives to verbal stimuli and verbal responses;
t consider some ethical consequences of the invasive character of interviewing;
t appreciate the critical role of the interviewer as the instrument of data collection.
Interview type
Individual Collective
In-depth Cognitive
Administration
Face to face
Phone
CATI
effectively, spoken questionnaires. There is little to add to the guidance already given (in
Chapter 8), other than (i) the interviewer cannot deviate from the script and (ii) the
refusal and non-completion rates are lower than with sending surveys by post or drop
and collect but the labour cost is greater. Structured questionnaire interviews are exten-
sively used by market researchers.
The types of interview we shall consider in more detail in this section are semi-
structured, the two forms of ‘evolving’ interview (evolving because the direction of the
interview is determined by the answers given by answers to preceding questions) and the
two forms of collective interview.
Figure 9.1 also shows ways in which the interview can be administered.
t Because a person-to-person relationship constitutes the heart of the interview, face to
face is the most common approach.
t Telephone interviewing is only appropriate for structured or semi-structured interview-
ing because. There are reports of its use in group interviews and focus groups but as
Hurworth (2004) points out discussions may be less spontaneous, there is an ab-
sence of non-verbal communication clues, it is difficult to use stimulus material un-
less it is sent in advance or accessed synchronously via the Internet. Against this, they
are widely used by human resources departments for staff recruitment, are far cheaper
than bringing people together and may well be more suitable in bringing together
hard to reach individuals. From the researchers’ viewpoint, we have to consider the
process of recording responses. It is much easier to tick boxes than summarise con-
versation. There is always the possibility of recording the interview but this raises
(a) questions of its legality if permission is not asked and (b) questions over the qual-
ity of the response if it is. If people know that their answers are being recorded over
the phone, they are more likely to be guarded in what they say, with the result that
the interview can become bland. This is less likely face to face, when interviewer and
interviewee can establish a rapport.
t Computer assisted telephone interviewing (CATI) should, at this point in time, only be
used by researchers at the beginning of their research careers with structured inter-
views because of the need to key in responses in real time. There are experiments and
developments with computer-based interviewing, especially associated with health
and trauma. Cater (2011) used Skype to interview members of the US armed forces
on the trauma of amputation. She recorded the interviews directly to the computer
and so was able to give the interviewee her full attention. Hinchcliffe (2009) used the
Internet to interview people with communication and learning difficulties. She found
cost benefits, flexibility in scheduling interviews and the interviewees’ perception of
anonymity an advantage and while some interviewees found it difficult to adjust to
the technology, most found it less stressful than face to face interaction. However,
despite the pace of change in technology and increased data collection opportunities,
she recognises that the approach still in its infancy. My conclusion is that Internet
enabled research is not sufficiently advanced to cope with all the nuances and contin-
gencies that social research requires.
(a) Pictures
Artistic representations, shapes and photographs have all been used to elicit responses in
interviews. The content of the image is directly related to the nature of the investigation.
The benefits of using pictures from a research strategy perspective is that they provide a
common stimulus for each interviewee, that is, a potential source of variation is removed
from the range of influences. The use of pictures also stimulates recall. Pictures can be
particularly important when dealing with groups whose verbal skills may be limited.
This would include children and people with learning difficulties.
Pictures can be used to assess the importance that people ascribe to objects. This is
what Harvey Ells did in his study of food choice by children and young people (Ells,
2001). He interviewed 7- and 8-year-olds and 14- and 15-year-olds in their classrooms
and usually in pairs. The children were shown pictures and photographs and asked to
comment on what they saw. The advantage of using pictures was that it was significantly
cheaper than taking respondents to food stores and food service outlets. Ells’ paper deals
with a range of methodological issues and how they were (or were not) resolved. For
instance, he reports that pictures were used particularly successfully with young children,
though in small group interviews some children did exaggerate their responses. Prosser
and Burke (2011) argue that images are not just a stimulus for children, they enable
researchers to adopt a ‘childlike perspective’ and, in doing so, to get closer to a child’s
view of the world. As they point out, ‘Words are the domain of adult researchers and
therefore can be disempowering to the young. Images…on the other hand, are central to
children’s culture from a very early age and are therefore empowering.’ Their paper is a
sound review of visual imagery techniques and cogent plea to see the world through the
eyes of a child.
Pictures have also been used as a stimulus for storytelling (‘Tell me what is going on
here’). The basis for using pictures in this way comes from psychology where the thematic
apperception test is used to explore personality and motivation. In this type of test (called,
generically, projective techniques) subjects interpret a set of ambiguous images; their inter-
pretations are used to gain insight into their views of the world and their relationships with
other people on the basis that they ‘project’ their unconscious into their narrative. Outside
psychology, projective approaches have been used to help people express opinions and to
determine and understand attitudes. Miriam Catterall and Patrick Ibbotson have reviewed
how a range of projective techniques might be used in education research (Catterall and
Ibbotson, 2000). While much of the stimulus material is visual, they point out that verbal
stimuli and role-play can also be used to explore people’s viewpoints. Their assessment of
the advantages and drawbacks of using such techniques is summarised in Table 9.1. On the
advantage side, projective techniques are easy to use in combination with other approaches,
Table 9.1 Advantages and disadvantages of projective techniques for data assembly
Advantages Disadvantages
such as questionnaires or focus groups. When used with focus groups they can be a means of
getting to a deeper level of understanding than is possible by simply asking questions such
as ‘Why do you say that?’ or ‘Why do you think that?’ This versatility means that they can
be used effectively with both a qualitative and quantitative design. The difference from the
respondent’s perspective is that projective techniques are different from what most people
would expect of data collection. The different approach to data collection is intriguing, re-
quires more active involvement than, say, a questionnaire and, in consequence, is likely to
be more interesting. As Catterall and Ibbotson say, ‘Projective techniques can also be fun to
complete once respondents get over the initial surprise, self-consciousness and embarrass-
ment at what they are expected to do’ (p. 249). This can, however, be a double-edged sword
because it might be the case that they are most suited to younger rather than older subjects
or those with a more outgoing personality. For sensitive topics (such as abuse, sex, death),
projective approaches may be a means of overcoming some people’s reticence in talking
about the issue or of discussing how it affects them. They can also be used to help subjects
empathise with someone else’s perspective. What is it like to be bullied, to be old or to go to
a new school? Catterall and Ibbotson used completion of speech bubbles in a cartoon show-
ing students being told by a lecturer about an assessment, not only to reveal their own views
about the type of assessment but also to get students to think of assessment from the point
of view of the lecturer. Finally (and this is, for many researchers, the real benefit), projective
techniques throw up ideas and insights that may not have occurred to us when we designed
our questionnaire or set up our interview schedule. On the other side of the equation is the
fact that some researchers might use projective approaches to probe into areas that might
be distressing for the participants. If our intention is to expose deep feelings that might
otherwise not surface, this could be an invasion that might be detrimental to the well-being
of the subject. Properly we should take advice from an expert (usually a psychologist) before
we implement a projective approach. From a data perspective, there is concern that projec-
tive approaches tell us more about the subject’s take on modern culture (which is why they
are used extensively in consumer research) than give us insights into attitudes and emotions.
Many of the stories that people tell about the images are similar. This supports the concern
about cultural-based interpretations but there is a further methodological issue to concern
us, the fact that different data analyses can give different results and the possibility that dif-
ferent researchers using the same analytic method can deliver different interpretations. For
many quantitative researchers this would lead to the method being dismissed but we should
be aware that when it comes to insights into the way people think, we have to accept that
people are different and those differences can lead to more than one interpretation.
Another approach used by researchers has been to give subjects cameras and ask them
to take photographs that highlight issues relevant to the research investigation. Tuula
Heinonen and Maria Cheung used cameras in their study of the lives of rural women in
China (Heinonen and Cheung, 2007). Their contacts were trained to use cameras and were
then brought together after they had taken their photographs. They selected photographs
that were the most significant for them and, in groups, they described the pictures and then
explored why they were taken and what made what was in the picture important to them.
What is revealed was an insider’s view of life in a Chinese village. Putting it in context, the
alternative as a research strategy was a long period of observation, but always as an outsider.
to reconstruct what they were doing as they watched the video (essentially a ‘think aloud’
approach). Broadly the same approach was used by Howard Tanner and Sonia Jones,
two Welsh researchers, who wanted to establish whether ICT improved mathematics
learning. Their research tested the use of technology such as whiteboards and data pro-
jectors. At the heart of their research was video-stimulated reflective dialogue, in which
lesson videos were shown to pupils in groups to stimulate dialogue and discussion. The
findings were clear: interactive oral work was stimulating; ‘fun’ and explanatory feedback
were important; the technology was perceived to be modern (as compared, for example,
with OHPs) and produced clear text and diagrams; the opportunity to try things out,
make mistakes, erase the screen and start again was valued. Using mini write-boards (dry
wipe boards) and, for older pupils, video feedback, was a strong stimulus for reflection.
This last point raises the question whether this research technique ought to be used more
to support student learning.
The advantage of using still or moving images is that they remove differences in
experience as a variable in the research design. When should they be used? This is a
more difficult question to answer. Clearly they act as a stimulus in situations where
it might be difficult to generate a response to questions. They could be used with
children or with people whose experience of what is being investigated is limited.
The latter is important because it allows the researcher to present a stimulus image
that is closely related to the type of data that is required to answer the research ques-
tion (for example, in the study above where the researchers could focus on different
forms of ICT in use in the classroom). This is clearly a benefit in terms of research
design because it seeks to make the issues being investigated the principal variable.
Projective techniques could be used to add detail and depth to many questionnaire
designs but from a methodological point of view it is important to be able to show a
link between the results of analysing the questionnaire data and the insights from the
projective technique. This may not be easy if the sample size of the group completing
the projective test is too small.
(c) Activities
Activities that interviewees or focus group members are asked to perform are the final
category of alternative stimulus. Asking people to represent what their understanding is,
what they feel and what their emotions are pictorially is one approach that is common.
Using dolls, drawing images and organising images (for example, selecting photographs
or postcards to represent how lecturers feel about their college) have also been used.
In education research such approaches are especially common with young children
whose verbal skills may be insufficiently developed to provide researchers with the data
they need. Activity-based stimulus requires subjects to be actively involved and to think
about what they are asked to do (questions are usually responded to immediately).
This may promote recall and organisation of ideas. It is especially useful when dealing
with issues that are hard to verbalise or which the respondent is uncomfortable with.
These approaches are, for instance, widely used in explorations of abuse or trauma (such
as bullying or the death of a family member). Drawing was used by Fiona Leach and
Shashikala Sitaram in their research into sexual harassment of schoolgirls in India (Leach
and Sitaram, 2007). Working in groups the girls drew maps to show where they had seen
or experienced abuse taking place. The researchers wanted to understand what the girls
felt about the different types of abuse. Their approach was to construct an ‘abuse spider’,
where each leg of the spider represented a type of abuse. The girls placed coloured stickers
346 Research Methods for Education Part 2 The Process of Data Collection
on the legs (each colour represented a level of seriousness) to represent how serious they
felt the abuse was.
The representation of the world using alternative approaches may require the re-
searcher to view the data as a metaphor. This, of course, requires a different type of
analysis and an understanding that underpins the ability to understand the language
of metaphor. Much of the work on metaphor analysis has been based on language,
where there can be an issue about when words are being used in an everyday conver-
sational sense and when they have a deeper meaning. For a discussion of this complex
area (yet one that is increasingly important for qualitative analysis), Rudolph Schmitt’s
paper (Schmitt, 2005) offers a good introduction to the way in which the concepts can
be operationalised. He suggests that we can see metaphor in the following ways:
t As a means of representing our results: he gave an example of medical provision,
which could be represented as a ‘franchise’, as a ‘mission’ and as a ‘nurturing process’.
t To describe the research process, for instance, thinking of qualitative data as a ‘kalei-
doscope’ of colour whose pattern emerges only when viewed through an analytical
process.
t Identifying metaphors used by interviewees. Thomas and Beauchamp (2011) set out
specifically with the objective of getting their respondents to talk in metaphor in their
study of newly qualified teachers making their ways in the profession. They explore
the difficulties some teachers had in thinking in this way and the examples they gave
them to start them off. Asked about how they viewed themselves just before they
began teaching, typical responses were:
t The captain of the boat. I have to take these people (students) somewhere and
there are storms and high waves.
t A coat hanger. Everything hangs on you and you have to support everything. If
not, it falls.
In their first year of teaching, respondents said:
t A survivor of the Titanic but who didn’t have a lifeboat and had to swim to shore.
t A gerbil on an exercise wheel who is eager to make efforts but getting nowhere.
Thomas’ and Beauchamp’s work gives us insights into teachers’ perceptions of them-
selves and their roles and shows how thinking in metaphors can provide us with a
rich body of evidence.
Professor Kara Chan’s research on children’s perceptions of material possessions
uses metaphor (Chan, 2006). The research issue she explores is of interest to both
educationalists from the perspective of value development and to marketers from the
perspective of selling to children. Her method was to give each child a sheet of paper,
blank but for a stimulus statement: ‘This child has a lot of new and expensive toys’
and ‘This child does not have a lot of toys’. The children were asked to draw some-
thing that illustrated the text. The drawing activity was then followed up by a short
interview. The drawings were then analysed and 28 categories of objects and facial
expressions (these were the metaphors) were devised to represent the meanings that
the children were expressing. Branded goods were a metaphor for being well-off, for
instance, while patches on clothes and toys were a metaphor for being poor. Chan
was able to show that even very young children could express emotional attachment,
social meaning, personality and self-image. As this work shows, the interpretation of
the data to create (and extract) meaning is a crucial step. (We shall look at this again
in Chapter 11.)
Chapter 9 TALKING, LISTENING AND WATCHING: OTHER APPROACHES TO DATA COLLECTION 347
For all education’s commitment to qualitative research, observation comes a weak second
to talk-based investigations when we see what is published. This is surprising, first, because
verbal communication is a limiting factor for some groups which are of interest to us
(young children, for example) and, second, because we have plenty of evidence that what
we see influences our judgements more than what we hear. A good (and very influential)
example of this appeared in 1968 when an American psychologist, Robert Rosenthal, and a
high school principal, Leonore Jacobson, published a short (only 11 paragraphs long) but
ground-breaking paper on how teachers’ expectations of their pupils become self-fulfilling
(Rosenthal and Jacobsen, 1996). The summary for their paper states:
Within each of 18 classrooms, an average of 20% of the children were reported
to classroom teachers as showing unusual potential for intellectual gains. Eight
months later these ‘unusual’ children (who had actually been selected at random)
showed significantly greater gains in IQ than did the remaining children in the
control group.
What Rosenthal and Jacobson identified was an effect but not the process that
caused it. Their findings began a programme of observational research into how chil-
dren pick up the clues as to what is expected of them and how they are regarded.
Of course, obvious praise or criticism has an impact but the wealth of research evi-
dence suggests that non-verbal communication is rather more important. Elisha Babad
quotes one example:
When a student fails to answer a teacher’s question, the teacher would continue to
be in eye contact with that student for a brief period (measured in seconds or mil-
liseconds) before turning their eyes away. It turns out…that the duration of this eye
contact following failure varies as a function of teacher expectancy: the duration
would be longer if the teacher expects the student to be capable of answering the
question and shorter in the case of a low expectancy.
(Babad, 2005, p. 290)
Small clues, such as brief eye contact over a long period, can contribute to lower
academic attainment and progression in children from lower socio-economic groups,
as well as social and ethnic stereotyping and that ill-defined but possibly pervasive con-
cept, institutional racism. There does not have to be a notice saying ‘You are not very
good’; pupils will pick up the signs from a run-down and poorly equipped school. Such
signs say eloquently ‘You are not worth spending money on’. And since pupils respond
to this behaviour and to these clues, we should pay more attention to observation as an
approach to data collection because behaviour is an important source of information.
Observation can deliver worthwhile data but only if it is done well. There is a sense
amongst some new researchers that observation is easy, all you have to do is walk around
until something strikes you. The very great danger of this is that it is the most unusual
things that are the most striking and the most unusual things are not usually the most
useful for understanding and explaining ordered situations and relationships. Observa-
tion is an organised process with structures and protocols that are the guarantee that data
are valid and reliable.
348 Research Methods for Education Part 2 The Process of Data Collection
observation was being developed as a means of enquiry quite independently. Perhaps the
most famous research (and the most important for educationalists) was Jean Piaget’s ob-
servation of his own children, which was to lay the foundations of our understanding of
child development. Piaget observed his children in the natural setting of their home lives,
though he did test out his evolving ideas on child development in play-type activities. As
Gerard Duveen (2000) observes in his analysis of Piaget’s observational approaches, ‘…
some of the observations…are straightforward records of his children’s activity, many…are
also records of interventions… A re-arrangement of things around the child would produce
a new situation in which the child’s reaction could be observed…’ (Duveen, 2000, p. 88).
Duveen notes that his methodology combined sound ethnographic with clinical practice.
While much educational scholarship in these early years of the twentieth century was
based on the exploration and implementation of philosophical ideals (‘what should be’
and not ‘what is’) there were examples of educationalists using observation to source
their data. For example, Marion Kirkpatrick’s discussion of rural schools (in the USA)
published in 1917 is clearly a reflection based on his own experience. Partly descriptive
autobiography (the first day as a schoolteacher in rural Kansas), rather nostalgic, anecdo-
tal, strongly reflective and contemplative, it was written as a guide for those who were be-
ginning their careers. It gives a picture of education, rural life and farming communities
at the beginning of the twentieth century. While the analysis does not result from struc-
tured observation, it is, nevertheless, the distillation of experience and contains insights,
so we should regard it as an authentic research document. By the middle of the century,
it was clear that developments in observation methodology in anthropology, psychology
and sociology were making their way into education and that educational research was
becoming conceptually and methodologically better founded. It was in this period that
we start to see an expansion of education journals (for example, Education Review in the
late 1940s and the Scandinavian Journal of Education Research and Educational Research in
the 1950s) that provided a platform for the publication of observational studies.
How have education researchers responded to the opportunities afforded by an obser-
vational methodology? A number of areas can be identified.
t Child development: amongst the topics that have been studied in recent years are social
competence of children, interaction between children and their teenage mothers,
between children and carers (foster mothers, childcarers), development outcomes
and home environment, friendship patterns (in the context of desirability) and the
behaviour of gifted children.
t Socialising behaviours: themes investigated include play and social interaction, role
play for citizenship, learning to be friends, sibling synergy and teacher and school
governor socialisation.
t Anti-social behaviours: studied using observational techniques include drug use, haz-
ardous drinking (amongst university students!), teaching for active citizenship, ag-
gression and peer rejection, homelessness, attention-deficit hyperactivity disorder
and many studies on disaffection.
t Group cultures: have been explored in terms of cultural diversity and learning, subject
cultures and disciplinary learning, new technology and learning, enhancing ethnic
diversity in higher education and youth cultures.
t Territoriality: studies around this theme often intersect with youth culture, sociali-
sation and anti-social behaviour. Other themes researched are: gender construction
amongst immigrants, organisation of work teams and groups, teenagers’ appropria-
tion of space and gangs.
350 Research Methods for Education Part 2 The Process of Data Collection
t Practice observation: for practice improvement and the maintenance of quality stand-
ards is well established in medical and teacher training, teaching and lecturing, co-
teaching (teaching a class jointly) and the assessment of vocational and practical skills.
While these themes are ones that can be picked up from the research literature, ob-
servational research is not limited to them. It can form part of any investigation, quan-
titative or qualitative in nature. The data that are collected can be processed into usable
information from which we can draw our research conclusions using either quantitative
or qualitative methods. When we develop our data collection strategy, we should see if
there is a role for observation. In order to do this, we have to appreciate its characteristics.
A naturalistic setting
The first thing we should note about observation is that it takes place in a natural setting
for those being observed. Thus, for example, we would observe pupils in a classroom,
young children playing in a playground, members of a school or college at the institu-
tion, senior management team in a meeting, young people at leisure, family life at home
and so on. In all of these situations, the setting is one that people normally inhabit as
part of their day-to-day activities. There is another type of ‘natural setting’ that we ought
to understand, one that is recognised and accepted for what it is but which is not part
of the normal experience. For example, if, as part of an investigation into socialisation,
we were to put a group of infants into a room with play facilities and then video what
happened, their environment would be unknown but the equipment and the setting
would be, so it could be classified as a ‘natural setting’. The key conditions of the idea of
a natural setting are that there are no conditions or variables that are not usually present
which could influence or disrupt what is being observed.
An unfolding investigation
A second characteristic of observational approaches is that the pathway to an end point
is not pre-determined. Observation is an emergent procedure. It is discovery-based. As
we learn more our ideas develop and our thinking changes. There are few limits on what
we can do and where we can go. An observational approach is flexible; it can merge very
easily into other methods of data collection or alternative data sources (such as statistical
series or documents). With this opportunity, however, comes a great danger. It is easy to
mistake a line of thinking that opens up a route that we believe will lead us to answer the
question, only to find that it is a cul-de-sac.
A holistic viewpoint
Observation takes a view of the whole circumstance. While there is a focus to an in-
vestigation, the context is an equally important source of data that aids understanding
and interpretation. It might, for example, be difficult to understand bullying at school
without reference to home life or to appreciate a tutor’s approach to teaching without
understanding the academic culture of the subject. The implication for the researcher is
Chapter 9 TALKING, LISTENING AND WATCHING: OTHER APPROACHES TO DATA COLLECTION 351
that it is important to know the background in order to give depth and perspective to
the subject.
A changing world
Most (but not all) observation takes place over a period of time. This is often a pre-
condition for effective data collection in a naturalistic setting. While changing conditions
are part of the natural setting, they are a potential agent of influence; in fact they may
be the factor being investigated. For instance, if we were looking at the amalgamation of
two schools, we should be interested in observing the reactions of teachers and adminis-
trative staff. Not only does amalgamation create extra work, it also reduces the number of
posts. Both are stressful situations. The issue for the researchers is to be able to partition
the dynamics of the world into background noise and significant influences.
Obtrusive Unobtrusive
covert’ if we are not known, and describe us as ‘participant’ if we are actively involved in
what we are researching or ‘non-participant’ if we are not involved.
may be sound to reward people for their time but we should always remember the Latin
phrase, caveat emptor, let the buyer beware!
Covert participation such as Weatherford’s is not without its problems and issues.
Amongst these is the question of safety and security. Dennis Rodgers spent a year in
Nicaragua researching life amongst the poor. He has written a compelling account
(Rodgers, 2007) of how his research led him into gang membership. Facing danger, a
knife fight, stealing, street fighting with other gangs, drug taking, gun fights, using gre-
nades and gaining a reputation for ‘fighting dirty’ – no ethical consideration of a PhD
proposal can prepare a researcher for these experiences (and one can only hope that he
did not mention them in his letters home!). Fortunately educational researchers are rarely
faced with such dangers. However, as the American researcher Robert Labaree (2002)
shows, being an insider researcher raises its own issues. His work, on university govern-
ance, was undertaken while he was a member of the senate of the university where he
worked. This situation raises an issue, both for the researcher and, as importantly, for the
reader. Being an insider certainly gives access to information, some of which may be priv-
ileged, and produces insights, for example into power relationships and decision making
outside the public arena, that otherwise would not be available. But being part of the
power structure can lead us to form relationships with others to shape events and direct
the very processes that we are studying. When we are actively involved, our involvement
and how we view an issue can influence our interpretation of the evidence. This last point
is a serious methodological issue for the investigator. How active do we have to be to be
taken seriously yet not change the character of the world we are investigating? And for the
reader, what reliance can we place on research undertaken by an interested party? If we
find ourselves in this situation, we have to make clear how we have resolved the conflict
and this requires, at the very least, that we recognise it and make explicit our own values.
School University
Close examination of text. Increase in text coverage, need for time management.
Importance of the teacher’s opinion. Importance of literary theory and cultural/historical
context.
Commentary at the beginning of school courses. Development of autonomous thinking and argument.
Seating in rows, not conducive to interaction. Discussion – active involvement in lectures and
seminars.
High level of preparation for the examination. Little preparation for exam.
first-year undergraduate programme, she was aware that students found the adjustment
to university learning difficult. They were concerned about feeling disorientated and dis-
organised, how to manage literary context and theory and how to analyse texts, as well as
their ability to manage the volume of work in the time available. While a research meth-
odologist might question the detail of her method, there is no doubt that her approach
of finding out what went on in English teaching at school was a sound one. She chose
two schools in which to observe classroom teaching and found the same approaches in
each. She found contrasts between school and university (see Table 9.3).
Despite these differences, Gillian Ballinger felt that there were similarities that would
prepare students for the transfer: a discursive approach to enable individuals to establish
and justify their own viewpoints, an appreciation of the importance of context to litera-
ture and the importance of the audience and staging in understanding plays. Her paper
concludes with some general guidance for university tutors. While the investigation is
not as rigorously set up as Persson’s, it is certainly valid as a means of appreciating the
learning environment that provides the basis for university transfer, and her analysis of
differences and similarities is clearly informed by a sound understanding of the purpose
as well as the methods of higher education teaching.
This idea of observation as a learning process is taken further by Seonaidh McDonald
who reviewed shadowing as a mean of gathering data (McDonald, 2005). Shadowing is
more usually thought of as a work experience activity but, because it involves someone
following someone else over an extended period of time, it is admirably suited as a
means of data collection. McDonald identifies three ways of using shadowing:
1. As experiential learning: the traditional way of looking at shadowing; in research terms
we might use it in an initial stage of an investigation to build up our background
knowledge.
2. To record behaviour: we follow people in order to collect data on their behaviour that
will be useful for our research. Three American researchers, led by Professor Vernon
Polite, followed 58 school principals to assess the transference of learning from a
professional development programme into work practice. An interesting perspective
on their research is that they fed back their analysis to their subjects to shape their
further development (Polite et al., 1997).
3. To see the world from another perspective: this approach is less instrumental than recording
behaviour, where there is often a strong analytic framework (reflecting data needs and
variables to be tested). The emphasis is on the lived experiences of the subjects and the
Chapter 9 TALKING, LISTENING AND WATCHING: OTHER APPROACHES TO DATA COLLECTION 355
purpose is to understand the world through their eyes. This moves the research strategy
firmly into the realm of phenomenology (see Chapter 2). As McDonald says, ‘shadow-
ing has made a leap from being used as a neutral measuring and recording (quantita-
tive) tool to the means of generating a narrative to first develop and then share insight
into a role (qualitative)’ (p. 466).
Shadowing as a method for education researchers can be used to understand how
people perform roles, spend time and interact with others. It could be used in a study of
school management, teacher burn out or student workloads. It is an approach that can
be used with others; the researcher need not be a mute observer but can, for instance, ask
questions to gain understanding.
As we have seen, interviews can take several forms and one of the skills of the researcher
is to choose the approach that best meets the needs and circumstances of the research.
At the heart of the interview process researchers have to make a fundamental choice, the
relationship of the interviewer to the interviewee(s); is the interviewer a neutral facilita-
tor or engaged and sympathetic? Is the interview an informal discussion or a schedule
of issues to be explored? Which approach is adopted is influenced by the personality of
the interviewer and how they relate to others, by the needs of the research programme
and by how far the interviewer yields responsibility for the interview and the data it
generates to the interviewee. If not a lot, then the interviewer controls the process usu-
ally through a framework of questions; if a lot, then at best the interviewer steers the
interviewee who gives a response in a sequence that seems appropriate to the issue. In
many ways this balance between interviewer and interviewee parallels the choice of
questions in a questionnaire, closed questions, when the interviewer controls the inter-
view, and open questions, when the interviewee explores the issue in a way they want
to with just prompts and questions from the interviewer. This latter approach is called
a narrative interview. Hollway and Jefferson (2008) provide excellent insights into the
approach. Both styles of interview can be used in the approaches to interviewing we
consider next.
19. How did/do your experiences of relationships 21. What do you see as the key ingredients for a
compare with the expectations you had when you successful relationship?
first started discovering men/women?
20. How have your relationships with your partners
differed to those with your friends? What do you
get from one and not the other?
The keys to successful interviewing are to use a tone War. What sort of attitudes prevailed to sex and mar-
and words that the interviewee will be comfortable riage? How would she feel about answering ques-
with. This requires a high level of empathy: the best tions on these topics?
interviewers put themselves in the person’s shoes Now put yourself in the woman’s shoes and write
and try to imagine how they would speak, how they out your answers to questions 7 and 8. Then com-
would react, how they would feel and they use this pare them with what the interviewer actually said
awareness to phrase the questions and to determine by reading the interview (extract one) at www.eds.
what follow-up questions to ask. ac.uk/qualidata/support/interviews/semi.asp. Your
Look at the interview guide in Case Study 9.1. Imag- questions will be different, so look to see how the
ine how you would ask the questions of a woman in interviewer kept the pace of the interview going in a
her 80s. Imagine what sort of world she inhabited in conversational way.
her late teens, probably just after the Second World
Against this, as with all interview approaches, there is a trade-off between the quan-
tity of data collected and its richness. The data are indeed rich and deep but, unless
resources are abundant or the data are obtained from few people, we will find it dif-
ficult to use all of the data in our analysis. This contrasts with questionnaires, where
the data for each person are ‘thinner’ but the scale of the data collection allows it to be
segmented into meaningful chunks (for example, the behaviour or attitudes of teach-
ers as compared with technical and administrative staff, or the career intentions and
aspirations of pupils in private schools as compared with those in state schools). The
higher cost of interviews as compared with questionnaires is not just a consequence
of employing more people for longer. There is, as well, a cost of training interview-
ers, debriefing them and monitoring the data they collect. If a research programme
cannot afford a burst of interviewers at the right time and has to make do with fewer
interviewers over a longer period, then there is a danger that the answers given by
those later in the interview cycle might have been affected by conditions in the wider
world. For example, if our research is into the choices young people make with respect
to their futures and the media report on the high salaries in the financial sector and
low starting salaries in teaching while interviewing is ongoing, later interviewees may
be affected by what they read, hear or see. The possibility is that this will sway some
people’s replies and influence the results that we obtain. Finally, there is always the
possibility that respondents will construct replies that place them in a better light.
The principal defence against this is for the interviewer to be aware of the possibil-
ity, to have good background information, to follow up with questions that reflect
knowledge of a different interpretation and, if all else fails, to triangulate the response
through other interviews or documentary sources.
Chapter 9 TALKING, LISTENING AND WATCHING: OTHER APPROACHES TO DATA COLLECTION 359
Advantages Disadvantages
The advantages and disadvantages of semi-structured interviews are set out in Table 9.4.
UK. The asylum seekers were in the hands of agents. In some cases the agents offered
them a ‘menu’ of destinations and prices, in other cases the agents only dealt with ac-
cess to the UK. Where asylum seekers chose the UK, their decision was influenced by its
image as a safe and tolerant refuge, links (including colonial links) with the UK and the
ability to speak English. The researchers found that the asylum seekers did not have a
clear appreciation of asylum procedures or benefits, so their decisions to come to the UK
were unaffected by such considerations.
With the in-depth approach, the role of the interviewer is not to slavishly follow a
structure but to stimulate a response. Interviewees should be helped to find their own
way through the issue, express things in the sequence they want to follow and use words
that they want to use. The interviewer can kick-start the process and move it along but
control and manipulation of the response is not really an option, as it is with semi-
structured interviews. Robinson and Segrott (2002) used a ‘topic guide’ that highlighted
the issues they wanted to cover. In relation to understanding the migrants’ choice of
destination, these included:
t Did research participants choose a particular destination country?
t Did the choice of destination alter during the journey itself?
t Was help sought from facilitators in the country of origin?
t Who was this, i.e. an individual or an organisation?
t Did such facilitators offer a choice of destination? What kind of guidance/advice did
they give on possible destinations?
t Did they help with travel documents, i.e. passports?
t Was help sought from facilitators in the UK?
t Did the research participant consider seeking refuge in a neighbouring country, rather
than a more distant country like the UK? Which country was this? If this option was
not pursued, why was this the case?
t What other countries were considered besides the UK? Why were these countries con-
sidered?
t Were they unable to gain entry to them?
t Were some of these other countries rejected as options by asylum seekers themselves?
t What were the reasons for these decisions?
t How does the UK differ from other considered destinations?
t Did research participants intend to settle in the UK, or did they intend a temporary
stay?
These are clearly issues to be addressed in the discussion rather than questions to be
put directly to their respondents.
The advantages and disadvantages of in-depth, unstructured interviews are summa-
rised in Table 9.5. In-depth interviews provide an opportunity for a detailed exploration
of an issue. They are particularly useful for case studies, policy research, action research
and research where a timeline is important and where opinion and meaningfulness are
significant. They are extensively used in feminist research where the interview has been
used to encourage or stimulate actions that can change the circumstances and life tra-
jectory of some of the women who participate (see Case Study 9.3). While it can be
used to collect factual information, it is particularly valuable where the research is con-
cerned with the values and mindsets that lie behind the facts. In researching the decision
of a school in England to seek academy status (independent state schools where local
Chapter 9 TALKING, LISTENING AND WATCHING: OTHER APPROACHES TO DATA COLLECTION 361
Advantages Disadvantages
t Issue-based t Time-consuming
t Good for explorations of what people find t Requires common interviewing
meaningful in their lives protocol
t Emphasis on free-response of interviewees t Interviewer has to be credible
reduces influence of interviewer t Respondent may not take part
t Interviewee may change reality
education authorities have no responsibility for funding), we would want to know what
values, beliefs and expectations drove the decision and whether there are any conflicts
between the decision makers and the decision influencers (the school governors, the
senior management team, local education authority, other schools) and those whose
roles and interests may be affected (the teachers, the local authority, the community and
the unions).
Inevitably, interviews take time to arrange and administer, so there is a clear relation-
ship between the scale of the research and its cost. Interviews may be conducted over a
period of time but this again raises the issue of the stability of responses. Small groups of
researchers often carry out interview research but this requires that interviewers have to
be knowledgeable to be credible and well-prepared and briefed to ensure the operation
of common standards. This also requires that the results of the interviews are audited to
ensure that there has been no deviation from guidelines. Finally, we should note two
problems in respect of the informants. First, a question of access; what if one or more
informants does not wish to be interviewed? Documentary sources may provide some
evidence, but the question for the researcher is whether someone’s absence undermines
the research. Second, in post-event questioning (whatever the outcome of an issue),
many people will reconstruct reality and present a version of events that places them in
the best light. Interviewers should always remain sceptical.
they are given must draw them into the right analytic and descriptive mindset. For
instance, most children would find it straightforward to describe what they are
doing and then lead to the thought process with the question, ‘And why are you
doing it like that?’
t However, even the involvement of the interviewer can also be a problem. Too
many questions can affect concentration and can alter the problem solving pro-
cess. Where the line falls between getting the respondent across the starting line
and interventions that affect the pace and direction of the race has to be a matter
of judgement.
How have think aloud methods been used in education research and with children?
The focus in all cases is on solving a problem or resolving an issue and the goal is either
to improve something or to build models of problem solving or decision making. The
areas where research has been most abundant are:
t learning for disabled children;
t gifted children;
t acquiring literacy and reading skills;
t approaches to subject teaching;
t development of on-line learning platforms and packages;
t methodology, especially questionnaire testing.
Case Study 9.4 provides an example of think aloud approaches in use.
The approach has also been used in a methodological context, to provide feedback
on what was in people’s minds when they responded to a questionnaire. The argument
for using the think aloud technique was well presented by Laura Desimone and Kerstin
Le Floch in 2004. They showed how it could be used at the piloting stage to improve
questionnaire design. Alice Bell (Bell, 2007) provides an excellent example of why it is
a valuable technique. Her guidance for nurse researchers gives sound advice for anyone
conducting questionnaire research with children and affirms that questionnaires can be
used with children as young as seven. She outlines the strengths of using think aloud
procedures in testing questionnaires and provides two examples of how their use led to
questionnaire improvements:
t where children misinterpreted personal computer as meaning personal, for their use
alone;
t where children did not understand the abbreviation ‘pc’ for personal computer.
These are useful examples because they show that even the smaller things can trip us
up! Amy Homes and Lorraine Murray used the approach in their review of ethnicity clas-
sifications in the Scottish Census (Homes and Murray, 2008). Ethnicity is an important
area for survey designers because, as their report shows, the potential for causing offence
and confusion is great. They asked interviewees a question, allowed them to answer it and
then asked them what was in their minds when they responded. Their report is well worth
reading not just to see the cognitive interview method at work but also to become aware
of how important it is for social researchers to deal sensitively with issues of ethnicity.
Natasha Maybe they just thought, ‘Oh aye, I’ve had enough’ to start with the strong
stuff and then like, maybe take a step back and then start smoking instead
‘cause they’re still getting attention fae [for] it and they feel cool.
Naomi Maybe they think that smokin’s nothing – it’s no’ going to harm you – but it
really does, you can get cancer an’ a’ that. [Continues.]
While paired interviews can deliver excellent data, the method is not without its prob-
lems. These are principally practical in nature. Amongst the problems reported by re-
searchers are:
t distractions if the interview is not isolated from other activities;
t lack of commitment from one of the interviewees;
t an interviewee leaving the session before the interview was completed;
t one interviewee dominating the other;
t unwillingness to reveal feelings on a sensitive issue (which, in one instance, were
revealed in a one-to-one situation).
Notwithstanding these issues, some of which can be minimised by preparing the ground
beforehand, pair interviews have a distinctive place in the suite of methods we can use.
conditions constitutes a valid framework for choosing people, but the selection pro-
cess does not stop there. Should the groups mix social status or gender (there is some
evidence that higher status members and men dominate in group sessions) or should
they have a uniform status? One consequence of the latter is that more group sessions
may be needed.
t Giving thought to the setting: where the group takes place should not advantage any
single member. The location and the setting should be one where everyone feels com-
fortable. What is appropriate for teenagers may not be as suitable for head teachers.
The layout of the space should support interaction and not establish hierarchies. It is
more difficult to speak from the back of a room than the front, so a layout in which
everyone faces inwards is often the best option. It is also useful to offer refreshments
before the session starts as a means of breaking the ice.
t Conditions for success: it is important that the person (or persons) running a group
(usually called the moderator) appreciates that people want to talk. They have chosen
to be there because they have views and opinions. If the moderator does not engage
everyone, those who wanted to speak but did not will be disappointed. For group
members, coming together is a social event, an opportunity to meet people and join
in discussion and the social dimension has to be realised. If members are to move
beyond stating their positions into explaining, justifying and judging others, then
the group has to develop a collective sense of trust. Setting ground rules is a useful
foundation for this – how do people show that they want to speak? When should
they yield the floor? How should aggression be dealt with? Finally, once people have
spoken, they find it easier to speak again. Everyone should speak at the beginning,
during the introduction to the session. A useful starter is for everyone to explain why
they were interested in attending.
t Managing the process: the group moderator is central to achieving a worthwhile out-
come. The moderator sets both the tone and the atmosphere. They have to be neutral,
not passive – dull is not an option! The moderator has to show personality to give life
to the group and to maintain interest and commitment. At the same time, they have
to be a manager, encouraging the more reticent to speak, testing ideas, exploring the
implications of viewpoints and summarising positions.
t Stimulating thinking: it is important to move beyond statements into explanations and
justifications. How to ensure that this happens should be planned before the group
takes place. Questions are an obvious stimulus but pictures, videos and task activities
(for example, drawing tasks) have also been used. A successful focus group moves
beyond answering questions.
t Producing a record: the discussion is the data so we have to make sure that it is recorded.
Audio recording is the minimum (and it is probably best to have two recorders in
case one malfunctions). Video recording is useful to capture the non-verbal com-
munication. The group should have an observer who makes notes during the session
and the moderator and observer should produce reflective reviews at the end of the
session. All are potential sources of data.
Despite their obvious value as a research tool, focus groups have had a limited use in
education research and many of the studies seemed not to exploit their full potential. A
criticism of much current use of focus groups (not just in education research) is that data
collection remains at a surface level, is limited to answering questions in a group setting
and that the potential for interaction within the group to expose the backgrounds to and
values underpinning views and opinions is not realised. Achieving this requires thought
368 Research Methods for Education Part 2 The Process of Data Collection
at the planning stage. It is not difficult to get people to give opinions, it is difficult to help
them articulate why they hold those opinions. How we can do this is shown in the next
part of this section.
There are, however, examples of excellent practice. In 2012 the UK Home Office pub-
lished a report on public attitudes on youth crime. The authors, Jessica Jacobson and
Amy Kirby, used focus groups. They paid particular attention to sampling so that their
results had a wider currency. They chose representative areas and for each group (seven
to ten people) had a specific age and household distribution. They structured the process
with an introductory question (‘What they perceived to be the main problems of youth
crime and anti-social behaviour in their neighbourhoods, the causes of these problems,
and their impact’), followed by an activity (responding to crime scenarios) and a
final section exploring views on restorative and community justice as alternatives to the
criminal justice system. In addition we should note the potential in critical pedagogy and
action research focus group research. Both have a goal of working with people to achieve
change. Action research seeks to improve a process or a situation and hence outcomes.
Teachers frequently use action research to enhance the learning experience of students. The
teacher identifies a baseline of student performance, motivation and attitudes to learning,
varies the teaching and then measures the same indicators again. Critical pedagogy has
the objectives of delivering greater social justice and emancipation. In both cases there is
normally a strong element of empowerment when used in an educational context. Much
of the research in critical pedagogy is informed by the work of radical thinkers, like the
Brazilian Marxist Paolo Freire. Freire developed the concept of ‘conscientisation’, in which
learners are helped to identify the root causes of their social condition (rather than merely
address the symptoms). In this situation a focus group becomes a learning experience
for its members. Freire’s ideas fall solidly within a field that has been called emancipa-
tory learning, the idea that education, knowledge and understanding are fundamental to
overcoming disadvantage and achieving social and economic change. Not all educational
research and development can be mould-breaking in this way but finding things out with-
out there being a purpose behind the research can seem a bit pointless. Case Study 9.5
explores the use of focus groups and other methods to inform policy action.
The context t When was the right time to become sexually active?
t What are the consequences of teenage pregnancy?
The UK has the highest rate of teenage pregnancy in
t What are the barriers to communicating about sex-
Europe. Teenage pregnancy and sexually transmitted
ual health?
diseases (STDs) impact on all of us, whatever our life-
styles. We pay, through taxation, the costs of health t How should services be designed to ensure acces-
care and treatment. We meet, through taxation, the sibility?
support for single person families. Taxation revenues
fall through periods and days lost from work. The ex- These issues were discussed in a variety of ways – in
amples could continue. It is, therefore, in all of our in- pairs with feedback to the group, whole group brain-
terests that we understand behaviours that can lead to storming, and discussion and small group discussion.
teenage pregnancy and STDs. As part of this research The discussions were recorded.
effort the UK Government’s Teenage Pregnancy Re-
search Unit commissioned research on Bangladeshi, The findings
Indian and Jamaican young people. These three
groups were chosen because teenage conceptions The research study was detailed and thorough and the
amongst Jamaican and Bangladeshi young women are summary here cannot do justice to the richness of the
higher than the national average and those for Indian report.
young women are lower. t Cultural factors and family are strong influences
on young people and their attitudes to teenage
The methodology pregnancy.
t Gender differences are apparent in behaviour and
The first stage of the research programme was 75 in- attitudes. Males and females often receive different
depth interviews that established individual stories messages about sex from within the family.
and experiences. The data from the interviews in- t Young people’s attitudes are often more relaxed
formed focus groups whose purpose was to explore than their parents’. Parents feel powerless to main-
implications for service provision and delivery. The tain cultural norms but they used religion, educa-
groups were organised so that young people, parents, tion and social advancement as a means of mini-
service providers and community support organisa- mising shifts in attitude amongst their children.
tions for each ethnic group discussed the issues sepa-
t Knowledge and use of reproductive health services
rately. Ground rules for the group were set out and
was patchy but if they were to be used ethnic role
the purpose of the research and the outcomes of the
models and culture-specific resources were felt to
interviews were presented. The themes explored in the
be important.
groups were:
t How people obtained information about sex and Source: French, R.S. et al., (2005)
relationships.
(a) Consents
People should consent to take part in research programmes. Their consent should be
informed but what does this actually mean? Certainly they should be aware of and
370 Research Methods for Education Part 2 The Process of Data Collection
Table 9.6 Guidance on Interviewing Children (after Davies and Westcott, 1999)
understand the purpose of the research. They should know why they particularly have
been chosen. They should know how their information will be used and how identifi-
able and locatable it will be. They should know the limits of their involvement – will
they see research results, will they be able to comment upon the analysis and conclu-
sions and, if necessary, be able to refute them or withdraw their contribution? All of
this is fairly standard, until the last point. Expert or key informant, or what are called
elite interviews (talking to a head of education, or someone with a high profile and
considerable authority and responsibility), pose particular issues. Depending on the
issue, it might be quite easy to identify respondents, despite any guarantee of anonym-
ity that might have been given. If the research output is critical of someone’s role or
contribution or is blamed for an action or an outcome, should there be a power of
veto? How important is the research relative to someone’s position, prospects or even
job security?
From another perspective, we have to be sure that someone is able to give informed
consent. With children the situation is quite clear. Up to a legally stated age (usually 18
Chapter 9 TALKING, LISTENING AND WATCHING: OTHER APPROACHES TO DATA COLLECTION 371
(b) Disclosure
Confidentiality is a normal condition under which people give us data. Normally main-
taining confidentiality is not a problem but what happens in a focus group? While we
can recruit participants on the basis that what is said in the group is confidential to
the group, not everyone might subscribe to the same ethical standards. Imagine a fo-
cus group to consider sexual identity with young adolescents in a school. Of course
the researcher has to reinforce the confidentiality message, even to the extent of getting
participants to sign a non-disclosure agreement, but what if one of the participants talks
about the experience outside the group and identifies one of the group members as ad-
mitting to being gay. Should the researcher take it up with the pupil who has disclosed
the information (over whom they have no authority) or with the head teacher, thereby
breaching confidentiality? Should the pupil be reprimanded or punished? Acting after
the event is always difficult. It may be that a better approach would be to role-play the is-
sue (and others) with the group before the focus group discussion. That would represent
a commitment to the duty of care to the participants and could well shape attitudes to
the importance of sticking to an agreement.
There is another situation in which confidentiality is a challenging issue and it too
involves a conflict with a duty of care. Researchers can learn a great deal of sensitive
information (which may not even be relevant to the investigation) especially in one-
to-one interviews. What should a researcher do, having agreed to treat what is said
in confidence, if they learn that the respondent is self-harming or that another pupil
is carrying a knife? Do these constitute circumstances where the confidence can be
broken? The first course of action would be to try to get the interviewee to seek help
but if this is not successful then, on balance, acting to secure the safety of the inter-
viewee or the wider community delivers a benefit that exceeds the cost of a broken
commitment.
372 Research Methods for Education Part 2 The Process of Data Collection
(c) Risk
Interviewing can be risky. Going to people’s houses and meeting people on estates is a
necessary part of the data collection process if, for instance, we are researching aspects
of social exclusion. The leader of a research team has a responsibility to look after its
members. As a minimum, we should know where members are going when they are
off-site. There should be procedures that require them to phone in before and after
a meeting. They should normally visit in pairs. Again, the ethics require planning to
foresee problems and to produce solutions to situations we hope will never occur.
In an interesting experiment, Pellico et al. (2009) describe how they took nursing stu-
dents to an art gallery to assess, test and develop their observational skills. The whole
exercise took 90 minutes, of which the students spent 10 examining and sketching a
picture. They then viewed patient photographs and were asked to record their assessment
of clinical symptoms. The same photographs were given to a control group of nurses
who had not been on the art gallery visit. The results of their experiment were that those
students whose observation skills had been developed by the gallery visit made more
observations on the photographs, made more plausible clinical assessments and had a
greater range of diagnoses. The issue for the researcher using observation as a data collec-
tion technique is how to develop the skill of looking (and thinking while you look) so
that most of the rich and relevant detail is recorded. Janesick (2011) has some answers
to this with her exercises observing and describing a still life scene, exercises on observing
people and their behaviours, drawing upside down and developing the skill of reflection.
To summarise, we get better at observation by knowing and understanding our topic and
through practice. There is no short cut.
While observing, we need to record what we have seen, heard, smelt and touched. In
structuring a recording procedure, we have to make two decisions. First, what is it that
we are going to record? Second, at what point do we structure and code our observation?
as well. They found that 30% of teaching time was ‘traditional’, where the emphasis
was on knowledge transmission, 5% on developing independent learning (what they
called process orientated teaching) and 43% on engaging actively with the students
(which they called activating teaching).
t Verbal interaction is important both in terms of the content and the character of the
communication and between whom the communication occurs. Communication data
can convey information about attitude, mood and status, it can reflect interests and
concerns and it can be used to interpret actions and behaviour. Talk can be conversa-
tional, discursive, explanatory, persuasive, confrontational and positional in character.
A group of Dutch medical educators conducted an interesting study on verbal interac-
tion (Visschers-Pleijers et al., 2006). Their interest was in a model of learning that is
extensively used in medical education, problem-based learning. The concept behind
this is that being confronted with a problem creates a need to identify specific knowl-
edge and need-driven knowledge is more effectively anchored in the memory than
knowledge acquired through transmission, for example in lectures. In problem-based
learning, new knowledge is assembled and combined with existing knowledge to solve
problems. As a method of learning, it is argued that it is a better means of ensuring
transference than classroom learning. What the Dutch team did was observe the verbal
interactions of four seminar groups with the object of assessing how far the students’
time was spent on learning activities, especially those that advanced their ability to col-
laborate in resolving problems – a skill we should all value in our doctors. They found
that 80% of the time was spent on learning interactions, with cumulative reasoning
accounting for 63% of the time alone. This simple approach throws light on the effec-
tiveness of the learning process. Observation can be extensively used in classroom and
playground studies, decision-making meetings and social interactions.
t Image (presentational data) is an object, experience and context through which people
present themselves to the outside world. Such data include what we own, what we
choose to use, what we consume, how we dress, how we travel, where we travel, how
we spend our time. How we choose to use our time and money says something about
our values, our status and even our aspirations. If we were to think that this is not rel-
evant for children, then the research by a team led by a British sociologist, Christopher
Pole, into ‘pester power’ should make us think again. Their findings (summarised on
the Cultures of Consumption website at http://www.consume.bbk.ac.uk/researchfindings/
newconsumers.pdf) show that (i) even for young people fashion is a marker of identity,
(ii) children as young as six understood ‘pop’ culture and (iii) that access to ‘labels’
influenced being excluded or included. Pole describes and comments upon the methods
used in the research elsewhere (Pole, 2007). The research team drew together data from
discussions with and questions to young children, children’s drawings and observation,
such as wardrobe audits. The context of his commentary on the research approach is
the move away from neutral observation to understanding the lifeworld and lived ex-
periences of children. This makes the child much more of an active participant in the
research process and brings children into much closer contact with the researcher. As
Pole says, ‘Social research with children is now conducted within a highly charged con-
text’ (p. 70) and, when the research theme is how children wish to present themselves,
clothe themselves and represent their bodies, discussions about parts of the body and
how they are displayed raise concerns about how far legitimate research can go before it
crosses a boundary of more general social concern for child welfare. Yet the issues with
which Pole is concerned, children’s consumption and their use of their own bodies, exist
in other areas of interest to education, not the least sexual identity, sexual awareness
376 Research Methods for Education Part 2 The Process of Data Collection
and the whole area of sex education. Understanding the lived-in world of children and
young people poses significant ethical and methodological problems for educational
researchers.
Table 9.7 Part of the coding framework used in a study of teaching and learning practice
Source: from ‘Towards Self-Directed Learning in Secondary Schools: What Do Teachers Do?’ Teaching and
Teacher Education, 17(7), 837-55 (Bolhuis, S. and Voeten, M. 2001), copyright 2001, with permission from Elsevier.
we develop our own framework or use someone else’s we should always be able to
(a) identify and (b) justify criteria that underpin the observation process. England had
a system of publicly-funded subject centres to bring together subject specialists in teach-
ing and learning at university level. Escalate, the subject centre for education (which
covers both teacher training and researching and studying education) had a checklist
of criteria that underpin ‘good practice observation’ and which is applicable to many
models of teaching observation at all levels (see Table 9.8). The key elements of this are:
t there should be an introduction that sets out what is to be achieved and a conclusion
that summarises what has been achieved;
t the lesson should be planned and not be an ad hoc reaction to circumstance;
378 Research Methods for Education Part 2 The Process of Data Collection
One of the concerns of educationalists is that the basis curriculum, quality of leadership and management
on which schools are publicly judged can distort the and overall effectiveness. Your task is to:
education that is offered to children and work to their
1. Consider how far the school’s observation lesson
disadvantage. Once standards are established against
observation criteria are designed to meet the Of-
which schools are assessed, head teachers are thought
sted subject provision criteria for teaching qual-
to shape the way teaching is provided and pupils are
ity. Look first to see whether the same words are
managed in order to demonstrate that the school
used and then look at the sense of each text to
meets (or exceeds) those standards. But is this actu-
see whether there is any conflict between them.
ally the case? In this activity you have two pieces of
2. Think about good and bad lessons that you have
evidence. One, on the left in the table below, are the
experienced. What made them good or bad? Are
criteria used by a school for lesson observation. This is
those factors reflected in either set of criteria?
the practice in which, usually, a senior or experienced
3. Consider the match between the school and Ofsted
member of staff observes the teaching of colleagues.
criteriain the following situations:
Often this is a subject expert such as a head of depart-
ment. Sometimes lessons are observed by a member of (a) lesson in a school that selects only the bright-
senior management, especially if there is an external est pupils
inspection due or if there are concerns about the com- (b) a class which has a wide range of abilities
petency of the teacher. On the right are the criteria (c) a class where 30% of the pupils have only
used by Ofsted (the quality assurance organisation for been in the country for less than two years
schools and colleges in England) to assess and grade and whose command of the language is
teaching quality in a subject (each subject has, in ad- limited.
dition, specific quality criteria). Ofsted’s approach is to
apply generic and subject specific criteria to each sub- How far will both the criteria cope with these situa-
ject in judgements of achievement, teaching quality, tions?
t Lesson plan has clear Outstanding (1) Much of the teaching in the subject is outstanding and
goals which are never less than consistently good. As a result, almost all
achieved. pupils are making rapid and sustained progress. All teach-
t Teacher shows good ers have consistently high expectations of all pupils. Draw-
subject knowledge and ing on excellent subject knowledge, teachers plan astutely
understanding. and set challenging tasks based on systematic, accurate
t Pupils are engaged assessment of pupils’ prior skills, knowledge and under-
with subject. standing. They use well judged and often imaginative
t Activities reflect indi- teaching strategies that, together with sharply focused
vidual pupil needs. and timely support and intervention, match individual
t Variety of teaching needs accurately. Consequently, pupils learn exceptionally
methods to enable all well. Teaching promotes pupils’ high levels of resilience,
pupils to learn effec- confidence and independence when they tackle challeng-
tively. ing activities. Teachers systematically and effectively check
t All pupils are stretched. pupils’ understanding throughout lessons, anticipating
t Assessment records are where they may need to intervene and doing so with no-
complete and inform table impact on the quality of learning. Time is used very
pupil development. well and every opportunity is taken to successfully develop
t Individualised Educa- crucial skills, including being able to use their literacy
tion Plans are referred and numeracy skills. Appropriate and regular homework
to during the lesson. contributes very well to pupils’ learning. Marking and con-
t Support staff and structive feedback from teachers and pupils are frequent
resources are effectively and of a consistently high quality, leading to high levels of
used to support learning. engagement and interest.
V
Chapter 9 TALKING, LISTENING AND WATCHING: OTHER APPROACHES TO DATA COLLECTION 379
t Challenging behaviour Good (2) As a result of teaching that is mainly good, with examples
is managed effectively. of outstanding teaching, most pupils and groups of
t Pupils are encouraged pupils, including disabled pupils and those who have
to reflect and helped to special educational needs, are achieving well in the
become independent subject over time. Teachers have high expectations of
learners. all pupils. Teachers use their well developed subject
t Pupils make progress knowledge and their accurate assessment of pupils’ prior
over time. skills, knowledge and understanding to plan effectively
t Pupils enjoy their work and set challenges.
and show a good at-
Satisfactory (3) Teaching results in most pupils, and groups of pupils,
titude.
currently in the school making progress in the subject
broadly in line with that made by pupils nationally with
similar starting points. There is likely to be some good
teaching and there are no endemic inadequacies across
year groups or for particular groups of pupils. Teachers’
expectations enable most pupils to work hard and
achieve satisfactorily and encourage them to make pro-
gress. Due attention is often given to the careful assess-
ment of pupils’ learning but this is not always conducted
rigorously enough and may result in some unnecessary
repetition of work for pupils and tasks being planned
and set that do not fully challenge. Teachers monitor
pupils’ work during lessons, picking up any general
misconceptions and adjust their plans accordingly to
support learning. These adaptations are usually successful
but occasionally are not timely or relevant and this slows
learning for some pupils. Teaching strategies ensure that
the individual needs of pupils are usually met. Teachers
carefully deploy any available additional support and set
appropriate homework and these contribute reason-
ably well to the quality of learning for pupils, including
disabled pupils and those who have special educational
needs. Pupils are informed about the progress they are
making and how to improve further through marking
and dialogue with adults that is usually timely and en-
couraging. This approach ensures that most pupils want
to work hard and improve.
V
380 Research Methods for Education Part 2 The Process of Data Collection
Intellectual stimulation
The class is conducted in a constructive learning environment in which the material of the lesson challenges the
students and encourages them to develop their skills and move beyond their existing levels of understanding.
Student participation
Students have opportunities to participate in the learning process in an active way that promotes their un-
derstanding, which gives them an opportunity to ask questions and relate the material to their own learning
development.
Use of appropriate learning resources
Which learning resources are appropriate depends very much on the topic being taught. Examples include:
t texts
t video and film
t visual material such as posters, pictures, diagrams, samples
t computer-based learning packages
t multi-media packages.
Good teaching does not necessarily involve high-tech equipment, more important is having the right stimulation
for learning. The students’ own knowledge and experience are often the most valuable resources.
Use of accommodation and equipment
t Best use is made of the accommodation and equipment available.
t Suitability of the room/lab/studio for the type of learning activity being undertaken.
t Seating arrangements are effective.
t Any health and safety issues have been identified and dealt with.
t the teaching approach should enable students to achieve what is to be achieved and
should actively involve students;
t the teacher knows the material;
t the material should challenge the students;
t the teacher should have good communication skills and should have a rapport with
the students.
How are criteria such as these translated to the observation form? They are a complex
set of ideas and having a separate space for each criterion is likely to result either in some
being omitted or a failure to enter a further record on sections already completed while
wanting to see if there is anything that can be said about an incomplete section. The
Education Department at the University of Exeter has an interesting solution, half-way
between a pre-code and a post-code (see Figure 9.2), with observations timed for at least
382 Research Methods for Education Part 2 The Process of Data Collection
Observer: Trainee:
Area of Learning/Topic:
Number of Pupils:
- High expectations
- Respect and consideration
- Promote positive values
- ELG/NC/NLS/NNS links
- Personal Knowledge and
understanding
- Monitoring strategies
- Constructive feedback
- Pupils
evaluation/reflection
Identification of needs
every three to five minutes and a set of prompts to inform observation by reminding the
observer of what to look for.
The alternative to devising a customised pre-coded observation form is to use one that
already exists. Because so much observation research with children is about capability in
social situations, many coding systems have been devised. It is impossible to list them
all but Kenneth Merrell, a leading American educational psychologist, identifies six that
have a general application (Merrell, 2008, p. 62):
t Child Behaviour Checklist – a rating scale for use in school.
t Behaviour Coding System – to analyse coercive and aggressive behaviour in school.
t Social Interaction Scoring System – for general social behavioural problems.
t Family Interaction Code – for home observation.
t Child’s Game/Parents’ Game – observation in 30 second units in a research setting of
child/parent interaction.
t Teacher Behaviour Code – to observe changes in parent teaching behaviour after training.
Another framework with a general application is the Social Interactive Coding System
developed, in 1990, by a team led by Dr Mabel Rice of the University of Kansas’ Language
Acquisition Studies Lab and used to measure the relationship between language and so-
cial interaction for children with impaired language capability (see Rice et al., 1990).
384 Research Methods for Education Part 2 The Process of Data Collection
All of these can be used by education researchers but they are only a few of the many
that are available. The Fast Track website provides a listing of many of these. (Fast Track
is an action-based project in the USA whose purpose is to research and then intervene to
prevent serious anti-social behaviour amongst adolescents.)
There are fewer coding systems in the public domain with an explicit educational ap-
plication but we should be aware of the Flanders Interaction Category framework (see
Table 9.9) because it is used to codify interactions between pupil and teacher in a teach-
ing and learning setting. The framework is based on a teaching and learning model that
engages the pupil. At the heart of the observation schedule are processes of initiating
action and responding. It was developed by an American educationalist, Ned Flanders,
in 1970 and used between 2001 and 2003 by Dr Mark Newman when at Middlesex
University (now at the University of London Institute of Education) for a study into
problem-based learning in nursing education. It has ten data coding categories, seven
relate to the teacher, two to students and one to silence. Data is recorded in these catego-
ries on a systematic basis (for example, every minute). This allows a quantitative assess-
ment of the nature of the interaction to be built. Newman used the Flanders coding to
understand how teacher and pupils worked in problem-based learning (where a prob-
lem determines the learning need). The results of observing two lessons, one a lecturing
approach and the other a problem-based learning approach, are shown in Figure 9.3
(Newman, 2004). The contrast is obvious. Pupil talk and response accounts for about
20% of the lecture-based lesson and about 66% of the problem-based lesson.
This ability to structure observation is important because, as we saw with Newman’s
work, it can generate quantifiable data. To this extent, pre-structured observations are
analogous to closed questions since both generate counts of data. In the same way, open
questions and post-observation coding have much in common too. Both are essentially
inductive, allowing the pattern to emerge after the response or the observation, and both
are particularly useful at an early stage of investigation when instruments and approach-
es are being tested. When observations are post-coded, the procedures common to all
observational approaches assume a great significance. All detail, not just apparently sig-
nificant events, should be recorded. The consequence of post-coding observations is that
we shall collect a great deal of data that subsequently we will not use. Because of this, the
process of sifting the data to pick out the nuggets that help resolve our research issue is
very important. (We shall look at how we can do this in Chapter 11.)
Response activity Accepts and clarifies an attitude or the feeling Pupil-talk – response.
or tone of a pupil in a non-threatening manner.
Praises or encourages.
Accepts or uses pupils’ ideas.
Asks questions.
Initiating activity Lecturing.
Pupil-talk – initiation.
Giving directions.
Criticising.
Silence or confusion
Chapter 9 TALKING, LISTENING AND WATCHING: OTHER APPROACHES TO DATA COLLECTION 385
Silence
Praises or encourages
Pupil talk – initiate
Accepts or uses idea
Pupil talk – response
Asks questions
Lecturing
Silence
Accepts or
uses idea
Asks
questions
Lecturing
Pupil talk
– initiate Giving directions
Pupil talk
– response
The nature of work, the pace of business evolution on a sessional contract. Your approach is to observe
and the pace of life itself are all changing rapidly. on-the-job training and to ask questions. Permission
The skills we acquired when we were young may not for the research has been granted by the chief ex-
be those we require now and are likely to be irrel- ecutive and you are carrying out your research at a
evant at some point in the future. Upskilling is in the branch plant. So with the necessary permissions in
interests of both employee and employer. Enlight- place, you approach employees. One after another
ened employers recognised that any learning activity they tell you that they are unavailable or ask you to
promotes an interest in learning that could lay the come back later. You decide to talk with the site man-
foundations for significant skill and capability devel- ager (with whom you have already spoken), only to
opment. The vehicle maker Ford established EDAP find that he too is ‘unavailable’. You go back to the
(Employee, Development and Assistance Programme) head office and shortly after receive a call to meet
in the late 1980s. Ford’s liberal interpretation of with the site manager. At that meeting, you are given
learning included (and still includes) golf lessons a proposition – speak to the people identified by the
and flower arranging as well as programmes with a manager and report back on what you find out. Are
more direct benefit to the company, such as language you being asked to spy?
training and technical updating. It became a model How would you respond, knowing that your research
followed by many other employers. career depended on getting good data? It is easy to
But what of an organisation whose view is more short be sanctimonious when it is not your future at stake!
term? You are researching the character of learning If you would like to find out what really happened
and training in an organisation and are particularly (plus examples of other moral dilemmas faced by
interested in how the system set up by the organi- researchers) read the paper, ‘A Different Kind of
sation manages personal and organisational goals. Ethics’ by Jason Ferdinand and others (Ferdinand
Some of the workforce is employed on a full-time et al., 2007).
contract and others are employed on a regular basis
came into a shop. Because of the number of people involved, they followed every
third shopper. This approach, however, was costly and was revised to be every third
shopper who was dressed in clothes in which goods could be concealed. There was
one discretionary element for observers. If the third person was not dressed in con-
cealing clothes but exhibited typical shoplifting behaviour within 15 seconds (such
as actively looking for employees and surveillance systems, fidgeting with packag-
ing), they could be followed. What the researchers noted was that after the revised
shopper selection process was introduced, the proportion of African-Americans and
Hispanics in the sample increased. The conclusion reached by the research team was
that social stereotyping was creating observational bias. This is an issue for educa-
tional researchers to be concerned about and we should reflect on our data to see if
it is present.
t The third area of bias we should be aware of is partisanship and over-identification. If
we are observing in conflict or choice situations, then it is difficult to remain neutral
if we wish to understand the processes and experiences involved (see Case Study
9.7 for an example of this). The implication is that if we want to reach this level of
understanding we have to take sides. While this immediately introduces bias (after all,
the perspective is one-sided), there must also be a concern that identification with the
group might lead to a potential (and not an impartial) representation of the group to
the academic community and society. Now the situations where this might arise in an
educational setting are probably few and far between but they will exist. It might occur
if we are studying gang formation, cultures and conflicts. Alternatively, we might be
looking at an attempt by a university to renegotiate the staff employment contract
and wish to understand the tactics of either university management or unions, or
by school governors in England to take a decision to change from local education
authority control.
Summary
t Data collection is part of a whole system of research. Decisions about data collection
cannot be taken in isolation, they have to be taken in conjunction with decisions
about research objectives, selection of sources and data analysis. Each impacts on
the other. In this section we have learnt about some techniques of data collection –
how to do it. At this point we need to stand back and see a bigger picture.
t Data collection is a rigorous process. While flexibility is important, unless data collec-
tion is a structured activity, the data collected are likely to be worthless.
t Data are not confined to words. Images, behaviour, possessions are all potentially
useful.
t Data collection methods are not mutually exclusive. Interviews, focus groups and
observation perform different tasks and work well together.
t The methods we have looked at are invasive. Because of this ethical issues arise.
While principles (such as those outlined in Chapter 3) can guide us on many ques-
tions, there are some situations that will require others to help us reach a judge-
ment. It is the moral responsibility of researchers to know when further guidance
should be sought.
t Now you are at the end of Part Two of this text, you know about:
t primary and secondary data sources;
t how you should use literature throughout your research programme;
t how data selection and sampling are fundamental to the quality of your research;
t the need to conceptualise your research problem; and
t how you can collect data using questionnaires, interviews and observation.
You are, in other words, well-prepared to go out and collect your data. How you pro-
cess the results is what we shall look at in Part Three.
Further reading
Greenbaum – reaching beyond the stage of asking look at a specialist text. Sarah Pink’s book is the pick
questions by getting the group to think in different of the bunch. She deals with photographic and video
ways about the issue. Greenbaum looks at projective output from the perspectives of their production
techniques (for example, associating the issue with an and interpretation and non-visual ways of represent-
animal or a car) and provoking techniques (where we ing findings. Throughout there is a strong theme
push beyond the first response) (see Chapter 8). of philosophical, ethical and moral issues that are
embedded in the use of visual data in a text-based
On-line research
intellectual world.
Fielding, N., Lee, R.M. and Blank, G. (eds) (2008) The
Sage Handbook of Online Research Methods, Sage, London. Metaphor analysis
Hine, C. (ed.) (2005) Virtual Methods: Issues in Social Schmitt, R. (2005) ‘Systematic Metaphor Analysis as
Research on the Internet, Berg, Oxford. a Method of Qualitative Research’, The Qualitative
The Internet will soon become an established means of Report, 10(2), 358–394.
collecting data. We have protocols for sampling and The interpretation of deeper meanings is known as
interviewing in circumstances that we have consid- ‘metaphor analysis’. The idea that what we say and
ered here but what we need is something similar, that produce can reveal deeper insights about us has a
other researchers regard as good practice, for when we long tradition in the arts and philosophy but, for
obtain our data via the Internet. social researchers, the foundations for a research ap-
The collection of papers edited by Fielding et al. is proach were laid by two Americans, a linguist, George
much more of a heavyweight text than Hine’s. For Lackoff, and a philosopher, Mark Johnson, in their
the researcher wanting to work through the Internet, book Metaphors We Live By (1980, University of Chi-
there are sections on designing Internet research, data cago Press). Translating their ideas into operational
capture, Internet survey, virtual ethnography and the research practice with rules and procedures remains
Internet as an archival source. an issue for qualitative researchers. Rudolph Schmitt
has begun to do this.
Image-based research
Pink, S. (2007) Doing Visual Ethnography, Sage, London.
We touched on the use of image-based data but before
making use of this sort of material, it is important to
References
Babad, E. (2005) ‘Non-verbal Behaviour in Education’, Bolhuis, S. and Voeten, M. (2001) ‘Towards Self-Directed
Chapter 7 in Harrigan, J., Rosenthal, R. and Scherer, Learning in Secondary Schools: What do Teachers
K. (eds) A Handbook of Non-verbal Behaviour Research Do?’ Teaching and Teacher Education, 17(7), 837–855.
Methods in the Affective Sciences, Oxford University Cameron, C. (2007) ‘Understanding of Care Work With
Press, New York. Young Children: Reflections on Children’s Independ-
Ballinger, G. (2003) ‘Bridging the Gap Between A Level ence in a Video Observation Study’, Childhood, 14(4),
and Degree: Some Observations on Managing the 467–486.
Transitional Stage in the Study of English Literature’, Cater, J. (2011) ’SKYPE – A Cost-effective Method for
Arts and Humanities Higher Education, 2(1), 99–109. Qualitative Research’, Rehabilitation Counsellors &
Bell, A. (2007) ‘Designing and Testing Questionnaires Educators Journal, 4(2), 3–4.
for Children’, Journal of Research in Nursing, 12(5), Catterall, M. and Ibbotson, P. (2000) ‘Using Projective
461–469. Techniques in Education Research’, British Educational
Berkowitz, E. and Cichelli, T. (2004) ‘Metacognitive Research Journal, 26(2), 245–256.
Strategy Use in Reading of Gifted High Achieving and Chan, K. (2006) ‘Exploring Children’s Perceptions of
Gifted Underachieving School Students in New York Material Possessions: a Drawing Study’, Qualitative
City’, Education and Urban Society, 37(1), 37–57. Market Research, 9(4), 352–366.
392 Research Methods for Education Part 2 The Process of Data Collection
Cheeseman, J. and Clarke, B. (2007) ‘Young Chil- Web-Based Diaries for Data Collection with Par-
dren’s Accounts of Their Mathematical Thinking’, ticipants with Social Communication Difficulties’,
Mathematics: Essential Research, Essential Practice, International Journal of Interdisciplinary Social Sciences,
Volume 1, Proceedings of the 30th Annual Confer- 4(1), 181–198.
ence of the Mathematics Education Research Group Hollway, W. and Jefferson, T. (2008) ‘Researching De-
of Australia, 192–200, available at www.merga.net.au fended Subjects with the Free Association Narrative
(accessed January 2008). Interview Method’, Chapter 5 in Given, L. (ed.) The
Dabney, D., Dugan, L., Topalli, V. and Hollinger, R. SAGE Encyclopaedia of Qualitative Research Methods,
(2006) ‘The Impact of Implicit Stereotyping on Sevenoaks, California, 296–315.
Offender Profiling: Unexpected Results from an Homes, A. and Murray, L. (2008) ‘Cognitive Question
Observational Study of Shoplifting’, Criminal Justice Testing Scotland’s Census Ethnicity Classification’,
Behaviour, 33(5), 646–674. Scottish Government Social Research, available at
Davies, G. and Westcott, H. (1999) ‘Interviewing Child www.scotland.gov.uk/socialresearch.
Witnesses under the Memorandum of Good Practice: Hurworth, R. (2004) ‘Telephone Focus Groups’, Social
A Research Review’, Police Research Series, Paper 115, Research Update, 44, University of Surrey.
Home Office. Irenka Suto, W.M. and Greatorix, J. (2008) ‘What Goes
Desimone, M. and Le Floch, K.C. (2004) ‘Are We Asking Through an Examiner’s Mind? Using Verbal Protocols
the Right Questions? Using Cognitive Interviews to to Gain Insights into the GCSE Marking Process’, Brit-
Improve Surveys in Education Research’, Educational ish Educational Research Journal, 34(2), 213–234.
Evaluation and Policy Analysis, 26(1), 1–22. Jacobson, J. and Kirby, A. (2012) ‘Public Attitudes to
Drury, J. and Stott, C. (2001) ‘Bias as a Research Strategy Youth Crime: Report on Focus Group Research’,
in Participant Observation: the Case of Intergroup Occasional Paper 105, Home Office.
Conflict’, Field Methods, 13(1), 47–67. Janesick, V. (2011) ‘Stretching’ Exercises for Qualitative
Duveen, G. (2000) ‘Piaget Ethnographer’, Social Science Researchers, Sage, London.
Information, 39(1), 79–97. King, R. (1978) All Things Bright and Beautiful: A Socio-
Ells, H. (2001) ‘Talking Pictures in Working School logical Study of Infants’ Classrooms, Wiley, Chichester.
Lunches: Investigating Food Choice With Chil- King, R. (1984) ‘The Man in the Wendy House: Research-
dren and Adolescents’, British Food Journal, 103(6), ing Infants’ Schools’, Chapter 5 in The Research Process
374–382. in Educational Settings: Ten Case Studies, Burgess, R.G.
Estell, D.B., Farmer, T.W., Cairns, R.B. and Cairns, B.P. (ed.), Falmer Press, London.
(2002) ‘Social Relations and Academic Achievement Kirkpatrick, M.G. (1917) The Rural School From Within,
in Inner-City Early Elementary Classrooms’, Inter- J.P. Lippincott, Philadelphia and now available from
national Journal of Behavioural Development, 26(6), ERIC (ED392563).
518–528. Kyronlampi-Kylmanen, T., and Maatta, K. (2011) ‘Using
Ferdinand, J., Pearson, G., Rowe, M. and Worthington, F. Children as Research Subjects: How to Interview a
(2007) ‘A Different Kind of Ethics’, Ethnography, 8(4), Child Aged 5 to 7 years’, Educational Research and
519–543. Reviews 6(1), 87–93.
French, R.S. et al. (2005) Exploring the Attitudes and Labaree, R. (2002) ‘The Risk of “Going Observational-
Behaviours of Bangladeshi, Indian and Jamaican Young ist”: Negotiating the Hidden Dilemmas of Being an
People in Relation to Reproductive and Sexual Health, a Inside Participant Observer’, Qualitative Research,
report for the Teenage Pregnancy Unit NO. RW53, 2(1), 97–122.
available at www.dfes.gov.uk/research/data/upload- Leach, F. and Sitaram, S. (2007) ‘Sexual Harassment and
files/RW53.pdf (accessed January 2008). Abuse of Adolescent Schoolgirls in South India’, Edu-
Heinonen, T. and Cheung, M. (2007) ‘View from the cation, Citizenship and Social Justice, 2(3), 257–277.
Village: Photonovella With Women in Rural China’, Lewis, A., and Porter, J. (2004) ‘Interviewing Children
International Journal of Qualitative Methods, 6(4), and Young People with Learning Disabilities:
35–52. Guidelines for Researchers and Multi-professional
Highet, G. (2003) ‘Cannabis and Smoking Research: In- Practice’, British Journal of Learning Disabilities,
terviewing Young People in Self-Selected Friendship 32(4),191–197.
Pairs’, Health Education Research, 18(1), 108–118. McDonald, S. (2005) ‘Studying Actions in Context: A
Hinchcliffe, V. (2009) ‘Internet Mediated Research: The Qualitative Shadowing Method for Organisational
Viability of Synchronous Online Interviewing and Research’, Qualitative Research, 5(4), 455–473.
Chapter 9 TALKING, LISTENING AND WATCHING: OTHER APPROACHES TO DATA COLLECTION 393
Merrell, K.W. (2008) Behaviour, Social and Emotional Rice, M.L., Sell, M.A. and Hadley, P.A. (1990) ‘The Social
Assessment of Children and Adolescents, Lawrence Interactive Coding System (SICS): An On-line, Clini-
Erlbaum, New York. cally Relevant Descriptive Tool’, Language, Speech and
Ministry of Justice (2011) Achieving Best Evidence in Hearing Services in Schools, (21), 2–14.
Criminal Proceedings: Guidance on Interviewing Victims Robinson, V. and Segrott, J. (2002) Understanding the
and Witnesses, and Guidance on Using Special Measures. Decision-Making of Asylum Seekers, Home Office Research
Crown copyright. Produced by the Ministry of Justice, Study 243, available at http:// www.homeoffice.gov.uk/
available at www.justice.gov.uk/downloads/victims- rds/pdfs2/hors243.pdf (accessed January 2009).
and-witnesses/vulnerable-witnesses/achieving-best- Rodgers, D. (2007) ‘Joining the Gang and Becoming a
evidence-criminal-proceedings.pdf, accessed January Broder: The Violence of Ethnography in Contempo-
2014. rary Nicaragua’, Bulletin of Latin American Research,
Newman, M. (2004) Problem Basing Learning: An Explora- 26(4), 44–61.
tion of the Method and Evaluation of its Effectiveness in Rosenthal, R. and Jacobsen, L. (1996) ‘Teachers’
a Continuing Nursing Education Programme, Middlesex Expectancies: Determinants of Pupils’ IQ Gains’,
University, London. Psychological Reports, 19, 115–118.
Pellico, L., Friedlaender, M. and Fennie, K. (2009) ‘Look- Schmitt, R. (2005) ‘Systematic Metaphor Analysis as
ing Is Not Seeing: Using Art to Improve Observa- a Method of Qualitative Research’, The Qualitative
tional Skills’, Journal of Nursing Education, 48(11), Report, 10(2), 358–394.
648–653. Sparling, J., Wilder, D., Kondash, J., Boyle, M. and
Persson, G. (2005) ‘Developmental Perspectives on Compton, M. (2011) ‘Effects of Interviewer Behav-
Personal and Aggressive Motives in Pre-Schoolers’ iour on Accuracy of Children’s Responses’, Journal of
Peer Interactions’, International Journal of Behavioural Applied Behaviour Analysis, 44(3), 587–592.
Development, 29(1), 80–91. Suto, I. and Greatorix, J. (2008) ‘What Goes Through
Pole, C. (2007) ‘Researching Children and Fashion: An an Examiner’s Mind? Using Verbal Protocols to
Embodied Ethnography’, Childhood, 14(1), 67–84. Gain Insights into the GCSE Marking Process’,
Polite, V., McClure, R. and Rollie, D. (1997) ‘The Emerg- British Educational Research Journal, 34(2),
ing Reflective Urban Principal’, Urban Education, 213–233.
31(5), 466–489. Thomas, L. and Beauchamp. C. (2011) ‘Understanding
Power, R. (1989) ‘Participant Observation and its Place New Teachers’ Professional Identities Through
in the Study of Illicit Drug Abuse’, British Journal of Metaphor’, Teaching and Teacher Education, 27 (4),
Addiction, 84, 45–52. 762–769.
Prosser, J. and Burke, C. (2011) ‘Image-Based Educa- Visschers-Pleijers, A., Dolmans, D., Leng, B., Wolfhagen,
tional Research: Childlike Perspectives’, Learning I. and Vleuten, C. (2006) ‘Analysis of Verbal Interac-
Landscapes, 4(2), 257–274 tions in Tutorial Groups: A Process Study’, Medical
Punch, S. (2002) ‘Interviewing Strategies With Young Education, 40(2), 129–137.
Adults: The “Secret Box”, Stimulus Material and Task- Weatherford, J.M. (1986) Porn Row, Arbor House,
based Activities’, Children and Society, 16(1), 45–56. New York.
Reid, C., Tom, A. and Frisby, W. (2006) ‘Finding the Whyte, W. (1943) Street Corner Society: The Social Struc-
“Action” in Feminist Participatory Action Research’, ture of an Italian Slum, University of Chicago Press,
Action Research, 43(3), 315–332. Chicago.
Data Data
Collection Analysis
Qualitative
procedures
(generic, grounded
theory, discourse,
others)
Chapter 11
Visual description
and analysis
Research
DATA Chapter 13 Research Strategies
problem
Chapter 14
Stat isticaI
description
Chapter 12
Stat isticaI
testing
Chapter 13
Part 3
In this final part of the text we turn to how we that can get through. While we are not yet at
can process our data. The object of the exercise the stage of breaking down our data on the
is to get our data to release the information basis of particle size, we should remember that
we need to answer our research question. As we are often dealing with both the big picture
we shall see, there are many approaches open and smaller scale effects. As an example, if our
to us (and even the ones we consider here are research were into classroom behaviour, one
far fewer than the whole range). It is important of our findings might be that many teachers
that we are familiar with many methods and would not confront what they considered to be
confident about using them because one of the unacceptable behaviour. If we follow up on this
research skills we should develop is the ability by asking ‘Why?’, we might find that the head
to construct an analytical package that does teacher was ineffective in dealing with children
what we want, dovetails with our methods of sent to him for admonishment or punishment.
data collection and is informed by any philoso- We might subsequently find that there was a
phies and paradigms we are working within. failure in leadership but that this could be par-
tially accounted for by the fact that the number
How should we look at data analysis? When of pupil exclusions was reflected in school per-
we get into the details of the methods, we will formance measures and that school governors
see that there are procedures we should fol- wanted to improve the school’s image. It is im-
low. As a generalisation, quantitative methods portant that we remain aware of the possibility
are more tightly constrained than qualitative, of this scale effect within our research enquiry.
but qualitative approaches, too, have ways
of working that are established and, in many The filters through which we can pass our data
ways, expected. Nonetheless, there is flexibility are:
in applying research methods that we should
make use of when the circumstances of our re- tVisualisation techniques.
search require it. tStatistical description.
tStatistical testing.
t Generic and specific approaches to the anal-
There is also another way in which we should
ysis of qualitative data.
think about data analysis. Essentially what we
are doing is filtering our data though a proced- The text finishes by returning to the templates
ure in order to focus our information on our re- we can use to frame our research (and that
search problem. This idea of data analysis as a we first met in Chapter 2). Our purpose here
filter is one we should pursue a little further. A is to show, with some examples, how studies
filter acts like a sieve and we know that the size were set up as a prelude to a final section on
of our mesh determines the size of the particles developing a research programme.
Chapter contents
Introduction 397
Summary 453
Further reading 454
References 455
Chapter 10
Learning themes
t How to ‘read’ a data set. t Use a range of graphical techniques to
portray data sets with one, two and three
t How to reorganise data in order to make or more variables in order to extract infor-
the message more apparent.
mation.
t Visual displays of simple data sets.
t Describing complex data sets.
By the end of this section you will be able to:
t Manipulate data tables in order to extract
information.
Introduction
If collecting good data is a necessary start to contains is blurred, unclear or even hidden. It
producing good research, then making them is our task as researchers to release the infor-
release their information is equally important. mation and apply it to our research question.
Sometimes this is straightforward and it is ob- This is what we shall do in this section in re-
vious what the data mean. However, there lation to statistical data, both published and
are occasions when the message that the data obtained through primary research.
398 Research Methods for Education Part 3 The Process of Data Analysis
The first step in getting the information out of data is to look at them. Often, however,
the data we are faced with can be difficult to penetrate. Here we shall look at some of the
problems we might encounter and how we can overcome them.
Reading a data set is often referred to as scanning and sometimes as ‘eyeballing’. This
can have the connotation of being a less than perfect, ‘quick and dirty’ method that has
the potential to lead to incorrect conclusions. We should say at the outset, therefore, first,
that while the approaches to scanning described here may be quick, they certainly are not
dirty; second, there is a framework we can use to scan data; and third, scanning is primar-
ily a procedure to throw up things that are worthy of further interpretation.
The methods we shall look at are:
r Visual inspection.
r Shaping.
r Stripping out.
r Compressing.
r Re-ordering.
r Reconstructing.
Source: From Table 15.12 of the Iran Statistical Yearbook 1384-5 (2005-6)
400 Research Methods for Education Part 3 The Process of Data Analysis
(1998 to 2005). We can see that, year on year, the numbers of students and staff in-
creased. The message is straightforward – but even this message poses questions. In
20 years, the numbers of students has trebled; why? Iran has a young population and a
high birth rate. But this is not the whole story. Look at the numbers of male and female
students at the beginning and end of the period. Far more girls are going to school now
than in the past. Not surprisingly, the number of teachers has increased as well but not by
the same amount as the number of clerical staff. The ratio of teachers to students is not so
different at the beginning and end of the period but for clerical staff it has almost halved.
Why is this? Is it because teachers refuse to do administrative work or because the system
was under-resourced at the beginning of the period? We can only guess at the answer.
There are many tables that we, as researchers, will come across that will be just like
Table 10.1. They will reveal a message and also intrigue us by posing further questions.
For this reason inspection of data tables can often be used as the start of a research en-
quiry. However, there will also be data tables that require us to work harder before we
can understand what it is they are trying to tell us. Table 10.2, produced by the Higher
Education Statistics Agency (HESA) in the UK, is a case in point. At first sight it looks
straightforward, a simple table with two variables, the subjects that were studied and
entry qualifications. But the complexity starts to kick in when we read the title and when
we look at the entry qualifications.
First, let us consider the title. The table deals with young entrants to full-time first degree
courses from low participation neighbourhoods. The context for this table is the assumption
that education is a means of creating a more equal society and that participation in
higher education provides a route to higher value employment and social and economic
advancement. The reason for collecting the data is to evaluate the extent to which UK
universities are committing themselves to Government policy to encourage young peo-
ple from poor backgrounds to enter higher education. ‘Low participation neighbour-
hood’ is an indicator of poor background and HESA has defined these areas using small
area statistics from the 2001 national census. Low participation neighbourhoods can be
as small as a few postcode areas or may be the larger part of a local authority. This is the
information we have to hold in our head when reading the table. We are also told that
the figures are percentages, but of what? We will come back to this shortly. Let us now
look at the entry qualifications. The categories are a bit of a mish-mash because some
constitute a scale (that is, they can be arranged in a sequence from high to low or vice
versa) and others can be put in any order we like. Standard entry qualifications are sub-
jects studied at the Advanced Level General Certificate of Education (A levels) and the
Higher Grade of the Scottish Certificate of Education (Highers). Students’ performances
in these examinations are converted into points; higher numbers of points are achieved
by a combination of studying more subjects and gaining higher grades. Categories 02
to 13 constitute a scale (the number of points increases). The other categories do not
constitute a scale (like the subjects of study along the top), because no one sequence is
intrinsically better than another. Now let us return to the actual percentages. Our first
task is to work out what these are percentages of. Do they sum to 100% along the rows
or the columns? If they do, our task is a little easier because we can begin to look for pat-
terns in terms of the interaction of rows and columns – but they do not! So all they can
be is a percentage of another data set. What can this be? It is the total on which the cell
value is based. For example, the cell value in blue falls in the row for foundation and ac-
cess courses and the column for computer sciences. The actual value, 26.6, shows that, of
those who applied for a course in computer sciences and were accepted on the basis of an
access qualification, 26.6% were from low participation neighbourhoods. Because every
cell is constructed like this, the consequence is that no cell can be compared directly with
Table 10.2 Qualifications and subjects of study of young entrants to first degree courses from low participation neighbourhoods 2005–6 (after
HESA table SP6)
Percentage
Subject of study
Entry qualifications 1 2 3 5 6 7 8 9 A B C D E F G H I J All subjects
01 A-levels/Highers with
unknown tariff pts 5.1 17.2 20.0 15.8 19.3 19.8 21.3 18.3 9.2 20.9 21.5 18.1 19.6 19.1 17.6 14.1 19.3 21.3 17.7
02 Tariff pts up to 100 18.0 18.1 9.5 20.3 17.7 19.4 15.8 14.1 19.3 19.8 16.4 15.0 19.7 17.9 19.4 17.5 19.4 18.0
03 Tariff pts 101–160 19.9 18.9 4.5 16.5 21.2 20.8 14.5 16.6 20.9 22.8 15.4 20.3 20.7 19.5 16.0 21.1 27.9 18.5
04 Tariff pts 161–200 16.3 19.7 5.5 18.6 19.5 15.8 13.4 12.7 18.1 21.1 12.9 16.2 18.2 19.6 15.8 19.3 15.6 16.7
05 Tariff pts 201–230 17.1 17.3 11.0 17.6 14.8 18.4 13.8 10.8 17.5 23.2 15.3 15.7 18.0 15.5 14.7 17.4 16.2 16.5
06 Tariff pts 231–260 4.7 15.5 16.6 9.4 14.3 13.1 16.1 10.6 10.3 15.6 18.3 13.3 15.7 16.2 16.0 13.5 17.2 24.1 15.0
07 Tariff pts 261–290 8.5 13.1 14.6 6.0 18.2 12.8 16.1 12.4 9.1 14.0 21.2 12.5 11.8 13.1 10.9 12.5 15.5 19.0 13.8
08 Tariff pts 291–320 13.8 13.2 15.0 5.4 14.0 17.1 14.5 11.0 9.3 13.4 18.6 11.8 10.6 12.5 12.8 12.3 15.8 14.4 13.3
09 Tariff pts 321–350 11.2 12.1 11.9 6.9 10.7 14.0 13.9 11.1 11.0 11.6 15.0 10.3 12.0 11.5 11.5 10.0 16.8 16.0 11.8
10 Tariff pts 351–380 7.6 10.8 11.2 11.5 9.5 12.8 15.6 10.5 6.2 10.0 13.2 10.5 13.0 9.1 8.6 10.6 17.6 16.1 10.7
11 Tariff pts 381–420 6.8 9.7 10.0 12.5 10.4 11.7 14.3 9.0 8.2 7.7 11.9 11.0 8.0 7.8 7.8 9.2 17.6 12.1 9.4
12 Tariff pts 421–480 6.2 10.3 10.4 9.7 9.3 10.1 13.2 7.0 6.8 7.3 11.6 10.5 8.1 8.8 7.5 9.3 14.2 8.9 9.0
13 Tariff pts 4811 7.9 14.3 8.3 14.8 6.9 9.4 7.9 8.6 6.9 7.6 10.6 8.5 8.9 8.0 6.6 9.3 13.9 12.4 8.5
14 VCE or GNVQ only 26.6 21.4 32.4 24.3 19.3 9.7 30.4 24.6 18.1 17.5 25.5 27.6 28.7 22.2
15 VCE or GNVQ and
A-levels or Highers 17.6 16.6 17.9 8.5 18.3 19.9 19.7 14.7 10.8 19.1 19.2 15.8 18.6 16.7 18.7 17.7 20.0 26.5 17.5
16 Baccalaureate 6.5 12.6 9.8 9.4 10.9 8.2 9.7 6.9 7.9 11.9 17.4 5.4 6.0 8.3 8.4
17 Foundation or access
course 27.7 20.3 17.4 26.6 17.1 18.4 20.7 21.0 17.7 9.9 18.9 14.2 12.2 17.2 14.3
18 BTEC, ONC, SCOTVEC
or equivalent 18.7 23.5 14.7 32.6 23.6 21.8 18.7 19.9 24.4 21.2 23.2 31.4 29.6 20.5 23.9 25.7 21.6
▲
Chapter 10 EXTRACTING THE INFORMATION FROM STATISTICAL DATA 401
Table 10.2 (continued)
402
Percentage
Subject of study
Entry qualifications 1 2 3 5 6 7 8 9 A B C D E F G H I J All subjects
19 Higher education
qualification 14.3 23.7 24.7 16.6 25.1 17.6 23.7 21.7 20.6 27.2 26.1 22.9 22.4 16.0 27.7 20.2 22.9 27.7 22.7
20 No previous qualification 31.8 18.9 18.6 21.8
21 Other qualifications not
given elsewhere 27.7 17.3 7.3 11.2 20.6 13.2 15.4 25.3 18.4 17.8 14.2 18.6 13.4 24.8 21.0 18.7 19.2
22 Unknown qualification 0.0 27.8 20.3 20.9 18.6 27.9 21.2 21.5 19.5 20.0 22.5 13.2 19.1 22.4 12.9 21.2 32.8 20.9
All qualifications 7.3 14.8 14.9 9.5 13.1 12.6 19.0 12.6 10.5 13.8 16.3 14.3 14.8 11.5 11.0 14.6 18.4 19.9 14.0
Percentages based on populations of less than 20 have been suppressed and are shown as a blank cell.
Source: HESA
Chapter 10 EXTRACTING THE INFORMATION FROM STATISTICAL DATA 403
Table 10.3 Percentage of young entrants to full-time first degree courses from low
participation neighbourhoods by subject
another cell because there is no guarantee that the denominator used to calculate the
percentages is the same. For the purposes of comparison, however, we can look at the
column and row totals.
The column totals represent the proportion of young entrants from low participation
neighbourhoods entering each subject area in 2005/06. Table 10.3 ranks these column
totals from low to high. The question for us as researchers is whether there is any pattern
here and, if there is, whether there is any meaning to it. If we divide the table into two,
the nine subjects with the highest percentage from low participation neighbourhoods
and the nine with the highest, then:
r lowest percentages: four of the nine subject areas are sciences and mathematics
(medicine, mathematics, physical sciences and engineering), two are humanities
(history and philosophy and languages) and two are vocational (agriculture and
architecture);
r highest percentages: two are broadly vocational (subjects allied to medicine and law),
two are sciences (biology and computer science) and the remainder are general areas
of study (though with some preparation for working life).
There is possibly an explanation behind this pattern. Can you think what it might be?
Begin by thinking about the target group, young people from low participation neigh-
bourhoods and then imagine yourself in that situation. What is the likelihood that you
will have good grades in mathematics and physics? Would you gravitate more to an aca-
demic subject or to one that offered you some sort of career future?
404 Research Methods for Education Part 3 The Process of Data Analysis
Where does this leave our interpretation of the cell values? In a bit of a limbo. We can
try to see if there are problems in relation to column or row totals but if there are, they
are none too clear and inspection will not reveal them. But let us reflect on how we have
got this far. We can summarise what we did as a series of rules.
1. Read the title of the table closely. Identify the variables and relate them to column
and row headings.
2. Understand all the terms in the title and the context that has led to them being ana-
lysed in the table. Think about educational, social and economic situations.
3. Understand how variables are structured in rows and columns. In Table 10.3 there
is only one variable each side. With some tables we can find up to four vari-
ables on each side, so we have to be able to read the hierarchy in which they are
presented.
4. Do the variables constitute a series or are they independent categories that could be
presented in any order?
5. Look to see if there are column or row totals. Do the cell values sum to one of them?
6. Is there any apparent pattern to the cell values? If not, is there a pattern if you re-
arrange the table by row or column totals?
Activity 10.1 gives you the opportunity to practise this type of analysis yourself.
Imagine our research is into interagency working in supporting children. The con-
text of our work, in the UK, is the ‘Every Child Matters’ policy. This seeks to ensure
that the agencies that have an interest in or responsibility for children’s welfare act in
a co-ordinated and integrated way and that there are no cracks between them that chil-
dren could fall through. The policy expects agencies to work together to resolve such
problems as truanting, the welfare of children in the family, bullying and youth crimi-
nality. Different agencies have different responsibilities in these and other issues but to
resolve the issues they need to collaborate. Imagine the position from the viewpoint of
a head teacher. There is a need to maintain contacts with children’s social services, the
youth justice system, the police, health service agencies, youth employment and advice
agencies – and these are just the main ones. Now imagine that the head teacher needs
to do this for every area in which the pupils live. From this perspective, interagency
working becomes much more complex. But how big is the problem? Table 10.5(a),
which shows pupil movements across education authority boundaries in London, may
give us some answers but, as it stands, it is far too complex. With 33 administrative
units (local education authorities) the matrix gives over 1,000 potential pieces of data.
We need to focus on the core information – just how many pupils are educated in
the borough in which they live? By answering this question, we will get an idea how
many agencies in different authorities head teachers will have to become involved
with. Fortunately these data are easy to access; they lie on the diagonal but even here
it is difficult to read, so we have to extract it (Table 10.5(b)). Alphabetical listing helps
us find the position for any authority but is not useful when ‘reading’ the data. For this,
we should arrange the data as an ascending or descending series (Table 10.5(c)). Now
we have some sort of answer to our question about the scale of the issue. For some
authorities it is much more of a problem than others. The head teachers in Kensington
and Chelsea (an education authority in London) have over half of their pupils living in
another authority but those in Tower Hamlets (another authority) have about 5%. Are
more head teachers like the ones in Kensington and Chelsea or like Tower Hamlets?
We cannot answer this directly but, on balance, because about 60% of the authorities
have between 30% and 55% of their pupils living outside the authority, we can say the
issue of cross-boundary liaison is quite significant. And why is this important? Look
Benchmarking one’s own performance against others in specific subjects. The focus of the test is usually
is an accepted way of improving standards in busi- reading, mathematics and science. PISA tests were
ness performance. It is now found in education with conducted in 2000, 2003 and 2006 and are sched-
the construction of league tables of schools and uni- uled to be repeated in 2009, 2012 and 2015. You can
versities. League tables, however, are only as good read more about PISA (and even attempt some of the
as the data used to compute them and many tables tests) on the website: www.oecd.org./pisa.
are heavily value-based. There is, however, a bench- Your task is to make sense of the table below, which
marking process that is well-conceived and whose is taken from Table 5.4 of the 2006 survey. The letters
data is robust. PISA, the Programme for International in the table give reasons for missing data. What pat-
Student Assessment, is operated through the Organi- terns do you detect (a) in the performance of students
sation for European Co-operation and Development (look at the differences between public and private
(OECD). The approach is to administer a standard test schools) and (b) in the non-availability of data? You
to young people aged 15 (that is, to students at the can read what the OECD statisticians said on page 230
end of the period of compulsory schooling in most of the 2006 report (available from the website).
states) to evaluate their knowledge of and capability
▲
406
Percentage of students and student performance on the science, reading and mathematics scales, by type of school
Results based on reports from school principals and reported proportionate to the number of 15-year-olds enrolled in the school
Government-independent
private school
Government or public schools (Schools which receive less than
(Schools which are directly controlled or managed Government-dependent private schools 50% of their core funding –
by: i) a public education authority or agency, or ii) (Schools which receive 50% or more of their funding that supports the basic
a government agency directly or a governing body, core funding – funding that supports the basic educational services of the
most of whose members are either appointed by a educational services of the institution – from institution – from government
public authority or elected by public franchise) government agencies) agencies)
Performance
Performance on the Performance on the on the
Science Reading Mathematics Science Reading Mathemat-
scale scale scale scale scale ics scale Science scale
Percentage Mean Mean Percentage Mean Mean Percentage
of students score score Mean score of students score score Mean score of students Mean score
OECD
Australia w w w w w w w w w w
Austria 90.7 511 491 506 8.4 503 479 488 0.9 c
Belgium w w w w w w w w w w
Canada 93.0 532 524 524 4.3 578 578 582 2.7 c
Czech Republic 96.2 514 482 510 3.5 492 481 487 0.2 c
Denmark 76.1 492 493 511 22.8 507 499 521 1.1 c
Finland 97.6 564 547 549 2.4 c c c 0.0 a
France w w w w w w w w w w
Research Methods for Education Part 3 The Process of Data Analysis
Germany 94.3 514 494 502 5.5 555 535 544 0.2 c
Greece 94.9 469 455 455 0.0 a a a 5.1 545
Hungary 84.2 500 478 485 13.1 533 516 529 2.7 c
Iceland 98.9 490 484 505 1.0 c c c 0.1 c
Ireland 41.8 488 494 483 54.8 519 530 511 3.4 558
Italy 96.4 476 469 462 1.2 c c c 2.4 c
Japan 70.1 537 501 528 1.0 c c c 28.9 526
Korea 53.7 524 554 549 31.5 505 543 527 14.8 552
Luxembourg 85.6 490 481 495 14.4 465 470 460 0.0 a
Mexico 89.7 402 402 398 0.0 c c c 10.3 455
Netherlands 33.0 524 505 526 67.0 527 509 534 0.0 a
New Zealand 95.5 527 518 519 0.0 a a a 4.5 603
Norway 98.1 484 482 488 1.9 c c c 0.0 a
Poland 98.4 497 507 495 1.0 c c c 0.6 c
Portugal 91.1 471 469 463 6.9 484 470 479 2.1 c
Slovak Republic 92.3 487 465 491 7.2 506 484 507 0.5 c
Spain 65.3 475 446 466 24.6 503 482 495 10.1 537
▲
Sweden 91.7 501 504 501 8.3 531 539 522 0.0 a
Switzerland 95.5 511 499 530 0.9 c c c 3.6 513
Turkey 99.5 424 447 423 0.0 a a a 0.5 c
United Kingdom 93.8 510 492 492 0.2 c c c 6.0 597
United States 92.6 485 m 470 0.8 c m c 6.6 554
Partners
Argentina 67.5 364 342 354 24.8 441 430 430 7.7 447
Azerbaijan 99.1 382 351 475 0.0 a a a 0.9 c
Brazil 92.4 375 378 353 0.0 a a a 7.6 482
Bulgaria m m m m m m m m m m
Chile 46.9 409 412 385 44.9 447 453 418 8.2 514
Colombia 82.7 379 378 361 5.1 431 440 415 12.3 412
Croatia 98.6 494 478 467 0.6 c c c 0.7 c
Estonia 98.1 531 500 514 1.4 c c c 0.6 c
Hong Kong 7.5 570 562 575 90.7 541 535 546 1.9 c
China
Indonesia 60.7 403 403 404 13.5 352 344 341 25.8 389
Israel 73.4 449 435 438 20.3 471 457 461 6.3 475
Jordan 80.6 410 390 373 1.3 c c c 18.1 468
Kyrgyzstan 99.4 320 283 308 0.0 a a a 0.6 c
Latvia 100.0 490 479 486 0.0 a a a 0.0 a
Liechtenstein c c c c c c c c c c
Lithuania 99.3 487 469 485 0.7 c c c 0.0 a
Macao-China 3.8 463 453 473 68.5 504 487 517 27.6 535
Montenegro 99.8 412 393 400 0.0 a a a 0.2 c
Qatar 91.1 338 301 304 0.1 c c c 8.8 434
Romania 100.0 418 396 415 0.0 a a a 0.0 a
Russian Federa- 100.0 479 440 476 0.0 a a a 0.0 a
tion
Serbia 99.4 436 401 436 0.6 c c c 0.0 a
Slovenia 97.7 517 493 503 2.3 c c c 0.1 c
Chinese Taipei 65.0 549 509 567 0.0 a a a 35 501
Thailand 83.5 422 417 418 6.1 389 385 379 10.5 433
Tunisia 98.2 388 384 368 1.8 c c c 0.0 a
Uruguay 84.9 416 397 414 0.0 a a a 15.1 496
Note: Values that are statistically significant are indicated in bold (see Annexe A3).
a The category does not apply in the country concerned. Data are therefore missing.
c There are too few observations to provide reliable estimates (i.e. there are fewer than 30 students or less than 3% of students for this cell or too few schools for
valid inferences).
m Data are not available. These data were collected but subsequently removed from the publication for technical reasons.
w Data have been withdrawn at the request of the country concerned.
x Data are included in another category or column of the table.
Chapter 10 EXTRACTING THE INFORMATION FROM STATISTICAL DATA 407
408 Research Methods for Education Part 3 The Process of Data Analysis
2004
at it from the perspective of a head teacher first. With pupils from five authorities, the
head teacher will have to deal with five sets of children’s services, five sets of youth
justice boards, five sets of police services (and many more local policing teams) and so
on. And each of the other agencies will have to deal with schools and childcare teams
in more than their own borough. It is a system that, unless managed well, has the po-
tential to fail many children.
Stripping out data, like other methods of reorganising data, can also leave us with new
questions. In this case, is there any pattern in the authorities at the bottom and at the
top? If we were to explore this further, we would want to know whether authorities with
higher proportions of incoming pupils:
r were surrounded by authorities with few schools on their borders;
r had higher levels of pupil attainment than surrounding authorities;
r had a similar socio-economic profile to surrounding authorities;
r had a large proportion of children attending private schools.
Table 10.4(b) By year, by activity and by age (excerpt from table)
Other Other
types of types of
ballgames ballgames Gymnastic, Weight
Football Handball by groups Badminton Tennis individual Swimming aerobic Athletics Dance Spinning training
7–9 years 2004 34.9 17.9 1.9 9.4 1.9 0.5 41.5 26.9 0.9 11.3 0.5 0.9
10–12 years 2004 39.8 15.9 5.5 19 8 4.2 34.9 16.6 2.8 9.3 0.3 1.4
13–15 years 2004 27 13.5 5.1 21.9 9.6 2.8 16.3 11.2 3.9 10.7 0.6 6.7
7–9 years boy 2004 49.6 10.3 2.6 11.1 1.7 0.9 41.9 22.2 0 4.3 0 0.9
10–12 years boy 2004 53.2 16.5 5.8 25.2 9.4 6.5 37.4 10.8 2.2 2.9 0 1.4
13–15 years boy 2004 36.6 8.5 6.1 26.8 12.2 3.7 15.9 4.9 3.7 3.7 0 6.1
7–9 years girl 2004 16.8 27.4 1.1 7.4 2.1 0 41.1 32.6 2.1 20 1.1 1.1
10–12 years girl 2004 27.3 15.3 5.3 13.3 6.7 2 32.7 22 3.3 15.3 0.7 1.3
13–15 years girl 2004 17.9 17.9 4.2 17.9 7.4 2.1 16.8 16.8 4.2 16.8 1.1 7.4
Unknown 2004 60 20 0 0 0 0 40 20 20 40 0 20
7–9 years 2004 4.2 1.4 11.3 9.4 0.5 0.5 14.6 6.6
10–12 years 2004 4.8 4.8 8.7 11.4 1 1.7 13.5 10
13–15 years 2004 3.9 6.2 9 5.1 1.7 3.9 7.3 14.6
7–9 years boy 2004 6 0.9 11.1 2.6 0 0 13.7 7.7
10–12 years boy 2004 7.2 5 10.8 0.7 1.4 2.9 11.5 9.4
13–15 years boy 2004 4.9 8.5 11 0 3.7 7.3 8.5 17.1
7–9 years girl 2004 2.1 2.1 11.6 17.9 1.1 1.1 15.8 5.3
10–12 years girl 2004 2.7 4.7 6.7 21.3 0.7 0.7 15.3 10.7
13–15 years girl 2004 3.2 4.2 7.4 9.5 0 1.1 6.3 12.6
Unknown 2004 0 0 0 0 0 0 0 20
Chapter 10 EXTRACTING THE INFORMATION FROM STATISTICAL DATA 409
410
2004 Football 34.9 39.8 27 49.6 53.2 36.6 16.8 27.3 17.9 60
Handball 17.9 15.9 13.5 10.3 16.5 8.5 27.4 15.3 17.9 20
Other types of 1.9 5.5 5.1 2.6 5.8 6.1 1.1 5.3 4.2 0
ballgames by groups
Badminton 9.4 19 21.9 11.1 25.2 26.8 7.4 13.3 17.9 0
Tennis 1.9 8 9.6 1.7 9.4 12.2 2.1 6.7 7.4 0
Other types of 0.5 4.2 2.8 0.9 6.5 3.7 0 2 2.1 0
ballgames individual
Swimming 41.5 34.9 16.3 41.9 37.4 15.9 41.1 32.7 16.8 40
Research Methods for Education Part 3 The Process of Data Analysis
Gymnastic, aerobic 26.9 16.6 11.2 22.2 10.8 4.9 32.6 22 16.8 20
Athletics 0.9 2.8 3.9 0 2.2 3.7 2.1 3.3 4.2 20
Dance 11.3 9.3 10.7 4.3 2.9 3.7 20 15.3 16.8 40
Chapter 10 EXTRACTING THE INFORMATION FROM STATISTICAL DATA 411
Table 10.5(a) Cross border movement of secondary1 school pupils2,3 resident in London, 20074
percentage of pupils
Wandsworth
Westminster
Southwark
Greenwich
Lewisham
School LA
Islington
Lambeth
Hackney
Camden
Resident LA
City of London 3.9 1.3 5.2 2.6 3.9 1.3 37.7 35.1 6.5
202 Camden 71.2 – 0.1 0.8 9.4 0.7 0.3 – 0.1 10.7
203 Greenwich – 71.4 – – – 0.1 2.3 0.6 0.1 – 0.1
204 Hackney 2.0 – 55.9 0.1 14.6 0.1 0.3 – 0.2 13.5 – 1.2 –
205 Hammersmith and Fulham 0.5 – 51.2 7.2 0.1 – – 8.9 1.4
206 Islington 17.1 – 2.5 0.7 63.9 1.0 1.3 0.4 0.4 – 1.9 –
207 Kensington and Chelsea 1.2 23.6 – 45.4 0.4 0.1 – 5.4 10.7 0.1
208 Lambeth 0.1 0.2 0.1 0.6 – 1.0 49.2 0.7 13.5 – 14.6 5.6
209 Lewisham 11.3 – 0.1 – 0.7 69.0 5.5 0.1 0.1 0.3 –
210 Southwark 0.1 1.6 – 0.2 0.1 0.2 6.5 8.4 74.7 0.3 0.7 4.1 –
211 Tower Hamlets 0.1 0.8 1.3 0.1 0.2 0.1 0.2 – 1.2 94.2 – 0.4 0.1
212 Wandsworth – – – 5.5 – 0.8 2.9 0.1 0.4 72.0 0.7
213 Westminster 2.2 4.3 0.6 10.2 1.5 0.3 1.5 71.5 –
301 Barking and Dagenham – – 0.2 – 0.1 – 0.1 0.7 – 0.1 84.2
302 Barnet 2.3 – 0.1 0.1 0.1 – – – 0.4
303 Bexley 4.8 – – – – 0.2 0.3 – – –
304 Brent 3.1 – 1.7 – 1.2 – – – – 0.1 3.3
305 Bromley – 0.7 0.3 0.3 1.8 0.3 – –
306 Croydon – – – 0.1 – – 2.2 0.2 0.4 – 0.9 0.2
307 Ealing – – 2.2 – 0.9 – 0.1 0.2
308 Enfield 0.4 – 0.6 – 0.5 – – 0.1 – 0.1 –
309 Haringey 2.0 – 2.8 0.1 2.3 0.1 – – 0.1 0.3 –
310 Harrow – 0.7 0.2 – 0.1
311 Havering – – – – – 0.1 – 3.1
312 Hillingdon – 0.4 0.3 – – – –
313 Hounslow – 0.8 – 0.2 – – –
314 Kingston upon Thames 0.4 – – 0.5 –
315 Merton – – 1.1 0.1 1.5 – 0.2 8.1 0.2 –
316 Newham 0.1 – 0.2 – 0.1 – – – 0.1 1.9 – 0.1 0.6
317 Redbridge – – 0.1 – – – 0.6 – 2.4
318 Richmond upon Thames 2.6 0.4 – 0.3 –
319 Sutton 0.1 – 0.1 – 0.5 –
320 Waltham Forest 0.3 0.7 – 0.2 – – – – 0.3 0.1 0.1
1 Includes: maintained mainstream secondary schools (including middle deemed), City Technology Colleges and
Academies.
2 Includes: solely registered and main registration of dually registered pupils. Excludes pupils reported to be boarders.
3 Aged 11–15 as at 31 August 2006.
4 Excludes: unmatched records (i.e. with missing or invalid postcodes).
(x suppressed data) (. not applicable)
▲
Source: DFES
412 Research Methods for Education Part 3 The Process of Data Analysis
percentage of pupils
Outside London
Waltham Forest
Hillingdon
Redbridge
Hounslow
Haringey
Havering
Newham
Croydon
Bromley
Harrow
Merton
Enfield
Sutton
Bexley
Ealing
Brent
1.3 1.3
1.6 0.1 – 0.1 0.3 0.2 – – – – – 0.1
18.8 – 5.5 – – – 0.1 – – 0.8
– 0.2 – – 1.7 5.7 0.1 – – 0.6 0.3 2.2 0.6
1.9 – 5.7 – – 0.1 0.1 14.6 0.3 0.7 6.9 – 0.1 0.1
0.1 0.1 0.1 1.2 4.4 – – – – 1.7
4.8 0.1 1.7 0.1 – 0.2 4.3 0.1 0.3 1.4 – 0.1
– – 0.5 9.2 – – 0.1 – 0.1 3.1 – 0.1 1.0 – 0.1
1.2 – 8.7 2.0 – – – – – – – – 0.2 – 0.6
0.1 – 0.7 1.8 – – – – – 0.1 – – 0.1 – 0.1
– – – – – 0.1 0.2 0.6 0.1 0.1 0.1
– 0.9 – 0.1 – 0.4 3.1 6.9 4.7 1.2 0.1
5.5 – 0.4 – – 0.2 0.8 – – 0.1 – – 0.1
– – 0.1 8.3 2.4 3.4 0.1 0.4
5.9 – 3.6 2.8 1.2 – 0.4 – 3.1
86.0 4.3 – – – – – – – 4.1
74.0 – 2.8 – – 4.7 0.5 0.3 – – – – – 0.5
3.1 84.3 4.9 – – 0.3 4.0
– 4.6 76.5 – – – – – – – 1.2 8.6 4.8
1.7 77.9 – – 1.1 5.0 8.9 0.3 – 0.1 – 1.3
0.3 – 86.8 2.9 – – 0.1 – 0.2 2.5
0.4 – – 13.1 74.7 – – 0.1 0.4 0.4
8.3 – 1.1 – – 72.3 6.2 – – – – – 5.5
– – 87.9 0.3 0.9 0.1 7.4
0.2 – 3.9 0.7 85.0 2.8 – – – 0.1 – – 6.3
– – 2.0 – – 0.6 76.8 1.1 – 9.8 – 8.4
– 0.1 – 0.5 77.0 3.1 6.9 3.4 7.9
– – 3.4 – – – 4.7 67.1 – 0.1 12.1 1.1
– – – – – 0.1 0.9 – 92.3 2.1 – 1.1 0.3
– – 0.1 0.1 1.5 – 2.3 86.0 – 0.8 5.9
– – 0.1 – 0.1 12.0 3.7 1.3 75.9 – 3.5
– 3.9 3.1 3.5 – 80.7 7.9
– – 1.1 0.5 0.3 1.4 5.7 88.1 1.0
Chapter 10 EXTRACTING THE INFORMATION FROM STATISTICAL DATA 413
In summary, stripping out data may answer an immediate question, but it is likely to
highlight other questions that need to be answered.
pattern is the starting point of data exploration. The problem, as we have seen already,
is that the balance between ‘message’ and ‘noise’ in the data often favours noise. This
occurs because there is a relationship between the visibility of the message and the
scale at which the data are presented. What do we mean by scale? Imagine that we
have pupil attainment data for all the pupils in an area. If we look at a data file of
individual cases there is so much detail that we will not be able to see any pattern.
What we do is compress it and, depending on how we compress it, different patterns
will emerge.
r If we put them into classes (0–20, 21–40, 41–60, etc. where the values are attainment
scores) and count the number of pupils in each class, then we might see a pattern (we
would expect to see a curve revealing that there are more average pupils than high or
low attaining ones. If this were not the case then we should be extremely suspicious
of the tests we gave the children in the first place).
r If we compressed the data by putting them into postcode classes, we might find that
different postcodes had very different patterns of pupil attainment. However, if the
classes or postcodes are too small, the pattern may still not be visible, so we have to
combine classes or postcodes still further until we can read the message clearly. Put-
ting it another way, the categories or classes we choose and their size are the lenses
through which we look at the data.
Table 10.6 shows this process at work. Table 10.6(a) shows the number of schools and
the number of pupils in the Netherlands by the type of school in 1994/95 and 2004/05.
As it stands, the table is difficult to read. We can understand what it should be telling
us but its message is not clear. However, when we consolidate the non-public schools
Table 10.6(a) Schools and pupils funded by the state in the Netherlands
Note: In the Netherlands there are wholly state funded public schools and state subsidized private and religious
foundation schools.
Source: Statistics Netherlands
Chapter 10 EXTRACTING THE INFORMATION FROM STATISTICAL DATA 415
Table 10.6(b) Schools and pupils funded by the state in the Netherlands with the categories consolidated
and primary and secondary schools, as in Table 10.6(b), we can begin to understand
something of what the data wants to tell us. First, the number of schools in both catego-
ries has fallen in the ten years since 1994/05. Second, in both the number of students has
risen. We can draw one conclusion from this and raise a number of questions. We must
conclude that school sizes in both categories have increased. But what are the questions
in the data?
r We might expect pupil numbers to increase because of the natural increase in popula-
tion but has the increase been the same in both categories? There are two percentages
we can calculate that will give us the answer.
r First, we can calculate the percentage by which each category has increased:
State-funded public schools
Clearly the rate of increase in student numbers is greater in the religious and
private schools.
r Second, we can calculate each category’s share of the pupil total in each year:
State-funded religious and private schools
Table 10.7 Expenditure per child in US dollars on early childhood education in OECD
countries in 2004
Australia — Australia —
Austria 6,106 Mexico 1,794
Belgium 4,915 Korea 2,520
Canada — Slovak Republic 2,575
Czech Republic 3,178 Czech Republic 3,178
Denmark 5,323 Switzerland 3,581
Finland 4,282 Japan 3,945
France 4,938 Poland 4,045
Germany 5,489 Hungary 4,231
Greece — Finland 4,282
Hungary 4,231 Norway 4,327
Iceland 6,114 Sweden 4,417
Ireland 4,948 Portugal 4,461
Italy 5,971 Spain 4,617
Japan 3,945 OECD average 4,741
Korea 2,520 Belgium 4,915
Luxembourg — France 4,938
Mexico 1,794 Ireland 4,948
Netherlands 5,807 New Zealand 5,112
New Zealand 5,112 Denmark 5,323
Norway 4,327 Germany 5,489
Poland 4,045 Netherlands 5,807
Portugal 4,461 Italy 5,971
Slovak Republic 2,575 Austria 6,106
Spain 4,617 Iceland 6,114
Sweden 4,417 United States 7,896
Switzerland 3,581 United Kingdom 7,924
Turkey — Canada —
United Kingdom 7,924 Greece —
United States 7,896 Luxembourg —
Turkey —
turning an alphabetical listing into a table where countries are ranked on the basis of
their expenditure makes things much clearer (Table 10.7 column (b)). What can we pick
out as the main messages?
r The difference between the lowest rank (Mexico) and the highest is over $6,000. That
is, the UK spends four times as much on early childcare education as Mexico.
r 10 of the 25 countries for which we have data spend plus or minus $500 of the OECD
average on early years education. In other words, this is a fairly balanced distribution.
There is another indicator of this; can you see it? Look at the number of countries
outside a spend of plus or minus $500 of the OECD average. This idea of a balanced
distribution could be important if we wished to conduct a more advanced statistical
analysis (see Chapter 13 for more on this).
While there is a relationship between how rich a country is and its position in the
ranking (Mexico and Korea have lower per capita incomes than the USA or UK), it is not
the only one. How do we explain Switzerland and Japan spending less than $4,000 when
other countries spend more? Why do Belgium, France and Ireland spend about $3,000
less than the UK and why are the UK and USA so much higher than other countries?
Before we reach any conclusions, we have to be sure that the basis on which the data are
collected are the same. If they are then we have to look for the answers in the political
priorities behind the spending.
Ranking is not the only way to re-order data. Any way that creates a pattern that can be
interpreted is useful. In most cases the criteria we can use will be known beforehand. All
that is necessary is that we ‘attach’ the criteria to the observation of data group. Table 10.8
shows an example of this. The table provides data on the proportion of pupils in each Lon-
don borough that is eligible for free school meals. This measure is a well-used measure to
indicate deprivation and disadvantage. The first two columns show the percentage of pupils
in each borough eligible for free school meals. If the table were just these two columns, it
would be difficult to read; there is too much information for us to see any pattern. So how
can we reorganise our data? The obvious way is in geographical terms. All we need do is cre-
ate a two-fold classification, inner and outer London. In this table we have used the official
categorisation of boroughs but even if there were not one, we could create our own. Our first
task is to tag each borough as inner or outer London (columns 3 and 4). We then pick out
each group separately and rank them in terms of the proportion eligible for free school meals
(columns 5 and 6, 7 and 8). Immediately the association between geographical location and
eligibility for free school meals is apparent, from under 10% in Kingston and Richmond
(next to each other in south west London) to over 50% in Tower Hamlets (the old dock area
just east of the City of London and the traditional first stop for immigrants).
Many of the data investigations we can carry out will show a spatial pattern. However,
just because we have shown a spatial element to the pattern does not mean that we have
found a cause. Our arguments have to be more sophisticated than that. Space itself is
neutral, it is how we value and use location that is important. The factors that we are
looking for are often associated with access – but accessibility is much more than just
transport. We have to think about access to job opportunities, to well-qualified teachers,
to welfare and social services. So we have to dig deeper and understand the processes at
work that create the pattern. In our case free school meals are used as a proxy for relative
poverty but poverty does not have a single cause. Some people may not wish to enter the
labour market, others may want to but may be discriminated against and yet others may
be unable to because they are unable to speak the language or because they are prevented
by legislation. The advice is always to go beyond the first pattern, particularly if it is geo-
graphical in character, and see if other factors are at work.
418 Research Methods for Education Part 3 The Process of Data Analysis
Table 10.8 Nursery and primary schools pupils in London Boroughs eligible for free school meals,
January 2006
% of Outer Inner
pupils London London Outer London Inner London boroughs
London borough eligible borough borough boroughs ranked ranked
The Index number for the total number of pupils in 1993/04 is:
466,605
IN 5 3 100 5 100.7
463,309
Index numbers are a means of comparing different series on a standardised basis. In
each case the base year is assumed to be 100. If we look at columns 5 and 6 in Table 10.9
we can immediately see differences in the rate of increase for the total number of pupils
and those receiving additional training and a bilingual education. The increase for the
latter is much greater – over the entire period about four times as great.
If we look at the rate of increase in the numbers receiving additional training in
Norwegian, we have to start with 1997/98 as our base year should always be the same.
What we can see here is that the growth in pupils with additional training in Norwegian
grew at the fastest rate (column 9) compared with those receiving a bilingual education
and the total number of pupils.
The index numbers we calculated are straightforward. We can also use the concept
to combine information from different data sets. For example, if we wanted to bring
together measures of children’s welfare into an index of well-being, we could draw on
(depending on the country) a range of health, social, economic and educational data.
What we might actually use and how we might combine them depends on how we
define ‘well-being’. A Professor of Sociology, Kenneth Land, led a team that attempted
this for the USA (Land et al., 2001). They defined seven dimensions of well-being: social
relationships, material well-being, health, safety, educational attainment, participation
in community and emotional and spiritual well-being (see Table 10.10). For each they
identified a number of indicators, 28 in all. The judgements they had to make were
(i) which data sets to use, (ii) what weighting to give to the different dimensions and
(iii) how to combine different measurement scales. The task is not difficult but does
420 Research Methods for Education Part 3 The Process of Data Analysis
Table 10.9 Pupils in Norwegian schools with training in their native language and in Norwegian
1 2 3 4 5 6 7 8 9
Domain Indicators
require a rationale for the decisions. Using this index Land and his colleagues were able
to show that, overall, the well-being of children and young people in the USA was no
better in 1998 than it was in 1975. It would also be feasible to construct one for admin-
istrative areas in the UK. Activity 10.2 is an opportunity to attempt this.
Another approach to reading data and seeing the messages contained within them is to
transform the data from numbers into a visual representation. Why should we do this?
Well, there is considerable evidence that we can assimilate and make sense of visual im-
ages more easily than number blocks. It is a process we use in our everyday lives. The
patterns we identify (or recognise) are bigger than the elements that comprise them.
Numbers only have a relationship to other numbers up or down a numerical scale; when
reorganised to present a visual stimulus we can judge them in terms of their proximity
(clustering), shape, colour and categorisation, amongst other things.
This is why data visualisation is so important in data analysis. The surprising thing is
that so many texts present it as the ‘nursery slopes’ of data analysis, aimed particularly
422 Research Methods for Education Part 3 The Process of Data Analysis
at readers unable to manage quantitative analysis. There are two main reasons why we
should dismiss this position. First, graphical methods are very useful in an initial explor-
ation of the data. If a pattern is identified, it can help us determine our next steps, for
example, an assessment of the significance of relationships between the variables. The
second reason is equally important. Graphical methods have high levels of communica-
bility, even to non-experts. Graphical presentations get our message over far more effec-
tively than mathematical statements. We should, therefore, treat data visualisation with
the seriousness it deserves.
r Second, there are two peaks, at 520 and 490 (statistically we say that this pattern is
‘bi-modal’). This pattern is of interest to us because it may be significant in research
terms. There are two things that might create this pattern. The countries with stems of
52 may have something in common and those with stems of 49 may have something
in common but the things they may have in common are different for the two groups.
Here the difference would be in the underlying processes that produced the data –
mathematics teaching in these countries. However, the pattern could also be produced
by the samples of pupils who took the test not being broadly similar in each country.
Here the difference would be the result of the way in which the data were collected.
Table 10.11 Mean scores in student performance on the mathematics scale, PISA 2006
OECD Partners
Australia 520 Argentina 381
Austria 505 Azerbaijan 476
Belgium 520
Brazil 370
Canada 527
Bulgaria 413
Czech Republic 510
Denmark 513 Chile 411
Finland 548 Colombia 370
France 496 Croatia 467
Germany 504 Estonia 515
Greece 459
Hong Kong – China 547
Hungary 491
Indonesia 391
Iceland 506
Ireland 501 Israel 442
Italy 462 Jordan 384
Japan 523 Kyrgyzstan 311
Korea 547 Latvia 486
Luxembourg 490
Liechtenstein 525
Mexico 406
Lithuania 486
Netherlands 531
New Zealand 522 Macao – China 525
Norway 490 Montenegro 399
Poland 495 Qatar 318
Portugal 466 Romania 415
Slovak Republic 492
Russian Federation 476
Spain 480
Serbia 435
Sweden 502
Switzerland 530 Slovenia 504
Turkey 424 Chinese Taipei 549
United Kingdom 495 Thailand 417
United States 474 Tunisia 365
OECD total 4 484 Uruguay 427
OECD average 498
54 9 7 7 8 549 7 7 8
53 0 1 53 0 1
52 7 0 0 3 2 5 5 52 7 0 0 3 2 5 5
51 0 3 5 51 0 3 5
50 4 5 6 1 2 4 50 4 5 6 1 2 4
49 1 6 0 0 5 2 5 49 1 6 0 0 5 2 5
48 6 6 0 486 6 0
47 6 6 4 47 6 6 4
46 6 2 7 46 6 2 7
45 9 459
44 2 442
43 5 435
42 7 4 427 4
41 7 5 1 3 417 5 1 3
40 6 406
39 9 1 399 1
38 4 1 384 1
37 0 0 37 0 0
36 5 365
35 35
34 34
33 33
32 32
31 8 1 318 1
r Third, the distribution is elongated. Statistically, this is called a ‘long tail’. This could
indicate either that the countries with low scores tested less able pupils or, more
likely, that the school system is less capable of developing the mathematical skills of
the students.
We have used the stem and leaf approach to establish the shape of the data set and
to identify its key features. This has enabled us to pose further questions and to suggest
reasons for the patterns. It has been easy to do this and it has been quick. There are, how-
ever, questions that we can still put to this data. In Table 10.12 column (b), the countries
are divided into two groups, OECD members and partners. Is there any pattern amongst
these? It is easy to test this by differentiating the two groups in terms of colour. What is
our conclusion? Well, more of the tail consists of OECD partners but partners are also
represented at the top end as well. What we might conclude from this is that partnership
itself is not a factor in explaining the pattern, though level of economic development
may be. The important thing to note here is that we have used the stem-leaf visualisation
to help us ask questions that take our analysis forward.
Chapter 10 EXTRACTING THE INFORMATION FROM STATISTICAL DATA 425
3,000.0
2,500.0
2,000.0
000 children
1,500.0
1,000.0
500.0
2000
2003
2006
0.0
Organised Selected other Computer
sports activities activities
Number (‘000)
2000 2003 2006
separate graphs showing gender differences. The lessons from this are that (a) visual
communication is important and (b) we should manipulate our data to produce the
best visual effects.
For those without access to these programs, it is possible to produce histograms on-
line. Figure 10.3 shows a histogram of the average of scores in mathematics for students
in different countries. This is the same data used in the stem and leaf diagram above. This
is an attractive analysis because not only does it show the shape of the distribution (the
dominance of high scores), it also allows us to see which countries scored what, thereby
giving us an insight into another dimension of analysis (the level of economic develop-
ment and the score). Figure 10.3 was produced on IBM’s Many Eyes system (www.many-
eyes.com). Registration is required.
A very special type of histogram is the age–sex pyramid. This is used to portray the
gender composition of a country or area for each age group. Figure 10.4 shows a pop-
ulation pyramid for the Netherlands. It follows the convention that (i) the age group
is shown on the vertical axis, (ii) gender is on the horizontal axis and (iii) females
Chapter 10 EXTRACTING THE INFORMATION FROM STATISTICAL DATA 427
2000 Female
1,200.0
2003 Male
000 children 1,000.0 2003 Female
2006 Male
800.0
2006 Female
600.0
400.0
200.0
0.0
Organised Other Computer
sport activities activities
2000 Male 2000 Female 2003 Male 2003 Female 2006 Male 2006 Female
are on the right and males on the left. This diagram is taken from the website of Statis-
tics Netherlands. Like some other countries, Statistics Netherlands has an interactive
viewer that allows users to move backwards and forwards in time. Figure 10.4 shows
the projected demographic structure of the Netherlands in 2030. What it reveals is a
typical mature population structure with a low birth rate and a low death rate (popu-
lation only starts to decline rapidly from the age of 70). To understand the bulges
between 28 and 40 and between 56 and 65, we have to go back to the conditions at
the times those people were born. The end of the Second World War was 65 years
ago and in many countries birth rates grew rapidly with the removal of that threat.
The bulge between 28 and 40 was the period when those born after the war had their
children.
Another useful source of national demographic structures is the US Census Bureau. On
its site, we can access demographic data for most countries of the world. Figure 10.5(a)
and (b) show population pyramids for Germany in 1991 and 2006 and (c) shows India
in 2006. These have been chosen to show particular features of population pyramids.
Germany: 1991
(a) MALE FEMALE
100+
95–99
90–94
85–89
80–84
75–79
70–74
65–69
60–64
55–59
50–54
45–49
40–44
35–39
30–34
25–29
20–24
15–19
10–14
5–9
0–4
4.0 3.5 3.0 2.5 2.0 1.5 1.0 0.5 0.0 0.0 0.5 1.0 1.5 2.0 2.5 3.0 3.5 4.0
Population (in millions)
Germany: 2006
(b) MALE FEMALE
100+
95–99
90–94
85–89
80–84
75–79
70–74
65–69
60–64
55–59
50–54
45–49
40–44
35–39
30–34
25–29
20–24
15–19
10–14
5–9
0–4
4.0 3.5 3.0 2.5 2.0 1.5 1.0 0.5 0.0 0.0 0.5 1.0 1.5 2.0 2.5 3.0 3.5 4.0
Population (in millions)
India: 2006
(c) MALE FEMALE
100+
95–99
90–94
85–89
80–84
75–79
70–74
65–69
60–64
55–59
50–54
45–49
40–44
35–39
30–34
25–29
20–24
15–19
10–14
5–9
0–4
70 60 50 40 30 20 10 0 0 10 20 30 40 50 60 70
Population (in millions)
r The pyramids for Germany and India for 2006 are characteristic shapes for two
types of society, developed and developing. The ‘beehive’ shape for Germany
shows a mature population with a low birth rate and death rate. The triangular
shape of India’s pyramid shows a youthful population with a high birth rate and
high death rate.
r Comparison of population structures over time will tell us something about the dy-
namics of the population. Those in the 0–4 category in Germany in 1991 were in the
15–19 group in 2006. If we compare the sizes of the bars for the two years, we see
that the sizes of both male and female groups have increased. There can be only one
cause, in-migration.
r Pyramids allow us to look back in time. The fall in birth rate between the 35–39
and 30–34 age groups in Germany (2006 pyramid) should make us ask the ques-
tion, ‘What happened in the early 1970s?’ Was it economic decline in the old Ger-
man Democratic Republic or changing attitudes as a result of the introduction of
the pill?
r In Germany in both 1991 and 2006, there are more males born than females.
This is as nature intended. If we look at the raw data, for every 100 females in the
0–4 age group, there are 105 males. In India, there is an imbalance between males
and females, but it is more pronounced. The data show that for every 100 females
there are 110 males. The question we should ask is ‘Where did the females go?’ And
the answer is not very palatable. They were born and then they were killed.
The examples of population pyramids we have looked at so far have been downloaded
from the Internet. We can even do this for local authorities in England and Wales via the
Census website (http://www.ons.gov.uk/ons/interactive/vp2-2011-census-comparator/
index.html) and for Scotland via SCROL, which has a mapping facility (Scotland’s
Census Results OnLine at www.scotlandcensus.gov.uk). Often, as researchers, we have
to look at smaller areas such as civil wards or areas we define using small area statistics.
In these cases, we have to create our own pyramids. Activity 10.3 is an opportunity to do
this and it gives us an idea how we can use this method of portraying data in educational
research.
The table below gives demographic data on the age/ r Is the population over 55 going to be the same,
sex structure of two areas in England, East Dorset and increase or decrease? If the planning rate of home
Luton. Your task is to construct population pyramids care provision is 6.5 per 1,000, what is your esti-
in one of the following ways: mate of the number of home carers for the over
65s in both locations in 10, 15 and 20 years’ time?
r by hand How many vocational training places would you
r using a spreadsheet by combining separate tables provide and when?
for males and females r Can you explain the differences in the two pyra-
r using an online service (such as Statisticum, www. mids? Would your explanation make any differ-
statisticum.org). ence to your estimates of school and vocational
Then answer the following questions: training places?
set that shows us the wealth of people in a country, we can produce a graph showing
what proportion of people own what proportion of the wealth. An even distribution
of resources will produce a graph that is a straight line – 10% of the people have 10%
of the resources, 20% have 20%, 30% have 30% and so on. Unequal distributions are
shown as a cumulative frequency curve that deviates from this straight line and the
greater the deviation the greater the inequality. For example, in Slovakia, the richest
20% of the population earn about 32% of the income. In the UK, the proportion is
about 43%. Measuring inequality is a specialist area with its own techniques which
432 Research Methods for Education Part 3 The Process of Data Analysis
are beyond the remit of this text, but the pattern of a small proportion having a lot (or
being responsible for a majority of the problems) is found extensively in social data.
We can find references to it as the 80:20 rule, though this is more a metaphor rather
than the actual ratio. Recognising imbalance in distributions is important for social
researchers.
How do we actually construct cumulative frequency graphs? The method is straight-
forward.
r First, arrange the classes in sequence from low to high (or vice versa) and lay out the
data as a table which shows the number of units of data in each class.
r Second, add the data in the second class to those in the first. Note the result. Then
add the data in the first and second classes to those in the third and continue to
the end.
The procedure is shown in the tables in Figure 10.6. The data in the table is from
the grades awarded students in a whole school year shown in the chart. The grades
are broken into 10 classes and the numbers of students in each class are shown in the
second column. The third column shows the number of students up to the grade. Thus
one student gained a mark 0 to 10 and two students 0 to 20, and so on. Figure 10.6(a)
shows the histogram for the data and Figure 10.6(b) the cumulative frequency curve
for the data. This type of graph is also called an ogive (possibly from an arabic word
80
70
Number of students
60
50
40
30
20
10
0
10
20
30
40
50
60
70
80
90
0
10
to
to
to
to
to
to
to
to
to
to
0
11
21
31
41
51
61
71
81
91
Marks
Series 1
0–10 1 1
11–20 1 2
21–30 4 6
31–40 12 18
41–50 49 67
51–60 68 135
61–70 32 167
71–80 18 185
81–90 15 200
91–100 12 212
250
200
Number of students
150
100
50
0
10
20
30
40
50
60
70
80
90
0
10
to
to
to
to
to
to
to
to
to
to
0
11
21
31
41
51
61
71
81
91
Marks
Series 1
meaning ‘arch’ which describes the shape of a particular cumulative distribution). The
ogive shown in Figure 10.6(b) gives the amount of data in any class that is less than the
total (for instance 135 students have marks between 0 and 60). It is also perfectly rea-
sonable to construct an ogive from the total and create a curve that, at any point, shows
the amount of data that is more than the lowest quantity (we just sum from high to
low – in our example, 77 students scored a mark higher than 60). A particularly use-
ful feature of ogives is that we can convert the data in the distribution to percentages
and show the proportion of the data set up to a class. This is useful when we compare
data sets.
Ogives are shown as continuous lines because the data (though broken into classes)
is continuous. We assume, for example, that there is a gradual progression in the num-
ber of students 0 and 100. Most spreadsheet percentages will allow the construction
of ogives. Figures 10.6(a) and (b) were constructed using the spreadsheet in MS Excel.
Activity 10.4 uses ogives in a very typical way with examination grade distributions. The
data are taken from a UK examination board and refer to examinations (A levels) taken
by students at the age of 17 or 18. The table in Activity 10.4 shows the grades obtained
(A is the best and U the poorest) for three subjects at different times. Constructing ogives
may only be a staging point in our analysis. If there are no differences in the curves there
is nothing to explore but if there are, then they should be (a) assessed to see whether they
are large enough to be important and (b) explored to see if they can be explained. This is
something we shall look at in Chapter 13.
The data below show the grades awarded to students r Is there any evidence that assessment distributions
in three subjects (Business Studies, English Literature differ between the 2006 and 2007 assessments? If
and Mathematics) at two sittings. Use the data to con- there is, does it matter?
struct ogives and to answer the following questions. How would you assess whether different examina-
r Is there any evidence that different subjects have tion boards appeared to have the same standards in
different assessment distributions? Could this imply assessing the same subjects?
different standards of assessment or different cul-
tures?
2007 January Business Studies 9.6 41.7 69.1 90.4 97 100 230
2006 May/June Business Studies 17.8 43.3 70.2 89 97.5 100 22,577
2007 January English Literature 30.8 50 88.5 96.2 100 100 52
2006 May/June English Literature 24.2 48 72.6 91.5 98.8 100 8,482
2007 January Mathematics 22.2 53.3 75.4 88.6 96.4 100 167
2006 May/June Mathematics 35.3 56.6 75.1 87.9 96.6 100 9,137
proportion of pupils in different class sizes. The production of a pie chart by hand is
straightforward. All we have to do is convert our raw data into percentages of the total
and then use these percentages to define the sizes of segments of a circle. Table 10.13
sets out the calculation. The data shows the composition by grades of academic staff in
English universities in 2006/07. There are five categories of staff: professors, senior lec-
turers and researchers, lecturers, researchers and other staff. For each category we know
the number of staff and the sum of these, 169,005, is the total number of academic
staff employed by English universities. Our next step is to convert the number in each
category as a percentage of the total. Thus 16,485 professors are 9.70% of the 169,995
Chapter 10 EXTRACTING THE INFORMATION FROM STATISTICAL DATA 435
total number of academic staff. We then use these percentages to define a segment of a
circle: 9.7% of 360° is 35°. Using a protractor, we then mark off the number of degrees
around a circle we have already drawn and draw lines from the centre of the circle to the
circumference.
There is, however, an easier way to construct pie charts by using the graphics op-
tion in a spreadsheet. Figures 10.7(a), 10.7(b) and 10.7(c) were drawn in just this way.
These data were selected to show the strength as well as the weaknesses of pie charts as a
method of extracting the information from data. Their strength is that they immediately
show dominant categories, in this case white and other ethnic groups. The weaknesses
are that:
r we lose precision – we can only hazard a guess at the dominance of white students
(about 80% of the total) and our estimates for other groups are likely to be wide of
the mark;
r we lose the ability to determine the relative importance of categories as the number
of categories increases – and the ethnic composition shown here is a very crude one;
Other
Other Asian
Chinese
Asian/British Asian
Black/Black British
White
Other
Other Asian
Chinese
Asian/British Asian
Black/Black British
White
Other
Other Asian
Chinese
Asian/British Asian
Black/Black British
White
r it is difficult to compare one pie chart with another – are there more Asian/British
Asian men or women undertaking full-time research?
r pie charts cannot deal with complex data sets (that is gender and ethnicity at the
same time). With three sets of data, we produced three pie charts. If we wanted also
to look at students enrolled on taught postgraduate and undergraduate courses we
would construct another six pie charts. With nine pie charts to compare, there is just
too much information to take in.
So when should we use pie charts? The answer is, as little as possible, because they
are less effective than other methods (such as bar charts) and certainly they should be
avoided when the number of categories is greater than five or when we are presenting
more than four sets of data.
14
12
10
0
1983 1985 1987 1989 1991 1993 1995 1997 1999 2001 2003 2005
% Female labour force
% Total labour force
% Male labour force
1983 1984 1985 1986 1987 1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004 2005
Unemployment,
female (% of
female labour
force) 13 14 13 13 14 12 12 11 10 7 8 8 9 8 7 6 4 4 3 4 4 5 6
Unemployment,
male (% of male
labour force) 14 15 13 12 7 7 6 5 5 4 5 6 6 5 4 3 3 2 2 3 4 5 5
Unemployment,
total (% of total
labour force) 14 14 13 12 10 9 8 7 7 6 6 7 7 6 6 4 4 3 3 3 4 5 5
Chapter 10 EXTRACTING THE INFORMATION FROM STATISTICAL DATA 437
From the line graph, we can immediately pick up the main messages. Over most of
the period there has been a substantial fall in unemployment. Over most of the period,
female unemployment has been higher than male. While the lines tend to follow each
other, there was a period in the mid to late 1980s when the trend in male and female
rates was divergent.
Line graphs are good at communicating information. They point us towards things
that can or should be explained. Their weakness is the extent to which our interpre-
tation can be manipulated. Figure 10.9 shows us the problem. Here there is an im-
plicit test of the assumption that more highly educated people are responsible for
the growth in Internet use. However, the graph shows the number of Internet users
mushrooming while the number of graduates is a flat line (it’s there at the bottom of
the graph!). The reality is that the number of graduates increased by 50% in just seven
years. The problem arises (a) because the vertical scale is configured to the number of
Internet users and (b) because of the scale difference in the two data sets. The number
of Internet users is, for instance, between 80 and 120 times the size of the number of
graduates in a year. Often we meet the reverse of this – steep slopes on our line graphs
that result from the exaggeration of the vertical scale, something that often results
from a section of the scale above 0 being cut out. The only solution in our case is to
produce two graphs.
16,000,000
14,000,000
12,000,000
10,000,000
8,000,000
6,000,000
4,000,000
2,000,000
0
1999 2000 2001 2002 2003 2004 2005 2006
Graduates
Internet users
Figure 10.9 Growth in numbers of graduates and Internet users, the Netherlands 1999–2006
Chapter 10 EXTRACTING THE INFORMATION FROM STATISTICAL DATA 439
In summary, line graphs are effective communicators of patterns and trends. Several
graphs can be plotted on the same axes for effective comparative assessment. While we
should concentrate on ‘reading the line’, we should not ignore the axes. Manipulation of
these can lead us into mistaken interpretation.
(vi) Scatterplots
Scatterplots (also known as ‘scattergraphs’) also deal with bivariate data, data sets
with two variables. Unlike line graphs, however, where there is only one quantity
of variable (b) at any point of variable (a) (or vice versa), scatterplots can have
more than one quantity of any variable for the other. If this sounds confusing, then
it becomes clear when we construct the scatterplots. Figures 10.10(a), 10.10(b)
20
% cautioned or convicted
15
Outlier?
10
0
2 4 6 8 10 12 14 16 18
% misusing substances
Figure 10.10(a) Scatterplot: outcomes of looked after children, London boroughs 2007 (criminal
activity and substance abuse)
70
60
50
% 5GCS Es A*–G
Outlier?
40
30
20
10
0
2 4 6 8 10 12 14 16 18
% abusing substances
Figure 10.10(b) Scatterplot: outcomes of looked after children, London boroughs 2007
(education and substance abuse)
440 Research Methods for Education Part 3 The Process of Data Analysis
70
60
50
% 5GCS Es A*–G
40
30
20
10
0
0 2 4 6 8 10 12 14 16 18 20
% cautioned or convivted
% identified as having
misused substances % cautioned or convicted % with 5 GCSEs
in year during the year A* to G
9.6 7 52
0.0 0 –
6.8 6 49
4.4 6 33
9.9 19 50
16.9 6 38
8.3 15 63
4.7 2 41
11.0 5 40
5.8 6 48
4.9 7 36
5.2 7 41
4.3 11 44
7.9 7 33
11.0 11 59
8.6 11 50
3.3 4 24
5.2 8 41
3.2 4 35
7.0 8 43
5.6 8 44
7.4 8 28
9.3 14 53
6.2 13 39
and 10.10(c) show plots for data on children looked after by a selection of Lon-
don boroughs in 2007. ‘Looked after children’ are those placed in local authority
care. There is concern about this group because of the fear that the absence of a
stable home environment can be a platform for under-achievement and deviant
behaviour. Column 1 (p.441) in the data table for the scatterplots shows the pro-
portion that misused substances, Column 2 the proportion cautioned or convicted
and Column 3 the proportion with five or more GCSEs at grades A* to G (the final
assessment stage of the UK National Curriculum). Figures 10.10(a), (b) and (c)
show pairs of data plotted against each other. Again a spreadsheet was used to pro-
cess the data.
(a) shows the per cent cautioned or convicted against percentage misusing sub-
stances.
(b) the per cent with five GCSEs against the per cent abusing substances.
(c) the per cent with five GCSEs against the per cent cautioned or convicted.
In each case there appears to be an association between the pairs of variables, with
low scores on one variable being matched by low (or lower in terms of the spread of
the distribution) on the other. There is clearly a relationship here between all combi-
nations of data pairs. Because an increase in one is associated with an increase in the
other, we call this a positive relationship. If the reverse had been the case, we would
have called it a negative relationship. We may not be surprised at the relationship be-
tween police convictions and cautions and substance misuse but the fact that there is a
positive relationship between each of these and educational outcomes is more intrigu-
ing. We can see now why it is called a scatterplot. For each, there is a general trend but
no single line that will join up the points. Some points form a cluster, others are more
isolated. If a point is ‘by itself’, we call this an outlier. Possible outliers are shown in
(a) and (b).
Identifying outliers is an important stage in exploring data. They could show errors in
initial measurement or data recording. If the data are accurate, then they are even more
interesting. In Figure 10.10(a), for example, why is the percentage misusing substances
in the outlier borough four to five times greater than the borough with the smallest
percentage (connected by the blue line). But, as with all graphs, we have to be careful
because the scales of the axes can exaggerate differences and lead us into unreasonable
interpretations. For this reason, if we are using IT-based visualisation, it is always safest
to test the data on another system, just to make sure that there is no error in data process-
ing by a program.
When would we use a scatterplot?
r As an initial exploration of data to see if there were any patterns that might suggest
relationships that should be explored further. For instance, we would be interest-
ed in a plot of data that took the form of a curve or fell along a straight line. In this
case, we might then go on to a correlation or regression analysis (see Chapters 12
and 13).
r To differentiate between the usual (a cluster) and the unusual (an outlier) and to in-
vestigate both because both can have an important message. The usual shows us what
is happening in the majority of cases and the unusual shows us what is happening
in extreme cases – and, as we have seen in Figure 10.10, outliers are an indication of
things being done well or disastrously!
442 Research Methods for Education Part 3 The Process of Data Analysis
ng
isusi % GC
% M ances SEs A
t
+ subs *–G
+
+ +
+
% cautioned or
+ + convicted
s A*–G
+ + +
+ +
+ +
+
+ + + + + +
+ +
++ +
+ + ++ + +
+ + +
+ + +
+ + +
+ +
+ g
sin s +
isu nce
M a
% cautioned or % bst
su
convicted
(a) %G (b)
CSE
sA
+ *–G
+
% Misusing
substances
+ +
+ +
+ + ++++ ++
++
++ +
+ +
d or
u t ione d
a
% c nvicte
co
(c)
20
% cautioned or convicted
15 03
0.3
0 4
0.4
10 0.5
.
0.7
0.9
0 9
0.1
5 0.2
0 2
0.6
0 6
0 0.1
0 1
5 10 15
% Misusing substances
views are necessary because some may not show the pattern clearly. Figure 10.11(c)
shows clustering influenced by the variables less clearly than (a) or (b). The program
also produces ‘maps’ of the two variable distributions called density plots. These
show the concentration and intensity of data points. The data points are interpolated
by isopleths (contours). Figure 10.12 is a density plot of Figure 10.10(a). It plots
the per cent misusing substances on the horizontal axis and the per cent cautioned
or convicted on the vertical axis. It shows an ‘island’ with a ridge running in a south
east–north west direction. The elongated shape of the island indicates that as misuse
increases so do cautions and convictions. But what about the ridge? Imagine a sec-
tion along the length of the line in Figure 10.12. It would show a steep rise up to
the top of the ridge and then a more gentle slope down. The core of the relationship
between the two variables is the peak of the distribution, where the shading gives
way to lighter tones. This is where the strongest relationship between drug misuse
and convictions is. The pattern of spread around the core is shown by the outer isop-
leths and as we move away from peak of the ridge, the influence of the core relation-
ship weakens and other influences, which may be individual or random, become
stronger.
What three-dimensional plots offer us is new insight into and perspectives on the
patterns that may exist in our data. They reinforce the fact that much research is about
looking for order. What we have seen so far is how we can look for order in simple data
sets but our research can be more complex than this. When this happens, our visualisa-
tion must become more sophisticated.
Teachers
Years worked
White Mixed Black Asian Other White Mixed Black Asian Other
M F M F M F M F M F M F M F M F M F M F
White Mixed Black Asian Other White Mixed Black Asian Other
M M M F M F M F M F M F M F M F M F M F
the data are telling them. We shall see some of these techniques in Chapter 13 but
it is important to point out that advances in computing power are producing data
visualisation methods that can give us insights into the message of complex data.
This section is not at the cutting edge of visualisation but it is in advance of where
education and much social science research currently sits. This much is apparent
from the fact that the number of papers in education journals that make use of these
methods is miniscule.
The types of data set we are likely to encounter will either be hierarchically organised
or assemblies of simple data sets. Hierarchically organised data have multiple catego-
risations. For example, teachers can have worked for one year, two to five years, six to
ten years or over ten years and can be classified by ethnicity and by gender. This type of
structure is shown in Figure 10.13. This is quite a simple hierarchical structure of three
levels but even with only the highest level of ethnic categorisation used in the census in
England and Wales, it still creates a table with 40 cells, which is a lot of data to manage.
Let us now see how we can visualise complex data sets.
ten or 20 years. What we have to do first is create the profile dimensions. How many
should there be? If the number is too few, we lose richness in our description; if it is
too many we are swamped by detail and lose the ability to discriminate. Generally
researchers seem to use between 8 and 12 profile dimensions. These dimensions are
then plotted as radii at equal intervals around a point. If there are 12 profiles, they
are plotted at 30° intervals and if there are 8, they are plotted at 45° intervals. The
score on a profile dimension is represented by the length of a line from the central
point. When all lines have been drawn, the end points are joined together to show
a shape.
These are the principles underpinning the star plot. Fortunately, we do not have to
draw them by hand because most spreadsheet programs will do it for us. The effective-
ness of the method can be seen in Figures 10.14(a) and 10.14(b), which profiles two
areas in England, Penwith in Cornwall and Richmond in London. The difference be-
tween the two areas can clearly be seen in the different shapes that are produced. The
value of this method is apparent when we produce plots for many areas. In this case we
can use shape to classify areas into the same group. Classification is very important in
the research process.
Penwith Richmond
6 strong
Employer obligations
5 moderate
Unemployment Rate 3
Proportion White 2
Urban Stress**
Affluence*
Population
1 weak investing
1 2 3 4 5 6 7
High Low Low High
Employee obligations
Med Med Med Med
(b)
Low High
High Low
(a)
pected – good behaviour, appropriate work rates, after school activities. The faces in
Figure 10.15(b) reflect aspects that are better in the relationship between employer
and employee with a happier appearance and things that are worse with a sadder ap-
pearance. The sad appearances show an intention to move employment quickly, low
levels of job satisfaction, poor health, irritability. The reverse is the case with the hap-
pier faces. This example shows that the technique can handle not just social, economic
and demographic data, but also emotions, attitudes, hopes and expectations, all of
which are key dimensions to understanding such things as achievement and under-
achievement, teacher burnout and parent satisfaction with the education their child
or children receive.
The construction of Chernoff faces has been held back by inadequate computing and
mapping capability but with the expansion of the visualisation of large data sets we can
expect data analysis programs to build in the Chernoff facility. We will then be able to
use them to handle and categorise complex data sets that are beyond the limits of star
plots.
then be plotted with another variable or variables. We could, for example, construct
a matrix chart to portray quality in higher education institutions. One axis would
be the institutions (nominal), the other would be quality criteria (categorical) and
for each criterion at each university we could put a symbol to represent the qu-
ality score (interval if we put the raw score or ordinal if we use the rank score for the
institution).
Simple forms of matrix chart can be constructed in MS Excel but it is not a straight-
forward procedure. IBM’s Many Eyes is a more sophisticated system and has the ad-
vantage that different combinations of variables can be displayed (that is, different
combinations can be tested). Figure 10.16 shows a matrix chart of data from Mexico
using data from David Gallardo of the Consejo Nacional de Evaluacion de la Desar-
rollo Social in Mexico. The chart shows three variables; the level of social disadvan-
tage on the vertical axis, location on the horizontal axis and home ownership as the
bubble size. The controls at the bottom of the display allow different variables to
be brought into the analysis. It is this feature that shows us how the method can be
used – as a means of exploring data and looking for associations as well as a method
for communicating our interpretations. In Gallardo’s example, we can see that the
administrative units are separated out by level of social disadvantage and that the
bubbles show the proportion of home ownership. The bubbles are all the same col-
our. We can change this to colour code the level of disadvantage or, more usefully,
an overall index of disadvantage. But home ownership is not the only variable we
can analyse. We can also look at the average number of occupants in each house
and we can change the bubbles from the actual value to the percentage each is of the
row total.
The matrix chart is primarily a means of exploring data to see what messages it has for
us. It should be used in the early stages of an investigative process. It does reveal pattern
but in many cases we can bring out the significance of the pattern better with numerical
techniques.
Chapter 10 EXTRACTING THE INFORMATION FROM STATISTICAL DATA 449
300,000
250,000
Number of students
200,000
150,000
QUALITY C
AL ITY B
AL Y D
A
E
ITY
100,000
IT Y
QU ALIT
Q AL
QU
QU
U
50,000
0
2006/07 2005/06 2004/05 2003/04 2002/03
(a) Stacked chart (b) Metroglyph
of disability, (b) the number with a disability between 1994 and 2005 where data
is available and (c) those who dropped out and those who completed a diploma
course. The power of the technology is shown when we look at Figure 10.18(b). The
data is now reorganised to show the pattern by year. Note at the top of the display,
‘year’ has been dragged across to make it the prime variable. We can also change
those who gained a certificate to those who gained a diploma by altering the right
hand bottom tab. If we wanted to see the pattern of disabled students who gained
a qualification, we could set the bottom left tab to ‘certificate’. As well, the program
allows us to pass over our data cells and reveal the data on which they are based and
also to zoom in on areas. Tree maps are clearly an acceptable and powerful tool for
visualisation.
We have looked at visualisation methods that will handle one, two or three or more
variables.
r For single variables (age or ethnicity or assessment grades, for example) we can use
stem and leaf diagrams, bar charts, histograms, and cumulative frequency plots
(ogives).
r For pairs of variables (ethnicity paired with assessment grades or obesity paired with
family income or teacher recruitment over time), we can use a scatterplot or a line
graph.
r For three or more variables (ethnicity, gender and assessment), we can use three-dimen-
sional graphs, star plots, Chernoff faces, matrix charts, concept maps and tree maps.
In terms of how we are going to use the methods, all can be used to portray data and
all can be used to explore data but some are better at one thing than the other.
r For data exploration, stem and leaf diagrams, scatterplots and all the multi-dimen-
sional methods are particularly appropriate.
r For portraying suppositions or conclusions, bar graphs, histograms, line graphs and
scatterplots are all effective. Pie charts are better at portraying data than exploring
data but are not particularly effective for either.
Activity 10.5 is an opportunity to develop your skills in data analysis and visualisation.
452 Research Methods for Education Part 3 The Process of Data Analysis
A. Complex data tables r In which areas is the literacy rate highest and in
1. You have been asked to prepare a short report which three areas is it lowest?
for your vice chancellor on aspects of your uni- r Are there areas where the male and female lit-
versity’s performance on the UK higher education eracy rates vary markedly? Match these areas
indicators. The indicators are available at www. with states that share the same characteristic.
hesa.ac.uk then follow the link under Products & r Use the chart button at the head of the table
Statistics to performance indicators. to portray the data. What do you think of the
r Identify the data structure of Tables 1a and result? Can you suggest an alternative method
1b. What is the difference between these two of visualisation?
tables in terms of the students each describes?
r Answer the question ‘Over the period 2002/03 – B. Data portrayal
2006/07 has the proportion of students from low The table below gives a range of data for pupils in select-
participation neighbourhoods in London’s uni- ed schools in a London borough. Your task is to visualise
versities increased/decreased or stayed the same?’ the data (either in its entirety or by selection) to see what
2. You are researching the role of education in mod- patterns emerge. Key Stage 2 is the final stage of prima-
ernisation of the state and social and economic de- ry education (age 10/11), Key Stage 3 is assessed at age
velopment. One of the indicators that you are using 14 and Key Stage 4 at age 16. Value added is calculated
of educational progress is literacy. Visit the UNESCO on the basis of pupil data. It seeks to assess the contribu-
website and follow the literacy links to the table on tion of the school experience to a pupil’s development as
regional literacy rates. UNESCO: http://stats.uis.un- measured by academic tests. As a guide, you may want
esco.org/unesco/ReportFolders/ ReportFolders.aspx. to start by looking at each column separately, then at
r Describe the data structure of the table. In par- pairs of columns and then at combinations of three or
ticular, explain the classification of regions. more columns. This is an exercise in data exploration.
▲
Chapter 10 EXTRACTING THE INFORMATION FROM STATISTICAL DATA 453
School Column 1 Column 2 Column 3 Column 4 Column 5 Column 6 Column 7 Column 8 Column 9
Column 1 Per cent of pupils with special educational needs (SEN), both with and without statement of need.
Column 2 Per cent half days lost through absence.
Column 3 Average point score for pupils at Key Stage 3, 2004.
Column 4 Average point score for pupils at Key Stage 4, 2006.
Column 5 Value added between Key Stage 2 and Key Stage 3, 2006.
Column 6 Value added between Key Stage 2 and Key Stage 4, 2006.
Column 7 Mean verbal test score.
Column 8 Mean quantitative test score.
Column 9 Mean non-verbal test score.
Summary
Data interpretation and presentation has, for a long time, been relatively ignored as
an area of analysis. It is, however, the foundation on which sound analysis is built.
The methods, techniques and strategies that we have looked at in this section are the
starting points for a long journey of practice and adaptation; practice because using
them has to become second nature and adaptation because the approaches have to be
selected to suit the occasion and modified to match the data. What should be apparent
from this section is that the field of visualisation is moving forward rapidly. The innova-
tions seem to occur in the sciences. They have been adopted in the fields of business
and management. So far, the social sciences, including education, have lagged behind.
t Data are the researcher’s raw material. As researchers we have to extract the infor-
mation from the data, understand its meaning and apply it to our research problem.
t We cannot achieve this unless we read data correctly. Simple tables are straightfor-
ward enough, but many data, especially those from official sources, are complex.
They can have a wide range of categories and multiple levels of categories, usually
with one nesting inside others. Both of these situations create data tables whose
scale is so great that we cannot take them in in one glance.
▲
454 Research Methods for Education Part 3 The Process of Data Analysis
t When data tables are large and complex we need to adopt strategies for making
them give up their information. These strategies include:
t inspection to identify the data structure;
t shaping and rearranging tables to make them more readable;
t stripping out data that is not needed;
t compressing data by combining categories;
t re-ordering data to make comparison easier;
t reconstructing data in another form.
t Our object is to identify pattern and order in our data. We do this because we are
interested in what is usual, what is exceptional and what is related. Visual inspection
of data does not always reveal everything to us. In the past we used the techniques
we shall meet in Chapters 12 and 13. However, improvements in computing power
mean that we now have another way of looking at data, by visualising them.
t There are many ways of portraying data. We have to choose a method that reflects
the nature of our data.
t If we are dealing with one variable, we call this univariate data.
t If we are dealing with two variables, we call this bivariate data.
t If we are dealing with more than two variables, we call this multi-variate data.
t Methods suitable for univariate data include stem and leaf plots, bar graphs and
histograms, and cumulative frequency graphs or ogives.
t Bivariate data can be represented on line graphs and scatterplots. It is possible to
add a third (or more) variable by altering the form of the plot of the two principal
variables. The scatterplot then becomes a means of portraying multi-variate data.
t Other visualisations for multivariate data include:
t divided bar charts;
t star plots;
t metroglyphs and Chernoff faces (metaphor mapping);
t matrix charts;
t tree maps.
t The range of visualisation types is increasing and we should remain abreast of the
opportunities. Most of the visualisations can be performed by readily available
software.
t Visualisation is not only important for revealing the structures within the data, it is
essential as a tool for communication, our interpretation and understanding.
t As researchers, it is important that we stay abreast of development and innovation
in data visualisation because it is likely to become as important as established meth-
ods of data analysis such as the statistical tests we shall look at in Chapter 13.
Further reading
The field of data visualisation is fast moving and many visualisation, the website of the Department of Math-
books will become out of date very quickly. William ematics and Statistics at York University (Canada) has
Jacoby’s slim text, Statistical Graphics for Visualising an excellent review from the earliest maps to 2004
Data, Sage, Thousand Oaks, Calif., 1998, is a use- (http://www.maths.yorku.ca/SCS/Gallery).
ful first introduction because it explains and devel- To keep up to date, just type in the search term ‘data
ops some of the basic principles. For a history of visualisation/vizualisation’ into your search engine.
Chapter 10 EXTRACTING THE INFORMATION FROM STATISTICAL DATA 455
References
Anderson, E. (1957) ‘A Semi-Graphical Method for the Land, K., Lamb, V. and Mustello, S. (2001) ‘Child and
Analysis of Complex Problems’, Proceedings of the Youth Well-Being in the United States, 1975–98:
National Academy of Sciences, 13(3), 923–927. Some Findings and a New Index’, Social Indicators
Chernoff, H. (1973) ‘Using Faces to Represent Points Research, 56(3), 241–318.
in a K-Dimensional Space Graphically’, Journal of the Rigotti, T. and Mohr, G. (2005) ‘German Flexibility:
American Statistical Association, 68, 361–368. Loosening the Reins Without Losing Control’, Chap-
de Vyder, F. (ND) Dutch Education: A Closed or an Open ter 5 in Employment Contracts on Well-Being Among
System, OECD, available at www.oecd.org/dataoecd/ European Workers, de Cuyper, N., Isaakson, K. and
1/33/191730.pdf (accessed May 2008). de Witte, H. (eds), Ashgate, Aldershot.
Chapter contents
Introduction 457
Summary 514
Further reading 515
References 516
Chapter 11
Learning themes
t Criteria for assessing quality in qualitative By the end of this section you will:
research.
t Understand how you can ensure quality in
t A generic qualitative method for process- data analysis.
ing data.
t Appreciate the significant part coding
t The classification and coding of data – the plays in qualitative data analysis.
core of the qualitative process.
t Know how to code data and be aware of
t How to build data into coherent blocks of how technology can help you.
knowledge.
t Be aware of approaches to qualitative
t How different qualitative methodologies data analysis and understand when to use
approach data analysis. them and how to apply them.
Introduction
One of the assumptions that new research- perspective it is exactly because qualitative
ers can often make is that because qualitative data are non-numeric that qualitative analysis
data takes the form of words, objects, pictures, is more complex than quantitative. The impli-
actions and behaviours and does not require cation of this is that qualitative research can-
numerical processing, they are inherently eas- not consist of a sequence such as, ’we asked
ier and more straightforward to analyse than them and now we are telling you what they
quantitative data. This is a myth that we should told us’. This approach is naïve. Good quali-
nail immediately, not the least because it is tative research is far more sophisticated than
often associated with the assumption that this. It has to be much more than mere re-
qualitative analysis is ’easier’ than quantitative portage. Commentary that is the product of
(and the implication that it is, as a consequence, understanding the field and insights into the
less robust). Looking at this from the opposite structure of the data is an essential element.
458 Research Methods for Education Part 3 The Process of Data Analysis
consequence of this is that while there are characteristics that bring qualitative researchers
together, there are epistemological, political and social perspectives that produce debate
(and division) that is every bit as fierce as that which once existed between quantitative
and qualitative protagonists.
There is, therefore, a sense that qualitative research is more a federation of practices
than an integrated framework for conducting research. Notwithstanding this, there are
process characteristics (that is, how the research is conducted and the data analysed) that
are common to many of the approaches (even if they are sometimes given a different
name). It is these in relation to data processing that constitute the heart of this section.
the same basic criteria as used in quantitative research and remained within a positivist
paradigm’ (Spencer et al., 2003, p. 39). In other words, there are some researchers who
will be bound by procedure and criteria and others who will not because they believe
that the criteria and procedures mimic quantitative research. The approach of Spencer
and her colleagues is to accept the divisions on the issue (divisions which, by implica-
tion, will marginalise some qualitative research traditions from a policy and practice
influencing role) and classify qualitative research on the basis of its acceptance or rejec-
tion of the applicability of quality criteria. They identify the following three approaches
to qualitative research.
1. The rejection of quality criteria on the basis that, because there is no single ’correct’
way of representing reality, there can be no ’correct’ way of collecting and analysing
data. It is this category that Onwegbuzie and Daniel criticised on the basis of ’any-
thing goes’.
2. An acceptance of the concept of quality, which is to be judged in terms of the
particular character of qualitative research. We met this argument in Chapter 3
and noted there the issue of the use of different words to describe quality issues
for qualitative research and the discussion whether they represented a meaningful
distinctiveness or just a semantic difference. This is closest to Onwegbuzie and
Daniel’s position.
3. Rejection of the position that quality can be thought of in terms of the credibility of
a study and its replacement by a mix of an ethical standpoint, a political viewpoint,
a desire to improve and emancipate or a commitment to the issue and to those being
researched. In other words, the idea that quality in qualitative research is being honest
and sincere. We shall see examples of this approach when we look at ’critical’ qualitative
research. However, because the researcher is committed to a viewpoint, there are other
qualitative researchers who would not accept the findings.
Despite these different positions, as readers of research we need some assurance that the
research has been done well. The solution by the researchers from the National Centre for
Social Research is to present a framework against which research can be assessed without
constructing a strait-jacket that forces qualitative research to conform to a specific model.
The framework they developed is guided by four principles that for most researchers will
be unproblematic.
1. Research should contribute to advancing knowledge or understanding.
2. The research design should address the research question.
3. Data collection, analysis and interpretation should be rigorous, systematic and trans-
parent.
4. Research claims should be credible with plausible arguments about the significance of
the evidence.
These themes have been embedded in this text.
The research team translated these principles into 18 questions to ask of research.
Table 11.1 summarises the questions that relate to data collection and analysis. The
questions are all phrased in such a way that we can reach a judgement on how convinced
we are by a piece of research and, if we have misgivings, where their source is. For each
question, the research team identifies indicators. Not all of these have to be present,
but sufficient have to exist to enable us to have confidence. Most of the indicators (for
example, for sample design) can be assessed through the text, but some (for example,
literature review) require pre-existing specialist knowledge.
Chapter 11 EXTRACTING THE INFORMATION FROM QUALITATIVE DATA SETS 461
1. How credible are the t Can the reader can see how the researcher arrived at his/her conclusions; the
findings? ’building blocks’ of analysis and interpretation are evident.
t Do the findings/conclusions ’make sense’ and do they match other knowledge
and experience.
2. How has knowledge/ t Is there a literature review that summarises knowledge and key issues raised by
understanding been previous research.
extended by the t Are the aims and design of the study set in the context of existing knowledge?
research? t Are findings presented in a way that offers new insights?
t Is there a discussion of the limitations of evidence, what remains unknown or
what further information/research is needed?
3. Scope for drawing wider t Is there a discussion of what can be generalised to wider population and how
inference – how well is the sample was drawn/case selection made?
this explained? t Is there a description of the context of the study so that its applicability to
other settings can be assessed?
t Is there a discussion of how hypotheses/propositions/findings may relate to
wider theory and/or a consideration of rival explanations?
t Is evidence to support claims for wider inference supplied?
4. How defensible is the t Is there a rational for the study design and an explanation of how it meets the
research design? aims of the study?
t Are reasons given for different stages of research, the use of particular
methods, data sources etc?
t Is there a discussion of the limitations of research design and the implications
for the Evidence?
5. How well defended t Is there a description of study locations and how and why they were chosen?
is the sample design/ t Is there a description of the population and how the sample relates to it?
target selection of t Is there a rationale for the selection of sample/settings/documents?
cases/documents?
6. Sample composition/ t Is the profile of sample/case described?
case inclusion – how t Is there a discussion of any missing coverage?
well is the eventual t Are reasons for non-participation and non-inclusion of cases/documents given?
coverage described? t Are access and methods of approach described and how these might have
affected participation/coverage discussed?
7. How well was the data t Is there a discussion of who collected data, what procedures/documents used
collection carried out? for collection/recording and evidence of checks on origin/status/authorship of
documents?
t If interviews/discussions/conversations were not recorded, were justifiable
reasons given?
t Are the conventions for making field notes given?
t Is there a discussion of how fieldwork methods or settings may have influenced
data collection?
8. How well has the t Do we know the form of the original data?
approach to, and t Are the reasons or the of the data management method/tool/package given?
formulation of, the Are they reasonable?
analysis been conveyed? t Can we see how descriptive analytic categories or labels have been generated
and used?
t Can we see how analytic concepts/typologies have been devised and applied?
9. Contexts of data t Are background details of study sites or settings give?
sources – how well t Are participants’ perspectives placed in a personal context?
are they retained and t Are the origins/history of written documents given?
portrayed?
▲
462 Research Methods for Education Part 3 The Process of Data Analysis
10. How well has diversity t Are diversity/multiple perspectives in the evidence shown?
of perspective and con- t Are negative cases, outliers or exceptions discussed?
tent been explored?
11. How clear are the t Are there clear conceptual links between analytic commentary and the original
links between data, data?
interpretation and t Is there evidence of how/why particular interpretation/significance are assigned
conclusions? to specific aspects of data?
t Can we see how explanations/theories/conclusions were derived and how they
relate to original data
t Are cases which do not fit with the explanation considered?
12. How clear are t Are assumptions made explicit?
the assumptions/ t Are ideological perspectives/values/philosophies made explicit and is their
theoretical perspectives/ impact on the research design and choice of methods explained?
values that have shaped t Is the possibility of error or bias may explored?
the form and output of
the research?
For researchers new to qualitative research, the framework has two advantages. First,
it helps us read the research output and appreciate whether the research process is ad-
equately developed and explained to allow us to have confidence in the findings. Sec-
ond, we can turn this whole process around and see just what we have to do in order to
produce research findings that our professional community will find convincing. In the
present context, questions 8 to 12 in the table are important. If this information is to
be presented in a thesis, report or journal article, we have to be sure that our processes
are sound and our decisions with respect to data analysis accurately recorded. Just what
these processes and decisions are we shall see in sections 11.3 and 11.4. Activity 11.1 will
show you how far other researchers have met the quality test.
The questions in the quality framework in Table 11.1 edu/ssss/QR/. Read the following two papers from vol-
are generic and could, perhaps with some tweaking, ume 13, number 1:
be used for almost any style of research. For this ac- Gonzalez, L., Stallone Brown, M. and Slate, J.R. (2008)
tivity, however, you should only use it in relation to ’Teachers Who Left the Teaching Profession: A Quali-
qualitative or predominantly qualitative research. tative Understanding’, pp. 1–11.
Go to any educational research journal and select five Tercanlioglu, L. (2008) ’A Qualitative Investigation
articles that use a qualitative methodology. Their ap- of Pre-Service English as a Foreign Language (EFL) –
proaches to research need not be different and the Teacher Opinions’, pp. 137–150.
articles can be selected from a single volume. Assess
On the basis of all the academic papers that you have
the articles against questions 3, 5, 6, 7, 8 and 13. If
read, write a short paragraph to advise editors of
authors fail to address the quality indicators, is there
journals how they should strengthen their peer re-
any pattern to this failure?
view and selection processes.
As a comparison, go to the website for the Qualita-
tive Report, an on-line journal, at http://www.nova.
Chapter 11 EXTRACTING THE INFORMATION FROM QUALITATIVE DATA SETS 463
Navigating your way through qualitative research can, at times, seem like wandering
through a maze where the shape of how the paths fit together eludes us. There are no
frameworks that will happily encompass all that qualitative researchers do. We will
find one person’s methodology being used by someone else as a research strategy and
the same terms being used to describe different practices and the reality is that all the
terminology is being used legitimately. Nonetheless it is important to understand key
elements even if we cannot fit them together easily as a jigsaw. We will begin with an
overview of method that is common to much qualitative research.
Data collection
Data analysis
Data preparation
Organising data
Data interpretation
3. Organising data: a sequential procedure in which links between data units are built,
evaluated and, perhaps, rejected for the whole procedure to start again. This ’clump-
ing’ of data can take place at different levels. Basic data units are aggregated into first
level clumps and first level to second level and so on as long as the data and interpre-
tations warrant it.
4. Interpretation of data: this is a test of the meaningfulness of the basic data structure and
data aggregation. An interpretation need not be acceptable and can lead to data analysis
being repeated.
In Figure 11.1 are two boxes on either side of the analysis core. These represent influ-
ences upon the analysis and the dotted lines show that they need not be present. On the
left side is existing information about the issue, both research findings and professional
debates. On the right side is the researcher’s personal perspective. These two boxes con-
stitute context and they are present throughout the research process.
In the following sections we shall explore the stages in the core of qualitative data
analysis in more detail in order to gain insight into how the process is operationalised.
the second stage, coding (section 11.5). Text-based data (reports and other documents)
are also usually in a form we can work with immediately. There are, however, data types
that have to be transformed before we can work with them.
1. Observations: field observations and associated notes may not be in a form in which
they can be used directly. We need ordered data and when we are in the field our
observation sheets can get crumpled or covered in coffee. We can use an observa-
tion sheet to make a note of an idea that has just occurred to us. They can become
separated from our descriptions of the setting. If we are using original manuscripts,
typescripts or products, they may be too valuable to be worked upon directly. What
we have to do in these cases is some combination of the following:
t copy the original data;
t link observations with information on who was observing and details of the set-
ting;
t transcribe shorthand notes and comments into text readable by others.
2. Speech: interviews, conversations, speeches and presentations that have been cap-
tured on an audio recording medium can be analysed directly but it is more usual
for them to be transcribed into a text format. This, however, requires decisions to be
made.
t Are only the words transcribed or do we include speech fillers, such as ’umm’, ’er’
and ’ah’?
t Do we transcribe speech in dialect directly or do we transform it into something
more like standard language?
t Are we only interested in the words or is the way they are spoken also of interest?
t Are questions important? If so, they have to be recorded at the time or video record-
ing used.
Dr John Bailey, a British medical researcher who uses qualitative methods, argues
that the issues that underpin these questions are fundamentally methodological
and are concerned with how far the researcher wishes to report an authentic voice and
how completely communication (words, tone, emphasis, gestures, mannerisms and
so on) should be recorded (Bailey, 2008). If we are researching young people how
important is it that we record how they actually speak in ’street talk’ rather than in
grammatical English? If we use the latter, what information will we miss? In reaching
decisions about the character of our transcription we have to think about what type
of data we need in order to get to grips with our research issue. There is, therefore,
no single ’correct’ transcription. The fact that we have to take these decisions shows
that transcription is not just a technical matter. This highlights a further issue, that
transcription is often reduced to audio typing by a clerical assistant. While it is accept-
able to use research resources to do this, we should not imagine that the typescript
itself is the transcription; it is just the base layer onto which we, as the researchers,
have to graft other aspects of communication, such as sighs, laughter, smiling, scowl-
ing and so on. And then we have to represent the meaning of all of this. Look at
Activity 11.2.
A final point that should be made in relation to preparing all types of data for analysis
is that it may not be necessary to use all the data that we have. There can be no hard and
fast rules on this but as a guideline we go into our data knowing what our research ques-
tion is. It is legitimate not to use data that is not relevant to our question. Data selection
466 Research Methods for Education Part 3 The Process of Data Analysis
Lynne Truss’s book, Eats, Shoots & Leaves: the Zero Visit the cemetery where famous people are buried
Tolerance Approach to Punctuation, explains the daily except Thursday.
role of punctuation in conveying meaning. Look at Woman without her man is a savage.
the sentences below and see how many meanings
Dog for sale eats anything especially fond of
you can convey with a change of emphasis and use
children.
of pauses.
They were alright.
The tourist saw the astronomer with the telescope.
With thanks to the people who put these sentences
The landlord painted all the walls with cracks.
on their websites.
is an important part of data preparation and, as a guide, we should look at and review all
the data we have as a first step. Selecting out data is important in resource terms (allow
three to six hours to transcribe an hour’s worth of recording) but there is a downside.
The benefit of a researcher personally transcribing everything is that it is an important
engagement with the data. Insights into data structure and meaning are rarely instanta-
neous, they usually require time to develop.
from a team led by Professor Brendan McKay of the Minister, Yitzhak Rabin and even the death of Princess
Australian National University. This paper set out a Diana, are predicted. Moby Dick may be a classic novel
set of technical reasons why the earlier results were but the author was no soothsayer. Case proven!
spurious. In an earlier test, Professor McKay was able If you want to know more about false messages, look
to show that War and Peace gave just as good results. at Professor Mckay’s webpage: http://cs.Anu.Edu.
He followed this up by demonstrating that, in Moby Au/~bdm/dilugim/. If you want to hear the songs sung
Dick, the assassinations of the Indian Prime Minis- backwards: http://www.Thepeek.Com and search for
ter, Indira Gandhi, the American Presidents, John F. ’hidden messages’. Remember the lesson, we can often
Kennedy and Abraham Lincoln, the Israeli Prime find what we are looking for in our data. Stay sceptical.
The manual* gives definitions and illustrations of all The interpersonal process coding framework is typical
of these situations. of pre-determined codes in that it has to strike a bal-
3. The coding of Affect is derived from observation of ance between recording detail and representing broad
facial expressions, voice tone, and body language. behaviours. The benefits of using such a coding sys-
The six categories are: tem are that someone has done the hard work for us
1. happy and the method has usually been tried and tested. The
2. caring choice for us as researchers is often between building
3. neutral something bespoke ourselves or walking into a coding
4. distress salesroom and getting something off the shelf.
5. aversive
6. sad. *Rusby. J., Estes, A. and Dishion, J. (1991) The Interpersonal
Process Code (IPC), Oregon Social Learning Centre, Eugene,
Oregon (available at http://www.oslc.org/resources/codemanu-
als/interpersonalprocesscode.pdf).
2. We can devise a coding system before our analysis based on theory and existing
knowledge. This is sometimes referred to as concept mapping. Effectively, what we are
doing is constructing a schematic model that shows relationships between variables
and then using our data to test it. Read Case Study 11.3.
3. Third, we can allow the coding structure to emerge from our data. This is where the
’mystery’ is and we shall consider it in more detail below.
Chapter 11 EXTRACTING THE INFORMATION FROM QUALITATIVE DATA SETS 469
bear in mind, though, that we can move from descriptive to interpretive coding. For
example, a study of children’s play on housing estates began with observations of
play (using a pre-existing coding framework that recorded age, gender, play activity
and play environment) and concluded by inferring some of the drivers for selection
of play locations such as safety and preferring to congregate with other children and
be part of a social group (Wheway and Millward, 1997).
t How many codes should there be? There is no answer to this. At first we should not
restrict ourselves to a particular number on the grounds that, at a later stage we can
always get rid of some. At that later stage, if we are coding manually, it can be difficult
to cope with many more than about 50 codes. And the danger of using computer
technology is that, at the time, it can appear to be easier to create another code rather
than think more deeply about what is being said and whether an existing code is
applicable. Methodological literature has tried to help us by creating the concept of
’saturation’ (a point at which coding has extracted maximum ’goodness’ out of the
data) but it is rarely explained or illustrated. Glen Bowen, an American qualitative
researcher, developed a rule of thumb that ’a data category was considered saturated
if it was reflected in more than 70% of the interviews, confirmed by member checks
(interviewee feedback on the analysed data), resonated with key informants, and
made sense given prior research’ (Bowen, 2008, p. 148). This is a useful indicator
because transparency about the number of codes we have is necessary if we are to
ensure that the audience for our research is not to believe that finality in the analysis
was determined by frustration or the need to meet deadlines rather than an obvious
and natural completion to the analysis.
t How should we make the tags? If we are doing this by hand, the best way is to make
several copies of the data and then annotate and copy. If we are dealing with text,
we could use a selection of highlighter pens. It is likely that as we progress through
the text some of our ideas may change, which will affect the coding. In this case we
note the change from old to new thinking, identify the data we have been working on
as Mark 1 coding and start over again with a clean copy. There are also computer pro-
grams we can use to help us handle the data. When we use these, the tagging process
is usually one where we highlight text or a portion of an image or a section of speech
and then click on a code.
t Should data only be tagged once? If we can devise a coding framework that is mutually
exclusive all well and good, but it is difficult. It is possible that we will start like this
but then things will get messy and codes will overlap. In fact, it is almost inevitable
that this will happen because as our coding framework grows, we cannot hold it all in
our minds. However, it is a good goal to set ourselves to work towards removing this
overlap at the end of each pass through our data. It will help us to be clearer about
our concepts and though it may require us to create more codes, there is a benefit in
the long run.
t How do we work through the data? Do we work through it case by case, looking at each
video of children playing in turn or reading each person’s interview transcript in turn
or do we look for aggression in all the videos of co-operative play (or in the responses
to question 4 and then all the responses to question 5)? The answer is that we have to
strike a balance between following our noses and being systematic. If something oc-
curs to us when we are marking up one piece of text, we should make a note of where
we are and what we are going to do, then we should examine the other texts to see if
there are similar meanings. For example, if we are coding transcripts of teachers’ in-
terviews about managing pupil behaviour and one teacher mentions that some pupil
behaviour deteriorates through the day, we should go back and see if anyone else has
made any allusion to the time factor. This could be important and could be related to
energy levels and dietary habits. We should always have by our sides a record book or
even a template of what analysis we have done on what data. As a guide, most people
seem to begin by looking at sets of data in the round. Some then follow their noses,
others continue to look at one set of data after another but look at them several times
Chapter 11 EXTRACTING THE INFORMATION FROM QUALITATIVE DATA SETS 473
over to make sure that each is reviewed in the context of our developing ideas. Lesley
Reid, at the University of Edinburgh, provides a good example of how this occurred
in her study of how teachers assess pupils’ writing (Reid, 2007). She noted that ’inter-
viewees were prompted to talk freely on three (sic) related issues:
t their own teaching career backgrounds;
t their knowledge and experience of writing pedagogy;
t knowledge and experience of writing assessment;
t their views of the role of pupils in the assessment process’ (pp. 136–137).
These themes provided the framework for first level descriptive coding. The research
team then met to discuss the project, their ideas were shared with the participants and
out of this there emerged additional coding to cover, for example, ’length and type of
experience’. The ’writing pedagogy’ codes were branched to include comments about
resources used and teaching approaches adopted (for example genre-based, process-
focused). Teachers’ comments about writing assessment were coded under technical
skills, planning, vocabulary use, creativity, writing structure. Their comments about
the role of pupils in the assessment process were coded under ’written and verbal
feedback, motivation, prioritising criteria’ (p. 137). Teachers were interviewed a sec-
ond time after they had met again to discuss assessment practice and additional codes
were developed from this to cover ’primary/secondary sector differences, the effect of
formative assessment practices and the importance of the context of text production’
(p. 137).
t How do we know that our coding is accurate? Well, coding is all a matter of interpreta-
tion so it is possible that our interpretation may be different from someone else’s
yet still be valid. This might be the case if I adopted a neutral stance to the data and
someone else came with a critical perspective. However, it is a good idea to get some-
one else (or some other people) to code some examples of the data and to compare
the outcomes. The differences can be discussed and used to inform our own coding
framework.
The process of coding and tagging data is not a clean one. It will go slowly at first as
we feel our way around the data. There are likely to be false starts – but this is no bad
thing because each time we learn something new. However, it does (usually) all come
good at the end!
through the data, we create codes – and some of these may be better than the ones we
have already developed.
t At this point, we have a coding structure in which we have reasonable confidence. It
is at this point that we often compare our coding structure and tagging results with
what others produce. Our next step is to assess the quantity of data in each code. We
want to ensure that our coding structure is not too coarse so that we have lots of data
in a few codes nor too fine so that we have a lot of codes with little data. In neither
case will we get a good sense of the information in the data set. If we are assessing the
amount of data in each code by hand, the best way is to transfer the tagged data onto
a record card and mark its location or origin on the card plus the name of the tag.
When we have done this for all of the tags, we assemble the cards into separate piles.
This allows us to do the following:
t compare the data in each pile to confirm the consistency of the tagging process;
t look at the largest piles and ask whether they contain subsidiary concepts or facts;
t look at the smallest piles (and it is likely that we will finish up with a ’pile’ of one
or two cards) and ask whether the code has a meaningful role to play.
We may, at this stage, restructure our concepts. Again, there is technology that can
help us with this.
t Our next step is to look at the codes and see if we can detect any common or over-
arching themes. In all likelihood we will approach this from different directions.
We could begin by comparing each of our coding categories in relation to the others
and asking, ’Why is this one different from that one?’ Our answer, ’Because it is…’,
will establish characteristics which may themselves be capable of being grouped. In
other words, by establishing how codes are different, we might be in a position to
identify some that have features in common. We could also try to construct our own
framework. Which codes deal with events, emotional states, environments, processes,
values? We could take our piles of cards and ask, ’How similar or different are these
two categories?’ and then use our answer to place them relative to each other. This
grouping of codes is a process that takes time and it needs time if it is to be effective.
What we are trying to capture and represent are the threads that hold the data togeth-
er. This is a creative process. It requires knowledge, understanding and insight. We
half see solutions but then they become shrouded in mist but, with time, links, pat-
terns and meaningful explanations do emerge. A group of doctoral students (Dye et
al., 2000) has described this process as moving from ’raw data bits’ to ’final category
array’. Their model is reproduced in Figure 11.2. The ’raw data bits’ represent the
initial tagging. The ’initial category set’ represents the first stage of bundling codes
together. These were progressively reviewed and refined until an overarching frame-
RAW DATA
BITS Category Set Refinement Refinement Category Array
work (final category array) was produced. Other themes are shown relative to each
other within the overarching theme. What this representation emphasises is that cat-
egories overlap. Clean and neat solutions are rarely an option.
In coding, experience is the great teacher. Activity 11.3 provides an opportunity to
make a start with some authentic data.
Most of what we read about bullying amongst young 16. will swear and then attempt to beat us. One day
people is about our own country. It does not take he tried to beat me, my son
much research, however, to realise that what goes on 17. run outside and throws him with stones. He was
in our country is replicated in many others. Visionary so furious since that day
is an EU-funded project that brings together experts 18. when he is drunk they fight. I am worried because
to address the problem and explore strategies to my son is no longer happy.
solve it (www.bullying-in-school.info). It collects re- 19. He hangs out with a group of boys. They are so
ports that demonstrate that bullying is a worldwide troublesome, I receive
issue. One report, from The Independent on-line edi- 20. complaints from neighbours that they take their
tion (South Africa) noted that ’Schools in Garteng are children’s money, tease them,
beset by racialised groups, gambling, bullying and 21. beat or even ask them to do nasty things. They
pupils who aren’t scared to use weapons on one an- will sit at the corner when a
other’ and reported research that ’children who bring 22. girl passes they tease at her, call her names and if
iPods, PsPs and expensive mobile phones are targeted she does not respond they
by children who want those items, while in poorer 23. laugh at her.
areas, violence happens over money and food’. 24. R: And what else?
In October 2004 Maraumo Maria Malematsa submit- 25. P: I think here at school they have called me com-
ted a thesis on bullying in a South African school as plaining that he had taken
part of the assessment for her Master’s degree in Edu- 26. money from one boy. Daily during break he will
cation (M.Ed.). A transcript of one of the interviews is go to the toilets ask money or
reproduced below: Your task is to code it to bring out 27. take money from other boys but he targets same
the salient issues. boys. If they don’t oblige he
28. will beat them after school. He will force others
1. R: Good morning, relax madam, as I have already to write homework’s for him
explained to your son do you 29. especially clever girls.
2. agree that I use a tape recorder to interview you? 30. R: Hmm…
3. P: Err… but ag, yes you may proceed. 31. P: Their group will beat other children at the street.
4. R: Tell me about your son’s behaviour at home. I am so worried he is now (Transcript page -1-)
5. P: I have serious problems with him even my 32. out of hand I have tried to call his uncle to talk
neighbours are complaining. with him but all in vain. But I
6. R: What type of problems? 33. think the cause of this bully behaviour is how my
7. P: It has started after marrying his step-father and husband treats him. I think
losing my job. 34. he feels not loved because he will not even sit
8. R: And with us or view the television. I
9. P: He will just sit there and sulk not talking to any- 35. really don’t know what to do. I am just afraid that
one even his sisters. My he will end up in jail or
10. husband after being paid will come home drunk. 36. commit suicide.
He will start by accusing him 37. R: You have said the cause of this bully behaviour
11. with things he had not done. At first he used to is my husband.
keep quite then now because 38. According to you is your son a bully?
12. he swears at him calling him names like I am not 39. P: He does not act like other kids. Now and then,
your father, you’re hopeless the school or neighbours
13. you will not make it in life and stupid my son an- 40. complain about him. I don’t know really but bully
swers back. is not maybe a person/child
14. R: What do you do? 41. who takes money by force from others or maybe
15. P. I sometimes intervene but I am always caught beat small boys?
in the middle. My husband 42. R: Ja, you are right.
V
476 Research Methods for Education Part 3 The Process of Data Analysis
43. P: So he is a bully. And I still say my husband is a 57. on talking with my son maybe he will gain some
cause, there is no peace in confidence.
44. our house. It frustrates him. 58. R: I will do so. According to you what should the
45. R: Where do you think they bully mostly? school do to stop
46. P: At school because he spends most of his time at 59. bullying?
school. 60. P: The school principal should talk to teachers. A
47. What should the school do planning on how to stop
48. P: Maybe try to talk with children who beat oth- 61. bullying should be written. Parents and school
ers or take their money. But I leaders should be called and
49. think if you have more children like my son call a 62. asked for their opinion. I think even in the class-
parents meeting maybe we rooms bullying should be
50. can came up with a better solution. 63. taught so that other learners should know that it
51. R: What else can we do? is not right to bully. If a
52. P: Involve other children who see what bullies do 64. learner had bullied his or her parents should be
daily ask about their called or the school should
53. opinions. Maybe you can draw up a plan and tell 65. have a special punishment. To stop bullying I
all the children that bullying think punishment must be used.
54. is not a fine thing. But mam to my problems I 66. But I think teachers too must stop bullying or
think I am going to discuss them maybe some are not aware that
55. with child protection unit people. My sister has 67. they hurt learners. I think they should learn more
told me about them they say about bullying.
56. they help. Maybe my husband will stop drinking 68. R: Thank you very much, bye.
and swearing. Please keep
t If there is a poor level of intercoder agreement, the reasons are reviewed by the code
designers and the coders. If the reason is that coders misunderstand the code criteria,
then retraining and redrafting produces a solution. If the reason is that codes are am-
biguous, then the criteria are reviewed by the team and more sharply conceptualised.
t As the coding structure is stabilised and released for use, consistency checks are con-
ducted on a regular basis.
Figure 11.5 Functional characteristics of text analysers (after Weitzman and Miles, 1995)
It deals in considerable detail with the processes outlined here and illustrates them in
relation to three leading programs; ATLAS-ti, MAXqda and Nvivo. The general advice
with respect to selecting software is to examine what programs offer in relation to what
the research enquiry requires. For example, if we are going to use one of the specific
approaches to data analysis that we look at later in this section rather than the generic
approach we reviewed in section 11.2, it is important that the technology works in the
way the technique requires. If, however, you are at the start of your research journey (as
many readers of this text will be), this advice is not particularly helpful. Most people
in this position will make their choice in one of the following ways: if they are part of
a university, then they will choose the one the university subscribes to or, if not, then
price is the criterion. Realistically, when we are learning and gaining experience, this is
a perfectly acceptable strategy. However, as we embark on a major piece of work (such
as doctoral research or a funded piece of academic or professional research), we should
review the software options in more detail because some of the programs are based upon
quite distinct method (and methodological) differences. As the German sociologist, Udo
Kelle, has pointed out, The Ethnograph is rooted in phenomenological and ethnographical
Chapter 11 EXTRACTING THE INFORMATION FROM QUALITATIVE DATA SETS 483
approaches, MAXqda has its conceptual origins in Max Weber’s concept of ’ideal types’
and AQUAD was influenced by a ’Popperian methodological approach’ (Kelle, 1997).
We cannot escape the fact that, in much qualitative research, methodology, software and
interpretation are inextricably linked.
The increasing sophistication of most generic software packages has seen concerns
grow about their potential to distance a researcher from their data. Indeed the range of
options within programs has probably led to more ’experimentation’ with the data, that
is, sorting and sifting data and generally trying things out just to see what emerges. This
image of ’experimentation’ is picked up by a Czech social scientist, Zdenek Konapasek.
His argument is that modern software is more than a tool, it is a ’virtual environment
for…practice-based knowledge making’ (Konapasek, 2008). He points out that the tech-
nology is ’the interface in which and through which we do thinking’. He could also rea-
sonably argue that it is also an aid to our creative thinking.
Notwithstanding Konapasek’s representation of the consequences of technology for
qualitative research, it is clear that computer assisted qualitative research and analysis is
sufficiently well established for continued innovation in product development. Hyper-
linking within data is becoming established. This is likely to drive the need for inbuilt
multi-dimensional analysis and visualisation of relationships. It is, however, the auto-
mated transcription of the spoken word that will make a qualitative strategy more attrac-
tive for many researchers.
Qualitative analysis, so far, has been presented as a generic approach. However, this is
something of a simplistic assumption, as we shall see below. Qualitative researchers can
approach their data from a large number of competing positions, each legitimate but
each possibly producing different explanations. On the one hand are researchers who
believe there is a single reality in the world they study (and, by implication, a single,
correct explanation of events). On the other hand there are many more researchers who
believe that the researcher constructs one out of countless explanations. Then there are
some, like Gerald Cupchik, a Canadian psychology professor, who see a half-way house
where there are common patterns of daily life (Cupchik, 2001). Realism, constructivism
and constructivist realism – the field of qualitative research can resemble a jungle that
poses many dangers for the innocent researcher. But as if this were not enough, there are
differences in the way we can go about our research. Some qualitative researchers adopt
a deductive approach and others (many more) have an approach that is inductive. Then
there are those who approach qualitative research from the perspective of critical theory
who believe that ’facts’ should be looked at from the perspective of power and authority
relationships and that the realist/constructivist debate is an irrelevance. Add to this the
fact that researchers come with their own personal baggage, their subject culture, their
research preferences, their personal goals (the box on the right side of Figure 11.1) and
we see that qualitative research is not just an exceedingly complex field, it is one that is
in a constant state of flux, shifting and developing all the time. It is impossible to construct
a simple representation of qualitative research because the field itself is not coherent.
All a text like this can reasonably do is pick out some islands of (relative) stability and
describe how data analysis happens there. This is what the next sections do.
484 Research Methods for Education Part 3 The Process of Data Analysis
towards critical discourse analysis. This idea of ’critical’ as an approach can be applied
to other types of qualitative analysis (such as critical narrative analysis) but they are
not considered here.
t The fourth and fifth approaches, testing other people’s findings and testing theory,
are effectively the same issue and the approaches we might use are the same. Tem-
plate analysis ensures that the specifics of how other people have worked are reflected
in our approach. For the other approaches, we use other people’s findings or theory
constructs to direct what we look for in our analysis.
Let us now look at these approaches in detail.
National During
standardised At school During compulsory compulsory
assessment system entry primary education secondary education
emerge from the data (bottom up), the strategy with template analysis is based on work-
ing top down, at least in the initial stages.
In general, a template is a pre-determined framework of concepts that are represented
as codes and the template we will use will be specific to our research issue. Where does a
template come from? The answer is from one of the following three sources:
1. It is one that has been used before. If this is the case we are using it either to test the
applicability and reliability of the method or to see whether results obtained in an
earlier study are applicable elsewhere.
2. Alternatively, it could arise from the nature of the research question. If we were evalu-
ating a policy or a process (such as a policy on the transfer of children from primary
to secondary school or the process of transfer in a particular area), the template is
likely to be dependent on the objectives or goals of the policy.
3. Finally, it can come from any combination of existing knowledge about an issue and
the philosophical perspective that the researchers choose to work within. Most tem-
plate analysis research seems to have its origins here. We shall look at two examples
of this in more detail.
As An points out, the bulk of the evidence outlined above shows that high stakes testing
harms the curriculum. However, we should not ignore the fact that a minority of studies
showed improvements in curriculum (if a pupil-centred pedagogy, greater content and
integration of knowledge are felt to be better in educational terms). The question we
should now be asking is this, ’If testing works in some situations and not others, why
not?’ An has pointed out the anomaly, the next research agenda is to find out why it
exists.
t the research questions – how nurses received, judged and used the feedback and its
value and utility; and
t the concept of social phenomenology developed by Alfred Schutz (Schutz 1967).
While this is too large a topic to explain here, it is, in essence, an assumption that
our life circumstances and our social interactions create shared meanings and un-
derstanding, which (a) establish an apparently objective world and (b) establish our
experience (knowledge) that we use to communicate with, assess and judge others
and operate in the shared world.
Drawing on these two sources, the coding framework they constructed consisted of six
categories of code: motives, social relationships, the meanings of social actions, systems of
relevance, ideal types and ’common sense’. In social relationships, for example, they dif-
ferentiated three interactions: ’we-relations’, where there was interaction, ’thou-relations’
where the relationship was not reciprocal and ’they-relations’ where the relationship
lacked individuality and personal identity and rested on a shared understanding of what is
488 Research Methods for Education Part 3 The Process of Data Analysis
Children are an important segment of the econo- t A fall in advertisements for products from 69% of
my. They are significant consumers both in terms of the total in 1993 to 55% in 1999.
what their parents spend on them and in their own t An increase in food advertising from 69% of
right. The UK bank, Halifax, conducts an annual survey the total product advertising in 1993 to 78% in
of the money children receive each week from their 1999.
parents (’pocket money’). In 2006, the average for chil- t An increase in the total of fats and sweets from
dren aged 7 to 16 was £8.20 with more for girls (£8.66) 36% of the food advertisements in 1993 to 53% in
than boys (£7.72) and more for children in London 1999 and fast-food restaurants from 23% in 1993
(£11.71) than for children elsewhere. Advertising is an to 28% in 1999. There were no advertisements
important means of communicating with consumers. for vegetables, fruit and protein-rich foods.
Our broader interest as education researchers should
mean that we are interested in the message contained Your tasks are:
in advertisements targeted at children.
t To judge whether the quantification of qualitative
This is the issue that an American professor of nu- data counts as a valid indicator in qualitative re-
trition, Carol Byrd-Bredbenner, investigated. She search. You should review other literature on this
looked at the content of advertisements broadcast on topic. A useful starting point is the website of the
Saturday mornings in 1993 and 1999. Her template Advertising Education Forum (http://www.aefo-
was one that had been used in other studies. It was rum.org). What else would a qualitative researcher
used to record information about the advert, length, want to know?
type (commercial product, TV programme) and prod- t To identify where else this type of approach could
uct advertised. In addition it coded the following for be used (e.g. analyses of classroom behaviour).
food: the type of food, the individuals in the adver-
tisement and compliance with advertising industry Source: Byrd-Bredbenner, C. (2002) Family and Con-
guidelines. The results were as follows: sumer Sciences Research Journal, 30(3), 382–403.
The second feature that creates distinctiveness is the focus of the enquiry. Klaus Krip-
pendorf, a German communications scientist working in the USA, who has been highly
influential in the development of content analysis, has identified the following areas
where the method can be used effectively (Krippendorf, 2004). He refers to these areas
as ’types of logic’.
t Extrapolations of:
t trends (see Carol Byrd-Bredbenner’s study described in Activity 11.3);
t patterns, to see whether two or more data sets show characteristics (in education,
for example we might look at press coverage of academic standards when assess-
ment grades improve year on year);
t differences, the obverse of our search for patterns. Do two data sets (such as a
school prospectus now and one ten years earlier) exhibit different characteristics?
t Establishing standards:
t identifying a feature to prove that it exists in a communication (in education this
could be a satisfactory specification of a pupil’s misdemeanour when parents are
informed that their child is to be suspended from school;
t evaluation of communications to show the level of factual accuracy or the exist-
ence of bias (for example, in newspaper reporting of educational outcomes);
t to reach judgements, an issue of great concern to education at the present time is
plagiarism. Content analysis can be used to demonstrate stylistic and language
use differences.
Chapter 11 EXTRACTING THE INFORMATION FROM QUALITATIVE DATA SETS 491
Whether the potential to use quantitative assessments (content analysis was originally
a quantitative approach) or its value in certain types of study makes content analysis
sufficiently distinctive as a method to warrant a place in a separate category is debat-
able. Certainly, as it explores relationships amongst concepts, it comes closer to other
types of qualitative analysis. What is apparent, however, is that its use is widespread in
the social sciences, from law to psychology and it is especially strong in media studies.
In education and research into young people it has been used to explore topics as var-
ied as the character of young people’s personal information on MySpace (Hinduja and
Patchin, 2008), group learning processes (Malatalo-Siegl, 2008), school evaluation in
the Netherlands (Blok et al., 2008) and the quality of life of ’street children’ in South
Africa (Vuyisile, 2006).
But grounded theory is more complex than this because we should refer not to
grounded theory but to grounded theories – and for many researchers it is important to
distinguish between them. Equally, however, there are others who believe that arguing
over the differences between them is akin to counting the number of angels that can
stand on a pin-head, that is, it is ultimately pointless.
Process Process
Constant comparison Memoing
Theoretical
Stage Three
coding
process we saw in section 11.2. What are the data units and where do the codes come
from? The data units are those segments of the data that contain something of mean-
ing in the context of the research. They can be anything from an expression or word,
through a sentence to a paragraph or a longer section of data. For example, if we are
investigating why parents do not act before legal proceedings are taken against them
for not ensuring that their children go to school and we are told that, ’I couldn’t make
sense of the letters that I got’, we might code this as ’complex communication/dif-
ficult language.’ The codes come out of the text and may be words used in the text or
terms chosen by the researcher. The codes will undoubtedly be influenced by where
the researcher is coming from. Dr Barry Brooker’s study of what stakeholders in a
school understood by leadership uses open coding from the classic grounded theory
approach to analyse interview and documentary data. Figure 11.7 shows an example
of this first stage coding process (Brooker, 2005, p. 58).
2. Selective coding: this moves us forward into a process of grouping codes together on the
basis of some overarching idea. If a parent told us that ’I didn’t get any letters brought
home’, with the implication that the child failed to deliver them, we might code this
as ’failure to ensure effective communication’. With the other code, ’complex com-
munication/difficult language’ we might put it into a higher level code called ’poor
communication’. The process seeks to find a link between codes that will bind them
to a core idea, the theoretical code. Because grounded theory assumes that a core idea
exists, it can be a trying process. It is, perhaps, for this reason that Kathy Charmaz
uses the terms ’initial’ and ’focused’ coding to describe the processes of substantive
494 Research Methods for Education Part 3 The Process of Data Analysis
Leadership for me, particularly as the school has grown, or Ifobably right from the
begi n ni ng too, but particularly now, is the leadershi p of people, because if you don't have
a happy staff, a staff who feel they are valued and feel that they've got a say and feel that
you k40w they're actually parttf a team, then you do n't have a soul. You don't have a
team ~hat works together.
(initial) and selective (focused) coding because (a) they make no assumption that
there is a single unifying theme and (b) they actually describe the process of a ’first
stab’ followed by ’creating order’ (Charmaz, 2006) In summary, the selective coding
process is one in which possible links between codes are batted to and fro between
many combinations of codes. The process continues until the package of substantive
codes linked with selective codes is fit for purpose.
t Theoretical coding: this is the final stage. It is the development of a core idea that
links all the codes together. Our aim is to develop an explanation that fits the se-
lective codes. However, we should not assume that a theoretical code a single
word or even a phrase. It probably will be a statement that explains the relation-
ship between the codes as variables. Such a statement is, in other words, the theo-
ry that grounded theory seeks to develop. The statement we seek to construct is an
explanation of the structures that we have isolated and relationships between
them. In our example so far, we might conclude that ’parents whose child does
not attend school are concerned about their child. Their inaction is because they
do not understand the sequence of events leading to prosecution and a key reason
for this is poor communication practice by the school and authorities’. This is the
theoretical statement. In edging towards a theoretical statement, we seek to ex-
plain ’why?’ In other words, the theory is causal or implies causality. In trying to
explain ’why’ we may find that certain things are not as clear or as well specified
as we would like. For this reason it is permissible (and expected) that we go back
and reclassify and recode. Case Study 11.4 gives an example of the derivation of a
theoretical code.
Let us now take a closer look at the two processes identified in Figure 11.6.
t Constant comparison: our grounded theorisation is built upon the coding process.
One of the problems with coding is that we can overwhelm ourselves with codes.
Our process has to be (a) effective in capturing the meanings in the data set and (b)
it has to be efficient in the way it does this. What do we mean here by ’efficiency’?
Putting it simply, we have to get as much meaning out of the data as we can with
Chapter 11 EXTRACTING THE INFORMATION FROM QUALITATIVE DATA SETS 495
the minimum number of codes. We can never know when we have achieved this
but the constant comparative method is our means of getting close to an optimal
solution. The approach works in the following way. Let us assume that we are cod-
ing responses to a series of questions that subjects could answer in their own way.
We might begin by looking at the first subject’s response and coding it. We then
move to the second subject and repeat the process. At this point we would see if (a)
the codes we gave to the second subject are applicable to the first (we will already
496 Research Methods for Education Part 3 The Process of Data Analysis
have done the reverse of this because we would know what codes we had developed
for the first subject) and (b) whether the two sets of codes overlap. If the codes
overlap, we would define new codes to ensure that no idea or element can be coded
into two codes. If one seems to be a sub-case of another, we would start to pencil
in a hierarchy. With the third subject we apply the codes from the first two subjects
and see if there are any more we would like to create. If we do, we would go back
and apply these to the first two subjects. This process continues with all subjects
and is essentially the same whatever data sets we have and however we choose to
work with them. In reality (though Glaser might argue otherwise), in undertaking
this process a researcher is not cocooned with just the data set and isolated from
the world of scholarship. That is, the process is not conducted in isolation. Experi-
ence, knowledge and understanding are all brought to bear on the comparative pro-
cess. While the process is as we have described it here, examples in the literature of
the process at work are virtually non-existent and most researchers appear to gain
experience on a trial and error basis. Nonetheless, the constant comparison of sub-
ject with subject and data with emerging concepts (selective and finally theoretical
codes) is one of the engines that push grounded theorisation from one stage to the
next. It is, however, an engine that can grind exceedingly slowly and can become
tedious and mind-numbing. Two Norwegian medical researchers, Tove Giske and
Barbara Artinian, report that it took them 18 months to code their data using the
constant comparative approach – and their sample consisted of interviews with just
18 patients (Giske and Artinian, 2007)! While constant comparison is the method
we use to edge towards being effective and efficient, it does not tell us when we are
there. Grounded theorists use the concept of ’saturation’ to help identify this point.
Saturation is achieved when new data does not create new insights (that is, it fits
into existing categories). In his work on community-based anti-poverty projects in
Jamaica, Glenn Bowen reports that the use of the constant comparative method led
to his sample size of interviewees growing from 24 to 36, so it can also have an effect
on the scale of research activity (Bowen, 2008).
t Memoing: the second process that drives grounded theory forward is memoing. This
is such an obvious process that it hardly seems worthwhile giving it a name but it is
very important. As we work with our data, ideas occur to us and we have to make sure
that we do not lose them. Memoing is the process of recording these ideas. It is such
a useful and important method of recording thoughts and hunches and the evolution
of our analytic ideas and frameworks that it is found extensively in other qualitative
methods. Barry Brooker wrote in a memo:
During the interview the principal was keen to soften any references to areas
that were not going as well as she would really like… it did leave me wonder-
ing whether some actions she undertakes, which seemed to be purely for caring,
human reasons, are also done for management and political reasons.
(Brooker, 2005, p. 57)
Melanie Berks and her colleagues at Monash University in Australia identify what we
can do with memos:
t Record the how and why of our decisions.
t Comment on the data, which helps us extract meaning (this is Brooker’s example
above).
t Record thoughts and ideas that may be useful subsequently in framing an analysis.
t Open up the study to other researchers – and other interpretations (Berks et al., 2008).
Chapter 11 EXTRACTING THE INFORMATION FROM QUALITATIVE DATA SETS 497
If grounded theory had stopped here, it would have been a fine achievement and
testament to a successful collaboration between two researchers from two different re-
search traditions. Since Glaser and Strauss’ work became known, however, qualitative
research has expanded greatly and with this expansion has come innovation and change
in method.
need for flexibility to suit circumstances. A change to method, however, means that
it is possible to produce a different theoretical interpretation from that which Gla-
ser and Strauss’ original method would have generated. Which raises the question,
’Who’s right?’
But how significant are these differences? Strauss and Corbin still have three stages to
their approach and, while they give the three stages different names from those used in
classic grounded theory, they perform really quite similar functions.
t The first stage, open coding, tags units of data. With Strauss and Corbin, however, this
usually requires line-by-line coding rather than ’chunks of text’ coding.
t The second stage, axial coding, groups open codes, much the same goal as Glaser
and Strauss’ selective coding. There are, however, some procedural differences. It may
be helpful to give an example of axial coding. Two American researchers in special
education, Jeannie Lake and Bonnie Billingsley, explored the nature of conflict that
can arise between parents and schools. They conducted open-ended interviews, with
parents, school officials and conflict mediators exploring the cause, circumstances
and resolution of disagreements and used grounded theory to process their data.
The open coding generated labels such as ’lack of factual information’, ’lack of legal
information’, ’misunderstanding of parental rights’, ’not understanding the operation
of educational systems’ and ’inability to judge program quality’ (Lake and Billingsley,
2000, p. 243). These were bundled together to create an axial code called ’knowl-
edge’. The process is this straightforward but it requires an excellent command of
language to define the umbrella concepts that describe the axial code. At this stage
two brains are often better than one!
t The third stage, selective coding, is the selection of a theme to be the core theme on
which the explanation rests. Again, the object is not so different from the purpose in
classic grounded theory.
And, for the record, Strauss and Corbin’s model also incorporates the constant com-
parison and memoing processes.
To the outside observer, therefore, it may not seem that the differences between the
classic approach to grounded theory and what Strauss and Corbin were saying were all
that great. In fact, many pragmatic researchers accept the differences with a shrug and an
assumption that ’needs must’. But to some, the differences were important and not only
could not but also should not be papered over.
You do not write, as Glaser did, ’Strauss’ book is without conscience, bordering on
immorality’ without feeling that you have been betrayed (Glaser, 1992, p. 5). Glaser’s
response to Strauss and Corbin was a defence of the classic approach but its effect was
to fuel debate. Methodologists have sought to define, understand and bridge the differ-
ences. Udo Kelle (2005) says that while Glaser may have a point, it is overstated.
What was not happening, however, was any slowing up in the use of grounded theory.
Researchers recognised the debate, largely ignored it and went ahead with their research,
using the method as they saw fit. This has given rise to what Bill Warburton has called
’the extended grounded theory family’ (Warburton, 2005), that is, a whole series of vari-
ants on classic grounded theory that have been developed to meet the needs of particular
situations or philosophical perspectives. In many regards this continued use and adapta-
tion of grounded theory shows the attachment of the academic community to the ap-
proaches. However, there is a sense amongst some researchers (see, for example, Jones
and Noble, 2007) that there is a lack of rigour in its implementation and that ’anything
goes’. This is a potentially serious criticism and we should ensure that it is not levelled
Chapter 11 EXTRACTING THE INFORMATION FROM QUALITATIVE DATA SETS 499
In 2001, the decennial census in the UK estimated Just these statistics are painful enough and while
that there were 175,000 young people whose task we might guess at the impact on the carers, there
it was to care for relations, principally grandparents, is no substitute for direct engagement with them.
parents and siblings: 13,000 of them did this for over Your task is to apply any variant or combination of
50 hours a week. In 2004, the organisation Carers UK grounded theory to the young carers’ own voices and
published a profile of young carers based on a sample to prepare a short report explaining and demonstrat-
of over 6,000. The survey showed that: ing your method and summarising your findings on
the consequences of being a young carer. The data,
t Over half lived in one parent families. available on the Web are of two types: a film made
t Most of those cared for are mothers. by young carers and a blog and discussion group used
t Physical health was the reason for half of the care by young carers.
needs and mental health problems for 29%.
t 60% of young carers provide domestic help, 48% Film: There2Care: http://youtube.com/watch?v= a73iPj_
general and nursing care (with 18% providing in- EHE.
timate personal care), 81% provide childcare and Discussion: The Princess Royal Trust for Carers: http://
82% provide emotional support. www.youngcarers.net/community/forums.
t 10% of young carers care for more than one person. Source: Dearden, C. and Becker, S. (2004) Young Carers in
t 65% of young carers have provided care for three the UK, The 2004 Report, Carers UK, London.
years or more.
at us. The best way to ensure this is to (a) be transparent and (b) report our method in
some detail. Activity 11.5 is an opportunity to practise this.
accurate description of what many of us actually use it for, word processing, graphic
design, surfing the Web and playing music and video. It does not take too much imagi-
nation to think that one day soon we will be calling it something different. Language
evolves to reflect and enable intellectual and material innovation. At the same time it is
the means through which we construct messages that others will interpret with the mean-
ings we intend, that is, we use language to create shared beliefs and expectations. This last
idea probably requires a little unpicking for its sense to be clear. The words we use have
a meaning. Over time, meanings become stable (though they can change in the future).
Take ’community’; what images and values does this conjure up? We might imagine a
rural scene of village life, where a small-scale location focuses interaction amongst the
people. Or we might imagine an urban scene where an ethnic group has chosen to live
together. In both cases, geographical scale is important and we might use terms such
as ’togetherness’, ’supportive’ and ’sharing’ to express processes and values that are tied
up with the idea of community. The word, then, expresses a way of living which, when
we deconstruct it, is one we value. A scale or so up from the community is the nation.
Again it is a concept we value. Within a nation, we are ’the same as’ and ’different from’.
However, while both community and nation can be argued over, the idea of nationhood
is far more contested. In Belgium, the Flemish and the French may feel more similar to
their neighbours than they do to each other. In Britain, political devolution has strength-
ened Welshness and Scottishness and may have weakened Britishness. In England, the
idea of a nation whose identity rests on shared culture and language has its roots in
the tenth century. In Italy and Germany, the idea of a nation was still an aspiration in the
nineteenth century. So when we use the words ’community’ or ’nation/national’, context
is clearly important. And now, with the EU, nations are progressively sharing actions to
create, of all things, a community but a community that is very different from the one
we looked at initially. Our interest, from the perspective of discourse analysis, is that the
use of the term ’community’ within a European context represents an attempt to imbue
the supra-national idea with the warmth and affection that we associate with its use at
the local level. The debate over what is to be shared and how it is to be managed and the
relative importance of national and supra-national institutions will, inevitably, either
redefine what community means or lead to a new concept and a new word.
While this example may not have immediate implications for the processes of educa-
tion, there are others that do. For example, take the concept of partnership. This has a
general use meaning of working together for a common good and a legal meaning of a
business enterprise structured in a particular way. In the UK, its use in public service pro-
vision has been growing. From the 1980s onwards, funding for regeneration and develop-
ment was offered to public and private institutions signing up to a partnership agreement.
Since then, the term has been extended to cover normal working relationships of both a
collaborative or contractual nature that have, to some degree, always been around. Part-
nership for Schools was the body that delivered the UK Government’s ’Building Schools for
the Future’ and ’Academy Development’ programmes (it ceased to exist in 2012 and its
responsibilities were transferred to the Education Funding Agency). The Partnership for
Schools Board of five people had representatives from the private and public sectors and
the civil service but not actually education, thereby, at one stroke, stretching the concept
of partnership to breaking point. The Independent/State Schools Partnership programme
is a funding initiative to strengthen collaboration and raise standards between the two
sectors. The Safer Schools Partnership is a mechanism for creating effective links between
schools, the youth justice system and the police service. School Sports Partnerships sought
to establish collaboration in sports development amongst local schools as a means of ac-
cessing public and National Lottery funding (closure of the programme was announced
Chapter 11 EXTRACTING THE INFORMATION FROM QUALITATIVE DATA SETS 501
in 2010). In these and other partnerships, the heart of the partnership association is ac-
cess to funding, which somehow does not fit easily with what many would understand
by partnership, that is, collaboration for a common good.
What all these examples show is that we use language to achieve outcomes and re-
frame realities and that the way we use language is not always the way that we have
used language in the past. This is what discourse analysis is concerned with and, more
than many, we as educationalists should appreciate that words are an extremely strong
weapon in enabling us to attain the futures we want.
Margie Wetherell, a Professor of Social Psychology, and her co-authors identify four
ways in which we can look at and analyse discourse (Wetherell et al., 2001, pp. 6–10).
1. A focus on words and the way we express ourselves. In this situation we are interested in
how context (such as social situation, cultural background, group affiliation) influences
how we use language. While this could include the study of dialect and the historical
evolution of language, more relevant for the educationalist might be how young people
create language that enables their discourse to be hidden from others. In 2005, the BBC
published a Lexicon of Teenspeak on its website (http://news.bbc.co. uk/1/hi/ maga-
zine/4074004.stm). Other websites also publish ’jargon busters’, so, if you are in the
dark when someone you interview says, ’He’s my blud and dis my yard’, you now know
where you can go for help. This new way of speaking even has a name – multi-cultural
London English. But it is not just ’teen speak’ that we should be interested in. We could,
for example, set the meaning in context. If a school is criticised in a report for unruly
pupil behaviour and lack of pupil achievement but damned with faint praise for its well-
written policies, there is more than a strong inference that aspirations are not translated
into action and it is the context that tells us this. And there is an interesting point on
which to end this first way of looking at discourse.
2. A focus on language in use is the second type of analysis. Here the focus is on interac-
tion between subjects and how language, the words, expression, emphasis, is used. This
may involve the detailed analysis referred to above but because language is batted back
and forth, we are interested in how one contribution shapes a response and how the
interaction between people creates meaning or understanding. Three Dutch researchers
show this method in use in their study of the way in which special needs pupils solve
mathematical problems (Aalsvoort et al., 2006). They video-recorded the interactions
of pupils and teachers as the pupils solved the problems (’sums’) and analysed them
in terms of ’materials used; questions raised during the sum; supportive activities, e.g.
offering help when the student seemed to fail in solving sums independently; and task
structure, e.g. learning to apply strategic knowledge by using spatial and numeral repre-
sentation of 1s and 10s correctly’ (p. 306). Their results are interesting because they indi-
cate that ’students can be more effective partners in processes that elicit mathematisation
if they are allowed to take part in the meaning-making of the strategies involved in doing
the sums’ (Aalsvoort et al., 2006, pp. 315–316). The implication of this is that learning
is effective when they are allowed and encouraged to find their own way of expressing
ideas and testing their interpretations with the teacher.
3. Focus on language pattern. Here, in the third type of analysis, the focus shifts from
the way language is used to the context in which it is used. We saw in the first approach to
discourse analysis that context helps create the meaning. Surfing, for example, means dif-
ferent things when we are on the beach or on the Internet. When we talk about language
pattern, however, we usually mean more than just individual words, rather the character
of the whole set of text or script. Two researchers at the University of Central Arkansas
in the USA, Shoudong Feng and Tammy Benson, looked at the language patterns of pre-
school children. The setting for their study was a computer suite in a pre-school centre
502 Research Methods for Education Part 3 The Process of Data Analysis
that they attended in order to give them a good start when they moved to formal school.
They videoed almost four hours of activity at times when the children had a free choice
of what they could do. From the recordings they extracted sections where the children
interacted with each other and coded the type of interaction. The coding system they
used was intended to show how the language met the children’s immediate needs and
desires. This was based on Michael Halliday’s thesis that language meets seven objectives
for children (Halliday, 1973):
t they can express wants;
t they can give instructions and so manage other people’s behaviour;
t through language they can get on with others;
t they can use it to describe themselves and how they are feeling;
t through language they can find out about their world and things in it;
t they can be creative and construct imaginary worlds, and
t they can give information and instructions.
What the researchers found was that almost all the children’s speech fell into these cat-
egories, though managing other children’s behaviour, using language to find out about
their environment and giving information and instruction were dominant. Their analysis
raises an issue about whether the results reflect the development stage that the children
have reached or whether the nature of the environment constrained behaviour and thus
language use (and development). If the latter, we have to pay more attention to the char-
acter of the environment and the nature of the activity in early years learning.
4. A focus on the link between language and the nature and structure of society. This type
of analysis seeks to understand how language is used to bolster our positions in a social
setting. At a macro level, it is used to show how language is a vehicle that can be used to
maintain privilege and assert status. As Wetherell says, ’Controversy is basic to this form of
discourse analysis because it involves the study of power and resistance’ (p. 9). Sarah Ohi,
an Australian academic, worked within this perspective in her exploration of the early years
literacy programme in the Australian state of Victoria. She identified that the literacy model
that was proposed was founded upon assertion and not an actual analysis of evidence. Her
discourse analysis of the policy documentation and teaching manuals showed that:
t there were few direct connections to primary sources of research evidence;
t implementation was heavily influenced by US example and there was no attempt to
assess whether the Australian situation created specific difficulties;
t there was no invitation or opportunity for teachers to assess primary evidence (be-
cause references were not cited and when they were they were not always referenced
in the text);
t there was no invitation for teachers to assess the model and this contributed to a sense
of being deprofessionalised (Ohi, 2008). We shall meet this type of approach to dis-
course analysis again when we look at critical discourse analysis (see page 508).
be noted that the quantity is much less than text-based) is related to a focus on pattern
and the association with the nature and structure of society (points 3 and 4 above).
Fiona Ormerod and Roz Ivanic, both then members of the Literacy Research Group
at Lancaster University, explored how children ’make meaning’. They looked at this
through project work, where interest, choice and control were managed by each child
and were reflected in the output. Their analysis looked at the materials children included
in their projects, the models they constructed, the images they drew, those they cut out
and used and the graphical quality of their writing and image framing. They noted that
’the choice of materials and the ways in which they are used to construct images can ap-
pear to communicate, in a very subtle way, something of the child’s attitude towards the
subject represented’ (Ormerod and Ivanic, 2002, p. 76). They note, for example, that ’in
Denise’s project on pigeons and doves, her use of a soft pencil and gently rounded repeated
strokes seems to communicate a sensual appreciation and protective fondness for the beauti-
ful fragile creature that she is depicting’. In another instance they observe, ’In the football
project produced by Robbie and four of his friends, there is a cutting of a photo from a
magazine, showing a player committing a foul, and next to it the word “Fowl” is written
boldly, in large letters, using thick black fibre-tip pen… The distinctive way in which the word
is written appears to communicate both confidence in the writer’s knowledge of the rules
of the game and a strong sense of shock and disapproval’ (pp. 76–77). In both cases the
emphasis is mine and not the authors’. We can see with these examples that through the
way pictures and words were constructed and the way they created emphasis the authors
were able to gain insight into how children were learning to express their knowledge and
understanding.
an individual perspective. As an aside, this raises the issue that interviewing to produce
a narrative must allow the personal voice to come through, that is, research interests and
the researcher should not inhibit the way an individual wishes to respond.
The thinking behind narrative analysis is that the personal voice reflects the priorities,
concerns, values and attitudes of the narrator and gets us closer, as researchers, to an in-
dividual’s personal experience or an institutional perspective (in the case of a school, for
example). Narrative is, of course, a form of discourse. Its distinguishing feature is that the
teller determines the sequence through which the data are revealed. There are different
types of narrative. Some report a sequence of events with a beginning and an end, some
are hypothetical reactions to stimulus ideas and some report things that happen on a
regular basis. In each case, we need our data in a particular form, namely that it should
be the writer’s or speaker’s own voice. The consequence of this is that narrative analysis
conditions the type of data we collect and the way we collect it. This is a good example of
why it is far better to plan our research procedure in detail and not to blunder through,
making decisions about method as we go.
But exactly how is narrative analysis carried out? To get to grips with this we have to
understand that, in broad terms, narrative analysis can deal with three types of data.
1. Research interview data: the interview is normally conducted in such a way as to mini-
mise interviewer interaction or interruption. Questions are phrased to encourage the
interviewee to speak for longer periods and to be responsible for the sequence of the
narrative. The type of instruction that might be used is, ’Tell me about…’
2. Existing narrative that exists in report or story form. We could, for example, use as our
data quality inspection reports, written statements of policy, responses to policy op-
tions, emails and blogs written by members of our target research community.
3. Written accounts constructed for the research. Gwendolyn Lloyd did this in a study on
how mathematics teachers in training got to grips with their professional role (Lloyd,
2006). She asked her subjects to compose fictional narratives that dealt with classroom
situations that supported and conflicted with curriculum statements on recommended
teaching themes and approaches.
Once we have the data, the framework within which we analyse it is rather different
from the tagging approach we considered earlier. Analysis can range along a continuum
from being more tightly focused to being more loosely focused. We shall look at exam-
ples of each in turn.
More focused approaches usually adopt a way of looking at the narrative developed
by an American Professor of Linguistics, William Labov. Heather Richmond, a Canadian
teacher educator, used Labov’s approach in her study of the experiences of adult learners.
She collected her data through focus groups and individual interview. This yielded ac-
counts that were expressed in the learners’ own words and in the sequence and with the
emphases that they wanted. She organised each narrative into a ’storymap’. This is a pic-
torial or tabular representation that seeks to place the narrative into various categories:
time (past present and future), personal state (ranging from disorganised and confused
to organised and clear) and experience (self, family, community and schooling). Each
storymap was then looked at through Labov’s frame (see Richmond, 2002). This consists
of four categories:
1. Orientation – a description of context, when and where something occurred, the na-
ture of the location, who was there.
2. Abstract – a summary of an event, story, narrative.
Chapter 11 EXTRACTING THE INFORMATION FROM QUALITATIVE DATA SETS 505
These categories are perspectives through which to process the data. What we have
to be aware of is that the ’facts’ are presented in order to achieve a specific perception,
which may colour the attitude of the listener to the narrator. Where this is identified, it
is additional research evidence. It does, however, move the analysis away from a consid-
eration of structure towards interpretation of meaning and intent. Richmond’s paper is
methodological in character and does not go into substantive research findings. How-
ever, Ilze Grobler who used the approach for her PhD study of teaching and learning in
higher education (Grobler, 2006) showed the change that happened to learners as they
moved to a collaborative learning approach and the importance of reflection in creating
(or as Grobler expresses it ’co-constructing’) knowledge. Qualitative research gives us
quite different insights into the learning process compared with those from quantitative
research. The latter may tell us what works (or what produces good outcome measures)
but qualitative research tells us what the process is and why it does or does not work.
More loosely focused approaches to narrative analysis are not any less rigorous but
they are less wedded to a procedure. They often make use of the coding procedure we
met in section 11.2 and it can, in fact, often be difficult to distinguish what makes a piece
of research a generic piece of qualitative analysis and what makes it a narrative analysis,
other than that the data are personal narratives. Heather Fraser, a Canadian social work
researcher, adopts a less formalised approach in her review of how to undertake narrative
research (Fraser, 2004) and she makes some very specific points about the culture and
standpoint of narrative research. She argues that narrative researchers have to find a way
between the idea that action is conditioned either by circumstance or by individual deci-
sion making, and the approach of narrative research is that of the chef who sees cooking
as an art form and does not try to stick to recipes. She clearly makes the point that narra-
tive researchers have to process their data in ways that extract the most or best informa-
tion and should not be constrained to follow any particular procedure. Her guidance
for analysing narratives is that we consider it in four stages, interpretation, comparative
experiences, the political dimension and the identification of common patterns.
t Interpretation: this is the stage in which we seek to pick out the salient features of
what we are being told. We have to read the narrative at an overall level to see if we
can characterise the story as a ’type’ and also at a more detailed level (perhaps even
read it on a sentence by sentence or line-by-line basis) in order to identify important
themes. She gives an example of how these two levels of interpretation can intersect.
In the middle of a narrative a respondent said, ’And I realised years ago that I always
had this feeling because I’d received so much charity in my life that I had to give back
to society’. Fraser called her story ’paying the debt’. Table 11.5 shows questions that
she thinks a researcher might usefully ask of the data at this stage. These direct us to
a close reading of the narratives and enable us to gain insight. Is there, for example, a
persuasive intent? Is there obvious difficulty in talking about something?
t Comparative experiences: at this point we want to see if the narrator places their story
in any cultural or social context. Does the narrator talk about themselves (what Fraser
calls the intra-personal experience), or about relations with others (the interpersonal)?
Are there references to popular culture (for instance ’a lot of people think…’) or social
relationships? In this process we are not putting these interpretations on the narrative
but looking to see if the narrative itself makes reference to them.
506 Research Methods for Education Part 3 The Process of Data Analysis
Stage Questions
t The political dimension: the next stage takes this further by looking for evidence of a
value base in either or both moral or political terms. Fraser suggests (Table 11.5) that
it is appropriate to look for evidence on the experiences of groups. She suggests by
class or gender or age or race amongst others. This could equally be family structure
or class size.
t Common pattern and differences: in the final stage we try to identify common themes
to and differences between narratives. Her questions here are far more general than at
earlier stages and are designed to challenge us and our conclusions. She seeks to re-
move the possibility of bias by asking, ’What would other theorists say?’’Are relevant
portions of the narrative interpreted fairly?’’Are narratives that challenge established
findings or the researcher’s pre-conceived ideas given full weight?’
It will be clear that Fraser’s less formalised approach has elements that we can see in
other approaches to the analysis of text. Progressively moving away from the detail of the
narrative, compare and contrast assessment and placing the accounts in a social, political
and cultural context are all found elsewhere.
Because narratives are individual constructs (that is, produced without influence or
interference by real people), this focuses the researcher’s attention on why the particular
elements of the narrative were selected (and not others) and why they were presented in
the way they were (and not others). In particular, a researcher will have to determine the
Chapter 11 EXTRACTING THE INFORMATION FROM QUALITATIVE DATA SETS 507
importance of elements and how this is reflected in their location in the narrative. Here
we have to acknowledge that there may be differences between speech and written text,
between reportage and argument and between factual and rhetorical styles. Narrative
analysis clearly requires a different set of understandings and insights compared with
other discourse analysis and novice researchers should understand the implications of
these genres before tackling the method.
[text] Brackets Indicates the start and end points of overlapping speech.
Equal sign Indicates the break and subsequent continuation of a
single utterance.
(# of seconds) Timed pause A number in brackets indicates the time, in seconds, of a
pause in speech.
(.) Micropause A brief pause, usually less than 0.2 seconds.
. or r Period or down arrow Indicates falling pitch or intonation.
? or p Question mark or up arrow Indicates rising pitch or intonation.
, Comma Indicates a temporary rise or fall in intonation.
- Hyphen Indicates an abrupt halt or interruption in utterance.
>text< Greater than/less than symbols Indicates that the enclosed speech was delivered more
rapidly than usual for the speaker.
<text> Less than/greater than symbols Indicates that the enclosed speech was delivered more
slowly than usual for the speaker.
° Degree symbol Indicates whisper, reduced volume, or quiet speech.
ALL CAPS Capitalised text Indicates shouted or increased volume speech.
underline Underlined text Indicates the speaker is emphasising or stressing the
speech.
::: Colon(s) Indicates prolongation of a sound.
(hhh) Audible exhalation.
tPS III
High dot Audible inhalation.
(text) Brackets Speech which is unclear or in doubt in the transcript.
((italic text)) Double brackets Annotation of non-verbal activity.
Source: Transana Keyboard Shortcuts and Transcript Notation, Henne, R. (and subsequently modified by
Woods, D.), http://www.transana.org/support/documentation.htm (November 2008).
setting. He quite clearly shows ’the machinery of talk with which leadership is enacted’
(Clifton, 2006, p. 202). With organisational management and leadership so important
now throughout the educational sector, particularly in countries where schools are
gaining more freedom from public administration, the processes of exerting authority
should be better understood, so that institutional managements can be better trained.
Case Study 11.5, though not educational, is an excellent example of the use of conver-
sational analysis to understand organisational failure.
Timed pause
Pilot in charge
(18.0)
PIC we’ll go down to fortythree hundred to there, (0.5) and if you c’n wind in
thirtyfour fifty,
(0.6)
PIC and when we- (0.9) when we get over there wind in twentyseven eighty.
(0.3)
PIC °that’ll be the minimum°. Speech between ° is quiet
(1.8)
PIC see how it looks.
(2.5) Rising pitch
PIC just for a giggle,
(6.4)
Sound PIC ah::: you c’n put the steps in there too if you wouldn’t mind.
prolonged
(1.5) Speech between <..> slower than normal
PIC >but you only need<to put the steps in <below the lowest safe>.
Melbourne MELB Gulf Air one (triple/two four) eight, (0.2) contact me on one two eight
control_ decimal one ( ).
(1.4)
The overall judgement was that the two were not Source: Nevile, M. and Walker, M. (2005) ’A context for error:
working as a team. This was the context in which the using conversation analysis to represent and analyse recorded
voice data’, Aviation Research Report 132005/0108, Australian
pilot instructed the co-pilot to set the altitude alert.
Transport Safety Bureau.
The co-pilot set 2,300 feet, which the pilot acknowl-
edged. The correct setting should have been 3,100 feet.
The aircraft hit the mountains at 2,250 feet.
discourse analysis as an approach do so from a radical perspective and see their research
as a challenge to a status quo that reinforces discrimination and embeds inequality.
This simple description smoothes over many divergent perspectives and anyone wishing
to engage with critical discourse analysis should read further to understand the differ-
ences and the disagreements. This section indicates these approaches in broad terms and
shows how they have been used in educational research.
510 Research Methods for Education Part 3 The Process of Data Analysis
As Rebecca Rogers and others comment in their review of critical discourse analysis in
educational research, it ’sets out to describe, interpret, and explain the relationships be-
tween language, social practices, and the social world’ (Rogers et al., 2005, p. 376). They
concluded their review by noting that most studies limited themselves to analysing how
power was reproduced and how it could be challenged. They also commented that the
methods used were not clearly described. This is a criticism in relation to transparency
that we have met before. If readers cannot see the process, they cannot judge the output.
What sorts of area have educational researchers looked at? Helen Oughton looked at
the UK Government’s strategy for up-skilling adults, ’Skills for Life’ (Oughton, 2007).
The context for this policy is two-fold, meeting the needs of economies that require an
updated knowledge and skills base, and secondly, the economic and social cost of those
who, by virtue of their educational attainment, find themselves towards the margins of
society. Oughton’s data were documents concerned with explaining the goals of ’Skills
for Life’ and the curriculum that was specified. Her focus was numeracy. Her conclusion
is that the documentation presents numeracy as ’a set of competences to be achieved in
response to employer demand’ (p. 273) when it could have been justified on the basis
of ’empowerment, personal achievement and critical literacy’ (p. 273). This conclusion
reflects a great gulf between those who see education as a means of being employed
and those who see it as a process of personal development that comes from being self-
motivated. It is the contrast between a programme whose outcomes are measured by the
number that can pass a test and one where success is measured by the number that go
on to further study.
Inge Askehave, a member of the Department of Language and Culture at Aallborg
University, looked at the market orientated focus that many universities are now em-
bracing (Askehave, 2007). Her data were international prospectuses from four countries,
Finland, Scotland, Australia and Japan. She showed that international prospectuses
adopted a common structure with common elements, each of which conveyed a message
and which, together, conveyed an impression (marketers might call this the ’brand’). Her
detailed analysis of one university’s brochure exposes the image being created: ’natural
beauty, historic feel and friendly atmosphere’ (p. 731). She digs deeper into the text and
shows that the ’university’ is projected as supportive and enabling. Her method is clearly
outlined and her analysis, from an educationalist’s viewpoint, is interesting. Again, as
with Oughton’s work, we find a representation of education that is economically func-
tionalist (in this case the functionalism is a focus upon the student as a ’customer’ rath-
er than as a learner) and very different from the liberal tradition that many academics
espouse.
Donald Gillies shows that this shift in educational purpose is deeply rooted in British
educational policy. Gillies explores the concept of ’excellence’ and notes its association
in modern times with a ’crisis narrative’, a sense that problems exist (because of an ab-
sence of excellence) which must be addressed by policy action (Gillies, 2007). He identi-
fies linkages between quality inspection in Scottish schools and the European Founda-
tion for Quality Management’s idea of excellence. What Gillies does most effectively is to
show the sleight of hand with which the term is used in different circumstances and that
while, as an idea, it is difficult to object to it, its implementation may have consequences
that may be objectionable (in particular, the application of a business quality model that
measures quality in terms of assessment outcomes and which serves to distort the educa-
tion process and adversely affects children’s experiences and achievements).
What do these studies show us? Very simply, that words are used to frame an issue,
that language is used to identify a solution and that the continuous reiteration of words
and phrases sets the agenda and, over time, shifts a status quo. And this is the power of
Chapter 11 EXTRACTING THE INFORMATION FROM QUALITATIVE DATA SETS 511
critical discourse analysis, it can help us understand what is happening and deflate the
bubble. It is not, however, a method of research for the novice.
(d) Semiotics
The idea of semiotics and the interpretations we put on objects, features and events has
been touched on before (see Chapter 2). Here we shall look at how semioticians work
and on research that has relevance for education. Our intention is to give an appreciation
of what it is aiming to achieve, not an account of its procedures.
Semiotics takes as its starting point that we understand the world as a series of signs
and symbols and that when we see, for example, a palace, we understand not just as a
grand building but a symbol of wealth and status. Semiotics is, for this reason, tradition-
ally described as being ’the science of signs’, in other words it is concerned with how we
create and transmit meaning. Semiotics developed as a field of research practice in the
early part of the twentieth century. There were two separate traditions, one in the USA
and the other in Europe. There was little intellectual exchange between them and the
one only really understood the existence of the other in the second half of the century.
It was the European tradition that began to have the greater impact in the social sci-
ences because it came out of the same sources as post-modernism and post-structuralism
(again, see Chapter 2). The ideas became part of popular culture with the publication of
Umberto Eco’s The Name of the Rose. Eco is an academic semiotician and his book incor-
porates semiotic themes. Dan Brown’s The Da Vinci Code does the same.
The purpose of semiotics is to identify signs in text, images, film, in fact any medium
through which we can communicate. There are three types of sign.
1. Icons: are representations of something real. A photograph, for example, is an icon.
Icons can take the form of images (our photograph), drawings (a sketch map, for
example) or metaphors.
2. Index: is an indicator of something else. If our joints ache and we have a fluctuating
temperature, it could indicate that we have ’flu.
3. Symbols: are constructed and agreed within a socio-cultural context. Symbols require in-
terpretation. Our palace example above shows what we mean by this. Symbols are all
around us. Look at the washing instructions on your clothes, for example. And when
you are browsing the Internet, which symbol do you press to go to your home page?
A cowrie shell and the €20 or £20 note in your pocket are all symbols of value. The
advertisements we see on our TVs and in our newspapers and magazines often embrace
symbolism. It does not take much imagination, for example, to understand the symbol-
ism in the advertisement in Figure 11.8.
These are the basic structures that semioticians work with and they use them to de-
construct communication, in other words break it up into its component parts. The thing
to be communicated (a fact, an idea, a value) is called the signified and the sign that
communicates is called the signifier. A crown, for example might be the signifier for
monarchy and the monarchy (or the king or queen) is what is signified. At this point
we have to introduce the idea of shades of meaning and to do this, we draw a distinc-
tion between denotation and connotation. We understand what we mean by a ’lesson’,
a coming together of students and teachers with, hopefully, a learning outcome. This
is what the idea of a lesson denotes. But when the music teacher, art teacher, biology
teacher or mathematics teacher all close their eyes, the image they see in their mind’s eye
may well be very different. The differences they attach to the core concept constitute the
connotative meaning. So for the biology teacher, a lesson takes place in a laboratory, for
512 Research Methods for Education Part 3 The Process of Data Analysis
the art teacher in an open space where students work at their own pace and so on. All are
lessons but the lessons are not all the same because the connotations for each are differ-
ent. Connotation can also be judgemental. If two teachers are talking about a pupil and
one says, ’Oh, her’, it represents recognition, but if she says, ’Oh, her’, with the emphasis
on ’her’, it says something beyond recognition that implies something bad.
Semiotic analysis is more complex and detailed than this but this overview is suf-
ficient for us to understand the method’s approach. Now let us look at how semiotic
research may be useful to educationalists.
The Norwegian education researcher, Marit Holm Hopperstad, used a semiotic ap-
proach to understand aspects of child development (Hopperstad, 2008). She wanted to
develop insights into how children made meaning and explored this by analysing their
drawing and play activity. Her study shows how we can use semiotics with groups who
are unable to verbalise what is happening or their reactions or emotions. Her theoretical
position was that drawing implies the selection of forms in order to make or represent a
meaning, so the artwork that is produced is a semiotic object. What Hopperstad wanted
to do was to understand how play informed the construction of the artwork. Her method
was to observe children drawing and she videoed their activity. Her observation notes,
the video record and the children’s drawings constituted her data. She analysed their
talk by linking talk and conversation sequences to particular art outcomes. Hopperstad’s
work is fascinating but too detailed to consider here. From a substantive point of view,
what emerges from her study is that children work and rework meaning in their drawing
Chapter 11 EXTRACTING THE INFORMATION FROM QUALITATIVE DATA SETS 513
and methodologically we see that drawing can be used to understand children’s ability
to make meaning.
Another area where educational researchers can usefully use a semiotic approach is in
understanding young people’s search for and creation of identity because this may well
be the context for understanding behaviour. Research on topics as diverse as adolescent
magazines (Currie, 1999), suicide behaviour (Anderson et al., 2005), body shape (Social
Semiotics, 2005) as well as the more obvious area of clothes (Morgado, 2007, on hip-hop
style) all help us appreciate in what ways young people can construct an image and cre-
ate an identity that should be recognised by those they want to recognise it. This type of
research, though outside education, is relevant to educationalists.
(e) Phenomenography
Phenomenography sets out to understand how individuals perceive the world in which
they operate. It makes no assumption that there is a single real world and accepts that
there can be many worlds, all real to individuals. In many ways the actual methods used
for data analysis in phenomenography need not be very different from those outlined
above. What is different about phenomenography is its starting position. In order to
appreciate how others see the world phenomenography has to be successful in data col-
lection. There are no specific phenomenographic methods of data collection so, using
whatever method is appropriate (interviews, diaries, personal accounts), the researcher
has to be sure that they understand what the subject is saying and also has to be sure
that the subject is saying what they are actually experiencing. In other words, there is
an additional stage in data production, interaction between researcher and subject, that
produces mutual understanding and agreement.
Phenomenography was developed by a Swedish professor of education, Ference Marton,
as a means of understanding learning from the student’s point of view (Marton, 1986). It
was Marton who developed the framework of deep and surface learning, something that
every student of education is (or should be) aware of. Deep learning has the outcome that
the knowledge is retained (which implies it is understood) and can be applied in new or
different situations. Surface learning is typified by rote learning, retained for a short time
then forgotten and therefore incapable of being used elsewhere. Given the assumptions
behind university level education, that students should be capable of independent
and self-directed learning, able to transfer their knowledge and understanding to other
situations and skilled in analysis, synthesis and argumentation, it is not surprising that it
should come under investigation and equally not surprising that the student’s perspective
should be placed at the forefront. As a concept deep and surface learning was rapidly
accepted by educationalists worldwide and it was this conceptualisation of learning
that probably explains the spread of phenomenography as a method. Marton’s analytic
method has much in common with other approaches. Essentially the data are investigated
and ’categories of description’ identified. These categories have the character of being
(a) elements common to subjects and (b) generalisations that result from groupings of
ideas into themes.
Many of the studies using a phenomenographic approach are studies of the learning
or teaching process. A New Zealand educationalist, Lois Ruth Harris, used the approach
to study teachers’ understandings of students’ commitments to learning (Harris, 2008).
Her starting point was that we know something about student disengagement from edu-
cation, especially its social and demographic associations, but we know little about the
circumstances in which students engage with what education can offer. Harris collected
her data through interview. Phenomenographers try to come at their data without any
514 Research Methods for Education Part 3 The Process of Data Analysis
pre-conceptions and to achieve this they state their viewpoint before commencing their
analysis. Harris even went so far as to limit her reading on the subject to methodological
and not substantive texts. She analysed the interview texts to identify ideas and themes
relevant to engagement. She picked out the most frequently occurring themes and identi-
fied passages that typified the theme. From these she extracted a generic theme, called a
category of description. The outcome of her research was that she was able to distinguish
six dimensions to engagement in learning.
1. Participating in classroom activities and following school rules.
2. Being interested in and enjoying participation in what happens at school.
3. Being motivated and confident in participation in what happens at school.
4. Being involved by thinking.
5. Purposefully learning to reach life goals.
6. Owning and valuing learning (Harris, 2008, p. 65).
Phenomenography has gained wide acceptance in education research because of its
application in understanding the learning process. It is not necessarily limited, however,
to learning. There is no reason why it cannot be applied to other situations where it is
valuable to reconstruct an individual’s view of the world. What is it like to be classified
as a ’failing teacher’ and to be in danger of losing one’s job? What is it like to be a male
teacher in a girls’ school (or vice versa)? What is it to enjoy learning in a culture where
this is not accepted? Understanding each of these situations better might lead to our
managing them better when things begin to go wrong.
Summary
1. Qualitative analysis is not an easy option. There are many approaches that can be
adopted. Each has its own views about the nature of reality and the nature of the
evidence provided by the data. These have to be understood because if they are not
we can finish up making assertions and drawing conclusions that are not warranted
by the philosophical foundations of the method. Compared with the comfort of
established research procedures and standards of judgement that exists in positivist
research, qualitative research is a minefield for the unwary and dangerous for the
naïve.
2. Most qualitative research is presented as taking place in natural settings (though
data collected through focus groups may, at times, fall outside this categorisation)
and as being within an interpretivist tradition. The implication of this is that the
researcher has to deal with many realities and find a way through them that seems
to make sense.
3. We must not assume that qualitative research lacks rigour. There is as much need
to demonstrate the quality of qualitative research as there is to ensure the qual-
ity of positivist research. Without demonstrating quality, what purports to be
research is just speculation. The best test of quality is transparency in proced-
ure. Much published research lacks this. There is considerable flexibility in the
way in which data can be collected and analysed. We should take advantage of
this flexibility if either or both our research question or research issue require
it. Flexibility, however, reinforces the need for transparency since it does mean
that academic peers rather than established research procedure are our test for
quality.
Chapter 11 EXTRACTING THE INFORMATION FROM QUALITATIVE DATA SETS 515
4. The number of qualitative approaches that we might use is large, larger even than
the set we have considered in this section. The approaches vary along a number of
dimensions:
t How predefined the methodological structure is – from highly structured
approaches such as grounded theory, to more loosely structured approaches such
as phenomenography.
t What influences methodology – from philosophical positions (as with critical dis-
course analysis) to conventions about research procedure, conversation analysis,
for example.
t The nature of the evidence provided by the data, from evidence of reality to
evidence about what people say about reality.
The way these approaches have been represented in this section is not definitive.
Other authors have, quite legitimately, presented their explanations in different ways.
5. While there are methods that have distinctive approaches (conversation analysis, for
example), many could draw on what we have set out as core processes. Semiotics,
grounded theory, discourse analysis and critical discourse analysis, for instance, all:
t use a range of data;
t transform data into a form suitable for analysis;
t code data;
t generalise from the data to produce themes.
6. The processes of data analysis, from coding to theming, can all be undertaken on
computers. This becomes a realistic option for more than small amounts of data.
Commercially available and free Web resources can also be used for early stage
explorations of data.
7. The choice of approach for data exploration and analysis should be made on the
basis of the research issue and the assumptions of the method.
t Do you think you are dealing with people’s experiences (in which case you might
consider, amongst others, phenomenography, content analysis or grounded the-
ory) or what people say about their experiences (look towards a type of discourse
analysis)?
t Do you think that the message is explicit in your data (perhaps conversation anal-
ysis) or will you have to dig deeper (perhaps semiotics)?
t Do you want to test other people’s findings (consider template analysis) or
develop your own (any other method)?
t Do you think that the conclusions should stem from the data or can you interpret
the data in the context of other findings (that is, Glaser’s model of grounded
theory or Strauss’ model)?
These are the types of question that have to be asked of any qualitative research
project. If they are not, then it is easy to make the wrong choice. Qualitative analysis
may have fewer ’must have’ conventions in respect of procedure but its assumptions
dictate the shape of the research.
Further reading
Corbin, J. and Strauss, A. (2008) Basics of Qualitative discourse analysis. This is a book to read if you are
Research: Techniques and Procedures for Developing unclear of your own philosophical position. It is, how-
Grounded Theory, Sage, Thousand Oaks, Calif. ever, not for the faint-hearted.
The latest edition of the text originally written by Strauss
Narrative analysis
and Corbin is still an excellent introduction to im-
plementing grounded theory. Kathy Charmaz’s book Webster, L. and Mertova, P. (2007) Using Narrative
covers the same ground, though in less detail. Enquiry as a Research Method: An Introduction to Using
Critical Event Narrative Analysis in Research on Learning
Discourse and conversation analysis and Teaching, Routledge, Abingdon.
Andersen, N.A. (2003) Discursive Analytical Strategies, The Often we are interested in turning points in people’s
Policy Press, Bristol. lives or the impact and effect of critical incidents. This
Fairclough, N. (2003) Analysing Discourse, Routledge, book shows how to conduct such an investigation.
London. Usefully, there is a whole chapter of examples of nar-
McClure, M. (2003) Discourse in Education and Social ratives so that researchers can see what they should
Research, Open University Press, Buckingham. aim to produce.
Rapley, T. (2007) Doing Conversation, Discourse and Docu-
Semiotics
ment Analysis, Sage, London.
Discourse analysis is a complex field. These four books Harnson, C. (2003) ’Visual Social Semiotics: Understand-
will take you to different parts. Each takes a different ing How Still Images Make Meaning’, Technical Commu-
approach. nication, 50(1), 46–60.
McClure provides a broad overview with case studies Van Leewen, T. (2005) Introducing Social Semiotics,
showing discourse analysis in different contexts. After Routledge, London.
reading in the introduction to Chapter 5 that ’like any Van Leewen’s book is a highly readable overview of what
other texts, research texts…are fabrications’, you begin can be a complex subject. It takes a broader view than
to look at research literature in a different way. just analysing material as signs, signifiers and signified
Rapley provides a sound introduction to some impor- and shows how people assemble resources to commu-
tant approaches to qualitative analysis. Chapter 3 is nicate an impression, an idea, a belief. It defines and
insightful in terms of ethical guidelines and advice on explains the principles of social semiotics and outlines
recording conversation and Chapter 4 on the practical- the frameworks used in analysis.
ities of recording outlines a procedure that will ensure Visual imagery is a growing area of data analysis, yet
you will obtain data. Chapters 6 to 9 look at ways of examples of how to go about it are few. Claire
extracting meaning from the data. Examples are well Harnson, an American analyst, has written a short
drawn to make points clear. review of images from a semiotic perspective in
Fairclough’s book has a much more ’hands on’ ap- the journal Technical Communication (available as
proach. More than with McClure, you will have to get a download from Amazon USA). With examples to
used to the technical terminology used in discourse explain principles and questions that will point
analysis but the ideas are well explained. researchers in the right direction, her paper is an
At the heart of Andersen’s book are the theoretical excellent introduction to a difficult area.
positions that determine the various approaches to
References
Aalsvoort, G., Harinck, F. and Gossé, G. (2006) ’Script Anderson, M., Standen, P. and Noon, J. (2005) ’A Social
Identification: A Way to Understand How Students Semiotics Interpretation of Suicidal Behaviour in
Choose Strategy During Mathematization?’ European Young People’, Journal of Health Psychology, 10(3),
Journal of Special Needs Education, 21(3), 301–319. 317–331.
An, W. (2007) ’High Stakes Testing and Curriculum Askehave, I. (2007) ’The Impact of Marketisation on
Control: A Qualitative Metasynthesis’, Educational Higher Education Genres – The International Student
Research, 36(5), 258–267. Prospectus as a Case in Point’, Discourse Studies, 9(6),
723–742.
Chapter 11 EXTRACTING THE INFORMATION FROM QUALITATIVE DATA SETS 517
Grobler, I. (2006) ‘Co-contructing Knowledge in a Lake, J. and Billingsley, B. (2000) ’An Analysis of Factors
Psychology Course for Health Professionals: A that Contribute to Parent School Conflict in Special
Narrative Analysis’, PhD thesis, University of Pretoria Education’, Remedial and Special Education, 21(4),
(accessible online http://upetd.up.ac.za/thesis/ 240–251.
available/etd-06212007-121700 accessed December Lewins, A. and Silver, C. (2006) ’Choosing a CAQDAS
2013). Package: A Working Paper’, available at http://caqdas.
Halliday, M. (1973) Explorations in the Functions of soc.survey.ac.uk/index.htm (accessed July 2008).
Language, Edward Arnold, London. Lewins, A. and Silver, C. (2007) Using Software in Quali-
Harris, L.R. (2008) ’A Phenomenographic Investigation tative Research: A-Step-by-Step Guide, Sage, London.
of Teacher Conceptions of Student Engagement in Lloyd, G. (2006) ’Pre-service Teachers’ Stories of Math-
Learning’, Australian Educational Researcher, 35(1), ematics Classrooms: Explorations of Practice Through
57–79. Fictional Accounts’, Educational Studies in Mathemat-
Henne, R. (and subsequently modified by Woods, D.) ics, 63(1), 57–87.
(2008) Transana Keyboard Shortcuts and Transcript MacQueen, K., McLelland, E., Kay, K. and Milstein, B.
Notation, http://www.transana.org/support/ (1998) ’Codebook Development for Team-based
documentation.htm (accessed November 2008). Qualitative Analysis’, Cultural Anthropology Methods
Hinduja, S. and Patchin, J. (2008) ’Personal Information (renamed Field Methods), 10(2), 31–36.
of Adolescents on the Internet: A Quantitative Content Malatalo-Siegl, K. (2008) ’From Multiple Perspectives to
Analysis of My Space’, Journal of Adolescence, 31(1), Shared Understanding: A Small Group in an Online
125–146. Learning Environment’, Scandinavian Journal of
Hopperstad, M. (2008) ’How Children Make Learning Education, 52(1), 77–95.
Through Drawing and Play’, Visual Communication, Marshall, H. (2002) ’What Do We Do When We
7(1), 77–96. Code Data?’, Qualitative Research Journal
Hsieh, H-S. and Shannon, S. (2005) ’Three Approaches 2(1), 56–70, http://www.latrobe.edu.au/aqr/
to Qualitative Content Analysis’, Qualitative Health journal/1AQR2002.pdf.
Research, 15(9), 1277–1288. Marton, J. (1986) ‘Phenomenography: A Research
Johnson, B. (2011) ’The Speed and Accuracy of Voice Approach to Investigating Different Understandings
Recognition Software-Assisted Transcription Versus of Reality’, Journal of Thought, 21, 28–49.
the Listen-and-Type Method: A Research Note’, Matheson, J. (2007) ’The Voice Transcription Technique:
Qualitative Research, 11(1), 91–97. Use of Voice Recognition Software to Transcribe
Jones, R. and Noble, G. (2007) ’Grounded Theory Digital Interview Data in Qualitative Research’,
and Management Research: A Lack of Integrity?’, Qualitative Report, 12(4), 547–560.
Qualitative Research in Organisations and Morgado, M. (2007) ’The Semiotics of Extraordinary
Management, 42(2), 84–103. Dress: A Structural Analysis of Interpretation of
Kelle, U. (1997) ’Theory Building in Qualitative Research Hip-Hop Style’, Clothing and Textiles Research Journal,
and Computer Programs for the Management of Tex- 25(2), 131–155.
tual Data’, Sociological Research Online, 2(2) at http:// Nevile, M. and Walker, M. (2005) ‘A Context for Error:
www.socresonline.org.uk/socresonline/2/2/1. html Using Conversation Analysis to Represent and Ana-
(accessed July 2008). lyse Recorded Voice Data’, Aviation Research Report
Kelle, U. (2005) ’“Emergence” vs “Forcing” by Empirical 132005/0108, Australian Transport Safety Bureau.
Data? A Crucial Problem of Grounded Theory Ohi, S. (2008) ’The Teacher’s Role in the Research-
Reconsidered’, Forum Qualitative Social Research, 6(2), Policy-Prexis Nexus’, Australian Journal of Education,
Art. 27, available at http://nbn-resolving.de/urn.nbn. 52(1), 95–109.
de: 0114-fqs0502275(accessed July 2008). Onwegbuzie, A. and Daniel, L. (2003) ’Typology of
Konapasek, Z. (2008) ’Making Thinking Visible With Analytical and Interpretational Errors in Quantitative
Atlas.Ei: Computer Assisted Qualitative Analysis as and Qualitative Educational Research’, Current Issues
Textual Practices’, Forum Qualitative Social Research, in Education, (on-line) 6(2), available at http:/ cie.
9(2), Art. 12, available at http://nbn-resolving.de/ ed.asu.edu/volume6/number2/ (accessed July 2008).
urn:nbn:de:0114-fqs0802124(accessed July 2008). Ormerod, F. and Ivanic, R. (2002) ’Materiality in
Krippendorf, K. (2004) Content Analysis: An Introduction Children’s Meaning Making Practices’, Visual
to its Methodology, Sage, Thousand Oaks, Calif. Communication, 1(1), 65–91.
Chapter 11 EXTRACTING THE INFORMATION FROM QUALITATIVE DATA SETS 519
Oughton, H. (2007) ’Constructing the “Ideal Learner”: Spencer, L., Ritchie, J., Lewis, J. and Dillon, L.
A Critical Discourse Analysis of the Adult Numeracy (2003) Quality in Qualitative Evaluation: A Framework
Core Curriculum’, Research in Post-Compulsory for Assessing Research Evidence, Cabinet Office,
Education, 12(2), 259–275. London.
Pettigrew, S. (2002) ’A Grounded Theory of Beer Strauss, A. (1987) Qualitative Analysis for Social Scientists,
Consumption in Australia’, Qualitative Market Cambridge University Press New York.
Research, 5(2), 112–122. Strauss, A. and Corbin, J. (1990) Basics of Qualitative
Pinsof, W. (1986) ’The Process of Family Therapy: The Research: Grounded Theory Procedures and Techniques,
Development of the Family Therapist Coding System’, Sage, Newbury Park, Calif.
pp. 201–284 in The Psychotherapeutic Process: A Tercanlioglu, L. (2008) ‘A Qualitative Investigation
Research Handbook, Greenberg, L. and Pinsof, W. (eds). of Pre-Service English as a Foreign Language
Guildford Press, New York. (EFL)–Teacher Opinions’, The Qualitative Report,
Rademaker, L., Grace, E. and Stephen K. Curda (2012) 13(1),137–150.
’Using Computer-assisted Qualitative Data Analysis Vuyisile, M. (2006) ’The Quality of Life of “Street Chil-
Software (CAQDAS) to Re-examine Traditionally dren” Accommodated at Three Shelters in Pretoria:
Analysed Data: Expanding Our Understanding of the An Exploratory Study’, Early Child Development and
Data and of Ourselves as Scholars’, The Qualitative Care, 176(3–4), 253–269.
Report, 17 (43), 1-11. Walker, D. and Myrick, F. (2006) Grounded Theory: An
Reid, L. (2007) ’Teachers Talking About Writing Assess- Exploration of Process and Procedure, Qualitative
ment: Valuable Professional Learning?’ Improving Health Research, 16(4), 547–559.
Schools, 10(2), 132–149. Warburton, W. (2005) ’What Are Grounded Theories Made
Rice, M., Sell, M. and Hadley, P. (1990) ’The Social In- Of ?’, available at http://eprints.soton.ac.uk/16340/
teractive Coding System (SICS)’, Language, Speech and (accessed July 2008).
Hearing Sources in Schools, 21(2), 2–14. Weitzman, E. and Miles, M. (1995) Computer Programs
Richmond, H. (2002) ’Learners’ Lives: A Narrative for Qualitative Data Analysis: A Software Sourcebook,
Analysis’, Qualitative Report, 7(3), available at Sage, Thousand Oaks, Calif.
http:// www.nova.edu/ssss/QR/QR7-3/richmond. Wetherell, M., Taylor, S. and Yates, S. (2001) Discourse as
html(accessed August 2008). Data, Sage, London.
Rogers, R., Malanchavavil-Berks, E., Mostey, M., Hui, D. Wheway, R. and Millward, A. (1997) Children’s Play:
and O’Garro Joseph, G. (2005) ’Critical Discourse Facilitating Play on Housing Estates, Chartered Institute
Analysis in Education: A Review of the Literature’, of Housing.
Review of Educational Research, 75(3), 365–416. Willett, R. (2006) ’Poofy Dresses and Big Guns: A
Rusby, J., Estes, A. and Dishion, T. (1991) The Interper- Post-Structuralist Analysis of Gendered Positioning
sonal Process Code, Oregon Social Learning Centre, Through Talk Amongst Friends’, Discourse, 27(4),
Oregon. 441–455.
Schutz, A. (1967) The Phenomenology of the Social World, Wright, P. and Davis, A. (2008) ’Adolescent Parenthood
Northwestern University Press Evanston, IL. Through Educators’ Eyes: Perceptions of Worries
Social Semiotics (2005) ’Thinking Fat: Special Issue of and Provision of Support’, Urban Education, 43(6),
Social Semiotics’, Social Semiotics, 15(2). 671–695.
Chapter contents
Introduction 521
Summary 561
Further reading 561
References 562
Chapter 12
Learning themes
t The questions that statistical description terms using a range of methods and
can answer. understand the implications of using
these methods.
t The three key characteristics of data sets,
how typical any value is, how spread out t Understand how the standard deviation
the data are and the shape of the data set. is a gateway to more advanced statistical
reasoning (standard error, sample size,
t Describing data expressed through differ- statistical testing).
ent types of measurement scale.
t Appreciate that data can be transformed
t Expressing the degree of association to approximate to a normal distribution
between one data set and another. and know how to effect transformations
t Assessing trends over time can show a using logarithms.
pattern. t Understand the principles of smoothing
t Working with real data sets. time series.
By the end of this section you will: t Understand how to represent and measure
the association between two variables and
t Understand the use of statistical notation. be able to calculate a measure using dif-
t Be able to describe the shape of a data set ferent data types.
(centre, spread, peak, lean) in numerical
Introduction
It is almost inevitable that, at some point in a view to developing a teaching approach
our research, we will have to deal with num- that helps them conceptualise mathematical
bers. Even if our research is firmly rooted in concepts), we could show the significance of
the qualitative tradition, it is likely that we will our research in terms of the cost of remedial
need to demonstrate the significance of our support at later stages in the educational sys-
study in a broader context. If, for example, our tem, the proportion of pupils who failed to
research was into understanding why some reach a standard or the percentage of adults
children have difficulties with numeracy (with whose numerical capability was insufficient to
522 Research Methods for Education Part 3 The Process of Data Analysis
enable them to function effectively in society. explore the performance data were averages
We could estimate the cost to the economy of and percentages.
inadequate numeracy. All of these numbers
are simple descriptors of situations but with- It is not just the demonstration of the fact that
out them our study cannot be benchmarked quantitative analysis can help us draw conclu-
in terms of its broader importance. But while sions that is important here, it is also that the
our research would be of significance, its real methods used were so simple. There are, of
value arises when we use the insights we have course, more complex procedures for answer-
gained and develop a new approach to teach- ing questions like, ‘is the difference between
ing that will help students learn. This is what these two sets of data significant?’, And we
a joint British/Dutch research team that looked shall meet some of them in chapter 13. What
at strategies for solving mathematical problems Anghileri and her collaborators did was show
used by English and Dutch 9- and 10-year-old a marked difference in performance just by
pupils did (Anghileri et al., 2002). They were describing the data mathematically. And this
able to show that the approach of the Dutch is what this section is all about, using statisti-
pupils (and the teaching approach that under- cal summaries to draw out the characteristics
pins this) was more effective than that of the of data sets so that, by inspection, similarities
English. And the only statistics they used to and differences can be identified and assessed.
degree to which a policy has been attained, practice improved or a social problem al-
leviated), so that we can look at them in relation to other instances. Summaries focus
on the message and suppress the noise.
r Second, description is the means by which we show our understanding. Descriptive
analysis is the process through which we move towards an understanding and our
description is a statement of where we have reached in our understanding. We cannot
show our understanding without describing it and the description of data is a power-
ful tool at our disposal.
r Third, we can describe a position in time. This creates a platform for establishing dif-
ference between now and a point in the past or the future and is the basis for further
exploration of our understanding about the dynamics of a situation. Without descrip-
tion we cannot take our thinking further and move from an understanding of ‘What
is?’ to formulate such questions as ‘What might be?’, ‘What should be?’ or ‘What
could be?’
British educational psychologists, led by Peter Smith, created a profile of those who
were bullied at school (Smith et al., 2004). They categorised them as ‘non-victims’,
‘escaped victims’, ‘continuing victims’ and ‘new victims’ and used a variety of self-
report and teacher reports to create the profile. The indicators they used included how
much they enjoyed school, their likes and dislikes about school, patterns of friendship,
whether they had ever bullied, their scores on a behavioural profile (that measured
emotion, conduct, inattention, peer relationships, social behaviour), school attend-
ance and the outcomes of being bullied. Their results, presented as percentages and
average scores, revealed that the groups differed in their adjustment to school and the
way they functioned in school. The results also showed that continuing victims of bul-
lying were judged by teachers as having high rates of conduct disorder, hyperactivity
and emotional problems. The benefit of this research is that it gives us insights not only
into bullying but also into the link between being bullied, classroom disorder and,
perhaps, exclusion.
Another approach to profiling is to assemble statistics from national sources. This is
what Pamela Meadows and Daniel Roegger, two researchers under contract to the UK’s
Department for Work and Pensions, did in order to profile low income homeowners
(Meadows and Roegger, 2005). This is the sort of research that in education we should
be aware of because we need to understand the dimensions and consequences of low
income for educational attainment and processes. Using data from the Parents and Chil-
dren Study, the Family Resources Survey, the Survey of Housing, Family Spending, the British
Household Panel Survey, the House Condition Survey and the Poverty and Social Exclusion
Survey, they were able to create a profile of low income homeowners in terms of income
levels, house value, mortgages, repossessions, age, size and condition of home, their
neighbourhood, patterns of consumption, savings, wealth and debt. While 60% of low
income homeowners are over 60-years-old, this still leaves about 5.5 million house-
holds who are under 60, a significant proportion of which will have children or young
dependants.
A Belgian study has done just this (Willems et al., 2005). A sample of just over 6,000
school children in Ghent (30% of the population) revealed an infection rate of 8.9%.
The following additional data were also collected:
r Personal data – age and sex.
r School data – school, class.
r Hair length, colour and type.
r Family data – number of children in the family, socio-economic status.
The results are shown in Table 12.1. This profile suggests association between vari-
ables. More girls than boys had hair lice. Infestations decreased from kindergarten to
6th year primary classes and increased with hair length and members of children in the
family. They were higher for lower socio-economic groups. We have, here, the basis for a
conceptual model but before we put it together we need think whether the associations
are direct or indirect. Why should gender per se be an influence? Only because girls might
have longer hair. To what extent are large families associated with lower socio-economic
status? Further investigation is clearly required but the use of simple descriptive statis-
tics has taken our understanding forward. And why should we as educationalists be
interested in this research on head lice? Because we want to understand the reasons for
absence and the causes of bullying, amongst other things.
% %
In Chapter 10, we learnt how to represent data graphically. In particular we saw how we
could use diagrams to represent the pattern in the data. We shall do exactly the same in
this section and learn how to make numerical statements that represent data character-
istics and form.
If we look back to Figure 10.6(a) we can see that there is a definite shape to the pattern
in this data. As we shall see, this shape and its characteristics are important in quantita-
tive analysis because it is the link between data description which we cover in this sec-
tion, and more advanced statistical analysis that we look at in the next.
Let us look in a little more detail at some of the characteristics of a distribution. We
need to understand what these are because they can be used to describe how one distri-
bution differs from another. There are four aspects of the shape of the distribution that
we should focus on: where the centre is, the spread of data around the centre, the height
of the distribution and whether or not it is symmetrical.
r The first thing that we should notice about a distribution is that there is a centre, the
point around which the observations are equally distributed. Imagine the distribu-
tion as a balance, with the same weight of observations on each scale. We can describe
where this point is using measures such as the mean, mode and median. All express
the ‘averageness’ of a distribution. We often use this measure to say something about
our data. For example, we might say that the average score of pupils in school A on a
language test was 48 and in school B it was 61. When we express the performance of
the pupils in this way, we think we are identifying a difference that may be important.
r The second characteristic of a distribution that can interest us is that there is a spread
of data around the centre. In some cases most of the data will be close to the centre
and in other cases the data will spread widely from the centre. In describing data it is
useful to be able to capture the extent of the spread of the data. We can measure this
spread. As Figure 12.1(a) shows, it is perfectly possible to have two collections of data
centred on the same point but with different spreads and it is important, when we
describe our data, to be able to tell people about this difference.
r The third feature of a distribution is how high it is. In Figure 12.1(b) the distribu-
tions are of different heights, though in both they have the same centre and the same
spread. This relationship between height and spread is called peakedness. We can
produce numerical descriptors of peakedness as well.
r Finally, the fourth characteristic a data distribution may have is that it can be sym-
metrical or that it can lean to the left or right side. This is shown in Figure 12.1(c). This
imbalance between left and right is called skew. The coloured curve is skewed to the
right and the black shade to the left. We can describe skew in numerical terms as well.
It is important that we describe our data sets accurately because our description can
highlight that it may be worth investigating further. Why do pupils in two schools score
such different grades? If test scores are all concentrated around the average does this
mean that all the students are much the same or is the examiner only using part of the
mark scale? While we shall look at each of our characteristics in turn, we should be
aware, at the outset, that central tendency and spread are the most important. The final
Chapter 12 EXTRACTING INFORMATION FROM QUANTITATIVE DATA 527
(a)
(b)
(c)
points we should make about the numerical descriptors we shall meet is that (a) in cal-
culating them, we shall use different measurement systems and that (b) in describing
some of these features of distributions, we shall combine indicators from these different
measurement systems. This is not at all difficult but it may be useful to refresh memories
by reading section 4.4 again.
average when we mean usual. ‘Middle’, ‘more’ and ‘usual’ are all different words we use to
express typicality and each one is different. Statistics is the same and each of these ideas
has a statistical equivalent. In this section we shall learn how to calculate them, what
they mean and how to use them.
∑ represents the process of summing all the values of x from the first (1) to the nth
l
– indicates the division process
n is the total number of xs
As you read articles and books you will come across this sort of notation. It is not
(usually) put in to confound us. It is a shorthand way of saying ‘this is how I did the
calculation’ and it can be important for us to know this because some calculations can be
done in different ways (especially with statistical tests) and each way should be used only
in particular circumstances. As researchers, we should be able to read what the notation
is telling us. As we can see, with this example, it need not be too difficult.
Table 12.2 shows the number of free school meals taken in primary schools in a Scottish
education authority on a census day in 2008. Data from 53 schools are shown. With this
amount of data it is difficult to get a sense of the typical, so calculating the mean value
is important. But calculations from official data sets like this can be tricky. The left-hand
column has several asterisks. These indicate either where the number was four or less or
where it could not be worked out. We have to decide on what basis to calculate the mean.
r If we choose to ignore the schools for which we have no information, then we divide
the total number of pupils taking free school meals (2,077) by 46 to give a mean
of 48.2.
r If we count the schools that have asterisks, we can assume either that no one had free
school meals (we divide our total by 53 to give a mean of 39.2) or that four in each
school had free school meals (which would increase our grand total by 28 pupils and
give us a mean of 39.72).
Which mean value should we take? First, we should investigate to see if we can iden-
tify the reasons for the missing data. If, after this, we still have no solution, we have to
decide whether to ignore the schools for which there are no data and just use the data we
have (which will over-estimate the true mean) or set the missing values to zero (which
will under-estimate the true mean). In some cases it may be better to over-estimate (for
Chapter 12 EXTRACTING INFORMATION FROM QUANTITATIVE DATA 529
Table 12.2 Free school meals taken in primary schools in a Scottish education
authority (2008)
49 49 72 72 40 40 93 93
1 1 31 31 42 42 8 8
10 10 65 65 15 15 25 25
17 17 24 24 28 28 8 8
102 102 2 2 47 47 16 16
22 22 35 35 * 0 63 63
21 21 * 0 14 14 103 103
21 21 * 0 * 0 62 62
51 51 59 59 118 118 86 86
10 10 7 7 31 31 60 60
84 84 42 42 74 74 95 95
25 25 62 62 15 15
* 0 91 91 52 52
* 0 * 0 79 79
example, if allocating funding) but generally we should go with the data we have, while
noting that our best estimate of the mean may be within a range.
While there may be issues with missing data, the method of calculating a mean is ac-
tually straightforward. Much of the data we have to work with, however, may be grouped
into classes (which statisticians refer to as ‘bins’) because this is the way the data is pre-
sented to us by official sources. This poses a problem. How do we calculate a mean in
these circumstances? It is quite simple. We assume that the data in each class is evenly
distributed and then represent each class by its mid-point. Table 12.3 shows the number
of nursery and primary school teachers by age. What is the mean age?
r Our first step is to look at the age groups. Most of the classes have a range of five years,
25–29 for example. Two have no limiting boundary, under 25 and over 60.
r The easiest classes to deal with are those with an age range of five years. What we do
is determine the mid-point of the class. The easiest way to do this is to add the upper
class boundary to the lower class boundary and then divide by two. So 30 plus 34
equals 64 and half of 64 is 32.
r Where the class does not have a value for class boundary we have to create one. At
the lower end, we assume that teachers start at age 21 and, at the upper end, that they
retire at 65.
r Our final step is to assume that every teacher has the age of the mid-point of the class.
We have to do this because we do not know from our table their actual ages. It will
give us the most reliable estimate. If we used the lower limit of the class we would
under-estimate the total ages of teachers in the class and if we used the upper limit we
would over-estimate the total ages. We then multiply the number of teachers in each
class by the mid-point of the class to give the total ages of teachers in the class.
r To calculate the mean age of teachers, we sum their ages (6,741,050) and divide this
by the total number of teachers (166,700). This gives a mean age of 40.44.
This method is quite simple and well within the capability of anyone aiming to do
research in education but sometimes the technique can get in the way of the implica-
tions of the result. This might be the case here. Let’s look at the data again. What stands
out? The number of nursery and primary teachers rises from first entrants (213,750 in
the under 25 group) to 780,800 in the 30–34 group then falls (629,000 in the 35–39
group) and then rises again, peaking in the 50–54 age group. The mean age actually
falls in the trough. So, in this case, the mean age (40.44) of teachers does not identify
the most usual teacher. This is something we need to watch out for. The mean is suscep-
tible to the influence of extreme values and to the character of the data distribution.
Our calculations are perfectly correct but the mean gives a better indication of typicality
when the pattern of the distribution falls off on the lower side of the mean much the
same as it does on the upper side, in other words when the distribution is symmetrical.
1 16 31 60 91
2 17 35 62 93
7 21 40 62 95
8 21 42 63 102
8 22 42 65 103
10 24 47 72 118
10 25 49 74
14 25 51 79
15 28 52 84
15 31 59 86
r Our first step is to find the class that contains the median teacher, that is, the 83,350th
teacher if they were all arranged in rank order of their ages. To do this it is easiest if we
calculate the cumulative frequency of our data set. That is, we add the numbers in a
class to the numbers in all preceding classes. This is shown in Table 12.5. The cumula-
tive frequency for the 25–29 group is the frequency for the group (30,100) plus the
frequencies of all preceding groups (9,500), a total of 36,900. The process continues
with all groups.
r The second step is to identify the category that contains the median teacher. There
are a total of 166,700 teachers, so the median teacher is number 83,350. The median
teacher, therefore, is found in the 40–44 age group (shown in blue). We know that
the median teacher is in this group because there are 98,000 teachers up to the age of
44 and only 81,000 up to the age of 39. Since 81,000 is less than 83,350 the median
teacher cannot be below 39-years-old and because 98,000 is greater than 83,350, the
median teacher cannot be older than 44.
r The third step is to identify where in the category the median item is. Here we again
assume that the data in the category are evenly distributed. What we have to deter-
mine is the difference between the cumulative frequency up to the median category
532 Research Methods for Education Part 3 The Process of Data Analysis
(81,000) and the median value (83,350). In our case this is 2,350. We then express
this as a percentage of the frequency in the median category (17,000) and multiply
this percentage by the range of the median category (5). In other words, we are saying
that if our median teacher is 2,350/17,000 per cent of the total in the group (actually
13.8%), the age of the median teacher is 13.8% of the range of the age group (13.8%
of 5), that is, 0.69. All we do is then add this lower age limit of the group (40) to give
a median age of 40.69. We can express this procedure in a fearsome-looking but quite
straightforward formula:
n
LV 1 2 CF
2
Median 5 ? r
f
LV 5 value of the lower limit of the median class
n 5 total number
CF 5 cumulative frequency of class below median class
f 5 frequency in median class
r 5 range of median class
In our example:
166,700
40 1 2 81,000
2 2,350
?r5 ? 5 5 40.69
17,000 17,000
r An alternative to the calculator method is to draw an ogive (see section 10.3 for the
method of how to do this) and read off the 50% value. The benefit of this approach
is that all the calculation can be done via a spreadsheet. Figure 12.2 shows an ogive
created via Calc, the spreadsheet program in the Open Office Suite. We can find the
median value, 50%, and read off the median age, about 40.7. This graphical method
is less precise than the calculation method but it is fast.
100.00
75.00
Cumulative frequency
Per cent
50.00
25.00
0.00
under 25 30–34 40–44 50–54 60 +
Age
Unlike the mean, the median is not affected by extreme values. Each value has the
same weight as every other value. What does this mean? If there were 40 items, the 1st
and the 40th have the same influence when we identify the median. If we were calculat-
ing the mean and the largest value (the 40th item) was an outlier that was three times
as large as the 39th value, it would exert an undue influence on the value of the mean.
The reason for this is that we are using two different measurement scales. With the
mean, we use an interval scale. With the median we are often using an ordinal scale
(that is, one based on rank order and not a standard unit of distance). Because this
is a weaker measurement scale (that is, it is less precise), the median is a less precise
estimator of central tendency. But against this, it is easy to calculate, is not distorted
to the same extent as a mean can be by extreme data values and allows us to rapidly
assess our data.
35,000
30,000
25,000
Numbers of teachers
20,000
15,000
10,000
5,000
0
under 25 25–29 30–34 35–39 40–44 45–49 50–54 55–59 Over 60
Age
We can also use the data on pupils taking free school meals (Table 12.4) to identify
the mode. This, however, involves us in an additional step, creating the class boundaries
that we use to represent the data. This is where we have to exercise our judgement, as we
shall see with the following examples.
Figure 12.4(a) shows the data with 11 classes. This gives two modal classes, 10–19
and 20–29 pupils per school, with two subsidiary peaks. If we reduce the number of
classes to 5 (Figure 12.4(b)), the modal class moves to 19 and under. Now if simply
changing the number of classes can do this to a statistical indicator, what can we do in
order to identify what the ‘right’ number of classes should be? This is actually a complex
mathematical area and clearly there has to be a relationship between the number of data
units and the number of classes (and, by implication the class interval) in order to show
sufficient detail (that is, neither suppress information nor swamp interpretation with
detail). There are some rules of thumb, however.
r Sturges’ rule states that the number of classes is given by the equation:
1 1 3.3 3 log n where n is the number of observation
Herbert Sturges, an American statistician, developed this rule of thumb in 1926.
r An alternative is to use the equation:
5 3 log n
This equation appears as custom and practice in some social sciences and appears
to have no formal or identifiable source.
Both of these formulations have been criticised because they have no basis in statis-
tical theory. Because of this it is better if we do not use them. The Rice rule (so called
because it was developed by staff in the Statistics Department at Rice University in the
Chapter 12 EXTRACTING INFORMATION FROM QUANTITATIVE DATA 535
Number of schools
5
0
19
29
39
49
59
69
79
89
99
9
0
10
<
to
to
to
to
to
to
to
to
to
>
10
20
30
40
50
60
70
80
90
Number of pupils
14
12
10
Number of schools
0
< 19 20 to 39 40 to 59 60 to 79 > 80
Number of pupils
Figure 12.4(b) Five data units
USA) seems to give results that approximate to more advanced rules that are beyond this
text to explain. The Rice formula is:
23 n where n is the number of data units
Table 12.6 shows the number of classes each method would generate for data sets
of different size. As the data sets increase in size, the Sturges model compacts the
536 Research Methods for Education Part 3 The Process of Data Analysis
12
10
Number of Schools
8
0
< 14 15 to 29 30 to 44 45 to 59 60 to 74 75 to 89 90 to 104 > 105
Number of Pupils
Rice 6 7 9 12
Sturges 6 7 8 9
5 ⴛ log 7 8 10 12
distribution, which will lead to a loss of detail. The 5 3 log n model may over-
estimate the ideal number of classes with small amounts of data. Both of these conclusions
assume that the Rice model gives more appropriate results. Figure 12.4(c) shows our
school meal data with 8 classes. This shows detail and gives a sense that the distribution
is more skewed than multi-modal.
1–8 5
9–15 5 1–15 10
16–21 4 16–29 9 1–16 11
22–25 4 30–51 9 17–40 12
26–40 5 52–76 9 41–68 12
41–51 5 77–118 9 69–118 11
52–62 5
63–76 4
77–91 4
92–118 5
interquartile range is the range from the bottom of the second quartile to the top of
the third quartile, that is, the quartiles immediately below and immediately above the
median. The interquartile range represents the 50% of the data set in the middle of the
distribution. We can, however, still be faced with problems.
From Table 12.7 column(c), we can estimate the interquartile range as 17 to 68
(a range of 52). But if we look at the actual values in our distribution they are 17 and
65, a range of 48. However, because we had to adjust the numbers in our class to take
account of the fact that 46 is not exactly divisible by four, even this interquartile range is
an over-estimate since it contains 24 out of 46 observations, that is, 52% and not 50%.
We can have a range of 50% of the data values if we use 23 of them. How do we do this?
The answer is to calculate where a theoretical bottom and upper value would be by tak-
ing 11.5 (half of 23) data units from a theoretical median (41) and adding 11.5, giving
19 to 64 (a range of 45).
The method we choose depends on the degree of accuracy we require. The ‘theo-
retical’ method is the best. However, we will often use the concept of interquartile
range to get a rough and ready impression of the data. Speed may be more important
than precision. However, it does not take a great deal of effort to create a more precise
measure.
going to affect the value of the overall deviation so we have to ‘remove’ the effect of size as
best we can. The best way to do this is to calculate the mean deviation for each data set. In
our case we divide the total deviation (10) by the number of data units (5). The answer is
2. If we had 50 units of data in our set we would divide the total deviation by 50. We call
this figure (the sum total of all the mean squared differences from the mean) the variance.
We have not finished, however, because there is a loose end in our mathematical ar-
gument. We used the device of squaring numbers to get rid of negative values. This has
the effect of overstating the value of the difference in the data set and we have to reverse
the effect. We do this by taking the square root of our mean difference: 2 5 1.414. This
is the standard deviation. The variance is the square of the standard deviation. (You will
come across this again in Chapter 13.)
We know sufficient about statistical calculation to be able to express this whole pro-
cess in a formula:
∑ ( x2x )
2
δ5
n
Where
␦ 5 standard deviation (sigma, lower case)
1 5 square root
x 5 a data unit
x 5 mean
Σ 5 sum of all (sigma, upper case)
n 5 the number of data units
An easier formulation of this formula if we calculate a standard deviation manually is:
25
∑x 2
2 x2
n
Height in cm Frequency
90 to 100 4
100 to 110 11
110 to 120 34
120 to 130 237
130 to 140 1,328
140 to 150 1,652
150 to 160 454
160 to 170 35
170 to 180 3
180 to 190 2
190 to 200 2
over 200
3,762
Table 12.9 Layout for calculating standard deviation from grouped data
Height in cm B C D E F
of group Mid-point Frequency B3C B2 (C 3 E)
The original data are in columns A and C. Column B shows a value of the mid-point
of each height category. Column D is the product of multiplying the mid-point of each
category by the frequency in the category. Column E is the square of the mid-point value
and Column F our estimate of the sum of squares, the product of multiplying the fre-
quency (Column C) with the square of the mid-point (Column E).
542 Research Methods for Education Part 3 The Process of Data Analysis
∑ fx ⎛ ∑ fx ⎞
2
2
2⎜ ⎟
∑f ⎝ ∑f ⎠
of 1. What was particularly important for statistics (and so for us) was that, in the nor-
mal distribution, it was further established that within this one standard deviation there
were 68% of the data units of the distribution. Just think about this and remember that,
in these early days, this was all to do with gambling and people were interested in the
chances of different outcomes occurring. This finding puts us in a very interesting posi-
tion, just at the point where we move from describing characteristics of shape to describ-
ing to judging likelihood, in other words, we are at the doorway that leads into the world
of probability. If we know that 68% of outcomes lie within one standard deviation, then
we also know that there is a 68% chance of one of those outcomes occurring.
Later in the century another important discovery was made which allowed this impor-
tant feature of the normal distribution to be extended to the real world. Let us imagine we
have a group of 30 children that we wish to sample to test their IQ. If we want our sample
size to be 5, there are 142,506 ways of combining 5 students out of the 30. Now the inter-
esting thing is that if we were to draw each of these samples and calculate the mean IQ of
the group, we would find that there were more combinations that would produce a mean
value at or close to the true mean and fewer combinations that would produce a mean that
was very different from the true mean. Visualise this for yourself, a peak of samples in the
centre close to the true mean and the number of samples falling away as we move away
from the true mean. In other words, it is a normal distribution. Let us summarise this. If
we draw all of the samples of a given size from a population, the distribution of the means
of these samples will be a normal distribution, irrespective of whether the parent popula-
tion followed a normal distribution. So the parent population need not be normal but the
distribution of sample means (the so-called sampling distribution) is always normal. This
has important implications for sampling, which are further developed in Appendix 1.
So how, then, do we interpret the standard deviations we have calculated for our data.
Let us look again at the data on the height of 10-year-old girls. We saw that the mean
height of girls was 141.21 cm with a standard deviation of 9.38 cm. This means that 68%
(remember this was established theoretically) of our population will lie within 9.38 cm
of our mean, that is within the range 131.83 cm to 150.59. (Actually it is not exactly 68%
but we do not need to be absolutely precise at this point).
But this is not all we can do to use our new knowledge about the standard deviation.
We can also estimate how confident we are in our estimation of the mean. Of course,
we can only do this if we have sampled our data, because if we were dealing with the
population we would be completely sure. And if we are not confident in the value of our
mean when we are working with the whole population, we are not very good at pressing
the keyboard of our calculator or computer. If we are dealing with a sample, we can
544 Research Methods for Education Part 3 The Process of Data Analysis
calculate a value called the standard error of the mean which will enable us to say within
what limits the true mean will lie. The formula for this is:
δ
Standard error of x 5
n
It is not important to understand how this formula is derived but the underlying princi-
ple is that it is the sampling distribution (the distribution of all possible samples of a given
size which we have just looked at) from which the sample is drawn that is important. So
for any sample statistic, there is a 68% chance that the true value lies within one standard
error. For statisticians, however, 68% is pretty hit and miss. Something more accurate is
usually required. Again, the characteristics of the normal distribution provide the solution.
If we double the standard deviation (2d), then we account for about 95% of the popula-
tion and if we treble it (3d), we account for about 99%. Again, these figures (developed
through statistical theory) are not precise but are sufficient for our purpose at present.
Let us examine this with our age data:
9.38
One standard error of x 5 5 6.54
3762
2 3 9.38
Two standard errors of x 5 5 13.08
3762
3 3 9.38
Three standard errors of x 5 5 19.62
3762
The sample mean height of 10-year-old-girls is 141.23 cm. Our best estimate of the
population mean is that it is:
r 141.21 6 6.54 (147.75 cm to 134.67 cm) with a 68% probability (one standard error).
r 141.21 6 13.08 (154.29 cm to 128.13 cm) with a 95% probability (two standard errors).
r 141.21 6 19.62 (160.83 cm to 121.59 cm) with a 99% probability (three standard errors).
Note that as the probability goes up the range within which the true mean sits in-
creases as well. If we think about it, this is what we would expect.
There is one other statistic that we should be aware of, the coefficient of variation. This
simple measure expresses the relationship between the standard deviation and the mean.
It is particularly useful when comparing different data sets because it allows us to express
the spread as a proportion of the mean. The coefficient of variation is given by the formula:
δ
Coefficient of variation 5
x
Because the coefficient of variation is standardised according to the size of the popula-
tion or sample, it measures the relative degree of spread. In our age data the coefficient is:
σ 9.38
5 5 0.07
x 141.21
Sometimes the coefficient is expressed as a percentage (in this case it would be 6%,
that is, the standard deviation is 6% of the mean). How might we use this statistic?
Imagine we were analysing assessment grades in academic courses. For each subject we
can calculate the mean score and the standard deviation and, for each, the coefficient of
variation. What would a comparison of these show? With a large number of students
taking each examination, we would hope to see the coefficients of variation being much
the same for each subject. If they were not, this might be an indication that some subjects
Chapter 12 EXTRACTING INFORMATION FROM QUANTITATIVE DATA 545
had an assessment template that encouraged full use of the marking scale and others did
not. We could calculate the coefficient of variation for each examiner in a subject and, as
quality control, identify who was out of line.
Q1 Q2 Md Q3 Q4
(a) Symmetrical distribution
Q1 Q2 Md Q3 Q4
(b) Distribution skewed left
Q2 is the same as Q3. In each of these quartiles there are 25% of the data values. In Figure 12.7
(b) the distribution is skewed left. Each quartile still has 25% of the data values but the
range of Q2 is less than Q3. From this we can devise some rules:
r If the range of Q1 1 Q2 is less than Q3 1 Q4, there is a positive skew.
r If the range of Q1 1 Q2 is greater than Q3 1 Q4, the skew is negative.
r If the range of Q2 is less than Q3 the skew is positive.
r If the range of Q2 is greater than Q3, the skew is negative.
While these relationships describe the skew, they do not actually measure it. For this,
we have to use the mean and standard deviation.
n ( x − x )3
?∑
(n − 1) (n − 2) σ
Where n is number of data units
x is an individual unit of data
x is the mean of the data set
This formula is quite straightforward and, fortunately for us, it is also the one used by
common spreadsheet programs such as MS Excel and Open Office Calc. A symmetrical
distribution has a value at or near 0. The direction of skew is indicated by positive and
negative results.
For many advanced data analyses it is important that we demonstrate that our data are
normally distributed, that is, we demonstrate that skew is not present to any significant de-
gree. So if our statistical analysis requires normality, we should be able to demonstrate it.
Positive kurtosis
Normal
Negative kurtosis
10.6 1.025
15.2 1.182
16.0 1.204
16.1 1.207
16.2 1.21
16.7 1.223
16.9 1.228
17.1 1.233
18.1 1.258
18.2 1.26
18.7 1.272
19.4 1.288
19.6 1.292
19.7 1.294
20.4 1.31
21.6 1.334
25.9 1.413
28.6 1.456
One of the more common transformations that we use is not to base our calculations
on the absolute values of the data, but to use the logarithms of those values. The effect of
this is to reduce the importance of differences between the values of extreme data units.
It will, therefore, reduce skew but not kurtosis. This is better than nothing.
Table 12.10 shows the effect of this transformation on a data set consisting of the
body mass index of a group of children. The effect of using the logarithm of the data
rather than the base data is effectively to normalise the distribution in terms of skew. It
marginally increases the kurtosis measure. We would, in this case, be advised to use the
transformed data in any subsequent analysis exploring causes or consequences.
In 2004 two American researchers criticised an earlier study that looked at the relation-
ship between educational attainment, school size and socioeconomic status (Howley, C.
and Howley A., 2004). At the heart of their critique was the existence of skewed distribution
of schools by size. The author of the original study wrote a response (Lee, 2004), noting that
‘school size is rarely normally distributed. Rather, it is positively skewed, with a long right-
hand tail’ (p. 7) and commented that ‘Many other studies of school size have used school
size (or grade cohort size or even school district size) without correcting for the non-normal
distribution’ (p. 8). The original study used another approach to cope with skew, namely
put schools into classes, numbered the classes sequentially and then used the class numbers
as variables in a statistical analysis. The issue debated by these authors is not just technical
Chapter 12 EXTRACTING INFORMATION FROM QUANTITATIVE DATA 549
This activity is an opportunity to practise some of the modal classes? Comparing the means and medi-
methods of describing the characteristics of a data ans, is there any suggestion that the distributions
set. The data we shall use (see table below) are taken are skewed?
from Statistics Norway (http://www.ssb.no/english/ 3. Calculate the standard deviation of the data for
subjects/04/02/utlaerer_en/) and show the total num- 1992, 1995, 1999 and 2003 by hand. Are the re-
ber of teachers of given ages each year over a 12-year sults much the same for each year? Then enter
period. the data into a spreadsheet and repeat the calcu-
lation using the spreadsheet. Are the results the
1. Identify the modal class for the total number of same?
teachers from 1992 to 2003. Is it always the same? 4. What is the standard error of the mean for the
If not, can you suggest why it shifts? four years? Write down what the standard error
2. Calculate the mean and median ages for teach- implies for any one of your years. Are the results
ers in 1992, 1995, 1999 and 2003. Do the values much the same for the four years?
support your interpretation of the data based on
Age
Year Total –29 years 30–39 years 40–49 years 50–59 years 60– years
because the way in which non-normal data are analysed, as raw or transformed data, may
well affect the results and the analysis may well have educational or financial implications.
Now that we have looked at these descriptive approaches, test out how they might be
used in Activity 12.1.
look at the outcomes over time of policy initiatives or trends in behaviour such as smok-
ing and teenage pregnancy, or pupil attainment. We are interested in how things change
over time and whether the change has a pattern to it. In this section, we will consider
how we can describe and determine patterns in a time series. Table 12.11 presents data
showing the number of births in England and Wales over the four quarters of the year
from 1996 to 2006. As we know this data can be represented as a graph. Figure 12.9 is a
graph of this data, drawn using Calc in the Open Office Suite. There is a pattern to this
data that seems to reproduce itself over a short period. This is referred to as its periodicity
and we can see it in Figure 12.9 as the line peaks every fourth quarter. In addition, suc-
cessive peaks are lower than the ones preceding them for about half the period, then the
opposite happens (the change occurs around 2001). The same is true with the bottoms
of the troughs. This suggests another periodicity over a longer period. What can we do
to see whether our eyes are deceiving us and to emphasise the periodicity of the data?
Source: Birth statistics: Review of the Registrar General on births and patterns of family
building in England and Wales, 2006, Office for National Statistics, Table 2.1.
180.0
160.0
140.0
120.0
100.0
Quarter years
A simple yet often effective analysis is to calculate a moving average (also known as
a running mean). This is a straightforward process. If we look at the data for 1996 in
Table 12.11 we can demonstrate the method with a basic three period moving average.
We would normally start with three period set because this has the effect of smoothing
the pattern without distorting the shape. The calculation is to look at three time periods,
total up the number of live births over these periods, find the average for the three peri-
ods and use this as the value for the middle period. Thus:
Period ending
1996
March 157.3
∑x
June 158.1 5 161.8
3 ∑x
September 169.9 5 164.1
December 164.2 3
We add together March, June and September totals and divide by three to give a
revised June figure (161.8). Then we add June, September and December and divide by
three to give a revised September figure. We continue this for all of our periods and then
plot our revised figures.
We can do this calculation for any length of period. Figure 12.10 shows the results for
a moving average over three quarters (black) and over five quarters (light blue) plotted
against the raw data (blue). Note the difference between Figures 12.9 and 12.10 – the
y scale in Figure 12.9 has been shortened deliberately to exaggerate the vertical difference.
This has been done to demonstrate the following points:
r The five period moving average reduces the peaks and troughs and brings out the
long period trend.
175
165
155
145
135
125
Quarters
Figure 12.10 Moving average of live births in England and Wales 1996–2006
552 Research Methods for Education Part 3 The Process of Data Analysis
r The three period moving average suppresses the marked peaks and troughs but re-
tains the shape of the distribution (for example, peaks usually coincide with peaks on
the original data and troughs with the troughs).
r This is not the case with the five period moving average where it sometimes peaks at
points where the original data set has a trough. This phenomenon is referred to as
phase shift.
This analysis confirms what we suspected when we first looked at the data. It becomes
particularly useful when we are dealing with a great many data plots where the seasonal
periodicity (or short term periodicity) can be masked by influences that are more pro-
nounced than the trend but which occur on a random basis.
If we want to explore the periodicity of the data in more detail, we can look at the dif-
ference between the moving average and the original data. The difference is called the
residual. If the original data is greater than the moving average, the residual is positive
and if it is smaller then the residual is negative. Why would we do this? We have to un-
derstand that what we are doing with our moving average is to emphasise the trend.
Sometimes, however, it is what happens around a point in time that is equally interest-
ing. If a head teacher has a long run of data on annual school enrolments, this can
be used to establish a long run trend, even to the extent of predicting enrolment for
the forthcoming year. If the school then introduces a marketing campaign, there is a
baseline against which to judge its effect. Figure 12.11 is a plot of the residuals between
the base data and the five period moving average. Two things stand out. The peaks of the re-
siduals are broadly the same and the bottoms of the troughs are broadly the same. From this,
we can deduce that the seasonal effect of birth rates that are higher than the trend in late sum-
mer is 5,000 to 6,000 and of lower birth rates than the trend is 6,000 to 8,000 in late winter.
If we wanted to, we could make a ‘seasonal adjustment’ by adding or removing these figures.
The moving average is an easy introduction to time series analysis and it shows us
how we can go about identifying if there are different periodicities. We can see that there
are periodicities in the data of Table 12.11, one short about six months and the other
15
10
–5
–10
Quarters
long over 20 years. We have seen that we can identify tipping points and cycles from peak
to peak. This distance between peaks is called the wavelength and it is the conceptual link
between our analysis and more advanced approaches.
How can we improve on our descriptive analysis? There are three ways of making our
exploration of time series more sophisticated. We shall outline them here but operation-
alisation is best done with a computer package.
1. Weighted moving averages: Our moving average treated each data unit as having
the same weight. For example in a three period moving average, each period has a
notional weight of one. We multiply this weight by the period total (which does not
change the total) before adding the three years together and dividing by three. We can,
however, apply a weighting function that gives each unit of data a different weight. We
do this particularly if we are smoothing over long time periods (say five periods or more)
to prevent ‘phase shift’ (we met this idea above) when a downturn in the actual data is
converted to an upturn in the smoothed data or vice versa. Our object with applying a
weighting function is to make the underpinning pattern in our data clearer. The usual
approach is to weight values according to their distance from the central value, with the
weighting decreasing as we move away from the central point. In our five period moving
average, for example, instead of weighting each data unit as 1, we could allocate weights
of 1/10 to the first and fifth values, 1/5 to the second and fourth and 2/5 to the third,
a ratio of 4:2:1 from the centre to the extremes. Table 12.12 shows how this is done.
The data is taken from Table 12.11. The base data are the actual totals for each quarter.
The weights indicate the value that is to be plotted. For example, .4 of 169.9 is 68.0
(rounded to one decimal place) and then we calculate the indicated proportions of the
other periods. Note that the weights add up to 1. Column 3 is the base data weighted
and column 4 is the sum of all the weighted data shown as plotted for the third period.
There are many sophisticated exponential smoothing functions we can use as well.
2. Data decomposition: These methods start with the idea that there are mathemati-
cal equations that can describe the data sets. The object of this approach is to extract the
underlying pattern, often with the object of predicting future states. In broad terms the
equations fall into two groups:
r the first, where the elements of the equations are added together:
Time series 5 trend 1 seasonal effect 1 random effect
r the second, where the elements are multiplied together:
Time series 5 trend 3 seasonal effect 3 random effect
The object of this analysis is to find an equation that best describes a pattern through
time (often with the hidden assumption that it can be used for forward planning). This
157.3 .1 15.7
158.1 .2 31.6
169.9 .4 68.0 163.9
164.2 .2 32.8
158.1 .1 15.8
554 Research Methods for Education Part 3 The Process of Data Analysis
approach makes use of the advanced techniques in regression and correlation that we
shall meet in the next section.
3. Spectral analysis: This suite of methods starts from the assumption that any time
series is composed of a set of different curves. Our moving average analysis, for example,
suggested a six month and a 201 year periodicity. We can illustrate the principle of spec-
tral analysis using a made up example. Figure 12.12(a) shows a time series. It does not
matter what the data might represent, just that there are a pattern of data over time. We
can see that the data line follows the 150 value apart from a slump in the middle of the
series. Now let us see how Figure 12.12(a) was constructed. Look now at Figure 12.12(b).
This shows the data that were used to construct Figure 12.12(a). The bottom data set
shows a periodicity of about 12 units of time. The middle curve shows a periodicity of
24 units of time and the top line shows a periodicity of 6 units of time. These periodici-
ties represent the wavelength. In addition, we should note that the heights of the curves
200
150
Data
100
50
0
1 3 5 7 9 11 13 15 17 19 21 23 25
Time
Series1
120
100
80
Data
60
40
20
0
1 3 5 7 9 11 13 15 17 19 21 23 25
Time
Series1
Series2
Series3
(that is, from the trough to the peak) are different. This is called the amplitude of the curve.
With this example we have started with some made up data and combined them to cre-
ate the data set in Figure 12.12(a). Spectral analysis starts with the data set and then tries
to find the set of curves that will best describe the data. It decomposes the time series
into a set of curves that differ in (a) wavelength (the distance between the peaks) and
(b) amplitude (the vertical distance from the peak to the trough). The object is to identify
the waves that, when added together, account for most of the variability in the data set.
With this approach we can see that we not only get a very good description of the data
but also a powerful technique for prediction.
The intention of all of these approaches to time series analysis is to find a good way
of describing the pattern in the data and identifying explanations that can be associated
with the descriptions. Sometimes these explanations take the form of external influences
(what, for example, might affect the six month periodicity in births?), while others con-
stitute a feedback loop within the data itself (girls born in year 0 are likely to produce
children in 20 to 30 years’ time). Case Study 12.1 shows a time series analysis that threw
up some interesting questions.
education before someone can enter the academic job The implication of this work for educational research-
market. Allow for the need to satisfy demand over a ers is that our methods of analysis of quantitative data
period of time (four to ten years is not unreasonable) may confine us to short and medium term interpreta-
and the 13 to 18 year cycle is found to be the period tions and that within at least some of our data sets
necessary for supply processes to meet demand. And there may be lurking deeper processes that we need to
this process became more apparent in the twentieth understand if our attempts at modelling and predict-
century because it was fuelled by the increasing social ing situations are to be effective.
mobility that led to a growth in student numbers.
40
30
Use of AV/IT
20
10
0
0 20 40 60 80
Age
Use of AV/IT in week
r Where the lines cross is the median point for the cluster.
r Last, join the two points together for the regression line.
Regression line
Data points
355.0
335.0
315.0
295.0
275.0
255.0
235.0
215.0
195.0
175.0
5.0 10.0 15.0 20.0 25.0 30.0
Deprivation measure
11, shown in Figure 12.16(a), where the increases in both are positive and 21, shown
in Figure 12.16 (c) where an increase in one is related to a decrease in the other. If the
range of the coefficient is 11 to 21, what happens at 0? This is shown in Figure 12.16(b),
which shows a random scatter of data with no relationship between the variables.
With this framework to help us, the correlation coefficient (known as r) for the data in
(c) Negative
Figure 12.15 is –0.45. Again, the correlation coefficient was calculated using a standard
spreadsheet program. What does the figure tell us? First that it is an inverse relationship
(as deprivation increases, academic performance decreases) and second, that there is a
spread of data around the predicted values of y, that is, y values are not always the same
proportion of x. This clearly is a far more typical situation for the sort of data that we
shall meet in our educational research and there are good reasons for it. We cannot pre-
dict exactly how a change in the level of deprivation will affect the academic performance
of every child in an area because:
r The measure of deprivation is a blunt measure and not specific to any one child.
r Some families cope better on low incomes than others and some on higher incomes
are less concerned with their child’s education than others in the same situation.
r Children themselves are not all the same. They have different skills and capabilities,
not all of which are reflected in academic testing. And on top of this some may be just
going down with an illness on the day of a test and others may be just plain grumpy
and not in the right mood.
These sorts of circumstance apply to most of the data that we meet in education so it
will be a rare situation that one of our correlation coefficients gets close to 1 or 21. Far
more typical is the –0.45 that we obtained in Figure 12.15 but even here we can see that
there is a clear association between the two variables. We will see how to understand
what a correlation coefficient is telling us in Chapter 13 when we meet something called
the coefficient of determination.
When we use a spreadsheet to calculate a correlation coefficient we use interval
data. But as we know, there are circumstances in which the data we have are ranked.
Fortunately, we can also calculate a correlation coefficient with ranked data. The
method of calculation is quite different from the one in spreadsheets (usually this is
the Pearson coefficient, named after the statistician who developed it). When we have
ranked data we calculate a Spearman coefficient (developed by an English psycholo-
gist, Charles Spearman). The interpretation of the correlation coefficient is, however,
exactly the same. Table 12.13 shows the rankings of countries on the basis of their
PISA mathematics test results for state and independent private schools. The calcula-
tion is straightforward:
r Set out the table with the area (or person or organisation) ranked on the two variables.
r Square the difference in rank values (for example (1 2 11)2 5 100.
r Sum the squared differences: 406.
r Substitute this in the formula:
6 ∑ d2
12 where d2 is the rank difference squared
n3 2 n
n is the number of pairs of data
6,406
51 2
10,648 2 22
2,436
51 2
10,626
51 2 .229 5 .771
The correlation coefficient (in this case identified as rs (after Spearman) is 1.771.
This is a strong correlation and suggests that when mathematics teaching is good in
560 Research Methods for Education Part 3 The Process of Data Analysis
Table 12.13 Rankings for 2003 PISA mathematics tests in state and
private schools in 22 countries
1 11 100
2 1 1
3 9 36
4 9 25
5 2 9
6 3 9
7 4 9
8 5 9
9 6 9
10 7 9
11 7 16
12 14 4
13 18 35
14 12 16
15 22 49
16 16 0
17 13 16
18 19 1
19 21 4
20 17 9
21 15 36
22 20 4
406
private schools in a country, it is also likely to be good in state schools in that country.
The implication of this may be that mathematics is more embedded in the culture (or
parental expectations) of some countries than others and so more highly regarded.
While the calculation of the Spearman coefficient is easy, with large data sets it can
become a chore. Fortunately there are a number of online calculators available on the
Internet.
This section acts as an important link between the data we have to work with and the
final conclusions we draw in relation to our research questions. The approaches to data
analysis that we have looked at are usually known as descriptive techniques and we
Chapter 12 EXTRACTING INFORMATION FROM QUANTITATIVE DATA 561
use them to produce a descriptive analysis. This is not so different from what we do in
qualitative analysis. We can use these descriptive analyses in two ways which reflect our
approach to conducting our research enquiry.
r We can come to a data set with no (or at least few) pre-conceptions of what we will
find or what we should be looking for. Our descriptive analysis will show us what is
usual and what is unusual. This could be the start of identifying our research ques-
tion. Why does this area have student outcomes that are so different from other simi-
lar areas? Why is student satisfaction so much better in that department?
r We can come expecting to find particular differences, for example, why boys’ attain-
ment is lower than girls’, why the academic attainment of some ethnic groups is
greater than others. These positions are established by previous research or theoreti-
cal analysis. We want to test them as propositions and so construct hypotheses.
Whichever approach we adopt, inductive or deductive, descriptive analysis can help us.
Summary
t We can use numerical statements to (a) summarise the appearance of our data and
(b) summarise relationships in our data.
t We can describe central tendency (using mean, mode and median), spread (using
range and deviation), skew and peakedness.
t In terms of relationships, we have looked at time series, regression and correlation
(and regression and correlation also say something about the appearance of the
data).
t Different types of data (interval data and data counts) require different approaches
to calculating descriptive measures and give us much more flexibility in selecting
data for analysis.
t Every method that we have considered is straightforward and easy to understand.
This is significant given many people’s predisposition to give up on numeracy. All
the techniques are within the capability of every educational researcher.
t In many instances the laborious computational work is taken out of our hands when
we put our data into spreadsheets. All we have to do is understand the methods
conceptually.
t Overall, in this chapter we have added significantly to the toolbox of techniques at
our disposal. We may not use them in every investigation but they are there for us
when we need them. Just as important as this is that these techniques are a step-
ping-stone to even more powerful ways of looking at data. We shall look at these in
the next section.
Further reading
Salkind, N.J. (2008) Statistics for People (Who Think) They This easy-to-understand book covers the material we
Hate Statistics, Sage, London. have looked at in this section. Its attraction is that it
makes use of spreadsheets to do the calculations.
562 Research Methods for Education Part 3 The Process of Data Analysis
References
Anghileri, J., Beishvizen, M. and van Putten, K. (2002) Howley’, Education Policy Analysis Archives, 12(53).
‘From Informal Strategies to Structured Procedures: Retrieved [9/10/2012] from http://epaa.asu.edu/
Mind the Gap’, Educational Studies in Mathematics, epaa/v12n53/.
49(2), 149–170. Meadows, P. and Roegger, D. (2005) ‘Low Income
Counahan, M., Andrews, R., Buttner, P., Byrnes, G. and Homeowners in Britain: Descriptive Analysis’,
Speare, R. (2004) ‘Head Lice Prevalence in Primary Research Report No. 251, Department for Work and
Schools in Victoria, Australia’, Paediatrics and Child Pensions, HMSO.
Health, 40(11), 616–619. Mueller-Benedict, V. (2000) ‘Confirming Long Waves in
Falagas, M., Matthaion, D., Rafailidis, P., Panos, G. and Time Series of German Student Populations 1830–
Pappas, G. (2008) ‘Worldwide Prevalence of Head 1990 Using Filter Techniques and Spectral Filter
Lice’, Emerging Infectious Diseases, 14(9), available at Techniques and Spectral Analysis’, Historical Social
http://www.cdc.gov/EID/content/14/9/1493.htm (ac- Research, 25(3/4), 36–56.
cessed September 2008). Nevill, S. and. Chen, X. (2007) ‘The Path Through
Hossain, D., Gorman, D., Williams-Mozel, J.and Darlene Graduate School: A Longitudinal Examination
Garvey (2008) ’Bridging the Gap: Identifying Needs 10 Years After Bachelor’s Degree’, U.S. Department of
and Aspirations of Indigenous Students to Facilitate Education Institute of Education Science.
their Entry into University’, Australian Journal of Indig- Smith, P., Talamelli, L., Cowie, H., Naylor, P. and Char-
enous Education, 37, 9-17. ham, P. (2004) ‘Profiles of Non-Victims, Escaped
Howley, C. B. and Howley, A.A. (2004, September 24) Victims, Continuing Victims and New Victims of
’School Size and the Influence of Socioeconomic School Bullying’, British Journal of Educational Psychol-
Status on Student Achievement: Confronting the ogy, 74(4), 565–581.
Threat of Size Bias in National Data Sets’, Education Willems, S., Lapeere, H., Haedens, N., Pastels, I., Naegert,
Policy Analysis Archives, 12(52). Retrieved [9/10/2012] J.-M. and De Maeseneer, J. (2005) ‘The Importance of
from http://epaa.asu.edu/epaa/v12n52/. Socio-economic Status and Individual Characteristics
Lee, V.E. (2004, September 24) ‘Effects of High-school on the Prevalence of Head Lice in School Children’,
Size on Student Outcomes: Response to Howley and European Journal of Dermatology, 15(5), 387–392.
This page intentionally left bank
Chapter contents
Introduction 565
Summary 617
Further reading 618
References 618
Chapter 13
Learning themes
t The principles underpinning a statistical t Understand the principle of standardising
test of significance. data and be able to read a Z table and
calculate and interpret a Z statistic.
t The types of question we can use statistical
tests to answer. t Know how to select a statistical test and
be able to justify a selection.
t The factors we have to take into account
in choosing a test. t Understand the basis on which a range of
statistical tests work, the conditions those
t How to interpret test results.
tests require, be able to apply the tests
By the end of this section you will: and derive and interpret the test statistic.
t Understand the principles of statistical t Appreciate that complex data sets require
testing and the concepts of statistical sig- advanced statistical techniques and be
nificance and impact/effect. aware of those techniques.
Introduction
This section will help us reach a judgement subject, our likes and dislikes, our beliefs of
about our research evidence. If we stop to what is right and what is wrong, our sense of
think about the process of making judge- the boundaries between what is acceptable
ments, for example, how we judge some and what is unacceptable. In other words,
things better, some things worse, some peo- our rational assessment is fed through an
ple good and some bad or some accused emotional interface to produce our judge-
guilty and some innocent, we probably imag- ment and our judgement is a very personal
ine a process of looking at the evidence, bal- thing. Our goal in this section is to remove
ancing the pros and cons and then reaching the subjective element from the decision
a decision. Our decision, however, is likely making through probability-based statistical
to be influenced by our feelings about the analysis.
566 Research Methods for Education Part 3 The Process of Data Analysis
point do we begin to suspect that there might be two groups and at what point are we sure
that there are two groups? Putting it another way, at what point does the difference between
the numbers become significant? This question is at the heart of this section and, as we shall
see, we can use statistical tests to help us reach a conclusion.
But what is a statistical test? The answer to this question is that it is a procedure we fol-
low to help us reach a decision about our data. We use a test to establish whether things
are the same or different (for example, whether any difference in the student outcomes
from mixed or single sex schools is important enough for us to change the character of
the way we educate children) or whether things are related to each other (for example,
does improving the ratio of teachers to students improve outcomes for students?). At
the core of the procedure is an estimation of probability; the probability, for example,
that the improved academic performance of girls in a group of single sex schools is to be
found in every girls’ school. With probabilities we can be more certain about our judge-
ments. If the performance of girls in 90% of single sex schools were better than girls’
performance in mixed sex schools, most people would find that figure pretty convincing,
some might even say that it is a significant difference. And this is what a statistical test
does; it helps us establish the significance of our findings.
There is, however, a caveat to this. There are other statistical distributions that can
and are used for statistical testing but the underpinning principles of a statistical test are
pretty much the same. All we have to know is which distribution to use, which is effec-
tively the same as saying which test we should use. So, as we progress the argument, bear
in mind this discussion. The world is often more complex than when it is first presented,
but if it is not presented simply at first, it is often difficult to grasp an idea and then move
forward with it to understand more complex issues.
34.1435% of values
x Z=1
Mean
The next step may not seem to have much point now but it connects with something we
learnt in Chapter 12. What we have to do is understand a characteristic of the normal distri-
bution and we need to do this because it is the link to understanding the idea of statistical
testing. We can measure along the horizontal axis and, for each unit we measure, we can
calculate the proportion of the distribution between the origin (the centre of the distribu-
tion where the x and y axes intersect) and any point. In other words, for every miniscule
distance we travel along the x axis, we can measure the proportion of the distribution
between that point and the origin (the mean). From this we could produce a table of meas-
urements for each unit of distance along the x axis. Welcome to the Z table! The Z table, also
called the normal distribution function, shows the proportion of the normal distribution for
a series of standard units from the mean. Figure 13.1 shows this. Moving from the mean
to point (a) shows the proportion of the population or sample in the area under the curve
between the mean and point (a). Figure 13.1 shows this proportion as 34.1435% of all the
data in the distribution. Table 13.1 is an extract from a Z table. The Z values are given in the
first column (these are notional units of distance along the x axis). In the second column
are the proportions of the distribution between the mid-point and values of Z and in the
third column are the proportions between values of Z and the extreme of the distribution.
Stay with the argument for a moment longer and all will become clearer. Let us look
at three values of Z. When Z is 1.0, 34.1435% of values are accounted for. When Z is
2.0, the proportion is 47.725% and when it is 3.0, the proportion is 49.865%. These
.1 3.9828 46.0172
.2 7.9260 42.0740
.3 11.7911 38.2089
.4 15.5422 34.4578
.5 19.1462 30.8538
.6 22.5747 27.4253
.7 25.8036 24.1964
.8 28.8145 21.1855
.9 31.5940 18.4060
1.0 34.1435 15.8565
1.2 38.4930 11.5070
1.4 41.9243 8.0757
1.6 44.5201 5.4799
1.8 46.4070 3.5930
2.0 47.7250 2.2750
2.2 48.6097 1.3903
2.4 49.1802 0.8198
2.6 49.5339 0.4661
2.8 49.7445 0.2555
3.0 49.8650 0.1350
570 Research Methods for Education Part 3 The Process of Data Analysis
proportions are only half the distribution, the right-hand side. For each value of Z, there
is an equivalent proportion on the left-hand side. If we add the proportions from the
right and left sides together, then either side of the mid-point there are:
68.29% of values when Z is 1.0
95.55% of values when Z is 2.0
99.93% of values when Z is 3.0
At this point, something might strike you. These percentages are virtually the same as the
proportions we saw in Chapter 12 that were 1, 2 and 3 standard deviations from the mean.
Actually, in Chapter 12 we only used approximate values and the values above are the actual
ones. Remember this while we go on to look at another way of looking at the Z distribution.
As well, for each of our values of Z, we can show the percentage of the values lying
beyond any point a. Thus, for example, when Z is 1.0, 15.865% of the values are greater
than Z. These values are given in column three of Table 13.1. When we look closely at
columns two and three, we will see something that should strike us as so obvious that we
will wonder why we have not spotted it before. Each pair of values sums to 50%. This is
not surprising because the two together are exactly half of the distribution.
The question now for us is, ‘What can we do with our research data so that we can make
use of the Z table and the fact that in a normal distribution a given proportion of all the data
values will lie within a specified distance of the mid-point?’ Let us go to the next section.
Let us just consider what this formula is doing. We are subtracting the mean from any
unit of data and expressing the result as a proportion of the distribution’s standard devia-
tion. Z is, effectively, units of a standard deviation. Since one standard deviation is always
68.29% of a normal distribution (we know this from the theoretical work done in rela-
tion to gambling almost 300 years ago), we have put our data into a form where it can be
directly compared with a normal distribution. Let us return to our two questions on the
consequences of allocating children to the closest school. If our mean travel distance for all
children is 2.7 km, the standard deviation 3.5 km and the question is, ‘What is the prob-
ability of a child living more than 2 km from the school?’, the Z value of our data unit is:
2.7 2 2
5 0.2
3.5
If we look at column three of Table 13.1 (column three because our question was the
probability of a child living more than 2 km from school) we see that the proportion
living beyond 2 km is 42.0740. However, we have to remember that this table only gives
us half of the distribution (look at Figure 13.1 again to see what we mean), so we have to
double our figure to give 84.148. Therefore the probability is 0.842 (the proportion has
been rounded to three decimal places).
For the second question, we have to work the table in reverse and find the Z value
equivalent to our proportions. Table 13.1 is a little crude for this but we can make a good
stab at it. Let us take 70% as an example. In Table 13.1 we can look for 70% in column
two (column two because that gives the proportion of the distribution between the mean
and Z and it is this that will enable us to determine the distance). However, look as we
might, we won’t find it. Until we realise that, again, we have to deal with both sides of
the distribution. In fact, what we have to look for is 35% (half of the 70% in half of the
distribution). We see in the table that Z 5 1 gives us 34.1435% and Z 5 1.2 gives us
38.4930. Our best guess from this table is that Z is 1.05 or thereabouts. Let us now put
these figures into our formula and see what we get:
2.7 2 x
5 1.05
3.5
2.7 2 x 5 1.05 3 3.5
2 x 5 3.675 1 2.7
5 6.375
So our answer is that if we allocate 70% of children to their nearest school, the fur-
thest any one of them will travel is 6.375 km (we can ignore the ‘–’ sign above because
the side of the distribution we look at does not matter in this instance).
If you have sharp eyes you may have noticed something different about the equations
we have used here. It is a technical matter but if we do not explain it, you could be con-
fused when you read quantitative research paper and books. Some terms in this formula
are different. For example, we have used S to stand for standard deviation, rather than d
that we have used before. Why? The answer lies in convention. Statisticians use different
symbols to signify values derived from a population and values derived from a sample.
The formula for Z set out above is the formula that denotes the data is a sample of the
population. If we were dealing with data for the population as a whole, we would write
the formula:
x 2m
Z5 where μ 5 mean
δ
␦ 5 standard deviation
572 Research Methods for Education Part 3 The Process of Data Analysis
We have actually started to do something interesting in this part of the section and
that is to use the characteristics of the normal distribution in our analysis and to inform
our conclusions. Let us see what else we can do with it.
87 2 49
Best 1.959
19.4
28 2 49
Worst 21.082
19.4
Now let us think about these Z values. Table 13.1 shows that when Z is 1.959, only
about 2.3% of the population exceeds this score (the right-hand column). A Z value of
0.82 (the worst student) is exceeded by about 85% of the population. The teacher was
pleased with these results because their worst student is better than the worst nationally
and their third best student is in the top 3% of students nationally. The evidence is that
their pupils, on the whole, perform somewhat better than the rest of the population. The
issue is whether their students perform significantly better than the national student set.
We cannot quite answer this yet but we are getting closer to an answer.
The teacher also calculated the mean mark for all their students. This was 60.35. They
calculated the Z value as:
60.35 2 49
5 0.5851
19.4
With this Z value, about 72% of all the values in a normal distribution fall below it.
(In Table 13.1 this is given by taking the value in the right-hand column from 100% or
Chapter 13 USING STATISTICS TO SAY SOMETHING SIGNIFICANT 573
adding 50% to the value in the left-hand column.) Most people would find this result
pretty convincing: 50% of the teacher’s students got below a mark of 60.35 and nation-
ally about 72% got below this mark.
Now there is a different way of looking at this value of 72% and that is to say that in
72% of all national tests, the national mean value will be lower than the teacher’s class
mean. Or, putting it another way, only in 28 years in 100 will a national test average
exceed theirs. This is starting to sound interesting because it is expressing the result in a
way that brings out the significance of the difference.
We have not yet quite reached the point of implementing a significance test (we
look at these from section 13.3 onwards) nor yet pinned down what we actually
mean by ‘significance’ in a research context (we look at this in section 13.1(vi)) but
we have seen the principles underpinning the idea of significance testing. This, quite
simply, is to:
t establish a statistical distribution as a model;
t examine the research data in the context of the model;
t use the distributional characteristics of the model to make probabilistic statements
about the data.
And while we have used the Z distribution to give us insights into the idea of testing,
we should not ignore the fact that creating Z values for our survey data can be used to
answer important research questions.
Finally, in this part, we should point out that there are several websites with calcula-
tions for Z that will remove the need for calculating values by hand and looking up the
results in tables. How easy can it get!
t First, we have to go back and examine this idea of association and add a caveat to the
conclusion above. If we seek to establish through a statistical test whether two sam-
ples (such as our two groups of students who took the science test) are different from
each other, we assume that if we show that they are not different, that they are (close
to being) the same. Very often we call the tests that do this ‘tests of association’. But
in this case what is being tested is whether data are drawn from the same population
or whether they represent two different populations, not whether there is causation.
Association has different implications according to the circumstance in which we use
it, so we have to be careful how we use it.
t Second, we have to be specific about the actual hypothesis that we are testing when
we use statistical techniques. The reason for this is that we are working within a pos-
itivist framework. (To refresh our minds of what the implications of this are, we
should read Chapter 2 again.) One thing we have to note is that our hypothesis, the
one we actually test, has to be constructed as a null hypothesis. (Look at Chapter 3
for details on this.) The hypothesis we seek to test is one of no significant difference
between the variables.
t Third, if we reject the null hypothesis, we should have ready an alternative hypoth-
esis. This is often called the research hypothesis – and it is the one we are really
interested in. The way we phrase the research hypothesis has important implica-
tions for the procedures we follow with statistical tests. If our null hypothesis is that
‘there is no difference between A and B’, our research hypothesis could be one of the
following:
Chapter 13 USING STATISTICS TO SAY SOMETHING SIGNIFICANT 575
If we express the likelihood of something (that these children are better at mathemat-
ics than those because of the way they were taught) in probability terms (the mean score
of children taught this way is 15% higher than those taught using another approach),
then we can reasonably ask the question, ‘Is this difference significant?’ To understand
how statisticians have reasoned this question (we say ‘reasoned’ because the issue is one
of logic and not numeracy), we have to go back to our hypothesis, which is, as we know,
a null hypothesis. In this case our null hypothesis would be something like, ‘There is no
difference between the two groups that have been taught in different ways’. The question
we ask ourselves is, ‘Is the null hypothesis true?’ The evidence we have is that children
do better with the new method. But we cannot take a decision yet because there is an-
other thing to consider. If our comparison were based on five students in each group,
we would be hesitant about being definitive in our claims. It is entirely possible, for
example, that the five students who were taught using the new approach just happened
to be bright students (that is, our sample was not balanced). If there were 20 students in
each group, we would be more confident in our claims that the new method constituted
a breakthrough and if this was a large test with 5,000 students we would almost certainly
be justified in saying ‘case proved’. It is clear then that the difference has to be interpreted
in the context of sample size and the implications this has for the possibility of bias in
the sample. What we will find with our statistical tests is that when our sample is small,
the difference between groups has to be large and when our sample is small, even small
differences can be significant.
So let us go back to our question, how does a statistician determine whether the result
of a statistical test is significant? All statistical tests will generate two important values.
One is the test statistic. This is the number that results from putting our test data into the
formula for the test. The other is a probability value associated with this statistic and it
is the one we are interested in. Remember that we are testing a null hypothesis that there
is no difference between the two groups. This is the same as saying that both groups are
drawn from the same population. Just think about this for a second. If the two groups are
from the same population then, assuming that our sampling is sound, the means should
be much the same. If the means are very different and if the null hypothesis is correct,
then we must be sampling from different ends of the population, one side is those who
are better at mathematics and the other those who are poorer. We ask ourselves what the
likelihood of this is and answer, ‘Not very likely’. What the probability value shows us
is the chance of getting a set of results like those we have if the two groups are actually
drawn from the same population.
As you might expect, however, it is not quite as straightforward as this because of what
we can and cannot do with the null hypothesis. (For a discussion on why we cannot prove
that the null hypothesis is true but we can prove that it is false, go back to Chapter 3 again,
especially Case Study 3.3.) The question we ask of the null hypothesis is, more accurately,
‘What is the likelihood that it is not true?’ Our probability statistic tells us the answer.
Some examples will help us to understand this. If we have a probability of 0.001 from
our test, this means the likelihood of the likelihood of it arising through the influence of
chance alone is 1 in 1000; to put it another way, that the null hypothesis has a 1 in 1000
(0.1%) chance of being ‘true’, or, in other words, a 99.9% chance of not being true. If
our probability is 0.999, this means that the null hypothesis has a 99.9% chance of being
‘true’, or, in other words, a 0.1 chance of not being true. In the first case we would reject
the null hypothesis and in the latter case we would fail to reject it. To summarise, signifi-
cance testing is a way of dismissing one factor, chance, that can influence the result. If we
dismiss chance as an influence, we then have to think what other influences may be at
work to give the results we have obtained.
Chapter 13 USING STATISTICS TO SAY SOMETHING SIGNIFICANT 577
We can find in the research literature different ways of expressing the interpretation of
the probability level. They are similar but they can throw you if you have not met them
before. Here are three more ways of saying what we have just explained, all taken from
published material. Significance is explained in terms of:
t the probability of rejecting the null hypothesis when it is true (or when we should
not have rejected it);
t the probability that the differences are due to chance;
t the probability that if we were to draw the samples again we would get the same
result.
We still, however, have not pinpointed the idea of significant difference. How do we
know whether our probability value of .0365 is significant or not? Let us return to an
argument that we have touched on before. If we were researching whether teachers who
undertook no educational duties or activities in their lunch break exercised better class
control after lunch than teachers who patrolled playgrounds or marked work and our test
generated a probability value of 0.5000, would we accept that undertaking no lunchtime
work led to more effective behaviour management if it occurred 50% of the time? We
would clearly be unwise to because the implication is that for the other 50% of time the
two groups of teachers would either be the same or those who worked during the lunch
break would be better. So, how many times would those who did not work at lunchtime
have to be better at managing their classes than those who worked before we think that
the difference is significant? If it happened 999 times out of 1,000 most people would be
convinced. If it happened 75% of the time many people would accept that the teachers
who did not work were better at controlling their classes – but not statisticians. Statisti-
cians are conservative and while they would accept 99.9% as convincing, there is a lower
limit below which convention says they do not go. That limit is usually 95% (statistical
programs present this probability value as 5%, a 1 in 20 chance, because the hypothesis
they use is a null hypothesis).
Why do statisticians work with this limit? Now this is an interesting question be-
cause it shows how custom and practice can become so embedded in a subject that it
becomes an established truth that is not to be challenged. The level of 5% was estab-
lished in 1925 by Ronald Fisher in his book Statistical Methods for Research Workers.
This has been described as the most influential statistics book in the twentieth century.
Whether this is true or not I am not qualified to say, but what is evident is that much
of what Fisher wrote in that book has been republished in texts in the intervening
years, including this one. He later wrote ‘it is a common practice to judge a result
significant, if it is of such a magnitude that it would have been produced by chance
not more frequently than once in twenty trials. This is an arbitrary, but convenient,
level of significance for the practical investigator’ (Fisher, 1925, p. 191). The 5% level
came about because he wanted to use Karl Pearson’s tables of probability values for
statistical tests in his 1925 book. However, Pearson and Fisher had been at war over
Fisher’s use of a statistical test since 1922 (though it is fair to say that the two had been
feuding for some time before this) and permission to use the tables was refused. Fisher
got round the copyright issues by printing only the probabilities associated with the
tails of the distributions between 0.1 (10%) and 0.01 (1%) and saying: ‘We shall not
often be astray if we draw a conventional line at.05.’ And we have been following this
advice since.
To summarise, the range within which statisticians normally work for statistical testing is
between 95% (95 out of 100 or 5% with a null hypothesis), and 99.9% (999 out of 1,000
578 Research Methods for Education Part 3 The Process of Data Analysis
or .001% with a null hypothesis). Conventionally 95% is used more. However, before we
accept these conventions, we should understand something of the history of statistical test-
ing. Much of the work on developing statistical method was carried out in industrial quality
control processes and in medical testing. Drug manufacturers, for example, work necessar-
ily at very low margins of error. We would not be very happy taking a drug if we knew that
there was a 1 in 10 chance that the manufacturing process got the dosage wrong or that it
could have serious consequences for us if we took it. Similarly, car component manufac-
turers have to work to high standards because failures cost money. With social sciences,
however, extremely robust standards are not necessarily called for because the things we
investigate are more ‘fuzzy’ and much less clear cut. We know from many studies that the
educational outcomes of children from single parent families are poorer than those from
two parent families. Writing in the Lancet, Swedish researchers led by Gunilla Weitoft have
shown that health outcomes are poorer also (Weitoft et al., 2003). Their data shows that
the death rate of girls from single parent families was 22.1 per 1,000,000 and for boys it was
49.1 compared with 17.0 for girls in two parent families and 29.6 for boys. Their studies
showed ‘increased risk of psychiatric disease, suicide or suicide attempt, injury or addiction
in children in single parent households’ (p. 294). The increased risk that was established is
statistically significant (at the level of 95%) but (and this is an important ‘but’) there were
also circumstances where differences existed but they were not as marked but might have
entered the conclusions had a less rigorous (90% or even 85%) level been selected.
The implication of this is truly important for education researchers. We have to think
about significance in two ways, first as statistical significance and second as significant
in the context of the research topic. Where we find interesting results whose probability
would indicate we should not reject the null hypothesis, we should think again and con-
sider whether our result is ‘significant’ because the assumptions of statistical significance
are too rigorous for the data we can collect on the issue. We should always remember
that when we are dealing with people, differences may not reach conventional statistical
levels but they are, nonetheless, personally and societally important. However, whatever
level of significance we choose as our benchmark, we have to justify it, either in conven-
tional statistical terms or in terms of our research topic.
a b
a1b two-tailed test
a one-tailed test
b one-tailed test
t In a one-tailed test, only one side of the distribution concerns us, the left deals with
one research hypothesis (A is greater than B) and the right with the other (B is greater
than A).
t With a two-tailed test (when our hypothesis is only that A and B are different), we
deal with both sides of the distribution.
This distinction, as we shall see, is important, because it ties in with something we
learnt in relation to Z tables, that sometimes, depending on what we are trying to deter-
mine, we have to double the probabilities associated with an event or outcome. This may
sound difficult but it is not as long as we are clear about the exact research hypothesis
that we are using. Let us now look at the implications of this. All will become clearer in
the following text.
A two-tailed test
Acknowledging this caveat, that there may be circumstances in education research where
being less rigorous than is conventionally the case may well be justified, let us now see
what happens in a two-tailed test. Figure 13.2 shows this situation in general terms. If
our level of acceptance or rejection of the null hypothesis is set at 5% (we will use the
formulation found in statistical programs, rather than the 95% we have used so far),
in a two-tailed test we allocate 2.5% of our probability to each side of the distribution
because we are sampling both sides of the distribution, that is, we are not specifying in
what way our two data sets might be different. If it were 10%, we would allocate 5% to
each side.
Let us look at this in terms of our Z table (Table 13.1). We cannot be exactly accurate
with this table but we can get pretty close (if we use an on-line checker or a more detailed
set of tables, we can and should be more accurate). Look at column three and find the Z
value closes to 2.5%. It is between 2.2750% and 3.5930%. So 2.5% of the values in the
distribution give a Z value of just below 2 (actually 1.96). If our test statistic is greater
than this we should reject our null hypothesis because in more than 5% of cases where
we obtain this test value from our samples, the difference does not occur by chance. If
580 Research Methods for Education Part 3 The Process of Data Analysis
our test statistic fails to reach the critical level (that is, our research hypothesis cannot
be accepted), we cannot accept our null hypothesis. All we can say is that there is not
enough evidence to reject the null hypothesis.
A one-tailed test
The principles of conducting a one-tailed test are exactly the same as for a two-tailed
test but instead of only saying that our two data sets, A and B, will be different, with
a one-tailed test we say that one will be bigger than the other and specify which one
will be bigger. This has an important consequence for rejecting or not rejecting the null
hypothesis.
In the two-tailed test we saw that we split the critical acceptance/rejection level
between the two tails of the distribution. In our example we put the 5% we are in-
terested in into both ends of the distribution. In a one-tailed test, we only look at
one half of the distribution because our research hypothesis is that A is bigger than
B (or vice versa). Look again at our Z table (Table 13.1) and we see the implications
of this. Again we look at column three to find where 5% probability is. We can see
that it falls between 1.6 and 1.8 (actually 1.645). This raises an interesting issue about
the discriminatory power of one- and two-tailed tests, as we shall see in the next part
of this section. The process of rejecting or nor rejecting the null hypothesis is exactly
the same as for the two-tailed test but what are the implications of using one or the
other? Let us imagine that our test statistic (the value we obtain from putting our data
into the formula for the test) was Z 5 1.8. With a one-tailed test we would reject the
null hypothesis because the critical value of Z is 1.645 (our value is 1.8) but with a
two-tailed test we would not reject the null hypothesis because the critical value of Z
is 1.96 (and our value is 1.8). So the way we phrase our research hypothesis is very
important indeed.
Null hypothesis
Decision True False
t If the null hypothesis is false, we make a correct decision if we reject it and an incor-
rect decision if we do not reject it. This is called a Type II error. Again, in our example,
we would make a Type II error if we said that class management in the afternoon was
not affected by whether the teacher did or did not work at lunchtime when the reality
is that not working would lead to better class management.
Which, if either, is the more serious mistake to make? Imagine a man is being tried
for a capital offence. If he is innocent but he is found guilty, this is a Type I error. If he
is guilty but found not guilty at trial, this is a Type II error. Is it better to hang him if he
is innocent, or release him if he is guilty? Most people would say that the finality of the
former (a Type I error) is the more serious. The implication of this is that we should do
as much as we can to avoid a Type I error.
Let us now think about this in relation to the value at which we will reject or fail to
reject our null hypothesis. Table 13.3 helps us here. It shows us the risk associated with
making a Type I or Type II error at various critical levels of rejecting the null hypothesis.
It shows that the risk of making a Type I error increases as the critical level becomes
less rigorous (that is, there is less risk at 0.01% than there is with 0.1%). It also reveals
that the risk of making a Type II error decreases as the critical level becomes less rigor-
ous (that is, there is more risk at 0.01% than there is at 0.1%). This situation has three
implications:
t First, there is a trade-off between a Type I and Type II error when we change the
critical value at which we test our hypothesis. Perhaps it is for this reason that many
researchers stick to 5%.
t Second, if we wish to minimise a Type I error (to execute an innocent man or incor-
rectly reject a true null hypothesis) then we should choose a lower critical value as our
benchmark.
t Third (and this is particularly important in terms of our determining whether our
research hypothesis is directional or non-directional), at any given critical value, a
two-tailed test includes a Type I error in both tails of the distribution. For example, at
the 5% critical value, a two-tailed test is effectively testing both halves of the distribu-
tion at the 2.5% level.
Table 13.3 Where the risk is with Type I and Type II errors
90% – 0.1
Risk of
making an
error as
critical
value of
95% – 0.05
testing
null
hypothesis
changes.
99% – 0.01
582 Research Methods for Education Part 3 The Process of Data Analysis
Most statistical packages for computers have, as their default, a two-tailed test (that
is, the direction of any difference is not specified). It is usually straightforward to re-
interpret the test statistic as a one-tailed test. However, there are some arguments against
doing this.
t First, the research hypothesis may specify the wrong direction (bigger than rather
than smaller than) leading to the wrong side of the distribution being tested. You
may say that this is no big deal since all we have to do is change the direction of our
research hypothesis. However, while this might take us from a state of not rejecting
the null hypothesis to one where we reject it, it does raise questions about how we
derived our research hypothesis in the first place.
t Second, the use of a one-tailed test without specifying that it is one-tailed can lead to
great confusion in the literature.
t Third, we may be tempted to use a one-tailed test when a two-tailed test just fails the
critical value at which we would reject our null hypothesis. The idea of a statistical
test is to place as much decision making as possible in the hands of procedure and
out of the hands of the researcher, so to use a one-tailed test like this is unethical. No
doubt it happens because researchers like to show that their results are significant!
In summary, there are no hard and fast rules about when to use a one-tailed or two-
tailed test but there are conventions. These are (i) to use a two-tailed test in preference to
a one-tailed and (ii) to use the 5% critical value for significance, other things being equal
(though we can argue for something less robust).
Activity 13.1 provides an opportunity to assess how far academic authors specify the
sorts of thing that we have identified as good practice.
In this section we have identified some principles of t Whether a research hypothesis was correctly speci-
good practice in research procedure. Let us see how fied and, if so, whether it was directional or non-
many published research papers conform to these directional.
principles. t Whether the test statistic was specified as being
Identify about 20 research papers that make use of a one- or two-tailed.
statistical testing procedure. It does not matter which On the basis of your analysis, what conclusions would
tests they use. The papers should be drawn from at you draw about the rigorousness of educational re-
least three journals. For each paper identify: search?
t Whether a null hypothesis was correctly specified.
Chapter 13 USING STATISTICS TO SAY SOMETHING SIGNIFICANT 583
children show a great improvement, the rest show some or a little improvement. Our
test reveals that the significance of the difference between A and the control is much
the same as B.
This is undoubtedly an important finding but which test has a greater impact overall
on reading improvement? By looking at the data from another perspective we can inter-
pret the result differently. In this case, we look at the size of the impact and not the sig-
nificance of the difference as a way of differentiating the two methods. Let us understand
this in a little more detail. With both methods the difference is significant at about the
2% level, in other words we would get our results only 2% of the time by chance if our
samples were actually from the same population. But we still do not know which of the
two new methods produces the better results. All we know is that both are better than
the control method.
We can actually determine this though if we look at something called the effect size.
This is an estimate of the scale of the impact that interventions make and not on the sig-
nificance of the difference. The concept of effect size is used extensively in assessments of
interventions in the learning process because it is especially important that interventions
that produce the greatest leaning benefits are adopted. To measure effect size we have to
know the standard deviation and the mean. The statistic that is generated, Cohen’s d, is
calculated as:
x intervention group 2 x control group
d5 where x is the mean
δ
δ is the standard deviation
Calculation can either be straightforward (if the standard deviations are the same) or
rather more complex if we have to use both standard deviations. The latter situation is
more usual but fortunately there are several on-line calculators that can be used which
do the job for us. If we assume our research data generated the following mean reading
scores and standard deviations:
then d is .49 between A and control and 0.64 between B and control. As a benchmark, 0.2
suggests a small effect, 0.6 a moderate effect and 0.8 or above a large effect. The implica-
tion of this result (remember both new approaches have the same statistical significance)
is that method B has the greater effect and that if we were investing in training for teachers
we should train them in method B rather than method A. It is true to say that effect size is
not extensively reported in education research and most inferential statistical analyses are
built around the existence of a null hypothesis. This situation, though, is being challenged
and there is a growing band of social researchers who are challenging null hypothesis
testing. As Sun et al. (2010) state: ‘Researchers are interested in the probability of the null
hypothesis being true given the data collected. However, NHST (null hypothesis signifi-
cance testing) estimates the probability of obtaining the data given null hypothesis is true,
a logic that is a reverse of research logic.’ They point out, as well, that the use of a thresh-
old value to determine whether or not to reject the null hypothesis is arbitrary; if the
threshold is .05, the difference between a test statistic of .049 and one of .051 is not great,
but one leads us to reject the null hypothesis and the other to not rejecting it. Sun and his
584 Research Methods for Education Part 3 The Process of Data Analysis
colleagues carried out an interesting piece of research to see how far authors of academic
papers were beginning to report effect size. This followed a move by two important aca-
demic associations in the USA (the American Psychological Association in 1994 and the
American Educational Research Association in 2006) to provide guidelines on reporting
effect size in the associations’ journals. They found that out of over 1,500 journal arti-
cles in the two disciplines published over a three-year period, 2005–2007, only 6% were
published in AERA journals which suggests (and only suggests, because we do not have
data on the number of articles in each subject area) that quantitative educational research
lags behind qualitative. Yet the opportunities for educational researchers concerning the
impact of an organisational or teaching innovation are significant, for example, strat-
egies for teaching reading or numeracy in early years, ways of getting hard to reach par-
ents to engage with school, the relative effectiveness of computer assisted or Web-based
learning, the impact of different staff development practices – the list could go on. Evans
and Gibbons (2007) used effect size measures as well as standard statistical tests in their
evaluation of the impact of interactivity in multi-media testing. Their research design was
simple and sound (and easily replicable for an undergraduate or postgraduate project).
They designed a multi-media learning activity (to teach how a bicycle pump worked) and
developed two versions, one with simulations and self-assessment and the other without.
They then tested the students’ knowledge and understanding and found that while the
two methods produced no significant differences in terms of retention of knowledge,
students using the interactive learning system with self-testing and simulation were better
at transferring their understanding. Their interpretation was that interactivity produces
deeper learning and a greater ability to use the understanding in problem solving.
This part of the section has introduced judging effect size as an alternative to the more
conventional hypothesis testing. However, the topic has not been fully explored and if
your research is into impact assessment, then it would be worthwhile developing your
understanding further.
t The second column gives the context for the test. A ‘goodness of fit’ test compares our
observed data with another data set. This could be the population from which it is
drawn (in which case we test how good our sample is) or the data as we would expect
them to be if they were arranged according to some principle we establish (this could
be setting up the data as a normal distribution). Context also describes the samples
themselves. If we have two or more samples and they are independently drawn (that
is, one does not influence the other), then we can see what tests we can use. The num-
ber of variables we are dealing with is also considered under context.
t The third column gives the measurement scale.
This table can be used, therefore, to help researchers select a scale. First we have to
decide which type of question we are asking. Next we have to look at the way we have
sampled, the number of variables we have and so on. And finally we have to consider on
what scale our data will be measured. This broad sequence should help us to focus on
which tests are appropriate and can be used to identify tests that are not considered here.
Because most calculations will be completed using a statistical package, our approach
is to explain what the test does and to illustrate this in broad terms with examples. There
are few detailed computations. With this background, showing which tests are appropri-
ate in what circumstances, it should be straightforward to follow the requirements set
out in the manual of whatever statistical package is being used to input the data and then
follow the guidance for interpreting the test statistic.
Before we look at the tests that we can use for establishing difference (or sameness, they
are two sides of the same coin), we should make one point very clearly. Most researchers
will not restrict themselves to one test. They do this for the following reasons:
t Their data will be based on different measurement scales, counts of people, interval
data such as income or age, quasi-interval data from rating scales and so on. Because
of this, they will need either to reduce everything to a common scale or use different
tests. Most choose to do the latter.
t If the data is sub-divided and then tested (for example, if different age groups or catego-
ries of teacher and teaching assistant are used as populations in their own right), data may
not fulfil the requirements of some tests. In particular, sample size may be small. This
might restrict the use of non-parametric techniques. If this might be an issue, it should be
foreseen at the planning stage and the overall sample size increased accordingly.
t Finally, different tests may be used at different stages of the enquiry. Some of the
more straightforward tests might be used to establish the accuracy of the sample
while more sophisticated tests are used to explore and identify significant associa-
tions within complex data sets.
For these reasons, it is important that education researchers know more about statisti-
cal testing than they need to for any research enquiry. And, with time and experience,
knowledge of tests will increase to way beyond what is considered in this text.
A B C D
before we can establish the significance of that difference. If one mean is £1.57 and second
£4.80 the difference could be significant, yet, in another case, if one mean is £1.89 and an-
other £6.37 the difference may not be significant. What else do we need to know to answer
the question whether two means are significantly different? Figure 13.3 will help us under-
stand what we need to do. The diagram shows four distributions A, B, C and D. Let us im-
agine that A is one sample (those who stay in school for lunch) and B, C and D our second
sample. Let us make it clear: B, C and D are not three samples, they are three examples of our
second sample. Obviously if B were the sample, the chances of the mean of B being signifi-
cantly different from the mean of A is less than if either C or D were the sample. Clearly the
difference is not just the difference between the means but the extent to which the distribu-
tions overlap. This is what Gossett started to look at in his research because it seemed to be
the way in to making a statement about probability. His approach, however, was not to look
at the standard deviations themselves but at the likelihood that the sample means were an
accurate estimation of the population mean. If we look at just one sample, we can calculate
the standard error of the mean (see Chapter 12 for an explanation of this). With two samples
we do not look at the standard error of the mean of each sample but at the standard error of
the difference. We know that the standard error of the mean gives the range within which the
mean will lie with a given probability. The same is true of the standard error of the difference
in the means. To calculate this, we need to know the standard deviations of our samples.
What these terms mean will become clearer when we see them in action.
Let us now look at the circumstances in which we can use the t test. It is important to
know these because the circumstance changes the formula we use.
The test generates a statistic, t, that we interpret for its significance. We can interpret
it through three sources:
1. a book of statistical tables (see, for example, Lindley and Scott, 1995);
2. an on-line look-up;
3. a statistics package.
The test statistic has to be interpreted according to the size of the sample. This is
because this test, in common with most of the tests we shall look at, generates a test
statistic that is a function of sample size, thus large data sets are likely to produce large
test statistics. We do not use sample size itself, however, but something called degrees
of freedom. The degrees of freedom in a test are always the sample size less one. So in
a sample of 58, there are 57 degrees of freedom. The explanation of degrees of freedom
as a concept is difficult (the mathematics are complex) but it is to do with the fact that
once a certain number of parameters of a data set have been established, only a given
number can vary without altering the character of the data set. In many ways, it is easi-
est to accept it as an act of statistical faith and remember that it is (n-1). However, as we
shall see with some other statistical tests, what n is in the context of degrees of freedom
is not always the same.
Now let us think about our test result. The probability value generated by the test tells
us the likelihood that the sample is an accurate sample of the population. If we generated
a probability statistic of 0.38 with a sample of 30, we would not reject our null hypothesis
that the sample and population mean are not different (in fact the statistic suggests it is a
very good sample). If our probability statistic were .021, we would almost certainly reject
our null hypothesis and examine our sample to see what had gone wrong. Think about
what these probabilities mean.
Good practice suggests that the literature should be replete with examples of the
t test being used to test the accuracy of a sample. Searches, however, suggest the op-
posite. It is good practice to demonstrate that data constitutes an accurate sample be-
cause without this being shown, there can be no extension of the arguments from the
analysis of the sample to the population. The failure to observe good practice was noted
by American researchers, led by Linda Zientek, who reviewed quantitative research in
teacher education (Zientek et al., 2008). They comment: ‘For readers to make well-
informed decisions on the basis of a study, the essential elements of study’s design
sample and analysis should be reported’ (p. 209). Only 9% of articles included what
was required to replicate the study and only 17% compared sample characteristics with
population. When this does not happen, there is no evidence for generalising the re-
sults. It is poor practice.
The top line of the equation means we subtract the mean of sample y from sample
x. We ignore any negative values. The second line means we add the mean variance of
sample x to the mean variance of sample y and take the square root. This form of the t
test is used extensively. Most researchers (and statistical packages) make the assumption
that sample sizes and variances (the squares of the standard deviations) are unequal. If
they are not a different formula (not given here) should be used.
Examples of the use of the t test to assess differences between samples abound in the
literature. Welsh researchers used the t test (in Minitab) to compare boys’ and girls’ daily
nutritional intake. They found a significant difference at the 95% level between boys’
and girls’ energy intake (Thomas et al., 2007). Adedeji Tella used it to compare the differ-
ence between boys’ and girls’ motivation to study mathematics and found the difference
to be significant at 95% (Tella, 2007). Two Swedish researchers, Asa Ahlin and Eva Mörk,
used the t test in an interesting way to assess whether the characteristics of municipalities
in Sweden that officially reported school expenditure were significantly different from
those that did not. This was an important test because it determined whether the conclu-
sions from those that reported could be extended to other municipalities. The test did
not detect a significant difference (Ahlin and Mörk, 2007).
developed were an effective discriminator of language development and for this they
matched their samples in terms of age, sex and socio-economic background. Differ-
ences between the two groups as evaluated by the t test were significant at either 94%
(language comprehension) or 99% (sentence development and expressive vocabulary)
(Schlichting and Spelberg, 2003).
When we use the t test in this way, with matched samples, we are not concerned with
the difference between the means of the two distributions but the mean of the differ-
ence between the two situations for the individuals taking part. So if we were looking
at a class of 15 students and their knowledge of current affairs before and after a lesson,
we are less interested in the means for the group before and after the lesson and more
interested in the change in individual scores between the two tests. The test statistic
then becomes:
where x is the mean of the effect (in our example the mean of the improvement in
knowledge)
s is the standard deviation of the effect
n is the number of participants
(ii) x2 (chi-square)
The chi-square test (also written as ‘Chi-Squared’, which is probably more accurate
but chi-square gets more on-line search hits) is extensively used by social science
researchers. This is not surprising because it is easy to understand and interpret. Un-
like the t test which uses interval data, chi-square requires data to be in a counted
form (numbers of people in different socio-economic groups, number in categories
1–5, 6–9, etc.). This means that chi-square is a non-parametric test. The most com-
monly used form of chi-square (and the one found in most statistical packages and
which we shall consider here) was developed by the British statistician Karl Pearson
(originally Carl Pearson, but when he was at the University of Heidelberg his name
became changed).
What the chi-square test does is compare the actual distribution of data with a theoret-
ical distribution of the same size. We shall see how we put this into operation with some
examples later. While it works with a different type of data from the t test, it makes the
standard assumption that samples are randomly drawn and, like most non-parametric
tests, its discriminatory ability weakens as the sample size becomes smaller.
The issue of sample size with chi-square is an important one so we should understand
it in a little more detail. As sample size decreases, so the number of counts in each of our
groups or categories for instance people with an IQ of 70 to 80 is likely to fall. When the
values in categories (in chi-square we call these cells) become low (generally below 5) the
test becomes a poor discriminator of significance. As a rule of thumb, a limit of 10% of
the total data counts should be in cells with fewer than five counts. If this does occur, we
have two solutions, either to increase the sample (and we should have thought of this
at the start of our planning!) or we combine cells. However, it is not just at low sample
sizes that we experience difficulties. When the sample size is large (over a thousand),
594 Research Methods for Education Part 3 The Process of Data Analysis
chi-square has a tendency to show differences as significant. It may be that there is a bet-
ter test that we could use but with large samples it may be safer to use a higher level of
significance, say 99 %. There is an implication in this that we should be aware of. Many
of the data we use may be expressed in percentages. Can we calculate chi-square using
percentages rather than actual totals? As a general rule the answer is ‘no’. If the actual
number of counts is small, say 35, and if we use percentages, chi-square thinks that our
sample is three times as large, with the consequence that it overstates the probability sta-
tistic. If the actual data count is between 90 and 110, then there is probably no impact.
With large sample sizes, using percentages will give us a more conservative test statistic
but the overall result is that we will not know where we are with our test.
Let us now see how chi-square can be used.
From this table it is now clear why we call the values in each group ‘cell values’.
The test statistic is quite straightforward to calculate:
Σ (O 2 E)2
x2 5
E
where O is the observed value
E is the expected value
S is the sum of the calculation for all the cells
So, for example, the calculation with our data is:
have to bear in mind that we are testing a null hypothesis, that there is no significant
difference between our two distributions. The probability value (often just labelled p
or pr in output from statistical packages) is the chance that the sample is drawn from
the population if the null hypothesis (that there is no difference between the two dis-
tributions) is correct. We have to set the probability level we obtain in our test against
the threshold level that we established at the outset of our test. Let us assume that
we established a threshold level of .05. This means that there is only a 5% chance of
drawing the sample that we are testing when the null hypothesis (that there is no dif-
ference between the sample and the population) is true. Now in our case, this prob-
ability is either 30% (rounded from .298) or 34% (.340). Neither of our probability
values is close to a point at which we would reject the null hypothesis, which, given
that we want our sample to be a good one, is a good result.
For a small number of values, these calculations are easy to do with a calculator. These
were done using an on-line calculator. Often the advantage of an on-line calculator or
commercial statistical package is that they will provide degrees of freedom and the test
probability level. Without these, we have to look up the test statistic in a table of values
or use a table on-line.
∑ (O 2 E )
2
E
However, in this case we derive the expected distribution in a different way.
The expected value for any cell is the product of that cell’s row and column probabili-
ties. This sounds complex but it is, in fact, extremely straightforward. The row probability
is given by row total as a proportion of the grand total (that is, the total number of cases)
Predominantly Predominantly
healthy Mixed unhealthy Totals
and the column probability by the column total as a proportion of the total number of
cases. The product of this is given by:
row total 3 column total
grand total
An example will show what an easy calculation this is. If the row total is 22, the col-
umn total 37 and the grand total 226, then substituting these in our formula, we find
that our cell expected value is 3.6:
22 3 37
5 3.6
226
This chi-square statistic can be calculated manually or by an on-line calculator or by
a statistical package. If the calculation is not done by hand, care must be taken to use a
calculator or package that deals with two-way data and not one way. The benefit of using
an on-line or package calculator is that most will generate the chi-square test statistic, the
number of degrees of freedom and the probability value for the statistic with the given
degrees of freedom. If we calculate the statistic manually (and it is straightforward as
long as we can multiply, divide, add up and subtract in the right order), we should know
how to determine the degrees of freedom. Even this is easy:
rows 2 1 3 columns 2 1
In our example (Table 13.5):
(2 2 1) 3 (3 2 1) 5 1 3 2 5 2
Chi-square is easy to understand, simple to operationalise and is a robust method that
has stood the test of time so it is not surprising that it has proved to be popular amongst
education, social and business researchers. An interesting example looked at how far a
sample of UK schools conformed to school food standards. Kaklamanou, Pearce and Nel-
son (2012) from the School Food Trust conducted research into whether there were dif-
ferences in food provision in academies (independently managed but funded directly by
central Government) and schools maintained by local authorities. This is an interesting
study not only because it blends qualitative and quantitative approaches but also because
the Academies Act (2010) specifically exempted academies from meeting compulsory
school food standards introduced in 2006 (on the grounds, as the Department for Edu-
cation put it, that ‘academies will always do the best for their pupils’ (Kalamanou et al.,
2012, p. 3). For lunch provision the researchers found no difference between academies
and maintained schools but in terms of break-time foods, provision of cakes, biscuits,
confectionery and savoury foods was significantly worse in academies. The chi-square test
played only a small part in the research programme but it did enable the authors to make
a strong statement about the absence of a national set of standards and inappropriate food
provision. A Finnish researcher, Professor Päivi Tynjälä, used the test to look at the learn-
ing experiences and learning outcomes of students taught in a traditional way and those
taught on a constructivist way. She was able to show that those who learnt in a constructiv-
ist environment acquired more diversified knowledge (Tynjälä, 1999). Because she only
had one degree of freedom, she chose to test her data for significance at the 99% level.
In another study, Andre Brouwers and Welko Tomic of the Dutch Open University
used chi-square in a study of teacher burnout and its relationship with self-efficacy, the
extent to which a teacher believes they can affect student performance (Brouwers and
Tomic, 2000). They measured three dimensions of burnout: emotional exhaustion, dep-
ersonalisation and personal accomplishment and evaluated self-efficacy on a Likert scale
598 Research Methods for Education Part 3 The Process of Data Analysis
Working out chi-square a Nigerian researcher, Michael tude to educating in the mother tongue. He used chi-
Ejieh, looked at an issue faced by many educators in square to test whether their attitude (two categories)
developing countries (and by many parents in small was related to their subject of study (four categories).
countries). Should children be educated in their
t How many degrees of freedom is this?
mother tongue or in a global language? He investi-
gated the attitude of primary level student teachers His x2 statistic was 3.019. Is this significant at the 95%
and found that they had a generally negative atti- level?
(see section 8.3(i) and 8.3(ii)). They were particularly interested in how burnout and
self-efficacy changed over time, and developed a series of models to represent this. The
relationship between these models and the data set was tested using chi-square. This is a
sophisticated analytic study with important implications for the process of managing the
education system and teachers. We should remember, however, that their conclusions
rest upon a simple but valid application of a robust statistical technique, chi-square, that
is equally available to new researchers.
Activity 13.2 is a chance to test your ability to interpret x2 output.
300
250
200
150
100
50
0
1 2 3 4 5 6 7 8 9 10 11 12 13 14
should note about the test. It is less sensitive if the maximum difference is at the tail of the dis-
tribution. In this case, comparison with another test such as chi-square may be warranted.
Let us now see how we can use the test.
t Set the data out as a cumulative distribution between 0 and 100% (or use raw data if
the samples are the same size) and identify the maximum difference (D).
t Compare this with a critical value for D that takes account of significance level and
sample size. Note that the calculation takes account of the sample size of each distri-
bution.
Again, this procedure can be accomplished using statistical packages and on-line cal-
culators.
The KS test was used in this way by three Croatian researchers, led by Ivan Prskalo,
in a study of the impact of the Bologna process (the co-operation by EU member states
to create transparency and commonality in European higher education) (Prskalo et al.,
2007). Their study group was pre-school and primary trainees and the focus of the study
was their preparation in physical education. In particular they were interested to discover
whether the Bologna process has affected either absolutely or relatively instruction hours
of teacher trainees. In a detailed analysis of how curriculum hours were used before and
after the Bologna changes, they were able to show that the changes that had occurred
were significant at the 90% level. They go on to discuss the implications of the changes
in terms of the preparation of trainees for the workplace.
Male Female
men and women and our research hypothesis is that we are systematically under-mark-
ing women. This is a one-tailed test (because our research hypothesis is directional).
Using an on-line calculator, we obtain a test statistic for a one-tailed probability of 0.07,
so we do not reject my null hypothesis if I use the conventional level of significance of
0.05%. If I had obtained these figures, however, I would be concerned that we were so
close to the critical value and I would want to conduct more research. One thing I could
do is go back in time and look at the grades given by this colleague in the past. I would
add these data (for this year) to the grades awarded last year and conduct another test
and I would repeat this by adding in data for the year before that as well.
58 87 64
73 54 57
27 77 51
56 45 40
32 40 58
one variable (modules) and we would compare the sum of the variances for the individ-
ual modules with the sum of the variances for all the observations treated as one data set.
of 163.8/290.5. These two values are derived from the sum of squares divided by
the degrees of freedom. F is the test statistic that we look up if we are using tables of
significance to see what its associated level of probability is.
t Most calculators, however, will perform this for us. In this case the probability is .583.
Putting this another way, 42% of the times we reject the null hypothesis, the null hy-
pothesis would be correct. This is such a high error rate that we cannot reject the null
hypothesis.
Test your own ability to interpret F ratios with Activity 13.3.
While our example used made up data, three South t specify the null hypothesis for each table
African researchers used ANOVA to look at the ef- t specify at what probability level you reject or do
fects of streaming boys and girls in mathematics not reject the null hypothesis
classes. They sampled nearly 1,500 students in four t express our acceptance/rejection of the null hy-
schools, one mixed but with segregated classes, one pothesis
mixed with mixed classes and two single sex, one for t look at all the results and suggest what an effec-
boys and one for girls. Individual pupil data were tive policy response might be.
mathematics test scores. Tables 1, 2 and 3 show the
results. Your task is to: Source: Bosire et al., 2008
Undergraduate Postgraduate
Conventional class Test results for undergraduates Test results for postgraduates
taught conventionally taught conventionally
Problem-based class Test results for undergraduates Test results for postgraduates taught
taught using problem-based method using problem-based method
Computer-based Test results for undergraduates Test results for postgraduates
learning taught using computer-based method taught using computer-based
method
Corrected model
Intercept
Student level F S
Learning model F S
Student level * learning model F S
Error
Total
Corrected total
Values entered
B1 B2 B3
A1 22 37 26
47 54 39
36 66 53
49 59 49
37 48 33
54 40 37
A2 54 61 54
35 68 37
41 52 49
42 46 58
27 40 61
52 59 56
ANOVA summary
Source SS df MS F P
bg 1305.2222 5
rows 312.1111 1 312.111 2.91 0.098366
columns 754.8889 2 377.444 3.52 0.042340
r3c 238.2222 2 119.111 1.11 0.342717
wg 3219.3333 30 107.311
Total 4524.5556 35
on each of the three learning models (B1 to B3). The lower part of the table shows the
output statistics. We had three null hypotheses:
1. Student level (rows): Null hypothesis is ‘there is no significant difference between
undergraduates and postgraduates’. The probability statistic for this is 0.098. If our
critical level is 0.05 (95%), we would not reject our null hypothesis (but with only six
subjects, we should not be too dogmatic in our judgements).
2. Learning model (column): Null hypothesis is ‘there is no significant difference in
student performance in the three learning situations’. The probability statistic here is
0.042, which is within the 5% level, so we can reject our null hypothesis. When we
look at the data there is evidence that students perform better on the conventional
learning model.
3. Interaction between student level and learning model (r 3 c): The probability statistic
here is 0.34. We cannot reject the null hypothesis.
The outcome of our analysis is that the learning model is the prime factor in determining
outcome grades and while there are differences between undergraduate and postgraduate
students in grades these are not significant at the 95% level. There is no evidence that under-
graduates do better than postgraduates with any learning model or vice versa.
ANOVA is an adaptable and powerful technique. It is based on identifying and meas-
uring associations between data sets on the same basis we test relationships between data
sets. This underlying statistical model is called the linear model. We will now look at its
use in the identification of significant relationships.
In Chapter 12 we saw that we could describe a relationship between two variables dia-
grammatically in a regression line and numerically in a correlation coefficient. Here we
want to see how we can test the significance of relationships and then what statistical
models we can use to describe the complexities of some of these relationships.
6 .8 4 2.67 0.056
400 .43 398 9.5 ,0.001
We start with our null hypothesis, in this case that there is no relationship between
the variables. The test statistic is straightforward:
n 22
t5r.
1 2r 2
where n is number of pairs of variables used to calculate the correlation coefficient
r is the correlation coefficient that we calculated
However, it is even easier to do the calculation on-line or through a statistics package.
Table 13.13 shows the results for our two examples. We should note the following:
t Sample size (n) and the correlation coefficient (r) are given in the first two columns.
The top row is our sample of six and the bottom row our sample of 400.
t The degrees of freedom are always the number of pairs minus two, one for each vari-
able.
t The results of the t test are given in the last column, which shows the probability
that the statistical relationship occurs by chance. In the case of the sample of six, it is
about 5%; in the case of the sample of 400, it is about 0.1%. What this is telling us is
that our lower correlation coefficient is, in fact, far more significant.
The test that we have used here can also be used with Spearman’s rank correlation
coefficient (see section 12.2(vii), 12.2(vi)).
There is another way in which we can assess how accurate our sample correlation
coefficient is as an estimate of the population correlation coefficient. This is to establish
the limits within which the population correlation coefficient lies with a certain level of
confidence. Confidence is expressed as a probability. It can be any level of probability
but 95% is the convention. With a 90% probability the confidence interval (the range
between the confidence limits) will be larger than 95% and with a 99% probability the
interval will be smaller. Most researchers stick to the middle!
How do we obtain (and interpret) a confidence interval? The calculation is based on
the Z table that we met early in this section. The good news is that there are many on-line
calculators as well as statistics packages that will produce the answers for us so we do not
have to resort to doing it ourselves. Let us use the coefficients we tested for significance
in Table 13.12.
t Look first at the correlation coefficient of 0.43 based on a sample of 400. Our on-line
calculator determines that at a 95% level the coefficient ranges from 0.347 to 0.506
(remember, the coefficient was significant at about the .01% level). How do we inter-
pret this? Many people would say that this means that 95% of the time the true mean
falls within the range .347 to .506. It may be a little fussy to make this point but this
is not quite right. The confidence limit is established on the basis of sampling and
we should more accurately say that in 100 samples, 95 will contain the population
mean. The difference between the two interpretations is subtle yet meaningful.
Chapter 13 USING STATISTICS TO SAY SOMETHING SIGNIFICANT 609
t Now let us look at our other set of data. Here the coefficient of 0.8 was based on
six pairs of data. Our on-line calculator determines that at a 95% level the coefficient
ranges from 20.032 to 0.977. This implies that despite the fact that the coefficient
was significant at about the 5% level, it is virtually worthless because the confidence
interval is so great.
There are two other points we should make about confidence intervals and limits.
First, they can be calculated for other measures (such as mean, regression lines). Most
calculation can be done for us using statistical packages. Their interpretation is the same
as above. The second thing to note about confidence intervals is what they tell us about
sample size and sampling. We know from our example above that the confidence inter-
val is large with a low sample size and becomes smaller as sample size increases. Let us
imagine that we have a correlation of 0.7. With a sample size of 20, the confidence limits
are from .374 to .872. With 100, they are from .584 to .788 and with 500 from .653 to
.742. As researchers with limited resources, we have to get the best estimate of correlation
at the lowest cost and there is clearly a trade-off between accuracy and cost. It is for this
reason that we should take particular care to determine a sample size that will give us the
accuracy we need. If you have not done it yet, now is the time to re-read section 6.6 and
follow this up by reading Appendix 1 now that we have covered more technical material.
Factor Indicator
tion (Weiss et al., 2003). This is an important issue given the policy thrust (in the UK
especially) to see employment as a way of relieving child poverty and overcoming
the likelihood of adults reproducing the inequalities of their own childhoods. The re-
searchers collected data on 390 children drawn from three different cities in the USA.
Table 13.15 sets out how the variables they used related to mothers’ involvement.
Note how low correlations were significant at 95 and 99% levels. This shows the im-
portance of sample size. The results have policy and practice implications. They show
that ethnicity was not an influencing factor, that part- and full-time work influenced
involvement in different directions – part-time work was positively associated and full-
time work negatively associated – and that mother’s education and age were positive
influences, that is, better educated and older mothers were more involved with their
children’s education. From a research design perspective, what is equally interesting is
the way in which the researchers used qualitative methods (interviews and observa-
tion) to gain an understanding of the processes at work in the mothers’ lives. These
data were analysed using a coding methodology on QSR NUD*IST (now Nvivo, see
section 11.2.(c)). The benefit of this two-pronged approach was, as the authors say,
that ‘emergent findings from one method helped to shape subsequent analyses per-
formed by another method’.
While partial correlation offers insights, it misses something as well. It is rather like
undertaking a two-way ANOVA but not looking at the interaction effect between the vari-
ables. Multiple correlation helps us do this.
do well in tests), it is not surprising that we get a high correlation coefficient. If there
is no change in the character of those applying to higher education, then we might ac-
cept that this correlation model is an accurate predictor of output. There is, however,
a strong move in some countries (the UK included) to expand the number of young
people from disadvantaged backgrounds entering higher education. If we keep the
same correlation model, we would find with this policy that the correlation coefficient
goes down as the policy becomes more successfully applied because of the likelihood
that students from disadvantaged neighbourhoods will have poorer academic qualifi-
cations. Quantitative analysis will give us clear answers but we always have to think
about its implications.
of the teaching approaches associated with the programme; they used questionnaires to
collect data of teacher characteristics and a log completed by teachers which classified the
extent to which they complied with the teaching specification; and, finally, they used a
range of statistical measures (mean and standard deviation, correlation, regression and
t tests) to process and interpret the data. They used the stepwise procedure to associate
teacher and school characteristics with the extent of unaltered curriculum use and found
that the principal determinants (at either 5% or 1% levels) were a school policy on sex
education and a range of curriculum beliefs such as teacher benefits, personal beliefs
and self-efficacy (a belief in one’s own ability). It is fair to say, though, that this is a
little used technique amongst educational researchers. While it solves a problem, much
quantitative analysis has moved on to using other techniques that are briefly described
in point(iv) below.
They took 447 young people involved with the youth justice system and tested them on
two scales:
1. One that measured anti-social processes (self-centredness, callousness and the ten-
dency to behave impulsively).
2. Another that measured childhood psychopathy (personality disorders).
They then correlated each individual’s scores on these two scales with whether or not
they had been charged with specific offences. It was this ‘yes/no’ element that made the
point-biserial coefficient the one to use. There were 68 correlations each for the anti-
social processes and psychopathy scales, 136 in total. With this number of correlations
at, for example, the 95% level, we can expect some six or seven of them to be significant
by chance; though just which ones they might be we have no way of knowing. Because of
this likelihood, on the basis of the coefficients they obtained they were unable to reach
any firm conclusions about the broad relationship between criminality and what these
tests measured. Indeed, some of the correlations suggested that higher test scores might
be associated with a lower propensity to engage in criminal activity! We might ask why
their results failed to confirm a hypothesis that many would find intrinsically appealing.
It could be that the simple categorisation of whether the people had or had not commit-
ted the named offences was too crude and that a composite measure would have yielded
a better result as it would differentiate single from multiple offenders. It could also be
that by testing only those who had a criminal record they were not dealing with a large
Chapter 13 USING STATISTICS TO SAY SOMETHING SIGNIFICANT 615
enough population and that had they included people who had no record, the greater
range in test scores might have generated higher correlation coefficients. This assessment
makes an important point. Just because we have not rejected our null hypothesis does
not mean that our research hypothesis is wrong. What may be at fault is the way we
have gone about the research (though this is not to say this was the case with Boccacini’s
group).
In 2006, two educational researchers at Goteborg this sector were over-sampled in order to produce
University in Sweden, Eva Myrberg and Monica robust research data. Out of a total of 10,632 pupils
Rosen, reported on a study that looked at reading tested, 1,034 attended private schools.
achievement amongst third grade pupils (age 10) in A selection of Myrberg’s and Rosen’s test results are
public and private schools. The underlying issues that set out below. Your task is to complete the tables,
the research tackled was the extent to which afflu- either by calculating the test statistic and assessing
ence and social status led to a privileged allocation its significance or by assessing the significance of the
of educational resources with the consequence of test statistics that are given.
differential educational outcomes and implications
for the quality of education in public schools in the
context of a national goal of equality in educational Question 1
provision.
What statistical test would you use to compare
Their method was to conduct the same test with stu- whether the difference between the means was sig-
dents in public and private schools. Because private nificant? Use the test and assess whether the differ-
schools only accounted for 3% of the total, pupils in ence was significant.
▲
616 Research Methods for Education Part 3 The Process of Data Analysis
Telling stories
Never/almost never 12.1 9.7
Sometimes 50.4 45.0
Often 37.6 45.3 7.803
Reading to child
Never/almost never 1.6 1.0
Sometimes 28.4 19.8
Often 70.0 79.2 12.165
Source: Myrberg, E. and Rosen, M. (2006) ‘Reading Achievement and Social Selection in Independent Schools in
Sweden: Results from IEA PIRLS 2001’, Scandinavian Journal of Educational Research, 50(2), 185–205.
Chapter 13 USING STATISTICS TO SAY SOMETHING SIGNIFICANT 617
But this encouragement for researchers to look to quantitative approaches comes with
a caution. Not only do we have to be aware of what the techniques do, we also have to
be aware of what conditions they require for their data and the assumptions that a test
requires. For this, we have to look outside the black box that is the statistical program
and be clear that the assumptions it makes are the same as we require. Before we use a
package test, we should read what it says on the label.
This section has brought us almost to the end of our research journey. All that remains
to be done is to see how the research package comes together. This is what we shall look
at in Chapter 14.
Summary
t We have learnt how the standard deviation is the bridge between describing our
data and being able to assess the significance of our results. A key aspect of this is
the Z table, which gives the probabilities within the normal deviation for all values
of Z. We have learnt how to convert our data into Z values and seen how we can
then make probabilistic statements about our data.
t We have seen that statistical testing only answers two types of question, ‘How dif-
ferent are these’? and ‘Are these two related?’ We refreshed our memory about the
way in which we phrase the question in a statistical test, noting that all a test would
allow us to do is reject a null hypothesis.
t This chapter introduced the idea of a level of significance, expressed in probabilistic
terms, for rejecting or not rejecting the null hypothesis – what is the probability that
this result has been produced by random processes? We found out that statisticians
conventionally use the 95% (5%) or 99% (1%) thresholds but added a warning note
that social scientists and education researchers could and probably should be a little
more flexible.
t We also learnt that significance was not necessarily the only indicator of a statistical
test that we should examine. We pointed out that significance testing ignores the
effect size of a relationship, which may be more important in some situations. As
well, we looked at confidence level, something that is particularly important with
small sample sizes.
t We learnt that the alternative to the null hypothesis is the research hypothesis and
that we should understand the implications of specifying it as a two-directional or
one-directional hypothesis, in particular how critical it is for interpreting the level of
significance.
t We have been introduced to a range of statistical tests and now understand that
we choose a test primarily on the basis of the question we want answered and the
character of our data. This section has made it clear that there are many more tests
available than it described but the tests introduced here are a good foundation for
reading research literature and for researchers beginning their research careers. We
have learnt that even if we have complex data sets there are tests that are within
our capabilities to use as new researchers, so we should not necessarily confine our-
selves to ‘easy’ questions.
t Perhaps, however, the most important thing to have learnt from this section is that
technology has taken the difficulties of computation and assessing significance out
of our hands and that undertaking quantitative research is not only more straight-
forward, it is also, arguably, easier than qualitative research.
618 Research Methods for Education Part 3 The Process of Data Analysis
Further reading
Cohen, L. and Holliday, M. (1996) Practical Statistics for sidered, it deals with more tests that a new researcher
Students, Paul Chapman Publishing, London. might use. It is written in a straightforward way that,
Ellis, P. (2010) The Essential Guide to Effect Sizes: Sta- if you have followed this section, you will manage.
tistical Power, Meta-Analysis, and the Interpretation of Neave and Worthington’s book also provides more
Research Results, Cambridge University Press. tests for the beginner researcher but in their case, all
Neave, H.R. and Worthington, P.L. (1988) Distribution are non-parametric. Ellis’ text provides a good next
Free Tests, Unwin-Hyman, London. step in effect size analysis.
Advising on further reading for statistical analysis could After this level, the analytic path leads onto multi-variate
be a lengthy undertaking, so great is the range of analysis, statistical procedures based on different
methods and texts available. Instead, I have chosen theories of probability (especially Bayes’ theorem)
only to direct you to the next level. and non-linear modelling.
Cohen and Holliday’s book is included because,
although it covers some of the material we have con-
References
Ahlin, A. and Mörk, E. (2007) ‘Effects of Decentralisation Lehmann, E.L. and D’Abrera, H. (2006) Nonparametrics:
on School Resources: Sweden 1989–2002’, Working Statistical Methods Based on Ranks, Springer, New York.
Paper 9, Uppsala University, Department of Econom- Lindley, D. and Scott, W. (1995) New Cambridge Statisti-
ics, Uppsala. cal Tables, Cambridge University Press, Cambridge.
Boccacini, M., Epstein, M., Poythress, N., Douglas, Myrberg, E. and Rosen, M. (2006) ‘Reading Achieve-
K., Campbell, J., Gardner, G. and Falkenbach, D. ment and Social Selection in Independent Schools in
(2007) ‘Self-Report Measures of Child and Adoles- Sweden: Results from IEA PIRLS 2001’, Scandinavian
cent Psychopathy as Predictors of Offending in Four Journal of Educational Research, 50(2), 185–205.
Samples of Justice-Involved Youth’, Assessment, 14(4), Neave, H. (2011) Statistics Tables: For Mathematicians,
361–374. Engineers, Economists and the Behavioural and
Bosire, J., Mondoh, H. and Barmao, A. (2008) ‘Effect of Management Sciences, Routledge, London.
Streaming by Gender and Student Achievement in Peterson, D., Finn-Aage, E., Taylor, T. and Freng,
Mathematics in Secondary Schools in Kenya’, South A. (2007) ‘Youth Violence in Context: The Roles of
African Journal of Education, 28(4), 595–607. Sex, Race and Community in Offending’, Youth
Brouwers, A. and Tomic, W. (2000) ‘A Longitudinal Violence and Social Justice, 5(4), 385–410.
Study of Teacher Burnout and Perceived Self-efficacy Prskalo, I., Fiudak, V. and Neljcik, B. (2007) ‘Educating
in Classroom Management’, Teaching and Teacher Future Pre-School and Primary School Teachers
Education, 16(2), 239–253. to Teach Physical Education: Bologna Process in
Evans, C. and Gibbons, N. (2007) ‘The Interactivity Croatia’, Kinesiology, 39(2), 171–183.
Effect in Multimedia Learning’, Computers & Educa- Schlichting, J. and Spelberg, H. (2003) ‘A Test for Meas-
tion, 49 (4), 1147–60. uring Syntactic Development in Young Children’,
Fisher, R.A. (1925) Statistical Methods for Research Language Testing, 20(3), 241–266.
Workers, Oliver and Boyd, Edinburgh. Stumpf, H. and Stanley, J. (2002) ‘Group Data on High
Hofer, T., Przyrembel, H. and Verleger, S. (2004). New School Grade Point Averages and Scores on Academic
Evidence for the Theory of the Stork, Paediatric and Aptitude Tests as Predictions of Institutional Gradu-
Perinatal Epidemiology, 18(1), 88–92. ation Rates’, Educational and Psychological Measure-
Kaklamanou, D., Pearce, J. and Nelson, M. (2012) ments, 62(b), 1042–1052.
Food and Academies: A Qualitative Study, School Food Sun, S., Pan, W. and Wang, L. L. (2010). ‘A Comprehen-
Trust. sive Review of Effect Size Reporting and Interpreting
Chapter 13 USING STATISTICS TO SAY SOMETHING SIGNIFICANT 619
Practices in Academic Journals in Education and Weitoft, G.R., Hjern, A., Haglund, B. and Rosen, M.
Psychology’, Journal of Educational Psychology, 102(4), (2003) ‘Mortality, Severe Morbidity and Injury in
989–1004. Children Living With Single Parents in Sweden: A
Tella, A. (2007) ‘The Impact of Motivation on Students’ Population-Based Study’, The Lancet, 361, 289–295.
Academic Achievement and Learning Outcomes in Wiefferink, C., Poelman, J., Linthorst, M.,
Mathematics Among Secondary School Students in Vanwesenbeeck, I., van Wijngaarden, J. and
Nigeria’, Education Journal of Mathematics, Science and Paulussen, T. (2005) ‘Outcomes of a Systematically
Technology Education, 3(2), 149–156. Designed Strategy for the Implementation of Sex Edu-
Thomas, N-E., Cooper, S-M., Graham, M., Boobler, W., cation in Dutch Secondary Schools’, Health Education
Baker, J. and Davies, B. (2007) ‘Dietary Habits of Research, 20(3), 323–333.
Welsh 12–13 Year Olds’, European Physical Education Zhou, L. and Fan, Z-Z. (2007) ‘Discrepancy Between
Review, 13(2), 247–256. Native English Speaker Teachers’ Teaching Styles
Tynjälä, P. (1999) ‘Towards Expert Knowledge? A Com- and Chinese English Learners’ Learning Styles’,
parison Between a Constructivist and a Traditional US–China Education Review, 4(9), 15–20.
Learning Environment in the University’, International Zientek, L., Caprano, M. and Caprano, R. (2008)
Journal of Educational Research, 31(5), 357–442. ‘Reporting Practices in Teacher Education Research:
Weiss, H. et al. (2003) ‘Making it Work: Low-Income One Look at the Evidence Cited in the AERA Panel
Working Mothers’ Involvement in Their Children’s Report’, Educational Researcher, 37(4), 208–216.
Education’, American Educational Research Journal,
40(4), 879–901.
Chapter contents
Introduction 621
Summary 648
Further reading 649
References 650
Chapter 14
Learning themes
This section looks at examples of actual t Begun the process of developing your own
research strategies. By the end of the section, research programme.
and the text, you will have:
By the end of this section you will:
t Appreciated how many research
strategies require research questions t Be aware of research strategies open to
to be formulated in a specific way. you and be confident in your ability to
justify one of your own.
t Looked at research through the perspec-
tive of six research frameworks: hypoth- t Be confident in your ability to plan and
esis testing, experiment, ethnography, prepare your own research programme.
case study, action research and evaluation. t Understand the value of your research to
t Seen how researchers have put together both yourself and to society.
research methods within these frameworks.
Introduction
Grasping the ideas, understanding the meth- The guidance for reading this text makes it
ods and appreciating the issues covered in clear that the parts and sections do not have
this text is an enormous undertaking. There to be mastered in sequence. However, there
is, however, one more key point that remains is clearly a sequence in the way in which it is
to be emphasised. This is that the research has organised. By and large this reflects the way
to be conceptualised as a whole before we in which many social science researchers will
start. This has been touched on before (for put together a research programme. The
example, when we talked about the need to process, however, is far more than a sequence
think about how data were to be analysed at of free-standing stages. Conceptualising the
the same time as identifying how we would research process is far more than deciding on
obtain our research evidence) but what is dif- the research question, deciding what data
ferent now is that it is the core theme of this are needed and where they will come from
section. and then analysing them. Certainly starting
622 Research Methods for Education Part 3 The Process Of Data Analysis
the process without knowing the route to way we arrange things to ensure that
be taken is more than likely to be a recipe when things do go wrong we still have an
for research disaster. Organising a research exit route.
programme means that we should:
The capability to do all of this rests firmly
t have a plan for what should happen at upon having a sound appreciation of a wide
each stage; range of research methods but it is fair to
t understand the possible implications that point out that our capability is enhanced by
a procedure at one point might have for a practice and experience. By this point in the
decision we have to take subsequently or text we can assume that the foundations of
even previously; an understanding of research methods are
well on the way to being laid. The purpose of
t appreciate that we can get blown off
this section is to start building the experience
course by unforeseen circumstances and
that turns students into effective researchers.
that we should build flexibility into the
working with particular data collection and analysis methods, there are no hard and
fast rules to this. There is considerable flexibility in how we go about the process of
answering the research question and usually there are alternatives to the way that
might first suggest itself. While these alternatives are rarely discussed in research
literature, we should explore them, if only to convince ourselves that the obvious
approach is the best one. One will, however, begin to take shape. It will be influenced
not only by our research question but also our understanding of data availability, the
resources we have, our knowledge of what methods might work and what methods
have been used.
Let us now return to the peaks. They are all frameworks within which people can work at
the start of their research careers. Our intention is to understand why they were appropri-
ate to our examples and to see how the researchers worked within the research approach.
The research strategies that we shall examine are:
r Case study.
r Evaluation.
r Ethnography.
r Action research.
r Experimentation.
r Hypothesis testing.
r Research questions.
adopted other approaches but their interest was in looking at an island of success in a sea
of, if not failure, then performance below expectations.
(ii) Evaluation
(a) Phrasing the research question
One feature of school life that is beginning to change in many developed countries
is that schools are becoming less isolated from other agencies concerned with social
and behavioural issues. Society is thus taking a more holistic view of how to deal with
problems that may have a cause in one part of a child’s world but have consequences in
others. Typical of society’s concern is anti-social behaviour and how to tackle it.
Rachel Sandford, Rebecca Duncombe and Kathy Armour, all at Loughborough Uni-
versity in England, looked at this very topic. They observed that ‘sport/physical activity
programmes are popular tools [to confront anti-social behaviour], based on a prevailing
belief in their potential to instil positive attitudes, traits and values, and to secure a wide
range of benefits for young people who are disaffected with, or disengaged from, one or
more aspects of society’ (Sandford et al., 2008, p. 419). However, they also noted that
‘there is a lack of robust, empirical evidence to support those beliefs’ (p. 420).
Having established a need for research to inform the issue, these researchers could
have gone in different directions. They could have made a quantitative assessment
of outcomes on a national scale; did people experiencing a sport/physical activity
intervention in their lives offend less? Instead, they chose to look in detail at two specific
programmes. In both cases it was an evaluation of what has happened. The path they
chose was to look at projects that sought to involve young people with physical activity.
There is an important point emerging here. Above we looked at case study as the
strategy for resolving a question. Here, evaluation is the strategy and the projects are case
studies, that is, a case study is the lens through which we look at the data. There is a subtle
difference though with the case study we looked at above. There, the interest was in what
we could learn from the case study, what went right and what went wrong. Here, our
interest is in whether an approach is effective in delivering policy goals.
that ‘have included climbing, abseiling, horse-riding, skiing, tennis, football, martial arts,
aerobics and circus skills’ (p. 424). The programme ends with some sort of competition
and a celebration party. With over 7,000 young people having been on the programmes in
the projects they looked at, the researchers had access to sufficient data for the quantitative
assessment they wanted. Their approach was to examine student profiles and behaviour
before and after involvement with the sports programmes. These profiles (which covered
attendance, behaviour, confidence and self-esteem) were produced by teachers periodically
during the programme period. All the evidence they collected, from the objective number
of referrals to the impressionistic teachers’ views of behaviour and attitude, points to an
improvement in student behaviour. But as they suggest, the need now is to understand
what was happening, in particular the processes responsible for the improvement.
Evaluation is, however, a flexible strategy, and need not be confined to questions
about policy. Another example will show us how an evaluation strategy can be a way of
shaping understanding in order to develop principles of good practice. Rob Bowker, an
English academic, is interested in teaching and learning strategies, in particular how well
understood the strategies for learning outside the school environment were (Bowker,
2002). There is a belief that school trips are beneficial, as much in improving social
relationships, group dynamics and individual motivation as in knowledge acquisition.
Realising these benefits, however, is dependent on the teacher having a clear idea of what
should be achieved and structuring the visit to ensure that the goals are achieved.
The focus for his research was the Eden Project, a reconstruction of global biomes in
climate controlled environments in the south west of England. Bowker identified five
questions:
r What are the Eden Project’s educational aims?
r What are the learning objectives of schools visiting the Project?
r What strategies are used by schools visiting the Project?
r What strategies are used by the Project to communicate their messages?
r What strategies are effective in achieving the school’s objectives and the Project’s aims?
His approach was to base the research on schools that organised their own visits. This
clearly is a better test of the ability of teacher and the Eden Project to understand each
other than if the schools had made use of the Eden Project’s own visit programme. Data
was obtained from pupils, teachers from eight primary schools and through an analysis
of the presentation methods used by the Eden Project.
Three schools identified specific curriculum links as the purpose for their visit, others
identified social benefits. Obtaining data from pupils was achieved by observation on
the day of the visit and follow-up interviews in schools. These had to be tailored to the
age of the children and use was made of visual prompts. The focus of the questioning was
to see how far the children appreciated the purpose of the Eden Project, how effectively
the displays communicated that purpose and what differences existed in pupil response
according to the way the visit was managed.
What Rob Bowker did, on the basis of his study, was to conceptualise a complex
relationship between the visit location (in this case the Eden Project), teacher and pupil
and deduce conditions for effective learning based on what he learnt from the pupils. He
concluded that effective strategies included:
r a high adult/child ratio of 1:4 or even 1:2 to provide effective guidance and focus;
r structuring activities within the child’s ability to concentrate (about 40 minutes, he
suggests);
628 Research Methods for Education Part 3 The Process Of Data Analysis
(iii) Ethnography
(a) Phrasing the research question
Ethnography is a research framework that can operate as a means of configuring the
whole approach to research (the research strategy) or as a means (perhaps together
with other methods) of collecting data. We shall explore its use as an overarching
strategy.
The resolution of conflict and injustice is central to the way societies operate. Legal
frameworks protect and punish but in some circumstances they are inadequate. Physical
bullying at school might be such a case. It is assault and so a criminal act but many schools
(and police services) prefer to use other means (such as restorative justice where offenders
are made to hear the real impact of their actions) rather than criminalise young people.
At a larger scale of social conflict, other approaches have been tried. In South Africa
the Truth and Reconciliation Commission operated on a national scale with a form of
the restorative justice model and the process has been central to healing the wounds
between ethnic communities. But reconciliation as a state of mind evolves and what was
acceptable may change over time.
This was the premise adopted by Krisztina Tihanyi and Stephanos du Toit, two South
African researchers, and the driver for change that they examined was the education
experience. As they point out, ‘The lives of young South Africans present an interesting
paradox: they have no first-hand experience of Apartheid, yet they are surrounded by
its economic and social legacies’ (Tihanyi and du Toit, 2005, p. 25). Their research
was ‘whether the racial integration of South Africa’s education system has presented a
space or opportunity for reconciliation’ (p. 25). In other words, is the education process
moving reconciliation on and helping the different communities live more at ease with
each other?
They could have sought answers from a mass survey of young people but it is
doubtful whether this would have generated any insightful results because it takes
Chapter 14 PUTTING IT ALL TOGETHER 629
more than a questionnaire to penetrate feelings and emotions to the level needed
to gain answers to the question. Instead, they spent time in schools and saw what
was happening. Their strategy was ethnographic and the schools they studied were
sample cases, that is, they were selected to represent different aspects of the school
system.
We find this cycle of action and reflection in Christina Bergendahl’s work on teaching
chemistry. Her question was whether different approaches to laboratory learning would
have different outcomes for students. Her approach was to set up a quasi-experiment in
which three groups of students were given different approaches to experimentation: a set
of procedures to be followed, an open-enquiry/problem solving approach and a revised
open-enquiry approach with more direction. She tested student attitudes to laboratory
experimentation before and after and found that the open-enquiry approaches were
received more positively.
The outcome of this research then led her to consider how approaches to teaching,
learning and assessment could enhance the development of higher order thinking skills.
With a further group of students she introduced a variety of teaching and assessment
methods and collected data from student notes, assessment responses, observation
and interviews on their impact on thinking skills. From this data she was not able to
identify any single approach that worked best for all students but she was able to show
that combinations of different approaches that required students to interact with the
learning process in different ways had the most impact on the development of higher
order skills.
What we should appreciate from this is that her action research strategy utilised a
quasi-experimental framework followed by an action research reflective model. This
is yet another instance of the fact that we can mix methods and methodologies with
strategies to meet the needs of the circumstances.
(v) Experimentation
(a) Phrasing the research question
Assessing the effects of an intervention or controlled change in circumstance is an
approach that is widely found in the natural and behavioural sciences. The strict
laboratory conditions that make it so successful there are not often available to education
researchers, so when we are faced with the sort of question experimentation deals with
(‘Do these fail less often?’), we have to follow the principles but accept that our ability
to control external variables is limited. We should refer to this as quasi-experimentation,
though we do not always do so.
An important process outcome for higher education is to enable students to function
as autonomous learners (this is one of the higher order skills that Christina Bergendahl
was concerned to develop in her chemistry students). For some groups of students, such
as trainee teachers, it is important not only that they achieve this goal but also that they
are effective autonomous learners because, without this capability, their effectiveness
as teachers will decline with time. Two Belgian researchers, Wil Meeus and Peter Van
632 Research Methods for Education Part 3 The Process Of Data Analysis
Petergern and a Dutch colleague, Joost Meijher, explored how they could improve the
capacity of their teacher education students to function autonomously. Their approach
was to introduce a portfolio method of assessment. The authors give an example of
what they mean by this: ‘During her practice, she gathers evidence of her competency –
e.g. lesson plans, a video tape of an interesting intervention, a summary of the pupils’
learning progress, etc. Furthermore, she uses these materials to reflect explicitly on her
competency’ (Meeus et al., 2008, p. 363). However, to judge its impact, they needed a
benchmark against which to assess it. This meant that some students had to continue
with the old model of working and assessment (the traditional dissertation). In this
way they set up a quasi-experiment where the main influence on learning outcomes was
the mode of production of the materials to be assessed and the learning processes that
underpinned each.
There is always an ethical issue with this approach to experimentation that we should
note. How do we minimise the disadvantage to any who might be disadvantaged? Is
there a level of disadvantage that should not be tolerated? If so, should this be zero?
Of course, the implication of zero disadvantage is that partial changes should never
happen and that it is better to have total change – but then there is a possibility that
everyone is worse off than they would have been. The ethical question is whether
relative disadvantage for some is worse or better than absolute disadvantage for all. The
alternative is that no change will ever occur. When faced with this sort of circumstance
our solution, perhaps, should be to ensure that no one is worse off than they would have
been had the intervention not occurred.
Course A 93 28 121
Course B — 53 53
93 81
Chapter 14 PUTTING IT ALL TOGETHER 633
student capability would (or should) have a significant impact on performance. The
researchers resolved this issue by looking at grades given to students in an earlier module
(teaching practice). They did this by conducting two t tests:
r comparison of students from Course A and Course B in the portfolio group;
r comparison of students from Course A in the traditional and portfolio groups.
In neither case was there a significant difference in the student performance levels.
They could, therefore, be confident that student capability was evenly distributed
between the dissertation and portfolio models.
The second issue that the researchers faced was that supervision quality could affect
outcomes. If better supervisors looked after one group and less effective supervisors were
with another then this could skew the results. It is difficult to judge supervision standards
before the event, so the researchers identified dimensions of supervision practice that
might affect student performance. These dimensions were:
r work experience of supervisors;
r learning as knowledge acquisition;
r learning as the ability to use knowledge;
r teaching as a process of encouraging and stimulating students;
r learning as a co-operative and collaborative activity;
r learning as a process of self-direction;
r confidence in students’ capacity for self-direction.
For each of these dimensions, the researchers constructed a set of scales (between
four and ten for each dimension). These scales constitute the indicators. They compared
how the supervisors in the two groups compared in each dimension and found that
there was no significant difference in any apart from confidence in the students’ capacity
for self-direction. They looked at this in more detail by investigating the differences
between:
r Course A teachers vs Course B teachers.
r Course A teachers in the traditional and portfolio groups.
r Course A and Course B teachers in the portfolio group.
r Course A and Course B teachers in the traditional group.
None of these differences was significant, though this could be due to low sample
size. It was clear that variation in supervision was unlikely to influence student out-
comes.
How, though, to measure the outcomes? The researchers were not interested in
academic performance as measured by student grades but by the relative ability of the
methods to build the capacity for autonomous learning. They measured this using two
existing scales:
r The Reporting Autonomous Studying Questionnaire.
r The Inventory of Learning Styles for Higher Education.
The students rated themselves on these scales before and after the dissertation
experience.
The Reporting Autonomous Studying Questionnaire was used to deliver an overall
estimate of metacognition (how aware we are ourselves of our own learning). The
634 Research Methods for Education Part 3 The Process Of Data Analysis
Experiences of Perceived
parental support discrimination
Stress
symptoms
Global
self-esteem
Finnish Sense of
identity mastery
The Finnish researchers set out to test their hypotheses by collecting data from 175 Viet-
namese school students, the test group, and 337 Finnish students, the comparison group.
The students were given the same survey instruments, series of attitude scales to measure:
r school adjustment;
r stress symptoms;
r self-esteem scale;
r sense of mastery;
r parental support;
r cultural identity;
r perceived discrimination;
r proficiency in Finnish.
School adjustment is clearly the dependent variable in the model and the others
are independent variables. Each student was measured on these scales and additional
demographic data (age, sex, length of residence in Finland) were also collected. The data
was analysed in the following way.
r First a form of ANOVA was used to look at differences within and between the two
groups. None of the demographic factors was related to school adjustment though,
not surprisingly, the length of time a Vietnamese child had been in Finland was
related to lower stress, higher self-esteem, proficiency in Finnish and sense of being
in control. However, the analysis did show that the Finnish students were less well
adjusted to school than the Vietnamese.
r The second stage of the analysis was to correlate all of the independent variables with
each other. The correlation coefficients then represented the strength of the or
linkages in the model. The results are shown in Figure 14.2. The researchers concluded
that their model fitted the data, though three relationships were not significant:
national identity did not increase the sense of mastery, proficiency in Finnish did not
affect Finnish identity and Vietnamese identity did not affect self-esteem.
r The third stage was to revise their model. This is shown in Figure 14.3.
Experiences of 15 Perceived 33
parental support discrimination
16 26 19 Stress 32
symptoms
,01 25 18 Global
09 self-esteem 25
Finnish Sense of
identity 01 mastery 33
17 Perceived 33
discrimination
Experiences of 27 22 Stress 33
parental support symptoms
15 Vietnamese School
identity 39 adjustment
Proficiency 21 Global
in Finnish self-esteem 25
Sense of
25 mastery 36
also explicitly questioned the extent to which it was not justified. This opened up the
way into some interesting research and particularly interesting conclusions. But before
we look at these, we should assess the options that were open to them in terms of data
collection.
To do this we have to think about the circumstances that can give rise to a shortage
of teachers. We can say that the problems are on the supply side, that is, not enough
people are entering the teaching profession. Alternatively, the problem could be with
the demand. Has this increased over time? A third model could see a combination of the
two, a shortage of teacher supply in some subjects or areas in part due to an increased
demand for teachers or an increase in the numbers leaving the profession. This analysis
might direct us to identify the following data needs.
r We could conduct a survey of undergraduates (because these make up the majority
of entrants into the profession) to understand their career intentions and the relative
attractiveness of different areas of employment. This would give us a sense of the
future supply of trainee teachers and whether there was anything that could be done
to strengthen the supply.
r We could conduct a survey of teachers to understand what aspects of the work they
disliked and what impact this might have on their career intentions. This would allow
us to judge whether factors within the control of school management or government
were contributing to people leaving the profession.
r We could survey those who have to recruit teachers (education authorities and
schools) and ask what problems they are having with teacher recruitment and
whether their need for teachers had changed in the recent past or was likely to change
in the immediate future.
Gorard’s team’s solution was not to embrace the second and third of these ap-
proaches. Let us think why. The second accepts the assumption that there is a problem
and does not question it. If there is not a problem, there is no need to survey. As well,
they may have been suspicious of the quality of data they would get. If I wanted to
improve my career lot, I might represent myself as more disenchanted than I really am.
So the second solution would require an expensive survey for a problem that might
not exist.
The third solution represents a difficult data collection problem. Where does respon-
sibility for teacher employment lie? In some cases with the school and in some with
a managing authority? Does this mean we survey authorities in respect of the schools
they are responsible for and, separately, schools where responsibility for management
lies with the school? This could be a sampling nightmare. It would almost certainly be
a costly exercise identifying the two populations. Of course, we could always sample
schools directly but against this we have to acknowledge that there will be an awful lot of
them. In this situation time and cost are factors we must take account of.
The only survey the team did was of undergraduates and, even then, it was set in the
context of a secondary data analysis. Obtaining primary research evidence ourselves
(for a discussion of primary and secondary data see Chapter 4) is an expensive
business, particularly so when the scale of our needs requires a national survey. In these
circumstances we should always look to see what official data exist. Gorard and his
team’s focus was England and Wales. Because of devolved governance, this meant that
data sources for England and Wales had to be used. Their secondary data sources for
teacher recruitment, teacher vacancies, teachers in post, teachers in training, recruitment
to training and outcomes were obtained from: the General Teaching Council for
Chapter 14 PUTTING IT ALL TOGETHER 639
Wales, the Teacher Training Agency (now the Teaching Agency), the Universities and
Colleges Admission Service, the Universities’ Council for the Education of Teachers,
the National Assembly for Wales, the Department for Education and Skills (now the
Department for Education and the Department for Business, Innovation and Skills),
National Statistics and the School Teachers’ Review Body (responsible for advising
the Government on teachers’ salaries and conditions). The fragmentation of the data
between these bodies was clearly an issue but it does put into perspective that the
effort of bringing the data together and the loss of detail through having to use less
than perfect data sets was still more cost-effective than a mass primary data collection
exercise. This data was supplemented with (a) a survey of students in higher education
(to determine what proportion might consider teaching as a career and to profile their
characteristics) and (b) focus groups with trainee teachers (to establish why they had
taken the decision to train).
So what were the conclusions?
r ‘There is no great crisis, either passing or looming and so no need for further major
and headline-grabbing policy initiatives at a national level’ (p. 434).
r ‘Teacher training, in general, is an increasingly popular option for graduates’ (p. 424).
r ‘Most teacher trainees are...successful, in terms of qualifying and obtaining a post’
(p. 424).
r ‘Any problems tend to reflect the regional and subject dispersion of teaching staff
rather than any national shortages’ (p. 425).
r ‘Almost none of the crisis account of teacher supply is well-founded’ (p. 426).
And, in passing, Gorard et al. have some critical things to say about previous work in this
area that has accepted, uncritically, the idea of a crisis.
What can we conclude from this, in relation to how we plan and conduct our research?
It is this. While some research questions will predispose us to adopt a particular strategy
(such as action research or evaluation), there are many that just encapsulate an issue
and set no conditions as to how we should approach the research. We are free agents.
Of course, we have the possibility of rephrasing our research question (for example, into
a null hypothis) but this is just a consequence of our freedom to choose. It is how we
choose to answer the question that is the key issue and while, from our perspective, this
is a matter of obtaining data from here rather than there, using this method rather than
that and analysing the data this way rather than that way, from the perspective of those
who read our research, they will judge it in terms of ‘Does it work?’ and ‘Is it convincing?’
This is the test that all researchers have to pass.
As we have said many times, at the heart of good research is a well-planned programme.
There are many occasions when this has to be done formally as part of the research
procedure (for example, when submitting a proposal for an undergraduate or
postgraduate project or a bid for externally funded research) but, even if the research
is for our own personal satisfaction, planning is still an essential process. Here we shall
look at the process of preparing a research proposal. What will become apparent is that
the process of preparing a proposal is a revision of the main themes in this text.
Why?
Literature Thinking
What if?
Researcher Policy and Research
What
Experience linking
should?
Personal Philosophical
values perspectives
Below are contexts for a research question. Your storyteller is one that many people will be able to
tasks are: conjure up. Listening, however, can cover a range
of engagements from the quietly bored to a child
r to identify which research strategies might be
actively experiencing the plot or place in their
appropriate for the issue;
mind’s eye. Do we know, though, how children
r to select one which you would use and to explain engage with books? If there are differences are
why; they due more to the type of book or to the
r to set out the research question you would seek to child? What could you do to find out?
answer. Source: Moschovahi, E. and Meadows, S. (2005) Early
Childhood Research and Practice, 7(2), no page num-
At the end of each context statement is a reference to
bers, available at http://ecrp.uinc.edu/.
the paper from which it came and where you can see
how the researcher(s) tackled the topic. 3. Individual performance is something that sports
stars and musicians have in common, and the pro-
1. Training programmes for students wishing to be-
cess of preparing for that outcome is long and
come primary or secondary teachers are obviously
arduous. Relationships are a crucial part of this
quite different. This reflects the development dif-
process, the sports coach and music teacher both
ferences in the children and a curriculum designed
play the same role with their students. The learn-
to meet the needs of each age group. There is al-
ing and training process is also important; differ-
ways an issue over how effective a professional
ent methods may produce different results. Very
programme is in developing students so that they
often, the model we have of the development
can meet the demands of the workplace. How
process is one where the student is a passive play-
would you assess whether courses for training pri-
er at the receiving end of instruction and advice.
mary teachers and those training secondary teach-
But what if we were to understand and appreciate
ers developed professional competence?
the student’s experiences, emotions and feelings,
Source: Wong, A., Chong, S., Choy, D., Wong, I. and could this improve development? Your task is to
Goh, K. (2008) Australian Journal of Teacher Educa- consider how this might be done with students at
tion, 33(3), 77–93, available at http://ajte.education. an early stage of learning a musical instrument.
edu.edu.au.
Source: Creech, A. and Hallam, S. (2006) ‘Every Picture
2. The story has had a place in the classroom Tells a Story: Pupil Representations of Learning the
experiences of young children for a great many Violin’, Educate, 6 (1), 35–56, Available at http://www.
years. The image of children sitting around the educatejournal.org.
Out of this process a research issue will emerge. Sometimes, however, we have to kick-
start the process. Activity 14.1 is an attempt to do just this. Learning how this happened
for others can also help us with the process. Geoffrey Walford, now Emeritus Professor
of Education at Oxford University, edited a book in which researchers reflected on the
research process (Walford, 1991).
r Stephen Ball (Professor at the Institute of Education in London) explains that his
research on the micropolitics of a school came out of a course on school organisation
he taught. He found that there was little literature and that the literature that existed
was poor quality.
r Barbara Tizard (Emeritus Professor at the Institute of Education in London) and Martin
Hughes (then Professor at Bristol University) describe a study they conducted with
others into 4-year-old girls at home and later at school. They explain that their work
had its origins in a debate about language and social class, its implications for language
development and education and the wider issue of the benefit of pre-school education.
r Geoffrey Walford’s stimulus to carrying out research on city technology colleges
(state-funded schools in England and Wales that are independent of local political
control) was their introduction as part of new Government policy.
Now look at Activity 14.2.
642 Research Methods for Education Part 3 The Process Of Data Analysis
Sometimes what a government thinks is important Write a reflective review that brings out:
can help us focus on an area of interest and oppor-
t the knowledge that you have gained;
tunity.
t the areas/themes that you think are particularly
Go to the Scottish Government’s website (www.scot- important in educational terms;
land.gov.uk), follow the Publications link and then t why you think they are important;
search ‘education’ or ‘early years’. Look through the t which topic you might be interested in exploring.
publications for the last two years, categorise them
and assess where the research priorities should be.
Either for a theme you identified in Activity 14.2 or a t Where are these data located? How will you assess
topic that interests you, assess it against the follow- them?
ing questions: t How long will it take to collect the data?
t Are there any additional costs other than your
t What evidence can you produce that others will time in collecting the data?
think that it is a significant issue? t What length of time is available for the project?
t What data will you require in order to draw
conclusions?
Chapter 14 PUTTING IT ALL TOGETHER 643
r The nature of the environment in which we are carrying out the research could be an
influence. It could be social concern about quality in a specific school or an educa-
tional concern, for instance differential rates of exclusion.
r We may focus on a specific aspect of an issue because that is manageable within the
resource envelope available to us. This should not be the total argument for carry-
ing out research but it is a reasonable supplementary argument for anyone with a
restricted timescale for the work (such as an undergraduate student).
What else can we use in the argument to bolster our claim for the validity and value of
our research theme? The answer to this is the claims made by other researchers. We use
what they say about their research to support our claim for our own research. The danger
of this is that the whole edifice of claims can become like a house of cards, liable to fall
down when a foundation is undermined. What we have to do to prevent this is ensure
that the claims are rooted in some incontrovertible evidence. Our literature search (see
Chapter 5) should be used to provide support for our claim. As researchers we should
enhance our skill in synthesising and shaping other people’s work to help make our ar-
gument. Activity 14.4 provides an opportunity for this.
There is, quite naturally, great interest in the a literature search to consider what evidence exists.
trajectories people follow through their lives. If, in You should assemble at least ten references, summarise
their background, people have a sound educational their conclusions and write a report of 500–700 words.
foundation, we often refer to their trajectories as ‘career
After this, look at Sections 3 and 4 of the Research
paths’ or ‘employment paths’. Because of the fast-
Report by Peter McCarthy’s research team from the
moving nature of the twenty-first century economy, we
University of Newcastle upon Tyne, below.
know that we have to embrace new skills and ways of
working and many employers as well as government
Source: McCarthy, P., Laing, K. and Waller, J. (2004)
supports us with lifelong learning opportunities. But
‘Offenders of the Future? Assessing the Risk of Children
what if our original premise is not met, that there is no
and Young People Becoming Involved in Criminal or
sound educational background, what then? Antisocial Behaviour’, Research Report 545, Department for
This is the question: ‘Are there patterns in the lives of Education and Skills, HMSO, available at http://webarchive.
young people that will predispose them to a trajectory nationalarchives.gov.uk/20130401151715/https://www.
that leads to behavioural problems, anti-social behav- education.gov.uk/publications/eorderingdownload/
iour and, ultimately, criminality?’ Your task is to conduct RR545.pdf (accessed November 2013).
Chapter 14 PUTTING IT ALL TOGETHER 645
To be successful at this stage we have to know about data sources, methods of collect-
ing data and methods of describing and analysing what we have collected. This means
that we have to be confident about everything from Chapters 6 through to 13! We have
to be able to argue for this method in preference to that method. There are two things we
can do to convince an audience:
r First, we can justify our selected methods (or non-use of methods) on the basis of
their strengths, weaknesses and requirements – that is, on the basis of their suitability.
r Second, we can demonstrate that the methods we propose have been successfully
used by others in similar circumstances. This is validatory evidence. And, once again,
everything has to be explicit.
As part of this whole process it is good practice to prepare a plan that sets out a schedule of
activities. If you are doing a project as part of a course, many institutions will require a plan
as part of the project proposal. The reason for this is that they need to be sure that you un-
derstand what you are taking on. A plan is a clear demonstration that you understand what
is required to make your strategy operational. If you do not appreciate what it takes to gather
the data (and many people new to research under-estimate the time required), this will be
apparent in the plan. A plan, however, is more than an allocation of time for a process in the
strategy because the process will inevitably have different elements to it. If data is going to
be gathered through a questionnaire, the plan should show how much time is allocated to:
r developing the questionnaire;
r identifying the population;
r testing the questionnaire;
r revising the questionnaire;
r establishing the sample size required;
r determining the sampling method;
r drawing the sample;
r contacting the sample;
r administering the questionnaire.
And for each of these stages we would have to ask, ‘What could go wrong?’ and build in
a contingency.
There should never be any time in a research programme when nothing is happening.
Amongst activities that can always fill time are:
r ensuring that we have the skills to undertake an activity later in the schedule (for
example, that we know how to input data into SPSS or any other program and that
we are confident that we can interpret the results);
r continuing a literature search and review;
r drafting sections of the final report (for example, the argument for the research, how
other people have approached the issue, the strategy and method adopted);
r reviewing the plan to see if our experience suggests we should make changes.
of what we have found. In most cases how we do this is fairly straightforward and will fit
into one of the following categories:
r We can explain what we have found in theoretical or conceptual terms.
r We can judge whether an action or activity has met its objectives and how modifica-
tions to processes can improve outcomes.
r We can identify courses of action that should not be followed to avoid failure or that
should be followed to ensure success.
With statistical and numerical data (both data that we have collected ourselves
and data already published), we should begin by portraying it and describing it.
Shape and pattern in our data are indicators of processes at work that we should
seek to understand and explain. The methods we looked at in Chapter 10 allow us
to do this and also to manipulate the data to explore relationships. While Chapter
10 introduced us to graphical methods, Chapter 12 introduced us to the idea of
numerical descriptions of data (the ideas of central tendency, spread, peakedness
and skew). Chapter 13 showed us how we could look for and assess association and
relationships in our data.
With non-numerical data (observations, text, audio and visual recordings) we have to
choose an approach that fits in with (a) our research position and (b) our research data.
Are we neutral researchers seeking ‘truth’? Do we believe that our task is to understand
‘truth’ as people themselves appreciate it? Do we think that how people respond to life
opportunities is conditioned by where they sit in terms of social, political and economic
relationships? And if we do, is our task as researchers to explain this to other researchers
or to help people understand their situation and free them from their situation? All
of these are positions that researchers can adopt and each pushes us towards a way of
working and a way of interpreting what we find. Our data and our research question
also influence our selection of a strategy. If our purpose is to see how people interact
through talking, then conversation analysis may be the first approach we should look
at. If our purpose is to see how organisations use words to present themselves, we may
look at discourse analysis or semiotics as an approach in the first instance. If we want
to build a model from our investigation, perhaps grounded theory should be what we
consider first.
What we should take away from this is how similar qualitative and quantitative
analysis of data actually are in their requirement that we understand the things that
affect our choice of method before we start. With quantitative analysis, the research
hypothesis and measurement scale are key influences. With qualitative analysis, we
should understand the influence of our philosophical position and data. In both cases
we are dealing with a research approach that is ‘joined up’, where what we do in one part
of the investigation affects how we proceed in another. If we do not have this overview,
we can find that we create problems for ourselves.
we should always remember that when we started out someone with experience thought
that the research theme was a good idea.
There are, however, some things that we can do to sustain our commitment. We are
driven forward by the goal of producing a report, dissertation or thesis. We recognise this
as the output of all our work. It demonstrates how we have responded to an issue and
embodies how we have intellectualised it. It is our product and its existence should be
a matter of satisfaction and pride. If, however, we focus only on the end product of all
our work, we will fail to appreciate the other benefits that doing research brings. Let’s
enumerate them.
r Research will inevitably take us into new areas. We will read new authors (and with
the worldwide Web available to us these can be drawn from all parts of the globe).
This process is stimulating. We begin to make connections, tie up things in different
ways and find new patterns. The process advances our thinking and our understand-
ing and, ultimately, puts us in a position to make real conceptual or practical ad-
vances that arise when our minds are stretched.
r Second, we become aware of new approaches (ways of collecting data, statistical
tests) that we may substitute for others in our plan. Even if we do not do this, we
have expanded our armoury of procedures.
r Third, we gain experience. We see what works and what does not. The changes we have
to make to the way we conduct our research may be the result of a failure in planning
or the result of something unforeseen. Either way, we have learnt something.
The great danger is that we will be so caught up with producing the end product that
we will lose sight of how the research process is changing us. It is good practice to reflect
on our work and to maintain an accurate analytical report or diary that identifies not
only what we did but what we learnt from doing it.
policy, we find that education research is still concerned with impact; why did or didn’t
this succeed? How can we deliver multi-agency working? Do schools with independent
governance achieve more for their pupils? It would be hard to judge that this knowledge
is not valuable.
But education research is valuable in another way too. Research is undertaken by
students, by teachers, by lecturers and by research professionals. For each of these, every
time they start a research project, they begin a journey of personal and professional
learning and discovery. Research is an exercise in solving problems and this has an
impact on the personal development of everyone who is or has been a researcher. As
individuals create knowledge, they grow their understanding and their confidence to
meet new challenges. This process has repercussions for the education sector. There is a
willingness to accept and act on a weight of evidence. There is active debate on goals and
methods, something that is absolutely necessary for the best ideas to emerge. All of this
grows capacity and capability within education and means that education as a sector is
more capable of meeting the challenge of change.
However, there are consequences in the way in which educational research happens
that can make life difficult for the new researcher. The educational research production
process is more like a cottage industry than anything else. This is not to say that the
research output is not good but that there is a large quantity of it worldwide and, until
recently, there was not a lot of joining up the massive research output taking place. For
this reason, it is important that every researcher looks at books and journals whose goal
is to synthesise research in particular fields, before starting any research enquiry. Because
what we intend for our research may well have been done before.
Research is a messy business. It has to be because the issues we face and the questions
we tackle are so varied and we have to address them in unique ways. As our knowledge
and understanding of the research process grows, so will we gain in confidence to do
it our way. The journey to become a good researcher is not short but it is satisfying.
Research well done is exhilarating and always a satisfying element of any career.
Summary
In this section we have been concerned with the process of shaping a research
approach. There are some key learning points to this.
Further reading
References
Bergendahl, C. (2003) ‘Acton Research as a Means of Moschovahi, E. and Meadows, S. (2005) ‘Young
Professional Development: Reflections on Research Children’s Cognitive Engagement during Classroom
and Action in University Chemistry Education’, Jour- Book Reading: Differences According to Book, Text
nal of In-Service Education, 29(3), 363–374. Genre, and Story Format’, Early Childhood Research
Bowker, R. (2002) ‘Evaluating Teaching and Learning and Practice, 7(2), no page numbers (available at
Strategies at the Eden Project’, Evaluation and Research http://ecrp.uiuc.edu/v7n2/index.html).
in Education, 16(3), 123–135. Sandford, R., Duncombe, R. and Armour, K. (2008) ‘The
Creech, A. and Hallam, S. (2006) ‘Every Picture Tells a Role of Physical Activity/Sport in Tackling Youth
Story: Pupil Representations of Learning the Violin’, Disaffection and Anti-Social Behaviour’, Educational
Educate, 6(1), 35–56. Review, 60(4), 419–435.
Gorard, S., See, B.H., Smith, E. and White, P. (2007) Tihanyi, K. and du Toit, S. (2005) ‘Reconciliation
‘What Can We Do to Strengthen the Teacher Work- Through Integration? An Examination of South
force?’, International Journal of Lifelong Education, Africa’s Reconciliation Process in Racially Integrating
26(4), 419–437. High Schools’, Conflict Resolution Quarterly, 23(1),
McCarthy, P., Laing, K. and Waller, J. (2004) ‘Offend- 25–41.
ers of the Future? Assessing the Risk of Children and Walford, G. (ed.) (1991 and 2003), Doing Educational
Young People Becoming Involved in Criminal or Research, Routledge, London.
Antisocial Behaviour’, Research Report 545, Westera, W., van den Henk, J. and van de Vrie, E. (2004)
Department for Education and Skills, HMSO, avail- ‘Strategic Alliances in Education; The Knowledge
able at http://webarchive.nationalarchives.gov. Engineering Web’, Innovations in Education & Teaching
uk/20130401151715/https//www.education.gov.uk/ International, 41(3), 317–328.
publications/eorderingdownload/RR545.pdf Wong, A., Chong, S., Choy, D., Wong, I. and Goh, K.
(accessed November 2013). (2008) ‘A Comparison of Perceptions of Knowledge
Meeus, W., Van Petegem, P. and Meijer, J. (2008) ‘Port- and Skills Held by Primary and Secondary Teachers:
folio as a Means of Promoting Autonomous Learning From the Entry to Exit of their Pre-service Pro-
in Teacher Education: A Quasi-experimental Study’, gramme’, Australian Journal of Teacher Education,
Educational Research, 50(4), 361–386. 33 (3), 77–93.
This page intentionally left bank
Appendix 1
In section 6.8 we saw how to calculate a sample size using on-line calculators. In this
Appendix, we shall see how we can do this for ourselves. To understand, you should
have read:
t Chapter 11 – calculation of the mean and standard deviation and standard error.
t Chapter 12 – derivation and use of z values.
The first approach to determining the size of our sample is to ask, ‘How accurate do we
want our sample to be?’ or, to put it another way, ‘How much error are we prepared to
accept?’ To understand how we reach a decision, we have to return to something we first
came across in Chapter 3. Here we looked at how many combinations of a given sam-
ple size there were in a population whose size was known. The answer, as the worked
example in Case Study 3.3 showed, is an awful lot. Let us think about all of these pos-
sible samples for a moment and, to help understand the point, let us imagine that our
research requires us to draw a sample of teachers from a single administrative area. The
samples we can draw will have two extremes.
t On the one hand, we could draw samples that are composed of the most typical teach-
ers. If we were to calculate the average age of the teachers in our samples, they would
be very close to the average age of all teachers.
t On the other hand, we could draw samples of the most atypical teachers. At one ex-
treme, there is a sample of the youngest teachers in the area and, at the other, an
equally valid sample of the oldest teachers. Remember that we are talking about all
possible samples that we might draw, so while the average age of all teachers might
be 37 years and 8 months, there are valid samples that could represent the average
age as 24 years and 2 months or 64 years and 9 months. While these are valid sam-
ples, they are not particularly representative. There are, of course, many, many other
Appendix 1 CALCULATING SAMPLE SIZE 653
Figure A1 Sampling distribution: plot of mean values of all samples of a given size
samples lying between these two extremes and distributed around the population
average. Fortunately for us, if we draw the sample randomly, then we are more likely
to choose average teachers simply because there are more of them.
If we were to plot all of our possible samples on a graph, then we would finish up
with a distribution like that in Figure A1. What this shows is that, on the extreme left and
right of the curve is a single sample of all of the youngest (on the left) and all of the old-
est (on the right) teachers in the area. In the centre of the distribution, around the aver-
age of the population, are many samples that generate an average teacher age close to the
population average. This distribution is called the sampling distribution. It is a theoretical
distribution that shows every possible sample of a given size.
One of the things we should note about this distribution is its shape. It is the same
as the normal distribution (that we first met in Chapter 12). This sampling distribution is
also a normal distribution. Whether we have a sample of 20 students, 50 students, 100
students or 1,000 students, the sampling distribution will always have this bell shape. The
normal distribution has some important characteristics. First, it is symmetrical (around
the population mean in our example). Second, we can define limits within which a given
proportion of the distribution (in this case of all possible samples for a given size) will
fall. These limits are called standard deviations (we met them in Chapter 12). In a normal
distribution the following are constant:
t 68% of the values in the distribution fall within one standard deviation either side of
the centre, 34% to the left and 34% to the right;
t 95% lie between two standard deviations of the central point, 47.5% to the left and
47.5% to the right;
t 99.7% lie between three standard deviations of the central point, 49.85% to the left
and 49.85% to the right.
We can use this information about the characteristics of a normal distribution to cal-
culate sample size. In essence, the question we shall be asking is, ‘How accurately do we
want to estimate the population average?’
654 Appendix 1 CALCULATING SAMPLE SIZE
Let us change our terminology a little and call the average the mean (see Chapter 12).
It is a simple calculation that most of us are familiar with – add up all of the values and
divide the total by the number of values. We can express it using a statistical equation:
n
x
x5 ∑
1 n
∑ represents the process of summing all the values of x from the first (1) to
1 the nth
— indicates the division process
n is the total number of xs
If we turn back to our sampling distribution, it is clear that the accuracy with which
the samples estimate the population mean decreases as the sample moves towards the
extremes of the distribution. The implication of this is that when we calculate a sample
mean, each and every one is likely to have an error in its estimate of the population mean.
The scale of the possible error is derived from the sampling distribution. For example,
there is a 68% chance that the sample is drawn from within one standard deviation of
the population mean. Thus the standard deviation of the sampling distribution is the
standard error of the mean. Again, we can express this as a simple statistical expression:
SE X 5 / n
deviation. From a statistical point of view, this is a pretty risky situation. It would be
better to be 95% confident, that is, two standard deviations. The formula then becomes:
(1.96 3 22.4)2
n5 5 77.1
5
The only value that we are not familiar with here is 1.96. This is called a Z score and
it is derived from the normal distribution (see Chapter 13). The Z value identifies the
proportion of the normal distribution that lies between the central point (0) and any
Z value. If we are concerned with both sides of the distribution, then the proportion is
double. The Z value for a 95% level of confidence is 1.96, that is, 95% of a normal dis-
tribution will fall 61.96.
1.22
90% (.10) where n is sample size
n
1.36
95% (.05)
n
1.63
99% (.01)
n
For values below 35, refer to tables.
n1 1 n 2
90% (.10) 1.22 where n1 and n2 are the two sample sizes
n1n1
n1 1 n 2
95% (.05) 1.36
n 1n 2
n1 1 n 2
99% (.01) 1.63
n 1n 2
Glossary
Analysis of variance (ANOVA) A statistical test that seeks to establish whether two or more
data sets are significantly different from each other. The test uses the variance in the data sets
and apportions the variance to that within the individual sets of data, that within the combined
sets of data and a residual (the difference between the sum of the individual sets and the
combined set).
Asynchronous A process or event that is not in time with another. For example, learning with
tutor feedback subsequent to student activity is asynchronous as is an interview conducted by
email.
Attitude A perspective held by a person in respect of an object, concept, idea, event or other
person. The attitude is usually represented as being for or against.
Attitude scale A means of quantifying a person’s attitude. The scale is usually a composite measure.
Bar graph (chart) A representation of the frequency of a categorical variable (such as ethnicity)
in which the horizontal axis represents the variable and the vertical shows the frequency (see
also histogram).
Benchmarking This term is used in two ways. It refers to the comparison of, say a school or
education authority, with another that has similar characteristics. It is also used in an evaluative
study to assess the impact of an intervention. The benchmark is a description of the situation
and outcomes before the intervention.
Bias A process that generates data that are not representative of the parent population. Bias
can occur through faults in sampling, poor practice in data collection and inappropriate (even
deliberate) misinterpretation of evidence.
Bivariate This means two variables. It is usually used where two variables (such as the number
of children with free school meals and the index of deprivation for their community) are plotted
against each other.
Boolean A form of algebra developed by George Boole and used in search engines. It allows us
to include/or exclude items, to group terms and so on.
Case study A study of an individual circumstance or situation (such as a failing school). The
case can be selected because it is unusual or because it is typical.
658 GLOSSARY
Categorical data Data that can be put into a group or category, such as independent and
public schools and the grouping of people according to their lifestyles. The classification of
data in this way enables some quantification (numbers of data units in the group) of data that
is, inherently, not quantifiable. For this reason, categorical data is sometimes referred to as
counted data.
Cause The idea that there are consequences when something occurs. The idea of a cause
creating an effect is often central to social science research and certainly essential if we conduct
experiments, construct hypotheses or develop theories or models.
Cause and effect A key approach in social science and educational research in which a
researcher attempts to show that a variable (the cause) produces and impact (the effect) in a
second variable or in a situation. The impact is usually measured as a quantity.
Census A survey of a whole population. Most states conduct a population census, typically
every ten years.
Central tendency The idea that in any bundle of quantified data we can identify values or
types of data that are most typical (see mean, median and mode).
Chernoff face A visual depiction of many variables in the form of a face, with variables taking
the form of the mouth, eyes, eyebrows, ears, nose and so on.
Chi-square (x2) A non-parametric statistical test that uses counted data to determine the
significance of any difference between an observed distribution of data and a distribution that
would be expected given specific assumptions.
Citation The process of referencing academic and other work that has informed a researcher’s
own study.
Closed question A question that gives the respondent no freedom to respond other than with
the categories pre-chosen by the researcher. Researchers can increase the flexibility of closed
questions by adding a response category ‘other’ and asking respondents to give details.
Cluster sampling A method of sampling that is useful for dealing with large data sets. The
population is divided into groups (clusters), often on a geographical basis. The clusters are
sampled as individual units and the members of the population within the selected cluster are
the source of data.
Codes The names given to themes selected for their significance to the research issue in
qualitative research (see coding and tags).
Coding The process of identifying concepts or themes from text, speech or behaviour. Codes
are umbrella ideas that link together basic data units. The coding process links together similar
words, ideas, actions or behaviour and progressively seeks to build them into an interpretative
hierarchy.
Coding frame This is the set of codes that have been devised to analyse data. The coding
frame should be sufficiently well developed and clearly explained so that two different
researchers will process the same data in the same way.
Coding software Software that can be used to handle qualitative data. It can be used to add
codes, bring codes together, add comments about the coding process and, in modern versions,
even predict the allocation of codes to data.
Coefficient of variation A measure of the relationship between the mean and the standard
deviation.
Cognitive interview An interview approach in which the interviewee is asked to place him
or herself in situations or to remember situations about which the researcher wishes to collect
data. The technique was developed for police witness interviewing.
Cohort A group in a sequence of groups. For instance, a cohort of students could be studied
from the time they enter the educational system to the time they leave.
Collective interview An interview with more than one person present and answering questions.
Compound Indicator An indicator of a variable that consists of several parts brought together
in a single measure.
Confidence interval This is the range within which the true value of an estimate will lie. For
example, if we sample voting intentions, we can say that 30% will vote for party A plus or minus
3%. The range 27% to 33% is the confidence interval. The level of accuracy we require (that is,
a smaller rather than a larger interval) will increase sample size.
Confidence level This is the proportion of times we would expect our sample to deliver an
estimate with a specified confidence interval (see definition). Confidence levels are expressed
in probabalistic terms. In social research 95% is commonly used, though 90% and 99% are also
found. The higher the confidence level required, the larger will be the sample.
Connotation Meanings that are attached to the literal meaning of a word by a group or
individual. Connotative meaning can be the image we bring to mind or the meaning we ascribe
to an image (see denotation).
Continuous data Data that can be measured either on a scale of infinite length or data that
over a range can have an infinite number of measurement possibilities. For example, when
measuring the height of adult males, there are an infinite number of measuring points.
Control variable A lens through which we examine data to see if it produces any
patterning in the character of the data. In survey research, for instance, we may use gender,
age, ethnicity and social status as control variables to see whether they reveal differences in
responses.
Conversation analysis A qualitative approach that explores the verbal interaction between
two or more people. The emphasis is on the pace, lilt and tone as much as the meaning.
Conversation analysis has a well-developed framework for analysing verbal interaction.
Correlation A means of measuring the degree of association between a pair of variables (see
correlation coefficient).
Critical theory Within social science and in this book, critical theory is a philosophical
perspective that informs the selection of a research issue and the analysis and interpretation of
research data. In broad term its perspective is to remove or reduce inequalities in society.
Cumulative frequency graph A distribution in which all occurrences below a value are added
together. When using data organised in classes, classes are arranged in a meaningful way (e.g.
from low to high) and the totals in each class added together, the first class added to the second
and then the first and second to the third and so on. Cumulative frequency graphs are also
known as ogives.
Data Facts, numbers, opinions, attitudes, actions, appearances and so on that we use in
any combination and from which we extract information that will help us answer a research
question.
Data archive A depository for primary research data. Data are usually stored in digital form.
Data visualisation An approach to data analysis that portrays data so that shape, patterning
and visual forms can be used to suggest processes at work.
Deductive A way of representing research in which the purpose of the activity is to test the
appropriateness of a theory or model. With this approach, the researcher postulates what will
or should happen and then collects data to test whether the world from which the data came
fits the theory or model.
Degrees of freedom A procedure used in statistical testing for taking account of sample size.
Conceptually, degrees of freedom in statistical tests are the number of data values that can vary
when we calculate the test statistic without changing that statistic. For the non-mathematician,
this is confusing so it is easier to stick with the idea that they allow for the influence of sample
size on the variability of our test statistic.
Dependent variable In a process (cause–effect) relationship, the dependent variable is the one
that changes as a result of a causative process (see independent variable).
Descriptive statistics A range of statistical approaches that are used to describe the shape and
characteristics of data. Conventionally the qualities that are described are averageness, spread,
peakedness and skew. Form in data can, however, appear as sequences and patterns in time or
space, so we should be aware of methods that describe data in these ways as well.
Directional hypothesis A positive hypothesis that specifies how variables differ, for instance
that a is larger than b.
Discourse analysis A qualitative approach that explores how we use words to construct as well
as convey meaning. The term has been used by several disciplines, which can create confusion
because the approaches can be quite different.
Ecological fallacy The assumption that a relationship that holds true for a group is equally
applicable to a specific individual. This is a danger in correlational analyses.
Effect size This is a quantification of the influence that one variable has on another.
Particularly with small samples, it can be a better estimator of impact than a significance test.
Ethics A branch of philosophy that deals with what is right and wrong, good and
bad. In a research context, ethics is concerned with the rights and protection of respondents
and of researchers, the misuse of data and accuracy, honour and responsibility in reporting
results.
GLOSSARY 661
Evaluation An approach to research that seeks to identify whether goals, outputs and
outcomes have been achieved, what processes lead to which outcomes and to assess the
effectiveness of actions, programmes and interventions.
Existentialism A flexible philosophical concept that can take on different nuances with
different people. Central to most are the ideas that our experiences shape our lives and that our
life choices are made on the basis of our freedom to choose. These ideas stand in contrast to
those espoused by structuralists.
Experiential learning A process of learning by doing (as opposed to rote learning or memorising).
Experiential learning has become significant as societies have become more concerned with skills
development and skill transfer from the learning environment to the work environment.
Experiment A research strategy that seeks to identify the effect of a single variable or a set
of variables acting both individually and in combination upon a situation. The experimental
condition seeks to ensure that the variables under test can vary but nothing else can.
Explanatory theory Theory derived from observation in the real world which seeks to explain
how that world operates.
Factor analysis A quantitative approach for detecting order in associations between variables.
The method seeks to identify which correlations between variables cluster together and calls
these clusters ‘factors’.
Factorial design An experimental design used to test the effect of several variables at the
same time or the same variable in different concentrations or, in the most complex situations, a
combination of the two.
Fisher’s test A statistical test for small samples cast as a contingency table.
Focus group A means of collecting data from a group of people. Members are selected
to represent a particular interest or set of interests that are significant to the research. The
strength of the focus group approach is that discussion amongst members can reveal feelings
and sensitivities that simply responding to a question will not.
Frequency distribution A chart that shows the number of counts of a variable or a data range
within a variable. We could construct, for instance, a frequency distribution of the ethnicity
of children in a school and a frequency distribution of their ages. The former would show the
number of children in each ethnic group and the latter the number of children in each age group.
Gantt chart A chart designed to show the sequencing and duration of activities in a process or
programme (such as a research programme).
Hawthorne effect A specific intervention effect (see definition) in which subjects change their
behaviour sometimes to confound the proposal being tested by the research. Named after a
factory in the USA where the effect was first identified.
Hermeneutics Hermeneutics is the discovery and study of meaning. To this extent it sits
above ways of researching (such as positivism). It can be used as a method to develop
other ways of discovering and studying meaning (for instance, the critique of positivism
that has given rise to qualitative methodology in general and humanistic approaches in
particular).
Histogram A visual representation of continuous data grouped into categories (or classes). The
horizontal axis of the histogram shows the class boundaries of the variable and the vertical axis
the frequency of occurrence. Also called histograph. See bar chart.
Holistic Referring to something that is looked at in its entirety. In a research context, it implies
interdisciplinarity. Education as a research field is implicitly holistic because it will often look at
the individual or organisation in a social, economic and political context.
Humanism A broad class of philosophical standpoints that highlight the need to understand
human experience. In terms of research practice, this becomes the focus of the research, and
people are seen as individuals and not as contributors to a statistical description.
Hypothesis A statement about the likely effect of a variable or variables on a situation. The
hypothesis can be derived from a theory as to what an effect should be or from a situation
observed elsewhere.
Index number A measure of an activity or quantity calculated in such a way that 100 is taken
as the base. Index numbers are often calculated for time series.
Indicator An indicator is a set of data that stand for a variable. For example, if our variable
was ‘boredom’ (and our study was into factors that affected student attainment), we could use
video evidence of student inattention or we could use self reports by students.
Inferential statistics A set of statistical approaches through which we can identify statistically
significant relationships between variables.
Informed consent The practice of ensuring that those who provide data are willing to do
so under conditions that are acceptable to both the researcher and the researched. Informed
consent is one of the mainstays of an ethical approach to research.
Intelligent design An argument in relation to evolution, the creation of the world and of
the universe that sees the order that exists as being the product of some supernatural process.
Its significance educationally is the push to teach intelligent design as part of a scientific
curriculum. Most scientists would argue that this is inappropriate as the underpinning theory is
not testable but an expression of belief.
Interquartile range A quantitative measure of spread, being the range between the 25% of
data values immediately above the median and the 25% immediately below the median when
the whole data set is ranked from high to low or vice versa.
Interval scale A measurement scale in which the difference between all scale points is the
same. There is no zero which represents no quantity of the thing being measured.
Intervening variable A variable which does not directly affect an independent variable but
which impacts on the process by which the independent variable impacts on the dependent
variable.
Intervention effect The idea that the process of investigation can influence what is being
investigated. This is of particular concern to researchers who believe that they should strive to
be neutral and that their observations should be unsullied. There is, however, much evidence
that this is not the case, from the well-known ‘Hawthorne effect’ (see reference) to the problem
of preventing interviewees over-identifying with the interviewer and the objects of the
investigation and shaping their responses to please the interviewer. At the other extreme there
are researchers who believe that it is their task to create change, that is produce intervention
effect in their subjects.
Kolmogorov-Smirnov test A statistical test that compares the difference between two
distributions (for instance, between the grades gained by boys and girls in a class test).
Leptokurtic A distribution that is more peaked than a normal distribution with the same mean.
Liberal Both a political philosophy and an approach to education. At the heart of the concept
is the idea of free choice tempered by social and economic responsibility. In educational terms,
it implies a broad education as a means of producing a balanced individual capable of exercising
a socially motivated (or constrained) choice.
Lifeworld This concept (linked with phenomenology) refers to the world as lived in and
experienced by the people in our investigation. The lifeworld is therefore an individual and
subjective idea (as compared with the idea that there is a ‘real world’ which most people
experience in the same way) (see phenomenology).
Likert scale A scale developed to represent the attitude or views of people. For example, if
we wished to assess the attitude of teachers to teaching as a profession, we could construct
a Likert scale. This would consist of a set of statements to which respondents would indicate
their degree of agreement on a scale (of normally five scale positions from agree to disagree).
The total of their scores for each statement would represent their overall attitude. There
are procedures for ensuring that a statement is appropriate and that there is an appropriate
balance of statements.
Line graph A graph showing the relationship between two variables both measured on
continuous scales. The points on the graph created by the two variables must fall on a
continuous line. A time series is also a line graph.
Longitudinal study A study in which changes over a long period of time (relative to the issue
being researched) are one of the prime interests. Longitudinal studies are often concerned with
the profiling of the characteristics of an age cohort over a period measured in decades.
Matched group An experimental design in which the test group and the control group are
matched in terms of key characteristics. These might be having children of the same gender, age
and social status or ensuring that the environment is the same for both test and control (e.g. by
using the same space for both).
664 GLOSSARY
Matrix chart A way of portraying three or more variables on a grid. The computer
programs that produce the visualisation are usually interactive and allow researchers to
manipulate data.
Mean A measure of the ‘averageness’ of a data set. The mean is usually calculated as the
arithmetic mean (there are others) in which the total of all values in the data set is divided by
the number of values in the data set.
Measurement scale The basis against which things can be measured. In research we use the
phrase in two ways, as a framework for describing types of measurement and as a term to
describe a length of measurement. In the first case, the most used framework for representing
measurement scales is nominal, ordinal, interval and ratio. In the second case we might ask
someone how far they agree with a statement on a scale 1 to 5, where 1 is completely agree
and 5 is completely disagree.
Median A measure of the ‘averageness’ of a data set. The median is the middle item in a data
set when the set has been ranked from high to low.
Metaphor map A generic name for visual representations of data that take a recognisable
form (see Chernoff face).
Methodology The approaches we use or can use to collect and analyse data. A methodology
can be just a collection of methods used in a specific instance though it is, more usually, an
association of methods that are commonly used together.
Mixed method design A research design that incorporates quantitative and qualitative
approaches either in parallel or sequentially.
Mixed model design A research design in which quantitative and qualitative approaches are
integrated throughout the enquiry.
Mode A measure of the ‘averageness’ of a data set. The mode is the value that occurs
most often. When applied to individual data values, it can produce a misleading estimate of
‘averageness’. For this reason it is more often applied to grouped data, and the measure is
referred to as ‘modal class’.
Modern (movement) The period prior to the postmodern. Recognisable in art and design and
in social studies. In the latter case it was typified by the identification of general processes and
the search for theory that could describe the social world and which could be used to improve
that world.
Moving average An approach for depicting the trend in a time series in which the value for
any one time point is the average (arithmetic mean) of that point and points either side. For a
three period calculation, we take the central value (time t) and add to that t21 and t11, divide
by three and ascribe the value to time t. The method then moves forward a period and adds t,
t21 and t12 and ascribes the value to t11 – and so on.
Multi-stage sampling An approach which requires that the population is broken up into
clusters whose individuals are then sampled. Those sampled are then assumed to be an
adequate description of the cluster character.
Multi-variate This means many variables. It is used to describe a situation in which several
variables (such as age, gender, socio-economic status and ethnicity) may be influencing a
situation (such as the number of young people from an area entering higher education). A set
of analytic procedures has been developed to allow us to assess the impact of such variables
individually and in combination.
Narrative analysis An approach to analysing spoken or written data whose significance is that
the data represent the authentic voice of the subject or of the subject as mediated by a third
person (e.g. the transcript of a statement given to a policeman).
Neo-conservatism or neocon A right wing political philosophy that believes in the primacy
of the free market and the freedom of the individual to act in his or her own interest. The
consequence of this is a belief in ‘small government’, that is minimum regulation.
Nominal data Data that are measured on a nominal scale (see definition).
Nominal scale The simplest scale of measurement in which objects are placed in classes and
their frequencies counted. Such data are also called ‘counted data’.
Non-directional hypothesis A positive hypothesis that specifies that two variables, a and b, are
different and not how they are different.
Non-response The people who choose not to participate in the data collection process. As
researchers we ought to assess whether the pattern of non-response introduces bias into the
data we collect.
Normative theory Theory derived from principles, which shows how the world would
operate if it were ruled by those principles. In research terms, the world of normative theory is
compared with the real world to assess how far the latter deviate from the former.
Null hypothesis The hypothesis that is actually tested with a statistical test. The hypothesis is
framed along the lines of ‘there is no significant difference between a and b’ for a two-tailed
test or ‘a is not larger than b’ for a one-tailed test.
666 GLOSSARY
Observation A unit of data or set of data observed at a specific time or over a specific period
(an observation). Also the process of collecting data by watching the source that generates
the data.
Observatory Borrowed from the observation of stars and planets, it is now used to describe
an organisation that collects information about an area or specific theme. Within the EU,
observatories have been encouraged that will profile localities.
One-tailed test The tail refers to a test distribution. In a one-tailed test, we examine the
probabilities associated with test result in one tail of the distribution only. This requires us to
have defined a directional positive or research hypotheses.
Open question A question that gives a respondent free reign in the formation of an answer.
Ordinal scale A measurement scale in which data can be ranked according to some criteria.
The rank value constitutes the measurement but the differences in rank may not be of equal
size.
Outlier In a set of data an individual unit or group of data that lies outside the broad pattern
of data. Sometimes an outlier can be the result of a problem with sampling, other times it can
be a valid but unusual outcome.
P value The p value is the probability of a specific test statistic under the null hypothesis. In the
past social scientists used to compare their test statistic with the values of test statistics at critical
values (such as 10%, 5%, 2.5%, 1%, .1%). With the advent of statistical packages, they calculate
the actual probability of the test statistic under a null hypothesis, thus making it easier for the
researcher to compare the test result with the critical value of acceptance/rejection of the null
hypothesis selected at the outcome of the research.
Paradigm An idea that at any one point in time all those working in a particular area, field or
subject adopt common ways of working and common ways of looking at issues.
Parametric statistics A family of statistical tests that assumes the data conform to a normal
distribution. While the assumption of normality can be challenging for educational researchers,
there are ways of transforming data to bring them closer to a normal distribution.
Partial correlation An approach to estimating the relationship between variables in which the
influence of one variable is assessed while holding others constant. The correlation coefficient is
interpreted as normal (see correlation coefficient).
Partial regression A representation of the relationship between one variable and another
while other variables are held constant.
Participant observation A means of data collection in which the researcher observes the
behaviour and lifestyle of the individual(s) or group(s) in which he or she is interested.
Sometimes those being observed are aware of the observer’s status (see visible researcher) and
sometimes they are not (see invisible researcher). The observer participates in the activities
being observed.
Percentile A 1% division of a data set that has been ranked from high to low.
Phenomenography A research approach that seeks to provide a rich and detailed description
of an individual’s or individuals’ experience. This description constitutes the data from which an
understanding is constructed. Phenomenography is an interpretivist approach (see definition).
Often confused with phenomenology, it differs in that it considers the specific experiences of
research subjects, while phenomenology is more concerned with general experiences considered
from the researcher’s perspective.
Pie chart A visual approach to the representation of data. Data are classified into mutually
exclusive classes and the size of each class is expressed as a proportion of the whole data set.
The proportions are used to construct segments of a circle (the ‘pie’).
Pilot (test/survey) An investigation that takes place before the main investigation and which is
designed to test and evaluate the effectiveness of the research procedures.
Platykurtic A distribution that is more flat than a normal distribution with the same mean.
Point-biserial correlation A correlation coefficient in which one of the data sets is continuous
and the other is counted data, usually for two mutually exclusive categories.
Policy analysis An approach to social research that either seeks to understand how policy was
determined (research about policy) or to identify what policy is needed or why policy is failing
or successful (research for policy). It makes use of a wide range of methodologies and methods.
Population The totality of a group from which a sample is drawn. This totality can be people
(the population of children in a schools), organisations (all schools in an administrative area)
and objects (all of the computers in a school).
Positive hypothesis See research hypothesis. The term positive hypothesis is used when we
want to draw a contrast with the null hypothesis.
Positivism (positivist) An approach to conducting social research that seeks to apply the
principles developed by the natural sciences. The key principles are the neutrality of the
researcher with respect to the problem and taking decisions about the results of the research
out of the researcher’s hands. Positivism seeks to emulate scientific research in seeking to
develop theory. Its approach is to postulate hypotheses which, if proven, can constitute building
blocks of a theory.
Postmodernism The period after modernism. The transition can be seen in many fields
including art and design and social studies. In the case of the latter, the broad indicator is
the shift from the study of, search for and identification of the general to appreciating the
significance of the individual, from the search for order to one of understanding why things are
different, from implementing global solutions to problems to valuing local solutions.
Praxis While the dictionary defines praxis as an habitual way of doing things, within the
emerging field of work-based learning and education more broadly, it is used to bring together
the idea that work activity is a blending of theory with practice, that practice informs theory
and theory then informs practice.
Primary data Data that are collected first hand from original sources (such as people themselves)
by the investigator and not statistics or data collected by others (see secondary data).
Probability-based sampling A method of drawing samples in which each unit of data has a
specific chance (probability) of being selected. The usual selection process is equal probability, in
which case the approach is also known as random sampling.
Probing The process of pushing a source to clarify an answer or to reveal more in an answer to
a question.
Probing question A question that is asked in order to clarify a previous response or whose
purpose is to get the respondent to reveal more of an issue.
Purposive sampling Sampling which has a specific purpose aligned to the goals of the
investigation. Purposive sampling is not probability based. Selection of a case study is purposive
as is the identification of subjects with relevant characteristics or behaviours (such as gay
teachers).
668 GLOSSARY
Qualitative data Often erroneously considered to be data that cannot be measured (see
quantitative data), qualitative data are more properly data that reflect beliefs, attitudes, views.
Such data can take the form of verbal statements, written accounts, behaviours, objects and
relationships. The meaningfulness of qualitative data arises from an understanding of personal
lives.
Qualitative research and analysis A research approach whose principal concern is how an
outcome comes about. It is more concerned with why 90% do this and 10% do that rather than
the fact that they do.
Quantitative data Data that are capable of being measured along one of several measurement
scales. Such data are held to exist in the ‘real’ world.
Quantitative research and analysis A research approach that seeks to identify pattern and
association in numerical data. It is closely linked with positivism which makes assumptions about
how research should be conducted.
Quartile A quarter of a data set that has been ranked from high to low (or vice versa). These
quarters are referred to as the upper quartile (the top 25%), the lower quartile (the bottom
25%), the upper middle quartile (26 to 50%) and the lower middle quartile (51 to 75%).
Quota sampling The specification of sample sizes (quotas) for specific groups. The sample
sizes (quotas) are designed to meet specific research goals. In some cases the quotas can reflect
relative proportions in a population (making them similar to stratified samples), in other cases
the sizes may reflect the need to get a sample large enough to make meaningful quantitative
statements. A distinguishing feature of many quota procedures is that respondents are
approached and accepted only if they meet quota specifications.
Radical Both a political philosophy and an approach to problem solving. In both cases it has
connotations of thinking unconstrained by conventional solutions.
Random A process in which every event has the same probability as every other event. This
situation is important in sampling if we wish to use statistical tests with our data. In this case,
each unit in our population should have the same chance of being selected.
Random sampling A way of drawing samples in each unit of data has the same chance of
being drawn as every other unit of data. This approach is also referred to as probability-based
sampling.
Range A quantitative measure of spread, being the difference between the greatest and the
smallest in a data set.
Rating scale A means of enabling subjects to express their level of agreement with a statement
or indicate their position with respect to a linear scale defined by the researcher.
Ratio scale A measurement scale in which the difference between all scale points is the same.
The scale has a zero which represents no quantity of the thing being measured.
Refereeing The process of peer review of papers published by academic journals. Peer review
of a paper by an independent and anonymous referee is a means of ensuring quality in
academic publication.
Reflection A high order analytic process through which to identify what has occurred and
why. It is a core process in action research and in anchoring personal learning. In this regard
experience and outcomes are assessed and reassessed in order to understand, learning
processes, capture learning gains and establish ways of moving forward.
GLOSSARY 669
Regression A line used to represent the trend in a data plot (see scattergraph). The line can
be inserted by eye but is more usually computed so that the total of the squares of the distance
between any data point and the line is minimised.
Representative In sampling, ensuring that the characteristics of the sample reflect those of the
population from which it is drawn.
Research agenda This is the priority given to research problems and themes. A university
department could have its research agenda but those with the power to influence what
research is done are those of funding bodies such as research councils, national governments
and the EU.
Research hypothesis The opposite of a null hypothesis and the issue that we are really
investigating. The research hypothesis can be non-directional (there is a difference between a
and b) or directional (a is larger than b).
Research issue The area of interest to the researcher. The first task of a researcher is often to
focus down the issue and isolate a research question or research questions.
Research philosophy A term used to describe the principles governing research practice. What
this description fails to convey, however, is that there is not just one research philosophy, so
while we can use the term in a portfolio sense, we have to talk about the different principles
that drive research practice. The study of these different sets of principles can also be referred
to as research philosophy.
Research problem More general than a research question, a research problem is an issue that
requires further focus before it can be investigated.
Research question A formulation of a research issue that isolates the specifics to be researched
and to which an answer is required.
Research strategy The overall approach to obtaining and processing data that will answer
the research question. A research strategy is informed by whether a researcher chooses to
work within a paradigm and the level of resources available. Essentially a research strategy is a
combination of methodology and method designed to meet the needs of a particular research
situation.
Response rate In survey research, the proportion of those contacted who take part.
It is usually expressed as a percentage (of respondents) of the sample. The response
rate is important in establishing the quality of the survey and in identifying the possibility
of bias.
Review journal A journal whose principal purpose is to summarise the latest developments in
a field.
Sample frame The section or area of the population from which the sample will be drawn.
In many cases the population will be the sample frame (for instance, if we want to interview a
sample of a year group in a school, the year group would be both the population and sample
frame). In other cases the sample frame will be a sub-set of the population. For example, if we
were studying the problems faced by head teachers in schools that drew their pupils from socially
deprived areas, it would be difficult to sample the whole population of head teachers in such
schools in a country, so we could identify three areas with different levels of deprivation and
sample the head teachers in these areas. In this case, the three deprived areas constitute the
sample frame.
670 GLOSSARY
Sample size The sample size is the number of people or bodies that participate in a
survey. Sample size is expressed both as a number and a proportion of the population. The
determination of sample size is important before a survey if data are to be processed to yield
results that meet standards of statistical rigour.
Sampling fraction The proportion of the population accounted for by the sample (that is the
sample as a fraction or percentage of the population).
Scale Used in two ways in social research: (a) A framework for measurement (attitude scale) (b)
a level at which data are presented and analysed. This level can be geographical (national, local)
or hierarchical (expenditure on food, leisure or expenditure on pre-pepared food, on fruit and
vegetable, on meat, on fish, on cereals etc.).
Scattergraph A way of portraying the interaction of two variables on a graph. Each axis
represents a variable. Data points are plotted against the two axes.
Scientism Both an approach to research and a belief about how research should be conducted.
Scientism believes in the primacy of a scientific approach to research, with key principles being
the neutrality of the researcher, the demonstration of the effect of an individual variable
or set of variables upon a situation and the development of a model to reflect a particular
circumstance or a theory to describe a general relationship.
Secondary data Data used by the researcher or the research team but collected originally by
another researcher.
Semantic differential A rating scale based on adjectival opposites that can be used to profile
respondent views about an issue.
Semi-structured interview An interview in which themes are identified and lead questions
specified but where the interviewer is given training to ask supplementary questions that will
provide the data needed by the research programme.
Semiotics An approach to qualitative research that seeks to identify the deeper meanings
behind behaviour and the way text and objects are used.
Sign The fundamental unit of semiotics. A sign stands for something. What it stands for is
socially and/or culturally determined. What it stands for (its meaning) can be analysed (see
connotation).
Significance level This is the level of probability at which a researcher chooses to reject the null
hypothesis. The most usual value selected is 5% but in some circumstances significance in social
or educational research terms could be less than this (for instance 10 or 20%).
Skew A pattern in data in which the data are not equally divided and balanced either side of
a central value (usually the mean). When plotted as a frequency distribution, the appearance of
the data set is that it leans to the left or to the right.
Snowball sampling Developed as a purposive sampling approach and often used for hard to
reach populations. The researcher uses subjects already identified to identify new subjects, the
assumption being that in hard to identify groups those already in the group are likely to know
other group members.
Soft system Whereas a system normally requires a quantitative relationship between inputs
and outputs, a soft system is an expression of the relationship between broader variables for
which there may not be a simple quantitative measure. For instance, if we were researching
children’s progress in pre-school, we might suggest that a contributory factor was the amount
GLOSSARY 671
of time parents spent reading with them. Further investigation suggests that socio-economic
status might be a factor that influences this. As well, we ought to consider whether one or both
parents worked was something to take into consideration. We are exploring a process but, at
this stage, in a non-quantitative way.
Special educational needs Conventionally this refers to the additional or special educational
or care provision required by students with specific physical or learning disabilities. Theoretically
it could also apply to those who are gifted in a general or specific context but it is now usual to
describe these as ‘gifted and talented’.
Spectral analysis A quantitative approach to the analysis of time series in which data plotted
through time is decomposed into a series of sine waves of different amplitude and wavelength.
When these curves are combined, they reproduce the original data set. The task of the
researcher is to identify the significance of the individual curves in terms of processes at work.
Standard error The idea that a characteristic of a sample reflects the characteristic of a
population within known limits. The standard error is usually calculated for the mean of
a sample. It is expressed as the sample mean plus or minus a value determined through
calculation with a 68% (approximately) or 95% (approximately) likelihood.
Star plot A way of portraying the sizes of a number of variables (usually more than five). The
variables are plotted in a circle on radii that are at equal angles from each other. The value of
the variable is shown by the length of the radius.
Statistical significance A way of assessing the likelihood that the difference between two
distributions could be random in nature. The conventional critical level of significance adopted
by statisticians is that an event occurring one time in 20 (5%) indicates that chance is unlikely to
be a factor. However, for some investigations this level may be too strict and researchers should
judge significance taking other factors into account.
Stem and leaf diagram A means of visually representing the digits of a numeric data set.
Numbers are broken down into component parts, such as tens and units. In this case the tens
constitute the stem and are written as a column of figures from high to low or vice versa. The
units constitute the leaf. These are written against the appropriate stem, thus the 2 of 52 is
place against 50 and the 5 of 55 against 50.
Stratification A procedure used in sampling that divides the population into groups (strata)
and allocates the sample size according to the relative sizes of the strata. The effect of this
is to reduce the sample size because we are sampling units of the population that are more
homogeneous than the population overall.
Student’s t test A statistical test for assessing whether two small samples are drawn from the
same population.
672 GLOSSARY
Summative assessment Used to describe the position a student has reached in terms of
knowledge, understanding and learning at a point in time (see also formative assessment).
Synchronous Something that happens at the same time as something else. Classroom teaching
and student interaction, for instance, are synchronous activities.
Synthetic phonics A method of teaching reading in which the reader is taught the ‘sounds’ of
letters and letter combinations, then break up the word into these sounds and assembles them
to make the word. For example, SH – I – P to make ship.
System A sequence of inputs and outputs that constitute a process. We can think of the
number of undergraduates in a university academic year as a system in which the inputs were
the number entering the year one plus any that were required to retake the year less those who
failed the previous year and were required to leave.
Tagging In grounded theory, the process of attaching a code to a piece of data (thereby
effectively converting data to evidence).
Template analysis Analysis of data according to a pre-existing framework. This framework can
be a standard template, a template devised for another piece of research (which we might use
to test previous results) or one devised to assess a theory or model.
Test statistic All statistical tests compute a test statistic, which is assessed against its probability
of occurrence under a null hypothesis. The test statistic is the outcome of the numerical
processes required by the test itself.
Theory A statement about functional and process relationships that create cause and effect.
Theories can be derived as a result of data analysis or prior to analysis and tested through data
collection and analysis.
Think aloud interview An interview approach in which a subject is asked to perform a task
and, at the same time, describes the thinking process that underpins his or her actions.
Time series A line graph that shows how a variable (such as birth rate or students in higher
education) changes over time.
Tree map A tree map is a way of representing data with a hierarchical structure in two
dimensions. The computer programs that produce the visualisation are usually interactive and
allow researchers to manipulate data.
Triangulation A process of verifying data. Data given by one source is confirmed by another
and preferably a third. The sources can be people, documentation, statistics, reports and so on.
Two-tailed test The tail refers to a test distribution. In a two-tailed test, we examine the
probabilities associated with test result in both tails of the distribution. This requires us to have
defined a non-directional positive or research hypotheses (that there is a difference).
Type 1 error The incorrect rejection of a null hypothesis, that is when there is no difference
between our samples, we conclude that there is.
Type 2 error The failure to reject the null hypothesis when it should be rejected, that is when
data sets are different, we conclude that they are not.
GLOSSARY 673
Univariate This means one variable. It is usually used in the context of a distribution. A
univariate distribution is one where one variable (such as age) is plotted in terms of frequency.
Unstructured interview A free form interview. Themes will have been identified by the
researcher but there is freedom to follow up on new themes, to return to themes as new
information is gathered and to probe for additional information. Unstructured interviews are
particularly useful for scoping an issue before more detailed investigation is carried out. They
are used extensively in policy analysis (see structured interview).
Variable A factor that can influence an outcome or be influenced by an outcome. The concept
of variable is important in modelling research situations.
Z score/value Z scores are values for a unit of data standardised in relation to the data sets
mean and standard deviation. Z scores associated with a research data set can be compared with
scores for a normal distribution, where the proportions of a data set lying at specific distances
from the mean are known. These known proportions are presented as a Z table and can also be
computed. A Z table shows the proportion of a normal distribution that lies between the mean
and a Z value, between Z values and below and above Z values.
Index
Note: Page numbers in bold are for Ariely, D. 51 Benson, Tammy 501–2
figures, those in italics are for Arnon, Sara 108–9 Benwell, Bethan 507
tables. Artinian, Barbara 496 Bergendahl, Christina 630, 631
Aalsvoort, G. 501 Askehave, Inge 510 Berkowitz, E. 364
academic culture 17 assessment practices 41 Berks, Melanie 496
academic researchers 7, 8, 30 association 574–5 Bessel’s correction 542
academic writing 219 attention 98 Bham, M. S. C. 76
acceptability 19 attitude scaling: methods 321–8; bias 161; citation 198; in observation
accuracy 291–2, 293 principles 306–9 387–8, 389; response, in
Acorn lifestyle classification 150–1 audio recordings 367; analysis of 465 questionnaires 292, 320–1;
action plans 23, 64 Audit Commission 58 sampling 116, 265
action research 45, 63–6, 67, 71, 103, auditing: research literature 87, 194; bibliographic search engines 207–8
137, 368, 630–1; key features resources 20–1, 231 Billingsley, Bonnie 498
631; uses of 64–6 Australian Education Index 15 bimodal distribution 533, 567
activity-based stimuli 345–6 autocoding 480 bivariate data 436, 439, 454
Adult Literacy and Life Skills Survey average see mean `blind’ experiments 127
(ALL) 9 Avramidis, Elias 327 Boccacini, Marcus 613–15
age-sex pyramids 426–9, 430, axial coding 498 Bolhuis, Sannelee 374–5, 376
431, 432 Bologna process 600
aggregation of data 163–5, 167 Babad, Elisha 347 Bonawitz, Elizabeth 75
Ahlin, Asa 592 Babbage, Charles 499 Boolean searches 212, 480
alcohol consumption 495 Bailey, John 465 boundaries, research 88
American Educational Research Balen, Rachel 50 Bourdon, Sylvain 468
Association (AERA) 52, 110, Ball, Stephen 641 Bowen, Glenn 496
208, 210–11, 584 Ballinger, Gillian 353–4 Bowker, Rob 627–8
American Evaluation Association 110 Baltar, F. 255 Bradburn, Norman 314
American Psychological bar graphs (bar charts) 425–9, British Education Index 15, 207
Association 584 451, 454 British Educational Research
An, Wayne 486–7 Bayes, Thomas 75 Association (BERA) 51, 52,
analysis of variance (ANOVA) 601–7; Bayliss, Phil 327 209, 210
one-way 602–3; two-way 604–7 Beauchamp, C. 346 British Educational Research Journal
analytic information 304 behaviour, as source of data 155, 458 (BERI) 210, 216
Anderson, Edgar 449 belief networks 75 British Sociological Association 52
Anderson, M. 513 Bell, Alice 364–5 Brooker, Barry 493
Anghileri, J. 522 bell curve 542, 568 Brouwers, Andre 76, 597–8
anonymity 52, 318–19, 386 Belohlav, James 46 Brunet, I. 255
anti-social behaviour 349, 626–7 benchmarking 58 bullying 475–6, 524
AQUAD 481, 483 benefiting from education research Burden, Robert 327
archives see data archives 646–8 Burgess, Ernest 38
INDEX 675
databases 15, 40; accessing literature Economic and Social Data Service existentialism 40, 41, 42, 89
via 206–8, 209; review 209 (ESDS) 177 experimental approaches 67, 121–8,
Davies, Graham 368 Economic and Social Research Coun- 138–9; design strategies
Davis, Anita Ann 471 cil (ESRC) Question Bank 315 122–7; key features 634;
deciles 537 Education Act (1944) 45 phrasing the research question
decision making, evaluation 57–60 Education Funding Agency 13 631–2; problems with 127–8
deductive method 107, 483 Education Network Australia 207 experimental groups 122, 123, 127
deep learning 513 Education Review 349 explanation, goodness in 289–91
degrees of freedom 591, 595, 602 Educational Research 349 explanatory theory 73–4
Delphi technique 256 Educational Research and Review 285 explanatory variables 279
Denmark 210, 371; Index of Social effect size 459, 583–4, 617 eyeballing see reading a data set
Progress 284; statistical data 177 Ejieh, Michael 598
denotation 511 electoral areas 167–70 F ratio 602–3
density plots 443 elite interviews 370 Facial Action Coding system 469
Department for Business, Innovation Ells, Harvey 343 factorial experimental design 125,
and Skills (BIS) 12, 13 emancipatory learning 368 126
Department for Children, Schools emotional behaviour and environ- factual coding 470
and Families (DCSF) 12, mental context 157 falsification of data 49
206, 210 Englund, T. 39 Family Spending 249
Department for Communities and environment, as source of data 156–7 Fan, Z.-Z. 599
Local Government 176 epistemology 36, 97, 109 Faraday, Michael 37
Department for Education (DfE) 12, EPSEM (Equal Probability of Selection Fast Track project 384
13, 174–6, 209 Method) see probability feminist research 360, 361
dependability 129, 459 sampling Feng, Shoudong 501–2
dependent variables 279, 585 ERIC 15, 40, 207 Ferdinand, Jason 387
deprivation, measurement of 282–3 Escalate 377, 380–1 Fereday, Jennifer 487–8
description 69, 70; case study as Estell, David 362 film 344–5
means of 54–5 Estes, Richard 284 Finland 210, 371; cluster sampling
descriptive coding 470 ethical issues 51–2, 90, 147, 195; in health education 248;
descriptive statistics see quantitative assessing 89; competing statistical data 8
descriptive analysis interests 17; confidentiality 52, Finlay, Linda 129
Desimone, Laura 364 319, 371, 373; consent 50, 52, Finnish Adolescent Health and Life-
determinism 157 369–71, 385–6; falsification of style Survey 248
development, and action research data 49; interviews and focus Finnish Social Science Data
65–6 groups 369–72; observation Archive 316
Dewey, John 73, 497 385–7; plagiarism 51, 490; Fisher, Ronald 577, 589, 601
dichotomous data 613 and research philosophy 35; Fisher’s test 600–1
directed content analysis 489 see also misuse of research Fitz, John 258
disclosure 371 The Ethnograph 481, 482–3 Flanders Interaction Category frame-
discourse analysis 484, 499–503, 646; ethnography 60–3, 66, 67, 68–9, 103, work 384–5
critical 484, 485, 508–11 139, 348; key features of 630; focus groups 103, 255, 289, 339,
disengagement and observation 388–9 phrasing the research question 340, 365, 366–8, 372; ethical
distributions: bimodal 533, 567; 628–9 issues 369–72; and naturalistic
characteristics of 526–7; flat European Educational Research enquiry 105; visual stimuli in
and peaked 546–7; leaning Journal 211 344, 367
(skewed) 526, 545–9, 567; European Foundation for Quality Foorman, Barbara 199–201
normal 542–4, 567, 568–9; Management (EFQM) 65 Fore, C. 76
sampling 543, 544 Eurostat 186, 317 foresight planning 256
divided bar charts 454 Eurydice 184–6 formative evaluations 59
Doyal, Len 102 evaluation 56–60, 66, 67, 70–1, France 55, 371
Drury, John 389 139, 626–8; impact 626; key Fraser, Heather 505–6
du Toit, Stephanos 628–9 features of 628; phrasing the Freire, Paulo 368
dual (pair) interviews 365–6 research question 626; process frequency diagrams see bar graphs
Durkheim, Emile 119 59, 626 Friborg, Oddgeir 328
Dutton, Dennis 218 Evans, C. 584 Friedman, Stephen 197
Dye, J. 474 Every Child Matters initiative 277, Frisby, W. 361
404–5 Futurelab 209, 210
Early Intervention Foundation 12–13 Evidence for Policy and Practice In-
EBSCO 207 formation and Coordinating Gallardo, David 448
ecological fallacy 119–20, 165 Centre 209, 210 Gallivan, Michael 135
INDEX 677
multi-level design 133 Norway 210; action research 65–6; organisational management and
multi-method approaches 72 cluster sampling in health practice, evaluation as means
multi-phase sampling 249–50 education 248 to improve 58
multi-stage sampling 247–9, 250, 251 NSPCC (National Society for the Ormerod, Fiona 503
multi-variable analysis 286 Prevention of Cruelty to Oughton, Helen 510
multiple correlation 611–12 Children) 209, 210 outcomes 65; evaluation of 59;
multivariate data sets 444–50, 454 null hypotheses 102, 108, 113, models’ influence on 76–8;
Mumford, Alan 74–5 114–15, 120, 574; and theory development as an
Mungazi, Dickson 74 statistical testing 576–82, 617; outcome 75–6
Murray, Lorraine 365 Type I error 580, 581; Type II outliers 441, 537
Myberg, Eva 615 error 581
Myrich, F. 497 pair (dual) interviews 365–6
objectives: of educational research paired comparisons 124–5
narrative analysis 484, 503–7 14–19; evaluation of 59 paired samples 592–3
narrative interviews 356 objectivity 129, 131 Papatheodorou, Theodora 248–9
National Bureau for Economic objects, as source of data 155–6, 458 paradigms 46–9
Research (USA) 147 observation 288, 289, 339, 347–55, parametric tests 586
National Centre for Education 374–89; bias in 387–8, 389; partial correlation 609–11, 612
Statistics (USA) 180, 181 characteristics of approach participant observation 62,
National Centre for Social Research 350–1; close 351–2; coding 352–3, 386
459, 460 data 376–7, 381, 383–5; data participants 50, 51; anonymity 52,
National College for School analysis 465; disengagement 318–19, 386; children as see
Leadership 13 issues 388–9; ethical issues children; consent 50, 52,
natural settings 350 385–7; lesson 376–85; non- 369–71, 385–6; withdrawal
naturalistic enquiry 105 participant 352, 353–5; 50, 52
neo-conservatism 43 participant 62, 352–3, 386; pattern searching 212
Netherlands: benchmarking 58; DANS remote 352 peakedness 526
(Data Archiving and Networked observatories 83–4 Pearson coefficient 559
Services) 315–16; education O’Farrell, Clare 45 Pearson, Karl 577, 589, 593
research 55; education system Ofqual (Office of the Qualifications Peirce, Charles 48
13–14; Index of Social Progress and Examinations Regulator) 13 Peled, Einat 51
284; journals 210; Life Situation Ofsted (Office for Standards in Pellico, L. 374
Index 284; quality management Education) 12, 13, 106, 209, perception 98
65; statistical data 178–9 210; lesson observation criteria periodicity of data 550–6
neutrality 27–8 376, 378–80 Persson, Gun 353
Neville, S. 522 ogives see cumulative frequency Peterson, D. 604
Newman, Mark 384 graphs Petter, Stacie 135
No Child Left Behind (USA) 277 Ohi, Sarah 502 Pettigrew, Simon 495
noise 163 one-tailed test 578, 579, 580–2 phenomenography 484, 513–14
nominal scales 154, 586 Ong, Anthony 38–19 phenomenology 39, 41, 42, 89;
non-parametric tests 586 ontology 35–6 social 487
non-probability sampling 238, Onwegbuzie, Anthony 110, 459, 460 Phillips, Linda 325
252–61, 266; case studies open questions 300–2, 303, 304, philosophy 35–46, 89
257–8, 259, 260, 261; 356, 384 phonics 9, 10, 199–201
convenience sampling 257, open (substantive) coding 492–3, Physical Quality of Life Index 284
259, 260, 261; quota sampling 495, 498 Piaget, Jean 75, 208, 349
252–4, 259, 260, 261; self- open systems 275 pictures see visual images
selecting samples 258–9, OpenStat 589 pie charts 435–6, 451
259–60, 261; snowball opinion polls 254 places, as source of data 156–7
sampling 254–5, 259, 260; optical character readers (OCRs) 330 plagiarism 51, 490
specialist group sampling ordinal data 153 planning research 16, 24, 230–2
255–7, 259, 260, 261 ordinal scales 154, 308, 533, 586 platykurtic distribution 546
non-response 116, 160, 264 Organisation for Economic Co- point-biserial correlation 613–14
non-verbal communication operation and Development polar diagrams 444
347, 367 (OECD) 9, 184–7, 317; Pole, Christopher 375
normal distribution 542–4, 567, Programme for International policy 219, 640; education policy in
568–70 Student Assessment (PISA) England 9, 12; evaluation as
normal distribution function 569 184, 336, 405, 422–4 tool for formulation of 58;
normative models 77 organisational change, action research shaping and testing 9, 11–12,
normative theory 73 and 64–5 13, 29, 30, 70, 71
680 INDEX
Polite, Vernon 354 professional researchers 7, 8, 17 quality assurance in schools 105, 106
politics 35, 70, 105–6; and critical Programme for International Student quantitative analysis 646; see also
theory 42–3; and education Assessment (PISA) 184, 336, quantitative descriptive
research 25–6, 30, 87 405, 422–4 analysis; statistical testing
Popper, Karl 37, 102, 635 Progress in International Reading quantitative data 148, 149;
population pyramids 428–9, 430, Literacy Study (PIRLS) 9 measurement of 153–4; see
431, 432 projective techniques 343–4, 345 also statistical data
portals (gateways) 206 proof: and qualitative research 108; quantitative descriptive analysis
Porter, J. 372 and quantitative research 521–62; distributions see
portraying data 421–53, 454, 646; bar 100–3, 108, 114–15 distributions; kurtosis 546–9,
graphs 425–9, 449, 451, 454; Prosser, J. 343 567; mean value 526, 528–30,
Chernoff faces 445, 447–8, proxy research 147 533, 539, 544, 546; median
451, 454; complex data sets Prskalo, Ivan 600 value 526, 530–3, 539, 545–6;
443–50; concept maps 451; psychology 74, 75, 98, 109, 343, mode value 526, 533–6; skew
cumulative frequency graphs 348–9 526, 545–9, 567;spread 526,
430–34, 451, 454; glyphs 449, publication 50 536–45; time series analysis
454; histograms 425–9, 451, Punch, Samantha 373 549–56
454; line graphs 437–9, 451; punctuation 466 quantitative research 47–8, 96,
matrix charts 448–9, 451; purpose of research 85–6 104, 129, 138–9, 139,
metaphor mapping 445–8, purposive sampling see non- 140; character of 98–103;
454; pie charts 434–7, 451; probability sampling hypotheses 37, 101–2, 107,
scatterplots 440–44, 451; star 108, 112–15, 120; and
plots 443–5, 446, 449, 451, qualitative data 148; measurement of positivism 37, 49, 98–9, 111,
454; stem and leaf diagrams 3, 152, 154; types of 149–51 121; problems with 115–20;
422–4, 451, 454; tree maps qualitative data analysis 457, 463–519, and proof 100–3, 108,
449–450, 451, 452, 454 646; coding see coding 114–15; and theory generation
positivism 37, 38, 39, 43, 49, 89, qualitative data; computer 99–100; see also mixed
98–9, 111, 121, 237, 635 assisted 477–83; content methods research
postgraduate students 7 analysis 484, 488–91; quartiles 537–9
postmodernism 44–5, 53, 89 conversation analysis 484, quasi experiments 124, 631
Power, Robert 352 507–8; critical discourse questionnaires 287, 288, 299–337,
practice oriented research 10, 11, 15, analysis 484, 485, 508–11; 340, 645; administration
29, 30, 350 discourse analysis 484, 499–503; 321–4; and anonymity
pragmatism 48–9, 67 grounded theory 105, 107, 318–19; and children 364–5;
praxis 11 484, 491–9, 503; interpretation confidentionality 319; piloting
precedence 153 of data 464, 470–1; narrative 334–5, 364–5; and projective
precision 292–3 analysis 484, 503–7; organising techniques 345; questions
predictive texts 466–7 data 464; overview 463–77; 310–21 (closed 300, 301–2,
presentation 51, 218 phenomenography 484, 303, 304; length 312; open
prestige effect 321 513–14; preparing the data 300–2, 303, 304; order effect
primary data 225, 232; sources of 463, 464–6; semiotics 484, 312; phrasing 313–15; on
154–8 491, 511–13; template analysis sensitive issues 317–19;
principles 49–52 484, 485–8, 492 simplicity 310–12; use of
privacy 386 qualitative research 38, 47, 48, 49, 89, existing questions 315–17;
probabilities 114; and standard 96–7, 128–32, 139, 140, 285; wording 310–12); response
deviation 539, 543; and character of 103–8; complexity bias 292, 320–1; scaling
statistical testing 115, 567, of 458–9; hypotheses 107, (methods 321–8; principles
572–3, 575–7, 578, 608, 617 108; and nature of proof 306–9); structure and layout
probability sampling 237–8, 239–51, 108; objectivity and integrity 328–31; and think aloud
266; cluster sampling 246–7, 131; quality in 459–62, approach 364–5
251; multi-phase sampling 514; quantitative indicators questions: closed or structured 300,
249–50; multi-stage sampling in 489–90; reliability and 301–2, 303, 304, 357, 384;
247–9, 250, 251; simple validity 130–1, 459; and and literature searches 205;
random sampling 239–42, theory generation 107–8; open or unstructured 300–2,
251, 261; stratification 244–6, triangulation 130–1; see also 303, 304, 356, 384
247, 248, 251; systematic mixed methods research quintiles 537
sampling 243–4, 251 quality: improvement 65; in quota sampling 252–4, 259, 260, 261
probability value 576 qualitative research 459–62,
problem-based learning 375, 384 514; of secondary data radar diagrams 444
process evaluation 59, 626 170–1, 183 Rademaker, L. 479
INDEX 681
random allocation of subjects 122–5 research problems 79, 80; scoping sampling 116, 160, 226, 233–66,
random sampling 239–42, 251, 261 82–5, 194 388; bias 116, 265; case studies
range of a data set 537–9, 545–6 research programmes 640–6; arguing 257–8, 259, 260, 261; choice
ratio scales 153–4 the case for 642–4; determining of method 260–1; cluster
re-ordering data 416–18 strategy and methods 644–5 246–7, 251; convenience 257,
reading 15–16, 196, 276; phonics research progression 23–4 259, 260, 261; frame 236,
approach to teaching 9, 10, research proposals 85–9 237, 264, 265; framework
199–201 research puzzle 6 of 236–8; and Internet 255,
reading ability: influences on research questions 67–72, 79, 80, 258–9; multi-phase 249–50;
121–2, 126; and questionnaire 138, 622–3; breaking down multi-stage 247–9, 250, 251;
development 310, 311; and 270–2; case studies 624–5; non-probability 238, 252–61,
social background 615–16 and data mismatch 234–5; 266; non-response 116, 264;
reading a data set 399–421, 453–4; developing and justifying 88–9; probability 237–8, 239–51,
compressing data 413–15; ethnography 628–9; evaluation 266; problems with 264–5;
re-ordering data 416–18; 626; experimentation 631–2; quota 252–4, 259, 260, 261;
reconstructing data 419–21; and mixed methods research reasons for 234–6; reliability of
shaping data tables 404; 135–7; posing 637–9 results 263; respondent driven
stripping out data 404–13; research strategy 138–9, 622–3; 255; self-selecting samples
visual inspection 399–404 action research as 630–1; case 258–9, 259–60, 261; simple
realism/realists 36, 483 study as 624–5; determining random 239–42, 251, 261;
reality, and qualitative research 105–7 644–5; ethnography as snowball 254–5, 259, 260,
reconstructing data 419–21 628–30; evaluation as 626–8; 261; specialist group 255–7,
refereeing 27 experimentation as 631–4; 259, 260, 261; stratification
references 217, 221 hypothesis testing as 634–6; 244–6, 247, 248, 251;
referencing: analysing 199; identifing 641; posing research systematic 243–4, 251; with/
selective 198 questions as 637–9 without replacement 240, 242
reflective practice 630 research students 7 sampling distribution 543, 544,
regression analysis 441, 609–13 researcher–subject relationship 130 653, 654
regression line 556–8 residuals 552 Sandford, Rachel 626–7
Reichel, Nirit 108–9 resources, auditing 20–1, 231 saturation 472, 496
Reid, Colleen 361 response bias 292, 320–1 scaling see attitude scaling
Reid, Lesley 473 review databases 209 Scandinavian Journal of Educational
relatedness 574–5 Review of Education Research 209, 210 Research 210, 349
relationship mapping 204–5 review journals 196–7, 208–9, 210 scanning see reading a data set
relativists 36 Review of Research in Education 209, scatterplots 439–44, 451, 556
Relevance of Science Education 210 sceptical perspective 26–7
(ROSE) project 300 Rice, Mabel 383 schemas 208–9, 304
reliability 129, 130–1, 459; of Rice rule 534–5, 536 Schlichting, Leesbeth 592–3
sampling results 263 Richardson, John 292 Schmitt, Dudolph 346
representativeness 18 Richmond, Heather 504–5 schools: admissions policies 45;
reputation 227, 229 Rigotti, T. 445 inspections 106; and parental
research agenda 79, 80–1 risk: assessment 21–2, 88; in choice 100, 101–2, 112–15;
research community 11, 14, 20, 79 interview situations 372 quality assurance in 105, 106
research hypotheses 113, 120, 574–5, Robinson, Vaughan 359–60 Schuman, Howard 302
578, 580, 617, 646; directional Robinson, Vicki 357 Schutz, Alfred 208, 487
or non-directional 580, 581 Robinson, William 119 Schwarz, Norbert 150
research issues 20, 67; determination Rodgers, Dennis 353, 386 scientism 37, 53, 89
of 81; exploring 8, 11; Roegger, Daniel 524 scoping the research problem
mapping 202–5; selection of Rogers, Rebecca 510 82–5, 194
79–85; systems representation rose diagrams 444 Scottish Education Research
of 276–7, 278 Rose, J. 9, 11 Organisation (SERA) 386
research literature 191–222, 640; Rosen, Monica 615 secondary data 154, 159–88, 225,
auditing 87, 194; identifying Rosenthal, Robert 347 232, 269; comparison 1
the research method in 194–5; Ross, J. 277 66–70; international data
misuse of 195–201; scoping running mean 551–3 sources 182–7; Internet
the research topic 194; as Russell, Helen 41 access to 159, 172–87; level
stimulus to research thinking of analysis 163–5; levels
192–4; synthesising research Samdal, O. 248 of confidence in 160–2;
findings 195 sample size 261–3; measuring 652–5 manipulation 168–9; national
research priorities 79 sample variation 127–8 data sourcwes 172–82;process
682 INDEX
of data collection 160–2; spatial analysis 570 STILE (Statistics and Indicators on
quality 170–1, 182; and Spearman coefficient 559, 560, 608 the Labour Market in the
research question mismatch special needs students 327 e-Economy) 316
234–5; resolving problems specialist group sampling 255–7, 259, stimulating research thinking 192–4
with 168–9; scale of data 260, 261 Stokoe, Elizabeth 507
presentation 162–3; spatial spectral analysis 554–5 Stones, Anthony 270
organisation of 167–70; types speech data analysis 465 Stott, Clifford 389
of 159 Spelberg, Henk 592–3 stratification 244–6, 247, 248, 251
Segrott, Jeremy 359–60 Spencer, Liz 459–60, 479 Strauss, Anselm 491, 492, 497–8
selective coding 493–4, 495, 498 spider diagrams 87 stripping out data 404–13
selective referencing 198 spread 526, 536–45; estimation of structured interviews 341–2
self-presentation 375–6 using grouped data 538–9; structured questions see closed
self-referenced data 162 range as measurement of questions
self-selecting samples 258–9, 537–9; standard deviation as Student’s t test 589–93, 607
259–60, 261 measure of 539–45 Stumpf, Heinrich 611
semantic differential scales 324–6, SPSS (Statistical Package for the Social Sturges rule 534, 535–6
327–8 Sciences) 588 style guides 51
semi-structured interviews 356–9 standard deviation 539–45, 546, 567, substantive (open) coding 492–3,
semiotics 155–6, 484, 491, 571, 653 495, 498
511–13, 646 standard error of the difference 590 Sudman, Seymour 314
sensitive issues in questionnaires standard error of the mean 544, 590, summative content analysis 489
317–19 654–5 summative evaluations 59
sequential estimation 655 standardising data 570–2 Sun, S. 583–4
sex education 612–13 standards 51 Super Output Areas 170, 173
sexual attitudes/health 369 Standards and Testing Agency 13 supervisors 8
shadowing 354–5 Stanley, Julian 611 surface learning 513
Shannon, Sarah E. 489 star plots 444–5, 449, 451, 454 survey and analysis approaches
shaping data tables 404 Stata 588 112–20, 121, 138, 160–1
Shaw, Ian 51 statistical data 69, 71, 159; extracting sustainability, evaluation of 59
significance 566–7 information from see Suto, Irenko 363
significance level 115, 617 portraying data; reading a Sweden 210, 371; education research
significance testing 567, 573, data set; international 184–7; 39, 55; education system 13,
575–82, 617 level of confidence in 160; 14; Index of Social progress
signifier/signified 511 national data sources 172–82; 284; quality assurance in
Silver, Christina 479–80 see also quantitative descriptive schools 106; statistical
Simons, Daniel 98 analysis data 180
simple random sampling 239–42, Statistical Science 466–7 Swedish Research Council 52
251, 261 statistical significance 575–8 symbolism 511
Simpson’s Paradox 165, 167 statistical testing 115, 116–17, 120, synthesis, of research findings 28, 195
SISA (Simple Interactive Skills 128, 565–619; choice of test synthetic phonics 9, 10
Analysis) 589 584–6; determining a test systematic sampling 242–4, 251
Sitaram, Shashikala 345–6 result 575–82; for difference systems model 274–7; variables
skew 526, 545–9, 567 587, 589–607 (analysis of 279–80
Slovakia, cluster sampling in health variance (ANOVA) 601–7; chi-
education 248 square test 593–8; Fisher’s test tagging data 471–3
Smith, Peter 524 600–1; Kolmogorov-Smirnov Tanner, Howard 345
snowball sampling 254–5, 259, test 598–600, 656; Student’s t target populations 236, 237–8
260, 261 test 589–93); non-parametric Tashakkori, Abbas 132
social classifications 150–2 586; and normal distribution Taylor, Barbara 199–201
social phenomenology 487 567, 568–70; and number of Taylor, Chris 109, 110, 134
social value of research 647–8 data sets 585; parametric 586; teacher burnout 76, 597–8
socialisation 349 for relationships 587, 607–15; teacher recruitment 637–9
sociology 38–9 variables 585, 586 Teaching Agency 13
Sokal, Alan 45 Statistics Canada 9 teamwork, coding qualitative data
Solheim, Erling 634 Statistics Netherlands 426–8 476–7
Sosu, Edward 109 Statpages 589 Teddlie, Charles 132
sources: authoritative 227–9; stem and leaf diagrams 422–4, telephone interviews 342
reputable 227, 229; stability 451, 454 Tella, Adedeji 592
over time 229–30 stepwise correlation 612–13 template analysis 484, 485–8, 492
Sparling, J. 368 Stevens, Peter 208 territoriality 349
INDEX 683
test statistics 576; calculation departments 12, 13, 25, visualisation of data see portraying
of 588–9 174–7, 209; government and data
text analysis software 477–83 public data sources 172–7; Voeten, Marius 374–5, 376
Textanz 478 Index of Child Well-Being 284; Vygotsky, Lev 73–4
thematic apperception test 343 Index of Social Progress 284;
themes: identifying research themes statistical data 172–7 Walford, Geoffrey 641
61–2, 642; and qualitative data United Nations Educational, Scientific Walker, D. 497
coding 473–5 and Cultural Organisation Warburton, Bill 498
theoretical coding 494 (UNESCO) 182–3, 317 Warford, Mark 77
theoretical perspective 15 United States (USA) 83; Census wavelength 553, 555
theory 77, 89–90; development of, Bureau 428; Centers for Weatherford, Jack 352
as an outcome 75–6; Disease Control and Web of Knowledge 207
explanatory 73–4; influence Prevention 488; cohort weighted moving averages 553
on goals 72–6; normative 73; studies 55; data sources Weitoft, Gunilla 578
and qualitative research 180–2; education theory 73, Weitzman, Eben 480
107–8; and quantitative 74; No Child Left Behind well-being indices 419–21
research 99–100 initiative 277 Wessa, Patrick 442
think aloud technique 363–5 univariate data 436, 449, 454 West, A. 45
Thomas, L. 346 Universities and Colleges Admission Westcott, Helen 368
Thomas, N-E. 592 Service (UCAS) 152, 176–7 Westera, Wim 624–5
Tihanyl, Krisztina 628–9 university admissions 152 Wetherall, Margie 501, 502
time, allocating 24, 88 university alliances 624–5 What Works clearing house 110,
time series analysis 549–56 unmatched groups 124–5 209, 210
timeline for research 230–1 Wheway, R. 471
Tizard, Barbara 641 validity 17–19, 129, 459 Whitaker, K. S. 76
Tom, A. 362 value added 286 Whyte, W.F. 130, 348, 352, 386
Tomic, Welko 76, 597–8 value of research 646–8 wicked problems 136–7
total quality management 65 values 35, 36, 46, 304–5 Wiefferinck, C. 612–13
transcription 465, 466, 477 Van Petergern, Peter 632 Willett, Rebekah 507
transforming data 547–8 variables 78, 294, 295, 436; and data word clouds 478
transparency 5, 18–19, 293, analysis 284–6; dependent word trees 478, 479
459, 514 279, 585; indicators 280–4, World Bank 186, 187
Treasury (UK) 177 294; and statistical testing 585, World Health Organisation
tree maps 449–450, 451, 452, 454 586; types of 279–80; see also (WHO) 248
triangulation 130–1, 134, 155, 229 independent variables Wright, Patricia Ann 471
Trotman, D. 39 variance 540; see also analysis of
Truss, Lynne 466 variance (ANOVA) Young People’s Social Attitudes
truth 16, 37, 47, 48, 97 video recording, of focus groups 367 Survey 305
Tukey, John 422 video surveillance 386
two-tailed test 578, 579–82 videos 344–5, 367 Z table 569, 579, 580, 617
Tynjälä, P. 597 visual images 343–5, 367; discourse Z values 569–70, 571, 572–3,
analysis 502–3; as source 579–80, 617
United Kingdom: cohort studies 55; of data 458; as stimuli Zhou, L. 599
Every Child Matters initiative 343–4, 367 Zientek, Linda 591
277, 404–5; government visual inspection of data 399–404 Zolingen, Simone 256