9449692
9449692
https://ebooknice.com/product/biota-grow-2c-gather-2c-cook-6661374
ebooknice.com
https://ebooknice.com/product/matematik-5000-kurs-2c-larobok-23848312
ebooknice.com
https://ebooknice.com/product/sat-ii-success-
math-1c-and-2c-2002-peterson-s-sat-ii-success-1722018
ebooknice.com
(Ebook) Master SAT II Math 1c and 2c 4th ed (Arco Master the SAT
Subject Test: Math Levels 1 & 2) by Arco ISBN 9780768923049,
0768923042
https://ebooknice.com/product/master-sat-ii-math-1c-and-2c-4th-ed-
arco-master-the-sat-subject-test-math-levels-1-2-2326094
ebooknice.com
(Ebook) Cambridge IGCSE and O Level History Workbook 2C - Depth Study:
the United States, 1919-41 2nd Edition by Benjamin Harrison ISBN
9781398375147, 9781398375048, 1398375144, 1398375047
https://ebooknice.com/product/cambridge-igcse-and-o-level-history-
workbook-2c-depth-study-the-united-states-1919-41-2nd-edition-53538044
ebooknice.com
https://ebooknice.com/product/digital-curation-36371378
ebooknice.com
https://ebooknice.com/product/digital-curation-in-the-digital-
humanities-preserving-and-promoting-archival-and-special-
collections-5138070
ebooknice.com
https://ebooknice.com/product/the-preservation-management-
handbook-34485152
ebooknice.com
https://ebooknice.com/product/fundamental-building-materials-4101584
ebooknice.com
Preserving digital materials 2nd Edition Ross Harvey
Digital Instant Download
Author(s): Ross Harvey
ISBN(s): 9783110253696, 3110253690
Edition: 2
File Details: PDF, 1.46 MB
Year: 2011
Language: english
Ross Harvey
Preserving Digital Materials
Current Topics
in Library and Information Practice
De Gruyter Saur
Ross Harvey
Preserving
Digital Materials
2nd Edition
De Gruyter Saur
ISBN 978-3-11-025368-9
e-ISBN 978-3-11-025369-6
ISSN 2191-2742
List of figures . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . X
Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
Chapter 1
What is Preservation in the Digital Age? Changing Preservation
Paradigms . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
Changing paradigms . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
The need for a new preservation paradigm . . . . . . . . . . . . . . . . . . . . . . . 10
Changing definitions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
Preservation definitions in the digital world . . . . . . . . . . . . . . . . . . . . . . 16
What exactly are we trying to preserve? . . . . . . . . . . . . . . . . . . . . . . . . . 21
How long are we preserving them for? . . . . . . . . . . . . . . . . . . . . . . . . . . 23
What strategies and actions do we apply? . . . . . . . . . . . . . . . . . . . . . . . . 24
Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24
Chapter 2
Why do we Preserve? Who Should do it? . . . . . . . . . . . . . . . . . . . . . . 25
Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25
Why preserve digital materials? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25
Professional imperatives . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27
New stakeholders . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29
How much data have we lost? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33
Current state of awareness of digital preservation problems. . . . . . . . . . . 37
Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 38
Chapter 3
Why There’s a Problem: Digital Artifacts and Digital Objects . . . . . . . 39
Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 39
Modes of digital death . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 40
Digital storage media. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43
Magnetic media . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 46
Optical disks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 47
The future for digital storage media . . . . . . . . . . . . . . . . . . . . . . . . . 49
VI Contents
Chapter 4
Selection for Preservation – The Critical Decision. . . . . . . . . . . . . . . . 56
Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 56
Selection for preservation, cultural heritage, and professional practice. . . 57
Selection criteria traditionally used by libraries and archives . . . . . . . . . . 59
Why traditional selection criteria do not apply to digital materials . . . . . . 63
IPR, context, stakeholders, and lifecycle models. . . . . . . . . . . . . . . . . . . 65
Intellectual property rights and legal deposit. . . . . . . . . . . . . . . . . . . 65
Context and community. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 66
Stakeholder input . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 67
Value of lifecycle models . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 68
Developing selection frameworks for preserving digital materials . . . . . . 69
Some selection frameworks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 71
How much to select? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 73
Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 74
Chapter 5
What Attributes of Digital Materials Do We Preserve? . . . . . . . . . . . . 75
Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 75
Digital materials, technology, and data. . . . . . . . . . . . . . . . . . . . . . . . . . 77
The importance of preserving context. . . . . . . . . . . . . . . . . . . . . . . . . . . 79
The OAIS Reference Model. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 80
The role of metadata . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 82
Preservation metadata . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 84
Preservation metadata standards. . . . . . . . . . . . . . . . . . . . . . . . . . . . 85
Persistent identifiers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 86
Authenticity . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 87
Significant properties . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 90
Research into authenticity . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 95
Functional Requirements for Evidence in Recordkeeping Project
(Pittsburgh). . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 95
InterPARES . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 95
Trusted digital repositories . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 97
Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 98
Contents VII
Chapter 6
Overview of Digital Preservation Strategies . . . . . . . . . . . . . . . . . . . . 99
Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 99
Historical overview . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 101
Who is doing what?. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 103
Criteria for effective strategies and practices. . . . . . . . . . . . . . . . . . . . . . 107
Broader concerns . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 111
Standards . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 111
Planning . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 111
Policies. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 112
Sustainability . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 113
Typologies of principles, strategies, and practices. . . . . . . . . . . . . . . . . . 114
A typology of digital preservation? . . . . . . . . . . . . . . . . . . . . . . . . . . . . 118
Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 120
Chapter 7
‘Preserve Technology’ Approaches: Tried and Tested Methods. . . . . . 121
Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 121
‘Non-solutions’ . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 122
Do nothing . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 124
Storage and handling practices. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 124
Durable/persistent digital storage media . . . . . . . . . . . . . . . . . . . . . . 127
Analogue backups. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 129
Digital archaeology and digital forensics . . . . . . . . . . . . . . . . . . . . . 130
‘Preserve technology’ approaches . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 131
Technology preservation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 132
Technology watch . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 133
Emulation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 134
The Universal Virtual Computer . . . . . . . . . . . . . . . . . . . . . . . . . . . 138
Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 139
Chapter 8
‘Preserve Objects’ Approaches: New Frontiers? . . . . . . . . . . . . . . . . . 140
Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 140
‘Preserve Objects’ approaches . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 140
Bit-stream copying, refreshing, and replication . . . . . . . . . . . . . . . . . . . . 142
Bit-stream copying . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 142
Refreshing . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 142
Replication . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 143
Standard data formats . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 143
File format registries . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 147
VIII Contents
Chapter 9
Digital Preservation Initiatives and Collaborations . . . . . . . . . . . . . . . 168
Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 168
Collaboration . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 168
Typologies of digital preservation initiatives . . . . . . . . . . . . . . . . . . . . . 171
International initiatives and collaborations . . . . . . . . . . . . . . . . . . . . . . . 172
International services. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 173
The Internet Archive . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 173
JSTOR . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 174
DuraSpace . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 175
LOCKSS . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 178
MetaArchive Cooperative . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 179
International alliances . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 180
UNESCO . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 180
PADI . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 181
OCLC . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 182
CAMiLEON . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 183
International Internet Preservation Consortium . . . . . . . . . . . . . . 183
Regional initiatives and collaborations . . . . . . . . . . . . . . . . . . . . . . . . . . 185
Regional services . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 185
NEDLIB . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 185
Regional alliances . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 185
ERPANET . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 185
European Commission-funded projects . . . . . . . . . . . . . . . . . . . 186
Digital Recordkeeping Initiative . . . . . . . . . . . . . . . . . . . . . . . . 188
National initiatives and collaborations . . . . . . . . . . . . . . . . . . . . . . . . . . 187
National services. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 188
AHDS . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 188
Florida Digital Archive . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 189
National alliances . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 190
Digital Curation Centre . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 190
Digital Preservation Coalition . . . . . . . . . . . . . . . . . . . . . . . . . . 191
Contents IX
NDIIPP. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 192
National Digital Stewardship Alliance . . . . . . . . . . . . . . . . . . . . 193
HathiTrust . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 194
Sectoral initiatives and collaborations . . . . . . . . . . . . . . . . . . . . . . . . . . 195
Sectoral services . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 195
Cedars . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 196
Sectoral alliances . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 196
JISC . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 196
Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 197
Chapter 10
Challenges for the Future of Digital Preservation . . . . . . . . . . . . . . . . 199
Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 199
What have we learned so far? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 201
Four major challenges . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 206
Challenge 1: managing digital preservation . . . . . . . . . . . . . . . . . . . 206
Challenge 2: funding digital preservation . . . . . . . . . . . . . . . . . . . . . 208
Challenge 3: peopling digital preservation . . . . . . . . . . . . . . . . . . . . 211
Challenge 4: making digital preservation fit . . . . . . . . . . . . . . . . . . . 213
Research and digital preservation. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 215
Conclusion: the future of digital preservation . . . . . . . . . . . . . . . . . . . . . 219
Bibliography . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 221
Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 241
List of Figures
– The likely high costs of taking action, and the likely high costs of delaying or not
taking action (including the likelihood of loss of access)
– A mismatch between funding cycles and long term preservation commitments, even
for long existing institutions …, leading to the possibility that some preservation
commitments may have to be given priority over others
– Intellectual property and other rights-based constraints on preservation processes
and on the provision of access
– Administrative complexities in ensuring timely action is taken that will be cost-
effective over very long periods of time
– The need to develop and maintain suitable knowledge and systems to deal with
these challenges (National Library of Australia, 2008).
The need for digital preservation touches all our lives, whether we work in commercial
or public sector institutions, engage in e-commerce, participate in e-government, or use
a digital camera. In all these instances we use, trust and create e-content, and expect
that this content will remain accessible to allow us to validate claims, trace what we
have done, or pass a record to future generations (NSF-DELOS Working Group on
Digital Archiving and Preservation, 2003, p.i).
These words remain as relevant now as when they were written almost ten
years ago.
We cannot expect a technological quick fix. We now appreciate that the
challenges of maintaining digital materials so they remain accessible in the
future are not just technological. They are equally bound up with organiza-
tional infrastructure, resourcing, and legal factors, and we have not yet got
the balance right. These and other factors combine to make the task difficult,
although there are clear pointers to the way ahead. As Breeding (2010, p.32)
notes, ‘while the current state of the art in digital preservation falls short of an
4 Introduction
ideal system that guarantees permanent survival, much has been done to address
the vulnerabilities inherent in digital content’.
Both the library community and the recordkeeping community (archivists
and records managers), as well as an increasing number of other groups, are
energetically seeking solutions to the challenges of digital preservation. Over
the last decade there has been increased sharing of the outcomes of research
and practice. Developments in one community have considerable potential to
assist practice in other information and heritage communities. This book goes
some way towards addressing this need by providing examples from several
different communities.
Although much high-quality information is available to information pro-
fessionals concerned with preserving materials in digital form, most notably on
the web, its sheer volume causes problems for busy information professionals,
scholars and scientists, and individuals who wish to understand the issues and
learn about strategies and practices for digital preservation. Preserving Digital
Materials is written for these time-poor information professionals, scholars
and scientists, and individuals. Its synthesis of current information, research
and perspectives about digital preservation from a wide range of sources
across many areas of practice makes it of interest to a wide range of readers
from preservation administrators and managers who want a professional refer-
ence text to thinking practitioners who wish to reflect on the issues that digital
preservation raises in their professional practice. It will also be of interest to
students.
The reader should note two features of this book. Preserving Digital Mate-
rials is not a how-to-do-it manual, although it does include information about
practical applications, so it is not the place to learn how to apply the technical
procedures of digital preservation. It is not primarily concerned with digitiza-
tion and makes little distinction between information that is born-digital and
information that is digitized from physical media.
This book addresses four key questions which give the text its four-part
structure:
The first edition of Preserving Digital Materials (2005) used many Australian
examples, because Australian practice in digital preservation – from the library,
recordkeeping, audiovisual archiving, data archiving and geoscience sectors –
was often at the forefront of international best practice. This second edition of
Preserving Digital Materials provides a more international perspective, noting
major initiatives in the UK, the EU and the US since 2005. It is possible to do
this in 2011 because of the considerable quantity of material reported by these
and many other initiatives and readily available on web sites, in conference
proceedings and from other public sources.
As noted above, there is a considerable amount of high-quality information
available about preserving materials in digital form, much of it available on the
web. The accessibility that this provides is countered by the impermanence of
much web material, as noted in several chapters in this book. All URLs in this
book were correct at the time of writing.
The first edition of this book acknowledged my indebtedness to many
people, and these debts still remain. Producing the first edition I benefited from
discussions with many colleagues at that time. In particular, I acknowledged
the following individuals for their ideas and support: Tony Dean for suggesting
the example of Piltdown Man; Liz Reuben, Matthew Davies, Stephen Ellis and
Rachel Salmond for case studies; Alan Howell, of the State Library of Victoria,
and staff of the National Library of Australia, in particular Pam Gatenby, Colin
Webb, Kevin Bradley and Margaret Phillips, for their assistance with clarifying
6 Introduction
concepts. Some of the material in the first edition was based on interviews
with Australian digital preservation experts, whose assistance and encourage-
ment was invaluable: Toby Burrows, Mathew Davies, Ray Edmondson, Stephen
Ellis, Alan Howell, Maggie Jones, Gavan McCarthy, Simon Pockley, Howard
Quenault, Lloyd Sokvitne, Paul Tresize, and Andrew Wilson. Heather Brown
and Peter Jenkins provided examples, and their assistance and the permission
of the State Library of South Australia was gratefully acknowledged. Thanks
were due to Ken Thibodeau, CLIR, ERPANET and UNESCO for permission
to use their material. I acknowledged my gratitude to my then employer, Charles
Sturt University, which supported me by providing study leave in 2003. I was
fortunate to be based at the National Library of Australia as a National Library
Fellow from March to June 2003 and I greatly appreciated the generous sup-
port of its then Director-General, Jan Fullerton, and other staff of the National
Library. Finally, I acknowledged the unfailing support of Rachel Salmond in
this and others of my endeavours.
In writing the second edition of this book I have incurred new debts. In
addition to those noted for the first edition, I gratefully acknowledge students
who have enrolled in my courses on digital preservation at Yonsei University,
Seoul, and the Graduate School of Library and Information Science, Simmons
College, Boston. My ideas have been informed by conversations with people too
numerous to name, but I wish to particularly thank Jeannette Bastian, Michèle
Cloonan, Joy Davidson, Cal Lee, Michael Lesk, Martha Mahard, Seamus Ross,
Anne Sauer, Shelby Sanett, and Terry Plum. I am grateful for the support of
my current employer, the Graduate School of Library and Information Science,
Simmons College, Boston.
I must again acknowledge the unfailing support of Rachel Salmond. I owe
Rachel more than I can adequately express here for her help over three decades,
her editorial assistance and her patience with me as the preparation of this
book took over normal schedules.
Chapter 1
What is Preservation in the Digital Age?
Changing Preservation Paradigms
Introduction
To preserve, as the dictionary reminds us is to keep
safe … to maintain unchanged … to keep or maintain
intact. But the rapid obsolescence of information
technology entails the probability that any digital
object maintained unchanged for any length of time
will become inaccessible (Thibodeau, 1999)
Any discussion about the preservation of digital materials must begin with the
consideration of two interlinked areas: changing preservation paradigms, and
definitions of terms. Without a clear understanding of what we are discussing,
the potential for confusion is too great. In library and recordkeeping practice
we are moving rapidly from collection-based models, whose principles and
practices have been developed over many centuries, to models where collec-
tions are not of paramount importance and where what matters is the extent of
access provided to information resources, whether they are managed locally or
remotely. Archivists have considered, debated, and sometimes applied the con-
cept of non-custodial archives, where there is no central collection, to accom-
modate the massive increase in numbers of digital records. Librarians manage
hybrid libraries, consisting of both physical collections and distributed digital
information resources, and digital libraries. Other stakeholders with a keen inter-
est in digital preservation manage digital information in specific subject areas,
such as geospatial data or social science data. In the past this material, where it
existed, was maintained as collections of paper and other physical objects. The
practices developed and applied in libraries and archives are still largely based
on managing physical collections and cannot be applied automatically to man-
aging digital collections.
The changing models of library and recordkeeping practice require new
definitions. The old terms do not always convey useful meanings in the digital
environment and can be misleading and, on occasion, even harmful. In library
and recordkeeping practice we are changing from a preservation paradigm
where primary emphasis is placed on preserving the physical object (the arti-
fact as carrier of the information we wish to retain, for example, a CD) to one
where there is no physical carrier to preserve. What, then, does the term pres-
ervation mean in the digital environment? How has its meaning changed?
8 What is Preservation in the Digital Age?
What are the implications of these changes? The phrase benign neglect pro-
vides an example of a concept that is helpful in the pre-digital preservation
paradigm but is harmful in the new. It refers to the concept that many informa-
tion carriers made of organic materials (most notably paper-based artifacts)
will not deteriorate rapidly if they are left undisturbed. For digital materials
this concept is positively harmful. One thing we understand about information
in digital form is that actions must be applied almost from the moment it is
created, if it is to survive. Pre-digital paradigm definitions do not accommodate
new forms, such as works of art that incorporate digital technologies, and time-
defined creative enterprises such as performance art.
This chapter examines the effect of digital information on ‘traditional’
librarianship and recordkeeping paradigms, noting the need for a new preser-
vation paradigm in an environment that is dynamic and has many stakeholders,
often with competing interests. It considers the differences between born-
digital and digitized information, and defines key terms.
Changing paradigms
It is now commonplace to hear or read that we live in an information society,
of which one main characteristic is the widespread and increasing use of net-
worked computing, which relies on data. This is revolutionizing the way in
which large parts of the world’s population live, work and play, and how
libraries, archives, museums and other institutions concerned with preserving
documentary heritage function and are managed. New expectations of these
institutions are evolving.
The significance of these changes is readily illustrated by just one example.
The internet is rapidly becoming the first choice for people who are searching
for information on a subject, and a new verb, to google (derived directly from
Google, the name of a widely used internet search engine) has entered our
vocabulary. The sheer size and rapid rate of the internet’s growth mean that
no systems have been developed to provide comprehensive access to it. The
systems that do exist are embryonic and experimental, and the quality of the
information available on the web is variable. Attempts to estimate the rate of
the internet’s growth have included counting the number of domain names
over several years. There has been a dramatic increase in the number of domain
names since 1994, when only a small number were registered, rising to around
100 million at the start of 2001 and, ten years later, to almost 800 million in
2010 (Internet Systems Consortium, 2011).
These major changes – it is not too extreme to call it a revolution – raise
the question of how to keep the digital materials we decide are worth keeping.
The ever-increasing quantities being produced do not assist us in finding an
answer. ‘According to a recent study by market-research company IDC … the
Changing paradigms 9
size of the information universe is currently 800,000 petabytes. … but it’s just
a down payment on next year’s total, which will reach 1.2 million petabytes, or
1.2 zettabytes. If these growth rates continue, by 2020 the digital universe will
total 35 zettabytes, or 44 times more than in 2009’ (Tweney, 2010). Nor does
the rapidity with which changes in computer and information technology occur.
The challenges are new and complex for nearly all aspects of librarianship and
recordkeeping, including preservation.
There have also been changes in the ways in which information is pro-
duced and becomes available to communities of users. The internet is only one
of these ways. In the pre-digital (print) environment the processes of creation,
reproduction and distribution were separate and different; now, ‘technology
tends to erase distinctions between the separate processes of creation, repro-
duction and distribution that characterize the classic industrial model of print
commodities’ (Nurnberg, 1995, p.21). This has significant implications for
preservation, especially in terms of who takes responsibility for it and at what
stage preservation actions are first applied. For instance, in the industrial-mode
print world, acquiring the artifact – the book – so that it could be preserved
occurred by means such as legal deposit legislation, requiring publishers to
provide copies to libraries for preservation and other purposes. If the creator is
now also the publisher and distributor, as is often the case in the digital world,
who has the responsibility of acquiring the information? These points are noted
in more detail later in this book.
New ways of working and new structures are developing. Cyberscholarship
(known also as e-science or e-research) is based on ready access to digital mate-
rials and applies computing techniques to analyze, visualize and present results.
This research is typically highly collaborative, being based on the use of large
data sets produced and shared by international communities of scholars. The
practices developed in this cyberscholarship environment are significantly dif-
ferent from traditional practices. Other characteristics of cyberscholarship also
illustrate different practices. The enhanced ability to compute large quantities of
data, such as using visualizations and simulations, provide new possibilities,
some of which can be seen in the Electronic Cultural Atlas Initiative (ecai.org).
The generation of large quantities of data places heavy demands on how data are
stored and managed. Heavy emphasis is placed on sharing and re-using digital
information. All of these factors place different demands on how digital informa-
tion is managed, including on its preservation over time to ensure it remains
available and usable in the future. A 2008 study carried out for the Association
of Research Libraries provides examples of cyberscholarship in humanities, so-
cial sciences and scientific/technical/medical subject areas in the US (Maron and
Kirby Smith, 2008). Changing information practices in the humanities, in the
UK specifically, are described by Bulger and her colleagues (Bulger et al., 2011).
Cyberinfrastructure refers to the computer networks, libraries and archives,
online repositories and other resources needed to support cyberscholarship.
10 What is Preservation in the Digital Age?
– When materials are treated, the treatments should, when possible, be re-
versible
– Whenever possible or appropriate, the originals should be preserved; only
materials that are untreatable should be reformatted
– Library materials should be preserved for as long as possible
– Efforts should be put into preventive conservation, and aimed at providing
appropriate storage and handling of artifacts
– Benign neglect may be the best treatment (derived from Cloonan (1993,
p.596), Harvey (1993, pp.14,140), and Bastian, Cloonan and Harvey (2011,
pp.612-613)).
The definitions associated with the old preservation paradigm are firmly rooted
in the conservation of artifacts – the physical objects that carry the information
content. In fact, the term ‘materials conservation’ is sometimes used, especially
by museums. The definitions provided in the IFLA Principles for the Care and
Handling of Library Materials (Adcock, 1998), widely adopted in the library
and recordkeeping contexts, articulate principles firmly based on maintenance
of the physical artifact. The definition of Conservation notes that its aims are
to ‘slow deterioration and prolong the life of an object’, and that of Archival
The need for a new preservation paradigm 11
the information landscape has changed, thanks to the digital revolution. Libraries are
working to integrate access to print materials with access to digital materials. There is
likewise a challenge to integrate the preservation of analog and digital materials. Preser-
vation specialists have been trained to work with print-based materials, and they are
justifiably concerned about the increased complexity of the new preservation agenda
(Kenney and Stam, 2002, p.v).
Marcum’s description of the situation is still accurate ten years later. Addition-
ally research libraries are seeking new roles as experts in the curation of digital
materials (Walters and Skinner, 2011).
What is the new preservation agenda? How has the preservation paradigm
changed to accommodate it? Pre-digital preservation paradigm thinking does
include some useful understanding of digital preservation. For example, it
recognizes that copying (as in refreshing from tape to tape) is the basis of digital
12 What is Preservation in the Digital Age?
Further key elements are the scale and nature of the digital information we
wish to maintain into the future and the preservation challenges these pose.
The complexities of the variety of digital materials are described in this way:
Digital objects worthy of preservation include databases, documents, sound and video
recordings, images, and dynamic multi-media productions. These entities are created
on many different types of media and stored in a wide variety of formats. Despite a
steady drop in storage costs, the recent influx of digital information and its growing
complexity exceeds the archiving capacity of most organizations (Workshop on Research
Challenges in Digital Archiving and Long-term Preservation, 2003, p.7).
Arising from these key factors is the need for new kinds of skills. Current
preservation skills and techniques are labour-intensive and, even where ap-
propriate, do not scale up to the massive quantities of digital materials we are
already encountering. The problem cannot simply be addressed by technologi-
cal means. What kind of person will implement the new policies and develop
the new procedures required to maintain digital materials effectively into the
future? New kinds of positions, requiring new skill sets are already being es-
tablished in libraries and archives. Among key selection criteria for a Digital
Archivist at the MIT Libraries in Cambridge, Massachusetts, advertised in
May 2011, were:
People with these skill sets are still in short supply. But it is more than new
skills that is required. We need to redefine the field of preservation and the
terms we use to describe preservation activities.
Changing definitions
Pre-digital preservation paradigm definitions do not convey useful meanings
when they are applied to digital preservation. What are they?
14 What is Preservation in the Digital Age?
Currently ‘conservation’ is the more specific term and is particularly used in relation to
specific objects, whereas ‘preservation’ is a broader concept covering conservation as
well as actions relating to protection, maintenance, and restoration of library collections.
The eminent British conservator, Christopher Clarkson, emphasizes this broader aspect
when he states that preservation ‘encompasses every facet of library life’: it is, he says,
‘preventive medicine ... the concern of everyone who walks into, or works in, a library.’
For Clarkson conservation is ‘the specialized process of making safe, or to a certain
degree usable, fragile period objects’ and ‘restoration’ expresses rather extensive rebuild-
ing and replacement by modern materials within a period object, catering for a future
of more robust use.’ He neatly distinguishes the three terms by relating them to the extent
of operations applied to an item: ‘restoration implies major alterations, conservation
minimal and preservation none’ (Harvey, 1993, pp.6-7).
It is clear that professionals are revising their definitions of preservation from a once-
and-forever approach for paper-based materials to an all-the-time approach for digital
materials. Preservation must now accommodate both media and access systems …
while we once tended to think about preserving materials for a particular period of time
– for example, permanent/durable paper was expected to last for five hundred years –
we now think about retaining digital media for a period of continuing value (Cloonan
and Sanett, 2002, p.93).
the paradigms of any of the information professions come up short when compared with
the scope of the issues continuously emerging in the digital environment. An overarching
dynamic paradigm – that adopts, adapts, develops, and sheds principles and practices
of the constituent information communities as necessary – needs to be created.
Preserving the original bit-stream is only one part of the problem; equally im-
portant is the requirement to preserve ‘the means of interpreting, reading and
utilizing the bit stream’ (Deegan and Tanner, 2002).
The difficulties of definition are not helped by disciplinary differences.
There are, for instance, differences in the way archivists and librarians use terms.
Some terms, such as integrity and authenticity, arise from the world of archives
and were not, until recently, usually associated with the work of librarians.
These differences, however, pale in comparison with the significantly different
definitions used in the IT industry. How IT professionals think about the long-
term storage of data is a question that assumes importance for digital preserva-
tion because of the heavy reliance that information professionals place on their
skills and services. There is abundant evidence that they think very differently
about preservation. Definitions of archive, archiving and archival storage give
us some indication of the mindset of IT professionals. A selection of online
dictionaries of information technology indicate that the terms are used in two
ways:
1. The process of moving data to a different kind of storage medium: for ex-
ample, ‘archive … 2 verb to put data in storage … on backing storage (such
as magnetic tape rather than a hard disk)’ (Collins, 2002)
2. The process of backing up data for long-term storage: for example, ‘ar-
chive (v.) To copy files to a long-term storage medium for backup … On
smaller systems archiving is synonymous with backing up’ (Webopedia,
2011).
Few of the definitions located display any interest or concern with the reasons
why long-term storage might be required, although one earlier definition is a
notable exception: ‘archiving Long term storage of information on electronic
media. Information is archived for legal, security or historical reasons, rather
than for regular processing or retrieval’ (Gunton, 1993, p.11). Perhaps the mind-
set of IT professionals is better indicated by this excerpt: ‘You detect data that’s
not needed online and move it an off-shore store. When someone wants to use
it, go find the off-line media and restore the data’ (Faulds and Challinor, 1998,
p.280).
Preservation definitions in the digital world 19
A period of time long enough for there to be concern about the impacts of changing
technologies, including support for new media and data formats, and of a changing user
community, on the information being held in a repository (International Organization
for Standardization, 2003, p.1-11).
items, even if that is done for preservation purposes’. Such statements are
worth making firmly because of the misconception still too commonly encoun-
tered in the information professions that digitizing of analogue materials, usually
photographs or paper-based material, is sufficient for preservation purposes.
This is not the case, as a 2010 report of LIBER members (Ligue des Biblio-
thèques Européennes de Recherche, representing European research libraries)
indicates:
Making the digitised material available and visible online is only one of the challenges
faced ... Another lies in assuring long-term access to them. Digitised materials í like
other digital data í are also fragile items and need special measures and arrangements
in order to be accessible despite technological change. While the preservation of paper
documents is well understood and is supported by a well-established infrastructure and a
profession of librarians and other experts, the preservation of digital objects in general
and digitised material in particular is a relatively new task for libraries and poses great
challenges in terms of the expertise and resources required (Bergau, 2010, p.6).
In terms of how they are preserved, though, the definitions in these two sources
make no distinction between born-digital materials and digital materials created
by digitizing analogue materials. This is acknowledged in the Digital Preserva-
tion Coalition’s definition of digital materials, which covers both ‘digital sur-
rogates created as a result of converting analogue materials to digital form
(digitisation), and “born digital” for which there has never been and is never
intended to be an analogue equivalent, and digital records’ (Digital Preservation
Coalition, 2008, p.24). These definitions also make clear that it is not only the
bit-stream that we seek to preserve. In order to ensure access in the future to
digital materials, we also need to take account of other attributes of digital
materials. The UNESCO Guidelines indicate this in the definition of information
packages, which comes from the OAIS Reference Model in Chapter 5. In addi-
tion to the bit-stream, which is typically ‘not understandable or re-presentable’
by itself, ‘any information and tools that would be needed in order to access
and understand’ the digital materials must also be preserved (UNESCO, 2003,
p.39).
The definitions are also very clear about the need to maintain other attributes
of digital materials. To ensure that digital materials remain usable in the future,
access to them is required – and not simply access, but access to ‘all qualities
of authenticity, accuracy and functionality’ (Digital Preservation Coalition,
2008, p.24). This, in turn, requires definitions of authenticity, expressed by the
UNESCO Guidelines as the ‘quality of genuineness and trustworthiness of
some digital materials, as being what they purport to be, either as an original
object or as a reliable copy derived by fully documented processes from an
original’ (UNESCO, 2003, p.157). (Note the emphasis on the significance of
full documentation to ensure authenticity; this has important implications for
digital preservation, noted in Chapter 5.) Four further definitions in the
How long are we preserving them for? 23
Conclusion
This chapter has introduced some of the key concepts that are reshaping preser-
vation practice in the digital environment. It notes the need for new ways of
thinking about preservation and poses three key questions that need to be con-
sidered when we think about the preservation of digital materials:
These questions and other themes introduced in Chapter 1 are explored in the
rest of this book.
Chapter 2
Why do we Preserve? Who Should do it?
Introduction
Society, of course, has a vital interest in preserving
materials that document issues, concerns, ideas, dis-
course and events ... The ability of a culture to survive
into the future depends on the richness and acuity of
its members’ sense of history (Task Force on Archiv-
ing of Digital Information, 1996, p.1)
preservation worth doing, it is also, some suggest, a duty. Agresto, former head
of the US National Endowment for the Humanities, suggested that ‘we have a
human obligation not to forget’ (cited in Harvey, 1993, p.7) and that preserva-
tion is essential for the well-being of democracies that depend ‘on knowledge
and the diffusion of knowledge’ and on ‘knowledge shared’ (Harvey, 1993,
p.7). Even greater claims are made: ‘the ability of a culture to survive into the
future’ depends on the preservation of knowledge (Task Force on Archiving of
Digital Information, 1996, p.1).
The cultural and political imperatives that have led to preservation being
considered as fundamental have been explored in books, such as Lowenthal’s
The Past is a Foreign Country (Lowenthal, 1985) and Taylor’s Cultural Selec-
tion (Taylor, 1996), which persuade us that preservation is not simply the con-
cern of a limited number of cultural heritage institutions and professions, but
has dimensions that have significant impact, both limiting and sustaining, on
most aspects of society. There is, in fact, no single reason why we preserve
knowledge. Preservation, suggests Cloonan (2001, p.231), ‘has a life force
fueled by many (often disparate) sources’.
None of these reasons change when we consider the preservation of
knowledge encoded in digital materials, but the rhetoric alters to emphasize
economic rationales. Preserving digital materials is essential. If we do not attend
to it ‘what is at stake is the loss of data representing billions of dollars of in-
vestment in new intormation technology, new scientific discoveries, and new
information on which our economic prosperity and national security depend’
(NDIIPP, 2011, p.1). Evidential and accountability reasons are also commonly
given: ‘we expect that this [digital] content will remain accessible to allow us to
validate claims, trace what we have done, or pass a record to future generations’,
states the NSF-DELOS Working Group on Digital Archiving and Preservation
(2003, p.[i]), who also specify five conditions for preservation, any one of
which is sufficient to provide a benefit to society:
– If unique information objects that are vulnerable and sensitive and therefore subject
to risks can be preserved and protected;
– If preservation ensures long-term accessibility for researchers and the public;
– If preservation fosters the accountability of governments and organisations;
– If there is an economic or societal advantage in re-using information, or
– If there is a legal requirement to keep it (NSF-DELOS Working Group on Digital
Archiving and Preservation, 2003, p.3).
What are, and will be, the social contexts and institutions for preserving digital docu-
ments? Indeed, what new kinds of institutions are possible in cyberspace, and what
technologies will support them? What kind of new social contexts and institutions
should be invented for cyberspace? (Lyman and Kahle, 1998).
Such new social contexts are emerging and their digital content is deemed
worth preserving. The Library of Congress’s work in preserving Twitter content
(Watters, 2011) and the Schlesinger Library’s in preserving blogs (Dunn, 2009)
are two examples.
The very aims of preservation are also being questioned – ‘What are we
preserving? For whom? And why?’ (NSF-DELOS Working Group on Digital
Archiving and Preservation, 2003, p.2) – and the expanded number of stake-
holders in the digital age means that a range of different interests must be con-
sidered.
Professional imperatives
What do these changes mean for libraries and archives? Have there been signifi-
cant changes in their practices?
At one level, there has been little change. Libraries are still, to use Deanna
Marcum’s words, ‘society’s stewards of cultural and intellectual resources’
(Kenney and Stam, 2002, p.v). Preservation is nothing less than core business
for libraries who maintain collections for use in the future. One typical view of
the preservation role of libraries is Gorman’s statement:
Libraries have a duty to preserve and make available all the records of humankind.
That is a unique burden. No other group of people has ever been as successful in pre-
serving the records of the past and no other group of people has that mission today ... Let
there be no mistake: if we librarians do not rise to the occasion, successive generations
will know less and have access to less for the first time in human history. This is not a
challenge from which we can shrink or a mission in which we can fail (Gorman, 1997).
items to be used longer before they wear out, and by the ‘just in case’ argument:
‘It cannot easily be predicted what will be of interest to researchers in the
future. Preserving current collections is the best way to serve future users’
(Adcock, 1998, p.8).
Similarly, there has been no fundamental change in archivists’ secure under-
standing of their preservation responsibilities. They have typically placed the
physical care of their collections at least on a par with, if not at a higher level
of importance than, the provision of access to those collections. This ‘physical
defence of archives’ was indeed considered paramount by the British archivist
Sir Hilary Jenkinson, who formulated this influential statement in 1922:
The duties of the Archivist ... are primary and secondary. In the first place he has to
take all possible precautions for the safeguarding of his Archives and for their custody
... Subject to the discharge of these duties he has in the second place to provide to the
best of his ability for the needs of historians and other research workers. But the position
of primary and secondary must not be reversed (Jenkinson, 1965, p.15).
– Working with information creators to identify requirements for the long-term man-
agement of information;
– Identifying the roles and responsibilities of those who create, manage, provide ac-
cess to, and preserve information
– Ensuring the creation and preservation of reliable and authentic materials;
– Understanding that information can be dynamic in terms of form, accumulation,
value attribution, and primary and secondary use; …
– Identifying evidence in materials and addressing the evidential needs of materials
and their users through archival appraisal, description, and preservation activities
(Gilliland-Swetland, 2000, p.21).
Nor should we forget the specific legal reasons for preservation. In the case of
archives these reasons are often connected to administrative and political ac-
countability. For some types of libraries, statutory responsibilities require that
preservation is their core business. National libraries, for example, have a stat-
utory responsibility for collecting and safeguarding access to information pub-
lished in their countries.
While the traditional preservation responsibilities of libraries and archives
may remain the same, there has been significant change in the ways in which
they are interpreted and operationalized as digital materials have become prev-
New stakeholders 29
alent. In their report about repositioning research libraries Walters and Skinner
(2011, p.57) state firmly that
very few research libraries should have more than half of their infrastructure devoted to
physical collections at this point in time. The library needs to think of digital curation as a
core function of the library and to invest financial and other resources into it accordingly.
New expertise and new perspectives are required, without discarding the prin-
ciples of preservation developed for non-digital materials. Of greatest signifi-
cance is the need to engage with other stakeholders and ‘form new alliances and
partnerships’ (Webb, 2000). These new stakeholders may not always embrace
engagement willingly and will often need to be convinced of their roles, as
Hilton, Thompson and Walters (2010) point out when writing of donations of
digital material to the Wellcome Library in the UK.
Smith cogently summarizes the concepts in this section:
Society has always created objects and records describing its activities, and it has con-
sciously preserved them in a permanent way … Cultural institutions are recognised
custodians of this collective memory: archives, librar[ies] and museums play a vital
role in organizing, preserving and providing access to the cultural, intellectual and his-
torical resources of society. They have established formal preservation programs for
traditional materials and they understand how to safeguard both the contextual circum-
stances and the authenticity and integrity of the objects and information placed in their
care … It is now evident that the computer has changed forever the way information is
created, managed, archives and accessed, and that digital information is now an integral
part of our cultural and intellectual heritage. However the institutions that have tradi-
tionally been responsible for preserving information now face major technical, organiza-
tional, resource, and legal challenges in taking on the preservation of digital holdings
(B. Smith, 2002, pp.133-134).
New stakeholders
The new challenges of digital preservation call for the involvement of new par-
ticipants. No longer are librarians and archivists the main groups concerned
with preserving digital materials: it is increasingly evident that the cultural heri-
tage institutions traditionally charged with responsibility for preserving materials
cannot continue to carry this responsibility in the digital age without widening
the range of partners in their endeavours. Scholars and scientists who, increas-
ingly, base their research on large data sets, drug companies who need to prove
ownership of intellectual property, lawyers who must keep secure evidence in
digital form, are but a few of myriad potential stakeholders.
Not only are new kinds of stakeholders claiming an interest or claiming
control, but higher levels of collaboration among stakeholders are also com-
monly understood to be necessary for digital preservation to be effective.
30 Why do we Preserve? Who Should do it?
Narrowly focused localized solutions are not considered likely to be the most
effective. Cooperation ‘can enhance the productive capacity of a limited sup-
ply of digital preservation funds, by building shared resources, eliminating
redundancies, and exploiting economies of scale’ (Lavoie and Dempsey, 2004).
The preservation of digital materials has become ‘essentially a distributed proc-
ess’ where ‘traditional demarcations do not apply’ and one for which ‘an inter-
disciplinary approach is necessary’ (Shenton, 2000, p.164).
Collaboration is considered more and more as the only way in which viable
and sustainable solutions can be developed, as the problems are well beyond
the scope of even the largest and most well-resourced single institution.
(UNESCO, 2003, Chapter 11 explores collaboration in more detail, and Chap-
ter 9 of this book provides examples of collaborative activities).
Who, more specifically, are these new stakeholders? What are their preser-
vation roles in an increasingly digital environment? An early indication was
provided by the Task Force on Archiving of Digital Information, whose in-
fluential 1996 report set much of the digital preservation agenda for the fol-
lowing decade. This report suggested that ‘intense interactions among the
parties with stakes in digital information are providing the opportunity and
stimulus for new stakeholders to emerge and add value, and for the relation-
ships and division of labor among existing stakeholders to assume new forms’.
It proposed two principles, the first that information creators, providers and
owners ‘have initial responsibility for archiving their digital information ob-
jects and thereby assuming the long-term preservation of these objects’, and
the second that, where this mechanism fails or becomes unworkable, ‘certified
digital archives have the right and duty’ to preserve digital materials (Task
Force on Archiving of Digital Information, 1996, pp.19-20). Since 1996 the
landscape of digital preservation has become clearer and we now see significant
levels of collaboration, strong emphasis on and involvement of data creators as
first-line preservers, and the development of certified digital archives (trusted
digital repositories).
In addition to data creators and certified digital archives, there are other
new stakeholders. They include commercial services, government agencies,
individuals, rights holders, beneficiaries, funding agencies, and users (Hodge and
Frangakis, 2004, p.15). ‘Hardware and software developers, publishers, produc-
ers, and distributors of digital materials as well as other private sector partners’
(UNESCO, 2004, Article 10) can also be added to the list. All stakeholders
are learning how to work together, learning first to understand the languages
of other disciplines and then working out how complementary skills can fit
together. Collaboration was of course not unknown in the old preservation
paradigm, one example being the collaboration of scholars and librarians to
identify the core literature in specific discipline areas for microfilming and
scanning projects (see Gwinn (1993) for an example in agriculture). The extent
and nature of collaborative activity has, however, intensified. It is encapsulated
New stakeholders 31
The same examples are presented, even when they are no longer in the ‘lost or
compromised’ category: the BBC’s Domesday Project, NASA data, the Viking
Mars mission, the Combat Area Casualty file containing prisoner of war and
missing in action information for the Vietnam war, the first email, the first web
site, as described in more detail below. Attempting to answer the question
requires, first, a consideration of the issue of selection for preservation (noted
in more detail in Chapter 4). A common argument is that anything significant
is likely to be maintained anyway; so should we be concerned about the rest?
Some of the examples that follow assume that the first (email, web site, and so
on) is worth preserving; but is this necessarily the case? It is often a view
developed in hindsight. Betts tells us that Ray Tomlinson, principal engineer at
BBN Technologies in Cambridge, Massachusetts did not save the first network
email ever sent in 1972 because ‘it just didn’t seem worth saving … Even if
backup tapes did exist, they might not be readable. They were just mag tapes,
and after seven or eight years, the oxide starts falling off, especially from tapes
of that era’ (Betts, 1999).
The small number of specific examples located indicate how great the
problem of loss or compromise of digital materials could be. The most often
quoted, indeed overused, examples are those cited in the 1996 report of the Task
Force on Archiving of Digital Information. Because they have been reported
very widely since, they warrant quoting at some length. The report notes the
case of the US Census of 1960.
In 1976, the National Archives identified seven series of aggregated data from the 1960
Census as having long-term historical value. A large portion of the selected records,
however, resided on tapes that the Bureau could read only with a UNIVAC type-II-A
tape drive. By the mid-seventies, that particular tape drive was long obsolete, and the
Census Bureau faced a significant engineering challenge in preserving the data from
the UNIVAC type II-A tapes. By 1979, the Bureau had successfully copied onto industry-
standard tapes nearly all the data judged then to have long-term value (Task Force on
Archiving of Digital Information, 1996, p.2).
The report notes other lost examples, one of them the first email message ‘sent
either from the Massachusetts Institute of Technology, the Carnegie Institute
of Technology or Cambridge University’ in 1964 (Task Force on Archiving of
Digital Information, 1996, p.3).
Rothenberg reminds us of some examples noted in a 1990 US House of
Representatives report:
hundreds of reels of tape from the Department of Health and Human Services; files
from the National Commission on Marijuana and Drug Abuse, the Public Land Law
Review Commission, the President’s Commission on School Finance, and the National
Commission on Consumer Finance; the Combat Area Casualty file containing POW
and MIA information for the Vietnam war; herbicide information needed to analyze the
impact of Agent Orange; and many others (Rothenberg, 1999b, pp.1-2).
Discovering Diverse Content Through
Random Scribd Documents
TRANSCRIBER’S NOTES
Typos fixed; non-standard spelling and dialect
retained.
*** END OF THE PROJECT GUTENBERG EBOOK THE READER'S
GUIDE TO THE ENCYCLOPAEDIA BRITANNICA ***
1.D. The copyright laws of the place where you are located also
govern what you can do with this work. Copyright laws in most
countries are in a constant state of change. If you are outside
the United States, check the laws of your country in addition to
the terms of this agreement before downloading, copying,
displaying, performing, distributing or creating derivative works
based on this work or any other Project Gutenberg™ work. The
Foundation makes no representations concerning the copyright
status of any work in any country other than the United States.
1.E.6. You may convert to and distribute this work in any binary,
compressed, marked up, nonproprietary or proprietary form,
including any word processing or hypertext form. However, if
you provide access to or distribute copies of a Project
Gutenberg™ work in a format other than “Plain Vanilla ASCII” or
other format used in the official version posted on the official
Project Gutenberg™ website (www.gutenberg.org), you must,
at no additional cost, fee or expense to the user, provide a copy,
a means of exporting a copy, or a means of obtaining a copy
upon request, of the work in its original “Plain Vanilla ASCII” or
other form. Any alternate format must include the full Project
Gutenberg™ License as specified in paragraph 1.E.1.
• You pay a royalty fee of 20% of the gross profits you derive
from the use of Project Gutenberg™ works calculated using the
method you already use to calculate your applicable taxes. The
fee is owed to the owner of the Project Gutenberg™ trademark,
but he has agreed to donate royalties under this paragraph to
the Project Gutenberg Literary Archive Foundation. Royalty
payments must be paid within 60 days following each date on
which you prepare (or are legally required to prepare) your
periodic tax returns. Royalty payments should be clearly marked
as such and sent to the Project Gutenberg Literary Archive
Foundation at the address specified in Section 4, “Information
about donations to the Project Gutenberg Literary Archive
Foundation.”
• You comply with all other terms of this agreement for free
distribution of Project Gutenberg™ works.
1.F.
Most people start at our website which has the main PG search
facility: www.gutenberg.org.