The DELOS Digital Library Reference Model
The DELOS Digital Library Reference Model
507618
DELOS
Version 0.96
November 2007
Project co-funded by the European Commission within the Sixth Framework Programme
(2002-2006)
This volume is one of a series of texts written in the context of the DELOS Network of
Excellence on Digital Libraries (www.delos.info).
No part of this volume may be reproduced in any form or by any electronic or mechanical
means (including photocopying, recording, or information storage and retrieval) without
permission from the publisher.
1
Institute of Information Science and Technologies (ISTI)
Italian National Research Council (CNR)
Pisa, Italy
2
Department of Information Engineering (DEI)
University of Padova
Padova, Italy
3
Department of Informatics and Telecommunications
University of Athens
Athens, Greece
4
Humanities Advanced Technology and Information Institute (HATII)
University of Glasgow
Glasgow, United Kingdom
5
College of Information Studies
University of Maryland,
College Park, Maryland
6
Database and Information Systems Group
University of Basel,
Basel, Switzerland
The DELOS Digital Library Reference Model - 0.96
Table of Contents
Table of Contents................................................................................................................... 4
Table of Figures ..................................................................................................................... 6
About this Volume................................................................................................................. 8
PART III The DELOS Digital Library Reference Model Concepts and Relations......... 64
III.1 Introduction............................................................................................................... 65
III.2 Concepts’ Hierarchy.................................................................................................. 66
III.3 Reference Model Concepts’ Definitions .................................................................... 71
III.4 Relations’ Hierarchy................................................................................................ 150
III.5 Reference Model Relations’ Definitions .................................................................. 152
Conclusions ....................................................................................................................... 163
Appendix A. Concept Maps in A4 format........................................................................ 164
A.1. DL Resource Domain Concept Map..................................................................... 164
A.2. Content Domain Concept Map............................................................................. 165
A.3. User Domain Concept Map.................................................................................. 166
A.4. Functionality Domain Concept Map..................................................................... 167
A.5. Functionality Domain Concept Map: Access Resource Functions ........................ 168
A.6. Functionality Domain Concept Map: Specializations of the Manage Resource
Function......................................................................................................................... 169
A.7. Functionality Domain Concept Map: General Manage Resource Functions.......... 170
A.8. Functionality Domain Concept Map: Manage Information Object Functions ....... 171
A.9. Functionality Domain Concept Map: Manage Actor Functions ............................ 172
A.10. Functionality Domain Concept Map: Collaborate Functions............................... 173
A.11. Functionality Domain Concept Map: Manage DL Functions .............................. 174
A.12. Functionality Domain Concept Map: Manage & Configure DLS Functions ....... 175
A.13. Policy Domain Concept Map ............................................................................. 176
A.14. Policy Domain Concept Map: Policies’ Hierarchy ............................................. 177
A.15. Quality Domain Concept Map............................................................................ 178
A.16. Architecture Domain Concept Map.................................................................... 179
Index of Concepts and Relations ........................................................................................ 180
Bibliography ...................................................................................................................... 182
Table of Figures
Figure I.2-1. DL, DLS, and DLMS: A Three-tier Framework............................................... 16
Figure I.3-1. The Digital Library Universe: Main Concepts.................................................. 18
Figure I.4-1. The Main Roles of Actors versus the Three-tier Framework............................. 21
Figure I.4-2. Hierarchy of Users' Views ............................................................................... 22
Figure I.5-1. The Digital Library Development Framework.................................................. 24
Figure II.1-1. The Digital Library Universe.......................................................................... 27
Figure II.1-2. The Reference Model as the core of the Development Framework.................. 29
Figure II.1-3. A concept map showing the key feature of concept maps................................ 31
Figure II.2-1. DL Domains Hierarchy Concept Map............................................................. 32
Figure II.2-2. DL Resource Domain Concept Map ............................................................... 34
Figure II.2-3. Content Domain Concept Map........................................................................ 35
Figure II.2-4. User Domain Concept Map ............................................................................ 38
Figure II.2-5. Functionality Domain Concept Map ............................................................... 40
Figure II.2-6. Functionality Domain Concept Map: Access Resource Functions................... 40
Figure II.2-7. Functionality Domain Concept Map: Specializations of the Manage Resource
Functions ...................................................................................................................... 41
Figure II.2-8. Functionality Domain Concept Map: General Manage Resource Functions
Applied to all Resources ............................................................................................... 41
Figure II.2-9. Functionality Domain Concept Map: Manage Information Object Functions .. 42
Figure II.2-10. Functionality Domain Concept Map: Manage Actor Functions ..................... 42
Figure II.2-11. Functionality Domain Concept Map: Collaborate Functions ......................... 43
Figure II.2-12. Functionality Domain Concept Map: Manage DL Functions......................... 43
Figure II.2-13. Functionality Domain Concept Map: Manage DLS Functions....................... 44
Figure II.2-14. Policy Domain Concept Map ........................................................................ 45
Figure II.2-15. Policy Domain Concept Map: Policies’ Hierarchy ........................................ 46
Figure II.2-16. Quality Domain Concept Map ...................................................................... 47
Figure II.2-17. Architecture Domain Concept Map............................................................... 50
Figure II.5-1. 5S - Map of formal definitions........................................................................ 59
Figure II.5-2. 5S - DL ontology............................................................................................ 60
Figure A-1. Resource Domain Concept Map (A4 format)................................................... 164
Figure A-2. Content Domain Concept Map (A4 format)..................................................... 165
Figure A-3. User Domain Concept Map (A4 format).......................................................... 166
Figure A-4. Functionality Domain Concept Map (A4 format) ............................................ 167
Figure A-5. Functionality Domain Concept Map: Access Resource Functions (A4 format) 168
Figure A-6. Functionality Domain Concept Map: Specializations of the Manage Resource
Function (A4 format) .................................................................................................. 169
Figure A-7. Functionality Domain Concept Map: General Manage Resource Functions (A4
format)........................................................................................................................ 170
Figure A-8. Functionality Domain Concept Map: Manage Information Object Functions (A4
format)........................................................................................................................ 171
Figure A-9. Functionality Domain Concept Map: Manage Actor Functions (A4 format) .... 172
Figure A-10. Functionality Domain Concept Map: Collaborate Functions (A4 format) ...... 173
Figure A-11. Functionality Domain Concept Map: Manage DL Functions (A4 format)...... 174
Figure A-12. Functionality Domain Concept Map: Manage & Configure DLS Functions (A4
format)........................................................................................................................ 175
Figure A-13. Policy Domain Concept Map (A4 format) ..................................................... 176
Figure A-14. Policy Domain Concept Map: Policies' Hierarchy (A4 format)...................... 177
Figure A-15. Quality Domain Concept Map (A4 format).................................................... 178
Figure A-16. Architecture Domain Concept Map (A4 format)............................................ 179
Section I.4 introduces the main roles that actors may play within digital libraries, i.e., end-
user, designer, administrator, and application developer.
Section I.5 describes the reference frameworks that are needed to clarify the DL universe at
different levels of abstraction, i.e., the Digital Library reference model and the Digital Library
reference architecture.
Section I.6 reports concluding remarks on The Digital Library Manifesto.
Section II.1 introduces “PART II The DELOS Digital Library Reference Model in a Nutshell”
by summarising the content of the Manifesto and setting the basis for reading and using the
rest of the part.
Section II.2 presents the constituent domains by briefly describing their rationale and
providing each of them with a concept map that reports the main related concepts and
relations connecting them.
Section II.3 addresses the Interoperability issue. In particular, it describes this problem by
pointing out the instruments that the Reference Model makes available for dealing with it.
Section II.4 introduces the Preservation issue and discusses how the current model can be
used and eventually extended to capture this important aspect of the digital library universe.
Section II.5 discusses related works. In particular, this section highlights the similarities and
the differences between this reference model and similar initiatives like the 5S Framework
and the CIDOC Conceptual Reference Model.
Section II.6 reports concluding remarks on the DELOS Digital Library Reference Model as
presented in PART II.
Section III.1 introduces “PART III The DELOS Digital Library Reference Model Concepts
and Relations” by highlight the role of this part.
Section III.2 presents the hierarchy of the Concepts constituting the Reference Model.
Section III.3 provides a definition for each of the 217 Concepts currently constituting the
model. Each definition is complemented by the list of relations connecting the concept to the
other concepts, the rationale for having introduced this concept in the model, and examples of
concrete instances of the concept in real scenarios.
Section III.4 presents the hierarchy of the identified Relations.
Section III.5 provides a definition for each of the 52 Relations currently constituting the
model. Each definition is complemented by the rationale for having it in the model and some
examples of concrete instances of them in real scenarios.
Section Conclusions sums up the volume.
Acknowledgements
Many people have contributed to this activity at different level.
First and foremost, the authors wish to thank the DELOS Network of Excellence on Digital
Library Consortium that gave them the chance to work on the fascinating topic of building
foundations for Digital Libraries. In particular, we want to thank Costantino Thanos (CNR-
ISTI), co-ordinator of the DELOS of Network of Excellence and to his Deputy, Vittore
Casarosa (CNR-ISTI), for their continuous encouragement and for the many useful
discussions on the proposed conceptualisation.
The authors also wish to acknowledge the considerable input to the model received from the
participants to the DELOS Reference Model Workshop held in Frascati (Rome) on June 2006.
The comments, visions and insights received on the initial release of the model have been
very helpful for the rest of the activity. The workshop participants besides the authors were:
José Borbinha (DEI-IST-UTL), Martin Braschler (Zurich University of Applied Sciences
Winterthur), Vittore Casarosa (ISTI-CNR), Tiziana Catarci (Università degli Studi di Roma
“La Sapienza”), Stavros Christodoulakis (Technical University of Crete), Edward Fox
(Virginia Tech), Norberth Fuhr (Universität Duisburg-Essen), Stefan Gradmann (Universitat
Hamburg), Ariane Labat (EC), Mahendra Mahey (UKOLN), Patricia Manson (EC), Andy
Powell (UKOLN), Hans-Jörg Schek (ETH), MacKenzie Smith (MIT Libraries), Costantino
Thanos (ISTI-CNR), and Theo van Veen (National Library of the Netherlands).
This volume has also benefited from the comments and ideas discussed during two workshops
titled “Foundations of Digital Libraries”. The two workshops were held, respectively, in
conjunction with the ACM IEEE Joint Conference on Digital Libraries (JCDL 2007) and the
11th European Conference on Research and Advanced Technologies on Digital Libraries
(ECDL 2007). As it is not possible to list all the participants individually, we thanks all of
them collectively and we only mention the especially helpful comments received from
Edward Fox (Virginia Tech), Geneva Henry (Rice University), and Marianne Backes (Centre
Virtuel de la Connaissance sur l’Europe).
Thanks to Perla Innocenti (HATII) for reviewing the sections on Policy and drawing our
attention to the fact that we had not represented the time dimensions.
Thanks also to Professor Stavros Christodoulakis (Technical University of Crete) for actively
partaking to some of the Reference Model internal meetings and raising useful comments
about the management of multimedia content, and to Prof. Hans Scheck (Univiversity of
Konstanz), leader of the DELOS work package that gave rise to the Reference Model activity,
for the key contribution he has given to the start-up of this activity and for his useful
comments.
We are particularly grateful to Maria Bruna Baldacci (ISTI-CNR) for the notably help in
improving the readability of this volume and to Francesca Borri (ISTI-CNR) for her
contribution to the graphical editing of this and the other volumes forming the Reference
Model document suite.
I.1 Introduction
The term ‘Digital Library’ is currently used to refer to systems that are very heterogeneous in
scope and yield very different functionality. These systems span from digital object and
metadata repositories, reference-linking systems, archives, and content administration systems
(mainly developed by industry), to complex systems that integrate advanced digital library
services (mainly developed in research environments). This “overload” of the term ‘Digital
Library’ is a consequence of the fact that there is not yet an agreement on what Digital
Libraries are and what functionality is associated with them. This results in lack of
interoperability and reuse of both contents and technologies. This document attempts to put
some order in the field for the benefit of its future advancement.
1
http://www.thethirdmanifesto.com/
2
http://www.cs.cmu.edu/People/clamen/OODBMS/Manifesto/index.html
3
http://agilemanifesto.org/
4
http://www.gnu.org/gnu/manifesto.html
Relevant research and industrial efforts in the Digital Library (DL) field have now reached an
advanced, although heterogeneous stage of development, thus the time is right for this field to
obtain its own Manifesto.
I.1.2 Motivation
Digital Libraries constitute a relatively young scientific field, whose life spans roughly the
last fifteen years. Instrumental in the birth and growth of the field have been the funding
opportunities generated by the ‘Technology Enhanced Learning; Cultural Heritage’ (formerly
‘Cultural Heritage Applications’) Unit of the Information Society Directorate-General of the
European Commission and the ‘Digital Library Initiatives’ in the United States sponsored by
the National Science Foundation and other agencies.
Digital Libraries represent the meeting point of many disciplines and fields, including data
management, information retrieval, library sciences, document management, information
systems, the web, image processing, artificial intelligence, human-computer interaction, and
digital curation. It was only natural that these first fifteen years were mostly spent on bridging
some of the gaps between the disciplines (and the scientists serving each one), improvising on
what ‘Digital-Library functionality’ is supposed to be, and integrating solutions from each
separate field into systems to support such functionality, sometimes the solutions being
induced by novel requirements of Digital Libraries. These have been achieved through much
exploratory work, primarily in the context of focused efforts devising specialized approaches
to address particular aspects of Digital-Library functionality. For example, the ARTISTE
project [11] from Europe’s Fifth Framework Programme focused on how to develop an
integrated analysis and navigation environment for art images and analogous multimedia
content, the COLLATE project [56] from the same Programme focused on how to deal with
old film libraries, while the Alexandria Project [7] from NSF’s DLI-1 and DLI-2 Programs
focused on geospatially-referenced multimedia material. For the most part, every effort so far
has been distinct and, in some sense, isolated from the rest. Every project has started from
scratch to build a system supporting the particular needs specified in the project’s description.
Nevertheless, looking back at the individual achievements of all the projects, one may see
clearly that there is substantial commonality among many of them; the bottom-up
development of the field so far has provided enough ‘data points’ for patterns to emerge that
can encapsulate all efforts.
Despite the young age of the field of Digital Libraries, it has made a long journey from its
initial conception to the present state of the art and has reached a level of maturity that did not
exist fifteen years ago. There is substantial knowledge and experience that have been
accumulated. This warrants a process of self-declaration that will identify the principle ideas
behind the field; it is time for a Digital Library Manifesto to set the ground rules for the field
and lead to the development of reference documents that will capture the full spectrum of
concepts that play a role in Digital Libraries.
culture and by using multiple Internet-connected devices [110]. An offspring of that activity
concludes that Digital Libraries can become the universal knowledge repositories and
communication conduits of the future, a common vehicle by which everyone will access,
discuss, evaluate, and enhance information of all forms [111][112]. Likewise, in his
framework for Digital Library research, Soergel [181] starts from three very different
perspectives that different people in the community have on Digital Libraries, i.e., as tools to
serve research, scholarship, and education, as a means for accessing information, and as
providing services primarily to individual users. He then enhances further each one and fuses
them all together to obtain the main guiding principles for his vision of the field. On the other
hand, Belkin [20] states that a Digital Library is an institution in charge of providing at least
the functionality of a traditional library in the context of distributed and networked collections
of information objects. Lesk [140] analyses and discusses the importance of the terms
“Digital” and “Library” in the expression “Digital Library”, where the former term mainly
implies existence of software for searching text, while the latter term refers to existing
material that has been scanned for online access, and concludes that the research effort in the
field are not usually associated with the users’ needs. Borgman [32] notices that at least two
competing visions of the expression “Digital Library” exist: researchers view Digital Libraries
as content collected on behalf of user communities, while practising librarians view Digital
Libraries as institutions or services. Kuny and Cleveland [131] discuss four myths about
Digital Libraries and attempt to bring them down: (i) the Internet is ‘The’ Digital Library; (ii)
at some point there will be a single Digital Library or a single-window view of Digital Library
collections; (iii) Digital Libraries are means to provide more equitable access to content from
anywhere at any time; and (iv) Digital Libraries are cheaper instruments than physical
libraries. They conclude that Digital Libraries impose reinvention of the role of librarians and
library models.
In addition to such a variety of perspectives that may currently exist on what a Digital Library
is, the concept has evolved quite substantially since the early idea of a system providing
access to digitized books and other text documents. The DELOS Network of Excellence on
Digital Libraries [60] now envisions a Digital Library as a tool at the centre of intellectual
activity having no logical, conceptual, physical, temporal, or personal borders or barriers on
information. It has moved from a content-centric system that simply organizes and provides
access to particular collections of data and information, to a person-centric system that aims to
provide interesting, novel, personalized experiences to users. Its main role has moved from
static storage and retrieval of information to facilitation of communication, collaboration, and
other forms of interaction among scientists, researchers, or the general public on themes that
are pertinent to the information stored in the Digital Library. Finally, it has moved from
handling mostly centrally-located text to synthesizing distributed multimedia document
collections, sensor data, mobile information, and pervasive computing services.
This vision of Digital Libraries seems to resonate well with the concept of “Information
Space” that has arisen from the field of Computer Supported Cooperative Work (CSCW).
Snowdon, Churchill, and Frecon [184] have developed future visions about “Connected
Communities” and “Inhabited Information Spaces”, with the latter being closely related with
the vision of Digital Libraries, in that ubiquitous information is a prerequisite for CSCW. In
more detail, Inhabited Information Spaces are “spaces and places where people and digital
data can meet in fruitful exchange, i.e., they are effective social workspaces where digital
information can be created, explored, manipulated and exchanged”. Thus, “in Inhabited
Information Spaces, both information and people who are using that information (viewing it,
manipulating it) are represented. This supports collaborative action on objects, provides
awareness of others’ ongoing activities, and offers a view of information in the context of its
use”. Based on the above and according to the aforementioned DELOS vision of a Digital
Library, the latter provides an Information Space that is populated by a user community and
becomes an Inhabited Information Space through CSCW technology. The two fields
complement each other nicely, in that one focuses on access and provision of relevant
information while the other focuses on visualisation and sharing of information.
It becomes obvious that, as envisioned, “Digital Library” is a complex notion with several
diverse aspects and cannot be captured by a simple definition. A comprehensive
representation encapsulating all potential perspectives is required. This has led to the drafting
of ‘The Digital Library Manifesto’, whose aim is to set the foundations and identify the
cornerstone concepts within the universe of Digital Libraries, facilitating the integration of
research and proposing better ways of developing appropriate systems. Having this broad
scope, the Manifesto is followed by a set of separate reference documents, which stand
individually but can also be seen as parts of a whole.
The Manifesto exploits the collective understanding of Digital Libraries developed by
European research groups, including those that are partners in DELOS, and the results of
DELOS working meetings (e.g., San Cassiano in 2001, Corvara in 2004, and Frascati in
2006).
The rest of Part I of this volume presents the core parts of this Manifesto and introduces
central aspects of the Digital Library framework. It first presents an examination of the three
types of relevant “systems” in this area: Digital Library, Digital Library System, and Digital
Library Management System (Section I.2). Then, it describes the main concepts
characterizing the above systems, i.e., content, user, functionality, quality, policy, and
architecture (Section I.3) and it introduces the main roles that actors may play within digital
libraries, i.e., end-user, designer, administrator, and application developer (Section I.4). In
Section I.5 it describes the reference frameworks that are needed to clarify the DL universe at
different levels of abstraction, i.e., the Digital Library reference model and the Digital Library
reference architecture. Finally, Section I.6 concludes the Manifesto part.
produce Digital Library Systems. Depending on the philosophy it follows, a DLMS may
belong to one of the following three types:
• Extensible Digital Library System
A complete Digital Library System that is fully operational with respect to a defined core
suite of functionality. DLs are constructed by instantiating the DLMS and thus obtaining
the DLS. Thanks to the open software architecture, new software components providing
additional capabilities can be easily integrated. The DelosDLMS [175] is a prototypical
example of a system based on this philosophy.
• Digital Library System Warehouse
A collection of software components that encapsulate the core suite of DL functionality
and a set of tools that can be used to combine these components in a variety of ways (in
Lego®-like fashion) to create Digital Library Systems offering a tailored integration of
functionalities. New software components can easily be incorporated into the Warehouse
for subsequent combination with those already there. BRICKS [37] and DILIGENT [62]
are two prototypical examples of systems that are based on this philosophy.
• Digital Library System Generator
A highly parameterised software system that encapsulates templates covering a broad
range of functionalities, including a defined core suite of DL functionality as well as any
advanced functionality that has been deemed appropriate to meet the needs of the specific
application domain. Through an initialization session, the appropriate parameters are set
and configured; at the end of that session, an application is automatically generated, and
this constitutes the Digital Library System ready for installation and deployment. The
MARIAN framework equipped with the 5SL specification language represents an
example of this process [89].
Although the concept of Digital Library is intended to capture an abstract system that consists
of both physical and virtual components, the Digital Library System and the Digital Library
Management System capture concrete software systems. For every Digital Library, there is a
unique Digital Library System in operation (possibly consisting of many interconnected
smaller Digital Library Systems), whereas all Digital Library Systems are based on a handful
of Digital Library Management Systems5. For instance, through DILIGENT it is possible to
build and run a number of DLSs, each realising a DL serving a target community. The DL is
thus the abstract entity that “lives” thanks to the software system constituting the DLS.
5
To the extent that it is helpful, one may draw an approximate analogy between the world of Digital Libraries and the world
of Databases. A DBMS (e.g., the DB2, Oracle system, MySQL or PostgreSQL) corresponds to a DLMS, offering general
data management services. A DBMS together with all application software running on top of it at an installation corresponds
to a DLS. Finally, a DL corresponds to a so-called “Information System” that consists of the above software, its data, and its
users.
I.3.1 Content
The Content concept encompasses the data and information that the Digital Library handles
and makes available to its users. It is composed of a set of information objects organized in
collections. Content is an umbrella concept used to aggregate all forms of information objects
that a Digital Library collects, manages, and delivers. It encompasses the diverse range of
information objects, including such resources as objects, annotations, and metadata. For
example, metadata have a central role in the handling and use of information objects, as they
provide information critical to its syntactical, semantic, and contextual interpretation.
I.3.2 User
The User concept covers the various actors (whether human or machine) entitled to interact
with Digital Libraries. Digital Libraries connect actors with information and support them in
6
From here on, we shall use the terms “Digital Library” (or its acronym “DL”), Digital Library System (DLS)
and Digital Library Management System (DLMS) to denote the systems identified in Sec.2, while by the term
“digital libraries” we shall refer to the whole field of digital library research and applications.
their ability to consume and make creative use of it to generate new information. User is an
umbrella concept including all notions related to the representation and management of actor
entities within a Digital Library. It encompasses such elements as the rights that actors have
within the system and the profiles of the actors with characteristics that personalize the
system’s behaviour or represent these actors in collaborations.
I.3.3 Functionality
The Functionality concept encapsulates the services that a Digital Library offers to its
different users, whether classes of users or individual users. While the general expectation is
that DLs will be rich in capabilities and services, the bare minimum of functions would
include such aspects as new information object registration, search, and browse. Beyond that,
the system seeks to manage the functions of the Digital Library to ensure that the functions
reflect the particular needs of the digital library’s community of users and/or the specific
requirements relating to the Content it contains.
I.3.4 Quality
The Quality concept represents the parameters that can be used to characterize and evaluate
the content and behaviour of a Digital Library. Quality can be associated not only with each
class of content or functionality but also with specific information objects or services. Some
of these parameters are objective in nature and can be automatically measured, whereas others
are subjective in nature and can only be measured through user evaluations (e.g., focus
groups).
I.3.5 Policy
The Policy concept represents the set or sets of conditions, rules, terms and regulations
governing interaction between the Digital Library and users, whether virtual or real. Examples
of policies include acceptable user behaviour, digital rights management, privacy and
confidentiality, charges to users, and collection delivery. Policies belong to different classes;
for instance, not all policies are defined within the DL or the organization managing it. The
policy supports the distinction between extrinsic and intrinsic policies. The definition of new
policies and re-definition of older policies will be a feature of digital libraries.
I.3.6 Architecture
The Architecture concept refers to the Digital Library System entity and represents a mapping
of the functionality and content offered by a Digital Library onto hardware and software
components7. There are two primary reasons for having Architecture as a core concept: (i)
Digital Libraries are often assumed to be among the most complex and advanced forms of
information systems [78]; and (ii) interoperability across Digital Libraries is recognized as a
substantial research challenge. A clear architectural framework for the Digital Library System
offers ammunition in addressing both these issues effectively.
The concepts populating all the six area just introduced share many similar characteristics and
are all concepts referring to internal entities of a Digital Library that can be sensed by the
external world. Introducing a higher-level concept referring to any of them, namely,
Resource, enables us to reason about these characteristics in a consistent manner.
7
This is an appropriate adaptation of the ‘Architecture’ definition from the Glossary of CMU’s Software Engineering
Institute. http://www.sei.cmu.edu/opensystems/glossary.html
The six core concepts (Content, User, Functionality, Quality, Policy and Architecture) that lie
at the heart of Digital Library universe need to be considered in conjunction with the four
main ways that actors interact with digital library systems, as discussed in the next section.
Figure I.4-1. The Main Roles of Actors versus the Three-tier Framework
As shown in Figure I.4-1, each role is primarily associated with one of the three “systems” in
the three-tier framework.
I.4.1 DL End-Users
DL End-Users exploit the DL functionality for the purpose of providing, consuming, and
managing the DL Content and some of its other constituents. They perceive the DL as a
stateful entity serving their functional needs. The behaviour and output of the DL depend on
the DL’s state at the time a particular part of its functionality is activated. The state of the DL
corresponds to the state of its resources, which, as we have seen above, consist of the
collections of information objects managed by the DL, the set of authorized users, the DL’s
functionality, and its set of policies. This state changes during the lifetime of the Digital
Library according to the functionality activated by users and their inputs. DL End-Users may
be further partitioned into Content Creators, Content Consumers, and Librarians.
I.4.2 DL Designers
DL Designers exploit their knowledge of the application semantic domain in order to define,
customize, and maintain the Digital Library so that it is aligned with the information and
functional needs of its potential DL End-Users. To perform this task, they interact with the
DLMS providing functional and content configuration parameters. Functional parameters
instantiate aspects of the DL functionality that are to be perceived by the DL End-Users,
including the characteristics of the result set format, query language(s), user profile formats,
and document/data model employed. Content configuration parameters specify third-party
resources exploited by the specific DL, e.g., repositories of content, ontologies, classification
schemas, authority files, and gazetteers that will be used to form the DL Content. The values
of these parameters configure the way the DL will be presented to the DL End-Users, because
they determine the particular Digital Library System instance serving the Digital Library. Of
course, these parameters need not necessarily be fixed for the entire lifetime of the DL; they
may be reconfigured to enable the DL to respond to the evolving expectations of users and
changes in all aspects from policies to content.
Designers have for the Digital Library, as well as the requirements that the available resources
impose on the definition of the DL. DL System Administrators interact with the DLMS by
providing architectural configuration parameters, such the chosen software components and
the selected hosting nodes. Their task is to identify the architectural configuration that best fits
the DLS in order to ensure the highest level of quality of service. The value of the
architectural configuration parameters can be changed over the DL lifetime. Changes of
parameter configuration may result in the provision of different DL functionality and/or
different levels of quality of service.
These four roles described above encompass the whole spectrum of actors interacting with
digital libraries. Their models of the DL Universe are linked together in a hierarchical fashion,
as depicted in Figure I.4-2. This hierarchy is a direct consequence of the above definitions,
since DL End-Users act on the Digital Library, whereas DL Designers, DL System
Administrators and DL Application Developers operate on the DLS (through the mediation of
a DLMS) and, consequently, on the DL as well. This inclusion relationship ensures that co-
operating actors share a common vocabulary and knowledge. For instance, the DL End-User
expresses requirements in terms of the DL model and, subsequently, the DL Designer
understands these requirements and defines the DL accordingly.
The relationship of these three frameworks with the general digital library environment is
shown in Figure I.5-1. At the top there is the most abstract Reference Model, which guides the
more specific Reference Architecture and Concrete Architecture further down. In turn, these
should constrain the development and implementation of any actual system. The three
reference frameworks are the outcome of an abstraction process that has taken into account
the goals, requirements, motivations and, in general, the digital library market, as shown in
the left-hand side of Figure I.5-1, and the best practices and relevant research shown on the
right-hand side of the same figure. When these frameworks are adopted and followed by the
community, the resulting systems will be largely compatible with each other; the
interoperability thus afforded will open up significant new horizons for the field.
The rest of this volume focuses on the Reference Model part of this framework.
8
This picture was inspired by the “Reference Model for Service Oriented Architecture” document [147].
II.1 Introduction
Despite the large number of “systems” that are named “digital libraries”
[32][111][112][131][77][78] (where “system” has to be intended as set of interconnected
things forming a whole) there are not yet real underlying foundations for them. This limits the
digital library field to grow, as happens for any building to which no appropriate foundation
has been provided. Because of this lack it is really difficult or almost impossible to
systematize activities for evaluating and comparing digital library systems, teaching and even
performing further and focused research. The same holds for system design and development,
for promoting sustainable approaches and solutions that aim at maximising the re-use of
existing knowledge and assets, and at properly addressing community needs.
In January 2005, the DELOS Networks of Excellence on Digital Libraries [60] decided to
initiate the definition of a reference model for digital libraries as a necessary step towards a
more systematic approach to the research on Digital Libraries. In this context, a reference
model is meant as an abstract framework for understanding significant relationships among
the entities of some universe, and for the development of consistent standards and/or
specifications supporting that universe [147]. The route to reach this objective, summarised
below, has been traced in the Manifesto (Part I of this volume).
9
It is still under discussion whether two other players should be added to this list, namely
Institutions and Industries. The Institutions are meant as organisations, either concrete or
virtual ones, having the important role of forming the digital library. The Industries are meant
as the institutions performing economic activities concerned with the digital library, by
providing either the software or the service.
Figure II.1-2. The Reference Model as the core of the Development Framework
The rest of Part II of this volume provides an overview of the DELOS Reference Model by
illustrating the constituent concepts and relationships. It is structured as follows. The current
section is completed by information setting the stage for the rest, e.g., background material
necessary to understand the rest, graphical and notational conventions. Section II.2 introduces
the constituent domains of the model, highlighting the main concepts and relationships
characterising the domain model rationale. Section II.4 discusses the Preservation issue by
presenting the concepts and relations concerning it. Section II.3 presents the Interoperability
issue and discusses the main concepts and relations related to it. Section II.5 briefly
investigates related work on models for digital libraries and domains. Finally, Section II.6
concludes.
II.1.1 Motivations
The scope of this section is to shortly state what is meant by a reference model and why it is
useful in the area of digital libraries.
In the Information Society Technologies (IST) area, a reference model can be considered an
abstract framework that can be used to understand significant relationships among the entities
of a specific environment, and for the development of consistent standards or specifications
supporting that environment. A reference model is based on a small number of unifying
concepts and may be used as a basis for education. A reference model is not addressing
implementation details, but it provides a common semantics that can be used unambiguously
across and between different implementations.
A Reference Model is useful in a specific IST area, because it is abstract in nature and it can
be used by system designers as a template for inventing and designing new software systems,
and by final users to interact and use a system in an easier and more effective way.
One significant example of reference model can be considered the Open System
Interconnection (OSI) reference model which describes how information from a software
application in one computer moves through a network medium to a software application in
another computer. The model was developed by the International Organization for
Standardization (ISO) in 1984, and it is now considered the primary architectural model for
internetworking.
In a parallel way, a reference model in the area of Digital Libraries (DL) can be useful for:
• End users of a specific DL since the reference model can help them in more easily
identifying appropriate functions and search capabilities;
• Metasearch systems that need to access one or more DL and need to know in what way to
interoperate with each DL and what search possibilities each DL offers;
• Administrators who want to match DL functions and reach specific level of performance
to satisfy user requirements;
• Investors who want to evaluate a DL;
• Designers and application developers who want to consider design options;
• Students and educators who want to learn and teach about digital libraries and understand
the full breadth of their functions.
Figure II.1-3. A concept map showing the key feature of concept maps10
10
Figure taken from [159].
• Time Domain (i.e., concepts and relations needed to capture aspects of the time sphere
like time periods and intervals);
• Space Domain (i.e., concepts and relations needed to capture aspects of the physical
sphere like regions and locations).
The rest of this Part II of this volume illustrates the different domains listed above by
providing an overview of their concepts and relationships. In approaching models like the one
we are presenting here, it is important to keep in mind that these models have not to be
intended as “complete” or exhaustive, i.e., capable of representing all the possible facets of
the systems in the DL Universe, rather as cores of a model of such a Universe that can be
extended by specific communities to include the elements that are required to capture their
specific needs.
management or application purposes. Examples of a Resource Set in the various domains are
Collection in the Content Domain or Group in the User Domain.
11
A Concept linked to a Relation through a dotted line represents an attribute of the Relation
itself.
Information Objects may acquire a further specialization depending on the level of abstraction
at which they are specified. This lead to an abstract Information object by level of abstraction
concept, which is a container or placeholder to be specialised using any of several models. For
example, the IFLA FRBR model [107] distinguishes
• Work, for example, the general idea of a story;
• Expression, for example, the telling of a story in a text;
• Manifestation, for example, the graphic image showing the letters and words that make up
the text that is in common to all copies printed from the same typeset image;
• Item, for example, an individual printed copy of a manifestation.
Other divisions are possible. In particular, the FRBR distinction between Work and
Expression is hard to apply in the digital world and therefore problematic.
Information objects can also be specialized by the predominant role they play in their
relationship to other objects; the class Information object by relationship is the abstract
conceptual container for the classes which these objects give raise to, namely:
• Primary Information Object, an Information Object that stands on its own, such as a book
or a data set.
• Metadata object, an Information Object whose predominant purpose is to give
information about a “target” Resource (usually, but not always, a Primary Information
Object);
• Annotation object, an Information Object whose predominant purpose is to annotate a
“target” Resource (or a Region of it). Examples of such Annotation Objects include notes,
structured comments, and links. Annotation Objects assist in the interpretation of the
target Resource, or give support or objections or more detailed explanations.
This modelling style reflects a basic intuition that distinguishes this model from most DL
models or de facto standards, namely that an information object is not born as (say) an
Metadata or an Annotation, but becomes such by virtue of playing a certain role to other
information objects. The intuition is grounded on the simple observation that, for instance, a
Dublin Core metadata record is to be primarily modelled as a relational structure (record,
tuple, graph fragment) which may also be associated to the resource it describes; it is this
association that gives the structure the role of metadata. A similar case is for a piece of text; it
is primarily a piece of text, and becomes an annotation only when it is linked to a certain
Resource in a certain way. In other words, the long standing issue whether annotations are
content or metadata is just an ill-posed question.
From an organisational point of view, Information Objects can be grouped in Collections
(<belongTo>), i.e. groups of objects considered as a single entity for certain management or
application purposes. As Collections are Information Objects they inherit all Information
Objects’ modelling aspects and facilities, e.g. they can be annotated. Moreover, Collections
are a specialisation of the Resourse Set concept. Actually, Collections are characterised by an
intension (<hasIntension>) and an extension (<hasExtension>). The former is the criterion
underlying the grouping. The way this criterion is expressed can range from the explicit
enumeration of all the objects intended to be part of the group to logical expressions capturing
the characteristics of the Resources intended to be part of the group. The latter is the concrete
set of resources (Resource Set) matching the intension. These characteristics are implemented
differently in diverse systems leading to scenarios ranging from static to highly dynamic ones,
e.g. [46].
Another specialisation of the Resource Set concept usually associated to the Content Domain
is the Result Set. In traditional digital libraries this is the set of documents that are retrieved
by issuing a Query. In this context it represents the set of Resources, no constraints on their
type, resulting from a Query.
which refers to a social group of humans with shared interests. In human Communities, intent,
belief, resources, preferences, needs, risks and a number of other conditions may be present
and common, affecting the identity of the participants and their degree of cohesiveness.
Because of the broad scope of the Function concept, it is infeasible to enumerate and predict
all different types and flavours of Functions that may be included in any Digital Library. Each
Digital Library may have its own set of Functions depending on its objectives, its intended
Actors. Therefore, Function is specialized into five other concepts that represent still quite
general classes of activities, as outlined below (Figure II.2-5).
Manage Resource captures all activities that are related to creating new Resources, inserting
them into the DL, deleting old Resources from it, and updating existing ones, as well as
applying conversions and transformations on them. This transformation may lead to new
Resources that may be submitted to the DL or be merely applied when accessing the
Resource. These maybe specialized to individual Functions for each resource type (Figure
II.2-7).
Figure II.2-7. Functionality Domain Concept Map: Specializations of the Manage Resource Functions
Some of the Functions may be applied on the Resources and others are applied on the
metadata describing those Resources. The general Functions that may be applied on all
Resources are related to the creation, submission withdrawal, update, validation and
annotation of Resources. Figure II.2-8 presents these general Functions. These Functions may
be specialized for particular Resource types.
Figure II.2-8. Functionality Domain Concept Map: General Manage Resource Functions Applied to all
Resources
Manage Information Object (Figure II.2-9) contains Function concepts that capture creation,
processing and transformation for primary Information Objects, which are independent of any
other, e.g., Author, as well as other concepts that do the same for Information Objects that
represent other Information Objects or Resources in general (such as references to others,
compositions of others, etc.).
Figure II.2-9. Functionality Domain Concept Map: Manage Information Object Functions
Manage Actor contains Functions necessary for the management of individual Actors in the
DL, including their registration or subscription, their login and the personalization of the
functions they are entitled to (Figure II.2-10).
The third specialization of Function is intimately related to User Domain. It is the Collaborate
function, which captures all activities that allow multiple Actors to work together through a
DL to achieve a common goal.
12
The DLS service is an instance of an Architectural Component (cf. Sec. II.2.7)
appropriately configured through (and made available by) the DLMS
as a new kind of problem. Structural issues include gross organization and global control
structure; protocols for communication, synchronization, and data access; assignment of
functionality to design elements; physical distribution; composition of design elements;
scaling and performance; and selection among design alternatives”. The IEEE Working
Group on Architecture [105], however, recognises that there is more than just structure in
architecture, and defines it as “the highest-level concept of a system in its environment”.
Thus, this Group’s understanding does not consider the architecture of a software system
limited to an inner focus, rather proposes to take into consideration the system as a whole in
its usage and development environments.
For the purposes of this reference model, the architecture of a software system (at a given
point) is defined as the organization or structure of the system's significant components
(Architectural Component) interacting with each other (<use>) through their interfaces
(Interface). These components may be in turn composed of smaller and smaller components
(<composedBy>) and interfaces (Figure II.2-17); however, different Architectural
Components may be incompatible with each other (<conflictWith>), i.e. cannot coexist in the
context of the same system. The software industry and the literature when using the term
“component” refer to many different concepts. Here, we use the term “component” to mean
an encapsulated part of a system, ideally a non-trivial, nearly independent, and replaceable
part of a system that fulfils a clear function in the context of a well-defined architecture. Each
Architectural Component is a Resource, thus it inherits the Resource’s characterising aspects
(cf. Sec. II.2.1), e.g., it is uniquely identified. As any Resource, components have Metadata
(Component Profile) which provide fundamental information for managing them. These
Metadata specify characteristics like the implemented or supported Functions, the
implemented Interfaces, their governing Policies, and the Quality Parameters that specify the
various quality facets describing how and how well the component performs with respect to
some viewpoint.
Architectural Components interact through a Framework Specification; they must also be
conformant to it (<conformTo>). This framework prescribes the set of Interfaces to be
implemented by the components and the protocols governing how components interact with
each other.
Architectural Components are classified in Software Architecture Components and System
Architecture Components. These classes are used to describe the Software Architecture and
the System Architecture of a software system respectively
Software Architecture Components are realised by Software Components. Each Software
Component
• encapsulates the implementation of a portion of a software system (capturing Content,
User, Functionality, Policy or Quality Domains aspects of the DL universe),
• its usage is regulated by (<regulatedBy>) particular Policies (Licenses), and
• it is represented by an Information Object (<representedBy>).
Thus, the Resource representing the Software Component inherits the Information Object’s
characterising aspects (Section II.2.2), e.g. it can be enriched through Metadata and
Annotations.
System Architecture Components are realised by Hosting Nodes and Running Components.
A Hosting Node encapsulates the implementation of the environment needed to host and run
Software Components. A Running Component represents a running instance of a Software
Component (<realisedBy>) active on a Hosting Node.
interoperability is usually based on the use of a common ontology, e.g. Dublin Core. The
latter case, i.e., the gathering approach, is the less demanding among the three. In this case no
source takes care of its potential consumers because the exposition of its content so that it can
easily be used by third parties is not a requirement; i.e., in this case the resources are not
required to be interoperable.
The Interoperability issue has many commonalities with preservation and multilinguality.
Actually, multilinguality can be seen as interoperability over languages while preservation
can be seen as interoperability over time. Syntactic and semantic aspects pervade any form of
interoperability. Both these aspects are equally important and customary used to discriminate
between the aspects to be bridged. In practice, the semantic interoperability is deemed to be
more important and to require more sophisticated approaches than syntactic interoperability.
However, semantic interoperability cannot be achieved without reaching the syntactic
interoperability.
Among the various concepts reported in the Reference Model the following ones are deemed
as particularly important to interoperability:
• Resource <hasMetadata> Information Object makes it possible to capture any Metadata
for supporting interoperability.
• Resource <hasFormat> Resource Format makes it possible to capture the Resource
Format a Resource is compliant with. The notion of format is important for the correct
interpretation of a Resource. For instance, in order for DL A to use an Information
Object from DL B, DL A must be able to either deal with that Information Object’s format
(read it, create displays, etc.) or be able to convert it to a format it can deal with.
Ontology with its specialization Resource Format is very important for Interoperability.
Format specifications need to be preserved so that Information Objects using an old
format or a previous version of an existing format can still be interpreted. Likewise, the
different versions of a subject ontology need to be preserved so that subject metadata
prepared using a previous version of an ontology can be interpreted properly.
• Resource <associatedWith> Resource makes it possible to capture the context a Resource
has originated from. Having this knowledge is important for the correct understanding of
the Resource meaning. Seeing an information object in its original context is important for
the correct understanding of its meaning.
The following functions are especially important for interoperability
• Transform, a specialization of the Process Function of Manage Information Object. It
may include format conversions, information extraction, and automatic translation and
summarization techniques. Its specialization Convert includes conversion into a different
encoding (converting a text from pdf to word, an image to a different format or
compression scheme, etc).
• Import Collection, a specialization of Manage Collection, supports the selection of the
third-party information sources whose objects will populate the DL Content or be used as
Resource Metadata, for example, Actor Profiles.
• Export Collection, a specialization of Manage Collection, supports the export of an entire
digital library or pieces of it to create a mirror site or to create a backup copy. Also
making Information Objects, especially metadata, available to be imported by another
system (harvesting) is a possible result of this function.
• Compare, a specialization of the Analyze Function, may be used to ascertain whether two
instances of an information object are the same.
interpreting the object disappears, and migration to a different format is necessary. The
issue of format applies both to primary Information Objects and to Metadata Objects,
which are Information Objects as well.
Ontology with its specialization Resource Format lies at the heart of preservation systems.
Format specifications need to be preserved so that Information Objects using an old
format or a previous version of an existing format can continue to be interpreted.
Likewise, the different versions of a subject ontology need to be preserved so that subject
metadata prepared using a previous version of an ontology can be interpreted accurately.
• Resource <hasQuality> Quality Parameter makes it possible to capture the quality
parameters deemed relevant to the preservation issue;
• Resource <associatedWith> Resource support the capture of the context an Information
Object (in general, any Resource) originated from. This information facilitates the
interpretation of an object in case the context provides critical semantic value.
Moreover, the Reference Model introduces Functions that are crucial for preservation, as
follows:
• Transform – the family of Functions through which Resources (Information Objects)
represented according to a given Resource Format are transformed into Resources
(Information Objects) expressed according to another Resource Format improving the
capability to transport and interpret them across representation devices and time.
• Visualize – the Function supporting Resource (Information Object) rendering. This should
be equipped with facilities for preserving behaviour and functionality of information
objects across systems and time.
• Withdraw – the function making possible to drop Resources (Information Object) from a
Digital Library system. From a preservation point of view, this function should provide for
mechanism for deciding whether to maintain the withdrawn object in a secondary store or
to completely delete it.
• export – the function allowing exporting of an entire digital library or pieces of it. This
might be done to create a mirror site or a backup copy, or to move a digital library or
elements of it to another technological environment. The Resource resulting from the
execution of this function must have a Resource Format making itself interpretable and
importable by another system.
• compare – the function that allows a person or a computer program to ascertain the
identity or similarity between two instances of an Information Object (more generally, a
Resource). By combining this Function with the Quality Parameters asserting the
Information Object (more generally, a Resource) probability of being correctly interpreted
across time, it will be possible to automate the application of Preservation Policies.
• Configure DL – For preservation, the system should save the configuration state after any
changes are made to it.
• (actions) logging – the function recording the actions performed on the Information
Object (Resource) across time. This logging information (that can be considered a kind of
Metadata) can be used for preservation purposes in different ways, e.g.
o It allows for rollback operations, e.g. returning an Information Object (more generally,
a resource) to a state it has had at a particular time in the past;
o It provides for usage history of information objects (more generally, a resource) which
is important as context for later uses.
13
An Information Object is also a Legal Object, i.e. a material or immaterial item to which
instances of Right can be applied.
Actor, i.e., people that individually or as a group have the potential to perform actions of
which they can be deemed to be responsible, is introduced as specialisation of the Persistent
Item class. This concept presents many commonalities with the one introduced in the Digital
Library Reference Model and presented in Section II.2.2.
Another specialisation of the Persistent Item class is Appellation, i.e., any sort of identifier
that can be used to identify specific instances of all the classes. The two models dedicate a
different effort to model this aspect. While the Digital Library Reference Model introduces
the concept of Resource Identifier without taking care to specialise it the CIDOC-CRM
introduces many specialisations ranging from Object Identifier to Address, Title, and Date.
Finally, the CIDOC-CRM captures also aspect related to the notion of Functionality. In fact,
even if its goal is to provide an ontology for modelling cultural heritage information, some of
its classes aim at capturing the history and evolution of such information and thus can be
considered as a sort of Functions objects/information have been subjected to. In particular, the
role of the Activity class is to comprise “actions intentionally carried out by instances of Actor
that result in changes of state in the cultural, social, or physical systems documented”.
represents the highest-level concept of a digital library, which exists to serve the
information needs of its societies and to describe the context of its use.
These concepts are of general purpose and represents low level constructors. Using these
concepts, Gonçalves et al. introduced the whole DL ontology reported in Figure II.5-114.
14
Figure II.5-1 is extracted from [88].
15
Figure II.5-2 is extracted from [88].
This group started from a general purpose definition of digital library and identified three
non-orthogonal components within this digital library domain: the users, the data/collection,
and the chosen system/technology. These entities are related and constrained by means of a
series of relationships, namely
(1) the definition of the set of users predefines the range and the content of the collection
relevant and appropriate for them,
(2) the nature of the collection predefines the range of technologies that can be used, and
(3) the attractiveness of the collection content with respect to the user needs and the ease of
use of the technologies by these users determine the extent of usage of the DL.
By relying on these core concepts and relationships it is possible to move outwards to the DL
Researcher domain and create a set of researcher requirements for a DL test bed.
Recently [195], this model has been enriched by focusing on the inter-relationships between
the basic concepts, i.e. the User-Content relationship is related to the usefulness aspects, the
Content-System relationship is related to the performance attributes, while the User-System is
related to usability aspects. For each of these three aspects, techniques and principles for
producing quantitative data and implementing their evaluation have been introduced.
The Reference Model addresses similar issues through the Quality domain (cf. Sec. II.2.6).
While the evaluation framework takes care of identifying the characteristics of the DL
systems to be measure and evaluated the Digital Library Reference Model introduces this
notion at the general level of Resource, i.e., each Resource is potentially subject to various
judgement processes capturing different perspectives.
The Core Ontology of Software Components provides concepts needed to capture software
components related aspects like libraries and licenses, component profiles, and component
taxonomies. The notion of SoftwareComponent (having a Profile aggregating knowledge
about it) is the main entity in this ontology and it is formalised as a Class that conforms to a
FrameworkSpecification (a set of Interfaces). Moreover, the notion of SoftwareLibrary and
License completes the scenario by introducing notions for supporting the automatic check of
conflicting libraries and incompatible licenses. The similarities with the set of concepts
captured by the Reference Model Architecture Domain (cf. Sec. II.2.7) are evident. However,
it is important to notice that the way the dependencies between the various components are
captured by the Reference Model makes it able to be more flexible with respect to this point.
The Core Ontology of Web Services reuses all the other ones to establish a well-founded
ontology for Web Services. This is a very specific ontology that captures the component-
oriented approach in terms of standards for protocols (SOAP) and descriptions (WSDL). The
other interesting feature is to explicit introduction of the QualityOfService parameters that in
the case of the Reference Model are captured through the general relationship, i.e.,
<hasQuality>, between a Resource and its Quality Parameters.
III.1 Introduction
As already stated, a Reference Model is a conceptual framework aiming at capturing
significant entities and their relationships in a certain universe with the goal of developing
more concrete models of it. Previous sections have outlined the motivation for the creation of
the DL Reference Model, as well as an upper level description of its constituents. Conceptual
Maps of the Reference Model Domains have been presented and described, providing a brief
overview of each Domain concepts, the relations that bind them as well as the interaction
between concepts of different domains.
This part of the volume delves more deeply into the Reference Model constituent parts.
Concepts and relations are presented in a hierarchical fashion, providing thus an overview of
the specialization relations between them. Concept and relation definitions are provided for
each of the concepts and relations of the concept maps.
Each concept definition contains a brief definition of the concept, its relations to other
concepts, the rationale behind the addition of the concept and an example. Each relation,
accordingly, is described by a definition, a rational and an example.
C1 Resource
. C2 Resource Identifier
. C3 Resource Set
. . C4 Result Set (also <isa> Information Object)
. . C15 Collection
. . C20 Group (also <isa> Actor)
. C5 Resource Format
. C16Query
. C17 Ontology
. [ User Resource ]
. . C19 Actor
. . . C20 Group (also <isa> Resource Set)
. . . . C21 Community
. . C22Role
. . . C23End-User
. . . . C24 Content Consumer
. . . . C25 Content Creator
. . . . C26 Librarian
. . . C27 DL Designer
. . . C28 DL System Administrator
. . . C29 DL Application Developer
. . C12 Actor Profile (also <isa> Metadata)
16
“Classifiers”, i.e. items added to the hierarchy for organisational purposes are marked [in
squared brackets].
. [ Functionality Resource ]
. . C31 Function
. . . C32 Access Resource
. . . . C33 Discover
. . . . . C34 Browse
. . . . . C35 Search
. . . . C36 Acquire
. . . . C37 Visualise
. . . C38 Manage Resource
. . . . C39 Create
. . . . C40 Submit
. . . . C41 Withdraw
. . . . C42 Update
. . . . C43 Validate
. . . . C44 Annotate
. . . . C45 Manage Information Object
. . . . . C46 Disseminate
. . . . . . C47 Publish
. . . . . C48 Author
. . . . . . C49 Compose
. . . . . C50 Process
. . . . . . C51 Analyze
. . . . . . . C52 Linguistic Analysis
. . . . . . . C53 Qualitative Analysis
. . . . . . . . C54 Examine Preservation State
. . . . . . . C55 Statistical Analysis
. . . . . . . C56 Scientific Analysis
. . . . . . . C57 Create Structured Representation
. . . . . . . C58 Compare
. . . . . . C59 Transform
. . . . . . . C60 Physically Convert
. . . . . . . . C61 Translate
. . . . . . . C62 Convert to a Different Format
. . . . . . . C63 Extract
. . . . C64 Manage Actor
. . . . . C65 Establish Actor
. . . . . . C66 Register
. . . . . . . C67 Sign Up
. . . . . . C68 Login
. . . . . C69 Personalise
. . . . . . C70 Apply Profile
. . . . C71 Manage Function
. . . . C72 Manage Policy
. . . . C73 Manage Quality Parameter
. . . C74 Collaborate
. . . . C75 Exchange Information
. . . . C76 Converse
. . . . C77 Find Collaborator
. . . . C78 Author Collaboratively
. . . C79 Manage DL
. [ Policy Resource ]
. . C112 Policy
. . . [ Policy by characteristic ]
. . . . [ Policy by context ]
. . . . . C113 Extrinsic Policy
. . . . . C114 Intrinsic Policy
. . . . [ Policy by expression ]
. . . . . C115 Explicit Policy
. . . . . C116 Implicit Policy
. . . . [ Policy by application ]
. . . . . C117 Prescriptive Policy
. . . . . C118 Descriptive Policy
. . . . [ Policy by compliance ]
. . . . . C119 Enforced Policy
. . . . . C120 Voluntary Policy
. . . [ Policy by scope ]
. . . . C121 System Policy
. . . . . C122 Change Management Policy
. [ Quality Resource ]
. . C146Measure
. . . C147 Objective Measure
. . . C148 Subjective Measure
. . . C149 Qualitative Measure
. . . C150 Quantitative Measure
. . C151 Measurement
. . C152 Quality Parameter
. . . C153 Generic Quality Parameter
. . . . C154 Economic Convenience
. . . . C155 Interoperability Support
. . . . C156 Reputation
. . . . C157 Security Enforcement
. . . . C158 Sustainability
. . . . C159 Documentation Coverage
. . . . C160 Performance
. . . . C161 Scalability
. . . C162 Content Quality Parameter
. . . . C163 Authenticity
. . . . C164 Authoritativeness
. . . . C165 Freshness
. . . . C166 Integrity
. . . . C167 Preservation Performance
. . . . C168 Provenance
. . . . C169 Scope
. . . . C170 Size
. . . . C171 Fidelity
. . . . C172 Perceivability
. . . . C173 Viability
. . . . C174 Metadata Evaluation
. . . C175 Functionality Quality Parameter
. . . . C176 Availability
. . . . C177 Awareness of Service
. . . . C178 Capacity
. . . . C179 Expectations of Service
. . . . C180 Fault Management Performance
. . . . C181 Impact of Service
. . . . C182 Orthogonality
. . . . C183 Dependability
. . . . C184 Robustness
. . . . C185 Usability
. . . . C186 User Satisfaction
. . . C187 User Quality Parameter
. . . . C188 User Activeness
. . . . C189 User Behaviour
. . . C190 Policy Quality Parameter
. . . . C191 Policy Consistency
. . . . C192 Policy Precision
. . . C193Architecture Quality Parameter
. . . . C194 Ease of Administration
. . . . C195 Compliance to Standards
. . . . C196 Ease of Installation
. . . . C197 Load Balancing Performance
. . . . C198 Log Quality
. . . . C199 Maintenance Performance
. . . . C200 Redundancy
. [ Architectural Resource ]
. . C202 Architectural Component
. . . C203 Software Architecture Component
. . . . C204 Software Component
. . . . . C205 Application Framework
. . . . C206 Interface
. . . . C207 Framework Specification
. . . C208 System Architecture Component
. . . . C209 Running Component
. . . . C210 Hosting Node
. . C13 Component Profile (also <isa> Metadata)
. . C133 License (also <isa> Policy)
Selecting a Resource Identifying scheme implies a trade-off. Usually, the wider the scope of
the scheme, the more costly it is to set up and maintain the scheme. Ideally, the scheme
having the widest scope within the acceptable cost range, should be selected.
Examples:
• URI
• IRI
• ARKs
• Digital Object Identifier (DOI)
• Persistent handles
C3 Resource Set
Definition: A set of Resources, which is in turn a Resource, often defined for some
management or application purpose.
Relationships:
• Resource Set <isa> Resource
• Resource <belongTo> Resource Set
Rationale: The grouping of Resources is required in many operations of a Digital Library.
For instance, in the Content Domain, Collections are Resource Sets, so are search results
(Result Set) or a subset of the search results marked by an Actor. In the User Domain, Groups
are Resource Sets.
Examples:
• The set of Collections, Functions, and Actors forming a “virtual research environment”,
i.e., the set of Resources grouped to serve a research need.
C4 Result Set
Definition: A Resource Set whose constituent Resources are the result of a Query execution.
Relationships:
• Result Set <isA> Resource Set
Rationale: A set of Resources returned by the system as a consequence of an Actor that
issues a Query. Result Set are group of Resources highly dynamic and time dependant, i.e.
different Result Sets can be obtained by issuing the same Query in different time periods. This
is a consequence of the changes in the Information Objects and Collections made available in
the system.
Examples:
• The set of Information Objects representing Picasso outcomes retrieved by a Query
C5 Resource Format
Definition: A description of the structure of a Resource. May build explicitly on an Ontology
or imply an Ontology.
Relationships:
• Resource <hasFormat> Resource Format
• Resource Format is <expressionOf> Ontology
Rationale: The schema defines the properties and attributes of a resource and assigns a name
to this kind of structure. The resource schema of information objects (a kind of resource)
gives the structural composition of the object; for instance, the objects stored into a Digital
Library of Ph.D. thesis might share a common format called “thesis”, defined as an
aggregation of multiple parts: the cover page, the preface, a sequence of chapters, images,
audio files, and supporting evidence in the form of data stored in a database. For other types
of resources, such as users or policies, the schema describes the set of properties or attributes
by which the resources are modelled.
We do not make any recommendation on how a schema should be, or which schema best
works as “the” schema for a specific kind of Resource. From a practical point of view, this
leaves space for one of two options: (1) either the developers of a digital library choose some
schemas and make them part of the digital library conceptual model; or (2) they leave open
the possibility of “plugging in” any schema, in which case a suitable meta-model must be
selected for each resource type in order to express the various resource schemas handled by
the system; for instance JCR is a suitable meta-model for information objects.
Examples: --
C6 Content Domain
Definition: One of the six main concepts characterising the digital library universe. It
represents the various aspects related to the modelling of information managed in the digital
library universe to serve the information needs of the Actors.
Relationships:
• Digital Library <definedBy> Content Domain
• Digital Library System <definedBy> Content Domain
• Digital Library Management System <definedBy> Content Domain
• Content Domain <consistOf> Information Object
• Content Domain <organisedIn> Collection
Rationale:
The Content concept represents the information that Digital Libraries handle and make
available to their Actors. It is composed of a set of Information Objects organised in
Collections. Content Domain is an umbrella concept that is used to aggregate all forms of
information that a Digital Library may require to offer its services. Metadata play an
important role in the Content Domain because they describe a clearly defined category of
Information Objects in the domain of discourse.
Examples: --
C7 Information Object
Definition: The main Resource of the Content Domain. An Information Object is a Resource
identified by a Resource Identifier. It must belong to at least one Collection. It may have
Metadata, Annotations, and multiple Editions, Views, Manifestations. In addition, it may have
Quality Parameters and Policies.
Relationships:
• Information Object <isa> Resource
• Information Object <hasFormat> Resource Format (inherited from Resource)
• Information Object is <identifiedBy> Resource Identifier (inherited from Resource)
• Information Object <belongTo> Collection
• Information Object <hasMetadata> Information Object (Metadata)
• Information Object <hasAnnotation> Information Object (Annotation)
An Edition is an Information Object and thus a Resource, therefore it is independent from the
Information Object it is an edition of.
Examples:
• An Information Object representing a study may be linked to the following Information
Objects via <hasEdition> relationships:
o its draft version is an Edition
o the version submitted is an Edition
o the version published in the conference proceedings with colour images is an Edition
C9 View
Definition: An Information Object representing a different expression of another Information
Object, to which it is related via a <hasView> relation.
Relationships:
• View <isa> Information Object
• Information Object <hasView> Information Object
Rationale: This entity represents a view of an Information Object. This concept responds to
the diversity of expressions of the same object that instantiated using different digital
technologies. Views do not represent different physical aspects, rather they are mechanisms to
differentiate types of representations or visualisations that can be given to the Information
Objects. The concept of View fits very well with those used in the DBMS, in this context a
view is a virtual or logical table (i.e. the organisational unit of data) composed as the result of
a query over the actual data stored in potentially different table and different ways in order to
provide a new organisational unit presenting data in a more useful way.
Edition and View together capture the expression concept of the IFLA-FRBR model [107].
Examples:
• An example of view that can be envisaged over the same Information Object representing
a data stream of an environmental sensor consists of its raw form as a series of numerical
values or as a graph representing the evolution of the values measured by the sensor along
the time.
• Another example might consider an Information Object representing the outcomes of a
workshop, three different views of this object can be envisaged:
o the “full view” containing a preface prepared by the conference chair and the whole
set of papers accepted and organised thematically,
o the “handbook view” containing the conference program and the slides of each
lecturer accompanied with the abstract of the papers organised per session, and
o the “informative view” reporting the goal of the workshop and the title list of the
accepted papers together with the associated abstract.
C10 Manifestation
Definition: An Information Object representing the physical embodiment of another
Information Object, to which it is related via a <hasManifestation> relationship.
Relationships:
• Manifestation <isa> Information Object
• Information Object <hasManifestation> Information Object
Rationale: Like Editions and Views, Manifestations are derived from a relation
(<hasManifestation>). However, while the Editions and Views deal with the intellectual and
logical organisation of Information Objects, Manifestations deal with their physical
presentation. Another important difference is that Manifestations may, transparently to the
Actor, be dynamically generated through a possibly complex process, taking into account
Actor preferences, templates, size restrictions, and other factors.
Examples: Examples of manifestations are the PDF file or the Microsoft Word file of the
same paper, the MPEG file containing the video recording of a lecture, a file containing the
raw data observed by a sensor, an XML file reporting the results of a certain elaboration.
C11 Metadata
Definition: Any Information Object that is connected to one or more Resources through a
<hasMetadata> relationship.
Relationships:
• Metadata <isa> Information Object
• Resource <hasMetadata> Information Object (Metadata)
• Information Object <hasMetadata> Information Object (Metadata)
• Metadata <hasFormat> Resource Format that is an <expressionOf> Ontology (inherited
by Resource)
• Actor Profile <isa> Metadata
• Policy Metadata <isa> Metadata
• Component Profile <isa> Metadata
Rationale: The “classic” definition of metadata is “data about data”. However, it depends
from the context whether an object is or is not metadata. This is the main motivation leading
to the modeling of them as a derived notion from the instances of the <hasMetadata> relation.
Metadata are used for describing different aspects of data, such as the semantics, provenance,
constraints, parameters, content, quality, condition, and other characteristic. These data can be
used in different contexts and for a diversity of purposes; usually, they are associated with an
Information Object (more in general to a Resource through the <hasMetadata>) as a means
for facilitating the effective discovery, retrieval, use and management of the object.
There are a number of schemes for classifying metadata.
One of them consists in classifying metadata according to the specific role they play:
• Descriptive metadata, i.e. metadata that provide a mechanism for representing attributes
describing and identifying the Resource. Examples include bibliographical attributes (e.g.
creator, title, publisher, date), format, list of keywords characterising the contents. The
term “descriptive” is used here in a consistent, but broader sense than in “descriptive
cataloguing”.
• Administrative metadata, i.e. metadata for managing a Resource. This category of
metadata may include metadata detailing: (i) technical characteristics of the Resource, (ii)
the history of the operations performed on the Resource since its creation/ingest, (iii)
means of access, (iv) how the authenticity and integrity of the Resource can be verified.
• Preservation metadata, i.e. metadata designed to support the long term accessibility of a
Resource by providing information about its content, technical attributes, dependencies,
management, designated community(ies) and change history. Preservation Metadata have
been identified as essential for the long-term management of digital objects. The
Reference Model for an Open Archival Information System (OAIS) [49] provides an
Rationale: Annotations can support co-operative work by allowing Actors to merge their
intellectual work with the DL Resources provided by the DL to constitute a single working
context. Annotations can be used in various contexts, e.g.
• to express a personal opinion about an Information Object,
• to enrich an Information Object with references to related works or contradictory
Information Object,
• to add personal notes about a retrieved Information Object for future usage.
Annotations are not only a way of explaining and enriching a DL Resource with personal
observations, but also a means of transmitting and sharing ideas in order to improve
collaborative work practices. Thus, annotations can be geared not only to the way of working
of the individual and to a method of study, but also to a way of doing research, as it happens
in the Humanities.
As Annotations are Information Objects they may be in different formats, be expressed in
different media, be associated with metadata, and can be themselves annotated. Actually, in
literature there is an ongoing discussion whether annotations have to be considered either
metadata or information objects. For the time being an Annotation is modelled as an
Information Object because (i) it has been considered an additional information that increase
the existing content by providing an additional layer of elucidation and explanation of it, and
(ii) because of this, the Annotation itself takes the shape of an additional Information Object
which can help people in understanding the annotated Resource. Actually, the status of
Annotation is derived from the <hasAnnotation> relation linking Resources; this choice
settles the long-standing issue whether Annotations are to be considered as Information
Objects or as Metadata.
A final observation is about the evolving nature of the Information Objects and in general of
the Resources that may result in invalidating a previously expressed Annotation. Usually each
update results in having a new Edition thus it is sufficient to link the Annotation to the
appropriate version it refers to.
Examples: --
C15 Collection
Definition: A content Resource Set. The extension of a collection consists of the Information
Objects it contains. A collection may be defined by a membership criterion, which is the
intension of the collection.
Relationships:
• Collection <isa> Resource Set
• Collection <isa> Resource
• Information Object <belongTo> Collection
• Collection <hasIntension> Query
• Collection <hasExtension> Resource Set (set of Information Object)
Rationale: Collections represent the “classic” mechanism to organise Information Objects
and to provide focused views of the Digital Library Information Object Resource Set. These
focused views enable Actors to access to thematic parts of the whole; they can be created by
the Librarians in order to keep the set of Information Objects organised and to improve its
access and usage; further, they can be created by authorised Content Consumers in order to
implement their own personal views of the Digital Library Information Object Resource Set.
The definition and identification of the Information Objects constituting a Collection (the
collection extension) is based on a characterisation criterion (the collection intension). These
criteria can range from an enumeration of the extension to conditions that specify which are
the properties that information objects must satisfy in order to be collection members (truth
conditions).
Typically, Collections are hierarchically structured in sub-collections, but for generality we do
not include this structuring in the present model.
Examples: --
C16 Query
Definition: A characterisation criterion capturing the common traits of the Resources forming
a Resource Set.
Relationships:
• Query <isa> Information Object
Rationale: The notion of query is well known in the DB area where it indicates an
expression issues according to a query language, e.g. SQL, to obtain the data stored in the DB.
Digital Libraries, as well as other Information Retrieval systems, borrowed this term to
represent the information need of their users. In the case of Digital Libraries queries can be
expressed according to various query languages ranging from keyword-based to fielded
forms.
The notion of Query is foundational for the Search Function. However, it can be used for
other purposes. This reference model uses them to capture the intension (<hasIntension>)
definition of a Collection.
Examples:
• “Digital Library” is the representation of a query constituted by two tokens issued by an
user interested in retrieving Resources dealing with Digital Libraries;
• “subject=H3.7 Digital Library AND author=Arms” is the representation of a complex and
fielded query issued by an Actor interested in finding the Resources having metadata that
contains the specified values in the identified fields.
C17 Ontology
Definition: An ontology is a formal conceptualization that defines the terms about a domain.
Ontologies formalize a shared vocabulary about a domain [93]
Relationships:
• Ontology <isa> Information Object
• Resource Format is <expressionOf> Ontology
Rationale: The notion of ontology generalizes that of schema or format, as well as related
notions, such as that of thesaurus. Ontologies may refer to different aspects of Information
Objects, such as their structure, their content, their preservation and others. Although a Digital
Library might define and adopt its own proprietary formats, it is widely acknowledged that
standard representation models (e.g. Dublin Core for descriptive metadata, MPEG for the
structure of audio-visual objects, OAIS for preservation) enhance the interoperability and
reuse of Resources. The emergence of rich schemas, such as CIDOC Conceptual Reference
Model (CRM) [190], which enable content owners or holders to define articulated
descriptions of their digital assets, and to exploit such descriptions in accessing the
information or in managing complex applications around them demands greater flexibility at
the level of generalisation. Semantic Web technologies, notably the Web Ontology Language
(OWL), which builds upon Description Logics and the associated inferential capabilities is
another driver.
The reference model does not make any commitment to a specific Ontology, rather it assumes
that the various “systems”, DL, DLS, and DLMS, will be able to offer to its users the ability to
handle multiple ontologies either sequentially or independently. A mechanism to support this
could offer:
• An ontology language, able to represent any ontology the DL users may want to work
with (e.g. OWL);
• An ontology mapping framework, consisting of a language for expressing relations
between elements from different ontologies, and an associated engine to exploit such
mappings in query evaluation.
Examples: --
C18 User Domain
Definition: One of the six main concepts characterising the digital library universe. It
represents the various aspects related to the modelling of external entities, either human or
machines, interacting with the digital library.
Relationships:
• Digital Library <definedBy> Actor Domain
• Digital Library System <definedBy> Actor Domain
• Digital Library Management System <definedBy> Actor Domain
• Actor Domain <consistOf> Actor
Rationale: The User Domain concept represents the Actors (whether human or not) entitled
to interact with Digital Libraries. The aim of Digital Libraries is to connect such Actors with
information (the Information Objects) and to support them in consuming already available
information and produce new information (through the Functions). User Domain is an
umbrella concept that covers all notions related to the representation and management of
Actor entities within a Digital Library, e.g., the digital entities representing the actors, their
rights within the system, their profiles (Actor Profile) exploited to personalize the system’s
behaviour or to represent these actors in collaborations.
Examples: --
C19 Actor
Definition: A Resource that represents an external entity that interacts with the Digital
Library and it is identified by a Resource Identifier. Furthermore, it may have at least one
Actor Profile and it may belong to at least one Group and be regulated by a set of Policies. An
Actor may be characterized by Quality Parameters and may be linked to other Actors.
Relationships
• Actor <isa> Resource
• Actor is <identifiedBy> Resource Identifier (inherited from Resource)
• Actor is <regulatedBy> Policy (inherited from Resource)
• Actor <belongTo> Group
• Actor is <modelledBy> Actor Profile
• Actor is <associatedWith> Actor
the system, such as permissible Functions and accessible Resources. Members of a Group
inherit (part of) the characteristics from the Group but they may have additional
characteristics as described in their individual Actor’s profile.
Examples:
• Community is an example of Group.
C21 Community
Definition: A social Group.
Relationships
• Community <isa> Group
Rationale: A Community is a particular subclass of Group, which refers to a social group of
humans with shared interests. In human Communities, intent, belief, resources, preferences,
needs, risks and a number of other conditions may be present and common, affecting the
identity of the participants and their degree of cohesiveness. Community is a pre-existing
group of people with shared interests, which is online in the Digital Library, or a group that is
formed as Actors in the Digital Library interact with the Library’s contents or with other
Actors. For instance, in a Digital Library with publications, there may be the Community of
people interested in Artificial Intelligence and the Community of people providing test
collections for Information Retrieval algorithms. On the other hand, as Group it is a well-
defined user community identified by a specific Actor Profile and Resource Identifier. The
Profile records permissible Roles, Functions, and Resources according to specific Policies.
Examples: --
C22 Role
Definition: A set of functions within the context of an organisation with some associated
semantics regarding the authority and responsibility conferred on the user assigned role.
Relationships:
• Actor <play> Role
• End-User <isa> Role
• DL Designer <isa> Role
• DL System Administrator <isa> Role
• DL Application Developer <isa> Role
Rationale: The above definition comes from [74] and works in accordance with the policy
mechanism pervading the Policy Domain (Section II.2.5). A role is a kind of a pre-packaged
generic profile and may be viewed as a packet of statements identifying the kind of Functions
a Actor is eligible to perform within the system. Thus, a role may be stored as a profile that
represents an individual or (most likely) a population of users. Roles are also called
stereotypes in user modelling. An Actor can be assigned to a Role; this means, the Actor
inherits all the Role statements. Clearly an Actor can play different Roles at different times or
more than one Role at the same time. Apart from the four main Actor Roles defined (End
User, DL Designer, DL System Administrator, and DL Application Developer), the following
generic Role are distinguished within a DL context and are subsequently defined: Content
Consumer, Content Creator, and Librarian, which are sub-roles of the End-User role. Apart
from these roles and sub-roles that are prototypically defined in the reference model, any
digital library could, and should, define additional roles. A sub-role may be defined providing
it with some of the Functions of a generic Role. For example, a content annotator Role might
be a sub-role of information creator that entitles Actors to only annotate existing Information
Objects.
Examples: --
C23 End-User
Definition: The Role of the Actors that access the Digital Library for exploiting its Resources
and possibly producing new ones.
Relationships
• End-User <isa> Role
Rationale: End-Users exploit DL facilities for providing, consuming, and managing DL
content (usually Information Objects, in general Resources). It is actually a class of Actors
further subdivided into the concepts of Content Creator, Content Consumer, and Librarian,
each one of which usually has a different perspective on the Digital Library. For instance, a
Content Creator may be a person that creates and inserts their own objects in the Digital
Library or an external program that automatically converts artefacts to digital form and
uploads them to the Digital Library.
Examples: --
C24 Content Consumer
Definition: The Role of the Actors that access the digital library for consuming its Resources,
usually Information Objects, through the available Functions.
Relationships
• Content Consumer <isa> End-User
Rationale: A Content Consumer is any entity that accesses the Digital Library for exploiting
(part of) its Resources. A person that searches (Search function) the contents of a digital
collection or an external subscription service are instances of Content Consumers.
Examples: --
C25 Content Creator
Definition: The Role of the Actors that provide new Information Objects to be stored into the
digital library or update already existing Information Objects.
Relationship
• Content Creator <isa> End-User
Rationale: A Content Creator may be a human or a program or another system. For instance,
it may be a person that creates and inserts their own documents in the Digital Library or an
external program that automatically converts artefacts to digital form and uploads them to the
Digital Library.
Examples: --
C26 Librarian
Definition: The Role of the Actors that manage digital library’s Resources, namely
Information Objects and End-Users.
Relationships:
• Librarian <isa> End-User
Examples: --
C31 Function
Definition: A particular operation that can be realized on a Resource or Resource Set as the
result of an activity of a particular Actor. It is identified by a Resource Identifier. It may be
performed by an Actor or it may refer to the respective supporting process of the DLS.
Relationships:
• Function <isa> Resource
• Function is <identifiedBy> Resource Identifier (inherited from Resource)
• Function is <influencedBy> Actor Profile
• Function is <influencedBy> Policy
• Function <actOn> Resource
• Function is <regulatedBy> Policy (inherited from Resource)
• Function <hasQuality> Quality Parameter (inherited from Resource)
• Actor <perform> Function
Rationale: A Function captures any processing that can occur on Resources and is typically
perceived as a result of an activity of an Actor in a Digital Library. It can possibly involve any
type of Resource and can be potentially performed by any kind of Actor. For instance, not
only a user can Search the contents in a digital library, i.e., Information Objects, but also an
Actor can search for other Actors, a program can Search for offered Functions, and so forth.
Due to its broad scope, Function is specialized into a set of specific but still quite generic
subclasses, such as Access Resource. In practice, a Digital Library can use different
specializations and combinations of these Functions intended to different Actors and
Resources.
Examples:
• Access Resource
• Manage Resource
C32 Access Resource
Definition: The class of Functions which provide Actors with mechanisms for discovering
and accessing Resources.
Relationships:
• Access Resource <isa> Function
• Access Resource <retrieve> Resource
• Discover <isa> Access Resource
• Acquire <isa> Access Resource
• Visualise <isa> Access Resource
Rationale: This is a family of Functions that do not modify the Digital Library or its
Resources but help in identifying Resources intended to be simply examined and perceived by
an Actor or possibly further exploited through use of other functions, such as Manage
Resource functions.
Examples: Discover, Acquire and Visualise are three classic Access Resource functions.
C33 Discover
Definition: The family of Functions to find a Resource, which may be an individual one or a
Resource Set compliant with the specification of the Actor request, as expressed by a Query or
by browsing.
Relationships:
• Discover <isa> Access Resource
• Discover <actOn> Resource Set
• Discover <return> Result Set
• Search <isa> Discover
• Browse <isa> Discover
Rationale: Discover is the central Access Resource function, which acts on Resource Sets and
aims at retrieving desired Resources.
Examples: --
C34 Browse
Definition: It is an Access Resource function which lists Resources in a Resource Set ordered
or organised according to a given characteristic or scheme.
Relationships:
• Browse <isa> Discover
Rationale: The Browse function allows an Actor to explore Digital Library’s Resources and it
may be used alternately with Search for this purpose. A Digital Library can be equipped with
different Browse capabilities. For instance, it may provide a different ordering or grouping of
Resources, such as browse per-author, when a Collection of publications is explored for
searching the correct form of the name of an author, or through an ontology representing the
underlying Collection of Information Objects or the set of permissible Functions.
Alternatively, graphical representations of a Resource Set may be used for browsing DL
Resources. For instance, it may be possible to have a digital library Collection depicted by
using bubbles or areas of different size each representing a certain topic and then navigating
among those bubbles in order to investigate on the content of each. Another example is that of
a tag cloud17, i.e., a visual depiction of descriptors, namely tags, that are used to annotate
Resources. Tags are typically listed alphabetically, and tag frequency is shown with font size
or colour. The tags are usually hyperlinks that lead to a collection of items that are associated
with that tag.
Examples: --
C35 Search
Definition: It is an Access Resource function that allows an Actor to discover the Resources
matching a Query, which are returned as a Result Set. Search must be triggered by a Query.
Relationships:
• Search <isa> Access Resource
• Search <issue> Query
• Search <return> Result Set
17
http://en.wikipedia.org/wiki/Tag_cloud
Rationale: There are several types of Search that can be performed by different types of
Actors and for accessing different types of Resources. For instance, not only a person can
Search the contents in a digital library, i.e., Information Objects, but also an Actor can Search
for other Actors, a program can Search for offered Functions, and so forth. Furthermore, the
Query describing the desired objects may be based on the content of a Resource, its Actor
Profile, its metadata, its annotations and so forth, and any combination of them. The form of
the Query does not constrain the type of Resource retrieved, e.g. a textual query can be used
to retrieve Information Objects whose manifestations are videos or audio files. An important
characteristic of the Search function is the search paradigm adopted. For example, the
Information Objects desired may be described through a query specification or condition. This
may consist of an unstructured query condition, i.e. sequence of search terms, combined with
operators, such as “and”, “or”, and “not”, or it can be a structured or fielded search, where
query conditions are expressed in terms of the metadata fields, e.g. “all the information
objects on a given research topic created by a certain author and published in a specific period
of time”. Moreover, an important characteristic of the search functionality resides in which
model is adopted in identifying the pertinence of the objects with respect to a query, e.g. the
Boolean model or the vector-space model.
Examples:
• ‘Query-By-Example’, which is based on an example Resource provided by the Actor. This
allows end users for example, to Search for Resources similar to a provided sample image
as well as to Search for those deemed similar to an excerpt of an audio.
• ‘Relevance feedback’. This supports the iterative improvement of the search Result Set by
allowing the Actor to express a relevance judgment on the retrieved Resources at each
iteration step. It improves effectively the discovery mechanism and the user satisfaction
because it enhances the expressive power of the query language supported by the digital
library.
C36 Acquire
Definition: It is an Access Resource function supporting an Actor in retaining Resources in
existence past the lifetime of the Actor interaction with the system.
Relationships
• Acquire <isa> Access Resource
• Acquire <actOn> Resource
Rationale: This Function provides mechanisms such as locally saving and printing the
content or metadata related to Information Objects.
Examples: --
C37 Visualise
Definition: It is an Access Resource function enabling an Actor to graphically perceive a
Resource, such as an Information Object or an Actor Profile.
Relationships:
• Visualise <isa> Access Resource
• Visualise <actOn> Resource
Rationale: Resources may be complex and they may be comprised of several parts. For
instance, an Information Object may combine information manifested in different media. The
Visualise function must thus be tailored according to the End-User characteristics, like the
device it uses or its personal setting, as well as to the characteristics of the object to be
rendered. Visualization is any technique for creating images, diagrams, animations, and so
forth to communicate a message.
Examples: Animation or drawing of diagrams are examples of Visualise function.
all the objects waiting for being published by the librarians, or directly to the DL Resource
Set, i.e., the set of resource seen by DL Actors.
Examples: --
C41 Withdraw
Definition: It is a Manage Resource function supporting an Actor in withdrawing Resources
from the DL.
Relationships:
• Withdraw <isa> Manage Resource
• Withdraw <actOn> Resource Set
Rationale: --
Examples: --
C42 Update
Definition: It is a Manage Resource function allowing an Actor to modify an already existing
Resource.
Relationships:
• Update <isa> Manage Resource
Rationale: This function implies capabilities to modify the Resource.
Examples: In case of Information Objects, it may add a new Edition, or a new View to an
already existing Information Object.
C43 Validate
Definition: It is a Manage Resource function supporting the Actor in validating the quality
status of a DL Resource.
Relationships:
• Validate <isa> Manage Resource
Rationale: This function supports the Actor in validating the quality status of a Resource of
the DL. The Function makes use of relevant quality parameters.
Examples: --
C44 Annotate
Definition: It is a Manage Resource function allowing an Actor to create an Annotation about
a Resource.
Relationships:
• Annotate <isa> Manage Resource
• Annotate <createAnnotation> Annotation
Rationale: This function allows an Actor to add Annotations. Annotations are Information
Objects. Management of existing Annotations may be performed using Manage Resource and
Manage DL functions. Moreover, since there are different types of annotations, such as notes
and bookmarks, the Annotate function may allow for the definition of one or more types,
which comply with the different meanings of Annotation in use.
Examples: --
C47 Publish
Definition: It is a Manage Information Object supporting an Actor in making Information
Objects available into the DL according to certain Policies.
Relationships:
• Publish <isa> Disseminate
Rationale: The information objects become available within the DL in accordance with the
policies assigned to them.
Examples: --
C48 Author
Definition: It is a Manage Information Object function supporting an Actor in creating
Information Objects.
Relationships
• Author <isa> Manage Information Object
• Author <creates> Information Object
Rationale: This function enables the Actor to create information objects according to one of
the DL accepted information object’ Resource Format.
Examples: --
C49 Compose
Definition: It is a Manage Information Object function supporting the Actor in using (parts
of) existing Information Objects in order to build compound objects.
Relationships
• Compose <isa> Author
Rationale: This function encapsulates the capabilities to create new information objects by
re-using existing objects, either in part or as a whole. For example, the user may compose a
multimedia album by putting together audio files, song lists, and singer biographies.
Examples: --
C50 Process
Definition: It is a Manage Information Object function supporting the Actor in all activities
related with the transformation and analysis of an information object.
Relationships
• Process <isa> Manage Information Object
• Analyze <isa> Process
• Transform <isa> Process
Rationale: This function encapsulates the capabilities to analyze and transform information
objects in order to view, disseminate or extract information from them. This represents a very
important category of Functions as it contains fundamental activities for taking advantage of
the DL Content for scientific, educational and recreational purposes.
Examples: --
C51 Analyze
Definition: It is a Process function supporting the Actor in all activities related with the
analysis of an Information Object.
Relationships
• Analyze <isa> Process
• Linguistic Analysis <isa> Analyze
• Qualitative Analysis <isa> Analyze
• Statistical Analysis <isa> Analyze
• Scientific Analysis <isa> Analyze
• Create Structured Representation <isa> Analyze
• Compare <isa> Analyze
Rationale: This function encapsulates the capabilities to analyze information objects in order
to extract information from them. It includes Functions related to the analysis of the
Information Object content or metadata, for statistical, scientific, linguistic, preservation, etc
purposes.
Examples: --
C58 Compare
Definition: It is an Analyze function supporting the Actor in comparing two or more
Information Objects, either primary ones or their metadata.
Relationships
• Compare <isa> Analyze
Rationale: This function represents the group of functions relevant to the comparison of
Information Objects. This may be performed for many reasons, preservation being a very
important one among them.
Examples: --
C59 Transform
Definition: It is a Process function enabling an Actor to create different views or
manifestations of an Information Object (or a set of Information Objects).
Relationships
• Transform <isa> Process
• Physically Convert <isa> Transform
• Extract <isa> Transform
• Convert to Different Format <isa> Transform
Rationale: Different representations of an Information Object (or a set of Information
Objects) enable the Actor to perceive information at different levels of abstraction, as desired.
Such possible conversions may be achieved with the help of approaches such as format
conversions, information extraction, automatic translation and summarization techniques.
Examples: --
C60 Physically Convert
Definition: It is a Transform function supporting the Actor in creating new manifestations of
an information object.
Relationships:
• Physically Convert <isa> Transform
• Physically Convert <createManifestation> Information Object
• Translate <isa> Physically Convert
Rationale: This function represents a wide range of functions related to the transformation of
the content of the Information Object. This transformation may include translation, text to
speech and speech to text conversions, tables in texts into spreadsheet or database format, data
into graphs, from 3D to 2D, different medium (including from paper to digital form), images
into colour histograms etc.
Examples: --
C61 Translate
Definition: It is a Physically Convert function enabling Actors to perceive an Information
Object in a language different from the object’s or the user’s native language. In this context
languages can range from country languages, e.g. Italian, English, to community and cultural
languages, e.g. Muslim culture.
Relationships:
• Translate <isa> Physically Convert
Rationale: Digital libraries must support the access to the information objects in as many
different languages as possible to enhance the usage of their Content. This function enables
multilingual information access. Multilingual information access approaches include query
translation, information object translation and combinations of both.
Examples: --
Rationale: This function enables the user to create a new version (e.g. convert the object into
another encoding). Depending on the type of the object, different types of conversions may be
possible, such as conversion into different encoding (converting a text from pdf to word, an
image to a different format or compression scheme, etc). This is a Function particularly useful
for interoperability purposes.
Examples: --
C63 Extract
Definition: It is a Transform function enabling an Actor to obtain a different manifestation of
an Information Object (or a set of Information Objects).
Relationships
• Extract <isa> Transform
• Extract <createManifestation> Information Object
Rationale: This function enables the user to create a new manifestation of an object which
may contain several parts of it. An example of such a function may be the extraction of
citations or text summaries.
Examples: --
C64 Manage Actor
Definition: It is a Manage Resource function supporting the administration of the set of
Actors that access the digital library.
Relationships:
• Manage Actor <isa> Manage Resource
• Manage Actor <actOn> Actor
• Establish Actor <isa> Manage Actor
• Personalise <isa> Manage Actor
Rationale: This is a family of functions supporting the DL administrators in dealing with the
DL user management. In particular, they cover the creation of new Actors, remove already
existing ones, and regulating their rights, i.e. establishing the tasks they are entitled to perform
and the information objects they are entitled to use as well their profile and associated
personalization issues.
Examples: --
C65 Establish Actor
Definition: It is a Manage Actor function dealing with the specific issues of the creation of
the Actors and their recognition by the DL.
Relationships
• Establish Actor <isa> Manage Actor
• Register <isa> Establish Actor
• Login <isa> Establish Actor
Rationale: An important aspect of the management of the DL Actors is the user creation,
registration, login and application of their profile on their actions.
Examples: --
C66 Register
Definition: It is the Establish Actor function supporting the adding of a new Actor to the set
of those managed and recognised by the digital library.
Relationships:
• Register <isa> Establish Actor
• Sign Up <isa> Register
Rationale: This function is responsible for populating the digital library user community.
Usually the fewer are the requirements imposed on the registration of novel users, the harder
is for the system to ensure the identity of a user. The constraints imposed at registration time
are a direct consequence of the audience the digital library is designed for. All these aspects
are decided at the DL design time by the DL Designer and are related to policies and
requirements that define the available Actor Profiles.
Examples: --
C67 Sign Up
Definition: It is a Register function supporting Actors in actively requesting their registration
in the DL and possibly expressing an interest on particular aspects of the DL.
Relationships:
• Sign Up <isa> Register
Rationale: This function encapsulates actions relevant to the active request of the Actor for
being registered in the DL and having access to its content. It is closely related to
personalization as the Actor during this process may fine-tune certain aspects of its Actor
Profile.
Examples: --
C68 Login
Definition: It is an Establish Actor function that enables an Actor to establish its identity in
the DL.
Relationships:
• Login <isa> Establish Actor
Rationale: Login is performed by matching a set of qualities or characteristics that uniquely
identifies an Actor. Assurance of identification can be increased by a number of practices
appropriate to the need. These practices range from passwords to tokens, smart cards, and
public keys with Certificates. The system then performs authentication, and it may further
perform authorization of the user. The execution of this function should be regulated by
policies.
Examples: --
C69 Personalise
Definition: The class of Manage Actor that supports Actors in having a personalized access to
the Content and Functionality of the DL.
Relationships
• Personalise <isa> Manage Actor
• Apply Profile <isa> Personalize
Rationale: This is a family of functions dedicated to adapt aspects of a digital library to the
DL users needs. These aspects may range from the DL look and feel to the organisation of the
digital library Content so that it satisfies the personal interest of its users. A main group of
personalization functions contains customization and application of the Actor Profile to all
DL Resources, whereas other functions may be related with the user feedback to the DL in
order to improve provided Functionality and Content.
Examples: --
C70 Apply Profile
Definition: It is a Personalise function enabling the applications of the Actors to the various
types of Function offered by a digital library.
Relationships:
• Apply Profile <isa> Personalise
Rationale: This function assumes that the system (semi-)automatically constructs a profile
per-user. Then, profile information is used to personalize the DL functions, e.g. personalized
search, recommendations, and so forth.
Examples: --
C74 Collaborate
Definition: The class of functions that supports Actors in sharing information, working and
communicating effectively and efficiently with peers.
Relationships
• Collaborate <isa> Function
• Exchange Information <isa> Collaborate
• Converse <isa> Collaborate
• Find Collaborator <isa> Collaborate
• Author Collaboratively <isa> Collaborate
Rationale: This is a family of functions that consists of a set of capabilities dedicated to
support Actors in using the DL as a common workspace. Some of the contained Functions
may be specializations of other Functions, as well, related to information access.
Examples: --
C76 Converse
Definition: It is a Collaborate function that supports an Actor in conversing through the DL
system.
Relationships
• Converse <isa> Collaborate
Rationale: This is a group of functions that allows Actors to talk to peers and exchange views
and opinions through DL chat services, on-line forum or list servers.
Examples: --
preservation works for the DL Content. Preservation Policies are very important for the
preservation related functions.
For a comprehensive description of the Preservation issue please refer to Section II.4.
Examples: --
Examples: --
Rationale: This function is the highest-level function with respect to the management of
policies, i.e. all the other functions dealing with policies are constrained by its choices and
outcome. For instance, the Manage Policy Domain is constrained by the values specified
when invoking the establish policies function at DL design time.
Examples: --
Building Digital Library policies is a complicated task since they must serve the needs of
institutions of various types and sizes which work together in a continuously evolving
distributed environment.
Policies exist at different levels, some ensure the effective functioning of the organisation that
manages the DL and others relate more directly to Actor services and how they are provided
and accessed. They make manifest operational expectations in such areas as: collection
development and management guidelines; human resource policies; space use policies;
confidentiality practices; user registration and enrolment, library card and borrowing policies;
and service use policies, e.g., acceptable user behaviour.
While Policy Domain is a general term, specific aspect within a single area are covered by
Policy and are manifested through a document which usually consists of policy statement,
rationale, enforcement, responsible office (Policy <expressedBy> Information Object).
Examples: --
C112 Policy
Definition: A condition, rule, term or regulation governing the operation of a Digital Library.
Relationships:
• Policy <isa> Resource
• Resource <regulatedBy> Policy
• Actor <regulatedBy> Policy
• Function <regulatedBy> Policy
• Policy <expressedBy> Information Object
• System Policy <isa> Policy
• Content Policy <isa> Policy
• User Policy <isa> Policy
• Functionality Policy <isa> Policy
• Enforced Policy <isa> Policy
• Voluntary Policy <isa> Policy
• Explicit Policy <isa> Policy
• Implicit Policy <isa> Policy
• Extrinsic Policy <isa> Policy
• Intrinsic Policy <isa> Policy
• Descriptive Policy <isa> Policy
• Prescriptive Policy <isa> Policy
Rationale: A Policy regulates Actors consuming Resources through Functions with respect to
a validity interval (Time domain can be used here). Policy has a specific area coverage, for
example Registration Policy or Preservation Policy.
Policy may be descriptive (e.g., Collection Development Policy which explains what is the
content of the collection and how it will be developed in future) or prescriptive (there are
strict procedures to follow, e.g. Registration Policy).
The currently identifies policy entities should be considered as examples; they are currently
the most important in the digital libraries.
Examples:
Privacy and Confidentiality Policy is a Policy which describes what rules are followed to
assure privacy and confidentiality of the Actors. This is seen as a part of the DL system.
The same Policy within the Digital Library System is seen as the specification what Functions
should be present, and on the Digital Library Management System refers to the practical
implementation of the Functions.
C113 Extrinsic Policy
Definition: A Policy defined outside of and applied within the DL.
Relationships:
• Extrinsic Policy <isa> Policy
• Extrinsic Policy is <antonymOf> Intrinsic Policy
• Extrinsic Policy <isa> Policy by context
Rationale: Extrinsic Policy is a Policy that is imposed from a body outside the Digital
Library (e.g. legal and regulatory frame works). According to the type of the DL, the
regulatory framework might differ - a DL in the pharmaceutical arena will operate in a very
different regulatory framework from one in the area of tourism.
Examples:
Legal and regulatory frameworks of a specific country applied to a Digital Library developed
by a local body.
C114 Intrinsic Policy
Definition: A Policy defined inside of and applied within the DL.
Relationships:
• Intrinsic Policy <isa> Policy
• Intrinsic Policy is <antonymOf> Extrinsic Policy
• Intrinsic Policy <isa> Policy by context
Rationale: Intrinsic Policy manifests the Policy principles implemented in the DL.
A Policy that is defined by the DL or its organisational context that reflect organisation’s
mission and objectives, intended expectations as to how Actors will interact with the DL,
expectations of content creators as to how their content will be used.
Examples:
A Policy within the Policy of the respective Digital Library is an Intrinsic Policy.
C115 Explicit Policy
Definition: A Policy that has been stated and approved.
Relationships:
• Explicit Policy <isa> Policy
• Explicit Policy is <antonymOf> Implicit Policy
• Explicit Policy <isa> Policy by expression
Rationale: Explicit Policy is a Policy defined by the DL managing organisation and reflecting
the objectives of the DL and how it wishes its users to interact with the DL. The
implementation of Explicit Policy on the Digital Library Management System level
corresponds to the definition and Actor expectations.
Examples:
Limitation for upload of files over a specified size, e.g. over 1 MB, which is clearly stated in
the user interface in addition to the explanation within the text of the Submission and
Resubmission Policy.
C116 Implicit Policy
Definition: A Policy that is inherent in the DL either through accident of design or
undocumented development decisions, but was not explicitly planned or stated.
Relationships:
• Implicit Policy <isa> Policy
• Implicit Policy is <antonymOf> Explicit Policy
• Implicit Policy <isa> Policy by expression
Rationale: Implicit policies usually arise as a result of ad-hoc decisions taken at system
development level or as a consequence of the inadequate testing of DLS that result in an
interaction of policies resulting in unintended policy deployment.
This is an illustration how improper actions on Digital Library System level or Digital Library
Management System level can cause consequences for the DL.
Implicit policies should be avoided as they tend to be opaque, have unintended and
unexpected consequences which impact on the interaction of all Actor communities with the
DL.
Examples:
An implemented but not communicated to the actors limitation in the file size while
uploading or downloading resources from the Digital Library is an example of Implicit
Policy.
C117 Prescriptive Policy
Definition: A Policy that constrains or manages interactions between DL Actors (virtual or
real) and the DL.
Relationships:
• Prescriptive Policy <isa> Policy
• Prescriptive Policy <isa> Policy by application
Rationale: Prescriptive policies can cover a broad range of policies from the kinds of function
specific types of Actors can have access to those that govern collection development.
Examples:
Termination of file upload, if the file is of format which is not permitted, is an example of
action which is taken as a result of a Prescriptive Policy.
C118 Descriptive Policy
Definition: A Policy which provides explanation on a certain Policy.
Relationships:
• Descriptive Policy <isa> Policy
• Descriptive Policy <isa> Policy by application
Rationale: Descriptive Policies are used to present in the form of explanation the aspects of a
particular Policy. A Descriptive Policy is a Policy which describe modes of behaviour,
expectations of Actor interaction, collecting and use guidelines, but which do not manifest
themselves through the automated application of rules, as a Prescriptive Policy does.
Examples:
The Collection Development Policy describes what is the scope and coverage of the DL.
C119 Enforced Policy
Definition: A Policy which is deployed and strictly applied within the DL.
Relationships:
• Enforced Policy <isa> Policy
• Enforced Policy <isa> Policy by compliance
Rationale: An Enforced Policy is a Policy developed, deployed and strictly used in the DL.
Monitoring and reporting tools are necessary to follow up how the policy is being applied.
Examples:
Charging Policy which had been introduced into the DL is an Enforced Policy.
C120 Voluntary Policy
Definition: A Policy which is either not deployed within the DL, or which might be followed
by the Actor according to his own decision.
Relationships:
• Voluntary Policy <isa> Policy
• Voluntary Policy <isa> Policy by compliance
Rationale: Voluntary Policy basically means a Policy which is followed according to the
decision of the Actor. This is valid for all Policies which application is a matter of choice. In
some cases users may comply to policies which are not officially communicated within the
particular digital library – based on their previous experience with other digital libraries.
Examples:
The Collection Development Policy might be outlined in broad terms, but not enforced into
practice.
C121 System Policy
Definition: A Policy that concerns an aspect of a system as a whole, being it a Digital
Library, a Digital Library System, or a Digital Library Management System
Relationships:
• System Policy <isa> Policy
• Change Management Policy <isa> System Policy
• Connectivity Policy <isa> System Policy
• Support Policy <isa> System Policy
• Resource Management Policy <isa> System Policy
Rationale: This is a class of Policies which govern generic processes within the digital
library system in its entirety on the three levels (DL, DLS and DLMS).
Examples: System Policies cover most general processes in the digital library, such as
regulation of changes or management of resources.
C122 Change Management Policy
Definition: The purpose of the Change Management Policy is to regulate how changes are
being done on the three levels and within the six domains of a digital library in a rational and
consistent manner which would be effectively communicated to the Actors and would not
harm their routine work.
Relationships:
• Change Management Policy <isa> Policy
• Resource <regulatedBy> Change Management Policy
• Change Management Policy <govern> Manage DL Function
• Change Management Policy <govern> Manage DLS Function
• Change Management Policy <govern> Manage Resource
Rationale: The aim of Change Management Policy in DL is to ensure stability in the process
of restructuring and assure coherence of actions on the three levels (DL, DLS and DLMS).
The complexity of DL could be approached when in the process of change management the
issues relevant to the six basic areas – information objects, actors, policies, quality
parameters, architectural components, functions – and the three levels (DL, DLS and DLMS)
are addressed in a rational and consistent manner.
It is of highest importance to define roles and responsibilities in change management, and to
consider in detail the change management process and the support which the DLS and DLMS
should provide for its smooth execution.
Examples:
Quality Parameter Measures which demonstrate the change management progress may be
part of the Change Management Policy.
C123 Resource Management Policy
Definition: Policies defining how resources in the DL are allocated.
Relationships:
• Resource Management Policy <isa> Policy
• Resource Management Policy <isa> System Policy
• Resource Management Policy <govern> Resource
Rationale: Resource management is a key area within the organisation and use of resources
in the DL. Resource Management Policy is the Policy which describes the principles and
procedures related to this field.
Since Resources may be of different nature, this Policy would usually be a combination of
different actions and procedures.
Examples:
Checking the consistency of Resource Identifiers may be a task from the Resource
Management Policy.
C124 Support Policy
Definition: Policies describing the kinds of support that Actors can expect in using the DL
system and the Resources it contains.
Relationships:
• Support Policy <isa> Policy
• Support Policy <isa> System Policy
• Support Policy <isa> (should be) Explicit Policy
• Support Policy <isa> (should be) Descriptive Policy
• Support Policy <isa> (should be) Intrinsic Policy
• Support Policy <govern> Actor
A broader understanding of digital rights defines them as all human rights which are affected
by technology, including the rights to use computers, communication networks and resources.
Examples:
The right to access knowledge is affected by digital technology and not all people have equal
opportunities in this respect. The right to use without any license is another example.
C133 License
Definition: A Policy regulating the exploitation of a Resource.
Relationship:
• License <isa> Policy
• License <isa> Digital Rights Management Policy
• Resource <regulatedBy> License
• License <grantedTo> Actor
Rationale: License is the agreement by which the owner of intellectual property permits its
use. In digital libraries license may be issued for specific uses of resources, or for designated
functionality features which should be downloaded and installed by the users.
Examples:
GPL (GNU General Public License), a popular license for free software, GNU LGPL (Lesser
General Public License), BSD (Berkeley Software Distribution or Berkeley System
Distribution) are examples of software licenses.
C134 Preservation Policy
Definition: Policy defining the approach to preservation taken by the DL.
Relationships:
• Preservation Policy <isa> Policy
• Preservation Policy <isa> Content Policy
• Resource <regulatedBy> Preservation Policy
Rationale: Preservation Policy prescribes how to implement actions assuring long-term
preservation of Resources - decision making on archival needs, archiving practices, timing
issues, access to archived materials, subsequent preservation measures for already archived
materials, subsequent preservation measures for already archived material, maintaining
preservation metadata, issues of interoperability of preserved materials.
Examples:
Reuse of preserved materials is part of the Preservation Policy.
C135 User Policy
Definition: Policy regulating the User domain.
Relationships:
• User Policy <isa> Policy
• Digital Rights Management Policy <isa> User Policy
• User Management Policy <isa> User Policy
• Acceptable User Behaviour Policy <isa> User Policy
• Personalisation Policy <isa> User Policy
• Privacy and Confidentiality Policy <isa> User Policy
• Access Policy <isa> User Policy
Rationale: This is a class of Policies which govern processes related to User domain within
the digital library system in its entirety on the three levels (DL, DLS and DLMS).
Examples:
All policies which regulate issues regarding digital rights and user behaviour.
C136 User Management Policy
Definition: Policy defining how user management is handled.
Relationships:
• User Management Policy <isa> Policy
• User Management Policy <isa> User Policy
• User Management Policy <govern> Actor
Rationale: The User Management Policy enables to execute functions like issuing, managing,
changing, sharing accounts; administration rights; sharing resources between multiple users.
Examples:
Account management is part of the User Management Policy.
C137 Registration Policy
Definition: Policy describing the information that is required for Actors, human and machine,
to register with the DL and how this information is validated, managed, and maintained.
Relationships:
• Registration Policy <isa> Policy
• Registration Policy <isa> User Management Policy
• Registration Policy <govern> Actor
• Registration Policy <govern> Login Function
• Registration Policy <govern> Subscribe Function
Rationale: This policy explains how virtual and human users should register in order to use
the DL.
The DLMS should perform functions on user log-in, validation, management and
maintenance.
Examples:
Storage of sessions and IP addresses is an element from the Registration Policy.
C138 Personalization Policy
Definition: Policy enabling the DL to define what kinds of personalisation will be allowable
and under what circumstances.
Relationships:
• Personalization Policy <isa> Policy
• Personalization Policy <isa> User Policy
• Personalization Policy <govern> Actor
• Personalization Policy <govern> Personalize Function
Rationale: The Personalization Policy has two roles, on the one hand it enables to recognise
the user and his/her access rights, and on the other hand it enables DL to serve its Actors
guaranteeing better Quality Parameters by offering Information Objects (in general
Resources) which agree with user preferences. In the DLS the Functions which are used to
assure personalization are Apply Profile, Customize, Login and Subscribe.
Examples
The choice of representation layout on the basis of statistics of user behaviour is an example
of Personalization Policy.
C139 Privacy and Confidentiality Policy
Definition: A policy which outlines the terms by which the organisation that manages the DL
will handle personal information on its Actors.
Relationships
• Privacy and Confidentiality Policy <isa> Policy
• Privacy and Confidentiality Policy <isa> User Policy
Rationale:
Policies prescribing actor details from application and enrolment information through to actor
interaction data will be handled by the DL and the organisation that manages the DL.
Typically, the DL should only maintain personal information on Actors that is relevant to its
better functioning and services.
Data about the Actors could be entered directly by them (e.g. user names, passwords), or
obtained automatically (e.g., IP address).
The personal data collected should be protected against unauthorized access, destruction,
misuse, modification, improper disclosure and loss.
Different rules may be applied to the use of various types of personal information - e.g., email
addresses, postal address, log in names and passwords, users’ opinions entered through
webpages.
Privacy and Confidentiality Policy principles should be embedded in the DL Functions which
requires collection of data about the Actors (supplied or automatically collected).
Examples:
The use of e-mail addresses of the Actors for announcing new DL collection may be justified
as a part of the Privacy and Confidentiality Policy.
Selling or sharing with other organisations lists of e-mail addresses of the Actors is typically
not in line with the Privacy and Confidentiality Policy, unless the users agreed to this.
C140 Acceptable User Behaviour Policy
Definition: Policy covering how the Actors may or may not interact with the DL.
Relationships:
• Acceptable User Behaviour Policy <isa> Policy
• Acceptable User Behaviour Policy <isa> User Policy
Rationale: Acceptable User Behaviour Policy presents rules and regulations for appropriate
use of the DL content and services, prescribing what a user can do and what he/she should
refrain.
Examples:
Regulations on copying material from a DL are part of the Acceptable User Behaviour Policy.
Rules for citation of the source of material from a DL are part of the Acceptable User
Behaviour Policy.
Rules on downloading images of workstations for within-institutional use of DL are part of
the Acceptable User Behaviour Policy.
Examples: The opinions of the users expressed in a DL forum or blog can be used as a source
for qualitative measure of important issues for the users (content analysis is one of the popular
techniques to analyse texts).
C150 Quantitative Measure
Definition: A Measure based on unit of measurement which is expressed via numerical
values.
Relationships:
• Quantitative Measure <isa> Measure
Rationale: Quantitative measures are based on collecting and interpreting numerical data.
There is a wide range of statistical methods for their analysis.
Examples: Quantitative measure is applied when collecting data and calculating the mean
time spent by the users in locating content.
C151 Measurement
Definition: The action of, and the value obtained by, measuring a Quality Parameter in
accordance with a selected Measure.
Relationships:
• Quality Parameter <measuredBy> Measurement
• Measurement is assigned according to (<accordTo>) a Measure
Rationale: See Quality Parameter.
Examples: See Quality Parameter.
C152 Quality Parameter
Definition: A Resource that indicates or is linked to performance or fulfilment of
requirements by another Resource. A Quality Parameter is evaluated by (<evaluatedBy>) a
Measure, is <measuredBy> a Measurement, and expresses the assessment
(<expressAssessment>) of an Actor.
Relationships
• Quality Parameter <isa> Resource
• Quality <expressedBy> Quality Parameters
• Resource <hasQuality> with respect to Quality Parameter
• Actor <expressAssessment> about Resources according to Quality Parameters
• Quality Parameter <evaluatedBy> Measure
• Quality Parameter <measuredBy> Measurement
• Quality Parameter <affectedBy> Resource
• Generalic Quality Parameter <isa> Quality Parameter
• Content Quality Parameter <isa> Quality Parameter
• Functionality Quality Parameter <isa> Quality Parameter
• User Quality Parameter <isa> Quality Parameter
• Policy Quality Parameter <isa> Quality Parameter
• Architecture Quality Parameter <isa> Quality Parameter
Rationale: Quality parameters serve the purpose of expressing the different facets of the
Quality and provide information about how and how well a Resource performs with respect to
some viewpoint. They express the assessment of an Actor, being it human or not, about the
Resource under examination. They can be evaluated according to different Measures, which
provides alternative procedures for assessing different aspects of a Quality Parameter and
assigning it a value. Quality Parameters are actually measured by a Measurement, which
represents the value assigned to a Quality Parameter with respect to a selected Measure.
Note that the Resource under examination in a Quality Parameter can be either a singleton
Resource, as for example in the case of the Integrity of an Information Object, or a Resource
Set, as for example in the case of the Orthogonality of a set of Functions.
Finally, a Quality Parameter may be affected by other Resources, such as other Quality
Parameters, Policies, or Functions; this allows us to create a “chain” of Resources which
leads to the determination of the Quality Parameter at hand. For example, the Availability is
affected by Robustness and Fault Management: indeed, when Function is both robust and
able to recover from error conditions, it is probable that also its Availability is increased. As a
further example, Economic Convenience may be affected by Charging Policy, since the latter
is responsible for the definition of the charging strategies.
Note that, being a Resource, a Quality Parameter may have Metadata and Annotations linked
to it; the former can provide useful information about the provenance of a Quality Parameter,
while the latter can offer the possibility for adding comments about a Quality Parameter,
interpreting the obtained values, and proposing actions to improve it.
Please note that the groupings of Quality Parameters in broad categories, such as Content
Quality Parameter, are made from the perspective of the Resources under assessment, in the
case of the example mainly Information Objects. This means that User Content Parameter
does not concern issues such as User Satisfaction or Usability, where the Actor is the subject
who makes the assessment, but in this group the Actor is the object of the assessment from
different points of view, such as the User Behaviour. Nevertheless, the active role of an Actor
in expressing an assessment is always preserved in the Quality Parameter by the fact the
Actor <expressAssessment> about a Resource in each Quality Parameter.
The definition of Quality Parameter complies with the notion of quality dimension used in
[17].
Examples: In order to make clear the relationship between Quality Parameter, Measure, and
Measurement, we can take an example from the information retrieval field. One of the main
Quality Parameters about an information retrieval system is the effectiveness, meant as its
capability of answering to the user information needs with relevant items. This quality
parameter can be evaluated according to many different Measures, such as precision and
recall [171]; precision evaluates effectiveness meant as the ability of the system to reject
useless items, while recall evaluates effectiveness meant as the ability of the system to
retrieve useful items. The actual values for the precision and recall are Measurements and are
usually computed by using standard tools, such as trec_eval18, which are Actors, in this case
not human.
C153 Generic Quality Parameter
Definition: A Quality Parameter that concerns an aspect of a “system” as a whole, being it a
Digital Library, a Digital Library System, or a Digital Library Management System.
Relationships:
• Generic Quality Parameter <isa> Quality Parameter
• Reputation <isa> Generic Quality Parameter
18
http://trec.nist.gov/trec_eval/
Compliance To Standards may affect Interoperability Support since their use makes easier to
interact with other systems.
The cost estimation of interoperability may be a component of the Economic Convenience
measure.
Examples: Interoperability Support problems can cause delays or impossibility to fulfil user
requests; thus they are also related to user satisfaction.
C156 Reputation
Definition: The Generic Quality Parameter which reflects the trustworthiness of a Digital
Library.
Relationships:
• Reputation <isa> Generic Quality Parameter
Rationale: The Reputation concerns the “good name” of a Digital Library, the credit it owns
by the user community, and its ability of being a point of reference.
Other Quality Parameters may greatly affect the Reputation and we may consider it as a sort
of overall indicator of the appreciation of a Digital Library.
Examples: Examples of aspects which influence the Reputation of a Digital Library are
whether a Digital Library provides Resources that can be regarded as true, real, impartial,
credible, and conveying the right information.
Examples of Quality Parameters which influence Reputation are: Economic Convenience,
Usability, Dependability, and so on.
C157 Security Enforcement
Definition: The Generic Quality Parameter which reflects the level and kind of security
features offered by a Digital Library.
Relationships:
• Security Enforcement <isa> Generic Quality Parameter
• Security Enforcement <affectedBy> Digital Rights Management Policy
• Security Enforcement <affectedBy> Access Information
• Security Enforcement <affectedBy> Configure DL
• Security Enforcement <affectedBy> Enabling Function
• Security Enforcement <affectedBy> User Behaviour
Rationale: This parameter reflects the capability of the Digital Library to support the
management of different levels of security as expected by users, content depositors, rights
owners, and librarians themselves.
Security Enforcement may be affected by both Policies and Functions. In particular the
Digital Rights Management Policy impacts the level of Security Enforcement of a Digital
Library, since it defines how the content has to be controlled. The Access Information
functions and their implementation influence Security Enforcement, since they provide Actors
with mechanisms for consuming Information Objects; the Configure DL functions impact
Security, since the possibility of correct and careful configuration of the Digital Library is a
pre-requisite for security; the Enabling Functions, such as Authentication, Authorization, and
Encryption, contribute to enforce the security of a Digital Library. Finally, also the User
Behaviour can affect the Security Enforcement, since an Actor my compromise security for
example by a careless use of its username and password.
Rationale: This Generic Quality Parameter provides an overall assessment about how well a
Resource performs from different points of view, e.g. efficiency, effectiveness, efficacy, and
so on
Examples: The response time upon invocation of a Function is an example of a generic
Performance, indicator; the presence of delays and/or jitter is an example of Performance
indicators more tailored to the multimedia and streaming contexts; precision and recall are
widely used Performance indicators in the information retrieval field.
C161 Scalability
Definition: The Generic Quality Parameter measuring the capability to increase Capacity as
much as needed.
Relationships:
• Scalability <isa> Generic Quality Parameter
Rationale: --
Examples: --
C162 Content Quality Parameter
Definition: A Quality Parameter that concerns an aspect of the Content main concept.
Relationships:
• Content Quality Parameter <isa> Quality Parameter
• Authenticity <isa> Content Quality Parameter
• Integrity <isa> Content Quality Parameter
• Provenance <isa> Content Quality Parameter
• Freshness <isa> Content Quality Parameter
• Preservation Performance <isa> Content Quality Parameter
• Size <isa> Content Quality Parameter
• Scope <isa> Content Quality Parameter
• Authoritativeness <isa> Content Quality Parameter
• Fidelity <isa> Content Quality Parameter
• Perceivability <isa> Content Quality Parameter
• Viability <isa> Content Quality Parameter
• Metadata Evaluation <isa> Content Quality Parameter
Rationale: This is a family of Quality Parameters, reflecting the variety of facets that
characterize the quality of the Content, in particular Information Objects, in a Digital Library.
Examples: Content quality is a moving target in a sense, but the requirements to the level of
quality of various materials in the digital library and its scope have to be presented in the
Collection Development Policy.
C163 Authenticity
Definition: The Content Quality Parameter reflecting whether an Information Object retains
the property of being what it purports to be.
Relationships:
• Authenticity <isa> Content Quality Parameter
Rationale: The definition takes into account the results and experience of the InterPARES I
project19 [66][67].
Examples: The methods for data protection are key to assure authenticity of resources.
Document sealing engines which timestamp and sign digitally every item in the digital library
are an example of a solution which creates the proof that the documents they have not been
modified from the original.
C164 Authoritativeness
Definition: The Content Quality Parameter measuring the reliability of Resource origin
based on the trustfulness to the creator of the Resource or the frequency of accessing/referring
to the resource by Actors.
Relationships:
• Authoritativeness <isa> Content Quality Parameter
Rationale: The authoritativeness parameter measures the quality of being widely accepted or
frequently accessed. It is helpful to compare digital libraries with similar or identical scope.
NISO Z39.7 Library Statistics and ISO 11620 Library Performance Indicators suggest
measures of usage especially for the libraries.
Examples: The authoritativeness could be measured by estimating the number of visitors
(general number, or different users). Another possibility is to gather transaction information
(number of downloads and print-outs).
C165 Freshness
Definition: The Content Quality Parameter measuring the Information Object quality of
being current and promptly updated.
Relationships
• Freshness <isa> Content Quality Parameter
Rationale: This parameter evaluates whether an Information Object and the information it
carries are fresh and updated with respect to the task at hand.
Examples: For example, a stream of data coming from a sensor which monitors the
temperature and the pressure of a patient should be updated a regular intervals in order to
provide meaningful information for a physician.
Another relevant example is a Digital Library keeping weather forecast information, where it
is important to know if this information is updated and reflects the current weather conditions.
C166 Integrity
Definition: The Content Quality Parameter measuring the Information Object quality of
being complete and integer.
Relationships:
• Integrity <isa> Content Quality Parameter
Rationale: This parameter encompasses both the extent to which an Information Object is of
sufficient breadth, depth, and scope for the task at hand, as pointed out in [17].
Examples: From the point of view of data protection, integrity should guarantee that there are
no losses in the stored resources. This is an important parameter connected with the
preservation of the content.
19
http://www.interpares.org/
Rationale: The scope parameter helps to understand the coverage of a Digital Library both in
the sense of Content, and in the sense of Functionality. While the Size provides quantitative
insight, Scope is more qualitative-oriented.
Examples: A Digital Library could contain the complete collection of works of a certain
author, time period, genre. This is a content-related example.
C170 Size
Definition: The Content Quality Parameter measuring the magnitude of Resource, Collection
or a Digital Library as a whole.
Relationships:
• Size <isa> Content Quality Parameter
• Size <isa> Quantitative Measure
Rationale: Sizes can be provided according to different measures: for example, numbers of
items, pages, bytes, articles, words, images, multimedia files. The evaluation of the size of a
digital library is helping the user to get an idea about the resources. Size is also important
parameter for the architecture and functionality of the DL.
Examples: The physical size of a collection calculated in bytes is important for estimation of
the migration effort.
C171 Fidelity
Definition: The Content Quality Parameter measuring the accuracy with which an electronic
system reproduces given a Resource.
Relationships:
• Fidelity <isa> Content Quality Parameter
Rationale: The Fidelity parameter is used to evaluate to what degree a particular
representation of a given Resource is different from its very first original representation.
Examples: The rendition of a text document may be identical to its original appearance in the
word processing software used at the time of creating the document, but may also
significantly differ from its original appearance especially in layout – this difference is
expressed through Fidelity.
C172 Perceivability
Definition: The Content Quality Parameter measuring what effort a Actor needs to invest in
order to understand and absorb a Resource.
Relationships:
• Perceivability <isa> Content Quality Parameter
Rationale: The Perceivablity parameter is used to evaluate how easy a Actor would
understand and retain the information/knowledge within a Resource from the Content domain.
This quality parameter is essential for evaluating which resources are most likely to be well
understood within a specific target group of users.
Examples: When numerous resources in the digital library represent the same topic,
perceivability may help to choose those of them which are most likely to be quickly
understood. Quite often images might be found as having higher perceivability than texts.
Perceivability may also be used for answering the needs of special groups of users, for
example providing audio content to visually impaired users.
C173 Viability
Definition: The Content Quality Parameter measuring whether the Resource’s bit stream is
intact and readable with the existing technology.
Relationships
• Viability <isa> Content Quality Parameter
Rationale: Viability is essential for preservation activities within a digital library. It would
estimate whether a digital object could be read and manipulated with the existing hardware
and software.
Examples: The minimum time specified by the supplier for the media’s viability under
prevailing environmental conditions.
C174 Metadata Evaluation
Definition: The Content Quality Parameter measuring characteristics of Metadata.
Relationships
• Metadata Evaluation <isa> Content Quality Parameter
Rationale: Metadata Evaluation is essential for various processes in the digital library, and
most specifically in tasks related to access, preservation and operability. According to a
functionality-oriented definition of Guy, Powell, and Day, “high quality metadata supports the
functional requirements of the system it is designed to support”. Metadata evaluation could be
as simple as checking whether metadata (or specific metadata elements) are available; or it
could be a more sophisticated evaluation of incomplete, inaccurate, or inconsistent metadata
elements. In the most detailed case metadata evaluation would be a compound parameter
consisting of several others – for example Completeness, Accuracy, Provenance,
Conformance to Expectations, Timeliness, User Satisfaction, Perceivability. This combination
would depend on the purpose of the metadata evaluation.
Examples: Completeness in the context of Metadata evaluation could be used to measure
whether a minimal required set of elements is available in the metadata records.
C175 Functionality Quality Parameter
Definition: A Quality Parameter that concerns an aspect of the Functionality main concept.
Relationships:
• Functionality Quality Parameter <isa> Quality Parameter
• Usability <isa> Functionality Quality Parameter
• User Satisfaction <isa> Functionality Quality Parameter
• Availability <isa> Functionality Quality Parameter
• Dependability <isa> Functionality Quality Parameter
• Robustness <isa> Functionality Quality Parameter
• Fault Management Performance <isa> Functionality Quality Parameter
• Capacity <isa> Functionality Quality Parameter
• Orthogonality <isa> Functionality Quality Parameter
• Awareness of Service <isa> Functionality Quality Parameter
• Expectations of Service <isa> Functionality Quality Parameter
• Impact of Service <isa> Functionality Quality Parameter
Rationale: This is a family of Quality Parameters, reflecting the variety of facets that
characterize the quality of the Functionality, in particular Functions, of a Digital Library.
Examples: --
C176 Availability
Definition: The Functionality Quality Parameter which indicates the ratio of the time a
Functions is ready for use to the total lifetime of the system.
Relationships:
• Availability <isa> Functionality Quality Parameter
• Availability <affectedBy> Robustness
• Availability <affectedBy> Fault Management
• Availability <affectedBy> Capacity
Rationale: Availability is a fundamental parameter for assessing the quality of a Function,
since Actors may be very disappointed when they try to use a Function and this is not
available.
Availability may be affected by other parameters, such as Robustness and Fault Management:
the former guarantees that a Function will continue to work and be available even in the case
of bad input; the latter guarantees that a Function will be able to recover from an error
condition and thus continue to be available. Finally, also Capacity may affect Availability,
since in case of starvation of resources a Function may stop to be available.
Availability typically parallels with Dependability.
Examples: In the telephone services, high levels of availability are requested, the well-known
“five-nines”, the 99,999% of up-time of the system, since nobody expects to pick up the
received and to not hear the signal.
C177 Awareness of Service
Definition: The Functionality Quality Parameter which measures how well the perspectives
Actor of a Digital Library are aware of its existence and Functions.
Relationships:
• Awareness of Service <isa> Functionality Quality Parameter
Rationale: To measure the Awareness of Service, most often surveys are used. To increase
the Awareness of Service, awareness system could be established as a DL functionality
component.
Examples: Awareness of Service for target user groups is important component of the current
information literacy.
C178 Capacity
Definition: The Functionality Quality Parameter measuring the limit on the number of
requests that a Function can serve in a given interval of time.
Relationships:
• Capacity <isa> Functionality Quality Parameter
Rationale: Capacity determines how many concurrent requests can be successfully served.
It may affect Availability, Dependability and Performance. Indeed, when a Function operates
beyond its Capacity, Availability MAY be compromised since the Function may stop
working, as in the case of denial of service attacks; similarly, Dependability and Performance
may be negatively affected if the Function does not complete its works or takes too much to
complete.
Examples: --
Orthogonality may affect Usability and may also affect User Satisfaction, when the usage of
the DL might get too complicated.
Examples: The idea of orthogonality is that the same function is invoked by the same
commands, from the same menu entries.
C183 Dependability
Definition: The Functionality Quality Parameter measuring the ability of a DL to perform a
Function under stated conditions for a specified period of time.
Relationships:
• Dependability <isa> Functionality Quality Parameter
• Dependability <affectedBy> Capability
Rationale: Dependability reflects whether a given Function works correctly without
producing errors.
Capacity may affect Dependability, since in case of starvation of resources a Function may
not work properly.
Examples: When a Actor types the URL address of a portal which gives access to a Digital
Library, he/she expects that the address is correctly resolved and to be redirected to the
correct site and not to a wrong one.
C184 Robustness
Definition: The Functionality Quality Parameter measuring the resilience to ill-formed input
or incorrect invocation sequences of a Function.
Relationships:
• Robustness <isa> Functionality Quality Parameter
Rationale: Robustness is a key parameter which may affect other Quality Parameters, such as
Security Enforcement or Availability. Indeed, many kinds of attack which compromise the
functioning of a service or gain un-authorized access to services, are based on ill-formed
input, such as buffer overflows.
Examples: --
C185 Usability
Definition: The Functionality Quality Parameter which indicates the ease of use of a given
Function.
Relationships:
• Usability <isa> Functionality Quality Parameter
• Usability <affectedBy> Orthogonality
Rationale: The Usability reports how much a given Function makes it easy for a Actor to
achieve its goals.
It can be evaluated by using different Measures: for examples, the Actor can indicate on a
subjective scale the degree of Usability of a Function, or the time need to complete a task can
be measured.
Examples: --
C186 User Satisfaction
Definition: The Functionality Quality Parameter which indicates how much an Actor is
satisfied by a given Function.
Relationships:
• User Satisfaction <isa> Functionality Quality Parameter
• User Satisfaction <affectedBy> Usability
• User Satisfaction <affectedBy> Documentation Coverage
• User Satisfaction <affectedBy> Performance
• User Satisfaction <affectedBy> Availability
• User Satisfaction <affectedBy> Dependability
• User Satisfaction <affectedBy> Orthogonality
Rationale: The User Satisfaction parameter reflect how much an Actor is satisfied by the
capabilities offered by a given Function. Many factors can influence the User Satisfaction,
such as the Usability, the Documentation Coverage, the Performance, the Availability, the
Dependability and so on.
Examples: --
C187 User Quality Parameter
Definition: A Quality Parameter that concerns an aspect of the User Domain main concept.
Relationships:
• User Quality Parameter <isa> Quality Parameter
• User Behaviour <isa> User Quality Parameter
• User Activeness <isa> User Quality Parameter
Rationale: This is a family of Quality Parameters, reflecting the variety of facets that
characterize the quality of the User Domain, in particular Actors, of a Digital Library.
Examples: --
C188 User Activeness
Definition: A User Quality Parameter that reflects how much an Actor is active and interacts
with a Digital Library.
Relationships:
• User Activeness <isa> User Quality Parameter
Rationale: This parameter concerns whether and how much an Actor is active with respect to
the Content and Functionality offered by a Digital Library.
Examples: Factors that influence this parameter are, for example, whether an Actor
frequently contributes his own Content to the Digital Library or whether an Actor often
participates in discussions with other Actors, perhaps by using Annotations.
C189 User Behaviour
Definition: A User Quality Parameter that reflects how an Actor behaves and interacts with a
Digital Library.
Relationships:
• User Behaviour <isa> User Quality Parameter
Rationale: This parameter concerns whether and how much an Actor abides by the Policies
and regulations of a Digital Library.
Examples: Factors that influence this parameter are, for example, if an Actor respects the
copyright on the Resources of a Digital Library or if he makes un-authorized copies of such
material.
Rationale: The presence of accurate logs is crucial for understanding, analysing, debugging,
and improving the functioning of a Digital Library System.
Furthermore, log analysis can be an effective way of understanding the Actor behaviour and
personalize the Digital Library System accordingly; therefore logs can be a useful input for
the Personalize functions and for creating Actor Profiles.
Examples: There are various standards for creating logs. For example, in the case of Web,
there is W3C Extended Log Format [99].
C199 Maintenance Performance
Definition: The Architecture Quality Parameter concerning the design and implementation of
software and hardware maintenance plans for Architectural Components.
Relationships:
• Maintenance Performance <isa> Architecture Quality Parameter
• Maintenance Performance <affectedBy> Change Management Policy
Rationale: Maintenance Performance concerns the design of plans for keeping Architectural
Components updated with research and technological advances.
Change Management Policy may affect Maintenance Performance, since it regulates the
change process in a Digital Library.
It may influence Sustainability, since it involves keeping the current system properly
functioning and evolving it for facing future technological developments.
Examples: A maintenance plan may concern programmed hardware updates, controlled
migration towards new software and hardware environments, and so on.
C200 Redundancy
Definition: The Architecture Quality Parameter measuring the degree of (partial) duplication
of System Architecture Components to decrease the probability of a system failure.
Relationships
• Redundancy <isa> Architecture Quality Parameter
Rationale: A redundant architecture helps in improving the overall performances of a system
and may improve the Availability, Dependability and Robustness of a Digital Library System.
Examples: --
C201 Architecture Domain
Definition: One of the six main concepts characterising the digital library universe. It
represents the various aspects related to the software systems concretely realising the digital
library universe.
Relationship
• Digital Library <definedBy> Architecture Domain
• Digital Library System <definedBy> Architecture Domain
• Digital Library Management System <definedBy> Architecture Domain
• Architecture Domain <consistOf> Architectural Component
Rationale: The Architecture Domain captures concepts and relationships characterising the
two software systems playing an active role in the DL universe, i.e. DLSs and DLMSs.
Unfortunately, the importance of this foundational concept has been largely underestimated in
the past. The importance of its domain and its modeling is described in Section II.2.7.
Examples: --
C206 Interface
Definition: A Software Architecture Component representing a set of methods and parameters
implemented by an Architectural Component. The client of such an Architectural Component
may rely on them while interacting with it.
Relationship
• Interface <isa> Software Architecture Component
• Architectural Component <has> Interface
• Framework Specification <prescribe> Interface
• Component Profile <profile> Interface
Rationale: The Interface encapsulates knowledge about the component, i.e. the rest of the
system can use the component according to the patterns enabled by the interface(s).
Examples:
• OAI-PMH [135] prescribed the Interface an Architectural Component acting as OAI
compliant data provider [134] must implement in order to serve an Architectural
Component willing to act as OAI application providers
C207 Framework Specification
Definition: The Software Architecture Component prescribing (<prescribe>) the set of
Interfaces and protocols an Architectural Component should conform to (<conformTo>) in
order to interact with the other Architectural Components of the same system by design.
Relationship
• Framework Specification <isa> Software Architecture Component
• Architectural Component <conformTo> Framework Specification
• Framework Specification <prescribe> Interface
Rationale: The notion of Framework Specification is needed to capture to operational context
an Architectural Component has been designed to operate in.
Examples:
• Enterprise JavaBeans
• Component Object Model
C208 System Architecture Component
Definition An Architectural Component contributing to implement the System Architecture of
a system.
Relationship
• System Architecture Component <isa> Architectural Component
• System Architecture Component <isa> Resource (inherited from Architectural
Component)
• System Architecture Component <yield> Function (inherited from Architectural
Component)
• System Architecture Component <hasQuality> Quality Parameter (inherited from
Resource)
• System Architecture Component <regulatedBy> Policy (inherited from Resource)
• System Architecture Component <hasProfile> Component Profile (inherited from
Architectural Component)
Examples:
• The server equipped with the bunch of software needed to host and run the Software
Component implementing the user interface of the DELOS Digital Library
(http://www.delos.info)
C211 Software Architecture
Definition: The set of Software Architecture Components organised to form a system.
Relationship
• Software Architecture <consistOf> Software Architecture Component
Rationale: Each software system is characterised by a set of software pieces organised in a
structure making them capable to work together. This organised set of software is the
Software Architecture. To help software engineers in designing their systems a set of well-
proven generic scheme for solution of recurring design problems have been identified, i.e.
software architecture patterns [39]. Patterns capture existing, well-proven experience in
software development and help to promote good design practise. The Reference Architecture
envisaged in Section I.5 and constituting an important part of the Digital Library development
framework is a pattern for Digital Library Systems. Similarly to patterns, it is important to
recall that many Reference Architectures can be designed, each dealing with a specific and
recurring problem in designing or implementing DLSs. Moreover, different Reference
Architectures can be used to construct DLSs with specific properties.
Examples:
• Client-Server architecture
• Service-oriented Architecture
C212 System Architecture
Definition: The set of System Architecture Components organised to form a system.
Relationship
• System Architecture <consistOf> System Architecture Component
Rationale: Each software system is characterised by the set of its constituents. This
Reference Model classifies the constituents of a software system along two dimensions, the
one of the Software Architecture and the one of the System Architecture. The System
Architecture, being an architecture is an organised set of constituents. In this case constituents
are System Architecture Components, namely Running Instances and Hosting Nodes. Because
of (i) the strong relations between Running Instances and Software Components, i.e., a
Running Component is the result of the deployment of a Software Component, and (ii) the fact
that Software Components are the main constituents of the Software Architecture of the
system, there is a strong relation between Software Architecture and System Architecture. A
System Architecture is one of the possible instances that are obtainable according to the
Software Architecture of the system in use. It is well known that by exploiting a software
system developed according to a monolithic application pattern it is not possible to realise a
system having a distributed System Architecture. The more flexible is the Software
Architecture a system adopt, the larger will be the potential range of application scenarios it
can be successfully exploited.
Examples:
• The set of servers and services realising the DELOS Digital Library
(http://www.delos.info)
C213 Purpose
Definition: The motivation characterizing the <associatedWith> relationship.
Relationships:
• Resource <associatedWith> Resource to a certain Purpose
Rationale: The <associatedWith> relation is one of the powerful ones allowing to build
compound Resources, i.e., Rersources obtained by combining existing constituent Resources
as to form a new knowledge bundle having a value added with respect to the single Resources
when considered as single island of information. Various kind of associations are possible,
and this diversity is captured by the Purpose concept attached to each instance of the
<associatedWith> relation.
Examples:
An Information Object representing an experiment (itself composed of various Information
Objects representing, e.g., the dataset the experiment is carried on, the dataset representing the
outcomes, the description of the procedure adopted) is <associatedWith> the Information
Object representing the scientific publication in an outstanding Journal of the filed with the
<Purpose> of scholarly dissemination.
C214 Region
Definition: A contiguous portion of a given Resource with the desired degree of granularity
identified in order to anchor a given Annotation to it.
Relationships:
• Resource <hasAnnotation> Annotation about a Region
Rationale: The idea of “contiguous portion” of a Resource resembles and complies with the
concept of segment introduced in Navarro et Al. [156]. The granularity of such a kind of
“identifier” can vary according to the meaningful ways of locating a part of a Resource, which
depend on the actual specialisation of the Resource we are dealing with. As a consequence,
we can have Regions which anchor an Annotation to the whole Resource, as well as, Regions
able to anchor an Annotation to a specific part of a Resource.
Examples: A piece of text, e.g. a paragraph, of a Information Object representing this volume
is a Region an Annotation can be attached to.
C215 Digital Library
Definition: An organization, which might be virtual, that comprehensively collects, manages,
and preserves for the long term rich Information Objects, and offers to its Actors specialized
Functions on those Information Objects, of measurable quality, expressed by Quality
Parameters, and according to codified Policies.
Relationship
• Digital Library <manage> Resource
• Digital Library <manage> Information Object
• Digital Library <serve> Actor
• Digital Library <offer> Function
• Digital Library <agreeWith> Policy
• Digital Library <tender> Quality Parameter
• Digital Library System <support> Digital Library
• Digital Library is <definedBy> Resource Domain
• The set of servers, services, and software realising The European Library
(http://www.theeuropeanlibrary.org)
• The set of servers, services, and software realising the National Science Digital Library
(http://nsdl.org)
C217 Digital Library Management System
Definition: A generic software system that provides the appropriate software infrastructure
both (i) to produce and administer a Digital Library System incorporating the suite of
Functions considered foundational for Digital Libraries, and (ii) to integrate additional
software components offering more refined, specialized, or advanced functionality.
Relationship:
• Digital Library Management System <deploy> Digital Library System
• Digital Library Management System <extend> Digital Library System
• Digital Library Management System is <definedBy> Resource Domain
• Digital Library Management System is <definedBy> Content Domain
• Digital Library Management System is <definedBy> User Domain
• Digital Library Management System is <definedBy> Functionality Domain
• Digital Library Management System is <definedBy> Policy Domain
• Digital Library Management System is <definedBy> Quality Domain
• Digital Library Management System is <definedBy> Architecture Domain
Rationale: This Reference Model introduces three notions of systems (cf. Sec. I.2) active in
the universe, i.e. the Digital Library, the Digital Library System, and the Digital Library
Management System. The Digital Library Management System (DLMS) is the system that
provides DL Designers, DL System Administrators, and DL Application Developers with
Functions supporting their tasks (cf. Sec. I.4). Depending on the set of Functions DLMS
provide Actors with, different types of such systems can be implemented (cf. Sec. I.2).
Examples:
• OpenDLib (http://www.opendlib.com): the DLMS used to create and maintain the DELOS
Digital Library (http://www.delos.info).
• DILIGENT [62]: a prototypical DLMS capable to deploy Digital Library Systems by
relying on a set of Resources ranging from Software Components to Hosting Nodes
dynamically gathered through Grid technologies.
• The DelosDLMS [175]: a DLMS built by integrating software and services developed by
DELOS partners.
20
“Classifiers”, i.e. items added to the hierarchy for organisational purposes are marked [in
squared brackets].
. . R36 retrieve
. . . R37 return
. . R38 produce
. . R39 issue
. [ Policy Relations ]
. . R6 regulatedBy (isa Resource Relation)
. . R40 govern
. . . R41 prescribe
. . R42 antonymOf
. . R43 influence
. [ Quality Relations ]
. . R44 expressAssessment
. . R45 evaluatedBy
. . R46 measuredBy
. . R47 affectedBy
. . R48 accordTo
. [ Architecture Relations ]
. . R49 implement
. . . R50 realisedBy
. . . R51 support
. . . R52 hostedBy
. . R14 composedBy (isa hasPart)
. . R16 use (isa associatedWith)
. . R17 conflictWith (isa associatedWith)
. . R10 hasProfile (isa hasMetadata)
. . R4 conformTo (isa hasFormat)
. . . R41 prescribe (isa govern)
. . R25 profile (isa belongTo)
R9 modelledBy
Definition: The relation connecting Actors to Actor Profiles representing them.
Rationale: This is a specialisation of the <hasMetadata>. An Actor may have many Actor
Profiles. An Actor Profile must be associated to one Actor.
Examples:
R10 hasProfile
Definition: The relation connecting Architectural Components to Component Profiles
representing them.
Rationale: This is a specialisation of the <hasMetadata>. An Architectural Component can
be associated with Component Profiles. A Component Profile must be associated with one
Architectural Component.
Examples:
R11 hasAnnotation
Definition: The relation connecting Resources to Information Objects to add an interpretative
value to a certain Region.
Rationale: This relation is analogous to <hasMetadata>. Annotations are sometimes
modelled as concepts, however they are more clearly seen as roles that information objects
play to resources in specific contexts. Hence annotation (cf. Sec. III.3 C14) is defined as the
range of the <hasAnnotation> relation. Happily, this choice settles the long-standing issue
whether annotations are to be considered as objects or as metadata.
Examples: --
R12 expressedBy
Definition: The relation connecting Resources to Information Objects materialising them.
Rationale: This relation has been introduced to capture the materialisation of otherwise
abstract Resources. It is mainly intended for the materialisations of Resources like Policy and
Quality Parameter but can be applied to any type of Resource.
A Resource can be associated to many Information Objects materialising it in different ways.
A materialising Information Object must be associated with one Resource.
Examples: --
R13 hasPart
Definition: The relation connecting Resources to their constituent Resources.
Rationale: The relation where a Resource “child” is a subset or part of the “parent” Resource.
This “part of” association may have two different natures: the aggregative and the
compositional one. In the aggregative nature the single parts stand by their selves and may be
constituents of any number of Resources. In the case of compositional nature the whole
strongly owns its parts, i.e. if the whole resource is copied or deleted, its parts are copied or
deleted with it.
Examples:
• A book has part the preface, the chapters, the bibliography
R14 composedBy
Definition: The relation connecting Architectural Components to constituent Architectural
Components.
Rationale: This is the specialisation of the <hasPart> relation in the case of Architectural
Components. Also in this case the relation can implement the aggregative and the
compositional nature of the “part of”.
An Architectural Component can be composed by many Architectural Components. The same
Architectural Component can be a component of many Architectural Components.
Examples: --
R15 associatedWith
Definition: The relation connecting a Resource to the Resources which are linked to the
former according to a certain Purpose.
Rationale: In addition to the explicitly identified pool of relations connecting Resources, this
relation allows to specify cross-resource links with respect to a well-known Purpose.
No constraints about the cardinality of this relation are established, i.e. a Resource may be
connected to zero or more Resources through the <associatedWith> with a certain Purpose; a
Resource may or may appear as the second term of a <associatedWith> relationship with a
certain Purpose.
Examples: --
R16 use
Definition: The relation connecting Architectural Components to Architectural Components
they use.
Rationale: Architectural Components are the constituents of the architectures of all the digital
library system. Thus it is supposed that the system follows the component-oriented approach.
The <use> relations capture the
Examples: --
R17 conflictWith
Definition: The relation connecting Architectural Components to Resources they realises.
Rationale: --
Examples: --
R18 invokes
Definition: The relation connecting Running Components to Running Components they use to
accomplish their task.
Rationale: --
Examples: --
R19 belongTo
Definition: The relation connecting Resources to the Resource Sets in whose extension they
belong. A specialisation of it is the relation connecting an Actor to a Group which defines
which user group an actor belongs to.
Rationale: A Resource may be a member of any number of Resource Set extensions and, vice
versa, the extension of a Resource Set may include any (finite) number of resources.
Examples:
• An Information Object belongs to a Collection
• An Actor belongs to a Group
R20 hasEdition
Definition: The relation connecting Information Objects to the Information Objects that
realise them along the time dimension.
Rationale: In “classic” Digital Library models, Editions represent the different states of an
Information Object during its lifetime, i.e. they play the role usually assigned to versions.
Versioning usually makes up a tree, because an object may be the version of at most one other
object. However, in the digital library world a more liberal approach may be appropriate
allowing an Information Object to be the edition of possibly many different Information
Objects. The resulting structure will be a directed graph, which must be acyclic to avoid
unintuitive situations.
Examples: An Information Object representing a study:
• <hasEdition> another Information Object representing the draft version of such a study;
• <hasEdition. another Information Object representing the “submitted version” of such a
study;
• >hasEdition> another Information Object representing the “version published in the
conference proceedings” with colour images;
R21 hasView
Definition: The relation connecting Information Objects to the Information Objects which are
Views of them.
Rationale: The concept of View captured by this relation fits very well with those used in the
database world. In this context, a view is a virtual or logical table expressed as a query
providing a new organisational unit to support some application. Similarly, Information
Object Views are introduced to provide multiple presentations of the information
represented/captured by the Information Object that may result useful in specific application
contexts.
The same Information Object may have different Information Objects linked through the
<hasView> relation. Vice versa, an Information Object may or may not be a view, that is the
second term of a <hasView> relationship.
Examples:
• An Information Object representing a data stream of an environmental sensor
o <hasView> the Information Object consisting of the raw data, i.e. a series of numerical
values measured by the sensor
o <hasView> the Information Object consisting of a picture reporting the graph of the
evolution of the values measured by the sensor along the time.
• An Information Object representing the outcomes of a workshop
o <hasView> the Information Object representing the “full view” and containing a
preface prepared by the conference chair and the whole set of papers accepted and
organised thematically;
o <hasView> the Information Object representing the “handbook view” and containing
the conference program and the slides of each lecturer accompanied with the abstract
of the papers organised per session, and
o <hasView> the Information Object representing the “informative view” and reporting
the goal of the workshop and the title list of the accepted papers together with the
associated abstract.
R22 hasManifestation
Definition: The relation associating Information Objects to the Information Objects
representing their physical embodiment.
Rationale:
While Edition and View concepts deal with the intellectual and logical organisation of
Information Objects, the Manifestation concept captured by the <hasManifestation> relation
deals with the physical presentation of objects.
Examples:
• The Information Object representing a conference paper <hasManifestation> the PDF file
or the Microsoft Word file embodying it.
• A lecture Information Object <hasManifestation> the MPEG file containing the video
recording of the event.
• A sensor Information Object <hasManifestation> the file containing the raw data captured
by the sensor.
R23 hasIntension
Definition: The relation connecting Collection to the Query describing the criterion underlying the
Collection.
Rationale: : In logic, the intension of an expression is its sense, as distinguished by the reference (or
denotation) of the expression, called the extension of the expression. This distinction was firstly made
by G. Frege, for whom the sense of an expression corresponded to what we intuitively think as the
meaning of the expression. R. Carnap later suggested that the sense of an expression is a function
which gives, for each state of affairs, the extension of the expression. S. Kripke, upon defining a
semantics for modal logic, finally established the notion of possible world as state of affairs [63].
Davidson argued that giving the meaning of a sentence is equivalent to stating its truth conditions.
The intention of a collection can thus be understood as a statement of what must be true of an object
for it to be a member of the collection.
Examples: --
R24 hasExtension
Definition: The relation connecting Collections to the Resource Sets representing the Information
Objects belonging to them.
Rationale: In logic, the extension of an expression is its denotation. For a proposition, this is the truth
value in the considered interpretation; the extension of a predicate is the set of objects that are denoted
by the predicate in the considered interpretation.
Examples: --
R25 profile
Definition: The relation connecting Component Profiles to the Resources they describe. Component
Profiles should describe (at least) Functions, Policies, Quality Parameters, and Interfaces inherent to
Architectural Components they are associated with via the <hasProfile> relation.
Rationale: Component Profile is the Metadata associated with each Architectural Component for its
management. The <profile> relation captures the aspects expected to be captured by this kind of
profile.
Examples:
• The Functions yielded by the Architectural Component is a typical information expected to be
included in the Component Profile.
• The Quality Parameters guaranteed by the Architectural Component is a typical information
expected to be included in the Component Profile.
R26 perform
Definition: The relation connecting Actors to Functions they use to do accomplish their
Digital Library activities.
Rationale: Functions has no meaning by themselves if no Actor is executing them. This
relation is fundamental to a DL, as it expresses the interaction of the DL with the Actors
through specific Functions in order to accomplish their goals.
Examples:
R27 play
Definition: The relation connecting an Actor to a Role which defines the Role(s) of the Actor.
Rationale: DL Actors may play different Roles in the DL, for example be at the same time
Content Creator and Content Consumer.
Examples: --
R28 interactWith
Definition: The relation connecting Functions to Functions that expresses the interaction
between them.
Rationale: This Function is fundamental for the modelling of the workflow of execution for
the Functions. It defines an order between them in order to clarify which Function follows the
current one.
Examples:
R29 influencedBy
Definition: The relation connecting Functions to Actor Profiles which expresses that
functions are influenced by specific user characteristics.
Rationale: This relation is very important for personalization, as it expresses that
functionality is related to and influenced by the Actor Profile of the Actor executing it,
adapting thus to the Actor specific needs.
Examples:
R30 actOn
Definition: The relation connecting Functions to Resources they operate on.
Rationale: This relation expresses the connection between specific Functions and the
Resources they interact with, either to manage or access them. A Function in most cases
produces a result to be presented to the Actor, it represents an action performed on one of the
DL constituents, which are not only primary Information Objects, but also Actor profiles,
Policies, etc.
Examples:
R31 create
Definition: The relation connecting the Create Functions to Resources they create.
Rationale: This connection expresses the relation of the creation of resources to the resource
created by the function. Note that in this case the new Resource is not actually inserted in the
library, until a Submit Function has been performed.
Examples:
R32 createAnnotation
Definition: The relation connecting Annotate Functions to Information Objects they create.
Rationale: This relation expresses the relation of the creation process of an annotation with
its end result.
Examples:
R33 createVersion
Definition: The relation connecting Author Collaboratively Function to Information Objects
they create. These Information Objects are linked to the originating Information Objects via
the <hasEdition> relation.
Rationale: This relation expresses the fact that a by-product of collaborating authoring is
different versions of the authored Information Object.
Examples:
R34 createView
Definition: The relation connecting Convert to Different Format Functions to Information
Objects they create. These Information Objects are linked to the originating Information
Objects via the <hasView> relation.
Rationale: This relation records the fact that the conversion of an Information Object to a
different format creates another view of it. An example of this is the conversion of a Word
document to pdf.
Examples:
R35 createManifestation
Definition: The relation connecting Extract and Physically Convert Functions to Information
Objects they create. These Information Objects are linked to the originating Information
Objects via the <hasManifestation> relation.
Rationale: In this case the primary Information Object itself is transformed and a new
manifestation is created.
Examples:
R36 retrieve
Definition: The relation connecting Access Resource Functions to Resources they find. A
specialisation of this relation connect Find Collaborator Functions to Actors they find.
Another specialization is the relation return which connects the Function Discover and Result
Set.
Rationale: This Function connects a retrieval function to the retrieved result.
Examples:
R37 return
Definition: The relation connecting Discover Functions to Result Sets they find. It is a
specialization of the retrieve relation connecting Access Resource Functions to Resources.
Rationale: This Function connects a discover function to the result set it returns.
Examples:
R38 produce
Definition: The relation connecting Query to Result Sets they return.
Rationale: When a Query is performed as a result of a Search Function, it produces a result
set.
Examples:
R39 issue
Definition: The relation connecting Search Functions to the Queries they use to retrieve
results.
Rationale: In order for the Function to retrieve the results the Actor has requested, it has to
issue a Query to a Collection and retrieve a result set.
Examples:
R40 govern
Definition: The relation connecting Policies to the Resources they control/govern.
Rationale: Each Policy to be effectively implemented must be applied to Resources. This
relation captures those Resources each Policy is designed to influence the actions and
conduct.
Examples: Digital Rights Management Policy governs Functions, while Digital Rights
govern Information Objects.
R41 prescribe
Definition: The relation connecting Framework Specifications to the Interfaces they state as a
rule that should be carried out by Architectural Components that <conformTo> it.
Rationale: Framework Specification is the Software Architecture Component that describes
the design of the set of Software Architecture Components planned to form the Software
Architecture of a system. By establishing the Interfaces each Software Architecture
Component (actually a Software Component) is expected to implement it is possible to
guarantee by design that the set of Software Architecture Components forming a Software
Architecture will properly work collaboratively as to form a whole.
Examples:
• A Framework Specification may <prescribe> the publish/subscribe Interface each
Software Component of a system must implement in order to conform to the
publish/subscribe mechanism planned for such a system.
R42 antonymOf
Definition: The relation connecting Policy to Policy with opposite meaning.
Rationale: This relation is used when we have a set of two Policy (in general Resources) with
opposite meaning. It is introduced in order to facilitate the understanding of bi-polar sets of
concepts.
Examples: Extrinsic policy and Intrinsic policy form a pair where each concept is
<antonymOf> the other concept.
R43 influence
Definition: The relation connecting Quality Parameter to Policy they affect.
Rationale: This reference model does not present the digital library as a static entity, but also
highlights the processes within the functioning of a digital library. One important aspect is
how decisions for applying specific Policies could be taken within the DL. This relation
captures the cases in which the decision is based on Quality Parameters.
Examples: The value of Security Enforcement Quality Parameter supported by a Digital
Library System will influence the Digital Right Management Policy.
R44 expressAssessment
Definition: The relation connecting Quality Parameters to the Actors who are expressing an
assessment about a Resource.
Rationale: See Quality Parameter, Actor, and Resource.
Examples: See Quality Parameter, Actor, and Resource.
R45 evaluatedBy
Definition: The relation connecting Quality Parameters to the Measures according to which
they are evaluated.
Rationale: See Quality Parameter and Measure.
Examples: See Quality Parameter and Measure.
R46 measuredBy
Definition: The relation connecting Quality Parameters to the Measurements which assign
them a value.
Rationale: See Quality Parameter and Measurement.
Examples: See Quality Parameter and Measurement.
R47 affectedBy
Definition: The relation connecting Quality Parameters to other Resources which influence
their determination.
Rationale: See Quality Parameter and Resource.
Examples: See Quality Parameter and Resource.
R48 accordTo
Definition: The relation connecting Measurements to the Measures which define how they
have to be obtained.
Rationale: See Quality Parameter, Measure, and Measurement.
Examples: See Quality Parameter, Measure, and Measurement.
R49 implement
Definition: The relation connecting Architectural Components to the Resources they realises.
Rationale: --
Examples: --
R50 realisedBy
Definition: The relation connecting Software Components to the Running Components
realising them.
Rationale: --
Examples: --
R51 support
Definition: The relation connecting Application Frameworks to the Running Components
they support the operation.
Rationale: --
Examples: --
R52 hostedBy
Definition: The relation connecting Running Components to the Hosting Nodes physically
hosting them.
Rationale: --
Examples: --
Conclusions
This document has presented the DELOS Reference Model for Digital Libraries. It consists of
separate parts that illustrate the DELOS model from different perspectives and at different
level of abstractions. This structure has been introduced to accommodate the needs of the
many different players of the digital library universe that are interested in understanding the
digital library systems at different level of details.
The model presented is the result of a three years effort aimed at contributing to the ambitious
process of laying foundations for digital libraries as a whole. Research on “digital libraries”
addresses many different areas. The lack of any agreement on the foundations for this broad
research field has led to a plethora of systems, methodologies and results that are difficult to
combine and reuse to produce enhanced outcomes.
The model illustrated draws upon the understanding of Digital Library systems acquired by
several European research groups active in the Digital Library field for many years, both
within the DELOS Network of Excellence and outside, as well as by other groups around the
world. In such an aspect, it has to be intended as a collective effort made by the digital library
community to agree on a common ground. This is meant to be useful not only for current
activities but also as a springboard for future work.
The presented model is an abstract framework for understanding significant relationships
among the entities of the digital library universe, and for the development of consistent
standards or specifications supporting the different elements of this universe. It aims at
providing a common semantics that can be used unambiguously across and between different
application areas both to explain and organise existing digital library systems and to support
the evolution of research and developments in this area.
Because of the broad coverage of the digital library field, the heterogeneity of the involved
actors, and the lack of any previous agreement on the foundations of the field, the DELOS
Reference Model has to be considered as a living document shared by the digital library
community. For this reason, this document is made available in its subsequent versions, each
taking advantage from the previous one and from the public comments received.
The framework introduced so far does not aim at covering every specific aspects of the digital
library universe. Rather its objective is to provide a core context representing the main aspects
of these systems. Other specific aspects can easily be modelled by relying on this core part
and by introducing more detailed concepts and relationships. We expect that in the future
many more focussed, fine-grained models, developed by other communities will be
progressively integrated in the current model thus creating an increasingly richer framework
capable of capturing more and more aspects of the digital library universe.
Figure A-5. Functionality Domain Concept Map: Access Resource Functions (A4 format)
A.6. Functionality Domain Concept Map: Specializations of the Manage Resource Function
Figure A-6. Functionality Domain Concept Map: Specializations of the Manage Resource Function (A4 format)
Figure A-7. Functionality Domain Concept Map: General Manage Resource Functions (A4 format)
Figure A-8. Functionality Domain Concept Map: Manage Information Object Functions (A4 format)
Figure A-9. Functionality Domain Concept Map: Manage Actor Functions (A4 format)
Figure A-10. Functionality Domain Concept Map: Collaborate Functions (A4 format)
Figure A-11. Functionality Domain Concept Map: Manage DL Functions (A4 format)
A.12. Functionality Domain Concept Map: Manage & Configure DLS Functions
Figure A-12. Functionality Domain Concept Map: Manage & Configure DLS Functions (A4 format)
Figure A-14. Policy Domain Concept Map: Policies' Hierarchy (A4 format)
Bibliography
[1] Abiteboul, S.; Agrawal, R.; Bernestein, P.; Carey, M.; Ceri, S.; Croft, B.; DeWitt, D.;
Franklin, M.; Garcia-Molina, H.; Gawlick, D.; Gray, J.; Haas, L.; Halevy, A.;
Hellerstein, J.; Ioannidis, Y.; Kersten, M.; Pazzani, M.; Lesk, M., Maier, D.; Naughton,
J.; Schek, H.-J.; Sellis, T.; Silberschatz, A.; Stonebraker, M.; Snodgrass, R.; Ullman,
J.D.; Weikum, G.; Widom, J.; Zdonik, S. The Lowell Database Research Self-
Assessment. Communications of the ACM (CACM), 48(5):111--118, 2005.
[2] Agosti, M.; Albrechtsen, H.; Ferro, N.; Frommholz, I.; Hansen, P.; Orio, N.; Panizzi, E.;
Pejtersen, A. M.; Thiel, U. DiLAS: a Digital Library Annotation Service. In Proceedings
of International Workshop on Annotation for Collaboration – Methods, Tools, and
Practices (IWAC 2005), 91-101, 2005
[3] Agosti, M.; Ferro, N. A System Architecture as a Support to a Flexible Annotation
Service, volume 3664 of Lecture Notes In Computer Science, pages 147–166. Springer-
Verlag, Berlin; Heidelberg. Peer-to-Peer, Grid, and Service-Orientation in Digital
Library Architectures: 6th Thematic Workshop of the EU Network of Excellence
DELOS. Revised Selected Papers, 2005
[4] Agosti, M.; Ferro, N. Annotations as Context for Searching Documents. volume 3507 of
Lecture Notes In Computer Science, pages 155–170. Springer-Verlag, Berlin;
Heidelberg, Proceedings 5th International Conference on Conceptions of Library and
Information Science – Context: nature, impact and role (Colis 5), 2005.
[5] Agosti, M.; Ferro, N.; Frommholz, I.; Thiel, U. Annotations in Digital Libraries and
Collaboratories: Facets, Models and Usage, volume 3232 of Lecture Notes In
Computer Science, pages 244–255. Springer-Verlag, Berlin; Heidelberg. Research and
Advanced Technology for Digital Libraries: 8th European Conference, ECDL 2004.
Proceedings.
[6] Agosti, M.; Ferro, N.; Orio, N. Annotating Illuminated Manuscripts: an Effective Tool
for Research and Education. In Proceedings of the 5th ACM/IEEE 2005 Joint
Conference on Digital Libraries (JCDL 2005), 121-130, 2005.
[7] Alexandria Digital Library http://www.alexandria.ucsb.edu/
[8] Alves M., Viegas Damásio C., Olmedilla D., Nejdl W. A distributed tabling algorithm
for rule based policy systems. In 7th IEEE International Workshop on Policies for
Distributed Systems and Networks (POLICY 2006), London, Ontario, Canada, June
2006. IEEE Computer Society.
[9] Andrews J., Law D., Digital Libraries: Policy, Planning and Practice, Ashgate, 2004,
ISBN 0 7546 3448 5, 283 pages.
[10] Antoniou G., Baldoni M., Bonatti P., Nejdl W., Olmedilla D. Rule-based policy
specification. In Ting Yu and Sushil Jajodia, editors, Secure Data Management in
Decentralized Systems. Springer, 2007 (to appear).
[11] ARTISTE An Integrated Art Analysis and Navigation Environment
http://www.ecs.soton.ac.uk/~km/projs/artiste/
[12] Avancini, H.; Candela, L.; Straccia, U. Recommenders in a Personalized, Collaborative
Digital Library Environment. Journal of Intelligent Information System, 28(3), 253-283,
2007.
[13] Ackerman, M. S. Providing Social Interaction in the Digital Library. In Proceedings of
the First Annual Conference on the Theory and Practice of Digital Libraries, 1994
[14] Arkin, A.; Askary, S.; Bloch, B.; Curbera, F.; Goland, Y.; Kartha, N.; Liu, C.K.; Mehta,
V.; Thatte, S.; Yendluri, P.; Yiu, A.; Alves, A. Web Services Business Process
Execution Language Version 2.0. OASIS, 2005
[15] Baader, F.; Calvanese, D.; McGuinness, D.; Nardi, D.; Patel-Schneider, P. F. The
Description Logic Handbook: Theory, Implementation and Applications. Cambridge
University Press, 2002. ISBN 0521781760
[16] Banwell, L.; Ray, K.; Coulson, G.; Urquhart, C.; Lonsdale, R.; Armstrong, C.; Thomas,
R.; Spink, S.; Yeoman, A.; Fenton, R.; Rowley, J. (2004). The JISC User Behaviour
Monitoring and Evaluation Framework. Journal of Documentation 60(3), pages 302–
320.
[17] Batini, C.; Scannapieco, M. (2006). Data Quality. Springer-Verlag, Berlin, Heidelgerb,
Germany.
[18] Beall, J. Metadata and Data Quality Problems in the Digital Library. Journal of Digital
Information, Vol. 6, Issue 3, Article No. 355, 2005-06-12.
[19] Becker M., Sewell P. Cassandra: Distributed access control policies with tunable
expressiveness. In 5th IEEE International Workshop on Policies for Distributed Systems
and Networks (POLICY 2004), pp. 159-168, Yorktown Heights, NY, USA, June 2004.
IEEE Computer Society.
[20] Belkin, N.J. Understanding and Supporting Multiple Information Seeking Behaviors in
a Single Interface Framework. In Proceeding of the Eighth DELOS Workshop: User
Interfaces in Digital Libraries, pages 11–18. European Research Consortium for
Informatics and Mathematics, 1999.
[21] Belkin, N.J.; Oddy, R.N.; Brooks, H.M. ASK for Information Retrieval: part 1.
Background and Theory. Journal of Documentation 38(2), pages 61–71, 1982
[22] Besek, J. M. Copyright issues relevant to the creation of a digital archive: A
preliminary assessment. Council on Library and Information Resources, Library of
Congress. (January 2003) http://www.clir.org/pubs/reports/pub112/pub112.pdf
[23] Bhagwat, D.; Chiticariu, L.; Tan, W.-C.; and Vijayvargiya, G. An Annotation
Management System for Relational Databases. In M. A. Nascimento, M. T. Özsu, D.
Kossmann, R. J. Miller, J. A. Blakeley, and K. B. Schiefer, editors, Proc. 30th
International Conference on Very Large Data Bases (VLDB 2004), pages 900–911.
Morgan Kaufmann, 2004.
[24] Bonatti P. A., Shahmehri N., Duma C., Olmedilla D., Nejdl W., Baldoni M., Baroglio
C., Martelli A., Patti V., Coraggio P., Antoniou G., Peer J., Fuchs N. Rule-based policy
specification: State of the art and future work. Technical report, Working Group I2, EU
NoE REWERSE, August 2004. http://rewerse.net/deliverables/i2-d1.pdf
[25] Bonatti P., Olmedilla D., Peer J. Advanced policy explanations. In 17th European
Conference on Artificial Intelligence (ECAI 2006), Riva del Garda, Italy, August 2006.
IOS Press.
[26] Bonatti P.A., De Capitani di Vimercati S., Samarati P. An algebra for composing access
control policies. ACM Trans. Inf. Syst. Secur., 5(1):1-35, 2002.
[27] Bonatti P.A., Duma C., Fuchs N., Nejdl W., Olmedilla D., Peer J., Shahmehri N..
Semantic web policies - a discussion of requirements and research issues. In 3rd
European Semantic Web Conference (ESWC), volume 4011 of Lecture Notes in
Computer Science, Budva, Montenegro, June 2006. Springer.
[28] Bonatti P.A., Olmedilla D. Policy language specification. Technical report, Working
Group I2, EU NoE REWERSE, February 2005. http://rewerse.net/deliverables/m12/i2-
d2.pdf.
[29] Booch, Grady; Rumbaugh, James; Jacobson, Ivar. The Unified Modeling Language
User Guide. Addison Wesley Longman. ISBN 0321267974. 2005
[30] Booth, D.; Haas, H.; McCabe, F.; Newcomer, E.; Champion, M.; Ferris, C.; Orchard, D.
Web Service Architecture. W3C Working Group Note, February 2004.
http://www.w3.org/TR/ws-arch/ (last visited February 21, 2007)
[31] Borbinha, J.; Kunze, J.; Spinazzé, A.; Mutschke, P.; Lieder, H.-J.; Mabe, M.; Dixson,
L.; Besser, H.; Dean, B.; Cathro, W. Reference models for digital libraries: actors and
roles. International Journal of Digital Libraries, 5(4):325-330, August 2005.
[32] Borgman C.L. What are digital libraries? Competing visions. Information processing
and Management, 35(3):227-243, 1999.
[33] Bottoni, P.; Civica, R.; Levialdi, S.; Orso, L.; Panizzi, E.; Trinchese, R. MADCOW: a
Multimedia Digital Annotation System. In Proceedings Working Conference on
Advanced Visual Interfaces (AVI 2004), 55–62, 2004.
[34] Börner K. Extracting and Visualizing Semantic Structures in Retrieval Results for
Browsing. In Proceedings of the fifth ACM conference on Digital libraries, San
Antonio, Texas, United States, 2000.
[35] Börner, K.; Chen C. Visual Interfaces to Digital Libraries. Lecture Notes In Computer
Science, Springer, 2003
[36] Bradner, S. Key words for use in RFCs to Indicate Requirements Levels. RFC 2119,
IETF, 1997
[37] BRICKS Integrated Project: Building Resources for Integrated Cultural Knowledge
Services http://www.brickscommunity.org (last visited February 21, 2007)
[38] Bruce, T.R.; Hillman, D.I. The Continuum of Metadata Quality, In: Hillman D. &
Westbrooks E. L. (eds.) Metadata in Practice. Chicago: American Library Association
(2004).
[39] Buschmann F., Meunier R., Rohnert H., Sommerlad P., Stal M. Pattern-Oriented
Software Architecture. John Wiley & Sons ISBN 0-471-95869-7, 1996
[40] Callan, J.P.; Lu, Z.;Croft, W.B. Searching Distributed Collections with Inference
Networks. In E.A. Fox, P. Ingwersen, and R. Fidel, editors, Proceedings of the 18th
Annual International ACM SIGIR Conference on Research and Development in
Information Retrieval, pages 21-28, Seattle, Washington, 1995. ACM Press.
[41] Candela, L.; Castelli, D.; Fuhr, N.; Ioannidis, Y.; Klas, C.-P.; Pagano, P.; Ross, S.;
Saidis, C.; Schek, H.-J.; Schuldt, H.; Springmann, M. Current Digital Library Systems:
User Requirements vs Provided Functionality. DELOS Deliverable D1.4.1, March
2006.
[42] Candela, L.; Castelli, D.; Ioannidis, Y.; Koutrika, G.; Pagano, P.; Ross, S.; Schek, H.-J.;
Schuldt, H. The Digital Library Manifesto. DELOS, 2006 ISBN 2-912335-24-8
[43] Candela, L.; Castelli, D.; Pagano, P.; Simi, M. OpenDLibG: Extending OpenDLib by
exploiting a gLite Grid Infrastructure. In Research and Advanced Technology for
Digital Libraries, 10th European Conference on Digital Libraries (ECDL 2006),
September 2006.
[44] Candela, L.; Castelli, D.; Pagano, P. Digital Library Reference Model. Project Report
2006-PR-02, Istituto di Scienza e Tecnologie dell’Informazione “A. Faedo”, CNR,
January 2006.
[45] Candela, L.; Castelli, D.; Pagano, P.; Simi, M. From Heterogeneous Information Spaces
to Virtual Documents. In Digital Libraries: Implementing Strategies and Sharing
Experiences, 8th International Conference on Asian Digital Libraries (ICADL 2005),
December 2005.
[46] Candela, L.; Castelli, D.; Pagano, P. A Service for Supporting Virtual Views of Large
Heterogeneous Digital Libraries. In Research and Advanced Technology for Digital
Libraries, 7th European Conference on Digital Libraries (ECDL 2003), August 2003.
[84] Gavriloaie R., Nejdl W., Olmedilla D., Seamons K., Winslett M. No registration
needed: How to use declarative policies and negotiation to access sensitive resources on
the semantic web. In 1st European Semantic Web Symposium (ESWS 2004), volume
3053 of Lecture Notes in Computer Science, pp. 342-356, Heraklion, Crete, Greece,
May 2004. Springer.
[85] Ghezzi, C.; Jazayeri, M.; Mandrioli, D. Fundamentals of Software Engineering (2nd
Ed.). Upper Saddle River, N.J., USA, Prentice Hall, 2003.
[86] Ginsburg, J. C. Copyright and Control Over New Technologies of Dissemination.
Columbia Law Review 101 (November 2001) : 1613-1647.
[87] Gladney, H.M. Principles for Digital Preservation. Communication of the ACM
49(2):111-116, February 2006.
[88] Gonçalves M.A. Streams, Structures, Spaces, Scenarios, and Societies (5S): A Formal
Model for Digital Library Framework and Its Applications. PhD thesis, Virginia
Polytechnic Institute and State University, November 2004.
[89] Gonçalves, M.A.; Fox, E.A. 5SL – A Language for Declarative Specification and
Generation of Digital Libraries. In Proceedings of the 2nd ACM/IEEE-CS Joint
Conference on Digital Libraries (JCDL‘02), Portland, Oregon, USA, 2002.
[90] Gonçalves, M.A.; Fox, E.A.; Watson, L.T.; Kipp, N.A. Streams, Structures, Spaces,
Scenarios, Societies (5S): A Formal Model for Digital Libraries. ACM Transactions of
Information Systems (TOIS), 22(2):270-312, 2004.
[91] Griffin, S.; Peters, C.; Thanos, C. Towards the new-generation digital libraries:
recommendations of the NSF/EU-DELOS working groups Guest editor introduction.
International Journal of Digital Libraries, 5(4):253-254, August 2005.
[92] Groth, D. P. and Streefkerk, K. Provenance and Annotation for Visual Exploration
Systems. IEEE Transactions On Visualization And Computer Graphics, 12(6):1500–
1510, November/December 2006.
[93] Guarino N., “Formal Ontology and Information Systems”, 1st International Conference
on Formal Ontology in Information Systems (FOIS’98), 3-15, Trento, June 1998.
[94] Gudgin, M.; Hadley, M.; Mendelsohn, N.; Moreau, J-J.; Nielsen, H. SOAP Version 1.2
Part 0: Primer. W3C Recommendation. http://www.w3.org/TR/soap12-part0/ (last
visited February 21, 2007)
[95] Gudgin, M.; Hadley, M.; Mendelsohn, N.; Moreau, J-J.; Nielsen, H. SOAP Version 1.2
Part 1: Messaging Framework. W3C Recommendation. http://www.w3.org/TR/soap12-
part1/ (last visited February 21, 2007)
[96] Guenther, R. PREMIS - Preservation Metadata Implementation Strategies Update 2:
Core Elements for Metadata to Support Digital Preservation. RLG DigiNews, 8(6),
December 15 2004. Retrieved December 22, 2004, from
http://www.rlg.org/en/page.php?Page_ID=20492#article2 (2004).
[97] Guercio, M., Lograno L., and Battistelli A.. Legislation, Rules and Policies for the
Preservation of Digital Resources, A SURVEY, 45 pp.
http://www.erpanet.org/events/workgroup/documents/Regulations_Policy%20_Dossier
_English%20version.pdf. (2003).
[98] Guy, M., Powell, A. and Day, M., Improving the quality of metadata in Eprint archives,
Ariadne, 2004, Issue 38, http://www.ariadne.ac.uk/issue38/guy/.
[99] Hallam-Baker P. M. and Behlendorf B. Extended Log File Format –W3CWorking Draft
WD-logfile-960323. http://www.w3.org/TR/WD-logfile.html, March 1996.
[100] Hartson, R.H.; Shivakumar, P.; Perez-Quinones, M.A. (2004). Usability Inspection of
Digital libraries: a Case Study. International Journal on Digital Libraries 4(2), pages
108–123.
[101] Harvey, R., Preserving Digital Materials. Munich: K.G. Saur, 246 pp. (2005).
[102] Heery, R.; Patel, M. Application profiles: mixing and matching metadata schemas.
Ariadne Issue 25, September 2000.
[103] Hodge, G., Frangakis E. Digital Preservation and Permanent Access to Scientific
Information; The State of the Practice: A report sponsored by The International Council
for Scientific and Technical Information and CENDI.
http://www.cendi.gov/publications/04-3dig_preserv.pdf. (2004).
[104] Huang, Z.; Chung, W.; Ong, T.-H.; Chen H. A graph-based recommender system for
digital library. In proceedings of the 2nd ACM/IEEE-CS joint conference on Digital
libraries (JCDL '02), 2002.
[105] IEEE Recommended Practice for Architectural Description, IEEE Std P1471, 2000
[106] IEEE Std 830-1998 IEEE recommended practice for software requirements
specifications. 20 Oct 1998
[107] IFLA Study Group on the Functional Requirements for Bibliographic Records.
Functional Requirements for Bibliographic Records: Final Report. September 1997.
http://www.ifla.org/VII/s13/frbr/frbr.htm (last visited February 21, 2007)
[108] Ingersoll, P.; Culshaw, J. Managing Information Technology. Libraries Unlimited,
2004.
[109] InterPARES Strategy Task Force Report: An Intellectual Framework for Policies,
Strategies, and Standards. In: The Long-Term Preservation of Authentic Electronic
Records: Findings of the InterPARES Project.
http://www.interpares.org/book/index.htm (2005).
[110] Ioannidis Y. (ed.) Digital Libraries: Future Directions for a European Research
Programme. DELOS Brainstorming Report, San Cassiano, Italy, June 2001.
[111] Ioannidis Y. Digital libraries at a crossroads. International Journal of Digital Libraries,
5(4):255-265, August 2005.
[112] Ioannidis, Y.; Maier, D.; Abiteboul, S.; Buneman, P.; Davidson, S.; Fox, E.; Halevy, A.;
Knoblock, C.; Rabitti, F.; Schek, H.; Weikum, G. Digital library information-
technology infrastructures. International Journal of Digital Libraries, 5(4):266-274,
August 2005.
[113] ISO 21127:2006 Information and documentation – A reference ontology for the
interchange of cultural heritage information. December 2006
[114] Jacobs, Ian; Walsh, N. (Eds.) Architecture of the World Wide Web, Volume One. W3C
Recommendation. http://www.w3.org/TR/webarch/
[115] Johnson, R.E.; Foote B. Designing reusable classes. Journal of object-oriented
programming, 1(2):22-35, 1988.
[116] Jones S. Graphical Query Specification and Dynamic Result Previews for a Digital
Library. In Proceedings of 11th Annual ACM Symposium on User Interface Software
and Technology. San Francisco, California, United States. 1998.
[117] Jones, S.; Cunningham, S.J.; McNab, R.; Boddie, S. A Transaction Log Analysis of a
Digital Library. International Journal on Digital Libraries 3(2), pages 152–169.
[118] Kagal L., Finin T., Joshi A. A policy based approach to security for the semantic web.
In 2nd International Semantic Web Conference (ISWC), vol. 2870 of Lecture Notes in
Computer Science, pp. 402-418, Sanibel Island, FL, USA, October 2003. Springer.
[119] Kagal L., Finin T., Joshi A. A policy language for a pervasive computing environment.
In 4th IEEE International Workshop on Policies for Distributed Systems and Networks
(POLICY), pages 63-, Lake Como, Italy, June 2003. IEEE Computer Society.
[120] Kagal L., Paolucci M., Srinivasan N., Denker G., Finin T.W., Sycara K.P. Authorization
and privacy for semantic web services. IEEE Intelligent Systems, 19(4):50-56, 2004.
[121] Kahan, J.; Koivunen, M.-R. Annotea: an open RDF infrastructure for shared Web
annotations. Proceedings of the 10th International Conference on World Wide Web
(WWW 2001), 623–632, 2001.
[122] Kahle, B. Jackson, M., Prelinger, R. Public access to digital materials. D-Lib Magazine,
7 (10), 2001.
[123] Kaushik S., Ammann P., Wijesekera D., Winsborough W., Ritchey R.. A policy driven
approach to email services. In 5th IEEE International Workshop on Policies for
Distributed Systems and Networks (POLICY), p. 169-, Yorktown Heights, NY, USA,
June 2004. IEEE Computer Society.
[124] Kaushik, S., Winsborough, W., Wijesekera, D., and Ammann, P. Email feedback: a
policy-based approach to overcoming false positives. In Proceedings of the 2005 ACM
Workshop on Formal Methods in Security Engineering (Fairfax, VA, USA, November
11, 2005). FMSE '05. ACM, New York, NY, pp. 73-82. (2005).
[125] Ke, H.-R.; Kwakkelaar, R.; Tai, Y.-M.; Chen, L.-C. (2002). Exploring Behavior of E-
journal Users in Science and Technology: Transaction Log Analysis of Elsevier’s
ScienceDirect OnSite in Taiwan. Library and Information Science Research 24, pages
265–291.
[126] Kelly, B.; Guy, M.; James, H. Developing A Quality Culture For Digital Library
Programmes. Proceedings of the EUNIS 2003 Conference. Amsterdam, Holland, 2003.
[127] Kengeri, R.; Fox, E.A.; Reddy, H.P.; Harley, H.D.; Seals, C.D. (1999). Usability Study
of Digital Libraries: ACM, IEEE-CS, NCSTRL, NDLTD. International Journal on
Digital Libraries 2(2), pages 157–169.
[128] Kerry, A. Digital Preservation: The Research and Development Agenda in Australia.
21 pp. http://www.swinburne.edu.au/lib/digcontforum/DigitalContinuity_AKerry.pdf
(2001).
[129] Kolari P., Ding L., Ganjugunte S., Joshi A., Finin T., Kagal L.. Enhancing web privacy
protection through declarative policies. In 6th IEEE International Workshop on Policies
for Distributed Systems and Networks (POLICY), pp. 57-66, Stockholm, Sweden, June
2005. IEEE Computer Society.
[130] Kuhlthau, C. C. (1991). Inside the Search Process: Information Seeking from the User’s
Perspective. Journal of the American Society for Information Science 42(5), pages 361–
371.
[131] Kuny, T.; Cleveland, G. The Digital Library: Myths and Challenges. In 62nd IFLA
General Conference, August 1996.
[132] Kyrillidou, M.; Giersch, S. Developing the DigiQUAL Protocol for Digital Library
Evaluation. Proceedings of the 5th ACM/IEEE-CS joint conference on Digital libraries.
Denver, CO, USA, pages 172-173, 2005.
[133] Lagoze, C.; Payette, S.; Shin, E.; Wilper, C. Fedora: an architecture for complex
objects and their relationships. International Journal of Digital Libraries, 6, 2006.
[134] Lagoze, C. and Van de Sompel, H. 2001. The open archives initiative: building a low-
barrier interoperability framework. In Proceedings of the 1st ACM/IEEE-CS Joint
Conference on Digital Libraries (Roanoke, Virginia, United States). JCDL '01. ACM,
New York, NY, 54-62. DOI= http://doi.acm.org/10.1145/379437.379449
[135] Lagoze, C.; Van de Sompel, H.; Nelson, M.; Warner, S. The Open Archives Initiative
Protocol for Metadata Harvesting
http://www.openarchives.org/OAI/openarchivesprotocol.html (last visited February 21,
2007)
[136] Lannom, L. Handle System Overview. In Proceedings of the 66th IFLA Council and
General Conference, Jerusalem, Israel, August 2000.
[137] Lavoie, B. The Incentives to Preserve Digital Materials: roles, scenarios and economic
decision-making. http://www.oclc.org/research/projects/digipres/incentives-dp.pdf.
(2003).
[138] Lavoie, B., and Dempsey L. Thirteen ways of looking at digital preservation. DLib
Magazine 10 (7/8). http://www.dlib.org/dlib/july04/lavoie/07lavoie.html. (2004).
[139] Lavoie, B.; Henry, G.; Dempsey, L. A Service Framework for Libraries. D-Lib
Magazine, 11(7/8), July/August 2006.
[140] Lesk, M. Expanding Digital Library Research: Media, Genre, Place and Subjects. In
Proceedings on the International Symposium on Digital Library 1999: ISDL’99, pages
51-57, Tsukuba, Ibaraki, Japan, September 1999.
[141] Library of Congress. MARC Standards. http://www.loc.gov/marc/ (last visited February
21, 2007)
[142] Library of Congress. METS Metadata Encoding & Transmission Standard.
http://www.loc.gov/standards/mets/ (last visited February 21, 2007)
[143] Lomow, G.; Newcomer, E. Understanding SOA with Web Services. Addison Wesley
Professional, 2005.
[144] Lord, P., and Macdonald A. e-Science Curation Report. Data curation for e-science in
the UK: an audit to establish requirements for future curation and provision. The JISC
Committee for the Support of Research (JCSR).
http://www.jisc.ac.uk/uploaded_documents/e-ScienceReportFinal.pdf. (2003).
[145] Lord, P., Macdonald A., Lyon L., and David Giaretta. From Data Deluge to Data
Curation. www.dcc.ac.uk/docs/AHM-150-rev-ll-dg.doc. (2005).
[146] Lynch C. Canonicalization: A Fundamental Tool to Facilitate Preservation and
Management of Digital Information. D-Lib Magazine 5(9), September 1999.
[147] MacKenzie, M.; Laskey, K.; McCabe, F.; Brown, P.; Metz, R. Reference Model for
Service Oriented Architecture. OASIS Committee Draft 1.0, February 2006.
[148] Marsh J. XML Base. W3C Recommendation. http://www.w3.org/TR/xmlbase/ (last
visited February 21, 2007)
[149] Maximilien, E.M.; Singh, M.P. A Framework and Ontology for Dynamic Web Services
Selection. IEEE Internet Computing 8(5):84-93, September-October 2004
[150] McClure, C. R. and Bertot, J.C. 2001. Evaluating Networked Information Services:
Techniques, Policy, and Issues. Medford, N.J.:Information Today.
[151] McMillan, G. The NDLTD, and Issues of Long Term Preservation and Archiving. IT'S
ABOUT TIME! Paper read at Next Steps: Electronic Theses and Dissertations
Worldwide ETD 2003, at Humboldt University, Berlin, Germany. http://edoc.hu-
berlin.de/etd2003/mcmillan-gail/PDF/index.pdf (2003).
[152] Melnik, S., Garcia-Molina, H., Paepcke, A. A mediation infrastructure for digital library
services. Proceedings of the fifth ACM conference on Digital libraries, p.123-132, June
02-07, 2000, San Antonio, Texas, United States
[153] Menell, P. S. Envisioning Copyright Law’s Digital Future. New York Law School Law
Review 46 (2002/2003), pp. 63-199.
[154] Metrics for Metadata website, http://ariadne.cti.espol.edu.ec/M4M/
[155] Mullen, S.; Crawford, M.; Lorch, M.; Skow, D. Site Requirements for Grid
Authentication, Authorization and Accounting. GGF Document Series, Document
Number GFD-I.032, October 2004.
[156] Navarro, G.; Baeza-Yates, R. Proximal Nodes: A Model to Query Document Databases
by Content and Structure. ACM Transactions on Information Systems (TOIS),
15(4):400–435, October 1997.
[157] Nejdl W., Olmedilla D., Winslett M., Zhang C. Ontology-based policy specification and
management. In 2nd European Semantic Web Conference (ESWC), volume 3532 of
Lecture Notes in Computer Science, pp. 290-302, Heraklion, Crete, Greece, May 2005.
Springer.
[158] Novak, J.D.; Gowin, D.B. Learning How to Learn. Cambridge University Press, 1984.
[159] Novak, J.D.; Cañas, A.J. The Theory Underlying Concept Maps and How to Construct
Them, Technical Report IHMC CmapTools 2006-01, Florida Institute for Human and
Machine Cognition, 2006.
[160] NSF/DELOS Working Group. Invest to Save. Report and Recommendations of the
NSF/DELOS Working Group on Digital Archiving and Preservation.
http://eprints.erpanet.org/archive/00000048/01/Digitalarchiving.pdf (2003).
[161] Nuseibeh, B.; and Easterbrook, S. Requirements Engineering: A Roadmap. In A.
Finkelstein (Ed.), The Future of Software Engineering, 22nd International Conference on
Software Engineering ICSE 2000, Limerick, Ireland, June 2000
[162] Oberle, D.; Lamparter, S.; Grimm, S.; Vrandecic, D.; Staab, S.; Gangemi, A. Towards
Ontologies for Formalizing Modularization and Communication in Large Software
Systems. Journal of Applied Ontology, 2006.
[163] OCLC/RLG PREMIS Working Group Implementing Preservation Repositories for
Digital Materials: Current Practice and Emerging Trends in the Cultural Heritage
Community. Report by the joint OCLC/RLG Working Group Preservation Metadata:
Implementation Strategies (PREMIS). Dublin, O.: OCLC Online Computer Library
Center, Inc. http://www.oclc.org/research/projects/pmwg/surveyreport.pdf (2004).
[164] Paepcke, A., Chang, C. K., Winograd, T., and García-Molina, H. 1998. Interoperability
for digital libraries worldwide. Communications of the ACM 41, 4 (Apr. 1998), 33-42.
DOI= http://doi.acm.org/10.1145/273035.273044
[165] Payette, S.D.; Rieger, O.Y. Supporting Scholarly Inquiry: Incorporating Users in the
Design of the Digital Library. The Journal of Academic Librarianship 24(2), pages 121–
129, 1998.
[166] PREMIS Working Group. Data Dictionary for Preservation metadata: Final Report of
the PREMIS Working Group. May 2005.
[167] Rieger O. Preservation in the Age of Large-Scale Digitization. Washington, DC:
Council on Library and Information Resources.
http://www.clir.org/activities/details/lsdi.pdf (2007).
[168] Ross, S. Digital Preservation, Archival Science and Methodological Foundations for
Digital Libraries. In Proc. ECDL 2007, Budapest.
http://eprints.erpanet.org/131/01/Keynote_ECDL2007_SROSS.pdf (2007).
[169] Ross, S.; Hedstrom, M. Preservation research and sustainable digital libraries.
International Journal of Digital Libraries, 5(4):317-324, August 2005.
[170] Rousseau, G.K.; Rogers, W.; Mead, S.E.; Sit, R A.; RousseauJamieson, B.A. Assessing
the Usability of On-line Library Systems. Behaviour and Information Technology 17(5),
pages 274–281, 1998.
[171] Salton, G.; McGill, M.J. Introduction to Modern Information Retrieval. McGraw-Hill,
New York, NY, 1983.
[172] Saracevic, T. Evaluation of Digital Libraries: an Overview. In: Agosti, M.; Fuhr, N.
(eds.): Notes of the DELOS WP7 Workshop on the Evaluation of Digital Libraries.
Padua, Italy, 2004 http://dlib.ionio.gr/wp7/workshop2004_program.html (last visited
February 21, 2007)
[173] Sfakakis, M.; Kapidakis, S. User Behavior Tendencies on Data Collections in a Digital
Library, volume 2458 of Lecture Notes In Computer Science, pages 550–559. Springer-
Verlag, Berlin: Heidelberg. Research and Advanced Technology for Digital Libraries:
6th European Conference, ECDL 2002, Rome, Italy, September 16-18, 2002.
Proceedings.
[174] Sharp, H., Finkelstein, A. & Galal, G. (1999). Stakeholder Identification in the
Requirements Engineering Process. Workshop on Requirements Engineering Processes
(REP'99) - DEXA'99, Florence, Italy, 1-3 September 1999, pp. 387-391
[175] Schek, H.-J.; Schuldt, H. DelosDLMS – Infrastructure for the Next Generation of
Digital Library Management Systems. ERCIM News, No. 66, July 2006.
[176] Shen, R. Applying the 5S Framework To Integrating Digital Libraries. Virginia
Polytechnic Institute and State University, PhD Thesis in Computer Science and
Applications, Blacksburg, Virginia, USA, 2006.
[177] Sheneiderman, B.; Feldman, D. Visualizing Digital Library Search Results with
Categorical and Hierarchical Axes. Fifth ACM International Conference on Digital
Libraries. San Antonio, Texas, United States, 2000.
[178] Shoshani, A. Storage Resource Managers: Middleware Components for Grid Storage.
In Proceedings of the Nineteenth IEEE Symposium on Mass Storage Systems (MSS
'02), 2002.
[179] Shoshani, A.; Sim, A.; Gu, J. Storage Resource Managers. In Grid Resource
Management: State of the Art and Future Trends, Chapter 20, 2003.
[180] Sobel, L. S. DRM as an Enabler of Business Models: ISPs as Digital Retailers. Berkeley
Technology Law Journal 18 (Spring 2003) : 667-695.
[181] Soergel, D. A Framework for Digital Library Research. DLib Magazine, 8(12),
December 2002.
[182] Soergel, D.; Christensen-Dalsgaard, B.; Ioannidis, Y.; Schuldt, H. (Eds.)
Recommendations and Observations for a European Digital Library (EDL). 4th DELOS
Brainstorming Workshop Report, Juan-le-Pins, France, 5-6 December 2005. ISBN 2-
912335-19-1
[183] Smeaton, A.F.; Callan, J. Personalisation and recommender systems in digital libraries.
International Journal of Digital Libraries, 5(4):299-308, August 2005.
[184] Snowdon, D.N.; Churchill, E. F.; Frecon, E. (Eds.) Inhabited Information Spaces -
Living with your Data. Springer, London 2004.
[185] Strodl, S., Becker, C., Neumayer, R., Rauber, A. How to Choose a Digital Preservation
Strategy: Evaluating a Preservation Planning Procedure. Proc. of the 2007 Conference
on Digital Libraries. Vancouver, BC, Canada, JCDL '07. ACM Press, New York, NY,
29-38 (2007).
[186] Sutcliffe, A.G.; Ennis, M.; Hu, J. Evaluating the Effectiveness of Visual User Interfaces
for Information Retrieval. International Journal of Human-Computer Studies 53, pages
741–763, 2000.
[187] Suzan, K. D. Tapping to the Beat of a Digital Drummer: Fine Tuning U.S. Copyright
Law for Music Distribution on the Internet. Albany Law Review 59 (1995) : 789-829.
[188] Tansley, R.; Bass, M.; Smith, M. DSpace as an Open Archival Information System:
Current Status and Future Directions. In Research and Advanced Technology for
Digital Libraries, 7th European Conference of Digital Libraries (ECDL 2003), August
2003.
[189] Tedd, L. A., Large, A. Digital libraries: principles and practice in a global environment.
Germany: K.G. Saur, 2005.
[190] The CIDOC Conceptual Reference Model. http://cidoc.ics.forth.gr (last visited February
21, 2007)
[191] The International DOI Foundation. The DOI Handbook. Edition 4.2.0, doi:10.1000/186,
February 2005.
[192] Thong, J.Y.L.; Hong, W.; Tam, K. Understanding User Acceptance of Digital
Libraries: What Are the Roles of Interface Characteristics, Organizational Context, and
Individual Differences? International Journal of Human-Computer Studies 57(3), pages
215–242, 2002.
[193] Tonti G., Bradshaw J., Jeffers R., Montanari R., Suri N., Uszok A. Semantic web
languages for policy representation and reasoning: A comparison of kaos, rei, and
ponder. In 2nd International Semantic Web Conference (ISWC), volume 2870 of
Lecture Notes in Computer Science, pp. 419-437, Sanibel Island, FL, USA, October
2003. Springer.
[194] Traw J. Library Web Site Policies (Clip Notes), American Library Association (May
2000). 98 pp., ISBN-10: 0838980880, ISBN-13: 978-0838980880
[195] Tsakonas, G.; Kapidakis, S.; Papatheodorou, C. Evaluation of User Interaction in
Digital Libraries. In: Agosti, M.; Fuhr, N. (Eds.): Notes of the DELOS WP7 Workshop
on the Evaluation of Digital Libraries. Padua, Italy, 2004.
http://dlib.ionio.gr/wp7/workshop2004_program.html (last visited February 21, 2007)
[196] Uszok A., Bradshaw J., Jeffers R. Kaos: A policy and domain services framework for
grid computing and semantic web services. In iTrust, vol. 2995, Lecture Notes in
Computer Science, pp. 16-26, Oxford, UK, April 2004. Springer.
[197] Van de Sompel, H., Lagoze, C., Bekaert, J., Liu, X., Payette, S., Warner, S. (2006). An
Interoperable Fabric for Scholarly Value Chains. D-Lib Magazine, 12(10). October
2006. doi:10.1045/october2006-vandesompel
[198] Waller R. (ed.) DELOS Clusters to Contribute to a Joint Prototype. DELOS Newsletter
#5.
http://www.delos.info/index.php?option=com_content&task=view&id=356&Itemid=13
3 (last visited February 21, 2007)
[199] Walter, V. A. Becoming digital: Policy implications for library and youth services.
Library Trends, 45(4): pp. 585-601, 1997.
[200] Weiser, P. J. The Internet, Innovation, and Intellectual Property Policy. Columbia Law
Review 103: pp. 534-613, 2003.
[201] Wiederhold G. Mediators in the Architecture of Future Information Systems. Computer,
vol. 25, no. 3, pp. 38-49, 1992.
[202] Wiederhold, G.; Wegner, P.; Ceri, S. Toward Megaprogramming. Communication of
the ACM, 38(11):89-99, November 1992.
[203] Wilson T. D. Information Behaviour: An Interdisciplinary Perspective. Information
Processing and Management 33(4), pages 551–572, 1997.
[204] Wilson T. D. Exploring Models of Information Behaviour: the Uncertainty Project.
(35), pages 893–849, 1999.
[205] Witten, I.H.; Bainbridge, D.; Boddie, S.J. Power to the People: End-user Building of
Digital Library Collections. ACM Press, 94-103, 2001.
[206] Wormell, I. (Editor). Information quality: Definitions and dimensions. Los Angeles,
CA, USA, Taylor Graham, 1990.
[207] Zachman, A. J. A framework for information systems architecture. IBM Systems
Journal, Vol 26, No 3. 1987.