Final Fundamentals of Metadata (1)
Final Fundamentals of Metadata (1)
By
Victoria Sokari
University Library, Bayero University, Kano
[email protected]
Introduction
In the time past, cataloguing was done using standards commonly ISBD, AACR2 and later the
MARC 21 in libraries. However, libraries and other cultural institutions of today have been
required to make a drastic move and change their approaches. This is as a result of the fact that
those standards that have been used for decades have come under increasing pressure to either
adapt new circumstances or to give way entirely to different standards, because of their shortfalls
in serving current purposes and circumstances (ALA, 2012).
One of the inevitable reasons for this change/transformation in the words of Coyle (2010) is that
the library data, despite being saved and accessed via computers, is designed mostly for use and
consumption by humans. Non-the-less, it has been observed that, people are not the only users of
data produced in the name of bibliographic control. Machine applications also interact with those
data in a variety of ways (Library of Congress, 2008). More so, the drastic technological
advancement may set the need: for system migration; to update to a different schema; to
disseminate metadata to an aggregator (e.g. library); to adopt change to an input standard; to
entirely refresh to philosophy. In addition, technological advancement has resulted in great
change in how users seek for information for instance, user research methods and their
expectations. The aforementioned reasons give way for the need for web sharable data.
Unfortunately, library data is not integrated with the web as most of the data is encoded in
natural language rather than as data (Baker, et al. 2011).
The older standards (like AACR2, MARC 21) are too library centric and not sufficient enough to
cater for the present and future needs of the web era. For this reason, the World Wide Web
Consortium (W3C) provides a way forward which is to build the Semantic Web using the Linked
Data principles to implement standards that seems to fit well with the legacy metadata produced
and maintained by libraries and other cultural institutions (ALA, 2012). In fact, we are facing a
chaotic atmosphere in Libraries at the moment as these changes take a toll on us. However, we
are not relenting as we must stand on our toes and work rigorously to make the necessary
changes that are required to keep us in the business of information service delivery.
Describing resources
Community supported standards for structure and input
Scale
Cataloging typically involves consideration/creation of description on a per item basis
Metadata typically involves consideration/creation of a description on a bulk scale
Metadata work also seeks to transform existing metadata
Diversity
Cataloging is widely understood to entail input governed by AACR/AACR2/RDA
into the MARC format
Metadata entails input governed by any number of standards into a variety of schema
Cataloging involves describing resources, typically on an item-by-item basis, to
conform to a known universe of description and discovery e.g., MARC in an
ILS/OPAC
Metadata frequently requires creating the universe of description and discovery to
approach description at bulk scale e.g. any schema in a variety of management
systems/discovery layers
Evaluation of schema recalls FRBR user tasks: find, identify, select, obtain
This approach is philosophically aligned with a linked data world, wherein the
metadata may be repurposed and repackaged in many contexts
Types of metadata
Administrative
Descriptive
Access/Use
Preservation
Structural
Technical
Descriptive • Information provided for the discoverability of the resource • Examples: Title;
Date; Subjects
Access/use • Information pertinent to finding and accessing the resource • Examples: Conditions
governing access (e.g., a collection is restricted); Conditions governing use (e.g., copyright
restrictions may apply)
Technical • Details of the digital object and its creation • Examples: Size; File type; Date
digitized
Standards
Schema: Metadata schemes (also called schema) are sets of metadata elements designed for a
specific purpose, such as describing a particular type of information resource. The definition or
meaning of the elements themselves is known as the semantics of the scheme. The values given
to metadata elements are the content. ‐NISO “Understanding Metadata”, 2004
Input/Content Standards
Pros
Responsibility for maintenance and documentation is spread across a larger group
External perspectives on proposals can lead to stronger end product that is usable
across a larger group
Supports interoperability across the community
Cons
Customization may be limited or more arduous to enact
Local Creation & Maintenance
Pros
Cons
Select schema
Select input standard
Select controlled vocabularies
Creation:
Management Systems
CSV file
Comma Separated Values: a comma is used as a delimiter to identify where a
machine should parse values into separate fields
Format in use with computers since 1972.
Relatively software agnostic: created in a plain text file, with each line
representing a data record, with each field within the record separated by a
comma
Common data exchange format; many software applications can export/import
CSV
Best used for very simple data that is highly consistent and does not contain
commas within the input value
Spreadsheet
Computer software initially implemented in 1962 that allows a user to create and
store data in an interactive table form.
Relatively software agnostic: most computers are loaded with a basic spreadsheet
program, and there are web based products as well.
Data is separated into columns and rows, with an expectation of consistency in
type of input between columns or rows. Data can be sorted in columns to bring
together identical input
XML editor
Computer software created to facilitate creation of well formed XML
Extensible Markup Language is derived from Standardized General Markup
Language (SGML) that was created in the mid to late 1990s.
XML editors typically require the user to read and create native XML syntax
When a valid schema is named at the beginning of XML document, many XML
editors can provide real-time feedback regarding any structural issues with the
metadata
Google Forms
Conceived as a cloud based survey application integrated with Google
spreadsheet, released in 2007
Includes options such as drop down menus, radio buttons and checkboxes;
does not require user knowledge of metadata schema and can be used to
avoid typos and other common consistency errors
Responses populate a Google spreadsheet
Transformation
What is transformation?
Goals of transformation
A system migration
Update to schema in use
Send to an aggregator
Change to input standard
Refresh to philosophy
User research methods change
User expectations change
Transformation examples
MarcEdit
Stylesheets (XSLT – Extensible Stylesheet Language Transformations)
Caveat emptor: metadata transformations are unlikely to be lossless. Always read the
fine print on crosswalk documentation.
Practical Lessons
Practical steps for creating metadata involve:
selecting a schema, input standard and controlled vocabularies
determining a method of metadata creation, using a management system or independent
tools
adequately document context, criteria and methodology.
MARC 21 http://www.loc.gov/marc/bibliographic/
MODS http://www.loc.gov/standards/mods/mods-outline-3-6.html
VRACore http://www.loc.gov/standards/vracore/VRA_Core4_Element_Description.pdf
Conclusion
To create metadata in a linked data environment, cataloguers, especially in Nigeria are expected
to re-tool and re-skill on the new ways as evidenced in the tutorials covered. There are numerous
things to learn and unlearn. We need to brush up on our meta-competencies now more than ever,
because that is what is required in order to have a competitive edge in the information world.
References:
American Library Association. (2012) "Transforming Library Metadata into Linked Library
Data" .http://www.ala.org/alcts/resources/org/cat/research/linked-data (Accessed October
16, 2021) Document ID: fc4b530d-00bb-4aef-b6d4-21d40f4ae19c
NISO (2004). Understanding Metadata. Bethesda, MD: NISO Press, National Information
Standards Organization. Accessed on October 16, 2021
http://www.niso.org/publications/press/UnderstandingMetadata.pdf
Exercise
1) Select what characteristic(s) cataloging and metadata have in common:
Select one:
a. Each has a commitment to standards geared toward interoperability
b. Usually resources are available to a diverse set of users, excepting special libraries
c. Each strongly values a community of practice
d. All of the above
2) True or false: cataloging and metadata are similar in that each involves describing resources
using community supported standards for structure and input.
Select one:
True
False
7) Metadata schema:
Select one:
a. Define a set of elements within a context
b. Define rules for machine validation
c. Provide structure for presentation
d. All of the above
9) Input standards provide for consistence use of terminology and define the minimum standard
for adequate description.
Select one:
True
False
13) Which of the following is an advantage to local creation and maintenance of metadata
standards?
Select one:
a. Organizations bear full responsibility for workload and documentation
b. Ability exists for customization when desired
c. Absence of external perspectives and feedback
d. Limited interoperability
14) Which of the following are advantages to community creation and maintenance of metadata
standards? Select one:
a. Supports interoperability
b. Responsibility for maintenance and documentation is spread across a larger group
c. External perspectives on proposals can lead to stronger end product that is usable across a
larger group
d. All of the above