SlideShare a Scribd company logo
The ARK Alliance:
20 years
850 institutions
8.2 billion persistent identifiers
John Kunze, California Digital Library
University of California Office of the President
October 2021
Digital preservation means
Long term protection for digital resources
● from human error, natural disaster, legal challenge, deliberate attack,
social upheaval, bankruptcy, etc.
Long term access to those resources from unbroken links
● with persistent identifiers (PIDs), also known as permalinks
Why persistent identifiers?
Because of “link rot” (broken references, 404 Not Found)
● Reliable, unbroken web links (URLs) are rare
● The average URL lifetime is only 100 days
But why not just search when you need a link?
● Because scholars and researchers take years to find their object references
Common types of persistent identifiers
● PURL, Handle, URN, DOI, ARK
A labelled URL with a globally unique identity inside it
https://n2t.net/ark:/12345/fk1234
makes ARK
actionable
(the resolver)
core globally unique
identity (independent
of web and hostname)
What is an ARK (Archival Resource Key)?
ARK anatomy
https://example.org/ark:/12345/x54xz321/s3/f8.05v.tiff
_________________/ __/ ___/ ______/____/_______/
| | | | | |
| ARK Label | | Sub-parts Variants
| | |
Name Mapping Authority (NMA) | Assigned Name
|
Name Assigning Authority Number (NAAN)
Why ARKs?
Major causes of broken links, and some features PURL Handle URN DOI ARK
Prevents fire, war, flood, attack, bankruptcy, ... No No No No No
Prevents human error No No No No No
Guarantees your links, or fixes them for you No No No No No
Decentralized admin plus inferenceable syntax No No No No Yes
Flexible metadata and persistence statements No No No No Yes
Identifiers extensible during resolution Yes No Yes No Yes
Free, non-paywalled, in unlimited numbers Yes No Yes No Yes
Who is using ARKs?
University of California Berkeley
Smithsonian National Museum
National Library of France
University of Chicago
Musée du Louvre
Family Search
British Library
Google
Internet Archive
Caltech Archives
Hawaii State Archives
French National Archives
Rockefeller Archive Center
Library and Archives Canada
Archives de la Ville de Genève
Silent Film Sound & Music Archive
• Libraries, data centers, archives, museums,
publishers, government agencies, and vendors
• Example institutions:
What are ARKs used for?
● genealogical records (8 billion FamilySearch)
● publisher content (100 million Portico)
● scientific datasets and records (22 million INIST)
● scanned books and texts 30 million Internet Archive)
● bibliographic records (15 million BnF main catalog)
● museum specimens (15 million Smithsonian Institution)
● public health documents (15 million UCSF IDL)
● historical documents (21 million CDL, 5 million BnF Gallica)
● historical authors and scholars (4 million SNAC)
● fine art museum collections (483,000 Louvre)
● vocabulary terms (9,000 Periodo, YAMZ)
ARK Alliance: 850 institutions and
8.2 billion ARKs in 20 years
Home of the ARK Alliance
arks.org
Join one of our working groups: info@arks.org
Get started with ARKs by filling out:
n2t.net/e/naan_request
Stay in touch:
● Twitter: @arks_org
● Email forum (English): groups.google.com/group/arks-forum
● Email forum (French): framalistes.org/sympa/info/arks-forum-fr
The ARK Alliance

More Related Content

PPTX
Finding Aids Slides
jennifer whitlock
 
PDF
Arch 192 New York
Michele Laing
 
PDF
Claudia Marinica - Supporting Semantic Interoperability in Conservation-Resto...
ariadnenetwork
 
PPT
New Collections Management
Nicholas Poole
 
PPTX
Documentation
Virag Sontakke
 
PDF
Provenance in Databases and Scientific Workflows: Part I
Bertram Ludäscher
 
PDF
How the Web of Data Will be Won
Jeni Tennison
 
PDF
Ruest and Milligan - The Great WARC Adventure
Ian Milligan
 
Finding Aids Slides
jennifer whitlock
 
Arch 192 New York
Michele Laing
 
Claudia Marinica - Supporting Semantic Interoperability in Conservation-Resto...
ariadnenetwork
 
New Collections Management
Nicholas Poole
 
Documentation
Virag Sontakke
 
Provenance in Databases and Scientific Workflows: Part I
Bertram Ludäscher
 
How the Web of Data Will be Won
Jeni Tennison
 
Ruest and Milligan - The Great WARC Adventure
Ian Milligan
 

Similar to The ARK Alliance: 20 years, 850 institutions, 8.2 billion persistent identifiers, 2021-10-22 (20)

PDF
DCMI ARK Tutorial 2024.10.20, slides and notes, 120 mins.pdf
John Kunze
 
PDF
The ARK Identifier Scheme at Ten Years Old
John Kunze
 
PPTX
Names, Things, and Open Identifier Infrastructure: N2T and ARKs
John Kunze
 
PDF
E-ARK-iPRES2016-Bern-October-2016
Sven Schlarb
 
PPTX
ARK identifiers: lessons learnt at BnF: paths forward
John Kunze
 
PDF
Slides anu talkwebarchivingaug2012
Roxanne Missingham
 
PDF
The web is a mess: how I learnt to stop worrying and love web archiving. Kris...
Biblioteca Nacional de España
 
PPTX
Collaboration and Cash: Web Archiving Incentive Awards
Anna Perricci
 
PPTX
Building Archivable Websites
nullhandle
 
PPTX
Making the Black Hole Gray: Web Archiving Art Resources at New York Art Resou...
The Frick Collection
 
PDF
E-ARK: Open Data Mining for Government Archives
Danube University Krems, Centre for E-Governance
 
PDF
Web Archiving: A Brief Introduction
Sawood Alam
 
PDF
Integrating web archiving in preservation workflows. Louise Fauduet, Clément ...
Biblioteca Nacional de España
 
PPTX
AtoM, Authenticity, and the Chain of Custody
Artefactual Systems - AtoM
 
PDF
Internet content as research data
National Library of Australia
 
PPTX
Uniform Resource Locator (URL), PURL.pptx
DrIrfanulHaqAkhoon
 
PPT
The development of web archiving 3
Essam Obaid
 
PDF
Digital Archives on a Dime
Jason Henderson
 
PDF
Archival Technologies
Cliff Landis
 
PDF
Archival Technologies 2014
Cliff Landis
 
DCMI ARK Tutorial 2024.10.20, slides and notes, 120 mins.pdf
John Kunze
 
The ARK Identifier Scheme at Ten Years Old
John Kunze
 
Names, Things, and Open Identifier Infrastructure: N2T and ARKs
John Kunze
 
E-ARK-iPRES2016-Bern-October-2016
Sven Schlarb
 
ARK identifiers: lessons learnt at BnF: paths forward
John Kunze
 
Slides anu talkwebarchivingaug2012
Roxanne Missingham
 
The web is a mess: how I learnt to stop worrying and love web archiving. Kris...
Biblioteca Nacional de España
 
Collaboration and Cash: Web Archiving Incentive Awards
Anna Perricci
 
Building Archivable Websites
nullhandle
 
Making the Black Hole Gray: Web Archiving Art Resources at New York Art Resou...
The Frick Collection
 
E-ARK: Open Data Mining for Government Archives
Danube University Krems, Centre for E-Governance
 
Web Archiving: A Brief Introduction
Sawood Alam
 
Integrating web archiving in preservation workflows. Louise Fauduet, Clément ...
Biblioteca Nacional de España
 
AtoM, Authenticity, and the Chain of Custody
Artefactual Systems - AtoM
 
Internet content as research data
National Library of Australia
 
Uniform Resource Locator (URL), PURL.pptx
DrIrfanulHaqAkhoon
 
The development of web archiving 3
Essam Obaid
 
Digital Archives on a Dime
Jason Henderson
 
Archival Technologies
Cliff Landis
 
Archival Technologies 2014
Cliff Landis
 
Ad

More from John Kunze (20)

PPTX
The YAMZ Metadictionary
John Kunze
 
PPTX
YAMZ Metadata Vocabulary Builder
John Kunze
 
PDF
EZID and N2T at CDL
John Kunze
 
PDF
YAMZ.net: better, faster, cheaper taxonomy building
John Kunze
 
PDF
A Vocabulary for Persistence
John Kunze
 
PDF
Identifiers obey Resolvers not Schemes
John Kunze
 
PPTX
YAMZ: a cross-domain crowd-sourced metadata vocabulary
John Kunze
 
PPTX
DataONE Preservation and Metadata Working Group Report 2014
John Kunze
 
PPTX
Selected Bash shell tricks from Camp CDL breakout group
John Kunze
 
PDF
Annotating Research Datasets
John Kunze
 
PPTX
The Data Management Ecosystem
John Kunze
 
PPTX
Library Tools Supporting Data-Rich Research
John Kunze
 
PPTX
Big Data's Long Tail
John Kunze
 
PPTX
Pamwg 2012ahm
John Kunze
 
PPTX
Scalable Identifiers for Natural History Collections
John Kunze
 
PDF
Future-Proofing the Web: What We Can Do Today
John Kunze
 
PDF
Supporting Data-Rich Research on Many Fronts
John Kunze
 
PDF
New Metaphors: Data Papers and Data Citations
John Kunze
 
PDF
Pairtrees for object storage
John Kunze
 
PDF
The BagIt file package format
John Kunze
 
The YAMZ Metadictionary
John Kunze
 
YAMZ Metadata Vocabulary Builder
John Kunze
 
EZID and N2T at CDL
John Kunze
 
YAMZ.net: better, faster, cheaper taxonomy building
John Kunze
 
A Vocabulary for Persistence
John Kunze
 
Identifiers obey Resolvers not Schemes
John Kunze
 
YAMZ: a cross-domain crowd-sourced metadata vocabulary
John Kunze
 
DataONE Preservation and Metadata Working Group Report 2014
John Kunze
 
Selected Bash shell tricks from Camp CDL breakout group
John Kunze
 
Annotating Research Datasets
John Kunze
 
The Data Management Ecosystem
John Kunze
 
Library Tools Supporting Data-Rich Research
John Kunze
 
Big Data's Long Tail
John Kunze
 
Pamwg 2012ahm
John Kunze
 
Scalable Identifiers for Natural History Collections
John Kunze
 
Future-Proofing the Web: What We Can Do Today
John Kunze
 
Supporting Data-Rich Research on Many Fronts
John Kunze
 
New Metaphors: Data Papers and Data Citations
John Kunze
 
Pairtrees for object storage
John Kunze
 
The BagIt file package format
John Kunze
 
Ad

Recently uploaded (20)

PPTX
Google SGE SEO: 5 Critical Changes That Could Wreck Your Rankings in 2025
Reversed Out Creative
 
PPTX
nagasai stick diagrams in very large scale integratiom.pptx
manunagapaul
 
PDF
APNIC Update, presented at PHNOG 2025 by Shane Hermoso
APNIC
 
PPTX
dns domain name system history work.pptx
MUHAMMADKAVISHSHABAN
 
PPTX
Generics jehfkhkshfhskjghkshhhhlshluhueheuhuhhlhkhk.pptx
yashpavasiya892
 
PPTX
Unlocking Hope : How Crypto Recovery Services Can Reclaim Your Lost Funds
lionsgate network
 
PDF
Centralized Business Email Management_ How Admin Controls Boost Efficiency & ...
XgenPlus Technologies
 
PPTX
ENCOR_Chapter_10 - OSPFv3 Attribution.pptx
nshg93
 
PPTX
B2B_Ecommerce_Internship_Simranpreet.pptx
LipakshiJindal
 
PPTX
SEO Trends in 2025 | B3AITS - Bow & 3 Arrows IT Solutions
B3AITS - Bow & 3 Arrows IT Solutions
 
PPTX
The Latest Scam Shocking the USA in 2025.pptx
onlinescamreport4
 
PDF
Project English Paja Jara Alejandro.jpdf
AlejandroAlonsoPajaJ
 
PPTX
Crypto Recovery California Services.pptx
lionsgate network
 
PPTX
ppt lighfrsefsefesfesfsefsefsefsefserrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrt.pptx
atharvawafgaonkar
 
PPTX
Microsoft PowerPoint Student PPT slides.pptx
Garleys Putin
 
PPT
Introduction to dns domain name syst.ppt
MUHAMMADKAVISHSHABAN
 
PPTX
Different Generation Of Computers .pptx
divcoder9507
 
PPTX
LESSON-2-Roles-of-ICT-in-Teaching-for-learning_123922 (1).pptx
renavieramopiquero
 
PPTX
Black Yellow Modern Minimalist Elegant Presentation.pptx
nothisispatrickduhh
 
PPTX
how many elements are less than or equal to a mid value and adjusts the searc...
kokiyon104
 
Google SGE SEO: 5 Critical Changes That Could Wreck Your Rankings in 2025
Reversed Out Creative
 
nagasai stick diagrams in very large scale integratiom.pptx
manunagapaul
 
APNIC Update, presented at PHNOG 2025 by Shane Hermoso
APNIC
 
dns domain name system history work.pptx
MUHAMMADKAVISHSHABAN
 
Generics jehfkhkshfhskjghkshhhhlshluhueheuhuhhlhkhk.pptx
yashpavasiya892
 
Unlocking Hope : How Crypto Recovery Services Can Reclaim Your Lost Funds
lionsgate network
 
Centralized Business Email Management_ How Admin Controls Boost Efficiency & ...
XgenPlus Technologies
 
ENCOR_Chapter_10 - OSPFv3 Attribution.pptx
nshg93
 
B2B_Ecommerce_Internship_Simranpreet.pptx
LipakshiJindal
 
SEO Trends in 2025 | B3AITS - Bow & 3 Arrows IT Solutions
B3AITS - Bow & 3 Arrows IT Solutions
 
The Latest Scam Shocking the USA in 2025.pptx
onlinescamreport4
 
Project English Paja Jara Alejandro.jpdf
AlejandroAlonsoPajaJ
 
Crypto Recovery California Services.pptx
lionsgate network
 
ppt lighfrsefsefesfesfsefsefsefsefserrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrt.pptx
atharvawafgaonkar
 
Microsoft PowerPoint Student PPT slides.pptx
Garleys Putin
 
Introduction to dns domain name syst.ppt
MUHAMMADKAVISHSHABAN
 
Different Generation Of Computers .pptx
divcoder9507
 
LESSON-2-Roles-of-ICT-in-Teaching-for-learning_123922 (1).pptx
renavieramopiquero
 
Black Yellow Modern Minimalist Elegant Presentation.pptx
nothisispatrickduhh
 
how many elements are less than or equal to a mid value and adjusts the searc...
kokiyon104
 

The ARK Alliance: 20 years, 850 institutions, 8.2 billion persistent identifiers, 2021-10-22

  • 1. The ARK Alliance: 20 years 850 institutions 8.2 billion persistent identifiers John Kunze, California Digital Library University of California Office of the President October 2021
  • 2. Digital preservation means Long term protection for digital resources ● from human error, natural disaster, legal challenge, deliberate attack, social upheaval, bankruptcy, etc. Long term access to those resources from unbroken links ● with persistent identifiers (PIDs), also known as permalinks
  • 3. Why persistent identifiers? Because of “link rot” (broken references, 404 Not Found) ● Reliable, unbroken web links (URLs) are rare ● The average URL lifetime is only 100 days But why not just search when you need a link? ● Because scholars and researchers take years to find their object references Common types of persistent identifiers ● PURL, Handle, URN, DOI, ARK
  • 4. A labelled URL with a globally unique identity inside it https://n2t.net/ark:/12345/fk1234 makes ARK actionable (the resolver) core globally unique identity (independent of web and hostname) What is an ARK (Archival Resource Key)?
  • 5. ARK anatomy https://example.org/ark:/12345/x54xz321/s3/f8.05v.tiff _________________/ __/ ___/ ______/____/_______/ | | | | | | | ARK Label | | Sub-parts Variants | | | Name Mapping Authority (NMA) | Assigned Name | Name Assigning Authority Number (NAAN)
  • 6. Why ARKs? Major causes of broken links, and some features PURL Handle URN DOI ARK Prevents fire, war, flood, attack, bankruptcy, ... No No No No No Prevents human error No No No No No Guarantees your links, or fixes them for you No No No No No Decentralized admin plus inferenceable syntax No No No No Yes Flexible metadata and persistence statements No No No No Yes Identifiers extensible during resolution Yes No Yes No Yes Free, non-paywalled, in unlimited numbers Yes No Yes No Yes
  • 7. Who is using ARKs? University of California Berkeley Smithsonian National Museum National Library of France University of Chicago Musée du Louvre Family Search British Library Google Internet Archive Caltech Archives Hawaii State Archives French National Archives Rockefeller Archive Center Library and Archives Canada Archives de la Ville de Genève Silent Film Sound & Music Archive • Libraries, data centers, archives, museums, publishers, government agencies, and vendors • Example institutions:
  • 8. What are ARKs used for? ● genealogical records (8 billion FamilySearch) ● publisher content (100 million Portico) ● scientific datasets and records (22 million INIST) ● scanned books and texts 30 million Internet Archive) ● bibliographic records (15 million BnF main catalog) ● museum specimens (15 million Smithsonian Institution) ● public health documents (15 million UCSF IDL) ● historical documents (21 million CDL, 5 million BnF Gallica) ● historical authors and scholars (4 million SNAC) ● fine art museum collections (483,000 Louvre) ● vocabulary terms (9,000 Periodo, YAMZ)
  • 9. ARK Alliance: 850 institutions and 8.2 billion ARKs in 20 years
  • 10. Home of the ARK Alliance arks.org Join one of our working groups: [email protected] Get started with ARKs by filling out: n2t.net/e/naan_request Stay in touch: ● Twitter: @arks_org ● Email forum (English): groups.google.com/group/arks-forum ● Email forum (French): framalistes.org/sympa/info/arks-forum-fr The ARK Alliance