SlideShare a Scribd company logo
TOWARDS A WEB-ENABLED GEO-SAMPLE WEB: AN OPEN SOURCE
RESOURCE REGISTRATION AND MANAGEMENT SYSTEM FOR CONNECTING
GEO-SAMPLES TO THE WEB
1MINERAL RESOURCES, 2LAND AND WATER
Anusuriya Devaraju1, Jens Klump1, Victor Tey1, Simon Cox2 and Ryan Fraser1
International Conference for Free and Open Source Software for Geospatial (FOSS4G 2017), August 14-19, 2017, Boston, MA.
Presentation Outline
• Challenges in sample identification and discovery
• International Geo Sample Number (IGSN)
• IGSN Implementation in CSIRO
• Applications
• Lessons Learned and Conclusions
Challenges
• Inconsistent sample
cataloguing practices.
• The lack of online
catalogues for
discovering physical
samples.
3 |
The EarthChem data portal returns 71
samples called ‘A-1’.
Different names representing
the same rock sample on the
PetDB (Petrological Database)
Towards a Web-enabled Geo-sample Web: An Open Source Resource Registration and Management System for Connecting Geo-Samples to the Web | Anusuriya Devaraju et al.
Examples
4 | Towards a Web-enabled Geo-sample Web: An Open Source Resource Registration and Management System for Connecting Geo-Samples to the Web | Anusuriya Devaraju et al.
Presentation Outline
• Challenges in sample identification and discovery
• International Geo Sample Number (IGSN)
• IGSN Implementation in CSIRO
• Applications
• Lessons Learned and Conclusions
International Geo Sample Number (IGSN)
• Globally unique, persistent identifiers for physical samples (e.g., rocks, water,
bio specimens), sample collections, and sampling features (e.g., boreholes).
6 |
IGSN:GRO000076
http://hdl.handle.net/10273/GRO000076
Fraser River water sample
(image source: @igsn_info)
IGSN ensures
preservation and access
of sample metadata
Towards a Web-enabled Geo-sample Web: An Open Source Resource Registration and Management System for Connecting Geo-Samples to the Web | Anusuriya Devaraju et al.
IGSN System Overview
• Clients send sample registrations to an allocating agent; the allocating agent
forwards the registrations to the IGSN e.V.
• Australian IGSN allocating agents are CSIRO, Geoscience Australia, Curtin Uni.
7 |
(Hierarchical architecture of the IGSN)
HTTP Request
(OAI Verb)
HTTP Response
(Valid XML)
IGSN Implementation Organization (IGSN e.V.)
(International IGSN Registration Service)
Handle.net Registry
Clients
(e.g., individual users and laboratories)
OAI
Data Provider
Allocating Agent
(e.g., CSIRO’s IGSN Registration Service)
National IGSN
Sample Catalogue
IGSN registration requests are specified based on
description schema developed by the agent
IGSN registration requests are specified based on
registration schema developed by the IGSN e.V.
OAI
Service Provider
Towards a Web-enabled Geo-sample Web: An Open Source Resource Registration and Management System for Connecting Geo-Samples to the Web | Anusuriya Devaraju et al.
Metadata Schemas
8 |
IGSN-CSIRO
Description
Schema
IGSN Registration Schema
Towards a Web-enabled Geo-sample Web: An Open Source Resource Registration and Management System for Connecting Geo-Samples to the Web | Anusuriya Devaraju et al.
Presentation Outline
• Challenges in sample identification and discovery
• International Geo Sample Number (IGSN)
• IGSN Implementation in CSIRO
• Applications
• Lessons Learned and Conclusions
IGSN Implementation in CSIRO
• CSIRO became a member of IGSN in 2013, and the use of IGSN was initiated by
CSIRO Mineral Resources.
• IGSN are currently used for:
• Future use cases are National Soil Archive and Australian National Insect
Collection.
10 | Towards a Web-enabled Geo-sample Web: An Open Source Resource Registration and Management System for Connecting Geo-Samples to the Web | Anusuriya Devaraju et al.
Sample Data Repositories Material Types Registrations (as at 31.05.2017)
Repository of the Australian Resources
Research Centre(ARRC)
rock, mineral, soil 25652
Capricorn Distal Footprints rock, vegetation, water, regolith 4232
Reflectance Spectra Reference Libraries mineral, rock, synthetic material 94
Identifier Governance in CSIRO
• Different namespace strategies may be used by other allocating agents.
• Following the IGSN Technical Specification, we established rules for allocating
IGSNs in different parts of the organisation.
11 |
Allocating agent namespace (‘CS’ represents CSIRO)
3
2
IGSN : CSRWASC00630
21 3
1
Sub-namespace of a client
Sample code specified by the client
Towards a Web-enabled Geo-sample Web: An Open Source Resource Registration and Management System for Connecting Geo-Samples to the Web | Anusuriya Devaraju et al.
System Implementation
12 | Towards a Web-enabled Geo-sample Web: An Open Source Resource Registration and Management System for Connecting Geo-Samples to the Web | Anusuriya Devaraju et al.
A Web Front-End for Sample Registrations
13 |
Prefix and user
registration can be
performed via the
user interface from
the Registrant menu
option.
Towards a Web-enabled Geo-sample Web: An Open Source Resource Registration and Management System for Connecting Geo-Samples to the Web | Anusuriya Devaraju et al.
Individual users
may register
their samples
through the
GUI.
CSIRO-IGSN Description Metadata Schema
• Built upon the DataCite Metadata Schema (v4.0).
• An XML Schema which represents the core concepts of physical samples:
• Sample identification, e.g., identifier, alternate identifiers, title, visibility, material, specimen type..
• Sampling activity, e.g., location, time, purpose, method…
• Sample curation, e.g., curator, curation location and time…
• Related resources, e.g. publications, datasets…
• Supports batch sample registrations and has minimal restrictions on mandatory
metadata elements.
• Captures the provenance of sample curation, and the relations between
samples and related resources.
• Flexible in terms of representing spatial and temporal information.
14 | Towards a Web-enabled Geo-sample Web: An Open Source Resource Registration and Management System for Connecting Geo-Samples to the Web | Anusuriya Devaraju et al.
SKOS-based Controlled Vocabularies
15 | Towards a Web-enabled Geo-sample Web: An Open Source Resource Registration and Management System for Connecting Geo-Samples to the Web | Anusuriya Devaraju et al.
Vocabularies Examples Service Provider
Material types soil, rock, vegetation Master Controlled Vocabulary Registry
for ODM2
CUAHSI
Specimen types thin section, grab,
dredge, cuttings
Master Controlled Vocabulary Registry
for ODM2
CUAHSI
Nil-reason types missing, unknown, withheld. OGC definitions of nil reasons CSIRO CSIRO
Contributor types originator, custodian, point of contact. Linked Data Registry CSIRO
*Registration types physical sample, sample collection,
sampling features.
ANDS Research Vocabularies Australia CSIRO & Geoscience Australia
*Identifier types DOI, IGSN, LSID, ORCID ANDS Research Vocabularies Australia CSIRO & Geoscience Australia
*Relation types isDerivedFrom, hasDocument,
hasDigitalRepresentation.
ANDS Research Vocabularies Australia CSIRO & Geoscience Australia
A list of existing and newly developed SKOS vocabularies. New vocabularies are indicated by asterisks*.
CSIRO-IGSN Allocating Agent Service
• Sample registrations must be specified based on the CSIRO-IGSN Description
Schema.
• A RESTful web service that enable clients to
• Register a sub-namespace
• Get all registered sub-namespaces
• Register samples
• Get metadata of a sample or a list of samples
• Mark a sample metadata as deprecated.
16 | Towards a Web-enabled Geo-sample Web: An Open Source Resource Registration and Management System for Connecting Geo-Samples to the Web | Anusuriya Devaraju et al.
Presentation Outline
• Challenges in sample identification and discovery
• International Geo Sample Number (IGSN)
• IGSN Implementation in CSIRO
• Applications
• Lessons Learned and Conclusions
Applications
Tracking samples from the field to the sample repository.
18 | Towards a Web-enabled Geo-sample Web: An Open Source Resource Registration and Management System for Connecting Geo-Samples to the Web | Anusuriya Devaraju et al.
IGSN pre-allocation in mobile field data acquisition systems.
Images courtesy of Reid, N. (2015), Golodoniuc, P. (2017)
IGSN Applications
19 |
Smart labels for sample inventory
management.
Discover samples and collections from
other repositories through a common
Australia Geo Sample Portal.
Towards a Web-enabled Geo-sample Web: An Open Source Resource Registration and Management System for Connecting Geo-Samples to the Web | Anusuriya Devaraju et al.
IGSN Applications
Reference samples by using IGSNs in publications, e.g., reports, articles, grant
proposals.
20 |
Publication URL :
http://dx.doi.org/10.1016/j.gca.2013.08.001
Towards a Web-enabled Geo-sample Web: An Open Source Resource Registration and Management System for Connecting Geo-Samples to the Web | Anusuriya Devaraju et al.
IGSN Applications
Cross-reference
between the
registered samples
and their related
resources (sub-
samples, datasets,
publications).
21 |
Specimen (IGSN) Spectrum (DOI)
cross-reference
Publication (DOI)
cross-reference
Towards a Web-enabled Geo-sample Web: An Open Source Resource Registration and Management System for Connecting Geo-Samples to the Web | Anusuriya Devaraju et al.
Presentation Outline
• Challenges in sample identification and discovery
• International Geo Sample Number (IGSN)
• IGSN Implementation in CSIRO
• Applications
• Lessons Learned and Conclusions
Lessons Learned
• We ensure the global uniqueness of registered identifiers within the
organization through the IGSN namespace governance.
• Integration of identifiers into new and existing systems
• New sampling campaigns – adopt IGSN at the early stage of the sampling activity.
• Existing sample curation systems – prepend the IGSN namespace to the local sample identifiers.
• What can be identified with IGSNs? We used IGSNs to identify not only physical
samples but also sample collections and sampling features..
• Outreach, e.g., talks, meetings, and workshops.
• Documentations & public repositories.
• National collaboration, i.e., a joint project with the other allocating agents in
Australia (e.g., Geoscience Australia and Curtin University).
23 | Towards a Web-enabled Geo-sample Web: An Open Source Resource Registration and Management System for Connecting Geo-Samples to the Web | Anusuriya Devaraju et al.
Conclusions
• We described the implementation of IGSN to identify and publish physical
samples and sample collections in CSIRO.
• This serves as a working example for implementing IGSN in a large organization.
• Benefits of implementing IGSN in CSIRO
• Identify samples unambiguously and discover them easily.
• Avoid duplicate sampling activities.
• Promote re-use of the samples for new purposes.
• The use of IGSN in CSIRO may be expanded to other collections and operated as
a central service by Information Management and Technology (IM&T) as part of
CSIRO’s National Facilities and Collections.
24 | Towards a Web-enabled Geo-sample Web: An Open Source Resource Registration and Management System for Connecting Geo-Samples to the Web | Anusuriya Devaraju et al.
Related Links
25 | Towards a Web-enabled Geo-sample Web: An Open Source Resource Registration and Management System for Connecting Geo-Samples to the Web | Anusuriya Devaraju et al.
Components Link
CSIRO Allocating Agent Service https://igsn.csiro.au/igsn30/api
Description Metadata Schema https://igsn.csiro.au/schemas/3.0/
Metadata Store (source) https://github.com/AuScope/igsn30/tree/master/sql
CSIRO-IGSN OAI-PMH Data
Provider
https://igsn.csiro.au/igsn30/api/service/30/oai
National IGSN Web Portal http://igsn.org.au
OAI-PMH Harvester and National IGSN Web
Portal (source repositories)
https://github.com/AuScope/NatPortalIGSN
Sample Registration and Management GUI https://igsn.csiro.au/igsn30
A list of the components developed and their links.
Mineral Resources
Anusuriya Devaraju
Postdoctoral Fellow
e anusuriya.devaraju@csiro.au
Mineral Resources
Jens Klump
Science Leader Earth Science Informatics
E jens.klump@csiro.au
Mineral Resources
Victor Tey
Senior Software Engineer
E victor.tey@csiro.au
Land and Water
Simon Cox
Chief Research Scientist
e simon.cox@csiro.au
Mineral Resources
Ryan Fraser
Research Manager
e ryan.fraser@csiro.au
MINERAL RESOURCES

More Related Content

PPTX
Web-enabled Physical Samples: Curating and Publishing Physical Samples in CSIRO
Anusuriya Devaraju
 
PDF
Integrated Earth Data Applications: Enhancing Reliable Data Services Through ...
iedadata
 
PPTX
The IGSN and Geosamples
iedadata
 
PDF
An Open Source Web Service for Registering and Managing Environmental Samples
Anusuriya Devaraju
 
PPTX
Lehnert_EGU201_SampleMetadataStandards
Kerstin Lehnert
 
PPTX
Cross-domain data discovery and integration
Simon Cox
 
PDF
Publishing Physical Sample Records on the Web
Anusuriya Devaraju
 
PPTX
The Implementation of the International Geo Sample Number in CSIRO: Experienc...
Anusuriya Devaraju
 
Web-enabled Physical Samples: Curating and Publishing Physical Samples in CSIRO
Anusuriya Devaraju
 
Integrated Earth Data Applications: Enhancing Reliable Data Services Through ...
iedadata
 
The IGSN and Geosamples
iedadata
 
An Open Source Web Service for Registering and Managing Environmental Samples
Anusuriya Devaraju
 
Lehnert_EGU201_SampleMetadataStandards
Kerstin Lehnert
 
Cross-domain data discovery and integration
Simon Cox
 
Publishing Physical Sample Records on the Web
Anusuriya Devaraju
 
The Implementation of the International Geo Sample Number in CSIRO: Experienc...
Anusuriya Devaraju
 

Similar to Towards A Web-Enabled Geo-Sample Web: An Open Source Resource Registration and Management System for Connecting Geo-Samples to the Web (20)

PPTX
Identifying and Linking Physical Samples with Data: Using IGSN
ARDC
 
PPT
Validation of services, data and metadata
Luis Bermudez
 
PPTX
The Internet of Samples: IGSN in Action
Kerstin Lehnert
 
PDF
EarthCube Monthly Community Webinar- Nov. 22, 2013
EarthCube
 
PPTX
Using e-infrastructures for biodiversity conservation - Gianpaolo Coro (CNR)
Blue BRIDGE
 
PDF
Bridging Environmental Data Providers and SeaDataNet DIVA Service within a Co...
Blue BRIDGE
 
PPT
2008-05-05 GEOSS UIC-ADC AQ Scen W shop Toronto
Rudolf Husar
 
PPT
sers, Applications and the Community of Practice for the Air Quality Scenario
Rudolf Husar
 
PPTX
Virtual Research Environments supporting tailor-made data management service...
Blue BRIDGE
 
PPTX
The BlueBRIDGE approach to collaborative research
Blue BRIDGE
 
PPTX
Citizen Observatory Framework with Access Management Federation in GEOSS - Ba...
COBWEB Project
 
PPT
GeoChronos: An On-line Collaborative Platform for Earth Observation Scientists
GeoChronos
 
PPTX
Paul hu bupdate_i_digbio_ecn_2012
ECNOfficer
 
PPTX
VREs and Research Tools - supporting collaborative research
Christopher Brown
 
PPT
Collaborative by Nature - Chris Higgins, IGIBS & EDINA
JISC GECO
 
PDF
Kerry Taylor - Semantics & sensors
Web Directions
 
PPT
The eCrystals Federation
ManjulaPatel
 
PPTX
EarthCube EISWG Spring Meeting Presentation - 4.28.2014
EarthCube
 
PPTX
Igsn webinar-26Jul-Slides
ARDC
 
PPT
Workflows, provenance and reporting: a lifecycle perspective at BIH 2013, Rome
Carole Goble
 
Identifying and Linking Physical Samples with Data: Using IGSN
ARDC
 
Validation of services, data and metadata
Luis Bermudez
 
The Internet of Samples: IGSN in Action
Kerstin Lehnert
 
EarthCube Monthly Community Webinar- Nov. 22, 2013
EarthCube
 
Using e-infrastructures for biodiversity conservation - Gianpaolo Coro (CNR)
Blue BRIDGE
 
Bridging Environmental Data Providers and SeaDataNet DIVA Service within a Co...
Blue BRIDGE
 
2008-05-05 GEOSS UIC-ADC AQ Scen W shop Toronto
Rudolf Husar
 
sers, Applications and the Community of Practice for the Air Quality Scenario
Rudolf Husar
 
Virtual Research Environments supporting tailor-made data management service...
Blue BRIDGE
 
The BlueBRIDGE approach to collaborative research
Blue BRIDGE
 
Citizen Observatory Framework with Access Management Federation in GEOSS - Ba...
COBWEB Project
 
GeoChronos: An On-line Collaborative Platform for Earth Observation Scientists
GeoChronos
 
Paul hu bupdate_i_digbio_ecn_2012
ECNOfficer
 
VREs and Research Tools - supporting collaborative research
Christopher Brown
 
Collaborative by Nature - Chris Higgins, IGIBS & EDINA
JISC GECO
 
Kerry Taylor - Semantics & sensors
Web Directions
 
The eCrystals Federation
ManjulaPatel
 
EarthCube EISWG Spring Meeting Presentation - 4.28.2014
EarthCube
 
Igsn webinar-26Jul-Slides
ARDC
 
Workflows, provenance and reporting: a lifecycle perspective at BIH 2013, Rome
Carole Goble
 
Ad

More from Anusuriya Devaraju (14)

PDF
FAIR – Assessment or Improvement?
Anusuriya Devaraju
 
PDF
Simple Steps to Effective Research Data Sharing
Anusuriya Devaraju
 
PDF
F-UJI : An Automated Assessment Tool for Improving the FAIRness of Research Data
Anusuriya Devaraju
 
PDF
An Automated Assessment of the FAIRness of Research Data
Anusuriya Devaraju
 
PDF
Data You May Like: A Recommender System for Research Data Discovery
Anusuriya Devaraju
 
PPTX
Using Feedback from Data Consumers to Capture Quality Information on Environm...
Anusuriya Devaraju
 
PDF
CAPTURING DATA PROVENANCE WITH A USER-DRIVEN FEEDBACK APPROACH
Anusuriya Devaraju
 
PPTX
Enabling Quality Control of SensorWeb Observations
Anusuriya Devaraju
 
PPTX
Representing and Reasoning about Geographic Occurrences in the Sensor Web
Anusuriya Devaraju
 
PDF
Semantic interoperability
Anusuriya Devaraju
 
PDF
Semantic Sensor Web
Anusuriya Devaraju
 
PDF
Linked Data
Anusuriya Devaraju
 
PDF
Combining Process and Sensor Ontologies to Support Geo-Sensor Data Retrieval
Anusuriya Devaraju
 
PDF
Fois2010 final
Anusuriya Devaraju
 
FAIR – Assessment or Improvement?
Anusuriya Devaraju
 
Simple Steps to Effective Research Data Sharing
Anusuriya Devaraju
 
F-UJI : An Automated Assessment Tool for Improving the FAIRness of Research Data
Anusuriya Devaraju
 
An Automated Assessment of the FAIRness of Research Data
Anusuriya Devaraju
 
Data You May Like: A Recommender System for Research Data Discovery
Anusuriya Devaraju
 
Using Feedback from Data Consumers to Capture Quality Information on Environm...
Anusuriya Devaraju
 
CAPTURING DATA PROVENANCE WITH A USER-DRIVEN FEEDBACK APPROACH
Anusuriya Devaraju
 
Enabling Quality Control of SensorWeb Observations
Anusuriya Devaraju
 
Representing and Reasoning about Geographic Occurrences in the Sensor Web
Anusuriya Devaraju
 
Semantic interoperability
Anusuriya Devaraju
 
Semantic Sensor Web
Anusuriya Devaraju
 
Linked Data
Anusuriya Devaraju
 
Combining Process and Sensor Ontologies to Support Geo-Sensor Data Retrieval
Anusuriya Devaraju
 
Fois2010 final
Anusuriya Devaraju
 
Ad

Recently uploaded (20)

PDF
Jenkins: An open-source automation server powering CI/CD Automation
SaikatBasu37
 
PPT
Order to Cash Lifecycle Overview R12 .ppt
nbvreddy229
 
PPTX
Maximizing Revenue with Marketo Measure: A Deep Dive into Multi-Touch Attribu...
bbedford2
 
PDF
Teaching Reproducibility and Embracing Variability: From Floating-Point Exper...
University of Rennes, INSA Rennes, Inria/IRISA, CNRS
 
PDF
Micromaid: A simple Mermaid-like chart generator for Pharo
ESUG
 
PPTX
AI-Ready Handoff: Auto-Summaries & Draft Emails from MQL to Slack in One Flow
bbedford2
 
PPTX
Save Business Costs with CRM Software for Insurance Agents
Insurance Tech Services
 
PDF
On Software Engineers' Productivity - Beyond Misleading Metrics
Romén Rodríguez-Gil
 
PPTX
oapresentation.pptx
mehatdhavalrajubhai
 
PPTX
Odoo Integration Services by Candidroot Solutions
CandidRoot Solutions Private Limited
 
PPTX
Web Testing.pptx528278vshbuqffqhhqiwnwuq
studylike474
 
PDF
Solar Panel Installation Guide – Step By Step Process 2025.pdf
CRMLeaf
 
PPTX
Explanation about Structures in C language.pptx
Veeral Rathod
 
PDF
Become an Agentblazer Champion Challenge Kickoff
Dele Amefo
 
PDF
QAware_Mario-Leander_Reimer_Architecting and Building a K8s-based AI Platform...
QAware GmbH
 
PPTX
ConcordeApp: Engineering Global Impact & Unlocking Billions in Event ROI with AI
chastechaste14
 
PPTX
Presentation of Computer CLASS 2 .pptx
darshilchaudhary558
 
PDF
Key Features to Look for in Arizona App Development Services
Net-Craft.com
 
PDF
The Role of Automation and AI in EHS Management for Data Centers.pdf
TECH EHS Solution
 
PDF
Become an Agentblazer Champion Challenge
Dele Amefo
 
Jenkins: An open-source automation server powering CI/CD Automation
SaikatBasu37
 
Order to Cash Lifecycle Overview R12 .ppt
nbvreddy229
 
Maximizing Revenue with Marketo Measure: A Deep Dive into Multi-Touch Attribu...
bbedford2
 
Teaching Reproducibility and Embracing Variability: From Floating-Point Exper...
University of Rennes, INSA Rennes, Inria/IRISA, CNRS
 
Micromaid: A simple Mermaid-like chart generator for Pharo
ESUG
 
AI-Ready Handoff: Auto-Summaries & Draft Emails from MQL to Slack in One Flow
bbedford2
 
Save Business Costs with CRM Software for Insurance Agents
Insurance Tech Services
 
On Software Engineers' Productivity - Beyond Misleading Metrics
Romén Rodríguez-Gil
 
oapresentation.pptx
mehatdhavalrajubhai
 
Odoo Integration Services by Candidroot Solutions
CandidRoot Solutions Private Limited
 
Web Testing.pptx528278vshbuqffqhhqiwnwuq
studylike474
 
Solar Panel Installation Guide – Step By Step Process 2025.pdf
CRMLeaf
 
Explanation about Structures in C language.pptx
Veeral Rathod
 
Become an Agentblazer Champion Challenge Kickoff
Dele Amefo
 
QAware_Mario-Leander_Reimer_Architecting and Building a K8s-based AI Platform...
QAware GmbH
 
ConcordeApp: Engineering Global Impact & Unlocking Billions in Event ROI with AI
chastechaste14
 
Presentation of Computer CLASS 2 .pptx
darshilchaudhary558
 
Key Features to Look for in Arizona App Development Services
Net-Craft.com
 
The Role of Automation and AI in EHS Management for Data Centers.pdf
TECH EHS Solution
 
Become an Agentblazer Champion Challenge
Dele Amefo
 

Towards A Web-Enabled Geo-Sample Web: An Open Source Resource Registration and Management System for Connecting Geo-Samples to the Web

  • 1. TOWARDS A WEB-ENABLED GEO-SAMPLE WEB: AN OPEN SOURCE RESOURCE REGISTRATION AND MANAGEMENT SYSTEM FOR CONNECTING GEO-SAMPLES TO THE WEB 1MINERAL RESOURCES, 2LAND AND WATER Anusuriya Devaraju1, Jens Klump1, Victor Tey1, Simon Cox2 and Ryan Fraser1 International Conference for Free and Open Source Software for Geospatial (FOSS4G 2017), August 14-19, 2017, Boston, MA.
  • 2. Presentation Outline • Challenges in sample identification and discovery • International Geo Sample Number (IGSN) • IGSN Implementation in CSIRO • Applications • Lessons Learned and Conclusions
  • 3. Challenges • Inconsistent sample cataloguing practices. • The lack of online catalogues for discovering physical samples. 3 | The EarthChem data portal returns 71 samples called ‘A-1’. Different names representing the same rock sample on the PetDB (Petrological Database) Towards a Web-enabled Geo-sample Web: An Open Source Resource Registration and Management System for Connecting Geo-Samples to the Web | Anusuriya Devaraju et al.
  • 4. Examples 4 | Towards a Web-enabled Geo-sample Web: An Open Source Resource Registration and Management System for Connecting Geo-Samples to the Web | Anusuriya Devaraju et al.
  • 5. Presentation Outline • Challenges in sample identification and discovery • International Geo Sample Number (IGSN) • IGSN Implementation in CSIRO • Applications • Lessons Learned and Conclusions
  • 6. International Geo Sample Number (IGSN) • Globally unique, persistent identifiers for physical samples (e.g., rocks, water, bio specimens), sample collections, and sampling features (e.g., boreholes). 6 | IGSN:GRO000076 http://hdl.handle.net/10273/GRO000076 Fraser River water sample (image source: @igsn_info) IGSN ensures preservation and access of sample metadata Towards a Web-enabled Geo-sample Web: An Open Source Resource Registration and Management System for Connecting Geo-Samples to the Web | Anusuriya Devaraju et al.
  • 7. IGSN System Overview • Clients send sample registrations to an allocating agent; the allocating agent forwards the registrations to the IGSN e.V. • Australian IGSN allocating agents are CSIRO, Geoscience Australia, Curtin Uni. 7 | (Hierarchical architecture of the IGSN) HTTP Request (OAI Verb) HTTP Response (Valid XML) IGSN Implementation Organization (IGSN e.V.) (International IGSN Registration Service) Handle.net Registry Clients (e.g., individual users and laboratories) OAI Data Provider Allocating Agent (e.g., CSIRO’s IGSN Registration Service) National IGSN Sample Catalogue IGSN registration requests are specified based on description schema developed by the agent IGSN registration requests are specified based on registration schema developed by the IGSN e.V. OAI Service Provider Towards a Web-enabled Geo-sample Web: An Open Source Resource Registration and Management System for Connecting Geo-Samples to the Web | Anusuriya Devaraju et al.
  • 8. Metadata Schemas 8 | IGSN-CSIRO Description Schema IGSN Registration Schema Towards a Web-enabled Geo-sample Web: An Open Source Resource Registration and Management System for Connecting Geo-Samples to the Web | Anusuriya Devaraju et al.
  • 9. Presentation Outline • Challenges in sample identification and discovery • International Geo Sample Number (IGSN) • IGSN Implementation in CSIRO • Applications • Lessons Learned and Conclusions
  • 10. IGSN Implementation in CSIRO • CSIRO became a member of IGSN in 2013, and the use of IGSN was initiated by CSIRO Mineral Resources. • IGSN are currently used for: • Future use cases are National Soil Archive and Australian National Insect Collection. 10 | Towards a Web-enabled Geo-sample Web: An Open Source Resource Registration and Management System for Connecting Geo-Samples to the Web | Anusuriya Devaraju et al. Sample Data Repositories Material Types Registrations (as at 31.05.2017) Repository of the Australian Resources Research Centre(ARRC) rock, mineral, soil 25652 Capricorn Distal Footprints rock, vegetation, water, regolith 4232 Reflectance Spectra Reference Libraries mineral, rock, synthetic material 94
  • 11. Identifier Governance in CSIRO • Different namespace strategies may be used by other allocating agents. • Following the IGSN Technical Specification, we established rules for allocating IGSNs in different parts of the organisation. 11 | Allocating agent namespace (‘CS’ represents CSIRO) 3 2 IGSN : CSRWASC00630 21 3 1 Sub-namespace of a client Sample code specified by the client Towards a Web-enabled Geo-sample Web: An Open Source Resource Registration and Management System for Connecting Geo-Samples to the Web | Anusuriya Devaraju et al.
  • 12. System Implementation 12 | Towards a Web-enabled Geo-sample Web: An Open Source Resource Registration and Management System for Connecting Geo-Samples to the Web | Anusuriya Devaraju et al.
  • 13. A Web Front-End for Sample Registrations 13 | Prefix and user registration can be performed via the user interface from the Registrant menu option. Towards a Web-enabled Geo-sample Web: An Open Source Resource Registration and Management System for Connecting Geo-Samples to the Web | Anusuriya Devaraju et al. Individual users may register their samples through the GUI.
  • 14. CSIRO-IGSN Description Metadata Schema • Built upon the DataCite Metadata Schema (v4.0). • An XML Schema which represents the core concepts of physical samples: • Sample identification, e.g., identifier, alternate identifiers, title, visibility, material, specimen type.. • Sampling activity, e.g., location, time, purpose, method… • Sample curation, e.g., curator, curation location and time… • Related resources, e.g. publications, datasets… • Supports batch sample registrations and has minimal restrictions on mandatory metadata elements. • Captures the provenance of sample curation, and the relations between samples and related resources. • Flexible in terms of representing spatial and temporal information. 14 | Towards a Web-enabled Geo-sample Web: An Open Source Resource Registration and Management System for Connecting Geo-Samples to the Web | Anusuriya Devaraju et al.
  • 15. SKOS-based Controlled Vocabularies 15 | Towards a Web-enabled Geo-sample Web: An Open Source Resource Registration and Management System for Connecting Geo-Samples to the Web | Anusuriya Devaraju et al. Vocabularies Examples Service Provider Material types soil, rock, vegetation Master Controlled Vocabulary Registry for ODM2 CUAHSI Specimen types thin section, grab, dredge, cuttings Master Controlled Vocabulary Registry for ODM2 CUAHSI Nil-reason types missing, unknown, withheld. OGC definitions of nil reasons CSIRO CSIRO Contributor types originator, custodian, point of contact. Linked Data Registry CSIRO *Registration types physical sample, sample collection, sampling features. ANDS Research Vocabularies Australia CSIRO & Geoscience Australia *Identifier types DOI, IGSN, LSID, ORCID ANDS Research Vocabularies Australia CSIRO & Geoscience Australia *Relation types isDerivedFrom, hasDocument, hasDigitalRepresentation. ANDS Research Vocabularies Australia CSIRO & Geoscience Australia A list of existing and newly developed SKOS vocabularies. New vocabularies are indicated by asterisks*.
  • 16. CSIRO-IGSN Allocating Agent Service • Sample registrations must be specified based on the CSIRO-IGSN Description Schema. • A RESTful web service that enable clients to • Register a sub-namespace • Get all registered sub-namespaces • Register samples • Get metadata of a sample or a list of samples • Mark a sample metadata as deprecated. 16 | Towards a Web-enabled Geo-sample Web: An Open Source Resource Registration and Management System for Connecting Geo-Samples to the Web | Anusuriya Devaraju et al.
  • 17. Presentation Outline • Challenges in sample identification and discovery • International Geo Sample Number (IGSN) • IGSN Implementation in CSIRO • Applications • Lessons Learned and Conclusions
  • 18. Applications Tracking samples from the field to the sample repository. 18 | Towards a Web-enabled Geo-sample Web: An Open Source Resource Registration and Management System for Connecting Geo-Samples to the Web | Anusuriya Devaraju et al. IGSN pre-allocation in mobile field data acquisition systems. Images courtesy of Reid, N. (2015), Golodoniuc, P. (2017)
  • 19. IGSN Applications 19 | Smart labels for sample inventory management. Discover samples and collections from other repositories through a common Australia Geo Sample Portal. Towards a Web-enabled Geo-sample Web: An Open Source Resource Registration and Management System for Connecting Geo-Samples to the Web | Anusuriya Devaraju et al.
  • 20. IGSN Applications Reference samples by using IGSNs in publications, e.g., reports, articles, grant proposals. 20 | Publication URL : http://dx.doi.org/10.1016/j.gca.2013.08.001 Towards a Web-enabled Geo-sample Web: An Open Source Resource Registration and Management System for Connecting Geo-Samples to the Web | Anusuriya Devaraju et al.
  • 21. IGSN Applications Cross-reference between the registered samples and their related resources (sub- samples, datasets, publications). 21 | Specimen (IGSN) Spectrum (DOI) cross-reference Publication (DOI) cross-reference Towards a Web-enabled Geo-sample Web: An Open Source Resource Registration and Management System for Connecting Geo-Samples to the Web | Anusuriya Devaraju et al.
  • 22. Presentation Outline • Challenges in sample identification and discovery • International Geo Sample Number (IGSN) • IGSN Implementation in CSIRO • Applications • Lessons Learned and Conclusions
  • 23. Lessons Learned • We ensure the global uniqueness of registered identifiers within the organization through the IGSN namespace governance. • Integration of identifiers into new and existing systems • New sampling campaigns – adopt IGSN at the early stage of the sampling activity. • Existing sample curation systems – prepend the IGSN namespace to the local sample identifiers. • What can be identified with IGSNs? We used IGSNs to identify not only physical samples but also sample collections and sampling features.. • Outreach, e.g., talks, meetings, and workshops. • Documentations & public repositories. • National collaboration, i.e., a joint project with the other allocating agents in Australia (e.g., Geoscience Australia and Curtin University). 23 | Towards a Web-enabled Geo-sample Web: An Open Source Resource Registration and Management System for Connecting Geo-Samples to the Web | Anusuriya Devaraju et al.
  • 24. Conclusions • We described the implementation of IGSN to identify and publish physical samples and sample collections in CSIRO. • This serves as a working example for implementing IGSN in a large organization. • Benefits of implementing IGSN in CSIRO • Identify samples unambiguously and discover them easily. • Avoid duplicate sampling activities. • Promote re-use of the samples for new purposes. • The use of IGSN in CSIRO may be expanded to other collections and operated as a central service by Information Management and Technology (IM&T) as part of CSIRO’s National Facilities and Collections. 24 | Towards a Web-enabled Geo-sample Web: An Open Source Resource Registration and Management System for Connecting Geo-Samples to the Web | Anusuriya Devaraju et al.
  • 25. Related Links 25 | Towards a Web-enabled Geo-sample Web: An Open Source Resource Registration and Management System for Connecting Geo-Samples to the Web | Anusuriya Devaraju et al. Components Link CSIRO Allocating Agent Service https://igsn.csiro.au/igsn30/api Description Metadata Schema https://igsn.csiro.au/schemas/3.0/ Metadata Store (source) https://github.com/AuScope/igsn30/tree/master/sql CSIRO-IGSN OAI-PMH Data Provider https://igsn.csiro.au/igsn30/api/service/30/oai National IGSN Web Portal http://igsn.org.au OAI-PMH Harvester and National IGSN Web Portal (source repositories) https://github.com/AuScope/NatPortalIGSN Sample Registration and Management GUI https://igsn.csiro.au/igsn30 A list of the components developed and their links.
  • 26. Mineral Resources Anusuriya Devaraju Postdoctoral Fellow e [email protected] Mineral Resources Jens Klump Science Leader Earth Science Informatics E [email protected] Mineral Resources Victor Tey Senior Software Engineer E [email protected] Land and Water Simon Cox Chief Research Scientist e [email protected] Mineral Resources Ryan Fraser Research Manager e [email protected] MINERAL RESOURCES