Research Ideas and Outcomes :
Conference Abstract
|
Corresponding author: Sharif Islam (sharif.islam@naturalis.nl)
Received: 22 Aug 2022 | Published: 25 Aug 2022
© 2022 Sharif Islam, Claus Weiland, Wouter Addink
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Islam S, Weiland C, Addink W (2022) From data pipelines to FAIR data infrastructures: A vision for the new horizons of bio- and geodiversity data for scientific research. Research Ideas and Outcomes 8: e93816. https://doi.org/10.3897/rio.8.e93816
|
Natural science collections are vast repositories of bio- and geodiversity specimens. These collections, originating from natural history cabinets or expeditions, are increasingly becoming unparalleled sources of data facilitating multidisciplinary research (
The scope and practice of specimen gathering are also evolving. The term extended specimen was coined to refer to the specimen and associated data extending beyond the singular physical object to other physical or digital entities such as chemical composition, genetic sequence data or species data. Thus the specimen becomes an interconnected network of data resources that have incredible potential to enhance integrative and data-driven research (
Thus the data elements are not just records or rows in a database or data pipelines going from one repository to another. They have the potential to become self-describing digital artefacts that can revolutionise how machines interpret and work with specimen data. Within this context, the Distributed System of Scientific Collections (DiSSCo), a new European Research Infrastructure for natural science collections, envisions an infrastructure based on FAIR Digital Objects (FDO) that can unify more than 170 European natural science collections under common and FAIR-compliant (Findable, Accessible, Interoperable, Reusable) (
The main purpose of this talk is to explain the vision of how FAIR and FDO can create a data infrastructure that can not only take advantage of existing databases and repositories but at the same time provide support for innovative services such as AI and digital twinning. With scientific use cases in mind, the talk will highlight a few key FAIR and FDO components (persistent identifiers, metadata, ontologies) within the collaborative modelling activity of Digital Specimen specification. These components provide the template for specifying how a Digital Specimen should look so DiSSCo can build a FAIR service ecosystem based on FDOs (
FAIR data infrastuctures, biodiveristy data, interdisciplinarity, digital specimen, digital twinning, FAIR Digital Objects, DiSSCo
Sharif Islam
First International Conference on FAIR Digital Objects, presentation
H2020-INFRADEV-2019-2020 – Grant Agreement No. 871043
DiSSCo Prepare