Research Ideas and Outcomes :
Project Report
|
Corresponding author: Niels Raes (nlbif@naturalis.nl)
Received: 22 Dec 2020 | Published: 30 Dec 2020
© 2020 Niels Raes, Ana Casino, Hilary Goodson, Sharif Islam, Dimitrios Koureas, Edmund Schiller, Leif Schulman, Laura Tilley, Tim Robertson
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Raes N, Casino A, Goodson H, Islam S, Koureas D, Schiller EK, Schulman L, Tilley L, Robertson T (2020) White paper on the alignment and interoperability between the Distributed System of Scientific Collections (DiSSCo) and EU infrastructures - The case of the European Environment Agency (EEA). Research Ideas and Outcomes 6: e62361. https://doi.org/10.3897/rio.6.e62361
|
The Distributed System of Scientific Collections (DiSSCo) Research Infrastructure (RI) is presently in its preparatory phase. DiSSCo is developing a new distributed RI to operate as a one-stop-shop for the envisaged European Natural Science Collection (NSC) and all its derived information. Through mass digitisation, DiSSCo will transform the fragmented landscape of NSCs, including an estimated 1.5 billion specimens, into an integrated knowledge base that will provide interconnected evidence of the natural world. Data derived from European NSCs underpin countless discoveries and innovations, including tens of thousands of scholarly publications and official reports annually (supporting legislative and regulatory processes on sustainability, environmental change, land use, societal infrastructure, health, food, security, etc.); base-line biodiversity data; inventions and products essential to bio-economy; databases, maps and descriptions of scientific observations; educational material for students; and instructive and informative resources for the public. To expand the user community, DiSSCo will strengthen capacity building across Europe for maximum engagement of stakeholders in the biodiversity-related field and beyond, including industry and the private sector, but also policy-driving entities. Hence, it is opportune to reach out to relevant stakeholders in the European environmental policy domain represented by the European Environment Agency (EEA). The EEA aims to support sustainable development by helping to achieve significant and measurable improvement in Europe's environment, through the provision of timely, targeted, relevant and reliable information to policy-making agents and the public. The EEA provides information through the European Environment Information and Observation System (Eionet). The aim of this white paper is to open the discussion between DiSSCo and the EEA and identify the common service interests that are relevant for the European environmental policy domain. The first section describes the significance of (digital) Natural Science Collections (NHCs). Section two describes the DiSSCo programme with all DiSSCo aligned projects. Section three provides background information on the EEA and the biodiversity infrastructures that are developed and maintained by the EEA. The fourth section illustrates a number of use cases where the DiSSCo consortium sees opportunities for interaction between the DiSSCo RI and the Eionet portal of the EEA. Opening the discussion with the EEA in this phase of maturity of DiSSCo will ensure that the infrastructural design of DiSSCo and the development of e-Services accommodate the present and future needs of the EEA and assure data interoperability between the two infrastructures.
The aim of this white paper is to present benefits from identifying the common service interests of DiSSCo and the EEA. A brief introduction to natural science collections as well as the two actors is given to facilitate the understanding of the needs and possibilities in the alignment of DiSSCo with the EEA.
Science policy interface, DiSSCo, EEA, interoperability, Research Infrastructure
European natural science collections (NSCs) are an integral and important part of the global natural and cultural capital. They include an estimated 1.5 billion animals, plants, fossils, rocks, minerals and meteorites, which account for 55% of the NSCs globally (
The last two decades have seen a rapid growth in the digitisation of NSCs, as well as the mobilisation of existing digital NSC data stored in local collection management systems of Natural History Museums (NHMs) and herbaria to the public domain (
Historically, the primary function of NSCs has been a record of biological and geological diversity and as references for taxonomists and geologists (
Together with the ongoing digitisation of NSC specimens and the growing body of digital specimen information, new techniques emerge that automate the extraction of morphological traits and measurements of size (especially for plants as herbarium vouchers are more easy to digitised than others NSC groups of organisms;
The richness of information captured by NSCs makes them stand out in quality and potential from human and machine observations on biodiversity in space and time shared through data portals like GBIF, and urge the need for mass digitisation of NSCs. The potential of NSCs by persistently linking all derived information to the physical specimens (from where the information was obtained), along with the recognition of the urgent need for mass digitisation of NSCs (
DiSSCo*
The DiSSCo consortium represents the largest ever formal agreement between Natural History Museums (NHMs), botanical gardens and collection-holding universities from Europe, currently representing 120 institutions from 21 countries. The DiSSCo consortium is still growing and open for new institutes to join by signing the Memorandum of Understanding (MoU) of DiSSCo. Collectively the DiSSCo consortium hosts 80% of the world's known bio- and geodiversity represented by the 1.5 billion specimens that are preserved in its NSCs. By bringing together NSCs institutions at this scale and combining earlier investments in data interoperability practices with technological advancements in digitization, cloud services and semantic linking of all derived data, DiSSCo makes all data from NSCs available as one virtual data cloud in association with a wide range of end-user services. These include finding data, accessing data, using data, and improving and updating data (Hardisty et al. 2020). DiSSCo connects historical NSC data with data emerging from new techniques including DNA barcodes, partial and whole genome sequences, proteomics and metabolomics data, chemical data, (morphological) trait data, phenological data and imaging data (Computer-assisted Tomography (CT), Synchrotron, etc.) to name but a few (See Section 1;
Although the final product upon completion will be the deployment of the DiSSCo RI, at present DiSSCo is a programme of strategically aligned EU-funded projects with collaborations from relevant organisations, such as CETAF*
DiSSCo started in 2014 with the DiSSCo design study, which resulted in the submission of DiSSCo to the ESFRI roadmap where it was accepted in 2018*
During the same period, CoL and GBIF embarked on the CoL+ project which develops a new IT architecture to compile the most comprehensive and authoritative global index of species currently available, and which is going to serve as the backbone taxonomy for GBIF, DiSSCo, and others, among which the EEA.
In 2019, the SYNTHESYS+*
The next project under the DiSSCo programme is the MOBILISE*
After acceptance of DiSSCo on the ESFRI roadmap in 2018, the DiSSCo Prepare project*
Under the aligned projects and in the initial phase of the DiSSCo Prepare project, major advances have been made regarding a) the definition of digital specimens through the openDS standard (
A key aspect of the DiSSCo Prepare project is the identification of stakeholders that may envisage data from national history collections as a foundational layer of bio and geo-diversity reference data in the environmental domain. The European Environment Agency (EEA) is identified as one of the main European advisory bodies and stakeholders for the DiSSCo RI in terms of providing assessments, tooling up policy-makers, and improving the analysis, development, adoption, and implementation of European policies. The European environmental policies are outlined in the recently published EC biodiversity strategy 2030 (
The European Environment Agency*
Eionet*
ETCs*
From the above seven ETCs that provide input to the EEA, the ETC on Biological Diversity (ETC/BD) is most closely related to the activities the DiSSCo RI and data it will be serving. The ETC/BD has the tasks of:
a) assisting the EEA in reporting on Europe's environment by addressing the state and trends of biodiversity in Europe,b) providing relevant information to support the implementation of environmental and sustainable development policies in Europe in particular for EU nature and biodiversity policies (DG Environment: Nature and Biodiversity*
The activities of the ETC/BD are instrumental for the implementation of the EU nature directives, Natura 2000 areas and protected areas in general via the common database on designated areas (CDDA) and the EMERALD network (areas beyond EU27), and all related data flows and reporting formats. The ETC/BD is responsible for the reporting cycle of the Nature Directives including its assessment resulting in the ‘State of Nature reports’ (
BISE - Biodiversity System for Europe - is a single entry point for data and information on biodiversity supporting the implementation of the EU biodiversity strategy (
EUNIS - European Nature Information System - brings together European data from several databases and organisations into three interlinked modules on:
a) sites,b) species, andc) habitat types.
EUNIS is a reference information system for anyone working in ecology and conservation, or those with an interest in the natural world. It is also used for assistance to the Natura 2000 process (EU Nature Directives including the EU Birds and Habitats Directives) and is coordinated with the related EMERALD Network of the Bern Convention (
This white paper serves the purpose to characterise DiSSCo and the EEA to facilitate mutual understanding and identify connections between the data portal Eionet and all its components and DiSSCo, an integrated knowledge base that provides interconnected hard evidence of the natural world. As DiSSCo represents the largest ever formal agreement between natural history museums, botanic gardens and collection-holding universities in the world, it is an ideal platform for transdisciplinary interaction and dialogue that can provide input to the European community to make informed decisions about improving the environment, integrating environmental considerations into economic policies and moving towards sustainability. DiSSCo’s goals and vision also should align with broader regional and global strategic initiatives such as the recently published European Biodiversity Strategy 2030 (
DiSSCo, at its present level of maturity, has to make design and infrastructural decisions to facilitate the data infrastructure where all information generated through mass-digitisation of the European NSCs will be findable, accessible, interoperable and reusable (FAIR). As DiSSCo is still in the preparatory phase, modifications and adjustments to the infrastructural design are still possible to assure that the future needs of the European environmental policy domain are accommodated. Below we provide examples for a number of scientific domains with high policy relevance to the EEA. By providing these examples we aim to open further discussions and identify additional use cases of the DiSSCo infrastructure that can support the aims of the European Union set out in the European Biodiversity Strategy 2030 and its environmental policy.
Both EUNIS and DiSSCo (along with GBIF, Encyclopedia of Life (EOL), Biodiversity Heritage Library (BHL), LifeWatch, and the Barcode of Life) aim to adopt the new Catalogue of Life (CoL) as their backbone taxonomy. Through the shared use of the taxonomic name identifiers, EUNIS will gain access to a wealth of information related to specimens stored in the European NSCs. In figure 3, the CoL ID is depicted in red. The example of a (simple) Digital Specimen demonstrates how linked information on the type specimen (A single specimen selected by a scientist to define in future what distinguishes e.g. a certain species from all other), synonyms, spatial distribution, environmental conditions, phenology, morphology and traits (from images or measurements stored in linked data infrastructures like TRY (
Linking EUNIS with DiSSCo through CoL provides hard evidence on the past and present distributions of species that are protected under the European Nature directives and potentially on the spread of Invasive Alien Species (IAS). Combination of specimen data with spatial environmental data through niche modelling techniques will allow assessing the invasive potential of IAS or the chance of survival in certain habitats under future climate change conditions. The analysis of phenological information might indicate their suitability as a nectar source for the pollinator community of crops, e.g. their suitability for use in flower borders of pollinator-dependent crop lands (
To assess the effectiveness of the EU nature restoration plan outlined in the EU biodiversity strategy 2030 (
Restoration action may involve planting or releasing new individuals. The choice on which genetic resources are used for this purpose may define the restoration success. The collections provide information on the environmental boundaries or ecological niche conditions under which species occur through their georeferences. In anticipation of predicted effects of climate change, genetic material should be obtained from populations that are optimally adapted to (near) future climatic conditions in the area of restoration. This is especially relevant for sessile and long-lived organisms like trees (
The use of environmental DNA or eDNA in future biodiversity monitoring is expected to rapidly increase, especially for aquatic and soil communities (
With the ongoing changes in the global climate, it is likely that species will shift their distribution range to track suitable climatic conditions. Predicting range shifts of species under climate change requires understanding of the bioclimatic conditions and biotic interactions that contribute to the success of the species. Information from NSC specimens is instrumental to assess the required conditions through niche modelling or species distribution modelling. Individuals at the trailing edges of their distribution are likely to disappear from local communities. With that they may leave gaps in the functional trait distribution of these communities. Information on the traits that these species represent might allow selecting other species adapted to climate change, which can fill this gap and restore the ecosystem services provided by this local community (
DiSSCo is built on three pillars, 1) development of the IT architecture that accommodates the storage and linking of all derived information, 2) mass digitisation of the European NSCs, and 3) capacitation of the user community of the DiSSCo-RI. Opening the communication channels with the EEA in this phase of development of the DiSSCo RI is instrumental to accommodate the future use of the DiSSCo RI and its service catalogue by users from the environmental policy domain. Tailor-made documentation and helpdesk support on the DiSSCo RI requires information about the background knowledge and IT skills of users in the environmental policy domain represented by the EEA. We hope that this white paper contributes to open the discussion to identify the needs of users in the environmental policy domain so that the infrastructural design of the DiSSCo RI can accomodate future user demands.
Horizon 2020
H2020-INFRAIA-2018-2020 | SYNTHESYS PLUS | Grant Agreement Number 823827
None