Research Ideas and Outcomes :
Conference Abstract
|
Corresponding author: Paul De Geest (paul.degeest@psb.ugent.be), Frederik Coppens (frcop@psb.vib-ugent.be), Stian Soiland-Reyes (soiland-reyes@manchester.ac.uk), Ignacio Eguinoa (ignacio.eguinoa@psb.vib-ugent.be), Simone Leo (simone.leo@crs4.it)
Received: 21 Sep 2022 | Published: 12 Oct 2022
© 2022 Paul De Geest, Frederik Coppens, Stian Soiland-Reyes, Ignacio Eguinoa, Simone Leo
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
De Geest P, Coppens F, Soiland-Reyes S, Eguinoa I, Leo S (2022) Enhancing RDM in Galaxy by integrating RO-Crate. Research Ideas and Outcomes 8: e95164. https://doi.org/10.3897/rio.8.e95164
|
We introduce how the Galaxy research environment (
RO-Crate (
The RO-Crate community brings together practitioners from very different backgrounds, and with different motivations and use cases. Among the core target users are:
Given the wide applicability of RO-Crate and the lack of practical implementations of FDOs, ELIXIR (
Tools have been developed towards aiding the previously mentioned use cases and increasing the general usability of RO-Crates by providing a user-friendly (programmatic) interface for consumption and production of RO-Crates through programmatic libraries for consuming/producing RO-Crates (ro-crate-py
The Galaxy project provides a research environment with data analysis and data management functionalities as a multi user platform, aiming to make computational biology accessible to research scientists that do not have computer programming or systems administration experience. As such, it stores not just analysis related data but also the complete analytical workflow, including its metadata. The internal data model involves the history entity, including all steps performed in a specific analysis, and the workflow entity, defining the structure of an analytical pipeline. From the start, Galaxy aims to enable reproducible analyses by providing capabilities to export (and import) all the analysis history details and workflow data and metadata in a FAIR way. As such it helps its users with the daily research data management. The Galaxy community is continuously improving and adding features, the integration of the FAIR Digital Object principles is a natural next step in this.
To be able to support these FDOs, Galaxy leverages the RO-Crate Python client library (
The adoption of RO-crate by Galaxy allows a standardised exchange of FDOs with other platforms in the ELIXIR Tools ecosystem, such as WorkflowHub and LifeMonitor. Integrating RO-Crate deeply into Galaxy and offering import and export options of various Galaxy objects such as Research Objects allows for increased standardisation, improved Research Data Management (RDM) functionalities, smoother user experience (UX) as well as improved interoperability with other systems. The integration in a platform used by biologists to do data intensive analysis, facilitates the publication of workflows and workflow invocations for all skill levels and democratises the ability to perform Open Science.
FAIR digital object, workflow, exchange format, packaging format, metadata description, FAIR, reproducible data analysis
Paul De Geest
First International Conference on FAIR Digital Objects, poster