Emerging FAIR Ecosystem(s): A Practical Perspective

There is broad acceptance that FAIR data (Wilkinson et al. 2016) reuse is desirable, with considerable interest and energy being devoted to its realization, but many questions remain on the part of prospective implementers. Fundamental to explaining how best to implement FAIR is an overview of how the many FAIR models (and the technologies that support them) fit together into a coherent "FAIR ecosystem". How do FAIR Implementation Profiles (FIPs) (Schultes et al. 2020) relate to FAIR Data Points (FDPs) (Bonino da Silva Santo et al. 2022), and how do these relate to the concept of FAIR Digital Objects (FDOs) ( Hudson et al. 2018)? What is their relationship to other diverse FAIR resources and digital assets (metadata, datasets, repositories, and the complex web of services that run them)? How are these novel and legacy systems intended to interoperate? These questions are often encountered by those involved in the growing number of projects looking at FAIR implementation (ENVRI-FAIR 2019, CODATA 2022, SciDataCon 2022, Seoul Korea 2022b , Health-Holland Project 2021, Health-Holland Project 2022, Swiss Personalized Health Network 2022, Maxwell 2021, VODAN 2020). Such an overview could also inform the development of specifications for the different models involved in a FAIR ecosystem, such as FIPs, FDPs, and the description of digital resources (data and services) machine-actionable. As we move toward the specification of the many components of the FAIR ecosystem, it seems only common sense to have an agreed roadmap.

This presentation looks at questions from FAIR implementation across various settings, and proposes a view of the overall ecosystem which could be agreed and communicated to prospective implementers. It suggests the relationship between various artefacts being discussed in the FAIR community today (FIPs, FDPs, FDOs, and other digital assets) and looks at how these can be connected to the business layer to support the development of services and applications within the envisioned FAIR ecosystem. Notably, this includes how the Cross-Domain Interoperability Framework (CDIF) being developed through the WorldFAIR project can connect to the underlying FAIR ecosystem in practical terms (Weise et al. 2022).
The presentation will address high-level considerations around the major technology components of a FAIR ecosystem, their roles within a range of common user scenarios (often having unavoidable legacy technology), and their relationship to each other and to the set of models needed to provide practical services for FAIR interoperation at the business level.
Three basic scenarios are examined, in order to understand the practical requirements of different communities. The first scenario is one which has received a good deal of attention during initial efforts to implement the FAIR principles, a domain or user community without a strong pre-existing culture of data sharing and resume, wishes to become FAIR. The second scenario is one where a community with a strong existing culture of data sharing and reuse is looking to integrate its current approaches with those advocated by the FAIR community. The third scenario looks at FAIR from the perspective of the implementer of FAIR services from an "industrial" perspective: how does FAIR provide the kind of market which is needed to support full-scale services and application development?
Each of these scenarios provide valid, but different views of what it will take to implement the FAIR in practical terms. In order to understand them, we can draw parallels with other large-scale data-sharing efforts in other communities -the Internet and the Web itself can be understood as a useful example of how vision, standards, and implementation combine to provide successful infrastructure at this scale. Indeed, the concept of Digital Objects (and now FAIR Digital Objects) has its roots in this analogy (Kahn and Wilensky 2006). Other, smaller examples also exist, which focus more specifically on the exchange of data and metadata: for example the Statistical Data and Metadata Exchange (SDMX) Initiative ( SDMX community 2022) or the emerging Cross-Domain Interoperability Framework ( SciDataCon 2022, Seoul Korea 2022a). Although implemented within targeted communities, these efforts exchange a wide range of data and metadata not entirely dissimilar to what is envisioned in FAIR. There is currently no single exact parallel for FAIR ecosystems, but there are examples from which we can learn in terms of making largescale data reuse a practical reality.
Core to these is a vision of all of the component pieces, and how they can act in concert to provide a scalable infrastructure which will address the needs of the many different communities of users. Such a common vision may be implicitly agreed among those working on FAIR implementation today, but in the interests of clear communication, it is time to document it -and in keeping with FAIR, this documentation should be itself machine-actionable. As we move toward the specification of the many components of the FAIR ecosystem, it seems only common sense to have an agreed roadmap.

Presented at
First International Conference on FAIR Digital Objects, presentation