Research Ideas and Outcomes :
Forum Paper
|
Corresponding author: Kate Ingenloff (kathryn.ingenloff@gmail.com)
Academic editor: Christoph Wohner
Received: 16 Apr 2024 | Accepted: 29 May 2024 | Published: 17 Jun 2024
© 2024 Kate Ingenloff, Syrine Ben Aziza, Claus Weiland, Nikoletta Nikolova, Hans-Hermann Thulke, Martin Lange, Adam Reichold, Dmitry Schigel
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Ingenloff K, Ben Aziza S, Weiland C, Nikolova N, Thulke H-H, Lange M, Reichold A, Schigel D (2024) Prototype Biodiversity Digital Twin: Disease Outbreaks. Research Ideas and Outcomes 10: e125521. https://doi.org/10.3897/rio.10.e125521
|
|
African swine fever is a transmissible virus impacting wild and domestic swine populations. In Europe, it is non-native and the recently introduced genotype affects wild boar populations with occasional outbreaks in domestic pigs. The ability to predict short-term spatial dynamics of this disease will greatly improve our ability to control and limit future spread of the virus. The BioDT disease outbreaks prototype, currently in development, implements an individual-based landscape-level mechanistic model as a digital twin aimed at providing informed support for management decisions in response to the spread of African swine fever in European wild boar populations.
wild boar, Sus scrofa, African swine fever, ecological modelling, outbreak management, mechanistic model
African swine fever (ASF) is a transmittable lethal infection of wild boar and domestic swine (Sus scrofa) caused by a DNA virus of the genus Asfiviru (family Asfarviridae;
First identified in Kenya in 1921 (
The BioDT disease outbreaks prototype digital twin (pDT) aims to establish a predictive modelling tool that can be used in acute ASF outbreaks by decision-making bodies (https://biodt.eu/use-cases/disease-outbreaks). It implements an established individual-based, spatially-explicit mechanistic model at the landscape scale to simulate the spread and potential control of ASF in European wild boar populations (
The purpose of this pDT is to provide information for data-driven responses in managing the spread of African swine fever in European wild boar populations. With each reported detection of ASF, the model is updated and rerun to produce more robust predictions of infection risk and the effectiveness of spatially-explicit control measures permitting routine update of the most suitable virus containment decisions during an ASF outbreak. Although there are multiple potential stakeholders and user groups, the initial implementation of this BioDT prototype digital twin focuses on public health decision-makers and people involved in decision support.
The wild boar-African swine fever model requires two types of data: static and dynamic (Table
Overview of static and dynamic data, known data sources and anticipated data formats.
Static/Dynamic |
Data |
Data source(s) |
Data format(s) |
Static |
Habitat structure model |
ENetWild, User-provided |
Ascii; GeoTIFF; Raster |
Dynamic |
Wild boar geolocation information |
ENetWild, User-provided |
Text file; Comma-separated file |
Dynamic |
Wild boar infection status |
User-provided |
Text file; Comma-separated file |
Dynamic |
ASF treatment scenarios (barriers) |
User-provided |
Shapefile |
Static data consist of habitat or landscape structure data. These data describe the landscape on which the model simulations take place. Provision of a habitat structure data layer triggers the first run of a wild boar–ASF model simluation. Currently, the static data layer must be created by the user or obtained from ENetWild (
Dynamic data can be supplied to the model any time new information becomes available, triggering the model to rerun and return updated results based on the new information. These data include wild boar geolocation and ASF infection information as well as virus control scenarios. Geolocation data include the location (as GPS coordinates), time and date a boar (or group of wild boar) was identified. These data may also supply the model with an animal’s infection status (is the animal healthy, infected, or recovered?), as well as other demographic information about an animal, such as its age, sex, and vitality (whether or not the animal is alive or dead). Proposed virus control measures or treatment scenarios are location-specific measures designed to halt or contain the infection including barriers designed to separate infected from non-infected individuals, local depopulation efforts and carcass retrieval. A user will be able to supply the model with multiple proposed control measures to identify the best control scenario. In an ideal scenario, wild boar geolocation data and ASF infection reports would be directly updated from partnered research institutes (RI), such as the Global Biodiversity Information Facility (GBIF); however, presently, all data must be supplied to the model by the user.
Fig.
The model was designed using a modular format (known as a modularised entity component system or ECS). This structure ensures that all model sub-modules have the same structure and interface. The core code of each module is written in the Rust programming language (https://www.rust-lang.org/); and each module has a 'wrapper' written in Python (https://www.python.org/). This setup allows for a fully customisable model module configuration that can be adapted to suit each modeller’s needs. With the assistance of H.-H. Thulke and M. Lange, the pDT team selected a default configuration for use in the initial pDT implementation.
At the close of the second year of the BioDT project, development of the wild boar-ASF pDT is well underway, but not yet complete (see Table
Overview of the wild boar-ASF protoype digital twin development tasks and their current status.
Category |
Task |
Status |
General |
Obtain minimum working model module configuration and test data from project collaborators |
Complete |
Data |
Fair data object creation |
Incomplete |
Model outputs |
Select standardiszed suite of static and dynamic model outputs |
Not started |
Model – local environment |
Exploration and testing of test data and minimum working model modular components in the local environment (model workflow) |
Complete |
Model development: automation of treatment data ingestion/manipulation |
Not started |
|
Model development: code adaptation to provide standardised model outputs |
Not started |
|
Model – LUMI |
Model code made available to LUMI contacts |
Complete |
Model deployment on LUMI supercomputing environment |
Not started |
|
Model automation, parallelisation, and testing on LUMI |
Not started |
|
pDT user access |
User Interface storyline drafting |
Complete |
User Interface design and testing |
Incomplete |
|
User-specific, version-controlled storage |
Incomplete |
Ensuring that project data and model outputs are FAIR (Findable, Accessible, Interoperable and Reusable) is a high priority for the BioDT initiative as FAIR principles are critical for enabling autonomous or semi-autonomous navigation and processing of data in large integrated data spaces by machines (
The wild boar-ASF model is not computationally intensive. The initial model implementation runs easily on a processor Intel(R) Core(TM) i7-8665U CPU @ 1.90GHz, 2112 Mhz, 4 Core with 8 Logical Processors running Windows version 10.0.19045 Build 19045. The simulation's current computational profile features an overall execution time of 8.07 seconds and a granularity of 15.51 milliseconds per simulation tick over a span of 520 ticks. The spatial range is bounded by [4_506_779, 3_052_929, 4_855_174, 3_353_689] with a resolution of 2000 m. Initial agents are defined with a release factor of 5.0 and the mutation probability is set to 1e-2. These metrics suggest that the current model instance is less computationally demanding than what is expected in large-scale deployments, which typically involve higher resolutions and more extensive agent-based modelling. However, when the model is applied to continental-scale problems with larger datasets, higher agent counts and finer spatial resolutions, the existing architecture may limit performance efficiency of large multi-repetition experiments.
Introducing parallelisation when the model is migrated to the LUMI supercomputing environment will be essential as a scalability measure, as well as a means of enhancing computational throughput. Should a user wish to explore multiple virus barrier scenarios at the same time, parallel processing will enable multiple, independent model simulations to run simultaneously, reducing overall runtime and making efficient use of computational resources. This will be particularly beneficial during acute ASF outbreaks when multiple intervention strategies must be rapidly evaluated.
A user interface (UI) for the pDT is currently in development with project collaborators at CSC - IT Center for Science LT (https://www.csc.fi) and VSB - Technical University of Ostrava (https://www.vsb.cz). Individual users will have their own password-protected account and the UI will facilitate three user capabilities (see Fig.
There are no plans at the moment for any connection with the Destination Earth (DestinE, https://digital-strategy.ec.europa.eu/en/policies/destination-earth) or the European Open Science Cloud (EOSC, https://eosc.eu/eosc-about/), nor is the pDT supported by any of the project biodiversity research institutes (RI). Additionally, the sustainability of the project remains in question. With only one year of support left under the broader BioDT project, the wild boar-ASF digital twin will require funding and support to keep it running. Support will be necessary to cover the technical infrastructure of the project and provide a means of funding on-going model development and implementing improvements to the user interface.
The wild boar-ASF prototype digital twin aims to address an ongoing need in tracking and responding to wildlife disease outbreaks, such as African swine fever. The ability to predict short-term spatial dynamics of ASF will improve efforts to control and limit potential spread. However, the initial implementation of this digital twin only simulates ASF within the wild boar population and some aspects of model development remain incomplete. The major barriers to progress are a lack of adequate funding streams to support long-term pDT infrastructure and model development and a lack of openly available disease host and infection data. Despite this, the project still hopes to eventually expand the model to simulate and assess the risk of ASF spillover from wild boar to domestic pig populations.
Spillover of ASF infection into domestic pig populations has significant economic and food-security impacts. The ability to model this potential on-demand and in real-time would minimise risk of transmission and protect the livelihoods of ranchers supplying pork and other pig products. In addition to requiring on-going support for model development and maintenance as information about wild boar-domestic swine-ASF improves, implementation of a full spillover risk model as a digital twin will present a new suite of challenges including significantly increased computational demand (the model linking the wild boar-ASF and domestic swine-ASF models would need to simulate feedback interactions) and a need for much greater access to near real-time wild boar, ASF infection, and domestic pig data via project RIs.
While the wild boar-ASF model is fully documented and well-established as a model-based policy support tool, the process of adapting it to a digital twin highlighted the need for greater generalisation of the model to effectively adapt it into an independent forecasting tool. Unfortunately, some of these model development needs (specifically automation of barrier data ingestion and standardisation of the model output data packet) are currently on hold as project resources for the modelling team are capped for the year. The pDT team hopes to resolve this before long so that model development can continue. In the meantime, development of the user interface and testing and implementation of the model in LUMI will continue. The team will also continue to address the dearth of available wild boar geolocation and ASF infection data in project RIs.
This study has received funding from the European Union's Horizon Europe Research and Innovation Programme under grant agreement No 101057437 (BioDT project, https://doi.org/10.3030/101057437). Views and opinions expressed are those of the author(s) only and do not necessarily reflect those of the European Union or the European Commission. Neither the European Union nor the European Commission can be held responsible for them.