Corresponding author: Lyubomir Penev (
Academic editor:
The EU BON project will build a substantial part of the
The main purpose of
The main purpose of
We surveyed the copyright and usage licenses used by the potential suppliers of data to the EU BON portal listed in the Annex to milestone MS241
1. EU BON data sharing agreement (MS971).
2. The paper of on the legal framework for biodiversity information in Europe (
3. Analysis of IPR policies of the EU BON data suppliers (Annex of milestone MS241).
4. The Bouchout declaration principles, see the website (
5. A RECODE project
6. Pensoft's
7. Official data polices statements and documents of major funders and research organizations (e.g. Horizon2020, National Science Foundation (NSF) and National Institutes of Health (NIH) of the USA, and others).
8. Other sources, cited within the document.
Biodiversity data and information provide important knowledge for many biological, geological, and environmental research disciplines as well as for the development of policies relating to the natural environment and the management of natural resources. Digital information management systems can bring together the wealth of information and the legacy of over 260 years of biological observations now dispersed in a myriad of different documents, institutions, and locations. As the signatories of the
This standpoint is accepted by most public and scientific authorities. In 2007, the Organisation for Economic Co-operation and Development (OECD) published “
At present, the access and reuse of biodiversity data is hampered by an array of technical, economic, sociological, legal and other factors. Considerable quantities of biodiversity information are detained in legacy literature that is not accessible for technical reasons. Many compilers of biodiversity content act as if or claim that they hold intellectual property rights over their data and information. Open and free access to biodiversity data and information requires that we overcome these obstacles. It is to that end that EU BON has elaborated its data policy.
Biodiversity data can be mobilized from three different major sources of information:
Raw data from observations/collections published via data aggregators and citizen science platforms
Unlocking the printed legacy literature through conversion to a digital format, retrospective markup, and/or text and data mining
Prospective markup of new publications
These three sources of biodiversity data each need appropriate policies and guidelines to incentivise data providers and custodian to publish the data. In spite of the diversity of specific national, institutional, domain-specific, and individual requirements and expectations regarding copyright and norms accepted and used across countries, we can formulate a few
Strategic goals for biodiversity data mobilization and publication:
Promote the understanding that primary biodiversity data are facts and therefore NOT a subject of copyright; they belong to the public domain, independent of their source;
We should require explicit statements that clearly place biodiversity data in the public domain, by applying a standardized waiver for any eventual copyright or database protection right, for example
To the maximum possible extent , we should render printed materials, PDFs, and other non-machine-actionable biodiversity data and narratives, into machine-readable and harvestable formats.
No intellectual property rights apply to information or data. “Intellectual property rights” are a group of legal instruments that exist in many countries and are applied to precise immaterial goods in a precise context. In member countries of the EU, “intellectual property rights” refer mainly to copyright (conceived in relation to creative works of art and literature), neighboring rights (relating to performances, phonograms and broadcasts), patent rights (relating to inventions), industrial designs, trademarks and databases. The concept of intellectual property rights applies only to goods that are precisely defined: Where there is no law stipulating explicitly the protection of a specified class of immaterial items, no intellectual property rights exist.
Data and information in general, or biodiversity data in particular, are not protected immaterial goods. Consequently, there can be no intellectual property right on biodiversity data as such. A legal protection can only exist if the biodiversity data qualify as one of the protected immaterial goods. In practice, this can occur where collections of biodiversity data qualify as a “work” in the meaning of copyright or as a “database” in the meaning of EU database protection.
Copyright can be applied to works that are original, individual, new creations with respect to the form of the presentation. It does not cover ideas, procedures, systems nor content. Scientific data present facts in standardized forms that have been agreed by the respective scientific community. As they are not creative in form, scientific data in general as well as their metadata do not qualify as works. This is also valid for numerous biodiversity data presented as images because they present facts according to standardized, preconceived conventions.
On the other hand, copyright protection can apply to a collection of biodiversity data if it constitutes, by reason of the selection or arrangement of their contents, an intellectual creation with an individual character. The more systematic a collection of data is, and the more consistent with agreed standards and conventions, the less individual it is in the meaning of copyright, and the less likely copyright protection will apply. Consequently, collections of biodiversity data will be protected by copyright only in a very small minority of cases. Nevertheless, in these few cases, copyright may constitute a barrier to the free exchange of biodiversity data.
European copyright legislators are well aware of this impediment to data exchange. The
The EU Database protection is not part of copyright but is a
As illustrated in a recently published review (
Copyright as well as database protection are part of “private law”, which is applied only on demand by the owner of the rights. Even if there is an intellectual property right with respect to a particular collection of biodiversity data, the owner is entitled to renounce their claim to those rights. This principle of private law is the basis of the common-sense phrase: “Where there is no plaintiff, there is no judge”.
The reluctance of researchers and publishers to distribute and exchange their data and information openly has economic, scientific, or sociological reasons (
One factor that may change this reluctant attitude is to develop measures that ensure that all who create, organise or mobilise data are fully credited for their contributions (
Biodiversity data and information should not be treated as commercial goods, but as a common resource for the whole human society. From this perspective, scientific publications need to be made openly available, as soon after publication and as freely as possible. Researchers should be able to communicate their results with minimum time delay and at minimum cost. Restrictions to open availability should only be applied if based on specific justifications, such as to protect security, endangered species, or to protect the privacy of individuals.
The main objective of EU BON is to build a substantial part of the Group on Earth Observation’s Biodiversity Observation Network (GEO BON). EU BON’s deliverables include a comprehensive „European Biodiversity Portal“ for all stakeholder communities, strategies for a global implementation of GEO BON and support of the Intergovernmental Platform on Biodiversity and Ecosystem Services (IPBES). In that perspective, EU BON recommends to all members, associated persons and institutions as well as to other stakeholders of biodiversity information to contribute to the following data policy:
1. Legislators
The EU should revise the Directive 2001/29/EC by declaring that the provision of a copyright exception for scientific research is compulsory for all member states. The new regulations should not refer to commercial or non-commercial scientific research, as this distinction is neither useful nor applicable in practice. Nor should they refer to the place from where, nor the technical mode how, works are accessed, as such restrictions hamper the research process.
The EU should revise the Directive 96/9/EC by declaring that the re-use of protected databases for scientific research is authorised by a compulsory exception to database rights.
Member states of the EU or the EEA should introduce or, where it already exists, extend a copyright exception for the use of works for scientific research. This exception should not refer to commercial or non-commercial scientific research, as this distinction is neither useful nor applicable in practice. Nor should it refer to the place from where, nor the technical mode how, works are accessed, as such restrictions hamper the research process.
Member states of the EU or the EEA should introduce or, where it already exists, extend an exception of database protection for the re-use of databases for scientific research.
2. Researchers
Researchers should refrain from asserting intellectual property rights for biodiversity data and information collected and/or published by them. By default, all content referring to biodiversity information should be openly accessible.
As far as material produced by researchers is protected by copyright or by database rights, the right owner should make these works or databases freely accessible and reusable by publishing them under a
Publicly funded research institutions should refrain from asserting intellectual property rights for biodiversity data and information collected and/or published by them. By default, all content referring to biodiversity information should be openly accessible.
Publicly funded institutions should encourage re-use of biodiversity data and information for research purposes with a requirement for attribution of the source, but should impose no other requirements on re-use.
As far as material owned by publicly funded institutions is protected by copyright or by database rights, the institutions should dedicate these works or databases to the public domain by publishing them under
3. Data aggregators
Encourage data suppliers and partner nodes to publish their data under
Ensure that data are stored in a versioned and time-stamped manner.
Provide data citation mechanisms (
Develop mechanisms to identify and cite arbitrary views of data, from a single record to an entire data set in a precise, machine-actionable manner, that are stable across different technologies and technological changes.
Allow to cite and retrieve that data as it existed at a certain point in time, whether the database is static or highly dynamic.
Identify data sets by storing and assigning persistent identifiers (PIDs) to times-tamped queries that can be re-executed against the time-stamped data store.
4. Funding Agencies
Whenever possible, funders should support and require use of the most liberal data use and re-use licenses, particularly by putting data into the public domain through the
Develop policies to require funded researchers to make the data underpinning scientific publications available in machine readable formats in public repositories at the time of initial publication.
Support and require enhancement to the maximum possible extent of machine-readability of both data and associated metadata.
Ensure and require that data management plans (DMPs) to include clear statements and a work plan for archiving and sharing research data. The DMPs should include: descriptions of data to be produced in the proposed study, any data standards used, mechanisms for providing access to and sharing of data (including provisions for protection of privacy, confidentiality, security, intellectual property, or other rights), provisions for data reuse and redistribution, and plans for archiving and long-term preservation of the data. As the technical infrastructure available to research will continue to evolve, so we expect DMPs to evolve.
Support and encourage the use of established public repositories and community based standards. Funders should encourage all funded researchers to make use of existing data standards relevant to their research community, such as standards for collecting and representing data and information describing the data set (i.e. metadata), as well as promote the interoperability of digital data in and across public repositories.
Develop approaches and support technologies to ensure the discoverability of data to make them findable, accessible, and citable. Funders should support also the development of data discovery indexes to provide a mechanism to enhance discoverability and facilitate appropriate attribution to those responsible for the dataset and link the citations to associated publications.
Explore the development of a data commons, a shared space for research output including data, software and a narrative that follows the FAIR principles of Find, Access, Interoperate and Reuse.
5. Publishers
Allow authors to retain the copyright to their publications and to make it available under the terms of the
Use the
Other data publishing licenses may be allowed as exceptions (subject to approval by the editor on a case-by-case basis) and should be justified with a written statement from the author that will be published with the article.
Authors should be required to share all data, code or protocols underlying the research reported in their articles. Exceptions could be permitted, but have to be justified in a written public statement accompanying the article.
Datasets and software should be deposited and permanently archived in appropriate, trusted, general, or domain-specific repositories (please consult
Small bodies of data may also be published as data files or packages supplementary to research articles, however, the authors should favour deposition in data repositories.
Ensure availability of both data and narrative in harvestable machine-readable formats (for example JATS XML), including article and dataset metadata.
Increase the proportion of machine-readable content within the narrative and data to the maximum extent possible at the current level of technology development.
We here include reference to other documents that are relevant to the EU BON data policies and summaries of their content.
On the basis of data policy principles, EU BON has agreed upon a data-sharing policy, which is binding for providers of data to the EU BON portal, EU BON partners, associated persons and institutions, and for all users of its portal. The report is part of the process to develop the legal framework on which this data sharing policy will be based.
The legal framework for data publishing and dissemination applicable to EU BON is realized in the form of the EU BON Data Sharing Agreement. By asking data providers to refrain from claiming intellectual property rights, it makes sure that no such rights are applied to data within the EU BON network. For data under national or international security restrictions or under time embargos, EU BON provides for a special category of “sensitive data”. Such data are kept separately from other data and are made available only upon special justification. Finally, EU BON does not assert any intellectual property rights for itself; it dedicates all collections of data that might qualify as works in the meaning of copyright to the public domain or publishes them under a Creative Commons (CC-BY) 4.0 license.
Intellectual property rights on biodiversity data?
Other legal aspects referring to biodiversity data
The EU BON Data Sharing Agreement
The Data Sharing Agreement sets out the policy of EU BON on the sharing and use of data available in the EU BON portal. The document refers to EC policies (
Background or guiding principles
Obligations and guarantees for data providers
Obligations and guarantees for EU BON
Obligations and guarantees for users
Contribution to GEO BON
The paper by
The request for open access
Why copyright can hamper the exchange of biodiversity knowledge
The European database protection
The importance of data use agreements
Examples of national regulations
Several specific components and sub-networks have already been identified for integration in EU BON. In addition to GBIF, these include, for example, the broader DataONE network (including the Long Term Ecological Research program (LTER), the International Long Term Ecological Research networks (ILTER), the Knowledge Network for Biocomplexity (KNB), and the Dryad digital repository), the database of the EU-wide monitoring methods and systems of surveillance for species and habitats of community interest (DaEuMon) and theDrupal Ecological Information System (DEIMS). The registry lists network resources that are expected to be connected to the EU BON registry. For each entity, the access protocol, metadata standard and any accessor requirements are noted.
The
So far, the Bouchout Declaration has been signed by more than 90 organizations and more than 200 individuals.
The EU
Deliverable (D5.1) on
Summary booklet on
RECODE provides:
1. Ten overarching recommendations:
Develop aligned and comprehensive policies for open access to research data;
Ensure appropriate funding for open access to research data;
Develop policies and initiatives that offer researchers rewards for open access to high quality data;
Identify key stakeholders and relevant networks and foster collaborative work for a sustainable ecosystem for open access to research data;
Plan for the long-term, sustainable curation and preservation of open access data;
Develop comprehensive and collaborative technical and infrastructure solutions that afford open access to and long-term preservation of high-quality research data;
Develop technical and scientific quality standards for research data;
Require the use of harmonized open licensing frameworks;
Systematically address legal and ethical issues arising from open access to research data; and
Support the transition to open research data through curriculum-development and training
2. Stakeholder-specific recommendations for funders, research institutions, data managers, publishers
3. Practical guides for these groups, including: (i) Preparing and implementing a policy; (ii) policy content; (iii) practical checklist for the specific group. At the end, they provide a long list of resources, including funder policies, EC policies for Open Access, publisher policies etc.
This is an extensive document that provides the basis for the data publishing practices in Pensoft's journals and can be used by other publishers when appropriate (
Data Publishing Policies
Data Publishing in a Nutshell
Why Publish Data
Data Publishing Licenses
How to Publish Data
Open Data Repositories
How to Cite Data in Pensoft Journal Articles
Guidelines for Authors
Data Published within Supplementary Information Files
What is a “Data Paper”
Data Papers Describing Primary Biodiversity Data
Generation of Data Paper manuscripts using the GBIF Integrated Publishing Toolkit (IPT)
Data Papers Describing Ecological and Environmental Data
Data Papers Describing Genome Data
Barcode Data Release Papers
Data Papers Describing Software Tools
Guidelines for Reviewers of Data Papers
Quality of the Manuscript
Quality of the Data
Consistency between Manuscript and Data
Bouchout declaration (
MS841: Biodiversity data publishing legal framework report
Data type: PDF document
Brief description: EU BON milestone report MS841
File: oo_80980.pdf
MS971: Data sharing agreement
Data type: PDF document
Brief description: EU BON milestone report MS971
File: oo_80981.pdf
MS241: Specification for registry and metadata catalogue
Data type: PDF document
Brief description: EU BON milestone report MS241
File: oo_80982.pdf