Research Ideas and Outcomes : Case Study
|
Corresponding author: Cameron Neylon (cn@cameronneylon.net)
Received: 17 Oct 2017 | Published: 19 Oct 2017
© 2017 Cameron Neylon
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation: Neylon C (2017) Case Study: Tobacco Economics Control Project. Research Ideas and Outcomes 3: e21703. https://doi.org/10.3897/rio.3.e21703
|
The Tobacco Control Economics Project is a project that seeks to gather evidence on tobacco use and economics in southern Africa. It is a project of the University of Cape Town with support from the DataFirst repository based at the University of Cape Town. Its aim is to gather data that already exists, sometimes in digital form, frequently in offline records or in some cases paper records, and bring them together as an open resource.
The project faces challenges of data gathering as well as permissions. Frequently data is or should be “available” in some form but control over the data is relinquished only unreluctantly. In many cases the legal standing of data is unclear. Many of the challenges relating to the bringing together of the data involve ascertaining what the legal standing of a dataset is or gaining permissions for its re-use.
DataFirst is a longstanding data sharing infrastructure with professional and experienced data management staff. Challenges of ensuring continued funding and maintenance are similar to those of data infrastructures globally. The infrastructure meets international standards and provides leadership to other services and platforms in this space.
research data, data management, open data, tobacco, health data, economic data, South Africa, Southern Africa
DataFirst is the site with the most previous experience and best existing infrastructure amongst the case studies. The participants work within a strong culture of data sharing and best practice, including training.
The project contact has extensive experience of data management, data management planning, and best practice in data handling. DataFirst is a world-class facility for data management. Therefore this contributing project represented this most experienced and expert part of the pilot project. A series of versions are available in the project data package (
A logistical oddity is that the main contact had not previously carried produced a Data Management Plan for a project. This was because they were generally responsible for executing an existing plan or advising on their development. Similarly to the Brazilian Virtual Herbarium the infrastructure nature of DataFirst meant that standardised approaches are not always appropriate.
There were some technical issues involved in the use of the DMPAssistant tool and the project contact elected to use the UK Digital Curation Centre DMP Online Tool instead. The technical issues appeared to be to do with authentication rather than network access or bandwidth so are probably not serious. Network access did not seem to be a major issue for South African projects in contrast with other African projects.
The development of the DMP was a useful exercise in surfacing issues to do with rights in data. Several project partners who held relevant data sets had previously stated they would contribute them to the project. However the DMP exercise led to the issue of rights being raised and clarification being sought. It is worth noting that this is not an explicit part of the DMP rubric but nonetheless the planning process was fruitful in providing a structure within which issues were surfaced.
DataFirst is a long standing infrastructure built in the context of UCT and South Africa. It is therefore well situated to operate effectively in this context. The challenges of gathering data from other sources within Southern Africa are varied, however the project has been structured specifically to tackle this.
In general, the experience of the main contact with systems such as those being deployed here meant that they were familiar and comfortable with the systems and questions. Nonetheless as noted an authentication problem lead them to use the DCC tool rather than that provided by Portage. Small technical challenges can be problematic and the availability, and awareness, of an alternative tool potentially saved a significant amount of time. Online/offline capability is therefore useful, though not critical in this case.
The main challenge that emerged for data sharing for the TCEP was a lack of clarity around permissions for data use. The DMP process was helpful in driving an explicit discussion of permissions status amongst the project leads. Datasets that were assumed to be usable were discovered to have either significant restrictions or to have no explicit permissions at all.
In terms of issues specific to developing and transitional nations there was less practical awareness at government level of Open Government Data. Control over access and limiting permissions persisted even in some cases where policy implied a requirement for openness. This is by no means restricted to developing and transitional nations, however it may be more prevalent and therefore more important as a limiting factor in these contexts.
As a data infrastructure DataFirst provides an excellent platform for data sharing and faces many of the same sustainability challenges as other infrastructures. Long term sustainability is not guaranteed and much funding is on a project basis, although it notes the ongoing support of its parent institution (
DataFirst has as an implicit goal the delivering of training and capacity that supports a culture of greater data sharing. In this sense policy implementation can be a support for existing activity as it strengthens the motivation of external grantees to engage with the infrastructure. Where data management planning, data deposition, or sharing are mandated there will be a motivation for local researchers to engage with local provision.
Policy in the context of DataFirst would support local action and be compatible with other policies that user groups might be subject to. Provided that policy imposition is associated with both information on support that DataFirst or similar groups can provide, and that DataFirst has the capacity to deliver that support, there is an opportunity to support culture change. The key issue is capacity and compatibility of policy frameworks.
Exploring the opportunities and challenges of implementing open research strategies within development institutions (