Research Ideas and Outcomes : Data Management Plan
|
Corresponding author: Reem Wael (reem.wael@harassmap.org)
Received: 18 Jul 2017 | Published: 18 Jul 2017
© 2017 Reem Wael
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation: Wael R (2017) Data Management Plan: HarassMap. Research Ideas and Outcomes 3: e15133. https://doi.org/10.3897/rio.3.e15133
|
HarassMap is an Egyptian organisation that works to create an environment where sexual harassment is not tolerated, and where individuals and institutions take action against it. For the purpose of this project, the project team cleaned up, organised, and made openly available for the public to access and use through a web portal, three main types of data:
The social media data was collected retrospectively from our Facebook page during the project period and covers the period 2010-2016. The crowdsourced data and the research data was cleaned and organised to make sure it is usable for the public but still kept in its raw format. During the collection and organisation period, we also made sure to clear out all personal identifiers from the data to ensure anonymity and confidentiality, and prepared descriptions of each dataset that will help the public understand how the data was collected and how it can and cannot be used.
The data is stored online on a web portal that we built together with a web developer during the project period. On the web portal, the data is available for the public to view, search and download for research or other purposes. The data is also backed up on a hard drive and the cloud. The web portal and HarassMap open data will be advertised on our website, and the direct link shared with our contacts and others who approach us with interest in our data.
data management plan, sexual harassment, Egypt, crowd sourcing, dmp, research data management
We collect different types of data. We will record numeric, audio, PDF, and text, images. The data includes crowdsourced reports that we receive online, reports, comments and messages that we receive on social media, and field data from research projects (interview transcripts, for example).
The data will be in different formats, such as XLS, docx, PDF/A, sav, and MP3. These formats are easy to re-use as long as researchers are able to work with the software. In addition, the data will be available and accessible to the user.
At the beginning, we will add the final version for any type of the data, and there will be three other copies of the data, with one copy stored off site (external hard disk), with an access to certain staff members. In addition, the data will be grouped based on the nature of data as follows:
There will also be text descriptions with information about how each dataset is organised.
There are different types of data, some needs documentation to make the data usable by other researchers and other do not need documentation because it is understood from the title. For the data that required documentation such as field data about sexual harassment, the documentation must include: research methodology used, sample size, variable definitions, assumptions made, format and file type of the data, a description of the data capture and collection methods, explanation of data coding and analysis performed (including syntax files- if available).
The crowdsourced data will also require explanation of how the data was collected and how it can and cannot be used.
HarassMap will start collecting data retrospectively. For the last five years we have been recieving data but not collecting. For the purpose of this project we will start from collecting the incoming data and at time we will document data collected in the past, moving chronologically. The purpose is to document as much data as we can in the next 6 months – the duration of the project. To make sure that the documentation is created or captured consistently, HarassMap will use part of the funds allocated through this project to recruit 2-3 interns to work on collecting/cleaning data in the data collection phase. They will work closely with HarassMap unit staff from which the data will be collected.
Currently we are not using any.
We will store the data on a server with the purpose of long-term storage (years). The exact details on cost and space will be determined once we hire a develop on a consultancy basis for this project.
The data will be saved on the server, external hard disks and internet cloud . HarassMap does not have a technology expert on-board. Therefore we will use funds available from this grant to hire a consultant who can help us set this system up and maintain it for the duration of the project. After the end of the project, we will allocate the maintenance fee from another grant as soon as possible.
Each team and collaborators can access it through internet and web application. Each team will have permission to edit the relevant parts .
It will be saved on the server and available to the public on the open database/platform (which will be kept online and running as long as we have funds for it).
Each project file will be stored on the server on its format and will be linked to its data in the database using some programming algorithms this will store the data on the server as long as we have the server .
We will use the application to manage accounts - each team/section manager can manage his team members remove, update, modify, and create.
We will primarily share raw data that we collected from a research study that we conducted 2011-2013, crowdsourced map reports, and from social media.
HarassMap has a default copy right policy: http://harassmap.org/en/copyright/ Creative Commons
HarassMap will announce this on our website, but we will also keep a permanent icon on the website indicating to users that they can have access to our data. Without announcing it, HarassMap already gets a lot of requests form researchers about sexual harassment.
We will also utilize our contacts with academics in Egypt, UK, and the US to ensure that they know about our open data.
Four HarassMap staff members are involved in this project: Director, Marketing and Communications Unit Head, researcher and Admin and HR Manager. Additionally we will use the project budget to offer paid internships to collect some data and to hire a developer as a consultant.
HarassMap has three different staff members to oversee the project and therefore the absence of any of them will not affect the implementation of the project.
Person to collect data from social media: 9000 EGP for interns to document and clean information
Developer: $2590
Database upkeep (hosting): Will be determined by the end of June 2017.
The database backend will be accessible only with a user name and password. The data that is accessible for the public, will be checked to ensure anonymity and confidentiality.
Our data is not generally sensitive but sometimes we get reports with names or numbers and in these cases we only publish the reports after we remove this information. The ‘sensitive’ information will not be available to the public.
We also are not able to share any information that would put HarassMap staff at risk such as reports that defame a person or a place that can file a defamation claim against HarassMap.
We will prepare an agreement for intellectual property rights that researcher will have to agree to before they have access to the data.