Skip to Main Content
It looks like you're using Internet Explorer 11 or older. This website works best with modern browsers such as the latest versions of Chrome, Firefox, Safari, and Edge. If you continue with this browser, you may see unexpected results.

Library Research Support: Open Research: Managing research data during a project

This guide is intended to provide advice and support on open access research, including guidance around Durham Research Online (DRO), open access publishing, research data management and related topics.

Choose the best storage solution

The University provides several different storage solutions for your research data.  The main benefit of University storage is that your research data will be backed up regularly in different locations so you are unlikely to lose your data.  If you keep your data on your own computer only then you risk losing some or all of it.  Your computer could develop a hardware fault.  Your computer could get damaged for all sorts of reasons.  Even backing up your data on an external hard drive or on data CDs is often inadequate.  A researcher recently lost all their research data because their laptop and all their backup data CDs were stolen.  Storage provided by the University is the best option.

CIS has created a Storage Options Tool to help you choose the most appropriate storage solution for your project. Please give it a try.

 

 

Protect sensitive research data

Assume you are a researcher managing personal or senstive research data and you have ethics approval for your project from your department or faculty.  Also assume you are familiar with the University's Ethics policy and its Data Protection policy.

You will need to protect your research data from unauthorised access.  The University recommends storing personal or senstive research data on OneDrvie for Business or its Sharepoint platform.  Thes two storage solutions are highly secure and use highly sophisticated data encryption algorithms.

Manage personal or senstive research data with care.  Avoid making too many copies of your research data.  Destroy copies you do not need.  Ideally, keep one copy only but use a storage solution which is hightly resilient, meaning you are highly unlikely to lose your data.  OneDrive for Business and Sharepoint are highly resilient.  

Anonymise your research data as soon as possible and destroy the original data if you do not need it.  The UK Anonymisation Network provides comprehensive guidance on data anonymisation.

The Information Commissioner's Office provides excellent guidance on all aspects of GDPR and data protection.

Work reproducibly

This is a big topic.  The University is beginning to assist individual researchers and departments with improving reproducible research practices.  Many of our researchers already work reproducibly.  Advanced Research Computing lead in this area but the RDM Team can assist with some aspects of this broad topic.

In his paper, Five selfish reasons to work reproducibly, Florian Markowetz identifies four levels of reproducibility:

  1. Avoid beginner's mistakes.  This is the lowest level of reproducibility.
    1. Keep files organised
    2. Name files in a meaningful way
    3. Avoid scattering files
  2. Computational reproducibility.  These tools enable you to iterate through your research data until you discover a pattern or an anomaly or something else significant.
    1. Use scripting tools: Python, R project, Perl ...
    2. Use notebook tools: IPython, Jupyter, knitr ...
  3. Software version control systems (e.g., GitMercurialSubversion).  These tools manage and track different versions of files in a project.
  4. The highest level of reproducibility is containerisation (e.g., Docker, Kubernetes).  These tools enable you to create and manage files in an environment called a container.  If another person executes the same container on their own machine, they should get the same results.  This is an advanced topic.