Discover our blogs

Aerospace | Cranfield University


Agrifood | Cranfield University


Alumni | Cranfield University


Careers | Cranfield University


Careers | Cranfield University

Defence and Security

Design | Cranfield University


Energy and Power | Cranfield University

Energy and Sustainability

Environment | Cranfield University


Forensics | Cranfield University


Libraries | Cranfield University


Libraries | Cranfield University

Manufacturing and Materials

Libraries | Cranfield University

School of Management

Libraries | Cranfield University

Transport Systems

Water | Cranfield University


Homepage / Research data – what to keep?

Research data – what to keep?


Deciding what research data to keep, and why, has become a more significant focus in recent years as the volume and diversity of data outputs have grown.

The What to Keep study was commissioned by Jisc and undertaken from May 2018 to January 2019.

Among the key findings from the study are:

  • The main drivers for what to keep are research integrity and reproducibility (the availability of the data supporting the findings in research); and the potential for reuse (availability of data for sharing with other users).
  • Research grant terms and other legal requirements (e.g. for clinical trials data) can specify a minimum term for which research data must be kept and at a basic level that sets one simple retention criterion. However, as these dates begin to expire an increasing number of datasets will need review and potentially more complex appraisal decisions made on whether they are retained.
  • It is essential to consider not only what and why to keep data, but for how long to keep it, where to keep it, and increasingly how to keep it in ways that reflects its potential value, cost, and available funding.
  • For funders from all disciplines, including UK Research and Innovation (UKRI), the optimal research data to keep are:
    • Data which support primary research findings, e.g. are necessary to reproduce or query those findings
    • Data that is of obvious long term value e.g. longitudinal studies
    • Data which is subject to legal requirements
    • Data with short term value for one purpose or set of users, but which can also have long term value for other purposes or users.

Some questions remain around what to keep in relation to instrumentation data, outputs from models and simulations, serendipity and “Curated Databases”.

Regarding supplementary data and materials, we should keep metadata, some software/algorithms/codes supporting data reproduction or interpretation, and physical materials.

The CESSDA SaW Cost Benefit Advocacy Toolkit provides valuable tools for thinking about future cost and benefits of research data.


Photo by Adam Nowakowski on Unsplash

Greg Simpson

Written By: Cranfield University

Categories & Tags:

Leave a comment on this post:

Sign up for more information about studying master’s and research degrees at Cranfield

Sign up now
Go to Top