Spring clean your research data: how to choose what to preserve
22/11/2016

When your research is complete, you will probably need to deposit your research data in a repository – but how much should you deposit? All of it or just the subset referenced in any articles? The raw or derived data? Selection and appraisal is a key step but the decision-making isn’t always straightforward.
Why not keep everything?
Whilst the cost of storage is ever decreasing, it is not insignificant, and as research data grows so rapidly, and must be stored for 10+ years in many cases (as required by RCUK), the cost of storage, backup, and preservation is expensive. Discovery is also made more difficult when there is a large amount of data and the datasets are less streamlined. And of course, it takes time and effort to prepare data for long-term usability (e.g. adding any necessary documentation), so it’s important that this work is worthwhile – unfortunately none of us have unlimited time at our disposal.
How to evaluate which data to keep
The main criteria to consider are:
- Legal/policy requirements: do you have a contractual obligation to preserve the data? E.g. RCUK funding and used in a publication that requires data to be made available?
- Uniqueness and reproducibility: is the data unique? Would it be hard/impossible/expensive to reproduce?
- Special value: is the data scientifically or culturally significant? E.g. representing a landmark discovery, new techniques, or can you see it aligning strongly with research trends?
- Re-use potential: is it reliable, well-documented data that is likely to be of broad interest with high re-use potential?
It may be useful to evaluate these whilst considering the potential type of re-use of your dataset, be it verification of findings and further analysis (in line with funder aims for data retention), building academic reputation, community resource development, learning and teaching, or private use by your future self.
Plan ahead!
An up-to-date data management plan helps the selection process as it will have covered many of these issues, and ensures that sufficient metadata and documentation is prepared/in progress, so the data can be deposited and re-used relatively easily, minimising the additional effort needed.
You can see a full checklist that guides you through the decision-making process in this selection and appraisal pdf (internal-only) and do get in touch at researchdata@cranfield.ac.uk if you want to discuss this further.
Public domain image from Pixabay.com
Categories & Tags:
Leave a comment on this post:
You might also like…
Tell us what you think of Library Services!
We want our students to have a great experience during their time at Cranfield. Each month, the University runs a short “Topic of the month” survey to focus on one service or facility and find ...
Air Transport Management alumni stories: SangHo (Henry) Han, Strategic Planning Manager in South Korea
SangHo (Henry) Han is a strategic planning manager at Air Premia in South Korea. He graduated from Cranfield University with an MSc in Air Transport Management in 2018. Here, Henry talks about his passion for ...
Life on campus and studying an Advanced Water Management MSc at Cranfield
After a year of working in the water, sanitation, and hygiene sector, I started to look for opportunities to further my knowledge about water resources. Researching water organisations on LinkedIn led me to find ...
Sign up to our ‘Making your research open access’ webinar – 9 February
Do you know your green from your gold? What are publisher deals, and how can they help you? My paper acknowledges UKRI funding, do I need to do anything special? What does an author accepted ...
How the Executive Logistics and Supply Chain Management MSc improved my career outlook
Hear from Harrison Jnr Ilodiwe, Executive Logistics and Supply Chain Management MSc, on his student experience at Cranfield. Why did you choose to study the Executive Logistics and Supply Chain Management MSc at Cranfield ...
Engineering problem to solve? Let Knovel help you find a solution
Did you know that Knovel provides you with more than just eBooks? Knovel is a key database for many engineering, mechanical and materials courses here at Cranfield University, and contains content from an extensive range ...