Five reasons to reconsider submitting primary data to your journal publisher
06/06/2017

When you submit a paper for publication, you often provide supplementary information including the data used in the research. It’s important to make this data available for the paper’s readers, to provide the evidence for your findings and allow them to better understand your research and build on it. But often it is not appropriate to submit this data as supplementary information files to the publisher’s website – the data should be submitted to a data repository (such as CORD). Why?
- Supplementary information is usually made available in pdf form, which makes the data pretty non-editable. Have you ever read a paper and wanted to reuse their figures, and ended up tracing a graph or typing out a table that wouldn’t copy and paste from the pdf? This is one reason it is much better to provide the data in a reusable, editable format (such as csv, txt, rtf, or even xlsx if necessary) on a data repository.
- It is often derived data that is used as supplementary information, such as graphs and charts reporting selected values. However, it is the full primary dataset that is most useful to others and that allows your results to be properly validated. Graphs are certainly useful in demonstrating findings, and appropriate to include in your publication, but the full raw data should be shared via a data repository.
- Indeed, it is the full dataset that is most useful to you yourself in future, and by depositing it to a data repository, you’re assured of its long-term preservation and retrievability. Where the repository has archival storage and carries out digital preservation (both of which we are working hard to implement with CORD), you’ve got a much better chance of accessing and reusing this data should you need to in ten years’ time.
- On a similar note, a repository requires a licence to be assigned to the data, and then it is reusable according to those terms. The situation is less clear for supplementary information, so only publishing it there might reduce its reuse value. Is the article open access and is the supplementary information equally open access? Does it have a licence or did the publisher request transferral of copyright, and do they allow others to reuse and redistribute this data?
- The data may even be more interesting than the paper! Perhaps there are other uses for the data and researchers may want to cite your data, though they did not use your article. On a repository, your data gets a DOI and is citable with metrics available for its use (CORD gives you view and download figures, citation counts, and altmetrics).
So whilst we’re not advising you to stop using supplementary information, it is good to consider what data you have used in your article, and whether the most appropriate place for it is actually a data repository. Of course, don’t forget to link to it from the article in your data access statement (internal link) or references.
Image: Dice five, CC-BY-NC-SA 2.0
Categories & Tags:
Leave a comment on this post:
You might also like…
How do I access the full-text of Harvard Business Review (HBR)?
This is one of the most frequently asked questions in the School of Management Library, presumably because HBR is such a key management journal and is renowned worldwide. The short answer is via EBSCO Business ...
Want to find out more about data documentation?: Workshop on 1 June
Data documentation is essential to make sure that well-organised and well-documented research data can be produced from our research projects. It ensures that your data will be understood and interpreted by any user. It will ...
Working on your internship report
Instead of producing a traditional thesis, as covered in our earlier post, some students in the School of Management - and perhaps some in other Cranfield Schools too - will embark upon on an internship ...
Embracing a Marketing and Leadership MSc Apprenticeship.
A Q&A with Faizah Azeem. Why did you decide to undertake a postgraduate apprenticeship in Marketing and Leadership? Cranfield delivers a unique programme that sits well with professionals wanting to develop themselves into expert ...
What is ‘Digital Forensic Science’?
Despite being a fundamental tool for many organisations and criminal justice systems around the world, arguably digital forensic science as a discipline does not always get the recognition it deserves in media broadcasts. Therefore, public ...
Pandemic PhD to prospects PhD
Hello, my name is Danni and I’m a third-year PhD student in the School of Water, Energy, and Environment specifically within the Soil, Agrifood and Biosciences department. My PhD focuses on how biostimulants (seaweeds, ...