Credit where credit’s due – citing datasets and data papers
24/08/2016
Why would you cite datasets?
Citing datasets ensures credit is given to their creators and that data impact is tracked, as data is an important research output in its own right. If you use other people’s datasets, you should cite them as you would cite their papers. If you publish your own datasets and they underpin an article, you should cite the dataset in its own right, with an in-text pointer to an entry in the reference list. Whilst you may already be including data access statements to meet funder requirements, full citations are valuable because data statements don’t treat data as a first-class record of research and don’t give due credit to the creators, especially where they’re different from the article authors.
How do you cite datasets?
As with other types of citation, advice on the exact elements to include can vary by style. The key elements are the identifier (see our post on DOIs), creator(s), title, date, and publisher. An example reference in Harvard-Cranfield would look like this:
Partridge, M.C. (2014) Spectra from LPG sensor repeat testing with 100ppm toluene. Cranfield University. 10.6084/m9.figshare.1004753.v1
In systems such as CORD, our data repository, there are various citation options on CORD item pages, with links to export the record directly to Mendeley or other reference managers, and a ‘cite’ tab that offers a cut-and-paste citation in a generic format. You may find the Library’s Quick guide to the author-date referencing style (internal pdf) useful, including tips for referencing with Mendeley.
How do data papers fit in?
A data paper is a paper that describes a dataset rather than drawing conclusions from it, and is usually published in a dedicated data journal, such as Nature’s Scientific Data. (This list of data journals may be helpful.)
Some people like to write data papers to promote their dataset, and to make it easier for others to credit you, as academics are more familiar with citing papers than datasets, although this is likely to change over time. If you write a data paper, a citation of the data paper counts as a citation of the dataset, so all uses of your data are acknowledged.
For more information, see this great UK Data Service pdf on data citation.
Image: thanks merci by infrogmation at https://www.flickr.com/photos/infrogmation/4136308490/, CC-BY 2.0.
Categories & Tags:
Leave a comment on this post:
You might also like…
Inside the Thermal Power and Propulsion MSc with Dr Uyioghosa Igie
In our recent conversation with Dr. Uyioghosa Igie, Programme Director for the Thermal Power and Propulsion MSc at Cranfield University, we uncovered what makes this course such an exciting and valuable path for ...
Borrow fiction online – for free!
Everybody needs a break from work, and if you fancy reading or listening to some fiction or non-academic books, we have the app for you! Use the Libby app to borrow a host of online books ...
Researching IPOs in Bloomberg
Are you researching IPOs? Do you want to find IPOs on a specific index (eg S&P 500, or UK AIM Index) for specific dates? Then Bloomberg is where you should be looking. If you haven’t ...
Meet the Cranfield alumna named among sustainability’s brightest rising stars
For Julia Anukam, working in sustainability is about being part of the solution. A conscious consumer and long-time vegan, she found her true calling after a re-evaluation of her career priorities during the Covid-19 ...
We need a million engineers who understand accessibility
…and we are, mostly, starting from zero. This arresting, attention-grabbing line was said to me only last month, in a busy London canteen. Who said it, where we were, are and what they said - ...
Cranfield apprentices named among sustainability’s brightest rising stars
Two Cranfield University apprentices have been recognised for their drive, determination and potential to lead the UK towards a more sustainable future. Julia Anukam and Lucie Rowley feature in the prestigious edie 30 Under ...