Open Data citation advantage

2 Aug
2010

Quite recently my colleagues published a paper on genome-wide model of translation (in PLoS Computational Biology). They have used a number of different data, although in one case the data wasn’t available in the text or as a supplementary material – they needed to ask the other group for certain raw numbers from their experiment on ribosome profiling in yeast (published in Science). Data was shared promptly, so my colleagues could finish the project. They have cited original paper, plus they’ve expressed their gratitude mentioning data sharing in acknowledgements.

Because sharing data resulted in a citation, I wonder how long will it take for Open Data advocates to start using this “open data citation advantage” as an argument for sharing data?

Don’t get me wrong – I’m all for Open Foo, but I do get frustrated when “citation advantage” becomes a major or even the only point of going open. It’s obscuring debate on Open Foo and limiting it to the aspects only (some) scientists care about.

Must read: advantage, schmantage post by Bill Hooker on OA citation advantage.

  • August 2, 2010 at 7:43 pm Heather Piwowar
    Hi Pawel. The argument has indeed been started ;) http://www.plosone.org/article/info:doi/10.1371/journal.pone.0000308 (see also "will you be scooped or will you be famous" http://www.prio.no/Research-and-Publications/Publication/?oid=55406 ) fwiw, I think the citation argument can be useful in some circles, for a limited time. But I agree that it should not be the only argument, nor is it the most important. Also fwiw, my guess is the citation advantage in most areas is below the 70% I found, at which point it starts to be a pretty weak argument anyway.
  • August 3, 2010 at 6:24 am Pawel Szczesny
    Oh, how cool, thank you Heather. I've missed that one (and failed again to notice that there are important things that weren't published last week ;) ). Interesting point about field-specificity. Do you have the code available? You could crowdsource reanalysing citation advantage in other areas (I can dig in genomics).
  • August 3, 2010 at 6:21 pm Heather Piwowar
    Agreed, extending to other areas would/could be very interesting. The hard part is getting the baseline "datasets created" articles, then annotating them for whether data shared or not. We have a start in evolutionary biology thanks to DataONE intern Sarah Judson http://www.openwetware.org/wiki/User:Sarah_Judson/Notebook/DataOne_DataCitationPractices and I've got 11k gene expression microarray articles (25% have shared data) ready to gather citation stats for, just need the hours. Or someone else who wants to work on it too... code and data are avail! :) http://tinyurl.com/39y963b
  • August 3, 2010 at 6:27 pm Heather Piwowar
    btw Pawel, now I'm curious.... you think the citation advantage argument is counterproductive? Or you want to explore it? Or both??? I could understand both. People contain multitudes :)
  • August 3, 2010 at 6:45 pm Pawel Szczesny
    While I don't think the argument is absolutely counterproductive, I might shoot the lecturer after OD advocacy talk that mentions _only_ citation advantage (such talks happen in OA advocacy). But seriously I'm interested in exploring the argument however not because of advocacy, but rather I'm curious about field-specific nuances and sharing culture/sharing policies. Thanks for the links - I'll have a look.
  • August 3, 2010 at 10:09 pm Heather Piwowar
    Gathering citation data is pretty easy from Scopus through PubMed IDs... let me know if you want more info on that or on my data and code. Three more refs on sharing policies, fyi, are DataONE intern Nic Weber in evo bio: http://www.openwetware.org/wiki/DataONE:Notebook/Data_Citation_and_Sharing_Policy my 2008 ELPUB paper on microarray policies, and Kate McCain's 1995 paper on Mandating Sharing. We are due for an across-the-board look, it is true!

Add a comment on FriendFeed




2 Responses to Open Data citation advantage

Avatar

Tweets that mention ? Open Data citation advantage Circle of Complexity -- Topsy.com

August 3rd, 2010 at 1:26 am

[...] This post was mentioned on Twitter by Jonathan Eisen, MGED Society. MGED Society said: RT @phylogenomics: Liked "Open Data citation advantage" http://ff.im/-oFeqL [...]

Avatar

Notional Slurry » links for 2010-08-10

August 11th, 2010 at 6:03 am

[...] ? Open Data citation advantage Circle of Complexity "Because sharing data resulted in a citation, I wonder how long will it take for Open Data advocates to start using this ?open data citation advantage? as an argument for sharing data?" (tags: citation-etiquette economics open-access open-science open-data social-engineering academic-culture) [...]

Comment Form

top