Data Production and Release: We Need to Change Incentives

By mikethemadbiologist on May 11, 2010.

ScienceBlogling Revere calls for an open data policy for federally-funded research (italics mine):

We've inveighed often here about the shameful practice that many senior and well-respected flu scientists have of keeping their sequences private until they publish -- if they publish using them. If not, no one gets to see them, even if we paid with tax money to collect them. The motives are often unselfish -- a senior scientist trying to protect post-docs or grad students from being scooped. Very Old School. This is the 21st century. We have our own students and we take mentoring very seriously. And one of the things we teach them is that if they have information of importance to public health, then it is to be made public. You don't make any deals with anyone that you will keep it confidential. Period. And you don't keep hold of it on your own initiative, either. Influenza virus sequences are matters of public health importance. If you are worried your career or the career of your students or post docs will be harmed by releasing them as soon as practicable, then you are in the wrong field. Choose a field or a virus where it doesn't matter. But keeping those sequences private is part of the "culture of the discipline." And it needs to change.

I've discussed this before, and I agree that data needs to be free. But what we need to ensure is that the incentive structure for scientists accurately reflects this policy. In the real world, scientists are not typically not rewarded for producing data--not directly anyway*. We are rewarded for producing publications, which lead to funding, which in turn leads to more publications, and so on**. For most academic researchers, the publication is vital for career reasons.

Revere offers a solution I like (italics mine):

As an epidemiologist it can take me years of hard work to collect data. I want to use that data and reap its benefits, both for public health and for me personally and my students and post docs. That doesn't mean I get to hoard them. It means that I have to use them in a timely way. I have an advantage over everyone else because I know the data better than they do and I have it before they do. But I don't have any ownership rights over it. If someone else can use my work, that's what science is all about. Making it available and accessible should be part of the culture of my discipline. It isn't, sad to say. But what should also be part of the culture is that if I use someone else's data (or vice versa), data made accessible to me by virtue of a granting agency's policies, I should give full credit to those who collected it and that credit should count in terms of academic appointments and promotions.

Exactly! Data generators should be included as co-authors, at least within a certain time period after data release--the length of that window might have be to discipline-dependent. We have to create incentives, or at least, remove disincentives for sharing data more rapidly. Many fields would benefit from this.

*I work at an institution where our funding is primarily predicated on data generation, not publication (although publication is good!).

**What is this 'teaching' thing you speak of?[/snark]

More like this

I fully agree. In linguistics, we use what are known as corpora. (Computers are finally able to let us dive into vast quantities of language and not go insane.) The major corpora of English are the BNC and COCA, both of which are available freely online.

If such corpora were not freely and easily available, linguistics would surely be suffering.

Advertisment

Donate

ScienceBlogs is where scientists communicate directly with the public. We are part of Science 2.0, a science education nonprofit operating under Section 501(c)(3) of the Internal Revenue Code. Please make a tax-deductible donation if you value independent science communication, collaboration, participation, and open access.

You can also shop using Amazon Smile and though you pay nothing more we get a tiny something.

Science 2.0

Science Codex

More by this author

Program Announcement: I'm Moving

September 1, 2011

I've dropped some hints in the past that my relationship with ScienceBlogs would be...altered. Well, I've decided to leave. Mostly, it had to do with the issue of pseudonymity, although I'm very excited to hang out my own shingle once again. I don't want to rehash the issue of pseudonymity,…

Note to Unions: This Is Not How You Build a Coalition

September 1, 2011

The old saw that 'we hang together or we get hung separately' is a perfect description of how the left has disintegrated into irrelevance. Too often, groups will focus on modest gains for their own narrow constituency, while selling out other allies. Over the long term, each component of the…

Links 8/31/11

August 31, 2011

Links for you. Science: Underground river 'Rio Hamza' discovered 4km beneath the Amazon What do accommodationists do about creationist politicians? I've Been Told You Can Get Flu From the Flu Shot: False! Federal Work Suspension of Leading Arctic Scientist Ended as Investigation of His…

Meet the New New Math, Same As the Old New Math? What We Can Learn from Finland

August 31, 2011

Recently, The New York Times published an op-ed calling for curricular changes in K-12 math education: Today, American high schools offer a sequence of algebra, geometry, more algebra, pre-calculus and calculus (or a "reform" version in which these topics are interwoven). This has been codified by…

Links 8/30/11

August 30, 2011

Links for you. Another Scientist Calls Out Sen. Coburn's Misleading, Juvenile "Report" XMRV: ITS EVERYWHERE! UUUUUGH! ITS IN MY RACCOON WOUNDS! AND MY QIAGEN COLUMNS! Coulter Goes All Science-y in Bid to Disprove Evolution Yet another bad day for the anti-vaccine movement 2011 Antibiotics: Killing…