Wednesday, 21 April 2010

A minimum data sharing policy?

To follow up on the post about the 12 steps to creating a new data policy, here is a first set of thought of what a high-level template for a data sharing policy might look like.

All data policies provide guidelines for adhering to the principle that data sharing is a good thing. While the particular rules of sharing might be different for a variety of reasons, the general tenets are largely the same.

Might it be possible to encapsulate these basic tenets into a terse minimum data sharing policy that could fit many (all?) situations?

This was in fact one of the recommendation of our Omics data sharing paper:

"We recommend that a single, brief, high-level consensus guideline serve as a template for policy documents at the funder, community, and project levels. At its heart should be the public and timely release of data. It should be based on the principle that funders and the research community must work together to develop best practice."

If such a high level general policy is possible it could contain simple content like this:

A minimum data policy

As an initiative/project/research scientist you will:

- continuously seek to comply with best practices in data sharing
- comply with relevant policies of funders
- strive to share your data with your colleagues and the wider world
- comply with relevant reporting standards
- cite any and all use of public data
- support Open Access publishing and Open Source software solutions
- cost appropriate resources into your grants to cover basic
data management and data sharing best practices
- participate in community efforts when you have opinions,
expertise to contribute
- work with funding agencies to help support you (the community) in complying with this policy


Of course, authors would then add relevant details to these basic statements to make their specific stance explicit.

Right now these statements combine both principles (data sharing should be part of the fabric of research) and practices (use standards as far as possible).

While a data policy based just on practices could easily be distilled from such a general guideline, policies that included strong principles would best start to bring about real-world data sharing.

2 comments:

  1. Once we've chewed on that list for a bit, perhaps we could put this forward as the 'Biosharing Pledge'? Could perhaps make a little icon for people to put on their lab pages that links back to the above post (or some version of it) online -- like the 'Built with X'-style things one often encounters.

    ReplyDelete
  2. Last year, the Data Conservancy (one of the US NSF Data Net projects) sponsored a life sciences workshop in which a straw-man, over-arching data policy was developed. The workshop was attended by representatives from academia, government and the private sector and included life sciences, computer sciences and information sciences disciplines. The following are the key issues that were developed.

    1. Scientists have the right to first use of data they produce
    2. Data providers should have a choice of licenses
    3. Repositories must provide access to data they hold
    4. Scientists must receive attribution when data they produce are used
    5. Open formats should be used
    6. Existing standards should be used where available
    7. Tools and formats used should be free and open-source
    8. Data should be collected with interoperability in mind
    9. Some data are valuable and should be preserved over time

    The above can be embellished to create more granular data policies that meet the needs of the Life Sciences sub-domains.

    Many of these issues intersect with the minimum policies listed in this blog post. I would like to use biosharing as a platform for developing a minimum data sharing policy for the life sciences and encourage feedback from the community concerning the adequacy of these points.

    ReplyDelete

Sociable