"The lack of standards, for instance, confounds many a researcher seeking to harness the diversity of knowledge now available on any chosen topic. All credit, then, to those in the vanguard of interoperability."
Coping with Big Data requires Big Standards - big in the latter sense, meaning widely adopted, not extensive, standards. In fact, a common approach is to define a 'minimum' standard.
BioStandards: The three pillars of reporting standards are scope, semantics and syntax.
A large part of ensuring that data can be shared is defining common ways to do so. Reporting standards are composed of three aspects - scope, semantics and syntax. In other words, checklists, controlled vocabularies and ontologies and formats. In yet more words, checklists are text documents that describe the scope of a particular area of knowledge to be captured in a standardized way, cvs and ontologies are shared vocabularies that help semantically express that knowledge and file formats define a syntax that can be used to physically capture and exchange that information.
Reporting Standards allow sharing of data and promote interoperability
There are an increasing number of "minimum information" checklist projects in biology. This projects are defining essential - or minimal - descriptors for reporting different types of data. These communities have self-organized into a community called MIBBI - this stands for "Minimum Information about a Biological or Biomedical Investigation".
MIBBI: http://mibbi.org
A similar and more mature community exists for ontology development. The OBO Foundry now contains over 60 community-driven ontology development projects.
OBO Foundry: http://www.obofoundry.org/
Likewise there are an increasing number of format projects but they have yet to come together into a joined-up community. We have therefore simple started to collect up projects here on this page that we, for now, call the BioFormat community.
