STAR/mmCIF: an ontology for macromolecular structure

Bioinformatics. 2000 Feb;16(2):159-68. doi: 10.1093/bioinformatics/16.2.159.

Abstract

Motivation: Crystallographers were motivated 10 years ago to develop a simple and consistent data representation for the exchange and archiving of data associated with the crystallographic experiment and the final structure. As this process evolved (and the data grew at near exponential rates) came the recognition that this representation should also facilitate the automated management of the data and, with the aid of additional software for verification and validation, provide improved consistency and accuracy and hence improved scientific inquiry. This realization led to a new Dictionary Definition Language (DDL) and an extensive dictionary based on this DDL for describing macromolecular structure. In broad terms this could be considered an ontology. An important feature in the development of the ontology was the endorsement and ongoing maintenance and support of the International Union of Crystallography (IUCr). While the description of macromolecular structure and the x-ray crystallographic experiment used to derive it represent explicit data, the ontology is extensible and applicable to other less well-characterized data domains.

Results: Details of the DDL, the dictionaries that have been developed, and software for reading and using this ontology are presented.

Availability: Extensive documentation, software tools and the DDL and dictionaries are available from http://ndbserver.rutgers.edu/mmcif and associated mirror sites.

Contact: Bourne: bourne@sdsc.edu and Westbrook:jwest@rcsp.rutgers.edu

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Crystallography, X-Ray
  • Databases, Factual*
  • Humans
  • Information Storage and Retrieval*
  • Macromolecular Substances
  • Molecular Structure
  • Proteins / analysis
  • Software

Substances

  • Macromolecular Substances
  • Proteins