A survey of metabolic databases emphasizing the MetaCyc family

Peter D Karp; Ron Caspi

doi:10.1007/s00204-011-0705-2

A survey of metabolic databases emphasizing the MetaCyc family

Arch Toxicol. 2011 Sep;85(9):1015-33. doi: 10.1007/s00204-011-0705-2. Epub 2011 Apr 27.

Authors

Peter D Karp¹, Ron Caspi

Affiliation

¹ Bioinformatics Research Group, SRI International, 333 Ravenswood Ave, Menlo Park, CA, 94025, USA. pkarp@ai.sri.com

Abstract

Thanks to the confluence of genome sequencing and bioinformatics, the number of metabolic databases has expanded from a handful in the mid-1990s to several thousand today. These databases lie within distinct families that have common ancestry and common attributes. The main families are the MetaCyc, KEGG, Reactome, Model SEED, and BiGG families. We survey these database families, as well as important individual metabolic databases, including multiple human metabolic databases. The MetaCyc family is described in particular detail. It contains well over 1,000 databases, including highly curated databases for Escherichia coli, Saccharomyces cerevisiae, Mus musculus, and Arabidopsis thaliana. These databases are available through a number of web sites that offer a range of software tools for querying and visualizing metabolic networks. These web sites also provide multiple tools for analysis of gene expression and metabolomics data, including visualization of those datasets on metabolic network diagrams and over-representation analysis of gene sets and metabolite sets.

Publication types

Research Support, N.I.H., Extramural
Review

MeSH terms

Computational Biology* / classification
Computational Biology* / standards
Databases, Factual* / classification
Databases, Factual* / standards
Databases, Genetic
Enzymes / metabolism
Information Storage and Retrieval / methods
Internet
Metabolic Networks and Pathways*
Software
User-Computer Interface

Substances

Enzymes

Abstract

Publication types

MeSH terms

Substances

Grants and funding