Biological consequences of ancient gene acquisition and duplication in the large genome of Candidatus Solibacter usitatus Ellin6076

PLoS One. 2011;6(9):e24882. doi: 10.1371/journal.pone.0024882. Epub 2011 Sep 15.

Abstract

Members of the bacterial phylum Acidobacteria are widespread in soils and sediments worldwide, and are abundant in many soils. Acidobacteria are challenging to culture in vitro, and many basic features of their biology and functional roles in the soil have not been determined. Candidatus Solibacter usitatus strain Ellin6076 has a 9.9 Mb genome that is approximately 2-5 times as large as the other sequenced Acidobacteria genomes. Bacterial genome sizes typically range from 0.5 to 10 Mb and are influenced by gene duplication, horizontal gene transfer, gene loss and other evolutionary processes. Our comparative genome analyses indicate that the Ellin6076 large genome has arisen by horizontal gene transfer via ancient bacteriophage and/or plasmid-mediated transduction, and widespread small-scale gene duplications, resulting in an increased number of paralogs. Low amino acid sequence identities among functional group members, and lack of conserved gene order and orientation in regions containing similar groups of paralogs, suggest that most of the paralogs are not the result of recent duplication events. The genome sizes of additional cultured Acidobacteria strains were estimated using pulsed-field gel electrophoresis to determine the prevalence of the large genome trait within the phylum. Members of subdivision 3 had larger genomes than those of subdivision 1, but none were as large as the Ellin6076 genome. The large genome of Ellin6076 may not be typical of the phylum, and encodes traits that could provide a selective metabolic, defensive and regulatory advantage in the soil environment.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Acidobacteria / genetics*
  • Codon / genetics
  • Gene Duplication / genetics*
  • Genes, Bacterial / genetics
  • Genome Size / genetics
  • Genome, Bacterial / genetics*
  • Likelihood Functions
  • Phylogeny
  • RNA, Ribosomal, 16S / genetics
  • Selection, Genetic
  • Sequence Homology, Nucleic Acid

Substances

  • Codon
  • RNA, Ribosomal, 16S