The distinction of CPR bacteria from other bacteria based on protein family content

Nat Commun. 2019 Sep 13;10(1):4173. doi: 10.1038/s41467-019-12171-z.

Abstract

Candidate phyla radiation (CPR) bacteria separate phylogenetically from other bacteria, but the organismal distribution of their protein families remains unclear. Here, we leveraged sequences from thousands of uncultivated organisms and identified protein families that co-occur in genomes, thus are likely foundational for lineage capacities. Protein family presence/absence patterns cluster CPR bacteria together, and away from all other bacteria and archaea, partly due to proteins without recognizable homology to proteins in other bacteria. Some are likely involved in cell-cell interactions and potentially important for episymbiotic lifestyles. The diversity of protein family combinations in CPR may exceed that of all other bacteria. Over the bacterial tree, protein family presence/absence patterns broadly recapitulate phylogenetic structure, suggesting persistence of core sets of proteins since lineage divergence. The CPR could have arisen in an episode of dramatic but heterogeneous genome reduction or from a protogenote community and co-evolved with other bacteria.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Bacteria / classification*
  • Bacteria / genetics
  • Bacteria / metabolism*
  • Bacterial Proteins / genetics
  • Bacterial Proteins / metabolism
  • Genome, Bacterial / genetics
  • Metagenomics
  • Phylogeny

Substances

  • Bacterial Proteins