Identification of Ohnolog Genes Originating from Whole Genome Duplication in Early Vertebrates, Based on Synteny Comparison across Multiple Genomes

PLoS Comput Biol. 2015 Jul 16;11(7):e1004394. doi: 10.1371/journal.pcbi.1004394. eCollection 2015 Jul.

Abstract

Whole genome duplications (WGD) have now been firmly established in all major eukaryotic kingdoms. In particular, all vertebrates descend from two rounds of WGDs, that occurred in their jawless ancestor some 500 MY ago. Paralogs retained from WGD, also coined 'ohnologs' after Susumu Ohno, have been shown to be typically associated with development, signaling and gene regulation. Ohnologs, which amount to about 20 to 35% of genes in the human genome, have also been shown to be prone to dominant deleterious mutations and frequently implicated in cancer and genetic diseases. Hence, identifying ohnologs is central to better understand the evolution of vertebrates and their susceptibility to genetic diseases. Early computational analyses to identify vertebrate ohnologs relied on content-based synteny comparisons between the human genome and a single invertebrate outgroup genome or within the human genome itself. These approaches are thus limited by lineage specific rearrangements in individual genomes. We report, in this study, the identification of vertebrate ohnologs based on the quantitative assessment and integration of synteny conservation between six amniote vertebrates and six invertebrate outgroups. Such a synteny comparison across multiple genomes is shown to enhance the statistical power of ohnolog identification in vertebrates compared to earlier approaches, by overcoming lineage specific genome rearrangements. Ohnolog gene families can be browsed and downloaded for three statistical confidence levels or recompiled for specific, user-defined, significance criteria at http://ohnologs.curie.fr/. In the light of the importance of WGD on the genetic makeup of vertebrates, our analysis provides a useful resource for researchers interested in gaining further insights on vertebrate evolution and genetic diseases.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Chromosome Mapping / methods
  • Gene Dosage / genetics
  • Gene Duplication / genetics*
  • Genetic Linkage / genetics*
  • Genome / genetics*
  • Humans
  • Sequence Homology, Amino Acid*
  • Species Specificity
  • Synteny / genetics*
  • Vertebrates / genetics*

Grants and funding

PPS acknowledges a PhD fellowship from Erasmus Mundus (UPMC) and La Ligue Contre le Cancer. HI acknowledges funding from Foundation Pierre-Gilles de Gennes, grant FPGG025. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.