Comparative genomic analysis of human and chimpanzee indicates a key role for indels in primate evolution

J Mol Evol. 2006 Nov;63(5):682-90. doi: 10.1007/s00239-006-0045-7. Epub 2006 Oct 29.

Abstract

Sequence comparison of humans and chimpanzees is of interest to understand the mechanisms behind primate evolution. Here we present an independent analysis of human chromosome 21 and the high-quality BAC clone sequences of the homologous chimpanzee chromosome 22. In contrast to previous studies, we have used global alignment methods and Ensembl predictions of protein coding genes (n = 224) for the analysis. Divergence due to insertions and deletions (indels) along with substitutions was examined separately for different genomic features (coding, noncoding genic, and intergenic sequence). The major part of the genomic divergence could be attributed to indels (5.07%), while the nucleotide divergence was estimated as 1.52%. Thus the total divergence was estimated as 6.58%. When excluding repeats and low-complexity DNA the total divergence decreased to 2.37%. The chromosomal distribution of nucleotide substitutions and indel events was significantly correlated. To further examine the role of indels in primate evolution we focused on coding sequences. Indels were found within the coding sequence of 13% of the genes and approximately half of the indels have not been reported previously. In 5% of the chimpanzee genes, indels or substitutions caused premature stop codons that rendered the affected transcripts nonfunctional. Taken together, our findings demonstrate that indels comprise the majority of the genomic divergence. Furthermore, indels occur frequently in coding sequences. Our results thereby support the hypothesis that indels may have a key role in primate evolution.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Base Sequence
  • Chromosome Mapping
  • DNA, Intergenic / genetics
  • Evolution, Molecular*
  • Genetic Variation
  • Genome / genetics*
  • Humans
  • Molecular Sequence Data
  • Mutagenesis, Insertional*
  • Open Reading Frames / genetics
  • Pan troglodytes
  • Primates / genetics*
  • Sequence Alignment
  • Sequence Analysis, DNA
  • Sequence Deletion*

Substances

  • DNA, Intergenic