OrthoMaM v8: A Database of Orthologous Exons and Coding Sequences for Comparative Genomics in Mammals

Douzery, Emmanuel J. P.; Scornavacca, Celine; Romiguier, Jonathan; Belkhir, Khalid; Galtier, Nicolas; Delsuc, Frédéric; Ranwez, Vincent

doi:10.1093/molbev/msu132

Comparative genomic studies extensively rely on alignments of orthologous sequences. Yet, selecting, gathering, and aligning orthologous exons and protein-coding sequences (CDS) that are relevant for a given evolutionary analysis can be a difficult and time-consuming task. In this context, we developed OrthoMaM, a database of ORTHOlogous MAmmalian Markers describing the evolutionary dynamics of orthologous genes in mammalian genomes using a phylogenetic framework. Since its first release in 2007, OrthoMaM has regularly evolved, not only to include newly available genomes but also to incorporate up-to-date software in its analytic pipeline. This eighth release integrates the 40 complete mammalian genomes available in Ensembl v73 and provides alignments, phylogenies, evolutionary descriptor information, and functional annotations for 13,404 single-copy orthologous CDS and 6,953 long exons. The graphical interface allows to easily explore OrthoMaM to identify markers with specific characteristics (e.g., taxa availability, alignment size, %G+C, evolutionary rate, chromosome location). It hence provides an efficient solution to sample preprocessed markers adapted to user-specific needs. OrthoMaM has proven to be a valuable resource for researchers interested in mammalian phylogenomics, evolutionary genomics, and has served as a source of benchmark empirical data sets in several methodological studies. OrthoMaM is available for browsing, query and complete or filtered downloads at http://www.orthomam.univ-montp2.fr/.

orthologous sequences, mammals, coding sequences, phylogenomics, comparative genomics

Introduction

Orthologous protein-coding sequences (CDS) are of great interest to study patterns of organismal evolution (species phylogenies) and genomic processes (molecular evolution). The wide use of exons and CDS in phylogenomics and comparative genomics is facilitated by the existence of several independent databases of orthologs (Alexeyenko et al. 2006), each with their pros and cons. Some are generalist—for example, COG/KOG (Tatusov et al. 2003), HOGENOM (Dufayard et al. 2005), and InParanoid (Östlund et al. 2010), some are taxonomically specialized—for example, OPTIC (Heger and Ponting 2008) for vertebrates, INVHOGEN (Paulsen and von Haeseler 2006) for nonvertebrates, EvolMarkers (Li et al. 2012) for metazoans, FUNYBASE for fungi (Marthey et al. 2008), GreenPhylDB (Conte et al. 2008) for plants, HOBACGEN (Perriere et al. 2000) for bacteria, and some are built on functional information, such as OrthoDisease (O’Brien et al. 2004). In particular taxonomic groups, researchers have identified potentially useful phylogenetic DNA markers from complete genomes and have validated their use in nonmodel species such as primates (Horvath et al. 2008), actinopterygian fishes (Li et al. 2007), or rosids (Duarte et al. 2010). However, these databases generally do not provide end-users with key parameters describing the evolutionary pattern of orthologs, and orientating the choice of the molecular markers to be studied from the viewpoint of phylogenomic and molecular evolution. Also, few of them provide high-quality nucleotide and amino acid alignments preserving the key underlying codon structure.

OrthoMaM (Ranwez et al. 2007) is a database of ORTHOlogous MAmmalian coding sequence Markers, which helps filling these gaps. It provides high-quality codon alignments of exon and CDS markers associated with a detailed characterization of their evolutionary dynamics in terms of phylogenetic signal, base composition, substitution rate, and chromosome location. Moreover, OrthoMaM focuses only on one-to-one orthologs identified by Ensembl (Flicek et al. 2014), that is, sequences for which no duplication is detected since the last common ancestor of the corresponding species. Indeed, as one-to-one orthologs are unaffected by complex intragenomic processes such as gene duplication or gene loss, the differences in their sequences are ensured to have occurred through common descent and therefore reflect the divergence between species.

Database Overview and Improvements

Mammalia is among the first animal taxa with many complete genomes available and has been extensively used to define most of the gold-standard methods in phylogenomic and molecular evolution studies. Based on the 12 mammalian genomes available in Ensembl v41, the first version of OrthoMaM was released in July 2007 and contained 3,170 exons (Ranwez et al. 2007).

Several major improvements have been made since then. In the current version (OrthoMaM version 8, October 2013, based on Ensembl v73), the database includes 6,953 exons and covers 40 mammalian species. In addition to exons, full orthologous CDS are now available. Queries have been made more flexible and can be performed taxonomically. Results can be dynamically sorted according to key descriptors, for example, number of orthologs, alignment length, α parameter of the among-site substitution rate heterogeneity, and G+C nucleotide composition on third codon positions (%GC3). The latter statistics has recently been connected to the performance of CDS as phylogenetic markers (Romiguier et al. 2013). Nucleotide and amino acid alignments, maximum likelihood (ML) gene trees, and detailed marker information can be downloaded for all exons and CDS. To improve readability, the phylogenetic tree of each marker is colored according to the major mammalian clades using the APE package (Paradis 2006). We also enriched the information associated with each marker by linking exons to their corresponding CDS and including functional annotations (gene ontology concepts) graphically displayed thanks to OntoFocus (Ranwez et al. 2012) and Graphviz (Ellson et al. 2002). Figure 1 displays screenshots associated with a given query on the OrthoMaM website.

Fig. 1.

Open in new tab Download slide

Screenshots from the OrthoMaM website. Here, we searched for CDS with 15–40 mammals, a relative evolutionary rate between 0.5 and 3, an α parameter of the Γ distribution ranging from 1 to 1.5, and a GC3 between 22% and 35%. We got 23 target CDS and focused on the LRRC63 marker. We then visualized the evolutionary dynamics parameters, the first 80 sites of the DNA alignment, and the corresponding phylogenetic tree.

The current OrthoMaM release contains a total of 13,404 CDS markers covering half of the known mammalian genes and providing a uniform representativity along chromosomes (fig. 2a). However, the number of available CDS widely varies among species, mainly because of the uneven sequencing coverage of the corresponding genomes. Figure 2b provides the phylogeny of the 40 species represented in OrthoMaM together with the number of CDS available for the different species and clades. For example, 973 CDS markers share the full set of 36 placental mammals of OrthoMaM, and 5,806 CDS markers share the full subset of 10 primates.

Fig. 2.

Open in new tab Download slide

(a) Genomic distribution of OrthoMaM CDS along human chromosome 1. The ideogram for human chromosome 1 is provided together with the distribution of OrthoMaM CDS (dark red bars to the right). The distributions of Ensembl predicted genes (white bars) and database-known genes (red bars) are also indicated (centre). (b) The phylogeny of the 40 species present in OrthoMaM. For each species, we provide the number of available CDS. For each node, we also indicate the number of CDS markers containing all species of the corresponding clade.

The OrthoMaM Pipeline

Identification of Orthologous Sequences

We start by using Ensembl annotations (Flicek et al. 2014) to identify one-to-one orthologous genes among pairs of three high-coverage reference species (Homo–Mus, Homo–Canis, and Mus–Canis). We then enrich each of those clusters of one-to-one orthologs by adding sequences of additional mammals that are annotated as one-to-one orthologs to the human gene (Ranwez et al. 2007). Note that the chromosomal distribution of OrthoMaM human genes basically mirrors the distribution of the full set of Ensembl human genes (fig. 2a), which is to be expected from an unbiased database.

Those clusters of one-to-one orthologous genes are turned into clusters of one-to-one orthologous CDS by selecting the longest transcript of each gene. We choose to consider the longest sequence as this is the one used by Ensembl to define the orthology relationships among genes, and this will maximize the evolutionary information to be analyzed.

The one-to-one orthologous exon clusters are not provided by Ensembl. Their identification is complicated by alternative splicing and by the variability in number and length of exons of a given gene across species. We tackle those problems by relying on the alignments of the one-to-one orthologous CDS to infer one-to-one orthology among their exons. Each human exon annotated by Ensembl initiates a one-to-one orthologous exon cluster. Exons from additional species are added to this cluster if they share a number of identical amino acids greater than half the length of the CDS alignment restricted to the candidate exon and the human one. This similarity threshold ensures that no more than one exon from a given species will be included in the predicted set of orthologs. Initial exon alignments longer than 400 sites are selected as our evolutionary marker descriptors are not meaningful enough for shorter sequences. Clusters with less than four sequences are discarded for the same reason.

Alignments and Trees

CDS and exon sequences are aligned at the codon level in two steps. First, the translated amino acids are aligned using MAFFT (Katoh et al. 2005) and gaps are reported onto the nucleotide sequences. This alignment is then refined using MACSE (Ranwez et al. 2011) to obtain a final codon alignment unaffected by frameshifts, misassemblies, and sequencing errors. Nucleotide and amino acid alignments are then filtered to remove spurious sequences and/or codons using trimAl (Capella-Gutiérrez et al. 2009). The filtering is conducted under the “automated1” option, which has been specifically designed to clean alignments before conducting ML phylogenetic inference. This step can yield final alignments shorter than 400 sites though the average length is far higher for both exons (956 sites) and CDS (1,850 sites). To ensure data traceability, each sequence is linked to the corresponding Ensembl CDS/exon. Moreover, each OrthoMaM alignment is available for download before and after filtering. All previous releases of OrthoMaM also remain available through the website.

The ML tree is identified for each marker by analyzing codon alignments with RAxML (Stamatakis 2006) under the general time reversible (GTR)+Γ model (Yang 1996). We acknowledge that using the proper model of sequence evolution is vital in probabilistic inference. However, we here used the same model for all CDS and exons because 1) it warrants a fair comparison among all markers of the database, 2) it is the one that best fits the majority of the markers (Ranwez et al. 2007), 3) the GTR exchangeability matrix is the only one available at the nucleotide level in RAxML, and 4) the parameter-rich GTR+Γ model is more likely to introduce increased variance rather than bias in the estimates (Lemmon and Moriarty 2004).

All parameters describing the evolutionary dynamics of exons and CDS are gathered by running PAUP* (Swofford 2003) on the ML tree inferred by RAxML. Branch lengths of ML phylograms are also examined, and if some exceed the unrealistic value of two substitutions per site, the corresponding alignment is excluded from OrthoMaM. This phylogenetic-based filter enables to detect and remove markers that likely contain misaligned sequences, misspecified open reading frames, or misannotated paralogs.

Database Updates and Scalability

OrthoMaM is regularly updated and its pipeline is constantly optimized to keep pace with the ever increasing number of available genomes and software developments in the field. Orthology annotation and sequences are now retrieved using the BioMart facilities, which allow massive retrieval of Ensembl data (Flicek et al. 2014). Those data are processed by home made Java tools to identify clusters of one-to-one orthologous CDS/exons. Phylogenetic analyses rely on shell scripts to chain up-to-date software. The website is based on a php/mysql database for query facilities and on XML/XSLT for exchange and graphic representation of marker details. All analyses are performed on the computing cluster of the Montpellier Bioinformatics Biodiversity (MBB) platform.

Query Options

There are three entry points in OrthoMaM. First, exon and CDS sections can be graphically browsed using a clickable phylogeny and ideograms of human chromosomes. Second, markers can be queried according to several of their key characteristics, including: minimal alignment length, number of sequences, mandatory species, base composition (%GC3), relative evolutionary rate of the marker, Ensembl gene identifier or HUGO gene symbol (see fig. 1). Third, a BLAST (Altschul et al. 1990) similarity search can be run to find OrthoMaM markers related to a given request.

Examples of Contributions

OrthoMaM has proven its usefulness in several phylogenomic and comparative genomic studies. We briefly list some of them to illustrate the broad spectrum of analyses facilitated by OrthoMaM. Our database has been used for developing new markers in multigene phylogenetic studies (Zhou et al. 2011; Hassanin et al. 2013) and also as a source of large-scale molecular data in phylogenomic (Parker et al. 2013; Romiguier et al. 2013), molecular dating (Schrago and Voloch 2013), and evolutionary genomic (Galtier et al. 2009; Romiguier et al. 2010; Rorick and Wagner 2011; Lartillot 2013) analyses. The high-quality codon alignments have also been utilized as benchmark empirical data sets for testing new analytical methods (Egan et al. 2008; López-Giráldez and Townsend 2011; Li and Drummond 2012; Wu et al. 2013) and for detecting footprints of purifying or positive selection (Jobson et al. 2010; Laguette et al. 2012). Finally, the inferred ML gene trees have served for assessing the performance of supertree methods (Scornavacca et al. 2008; Ranwez et al. 2010). With the ongoing pace of mammalian genome sequencing, we envision an enhanced potential for the uses of OrthoMaM in comparative genomic studies aiming at understanding the evolutionary dynamics of protein-coding genes.

Future Prospects

The primary aim of OrthoMaM is to provide high-quality genome scale alignments and phylogenetic analysis for one-to-one orthologous exons and CDS among mammals. Its analysis pipeline strategy has been adapted to cope with the increasing number of mammalian genomes that will be released in the upcoming years. This bioinformatic pipeline is constantly improved and we are currently testing the possibility of relying on codon-based phylogenetic inference using codon-phyML (Gil et al. 2013) and including in future releases per branch dN/dS estimations using mapNH (Romiguier et al. 2012). Moreover, we are considering possible solutions to filter only parts of a sequence in order to further improve the quality of our codon alignments with respect to potential exon annotation errors in CDS. We are also evaluating the relevance of expanding the database toward noncoding markers, such as intronic, untranslated, and regulatory regions.

Acknowledgments

This work was supported by the Montpellier Bioinformatics Biodiversity platform, the Agence Nationale de la Recherche “Investissements d’avenir/Bioinformatique” (ANR-10-BINF-01-02 “Ancestrome”), and the European Research Council (“PopPhyl”: Population phylogenomics). The authors thank two reviewers for their comments on the manuscript. This publication is contribution 2014-022 of the Institut des Sciences de l’Evolution de Montpellier (UMR 5554—CNRS-IRD-UM2).

References

Alexeyenko

A

Lindberg

J

Pérez-Bercoff

A

Sonnhammer

E

Overview and comparison of ortholog databases

Drug Discov Today Technol.

2006

3

137

143

Google Scholar

Crossref

PubMed

WorldCat

Altschul

S

Gish

W

Miller

W

Myers

E

Lipman

D

Basic local alignment search tool

J Mol Biol.

1990

215

403

410

Google Scholar

Crossref

PubMed

WorldCat

Capella-Gutiérrez

S

Silla-Martínez

JM

Gabaldón

T

trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses

Bioinformatics

2009

25

15

1972

1973

Google Scholar

Crossref

PubMed

WorldCat

Conte

MG

Gaillard

S

Lanau

N

Rouard

M

Périn

C

GreenPhylDB: a database for plant comparative genomics

Nucleic Acids Res.

2008

36

D991

D998

Google Scholar

Crossref

PubMed

WorldCat

Duarte

J

Wall

P

Edger

P

Landherr

L

Ma

H

Pires

J

Leebens-Mack

J

dePamphilis

C

Identification of shared single copy nuclear genes in Arabidopsis, Populus, Vitis and Oryza and their phylogenetic utility across various taxonomic levels

BMC Evol Biol.

2010

10

61

Google Scholar

Crossref

PubMed

WorldCat

Dufayard

JF

Duret

L

Penel

S

Gouy

M

Rechenmann

F

Perriere

G

Tree pattern matching in phylogenetic trees: automatic search for orthologs or paralogs in homologous gene sequence databases

Bioinformatics

2005

21

2596

2603

Google Scholar

Crossref

PubMed

WorldCat

Egan

A

Mahurkar

A

Crabtree

J

Badger

JH

Carlton

JM

Silva

JC

IDEA: interactive display for evolutionary analyses

BMC Bioinformatics

2008

9

524

Google Scholar

Crossref

PubMed

WorldCat

Ellson

J

Gansner

E

Koutsofios

L

North

SC

Woodhull

G

Graphviz—open source graph drawing tools

Graph Drawing

2002

2265

483

484

Google Scholar

OpenURL Placeholder Text

WorldCat

Flicek

P

Amode

MR

Barrell

D

Beal

K

Billis

K

Brent

S

Carvalho-Silva

D

Clapham

P

Coates

G

Fitzgerald

S

et al.

Ensembl 2014

Nucleic Acids Res.

2014

42

Database issue

D749

D755

Google Scholar

Crossref

PubMed

WorldCat

Galtier

N

Duret

L

Glémin

S

Ranwez

V

GC-biased gene conversion promotes the fixation of deleterious amino acid changes in primates

Trends Genet.

2009

25

1

5

Google Scholar

Crossref

PubMed

WorldCat

Gil

M

Zanetti

MS

Zoller

S

Anisimova

M

Codonphyml: fast maximum likelihood phylogeny estimation under codon substitution models

Mol Biol Evol.

2013

30

6

1270

1280

Google Scholar

Crossref

PubMed

WorldCat

Hassanin

A

An

J

Ropiquet

A

Nguyen

TT

Couloux

A

Combining multiple autosomal introns for studying shallow phylogeny and taxonomy of Laurasiatherian mammals: application to the tribe Bovini (Cetartiodactyla, Bovidae)

Mol Phylogenet Evol.

2013

66

3

766

775

Google Scholar

Crossref

PubMed

WorldCat

Heger

A

Ponting

CP

OPTIC: orthologous and paralogous transcripts in clades

Nucleic Acids Res.

2008

36

D267

D270

Google Scholar

Crossref

PubMed

WorldCat

Horvath

JE

Weisrock

DW

Embry

SL

Fiorentino

I

Balhoff

JP

Kappeler

P

Wray

GA

Willard

HF

Yoder

AD

Development and application of a phylogenomic toolkit: resolving the evolutionary history of Madagascar’s lemurs

Genome Res.

2008

18

489

499

Google Scholar

Crossref

PubMed

WorldCat

Jobson

R

Nabholz

B

Galtier

N

An evolutionary genome scan for longevity-related natural selection in mammals

Mol Biol Evol.

2010

27

840

847

Google Scholar

Crossref

PubMed

WorldCat

Katoh

K

Kuma

K

Toh

H

Miyata

T

MAFFT version 5: improvement in accuracy of multiple sequence alignment

Nucleic Acids Res.

2005

33

511

518

Google Scholar

Crossref

PubMed

WorldCat

Laguette

N

Rahm

N

Sobhian

B

Chable-Bessia

C

Munch

J

Snoeck

J

Sauter

D

Switzer

WM

Heneine

W

Kirchhoff

F

et al.

Evolutionary and functional analyses of the interaction between the myeloid restriction factor SAMHD1 and the lentiviral Vpx protein

Cell Host Microbe

2012

11

205

217

Google Scholar

Crossref

PubMed

WorldCat

Lartillot

N

Phylogenetic patterns of gc-biased gene conversion in placental mammals and the evolutionary dynamics of recombination landscapes

Mol Biol Evol.

2013

30

3

489

502

Google Scholar

Crossref

PubMed

WorldCat

Lemmon

A

Moriarty

E

The importance of proper model assumption in Bayesian phylogenetics

Syst Biol.

2004

53

265

277

Google Scholar

Crossref

PubMed

WorldCat

Li

C

Riethoven

J-J

Naylor

G

Evolmarkers: a database for mining exon and intron markers for evolution, ecology and conservation studies

Mol Ecol Res.

2012

12

967

971

Google Scholar

Crossref

WorldCat

Li

CH

Orti

G

Zhang

G

Lu

GQ

A practical approach to phylogenomics: the phylogeny of ray-finned fish (Actinopterygii) as a case study

BMC Evol Biol.

2007

7

44

Google Scholar

Crossref

PubMed

WorldCat

Li

WLS

Drummond

AJ

Model averaging and bayes factor calculation of relaxed molecular clocks in Bayesian phylogenetics

Mol Biol Evol.

2012

29

2

751

761

Google Scholar

Crossref

PubMed

WorldCat

López-Giráldez

F

Townsend

JP

Phydesign: an online application for profiling phylogenetic informativeness

BMC Evol Biol.

2011

11

1

152

Google Scholar

Crossref

PubMed

WorldCat

Marthey

S

Aguileta

G

Rodolphe

F

Gendrault

A

Giraud

T

Fournier

E

Lopez-Villavicencio

M

Gautier

A

Lebrun

M-H

Chiapello

H

FUNYBASE: a FUNgal phYlogenomic dataBASE

BMC Bioinformatics

2008

9

456

Google Scholar

Crossref

PubMed

WorldCat

O’Brien

K

Westerlund

I

Sonnhammer

E

OrthoDisease: a database of human disease orthologs

Hum Mutat.

2004

24

112

119

Google Scholar

Crossref

PubMed

WorldCat

Östlund

G

Schmitt

T

Forslund

K

Köstler

T

Messina

D

Roopra

S

Frings

O

Sonnhammer

E

InParanoid 7: new algorithms and tools for eukaryotic orthology analysis

Nucleic Acids Res.

2010

38

D196

D203

Google Scholar

Crossref

PubMed

WorldCat

Paradis

E

Analysis of phylogenetics and evolution with R. Use R! series

2006

New York

Springer Science+Business Media

Google Scholar

Google Preview

OpenURL Placeholder Text

WorldCat

Parker

J

Tsagkogeorga

G

Cotton

JA

Liu

Y

Provero

P

Stupka

E

Rossiter

SJ

Genome-wide signatures of convergent evolution in echolocating mammals

Nature

2013

502

7470

228

231

Google Scholar

Crossref

PubMed

WorldCat

Paulsen

I

von Haeseler

A

INVHOGEN: a database of homologous invertebrate genes

Nucleic Acids Res.

2006

34

Database issue

D349

D353

Google Scholar

Crossref

PubMed

WorldCat

Perriere

G

Duret

L

Gouy

M

HOBACGEN: database system for comparative genomics in bacteria

Genome Res.

2000

10

379

385

Google Scholar

Crossref

PubMed

WorldCat

Ranwez

V

Criscuolo

A

Douzery

EJ

Supertriplets: a triplet-based supertree approach to phylogenomics

Bioinformatics

2010

26

12

i115

i123

Google Scholar

Crossref

PubMed

WorldCat

Ranwez

V

Delsuc

F

Ranwez

S

Belkhir

K

Tilak

M

Douzery

EJP

OrthoMaM: a database of orthologous genomic markers for placental mammal phylogenetics

BMC Evol Biol.

2007

7

241

Google Scholar

Crossref

PubMed

WorldCat

Ranwez

V

Harispe

S

Delsuc

F

Douzery

E

MACSE: multiple alignment of coding sequences accounting for frameshifts and stop codons

PLoS One

2011

6

9

e22594

Google Scholar

Crossref

PubMed

WorldCat

Ranwez

V

Ranwez

S

Janaqi

S

Subontology extraction using hyponym and hypernym closure on is-a directed acyclic graphs

IEEE Trans Knowl Data Eng.

2012

24

12

2288

2300

Google Scholar

Crossref

WorldCat

Romiguier

J

Figuet

E

Galtier

N

Douzery

E

Boussau

B

Dutheil

J

Ranwez

V

Fast and robust characterization of time-heterogeneous sequence evolutionary processes using substitution mapping

PLoS One

2012

7

3

e33852

Google Scholar

Crossref

PubMed

WorldCat

Romiguier

J

Ranwez

V

Delsuc

F

Galtier

N

Douzery

EJ

Less is more in mammalian phylogenomics: AT-rich genes minimize tree conflicts and unravel the root of placental mammals

Mol Biol Evol.

2013

30

9

2134

2144

Google Scholar

Crossref

PubMed

WorldCat

Romiguier

J

Ranwez

V

Douzery

E

Galtier

N

Contrasting GC-content dynamics across 33 mammalian genomes: relationship with life-history traits and chromosome sizes

Genome Res.

2010

20

1001

1009

Google Scholar

Crossref

PubMed

WorldCat

Rorick

MM

Wagner

GP

Protein structural modularity and robustness are associated with evolvability

Genome Biol Evol.

2011

3

456

Google Scholar

Crossref

PubMed

WorldCat

Schrago

C

Voloch

C

The precision of the hominid timescale estimated by relaxed clock methods

J Evol Biol.

2013

26

4

746

755

Google Scholar

Crossref

PubMed

WorldCat

Scornavacca

C

Berry

V

Lefort

V

Douzery

EJ

Ranwez

V

PhySIC_IST: cleaning source trees to infer more informative supertrees

BMC Bioinformatics

2008

9

1

413

Google Scholar

Crossref

PubMed

WorldCat

Stamatakis

A

RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models

Bioinformatics

2006

22

21

2688

2690

Google Scholar

Crossref

PubMed

WorldCat

Swofford

DL

PAUP*: phylogenetic analysis using parsimony (* and other methods). Version 4.0b10

2003

Sunderland (MA)

Sinauer Associates

Google Scholar

Google Preview

OpenURL Placeholder Text

WorldCat

Tatusov

R

Fedorova

N

Jackson

J

Jacobs

A

Kiryutin

B

Koonin

E

Krylov

D

Mazumder

R

Mekhedov

S

Nikolskaya

A

et al.

The COG database: an updated version includes eukaryotes

BMC Bioinformatics

2003

4

41

Google Scholar

Crossref

PubMed

WorldCat

Wu

C-H

Suchard

MA

Drummond

AJ

Bayesian selection of nucleotide substitution models and their site assignments

Mol Biol Evol.

2013

30

3

669

688

Google Scholar

Crossref

PubMed

WorldCat

Yang

Z

Among-site rate variation and its impact on phylogenetic analyses

Trends Ecol Evol.

1996

11

9

367

372

Google Scholar

Crossref

PubMed

WorldCat

Zhou

X

Xu

S

Zhang

P

Yang

G

Developing a series of conservative anchor markers and their application to phylogenomics of laurasiatherian mammals

Mol Ecol Res.

2011

11

1

134

140

Google Scholar

Crossref

WorldCat

Author notes

Associate editor: Xun Gu

Download all slides

Month:	Total Views:
January 2017	3
February 2017	12
March 2017	23
April 2017	11
May 2017	19
June 2017	11
July 2017	14
August 2017	12
September 2017	10
October 2017	9
November 2017	26
December 2017	33
January 2018	39
February 2018	33
March 2018	50
April 2018	28
May 2018	44
June 2018	37
July 2018	41
August 2018	35
September 2018	28
October 2018	43
November 2018	44
December 2018	40
January 2019	31
February 2019	31
March 2019	46
April 2019	41
May 2019	46
June 2019	43
July 2019	35
August 2019	47
September 2019	47
October 2019	18
November 2019	41
December 2019	22
January 2020	32
February 2020	24
March 2020	26
April 2020	25
May 2020	19
June 2020	30
July 2020	13
August 2020	27
September 2020	27
October 2020	27
November 2020	24
December 2020	30
January 2021	30
February 2021	25
March 2021	16
April 2021	25
May 2021	12
June 2021	17
July 2021	9
August 2021	15
September 2021	14
October 2021	26
November 2021	32
December 2021	16
January 2022	29
February 2022	13
March 2022	23
April 2022	16
May 2022	33
June 2022	27
July 2022	26
August 2022	32
September 2022	30
October 2022	33
November 2022	25
December 2022	20
January 2023	21
February 2023	19
March 2023	14
April 2023	31
May 2023	27
June 2023	6
July 2023	33
August 2023	21
September 2023	8
October 2023	24
November 2023	21
December 2023	28
January 2024	27
February 2024	23
March 2024	18
April 2024	22

Article Contents

OrthoMaM v8: A Database of Orthologous Exons and Coding Sequences for Comparative Genomics in Mammals

Introduction

Database Overview and Improvements

The OrthoMaM Pipeline

Identification of Orthologous Sequences

Alignments and Trees

Database Updates and Scalability

Query Options

Examples of Contributions

Future Prospects

Acknowledgments

References

Author notes

Citations

Views

Altmetric

Email alerts

Email alerts

Citing articles via

Latest

Most Read

Most Cited

Article Contents

OrthoMaM v8: A Database of Orthologous Exons and Coding Sequences for Comparative Genomics in Mammals

Introduction

Database Overview and Improvements

The OrthoMaM Pipeline

Identification of Orthologous Sequences

Alignments and Trees

Database Updates and Scalability

Query Options

Examples of Contributions

Future Prospects

Acknowledgments

References

Author notes

Citations

Views

Altmetric

Email alerts

Email alerts

Citing articles via

Latest

Most Read

Most Cited

This Feature Is Available To Subscribers Only