INTRODUCTION
There has been a long-standing debate about the tempo and mode of trait evolution on the macroevolutionary time scale. The gradualism theory states that evolution occurs gradually by small changes that accumulate over a long period of time (
1). The pulsed evolution theory, on the other hand, argues that evolution mostly proceeds in bursts of larger changes (jumps) separated by long periods of stasis or slow evolution (
1–
3). Two types of jumps have been proposed in pulsed evolution. Simpson’s quantum evolution theory postulates that jumps happen when lineages shift into new adaptive zones and these jumps play an important role in the origination of higher taxa (
2), while Eldredge and Gould’s later punctuated equilibrium theory focuses exclusively on jumps associated with speciation (
1). Conceptually, these two types of jumps exist side by side but differ in their frequencies and magnitudes. Studies of animal fossil records support the punctuated equilibrium theory (
4,
5), and more recent phylogenetic comparative studies of vertebrate body size (
6–
8) also provide evidence for quantum evolution. Together, they show that evolution is composed of not only slow and gradual changes but also instant jumps on the macroevolution time scale.
Analogous studies in bacteria and archaea, the most ancient and diverse forms of life on Earth, are lacking, largely because of the scarcity of fossil records and well-measured quantitative phenotypic traits in microbes. Fortunately, the phenotypic evolution of microbial species can be reconstructed from extant genome sequences. Several genomic features are highly correlated with the microbial life strategy. For example, the GC% (% guanine and cytosine content) of the ribosomal RNA (rRNA) gene is correlated with the optimal growth temperature of bacteria and archaea (
9). According to the genome streamlining theory, genome size, genomic GC%, and nitrogen content of proteins all evolve in response to changes in nutrient levels in the environment (
10,
11). These genomic features can be accurately determined from thousands of complete genomes now available that represent a broad range of closely and distantly related lineages, making it possible to study the tempo and mode of trait evolution in microbes over a broad spectrum of macroevolution time scales.
Long-term experimental evolution has shown evidence of pulsed evolution in
Escherichia coli cell size (
12). However, on the macroevolutionary time scale, the role of pulsed evolution in microbial trait evolution remains largely unknown. Although it is well known that there are large trait changes between bacterial clades (e.g., the genomic GC% of high-GC versus low-GC Gram positives, and AT (adenine-thymine)-rich obligate intracellular bacteria versus their free-living relatives) (
13,
14), it is unclear whether these large trait changes arose gradually or rapidly by jumps during the time the clades diverged from each other. Compared to animals and plants, bacteria and archaea reproduce asexually and have relatively large population sizes, high dispersal rates, and short generation times. Another salient feature unique to microbes is that their genomes can often leap by large-scale horizontal gene transfers (HGTs) (
15), which obviously will have an impact on the tempo and mode of evolution. Given these unique features, a central question is whether the tempo and mode of microbial trait evolution are similar to those of eukaryotes and whether pulsed evolution is a universal theme across the tree of life, and, if so, to what extent pulsed evolution contributes to microbial trait evolution.
RESULTS
Gradual evolution does not explain microbial genomic trait evolution
We downloaded 10,616 and 263 complete bacterial and archaeal genomes, respectively, from the National Center for Biotechnology Information (NCBI) RefSeq database, from which we reconstructed genome trees and selected 6668 and 247 representative genomes that passed quality control (Materials and Methods). For each representative genome, we calculated four genomic traits [genomic GC%, rRNA GC%, genome size, and the average nitrogen atoms per residual side chain (N-ARSC)], all of which showing strong phylogenetic signals (Pagel’s λ > 0.99). Using phylogenetically independent contrast (PIC) of tip pairs, we found that, although the four genomic traits are significantly correlated as previously reported (
10,
16), the correlation appears to be weak (fig. S1), with the proportion of variance explained (PVE) by the other traits being less than 13.5% for all four traits. Therefore, to capture the possible variation in the tempo and mode of evolution, we chose to test each of the four traits separately. Notably, the PIC distributions of these traits in bacteria drastically deviate from the normal distribution expected by the Brownian motion (BM) model of gradual evolution (
Fig. 1, A to D; two-sided Kolmogorov-Smirnov test,
P < 0.001 for all four traits). Specifically, all PIC distributions exhibit a strong leptokurtic (heavy-tailed) pattern with a positive excess kurtosis ranging from 5.79 to 13.47, indicating that extremely rapid large trait changes (pulses) occur more frequently than expected by the BM model. For archaea, the PIC distribution also deviates from the normal expectation (fig. S2, A to D; two-sided Kolmogorov-Smirnov test,
P < 0.001,
P = 0.024,
P = 0.018, and
P = 0.155 for rRNA GC%, genomic GC%, genome size, and N-ARSC, respectively), with the excess kurtosis ranging from 1.47 to 7.58. Although inconsistent with BM, such a heavy-tailed pattern can be explained by pulsed evolution. Extremely rapid large trait changes (|PIC| > 3) take place more frequently than expected by the normal distribution (0.27%) throughout the bacterial evolutionary history (fig. S3), suggesting repeated episodes of pulsed trait evolution.
Modeling microbial genomic trait evolution
When plotted against the branch length, the trait changes between two sister nodes in the bacteria phylogeny display a “blunderbuss pattern” (
Fig. 1, E to H). It starts with a period of stationary fluctuations where trait changes are bounded and the variance does not accumulate over time. Segmented linear regression analysis indicates that this phase of stasis lasts until the branch length reaches 0.001 substitutions per site for rRNA GC% (fig. S4). On longer time scales, the stasis yields to a pattern of increasing divergence over time. The archaeal traits display similar patterns (fig. S2, E to H). This blunderbuss pattern, first observed in the evolution of vertebrate body size, is a signature of pulsed evolution (
6). For rRNA GC%, we observed a second spike in the trait divergence rate at 0.025 substitutions per site (fig. S4), indicating a change of evolution tempo at this point.
To formally test whether pulsed evolution explains the patterns, we model the trait change between two sister nodes using the Levy process (
8). More specifically, we model the trait change as the sum of three independent stochastic variables: pulsed evolution, gradual evolution, and time-independent trait variation. We assume that pulsed evolution occurs at a constant rate relative to the molecular divergence and the jump size follows a normal distribution with a mean of zero. As a result, the pulsed evolution is modeled as a compound Poisson process with normal jumps, with parameters λ and σ
2 denoting the frequency (number of expected jumps per lineage per unit branch length) and the magnitude (variance of trait change) of the jumps, respectively. We model gradual evolution using the classic BM model with a single parameter
denoting the rate of the gradual trait change. Meanwhile, we observed trait variation between genomes separated by zero branch length, indicating the presence of time-independent variation in our phylogeny. Because this variation follows a leptokurtic distribution, we model the time-independent variation with the Laplace distribution with one single parameter ε denoting its variance for simplicity and convenience. It should be noted that a jump in the genomic trait may be coupled with an increase in the molecular divergence rate, especially for those traits affecting protein sequences (e.g., genomic GC% and N-ARSC). However, such correlation between the molecular branch length and the trait change will only reduce the signal of pulsed evolution, as the increased branch length provides greater power for gradual evolution to explain the trait variation (fig. S5).
The changing tempos revealed by segmented linear regression suggest that one Poisson process may not adequately describe the patterns of pulsed evolution, prompting us to add multiple Poisson processes (with different jump magnitudes) to our modeling. Therefore, using the framework described above, we tested seven different models (
Table 1). The BM model delineates gradual evolution with a constant rate, while the VRG (variable rate model with Gamma distribution) model describes gradual evolution with continuous variable rates. The PE1, PE2, and PE3 models describe pulsed evolution with one, two, or three Poisson processes, respectively. The PE(
n) + BM models represent trait evolution with both pulsed and gradual evolution. Details of these models and the maximum likelihood (ML) framework are provided in Supplementary Text. Using simulated data, we show that our ML framework can distinguish the BM, continuous variable rate, and pulsed evolution models (table S1) and capture the frequency and magnitude of jumps (table S2). Among pulsed evolution models, our ML framework can distinguish models with one Poisson process from those with more than one Poisson process, but it favors models with a BM component and tends to underestimate the contribution of pulsed evolution (table S1). This is because it is difficult to distinguish between frequent jumps and gradual evolution on long branches (fig. S5).
Microbial trait evolution is dominated by frequent and rare jumps
For the four traits that we have examined in bacteria and archaea, the best model is always the one with a pulsed evolution component. The relative support for the gradual evolution models is marginal, with Akaike information criterion (AIC) weights for the BM model < 0.5% and those for the VRG model < 2.1% (
Table 2). Both the pulsed evolution and VRG models fit the overall PIC distributions better than the BM (
Fig. 1, A to D, and fig. S2, A to D). The pulsed evolution model also fits the patterns of genomic GC% and rRNA GC% changes at different branch lengths better than the BM and VRG models (
Fig. 1, E and F, and fig. S2, E and F). The strong support by AIC and improved fit to the PIC pattern across different branch lengths suggest that pulsed evolution is present in both bacterial and archaeal genomic traits. To test the prevalence of pulsed evolution in bacteria, we separately fitted our models in 17 bacterial families that each contained at least 100 genomes. We found that trait evolution in 76.5, 88.2, 52.9, and 23.5% of tested families were best explained by a model with a pulsed evolution component (PE1, PE1 + BM, PE2, PE2 + BM, or PE3) for rRNA GC%, genomic GC%, genome size, and N-ARSC, respectively, indicating that pulsed evolution is prevalent in bacteria (table S3). To examine the effect of sample size on the power of detecting pulsed evolution, we simulated trait evolution under the pulsed evolution model and conducted model selection on the simulated data. Our simulation shows that, when the number of genomes decreases, the power to detect pulsed evolution also decreases (table S4), suggesting that we might have underestimated the prevalence of pulsed evolution in the 17 bacterial families. We did not test the prevalence of pulsed evolution in archaea because of the insufficient number of archaeal genomes at the lower taxonomic levels.
Using parameters of the best models (
Table 3), we estimated the relative contribution of each compound Poisson process. The variable ε represents the trait variance in the initial stasis phase (
Fig. 1). Its estimated value approximates the intraspecific trait variation between genomes with identical marker gene alignments (i.e., zero branch length) and therefore is used as the baseline. The jumps vary greatly in their frequencies and magnitudes but can be roughly classified into two types: small and frequent, or large and rare (
Table 3). For example, for rRNA GC%, rare jumps (1.96 jumps per lineage per unit branch length) are extremely large in magnitude, as the SD of trait change introduced by one rare jump is approximately 60 times of
, or roughly equivalent to that introduced by 700 million years (0.35 substitutions per site) of gradual evolution under the BM model, and approximately corresponds to 5.8°C change in the optimal growth temperature. In comparison, the frequent jumps (118 jumps per lineage per unit branch length) are 60 times more frequent, but their sizes are only about three times of
. In terms of the absolute change in the trait value, a rare jump between two closely related genomes can cause an rRNA stem GC% change of up to 1.8%, while under the gradual BM model, the expected change in rRNA stem GC% will only be about 0.1%. Overall, rare jumps predominate in trait evolution as they contribute more than 74% of variation in each trait over the whole phylogeny. Similarly, pulsed evolution also predominates in archaea as the PE1 and PE2 models are the best models in all archaeal traits (
Table 2). Because of the limited number of archaeal genomes, we cannot robustly estimate the parameters of each jump process in archaea.
To evaluate the effect of the tree topology on our model fitting, we fitted models on the genome tree of the family Enterobacteriaceae (with 748 genomes) made using either FastTree or RAxML. We found that the fitted model parameters are highly similar using these two trees (table S5). We also validated our fitted model with the bacterial phylogeny downloaded from Genome Taxonomy Database (GTDB) (
17) and found that pulsed evolution models with at least two Poisson processes are also the best models (table S6). Because all the models tested in this study are time reversible, rooting of the phylogeny at different points does not affect the results.
Rare jumps are correlated between traits and with cladogenesis in bacteria
We use simulation to test whether we can detect specific jumps along the phylogeny using the posterior probability estimated under pulsed evolution models. Because pulsed evolution models with at least two Poisson processes are the best models for bacterial genomic trait evolution, we simulated trait evolution under the PE2 model. We found that posterior probability calculated under the pulsed evolution model can predict both frequent and rare jumps very well, with the receiver operating characteristic (ROC) curves having an average area under curve (AUC) of 0.97 for both frequent and rare jumps (fig. S6). Using a posterior probability cutoff of 0.9, we can achieve a specificity of 0.99 for detecting both frequent and rare jumps. In addition, we can detect rare jumps on short branches (defined as branches with a prior probability of having at least one jump < 0.1) with a sensitivity of 0.6. As a comparison, we also tested the performance of BayesTraits, which assumes the continuous variable rate model VRG. BayesTraits failed to accurately capture the frequent and rare jumps, yielding an average AUC below 0.70 (fig. S6). This shows that the continuous variable rate model is inadequate to capture the mode of pulsed evolution.
Using a posterior probability cutoff of 0.9, we mapped the rare jumps of the genomic traits onto the bacterial phylogeny. We found that jumps occurred throughout the phylogeny (
Fig. 2), again indicating that pulsed evolution is prevalent in bacterial evolution history. We detect recent rare jumps in genome size that happen on short branches separating recently diverged species. These jumps correspond to events of very recent genome reduction and expansion (table S7). We also detect rare jumps in more ancient branches. Some of the rare jumps are associated with known key evolutionary adaptations, validating our predictions. For instance, a classic example of adaptation to endosymbiosis occurred within the family Enterobacteriaceae, in the lineage leading to a clade of insect endosymbionts that includes the genera
Buchnera,
Wigglesworthia, and
Candidatus Blochmannia (
18). Our model detects large rare jumps at the base of the clade in the genome size and genomic GC% (posterior probability > 0.9;
Fig. 2). The recently described order
Candidatus Nanopelagicales within the phylum Actinobacteria makes up the most abundant free-living bacteria in fresh water. Nanopelagicales have adapted to live in the nutrient-poor environment by streamlining their genomes (
19). Compared to its high-GC Gram-positive sister clade, Nanopelagicales has markedly reduced genome size (~1.4 Mbp) and genomic GC% (~48%). We detect large rare jumps in all genomic traits at the base of the order with extremely high confidence (posterior probability >0.99), suggesting that the genomic streamlining process happened not gradually but by jumps. Similar patterns have also been observed in the branch leading to the most abundant free-living marine bacteria Pelagibacterales and the intracellular bacteria Rickettsiales and Holosporales. Our model also predicts large rare jumps at higher taxonomic levels such as those at the base branch leading to the α-, β-, γ-, and δ-proteobacteria (posterior probability > 0.96) and the branch that separates the γ-proteobacteria from the rest of the proteobacteria (posterior probability > 0.99). Our results suggest that these key evolutionary adaptations evolved in rapid bursts instead of through slow divergence of species over prolonged periods of time as proposed by the gradual evolution model.
We tested the pairwise correlation of rare jumps’ occurrence between traits using the posterior probability of jumps. We find that rare jumps’ occurrence is significantly positively correlated between all pairs of traits (P < 0.001; table S8), except between genome size and N-ARSC (P = 0.060; table S8). Because frequent jumps are saturated in most part of the phylogeny, we do not have enough power to test the correlation of frequent jumps’ occurrence between traits.
Next, we tested whether jumps are correlated with cladogenesis in bacteria by comparing the predicted frequency of jumps to the expected frequency, for which we assume no correlation of jumps with cladogenesis (the null hypothesis). For example, if jumps happen significantly more frequently between two congener sister nodes than expected, then jumps are considered correlated with the speciation event (cladogenesis at the species level). For frequent jumps, simulations indicated that we lacked the statistical power to reject the null hypothesis at every taxonomic level, and therefore, we excluded them from this analysis. For rare and super rare jumps, we tested their correlation with cladogenesis from the species to order levels. We found that rare and super rare jumps occur more frequently than expected for all traits at the genus, family, and order levels, except for N-ARSC at the genus level (
Table 4). This increase in frequency is significant for rRNA GC%, genomic GC%, and genome size at the genus and family levels (
P ≤ 0.050) and for rRNA and genomic GC% at the order level (
P < 0.001). We found that rare and super rare jumps happen less frequently than expected for all traits at the species level, although it is significant only for ribosomal GC% (
P < 0.001). Our results suggest that rare and super rare jumps are correlated with cladogenesis at higher taxonomic ranks.
Jumps in bacterial traits can be mediated by HGT
HGT is one of the most important processes in bacterial and archaeal evolution. To determine whether the jumps identified in this study could be due to HGT, we manually examined the top candidates of rare jumps in rRNA GC% (posterior probability > 0.9 and prior probability < 0.1) and found clear evidence of HGT. For example, we detected a rare, large jump between two strains of obligate endosymbiont of whiteflies Candidatus Portiera aleyrodidarum. Figure S7 shows that strain China 1 acquired nearly one-third of its 16S and 23S rRNA genes from the lineage of Candidatus Hamiltonella defensa, which is also an endosymbiont in whiteflies but belongs to a different order. GC% of the acquired 16S rRNA fragment increased from 46 to 52%, while GC% of the acquired 23S rRNA fragments increased from 44 to 51%, leading to a jump in the rRNA stem GC%.
DISCUSSION
The central question that we try to address in this study is the tempo and mode of microbial trait evolution: whether the traits evolve mainly by gradual or pulsed evolution. Large trait differences between bacterial lineages are well known (
13,
14), but it is less clear whether these large trait changes arose gradually over time or rapidly by jumps (the mode). Using an ML framework, we explicitly test the mode of genomic trait evolution in bacteria and archaea, and we show that pulsed evolution explains the patterns significantly better than both the constant and variable rate gradual evolution models. Our analysis suggests that pulsed evolution is not only present but also prevalent and dominant in microbial genomic trait evolution.
Microbes are known for rapid evolution. Why are these genomic traits constrained for millions of years before they diverge? The stasis at the species level can be explained by stabilizing selection that eliminates variants falling outside of a stable niche (
20). Alternatively, it can be maintained by gene flow, as suggested by Futuyma’s ephemeral divergence theory (
21). Futuyma proposes that novel adaptive trait variation arises frequently in local populations, but the spatial and temporal mosaic nature of niches prevents such local adaptations from spreading to the entire species because they are wiped out by the gene flow from the prevailing intervening ancestral populations. As a result, trait changes perish and do not accumulate over time, resulting in stationary fluctuations, until speciation interrupts the gene flow. Although reproducing asexually, microbes do exchange genes through homologous recombination, and there is evidence that gene flow plays a critical role in bacterial speciation at least under certain conditions (
22–
24). The transient trait variation in the initial stasis phase when jumps are absent (the ε term in our model) approximately matches the intraspecific trait variation.
At longer time scales or higher taxonomic levels, trait evolution can be constrained through stabilizing selection exerted by the adaptive zone (
25), defined as a set of ecological niches to which a group of species is adapted (
2). This will generate the pattern of phylogenetic conservatism where organisms in a clade tend to have similar traits (synapomorphy) and occupy similar habitats. Accordingly, both genome analyses and ecological studies support that ecological coherence exists at higher taxonomic levels in bacteria (
26). For example, different bacterial clades have their unique set of genes (
27), and analysis of thousands of cultured microbial strains showed that strains related at the genus, family, or order level occupy the same habitat more frequently than expected by chance (
28).
Using simulation, we show that our ML framework can detect jumps with extremely high specificity (0.99) when a posterior probability cutoff of 0.9 is used to predict jumps. On the other hand, the continuous variable rate model struggles to capture the mode of pulsed evolution and performs poorly when benchmarked using the ROC curve. We detected two types of jumps in one dataset: small frequent jumps and large rare jumps. This is possible because the large bacterial dataset spans a wide range of macroevolutionary time scales. For example, the bacterial genome tree in our study has a total branch length of 442.9 substitutions per site. For super rare jumps (e.g., genome size jumps with a rate of 0.17 jump per lineage per unit branch length), it is estimated that there are still 75 events in the entire phylogeny. On the other hand, the resolution of our bacterial genome tree is 5 × 10
−5 substitutions per site, meaning that we can detect jumps that occur as frequently as 20,000 jumps per lineage per unit branch length on average. The large difference in the frequency and size of the jumps suggests that they represent different kinds of evolutionary events. Although our modeling does not stipulate the coupling of cladogenesis and pulsed evolution (as in the classical punctuated equilibrium theory), the rate of the frequent jumps in bacteria (115 to 634 jumps per lineage per unit branch length or 0.06 to 0.32 jumps per lineage per million years) approximates the recently estimated bacterial speciation rate (0.03 to 0.05 speciation per lineage per million years) where species is defined as having 99% identical 16
S rRNAs (
29), suggesting that frequent jumps and speciation events may be correlated. Two features of the rare jumps fit the description of quantum evolution. First, the rare jumps are fairly large in magnitude, most likely resulting from shifting between major adaptive zones. Second, our test shows that rare jumps happen less frequently than expected at the species level but significantly more frequently than expected at higher taxonomic levels (genus, family, and order), suggesting that there is a correlation between rare jumps and the origination of higher taxa. Furthermore, some of the predicted rare jumps coincide with known major evolutionary adaptations in bacterial evolution history. A key insight from this observation is that major evolutionary adaptations in bacteria and the origination of major bacterial lineages may happen in quick bursts (quantum evolution) instead of through slow divergence of species over prolonged periods of time (gradual evolution) (
30), which is consistent with earlier findings of rapid expansion of major microbial lineages (
31,
32). We found that rare jumps of the four traits tend to happen together.
Microbial genomes are highly dynamic (
33,
34). They can change by mutation, gene loss, gene duplication, and HGT. Whatever the mechanism is, our study suggests that large changes happen in episodes of bursts rather than gradually and slowly. These large changes are not due to the simple gain and loss of plasmids, as we have excluded plasmids in our study. Chromosomes are in constant exchange with phages, plasmids, and other mobile elements and can change by “quantum leaps” in the form of genomic islands (
15). It is also well known that rRNA genes can be horizontally transferred (
35), which we show in this study can result in an instant jump in rRNA GC%. In contrast to animals and plants, HGT plays an important role in the evolution of bacteria and archaea, therefore providing an additional avenue for microbial pulsed evolution. It is worth pointing out that jumps in our model represent trait changes that persist over time, not the processes that drive the changes. The rarity of detected jumps does not mean that the evolutionary processes (e.g., selection and population bottleneck) that drive the jumps are rare. It merely means that the success rate of such jumps is low. The rarity of jumps can result from adaptation to a large environmental shift that happens infrequently, or it can be a manifestation of multiple frequent small jumps occurring in quick succession, which is also rare.
In conclusion, our modeling of phylogenetic comparative data shows that pulsed evolution is both prevalent and dominant in bacterial and archaeal genomic traits evolution. The signatures of pulsed evolution detected in this study are consistent with both the punctuated equilibrium and quantum evolution theories. More broadly, our results suggest that pulsed evolution is the rule rather than the exception across the tree of life, despite the drastically different population genetic properties of micro- and macroorganisms.
Acknowledgment
Funding: The authors acknowledge that they received no funding in support of this research.
Author contributions: Y.G. and M.W. conceived the study, designed the analyses, and wrote the manuscript. Y.G. wrote the codes and performed the data analysis.
Competing interests: The authors declare that they have no competing interests.
Data and materials availability: All data and scripts needed to evaluate the conclusions in the paper are present in the paper and/or the Supplementary Materials.