Why are some traits and trait combinations exceptionally common across the tree of life, whereas others are vanishingly rare? The distribution of trait diversity across a clade at any time depends on the ancestral state of the clade, the rate at which new phenotypes evolve, the differences in speciation and extinction rates across lineages, and whether an equilibrium has been reached. Here we examine the role of transition rates, differential diversification (speciation minus extinction) and non-equilibrium dynamics on the evolutionary history of angiosperms, a clade well known for the abundance of some trait combinations and the rarity of others. Our analysis reveals that three character states (corolla present, bilateral symmetry, reduced stamen number) act synergistically as a key innovation, doubling diversification rates for lineages in which this combination occurs. However, this combination is currently less common than predicted at equilibrium because the individual characters evolve infrequently. Simulations suggest that angiosperms will remain far from the equilibrium frequencies of character states well into the future. Such non-equilibrium dynamics may be common when major innovations evolve rarely, allowing lineages with ancestral forms to persist, and even outnumber those with diversification-enhancing states, for tens of millions of years.

1. Introduction

Disparity in the numbers of taxa with different traits is a hallmark of biodiversity. Explanations for the distribution of phenotypes across the Tree of Life variously emphasize evolutionary processes acting both above and below the species level [1]. For example, as a result of intraspecific processes such as mutation, selection and genetic drift, some traits may be gained more often than lost, and the ensuing directionality in transitions should lead to more taxa having those traits [2–4]. Above the species level, differential speciation and/or extinction across lineages can also shape the distribution of phenotypic diversity [5,6]. For instance, traits that act as key innovations are expected to become common across the tree [7], whereas those that act as evolutionary dead-ends will continually be pruned from the tree [8]. Together, these long-term rates of character change, speciation and extinction establish an expected equilibrium frequency of different phenotypes [9]. However, because the effects of these underlying processes accumulate gradually over long evolutionary timescales, non-equilibrium dynamics may also be important. For instance, even traits leading to rapid lineage diversification may be rare in a clade if they evolved only recently. Despite the growth of comparative methods for the analysis of trait evolution and diversification, the extent of non-equilibrium dynamics in natural systems remains largely unexplored [10].

Here we examine the interplay of trait transition rates, differential diversification and non-equilibrium dynamics in the history of angiosperms. This approximately 140 million year old clade [11] comprises at least 250 000 species known for both their amazing variety of forms and the markedly uneven distribution of this phenotypic variation across taxa. For example, some floral morphologies appear in many families and species, whereas other forms characterize only a few taxa [12]. Stebbins [13] attributed the abundance of taxa with particular floral phenotypes to directional transitions generated by natural selection, and moreover, he suggested that variation in functional interactions among traits, rather than in the traits themselves, leads to differences in fitness. However, a skewed distribution of floral forms with some common and others rare could also arise from differential diversification, for example, if certain forms promote speciation more than others. This differential diversification could be linked to individual floral traits, e.g. floral symmetry [14], or, if functional interactions are key, to combinations of characters, e.g. stamen number and symmetry together [15]. Distinguishing between these possible explanations for the distribution of floral morphologies across angiosperms requires simultaneous estimation of lineage diversification and trait transitions [2,9]. Although previous studies have examined the patterns of trait evolution across the angiosperms [4,14,16,17], none have quantified the relative importance of directional trait transitions and differential diversification in the present-day distribution of floral traits and of trait combinations, or assessed whether this distribution represents an evolutionary equilibrium.

We focus on the evolution of a suite of six floral traits, including aspects of merosity, fusion and symmetry, that were originally considered by Stebbins [13] in his angiosperm-wide analysis. These features vary dramatically across taxa [18,19] and are strongly associated with floral function [20,21]. Given their role in interactions with pollinators, these traits are probably under strong selection, with adaptive changes spreading to fixation whenever they arise. In addition, differences in the interaction between flowers and pollinators may lead to reproductive isolation and diversification [22,23]. Plant–pollinator interactions often generate selection for suites of floral traits [24], supporting Stebbins' notion that trait combinations are key to understanding plant adaptation and speciation. Collectively, these prior studies suggest that directional transitions and differential diversification acting on interacting suites of floral traits may have played a significant role during the angiosperm radiation. To test this question, we amassed trait data and phylogenetic information for a large and random sample of angiosperms and developed scripts for model fitting with multiple characters and character combinations with existing software. With this approach, we consider to what extent the present-day frequency of the six target traits is owing to directional transitions and differential diversification. We also ask if directionality and differential diversification hinge on single character states or the states of multiple characters. Finally, using stochastic simulations based on these results, we examine whether the distribution of angiosperm diversity has reached equilibrium, i.e. whether the frequencies of taxa with different floral trait combinations match predictions based on rates of trait transitions and species diversification. While evolutionary lags are predicted to shape the course of adaptive radiations [10], it is unknown if such non-equilibrium dynamics are important on such a deep evolutionary timescale.

2. Methods

(a) Phylogenetic inference

Five hundred angiosperm species were selected from GenBank for inclusion in the dataset. Available software for joint estimation of transition rates and state-dependent diversification, Diversitree [25], allows for incomplete taxon sampling (i.e. not all extant species present in the phylogeny), but the subsampling is assumed to be random across clades (i.e. clades are sampled in proportion to their species richness, though unequal sampling based on traits can be incorporated) [26]. We thus implemented a stratified two-step procedure to create a random sample of species that satisfies this criterion. The stratified random sampling procedure first calculated the expected number of species sampled from each angiosperm family given the size of the family; larger families would accordingly be represented by more species. If the expected number was less than one (families of fewer than 500 species), we used dynamic rounding to convert the fraction to an integer. That is, if a random variate between 0 and 1 was larger than the expected fraction, one species was chosen, and otherwise, none was selected. Second, among the species in GenBank with sufficient sequence data (minimally ITS), we randomly selected the expected number of species from each family. For example, a family represented by 1% of all extant angiosperms would comprise 1% (or 5 species) of our 500 species dataset, and we randomly chose five species from the family among those on GenBank that minimally had ITS sequences. We compiled sequences for six other loci in addition to ITS (rbcL, matK, 28S, 18S, trnK, atpB) and built our final supermatrix with the PHLAWD pipeline [27]. We chose a set of 28 additional angiosperms (e.g. Amborella, Austrobaileya) for localizing fossil calibrations (electronic supplementary material, table S1), and we selected 44 gymnosperms to serve as outgroups. We inferred the phylogeny in RAxML v. 7.2.3 [28] using a GTR + gamma model of sequence evolution, partitioned by gene, and a family-level constraint tree based on well-established relationships (http://www.mobot.org/MOBOT/research/APweb/, accessed on 8 January 2009). We employed a constraint tree, because the goal of this analysis was not to re-infer angiosperm relationships [29–31], but to obtain relative branch length estimates for the sampled tips. The RAxML tree was time-calibrated using penalized likelihood in TreePL [32] with 40 fossil calibration points (electronic supplementary material, table S1). The outgroups and additional angiosperms were pruned from the tree before analysis to maintain the random sampling design.

(b) Character selection and scoring

Based on Stebbins' 1951 analysis of angiosperm diversity [13], we targeted six binary floral characters: corolla (petals) presence, fusion of the perianth (calyx and corolla or tepals), flower symmetry, stamen number relative to perianth parts, carpel fusion and ovary position. Changes in perianth morphology and relative stamen number alter interactions with pollinators and thereby affect the accuracy and efficiency of pollen transfer [20,21,33,34]. Carpel fusion influences the intensity of competition among pollen tubes for access to ovules and allows more even distribution of pollen tubes among carpels, which can enhance offspring quality and quantity [35,36]. Finally, inferior ovaries may protect ovules from probing insects [37] and facilitate coevolution with ovipositing insects [38] relative to superior ovaries. We scored all 500 species in the phylogeny for the six characters based on floras, monographs and species descriptions (e.g. [39,40], see the electronic supplementary material, table S2 for more detail). The final dataset (phylogeny and character matrix; Dryad http://dx.doi.org/10.5061/dryad.486c0) was reduced to 464 angiosperm species, because some states could not be scored for some taxa (e.g. perianth symmetry cannot be scored for species without a perianth).

(c) Joint analyses of transition and diversification rates

We employed SSE methods [9,26] for jointly estimating transition rates among character states, character state combinations and diversification rates within those combinations. Multistate SSE methods (MuSSE [25]) can handle a single multistate character (with states 0, 1, 2, etc.) or a set of binary characters. The latter can be done by converting the set of binary characters to a single multistate character. For example, for a set of three characters, state combination 000 becomes state 0, combination 001 becomes state 1, 010 becomes state 2 and so forth, giving eight states in total (figure 1). In our case, with six binary floral characters, there are 2⁶ (or 64) possible combinations, which can be coded as a single character with 64 states. The full MuSSE for a 64-state character would have 512 free parameters (384 transition rates, 64 speciation rates and 64 extinction rates), which would be challenging to estimate, even with a complete sampling of all angiosperms. Therefore, we created a set of scripts to fit a series of simplified models that essentially group the states (and thus the rates) while still allowing us to test hypotheses about directional transition and/or differential diversification (see full workflow in the associated Dryad package: http://dx.doi.org/10.5061/dryad.486c0). As an example, returning to the three character network in figure 1, we could group the states into two sets (000 + 001 versus the rest) and ask how having a zero for the first two character states affects diversification rates. We would estimate just two pairs of state-dependent speciation and extinction rates (one for the A set and one for B set) instead of the eight required for the full model (not shown in figure 1). We also need to estimate transition rates, but could assume that transitions within each set has a single rate (q_AA or q_BB), and there is just one rate for moving between the sets (q_AB; figure 1). Thus, we have three rate parameters to estimate instead of the 12 in the original full model. We could relax these constraints in order to test for directional transitions, i.e. allow transitions from A into B to occur at a different rate than B into A, which would add one additional parameter. This approach of grouping states into sets results in a tremendous reduction in the number of model parameters as the state space grows. Even with the most complex model, having directional transitions between A and B and different rates of diversification in A and B, we have only eight parameters to estimate (two speciation rates, two extinction rates, four transition rates) compared with the 512 in the full model.

Figure 1. Simplification of MuSSE model by partitioning into sets. The diagram depicts a network of eight states, which are combinations of three binary (0/1) characters. Thus, ‘000’ is the combination with state 0 for each of the three characters. Each combination is connected to three other combinations that differ by just one state. A full MuSSE analysis would involve estimating transition rates (black arrows) along each of the edges of this network as well as speciation and extinction rates (not shown) for each combination. We simplified this network by dividing it into two sets (e.g. the black ‘A’ set and the grey ‘B’ set in the middle). We can then consider rates of transition within the sets (q_AA, q_BB) and between the sets (here, q_AB = q_BA). This can be represented as a matrix (right), and if we assume a single rate within each set plus a rate of moving between sets, the model includes just three transition rate parameters. We could also assume a single speciation and an extinction rate for each set (not shown), which would bring the model to seven parameters. These assumptions can be relaxed to create additional more complex models, but which are still less parameter-rich than the full model.

In order to test for directional transitions and differential diversification associated with each character state and character state combination, we conducted this subsetting procedure across all possible bipartitions (division into two sets) of the network of 64 combinations. For individual characters, this division split the network into two equal halves, namely all the combinations with a zero for the character and all of those with a one (0***** versus 1***** where the * may be any state for any of the other characters). Bipartitions can also be as small as a single combination versus the rest (e.g. 000111 versus all other combinations). Through this procedure, then, we examine all characters individually and in all possible combinations with the other characters. For each of these bipartitions, we then tested a set of models that varied in constraints on transition and diversification rates. In total, we considered 30 models, all combinations of five transition rate models (electronic supplementary material, table S3) and six diversification models (electronic supplementary material, table S4), for each bipartition. We used maximum-likelihood MuSSE functions implemented in the R package Diversitree [25] to estimate diversification and transition rates for each model, to calculate the Akaike information criterion (AIC) scores, and to infer ancestral states across the phylogeny. Rather than assume an equilibrium root state or otherwise infer it, we assumed that lineages began with the ancestral states for each character (electronic supplementary material, table S2), which follows the coding by Stebbins' [13] and more recent comparative studies [41,42]. The version of Diversitree used allows specification of sampling by state: absent other information, we assumed the sampling within each state was the same (464 out of 250 000 angiosperm species). Note that in grouping states into sets, we did not alter the underlying data (the six characters), so likelihoods are comparable between all models and bipartitions.

(d) Stochastic simulations

We conducted two sets of simulation analyses to interrogate the findings from maximum-likelihood model fitting. The first set of simulations assessed how the level of sampling (464 of an estimated 250 000 angiosperms) might affect the reliability of our conclusions. To do this, we conducted parametric bootstrapping under the inferred model. We used model-averaged values for speciation, extinction and transition rates, weighted by the AIC scores of the models, from the MuSSE analysis. We simulated phylogenies under this model using the SimulateHisse function in the R package HiSSE [43], starting with a single lineage and proceeding for 136 million years (the approximate crown group age of angiosperms based on recent analyses and pollen fossils [44,45]). At every instant of time, any species could speciate, go extinct or change state. This procedure made trees of thousands to hundreds of thousands of species, which we then randomly pruned to 464 species. We re-estimated the 30 diversification and transition models that were fitted to the original dataset, focusing on a set of combinations that were found to have higher diversification in our MuSSE analysis. Finally, we compared the model-averaged rates from these simulated replicates with those estimated from the original data.

The second set of stochastic simulations estimated the future approach towards an equilibrium distribution of trait combination frequencies based on the inferred transition and diversification rates. The SSE methods do not necessarily assume equilibrium, and thus the observed distribution of current states may differ from that predicted under the best model at equilibrium [8,46]. We again used the model-averaged rates from the original analysis and simulated evolution for 151 million years from the original ancestral state (i.e. 15 million years into the future). In these simulations, we tracked the proportion of species with each trait combination at 1 million year intervals, but not the tree structure itself, unlike the first simulation (this makes the computation far more efficient). To test for non-equilibrium, we compared the proportions of taxa with each combination of the three key characters (corolla presence, floral symmetry and stamen number; see Results) across the simulations to the proportions expected at equilibrium. We calculated these equilibrium values from the model-averaged diversification and transition rates following Maddison et al. [9] using stationary.freq.classe() in Diversitree. The scripts for running both of these sets of simulations are available as part of the Dryad package containing the dataset.

3. Results

(a) Distribution of trait combinations

Our sample of 464 species was spread across 134 angiosperm families. Across these species, the character–state combinations exhibited a wide range of frequencies (electronic supplementary material, figure S1). Thirty-one, or 48%, of the 64 possible combinations are not present in any sampled species, suggesting that much of phenotype space is unoccupied. Within the occupied space, over half of all sampled species exhibit the five most common combinations, which all share the presence of a corolla, few stamens, and fused carpels. This skewed distribution of character combinations contrasts sharply with the distribution obtained by randomly sampling the combination space (grey lines, electronic supplementary material, figure S1). The concentration of angiosperm diversity in certain regions of floral phenotype space has been considered qualitatively and even quantitatively in previous studies [12,13], but this is, to our knowledge, the first demonstration based on a taxonomically stratified random sample across the angiosperms at the species level. We expect that the remaining sampling bias is due to which species within each family are found in GenBank and available for scoring.

(b) Differential transition and diversification rates across character combinations

We examined a large number of models to identify character–state combinations associated with directional transitions and/or differential diversification. Given the 728 (or 3⁶ – 1) possible bipartitions of the 64-state network into sets of character–state combinations and the 30 diversification/transition models, the total possible number of models to examine is 21 840. However, many character–state combinations are not represented by any species, and thus could not constitute a viable set, leaving 19 657 models. Despite this large model space, over 99% of the Akaike weight was concentrated in just 10 models. All 10 of these models incorporate differential diversification and include one or more of the three character states: corolla present, bilateral symmetry and reduced stamen number (table 1). According to the best-fitting model (lowest AIC score), lineages with this combination of characters diversified roughly twice as fast as lineages with any other character–state combination (table 1, compare r_F with r_N). In contrast to the diversification rates, which varied based on floral state combinations, we found little evidence for strongly asymmetric trait transition rates that could lead to biases in state frequencies. The best-fitting model for the set comprising bilaterally symmetrical corollas with reduced stamen number included equal transition rates between the sets (table 1). Furthermore, models with differential transition rates accounted for only 29.8% of the AIC weight across all of the analyses. Thus, differences in gain and loss rates (directional transitions) appear to have had a limited influence on the overall distribution of trait combinations across angiosperms.

Table 1.Top 10 models (ranked by Akaike weight) from the maximum-likelihood analysis of focal areas. (The focal area is described by a string in which 0 and 1 indicate the ancestral or derived states, respectively, for each of the six characters (electronic supplementary material, table S1) and ‘asterisk’ indicates that the state for that character could be either 0 or 1. The only characters identified by the models as part of focal area are corolla presence (character 1: 0, present; 1 ,absent), floral symmetry (character 3: 0, radial; 1, bilateral) and stamen number (character 4: 0, more than twice as many as merosity; 1, equal or fewer). For example, the focal area in the top model (first row, 0*11**) comprises all taxa with corolla present, bilateral symmetry and few stamens, regardless of the states of the other three characters. Model parameters include transition rates within and between focal (F) and non-focal (N) areas (q_FF, q_NN, q_FN, q_NF) and diversification parameters (r_F, r_N). Complete description of models for transitions and diversification are presented in tables S3 and S4).)

Collapse
Akaike weight	cumulative Akaike weight	focal	diversification	r_F	r_N	transitions	q_FN	q_NF	q_NN	q_FF
0.274	0.274	011*	free	0.064	0.038	equal	0.001	0.001	0.001	0.001
0.245	0.519	1*	free	0.057	0.037	free	0.003	0.001	0.002	0.000
0.202	0.721	1*	free	0.052	0.036	equal	0.001	0.001	0.001	0.001
0.128	0.849	11	free	0.059	0.038	equal	0.001	0.001	0.001	0.001
0.090	0.939	01**	free	0.058	0.038	equal	0.001	0.001	0.001	0.001
0.024	0.963	1*	free	0.055	0.034	outflow	0.003	0.001	0.001	0.001
0.014	0.977	011*	free	0.060	0.035	outflow	0.001	0.001	0.001	0.001
0.009	0.985	1*	two/one	0.068	0.030	free	0.005	0.001	0.002	0.000
0.003	0.988	01**	free	0.059	0.035	outflow	0.002	0.001	0.001	0.001
0.003	0.991	011*	free	0.050	0.037	free	0.001	0.003	0.002	0.000

Despite the weak direct effect of differences in gain and loss rates on the frequencies of trait combinations, the low overall magnitude of all transition rates compared with diversification rates (table 1) could profoundly affect the distribution of trait combinations through non-equilibrium dynamics. Given the estimated rates, we calculate that, at equilibrium, 90% of angiosperms should possess the key set of states associated with higher diversification, namely bilaterally symmetrical corollas and few stamens. By contrast, only 38% of the species in our sample belong to that set. This marked difference between the equilibrium expectation and the observed frequency could arise because rare transitions between character states limit the potential for lineages to assemble the three elements of the key combination. Comparison of the transition rates and diversification rates (table 1) is consistent with this explanation. In the best-fitting model, the diversification rate of lineages with the ancestral state is 38 times higher than the rate of transitions away from that state, suggesting an average of 38 net diversification (speciation minus extinction) events for every transition event. The ratios are similar for other character state combinations. These low transition rates relative to diversification rates are also consistent with large clades sharing the same state (figure 2a) and with a higher number of species retaining the ancestral state than predicted at equilibrium (corolla present, radial symmetry, many stamens; figure 2b).

Figure 2. Non-equilibrium dynamics and the evolution of the floral trait combinations. (a) Maximum-likelihood reconstructions of the history of the three key character combination using model-averaged rates from the MuSSE analysis. Repeated origins of the key combination are shown in red. All other combinations are shown in black or grey lines, with the latter indicating uncertainty in the reconstruction of the subtending node. (b) The proportion of taxa with each possible combination of states for the three characters that comprise the key combination (corolla presence/absence, symmetry, and relative stamen number) in the observed (empirical) dataset (grey), in stochastic simulations to the present (blue), in simulations extending 15 Myr into the future (green), and at equilibrium (black). Ancestral states are shown in black and derived states in purple (see also the electronic supplementary material, figure S1), and the key combination is highlighted in yellow. (c) Mean (±95% CI) time of appearance of each of the eight trait combinations defined in (b) during the course of simulations. Flower cartoons show states of perianth (white, corolla; green, calyx), symmetry and stamen number.

(c) Stochastic simulations of angiosperm evolution

Our simulations illustrate that the preceding conclusions about directional transitions and state-dependent diversification are probably not an artefact of sampling intensity. Using model-averaged rates from the likelihood model fitting, we completed 136 million year simulations for 22 of the 50 lineages that we initiated. The resulting 22 trees contained from 2673 to 186 550 species (owing to stochasticity), which were randomly subsampled to 464 species (the size of our original dataset) before model fitting. As in the original dataset, the simulations supported elevated diversification for lineages with the key set of states (bilaterally symmetric corollas with reduced stamens, electronic supplementary material, figure S2). For the three key characters, diversification rates were, on average, 30% higher in the set with bilaterally symmetric corollas with reduced stamens than in the other set across all of the simulated datasets (electronic supplementary material, table S5). Moreover, the original rates fell within the range of the simulations for both sets of diversification and transition rate parameters (electronic supplementary material, figure S2). Thus, detection of elevated diversification in lineages with bilaterally symmetric corollas and reduced stamens is robust to the sampling intensity employed in our analysis.

An independent set of stochastic simulations explored the consequences of relatively low transition rates for non-equilibrium dynamics. Estimates of the equilibrium frequency across trait combinations predict that most angiosperm species should eventually possess bilaterally symmetric corollas with reduced stamens, compared with the 38% at present. Our stochastic simulations showed an even lower proportion of species in this set after 136 million years of evolution (7%, 95% confidence interval (CI) 0–35%: figure 2b). The wide CIs around the mean from these simulations reflects the stochastic and contingent nature of evolutionary processes [47]; a variety of outcomes are possible even when the starting point and parameters are the same. Despite the range of outcomes from the simulations, none of the simulations produced the high proportion of species with the key set expected at equilibrium (90%). At the same time, the proportion of species with the ancestral state (radially symmetric corollas with many stamens) is much higher in the simulations (53%, 95% CI: 3–84%) than predicted at equilibrium (0.2%), consistent with a long delay in evolving away from the ancestral state. This gap between the equilibrium expectations and the simulated proportions (as well as the observed proportions) supports the hypothesis that low transition rates relative to diversification rates have limited the rate of approach to equilibrium. Continuation of the simulations for an additional 15 million years to extrapolate future angiosperm evolution towards the equilibrium raised the frequency of lineages with the key set of character states to a mean of 10% (95% CI: 0–46%), but it still remained far below the expected equilibrium.

4. Discussion

Given the critical role of flowers in sexual reproduction, their morphology should be subject to strong selection [23,24]. Indeed, Stebbins [13] largely attributed the highly skewed pattern of floral trait combinations across the angiosperms (electronic supplementary material, figure S1) to strong natural selection leading to the fixation of particular combinations of characters. Over time, these directional transitions towards adaptive combinations would lead to the prevalence of some phenotypes and the rarity or complete absence of others. He recognized that genetic factors, such as pleiotropy [48] or genetic architectures which favour loss over gain [49], could also contribute to evolutionary trends in angiosperm morphology. Subsequent studies have reported directional trends for the evolution of many complex traits [50,51], including floral traits such as symmetry and colour [4,46] as well as the ability of natural selection to act on multiple floral trait combinations [52]. However, our study finds that directionality in rates of evolution has had a limited impact in determining the overall frequency of phenotypes for the six traits among angiosperms. The best-fitting model assumed equal rates of gain and loss of character states, and over all models examined, models with biased transitions received less than 30% of the AIC weight. This contrast between our study and previous studies that highlighted directionality is probably owing to our application of SSE methods to a random sample of angiosperms. These methods allow for the possibility that differences in diversification rates, in addition to transition rate biases, could contribute to patterns of trait variation in extant taxa [2], and thus do not presuppose a conclusion of directional transitions to account for unequal numbers of taxa in different states [3].

In contrast to the limited effect of directional transitions, our analysis identified differential diversification as an important component of the best-fitting models. Specifically, increased diversification is strongly associated with three key character states: corolla present, bilateral symmetry and few stamens. These floral features facilitate animal pollination, and their well-studied effects on plant–pollinator interactions are consistent with a role in shaping patterns of diversification. Bilateral floral symmetry is frequently proposed as a key innovation [14,53] and in our models, it affects diversification rates most consistently, occurring in all of the top 10 models. Critically, however, symmetry does not operate in isolation to influence diversification. The inclusion of two additional traits, corolla presence and stamen number, in the top models demonstrates that the positive effect of symmetry hinges on the state of other floral characters. Ecological studies show that both bilateral symmetry and reduced stamen number increase precision of pollen placement and facilitate specialization for particular pollinators [54–56]. Bilateral floral symmetry modulates pollen placement by constraining the approach of pollinators and positioning them with respect to the stamens [55]. Reductions in stamen number increase precision by promoting correspondence between the positions of pollen placement and stigma contact [54,56–58]. The inclusion of corolla presence among the key features suggests that bilateral symmetry is most important in flowers that have a perianth which includes a corolla, or that having a corolla provides greater flexibility of function for the calyx. Together, the potential for both more specialized and more precise interactions with pollinators may increase opportunities for prezygotic isolation and thus speciation in lineages with bilaterally symmetrical corollas and few stamens [14,23,59,60], although lower extinction rates may also play a role [60]. The many origins of this combination across the phylogeny (figure 2a) suggest that its association with diversification is not spurious [61,62], or limited to a single major clade, but instead that it is a key innovation that has arisen convergently in many angiosperm lineages.

If increased diversification owing to the key set of character states fully explained the pattern of trait disparity fitted by the models, the trait combination with greatest diversification rate should be most common, as is expected at equilibrium. By contrast, we estimate that this set is found in less than 40% of extant species. Our stochastic simulations suggest that this discrepancy is owing to the long time required for the evolutionary assembly of all three character states. In these simulations, a median of 11 million years (95% CI 2.0–37.4) elapsed before fixation of any trait combination other than the ancestral combination, and 71.5 million years (33.3–109.35) elapsed before the first origin of the three character combination with the highest diversification rate (figure 2a,c). These intervals are concordant with the fossil record, as flowers with reduced stamen number appeared within 20 million years after the origin of angiosperms, followed by bilaterally symmetric flowers roughly 44 million years later [63]. The number of angiosperms with the ancestral states continues to decline towards the present, whereas the number of taxa with the fastest diversifying combination of derived states continues to rise (figure 2b). Our simulations suggest that the approach to the equilibrium predicted by the model will be exceptionally gradual, as an additional 15 million years of evolution produced only a 1.5% closer approximation of the equilibrium frequencies. Thus, although the possession of bilaterally symmetrical corollas and few stamens clearly promotes diversification once it has evolved, the relative rarity of necessary transitions causes a lingering influence of the initial state of the common ancestor on current trait diversity. Given the slow pace of the approach to equilibrium, major global events (such as mass extinctions) that could alter the course of angiosperm evolution are likely to occur before the predicted frequencies are reached.

This inferred history of six functional floral traits reveals two general features that probably dominate the dynamics of trait evolution in many clades of organisms, and hence strongly influence the distribution of contemporary diversity. First, traits and trait combinations that experience low transition rates are likely to remain far from the expected equilibrium frequencies over long evolutionary timespans. This means that even character states that strongly promote lineage diversification may not be associated with high species richness at any instant, underscoring the importance of the timing of the ‘arrival of the fittest’ [64]. Second, many traits, like the features of floral organs, interact in organismal function and their effects, whether on the fitness of individuals in populations or on the success of lineages, depend on the state of other traits. Incorporating such context-dependence introduces additional complexity into studies of macroevolutionary patterns. Nevertheless, consideration of trait interactions in an organismal and functional framework will lead to a richer and more realistic picture of the evolutionary process and a stronger connection with microevolutionary studies, which have long emphasized synergistic effects [65,66].

As researchers move towards diversification analyses of multiple characters [67,68], we anticipate increasing interest in using approaches such as ours that directly address the potentially synergistic interactions between characters [15]. In this context, it is important to discuss the potential limitations of our analysis and of SSE methods in general. First, we assumed the tree and states were known with certainty, though relaxing these assumptions are possible. Second, SSE methods have been found to choose a trait-dependent diversification model when traits are simulated under a diversification-independent model and the tree comes from an unknown empirical process [69]. This reflects a well-known statistical issue: rejecting a null does not mean that the alternate model is correct. Here, we used multimodel inference rather than null hypothesis rejection, with a focus on the parameter values. This incorporates uncertainty in the models and focuses on the biological processes, rather than rejecting an unrealistic null. However, even though we used a variety of models, they were individually simple: there was some heterogeneity in process by state, but, otherwise, the same rates were used through time and across taxa. Future extensions of this work could examine additional sources of heterogeneity (e.g. variation in rates across time periods or across clades) [70], although such an increase in model complexity would require not only new methods, but also greater taxon sampling. The application of simulation approaches, as in this study and others [71], will be crucial for determining the power to estimate models and test macroevolutionary hypotheses.

Data accessibility

Data and scripts are available in Dryad at http://dx.doi.org/10.5061/dryad.486c0. Correspondence and requests for materials should be addressed to B.C.O. ([email protected]) or S.D.S ([email protected]).

Authors' contributions

This working group was led by C.B.F. (PI), W.S.A. (co-PI) and P.K.D. (co-PI). All authors contributed to the general design of the research question and analyses. B.C.O. selected the focal species. S.D.S., W.S.A., L.D.H., C.R.H., L.C.H., L.H., A.L., S.M., S.A.S., P.F.S., C.B.F. and P.K.D. scored floral traits. S.M. provided constraint ages for the tree that S.A.S. created. B.C.O., S.D.S., C.B.F. and P.K.D. designed the detailed analyses that B.C.O. ran using scripts he wrote. S.D.S., P.K.D., C.B.F., B.C.O. and L.D.H. wrote the paper, which the co-authors edited.

Competing interests

We declare we have no competing interests.

Funding

The working group that led to this publication was supported by the National Evolutionary Synthesis Center (NESCent), NSF no. EF-0423641.

Acknowledgements

David Baum, William Friedman, Steven Johnson, Matthew Streisfeld, Matt Pennell and three anonymous reviewers provided useful feedback on the manuscript.

Footnotes

Published by the Royal Society under the terms of the Creative Commons Attribution License http://creativecommons.org/licenses/by/4.0/, which permits unrestricted use, provided the original author and source are credited.

References

1
Barraclough TG, Humphreys AM . 2015 The evolutionary reality of species and higher taxa in plants: a survey of post-modern opinion and evidence. New Phytol. 207, 291–296. (doi:10.1111/nph.13232) Crossref, PubMed, ISI, Google Scholar
2
Maddison WP . 2006 Confounding asymmetries in evolutionary diversification and character change. Evolution 60, 1743–1746. (doi:10.1111/j.0014-3820.2006.tb00517.x) Crossref, PubMed, ISI, Google Scholar
3
Nosil P, Mooers AO . 2005 Testing hypotheses about ecological specialization using phylogenetic trees. Evolution 59, 2256–2263. (doi:10.1111/j.0014-3820.2005.tb00933.x) Crossref, PubMed, ISI, Google Scholar
4
Ree RH, Donoghue MJ . 1999 Inferring rates of change in flower symmetry in asterid angiosperms. Syst. Biol. 48, 633–641. (doi:10.1080/106351599260201) Crossref, ISI, Google Scholar
5
Rabosky DL, McCune AR . 2010 Reinventing species selection with molecular phylogenies. Trends Ecol. Evol. 25, 68–74. (doi:10.1016/j.tree.2009.07.002) Crossref, PubMed, ISI, Google Scholar
6
Jablonski D . 2008 Species selection: theory and data. Annu. Rev. Ecol. Evol. Syst. 39, 501–524. (doi:10.1146/annurev.ecolsys.39.110707.173510) Crossref, ISI, Google Scholar
7
Farrell BD, Dussourd DE, Mitter C . 1991 Escalation of plant defense: do latex and resin canals spur plant diversification. Am. Nat. 138, 881–900. (doi:10.1086/285258) Crossref, ISI, Google Scholar
8
Goldberg EE, Kohn JR, Lande R, Robertson KA, Smith SA, Igic B . 2010 Species selection maintains self-incompatibility. Science 330, 493–495. (doi:10.1126/science.1194513) Crossref, PubMed, ISI, Google Scholar
9
Maddison WP, Midford PE, Otto SP . 2007 Estimating a binary character's effect on speciation and extinction. Syst. Biol. 56, 701–710. (doi:10.1080/10635150701607033) Crossref, PubMed, ISI, Google Scholar
10
Gillespie RG . 2013 Adaptive radiation: convergence and non-equilibrium. Curr. Biol. 23, R71–R74. (doi:10.1016/j.cub.2012.11.052) Crossref, PubMed, ISI, Google Scholar
11
Magallon S, Gomez-Acevedo S, Sanchez-Reyes LL, Hernandez-Hernandez T . 2015 A metacalibrated time-tree documents the early rise of flowering plant phylogenetic diversity. New Phytol. 207, 437–453. (doi:10.1111/nph.13264) Crossref, PubMed, ISI, Google Scholar
12
Chartier M, Jabbour F, Gerber S, Mitteroecker P, Sauquet H, von Balthazar M, Staedler Y, Crane PR, Schoenenberger J . 2014 The floral morphospace: a modern comparative approach to study angiosperm evolution. New Phytol. 204, 841–853. (doi:10.1111/nph.12969) Crossref, PubMed, ISI, Google Scholar
13
Stebbins GL . 1951 Natural selection and the differentiation of angiosperm families. Evolution 5, 299–324. (doi:10.2307/2405676) Crossref, ISI, Google Scholar
14
Sargent RD . 2004 Floral symmetry affects speciation rates in angiosperms. Proc. R. Soc. Lond. B 271, 603–608. (doi:10.1098/rspb.2003.2644) Link, ISI, Google Scholar
15
Donoghue MJ, Sanderson MJ . 2015 Confluence, synnovation, and depauperons in plant diversification. New Phytol. 207, 260–274. (doi:10.1111/nph.13367) Crossref, PubMed, ISI, Google Scholar
16
Jabbour F, Nadot S, Damerval C . 2009 Evolution of floral symmetry: a state of the art. Curr. Biol. 332, 219–231. (doi:10.1016/j.crvi.2008.07.011) Google Scholar
17
Sanderson MJ, Donoghue MJ . 1994 Shifts in diversification rate with the origin of angiosperms. Science 264, 1590–1593. (doi:10.1126/science.264.5165.1590) Crossref, PubMed, ISI, Google Scholar
18
Cronquist A . 1988 The evolution and classification of flowering plants. Bronx, NY: The New York Botanical Garden. Google Scholar
19
Takhtajan A . 1997 Diversity and classification of flowering plants. New York, NY: Columbia University Press. Google Scholar
20
Armbruster WS, Hansen TF, Pelabon C, Perez-Barrales R, Maad J . 2009 The adaptive accuracy of flowers: measurement and microevolutionary patterns. Ann. Bot.-Lond. 103, 1529–1545. (doi:10.1093/aob/mcp095) Crossref, PubMed, ISI, Google Scholar
21
Endress PK . 1982 Syncarpy and alternative modes of escaping disadvantages of apocarpy in primitive angiosperms. Taxon 31, 48–52. (doi:10.2307/1220588) Crossref, ISI, Google Scholar
22
Hodges SA, Arnold ML . 1995 Spurring plant diversification: are floral nectar spurs a key innovation? Proc. R. Soc. Lond. B 262, 343–348. (doi:10.1098/rspb.1995.0215) Link, ISI, Google Scholar
23
van der Niet T, Johnson SD . 2012 Phylogenetic evidence for pollinator-driven diversification of angiosperms. Trends Ecol. Evol. 27, 353–361. (doi:10.1016/j.tree.2012.02.002) Crossref, PubMed, ISI, Google Scholar
24
Fenster CB, Armbruster WS, Wilson P, Dudash MR, Thomson JD . 2004 Pollination syndromes and floral specialization. Annu. Rev. Ecol. Evol. Syst. 35, 375–403. (doi:10.1146/annurev.ecolsys.34.011802.132347) Crossref, ISI, Google Scholar
25
FitzJohn RG . 2012 Diversitree: comparative phylogenetic analyses of diversification in R. Methods Ecol. Evol. 3, 1084–1092. (doi:10.1111/j.2041-210X.2012.00234.x) Crossref, ISI, Google Scholar
26
FitzJohn RG, Maddison WP, Otto SP . 2009 Estimating trait-dependent speciation and extinction rates from incompletely resolved phylogenies. Syst. Biol. 58, 595–611. (doi:10.1093/sysbio/syp067) Crossref, PubMed, ISI, Google Scholar
27
Smith SA, Beaulieu JM, Donoghue MJ . 2009 Mega-phylogeny approach for comparative biology: an alternative to supertree and supermatrix approaches. BMC Evol. Biol. 9, 37. (doi:10.1186/1471-2148-9-37) Crossref, PubMed, ISI, Google Scholar
28
Stamatakis A . 2006 RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models. Bioinformatics 22, 2688–2690. (doi:10.1093/bioinformatics/btl446) Crossref, PubMed, ISI, Google Scholar
29
Chase MW et al. 1993 Phylogenetics of seed plants: an analysis of nucleotide sequences from the plastid gene rbcL. Ann. Missouri Bot. Garden 528–580. (doi:10.2307/2399846) Crossref, ISI, Google Scholar
30
Soltis DE et al. 2000 Angiosperm phylogeny inferred from 18S rDNA, rbcL, and atpB sequences. Bot. J. Linn. Soc. 133, 381–461. (doi:10.1111/j.1095-8339.2000.tb01588.x) Crossref, ISI, Google Scholar
31
Qiu Y-L et al. 1999 The earliest angiosperms: evidence from mitochondrial, plastid and nuclear genomes. Nature 402, 404–407. (doi:10.1038/46536) Crossref, PubMed, ISI, Google Scholar
32
Smith SA, O'Meara BC . 2012 TreePL: divergence time estimation using penalized likelihood for large phylogenies. Bioinformatics 28, 2689–2690. (doi:10.1093/bioinformatics/bts492) Crossref, PubMed, ISI, Google Scholar
33
Citerne H, Jabbour F, Nadot S, Damerval C . 2010 The evolution of floral symmetry. Adv. Bot. Res. 54, 85–137. (doi:10.1016/S0065-2296(10)54003-5) Crossref, ISI, Google Scholar
34
Faegri K, van der Pijl L . 1979 The principles of pollination ecology. Oxford, UK: Pergamon Press. Google Scholar
35
Mulcahy DL . 1979 The rise of the angiosperms: a genecological factor. Science 206, 20–23. (doi:10.1126/science.206.4414.20) Crossref, PubMed, ISI, Google Scholar
36
Armbruster WS, Debevec EM, Willson MF . 2002 Evolution of syncarpy in angiosperms: theoretical and phylogenetic analyses of the effects of carpel fusion on offspring quantity and quality. J. Evol. Biol. 15, 657–672. (doi:10.1046/j.1420-9101.2002.00414.x) Crossref, ISI, Google Scholar
37
Grant V . 1950 The protection of the ovules in flowering plants. Evolution 4, 179–201. (doi:10.2307/2405331) Crossref, ISI, Google Scholar
38
Thompson JN, Pellmyr O . 1992 Mutualism with pollinating seed parasites amid co-pollinators: constraints on specialization. Ecology 73, 1780–1791. (doi:10.2307/1940029) Crossref, ISI, Google Scholar
39
Kubitzki K . 2004 Flowering plants, dicotyledons: celastrales, oxidales, rosales, cornales, ericales. In Families and genera of vascular plants. (ed Kubitzki K ), p. 489. New York, NY: Springer. Google Scholar
40
Flora of North America Editorial Committee. 1993 Flora of North America north of Mexico. New York, NY: Oxford University Press. Google Scholar
41
Endress PK . 2001 The flowers in extant basal angiosperms and inferences on ancestral flowers. Int. J. Plant Sci. 162, 1111–1140. (doi:10.1086/321919) Crossref, ISI, Google Scholar
42
Endress PK, Doyle JA . 2009 Reconstructing the ancestral angiosperm flower and its initial specializations. Am. J. Bot. 96, 22–66. (doi:10.3732/ajb.0800047) Crossref, PubMed, ISI, Google Scholar
43
Beaulieu JM, O'Meara BC . In press. Detecting hidden diversification shifts in models of trait-dependent speciation and extinction. Syst. Biol. (doi:10.1093/sysbio/syw022) ISI, Google Scholar
44
Hughes NF, Mcdougall AB, Chapman JL . 1991 Exceptional new record of cretaceous Hauterivian angiospermid pollen from southern England. J. Micropaleontol. 10, 75–82. (doi:10.1144/jm.10.1.75) Crossref, Google Scholar
45
Brenner GJ . 1996 Evidence of the earliest stage of angiosperm pollen evolution: a paleoequatorial section from Israel. In Flowering plant origin, evolution, and phylogeny (eds Taylor DW, Hickey LJ ), pp. 91–115. New York, NY: Chapman and Hall. Google Scholar
46
Smith SD, Goldberg EE . 2015 Tempo and mode of flower color evolution. Am. J. Bot. 102, 1014–1025. (doi:10.3732/ajb.1500163) Crossref, PubMed, ISI, Google Scholar
47
Gould SJ . 1990 Wonderful life: the Burgess Shale and the nature of history. New York, NY: W. W. Norton & Co. Google Scholar
48
Smith SD . 2016 Pleiotropy and the evolution of floral integration. New Phytol. 209, 80–85. (doi:10.1111/nph.13583) Crossref, PubMed, ISI, Google Scholar
49
Preston JC, Martinez CC, Hileman LC . 2011 Gradual disintegration of the floral symmetry gene network is implicated in the evolution of a wind-pollination syndrome. Proc. Natl Acad. Sci. USA 108, 2343–2348. (doi:10.1073/pnas.1011361108) Crossref, PubMed, ISI, Google Scholar
50
Wiens JJ . 2001 Widespread loss of sexually selected traits: how the peacock lost its spots. Trends Ecol. Evol. 16, 517–523. (doi:10.1016/S0169-5347(01)02217-0) Crossref, ISI, Google Scholar
51
Goldberg EE, Igic B . 2008 On phylogenetic tests of irreversible evolution. Evolution 62, 2727–2741. (doi:10.1111/j.1558-5646.2008.00505.x) Crossref, PubMed, ISI, Google Scholar
52
Fenster CB, Reynolds RJ, Williams CW, Makowsky R, Dudash MR . 2015 Quantifying hummingbird preference for floral trait combinations: the role of selection on trait interactions in the evolution of pollination syndromes. Evolution 69, 1113–1127. (doi:10.1111/evo.12639) Crossref, PubMed, ISI, Google Scholar
53
Kay KM, Voelckel C, Yang JY, Hufford L, Kaska DD, Hodges SA . 2006 Floral characters and species diversification. In Ecology and evolution of flowers (eds Harder LD, Barrett SCH ), pp. 311–325. Oxford, UK: Oxford University Press. Google Scholar
54
Armbruster WS, Pelabon C, Hansen TF, Bolstad GH . 2009 Macroevolutionary patterns of pollination accuracy: a comparison of three genera. New Phytol. 183, 600–617. (doi:10.1111/j.1469-8137.2009.02930.x) Crossref, PubMed, ISI, Google Scholar
55
Neal PR, Dafni A, Giurfa M . 1998 Floral symmetry and its role in plant-pollinator systems: terminology, distribution, and hypotheses. Annu. Rev. Ecol. Syst. 29, 345–373. (doi:10.1146/annurev.ecolsys.29.1.345) Crossref, Google Scholar
56
Walker-Larsen J, Harder LD . 2000 The evolution of staminodes in angiosperms: patterns of stamen reduction, loss, and functional re-invention. Am. J. Bot. 87, 1367–1384. (doi:10.2307/2656866) Crossref, PubMed, ISI, Google Scholar
57
Darwin C . 1862 On the various contrivances by which British and foreign orchids are fertilised by insects. London, UK: John Murray. Google Scholar
58
Vogel S . 1996 Christian Konrad Sprengel's theory of the flower: the cradle of floral ecology. In Flora ecology: studies on floral evolution in animal-pollinated plants (eds Lloyd DG, Barrett SCH ), pp. 44–62. New York, NY: Chapman and Hall. Google Scholar
59
Donoghue MJ, Ree RH, Baum DA . 1998 Phylogeny and the evolution of flower symmetry in the Asteridae. Trends Plant Sci. 3, 311–317. (doi:10.1016/S1360-1385(98)01278-3) Crossref, ISI, Google Scholar
60
Armbruster WS, Muchhala N . 2009 Associations between floral specialization and species diversity: cause, effect, or correlation? Evol. Ecol. 23, 159–179. (doi:10.1007/s10682-008-9259-z) Crossref, ISI, Google Scholar
61
Maddison WP, FitzJohn RG . 2015 The unsolved challenge to phylogenetic correlation tests for categorical characters. Syst. Biol. 64, 127–136. (doi:10.1093/sysbio/syu070) Crossref, PubMed, ISI, Google Scholar
62
Ng J, Smith SD . 2014 How traits shape trees: new approaches for detecting character state-dependent lineage diversification. J. Evol. Biol. 27, 2035–2045. (doi:10.1111/jeb.12460) Crossref, PubMed, ISI, Google Scholar
63
Friis EM, Crane PR, Pedersen KR . 2011 Early flowers and angiosperm evolution. Cambridge, UK: Cambridge University Press. Crossref, Google Scholar
64
De Vries H . 1904 Species and varieties: their origin by mutation. Chicago, IL: The Open Court Publishing Company. Google Scholar
65
Phillips PC . 2008 Epistasis - the essential role of gene interactions in the structure and evolution of genetic systems. Nat. Rev. Genet. 9, 855–867. (doi:10.1038/nrg2452) Crossref, PubMed, ISI, Google Scholar
66
Wright S . 1931 Evolution in Mendelian populations. Genetics 16, 0097–0159. Crossref, PubMed, Google Scholar
67
Givnish TJ et al. 2015 Orchid phylogenomics and multiple drivers of their extraordinary diversification. Proc. R. Soc. B 282, 171–180. (doi:10.1098/rspb.2015.1553) Link, ISI, Google Scholar
68
Silvestro D, Zizka G, Schulte K . 2014 Disentangling the effects of key innovations on the diversification of Bromelioideae (Bromeliaceae). Evolution 68, 163–175. (doi:10.1111/evo.12236) Crossref, PubMed, ISI, Google Scholar
69
Rabosky DL, Goldberg EE . 2015 Model inadequacy and mistaken inferences of trait-dependent speciation. Syst. Biol. 64, 340–355. (doi:10.1093/sysbio/syu131) Crossref, PubMed, ISI, Google Scholar
70
O'Meara BC . 2012 Evolutionary inferences from phylogenies: a review of methods. Annu. Rev. Ecol. Evol. Syst. 43, 267–285. (doi:10.1146/annurev-ecolsys-110411-160331) Crossref, ISI, Google Scholar
71
Boettiger C, Coop G, Ralph P . 2012 Is your phylogeny informative? Measuring the power of comparative methods. Evolution 66, 2240–2251. (doi:10.1111/j.1558-5646.2011.01574.x) Crossref, PubMed, ISI, Google Scholar