Proceedings of the Royal Society B: Biological Sciences
You have access Research article

Hierarchical structure of ecological and non-ecological processes of differentiation shaped ongoing gastropod radiation in the Malawi Basin

Bert Van Bocxlaer

Bert Van Bocxlaer

CNRS, Université de Lille, UMR 8198 – Evo-Eco-Paléo, 59000 Lille, France

Limnology Unit, Department of Biology, Ghent University, 9000 Ghent, Belgium

Department of Animal Ecology and Systematics, Justus Liebig University, 35392 Giessen, Germany

[email protected]

Google Scholar

Find this author on PubMed

Published:https://doi.org/10.1098/rspb.2017.1494

    Abstract

    Ecological processes, non-ecological processes or a combination of both may cause reproductive isolation and speciation, but their specific roles and potentially complex interactions in evolutionary radiations remain poorly understood, which defines a central knowledge gap at the interface of microevolution and macroevolution. Here I examine genome scans in combination with phenotypic and environmental data to disentangle how ecological and non-ecological processes contributed to population differentiation and speciation in an ongoing radiation of Lanistes gastropods from the Malawi Basin. I found a remarkable hierarchical structure of differentiation mechanisms in space and time: neutral and mutation-order processes are older and occur mainly between regions, whereas more recent adaptive processes are the main driver of genetic differentiation and reproductive isolation within regions. The strongest differentiation occurs between habitats and between regions, i.e. when ecological and non-ecological processes act synergistically. The structured occurrence of these processes based on the specific geographical setting and ecological opportunities strongly influenced the potential for evolutionary radiation. The results highlight the importance of interactions between various mechanisms of differentiation in evolutionary radiations, and suggest that non-ecological processes are important in adaptive radiations, including those of cichlids. Insight into such interactions is critical to understanding large-scale patterns of organismal diversity.

    1. Introduction

    The completion of reproductive isolation between populations is a key step during speciation in sexually reproducing Eukaryota [1]. Such isolation is obtained once gene flow is fully interrupted, and it may be prezygotic and often reversible, e.g. related to geographical barriers or assortative mating [2], up to postzygotic, intrinsic and irreversible, e.g. because of Dobzhansky–Muller incompatibilities [1,3]. The processes involved determine to a large extent the strength and stability of the arising reproductive barrier(s) through time [3]. As such, insights into the nature of reproductive isolation and its genomic basis are prerequisites to understanding speciation [4,5].

    Ecological speciation, i.e. the development of reproductive isolation between populations as a direct or indirect result of adaptation to different environments, has been examined extensively [3], but non-ecological ‘drivers’ of population differentiation and speciation may be common too [4]. Prolonged geographical isolation in ecologically similar environments may lead to the fixation of alternate, incompatible mutations in allopatric populations under uniform selection (mutation-order speciation), or genetic drift could facilitate differentiation and speciation, at least when founding populations are small [4,69]. Ecological and non-ecological processes can interact in intricate ways during speciation, both in space and time [10,11], and as to their net genomic effects, i.e. their relative contributions to changes in allele frequencies [4,12]. Isolating the contributions of neutral and selective processes to population differentiation is often difficult [1,4,12,13], certainly in retrospect as it requires separating reproductive barriers that initiated and eventually completed speciation from those that evolved afterwards [4,11]. Incipient species may be exceptionally informative in this respect [13,14], and those of ongoing radiations may additionally provide important insights into the roles of various speciation mechanisms in evolutionary radiation, which represents an important knowledge gap at the interface of microevolution and macroevolution [15,16].

    Here I document mechanisms of population differentiation and speciation in an ongoing gastropod radiation from Lake Malawi. This study represents—to my knowledge—the first in depth examination of such processes in an invertebrate radiation that coincided with a renowned adaptive radiation of cichlid fishes [1719]. Specifically, I performed integrative studies on genomic and morphological data in an ecological context to examine how ecological and non-ecological processes have affected population structure in the emergent species flock. I found a remarkable hierarchical structure of older, non-ecological processes on a large geographical scale and younger, ecological processes on a regional scale. Substantial differences exist in the relative contribution of these processes to the genome-wide differentiation in each speciation event. The work illustrates that various processes act simultaneously in evolutionary radiations and that their interaction in space and time is fundamental to the prospects of radiation. Overall, genetic drift and mutation-order processes have been essential in causing genome-wide differentiation in this gastropod radiation, although, the occurrence of ecological processes may give the impression that it represents a ‘full-blown’ adaptive radiation.

    2. Study system

    Lake Malawi in the East African Rift is one of the most species-rich freshwater systems in the world [18], and its climatological and environmental history is becoming increasingly well documented [2022]. Here I focus on the ongoing Lanistes radiation in the Malawi Basin, which comprises five nominal species that exhibit differentiation at various levels of organization (electronic supplementary material, text and figure S1) [2325]. Two of these species were previously assigned to lineages occurring elsewhere in Africa based on morphological resemblance, but they are genetically distinct [26], and I refer to them as L. sp. (ovum-like) and L. sp. (ellipticus-like). They occur in the lake on shallow sandy to muddy substrates in association with shoreline vegetation, but mainly in shallow fringing pools, swamps, satellite lakes and rivers [25]. The other three species, Lanistes solidus, Lanistes nyassanus and Lanistes nasutus, are restricted to soft substrates within Lake Malawi and have presumably evolved intra-lacustrine [23,27]. The habitats of L. solidus and nyassanus overlap in water depth from approximately 1–35 m, whereas L. nasutus is rare and lives at 40–90 m depth [23,25,27,28]. Behavioural, anatomical and morphological differences among these species have been interpreted as adaptations to (i) wave action and water energy (shell morphology), (ii) the presence of predators (shell thickness, variation in growth rates, burying behaviour, development of a nocturnal lifestyle), and (iii) substrate-specific food sources (differences in radulae) [23]. For certain traits, e.g. shell morphology, specimens with intermediate characteristics exist, which was attributed to the recent (several 10 000s up to 100 000 years), still ongoing differentiation in the Lanistes radiation [23]. This timeframe of diversification is substantiated by molecular data [26] and the rising water level after severe megadroughts approximately 75–135 ka ago [20,21], which has also triggered diversification in Malawian cichlids [21,29] and viviparid gastropods [30,31].

    3. Material and methods

    (a) Sampling

    Specimens were collected in 2006, 2007 and 2008 from soft substrates in the Malawi Basin, as Lanistes does not occur on rocks [24,25] (additional sampling information is provided in the electronic supplementary material, text, tables S1 and S2). The samples contain all abovementioned species, except for L. nasutus, which is rare and presumably strongly declining in abundance [28]. Shell and tissue vouchers are stored at the Biodiversity and Systematics collection of the Justus Liebig University Giessen and the Natural History Museum London. Individuals were identified to the currently valid nominal Lanistes species of Lake Malawi following relevant literature descriptions [24,25]. Environmental conditions at collecting sites were characterized with five variables that are relevant in shaping ampullariid habitats [24], i.e. substrate composition, temperature and the concentrations of dissolved oxygen, phosphate and calcium. Habitat diversity was examined with non-metric multidimensional scaling (nmMDS) and model-based clustering using multivariate normal mixture models (see below).

    (b) Survey of genome-wide polymorphism

    Technical data on DNA extractions and amplified fragment length polymorphism (AFLP) protocols are provided in the electronic supplementary material, text. In total, 14 of the 60 primer combinations tested for selective PCR yielded clearly readable profiles (electronic supplementary material, table S3). AFLP bands were digitally captured with e-Seq v. 2.0, and automatically scored with SagaMX v. 3.2 (both from LI-COR). Scoring results for all loci were checked by eye without prior knowledge of population affiliation. Only unambiguous polymorphic bands ranging 100–350 bp were selected for subsequent analysis to mitigate the amount of potentially homoplasic bins. To estimate the quality of the generated data the entire AFLP protocol was repeated with a random subset of 10% of the specimens following ref. [32]. This test resulted in an overall reproducibility of greater than 95% (electronic supplementary material, table S4), indicating that the method is robust and the obtained dataset reliable.

    (c) Genetic analyses

    To examine the effects of selective processes, I compared the coefficients of population differentiation for all loci with those expected under neutral evolution in BayeScan v. 2.1 [33] (20 pilot runs, sampling = 100 000 generations, burn-in = 50 000 generations, thinning interval = 10). Loci that have significantly higher or lower differentiation than expected under neutral evolution were considered under selection, whereas the others were considered neutral for further analyses.

    Pairwise population differentiation was quantified with Wright's (locus-specific) FST following ref. [34] for selectively neutral loci and all loci with AFLP-SURV v. 1.0 [35]. Null allele frequencies were calculated with a Bayesian approach using a non-uniform prior distribution [36], which tolerates moderate departures from Hardy–Weinberg equilibrium [37]. Nei's genetic diversity (Hj), which is equivalent to the expected heterozygosity across loci, and the global FST were also calculated with AFLP-SURV following ref. [34].

    Patterns of molecular differentiation were identified from all markers with discriminant analysis of principal components (DAPC) using Adegenet v. 1.4 [38] in R v. 3.2.1 [39] and with Structure v. 2.3.4 [40]. DAPC was performed with the output of K-means clustering based on a Bayesian information criterion (BIC). As no prior information on the degree of gene flow among populations is available I used an admixture model with correlated allele frequencies in Structure, which allows for individuals having inherited a fraction of their genome from each of the K putative clusters. Analyses were conducted for K = 1–10, with 10 independent runs per K. Each Markov chain ran for 100 000 generations following a burn-in of 50 000 generations. The underlying genetic structure was examined with the ΔK statistic [41] in ‘Clumpak’ [42].

    (d) Morphometric analyses

    The specimens included in AFLP analyses were subjected to semilandmark analyses as described in ref. [43]. I used mainly adult specimens with well-preserved shells for molecular analyses, but in some cases juveniles or specimens with damaged shells were included to increase sampling sizes. Such specimens (n = 18; <10% of the dataset; electronic supplementary material, table S1) were omitted from morphometric analyses, because allometric differences and/or ambiguous reconstructions of shape would inflate the morphological variability of populations. Because Lanistes displays indeterminate growth (although growth is slower in adults) I further reduced the impact of allometric variation by focusing only on shape differences. Voucher images were compiled and digitized in tpsDig v. 2.16 [44] with 11 landmarks (LMs) and two curves of equally spaced semilandmarks (electronic supplementary material, figure S2). Individuals were aligned using a baseline (LMs 1 and 2) and subjected to generalized Procrustes superimposition in CoordGen6h [45]. Subsequently, semilandmarks were slid along their respective curve using the minimum Procrustes distance criterion in SemiLand6 [45]. These coordinates were imported in R, converted to a Euclidean distance matrix and subjected to nmMDS in two dimensions using Vegan v. 2.3-0 [46] and Mass v. 7.3-41 [47]. The goodness of fit of the resulting ordination was examined using Kruskal's and Clarke's rules of thumb [48,49]: stress values <5 indicate excellent ordination, those less than 15 good ordination, with limited chances of false inferences. I examined shape variation along the nmMDS axes in GeoMorph v. 2.1.6. [50], and the contribution of individual morphometric variables to the morphospace occupation with vegan's envfit function based on 1000 permutations. The morphospace occupation was subjected to model-based clustering using multivariate normal mixture models (n = 14) in Mclust v. 5.0.1 [51] without a priori classification [30]. These models make different assumptions as to the expected shape, volume and orientation of trait variance and covariance, and support is evaluated with a BIC. Statistically preferred clustering solutions may not be biologically realistic, and the reliability of the outcome depends on the representation of ‘groups’ in the dataset (discussed in ref. [30]).

    Subsequently, I estimated the degree of phenotypic differentiation, PST, as an analogue to QST [52]. If PST serves as a reasonable approximation of QST, PSTs can be compared with FSTs calculated from the neutral loci to examine the roles of genetic drift and selection in the observed differentiation in phenotypic traits. Beyond genetic factors, PST can be influenced by environmental and non-additive genetic effects, and, hence, the accuracy with which QST can be approximated by PST requires evaluation [53].

    I calculated PSTs for pairwise population comparisons directly from the morphospace occupation using the formula: Inline Formula (eqn. 3 in ref. [53]). The phenotypic variances between and within populations (Inline Formula and Inline Formula, respectively) were calculated from the mean squares generated by an analysis of variance [54], which is unbiased by sampling effects and represents a conservative way of calculating PSTs. Two parameters influence the approximation: c, the proportion of total genetic variance owing to additive genetic effects across all populations, and heritability, h2. Values for these parameters are difficult to obtain for populations from the wild. Therefore, I evaluated the sensitivity of my analyses for a wide range of scenarios for c/h2 (i.e. from 0.0 to 2.0 in steps of 0.1). Specifically, I bootstrapped the original Procrustes superimposition coordinates and repeated ordinations in morphospace and PST calculations 1000× for each scenario of c/h2. The resulting 95% confidence interval (CI) on pairwise PST comparisons between populations was compared with the 95% CI on the global FST for all neutral loci [53]. Additionally, I tested at what values of c/h2 the corresponding PST distribution significantly exceeds the average locus-specific FSTs from all neutral loci with Wilcoxon rank-sum tests [55]. Shell morphology was considered to be one multidimensional trait for these tests (although it arguably fulfils several functions; see §2).

    (e) Inference of mechanisms of differentiation

    I examined processes of geographical differentiation in the Lanistes radiation by estimating the pairwise relatedness between individuals with a kinship coefficient, Fij, for dominant markers (based on all loci) [56]. These calculations require information on the inbreeding coefficient, FI, which is not available for Lanistes from the Malawi Basin. Hence, I repeated the calculations for scenarios of Hardy–Weinberg equilibrium up to substantial inbreeding (FI = 0.0 to 0.6, respectively, in steps of 0.2). This strategy is reasonable because deviations from the real, unknown value with up to 0.2 would bias kinship estimates less than 15% [56]. Kinship coefficients were regressed on the natural logarithm of spatial distances among localities in SPAGeDi, v. 1.4 [57] for the entire basin and for individual regions. Summary statistics are based on tests with 1000 permutations and jackknife procedures.

    Genetic differentiation among populations (FST) may arise owing to selection, neutral processes or both. Given that PST reasonably approximates QST for the Lanistes dataset (see §4), I examined signals of adaptation by comparing pairwise population PSTs with locus-specific FSTs (for neutral loci), using the SPAGeDi results to account for geography. If the PST of a trait (nmMDS1) is significantly larger than the corresponding neutral FST, genetic drift alone is not sufficient to explain the observed differentiation, implying that diversifying selection occurred; if the observed PST is significantly lower than the neutral FST stabilizing selection is implied [55].

    4. Results and discussion

    (a) Environmental analysis indicates two habitat types

    nmMDS of the sampling localities in environmental space resulted in very limited stress (=2.08), and permutation tests indicate that three variables (substrate composition, temperature and dissolved oxygen) contribute significantly to the ordination (figure 1). Model-based clustering without a priori group assignments reveals that localities cluster into two main groups or benthic habitats along nmMDS1, which relate strongly to differences in environmental stability and are labelled ‘stable’ and ‘fluctuating’ in what follows (more information on the examination of habitats is provided in the electronic supplementary material, text).

    Figure 1.

    Figure 1. Non-metric multidimensional scaling (nmMDS) of the 15 sampling localities based on five environmental variables that characterize Lanistes habitats. Three of these variables, i.e. substrate composition, temperature and dissolved oxygen, contribute significantly to the environmental differentiation between localities. Model-based clustering without a priori group assignments indicates that sampling localities fall into two main groups or habitats (support of two-group model over models with 1, 3 or 4 groups: ΔBIC ≥ 1.69). Both habitats differ mainly in their degree of environmental stability, and are categorized as ‘stable’ and ‘fluctuating’. The mclust model identifiers in the inset are explained in ref. [51].

    (b) Molecular diversity and differentiation

    In total 201 AFLP loci were scored reliably for 182 Lanistes individuals from 15 populations (electronic supplementary material, table S1). BayeScan analysis indicates that 14 loci (6.97%) had α values indicative for diversifying (positive) selection (false-discovery rate (FDR) = 0.05) and these are subsequently referred to as outlier loci (electronic supplementary material, figure S3). Seven of these markers are particularly strong outliers (FDR < 0.001). As none of the examined loci show evidence for balancing selection, the other 187 loci are considered neutral. This proportion of outliers is very similar to what has been reported in studies of genome-wide differentiation associated with (local) adaptation and speciation (approx. 2–8%) [5862].

    Within-population genetic diversity ranges from 0.194 to 0.287 (for neutral markers; electronic supplementary material, table S1) and is significantly larger in the southern than in the northern region, and in stable than fluctuating habitats (Wilcoxon rank-sum test, W = 9, p = 0.029; W = 1, p < 0.001, respectively). The global FST of the dataset is 0.0975 (95% CI: 0.0313–0.1637) for neutral loci and 0.1232 (95% CI: 0.0732–0.1732) for all loci. These values are similar to those obtained in the study of speciation in Timemia and cichlids [12,63].

    Examination of patterns of molecular diversity with DAPC and k-means clustering yields the best support for four molecular groups, each with an individual assignment probability of 1.00 (figure 2a). Other models have substantially higher BIC values (ΔBIC ≥ 1.26), except for a five-group model (ΔBIC = 0.05). Analysis in Structure indicates the highest ΔK for two groups (102.3 whereas ≤ 22.4 for other scenarios; electronic supplementary material, figure S4). Although different in the ‘preferred’ number of groups, DAPC and Structure provide almost identical attributions of specimens to clusters for scenarios of 2–5 groups. Both methods support a strong genetic separation between populations from the northern and southern regions of the Malawi Basin. The four-group DAPC model additionally recognizes two groups in each region. (The two-group Structure result can thus be reconstructed directly from the four-group DAPC model by regional lumping and, as such, I use DAPC for visualization.) Individuals from the two northern groups (I and II) are regularly found in sympatry, whereas those from the southern groups (III and IV) occur almost exclusively in allopatry (figure 2b). Groups I and II are very similar in their composition of (morphologically identified) nominal species, whereas groups III and IV consist almost exclusively of L. solidus and L. nyassanus and of L. sp. (ovum-like) and L. solidus, respectively (electronic supplementary material, figure S5). The differences between groups within regions are more limited than those between regions: L. sp. (ellipticus-like) and L. nyassanus occur only in the northern and southern region, respectively. In the five-group DAPC model the abovementioned group IV is further split into two parts: one part contains the specimens from Lake Chilingali (am0815), the other part the rest. Analyses of patterns of molecular diversity are important to provide biological meaning to the results, but I did not rely on specific grouping scenarios in subsequent analyses, beyond analysing patterns by region following the strong interregional molecular differentiation.

    Figure 2.

    Figure 2. Genetic diversity and morphological disparity in the ongoing Lanistes radiation of the Malawi Basin. (a) Discriminant analysis of principal components (DAPC; n = 182) gives the best BIC support for four genetic groups, with a strong separation between the northern and southern regions of the basin. (b) Lake Malawi and surroundings with indication of sampling localities, colour-coded genetic diversity at these localities, bathymetry and shoreline substrates (after ref. [22]). In the north both genetic groups occur highly sympatric but in the south not. The dashed white outline at the centre of the rift indicates the approximate extent of the lake during Late Pleistocene low stands [20,21]. The morphospace occupation (n = 164) indicates an incomplete separation of molecular groups and of specimens from the northern and southern regions of the basin (c), but a strong separation as to habitats (d). (d) The nominal species occupy somewhat separate areas in morphospace although overlaps are substantial, especially between L. sp. (ellipticus-like) and L. sp. (ovum-like), which is inflated by allometric effects (young adults of L. sp. (ovum-like) resemble L. sp. (ellipticus-like)). Colours reflect habitats, symbols the nominal species; electronic supplementary material, figures S6 and S7 present additional representations of morphological variability.

    (c) Morphological differentiation

    Ordination of semilandmark data with nmMDS indicates a stress of 14.87, suggesting limited risk of drawing false inferences. The morphospace distribution (figure 2c,d and electronic supplementary material, figure S6) resembles a single ‘supercluster’ (ΔBIC ≥ 2.51 over model-based clustering solutions with two or more clusters), even though specimens occupy different parts of the morphospace depending on the nominal species to which they belong and the region and habitat they occupy. Substantial differences in morphospace occupation also exist between molecular groups, except for between groups I and II (electronic supplementary material, figure S7 details shape differences along both axes). If four clusters are enforced—given that the dataset comprises four valid, nominal species and following the results of the DAPC analysis—the resulting morphological groups coincide well with the nominal species (electronic supplementary material, figure S6) rather than with the molecular groups. Two-way multivariate analysis of variance (MANOVA) indicates significant differences in morphospace occupation by region (Wilks' λ = 0.81, F2,159 = 18.51, p < 0.001) and by habitat (Wilks’ λ = 0.31, F2,159 = 176.68, p < 0.001) but not by their interaction (Wilks' λ = 0.98, F2,159 = 1.84, p = 0.162). The morphospace differentiation between habitats is highly significantly larger than between regions (mean ± standard deviation: 0.0430 ± 0.0031 and 0.0419 ± 0.0035, respectively; Wilcoxon rank-sum test on 1000 bootstrap replicates: W = 404 500, p < 0.001), as can be observed by comparison of figure 2c,d.

    (d) QST can be approximated by PST

    For most of the relevant interval of c/h2 (i.e. 0.15–2.00) mean PSTs exceed the global FST from neutral loci, and the 95% CIs on PSTs do not overlap with the upper confidence limit on the global FST for c/h2 ≥ 0.45 (electronic supplementary material, figure S8a). Statistical tests indicate that PSTs significantly exceed the average locus-specific FSTs of neutral loci for values of c/h2 ≥ 0.5 (Wilcoxon rank-sum test for c/h2 = 0.5: W = 7872, p = 0.002) (electronic supplementary material, figure S8b). Therefore, the PSTs for this dataset are indicative of moderate-to-strong phenotypic divergence, and they can be used to approximate QST at c/h2 ≥ 0.5. To be conservative, I compared PSTs and average locus-specific FSTs for c/h2 = 0.5 rather than for the null assumption of c/h2 = 1.0 [53]. Population pairwise PSTs and FSTs are reported in electronic supplementary material, table S5.

    (e) Patterns of population differentiation

    The regression of kinship coefficients on spatial distance yields a significant negative correlation for the entire dataset (r2 = 0.216, p < 0.001; figure 3a), but this trend disappears when regions are analysed separately. A weak yet statistically significant regression is obtained in the north (r2 = 0.013, p = 0.023; figure 3b) but not in the south (r2 = 0.005, p = 0.237; figure 3c). These results are robust throughout the range of reasonable FI values (FI = 0.0–0.6).

    Figure 3.

    Figure 3. Mechanisms of genetic differentiation in the Lanistes radiation. (a–c) Relationship of kinship coefficients (using all loci) on spatial distances for the complete Malawi Basin (a), the northern region (b), and the southern region (c). The black curve assumes Hardy–Weinberg equilibrium, but results are robust for progressively increasing inbreeding (FI = 0.2, 0.4 and 0.6; grey curves). The significant negative regression for the entire dataset does not persist within individual regions, suggesting the existence of a strong geographical barrier to gene flow coinciding with the centre of rifting. (d–g) Comparisons of phenotypic differentiation (PST) and neutral genetic differentiation (FST). (d) Comparisons coded according to regions; confidence intervals (CIs) are only shown for pairwise population comparisons (n = 70) for which the 95% CI on PSTs and on FSTs do not overlap with the line of FSTPST equality (dotted dash line), thus indicating adaptation or stabilizing selection. (e) The same comparisons, but coded as to habitat properties. Jointly (d,e) indicate that most adaptation occurs between regions and between habitats, but to a lesser extent also within regions and habitats. A positive and significant correlation between phenotypic differentiation and neutral genetic differentiation (blue regression lines), i.e. isolation by adaptation (IBA), exists for the entire dataset (e), but IBA occurs exclusively within regions (f) not between regions (g). The FSTs associated with comparisons between regions fall in narrow zones (blue vertical bars in g) that indicate a phase of genetic admixture in the past.

    PSTs are significantly higher than the associated FSTs for 55 pairwise population comparisons (approx. 52%) and significantly lower in 15 other comparisons (approx. 14%). Most adaptation occurs between populations from different regions (figure 3d) and different habitats (figure 3e), but also within regions and habitats evidence for adaptation exists (figure 3d,e and electronic supplementary material, figure S9). A significant, positive correlation is observed between phenotypic divergence and neutral genetic differentiation for the entire dataset (r = 0.262, p = 0.007). Accounting for geographical effects (figure 3a–c), strong correlations are obtained for PSTFST comparisons within regions (south: n = 28; r = 0.596, p < 0.001; north: n = 21; r = 0.462, p = 0.035, all regional comparisons: n = 49; r = 0.488, p < 0.001; figure 3f), but not between regions (n = 56; r = −0.187; p = 0.168).

    (f) Processes of population differentiation

    Remarkable differences exist in patterns of differentiation within and between regions of the Malawi Basin. Below, I examine evidence for ecological and non-ecological mechanisms of differentiation separately before producing an integrative model of differentiation in the Lanistes radiation. Non-ecological mechanisms include neutral processes and divergence under uniform selection in separate regions. The role of neutral processes is illustrated by the significant negative correlation between gene flow (or kinship) and geographical distance for the entire dataset (figure 3a). If isolation by distance (IBD) were the main explanation for geographical differentiation, we would also expect significant negative correlations within regions and especially in the southern region given the overall larger distance between sampling localities than in the northern region. As correlations within regions are weak to non-existing, a single migration barrier explains the geographical differentiation in the Lanistes radiation from the Malawi Basin best, although weak IBD may overlay it. Geological and ecological data support this interpretation: Lanistes is restricted to shallow, soft-substrate habitats in the Malawi Basin, and, when lake levels are high, as during the last approximately 60 ka [20,21], such habitats occur only along the northernmost and southernmost shores of the basin. At the centre of rifting, border faults coincide with the shorelines of Lake Malawi, both east and west [64,65], and these steep, rocky shorelines (figure 2b) are uninhabitable for Lanistes [24,66]. In combination with the anoxic bottom waters [28] they form a strong migration barrier for molluscs. The molecular consequences of geographical separation since the water level rose are recorded in the pairwise population FSTs across the barrier (figure 3g): most of these FSTs fall in a narrow band at FST = ∼0.1, representing divergence since the last major event of genetic admixture, i.e. the last persistent low stand (figure 4). The occurrence of another narrow band at FST = ∼0.2 may provide evidence for an older period of admixture, however, all these comparisons involve satellite Lake Chilingali (am0815). The Lanistes specimens from this satellite lake form the most genetically differentiated population in the southern region (electronic supplementary material, figure S9), but they also have the lowest intrapopulational genetic diversity, which indicates that founder effects and genetic drift may have enhanced differentiation. Lake Chilingali is well separated from Lake Malawi and, despite its shallow depth (approx. 5 m), it also harbours a strongly differentiated Rhamphochromis population [67].

    Figure 4.

    Figure 4. (a) Conceptual model of genetic and morphological differentiation in the ongoing Lanistes radiation of the Malawi Basin over time. Lake Malawi changed from a shallow, green to a deep, blue lake, most recently after megadroughts approximately 75–130 ka ago [20,21], which triggered diversification in Lanistes. Solid lines represent periods during which neutral processes predominate whereas dotted lines indicate phases of elevated selection and adaptation. Per unit of time larger genetic and morphological differentiation may occur under selection than under neutral processes. As the lake level rose, differentiation in Lanistes was initiated by geographical separation of populations in the northern and southern regions. Periods of more stable, high lake levels subsequently allowed adaptation and diversification, but these periods were interrupted occasionally by environmental perturbations. Perturbations may have caused a decrease in genetic and morphological diversity, e.g. by secondary contact, hybridization and reticulate evolution (arbitrarily represented in the grey lineage) and/or the extinction of genetic lineages (arbitrarily represented in the black lineage). The extant fauna (colour-dotted lines) displays a hierarchical structure of differentiation mechanisms: older, non-ecological mechanisms (geographical differentiation and parallel evolution) occur mainly between regions, and younger, ecological mechanisms (isolation by adaptation) within regions. (b) Hypothesis of the spatio-temporal constraints of water level fluctuations in the Malawi Basin (based on ref. [20]) on the observed molecular and morphological differentiation in Lanistes, indicating the last period of major genetic admixture, the separation between regions and adaptation within regions. SEC, shoreline elevation constraints.

    Whereas molecular differentiation is strongest between regions (figure 2a), morphological differentiation occurs mainly between habitats (figure 2d), although highly significant differences in morphospace occupation occur also by region (figure 2c). Given that both habitats occur in each region, and that the common ancestor of the Lanistes radiation necessarily preferred one habitat type (adaptation within regions postdates molecular differentiation between regions (see below)), the molecular and morphological data jointly provide evidence for parallel evolution (electronic supplementary material, figure S10). Parallel evolution suggests that similar selective regimes existed in both regions and that mutation-order processes may have been influential in the differentiation between regions. Evidence for stabilizing selection is found in PSTFST comparisons between regions (red symbols in figure 3d) that correspond to comparisons between fluctuating or fluctuating and stable habitats (red and black symbols in figure 3e). This finding also suggests that at least some populations are near optima in the adaptive landscape. In recently separated lineages, as is the case for Lanistes, the probability that the same genes are involved in independent parallel adaptation between regions is high [68], which may increase the chance for rapid evolution of incompatibilities.

    Ecological processes in the ongoing Lanistes radiation are evidenced by population pairwise PSTFST comparisons with PSTs that are significantly higher than the corresponding FSTs (figure 3d–g), and the differentiation in each region of quantitatively discernible ecotypes (figure 2c,d and electronic supplementary material, figure S10) (see refs [52,69]). The strongest evidence for adaptation occurs in pairwise comparisons across regions and across habitats (red symbols in figure 3d,e), but only within regions evidence for isolation by adaptation (IBA) exists, i.e. a positive correlation between adaptive phenotypic divergence and neutral genetic differentiation [58]. IBA is observed for all neutral AFLP loci, indicating it to be a strong driver of neutral genomic differentiation within regions. Between regions adaptation is occurring, but IBA does not contribute much to the associated neutral genetic differentiation. In summary, ecological processes are responsible for most of the neutral genetic differentiation within regions, whereas neutral genome-wide differentiation between regions is mainly caused by non-ecological processes.

    (g) Spatio-temporal model of radiation

    FSTs (of neutral loci) for comparisons between regions are significantly larger than those for comparisons within regions (Wilcoxon rank-sum test: W = 610, p < 0.001; compare figure 3f,g), which indicates that the geographical separation between regions predates IBA within each region. This finding suggests that only during prolonged high lake level phases the ‘stable’ and ‘fluctuating’ habitats persisted for a sufficient period for IBA to occur. The data reveal a strong structure of differentiation processes in time and space, tightly linked to environmental change in the Malawi Basin (figure 4): as the water level rose after the last persistent low stand geographical separation occurred between regions, and only later when lake levels were high and stable, adaptation to both habitat types occurred within each region. The hierarchical structure of various differentiation mechanisms, rather than their mere presence, has promoted evolutionary radiation by allowing multiple speciation events to unfold simultaneously, resulting in additive differentiation as mechanisms interacted synergistically in space and time. Interestingly, the ‘stage model’ of diversification in Lake Malawi cichlids [17], and the role of environmentally triggered habitat isolation and reconnection as a potential ‘species pump’ [67], suggest that similar interactions of differentiation mechanisms occurred in cichlids, however, currently these models focus on adaptive processes only. Cichlids would have primarily diversified during deep-lake conditions, whereas intermittent turbid, low lake level phases reduced rocky habitats, decreased visibility and opportunities for assortative mating, and promoted hybridization and the collapse of some clades [21,7072]. Here I demonstrate that ecological and non-ecological processes of differentiation can interact in a structured way, especially in systems where geographical ranges and habitat diversity have been waxing and waning over time. The implications of my findings are probably important for cichlids and other adaptive radiations. As most of these radiations are more complex than that of Lanistes in the Malawi Basin, separating the mechanisms leading to speciation from those that followed afterwards will be more complicated, however. These complications are exemplified by cichlid radiations, for which it is often difficult to identify sister species relationships, to control for possible introgression from non-sister taxa, and to reconstruct the geographical context [73,74]. My results also suggest that as divergence progresses, interactions of various processes may cause the role of non-ecological processes to be underestimated as ecological processes are more effective in obliterating non-ecological signals, e.g. via selective sweeps or the progressive expansion of genomic islands of differentiation, than vice versa. Interactions of microevolutionary processes are critical to the potential for radiation, and documenting such interactions is therefore essential to understand the relative contribution of various processes to large-scale patterns of organismal diversity. Genomic approaches and experiments are required to test the genome-wide effects of the various processes involved and to unravel how selection acted in the differential adaptation in Lanistes.

    Ethics

    This study results from research cooperation KM/1/1.64 between Ghent University, Belgium and the Karonga Museum, Ministry of Tourism, Wildlife and Culture, Malawi. Fellowship 12N3915N of the FWO Vlaanderen received clearance from the ethical committee on animal experimentation of Ghent University.

    Data accessibility

    Datasets are deposited in Dryad at doi: http://dx.doi.org/10.5061/dryad.dn58p [75].

    Competing interests

    I have no competing interests.

    Funding

    Research was funded by postdoctoral fellowship 12N3915N of the FWO Vlaanderen and DFG grants SCHR 352/9-1, AL 1076/6-2, 7-1, 8-1.

    Acknowledgements

    I thank the Cultural and Museum Centre Karonga, Harrison Simfukwe and Friedemann Schrenk for support, Christian Albrecht, Tom Wilke and Jon Ablett for access to collections and Matthias Schrader and Silvia Nachtigall for laboratory work. Xavier Vekemans, Thomas Neubauer and an anonymous reviewer provided constructive advice. Editors: John Pandolfi and Sasha Dall.

    Footnotes

    Electronic supplementary material is available online at https://dx.doi.org/10.6084/m9.figshare.c.3854737.

    Published by the Royal Society. All rights reserved.

    References