DNA barcoding reveal patterns of species diversity among northwestern Pacific molluscs

Sun, Shao’e; Li, Qi; Kong, Lingfeng; Yu, Hong; Zheng, Xiaodong; Yu, Ruihai; Dai, Lina; Sun, Yan; Chen, Jun; Liu, Jun; Ni, Lehai; Feng, Yanwei; Yu, Zhenzhen; Zou, Shanmei; Lin, Jiping

doi:10.1038/srep33367

Download PDF

Article
Open access
Published: 19 September 2016

DNA barcoding reveal patterns of species diversity among northwestern Pacific molluscs

Scientific Reports volume 6, Article number: 33367 (2016) Cite this article

8493 Accesses
25 Citations
1 Altmetric
Metrics details

Subjects

Abstract

This study represents the first comprehensive molecular assessment of northwestern Pacific molluscs. In total, 2801 DNA barcodes belonging to 569 species from China, Japan and Korea were analyzed. An overlap between intra- and interspecific genetic distances was present in 71 species. We tested the efficacy of this library by simulating a sequence-based specimen identification scenario using Best Match (BM), Best Close Match (BCM) and All Species Barcode (ASB) criteria with three threshold values. BM approach returned 89.15% true identifications (95.27% when excluding singletons). The highest success rate of congruent identifications was obtained with BCM at 0.053 threshold. The analysis of our barcode library together with public data resulted in 582 Barcode Index Numbers (BINs), 72.2% of which was found to be concordantly with morphology-based identifications. The discrepancies were divided in two groups: sequences from different species clustered in a single BIN and conspecific sequences divided in one more BINs. In Neighbour-Joining phenogram, 2,320 (83.0%) queries fromed 355 (62.4%) species-specific barcode clusters allowing their successful identification. 33 species showed paraphyletic and haplotype sharing. 62 cases are represented by deeply diverged lineages. This study suggest an increased species diversity in this region, highlighting taxonomic revision and conservation strategy for the cryptic complexes.

Largest known madtsoiid snake from warm Eocene period of India suggests intercontinental Gondwana dispersal

Article Open access 18 April 2024

Phylogenomics and the rise of the angiosperms

Article Open access 24 April 2024

Diversity-dependent speciation and extinction in hominins

Article Open access 17 April 2024

Introduction

DNA barcoding - sequencing a standard region of the mitochondrial cytochrome c oxidase 1 gene (COI) - has become a standardized and broadly used molecular approach for specimen identification and species discrimination^1,2. Specimen identification is based on the evidence that selected DNA sequences are more variable among species than within species³. It involves assigning taxonomic names to a query sequence using a DNA reference library of taxonomically preidentified vouchers. Given this premises, the reliability of DNA barcoding is largely determined by the quality of the reference barcode libraries to which the unknown specimen is compared⁴. Generating rapid and accurate identifications of specimen with DNA barcodes can help to resolve distorted views of biodiversity⁵. DNA barcoding therefore represents a powerful tool for biodiversity assessment (species discovery), quickly sorting collections into species-like units¹.

Many criticisms to DNA barcoding have been raised in the literature for the shortcomings of experimental design and analytical procedure^6,7. For example, problems mostly occur when phylogenetic methods (e.g. neighbour joining) are used as the only analytical method, and identification success rates are not quantified⁸. However, a quantification of monophyly still remains a useful description of the data, when it was used in conjunction with other methods⁶. Thus, further barcoding studies should push forward improvements in data analysis, making more use of alternative methods. As explained by Collins and Cruickshank (2012), the sequence-based specimen identification criteria, such as ‘best close match’ criteria, make DNA barcoding a powerful tool in terms of established classification. When evaluating DNA barcoding as a biodiversity assessment tool (species discovery), a method is required that can estimate the number of species in mixed-organism sample directly from the barcode sequence data, and independently from the prior taxonomic species assignments preassigned taxonomic names (i.e. the data set used to subsequently measure consistency between the two approaches). The Barcode Index Numbers (BINs) analysis tool computed by the Barcode of Life Data system (BOLD)⁹ are able to use genetic information to generate an approximate of the number of operational taxonomic units that closely correspond to species.

The region of the northwestern Pacific comprising the countries of China, Japan, North Korea, South Korea, and Russia is characterized by distinct tectonic and geographical features, producing more than 75 percent of the marginal basins found on the Earth today¹⁰. The richest diversity of many marine taxa was found in these waters because of the complicated geological history and dramatic variations in local climates^11,12,13. Therefore, biodiversity research and conservation efforts in this area are necessary. Marine molluscs are the most diverse phylum of marine life¹⁴. However, in recent years, increasingly violent and vigorous impacts of global climate change, coastal environment deterioration and anthropogenic activities have resulted in marked decline of biodiversity, and the number of endangered marine molluscs species have been distinctly increased. Moreover, the marine molluscs present a significant challenge for morphological approaches to specimen identification because they exhibit differences in life stage, frequently have morphologically cryptic taxa, and substantial phenotypic plasticity^15,16, which hampered the conservation and management of the richest diversity of this taxa. In this sense, reliable specimen identification and biodiversity monitoring of organism in the field is quite necessary.

Many studies have validated the efficacy of DNA barcoding in specimen identification and species discovery for molluscs. Zou et al. (2011) demonstrates the effectiveness of the character-based barcoding method for specimen identification in Neogastropoda¹⁷. Aside from enabling identifications for whole specimens, barcode analysis opens up new possibilities - it can provide identifications during any stage of development. Puillandre et al. (2009b) clearly demonstrated the ability of barcodes to identify gastropod larvae, although barcode data are sparse and taxonomic coverage is biased toward shallow water species¹⁸. Teske et al. (2007) reported that the sympatric intertidal limpets (Siphonariidae) off coastal southeast Africa lacked barcode differences, suggesting they are morphotypes of a single species¹⁹. Two clams of the genus Donax showed no significant barcode variation and were found to represent one species²⁰. Barcodes have also revealed lack of genetic differentiation among some species of molluscs, given that not all morphological differences are the result of cladogenesis⁵. Several prior studies have established the value of DNA barcoding in resolving morphologically cryptic species complexes in several molluscan families^22,23. Despite the demonstrated utility of DNA barcoding in marine molluscs, these works focus either on restricted geographic areas and/or on a relatively restricted number of closely related species. No study has aimed to assemble a comprehensive barcode library for the entire Mollusca phylum of a large geographic area.

In this study, we establish a comprehensive barcode reference library for the marine molluscs of the northwestern Pacific (China, Japan and Korea), to test the efficacy of our DNA library for specimen identifications and shed new light on the northwestern Pacific molluscs diversity by employing different analytical approaches.

Results

Surveys of three countries (Fig. 1) assemblages yielded a total of 2,801 sequences for the northwestern Pacific molluscs, belonging to 91 families, 240 genera, and 569 species. The taxonomy, accession numbers and the site of collection are available at Supplementary Table S1. For most species, multiple specimens (mean = 4.9 specimens per species) were analyzed to document intraspecific variability. 182 species were represented by a single specimen, and 1 species (Cellana nigrolineata) was represented by 62 specimens. The average nucleotide frequencies for all 573 species are as follows: A = 22.97%, T = 39.41%, G = 20.96% and C = 16.66%. Mean GC content averaged 37.62% (SE = 0.06), but showed considerable variation (range 29.94–52.02%). A chi-square test of homogeneity demonstrated significant variation in nucleotide frequencies among species in each of five molluscan classes (P < 0.001). Mean nearest neighbour distances between congeneric species showed a significant (P < 0.001; R² = 0.167) positive correlation with mean GC content (Supplementary Fig. 1).

Distance summary

We observed a hierarchical increase in mean divergence according to different taxonomic levels, within species (mean = 0.97%, SE = 0.023), within congeners (mean = 18.67%, SE = 0.004), within families (mean = 22.47%, SE = 0.003), within orders (mean = 25.3%, SE = 0.002) and within classes (mean = 30.60%, SE = 0.012) (Table 1). Therefore, there was ca 19.25× more variation among congeneric species than among conspecific individuals. A regression analysis revealed that the mean interspecific divergence appeared to increase with the number of species analyzed from a genus, but the regression was not significant (Fig. 2A; P = 0.049; R² = 0.138). And the intraspecific divergence did not significantly differ with the number of individuals analyzed per species (Fig. 2B; P = 0.27; P = 0.56).

Table 1 COI genetic divergences according to different taxonomic levels within the the northwestern Pacific molluscs.

Full size table

Barcode gap analysis

We counted how often the maximum sequence divergence among individuals of a species exceeded the minimum sequence divergence from another congeneric species. These situations, which may confound barcode-based taxonomic assignments, were encountered in 70 species (12.30%) (Fig. 3, Supplementary Table S2). In these species, the maximum intraspecific variation overlaps with the NN (nearest neighbour) distance, leading to the absence of a barcode gap and in 36 case, NN distances were zero. 91 species show low distance to the NN (<=2%), but still exceeded the maximum intraspecific value.

Success of sequence-based specimen identification techniques

In the simulations, the BM approach returned 89.15% of true and 10.92% of false identifications (Table 2). When singletons were removed, false identifications decreased to 4.73%. Details of simulation results are available as Supplementary Table S3. With a threshold of 0.01, the BCM analysis provided 68.62% of true and 1.14% of false identifications. For 14.28% of the queries, the result is ambiguous (more than one equally close matches were found below the threshold of 0.01). 15.96% of the queries had no conspecific matches below the threshold of 0.01, and almost half of these (40.72%) were singletons with no conspecific sequence available. The threshold optimization method (‘threshVal’ function in SPIDER) reported a threshold between 0.0135 and 0.0260 (Supplementary Fig. 2). The average value of 0.02 was selected as the optimized threshold for the analyses. Under this threshold, the BCM approach provided 74.94% of true, 1.75% of false identifications, and the ambiguous queries were 15.42%. The remaining 7.89% queries were unidentified. The ‘localMinima’ function in SPIDER returned the threshold of 0.053 as possible transition between intra- and interspecific distances (Supplementary Fig. 3). With this threshold, the BCM approach provided 76.29% of true, 2.46% of false and 15.49% of ambiguous identifications, while 5.75% had no identification. When singletons were excluded, the false and unidentified queries decreased under each threshold. The ASB analysis returned the same results as BCM at the threshold value 0.01. While the BCM approach returned a slightly higher success rate than that of ASB approache from threshold value 0.021 and 0.053.

Table 2 Identification success based on Best Match (BM), Best Close Match (BCM) and All Species Barcodes (ASB).

Full size table

BIN discordance report and the nearest neighbour analysis

The BIN analysis included 2591 of the 2801 records and generated 582 different BINs. A number of 387 BIN clusters was found to be taxonomically concordant with other barcode data on BOLD assigned to the same species name (Supplementary Table S4). Five records was indicated as singleton, which means that this BIN only refers to one specimen (Supplementary Table S5). BIN discordance analysis returned 190 BINs as discordant respect to our prior taxonomic assignments (Supplementary Table S6). The external (incl. BOLD data) incongruence occurred at different taxonomic levels: the highest rank of conflict was found at one phylum level, followed by five at order, as well as nine family level. At the genus level, 62 BINs were found to be discordant, which means that specimens belonging to different genera of the same family were grouped together in one BIN. Finally, 113 BINs incorporate specimens of at least two congeneric species. Within BNPM data, 72.2% of BINs was found to be concordantly with morphology-based identifications. The discrepancies include two groups: (i) 45 discordant BINs caused by haplotype sharing and low between-species divergence (Table 3), and (ii) 68 species clusters were assigned to two or more BINs (Table 4).

Table 3 Cases of BIN sharing involving 42 pairs and three triplets of species of northwestern Pacific molluscs.

Full size table

Table 4 68 cases in which high intraspecific divergence led to the assignment of conspecific individuals to two or more BINs.

Full size table

The nearest neighbour (NN) of each BIN according to the data available in BOLD is available as Supplementary Table S3. The NN comparison evidenced the under-representation of mollusc species on BOLD and the need for taxonomic reassessment of some species: 65% of BINs generated by our entries had a congeneric NN, 20% had a NN from the same family or a higher taxonomic rank, and 14% had a NN represented by an unidentified specimen.

Neighbour-joining analysis

The neighbour joining (NJ) tree profile showed that sequence records for 2,320 (83.0%) queries representing 355 (62.4%) of all species formed distinct barcode clusters allowing their successful identification. 299 sequences involve 31 cases of paraphyly or shared barcodes between closely related species pairs, making their misidentification. Due to the lack of conspecific sequences in the data set, 31.9% of species are ambiguous and remain unidentified (Supplementary Fig. 4). Therefore, a large proportion of sequences (83.0%) and species (62.4%) were unambiguously distinguishable using the criterion of barcode clusters.

Thirteen of these 31 problematic cases involved species that formed paraphyletic clusters (Supplementary Fig. 4, groups highlighted in yellow; e.g., Patelloida pygmaea; Fig. 4A). For P. pygmaea, some of the taxa exhibiting deep intraspecific divergence values were recovered as paraphyletic in phylogenetic trees; nevertheless, the haplotype networks of the paraphyletic species demonstrated that no shared haplotype was found between each pair of the species (e.g., Patelloida spp.; Fig. 4B).

Members of six species pairs and two species trioes showed cases of barcode sharing, producing a mixed-species cluster in the NJ tree (Supplementary Fig. 4, framed clusters; e.g., Meretrix spp; Fig. 5A). For Meretrix spp., the sharing of COI haplotypes was found in the haplotype networks of the closely related species (Fig. 5B). Overall, all these eighteen species with undifferentiated barcodes formed only fifteen clusters in phylogenetic trees.

Deeply divergent intraspecific clusters were found within 62 of the 569 analyzed species (10.9%), indicating the occurrence of cryptic diversity (Table 5, Supplementary Fig. 4, groups highlighted in magenta). Those divergent intraspecific clusters, which correspond to divergent evolutionary lineages, were restricted to 32 of the 91 analyzed families (Table 5). The number of lineages by species varied from 2 to 4, for a total of 137 divergent lineages among 62 named species, which suggests a 13% increase in species diversity. Deeply divergent intraspecific lineages (>2%) were always (19/62) found in different geographical locations (e.g Echinolittorina vidua and Serratina capsoides Fig. 6A–D). Notably, the inflated geographical coverage changed the clustering pattern of conspecific individuals. In our data set, 3 species (Patelloida pygmaea, Thais luteostoma and Conus sanguinolentus) moved from monophyletic to paraphyletic after inclusion of additional populations. Consequently, we concentrated the study on how does the inclusion of geographically separated populations influence DNA barcoding. As expected, expansion of geographical coverage significantly increased intraspecific variation. The mean value of maximum intraspecific genetic distance increased eight-fold: from x ± S.E. = 1.02 ± 0.06% (when one population species was considered) to x ± S.E. = 8.77 ± 0.17% (when individuals from distinct populations were included).

Table 5 Summary of the northwestern Pacific molluscs taxa analyzed.

Full size table

Discussion

The study represents the first comprehensive DNA barcode database for marine molluscs from the northwestern Pacific, including the collection and analysis of 569 species. It demonstrated the ability of DNA barcoding to identify species and shed a new light on their species diversity. The mean level of intraspecific divergence of 0.97% observed in northwestern Pacific molluscs was approximately two times higher than any other marine groups thoroughly surveyed with DNA barcodes, including the following: Australian marine fishes (0.39%)²⁴, Australian decapods (0.46%)²⁵, Australian echinoderms (0.62%)²⁴, Canadian polychaetes (0.38%)²⁶. Such a high level of intraspecific divergence may be explained by the limited dispersal capabilities of molluscs, which promote lineage divergence and enhanced speciation rates²⁷.

No barcode sharing was detected among individuals of different species and a barcode gap was present for all but 70 cases. The NJ analysis demonstrated monophyletic clustering of haplotypes for 39 of these species. In the remaining 31 species, the distance to the NN was substantial (5.50–14.71%), but the level of the maximum intraspecific divergence was even higher (5.79–20.31%), producing the overlaps. The distance-based approach assumes that a species can be correctly identified when the mean distance to the most closely related species (nearest neighbor) is higher than the maximum intraspecific distance²⁸. However, growing evidence suggests that the overlap between mean intra- and interspecific genetic distances is considerably greater with larger proportions of closely related taxa^29,30. And, the extent of the barcoding gap tends to be overestimated when mean intraspecific distances are used, while smallest intraspecific distances yield more consistent results³¹. Hence, although the identification success generally declined when the overlap between intra- and interspecific distances increased, the lack of a barcoding gap is not necessarily influencing specimens identification^32,33,34. Based on this hypothesis, in our data, the extent of the barcoding gap was not considered as a necessary predictor for the identification success.

In the BM approach, 2,497 nonsingleton queries had a conspecific sequence as closest match. Best match would perform much better if it was applied to a data set from which single-sequence species have been removed. When expand this approach to the entire BOLD database, 177 of the 182 singletons were clustered in BINs with conspecific or congeneric sequences from other projects, suggesting the 182 singletons were reduced to 5 once in the BOLD database. Fifteen BINs had a nearest neighbour from the same family or a higher taxonomic group, revealing the lack of barcode data for several molluscs⁴.

In our simulations, BCM approach returned a slightly higher success rate than that of ASB approach at the threshold value 0.021 and 0.053 used for identification. The ASB criterion is more restrictive than BCM. The BCM criterion looks only at the closest match below a defined threshold, while ASB assigns a match according to all sequences under that threshold. Thus, when sequences from different species have distances values falling below the threshold, ASB criterion returned a misidentification⁴. For this data set, BCM and ASB approaches don’t outperform tree-based specimen identification. For example, the success rate of tree-based specimen identification reached 83%, whereas BCM and ASB approaches yield lower success rate (68.62–76.29%) and has a relatively high incidence of ambiguous (14.28–19.53%). This high proportion of ambiguous identification could be due to the increasing geographic scale of sampling, the chances of encountering closely related species increase, while interspecific divergence decreases significantly³⁵. With a higher chance of sequences from closely related species to fall under the threshold, more ambiguous identifications appeared.

The taxonomic reliability of DNA barcodes can be evaluated by analysing new data together with already published sequences. A taxonomic species assignment is more likely to be correct if congruent results were produced by several taxonomists. In implementing our BNPM data in the BOLD database, the majority of BINs (66.5%) was found to be taxonomically concordant with other barcode data on BOLD. For these cases, specimens analysed by at least two BOLD users were assigned the same species name and BIN. 33.4% of the BINs were found to be discordant. The highest rank of conflict was found at Phylum level at Crassostrea gigas (BIN:AAB2297). This discordance is probably to be caused by a typo or data-base error, as confusions between Mollusca and Arthropoda seem to be implausible. The conflict of seven species at order level and fourteen species at family level is unlikely to be caused misidentifications, as long as the data refers to adult specimens. The congruence problems at genus and particularly at species level can casused by misidentifications, because congeneric species usually be difficult to distinguish. Most of the discordances between our data and that already incorporated in the BIN pipeline were caused by the use of synonymies, inadequate taxonomy and misidentifications. This result highlights the need for an accurate taxonomic review of already published DNA barcode data, which will be one of the most relevant issues to increase the reliability of international barcode reference libraries like BOLD³⁶.

In this study, the NJ phenogram derived from the complete barcode data set, resulted in thirteen paraphyly species. According to our current data, all haplotypes are species-specific, so that specimens could be attributed to the correct taxon. We emphasize that cases of paraphyly may not prevent the identification of species as they share no haplotypes. Cases of paraphyly in Central Asian butterflies were also treated as identification successes because the species involved were never found to share haplotypes³⁷. Taking these cases into account, the identification success rate of DNA barcoding for northwestern Pacific molluscs specimens rises to 87.1%. However, considering only several cases involved, more sampling is required to verify the robustness of this conclusion. Simultaneously, these cases also highlight the importance of comprehensive sampling (across different populations and geographic regions) without which these species in our dataset may have appeared as reciprocally monophyletic, leading to misinterpretations of DNA barcoding performance. The BIN analysis failed to detect the concordance between identifications and genetic clustering for these paraphyly species. Seven of the species divided in more than one BIN and six cases share BINs with a nearest neighbour.

In the NJ analysis, the cases of low genetic divergence or haplotype sharing involved 18 species. In all of these species, the congeners shared the same BIN as well. In general, interspecific haplotype sharing has four possible explanations: hybridization, incomplete lineage sorting, inadequate taxonomy or misidentification^38,39. Detailed analysis of such cases can provide a better understanding of the evolutionary history of the species involved. First, the identification of marine mollusks is often difficult due to the phenotypic plasticity and environment effects. They may exhibit morphological variations in different life stage, and some species have the shell reduced or (rarely) lost⁴⁰. Most Cephalopoda species are composed of soft tissues, the measurement of which is difficult to standardize among researchers, and their growth patterns are highly responsive to environmental variables⁴¹. Thus, it may result in a lack of consensus regarding their taxonomy and lead to misidentification, producing an apparent case of haplotype sharing. Second, the pattern may also be attributed to hybridization or incomplete lineage sorting. The taxa share mtDNA haplotypes because of hybridization or incomplete lineage sorting of ancestral polymorphisms have been reported in Caenogastropoda, Mollusca, such as the sibling species of rough periwinkles, Littorina arcana and L. saxatilis⁴². However, this investigation is really sparse, and little is known regarding the other cases and further studies are needed to interpret the pattern. Thus, it seems inadequate to explain the cases of haplotype sharing encountered by us with hybridization pattern. In order to disentangle the relationships among the closely related species, more detailed studies (e.g., more detailed morphological analyses and population-level analyses with larger sample size) should be employed to these species in the future⁴³.

Detecting cryptic and potentially new species from molecular biodiversity inventories is for many classical biologists the most appealing application of DNA barcoding³⁶. Large genetic distances within traditionally recognized species accompanied by morphological, geographical and other subtle differences, have revealed cryptic species in most types of organism and habitat, from deep-sea clams to freshwater fish, and from tropical butterflies to arctic plants^{44,45,46,47,48}. The proportion of species with deeply diverged lineages (>2%) among northwestern Pacific molluscs is relatively high (ca. 10.9%), revealed a significant amount of previously unrecognized cryptic diversity. This may unsurprising, given that molluscs represent a taxonomically weak-studied group of organisms. For the 569 species analyzed, our survey flagged 137 candidate species represented by 62 named species, which suggests a 13% increase in species diversity. Perhaps, this high cryptic diversity within northwestern Pacific molluscs is unsurprising, considering the fact that molluscs are the most diverse phylum of marine life, with more than 50,000 described species, coupled with a high degree of phenotypic plasticity and a shortage of taxonomists⁴⁹. Furthermore, the marine habitats might be breeding grounds of cryptic speciation because they are the most species-rich habitats on Earth⁵⁰ and because many of those organisms are involved in specialized interspecific interactions⁴⁸. The highest proportion of cryptic diversity was found among family Cavoliniidae, Cultellidae, Haminoeidae, Pharidae, Pinnidae [an increase of 100% (1 of 1)], followed by family Batillariidae and Solenidae [an increase of 50% (1 of 2)] (Table 2). Nonetheless, 85% of all cryptic diversity occurs in the two most diversified classes, Bivalvia and Gastropoda. It appears that, just like for other components of biodiversity, the distribution of cryptic diversity among marine molluscs is not uniform, prompting several questions about possible taxonomic biases in the estimates of diversity. For example, do large, varied groups such as Bivalvia and Gastropoda hide unknown numbers of new species? However, sixteen of the 62 cryptic complexes are still represented by a single BIN each. Analysing the BNPM data set, 68 species were assigned to two or more BINs, because of the relatively high intraspecific divergences. It is worth noting that these species always has a conspecific sequence as their nearest neighbour, reflecting congruence between the simulations of sequence-based identifications scenario and independent clustering on BOLD. The presence of multiple BINs caused by divergent mitochondrial lineages for a single taxonomically identified species also gives some evidence for the existence of putative cryptic species⁵¹.

High genetic variability within a species can result from phylogeographic processes or geographically incomplete sampling^4,52. In our data, 19 species are likely to exhibit notable intraspecific diversification among lineages from different geographical regions, and in 21 cases, more than one BINs were observed among different geographically population. The historical separation of the sea basins was reported to have dramatically influenced the current genetic distribution of various marine species^53,54,55,56. This is particularly important when dealing with northwestern Pacific species, whose genetic structure was influenced by Pleistocene climatic fluctuations. During Pleistocene glaciations, three marginal seas (South China Sea, East China Sea and Japan Sea) of northwestern Pacific separated from each other owning to the declined sea level^57,58. These three marginal seas had served as separate refugia and dramatically promoted the diversification of various marine species^53,54,55,57. This geographically correlated population differentiation demonstrates that individuals from some taxa can be identified not only according to species but linked to a particular watershed⁵.

Perhaps, the genetically dissimilar taxa investigated in present study represent new species. Our calibration highlights a careful taxonomic revisionary work for these taxa, as well as the reproductive biology and ecology of the taxa involved. Because it is possible that some of the newly identified species is always accompanied by slight morphological changes that have simply been ignored, and the true number of biological species is likely to be greater than the current tally of nominal species^5,55. Therefore, the current northwestern Pacific molluscs taxonomy at the species level conceals the species diversity in some groups. A good estimate of cryptic species diversity have important implications for conservation and natural resource protection and management⁵³. Molecular evidence has revealed that species already considered endangered or threatened might be composed of cryptic species complexes that are even more rare than previously supposed^59,60. This taxonomic shift renders one already threatened species into one more evolutionary lineages, each of which is substantially more endangered than was previously considered⁶¹. Moreover, species are lost at an alarming rate and looking for reproductive isolation is time-consuming, and once lost, an evolutionary lineage can never be recovered⁶². Thus, these results indicated that our DNA-based distinct evolutionary lineages highlighted in this study should be considered prioritized conservation units that need to be taken into account in protection strategies.

Material and Methods

Sampling and collection data

COI sequence data used for this analysis came from two sources: (i) specimens were collected from the coast of China for the purposes of DNA barcoding, and (ii) public data from China, Japan and Korea in GenBank, downloaded using the Barcode of Life Database (BOLD, www.barcodinglife.org). 1156 specimens were collected from the coast of China during 2004–2014. These samples were stored in 95% ethanol and deposited as voucher specimens in Fisheries College, Ocean University of China. The species-level identification was based on morphological characteristics according to the current literature and was conducted by taxonomists specialized in this fauna. Detailed specimen data (taxonomy, collection sites, and voucher catalogue numbers) are available via BOLD’s project ‘Barcoding of Molluscs along Coastal of China’ (BMCC). The 1645 sequences which were taken from the BOLD database are available in the BOLD project ‘Barcoding of Molluscs along Coastal of China, Japan and Korea (BMCCJK). All records used for this study were tagged with the unique identifier ‘Barcoding of Northwestern Pacific Molluscs’ (BNPM). Both the map of World and the map of northwestern Pacific with Greater China, Japan, and Korea were rendered with Ocean Data View (ODV) software, version 4.7.3 (available at http://odv.awi.de)⁶³.

Molecular Data Collection

The muscle tissue of each specimen was removed and used for DNA extraction following a CTAB method that has been modified from⁶⁴ and a modification of standard phenol-chloroform procedure that has been described by⁶⁵. A partial region of mitochondrial COI gene was amplified using universal primers (LCO1490 5′-GGT CAA CAA ATC ATA AAG ATA TTG G-3′ and HCO2198 5′-TAA ACT TCA GGG TGA CCA AAA AAT CA-3′) designed by⁶⁶. For the species that were not successfully amplified by the universal COI primers, the other primers (COXAF 5′-CWA ATC AYA AAG ATA TTG GAA C-3′ and COXAR 5′-AAT ATA WAC TTC WGG GTG ACC-3′) designed by Colgan et al. (2001) were used⁶⁷. PCR was carried out in a 50-μl reaction volume containing 2 U Taq DNA polymerase (Takara), about 100 ng of template DNA, 1 μM of forward and reverse primers, 200 μM of each dNTP, 1× PCR buffer and 2 mM MgCl₂. The PCR reaction was carried out under the following conditions: 94 °C for 3 min, 35 cycles of 94 °C for 30 s, 48–52 °C for 1 min and 72 °C for 1 min, with a final extension period of 7 min at 72 °C. The amplified DNA was fractionated by electrophoresis in 1.5% low-melting-temperature agarose gels. PCR products were purified with EZ Spin Column DNA Gel Extraction Kit (Sangon BioTechnologies) following the manufacturer’ s protocol. The purified products were used as the template DNA for cycle sequencing reactions performed using BigDye Terminator Cycle Sequencing Kit (Applied Biosystems), and sequencing was conducted on an ABI PRISM 3730 (Applied Biosystems) automatic sequencer. Both DNA strands were sequenced to ensure accuracy.

DNA Barcoding Analyses

Sequences were viewed and manually edited conducting with SEQMAN software (DNA-Star 7.2.1). Sequence alignment was performed using the BOLD Management and Analysis System⁹ and Clustal X software⁶⁸. Overall data were compared using the ‘Distance Summary’ and ‘Barcode Gap Analysis’ tools on BOLD. Maximum intraspecific divergence was plotted against nearest neighbour distance to determine how often nearest neighbour distances were greater than intraspecific divergences, indicating the presence of a barcode gap. In addition, the ‘Sequence Composition’ tool on BOLD was used to examine variation in GC content among species. The Picante and VEGAN packages in Revolution R were used to perform linear regressions to determine if the number of individuals sampled within a species impacted estimates of intraspecific divergence and if the number of species sampled from a genus impacted the mean nearest neighbour distances^69,70. The boot and Hmisc packages in Revolution R were used to test whether mean nearest neighbour distance was correlated with mean GC content⁷¹.

Genetic distances were calculated with the BOLD Management and Analysis System, employing the Kimura-2-Parameter (K2P) distance metric⁷². We analysed the quality of our data set by simulating a sequence sequence-based specimen identification scenario using R (www.r-project.org) with the libraries APE⁷³ and SPIDER⁷⁴, see also refs 4, 6 and 75. Every sequence was used as a query against the entire data set of identified sequences, and a species name was assigned based on three criteria: Best Match (BM), Best Close Match (BCM) and All Species Barcode (ASB). In BM, each query sequence was found according to its closest barcode match regardless of its distance. In BCM, the query sequence was identified by the closest barcode match with a distance below a defined threshold. In ASB, we assembled for each query a list of all barcodes sorted by similarity to the query using the same threshold as for best close match. The query sequence was identified when all matches below the threshold were conspecific.

In BM, if both sequences were from the same species, the results were “true”, whereas mismatched names were counted as “false”. Several equally good best matches from different species were considered ambiguous. In BCM and ASB, all queries without barcode match below the threshold value remained unidentified. The query was considered ‘ambiguous’ when several equally good best matches were found that belonged to a minimum of two species below the threshold (in BCM) or sequences from multiple species were found below the threshold (in ASB). Queries were labelled as ‘true’ or ‘false’ according to the respective congruence or incongruence between query identifications and prior taxonomic assignments.

Three different thresholds were used in BCM and ASB criteria. The first threshold was set to 0.01, which is the standard used by BOLD’s ID engine⁹. The second threshold was generated by the function ‘threshVal’ in SPIDER which minimizes the cumulative identification failure incorporating false-positive error (no conspecific matches within threshold but conspecific samples available) and false-negative error (more than one species recorded within threshold). The third threshold was obtained from the minimum of a density plot of genetic distances, which represents the transition between intra- and interspecifc distances, which was calculated by the function ‘localMinima’ in SPIDER.

We also compared the results of our simulations with the analysis tools provided by BOLD. In particular, we analysed the Barcode Index Numbers (BINs) assigned to our sequences according to the sequence-based clustering method implemented in BOLD⁹ and the nearest neighbour to each BIN.We used BIN assignments to (i) verify a priory species identification, (ii) to identify cases of haplotype sharing between species or low levels of interspecific distances, (iii) and to get hints on cryptic diversity (species with more than one BIN). The ‘BIN Discordance Report’ analysis tool was applied to analyse our data set together with public sequences on BOLD. BINs were identified as taxonomically discordant, if species clusters shared a BIN, or those were assigned to two or more BINs. The concordant BINs mean taht the sequences provided by at least two BOLD users were assigned the same species name and BIN.

The neighbor-joining tree⁷⁶ of the whole data set was performed on the BOLD database. The number of divergent lineages within recognized species was calculated as the number of haplotypes, or clusters of haplotypes, with a mean divergence of over 2% from any other haplotypes or clusters of haplotypes.

Further phylogenetic analysis was performed on some species that represented, respectively, examples of paraphyletic clusters, cryptic diversity and distinct recognized species that potentially represent single evolutionary lineages. For those groups, we performed neighbor-joining analyses based on the K2P model using MEGA v. 5⁷⁷. Branch support was estimated by bootstrapping with 1,000 replicates. In our study, the haplotype networks of the closely related species were constructed using the default 95% connection limit in the TCS software⁷⁸.

Additional Information

How to cite this article: Sun, S. et al. DNA barcoding reveal patterns of species diversity among northwestern Pacific molluscs. Sci. Rep. 6, 33367; doi: 10.1038/srep33367 (2016).

References

Schindel, D. E. & Miller, S. E. DNA barcoding a useful tool for taxonomists. Nature 435, 17 (2005).
ADS CAS PubMed Google Scholar
Hebert, P. D. N., Cywinska, A. & Ball, S. L. Biological identifications through DNA barcodes. Proc. R. Soc. Lond. B. 270, 313–21 (2003).
CAS Google Scholar
Avise, J. C. Phylogeography: The History and Formation of Species. Harvard University Press, Cambridge, Massachusetts (2000).
Barco, A., Raupach, M. J., Laakmann, S., Neumann, H. & Knebelsberger, T. Identification of North Sea molluscs with DNA barcoding. Mol. Ecol. Resour. 16, 288–297 (2016).
CAS PubMed Google Scholar
April, J., Mayden, R. L., Hanner, R. H. & Bernatchez, L. Genetic calibration of species diversity among North America’s freshwater fishes. Proc. Natl. Acad. Sci. USA. 108, 10602–10607 (2011).
ADS CAS PubMed PubMed Central Google Scholar
Collins, R. A. & Cruickshank, R. H. The seven deadly sins of DNA barcoding. Mol. Ecol. Resour. 13, 969–975 (2012).
PubMed Google Scholar
Rubinoff, D., Cameron, S. & Will, K. A genomic perspective on the shortcomings of mitochondrial DNA for ‘barcoding’ identification. J. Hered. 97, 581–594 (2006).
CAS PubMed Google Scholar
Little, D. P. & Stevenson, D. W. A comparison of algorithms for the identification of specimens using DNA barcodes: examples from gymnosperms. Cladistics 23, 1–21(2007).
PubMed Google Scholar
Ratnasingham, S. & Hebert, P. D. N. BOLD: The Barcode of Life Data System (www.barcodinglife.org). Mol. Ecol. Notes. 7, 355–364 (2007).
CAS PubMed PubMed Central Google Scholar
Tamaki, K. & Honza, E. Global tectonics and formation of marginal basins-role of the western Pacific. Episodes, 14, 224–230 (1991).
Google Scholar
Briggs, J. C. The marine of East Indian Ocean: diversity and speciation. J. Biogeogr. 32, 1517–1522 (2005).
Google Scholar
Allen, G. R. Conservation hotspots of biodiversity and endemism for Indo-Pacific coral reef fishes. Aquat. Cons. 17, 1–6 (2007).
Google Scholar
Jensen, K. R. Biogeography of the Sacoglossa (Mollusca, Opisthobranchia). Bonn. Zool. Beitr. 55, 255–281 (2006).
Google Scholar
Appeltans, W. et al. The magnitude of global marine species diversity. Curr. Biol. 22, 2189–2202 (2012).
CAS PubMed Google Scholar
Drent, J., Luttikhuizen, P. C. & Piersma, T. Morphological dynamics in the foraging apparatus of a deposit feeding marine bivalve: phenotypic plasticity and heritable effects. Func. Ecol. 18, 349–356 (2004).
Google Scholar
Marko, P. B. & Moran, A. L. Out of sight, out of mind: high cryptic diversity obscures the identities and histories of geminate species in the marine bivalve subgenus Acar. J. Biogeogr. 36, 1861–1880 (2009).
Google Scholar
Zou, S. M., Li, Q., Kong, L. F., Yu, H. & Zheng, X. D. Comparing the usefulness of distance, monophyly and character-based DNA barcoding methods in species identification: a case study of Neogastropoda. PLoS one 6, e26619 (2011).
ADS CAS PubMed PubMed Central Google Scholar
Puillandre, N. et al. Identifying gastropod spawn from DNA barcodes: possible but not yet practicable. Mol. Ecol. Resour. 9, 1311–1321 (2009b).
CAS PubMed Google Scholar
Teske, P. R., Barker, N. P. & McQuaid, C. D. Lack of genetic differentiation among four sympatric southeast african intertidal limpets (Siphonariidae): phenotypic plasticity in a single species? J. Mollus. Stud. 73, 223–228 (2007).
Google Scholar
Carstensen, D., Laudien, J., Leese, F., Arntz, W. & Held, C. Genetic variability, shell and sperm morphology suggest that the surf clams Donax marincovichi and D. obesulus are one species. J. Mollus. Stud. 75, 381–390 (2009).
Google Scholar
Mikkelsen, N. T., Schander, C. & Willassen, E. Local scale DNA barcoding of bivalves (Mollusca): A case study. Zool. Scr. 36, 455–463 (2007).
Google Scholar
Johnson, S. B., Warén, A. & Vrijenhoek, R. C. DNA barcoding of Lepetodrilus limpets reveals cryptic species. J. Shell. Res. 27, 43–51 (2008).
Google Scholar
Zou, S., Li, Q. & Kong, L. F. Monophyly, Distance and Character–Based Multigene Barcoding Reveal Extraordinary Cryptic Diversity in Nassarius: A Complex and Dangerous Community. PLoS one 7, e47276 (2012)
ADS CAS PubMed PubMed Central Google Scholar
Ward, R. D., Holmes, B. H. & O’HARA, T. D. DNA barcoding discriminates echinoderm species. Mol. Ecol. Resour. 8, 1202–1211 (2008a).
CAS PubMed Google Scholar
Ward, R. D., Holmes, B. H., White, W. T. & Last, P. R. DNA barcoding Australasian chondrichthyans: results and potential uses in conservation. Mar. Freshwater. Res. 59, 57–71 (2008b).
CAS Google Scholar
Carr, C. M., Hardy, S. M., Brown, T. M., Macdonald, T. A. & Hebert, P. D. N. A tri-oceanic perspective: DNA barcoding reveals geographic structure and cryptic diversity in Canadian polychaetes. PLoS one 6, e22232 (2010).
ADS Google Scholar
Kappes, H. & Haase, P. Slow, but steady: dispersal of freshwater molluscs. Aquat. Sci. 74, 1–14 (2012).
Google Scholar
Aliabadian, M., Kaboli, M., Nijman, V. & Vences, M. Molecular identification of birds: performance of distance-based DNA barcoding in three genes to delimit parapatric species. PLoS ONE, 4, e4119 (2009).
ADS PubMed PubMed Central Google Scholar
Funk, D. J. & Omland, K. E. Species-level paraphyly and polyphyly: frequency, causes, and consequences, with insights from animal mitochondrial DNA. Annu. Rev. Ecol. Evol. S. 34, 397–423 (2003).
Google Scholar
Moritz, C. & Cicero, C. DNA barcoding: promise and pitfalls. PLoS Biol. 2, e354 (2004).
PubMed PubMed Central Google Scholar
Meier, R., Zhang, G. & Ali, F. The use of mean instead of smallest interspecific distances exaggerates the size of the barcoding gap and leads to misidentification. Syst. Biol. 57, 809–813 (2008).
PubMed Google Scholar
Virgilio, M., Backeljau, T., Nevado, B. & De Meyer, M. Comparative performances of DNA barcoding across insect orders. BMC bioinformatics. 11, 1 (2010).
Google Scholar
Ross, H. A., Murugan, S. & Li, W. L. S. Testing the reliability of genetic methods of species identification via simulation. Syst. Biol. 57, 216–230 (2008).
PubMed Google Scholar
Lou, M. & Golding, G. B. Assigning sequences to species in the absence of large interspecific differences. Mol. Phylogenet. Evol. 56, 187–194 (2010).
CAS PubMed Google Scholar
Bergsten, J. et al. The effect of geographical scale of sampling on DNA barcoding. Syst. Biol. 61, 851–869 (2012).
PubMed PubMed Central Google Scholar
Knebelsberger, T., Dunz, A. R., Neumann, D. & Geiger, M. F. Molecular diversity of Germany’s freshwater fishes and lampreys assessed by DNA barcoding. Mol. Ecol. Resour. 15, 562–572 (2015).
CAS PubMed Google Scholar
Lukhtanov, V. A., Sourakov, A., Zakharov, E. V. & Hebert, P. D. N. DNA barcoding Central Asian butterflies: increasing geographical dimension does not significantly reduce the success of species identification. Mol. Ecol. Resour. 9, 1302–1310 (2009).
PubMed Google Scholar
Kerr, K. C. et al. Comprehensive DNA barcode coverage of North American birds. Mol. Ecol. notes. 7, 535–543 (2007).
CAS PubMed PubMed Central Google Scholar
Ward, R. D., Hanner, R. & Hebert, P. D. N. The campaign to DNA barcode all fishes, FISH-BOL. J. Fish. Biol. 74, 329–356 (2009).
CAS PubMed Google Scholar
Colgan, D. J., Ponder, W. F., Beacham, E. & Macaranas, J. Molecular phylogenetics of Caenogastropoda (Gastropoda: Mollusca). Mol. Phylogenet. Evol. 42, 717–737 (2007).
CAS PubMed Google Scholar
Shaw, P. W., Pierce, G. J. & Boyle, P. R. Subtle population structuring within a highly vagile marine invertebrate, the veined squid Loligo forbesi, demonstrated with microsatellite DNA markers. Mol. Ecol. 8, 407–417 (1999).
CAS Google Scholar
Mikhailova, N. A., Gracheva, Y. A., Backeljau, T. & Granovitch, A. I. A potential species-specific molecular marker suggests interspecific hybridization between sibling species Littorina arcana and L. saxatilis (Mollusca, Caenogastropoda) in natural populations. Genetica, 137, 333–340 (2009).
CAS PubMed Google Scholar
Chen, W., Ma, X., Shen, Y., Mao, Y. & He, S. The fish diversity in the upper reaches of the Salween River, Nujiang River, revealed by DNA barcoding. Sci. Rep. 5, 17437 (2015).
ADS CAS PubMed PubMed Central Google Scholar
Vrijenhoek, R. C., Schutz, S. J., Gustafson, R. G. & Lutz, R. A. Cryptic species of deep sea clams (Mollusca, Bivalvia, Vesicomyidae) from hydrothermal vent and cold water seep environments. Deep-Sea Res. Part I. 41, 1171–1189 (1994).
Google Scholar
Feulner, P. G. D., Kirschbaum, F., Schugardt, C., Ketmaier, V. & Tiedemann, R. Electrophysiological and molecular genetic evidence for sympatrically occuring cryptic species in African weakly electric fishes (Teleostei: Mormyridae: Campylomormyrus). Mol. Phylogenet. Evol. 39, 198–208 (2006).
CAS PubMed Google Scholar
Hebert, P. D., Penton, E. H., Burns, J. M., Janzen, D. H. & Hallwachs, W. Ten species in one: DNA barcoding reveals cryptic species in the neotropical skipper butterfly Astraptes fulgerator. Proc. Natl. Acad. Sci. USA 101, 14812–14817 (2004).
ADS CAS PubMed PubMed Central Google Scholar
Grundt, H. H., Kjølner, S., Borgen, L., Rieseberg, L. H. & Brochmann, C. High biological species diversity in the arctic flora. Proc. Natl. Acad. Sci. USA 103, 972–975 (2006).
ADS CAS PubMed PubMed Central Google Scholar
Bickford, D. et al. Cryptic species as a window on diversity and conservation. Trends. Ecol. Evol. 22, 148–155 (2007).
PubMed Google Scholar
Bouchet, P. The magnitude of marine biodiversity. In: The exploration of marine biodiversity: scientific and technological challenges. Duarte CM. Fundacion BBVA: Bilbao, Spain, 31–64 (2006).
Willig, M. R., Kaufman, D. M. & Stevens, R. D. Latitudinal gradients of biodiversity: pattern process, scale, and synthesis. Annu. Rev. Ecol. Evol. Syst. 34, 273–309 (2003).
Google Scholar
Barco, A., Houart, R., Bonomolo, G., Crocetta, F. & Oliverio, M. Molecular data reveal cryptic lineages within the northeastern Atlantic and Mediterranean small mussel drills of the Ocinebrina edwardsii complex (Mollusca: Gastropoda: Muricidae). Zool. J. Linn. Soc. 169, 389–407 (2013).
Google Scholar
DeSalle, R., Egan, M. G. & Siddall, M. The unholy trinity: taxonomy, species delimitation and DNA barcoding. Phil. Trans. R. Soc. B: Biological Sciences, 360, 1905–1916 (2005).
CAS PubMed PubMed Central Google Scholar
Liu, J. X., Gao, T. X., Wu, S. F. & Zhang, Y. P. Pleistocene isolation in the Northwestern Pacific marginal seas and limited dispersal in a marine fish, Chelon haematocheilus (Temminck & Schlegel, 1845). Mol. Ecol. 16, 275-288 (2007).
Xu, J., Chan, T. Y., Tsang, L. M. & Chu, K. H. Phylogeography of the mitten crab Eriocheir sensu stricto in East Asia: Pleistocene isolation, population expansion and secondary contact. Mol. Phylogenet. Evol. 52, 45–56 (2009).
PubMed Google Scholar
Shen, K. N., Jamandre, W. B., Hsu, C. C., Tzeng, W. N. & Durand, J. D. Plio-Pleistocene sea level and temperature fluctuations in the northwestern Pacific promoted speciation in the globally-distributed flathead mullet Mugil cephalus. BMC Evol. Biol. 11, 83 (2011).
CAS PubMed PubMed Central Google Scholar
Liu, J., Li, Q., Kong, L. & Zheng, X. Cryptic diversity in the pen shell Atrina pectinata (Bivalvia: Pinnidae): high divergence and hybridization revealed by molecular and morphological data. Mol. Ecol. 20, 4332–4345 (2011).
PubMed Google Scholar
Ni, G., Li, Q., Kong, L. F. & Zheng, X. D. Phylogeography of bivalve Cyclina sinensis: testing the historical glaciations and Changjiang River outflow hypotheses in northwestern Pacific. PLoS one 7, e49487 (2012).
ADS CAS PubMed PubMed Central Google Scholar
Wang, P. X. Response of western Pacific marginal seas to glacial cycles: paleoceanographic and sedimentological features. Mar. Geol. 156, 5–39 (1999).
ADS Google Scholar
Bowen, B. W., Nelson, W. S. & Avise, J. C. A molecular phylogeny for marine turtles: trait mapping, rate assessment, and conservation relevance. Proc. Natl. Acad. Sci. USA 90, 5574–5577 (1993).
ADS CAS PubMed PubMed Central Google Scholar
SchÖnrogge, K. et al. When rare species become endangered: cryptic speciation in myrmecophilous hoverflies. Biol. J. Linn. Soc. 75, 291–300 (2002).
Google Scholar
Kong, L. F. & Li, Q. Genetic evidence for the existence of cryptic species in an endangered clam Coelomactra antiquata. Mar. Biol. 156, 1507–1515 (2009).
Google Scholar
Moritz, C. Strategies to protect biological diversity and the evolutionary processes that sustain it. Syst. Biol. 51, 238–254 (2002).
PubMed Google Scholar
Schlitzer, R. Interactive analysis and visualization of geoscience data with Ocean Data View. Comp. Geosci. 28, 1211–1218 (2002).
Google Scholar
Winnepenninckx, B., Backeljau, T. R. D. W. & Dewachter, R. Extraction of high-molecular-weight DNA from molluscs. Trends. Genet. 9, 407 (1993).
CAS PubMed Google Scholar
Li, Q., Park, C. & Kijima, A. Isolation and characterization of microsatellite loci in the Pacific abalone, Haliotis discus hannai. J. Shellfish. Res. 21, 811–815 (2002).
Google Scholar
Vrijenhoek, R. DNA primers for amplification of mitochondrial cytochrome c oxidase subunit I from diverse metazoan invertebrates. Mol. Mar. Biol. Biotech. 3, 294–299 (1994).
Google Scholar
Colgan, D. J., Hutchings, P. A. & Brown, S. Phylogenetic relationships within the Terebellomorpha. J. Mar. Biol. Assoc. UK. 81, 765–773 (2001).
CAS Google Scholar
Thompson, J. D., Gibson, T. J., Plewniak, F., Jeanmougin, F. & Higgins, D. G., The ClustalX windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucleic. Acids. Res. 25, 4876–4882 (1997).
CAS PubMed PubMed Central Google Scholar
Dixon, P. VEGAN, a package of R functions for community ecology. J. Veg. Sci. 14, 927–930 (2003).
Google Scholar
Kembel, S. W. et al. Picante: R tools for integrating phylogenies and ecology. Bioinformatics 26, 1463–1464 (2010).
CAS PubMed Google Scholar
Harrell, F. E. Miscellaneous Hmisc: Harrell miscellaneous. R package version 3.9-3 (2012).
Kimura, M. A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequences. J. Mol. Evol. 16, 111–120 (1980).
ADS CAS PubMed Google Scholar
Paradis, E., Claude, J. & Strimmer, K. APE: analyses of phylogenetics and evolution in R language. Bioinformatics, 20, 289–290 (2004).
CAS PubMed Google Scholar
Brown, S. D. et al. Spider: an R package for the analysis of species identity and evolution, with particular reference to DNA barcoding. Mol. Ecol. Resour. 12, 562–565 (2012).
PubMed Google Scholar
Collins, R. A. & Cruickshank, R. H. Known knowns, known unknowns, unknown unknowns and unknown knowns in DNA barcoding: a comment on Dowton et al. Syst. Biol. syu060 (2014).
Saitou, N. & Nei, M. The neighbour-joining method: a new method for reconstructing evolutionary trees. Mol. Biol. Evol. 4, 406–425 (1987).
CAS PubMed Google Scholar
Tamura, K. et al. MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Mol. Biol. Evol. 28, 2731–2739 (2011).
CAS PubMed PubMed Central Google Scholar
Clement, M., Posada, D. & Crandall, K. A. TCS: a computer program to estimate gene genealogies. Mol. Ecol. 9, 1657–1659 (2000).
CAS PubMed Google Scholar

Download references

Acknowledgements

This study was supported by research grants from by research grants from Fundamental Research Funds for the Central Universities, National Natural Science Foundation of China (41276138), and National Marine Public Welfare Research Program (201305005).

Author information

Authors and Affiliations

Key Laboratory of Mariculture, Ministry of Education, Ocean University of China, Qingdao, 266003, China
Shao’e Sun, Qi Li, Lingfeng Kong, Hong Yu, Xiaodong Zheng, Ruihai Yu, Lina Dai, Yan Sun, Jun Chen, Jun Liu, Lehai Ni, Yanwei Feng, Zhenzhen Yu, Shanmei Zou & Jiping Lin

Authors

Shao’e Sun

View author publications

You can also search for this author in PubMed Google Scholar
Qi Li

View author publications

You can also search for this author in PubMed Google Scholar
Lingfeng Kong

View author publications

You can also search for this author in PubMed Google Scholar
Hong Yu

View author publications

You can also search for this author in PubMed Google Scholar
Xiaodong Zheng

View author publications

You can also search for this author in PubMed Google Scholar
Ruihai Yu

View author publications

You can also search for this author in PubMed Google Scholar
Lina Dai

View author publications

You can also search for this author in PubMed Google Scholar
Yan Sun

View author publications

You can also search for this author in PubMed Google Scholar
Jun Chen

View author publications

You can also search for this author in PubMed Google Scholar
Jun Liu

View author publications

You can also search for this author in PubMed Google Scholar
Lehai Ni

View author publications

You can also search for this author in PubMed Google Scholar
Yanwei Feng

View author publications

You can also search for this author in PubMed Google Scholar
Zhenzhen Yu

View author publications

You can also search for this author in PubMed Google Scholar
Shanmei Zou

View author publications

You can also search for this author in PubMed Google Scholar
Jiping Lin

View author publications

You can also search for this author in PubMed Google Scholar

Contributions

Q.L., L.K., H.Y. and S.S. conceived and designed the experiments, X.D., R.Y., L.D., Y.S., J.C., L.N., Y.F., Z.Y., S.Z. and J.L. collected the data and performed the experiments, Q.L. and S.S. analysed the data and wrote the paper and all authors were involved in the critical review of the manuscript.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Electronic supplementary material

Supplementary Information

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Sun, S., Li, Q., Kong, L. et al. DNA barcoding reveal patterns of species diversity among northwestern Pacific molluscs. Sci Rep 6, 33367 (2016). https://doi.org/10.1038/srep33367

Download citation

Received: 04 April 2016
Accepted: 25 August 2016
Published: 19 September 2016
DOI: https://doi.org/10.1038/srep33367

This article is cited by

Molecular characterization and phylogenetic analysis of crabs (Crustacea: Decapoda: Brachyura) based on mitochondrial COI and 16S rRNA genes
- Anindita Barua
- Tamim Afrin
- Md. Sagir Ahmed
Conservation Genetics Resources (2021)
Wireworm (Coleoptera: Elateridae) genomic analysis reveals putative cryptic species, population structure, and adaptation to pest control
- Kimberly R. Andrews
- Alida Gerritsen
- Samuel S. Hunter
Communications Biology (2020)
A newly discovered radiation of endoparasitic gastropods and their coevolution with asteroid hosts in Antarctica
- Kara K. S. Layton
- Greg W. Rouse
- Nerida G. Wilson
BMC Evolutionary Biology (2019)
Comparison of three DNA marker regions for identification of food relevant crustaceans of the order Decapoda
- Gesche Spielmann
- Jana Diedrich
- Ingrid Huber
European Food Research and Technology (2019)
DNA barcode based delineation of freshwater fishes from northern Western Ghats of India, one of the world’s biodiversity hotspots
- Tejas S. Patil
- Rahul A. Jamdade
- Dipak V. Muley
Biodiversity and Conservation (2018)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Distance summary

Barcode gap analysis

Success of sequence-based specimen identification techniques

BIN discordance report and the nearest neighbour analysis

Neighbour-joining analysis

Discussion

Material and Methods

Sampling and collection data

Molecular Data Collection

DNA Barcoding Analyses

Additional Information

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Ethics declarations

Competing interests

Electronic supplementary material

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links