Research Article |
Corresponding author: Florian Leese ( florian.leese@uni-due.de ) Academic editor: Ana Previšić
© 2019 Sonja Darschnik, Florian Leese, Martina Weiss, Hannah Weigand.
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Darschnik S, Leese F, Weiss M, Weigand H (2019) When barcoding fails: development of diagnostic nuclear markers for the sibling caddisfly species Sericostoma personatum (Spence in Kirby & Spence, 1826) and Sericostoma flavicorne Schneider, 1845. ZooKeys 872: 57-68. https://doi.org/10.3897/zookeys.872.34278
|
The larval stages of the central European sibling caddisfly species Sericostoma personatum (Spence in Kirby and Spence, 1826) and S. flavicorne Schneider, 1845 are morphologically similar and can only be distinguished by differences in coloration in late larval instars. Identification using the mitochondrial barcoding gene, i.e., the Cytochrome c Oxidase 1, is impossible, as both species share the same highly differentiated haplotypes due to introgression. Nuclear gene markers obtained through double digest restriction site associate sequencing (ddRAD seq), however, can reliably distinguish both species, yet the method is expensive as well as time-consuming and therefore not practicable for species determination. To facilitate accurate species identification without sequencing genome-wide markers, we developed nine diagnostic nuclear RFLP markers based on ddRAD seq data. The markers were successfully tested on geographically distinct populations of the two Sericostoma species in western Germany, on known hybrids, and on another sericostomatid caddisfly species, Oecismus monedula (Hagen, 1859) that sometimes shares the habitat and can be morphologically confounded with Sericostoma. We describe a simple and fast protocol for reliable species identification of S. personatum and S. flavicorne independent of the life cycle stage of the specimens.
Freshwater biodiversity, molecular species identification, RFLP, Sericostomatidae, Trichoptera
Macroinvertebrate species are important indicators for the ecological status and water quality of freshwater ecosystems. Correct taxa lists are the basis for bioassessments and hence reliable tools for taxonomic identification of species are essential. However, reliable morphological species identification can be difficult or impossible, especially for closely-related species or early larval stages, because diagnostic characters are lacking or not yet visible. To deal with this problem, DNA-based methods have been developed for species identification, i.e., DNA barcoding (
One taxon that is typically considered in freshwater ecosystem monitoring is the caddisfly family Sericostomatidae (
While the two species are not assessed individually in current biodiversity assessments of streams because of the difficulty to distinguish them, it is known that they differ in their ecological requirements (
For identification of diagnostic RFLP markers, the ddRAD seq data from
As a first step, loci present in at least 50% of all individuals were tested for fixed single nucleotide polymorphisms (SNPs), i.e., diagnostic markers, between the two Sericostoma species. The respective scaffolds were extracted from the draft genome. For 217 loci, the sequences were checked for the presence of a palindromic restriction enzyme recognition site at the fixed, diagnostic SNPs. Three restriction enzyme motifs occurring frequently in the analyzed subsets were EcoRV (GATATC), NdeI (CATATG), and PvuII (CAGCTG), respectively. We used in-house python scripts (available on request) to check for the presence of diagnostic sites in the remaining loci with fixed SNPs. Subsequently, primers for amplification of regions containing the selected SNPs were designed with Geneious 6.0.6 (Biomatters Ltd). To enable the later multiplexing of different markers, the primers were chosen to result in (i) different fragment lengths of the undigested fragments and (ii) different fragment lengths of the digested fragments for markers cut by the same restriction enzyme.
In a final step, the primer sequences were checked for the potential amplification of multiple fragments. Hence, they were mapped against the Sericostoma genome (GenBank accession NCQO00000000.1) using the blastn megablast algorithm (
To test the reliability and performance of the potential markers, 80 Sericostoma sp. specimens from 17 locations from two different regions in Germany were investigated. Additionally, markers were tested for four Oecismus monedula specimens. A detailed list of locations is given in Table
Overview of sampling sites, number of analyzed specimens per site, DNA extraction method used (SaPr = salt precipitation and/or CH = Chelex), and morphological identification level.
Site | Stream | Coordinates [N/E] | Geographical region | Year | Number of samples | Extraction method | Identification level |
Svb | Silvertbach | 51.644583 7.230139 | North Rhine-Westphalia | 2017 | 7 | SaPr | species |
D | Diemel | 51.420806 8.808667 | North Rhine-Westphalia | 2017 | 7 | SaPr | species |
V | Volme | 51.241611 7.531167 | North Rhine-Westphalia | 2017 | 2 | SaPr | species |
1OW | Tributary Lüderbach | 50.475897 9.295219 | Hesse | 2018 | 5 + 1 O. monedula | CH | genus |
2KS | Tributary Rammholzer Wasser | 50.331768 9.619321 | Hesse | 2018 | 5 + 3 O. monedula | all samples with CH, 2 O. monedula also with SaPr | genus |
3OE | Tributary Lohrbach | 50.116317 9.462612 | Hesse | 2018 | 7 | all samples with CH, 2 Sericostoma also with SaPr | genus |
2OS | Schmale Sinn | 50.33576 9.696705 | Hesse | 2018 | 6 | all samples with CH, 2 Sericostoma also with SaPr | genus |
Han | Hannebecke | 51.30635 8.41007 | North Rhine-Westphalia | 2014 | 1 | SaPr | species |
Val | Valme | 51.31553 8.40354 | North Rhine-Westphalia | 2014 | 2 | SaPr | species |
Nie | Nier | 51.31380 8.35892 | North Rhine-Westphalia | 2014 | 4 | SaPr | species |
Bra | Brabecke | 51.29762 8.40043 | North Rhine-Westphalia | 2014 | 3 | SaPr | species |
Nes | Nesselbach | 51.17494 8.41878 | North Rhine-Westphalia | 2014 | 3 | SaPr | species |
Roe | Röhr | 51.27247 8.05820 | North Rhine-Westphalia | 2014 | 7 + 1 hybrid | SaPr | species |
Sch | Schwarze Ahe | 51.20499 7.72184 | North Rhine-Westphalia | 2014 | 3 | SaPr | species |
Kru | Krummenau | 51.07383 7.71277 | North Rhine-Westphalia | 2014 | 9 + 2 hybrids | SaPr | species |
Sor | Sorpe | 51.19883 8.42851 | North Rhine-Westphalia | 2013 | 3 | SaPr | species |
Sil | Silberbach | 51.03180 8.05244 | North Rhine-Westphalia | 2014 | 3 | SaPr | species |
DNA was extracted using two different protocols: For samples collected in 2018, tissue was taken from legs and thoracic muscle and DNA extracted following a Chelex-based extraction protocol, by incubating tissue samples in 150 µl 10% (w/v) Chelex 100 (Bio-Rad) at 95 °C for 15 minutes in total, vortexing the samples every 5 minutes. For samples from 2013, 2014, and 2017, tissue was taken from the abdomen of specimens. To avoid contamination, gut content was removed before extraction. DNA was extracted according to a salt precipitation protocol (
For RFLP marker validation, a PCR amplification was carried out for all developed primer pairs in separate reactions for 39 samples. Primers are listed in Table
List of primers with corresponding restriction enzymes and expected fragment sizes for S. flavicorne (Sf) and S. personatum (Sp).
Enzyme | Primer name | Primer sequence (5‘-3‘) | Tm [°C] | Species with restriction site | Fragment size Sf [bp] | Fragment size Sp [bp] |
EcoRV | EcoRV1_fw | GTGCTTCTGTCCTGTTATTC | 54.0 | Sp | 397 | 229 |
EcoRV1_re | TTCAAACTTGCAAAAATGCC | 54.1 | 168 | |||
EcoRV2_fw | AAAGAGGCGATTAACTTTCG | 54.0 | Sp | 542 | 165 | |
EcoRV2_re | CACATTATGAACACCACACA | 53.8 | 377 | |||
EcoRV4_fw | AATCACTAAAACTGCCAACC | 54.1 | Sp | 739 | 217 | |
EcoRV4_re | CTTGTACCCGTTATCGAGAG | 55.1 | 522 | |||
EcoRV5_fw | GAGTTCTGATCCTGTTTGTG | 54.0 | Sp | 470 | 129 | |
EcoRV5_re | TGGCCTAGCTCAATAAATGA | 54.1 | 341 | |||
NdeI | NdeI1_fw | TCTTCTGGTTCTAGGGAAAA | 54.1 | Sp | 667 | 354 |
NdeI1_re | ACGAAGACTGAACTCTCAAT | 53.2 | 313 | |||
NdeI4_fw | TCAGCATGACAGGTGAATAT | 54.1 | Sf | 302 | 440 | |
NdeI4_re | ACAAAATGAGGCAAGTGAAT | 54.0 | 138 | |||
NdeI5_fw | TGTTTGATGGATTCCTCAGA | 54.0 | Sf | 379 | 547 | |
NdeI5_re | TGCCTCTCATCCTATTGATC | 54.0 | 168 | |||
NdeI8_fw | TTATTCGCGCCATACTTTAC | 53.9 | Sf | 455 | 761 | |
NdeI8_re | ATGGTCTTACCCGTTTAGAG | 54.2 | 306 | |||
PvuII | PvuII2_fw | GCATAACCGACAATGTGTAA | 54.0 | Sf | 564 | 876 |
PvuII2_re | CTAGCTCATTTCCTTTGTGG | 54.0 | 312 |
All PCR products were digested with the respective enzymes in 30 µl reaction volumes as follows: 10 µl of unpurified PCR product, 2 µl of Green Buffer (to directly load digested PCR products on agarose gels), 1 µl of either FastDigest Eco321 (isoschizomer or EcoRV), FastDigest NdeI or FastDigest PvuII (all Thermo Scientific), and 17 µl deionized water. Incubation was conducted at 37 °C for 5 min for Eco321 and PvuII, and for 60 min for NdeI. Digested PCR products were visualized on 2% TBE agarose gels and compared to a 100 to 1000 bp ladder to determine the size of the fragments. Species were identified by comparing the resulting patterns to the expected fragment lengths (Table
Amplification success differed slightly between candidate markers. Generally, the success rate was high for all tested single and multiplex reactions (Suppl. material
A schematic overview of the band patterns predicted after the primer design for both species is shown in Fig.
When the PCR amplification was successful, enzymatic restriction and species assignment based on the resulting fragment patterns was successful for all S. flavicorne/personatum individuals, i.e., all 38 individuals (excluding hybrids) previously identified to species level by ddRAD seq (
For the four tested O. monedula specimens, PCR was successful only for four primer pairs (EcoRV1, EcoRV2, EcoRV4 and PvuII2): For the PvuII2 primer pair a product of ~550 bp was visible when amplified from two of the samples additionally extracted by salt precipitation, while no product was visible when amplified from Chelex extraction (4 individuals). EcoRV1 yielded a fragment of ~900 bp in single marker assessment that was not visible in multiplex tests. In general, bands on the agarose gels were faint in single marker assessment and barely visible in multiplex approaches. EcoRV2 yielded the same pattern after restriction as expected for S. flavicorne while EcoRV4 created the same pattern as specimens of S. personatum.
The three tested hybrids amplified for all markers but created intermediate results after restriction (Suppl. material
In this study, we developed and tested nine RFLP markers to distinguish the two sibling caddisfly species S. flavicorne and S. personatum in Germany, which cannot be identified using COI barcoding. An advantage of the RFLP approach is that no sequencing is required and thus species assignment is possible directly after PCR and a short restriction incubation, followed by simple agarose gel visualization. Since all primer pairs amplify with the same PCR settings, all markers can be tested in simultaneous reactions, with the possibility of multiplexing two primer pairs each.
Different factors can impact on the identification success of the RFLP markers. First, good amplification success for the correct amplicons in the PCR is needed to enable subsequent restriction digestion. We found high success rates of ≥ 90% for all single reactions and ≥ 80% for all multiplex reactions, with the exception of NdeI8 (only 60%). For three individuals, an alternative, shorter fragment was amplified for EcoRV5, which might be caused by a local sequence variant. This phenomenon may increase in frequency when extending the geographic range (ascertainment bias). Second, the restriction enzymes need to digest the PCR fragments reliably. While this was the case for FastDigest Eco321 (Isoschizomer of EcoRV) and FastDigest PvuII, incomplete digestion was sometimes observed for FastDigest NdeI, even though incubation time was set to 60 min as advised by the manufacturer. Finally, the DNA extraction method should not influence the reliability of the results in order to provide a robust method for application in different laboratories. The two extraction methods tested here, i.e., the salt precipitation and the Chelex approach, did not systematically impact the amplification or restriction success. Still, DNA extracted with salt precipitation showed stronger bands on the agarose gels after PCR and after restriction than Chelex extracted samples, indicating a lower yield of PCR products for Chelex extractions. Despite the lower output, the fragments were still successfully amplified. Hence, for a quick and inexpensive S. personatum/flavicorne identification, Chelex extraction is well-suited, since it is simpler, cheaper and especially much faster than most other extraction methods.
Besides the different technical aspects, we also tested if the RFLP method works for the target species across a broader geographic scale than used in the study of
As the study of
In addition to S. flavicorne/personatum, we also tested our RFLP-markers with O. monedula samples. These species can co-occur in the streams and can be difficult to distinguish morphologically. In contrast to S. personatum/flavicorne, amplification success was low for O. monedula. No fragments were amplified for several markers, but with EcoRV2 the S. flavicorne and with EcoRV4 the S. personatum amplicon was generated. Furthermore, two markers (EcoRV1 and PvuII2) generated additional fragments for O. monedula, when the DNA was extracted via salt precipitation. These in general weak bands, were almost absent when using DNA from the Chelex extraction or when multiplexing markers. Hence, scoring only individuals successfully amplified for several markers with unambiguous species identification as well as excluding specimens with bands expected only for O. monedula, allows to clearly distinguish the two Sericostoma species from O. monedula. Individuals not fulfilling these criteria cannot be directly assigned to O. monedula, especially if only few markers amplify successfully. While the specific PCR products found for O. monedula may allow an unambiguous species identification of this species with our markers, our sample size of O. monedula is too low for any validation. We currently recommend the use of COI barcoding to clearly assign them to O. monedula as for this purpose DNA barcoding works reliably. It should also be noted that the proposed approach does not work for community-based DNA assessments (DNA metabarcoding) but only for individual specimen-based approaches and thus would inquire an additional analysis step.
In summary, the markers introduced in this study are an easy-to-use, cheap, and reliable alternative to CO1 barcoding for determining the problematic sister species S. personatum and S. flavicorne. They were applied with high amplification and restriction success rates per marker in single and multiplex approaches. The latter allows to halve material costs and reaction times (
It is important to note that the specimens tested herein only come from a small part of the total species range; therefore, the proven success of identification with the markers is limited to the regions tested. Supposedly, the method will give informative results in different areas as well, which remains to be established in the future.
We thank Ana Previšić, Simon Vitecek, and Jean-Luc Gattolliat for valuable remarks and suggestions that greatly improved the manuscript. Furthermore, we thank Cristina Hartmann-Fatu and Sandra Roeser (University of Duisburg-Essen) for laboratory assistance. FL and HW are members of and supported by COST Action CA15219 (DNAqua-Net).
Supplementary Table S1
Data type: molecular data
Explanation note: Overall amplification success (%) of primer pairs, multiplex, and single reaction approaches combined.
Supplementary figures
Data type: molecular data
Explanation note: Figure S1 – three hybrid samples (H1, H2, H3) after restriction for all markers. Figure S2 – comparison of six samples (two of each S. personatum, S. flavicorne and O. monedula) with assessed with PvuII2 after restriction, salt extraction protocol on the left and Chelex extraction on the right.