Review Article |
Corresponding author: Michael J. Raupach ( raupach@snsb.de ) Academic editor: Sammy De Grave
© 2015 Michael J. Raupach, Adriana E. Radulovici.
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Raupach MJ, Radulovici AE (2015) Looking back on a decade of barcoding crustaceans. ZooKeys 539: 53-81. https://doi.org/10.3897/zookeys.539.6530
|
Species identification represents a pivotal component for large-scale biodiversity studies and conservation planning but represents a challenge for many taxa when using morphological traits only. Consequently, alternative identification methods based on molecular markers have been proposed. In this context, DNA barcoding has become a popular and accepted method for the identification of unknown animals across all life stages by comparison to a reference library. In this review we examine the progress of barcoding studies for the Crustacea using the Web of Science data base from 2003 to 2014. All references were classified in terms of taxonomy covered, subject area (identification/library, genetic variability, species descriptions, phylogenetics, methods, pseudogenes/numts), habitat, geographical area, authors, journals, citations, and the use of the Barcode of Life Data Systems (BOLD). Our analysis revealed a total number of 164 barcoding studies for crustaceans with a preference for malacostracan crustaceans, in particular Decapoda, and for building reference libraries in order to identify organisms. So far, BOLD did not establish itself as a popular informatics platform among carcinologists although it offers many advantages for standardized data storage, analyses and publication.
Barcode of Life Data Systems, Crustacea , cytochrome c oxidase subunit I, DNA barcoding, mitochondrial DNA, specimen identification
The accurate diagnosis of species represents a pivotal component for many topics, including large-scale biodiversity studies and conservation planning. Traditionally, species are identified using morphological characters. This approach requires a certain level of training in observing morphology and it usually leads to a narrow specialization in identifying organisms belonging to a restricted group of taxa (e.g. a carcinologist will likely have difficulties in identifying polychaetes and the other way around). Therefore, a routine and correct morphological identification of many taxa can be challenging, time-consuming and typically requires highly trained specialists. This is especially true for larval stages, juveniles and females which are often not included in species descriptions, resulting in a quite difficult task of assigning correct species names to specimens. In many cases morphological variability and phenotypic plasticity may also complicate a correct species determination. Furthermore, we observe a decline of taxonomists that are able to identify and characterize species of many taxa (e.g.
As consequence of the rise of molecular biology in the last decades, the application of DNA sequence data represents a promising and effective alternative approach to identify specimens throughout all life stages (
Not surprisingly, DNA barcoding has been criticized from its beginning. In various cases, DNA barcoding was considered as a useless and expensive identification method (e.g.
Nevertheless, DNA barcoding has been successfully applied in a large number of taxonomic groups belonging to both invertebrates (e.g.
Within the invertebrates, the Crustacea constitute a challenging taxon for DNA barcoding. With more than 67,000 described species so far (Ahyong et al. 2013), this taxon is species-rich, morphologically diverse and ecologically important. Various crustacean species are of high economic interest (e.g. lobsters, crabs, or shrimps) and represents the basis of extensive crustacean fisheries around the world. Crustaceans can be found in all aquatic environments, and some of them successfully colonized terrestrial habitats in various degrees (e.g. talitrid amphipods, terrestrial crabs, and woodlice). However, a correct identification to the species level is not straightforward for most crustacean taxa, especially for larval and immature stages. Even as adults, numerous species are difficult to identify using morphological characters and usually require the help of taxonomists to differentiate subtle degrees of morphological variability and polymorphism within and between species. This is especially true for small deep-sea crustaceans (e.g. isopods, amphipods and tanaids), and species of the meiofauna (e.g. harpacticoid copepods).
In this review we provide an update regarding the progress of DNA barcoding in crustaceans based on descriptive statistics. Major points of the review are: taxonomic coverage, subject areas, and the use of BOLD as a major platform for the standardization of barcoding studies.
This manuscript covers research articles published between 01-01-2003 and 31-12-2014 and available in the “Web of Science” (WoS) database maintained by Thomson Reuters (http://webofknowledge.com). WoS was searched on 15-01-2015 by using “barcod*” and “crusta*” as keywords in the topic of articles hosted by all databases associated with WoS. For comparison purposes, similar searches were conducted for other arthropod taxa on the same day: Insecta (“insect*”), Chelicerata (“chelicer*”) and Myriapoda (“myriapod*”) in combination with “barcod*”. All crustacean references were individually and carefully checked for inconsistencies, in particular false positive results (e.g. articles dealing with other taxa than crustaceans) and duplications. Only publications of the type “article” were kept for further analyses. Language was not selected as filter criterion, and non-English publications with a title and abstract in English were included. Following a strict terminology for DNA barcoding (sensu
In order to verify the popularity and use of the BOLD workbench among crustacean barcoders, each article was searched for referencing BOLD and given a label: ‘YES’ or ‘NO’. If a BOLD project was mentioned by code or title, subsequent steps were followed to find particular records in BOLD and import them into a dataset: 1) search by project code/title in BOLD Workbench, 2) copy all records from that project, and 3) add records to dataset. All public records stored in BOLD and generated by crustacean barcoding studies can be retrieved by searching DS-CRST (Title: Crustacean Barcoding Studies) in BOLD or by going directly to the corresponding DOI associated with this dataset (https://doi.org/10.5883/DS-CRST). By using a project code as search term, all records of that project were imported, regardless of its history (i.e., records added or removed from a project) between the publication date and January 2015). Some articles mentioned the use of BOLD without providing a project code. In such cases, we were able to find records by the process IDs mentioned in the publication or by searching BOLD based on taxa names. However, when tracking records was not a straightforward process, we excluded those studies from our BOLD-related analyses. DS-CRST in BOLD was used for standard barcoding analyses: number of species versus number of BINs, taxon ID tree and distance summary. Geographic coordinates, where available, were exported and used to create a map in QGIS (
Additional bibliographic data were compiled for all references: publication title, first authors’ names, journal name, publication year, open-access feature, and the number of citations (as provided by WoS). The major results of our literature review are summarized graphically; a table containing all raw data is available as Suppl. material
Our search in WoS produced 243 hits associated with the terms “barcod*” and “crusta*”, 1,064 references for “barcod*” and “insect*”, 67 for “barcod*” and “chelicer*” and eight for “barcod*” and “myriapod*” (Fig.
DNA barcoding studies of the Arthropoda. Total number and percentage values of articles published with “barcod*” and insect*” (green), “crusta*” (blue), “chelicer*” (orange), or “myriapod*” (violet) as keywords in their topic and listed in the Web of Science (period covered: 2003-2014; n = 1,382). For crustaceans, the total number of articles is split into: 1) the number of articles removed from our analysis (duplications and false positives) (pie sector in light blue) and 2) the core number of articles used in this review (pie sector in dark blue). Arthropod illustrations were modified from
Subject areas of DNA barcoding studies of the Crustacea. Number of articles with “barcod*” and “crusta*” as keywords in their topic as retrieved from the Web of Science (period covered: 2003–2014; n = 164) and divided into six subject areas (from bottom to top): identification and barcode library (red), genetic variability (orange), species description (green), phylogenetics (violet), methods (blue), and numts (grey).
The taxonomic coverage of the 164 barcoding publications showed a strong preference for the Decapoda (n = 60, 36.7%), followed by the mixed taxon of “Crustacea” (n = 28, 17%), the Amphipoda (n = 21, 12.8%), Copepoda (n = 18, 11%), and Diplostraca (n = 13, 8%) (Table
Number of publications of the Crustacea using DNA barcodes. “Barcod*” and “crusta*” were used as keywords in the Web of Science (2003–2014). For comparison, the most recent species count per taxon is given in a separate column (based on
Taxon | Publications | (%) | Number of described species | |
---|---|---|---|---|
Malacostraca | Decapoda | 60 | 36.7 | 14,895 |
Amphipoda | 21 | 12.8 | 9,896 | |
Isopoda | 6 | 3.7 | 10,661 | |
Stomatopoda | 3 | 1.8 | 460 | |
Bathynellacea | 2 | 1.2 | 241 | |
Euphausiacea | 1 | 0.6 | 87 | |
Tanaidacea | 1 | 0.6 | 1,069 | |
Maxillopoda | Copepoda | 18 | 11 | 15,976 |
Cirripedia | 5 | 3 | 1,306 | |
Branchiopoda | Diplostraca | 13 | 8 | 821 |
Anostraca | 5 | 3 | 313 | |
Ostracoda | 1 | 0.6 | 7,577 | |
“Crustacea“ | 28 | 17 | n. a. | |
Total | 164 | 100 |
Our investigation also revealed that most crustacean barcoding studies focus on the identification of specimens and the expansion of reference libraries for various taxa (n = 64, 39.1%) (Table
Subject area and taxonomic rank of DNA barcoding studies of the Crustacea. Number of articles were retrieved by using “barcod*” and “crusta*” as keywords in the topic of articles hosted by the Web of Science (period covered: 2003–2014).
Identification, library | Genetic variability | Species description | Phylogenetics | Methods | numts | |
---|---|---|---|---|---|---|
Decapoda | 26 | 11 | 15 | 5 | 1 | 2 |
Amphipoda | 4 | 15 | 1 | 1 | ||
Isopoda | 2 | 3 | 1 | |||
Stomatopoda | 3 | |||||
Bathynellacea | 1 | 1 | ||||
Euphausiacea | 1 | |||||
Tanaidacea | 1 | |||||
Copepoda | 4 | 5 | 6 | 3 | ||
Cirripedia | 2 | 2 | 1 | |||
Diplostraca | 2 | 8 | 3 | |||
Anostraca | 1 | 1 | 1 | 1 | 1 | |
Ostracoda | 1 | |||||
“Crustacea” | 19 | 1 | 6 | 2 | ||
Total | 64 | 44 | 32 | 11 | 9 | 4 |
Approximately two thirds of the barcoding studies focused on the marine environment (n = 99, 60.4%) and only one third dealt with freshwater systems (n = 49, 29.8%) (Fig.
Geographic and habitat focus of the analyzed DNA barcoding studies of the Crustacea. Studies were listed in the Web of Science (period covered: 2003–2014, n = 164), with the number of publications shown on the X axis. Green bars indicate freshwater studies, dark blue bars marine studies. A black bar represents studies that were performed on a global scale. For 11 studies, no classification was possible (grey bar). Note that publications can include taxa from more than one habitat or region.
The vast majority of publications (n = 129, 78.7%) did not mention BOLD in their text (label ‘NO’ in Suppl. material
Project console for DS-CRST in BOLD. Various statistics for the current status of specimens are displayed: record count, species count, taxonomy breakdown, specimen depositories, country of collection, sequence count, flagged records count, trace count, image count. Note that BOLD is a dynamic environment and updates will be reflected on the project console.
Example for a BIN page in BOLD. The amphipod Rhachotropis aculeata (Lepechin, 1780) is registered in the BIN registry as BOLD:AAB3310. Note that BOLD is a dynamic environment and updates will be reflected on the BIN page, including BIN changes.
We found 76 different journals publishing articles dealing with DNA barcoding and crustaceans. Most studies were published in Zootaxa (n = 23, 14%), followed by the Journal of Crustacean Biology and PLOS ONE (each with n = 9, 5.5%), Molecular Ecology Resources (n = 7, 4.3%), Crustaceana and Invertebrate Systematics (each with n = 6, 3.7%). A number of 50 journals (65.8%) had only one article dealing with crustacean barcoding. Only 33 articles (20.1%) were open access as they were published in open access journals (e.g. PLOS ONE, ZooKeys) or in subscription journals where authors chose to publish their work as open-access (Suppl. material
Most cited crustacean barcoding articles per subject area. Data obtained from Web of Science based on a query with ‘barcod*’ and ‘crusta*’ as keywords in the topic of articles published between 2003 and 2014. Citations are given as the total number of citations since publication and the average number of citations per year (in brackets).
Subject area | Title | Authors | Journal | Year | Citations |
---|---|---|---|---|---|
Identification, library | Biological identifications through DNA barcodes: the case of the Crustacea | Costa FO, deWaard JR, Boutillier J, Ratnasingham S, Dooh RT, Hajibabaei M, Hebert PDN | Canadian Journal of Fisheries and Aquatic Sciences | 2007 | 165 (18.3) |
Genetic variability | DNA barcoding reveals extraordinary cryptic diversity in an amphipod genus: implications for desert spring conservation | Witt JDS, Threloff DL, Hebert PDN | Molecular Ecology | 2006 | 172 (17.2) |
Species description | A revision of the Portunus pelagicus (Linnaeus, 1758) species complex (Crustacea: Brachyura: Portunidae), with the recognition of four species | Lai JC, Ng PKL, Davie PJF | Raffles Bulletin of Zoology | 2010 | 23 (3.8) |
Phylogenetics | Systematic and evolutionary insights derived from mtDNACOI barcode diversity in the Decapoda (Crustacea: Malacostraca) | Matzen da Silva J, Creer S, dos Santos A, Costa AC, Cunha MR, Costa FO, Carvalho GR | Public Library of Science ONE | 2011 | 21 (4.2) |
Methods | Relationship between morphological taxonomy and molecular divergence within Crustacea: proposal of a molecular threshold to help species delimitation | Lefébure T, Douady CJ, Gouy M, Gibert J | Molecular Phylogenetics and Evolution | 2006 | 185 (18.5) |
numts | Many species in one: DNA barcoding overestimates the number of species when nuclear mitochondrial pseudogenes are coamplified | Song H, Buhay JE, Whiting MF, Crandall KA | Proceedings of the National Academy of Sciences of the USA | 2008 | 292 (36.5) |
During the past few years, crustaceans have become a popular target for DNA barcoding among the Arthropoda, being outnumbered only by barcoding studies of the Insecta (Fig.
Although we used a highly popular database which indexes scientific literature, we are aware that an unknown number of references are missing from our study. This is mainly caused by two reasons: 1) the term “DNA barcoding” was not used in the publication although COI sequences were applied for species identification (e.g.
A rapid investigation of the taxonomic diversity covered in the 164 barcoding publications showed the highest frequency for Malacostraca (n = 94, 57.4%), the class with the largest number of crustacean species (
In contrast to the total number of publications, which revealed a steady increase followed by a relative plateau, the trend for the six subject areas (see methods) showed large fluctuations from year to year (Fig.
Species identification based on DNA barcodes relies on the existence of reference libraries which consist of COI sequences from specimens previously identified by experts based on traditional methods (i.e., morphological characters). Consequently, many barcoding studies published so far deal with the development of comprehensive barcode libraries (e.g.
The study of intraspecific genetic variation in relation to geography has become very popular in recent decades and resulted in the formation and expansion of a new research field, namely phylogeography (
Ideally, DNA barcoding and species discovery would be seen as intertwined. Whereas the main objective of DNA barcoding is to identify unknown specimens based on reference libraries, an additional outcome is reflected in the identification of unknown genetic clusters that might represent new species. As such, DNA barcodes represent powerful diagnostic supplementary characters that accelerate and revive traditional morphological taxonomy but do not replace it (
During the last years, COI sequences combined with other mitochondrial and nuclear markers have been frequently used to reconstruct the phylogeny of various taxa of the Crustacea (e.g. Blanco-Bercial et al. 2011,
Although DNA barcoding as a molecular method for species identification has been in use for more than a decade, techniques for generating, applying, and analyzing barcode data are still being improved to guarantee an efficient workflow (e.g.
The unwanted amplification of nuclear copies of mitochondrial DNA (numts) represents a problem not only for the analyses of DNA barcodes (COI sequences) but mitochondrial genes in general (Bensasson et al. 2010,
In March 2015, the Public Data Portal of BOLD was hosting more than 80,000 DNA barcodes representing about 5,700 crustacean species (plus a large amount of unidentified specimens) and 10,000 BINs. Only 8% (6,270 records; 860 species names) were directly associated with crustacean barcoding studies (35 publications, Suppl. material
A growing database such as BOLD, which follows specific high standards for data quality, will certainly be useful for large-scale analyses in crustacean phylogeography, biogeography and biodiversity assessment and will offer support for technological advances such as high-throughput sequencing.
Our review shows that DNA barcoding has gained popularity in carcinology and that the most popular group targeted for various related topics are the malacostracan crustaceans, in particular decapods. As the main goal of DNA barcoding is to assign unknown specimens to known species, most crustacean barcoding studies were found to build or use existing reference libraries for identification purposes and this trend will surely continue and probably increase in the future. The generation of comprehensive barcode libraries will represent a challenging but also an important task, especially for some species-rich habitats (e.g. the deep sea or coral reefs), where our general knowledge about crustacean diversity, in particular species numbers, is still poor. A second objective of DNA barcoding is to accelerate species discovery, particularly in cryptic, microscopic and other organisms with complex or inaccessible morphology. We believe that more progress will be made in this direction as well.
Crustacean taxonomy seems to be slowly incorporating DNA barcoding in the field as the top journal in this field is a taxonomic journal and the most prolific first authors have a taxonomic background. However, a larger acceptance and application is highly desirable, and therefore we encourage a stronger cooperation between “classical” taxonomists and the DNA barcoding community. Moreover, the term “DNA barcode” should only be used for COI-5P` sequences (
This review is the result of the symposium ”Molecular species identification and classification in crustaceans” held during the 8th International Crustacean Congress in Frankfurt, Germany (August 18–23, 2014). We thank the organizers of the conference, in particular Prof. Dr. Michael Türkay (03.04.1948 – 09.09.2015), for the opportunity and encouragement to organize this session.
A decade of DNA barcoding of crustaceans: input file
Data type: data table
Explanation note: Raw data related to 164 publications on crustacean barcoding as retrieved from Web of Science: bibliography, citations, habitat type, geographical area, BOLD use.