Research Article |
Corresponding author: Fang Wang ( wangf0110@163.com ) Academic editor: Takumasa Kondo
© 2023 Yun-Feng Hou, Jiu-Feng Wei, Tian-You Zhao, Cai-Feng Li, Fang Wang.
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Hou Y-F, Wei J-F, Zhao T-Y, Li C-F, Wang F (2023) First complete mitochondrial genome of the tribe Coccini (Hemiptera, Coccomorpha, Coccidae) and its phylogenetic implications. ZooKeys 1180: 333-354. https://doi.org/10.3897/zookeys.1180.109116
|
Soft scale insects (Hemiptera, Coccidae) are important pests of various agricultural and horticultural crops and ornamental plants. They have negative impacts on agriculture and forestry. The tribe Coccini represents one of the most ancient evolutionary lineages of soft scale insects. However, no complete Coccini mitochondrial genome (mitogenome) is available in public databases. Here, we described the complete mitogenome of Coccus hesperidum L., 1758. The 15,566 bp mitogenome of C. hesperidum had a high A+T content (83.4%) and contained a typical set of 37 genes, with 13 protein-coding genes (PCGs), 22 transfer RNA genes (tRNAs) and two ribosomal RNA genes (rRNAs). Only seven tRNAs had the typical clover-leaf secondary structure and the remaining tRNAs lacked the DHU arm, TψC arm or both. Moreover, a comparative analysis of all reported scale insect mitogenomes from GenBank database was performed. The mitogenomes of scale insects showed high similarities in base composition and A+T content. Additionally, our phylogenetic analysis confirmed the monophyly of Coccomorpha and revealed that the archaeococcoids were the most basal lineage within Coccomorpha, while Ericerus pela and Didesmococcus koreanus, belonging to Coccidae, were often mixed with Aclerdidae, making Coccidae a paraphyletic group. These findings expand the mitogenome database of scale insects and provide new insights on mitogenome evolution for future studies across different insect groups.
Coccus hesperidum, comparative mitochondrial genomics, mitogenome, phylogenetic analysis
Scale insects (Coccomorpha) belong to the suborder Sternorrhyncha and include more than 8500 species worldwide (
Coccus is the oldest genus within Coccidae, proposed by Linnaeus in 1758 with Coccus hesperidum L. as its type species; it belongs to the tribe Coccini and subfamily Coccinae (
Mitochondria are organelles involved in energy metabolism in eukaryotic cells (
Mitogenomes of Coccus or even Coccini have not been reported to date. Thus, we sequenced and analysed the detailed features of the complete mitogenome of C. hesperidum. Then, we compared the mitogenome characteristics for all reported scale insects mitogenomes. In addition, we investigated the mitogenome phylogeny of Sternorrhyncha, to assess the phylogenetic position of C. hesperidum. These findings expand the mitogenome database of scale insects and provide a significant basis for future studies of the phylogeny and evolution of Hemiptera.
Coccus hesperidum was collected from Radermachera sinica (Bignoniaceae) on 19 May 2019, in Shijiazhuang (37°59'58"N, 114°30'59"E), Hebei Province, China. Then, the scale insects were stored in absolute ethanol at -80 °C. The samples were identified, based on morphological characteristics and molecular identification. For each specimen, total DNA was extracted from the body using a QIAamp DNA Mini Kit (Qiagen, Hilden, Germany) according to the extraction protocol. The concentration and quality of DNA were determined by a NanoDrop 2000 spectrophotometer (Thermo Fisher Scientific, Waltham, MA, USA) and 1% agarose gel electrophoresis and samples were then stored at -20 °C. A genomic DNA library was constructed with 1 µg of DNA that was fragmented into 300–500 bp fragments using the Covaris ME220 Focused Ultrasonicator (Thermo Fisher Scientific). After end-repair, “A” tailing, adapter ligation, purification and PCR amplification, the fragments were sequenced using the paired-end 150 sequencing method on the Illumina NovaSeq 6000 platform by Novogene Bioinformatics Technology Co., Ltd. (Tianjin, China).
Approximately 30 Gb of clean data were obtained after filtering the raw data by removing adapter sequences and low-quality reads (quality value < 20). The complete mitogenome was assembled by Novoplasty (
The secondary structures of transfer RNA genes (tRNAs) were predicted using the ViennaRNA module (
For phylogenetic analysis, the newly-obtained mitogenome data for Coccus hesperidum in this study and 51 other representative Sternorrhyncha species from the GenBank database were sampled (Suppl. material
The complete mitogenome sequence of Coccus hesperidum was assembled into a single contig of 15,566 bp in length, including 13 PCGs, 22 tRNAs and two rRNAs, amongst which 24 genes (9 PCGs and 15 tRNAs) were encoded on the forward strand (+), while the other four PCGs, seven tRNAs and two rRNAs were on the reverse strand (-) (Table
Gene | Location | Size (bp) | Strand | Start codon | Stop codon | Anticodon | Intergenic length |
---|---|---|---|---|---|---|---|
trnM | 753–818 | 66 | + | CAU | |||
trnW | 810–870 | 61 | + | UCA | -9 | ||
COX1 | 861–2387 | 1527 | + | ATA | TAA | – | -10 |
trnL2 | 2388–2449 | 62 | + | UAA | 0 | ||
COX2 | 2450–3113 | 664 | + | ATA | T | – | 0 |
trnK | 3114–3183 | 70 | + | UUU | 0 | ||
trnD | 3180–3244 | 65 | + | GUC | -4 | ||
ATP8 | 3238–3378 | 141 | + | ATT | TAA | – | -7 |
ATP6 | 3372–3989 | 618 | + | ATG | TAA | – | -7 |
COX3 | 3991–4752 | 762 | + | ATG | TAA | – | 1 |
trnG | 4752–4809 | 58 | + | UCC | -1 | ||
ND3 | 4807–5145 | 339 | + | ATA | TAA | – | -3 |
trnA | 5146–5203 | 58 | – | UGC | 0 | ||
trnR | 5202–5249 | 48 | + | UCG | -2 | ||
trnN | 5241–5294 | 54 | + | GUU | -9 | ||
trnS1 | 5296–5343 | 48 | + | UCU | 1 | ||
trnE | 5352–5405 | 54 | + | UUC | 8 | ||
trnF | 5396–5452 | 57 | – | GAA | -10 | ||
ND5 | 5451–7055 | 1605 | – | ATT | TAA | – | -2 |
trnH | 7056–7112 | 57 | – | GUG | 0 | ||
ND4 | 7114–8388 | 1275 | – | ATT | TAA | – | 1 |
ND4L | 8401–8655 | 255 | – | ATT | TAA | – | 12 |
ND6 | 8696–9172 | 477 | + | ATT | TAG | – | 40 |
trnP | 9179–9241 | 63 | + | UGG | 6 | ||
trnQ | 9238–9295 | 58 | – | UUG | -4 | ||
trnC | 9295–9349 | 55 | – | GCA | -1 | ||
trnI | 9359–9423 | 65 | + | GAU | -10 | ||
ND2 | 9424–10,362 | 939 | + | ATT | TAA | – | 0 |
trnY | 10,373–10,425 | 53 | + | GUA | 10 | ||
trnT | 10,440–10,493 | 54 | + | UGU | 14 | ||
CYTB | 10,501–11,568 | 1068 | + | ATA | TAA | – | 7 |
trnS2 | 11,570–11,622 | 53 | + | UGA | 1 | ||
ND1 | 11,647–12,552 | 906 | – | ATT | TAA | – | 24 |
trnL1 | 12,553–12,615 | 63 | – | UAG | 0 | ||
16S rRNA | 12,616–13,789 | 1174 | – | – | 0 | ||
trnV | 13,790–13,835 | 46 | – | UAC | 0 | ||
12S rRNA | 13,836–14,447 | 612 | – | – | 0 |
The total length of 13 PCGs was 10,575 bp, accounting for 67.9% of the complete mitogenome length of Coccus hesperidum. Mostly, PCGs reside on the forward strand, except ND1, ND4, ND4L and ND5, which were on the reverse strand. The AT/GC skew values for the 13 PCGs were -0.48 to 0.26 and -0.54 to 0.66, respectively (Table
Nucleotide compositions and AT/GC skews in mitochondrial genome of Coccus hesperidum.
Gene | Nucleotide frequency | A+T(%) | AT-skew | GC-skew | |||
---|---|---|---|---|---|---|---|
A(%) | T(%) | G(%) | C(%) | ||||
COX1 | 40.0 | 35.8 | 8.8 | 15.3 | 75.8 | 0.06 | -0.27 |
COX2 | 48.9 | 29.7 | 7.8 | 13.6 | 78.6 | 0.24 | -0.27 |
COX3 | 45.0 | 37.0 | 5.4 | 12.6 | 82 | 0.10 | -0.40 |
ATP8 | 55.3 | 36.2 | 2.8 | 5.7 | 91.5 | 0.21 | -0.34 |
ATP6 | 49.8 | 34.0 | 3.7 | 12.5 | 83.8 | 0.19 | -0.54 |
ND1 | 25.3 | 56.6 | 12.1 | 6.0 | 81.9 | -0.38 | 0.34 |
ND2 | 50.7 | 34.0 | 5.0 | 10.3 | 84.7 | 0.20 | -0.35 |
ND3 | 54.3 | 32.2 | 3.8 | 9.7 | 86.5 | 0.26 | -0.44 |
ND4 | 22.2 | 62.4 | 10.2 | 5.2 | 84.6 | -0.48 | 0.32 |
ND4L | 23.9 | 64.3 | 9.8 | 2.0 | 88.2 | -0.46 | 0.66 |
ND5 | 23.5 | 60.2 | 10.0 | 6.2 | 83.7 | -0.44 | 0.23 |
ND6 | 55.3 | 32.3 | 3.1 | 9.2 | 87.6 | 0.26 | -0.50 |
CYTB | 44.1 | 34.6 | 7.4 | 13.9 | 78.7 | 0.12 | -0.31 |
22 tRNAs | 46.7 | 41.1 | 6.2 | 6.0 | 87.8 | 0.06 | 0.02 |
rrnL | 38.3 | 49.2 | 8.3 | 4.1 | 87.56 | -0.12 | 0.34 |
rrnS | 36.4 | 46.60 | 10.9 | 6.0 | 83.01 | -0.12 | 0.29 |
2 rRNAs | 37.7 | 48.3 | 9.2 | 4.8 | 86.0 | -0.14 | 0.31 |
Total | 50.9 | 32.5 | 5.8 | 10.9 | 83.4 | 0.22 | -0.31 |
AA | Codon | Count | RSCU | AA | Codon | Count | RSCU | AA | Codon | Count | RSCU | AA | Codon | Count | RSCU |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Leu | TTA | 247 | 3.696 | Ile | ATT | 324 | 1.653 | His | CAC | 14 | 0.718 | Lys | AAG | 29 | 0.267 |
Ser | TCA | 116 | 3.114 | Trp | TGA | 47 | 1.649 | Leu | TTG | 47 | 0.703 | Met | ATG | 52 | 0.202 |
Thr | ACA | 84 | 2.489 | Glu | GAA | 48 | 1.548 | Asn | AAC | 96 | 0.671 | Pro | CCG | 3 | 0.188 |
Val | GTT | 91 | 2.233 | Ala | GCT | 10 | 1.509 | Ala | GCC | 4 | 0.604 | * | TAG | 1 | 0.167 |
Pro | CCA | 34 | 2.125 | Val | GTA | 60 | 1.472 | Gly | GGG | 10 | 0.541 | Ser | AGG | 6 | 0.161 |
Gly | GGA | 38 | 2.054 | Tyr | TAT | 130 | 1.469 | Cys | TGC | 4 | 0.533 | Val | GTC | 6 | 0.147 |
Ser | TCT | 73 | 1.96 | Cys | TGT | 11 | 1.467 | Tyr | TAC | 47 | 0.531 | Val | GTG | 6 | 0.147 |
Gln | CAA | 30 | 1.935 | Ser | AGA | 53 | 1.423 | Glu | GAG | 14 | 0.452 | Gly | GGC | 2 | 0.108 |
Arg | CGA | 11 | 1.872 | Asn | AAT | 190 | 1.329 | Thr | ACC | 13 | 0.385 | Leu | CTC | 7 | 0.105 |
* | TAA | 11 | 1.833 | Pro | CCT | 21 | 1.312 | Pro | CCC | 6 | 0.375 | Thr | ACG | 3 | 0.089 |
Ala | GCA | 12 | 1.811 | Gly | GGT | 24 | 1.297 | Trp | TGG | 10 | 0.351 | Arg | CGC | 0.5 | 0.085 |
Met | ATA | 463 | 1.798 | His | CAT | 25 | 1.282 | Ser | TCC | 13 | 0.349 | Ser | TCG | 3 | 0.081 |
Lys | AAA | 188 | 1.733 | Thr | ACT | 35 | 1.037 | Ile | ATC | 68 | 0.347 | Ala | GCG | 0.5 | 0.075 |
Phe | TTT | 413 | 1.721 | Ser | AGT | 32 | 0.859 | Arg | CGG | 2 | 0.34 | Gln | CAG | 1 | 0.065 |
Arg | CGT | 10 | 1.702 | Leu | CTT | 49 | 0.733 | Asp | GAC | 9 | 0.31 | Ser | AGC | 2 | 0.054 |
Asp | GAT | 49 | 1.69 | Leu | CTA | 48 | 0.718 | Phe | TTC | 67 | 0.279 | Leu | CTG | 3 | 0.045 |
The most frequently used codons were ATA, TTT and ATT (Suppl. material
In total, 22 tRNA genes with lengths of 46 to 70 bp were detected in the mitogenome of Coccus hesperidum (Table
The two rRNA genes rrnL and rrnS were both on the reverse strand in the mitogenome of C. hesperidum. The rrnL gene was 1,174 bp with 87.56% A and T nucleotides and was located between trnL1 and trnV. The rrnS gene was 612 bp with an 83.01% A+T content and was located between trnV and trnM (Table
Mitogenomes of all reported scale insects were compared. All mitogenomes ranged in size from 12,395 bp (Drosicha corpulenta, Monophlebidae) to 17,405 bp (Albotachardina sinensis, Kerriidae), except for D. corpulenta with incomplete annotation information (only base composition and AT-skew were analysed). With respect to the base composition of the whole mitogenomes, the A+T content ranged from 81.0% (Nipponaclerda biwakoensis, Aclerdidae) to 91.1% (Matsucoccus matsumurae, Matsucoccidae), AT-skew ranged from 0.015 (Phenacoccus manihoti, Pseudococcidae) to 0.412 (Didesmococcus koreanus, Coccidae) and GC-skew ranged from -0.573 (Antecerococcus theydoni, Cerococcidae) to -0.258 (D. corpulenta), indicating that A and T were used more frequently than G and C (Table
Family | Species | Mitochondrial genome | PCGs | ||||||
---|---|---|---|---|---|---|---|---|---|
Length | A+T(%) | AT-skew | GC-skew | Length | A+T(%) | AT-skew | GC-skew | ||
Coccidae | Coccus hesperidum | 15566 bp | 83.4 | 0.221 | -0.305 | 10575 bp | 82.1 | -0.08 | -0.11 |
Coccidae | Didesmococcus koreanus | 15143 bp | 82.5 | 0.412 | -0.363 | 10599 bp | 81.9 | -0.08 | -0.10 |
Coccidae | Saissetia coffeae | 15389 bp | 84.7 | 0.190 | -0.372 | 10632 bp | 84.1 | -0.07 | -0.11 |
Coccidae | Ceroplastes floridensis | 15086 bp | 85.1 | 0.231 | -0.365 | 10647 bp | 84.5 | -0.06 | -0.09 |
Coccidae | Ceroplastes japonicus | 14979 bp | 85.2 | 0.232 | -0.369 | 9306 bp | 83.7 | 0.01 | -0.15 |
Coccidae | Parasaissetia nigra | 15632 bp | 85.9 | 0.211 | -0.338 | 10644 bp | 85.6 | -0.05 | -0.07 |
Coccidae | Ceroplastes rubens | 15387 bp | 87.5 | 0.241 | -0.312 | 10656 bp | 86.6 | -0.05 | -0.07 |
Coccidae | Ericerus pela | 16349 bp | 88.4 | 0.128 | -0.316 | 10659 bp | 87.8 | -0.06 | -0.07 |
Aclerdidae | Aclerda takahashii | 16599 bp | 84.5 | 0.124 | -0.444 | 10608 bp | 83.7 | -0.10 | -0.10 |
Aclerdidae | Nipponaclerda biwakoensis | 16675 bp | 81.0 | 0.128 | -0.394 | 10641 bp | 81.0 | -0.09 | -0.07 |
Pseudococcidae | Phenacoccus manihoti | 14965 bp | 89.3 | 0.015 | -0.327 | 10614 bp | 88.8 | -0.15 | -0.11 |
Matsucoccidae | Matsucoccus matsumurae | 15360 bp | 91.1 | 0.125 | -0.305 | 10623 bp | 90.4 | -0.12 | 0.01 |
Cerococcidae | Antecerococcus theydoni | 15552 bp | 83.6 | 0.217 | -0.573 | 10584 bp | 81.8 | -0.07 | -0.17 |
Eriococcidae | Apiomorpha munita | 15644 bp | 89.4 | 0.031 | -0.434 | 10602 bp | 88.1 | -0.08 | -0.15 |
Eriococcidae | Acanthococcus coriaceus | 16295 bp | 89.4 | 0.087 | -0.415 | 10500 bp | 88.0 | -0.10 | -0.12 |
Kerriidae | Albotachardina sinensis | 17405 bp | 90.0 | 0.177 | -0.417 | 10566 bp | 89.1 | -0.10 | -0.16 |
Monophlebidae | Drosicha corpulenta* | 12395 bp | 87.2 | 0.056 | -0.258 |
Within the scale insect mitogenomes, the total length of 13 PCGs ranged from 9,306 bp (Ceroplastes japonicus, Coccidae) to 10,659 bp (Ericerus pela, Coccidae), the A+T content and AT/GC skew of each PCG are shown in Table
The locations of two rRNAs were identical in the majority of scale insect mitogenomes, where rrnS was flanked by trnV and trnM and rrnL was flanked by trnL1 and trnV. In other scale insects, the two rRNAs were in different locations; for example, in M. matsumurae, rrnS was between trnV and trnI, in Albotachardina sinensis, rrnS was located between trnV and trnP, in Antecerococcus theydoni and Acanthococcus coriaceus, rrnL-rrnS was between trnL1 and trnQ and, in Apiomorpha munita, rrnL-rrnS was between trnV and trnY.
For each taxon, the PCGAA dataset included 3,127 amino acids and the PCG123rRNA dataset contained 11,380 bp. We obtained four phylogenetic trees with highly concordant topologies, based on the above datasets under BI and ML (Figs
Within Coccomorpha, the monophyly of the four families, Pseudococcidae, Matsucoccidae, Cerococcidae and Kerriidae could not be verified because a single species was included for each family. The two families, Eriococcidae and Aclerdidae, represented by two species, formed monophyletic clades with high support. Matsucoccidae, which belonged to the archaeococcoids, was at the most basal position within Coccomorpha. All the remaining families belonging to neococcoids were clustered into a single clade and Pseudococcidae was the basal family of the neococcoids. The phylogenetic relationships of other families belonging to neococcoids were presented as (Eriococcidae + (Kerriidae + (Cerococcidae + (Aclerdidae + Coccidae)))). Thereinto, the two families Aclerdidae and Coccidae were clustered with each other in all phylogenetic trees. However, the species in the family Coccidae did not form a separate clade and were mixed with Aclerdidae species with low to high support values in all BI and ML trees, revealing a surprising/unexpected result that Coccidae appeared as a paraphyletic group and it is necessary to include mitogenomes from more species to confirm this result. Amongst soft scale insects, species in the subfamily Ceroplastinae, represented by the congeneric species Ceroplastes rubens, C. floridensis and C. japonicus, were correctly clustered into one branch with very high support and the species Coccus hesperidum, Saissetia coffeae and Parasaissetia nigra of the subfamily Coccinae formed a separate branch in most trees.
With the development of high-throughput sequencing technology, an increasing number of insect mitogenomes have been sequenced and reported, providing useful data for systematics and evolutionary studies (
The Ka/Ks ratio is a measure of the selection pressure acting on a gene, indicating neutral selection (Ka/Ks = 1), negative or purifying selection (Ka/Ks < 1) and positive or diversifying selection (Ka/Ks > 1) (
Codon usage analyses of the scale insects in our study showed that the most frequently used codons were ATA (Met), ATT (Ile), TTT (Phe) and TTA (Leu) and the least used codons varied amongst species. Furthermore, TAA or TAG was more frequently used as stop codons in most mitogenomes of 16 scale insects, while those in some species ended with a single T. With respect to the secondary structure of tRNAs in the mitogenomes of scale insects, some tRNAs had a typical clover-leaf secondary structure, while some lacked the DHU arm or T arm, forming a truncated secondary structure. A few tRNAs did not have the DHU arm or T arm. Combined with results of previous studies (
Concerning the phylogenetic relationships within Sternorrhyncha, the sister group relationship between Aphidomorpha and Coccomorpha was strongly supported in the present study, congruent with results of previous morphological and molecular studies (
The Coccomorpha is often divided into two informal groups, archaeococcoids and neococcoids. The adult females of the former group possess abdominal spiracles, considered an ancestral feature in scale insects and have been identified as the basal lineage within Coccomorpha (
The present study is the first to determine the complete mitogenome sequence of Coccus hesperidum (tribe Coccini) by next-generation sequencing methods. The C. hesperidum mitogenome was 15,566 bp long, had a high A+T content (83.4%) and contained a typical set of 37 genes, with 13 PCGs, 22 tRNAs and two rRNAs. Only seven tRNAs had the typical cloverleaf secondary structure and the remaining tRNAs lacked the DHU arm, TψC arm or both. The mitogenomes of all reported scale insects were similar in structure, base composition and A+T content. As determined by RSCU, there was obvious bias and different coccid species preferred to use different codons; the most frequently used codons were ATA (Met), ATT (Ile), TTT (Phe) and TTA (Leu). Our phylogenetic analysis confirmed the monophyly of Coccomorpha, demonstrated that the archaeococcoids occupied the most basal position within Coccomorpha and showed that Ericerus pela and Didesmococcus koreanus, belonging to Coccidae, were mixed with Aclerdidae, such that Coccidae may form a paraphyletic group. Collectively, this study enriches the mitogenome database of scale insects and provides the basis for future phylogenetic and evolutionary analyses of scale insects.
Thanks to MogoEdit for its linguistic assistance during the preparation of this manuscript. Thanks to the anonymous reviewers whose comments helped improved the manuscript.
The authors have declared that no competing interests exist.
No ethical statement was reported.
This study was supported by the National Natural Science Foundation of China (31802001) and funded by Science and Technology Project of Hebei Education Department (BJ2020052).
Conceptualization: FW, YFH. Data curation: TYZ, CFL. Funding acquisition: FW. Methodology: JFW, YFH. Project administration: FW. Software: TYZ, YFH. Supervision: FW. Validation: JFW. Visualization: JFW, YFH. Writing - original draft: JFW, YFH. Writing - review and editing: FW, CFL.
Yun-Feng Hou https://orcid.org/0009-0008-8244-7389
Tian-You Zhao https://orcid.org/0000-0003-1378-6893
Fang Wang https://orcid.org/0000-0002-5758-7434
All of the data that support the findings of this study are available in the main text or Supplementary Information.
First complete mitochondrial genome of the tribe Coccini and its phylogenetic implications
Data type: zip
Explanation note: table S1. List of species used in the phylogenetic analysis. table S2. The most frequently used codons of protein-coding genes (PCGs) in scale insect mitogenomes. figure S1. Inferred secondary structures of 22 transfer RNA genes (tRNAs) of Coccus hesperidum. tRNAs are labelled with abbreviations for the corresponding amino acids according to the IUPAC-IUB code. table S3. A+T content (%) in mitogenomes of scale insects. figure S2. The relative synonymous codon usage (RSCU) of protein-coding genes (PCGs) in mitogenomes of scale insects.