Research Article |
|
Corresponding author: Han Xu ( hanny_90@163.com ) Corresponding author: San-an Wu ( sananwu@bjfu.edu.cn ) Academic editor: Takumasa Kondo
© 2025 Yu-ang Li, Xinyi Zheng, Han Xu, San-an Wu.
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Li Y-a, Zheng X, Xu H, Wu S-a (2025) First mitogenome of the family Putoidae (Hemiptera, Coccomorpha) and its phylogenetic implications. ZooKeys 1247: 1-18. https://doi.org/10.3897/zookeys.1247.144896
|
Scale insects are significant pests impacting agriculture, forestry, and ornamental plants. They play a dual role in ecosystems, serving as a food source for insects like bees, producing pigments and wax. Mitochondrial genomes have been widely utilized in phylogenetic studies. However, the mitogenomes of scale insects currently available in GenBank fail to adequately represent the majority of families. In this paper, the first complete mitogenome of Puto sinensis Zheng & Wu, 2025 is described, revealing previously unreported gene rearrangements in scale insects. It has a length of 18,830 bp and a high A+T content of 90.7%. Moreover, the phylogenetic analysis based on mitogenomic sequences shows that archaeococcoids are a paraphyletic group, with the family Putoidae being sister to all neococcoids – a finding consistent with results from nuclear gene and morphological data. This underscores the utility of mitochondrial genome data in reconstructing phylogenetic relationships within the infraorder Coccomorpha.
Gene rearrangements, mitogenome, molecular phylogeny, Putoidae, scale insects, taxonomy
Scale insects (Coccomorpha) belong to the suborder Sternorrhyncha within the Hemiptera, along with aphids (Aphidomorpha), whiteflies (Aleyrodomorpha), and psyllids (Psyllomorpha) (
Within Coccomorpha, Putoidae has faced contentious taxonomic placement since its establishment. Adult females of Puto exhibit morphological characteristics similar to those of adult females of Phenacoccus in Pseudococcidae. Therefore, Putoidae was often classified within Pseudococcidae and belongs to neococcoids (
The mitochondria are important functional organelles of eukaryotes (
However, whether the phylogenetic position of Putoidae inferred from mitochondrial genes is consistent with that based on nuclear genes or morphological characteristics remains to be further investigated. The present study sequenced and annotated the first complete mitogenome of the family Putoidae. The phylogenetic tree was then reconstructed based on the mitogenomic sequences of 18 other coccoid species. Furthermore, the study explored mitochondrial genome rearrangements within Coccomorpha, providing new insights into their phylogenetic relationships.
The sample of Puto sinensis for DNA extraction was collected on the trunk of Lindera communis (D. Don) Merr. (Lauraceae) from Guizhou Province, China, and was preserved in 95% ethanol under -20 °C at the
Beijing Forestry University, Beijing, P. R. China (
The phylogenetic analysis included 30 species, comprising 18 species from Coccomorpha, nine species from three other infraorders of Sternorrhyncha, and three species of the order Thysanoptera. Except for the data on Puto sinensis, all sequence data were obtained from the National Center for Biotechnology Information at https://www.ncbi.nlm.nih.gov (refer to Table
| Order | Infraorder | Family | Species |
|---|---|---|---|
| Thysanoptera | Aeolothripidae | Aeolothrips indicus Bhatti | |
| Franklinothrips vespiformis (Crawford) | |||
| Thripidae | Thrips hawaiiensis (Morgan) | ||
| Hemiptera | Aleyrodomorpha | Aleyrodidae | Aleyrodes shizuokensis Kuwana |
| Bemisia tabaci (Gennadius) | |||
| Psyllomorpha | Carsidaridae | Allocarsidara bakeri Hollis | |
| Paracarsidara gigantea (Crawford) | |||
| Psyllidae | Cyamophila willieti (Wu) | ||
| Aphalaridae | Rhinocola aceris (Linnaeus) | ||
| Aphidomorpha | Aphididae | Aphis glycines Matsumura | |
| Myzus persicae (Sulzer) | |||
| Acyrthosiphon pisum Harris | |||
| Coccomorpha | Matsucoccidae | Matsucoccus matsumurae (Kuwana) | |
| Monophlebidae | Icerya purchasi Maskell | ||
| Coronaproctus castanopsis Li, Xu & Wu | |||
| Putoidae | Puto sinensis Zheng & Wu | ||
| Pseudococcidae | Paracoccus marginatus Williams & Granara de Willink | ||
| Phenacoccus aceris (Signoret) | |||
| Phenacoccus manihoti Matile-Ferrero | |||
| Eriococcidae | Acanthococcus coriaceus (Maskell) | ||
| Apiomorpha munita (Schrader) | |||
| Kerriidae | Albotachardina sinensis Zhang | ||
| Cerococcidae | Antecerococcus theydoni (Hall) | ||
| Aclerdidae | Nipponaclerda biwakoensis (Kuwana) | ||
| Aclerda takahashii Kuwana | |||
| Coccidae | Didesmococcus koreanus Borchsenius | ||
| Saissetia coffeae (Walker) | |||
| Parasaissetia nigra (Nietner) | |||
| Ceroplastes japonicus Green | |||
| Ceroplastes floridensis Comstock |
We sequenced the genome of Puto sinensis using a next-generation sequencing method with Illumina Hiseq 2500 at Berry genomics (Beijing, China) with 6× sequencing depth and with 150 bp paired-end sequencing reads. An Illumina TruSeq library was constructed from total genomic DNA of a single species with an average insert size of 150 bp. After removing adapters and low-quality reads using fastp v. 0.20.0 (
There is no mitogenome available in GenBank for either Phenacoccus aceris or Coronaproctus castanopsis. So, SRA data of P. aceris and C. castanopsis were downloaded from NCBI to obtain their mitogenomes. The IDBA-UD (
The protein-coding genes (PCGs) and tRNA genes were annotated using the MITOS2 (
The entire workflow of the phylogenetic analysis was conducted using PhyloSuite v. 1.2.2 (
The mitochondrial genome of Puto sinensis has 18,830 bp (Fig.
| Feature | %T(U) | %C | %A | %G | %A+T | AT Skew | GC Skew |
|---|---|---|---|---|---|---|---|
| Whole genome | 44.6 | 6.8 | 46.2 | 2.4 | 90.7 | 0.017 | -0.475 |
| Protein-coding genes | 47.8 | 6.1 | 41.2 | 5.0 | 88.9 | -0.074 | -0.096 |
| First codon position | 37.9 | 5.6 | 50.2 | 6.3 | 88.1 | 0.141 | 0.057 |
| Second codon position | 56.5 | 7.4 | 29.9 | 6.2 | 86.4 | -0.308 | -0.083 |
| Third codon position | 48.9 | 5.2 | 43.4 | 2.5 | 92.3 | -0.060 | -0.353 |
| Protein-coding genes J-strand | 45.3 | 8.3 | 42.3 | 4.1 | 87.6 | -0.035 | -0.346 |
| First codon position | 33.0 | 7.7 | 54.3 | 5.0 | 87.3 | 0.244 | -0.211 |
| Second codon position | 54.8 | 10.0 | 28.9 | 6.3 | 83.7 | -0.309 | -0.229 |
| Third codon position | 48.2 | 7.3 | 43.6 | 0.9 | 91.8 | -0.051 | -0.781 |
| Protein-coding genes N-strand | 51.6 | 2.4 | 39.4 | 6.5 | 91.1 | -0.134 | 0.464 |
| First codon position | 45.6 | 2.3 | 43.7 | 8.4 | 89.4 | -0.021 | 0.572 |
| Second codon position | 59.2 | 3.1 | 31.5 | 6.2 | 90.8 | -0.306 | 0.333 |
| Third codon position | 50.1 | 1.8 | 43.0 | 5.1 | 93.0 | -0.075 | 0.474 |
| tRNA genes | 44.6 | 3.1 | 47.9 | 4.4 | 92.5 | 0.036 | 0.172 |
| tRNA genes J-strand | 44.1 | 4.1 | 48.3 | 3.6 | 92.4 | 0.045 | -0.065 |
| tRNA genes N-strand | 45.5 | 1.4 | 47.3 | 5.8 | 92.8 | 0.020 | 0.600 |
| rRNA genes | 46.3 | 2.4 | 44.6 | 6.8 | 90.8 | -0.018 | 0.479 |
| Genes | Strand | Location | Length | Anticodon and its location | Start codon | Stop codon | Intergenic nucleotides |
|---|---|---|---|---|---|---|---|
| COI | J | 1–1551 | 1551 | - | ATT | TAA | 0 |
| trnL2 | J | 1560–1623 | 64 | TAA 1592–1594 | - | - | 8 |
| trnD | J | 1626–1685 | 60 | GTC 1663–1665 | - | - | 2 |
| ATP8 | J | 1686–1826 | 141 | - | ATT | TAA | 0 |
| ATP6 | J | 1820–2431 | 612 | - | ATG | TAA | -7 |
| COIII | J | 2463–3215 | 753 | - | ATG | TAA | 31 |
| trnA | J | 3200–3275 | 76 | TGC 3237–3239 | - | - | -15 |
| trnR | J | 3268–3312 | 45 | TCG 3294–3296 | - | - | -4 |
| trnN | J | 3322–3378 | 57 | GTT 3343–3345 | - | - | 9 |
| trnE | J | 3376–3431 | 56 | TTC 3406–3408 | - | - | -3 |
| trnF | N | 3432–3496 | 66 | GAA 3449–3451 | - | - | 0 |
| trnH | N | 3518–3587 | 69 | GTG 3552–3554 | - | - | 21 |
| ND6 | J | 3586–4065 | 480 | - | ATT | TAA | -2 |
| trnS1 | J | 4068–4124 | 57 | TGA 4085–4087 | - | - | 2 |
| trnQ | N | 4152–4205 | 54 | TTG 4186–4188 | - | - | 27 |
| ND1 | N | 4218–5120 | 903 | - | ATT | TAA | 12 |
| trnL1 | N | 5120–5194 | 75 | TAG 5155–5157 | - | - | -1 |
| lrRNA | N | 5195–6388 | 1194 | - | - | - | 0 |
| trnV | N | 6389–6452 | 64 | TAC 6416–6418 | - | - | 0 |
| trnI | J | 6504–6573 | 70 | GAT 6539–6541 | - | - | 51 |
| trnM | J | 6574–6639 | 65 | CAT 6607–6609 | - | - | 0 |
| ND2 | J | 6640–7581 | 942 | - | ATT | TAA | 0 |
| trnW | J | 7580–7635 | 56 | TCA 7612–7614 | - | - | -2 |
| trnY | N | 7634–7704 | 71 | GTA 7668–7670 | - | - | -2 |
| COII | J | 7777–8448 | 672 | - | ATA | TAA | 72 |
| trnK | J | 8450–8519 | 70 | TTT 8510–8512 | - | - | 1 |
| trnG | J | 8557–8612 | 56 | TCC 8587–8589 | - | - | 37 |
| ND3 | J | 8610–8963 | 354 | - | ATA | TAA | -3 |
| trnS2 | J | 8959–9012 | 58 | TCT 8986–8988 | - | - | -5 |
| ND5 | N | 9034–10662 | 1629 | - | ATT | TAA | 17 |
| ND4 | N | 10650–11918 | 1269 | - | ATT | TAA | -13 |
| ND4L | N | 11914–12207 | 294 | - | ATT | TAA | -5 |
| trnT | J | 12190–12248 | 59 | TGT 12222–12224 | - | - | -18 |
| trnP | N | 12247–12303 | 57 | AGG 12269–12271 | - | - | -2 |
| trnC | N | 12282–12321 | 40 | GCA 12302–12304 | - | - | -22 |
| cytb | J | 12336–13418 | 1083 | - | ATG | TAA | 14 |
| srRNA | N | 15769–16377 | 609 | - | - | - | 2350 |
The mitochondrial genome of Puto sinensis comprises 13 protein-coding genes, with a length of 10,700 bp, and features a high A+T content of 88.9%. The relative synonymous codon usage (RSCU) is depicted in Fig.
| Amino acid | Codon | Count | RSCU | Amino acid | Codon | Count | RSCU |
|---|---|---|---|---|---|---|---|
| Phe | UUU(F) | 325 | 1.75 | Ser | UCU(S) | 48 | 1.81 |
| UUC(F) | 47 | 0.25 | UCC(S) | 7 | 0.26 | ||
| Leu | UUA(L) | 406 | 4.93 | UCA(S) | 85 | 3.21 | |
| UUG(L) | 24 | 0.29 | UCG(S) | 2 | 0.08 | ||
| Leu(c) | CUU(L) | 25 | 0.3 | Ser(s) | AGU(S) | 19 | 0.72 |
| CUC(L) | 4 | 0.05 | AGC(S) | 1 | 0.04 | ||
| CUA(L) | 34 | 0.41 | AGA(S) | 43 | 1.62 | ||
| CUG(L) | 1 | 0.01 | AGG(S) | 7 | 0.26 | ||
| Ile | AUU(I) | 522 | 1.85 | Thr | ACU(T) | 24 | 1.88 |
| AUC(I) | 41 | 0.15 | ACC(T) | 3 | 0.24 | ||
| Met | AUA(M) | 504 | 1.92 | ACA(T) | 24 | 1.88 | |
| AUG(M) | 21 | 0.08 | ACG(T) | 0 | 0 | ||
| Val | GUU(V) | 34 | 2.34 | Ala | GCU(A) | 10 | 2.5 |
| GUC(V) | 3 | 0.21 | GCC(A) | 2 | 0.5 | ||
| GUA(V) | 19 | 1.31 | GCA(A) | 4 | 1 | ||
| GUG(V) | 2 | 0.14 | GCG(A) | 0 | 0 | ||
| Tyr | UAU(Y) | 238 | 1.86 | Cys | UGU(C) | 10 | 1.67 |
| UAC(Y) | 18 | 0.14 | UGC(C) | 2 | 0.33 | ||
| UAA(*) | 69 | 1.79 | Trp | UGA(W) | 57 | 1.93 | |
| UAG(*) | 8 | 0.21 | UGG(W) | 2 | 0.07 | ||
| His | CAU(H) | 28 | 1.75 | Arg | CGU(R) | 7 | 1.47 |
| CAC(H) | 4 | 0.25 | CGC(R) | 0 | 0 | ||
| Gln | CAA(Q) | 30 | 1.88 | CGA(R) | 11 | 2.32 | |
| CAG(Q) | 2 | 0.13 | CGG(R) | 1 | 0.21 | ||
| Asn | AAU(N) | 375 | 1.82 | Pro | CCU(P) | 22 | 1.66 |
| AAC(N) | 38 | 0.18 | CCC(P) | 10 | 0.75 | ||
| Lys | AAA(K) | 157 | 1.89 | CCA(P) | 21 | 1.58 | |
| AAG(K) | 9 | 0.11 | CCG(P) | 0 | 0 | ||
| Asp | GAU(D) | 37 | 1.85 | Gly | GGU(G) | 17 | 1.1 |
| GAC(D) | 3 | 0.15 | GGC(G) | 3 | 0.19 | ||
| Glu | GAA(E) | 44 | 1.83 | GGA(G) | 36 | 2.32 | |
| GAG(E) | 4 | 0.17 | GGG(G) | 6 | 0.39 |
We investigated codon usage bias through three metrics: CBI (codon bias index), ENC (effective number of codons) and GC content (including third codon positions, GC3) (Fig.
Evaluation of codon bias across 18 coccoid mitogenomes. a. Relevance of GC content of codons to ENC; b. Relevance of GC content of 3rd codons positions to ENC; c. Relevance of GC content of codons to CBI; d. Relevance of GC content of 3rd codon positions to CBI; e. Relevance of ENC to CBI.
By calculating the rates of nonsynonymous substitutions (Ka), synonymous substitutions (Ks), and the Ka/Ks ratio for the protein-coding genes of Puto sinensis and 17 other species of scale insects (Fig.
Twenty-two tRNA genes were identified in Puto sinensis, and the secondary structures are shown in Fig.
Typical cloverleaf secondary structures can be formed by only a few tRNA genes, while the majority of tRNA genes are truncated.
The rrnL was located between trnL1 and trnV, and the rrnS was located between two control regions. The length of rrnL with 1, 195 bp, and the length of rrnS with 609 bp. The rrnL had a high AT content, with 91%, and the rrnS also had a high AT content, with 90.5%.
The mitogenomes of all previously sequenced coccoid species have been rearranged (Fig.
The phylogenetic analyses were conducted using sequences of 13 protein-coding genes from mitochondrial genomes of 27 species of Hemiptera and three species of Thysanoptera. Both Bayesian inference (BI) and maximum likelihood (ML) methods produced identical topologies (Fig.
The conserved gene arrangement is recognized as a characteristic feature of mitochondrial genomes of insects (
Additionally, Putoidae was recovered as a sister group to neococcoids based on 13 protein-coding genes from the mitochondrial genome, further indicating that this family is more closely related to neococcoids than the two non-neococcoid families Matsucoccidae and Monophlebidae. The results from mitogenomic gene rearrangements and molecular phylogeny both support Putoidae as sister to neococcoids, consistent with evidence from molecular fragments, morphology of adult males, and multi-evidence analyses (
We thank Dr Erin Powell (Florida Department of Agriculture and Consumer Services), Dr Jinyeong Choi (Okinawa Institute of Science and Technology Graduate University) and two anonymous reviewers for their helpful suggestions and critical review of the manuscript. This project was supported by the National Natural Science Foundation of China (No. 32270476).
The authors have declared that no competing interests exist.
No ethical statement was reported.
No use of AI was reported.
This project was supported by the Beijing Natural Science Foundation (Grants 5224042), GDAS Special Project of Science and Technology Development (2020GDASYL-20200102021), and National Natural Science Foundation of China (No. 32270476).
Yu-ang Li and Han Xu designed the research and analyzed the data; Yu-ang Li and Xinyi Zheng wrote the initial draft of the paper; San-an Wu and Han Xu reviewed and edited the manuscript. All authors have read and agreed to the final version of the manuscript for publication.
Yu-ang Li https://orcid.org/0009-0004-8851-6796
Xinyi Zheng https://orcid.org/0000-0002-7396-7488
Han Xu https://orcid.org/0000-0001-7226-1742
San-an Wu https://orcid.org/0000-0002-9671-9401
All of the data that support the findings of this study are available in the main text or Supplementary Information.
Summary of partitioning scheme for Maximum Likelihood and Bayesian inference
Data type: docx