Research Article |
Corresponding author: Guo-Hao Zu ( zuguohao@tjau.edu.cn ) Academic editor: Zachary Lahey
© 2024 Cheng-Hui Zhang, Hai-Yang Wang, Yan Wang, Zhi-Hao Chi, Yue-Shuo Liu, Guo-Hao Zu.
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Zhang C-H, Wang H-Y, Wang Y, Chi Z-H, Liu Y-S, Zu G-H (2024) The first two complete mitochondrial genomes for the genus Anagyrus (Hymenoptera, Encyrtidae) and their phylogenetic implications. ZooKeys 1206: 81-98. https://doi.org/10.3897/zookeys.1206.121923
|
Anagyrus, a genus of Encyrtidae (Hymenoptera, Chalcidoidea), represents a successful group of parasitoid insects that attack various mealybug pests of agricultural and forestry plants. Until now, only 20 complete mitochondrial genomes have been sequenced, including those in this study. To enrich the diversity of mitochondrial genomes in Encyrtidae and to gain insights into their phylogenetic relationships, the mitochondrial genomes of two species of Anagyrus were sequenced, and the mitochondrial genomes of these species were compared and analyzed. Encyrtid mitochondrial genomes exhibit similarities in nucleotide composition, gene organization, and control region patterns. Comparative analysis of protein-coding genes revealed varying molecular evolutionary rates among different genes, with six genes (ATP8, ND2, ND4L, ND6, ND4 and ND5) showing higher rates than others. A phylogenetic analysis based on mitochondrial genome sequences supports the monophyly of Encyrtidae; however, the two subfamilies, Encyrtinae and Tetracneminae, are non-monophyletic. This study provides valuable insights into the phylogenetic relationships within the Encyrtidae and underscores the utility of mitochondrial genomes in the systematics of this family.
Encyrtid, genome structure, mitogenome, protein-coding genes, phylogenetic analyses, Tetracneminae
Encyrtidae is a large hymenopteran family in the superfamily Chalcidoidea, comprising 518 known genera, of which 495 are recognized as valid (totaling more than 4830 species), along with 23 fossil genera (26 species) worldwide (
Insect mitochondrial genomes are usually small, circular molecules containing 37 genes: 13 protein-coding genes (PCGs), two ribosomal RNA genes (rRNAs), and 22 transfer RNA genes (tRNAs), as well as a large non-coding element known as the A+T-rich or control region (CR), which regulates transcription and replication (
The exploration of hymenopteran mitochondrial genomes commenced with the sequencing of CYTB and ATP8 genes of Apis mellifera, and it was not until 1993 that the first complete mitochondrial genome was deciphered (
At present, there are only morphology-based classification systems for Encyrtidae (
In this study, we conducted the sequencing and annotation of the mitogenomes of Anagyrus galinae (accession number: OR652687) and Anagyrus jenniferae (accession number: OR790122), analyzing their respective characteristics. In addition, we reconstructed the molecular phylogenetic relationships of these two new mitochondrial genomes and other species of Encyrtidae. The molecular data presented in this study will contribute to a better understanding of the characteristics of the Encyrtidae mitogenome. Further, a phylogenetic analysis was performed, including 19 uploaded mitogenomes together with our newly acquired data, which represented Encyrtidae. The goal of our study was to place two new species of Anagyrus within the context of the known mitogenome diversity of Encyrtidae by performing mitogenomic and phylogenetic analyses.
The specimens, A. galinae and A. jenniferae, were collected from Tianjin Agricultural University (39°5′21″N, 117°5′38″E), Xiqing District, Tianjin City, China, in September 2022. Freshly collected specimens were promptly immersed in 100% ethanol for initial preservation and subsequently stored at −40 °C in the Insect Herbarium of Tianjin Agricultural University. Following morphological identification, total DNA from each specimen was extracted from the body, excluding the abdomen, using the DNeasy Blood & Tissue Kit (Qiagen, Hilden, Germany) according to the manufacturer’s instructions. The purity and concentration of the extracted total DNA were assessed through 1% agarose gel electrophoresis and optical density value detection. The total DNA of two encyrtids underwent sequencing using the Illumina NovaSeq 6000 platform with a 350 bp insert size and a paired-end 150 bp sequencing strategy. Sequencing was conducted by Novogene Co., Ltd. (Beijing, China).
After initial data acquisition, with adapter sequences removed, additional filtering was carried out using fastp 0.23.4 (
A total of 21 mitogenomes from two families of Chalcidoidea, including 20 Encyrtidae species and a Aphelinidae species as outgroup, were used for the phylogenetic analysis (Table
GenBank accession numbers of species used in phylogenetic reconstruction and their original publications.
Superfamily | Family | Species | Accession Number | References |
---|---|---|---|---|
Chalcidoidea | Aphelinidae | Encarsia formosa | MG813797 |
|
Encyrtidae | Aenasius arizonensis | NC_045852 |
|
|
Anagyrus galinae | OR652687 | This study | ||
Anagyrus jenniferae | OR790122 | This study | ||
Blastothrix speciosa | NC_082111 | Unpublished | ||
Cheiloneurus chinensis | NC_084192 | Unpublished | ||
Cheiloneurus elegans | NC_071192 | Unpublished | ||
Diaphorencyrtus aligarhensis | NC_046058 |
|
||
Encyrtus aurantii | OR120384 | Unpublished | ||
Encyrtus eulecaniumiae | NC_051459 |
|
||
Encyrtus infelix | NC_041176 |
|
||
Encyrtus rhodococcusiae | NC_051460 |
|
||
Encyrtus sasakii | NC_051458 |
|
||
Exoristobia philippinensis | NC_084171 | Unpublished | ||
Lamennaisia ambigua | NC_082113 | Unpublished | ||
Lamennaisia nobilis | NC_061411 | Unpublished | ||
Leptomastidea bifasciata | OR790123 | Unpublished | ||
Metaphycus eriococci | NC_056349 |
|
||
Ooencyrtus plautus | NC_068223 |
|
||
Psyllaephagus sp. | OP787025 | Unpublished | ||
Tassonia gloriae | NC_082112 | Unpublished |
The assembled mitochondrial genome of A. galinae was a 15,364 bp, and the A. jenniferae mitochondrial genome was 15,396 bp, which both had the same gene organization, including 13 PCGs, 22 tRNAs, two rRNAs and a control region located between trnM and trnI (Fig.
Gene organization of the mitochondrial genomes of Anagyrus galinae and Anagyrus jenniferae.
Gene | Direction | Anticodon | Anagyrus galinae | Anagyrus jenniferae | |||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Position | Length | Start codon | Stop codon | Intergenic Nucleotides | Position | Length | Start codon | Stop codon | Intergenic Nucleotides | ||||
trnI | − | GAU | 1–70 | 70 | 1–67 | 67 | |||||||
ND2 | + | 98–1087 | 990 | ATT | TAA | 27 | 74–1081 | 1008 | ATT | TAA | 6 | ||
trnW | + | UCA | 1087–1149 | 63 | −1 | 1080–1146 | 67 | -2 | |||||
trnY | − | GUA | 1155–1221 | 67 | 5 | 1148–1212 | 65 | 1 | |||||
trnS1 | − | UCU | 1222–1280 | 59 | 0 | 1216–1275 | 60 | 3 | |||||
trnC | − | GCA | 1283–1348 | 66 | 2 | 1293–1361 | 69 | 17 | |||||
trnN | + | GUU | 1369–1434 | 66 | 20 | 1368–1431 | 64 | 6 | |||||
trnR | − | UCG | 1433–1497 | 65 | −2 | 1439–1504 | 66 | 7 | |||||
ND3 | − | 1498–1842 | 345 | ATT | TAA | 0 | 1505–1858 | 354 | ATA | TAA | 0 | ||
trnG | − | UCC | 1843–1906 | 64 | 0 | 1856–1919 | 64 | -3 | |||||
CO3 | − | 1911–2714 | 804 | ATG | TAA | 4 | 1925–2710 | 786 | ATG | TAA | 5 | ||
ATP6 | − | 2715–3387 | 673 | ATG | T | 0 | 2710–3383 | 674 | ATG | TA | -1 | ||
ATP8 | − | 3381–3542 | 162 | ATT | TAA | −7 | 3377–3538 | 162 | ATC | TAA | -7 | ||
trnD | − | GUC | 3543–3608 | 66 | 0 | 3539–3602 | 64 | 0 | |||||
trnK | + | UUU | 3612–3683 | 72 | 3 | 3606–3676 | 71 | 3 | |||||
CO2 | − | 3688–4365 | 678 | ATT | TAG | 4 | 3678–4355 | 678 | ATT | TAA | 1 | ||
trnL2 | − | UAA | 4369–4434 | 66 | 3 | 4365–4428 | 64 | 9 | |||||
CO1 | − | 4440–5987 | 1548 | ATT | TAA | 5 | 4431–5969 | 1539 | ATG | TAA | 2 | ||
trnE | + | UUC | 5972–6036 | 65 | −16 | 5972–6034 | 63 | 2 | |||||
trnF | − | GAA | 6036–6102 | 67 | −1 | 6034–6099 | 66 | -1 | |||||
ND5 | − | 6102–7769 | 1668 | ATA | TAA | −1 | 6099–7763 | 1665 | ATT | TAG | -1 | ||
trnH | − | GUG | 7767–7833 | 67 | −3 | 7764–7829 | 66 | 0 | |||||
ND4 | − | 7844–9169 | 1326 | ATG | TAG | 10 | 7829–9156 | 1328 | ATG | TA | -1 | ||
ND4L | − | 9163–9450 | 288 | ATT | TAA | −7 | 9150–9437 | 288 | ATT | TAA | -7 | ||
trnT | + | UGU | 9453–9518 | 66 | 2 | 9440–9505 | 66 | 2 | |||||
trnP | − | UGG | 9520–9582 | 63 | 1 | 9506–9574 | 69 | 0 | |||||
ND6 | + | 9584–10151 | 568 | ATG | T | 1 | 9575–10143 | 569 | ATG | TA | 0 | ||
CYTB | + | 10152–11300 | 1149 | ATG | TAA | 0 | 10143–11285 | 1143 | ATG | TAA | -1 | ||
trnS2 | + | UGA | 11300–11365 | 66 | −1 | 11290–11354 | 65 | 4 | |||||
ND1 | − | 11356–12291 | 936 | ATT | TAG | −10 | 11345–12283 | 939 | ATA | TAG | -10 | ||
trnL1 | − | UAG | 12292–12358 | 67 | 0 | 12284–12348 | 65 | 0 | |||||
lrRNA | − | 12364–13674 | 1311 | 5 | 12353–13654 | 1302 | 4 | ||||||
trnA | − | UGC | 13682–13744 | 63 | 7 | 13651–13719 | 69 | -4 | |||||
trnQ | + | UUG | 13761–13831 | 71 | 16 | 13797–13864 | 68 | 77 | |||||
srRNA | − | 13831–14602 | 772 | −1 | 13891–14646 | 756 | 26 | ||||||
trnV | − | UAC | 14602–14669 | 68 | −1 | 14646–14710 | 65 | -1 | |||||
trnM | − | CAU | 14668–14735 | 68 | −2 | 14709–14770 | 62 | -2 | |||||
CR | 14736–15364 | 629 | 0 | 14771–15396 | 626 | 0 |
The nucleotide composition of the mitogenome from A. galinae was biased toward A and T, with 83.12% of A+T content (A = 45.12%, T = 38.00%, C = 10.82%, G = 6.05%), A+T content was 82.94%, 87.20% in PCGs and rRNAs, respectively. The nucleotide composition of the mitogenome from A. jenniferae was biased toward A and T, with 82.64% of A+T content (A = 46.41%, T = 36.23%, C = 11.33%, G = 6.02%), A+T content was 82.32%, 85.20% in PCGs and rRNAs, respectively. The values of AT-skew and GC-skew were often used to indicate the nucleotide composition of the mitochondrial genome. In this study, the nucleotide features of two new mitogenomes were investigated by calculating the percentages of AT-skew and GC-skew (Table
Nucleotide features of the mitochondrial genome across Anagyrus galinae and Anagyrus jenniferae.
Feature | Length (bp) | T% | C% | A% | G% | A+T% | AT-Skew | GC-Skew |
---|---|---|---|---|---|---|---|---|
Whole genome | 15364/15396 | 38.00/36.23 | 10.82/11.33 | 45.12/46.41 | 6.05/6.02 | 83.12/82.64 | 0.086/0.123 | −0.283/−0.306 |
ATP6 | 673/674 | 46.66/47.63 | 7.43/8.01 | 34.92/34.27 | 11.00/10.09 | 81.58/81.90 | −0.144/−0.163 | 0.194/0.115 |
ATP8 | 162/162 | 48.77/48.77 | 4.32/4.94 | 43.83/36.42 | 3.09/9.88 | 92.59/85.19 | −0.053/−0.145 | −0.167/0.333 |
CO1 | 1524/1539 | 45.41/46.39 | 10.37/10.98 | 29.86/27.23 | 14.37/15.40 | 75.26/73.62 | −0.207/−0.260 | 0.162/0.167 |
CO2 | 678/678 | 45.58/45.72 | 8.55/8.41 | 33.04/33.19 | 12.83/12.68 | 78.61/78.91 | −0.159/−0.159 | 0.200/0.203 |
CO3 | 804/786 | 46.64/49.75 | 7.84/8.52 | 32.21/29.90 | 13.31/11.83 | 78.86/79.64 | −0.183/−0.249 | 0.259/0.163 |
CYTB | 1149/1143 | 43.69/41.91 | 14.36/14.7 | 32.64/34.82 | 9.31/8.57 | 76.33/76.73 | −0.145/−0.092 | −0.213/−0.263 |
ND1 | 936/939 | 46.47/48.35 | 7.05/6.71 | 32.37/31.31 | 14.1/13.63 | 78.85/79.66 | −0.179/−0.214 | 0.333/0.340 |
ND2 | 990/1008 | 50.10/47.52 | 9.19/9.62 | 37.58/39.19 | 14.10/13.63 | 87.68/86.71 | −0.143/−0.096 | −0.492/−0.448 |
ND3 | 345/351 | 51.01/52.99 | 5.22/5.41 | 33.91/31.34 | 9.86/10.26 | 84.93/84.33 | −0.201/−0.257 | 0.308/0.309 |
ND4 | 1326/1328 | 50.08/52.41 | 4.98/5.20 | 34.01/30.20 | 10.94/12.20 | 84.09/82.61 | −0.191/−0.269 | 0.374/0.403 |
ND4L | 288/288 | 53.13/53.82 | 2.78/2.08 | 34.03/36.46 | 10.07/7.64 | 87.15/90.28 | −0.219/−0.192 | 0.568/0.571 |
ND5 | 1665/1665 | 50.81/51.11 | 5.77/5.77 | 33.09/32.61 | 10.33/10.51 | 83.90/83.72 | −0.211/−0.221 | 0.284/0.292 |
ND6 | 568/569 | 46.13/45.34 | 8.45/10.54 | 42.25/41.48 | 3.17/2.64 | 88.38/86.82 | −0.044/−0.045 | −0.455/−0.600 |
srRNA | 772/756 | 44.30/44.84 | 4.15/4.10 | 43.52/40.08 | 8.03/10.98 | 87.82/84.92 | −0.009/−0.056 | 0.319/0.456 |
lrRNA | 1311/1302 | 44.55/46.08 | 4.27/4.15 | 42.03/39.40 | 9.15/10.37 | 86.58/85.48 | −0.029/−0.078 | 0.364/0.429 |
CR | 629/626 | 42.61/40.57 | 7.15/8.47 | 46.26/48.72 | 3.98/2.24 | 88.87/89.29 | 0.041/0.091 | −0.285/−0.582 |
By comparing the known mitochondrial genome structure of Encyrtidae, we found that the sequence of 13 PCGs was consistent, except for ND3 rearranged in Diaphorencyrtus aligarhensis and Leptomastidea bifasciata. The sequence of PCGs in these mitochondrial genomes were the same (Fig.
The total lengths of 13 PCGs are 11,108 bp in A. galinae, 11,130 bp in A. jenniferae. In these mitochondrial genomes, the length of each PCG ranges from 162 bp (ATP8) to 1665 bp (ND5). Two mitogenomes of Anagyrus exhibited similar start and stop codons. All the initiation codons of PCGs were ATN (ATA, ATG and ATT). Three kinds of stop codons existed on the new mitogenomic sequences: TAA, TAG and truncated termination codons (TA existed on ATP6, ND4, ND6 in A. jenniferae, T existed on ATP6, ND6 in A. galinae), TAA were the most frequently used. Truncated termination codons are commonly used in metazoan mitogenomes, which could be completed by post-transcriptional poly-adenylation (
The codon UUA (Leu2) was the most commonly used in both mitogenomes. Mitochondrial protein coding genes have obvious bias towards A and T, and for mitochondrial protein-coding gene of A. galinae the three most frequently used codons were UUA (Leu2) 469 times, AUU (Ile) 440 times and UUU (Phe) 432 times. For A. jenniferae, the three most used codons were UUA (Leu2) 463 times, UUU (Phe) 431 times and AUU (Ile) 393 times. Mitochondrial protein-coding genes in Encyrtidae prefer A and U in the third codon, which is like some hymenopteran insects (
In this study, based on 20 mitochondrial genomes of Encyrtidae, DnaSP was used to calculate the non-synonymous substitution rate, synonymous substitution, and Ka/Ks ratio of 13 PCGs in the mitochondrial genome and then to compare the evolution rate between genes (Fig.
Ka/Ks values of 12 PCGs (all PCGs except ATP8) were far lower than 1.0, indicating that they were subject to purifying selection, a phenomenon first discovered in Chalcidoidea. In addition, the Ka/Ks value of ATP8 is higher than 1.0, higher value of ATP8 was also found in other species (
The mitochondrial genomes of the two species both included 22 tRNA genes, and the total lengths of the tRNAs of A. galinae and A. jenniferae are 1455 bp and 1445 bp, respectively. The length of tRNA genes in two Anagyrus species ranged from 59 to 72 bp. The secondary structures of the 22 tRNAs of the two species are shown in Suppl. materials
Hymenopteran mitochondria have a high rearrangement rate, which mainly occurs in A+T-rich regions, ND2, ND2-CO2, CO2-ATP8, and ND3-ND5 regions (
As for the rRNAs of two Anagyrus species, both lrRNA and srRNA genes are encoded on the N-strand and have a heavy AT nucleotide bias. The lengths of lrRNA and srRNA in A. galinae are 1311 bp and 772 bp, with the different A+T contents of 86.58% and 87.82%, and in A. jenniferae are 1302 bp and 756 bp, with the different A+T contents of 85.48% and 84.92%.
In the mitogenome, the largest non-coding region is normally the A+T-rich region, also known as the control region, which regulates the replication and transcription of mitochondrial DNA (
The phylogenetic analysis of the concatenated dataset was conducted using BI and ML, which were shown in Fig.
The result of maximum-likelihood and Bayesian analysis both indicate that the taxonomic relationship of each genus of Encyrtidae is (Metaphycus + Aenasius) + (((Anagyrus + Leptomastidea) + Encyrtus) + ((Blastothrix + Psyllaephagus) + (((Cheiloneurus + Tassonia) + Diaphorencyrtus) + (Ooencyrtus + (Exoristobia + Lamennaisia))))).
Overall, the phylogenetic trees reconstructed by both methods indicate that species belonging to the same tribe are clustered into one or adjacent clades, while species belonging to the same genus are clustered into the same clade, consistent with the morphological classification system. At the subfamily level, according to the morphological classification system, Encyrtidae is divided into two subfamilies: Tetracneminae and Encyrtinae. Aenasius, Anagyrus, and Leptomastidea all belong to Tetracneminae, while the remaining genera belong to Encyrtinae. However, in the phylogenetic trees reconstructed in this study, the results of both methods show that, except for Encyrtus and Metaphycus, Encyrtidae is divided into two main parts, which essentially conforms to the morphological classification system. Metaphycus and Aenasius form a monophyletic clade as sister groups, which is consistent with the previous phylogenetic results (
While the Anagyrus species were not clustered on one branch with Aenasius arizonensis but clustered with Encyrtus, this may be due to different dietary habits. The five genera Metaphycus, Aenasius, Anagyrus, Leptomastidea, and Encyrtus exclusively parasitize scale insects within Hemiptera. In contrast, other species of Encyrtinae have a broader host range, including species from Lepidoptera, Diptera, Coleoptera, Hymenoptera, and more families within Hemiptera (
In this study, we determined two newly sequenced mitogenomes, which are from A. galinae and A. jenniferae, then found them consistent with previously reported mitogenomes of Encyrtidae. Two new mitogenomes exhibited quite similar features in the genome size, base content, AT nucleotide bias, AT-skew, GC-skew, codon usage of protein genes, secondary structure of tRNAs and gene rearrangement. The BI and ML phylogenetic analysis among the major lineages based on the concatenated datasets yielded well-resolved topologies with moderate to high support for most branches. These results provide a relatively holistic framework and valuable data toward the future resolution of phylogenetic relationships in Encyrtidae. This study provided insights into the phylogenetic relationships of certain taxa within Encyrtidae, the limited sample size and scarcity of molecular evidence remain challenges. Therefore, future studies should aim to augment the number of sampled species and expand the dataset of mitochondrial genomes, utilizing a broader range of data for robust phylogenetic analysis and a comprehensive assessment of the taxonomic status within Encyrtidae.
We extend our heartfelt gratitude to Tao Wang from the College of Life Sciences, Nankai University, China, for providing valuable assistance in software analysis. Special thanks also go to Miss Zi-Yan Wang from the University of Sheffield, UK, as well as Mr. Shuai Zhang and Mr. Mark Sharples from the University of Manchester, UK, for their dedicated efforts in reviewing and revising the article. Their contributions have significantly enriched the quality and clarity of our work.
The authors have declared that no competing interests exist.
No ethical statement was reported.
No funding was received for conducting this study.
Conceptualization: GHZ, CHZ. Data curation: YW, CHZ. Formal analysis: CHZ. Investigation: CHZ, HYW. Methodology: CHZ, YW. Project administration: YW, CHZ, HYW. Resources: GHZ. Software: CHZ, YSL, ZHC. Supervision: GHZ, CHZ. Validation: CHZ, HYW. Visualization: CHZ. Writing – original draft: YW, CHZ, HYW. Writing – review and editing: CHZ, HYW.
Cheng-Hui Zhang https://orcid.org/0000-0001-8234-0903
Hai-Yang Wang https://orcid.org/0009-0007-5665-2111
Yan Wang https://orcid.org/0000-0002-5001-975X
Zhi-Hao Chi https://orcid.org/0009-0006-0447-6948
Guo-Hao Zu https://orcid.org/0000-0002-9892-2171
Secondary structures of 22 tRNA genes of Anagyrus galinae
Data type: jpg
Explanation note: Blue gene names indicate that in the major strand, and red names indicate that in the minor strand.
Secondary structures of 22 tRNA genes of Anagyrus jenniferae
Data type: jpg
Explanation note: Blue gene names indicate that in the major strand, and red names indicate that in the minor strand.