Research Article |
Corresponding author: Hao-Yuan Hu ( haoyuanhu@126.com ) Corresponding author: Yi-Xin Huang ( huangyx@ahnu.edu.cn ) Academic editor: Francisco Javier Peris Felipo
© 2022 Zhi-Ping Xing, Xin Liang, Xu Wang, Hao-Yuan Hu, Yi-Xin Huang.
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Xing Z-P, Liang X, Wang X, Hu H-Y, Huang Y-X (2022) Novel gene rearrangement pattern in mitochondrial genome of Ooencyrtus plautus Huang & Noyes, 1994: new gene order in Encyrtidae (Hymenoptera, Chalcidoidea). ZooKeys 1124: 1-21. https://doi.org/10.3897/zookeys.1124.83811
|
Studies of mitochondrial genomes have a wide range of applications in phylogeny, population genetics, and evolutionary biology. In this study, we sequenced and analyzed the mitochondrial genome of Ooencyrtus plautus Huang & Noyes, 1994 (Hymenoptera, Encyrtidae). The nearly complete mitogenome of O. plautus was 15,730 bp in size, including 13 PCGs (protein-coding genes), 22 tRNAs, 2 rRNAs, and a nearly complete control region. The nucleotide composition was significantly biased toward adenine and thymine, with an A + T content of 84.6%. We used the reference sequence of Chouioia cunea and calculated the Ka/Ks ratio for each set of PCGs. The highest value of the Ka/Ks ratio within 13 PCGs was found in nad2 with 1.1, suggesting that they were subjected to positive selection. This phenomenon was first discovered in Encyrtidae. Compared with other encyrtid mitogenomes, a translocation of trnW was found in O. plautus, which was the first of its kind to be reported in Encyrtidae. Comparing with ancestral arrangement pattern, wasps reflect extensive gene rearrangements. Although these insects have a high frequency of gene rearrangement, species from the same family and genus tend to have similar gene sequences. As the number of sequenced mitochondrial genomes in Chalcidoidea increases, we summarize some of the rules of gene rearrangement in Chalcidoidea, that is four gene clusters with frequent gene rearrangements. Ten mitogenomes were included to reconstruct the phylogenetic trees of Encyrtidae based on both 13 PCGs (nucleotides of protein coding genes) and AA matrix (amino acids of protein coding genes) using the maximum likelihood and Bayesian inference methods. The phylogenetic tree reconstructed by Bayesian inference based on AA data set showed that Aenasius arizonensis and Metaphycus eriococci formed a clade representing Tetracneminae. The remaining six species formed a monophyletic clade representing Encyrtinae. In Encyrtinae, Encyrtus forms a monophyletic clade as a sister group to the clade formed by O. plautus and Diaphorencyrtus aligarhensis. Encyrtus sasakii and Encyrtus rhodooccisiae were most closely related species in this monophyletic clade. In addition, gene rearrangements can provide a valuable information for molecular phylogenetic reconstruction. These results enhance our understanding of phylogenetic relationships among Encyrtidae.
Encyrtinae, mitogenome, pairwise breakpoint distance, phylogenetic tree, Tetracneminae
The mitochondrial genome is a standard circular molecule, mostly range between 15 kb and 18 kb in size, with 37 genes, including 13 protein-coding genes, three of which are oxidative phosphorylation complexes, 2 rRNAs, and 22 tRNAs and a major non-coding region which mainly regulates replication and transcription (Fig.
Extensive mitochondrial genome data indicates that, compared to other orders in the hexapoda, Hymenoptera (
Ooencyrtus plautus is a kind of parasitoid in the family Encyrtidae, which has the characteristics of high parasitism rate and strong reproduction ability. In the Encyrtidae, seven species of complete mitochondrial genes and one species of partial mitochondrial genes have been reported (Table
Family | Subfamily | Taxa | GenBank Accession No. | Location/Refence | |
---|---|---|---|---|---|
1 | Aphelinidae | Encarsia formosa | MG813797.1 | ( |
|
2 | Encarsia obtusiclava | MG813798.1 | ( |
||
3 | Encyrtidae | Tetracneminae | Aenasius arizonensis | MK630013 | ( |
4 | Tetracneminae | Metaphycus eriococci | MW255970 | Direct Submission | |
5 | Encyrtinae | Diaphorencyrtus aligarhensis | MN274569 | ( |
|
6 | Encyrtinae | Encyrtus eulecaniumiae | NC_051459 | Direct Submission | |
7 | Encyrtinae | Encyrtus infelix | MH729198 | ( |
|
8 | Encyrtinae | Encytus rhodooccisiae | NC_051460 | Direct Submission | |
9 | Encyrtinae | Encyrtus sasakii | NC_051458 | Direct Submission | |
10 | Encyrtinae | Ooencyrtus plautus | OP442361 | This study |
Specimens of Ooencyrtus plautus were collected from Fuzhou city, Fujian province, in September 2020. They were reared in the laboratory, then processed for DNA extraction. Total genomic DNA was extracted using the cetyltrimethyl ammonium bromide (CTAB) method (
Sequencing was performed using a whole genome shotgun (WGS) strategy on the Illumina Novaseq platform. The quality of data was checked using FastQC (
Gene annotation of mitochondrial genome was performed using MitoZ and 13 protein-coding genes as well as 22 tRNA genes are annotated (
Geneious v. 11.0.2 was used to examine all genes in the mitochondrial genome (
In this study, a total of 10 species of mitogenomes were analyzed, of which Encarsia formosa and Encarsia obtusiclava of family Aphelinidae were selected as outgroups (Table
Both ML (maximum likelihood) and BI (Bayesian inference) analyses were performed on the concatenated data set used for phylogenetic reconstruction. In W-IQtree (
The nearly complete mitogenome of O. plautus was 15,730 bp in size, including 13 PCGs, 22 tRNAs, 2 rRNAs, and a nearly complete control region. In Ooencyrtus plautus, 27 genes (15 tRNAs, 2 rRNAs, and 10 PCGs) were encoded by the majority-strand (J-strand), and 10 genes (3 PCGs and 7 tRNAs) were encoded by the minority-strand (N-strand). The values of AT skew and GC skew are often used to reveal the nucleotide composition of the mitochondrial genome (
The nucleotide composition O. plautus was significantly biased toward adenine and thymine, with an A + T content of 84.6% (Table
Nucleotide composition and skewing of the Ooencyrtus plautus mitogenome.
size | T | C | A | G | AT-skew | GC-skew | |
---|---|---|---|---|---|---|---|
whole genome | 15730 | 0.401 | 0.085 | 0.445 | 0.069 | 0.052 | -0.104 |
PCGs | 11072 | 0.381 | 0.097 | 0.439 | 0.083 | 0.071 | -0.078 |
PCGs(J) | 8402 | 0.345 | 0.086 | 0.467 | 0.102 | 0.150 | 0.085 |
PCGs(N) | 2670 | 0.494 | 0.081 | 0.352 | 0.073 | -0.168 | -0.052 |
tRNA | 1467 | 0.455 | 0.066 | 0.437 | 0.042 | -0.020 | -0.222 |
tRNA(J) | 994 | 0.454 | 0.036 | 0.437 | 0.073 | -0.019 | 0.339 |
tRNA(N) | 473 | 0.457 | 0.053 | 0.438 | 0.053 | -0.021 | 0.000 |
rRNA | 2103 | 0.437 | 0.068 | 0.453 | 0.042 | 0.018 | -0.236 |
Intergenic spacers and overlapping genes are very common in arthropod mitochondrial genomes (
In the mitochondrial genome of O. plautus, the total length of protein-coding genes was 11,072 bp, accounting for 70.39% of the entire genome. Most of them were encoded on the J strand, and only nad2, nad6, and cob were encoded on the N strand. The average A + T content of the 13 protein-coding genes was 82%, ranging from 73.7% (cox1) to 93.80% (atp8) for individual genes.
To investigate further this high A and T content, and the frequency of synonymous codon usage, we calculated relative synonymous codon usage (RSCU) values. The relative synonymous codon usages (RSCU) of the O. plautus are shown in Fig.
The predicted initiation codons are ATN as in most other insect mitochondrial genomes (
Name | Start | Stop | Strand | Length | Codons |
---|---|---|---|---|---|
trnI | 1 | 68 | + | 68 | |
nad2 | 1064 | 75 | - | 870 | ATT/TAA |
trnR | 1146 | 1082 | - | 65 | |
trnQ | 1227 | 1160 | - | 68 | |
trnN | 1229 | 1296 | + | 68 | |
trnW | 1299 | 1367 | + | 69 | |
trnC | 1481 | 1415 | - | 67 | |
trnS1 | 1482 | 1540 | + | 59 | |
trnY | 1548 | 1613 | + | 66 | |
nad3 | 1613 | 1966 | + | 333 | ATT/TAA |
trnG | 1975 | 2039 | + | 65 | |
cox3 | 2043 | 2828 | + | 747 | ATG/TAA |
atp6 | 2828 | 3502 | + | 666 | ATG/TAA |
atp8 | 3496 | 3657 | + | 159 | ATT/TAA |
trnD | 3658 | 3721 | + | 64 | |
trnK | 3796 | 3727 | - | 70 | |
cox2 | 3800 | 4474 | + | 639 | ATT/TAA |
trnL2 | 4486 | 4549 | + | 64 | |
cox1 | 4561 | 6090 | + | 1509 | ATG/TAA |
trnE | 6161 | 6099 | - | 63 | |
trnF | 6162 | 6229 | + | 68 | |
nad5 | 6230 | 7898 | + | 1419 | ATT/T |
trnH | 7899 | 7966 | + | 68 | |
nad4 | 7970 | 9307 | + | 1281 | ATG/TAA |
nad4l | 9301 | 9588 | + | 255 | ATT/TAA |
trnT | 9589 | 9659 | - | 71 | |
trnP | 9661 | 9725 | + | 65 | |
nad6 | 10289 | 9744 | - | 501 | ATG/TAA |
cob | 11424 | 10291 | - | 1098 | ATG/TAA |
trnS2 | 11491 | 11423 | - | 69 | |
nad1 | 11493 | 12417 | + | 906 | ATT/T |
trnL1 | 12418 | 12483 | + | 66 | |
rrnL | 12484 | 13791 | + | 1308 | |
trnA | 13792 | 13860 | + | 69 | |
rrnS | 13861 | 14655 | + | 795 | |
trnV | 14656 | 14725 | + | 70 | |
trnM | 14724 | 14788 | + | 65 |
Previous work (
In total, 22 transfer RNA genes were found, ranging in size from 59 bp (trnS1) to 71 bp (trnT). Most of the tRNAs were encoded on the J strand and only 7 tRNAs (trnT, trnE, trnK, trnC, trnQ, trnR, and trnS2) were encoded on the N strand. The average nucleotide composition of these tRNAs was A: 43.7%, T: 45.5%, C: 6.6%, and G: 4.2%, with a total average A + T content of 89.2%. All tRNA sequences can be folded into the canonical cloverleaf secondary structure, except for trnS1 which lacked the dihydrouridine (DHU) arm. A lack of the DHU arm in trnS1 was found in the mitochondrial genomes of most insects (
The two rRNA genes, the larger ribosomal gene (rrnL) and the smaller ribosomal gene (rrnS), were located between trnA and trnL1, and trnV and trnA, respectively. The average of the total size of two rRNAs was 2,103 bp and the average A + T content was 89%.
Gene rearrangement in the mitochondrial genome is a relatively rare event in the evolutionary history of insects (
In Encyrtidae, the gene order of all species of Encytus are identical. We used their gene order as a template to analyze the gene rearrangement of Encyrtidae. It can be summarized into four obvious rearrangements. They are the rearrangement of gene clusters trnI-nad2-trnW, trnY-trnS1-trnN-trnC-trnR, and trnA-trnQ-rrns-trnV-trnM. In addition, there are inversions of trnE-trnF and trnP-trnT.
Firstly, the trnW-trnI-nad2 gene cluster in these four species (Aenasius arizonensis, Diaphorencyrtus aligarhensis, Metaphycus eriococci, and Platencytus parkeri) was rearranged as trnI-nad2-trnW, most species in Chalcidoidea of which were in this order. However, in O. plautus, trnW was translocated, causing the gene cluster trnI-nad2-trnW to be divided into trnI-nad2 and trnW, which is the first of its kind to be reported in Chalcidoidea. Besides, the trnI-nad2 gene cluster was translocated to upstream. Secondly, the trnN-trnS1-trnY-trnC-trnR gene clusters in four species (A. arizonensis, D. aligarhensis, M. eriococci, and O. plautus) have undergone disorderly rearrangements. Thirdly, trnE-trnF inversion occurred in both A. arizonensis and D. aligarhensis. In addition, trnP-trnT inversion also occurred in D. aligarhensis. Finally, in D. aligarhensis and O. plautus, trnQ in the gene cluster trnA-trnQ-rrnS-trnV-trnM has a long-distance transposition. In M. eriococci, trnM and trnQ were transposed in the gene cluster trnA-trnQ-rrnS-trnV-trnM.
Phylogenetic relationships were analyzed using the concatenated nucleotides and amino acids sequences of 13 PCGs from eight encyrtid species and two outgroups. Four topologies were constructed using both ML and BI methods and two different data sets.
The topological structures of these two phylogenetic trees reconstructed by ML analysis were identical with the phylogenetic tree reconstructed by BI based on PCG (Fig.
Phylogenetic tree produced by maximum likelihood and Bayesian inference analyses based on the 13 PCG and the AA data set A phylogenetic tree produced by Bayesian inference analyses based on 13 PCG data set B phylogenetic tree produced by maximum likelihood analyses based on the AA data set C phylogenetic tree produced by maximum likelihood analyses based on the 13 PCG data set.
In the BI topology based on the AA data set, A. arizonensis and M. eriococci formed a clade representing Tetracneminae. The remaining six species formed a monophyletic clade representing Encyrtinae. In Encyrtinae, Encyrtus formed a monophyletic clade as a sister group to the clade formed by O. plautus and D. aligarhensis. Encyrtus sasakii and E. rhodooccisiae are most closely related in this monophyletic clade. In the BI topology based on the PCG data set, E. eulecaniumiae and E. sasakii form a sister group then to E. rhodooccisiae. Encyrtus infelix was the first diverged in Encyrtus.
Pairwise breakpoint distances (PBD) between the mitochondrial genomes of each species in Encyrtidae were calculated using the web server CREx, and heatmaps were constructed (Fig.
In Encyrtidae, the gene rearrangement processes can be summarized into four rearrangement mechanisms, which can also be applied to Chalcidoidea (Fig.
These four gene rearrangements can be applied to most of Chalcidoidea. When these tRNA gene rearrangement patterns were mapped on the estimated phylogenetic tree, the gene order of mitochondrial genome may resolve some contentious evolutionary questions (
As the sampling diversity of Encyrtidae is limited, it cannot completely solve the main classification question. Considering the limited research on taxonomic relationships inference based on molecular data, a comprehensive comparison of species morphology and genetic characteristics is needed to better understand the phylogenetic relationship of Encyrtidae. If more mitochondrial genomes are sequenced, the accuracy of phylogenetic relationships could be improved. We hope that future studies will combine morphology with more data sets from the mitochondrial genome to provide sufficient evidence for the phylogenetic relationship of the Encyrtidae.
In this study, we sequenced the nearly complete mitogenome of O. plautus that contains 37 genes and one control region. The nucleotide composition, codon usage, RNA structures, and protein-coding genes evolution were analyzed. The mitogenome genome of O. plautus reveals the phylogenetic relationship of Encyrtidae for the first time. In addition, the regularity of gene rearrangement within Encyrtidae is discussed. The results of this study will contribute to further studies on evolutionary relationships within Encyrtidae.
This work was supported by the National Natural Science Foundation of China (grant no. 32100352, 32100355), the Natural Science Fund of Anhui Province (grant no. 1908085QC93), the Natural Science Foundation of Universities of Anhui Province (grant no. KJ2020A0094), the Major Science and Technology Projects in Anhui Province (grant no. 202003a06020009), the National Science and Technology Fundamental Resources Investigation Program of China (grant no. 2019FY101800) and the Natural Science Foundation of Anhui Normal University (grant no. 2020XJ19).