Research Article |
Corresponding author: Qing Zhao ( zhaoqing86623@163.com ) Academic editor: Wenjun Bu
© 2023 Wang Jia, Jiufeng Wei, Minmin Niu, Hufang Zhang, Qing Zhao.
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Jia W, Wei J, Niu M, Zhang H, Zhao Q (2023) The complete mitochondrial genome of Aeschrocoris tuberculatus and A. ceylonicus (Hemiptera, Pentatomidae) and its phylogenetic implications. ZooKeys 1160: 145-167. https://doi.org/10.3897/zookeys.1160.100818
|
Aeschrocoris tuberculatus and A. ceylonicus (Hemiptera, Pentatomidae, Pentatominae) are mainly distributed in southern China, India, Myanmar, and Sri Lanka. Both species are also common agricultural pests. However, only the morphology of the genus Aeschrocoris has previously been studied, and molecular data have been lacking. In this study, the whole mitochondrial genomes of A. tuberculatus and A. ceylonicus are and annotated. The lengths of the complete mitochondrial genomes of the two species are 16,134 bp and 16,142 bp, respectively, and both contain 37 typical genes, including 13 protein-coding genes (PCGs), two ribosomal RNA genes (rRNAs), 22 transfer RNA genes (tRNAs), and a control region. The mitochondrial genome structure, gene order, nucleotide composition, and codon usage of A. tuberculatus and A. ceylonicus are consistent with those of typical Pentatomidae. Most PCGs of both species use ATN as the start codon, except atp8, nad1, and cox1, which use TTG as the start codon. cox1, cox2, and atp6 use a single T, and nad1 use TAG as the stop codon; the remaining PCGs have TAA as the stop codon. The A+T contents of the two species are 73.86% and 74.08%, respectively. All tRNAs have a typical cloverleaf structure, with the exception of trnS1, which lacks a dihydrouridine arm. The phylogenetic tree is reconstructed using the maximum-likelihood method based on the newly obtained mitochondrial genome sequences and 87 existing mitochondrial genomes of Pentatomoidea from the NCBI database and two species of Lygaeoidea as outgroups. The phylogenetic trees strongly support the following relationships: (Urostylididae + ((Acanthosomatidae + ((Cydnidae + (Dinidoridae + Tessaratomidae)) + (Scutelleridae + Plataspidae))) + Pentatomidae). This study enriches the mitochondrial genome database of Pentatomoidea and provides a reference for further phylogenetic studies.
Mitogenome, Pentatomoidea, phylogenetic analysis
The insect mitochondrial genome is a circular double-stranded DNA molecule with a length of about 16–18 kb, which code 37 genes: 13 protein-coding genes (PCGs), two ribosomal RNA (rRNA) genes, and 22 transfer RNA (tRNA) genes (
Pentatomoidea (Hemiptera, Heteroptera, Pentatomomorpha) consists of more than 8,000 species in 18 families, of which Pentatomidae is the largest family containing 940 genera and about 5,000 species (
The tribe Aeschrocorini was first proposed by
In this study, we analyze the mitochondrial genomes of A. tuberculatus and A. ceylonicus in detail, including genome structure, nucleotide composition, and codon usage. Meanwhile, we also construct the genome structure of RNA. In addition, we analyze the phylogenetic relationship of eight families of Pentatomoidea and explore the phylogenetic location of these two species. The results of this study will provide a reference for phylogenetic analyses and identification of the Pentatomoidea.
Adult specimens of Aeschrocoris tuberculatus and A. ceylonicus were collected from Baihua Ling (Baoshan City, Yunnan Province, China; 25°16'43"N, 98°48'12"E) on 13 August 2015 and from Guanlan Ting (Taohua Island, Zhoushan City, Zhejiang Province, China; 29°50'31"N, 122°14'13"E) on 4 August 2016. All samples were immediately placed in anhydrous ethanol and stored in a refrigerator at –25 °C until DNA was extracted. The species were identified by Qing Zhao.
Whole-genome DNA was extracted from the thoracic muscle of the samples using the Genomic DNA Extraction Kit (BGI, Wuhan, Hubei, China). Concentrations of samples were detected using Qubit Fluorometer and microplate reader (
When the assembly was complete, the complete mitogenomes were manually annotated using Geneious v. 11.0 software (
The base composition, codon usage (RSCU), and amino acid composition of the mitogenome were analyzed using MEGA v. 11.0. The skew of the nucleotide composition was calculated as follows: AT-skew = (A – T) / (A + T) and GC-skew = (G – C) / (G + C) (
In this study, we used the two newly sequenced species, 87 species from other eight families of Pentatomoidea, and two species (Geocoris pallidipennis and Kleidocerys resedae as the outgroup) from Lygaeoidea to analyze the phylogenetic position of A. tuberculatus and A. ceylonicus and the phylogenetic relationships within Pentatomoidea (Table
Classification | Family | Species | Accession number | Reference |
---|---|---|---|---|
Outgroup | ||||
Lygaeoidea | Lygaeidae | Geocoris pallidipennis | EU427336 |
|
Kleidocerys resedae | KJ584365 |
|
||
Ingroup | ||||
Pentatomoidea | Acanthosomatidae | Acanthosoma labiduroides | JQ743670 |
|
Anaxandra taurina | NC042801 |
|
||
Sastragala edessoides | JQ743676 |
|
||
Sastragala esakii | MW847247 |
|
||
Cydnidae | Adrisa magna | NC042429 |
|
|
Aethus nigritus | MW847231 |
|
||
Macroscytus gibbulus | EU427338 |
|
||
Macroscytus subaeneus | MW847241 |
|
||
Scoparipes salvazai | NC042800 |
|
||
Dinidoridae | Coridius brunneus | MW899158 | Unpublished | |
Cyclopelta parva | NC037739 |
|
||
Megymenum gracilicorne | NC042810 |
|
||
Pentatomidae | Aeschrocoris ceylonicus | OP526368 | This study | |
Aeschrocoris tuberculatus | OP526367 | This study | ||
Arma custos | NC051562 |
|
||
Anaxilaus musgravei | NC061538 | Unpublished | ||
Brachymna tenuis | NC042802 |
|
||
Carbula sinica | NC037741 |
|
||
Catacanthus incarnatus | NC042804 |
|
||
Caystrus obscurus | NC042805 |
|
||
Cazira horvathi | NC042817 |
|
||
Dalpada cinctipes | NC058967 |
|
||
Dalsira scabrata | NC037374 |
|
||
Deroploa parva | NC063299 | Unpublished | ||
Dinorhynchus dybowskyi | NC037724 |
|
||
Dolycoris baccarum | NC020373 |
|
||
Eocanthecona furcellata | MZ440302 | Unpublished | ||
Eocanthecona thomsoni | NC042816 |
|
||
Eurydema dominulus | NC044762 |
|
||
Pentatomoidea | Pentatomidae | Eurydema gebleri | NC027489 |
|
Eurydema liturifera | NC044763 |
|
||
Eurydema maracandica | NC037042 |
|
||
Eurydema oleracea | NC044764 |
|
||
Eurydema qinlingensis | NC044765 | Unpublished | ||
Eurydema ventralis | MG584837 | Unpublished | ||
Erthesina fullo | NC042202 |
|
||
Eysarcoris aeneus | MK841489 |
|
||
Eysarcoris annamita | MW852483 |
|
||
Eysarcoris gibbosus | MW846868 |
|
||
Eysarcoris guttigerus | NC047222 |
|
||
Eysarcoris montivagus | MW846867 |
|
||
Eysarcoris rosaceus | MT165687 |
|
||
Glaucias dorsalis | NC058968 |
|
||
Gonopsis affinis | NC036745 |
|
||
Graphosoma rubrolineatum | NC033875 |
|
||
Halyomorpha halys | NC013272 |
|
||
Hippotiscus dorsalis | NC058969 |
|
||
Hoplistodera incisa | NC042799 |
|
||
Menida violacea | NC042818 |
|
||
Nezara viridula | NC011755 |
|
||
Neojurtina typica | NC058971 |
|
||
Palomena viridissima | NC050166 | Unpublished | ||
Pentatoma metallifera | NC058972 |
|
||
Pentatoma rufipes | MT861131 |
|
||
Pentatoma semiannulata | NC053653 | Unpublished | ||
Picromerus griseus | NC036418 |
|
||
Picromerus lewisi | NC058610 |
|
||
Placosternum urus | NC042812 |
|
||
Plautia crossota | NC057080 |
|
||
Plautia fimbriata | NC042813 |
|
||
Plautia lushanica | NC058973 |
|
||
Priassus spiniger | OK546352 | Unpublished | ||
Scotinophara lurida | NC042815 |
|
||
Tholosanus proximus | NC063300 | Unpublished | ||
Zicrona caerulea | NC058303 |
|
||
Plataspidae | Brachyplatys subaeneus | MW847232 |
|
|
Calacta lugubris | MW847233 |
|
||
Coptosoma bifaria | EU427334 |
|
||
Coptosoma variegatum | OP123035 |
|
||
Megacopta bituminata | OP123020 |
|
||
Megacopta caliginosa | OP123022 |
|
||
Megacopta centronubila | OP123024 |
|
||
Megacopta cribraria | JF288758 | Unpublished | ||
Megacopta cribriella | OP123025 |
|
||
Megacopta distanti | OP123028 |
|
||
Megacopta horvathi | OP123029 |
|
||
Megacopta lobata | OP123031 |
|
||
Scutelleridae | Cantao ocellatus | MF497713 |
|
|
Chrysocoris stollii | NC051942 | Unpublished | ||
Eurygaster testudinaria | NC042808 |
|
||
Poecilocoris druraei | MW847246 |
|
||
Tessaratomidae | Dalcantha dilatata | JQ910981 |
|
|
Eusthenes cupreus | NC022449 |
|
||
Mattiphus splendidus | NC053743 |
|
||
Pycanum ochraceum | MW899159 |
|
||
Tessaratoma papillosa | NC037742 |
|
||
Urostylididae | Urostylis flavoannulata | NC037747 |
|
|
Urolabida histrionica | MW847249 |
|
||
Urochela quadrinotata | NC020144 |
|
To determine whether the sequences contained phylogenetic information, we tested nucleotide substitution saturation and plotted transition and transversion rates against the TN93 distances for two datasets: all codon positions of the 13 PCGs (PCG123) and first and second codon positions of PCGs (PCG12) using DAMBE to further validate the feasibility of constructing a phylogenetic tree (
The complete mitogenomes of Aeschrocoris tuberculatus (16,134 bp, GenBank accession no. OP56367) and A. ceylonicus (14,142 bp, GenBank accession no. OP56368) were obtained (Fig.
Organization of the mitochondrial genomes of Aeschrocoris tuberculatus and A. ceylonicus.
A. tuberculatus | A. ceylonicus | |||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|
Gene | Strand | Anticodon | Position | Size (bp) | Initiation codon | Stop codon | Intergenic nucleotide | Position | Size (bp) | Initiation codon | Stop codon | Intergenic nucleotide |
trnI | J | GAT | 1–71 | 71 | 0 | 1–71 | 71 | 0 | ||||
trnQ | N | TTG | 80–148 | 69 | 8 | 79–147 | 69 | 7 | ||||
trnM | J | CAT | 154–224 | 71 | 5 | 152–222 | 71 | 4 | ||||
nad2 | J | 225–1208 | 984 | ATA | TAA | 0 | 226–1206 | 981 | ATA | TAA | 3 | |
trnW | J | TCA | 1226–1292 | 67 | 17 | 1223–1290 | 68 | 16 | ||||
trnC | N | GCA | 1285–1352 | 68 | –8 | 1283–1350 | 68 | –8 | ||||
trnY | N | GTA | 1366–1434 | 69 | 13 | 1364–1429 | 66 | 13 | ||||
cox1 | J | 1450–2986 | 1537 | TTG | T | 15 | 1445–2981 | 1537 | TTG | T | 15 | |
trnL2 | J | TAA | 2987–3053 | 67 | 0 | 2982–3048 | 67 | 0 | ||||
cox2 | J | 3054–3732 | 679 | ATA | T | 0 | 3049–3727 | 679 | ATA | T | 0 | |
trnK | J | CTT | 3733–3804 | 72 | 0 | 3728–3799 | 72 | 0 | ||||
trnD | J | GTC | 3808–3873 | 66 | 3 | 3803–3869 | 67 | 3 | ||||
atp8 | J | 3874–4035 | 162 | TTG | TAA | 0 | 3870–4031 | 162 | TTG | TAA | 0 | |
atp6 | J | 4029–4701 | 673 | ATG | T | –7 | 4025–4697 | 673 | ATG | T | –7 | |
cox3 | J | 4702–5490 | 789 | ATG | TAA | 0 | 4698–5486 | 789 | ATG | TAA | 0 | |
trnG | J | TCC | 5490–5555 | 66 | –1 | 5486–5550 | 65 | –1 | ||||
nad3 | J | 5556–5909 | 354 | ATT | TAA | 0 | 5551–5904 | 354 | ATT | TAA | 0 | |
trnA | J | TGC | 5914–5982 | 69 | 4 | 5909–5977 | 69 | 4 | ||||
trnR | J | TCG | 5991–6058 | 68 | 8 | 5988–6056 | 69 | 10 | ||||
trnN | J | GTT | 6064–6131 | 68 | 5 | 6061–6128 | 68 | 4 | ||||
trnS1 | J | GCT | 6133–6202 | 70 | 1 | 6130–6199 | 70 | 1 | ||||
trnE | J | TTC | 6207–6275 | 69 | 4 | 6200–6269 | 70 | 0 | ||||
trnF | N | GAA | 6274–6341 | 68 | –2 | 6268–6335 | 68 | –2 | ||||
nad5 | N | 6346–8052 | 1707 | ATG | TAA | 4 | 6340–8046 | 1707 | ATG | TAA | 4 | |
trnH | N | GTG | 8055–8123 | 69 | 2 | 8049–8117 | 69 | 2 | ||||
nad4 | N | 8127–9455 | 1329 | ATG | TAA | 3 | 8121–9449 | 1329 | ATG | TAA | 3 | |
nad4l | N | 9449–9736 | 288 | ATT | TAA | –7 | 9443–9730 | 288 | ATT | TAA | –7 | |
trnT | J | TGT | 9739–9807 | 69 | 2 | 9733–9801 | 69 | 2 | ||||
trnP | N | TGG | 9808–9871 | 64 | 0 | 9802–9865 | 64 | 0 | ||||
nad6 | J | 9880–10353 | 474 | ATA | TAA | 8 | 9868–10347 | 480 | TTG | TAA | 2 | |
cytb | J | 10346–11482 | 1137 | ATG | TAA | –8 | 10340–11476 | 1137 | ATG | TAA | –8 | |
trnS2 | J | TGA | 11482–11550 | 69 | –1 | 11476–11534 | 68 | –1 | ||||
nad1 | N | 11576–12499 | 924 | TTG | TAG | 25 | 11568–12491 | 924 | TTG | TAG | 33 | |
trnL1 | N | TAG | 12500–12565 | 66 | 0 | 12492–12557 | 66 | 0 | ||||
rrnL | N | 12566–13874 | 1309 | 0 | 12558–13859 | 1302 | 0 | |||||
trnV | N | TAC | 13875–13942 | 68 | 0 | 13860–13927 | 68 | 0 | ||||
rrnS | N | 13943–14751 | 809 | 0 | 13928–14740 | 813 | 0 | |||||
OH | J | 14752–16134 | 1383 | 0 | 14741–16142 | 1402 | 0 |
The nucleotide composition of two species shows the predominance of A+T in the complete mitochondrial genome (Table
Nucleotide composition of the mitogenomes of Aeschrocoris tuberculatus and A. ceylonicus.
A. tuberculatus | ||||||||
Feature | Length (bp) | A% | C% | G% | T% | A+T% | AT-skew | GC-skew |
Whole genome | 16134 | 42.32 | 15.20 | 10.94 | 31.55 | 73.86 | 0.15 | –0.16 |
PCGs | 11036 | 32.63 | 13.54 | 13.37 | 40.46 | 73.09 | –0.11 | 0.01 |
tRNA | 1503 | 38.39 | 10.18 | 13.71 | 37.72 | 76.11 | 0.01 | 0.15 |
rRNA | 2118 | 32.39 | 8.40 | 15.63 | 43.58 | 75.97 | –0.15 | 0.30 |
Control region | 1383 | 38.41 | 14.49 | 11.52 | 35.58 | 77.99 | 0.04 | –0.11 |
A. ceylonicus | ||||||||
Feature | Length (bp) | A% | C% | G% | T% | A+T% | AT-skew | GC-skew |
Whole genome | 16142 | 42.36 | 14.94 | 10.98 | 31.72 | 74.08 | 0.14 | –0.15 |
PCGs | 11040 | 32.40 | 13.65 | 13.56 | 40.39 | 72.79 | –0.11 | 0.00 |
tRNA | 1502 | 38.08 | 9.85 | 13.52 | 38.55 | 76.63 | –0.01 | 0.16 |
rRNA | 2115 | 32.77 | 8.32 | 15.08 | 43.83 | 76.60 | –0.14 | 0.29 |
Control region | 1402 | 39.40 | 12.58 | 10.06 | 37.96 | 77.35 | 0.02 | –0.11 |
The composition of nucleotides is also reflected in the use of codons. The RSCUs of the two species show some differences and are compared to each other in Fig.
The length of PCGs in A. tuberculatus and A. ceylonicus is 11,036 bp and 11,040 bp, respectively. For the 13 PCGs, nine (cox1, cox2, cox3, atp6, atp8, nad2, nad3, nad6, and cytb) are encoded on the major strand (J-strand), whereas the other four are encoded on the minor strand (N-strand). The typical ATN (five with ATG, three with ATA, and two with ATT) are used as the start codon in most PCGs of these species, except for the atp8, nad1, and cox1 genes, which use TTG as the start codon. cox1, cox2, and atp6 sequences terminate with a single T, the terminal codon of nad1 sequences is TAG, and the stop codon for the remaining genes was TAA.
In addition, we calculated non-synonymous substitutions (Ka), synonymous substitutions (Ks), and Ka/Ks ratios for the 13 PCGs of the Pentatomoidea (Fig.
The total lengths of the tRNAs of A. tuberculatus and A. ceylonicus are 1,503 bp and 1,502 bp, respectively. And the length of tRNA genes are from 64 bp to 72 bp. Fourteen genes (trnA, trnE, trnD, trnG, trnK, trnI, trnL2, trnM, trnN, trnR, trnS1, trnS2, trnT, and trnW) are located on the J-strand, and other eight genes on the N-strand. Only trnS1 lacks a dihydrouridine (DHU) arm; the other tRNA genes all have the classic cloverleaf secondary structure. In addition to the typical base pairs (A-U and G-C), some wobble G-U pairs appear in these secondary structures, which can form stable chemical bonds between G and U (Fig.
The rrnL and rrnS genes have the same situation in the two species. The rrnL gene is located between trnL1 (CUN) and trnV, and the rrnS gene is located between trnV and the control region; they are encoded on the N-strand. The lengths of the two genes in A. tuberculatus are 1,309 bp (rrnL) and 809 bp (rrnS); the complete secondary structures are shown in Figs
The control is the main regulatory region for replication and transcription of the mitochondrial genome (
Before constructing the phylogenetic tree, we evaluated the substitution saturation of the PCG123 and PCG12 datasets. The results show that the Xia saturation index (Iss) is below the critical values for a symmetric (Iss.cSym) and asymmetric (Iss.cAsym) topology (Fig.
Our analysis of the heterogeneity of the base composition in the two datasets show that the heterogeneity of PCG123 is higher than in PCG12, thus indicating a higher heterogeneity of the third site of the codon. The degree of heterogeneity between the two datasets is certainly consistent with the construction of a phylogenetic tree, which can be used for phylogenetic analysis (Fig.
AliGROOVE analysis of 89 Pentatomoidea species a based on PCG123 b based on PCG12. The mean similarity score between sequences is represented by colored squares, based on AliGROOVE scores ranging from –1, which indicates a great difference in rates from the remainder of the data set (= heterogeneity, red color) to +1, which indicates rates that matched all other comparisons (blue color, as in this case).
We constructed phylogenetic trees of Pentatomoidea based on the two datasets using the ML method (Figs
In this study, we sequenced and annotated the complete mitogenomes of Aeschrocoris tuberculatus and A. ceylonicus using NGS technology and Geneious v. 11.0. Our analysis comparing of the mitochondrial genomes of the two species show that the gene arrangement is highly conserved, which is consistent with other published mitochondrial genomes of Hemiptera (
In most Pentatomidae mitochondrial genomes, only cox1 has TTG as its start codon, and the remaining 12 PCGs use ATN as their start codon (
The composition of the four bases in A. tuberculatus and A. ceylonicus is A>T>C>G. There is a clear AT preference in nucleotide composition. Most tRNAs have the typical cloverleaf secondary structure as observed in Hemiptera. However, the lack of a DHU arm in the trnS1 is common in hemipteran mitogenomes (
Through the topological structure of the trees, the clade including Urostylididae is found to be the earliest clade lineage. It forms a sister group to the other families. The relationship of (Cydnidae + (Dinidoridae + Tessaratomidae)) was recovered in our phylogenetic results with high support; these results are consistent with previous studies (
We studied the genus Aeschrocoris at a molecular level for the first time and preliminarily identified its taxonomic position and evolution in phylogenetic relationships. This study not only discusses the relationships among families, but it also adds new molecular data for Pentatomidae. These results demonstrate that mitochondrial genomes can effectively reveal the phylogenetic relationships among differing taxonomic hierarchies. We should sequence more mitochondrial genes to provide greater evidence for exploring the phylogenetic relationships among taxa.
This research was funded by the National Science Foundation Project of China (no. 31872272 and 32100370); the Research Project Supported by Shanxi Scholarship Council of China (no. 2020-064 and 2020-065), Natural Science Research General Project of Shanxi Province (No.202103021224331 and 202103021224132).