Research Article |
Corresponding author: Qing Zhao ( zhaoqing86623@163.com ) Academic editor: Laurence Livermore
© 2021 Ling Zhao, Jiufeng Wei, Wanqing Zhao, Chao Chen, Xiaoyun Gao, Qing Zhao.
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Zhao L, Wei J, Zhao W, Chen C, Gao X, Zhao Q (2021) The complete mitochondrial genome of Pentatoma rufipes (Hemiptera, Pentatomidae) and its phylogenetic implications. ZooKeys 1042: 51-72. https://doi.org/10.3897/zookeys.1042.62302
|
Pentatoma rufipes (Linnaeus, 1758) is an important agroforestry pest widely distributed in the Palaearctic region. In this study, we sequence and annotate the complete mitochondrial genome of P. rufipes and reconstruct the phylogenetic trees for Pentatomoidea using existing data for eight families published in the National Center for Biotechnology Information database. The mitogenome of P. rufipes is 15,887-bp-long, comprising 13 protein-coding genes, 22 transfer RNA genes, two ribosomal RNA genes, and a control region, with an A+T content of 77.7%. The genome structure, gene order, nucleotide composition, and codon usage of the mitogenome of P. rufipes were consistent with those of typical Hemiptera insects. Among the protein-coding genes of Pentatomoidea, the evolutionary rate of ATP8 was the fastest, and COX1 was found to be the most conservative gene in the superfamily. Substitution saturation assessment indicated that neither transition nor transversion substitutions were saturated in the analyzed datasets. Phylogenetic analysis using the Bayesian inference method showed that P. rufipes belonged to Pentatomidae. The node support values based on the dataset concatenated from protein-coding and RNA genes were the highest. Our results enrich the mitochondrial genome database of Pentatomoidea and provide a reference for further studies of phylogenetic systematics.
Mitogenome, Pentatomoidea, phylogenetic analysis
The mitochondrion is a semi-autonomous organelle with its own genetic material, known as the mitochondrial genome (mitogenome) (
Pentatomoidea, one of the most commonly encountered groups in Hemiptera, includes 1,410 genera and 8,042 species which are widely distributed worldwide (
Pentatoma rufipes (Linnaeus, 1758) (Hemiptera, Heteroptera, Pentatomidae) is a medium-sized to large, dark brown insect with reddish-orange spots and bright orange legs (
In this study, we sequenced and annotated the mitogenome of P. rufipes and analyzed its mitogenome in detail, including the genome structure, nucleotide composition, and codon usage, and constructed RNA secondary structures. In addition, we combined the complete mitogenome of P. rufipes with the existing data for the eight families of Pentatomoidea to explore the phylogenetic position of P. rufipes.
Adult Pentatoma rufipes specimens were collected in Baiji Hill (Tonghua City, Jilin Province, China; 41°58.14'N, 126°06.58'E) on 24 July 2015. All samples were immediately placed in absolute ethanol and stored in a freezer at –20 °C until DNA extraction. Specimen identification was performed by Qing Zhao. The voucher specimen is maintained at the Institute of Entomology of Shanxi Agricultural University (voucher number: SXAU 007; Taigu, China). The complete mitogenome of P. rufipes has been submitted to GenBank (accession number: MT861131).
Whole-genome DNA was extracted from the thoracic muscle of adult samples using the Genomic DNA Extraction Kit (Sangon Biotech, Shanghai, China). The mitogenomes were sequenced using the whole-genome shotgun method on the Illumina Miseq platform (Personalbio, Shanghai, China), with 400-bp inserts and paired-end model. A5-miseq v. 20150522 (
After assembly, the complete mitogenome was manually annotated using Geneious v. 8.1.4 software (
In this study, we selected the mitogenomes of P. rufipes, representative species from eight other Pentatomoidea families, and two Coreoidea species (outgroup) to analyze the phylogenetic position of P. rufipes and the phylogenetic relationships within Pentatomoidea. All species included in this analysis are listed in Table
Classificationstatus | Family | Species | Accession number |
Outgroup | |||
Coreoidea | Coreidae | Hydaropsis longirostris | EU427337 |
Anoplocnemis curvipes | NC_035509 | ||
Ingroup | |||
Pentatomoidea | Acanthosomatidae | Acanthosoma labiduroides | JQ743670 |
Sastragala edessoides | JQ743676 | ||
Anaxandra taurina | NC_042801 | ||
Cydnidae | Macroscytus gibbulus | NC_012457 | |
Adrisa magna | NC_042429 | ||
Scoparipes salvazai | NC_042800 | ||
Dinidoridae | Cyclopelta parva | KY069962 | |
Megymenum gracilicorne | NC_042810 | ||
Pentatomidae | Halyomorpha halys | NC_013272 | |
Eurydema gebleri | NC_027489 | ||
Graphosoma rubrolineatum | NC_033875 | ||
Gonopsis affinis | NC_036745 | ||
Dinorhynchus dybowskyi | NC_037724 | ||
Plautia fimbriata | NC_042813 | ||
Pentatoma rufipes | MT861131 | ||
Plataspidae | Coptosoma bifaria | EU427334 | |
Megacopta cribraria | NC_015342 | ||
Scutelleridae | Cantao ocellatus | NC_042803 | |
Eurygaster testudinaria | NC_042808 | ||
Tessaratomidae | Dalcantha dilatata | JQ910981 | |
Eusthenes cupreus | NC_022449 | ||
Tessaratoma papillosa | NC_037742 | ||
Urostylididae | Urostylis flavoannulata | NC_037747 |
To determine if the sequences contained phylogenetic information, we tested nucleotide substitution saturation, and plotted transition and transversion substitutions against the TN93 distance for all datasets before reconstructing the phylogenetic trees using DAMBE v. 4.5.32 (
The mitochondrial genome of Pentatoma rufipes is 15,887-bp-long and contains a control region and 37 genes comprising 13 PCGs, 22 tRNA genes and two rRNA genes (Fig.
Gene | Strand | Position | Anticodon | Size(bp) | Start codon | Stop codon | Intergenetic nucleotides* |
---|---|---|---|---|---|---|---|
trnI | J | 1–67 | GAT | 67 | |||
trnQ | N | 65–134 | TTG | 70 | –3 | ||
trnM | J | 137–205 | CAT | 69 | 2 | ||
ND2 | J | 206–1189 | 984 | ATT | TAA | 0 | |
trnW | J | 1198–1265 | TCA | 68 | 8 | ||
trnC | N | 1258–1321 | GCA | 64 | –8 | ||
trnY | N | 1331–1397 | GTA | 67 | 9 | ||
COX1 | J | 1407–2943 | 1537 | TTG | T | 9 | |
trnL2UUR | J | 2944–3008 | TAA | 65 | 0 | ||
COX2 | J | 3009–3687 | 679 | ATA | T | 0 | |
trnK | J | 3688–3761 | CTT | 74 | 0 | ||
trnD | J | 3761–3822 | GTC | 62 | –1 | ||
ATP8 | J | 3823–3981 | 159 | TTG | TAA | 0 | |
ATP6 | J | 3975–4649 | 675 | ATG | TAA | –7 | |
COX3 | J | 4652–5440 | 789 | ATG | TAA | 2 | |
trnG | J | 5446–5510 | TCC | 65 | 5 | ||
ND3 | J | 5511–5864 | 354 | ATC | TAA | 0 | |
trnA | J | 5873–5943 | TGC | 71 | 8 | ||
trnR | J | 5960–6024 | TCG | 65 | 16 | ||
trnN | J | 6033–6101 | GTT | 69 | 8 | ||
trnS1AGN | J | 6101–6170 | ACT | 70 | –1 | ||
trnE | J | 6171–6238 | TTC | 68 | 0 | ||
trnF | N | 6237–6301 | GAA | 65 | –2 | ||
ND5 | N | 6301–8007 | 1707 | ATT | TAA | –1 | |
trnH | N | 8009–8076 | GTG | 68 | 1 | ||
ND4 | N | 8079–9410 | 1332 | ATG | TAA | 2 | |
ND4L | N | 9404–9691 | 288 | ATT | TAA | –7 | |
trnT | J | 9694–9758 | TGT | 65 | 2 | ||
trnP | N | 9759–9820 | TGG | 62 | 0 | ||
ND6 | J | 9823–10299 | 477 | ATG | TAA | 2 | |
CYTB | J | 10304–11440 | 1137 | ATG | TAA | 4 | |
trnS2UCN | J | 11456–11524 | TGA | 69 | 15 | ||
ND1 | N | 11548–12477 | 930 | ATA | TAA | 23 | |
trnL1CUN | N | 12472–12539 | TAG | 68 | –6 | ||
16S rRNA | N | 12540–13816 | 1277 | 0 | |||
trnV | N | 13817–13886 | TAC | 70 | 0 | ||
12S rRNA | N | 13887–14707 | 821 | 0 | |||
CR | 14708–15887 | 1180 | 0 |
The base content and skewness of the genes in the P. rufipes mitogenome is shown in Table
Nucleotide composition and skewness of the Pentatoma rufipes mitochondrial genome.
Feature | Length(bp) | A% | C% | G% | T% | A+T% | AT-skew | GC-skew |
---|---|---|---|---|---|---|---|---|
Whole genome | 15737 | 42.0 | 12.4 | 9.9 | 35.7 | 77.7 | 0.08 | –0.11 |
PCGs | 11046 | 34.2 | 11.1 | 11.8 | 42.9 | 77.1 | –0.11 | 0.03 |
PCG-J | 6800 | 37.2 | 12.6 | 11.7 | 38.5 | 75.7 | –0.02 | –0.04 |
PCG-N | 4246 | 29.4 | 8.8 | 11.9 | 49.9 | 79.3 | –0.26 | 0.15 |
tRNA genes | 1460 | 39.7 | 10.0 | 12.3 | 38.0 | 77.7 | 0.02 | 0.10 |
tRNA genes-J | 936 | 40.6 | 11.0 | 11.1 | 37.3 | 77.9 | 0.04 | 0.01 |
tRNA genes-N | 524 | 38.0 | 8.2 | 14.4 | 39.3 | 77.3 | –0.02 | 0.27 |
rRNA genes | 2053 | 35.6 | 7.6 | 12.6 | 44.2 | 79.8 | –0.11 | 0.25 |
Control region | 1142 | 38.3 | 13.6 | 7.6 | 40.4 | 78.7 | –0.03 | –0.28 |
The preference for nucleotide composition is also reflected in codon use. The relative synonymous codon usage values for the P. rufipes mitogenome are summarized in Figure
Amino acid | Codon | N | RSCU | N+ | RSCU+ | N– | RSCU– |
---|---|---|---|---|---|---|---|
Phe | UUU | 260 | 1.7 | 118 | 1.49 | 142 | 1.92 |
UUC | 46 | 0.3 | 40 | 0.51 | 6 | 0.08 | |
Leu2 | UUA | 440 | 4.92 | 238 | 4.89 | 202 | 4.95 |
UUG | 16 | 0.18 | 6 | 0.12 | 10 | 0.24 | |
Leu1 | CUU | 47 | 0.53 | 19 | 0.39 | 28 | 0.69 |
CUC | 1 | 0.01 | 1 | 0.02 | 0 | 0 | |
CUA | 30 | 0.34 | 26 | 0.53 | 4 | 0.1 | |
CUG | 3 | 0.03 | 2 | 0.04 | 1 | 0.02 | |
Ile | AUU | 382 | 1.83 | 255 | 1.8 | 127 | 1.91 |
AUC | 35 | 0.17 | 29 | 0.2 | 6 | 0.09 | |
Met | AUA | 274 | 1.83 | 179 | 1.86 | 95 | 1.78 |
AUG | 25 | 0.17 | 13 | 0.14 | 12 | 0.22 | |
Val | GUU | 80 | 1.99 | 33 | 1.43 | 47 | 2.72 |
GUC | 5 | 0.12 | 1 | 0.04 | 4 | 0.23 | |
GUA | 68 | 1.69 | 51 | 2.22 | 17 | 0.99 | |
GUG | 8 | 0.2 | 7 | 0.3 | 1 | 0.06 | |
Ser2 | UCU | 95 | 2.11 | 31 | 1.24 | 64 | 3.18 |
UCC | 9 | 0.2 | 6 | 0.24 | 3 | 0.15 | |
UCA | 111 | 2.46 | 76 | 3.04 | 35 | 1.74 | |
UCG | 1 | 0.02 | 0 | 0 | 1 | 0.05 | |
Pro | CCU | 74 | 2.31 | 48 | 2.04 | 26 | 3.06 |
CCC | 13 | 0.41 | 9 | 0.38 | 4 | 0.47 | |
CCA | 41 | 1.28 | 37 | 1.57 | 4 | 0.47 | |
CCG | 0 | 0 | 0 | 0 | 0 | 0 | |
Thr | ACU | 60 | 1.47 | 42 | 1.33 | 18 | 1.95 |
ACC | 11 | 0.27 | 5 | 0.16 | 6 | 0.65 | |
ACA | 91 | 2.23 | 78 | 2.48 | 13 | 1.41 | |
ACG | 1 | 0.02 | 1 | 0.03 | 0 | 0 | |
Ala | GCU | 61 | 1.88 | 42 | 1.77 | 19 | 2.17 |
GCC | 11 | 0.34 | 9 | 0.38 | 2 | 0.23 | |
GCA | 54 | 1.66 | 44 | 1.85 | 10 | 1.14 | |
GCG | 4 | 0.12 | 0 | 0 | 4 | 0.46 | |
Tyr | UAU | 170 | 1.85 | 67 | 1.7 | 103 | 1.96 |
UAC | 14 | 0.15 | 12 | 0.3 | 2 | 0.04 | |
His | CAU | 59 | 1.66 | 45 | 1.58 | 14 | 2 |
CAC | 12 | 0.34 | 12 | 0.42 | 0 | 0 | |
Gln | CAA | 47 | 1.84 | 35 | 2 | 12 | 1.5 |
CAG | 4 | 0.16 | 0 | 0 | 4 | 0.5 | |
Asn | AAU | 179 | 1.8 | 114 | 1.74 | 65 | 1.91 |
AAC | 20 | 0.2 | 17 | 0.26 | 3 | 0.09 | |
Lys | AAA | 102 | 1.79 | 73 | 1.9 | 29 | 1.57 |
AAG | 12 | 0.21 | 4 | 0.19 | 8 | 0.43 | |
Asp | GAU | 63 | 1.88 | 38 | 1.81 | 25 | 2 |
GAC | 4 | 0.12 | 4 | 0.19 | 0 | 0 | |
Glu | GAA | 73 | 1.7 | 56 | 1.9 | 17 | 1.26 |
GAG | 13 | 0.3 | 3 | 0.1 | 10 | 0.74 | |
Cys | UGU | 42 | 1.71 | 12 | 1.6 | 30 | 1.76 |
UGC | 7 | 0.29 | 3 | 0.4 | 4 | 0.24 | |
Trp | UGA | 91 | 1.88 | 68 | 1.97 | 23 | 1.64 |
UGG | 6 | 0.12 | 1 | 0.03 | 5 | 0.36 | |
Arg | CGU | 13 | 0.96 | 2 | 0.23 | 11 | 2.32 |
CGC | 2 | 0.15 | 1 | 0.11 | 1 | 0.21 | |
CGA | 35 | 2.59 | 30 | 3.43 | 5 | 1.05 | |
CGG | 4 | 0.3 | 2 | 0.23 | 2 | 0.42 | |
Ser1 | AGU | 40 | 0.89 | 14 | 0.56 | 26 | 1.29 |
AGC | 5 | 0.11 | 3 | 0.12 | 2 | 0.1 | |
AGA | 96 | 2.13 | 69 | 2.76 | 27 | 1.34 | |
AGG | 4 | 0.09 | 1 | 0.04 | 3 | 0.15 | |
Gly | GGU | 64 | 1.32 | 28 | 0.9 | 36 | 2.06 |
GGC | 6 | 0.12 | 2 | 0.06 | 4 | 0.23 | |
GGA | 102 | 2.1 | 82 | 2.65 | 20 | 1.14 | |
GGG | 22 | 0.45 | 12 | 0.39 | 10 | 0.57 |
Most P. rufipes PCGs share the ATN start codon (five with ATG, three with ATT, two with ATA, and one with ATC), except for COX1 and ATP8, which start with TTG. COX1 and COX2 sequences terminate with a single T, and the stop codon for the remaining genes is TAA. The AT content (77.1%) of the 13 PCGs exceeded the GC content (22.9%), and the AT bias is moderately negative (absolute value: 0.1–0.2).
In addition, we calculated the synonymous substitutions (Ks), non-synonymous substitutions (Ka), and the Ka/Ks ratios of the 13 PCGs from Pentatomoid insects. We also compared the evolutionary rates of the 13 PCGs (Fig.
We detected 22 tRNA genes, which can transport all 20 amino acids, in the mitogenome of P. rufipes. There are two tRNAs each for leucine and serine: trnL1 (CUN) and trnL2 (UUR), and trnS1 (AGN) and trnS2 (UCN), respectively. The anticodons of trnL are TAA and TAG, and the anticodons of trnS are ACT and TGA. The 22 tRNA genes span 1,481 bp, between 62 and 74 bp in length. Although trnS1 lacks a dihydrouridine arm, the other tRNA genes all have the classic clover leaf secondary structure. In addition to the typical base pairs (A-U and G-C), some wobble G-U pairs appear in these secondary structures, which can form stable chemical bonds between G and U; In addition, atypical pairing of U-U and U-C is also found (Fig.
The two P. rufipes rRNA genes (12S rRNA and 16S rRNA) are encoded on the N-strand. The 16S rRNA gene is located between trnL1 (CUN) and trnV, which is 1,277 bp in length, and there is no gene overlap between 16S rRNA and the two tRNA genes. The 12S rRNA gene (821 bp) is located between trnV and the control region, similar to the published pentatomid mitogenomes. The base content of the rRNA genes is in the order of T (44.2%) > A (35.6%) > G (12.6%) > C (7.6%). The AT-skews are negative, and the GC-skews are positive. The complete secondary structures of the 12S rRNA and 16S rRNA genes are shown in Figures
The control region of the mitogenome of P. rufipes is located between the 12S rRNA gene and trnI. The control region is 1,180 bp long, making it the longest noncoding region in the mitogenome, and has an A + T content of 78.7%. The AT-skew and GC-skew in the control area are –0.03 and –0.28, respectively, indicating that the content of T is higher than that of A and the content of C is higher than that of G.
To eliminate the negative effect of the substitution saturation in the phylogenetic analysis, saturation tests on the three data sets were conducted. Nucleotide sequence substitution saturation is usually determined by analyzing the relationship between the transition and transversion values against the corresponding corrected genetic distance. In all tests, the Xia saturation index (Iss) was below the critical values for a symmetric (Iss.cSym) and asymmetric (Iss.cAsym) topology (Fig.
We reconstructed the phylogenetic trees of eight families in Pentatomoidea from three datasets (PCGRNA, PCG, and PCG12) using Bayesian inference method. The topological structures of the trees were similar, especially PCG and PCG12 showed similar family-level relationships (Figs
In this study, we sequenced the complete mitogenome of P. rufipes using NGS technology, revealing a mitogenome that is 15,887-bp-long containing 37 genes. The order of the 37 genes is consistent with other published mitogenome of Hemiptera (
The composition of the four bases in the P. rufipes mitogenome suggested highly unbalanced (A>T>C>G). The nucleotide composition shows an obvious AT preference, and the entire genome shows AT-skew and CG-skew. The above characteristics of mitogenome base composition of P. rufipes are ubiquitous to all sequenced species of Pentatomidae. The preference of bases composition is generally considered to be caused by asymmetric mutation and selection pressure of the four bases (
The secondary structures of tRNAs for P. rufipes is conserved and trnS1 lacks DHU arm, these features meet the character of metazoan mitochondrial genomes (Wolstenholme 1992). In addition to the typical Watson-Crick pairing (G-C and A-U), there are also some typical pairings such as U-G, U-C and U-U. Some scholars have proposed that those tRNAs with non-Watson-Crick matches can be transformed into fully functional proteins through post-transcriptional mechanisms (
The phylogenetic result suggested that there are some different topology compared to other studies, but we infer that the possible reasons are as follows: first, the number and taxon of samples selected are different. In this study, the phylogenetic relationship between Pentatoma and Plautia was relatively close, and they were far from Graphosoma. However, when the phylogenetic tree was constructed with Pentatoma semiannulata, the relationship between Pentatoma and Graphosoma was closer (
In summary, the mitogenome of P. rufipes has typical sequence structures, and the gene content, nucleotide composition, codon usage, RNA structures, and rates of PCGs evolution are similar to those of other published pentatomid genomes. The mitochondrial genome of P. rufipes reveals the phylogenetic location of Pentatoma, indicating that the mitogenome can be used to reveal phylogenetic relationships among different taxonomic levels of insects. However, more insect mitogenomes should be sequenced, which would provide more insight into the phylogenetic relationships of species from different taxa.
This research was supported by National Science Foundation of China [no. 31872272 and 31501876]; Shanxi Scholarship Council of China [no. 2020-064 and 2020-065], and Shanxi Graduate Innovation Project of Shanxi Province [no. 2020SY215]. The authors have declared that no competing interests exist.