Research Article |
Corresponding author: Cuiqing Gao ( cqgao@njfu.edu.cn ) Academic editor: Jader Oliveira
© 2023 Cuiqing Gao, Wen Dong.
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Gao C, Dong W (2023) Characterization of two new Pylorgus mitogenomes (Hemiptera, Lygaeidae, Ischnorhynchinae) and a mitochondrial phylogeny of Lygaeoidea. ZooKeys 1166: 141-154. https://doi.org/10.3897/zookeys.1166.104103
|
Lygaeidae is a large family of Hemiptera (Heteroptera) currently separated into three subfamilies, Ischnorhynchinae, Lygaeinae, and Orsillinae. In this research, the complete mitogenomes of the iscnorhynchines Pylorgus porrectus Zheng, Zou & Hsiao, 1979 and Pylorgus sordidus Zheng, Zou & Hsiao, 1979 were sequenced, and the phylogeny of Pylorgus and the Lygaeidae with known complete mitogenomes were examined. The mitogenomes are 15,174 bp and 15,399 bp in size, respectively, and comprised of 13 protein-coding genes (PCGs), 22 transfer RNA genes (tRNAs), two ribosomal RNA genes (rRNAs), and a control region (D-loop). Nucleotide composition is biased toward A and T, and the gene order is identical to that of the putative ancestral arrangement of insects. Eleven PCGs begin with a typical ATN, and the remaining two PCGs begin with TTG (cox1 and nad4l). All tRNAs had a typical cloverleaf secondary structure, but some of them had individual base mismatches. The phylogenetic analyses based on the concatenated nucleotide sequences of the 13 PCGs, using Bayesian inference and maximum likelihood, support the monophyly of Lygaeidae. The results show that P. porrectus and P. sordidus clustered with nine other Lygaeidae. This study includes the first complete sequencing of the mitochondrial genomes of two Pylorgus species, which will provide important data for studying the phylogenetic position of Lygaeidae in Lygaeoidea and reconstructing the phylogenetic relationships within Pentatomomorpha.
Heteroptera, mitochondrial DNA, next-generation sequencing, phylogenetic analysis, Pylorgus porrectus, Pylorgus sordidus
The Lygaeoidea represents the second largest superfamily within the infraorder Pentatomomorpha and includes over 4660 described species in 16 families (
Currently, three subfamilies of Lygaeidae (sensu stricto) are recognized: Ischnorhynchinae, Lygaeinae, and Orsillinae (
To date, the phylogeny of Lygaeidae is unresolved (
The complete mitochondrial genome data of nine species in Lygaeidae are included on NCBI, and only two species of Ischnorhynchinae. However, for the largest genus in this subfamily, Pylorgus, the mitochondrial genome data is totally unknown. Therefore, in the present study, we obtained the complete mitochondrial genomes of two Pylorgus species, Pylorgus porrectus Zheng, Zou & Hsiao, 1979 and Pylorgus sordidus Zheng, Zou & Hsiao, 1979, by using the next-generation sequencing technology. Furthermore, we constructed the phylogenetic trees based on the mitogenomes of 21 species of the superfamily Lygaeoidea and four outgroup species, which will provide important data for further studies on the phylogenetic position of Lygaeidae in Lygaeoidea and be also useful to reconstruct the phylogenetic relationships within Pentatomomorpha.
Adults of Pylorgus porrectus (Fig.
Family | Subfamily | Species | Length (bp) | GenBank No. |
---|---|---|---|---|
Lygaeidae | Ischnorhynchinae | Kleidocerys resedae (Panzer, 1797) | 14,688 | KJ584365.1 |
Ischnorhynchinae | Pylorgus porrectus Zheng, Zou & Hsiao, 1979 | 15,174 | OP793792 | |
Ischnorhynchinae | Pylorgus sordidus Zheng, Zou & Hsiao,1979 | 15,399 | OQ064783 | |
Ischnorhynchinae | Crompus oculatus Stål, 1874 | 15,332 | MW619652.1 | |
Lygaeinae | Arocatus melanocephalus (Fabricius, 1798) | 15,409 | NC_063142.1 | |
Lygaeinae | Tropidothorax cruciger (Motschulsky, 1859) | 15,781 | NC_056293.1 | |
Lygaeinae | Tropidothorax sinensis (Reuter, 1888) | 15,422 | MW547017.1 | |
Orsillinae | Nysius plebeius Distant, 1883 | 17,637 | MN599979.1 | |
Orsillinae | Nysius cymoides (Spinola, 1837) | 16,301 | MW291653.1 | |
Orsillinae | Nysius fuscovittatus Barber, 1958 | 14,575 | NC_050167.1 | |
Orsillinae | Nithecus jacobaeae (Schilling, 1829) | 14,206 | MW619651.1 | |
Berytidae | Metacanthinae | Yemmalysus parallelus Stusak, 1972 | 15,747 | NC_012464.1 |
Metacanthinae | Metatropis longirostris Hsiao, 1974 | 15,744 | NC_037373.1 | |
Blissidae | Bochrus foveatus Distant, 1879 | 14,738 | NC_065814.1 | |
Capodemus sinuatus (Slater, Ashlock & Wilcox, 1969) | 15,199 | NC_065815.1 | ||
Geocoridae | Geocorinae | Geocoris pallidipennis (Costa, 1843) | 14,592 | NC_012424.1 |
Henestarinae | Henestaris halophilus (Burmeister, 1835) | 14,868 | MW619656.1 | |
Malcidae | Chauliopinae | Chauliops fallax Scott, 1874 | 15,739 | NC_020772.1 |
Malcinae | Malcus inconspicuous Štys, 1967 | 15,575 | NC_012458.1 | |
Rhyparochromidae | Rhyparochrominae | Neolethaeus assamensis (Distant, 1901) | 15,067 | NC_037375.1 |
Rhyparochrominae | Bryanellocoris orientalis Hidaka, 1962 | 15,606 | NC_063139.1 | |
Pyrrhocoridae | Dysdercus evanescens Distant, 1902 | 15,635 | MW619727.1 | |
Coreidae | Hydarinae | Hydaropsis longirostris (Hsiao, 1963) | 16,521 | EU427337.1 |
Rhopalidae | Aeschyntelus notatus Hsiao, 1963 | 14,532 | EU427333.1 | |
Alydidae | Riptortus pedestris (Fabricius, 1775) | 17,191 | EU427344.1 |
The complete genomic DNA was extracted from an adult sample using a Rapid Animal Genomic DNA Isolation Kit (Sangon Biotech, Shanghai, China). Libraries were prepared on an Illumina MiSeq PE300 platform (Sangon Biotech, Shanghai, China). Low-quality and short reads were removed using Fastp v. 0.36 (
SPAdes v. 3.15 (
For mitochondrial gene annotation, we used tBLASTn and GeneWise to back-align with near-source reference databases to obtain the coding sequence (CDS) gene boundaries, and MiTFi to obtain the transfer RNA genes (tRNAs) sequence annotation. The non-coding ribosomal RNA genes (rRNAs) were identified by cmsearchrfam alignment and finally summarized into a complete annotation result.
The nucleotide composition and RSCU (relative synonymous codon usage) were calculated using PhyloSuite v. 1.2.2 (
The mitochondrial genome data of 25 species in Pentatomomorpha were used to reconstruct the phylogenetic relationship of Lygaeoidea, in which 21 species of Lygaeoidea were regarded as ingroup and four species was regarded as outgroup (Table
BI phylogenies were inferred using MrBayes v. 3.2.6 (
The assembled complete mitogenomes of Pylorgus porrectus and P. sordidus are circular DNA molecules of 15,174 bp and 15,399 bp in length, respectively, which is within the range of the sequenced mitogenomes of Lygaeidae in GenBank (Table
Gene | Size (bp) | A | T | G | C | A+T% | AT-skew | GC-skew |
---|---|---|---|---|---|---|---|---|
P. porrectus | 15,174 | 42.7 | 31.8 | 9.6 | 15.8 | 74.5 | 0.15 | −0.24 |
P. sordidus | 15,399 | 42.8 | 33.1 | 9.6 | 14.5 | 75.9 | 0.13 | −0.2 |
The basic composition of P. porrectus was A = 42.7%, T = 31.8%, G = 9.6%, and C = 15.8%, and of P. sordidus, A = 42.8%, T = 33.1%, G = 9.6%, C = 14.5%. Furthermore, both mitochondrial genome sequences were biased toward A and T. The AT content of P. porrectus was 63.74% and that of P. sordidus was 64.12%. The AT-skew value was greater than 0, whereas the GC skew value was less than 0, indicating that the base composition of P. porrectus and P. sordidus showed a strong A-bias and T-bias (Table
The complete length of the 13 PCGs of P. porrectus and P. sordidus were 10,991 bp and 10,993 bp, respectively. Of these, nine PCGs are located at the N-strand, and the other four PCGs were encoded on the J-strand (Fig.
The RSCU of the two species was calculated (Fig.
RSCU values of Pylorgus species a P. porrectus b P. sordidus. The abscissa represents the type of amino acid translated by the codon, and the ordinate represents the codon bias score calculated for the amino acid. The higher the score, the more the types of codons, and the more active the evolutionary variation of genes in the genome.
Gene | Position (bp) | Size (bp) | Strand | Direction | Intergenic nucleotides | Anti- or start/stop codons |
---|---|---|---|---|---|---|
trnI | 1–64 | 64 | N | Forward | 0 | |
trnQ | 62–130 | 69 | J | Reverse | −3 | |
trnM | 131–200 | 70 | N | Forward | 0 | |
nad2 | 201–1187 | 987 | N | Forward | 0 | ATT/TAA |
trnW | 1178–1241 | 64 | N | Forward | −10 | |
trnC | 1234–1296 | 63 | J | Reverse | −8 | |
trnY | 1303–1370 | 68 | J | Reverse | 6 | |
cox1 | 1374–2907 | 1534 | N | Forward | 3 | TTG/T– |
trnL2 | 2908–2972 | 65 | N | Forward | 0 | |
cox2 | 2973–3648 | 676 | N | Forward | 0 | ATA/T– |
trnK | 3649–3713 | 65 | N | Forward | 0 | |
trnD | 3714–3774 | 61 | N | Forward | 0 | |
atp8 | 3775–3933 | 159 | N | Forward | 0 | ATA/TAA |
atp6 | 3927–4592 | 666 | N | Forward | −7 | ATG/TAA |
cox3 | 4601–5378 | 778 | N | Forward | 8 | ATT/T– |
trnG | 5379–5443 | 65 | N | Forward | 0 | |
nad3 | 5444–5797 | 354 | N | Forward | 0 | ATA/TAG |
trnA | 5796–5860 | 65 | N | Forward | −2 | |
trnR | 5861–5920 | 60 | N | Forward | 0 | |
trnN | 5923–5990 | 68 | N | Forward | 2 | |
trnS1 | 5990–6058 | 69 | N | Forward | −1 | |
trnE | 6058–6121 | 64 | N | Forward | −1 | |
trnF | 6122–6184 | 63 | J | Reverse | 0 | |
nad5 | 6185–7882 | 1698 | J | Reverse | 0 | ATT/TAA |
trnH | 7886–7949 | 64 | J | Reverse | 3 | |
nad4 | 7987–9303 | 1317 | J | Reverse | 37 | ATG/TAA |
nad4l | 9297–9605 | 309 | J | Reverse | −7 | TTG/TAA |
trnT | 9581–9643 | 63 | N | Forward | −25 | |
trnP | 9644–9706 | 63 | J | Reverse | 0 | |
nad6 | 9709–10170 | 462 | N | Forward | 2 | ATC/TAA |
cob | 10170–11306 | 1137 | N | Forward | −1 | ATG/TAG |
trnS2 | 11305–11374 | 70 | N | Forward | −2 | |
nad1 | 11396–12319 | 924 | J | Reverse | 21 | ATC/TAA |
trnL1 | 12320–12384 | 65 | J | Reverse | 0 | |
rrnL | 12392–13612 | 1221 | J | Reverse | 7 | |
trnV | 13635–13700 | 66 | J | Reverse | 22 | |
rrnS | 13726–14315 | 590 | J | Reverse | 25 |
The nucleotide diversity (Pi) of the two species based on 13 PCGs was computed (Fig.
Gene | Position (bp) | Size (bp) | Strand | Direction | Intergenic nucleotides | Anti- or start/stop codons |
---|---|---|---|---|---|---|
trnI | 1–64 | 64 | N | Forward | 0 | |
trnQ | 62–130 | 69 | J | Reverse | −3 | |
trnM | 131–201 | 71 | N | Forward | 0 | |
nad2 | 202–1188 | 987 | N | Forward | 0 | ATT/TAA |
trnW | 1179–1242 | 64 | N | Forward | −10 | |
trnC | 1235–1297 | 63 | J | Reverse | −8 | |
trnY | 1305–1373 | 69 | J | Reverse | 10 | |
cox1 | 1377–2910 | 1534 | N | Forward | 3 | TTG/T– |
trnL2 | 2911–2975 | 65 | N | Forward | 0 | |
cox2 | 2976–3651 | 676 | N | Forward | 0 | ATA/T– |
trnK | 3652–3716 | 65 | N | Forward | 0 | |
trnD | 3717–3777 | 61 | N | Forward | 0 | |
atp8 | 3778–3936 | 159 | N | Forward | 0 | ATA/TAA |
atp6 | 3930–4595 | 666 | N | Forward | −7 | ATG/TAA |
cox3 | 4604–5381 | 778 | N | Forward | 8 | ATT/T– |
trnG | 5382–5444 | 63 | N | Forward | 0 | |
nad3 | 5445–5798 | 354 | N | Forward | 0 | ATA/TAG |
trnA | 5797–5860 | 64 | N | Forward | −2 | |
trnR | 5861–5920 | 60 | N | Forward | 0 | |
trnN | 5923–5990 | 68 | N | Forward | 2 | |
trnS1 | 5990–6058 | 69 | N | Forward | −1 | |
trnE | 6058–6121 | 64 | N | Forward | −1 | |
trnF | 6122–6184 | 63 | J | Reverse | 0 | |
nad5 | 6185–7882 | 1698 | J | Reverse | 0 | ATT/TAA |
trnH | 7886–7949 | 64 | J | Reverse | 3 | |
nad4 | 7988–9304 | 1317 | J | Reverse | 38 | ATG/TAA |
nad4l | 9298–9606 | 309 | J | Reverse | −7 | TTG/TAA |
trnT | 9582–9644 | 63 | N | Forward | −25 | |
trnP | 9645–9707 | 63 | J | Reverse | 0 | |
nad6 | 9710–10171 | 462 | N | Forward | 2 | ATC/TAA |
cob | 10171–11307 | 1137 | N | Forward | −1 | ATG/TAG |
trnS2 | 11306–11375 | 70 | N | Forward | −2 | |
nad1 | 11397–12320 | 924 | J | Reverse | 21 | ATC/TAA |
trnL1 | 12321–12385 | 65 | J | Reverse | 0 | |
rrnL | 12397–13613 | 1217 | J | Reverse | 11 | |
trnV | 13636–13701 | 66 | J | Reverse | 22 | |
rrnS | 13727–14316 | 590 | J | Reverse | 25 |
The ratios of Ka/Ks for each gene of the 13 PCGs were also computed (Fig.
Eleven gene overlaps were observed in the two mitogenomes, ranging from 1 bp to 25 bp (Tables
Intergenic spacers were identified in the two mitogenomes, and their lengths ranged from 1 bp to 38 bp (Tables
The two mitogenomes both contain the complete set of 22 tRNA genes typical of Lygaeidae mitogenomes, ranging from 60 to 71 bp, which is consistent with previously sequenced mitogenomes of Lygaeidae (
All tRNA have the typical cloverleaf secondary structure, including the TΨC arm, the amino acid acceptor arm, the anticodon arm, and the dihydrouridine arm. Some of tRNA genes (trnY, trnA, trnS1, trnF, trnH, trnP, and trnV) showed individual base mismatches, which is a common phenomenon in insect mitogenomes (
The rrnL genes of the two mitogenomes are located at the intergenic region between trnL and trnV, with lengths that range from 1217 bp to 1221 bp. The rrnS genes are located between trnV and the D-loop, which are both 590 bp in length. Both rRNAs are located on the N-strand.
Phylogenetic relationships within Lygaeoidea were reconstructed based on mitochondrial 13 PCGs using BI and ML methods (Figs
The clades making up the Lygaeidae had high support values in the BI results and confirmed the monophyly of Lygaeidae (Figs
In this study, we sequenced and analyzed the mitogenomes of Pylorgus porrectus and P. sordidus, which had common and similar structures. The mitochondrial genome structure of the two Pylorgus species is a double-stranded closed loop, containing a non-coding control region sequence and encoding 37 genes. The two species showed a substantial nucleotide bias toward a higher A and T content, as do other Pentatomomorpha (
The phylogenetic results using 13 PCGs confirm the monophyly of Lygaeidae, which support the opinions of
We are grateful to Shengchang Lai (Nanjing Forestry University) for helping with data analysis. We also thank Suyan Cao (Nanjing Forestry University) for collecting specimens. We thank Thomas J. Henry (National Museum of Natural History, Washington DC.), the two anonymous reviewers, and the subject editor, Jader Oliveira, for their helpful and constructive comments. We also thank Guangyu Yu (Jiangxi Agricultural University) for revising the manuscript.
No conflict of interest was declared.
No ethical statement was reported.
This research was funded by Major Project of Agricultural Biological Breeding (grant no. 2022ZD0401501), the National Natural Science Foundation of China (grant no. 31402010), and the Highly Educated Talents Foundation in Nanjing Forestry University (grant no. G2014002).
Conceptualization, C.G. and W.D.; methodology, C.G. and W.D.; investigation, C.G. and W.D.; funding acquisition, C.G.; writing—original draft preparation, W.D.; writing—review and editing, C.G. Both authors have read and agreed to the published version of the manuscript.
Cuiqing Gao https://orcid.org/0000-0002-0177-5161
All of the data that support the findings of this study are available in the main text or Supplementary Information.