Research Article |
|
Corresponding author: Aihui Yin ( keyanlaodong@163.com ) Academic editor: Axel Hausmann
© 2025 Yanpeng Cai, Aihui Yin.
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Cai Y, Yin A (2025) The first mitogenomes of the subfamily Epipleminae (Lepidoptera, Uraniidae) and phylogenetic analysis of Macroheterocera. ZooKeys 1255: 343-363. https://doi.org/10.3897/zookeys.1255.164711
|
The subfamily Epipleminae, the largest group within Uraniidae (Lepidoptera, Macroheterocera, Geometroidea), comprises small, nocturnal moths primarily distributed in tropical regions. Taxonomic and molecular phylogenetic studies of Epipleminae have long been challenging, and this group remains genetically understudied, with no mitochondrial genomes (mitogenomes) reported to date, despite their wide utility in phylogenetic research. Here, we sequenced, assembled, and annotated the first complete mitogenomes of four epiplemine species: Dysaethria flavistriga (15,404 bp), Monobolodes prunaria (15,258 bp), Phazaca alikangensis (15,482 bp), and Warreniplema fumicosta (15,467 bp), using high-throughput sequencing technology. These mitogenomes exhibit typical gene arrangement of ditrysian Lepidoptera, along with distinctive features such as rare (TA)n microsatellite repeats in the 16S rRNA. Most protein-coding genes (PCGs) initiate with standard ATN start codons and terminate with TAA or a single T residue. Codon usage analysis revealed UUA (Leu2), AUU (Ile), UUU (Phe), AUA (Met), and AAU (Asn) as the five most frequently used codons. All tRNAs display canonical cloverleaf secondary structures, except for trnS1, which lacks the DHU arm. Comprehensive phylogenetic analyses that incorporated existing macroheteroceran mitogenomic data provided robust support for the placement of Epipleminae under Uraniidae, offered the first mitogenome-based evidence supporting the monophyly of Geometroidea on family level, and strongly supported a sister relationship between Geometroidea and Noctuoidea.
16S rRNA, Epipleminae, Macroheterocera, mitogenome, phylogeny, Uraniidae
The insect mitochondrial genome (mitogenome) is a double-stranded closed loop DNA molecule, measuring 15–18 kb in size. It encodes a pack of 37 genes, including 13 protein-coding genes, 22 transfer RNA genes, and two ribosomal RNA genes, which exhibit strong conservation across bilaterian metazoans (
The mega-diverse Macroheterocera represents the high-level phylogenetic lineage of the “large moths” in Lepidoptera. It currently includes six superfamilies (Bombycoidea, Drepanoidea, Geometroidea, Lasiocampoidea, Mimallonoidea, and Noctuoidea), and consists of more than 72,400 described species (
Uraniidae is a family within the superfamily Geometroidea of Macroheterocera, and comprises ~700 species in ~90 genera (
Epipleminae, the most speciose subfamily of the four subfamilies (the other three: Auzeinae, Microniinae, and Uraniinae) in Uraniidae (
Accordingly, in this study, we present four complete mitogenome sequences for Epipleminae for the first time. Our focal species include Dysaethria flavistriga, Monobolodes prunaria, Phazaca alikangensis, and Warreniplema fumicosta, of which the specimens were all collected from Guizhou, China. We provide a detailed description and comparative analysis of these mitogenomes. Additionally, we explore the phylogenetic positions of Epipleminae and Uraniidae, and further reconstruct the phylogeny of Macroheterocera using available mitogenome data.
Adult specimens of the four epiplemine species were captured by light trap from Baiyan Village (25°21'47"N, 107°56'5"E, 748 m elevation), Maolan National Nature Reserve, Libo County, Guizhou, China on 8 July 2024. The specimens were pinned and dried, and their genitalia were dissected and slide-mounted for examination. Identification was based on external morphology and genitalia characters, following taxonomic references (
Total DNA was extracted from the thoracic muscles of the dry specimens using the MagicMag Genomic DNA Micro Kit (Sangon Biotech Co., Shanghai, China) following the manufacturer’s protocol. For each sample, 0.2 μg DNA was fragmented by sonication to an average size of 350 bp. The fragmented DNA was then subjected to high-throughput pair-ended sequencing (PE150) on Illumina NovaSeq 6000 platform at Novogene Bioinformatics Technology Co., Ltd (Tianjin, China).
The raw data were processed using Fastp v. 0.19.7 (
The mitogenomes were then uploaded to GenBank. The standard 658 bp COX1 barcodes were extracted from the mitogenome sequences and uploaded to the BOLD system. The corresponding GenBank accession numbers and BOLD sample IDs are listed in Table
The GenBank accession numbers and BOLD sample IDs of the four epiplemine species.
The base composition of the mitogenomes was analyzed using MEGA v. 11.0.13 (
The secondary structures of the tRNAs were predicted via MITOS2 online service, while rRNA secondary structures were predicted with R2DT v. 1.3 (
To determine the phylogenetic positions of Epipleminae and Uraniidae, and to reconstruct the phylogeny of Macroheterocera, we aimed to maximize taxonomic representation at the family and subfamily level. We downloaded mitogenomic sequences for 34 species from GenBank, representing 18 families within Macroheterocera and four outgroup families (Table
Basic information of the mitogenomes used for the phylogenetic analyses in this study. Accession numbers of newly sequenced species are highlighted in bold.
| Superfamily | Family | Subfamily | Species | Accession/SRA number |
|---|---|---|---|---|
| Ingroup | ||||
| Bombycoidea | Bombycidae | Bombycinae | Bombyx mandarina | AB070263 |
| Brahmaeidae | – | Brahmaea hearseyi | KU884326 | |
| Endromidae | – | Andraca theae | KX365419 | |
| Eupterotidae | – | Ganisa cyanogrisea | MF100140 | |
| Saturniidae | Saturniinae | Antheraea pernyi | MK920216 | |
| Sphingidae | Macroglossinae | Theretra clotho | MZ562564 | |
| Drepanoidea | Cimeliidae | – | Axia margarita | SRR1006157 |
| Doidae | – | Doa sp. | KJ508058 | |
| Drepanidae | Cyclidiinae | Cyclidia substigmaria | SRR1021608 | |
| Drepaninae | Pseudalbara parvula | MZ823341 | ||
| Oretinae | Oreta sp. | MZ823343 | ||
| Thyatirinae | Tethea albicostata | OK149234 | ||
| Geometroidea | Epicopeiidae | – | Epicopeia hainesii | MK033610 |
| Geometridae | Alsophilinae | Alsophila aescularia | OX276387 | |
| Ennominae | Ectropis grisescens | MN792921 | ||
| Geometrinae | Iotaphora admirabilis | MK903032 | ||
| Larentiinae | Hydrelia parvulata | MN962739 | ||
| Sterrhinae | Idaea salutaria | MK122626 | ||
| Pseudobistonidae | Pseudobistoninae | Pseudobiston pinratanai | SRR11994912, SRR13187626 | |
| Sematuridae | – | Homidiana leachi | SRR27474561 | |
| – | Mania lunus | SRR27474554, SRR27474556 | ||
| Uraniidae | Epipleminae | Dysaethria flavistriga | PV151521 | |
| Monobolodes prunaria | PV151522 | |||
| Phazaca alikangensis | PV151523 | |||
| Warreniplema fumicosta | PV151524 | |||
| Microniinae | Acropteris iphiata | MN093120 | ||
| Uraniinae | Lyssa zampa | MZ713634 | ||
| Lasiocampoidea | Lasiocampidae | Chaetomalachiinae | Euthrix laeta | KU870700 |
| Pinarinae | Trabala vishnou | MZ927091 | ||
| Mimallonoidea | Mimallonidae | – | Lacosoma valva | KJ508050 |
| Noctuoidea | Erebidae | Arctiinae | Brunia dorsalis | MN635735 |
| Erebinae | Erebus caprimulgus | MZ959074 | ||
| Lymantriinae | Dasychira tristis | MZ520324 | ||
| Euteliidae | Euteliinae | Eutelia adulatricoides | KJ185131 | |
| Noctuidae | Amphipyrinae | Spodoptera frugiperda | KU877172 | |
| Dilobinae | Diloba caeruleocephala | OX381635 | ||
| Eustrotiinae | Maliattha signifera | OQ111926 | ||
| Nolidae | Eariadinae | Earias clorana | OK235312 | |
| Notodontidae | Dudusinae | Dudusa sphingiformis | MW788876 | |
| Outgroup | ||||
| Calliduloidea | Callidulidae | – | Pterodecta felderi | MT370823 |
| Hyblaeoidea | Hyblaeidae | – | Hyblaea puera | MW885970 |
| Papilionoidea | Papilionidae | Parnassiinae | Luehdorfia chinensis | KM453725 |
| Pyraloidea | Pyralidae | Phycitinae | Plodia interpunctella | KT428892 |
In our phylogenetic analyses, multi-gene and multi-method approaches were employed to mitigate single-locus bias, resolve topological conflicts, and strengthen nodal support, thereby ensuring robust phylogenetic hypotheses. Consequently, a total of ten phylogenetic trees were reconstructed using Maximum Likelihood (ML) and Bayesian Inference (BI) methods based on five data matrices: (1) “PCG123R” matrix: All three codon positions of 13 PCGs plus two rRNA genes (13,214 sites); (2) “PCG123” matrix: All three codon positions of 13 PCGs (11,163 sites); (3) “PCG12R” matrix: The 1st and 2nd codon positions of PCGs plus two rRNA genes (9,493 sites); (4) “PCG12” matrix: The 1st and 2nd codon positions of PCGs (7,442 sites); (5) “AA” matrix: Amino acid sequences translated from 13 PCGs (3,721 sites).
The DNA sequences of each gene were aligned separately using MAFFT v. 7.52 (
For the AA matrix, the site-heterogeneous substitution model (CAT + GTR) was used in the BI analysis. This model was chosen because it demonstrated better performance against systematic errors and long-branch attraction (LBA) artifacts compared to site-homogeneous models, despite its higher computational demand. For all other tree-building analyses, site-homogeneous models were applied, and the optimal partitioning schemes and corresponding best-fit substitution models were determined using ModelFinder v. 2.2.2 (
The ML trees were constructed in IQ-TREE2 software with 10,000 ultrafast bootstraps (BS) (
The complete mitogenomes of D. flavistriga (length: 15,404 bp), M. prunaria (length: 15,258 bp), P. alikangensis (length: 15,482 bp), and W. fumicosta (length: 15,467 bp) were sequenced and annotated. As expected, these epiplemine species shared the typical composition of 37 genes (13 PCGs, 22 tRNA genes, and two rRNA genes) and one A+T-rich control region with other metazoan animals (
Circular structures of the mitochondrial genomes of four newly sequenced epiplemine species. Genes are represented with different color blocks. Colored blocks outside of each ring indicate that the genes are on the majority strand, while colored blocks within the rings indicate that the genes are located on the minority strand.
The mitogenomes of the four epiplemine species exhibited a distinct overall A + T nucleotide bias (80.8% to 81.2%). This high A+T content varied slightly among different types of genes, being most pronounced in the control region (93.7% to 95.6%), followed by rRNAs (84.9% to 85.7%), tRNAs (81.5% to 82.1%), and lowest in PCGs (79.0% to 79.8%) (Table
Base composition in different regions in the mitochondrial genomes of four epiplemine species: D. flavistriga/M. prunaria/P. alikangensis/W. fumicosta.
| Feature | Size (bp) | A+T% | AT-skew | GC-skew |
|---|---|---|---|---|
| Whole mitogenome | 15404/15258/ 15482/15467 | 81.2%/81.2%/ 80.8%/81.2% | 0.007/-0.0004/ 0.020/0.021 | -0.187/-0.172/ -0.200/-0.195 |
| PCGs | 11200/11200/ 11202/11199 | 79.7%/79.8%/ 79.0%/79.4% | 0.009/0.003/ 0.018/0.019 | -0.170/-0.152/ -0.181/-0.176 |
| rRNA genes | 2177/2179/ 2195/2237 | 85.2%/85.4%/ 84.9%/85.7% | 0.009/-0.009/ 0.015/0.027 | -0.344/-0.342/ -0.372/-0.352 |
| tRNA genes | 1467/1459/ 1461/1458 | 82.0%/82.0%/ 82.1%/81.5% | 0.024/0.038/ 0.044/0.034 | -0.091/-0.125/ -0.115/-0.122 |
| Control region | 431/226/371/411 | 93.7%/93.8%/ 95.4%/95.6% | -0.059/-0.085/ -0.028/0.038 | -0.630/-0.143/ -0.412/-0.667 |
Gene overlaps and intergenic spacers were identified in the mitogenomes of the four epiplemine species: D. flavistriga (6 overlaps, 15 spacers), M. prunaria (5 overlaps, 18 spacers), P. alikangensis (8 overlaps, 15 spacers), and W. fumicosta (8 overlaps, 14 spacers) (Suppl. material
The total lengths of the 13 PCGs in the mitogenomes of D. flavistriga, M. prunaria, P. alikangensis and W. fumicosta were 11,200 bp, 11,200 bp, 11,202 and 11,199 bp respectively. Most PCGs in the four mitogenomes used the typical ATN start codon for initiation (Suppl. material
The relative synonymous codon usages (RSCU) for the four mitogenomes are shown in Fig.
The synonymous substitution rates (Ks), non-synonymous substitution rates (Ka), and the Ka/Ks ratios of the 13 PCGs in Epipleminae were calculated (Fig.
The total lengths of the 22 tRNA genes in the mitogenomes of D. flavistriga, M. prunaria, P. alikangensis and W. fumicosta were 1,467 bp, 1,459 bp, 1,461 and 1,458 bp, respectively. The lengths of individual tRNA genes ranged from 62 bp (the shortest, trnS1, in P. alikangensis and W. fumicosta) to73 bp (the longest, trnL1, in D. flavistriga) (Suppl. material
The total lengths of the two rRNA genes in the mitogenomes of D. flavistriga, M. prunaria, P. alikangensis and W. fumicosta were 2,177 bp, 2,179 bp, 2,195 and 2,237 bp respectively. The 16S rRNA genes were located between trnL1 and trnV, and the 12S rRNA genes were positioned between trnV and the control region, with no intergenic spacers or overlaps observed (Suppl. material
16S rRNA secondary structure of Warreniplema fumicosta. The additional helix formed between helices H2259 and H2347 by the microsatellite sequence (TA)4A(TA)7, along with its short flanking sequences, is highlighted with a pale green background. The additional helix formed between helices H579 and H1648 by the microsatellite sequence (TA)13 is highlighted with a pale red background.
The control regions of epiplemine mitogenomes were located between the 12S rRNA and trnM genes (Suppl. material
Alignment of the control regions from four newly sequenced epiplemine mitogenomes. The conserved motif ACATAGA (highlighted in red) and its subsequent poly-T stretch, as well as the conserved motif CCATAGTTAATAA(A/T)TTTT (highlighted in green), are emphasized. Dots represent omitted sequences, with the number of dots not proportional to nucleotide number of corresponding region.
The AliGROOVE tests revealed generally low sequence composition heterogeneity among the 43 representative species across all five datasets, with the exception of three species: Axia margarita (Cimeliidae), Cyclidia substigmaria (Cyclidiinae) and Pseudobiston pinratanai (Pseudobistonidae) (Fig.
Base compositional heterogeneity of mitogenome sequences for our five nucleotide and amino acid datasets used in phylogenetic analyses. The degree of sequence similarity is visualized using colored blocks, based on AliGROOVE scores, ranging from -1 (indicating non-random similarity, shown in red) to +1 (representing full random similarity, shown in blue).
Different combinations of datasets, partitioning models, and tree-building methods resulted in ten phylogenetic trees within Macroheterocera (Fig.
Drepanoidea (comprising three families: Cimeliidae, Doidae, and Drepanidae sensu
The close relationship between Bombycoidea and Lasiocampoidea was strongly supported by all our analyses (BS: 99–100, PP: 0.99–1.00). However, their sibling relationship, as suggested by multiple previous studies (e.g.
The relationship between Geometroidea and Noctuoidea has historically been unresolved, despite a number of molecular phylogenetic studies addressing this issue (e.g.
The superfamily Geometroidea was consistently recovered as monophyletic in all our analyses, with very strong nodal support (BS: 98–100, PP: 1.00). All the five families (Epicopeiidae, Geometridae, Pseudobistonidae, Sematuridae, and Uraniidae) included in Geometroidea sensu
In this study, we presented the first four mitogenomes of the subfamily Epipleminae, providing detailed comparative analyses and exploring their phylogenetic implications within Macroheterocera. While the organization and gene content of these mitogenomes did not exhibit significant divergence from other ditrysian Lepidoptera, a rare phenomenon within Lepidoptera–previously reported only in tortricid mitogenomes–was observed in the 16S rRNA, where additional helices were formed by (TA)n microsatellite sequences in all four species. Specifically, one helix, located between helices H2259 and H2347, was conserved across all four species, although its sequence composition varied among them. Additionally, a unique (TA)13 helix was identified exclusively in W. fumicosta, positioned between helices H579 and H1648. Noteworthily, these additional helices were not observed in the three previously published mitogenome sequences (GenBank Nos MN093120, MW435592, and MZ713634) from the other two subfamilies of Uraniidae.
In our phylogenetic analyses, the generated trees were largely congruent with previous molecular studies regarding the major lineages within Macroheterocera on the superfamily level. Our study was the first mitogenomic-based analysis to include representatives from all five families within the superfamily Geometroidea. The monophyly of Geometroidea and its sister relationship with Noctuoidea were strongly supported. All four species sequenced in this study formed a highly supported monophyletic clade, representing the subfamily Epipleminae. Furthermore, representatives from Epipleminae, Microniinae, and Uraniinae clustered into the monophyletic family Uraniidae within Geometroidea. However, due to the absence of mitogenomic data from the subfamily Auzeinae, and the overall scarcity of representative mitogenomes from Uraniidae, the monophyly of Uraniidae could not be fully tested in our study, and the internal phylogenetic relationships within Uraniidae remained unresolved. These findings highlighted the critical need for additional mitogenomic data from underrepresented taxa to further refine the phylogenetic framework of Uraniidae and to elucidate the evolutionary relationships within this diverse family.
The authors have declared that no competing interests exist.
No ethical statement was reported.
No use of AI was reported.
This research was funded by the National Natural Science Foundation of China (31760630), and Guizhou Provincial Basic Research Program (Natural Science) (Qian Kehe Foundation-ZK [2024] General 355).
Yanpeng Cai: data curation, formal analysis, investigation, writing – original draft preparation, visualization. Aihui Yin: Conceptualization, methodology, writing – review and editing, supervision, funding acquisition.
Yanpeng Cai https://orcid.org/0000-0001-6554-3784
Aihui Yin https://orcid.org/0000-0001-7614-9969
Supplementary figures
Data type: pdf
Explanation note: figs S1–S4. tRNA secondary structures of Dysaethria flavistriga, Monobolodes prunaria, Phazaca alikangensis, and Warreniplema fumicosta. figs S5–S7. 16S rRNA secondary structures of Dysaethria flavistriga, Monobolodes prunaria, and Phazaca alikangensis. figs S8–S11. 12S rRNA secondary structures of Dysaethria flavistriga, Monobolodes prunaria, Phazaca alikangensis, and Warreniplema fumicosta. figs S12–S20. Phylogenetic trees of Macroheterocera.
Supplementary tables
Data type: pdf
Explanation note: tables S1–S9. The corresponding partitioning schemes and substitution models determined by ModelFinder for the phylogenetic analyses using site-homogeneous models. table S10. Annotation of the mitochondrial genomes of four epiplemine species.