Research Article |
Corresponding author: Zhihang Zhuo ( zhuozhihang@foxmail.com ) Academic editor: Francesco Vitali
© 2023 Junhao Wu, Danping Xu, Xinju Wei, Wenkai Liao, Xiushan Li, Zhihang Zhuo.
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Wu J, Xu D, Wei X, Liao W, Li X, Zhuo Z (2023) Characterization of the complete mitochondrial genome of the longhorn beetle, Batocera horsfieldi (Coleoptera, Cerambycidae) and its phylogenetic analysis with suitable longhorn beetles. ZooKeys 1168: 387-402. https://doi.org/10.3897/zookeys.1168.105328
|
Mitochondrial genome analysis is an important tool for studying insect phylogenetics. The longhorn beetle, Batocera horsfieldi, is a significant pest in timber, economic and protection forests. This study determined the mitochondrial genome of B. horsfieldi and compared it with the mitochondrial genomes of other Cerambycidae with the aim of exploring the phylogenetic status of the pest and the evolutionary relationships among some Cerambycidae subgroups. The complete mitochondrial genome of B. horsfieldi was sequenced by the Illumina HiSeq platform. The mitochondrial genome was aligned and compared with the existing mitochondrial genomes of Batocera lineolata and B. rubus in GenBank (MF521888, MW629558, OM161963, respectively). The secondary structure of transfer RNA (tRNA) was predicted using tRNAScan-SE server v.1.21 and MITOS WebSever. Thirteen protein-coding genes (PCGs) and two ribosomal RNA gene sequences of 21 longhorn beetles, including B. horsfieldi, plus two outgroups, Dryops ernesti (Dryopidae) and Heterocerus parallelus (Heteroceridae), were analyzed. The phylogenetic tree was constructed using maximum likelihood and Bayesian inference methods. In this study, we successfully obtained the complete mitochondrial genome of B. horsfieldi for the first time, which is 15 425 bp in length. It contains 37 genes and an A + T-rich region, arranged in the same order as the recognized ancestor of longhorn beetles. The genome of B. horsfieldi is composed of 33.12% A bases, 41.64% T bases, 12.08% C bases, and 13.16% G bases. The structure, nucleotide composition, and codon usage of the new mitochondrial genome are not significantly different from other longhorn mitochondrial genomes. Phylogenetic analyses revealed that Cerambycidae formed a highly supported single clade, and Vesperidae was either clustered with Cerambycidae or formed a separate clade. Interestingly, B. horsfieldi, B. rubus and B. lineolata were clustered with Monochamus and Anoplophora species in both analyses, with high node support. Additionally, the Vesperidae Spiniphilus spinicornis and Vesperus sanzi and the 19 Cerambycidae species formed a sister clade in the Bayesian analysis. Our results have produced new complete mitogenomic data, which will provide information for future phylogenetic and taxonomic research, and provide a foundation for future relevant research.
Evolutionary relationships, mitogenome, pest species, protein-coding genes, ribosomal RNA genes, secondary structure, tRNA
The longhorn beetle family Cerambycidae is one of the larger families in Coleoptera, with about 40 000 recorded species worldwide. As of 2005, the recorded number of longhorn beetle species in China had surpassed 3100 (
The mitochondrial genome is widely used in population genetics, phylogenetics and molecular evolution studies due to its maternal inheritance, stable genome composition, relatively conservative gene arrangement, and rare occurrence of recombination (
The specimens of B. horsfieldi, three male adults and one female adult, were collected from the host plant Populus tomentosa in Xishan Forest Park, Nanchong City, Sichuan Province, China, on June 17, 2022 (30.803038°N, 106.063072°E, alt. 480 m), and were deposited in the Laboratory of Forest Conservation, College of Life Science, China West Normal University (Voucher No. SCNC-BH-20220617.1-3 for male adult, SCNC-BH-20220617.4 for female adult). These specimens were preserved in 95% ethanol at -24 °C for long-term storage in the specimen collection room at China West Normal University. The total genomic DNA was extracted from muscle tissue of individual specimens using the Ezup Column Animal Genomic DNA Purification Kit (Shanghai, China), following the manufacturer’s instructions. For sequencing, the extracted DNA was stored at -24 °C.
Next-generation sequencing and assembly were performed by Beijing Aoweisen Gene Technology Co. Ltd (Beijing, China) to obtain the complete mitogenome of B. horsfieldi for the first time. After quantification of the total genome DNA, a whole-genome shotgun strategy was employed for sequencing on the Illumina HiSeq platform. A total of 14 348 102 paired-end reads with a read length of 150 bp were obtained from the sequencing process. Among these, 22 966 sequences were used for mitochondrial genome assembly. Based on the invertebrate genetic code, such as Batocera lineolata Chevrolat, 1852 (MF521888), B. lineolata (MW629558) and B. rubus Linnaeus, 1758 (OM161963), the assembly of B. horsfieldi was carried out by referring to
A total of 21 complete mitochondrial genomes of longhorn beetles, including one newly sequenced species (B. horsfieldi), as well as the complete mitochondrial genomes of Dryops ernesti DesGozis, 1886 and Heterocerus parallelus Gebler, 1830, were used in this study (Table
GenBank accession numbers of species used in this study (accession number of the newly sequenced species in bold).
Family | Genus | Species | GenBank Accession number |
---|---|---|---|
Cerambycidae | Monochamus | Monochamus sparsutus | NC053906.1 |
Monochamus alternatus | NC050066.1 | ||
Monochamus sartor urussovii | OP856519.1 | ||
Monochamus saltuarius | OP169419.1 | ||
Agapanthia | Agapanthia amurensis | MW617354.1 | |
Agapanthia daurica | MN473114.1 | ||
Anoplophora | Anoplophora glabripennis | NC008221.1 | |
Anoplophora horsfieldi | MW364565.1 | ||
Anoplophora chinensis | NC029230.1 | ||
Batocera | Batocera lineolata | MW629558.1 | |
Batocera rubus | OM161963.1 | ||
Batocera horsfieldi | OQ785650 | ||
Xylotrechus | Xylotrechus grayii | NC030782.1 | |
Turanoclytus | Turanoclytus namaganensis | NC060874.1 | |
Demonax | Demonax pseudonotabilis | OP096419.1 | |
Allotraeus | Allotraeus orientalis | NC061181.1 | |
Arhopalus | Arhopalus unicolor | NC053904.1 | |
Aromia | Aromia bungii | NC053714.1 | |
Cephalallus | Cephalallus oberthueri | NC062854.1 | |
Vesperidae | Vesperus | Vesperus sanzi | MN473093.1 |
Spiniphilus | Spiniphilus spinicornis | NC029515.1 | |
Out groups | Dryops | Dryops ernesti | KX035147.1 |
Heterocerus | Heterocerus parallelus | KX087297.1 |
The complete mitochondrial genome of B. horsfieldi is 15 425 bp in length and is a closed circular molecule (Fig.
Base composition in different regions of the mitochondrial genome of B. horsfieldi.
Region | Length (bp) | A% | T% | C% | G% | A+T% | AT-skew | GC-skew |
---|---|---|---|---|---|---|---|---|
ND1 | 951 | 27.44 | 44.58 | 8.31 | 19.66 | 72.02 | -0.24 | 0.41 |
ND2 | 1011 | 33.14 | 40.36 | 18.30 | 8.21 | 73.50 | -0.10 | -0.38 |
ND3 | 354 | 31.36 | 43.22 | 16.38 | 9.04 | 74.58 | -0.16 | -0.29 |
ND4 | 1333 | 29.78 | 48.16 | 6.98 | 15.08 | 77.84 | -0.24 | 0.37 |
ND4L | 288 | 29.17 | 49.13 | 7.64 | 13.89 | 78.48 | -0.26 | 0.29 |
ND5 | 1717 | 31.16 | 46.88 | 7.75 | 14.21 | 78.04 | -0.20 | 0.29 |
ND6 | 504 | 37.7 | 42.86 | 13.29 | 6.15 | 80.56 | -0.06 | -0.37 |
ATP6 | 675 | 32.74 | 41.33 | 16.74 | 9.19 | 74.07 | -0.12 | -0.29 |
ATP8 | 156 | 42.95 | 44.23 | 8.97 | 3.85 | 87.18 | -0.01 | -0.40 |
COI | 1543 | 30.46 | 35.84 | 18.79 | 14.91 | 66.30 | -0.08 | -0.12 |
COII | 688 | 32.27 | 38.52 | 18.17 | 11.05 | 70.79 | -0.09 | -0.24 |
COIII | 787 | 30.50 | 38.25 | 17.03 | 14.23 | 68.75 | -0.11 | -0.09 |
CYTB | 1143 | 32.37 | 38.58 | 17.76 | 11.29 | 70.95 | -0.09 | -0.22 |
Rrnl | 1283 | 34.53 | 39.91 | 7.17 | 18.39 | 74.44 | -0.07 | 0.44 |
Rrns | 751 | 33.95 | 43.14 | 7.46 | 15.45 | 77.09 | -0.12 | 0.35 |
rRANs | 2034 | 34.32 | 41.10 | 7.28 | 17.31 | 75.42 | -0.09 | 0.41 |
tRNAs | 1450 | 38.55 | 37.79 | 10.07 | 13.59 | 76.34 | 0.01 | 0.15 |
13PCGs | 11150 | 31.42 | 42.13 | 13.60 | 12.85 | 73.55 | -0.15 | -0.03 |
A+T-rich region | 791 | 43.99 | 43.11 | 6.83 | 6.07 | 87.10 | 0.01 | -0.06 |
Whole Genome | 15425 | 33.12 | 41.64 | 12.08 | 13.16 | 74.76 | -0.11 | 0.04 |
Mitogenome map of B. horsfieldi. ND1, ND2, ND3, ND4, ND4L, ND5 and ND6 are in yellow. COI, COII and COIII are in magenta. ATP6 and ATP8 are in green. CYTB is in pink. rrnl and rns are in red. all tRNAs are in dark blue and the control region is in white. Figure
The total length of the 13 PCGs in the mitochondrial genome of B. horsfieldi is 11 150 bp, accounting for 72.29% of the entire genome. Among the 13 PCGs, four genes (ND4, ND4L, ND5, and ND1) are encoded on the N strand, while the other nine genes (COI, COII, COIII, ATP8, ATP6, NAD2, NAD3, NAD6, and CYTB) are encoded on the J strand. Among the 13 PCGs, the longest gene is COX1, but its A+T content is the lowest at 66.30%, while the shortest gene is ATP8, but its A+T content is the highest at 87.18%. The start codons for COI, COII, ATP8, ND5, and ND6 are ATT, while the start codon for ND1 is TTG. The start codons for COIII, ATP6, ND3, ND4, ND4L, and CYTB are ATN (N represents G or C) (Table
Characteristics of B. horsfieldi mitochondrial genome. J and N represent the positive and negative chains of mitochondrial genome, respectively.
Gene | Strand | Region | Length (bp) | Start codon | Stop codon | Anticodon |
---|---|---|---|---|---|---|
tRNA-Ile | J | 1–68 | 68 | GAU | ||
tRNA-Gln | N | 70–138 | 69 | UUG | ||
tRNA-Met | J | 138–206 | 69 | CAU | ||
ND2 | J | 207–1217 | 1011 | ATC | TAA | |
tRNA-Trp | J | 1216–1281 | 66 | UCA | ||
tRNA-Cys | N | 1274–1336 | 63 | GCA | ||
tRNA-Tyr | N | 1337–1402 | 66 | GUA | ||
COI | J | 1395–2937 | 1543 | ATT | T | |
tRNA-Leu2 | J | 2938–3002 | 65 | UAA | ||
COII | J | 3003–3690 | 688 | ATT | T | |
tRNA-Lys | J | 3691–3759 | 69 | UUU | ||
tRNA-Asp | J | 3760–3827 | 68 | GUC | ||
ATP8 | J | 3828–3983 | 156 | ATT | TAG | |
ATP6 | J | 3977–4651 | 675 | ATG | TAA | |
COIII | J | 4651–5437 | 787 | ATG | T | |
tRNA-Gly | J | 5438–5501 | 64 | UCC | ||
ND3 | J | 5502–5855 | 354 | ATC | TAG | |
tRNA-Ala | J | 5854–5917 | 64 | UGC | ||
tRNA-Arg | J | 5918–5979 | 62 | UCG | ||
tRNA-Asn | J | 5979–6045 | 67 | GUU | ||
tRNA-Ser1 | J | 6046–6112 | 67 | UCU | ||
tRNA-Glu | J | 6113–6176 | 64 | UUC | ||
tRNA-Phe | N | 6176–6239 | 64 | GAA | ||
ND5 | N | 6240–7956 | 1717 | ATT | T | |
tRNA-His | N | 7957–8019 | 63 | GUG | ||
ND4 | N | 8020–9352 | 1333 | ATG | T | |
ND4L | N | 9346–9633 | 288 | ATG | TAA | |
tRNA-Thr | J | 9636–9700 | 65 | UGU | ||
tRNA-Pro | N | 9701–9765 | 65 | UGG | ||
ND6 | J | 9768–10271 | 504 | ATT | TAA | |
CYTB | J | 10271–11413 | 1143 | ATG | CTC | |
tRNA-Ser2 | J | 11416–11483 | 68 | UGA | ||
ND1 | N | 11504–12454 | 951 | TTG | TAG | |
tRNA-Leu1 | N | 12456–12520 | 65 | UAG | ||
rRNl | N | 12521–13803 | 1283 | |||
tRNA-Val | N | 13804–13872 | 69 | UAC | ||
rRNs | N | 13873–14623 | 751 | |||
Control region | J | 14624–15414 | 791 |
The total length of the 22 tRNAs in B. horsfieldi is 1450 bp, ranging from 61 to 70 bp (Table
The outcomes revealed that all tRNA genes conform to the standard cloverleaf structure, with the exception of tRNA-ser1, which is lacking a characteristic feature - the dihydrouridine arm (Fig.
Thirteen protein-coding genes (PCGs) and two ribosomal RNA (rRNA) sequences were utilized to construct the phylogenetic trees of B. horsfieldi via maximum likelihood (ML) and Bayesian inference (BI) methods (Figs
The complete mitochondrial genome sequence of B. horsfieldi obtained in this work enriches the species data of Batocera, Cerambycidae. Comparative analysis and phylogenetic analysis of the sequence features of Cerambycidae mitochondrial genomes were carried out by combining 13 protein-coding genes (PCGs) and two ribosomal RNA (rRNA) data from 21 species of longhorn beetles publicly available on NCBI. The results of this study will lay the foundation for population genetics research on B. horsfieldi and phylogenetic reconstruction of the Cerambycidae.
The secondary structures of tRNAs in Cerambycidae mitochondrial genomes have a typical cloverleaf structure, except for the absence of the D-arm in tRNA-ser1 (AGN) (
Currently, traditional taxonomic studies of longhorn beetles are mainly based on morphological features; however, there are still many controversies due to the complexity and instability of the morphological features. In this study, a phylogenetic tree was constructed using BI and ML methods based on 13 PCGs and 2 rRNAs, exhibiting differences with previous related studies. For instance, our results suggest that A. glabripennis and A. chinensis belong to the same branch, whereas in the study by
In this study, we successfully obtained the complete mitochondrial genome of Batocera horsfieldi for the first time, which is 15 425 bp in length. The mitochondrial genome of B. horsfieldi exhibits a molecular pattern similar to that of other Cerambycidae species, comprising 37 gene segments, including 22 transfer RNAs, two ribosomal RNAs, 13 protein-coding genes (PCGs), and an A + T-rich region typical of other Cerambycidae mitogenomes. Four of the 13 PCGs are encoded on the N strand, while the remaining nine genes are encoded on the J strand. The start codons of COI, COII, ATP8, ND5, and ND6 are ATT, while that of ND1 is TTG, and the start codons of COIII, ATP6, ND3, ND4, ND4L, and CYTB are ATN (N represents G or C). Seven of the 13 PCGs share typical stop codons (TAA and TAG), while CYTB uses CTC as a stop codon and the remaining four PCGs use a single T residue as a stop codon. All tRNAs exhibit typical cloverleaf structures except for tRNASer1. Phylogenetic analyses revealed that Cerambycidae formed a highly supported single clade, and Vesperidae was either clustered with Cerambycidae or formed a separate clade. Interestingly, B. horsfieldi, B. rubus and B. lineolata were clustered with Monochamus and Anoplophora species in both analyses, with high node support values. Additionally, Spiniphilus spinicornis and the 20 longhorn beetles formed a sister clade in the Bayesian analysis.
We thank associate professor Bo Luo for sample collection (College of Life Science, China West Normal University). We also wish to express our deep appreciation to the anonymous reviewers for their comments that helped to improve the quality of this manuscript.
The authors have declared that no competing interests exist.
No ethical statement was reported.
This work was funded by the Ministry of Science and Technology of the People’s Republic of China Support Program (2022YFE0115200), Sichuan Province Science and Technology Support Program (2022NSFSC0986) and China West Normal University Support Program (20A007, 20E051, 21E040, and 22kA011).
Conceptualization, Zhihang Zhuo; methodology, Junhao wu and Danping Xu; software, xinju wei; formal analysis, Junhao wu and Zhihang Zhuo; investigation, Wenkai Liao; data curation, Junhao wu; writing-original draft preparation, Junhao wu; writing-review and editing, Danping Xu and Zhihang Zhuo; supervision, Zhihang Zhuo.
Zhihang Zhuo https://orcid.org/0000-0002-2566-3172
The mitogenomes were deposited at Figshare. https://doi.org/10.6084/m9.figshare.22666192.v1 and https://doi.org/10.6084/m9.figshare.22673647.v1.
Batocera horsfieldi and 22 species
Data type: Phylogenetic (docx. file)