Research Article |
Corresponding author: Gou Xingchun ( gouxingchun@189.cn ) Academic editor: Tony Robillard
© 2020 Qiu Zhongying, Chang Huihui, Yuan Hao, Huang Yuan, Lu Huimeng, Li Xia, Gou Xingchun.
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Zhongying Q, Huihui C, Hao Y, Yuan H, Huimeng L, Xia L, Xingchun G (2020) Comparative mitochondrial genomes of four species of Sinopodisma and phylogenetic implications (Orthoptera, Melanoplinae). ZooKeys 969: 23-42. https://doi.org/10.3897/zookeys.969.49278
|
In this study, the whole mitochondrial genomes (mitogenomes) from four species were sequenced. The complete mitochondrial genomes of Sinopodisma pieli, S. houshana, S. qinlingensis, and S. wulingshanensis are 15,857 bp, 15,818 bp, 15,843 bp, and 15,872 bp in size, respectively. The 13 protein-coding genes (PCGs) begin with typical ATN codons, except for COXI in S. qinlingensis, which begins with ACC. The highest A+T content in all the sequenced orthopteran mitogenomes is 76.8% (S. qinlingensis), followed by 76.5% (S. wulingshanensis), 76.4% (S. pieli) and 76.4% (S. houshana) (measured on the major strand). The long polythymine stretches (T-stretch) in the A+T-rich region of the four species are not adjacent to the trnI locus but are inside the stem-loop sequences on the major strand. Moreover, several repeated elements are found in the A+T-rich region of the four species. Phylogenetic analysis based on 53 mitochondrial genomes using Bayesian Inference (BI) and Maximum Likelihood (ML) revealed that Melanoplinae (Podismini) was a monophyletic group; however, the monophyly of Sinopodisma was not supported. These data will provide important information for a better understanding of the phylogenetic relationship of Melanoplinae.
mitogenome, phylogeny, Sinopodisma
The insect mitochondrial genome (mitogenome) is a circular double-stranded covalently closed DNA molecule, with maternal genetic characteristics of relatively small molecular mass, simple structure, high copy number, relatively conservative gene arrangement, and rapid rate of gene evolution. The mitogenome contains 13 protein-coding genes (PCGs), two ribosomal RNA genes (rRNAs), 22 transfer RNA genes (tRNAs), and one A+T-rich region. The mitochondrial genes have been widely used in identifying species, estimating evolutionary relationships and recognising both the population structure and phylogeography (
The genus Sinopodisma Chang, 1940 belongs to Melanoplinae, Acrididae, and Caelifera (
In order to better understand the phylogenetic relationship of Melanoplinae, we obtained complete mitogenome sequences of S. pieli, S. houshana, S. qinlingensis, and S. wulingshanensis and compared them in detail. The new mitogenomes data not only helped us understand the characteristics of mitogenome of this group and the differences among different species, but also provided the basis for better exploring their evolutionary relationships. Combined with the new data and the existing data, we reconstructed the phylogeny of 53 Acrididae species based on a dataset of 37 complete mitochondrial genes, which may provide new angle for discussing the relationships within the Melanoplinae.
Information on the samples analysed in the present study is summarised in Suppl. material
High-Throughput Sequencing Technique was used to sequence S. wulingshanensis. We first fragmented DNA using an ultrasonic mechanical method. Then, we built DNA library and used Illumina HiSeq 2500 to sequence the whole genome, including the mitogenomes. The average read length was approximately 125 bp. The DNA library and sequencing were supported by the Biomarker Company (
The sub-PCR was performed in a total volume of 40 μL including 14 μL of ddH2O, 2 μL of forward primer (10 μM), 2 μL of reverse primer (10 μM), 2 μL of template DNA (50 ng/μL), and 20 μL of 2×Taq PCRStar Mix. The sub-PCR was under the following conditions: initial denaturation at 96 °C for 2 min → (96 °C for 10 sec, 51.5 °C for 35 sec, 60 °C for 4 min) × 35 cycles → 72 °C for 7 min → decrease to 4 °C. Most sub-PCR products were directly sequenced by means of primer walking, and other fragments were cloned into the pGEM-T Easy vector (Promega, USA) prior to sequencing.
The Standen Package (
Fifty-three available insect mitogenomes were used the phylogenetic analyses of Acrididae. The mitogenomes of Asiotmethis jubatus (NC_025904), Filchnerella beicki (NC_024923) and Humphaplotropis culaishanensis (NC_023535) were downloaded and used as outgroup (Taxonomy of all species is based on Orthoptera Species File (Version 5.0/5.0) (
The complete mitogenomes of S. pieli, S. houshana, S. wulingshanensis and S. qinlingensis are 15,857 bp, 15,818 bp, 15,872 bp and 15,843 bp in size, respectively (Figure
Generally, A % > T % and C % > G % are common characteristics in all insects (
Nucleotide compositions of S. pieli, S. houshana, S. wulingshanensis, and S. qinlingensis.
Feature | AT% | AT-skew | GC-skew | |||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|
S. q. | S. w. | S. p. | S. h. | S. q. | S. w. | S. p. | S. h. | S. q. | S. w. | S. p. | S. h. | |
Whole genome (J-strand) | 76.8 | 76.5 | 76.4 | 76.4 | 0.121 | 0.126 | 0.121 | 0.126 | -0.122 | -0.126 | -0.126 | -0.129 |
Protein-coding genes* | 76.1 | 75.7 | 75.8 | 75.6 | -0.14 | -0.141 | -0.143 | -0.14 | 0.021 | -0.001 | 0.004 | 0.019 |
First codon position | 68.8 | 68.4 | 66.3 | 68 | -0.039 | -0.042 | -0.296 | -0.046 | 0.24 | 0.214 | -0.06 | 0.241 |
Second codon position | 65.9 | 65.6 | 86.8 | 65.5 | -0.403 | -0.407 | -0.142 | -0.404 | -0.171 | -0.169 | -0.202 | -0.169 |
Third codon position | 93.8 | 93.2 | 74.4 | 93.3 | -0.029 | -0.026 | -0.006 | -0.023 | -0.026 | -0.154 | 0.194 | -0.073 |
22 tRNAs | 74 | 74.5 | 73.9 | 74.2 | 0.011 | 0.017 | 0.016 | 0.005 | 0.112 | 0.103 | 0.1 | 0.1 |
2 rRNA | 76.7 | 76.4 | 75.7 | 76.3 | -0.141 | -0.157 | -0.134 | -0.154 | 0.21 | 0.215 | 0.208 | 0.23 |
rrnL | 77.5 | 77.6 | 77.7 | 77.5 | -0.171 | -0.183 | -0.149 | -0.172 | 0.22 | 0.218 | 0.213 | 0.218 |
rrnS | 75.4 | 74.3 | 72.5 | 74.2 | -0.092 | -0.111 | -0.108 | -0.124 | 0.194 | 0.21 | 0.202 | 0.243 |
A+T-rich region | 86.3 | 86.6 | 86.5 | 86.5 | 0.146 | 0.16 | 0.152 | 0.154 | -0.165 | -0.167 | -0.147 | -0.141 |
A total of nine PCGs (ND2, COXI, COXII, ATP8, ATP6, COXIII, ND3, ND6 and CYTB) are located on the J-strand, while the others (ND5, ND4, ND4L and ND1) are located on the N-strand. The 13 PCGs start with typical ATN codons in all four species, except COXI in S. qinlingensis, which begins with ACC. Many nonstandard initiation codons have previously been reported (
The A+T content of the 13 PCGs, excluding stop codons, is observed to be 76.1%, 75.7%, 75.8% and 75.6% in S. qinlingensis, S. wulingshanensis, S. pieli and S. houshana, respectively (Table
Codon usage bias (codon bias) is a phenomenon in which specific codons are used more frequently than other synonymous codons by certain organisms during the translation of genes to proteins. With rapid progress in whole-genome sequencing, analysis of codon usage bias at the genome level, rather than for a single gene or a set of genes, has gained attention. Genome-wide investigations on the variations in codon use and codon context bias are important for understanding the functional evolution of genomes within and between species (Lu et al. 2002;
A total of 14 tRNAs (trnI, trnM, trnW, trnLUUR, trnD, trnK, trnG, trnA, trnR, trnN, trnSAGN, trnE, trnT, trnSUCN) are located on the J-strand, while the remaining tRNAs (trnQ, trnC, trnY, trnF, trnH, trnP, trnLCUN, trnV) are located on the N-strand. Moreover, 21 of the 22 tRNAs are well folded into a clover-leaf-like secondary structure, except trnSAGN, which lacks the DHU stem in all four species (Suppl. material
The lengths of the 22 tRNAs in the four species range from 64 to 71 bp in S. pieli, S. qinlingensis and S. wulingshanensis, and from 58 to 71 bp in S. houshana. According to the secondary structures and sequence alignments, the most conserved tRNAs in the four mitogenomes is trnF (Suppl. material
In the remaining tRNAs, nucleotide substitutions are mainly restricted to loops, with obvious insertion-deletion polymorphisms. In S. pieli, there are 27 non-canonical base pairs, consisting of 17 G-U pairs and 1 A-A, 2 A-G, 1 A-C, 1 U-C and 5 U-U mismatches. In S. qinlingensis, there are 21 non-canonical base pairs, consisting of 17 G-U pairs and 1 A-A, 1 A-G and 2 U-U mismatches. In S. wulingshanensis, there are 22 non-canonical base pairs, consisting of 18 G-U pairs and 1 A-A, 1 A-G and 2 U-U mismatches. In S. houshana, there are 27 non-canonical base pairs, consisting of 19 G-U pairs and 1 A-A, 2 A-G, 2 A-C, 1 U-C and 2 U-U mismatches. The possession of aberrant mismatches, loops, or extremely short arms for tRNA is common in metazoan mitogenomes (
Similar to other insect mitogenomes, rrnL is located between trnLCUN and trnV, and rrnS is located between trnV and the A+T-rich region. The lengths of rrnL are 1,343 bp, 1,313 bp, 1,369 bp and 1,313 bp in the S. pieli, S. houshana, S. wulingshanensis and S. qinlingensis mitogenomes, respectively, and the lengths of rrnS are 792 bp, 797 bp, 797 bp, and 797 bp, respectively. In the other orthopteran mitogenomes in GenBank, the lengths of rrnL range from 1,236 bp (Gryllotalpa pluvialis, NC_011302) to 1,371 bp (Pseudoxya diminuta, NC_025765), and the lengths of rrnS range from 461 bp (Ceracris kiangsu, NC_019994) to 882 bp (Ruspolia dubia, NC_009876). Therefore, the lengths of rrnL and rrnS from these four species are within the normal range. The A+T content ranges from 72.5% to 77.7% in the rRNA genes, and both rRNA genes exhibit T-skew and G-skew (Table
The A+T-rich region is the major noncoding region in insect mitogenome, which is located in the conserved position between the rrnS and trnI genes and has an A+T content of 86.5% in S. pieli and S. houshana, 86.6% in S. wulingshanensis and 86.3% in S. qinlingensis. In addition, the A+T-rich regions of all four mitogenomes favour A-skew and C-skew (Suppl. material
The long polythymine stretch and conserved sequence blocks in the A+T rich regions from four species. Note: The long polythymine stretch. T-stretch sequence was labelled with box, located in the majority strand. Within each block, nucleotides identical in the two sequences are bottom-marked with asterisks.
Some tandem repetition and conserved structural elements have been observed in the insect A+T-rich region. Comparison of our four species with Schistocerca gregaria and Oxya chinensis revealed some conserved blocks. Indeed, these A+T-rich regions have eight conserved blocks (Figure
The presence of a variable number of tandem repeat units may be useful for inferring the genetic structures of populations among closely related taxa and individuals of the same species (
The dataset of all 37 mitochondrial genes was used to perform phylogenetic analyses based on 53 Acrididae mitogenome sequences, including the four newly generated sequences, 49 other Acrididae sequences from GenBank and three outgroup sequences (Suppl. material
In Melanoplinae, the topological relationships between the genera were the same in the two trees and all species belong to the tribe Podismini. The phylogenetic results in this study supported the monophyly of Podismini. The same inference could be found in other phylogenies based on mitogenome data (
The complete mitochondrial genomes of Sinopodisma pieli, S. houshana, S. qinlingensis, and S. wulingshanensis were obtained. The mitogenomes of four species have typical genome organisation and gene arrangement order, compared to other caeliferan mitogenomes. We focused on comparative analyses of four Sinopodisma mitogenomes to find the characteristics of base composition, overlapping and intergenic regions, and tRNA secondary structures. All 13 PCGs have typical starting ATN codons, except for COXI in S. qinlingensis, which start with ACC. A+T contents in four mitogenomes are high and we found several repeated elements in the A+T-rich region of the four species. Moreover, 53 mitogenome data were used to build the phylogenetic relationship. The phylogenetic tree supported the monophyly of Melanoplinae, but do not support the monophyly of Sinopodisma.
The authors have declared that no competing interest exists. This paper received assistance from the Project of Shaanxi Key Laboratory of Brain Disorders (18NBZD10), the National Natural Science Foundation of China (81800211) and Xi’an medical university doctoral research fund (2017DOC20). We would like to thank Jianhua Huang for kindly providing us with samples.
Tables S1–S7
Data type: table excel
Explanation note: Table S1. Information on the samples analysed in the present study. Table S2. List of L-PCR primers used in this study. Table S3. Taxonomic information and GenBank accession numbers for the 53 taxa used for the phylogenetic analysis in this study. Table S4. Annotation and gene organisation of four mitochondrial genomes. Table S5. Base composition and length features of the four mitochondrial genomes used in this study. Table S6. The repeat elements in A+T-rich regions of four species. Table S7. Subset partition and its optimal model of datasets.
Figures S1–S4
Data type: images
Explanation note: Figure S1. The A+T content of mitogenomes in orthopteran species. Figure S2. The amino acid composition from S. pieli, S. houshana, S. wulingshanensis, and S. qinlingensis. Figure S3. Inferred secondary structure of tRNA families in S. pieli, S. houshana, S. wulingshanensis and S. qinlingensis mitochondrial genomes. Note: The nucleotide substitution pattern for each tRNA family is modeled using as the reference the structure determined for S. pieli. Because of the special secondary structure of trnP in S. houshana, the trnP in Figure S3 is presented separately. In nucleotide substitutions and insertions, we used different colours to represent different species (one or several). The colour of some substitutions and insertions only represent some species, which means base deletions in the location of other species. Figure S4. Putative stem-loop structures found in A+T-rich region from four species.