Research Article |
Corresponding author: Xuejuan Li ( lixuejuan456@163.com ) Corresponding author: Haiyan Ye ( tiantian@snnu.edu.cn ) Academic editor: Grace P. Servat
© 2022 Chao Yang, Xiaojuan Du, Yuxin Liu, Hao Yuan, Qingxiong Wang, Xiang Hou, Huisheng Gong, Yan Wang, Yuan Huang, Xuejuan Li, Haiyan Ye.
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Yang C, Du X, Liu Y, Yuan H, Wang Q, Hou X, Gong H, Wang Y, Huang Y, Li X, Ye H (2022) Comparative mitogenomics of the genus Motacilla (Aves, Passeriformes) and its phylogenetic implications. ZooKeys 1109: 49-65. https://doi.org/10.3897/zookeys.1109.81125
|
The genus Motacilla belongs to Motacillidae (Passeriformes), where mitochondrial features are poorly understood and phylogeny is controversial. Whole mitochondrial genome (mitogenome) data and large taxon sampling are considered to be ideal strategies to obtain this information. We generated four complete mitogenomes of M. flava, M. cinerea, M. alba and Dendronanthus indicus, and made comparative analyses of Motacilla species combined with mitogenome data from GenBank, and then reconstructed phylogenetic trees based on 37 mitochondrial genes. The mitogenomes of four mitogenome sequences exhibited the same gene order observed in most Passeriformes species. Comparative analyses were performed among all six sampled Motacilla mitogenomes. The complete mitogenomes showed A-skew and C-skew. Most protein-coding genes (PCGs) start with an ATG codon and terminate with a TAA codon. The secondary structures of RNAs were similar among Motacilla and Dendronanthus. All tRNAs except for trnS(agy) could be folded into classic clover-leaf structures. Three domains and several conserved boxes were detected. Phylogenetic analysis of 90 mitogenomes of Passeriformes using maximum likelihood (ML) and Bayesian inference (BI) revealed that Motacilla was a monophyletic group. Among Motacilla species, M. flava and M. tschutschensis showed closer relationships, and M. cinerea was located in a basal position within Motacilla. These data provide important information for better understanding the mitogenomic characteristics and phylogeny of Motacilla.
Comparative analysis, mitogenome, phylogeny
In most animals, the mitochondrial genome (mitogenome) contains 13 protein-coding genes (PCGs), two rRNA genes (rRNAs), 22 tRNA genes (tRNAs), and one noncoding region (the control region, CR) (
Passeriformes comprises 6533 currently described species (
An increasing number of avian mitogenome sequences are being generated with high-throughput sequencing technology (
Muscle samples were collected from the following species: M. flava (from China, Shaanxi Province, Hongjiannao in 2013); M. cinerea (from China, Shaanxi Province, Feng County in 2017); and M. alba and D. indicus (from China, Shaanxi Province, Lantian in 2018). All specimens of muscle samples were preserved in 100% ethanol and stored at -20 °C at the Shaanxi Institute of Zoology, Shaanxi Province, China.
The mitogenome of M. flava was sequenced by Genesky Biotechnologies Inc., Shanghai, China, using the Illumina HiSeq2000 platform, while those of M. cinerea, M. alba and D. indicus were sequenced at Biomarker Technologies Inc., Beijing, China, using the Illumina Xten platform and a 150 bp paired-end strategy. Genomic DNA was extracted using a DNeasy kit and fragmented using ultrasonic methods to prepare a small-inserted-fragment library. The library data were obtained via Bridge PCR and Illumina paired-end sequencing.
There were 15,149,744 paired-end raw reads of M. flava, of which 47,390 reads were used for mitogenome assembly, with average coverage of 417.1X. There were 20,702,440 paired-end raw reads of M. cinerea, with clean data 6.92 G. A total of 261,229 reads were used for mitogenome assembly, with average coverage of 2256.2X. There were 7,868,047 raw reads in M. alba, with 7,860,296 reads with clean data, and 8,430,436 raw reads of D. indicus, with 8,420,710 reads with clean data.
The raw data from M. flava, M. cinerea and M. alba were quality trimmed with CLC Genomics Workbench 9.5.2 (CLC bio, Aarhus, Denmark) using the default parameters. Mitogenome assembly was performed in MITOBIM 1.8 (
The six mitogenomes (M. flava, M. cinerea and M. alba mitogenomes from collected specimens combined with M. tschutschensis, M. alba and M. cinerea genomes from GenBank) were used for comparative analysis. A mitogenome of M. lugens (KU246035/NC_029703) was excluded because this has been shown to represent a chimera (
A total of 90 mitogenomes of Passeriformes were used to reconstruct phylogenetic relationships; the included mitogenomes came from 12 taxonomic families with Aethopyga gouldiae (Nectariniidae) used as an outgroup (Suppl. material
The obtained complete mitogenomes of M. flava, M. cinerea, M. alba and D. indicus ranged from 16,831 bp to 16,870 bp in length and each contained 37 genes and a noncoding region (CR) (Fig.
The gene orders and nucleotide compositions of the six sampled Motacilla mitogenomes were generally similar. For instance, the A+T content ranges from 53.5% to 53.9%, which was slightly higher than the G+C contents. All mitogenomes showed a tendency toward A-skew and obvious C-skew (Suppl. material
The A+T contents of the 13 PCGs excluding stop codons ranged from 52.4% to 52.9% in sampled Motacilla mitogenomes (Suppl. material
The first and last codons of the PCGs of Motacilla were compared (Suppl. material
RSCU analysis indicated that among all PCGs, codons including A or C at the third position were frequently overused relative to other synonymous codons (Fig.
Similar to other avian mitogenomes, rrnS was found to be located between trnF and trnV, and rrnL was located between trnV and trnL(uur). The length of rrnS was 975 bp in M. alba (MN356232) and 973 bp in the other Motacilla mitogenomes, while the length of rrnL was 1595 bp in all Motacilla species. The A+T content was slightly greater than the G+C content in the rRNA genes, ranging from 52.2% to 52.3% in rrnS and 55.2% to 55.4% in rrnL, and both rRNA genes exhibited A-skew and C-skew (Suppl. material
The rrnS included three domains and 47 helices in M. flava (Suppl. material
A total of eight tRNAs (trnQ, trnA, trnN, trnC, trnY, trnS(ucn), trnP and trnE) were located on the N-strand, while the remaining 14 tRNAs were located on the J-strand (Fig.
Twenty-one of the 22 tRNAs of M. flava were folded into a clover-leaf-like secondary structure, with the exception of trnS(agy), lacking a dihydrouridine (DHU) stem (Suppl. material
The CR was located between the trnE and trnF genes and were 1243–1250 bp in length. The average A+T content was 56.2% among all sampled Motacilla mitogenomes, which was slightly higher than that of G+C. The CRs showed a tendency toward T-skew and C-skew (Suppl. material
The CR regulates the replication of the H strand and the transcription of all mitochondrial genes (
The proportions of variable sites among the three domains were 3.6%, 2.4% and 9.0%, respectively. Thus, most variation was found in domain III, similar to the findings of previous studies (
The ML and BI phylogenetic trees were reconstructed using the mtDNA dataset, showing consistent topological results among Motacillidae (Fig.
Within Motacilla, the following phylogenetic relationships were recovered: (((M. flava+M. tschutschensis)+M. alba)+M. cinerea), similar to previous studies (Suppl. material
The complete mitogenomes of Motacilla flava, M. cinerea, M. alba and Dendronanthus indicus were sequenced and were shown to present the typical genome organization and gene order found in other Passeriformes mitogenomes. We focused on comparative analyses of the six mitogenomes to identify the mitogenomic characteristics of the genus Motacilla, such as the base composition, codon usage and RNA secondary structures. The complete mitogenomes showed a tendency toward A-skew and C-skew. Most PCGs start with typical ATG codons and terminated with TAA codons. All tRNAs could be folded into classic clover-leaf structures except for trnS(agy), which lacked a DHU arm. In addition, 90 mitogenomes of Passeriformes were used to build the tree of phylogenetic relationships. The phylogenetic tree supported the monophyly of Motacillidae. Within Motacilla, the phylogenetic topology of (((M. flava+M. tschutschensis)+M. alba)+M. cinerea) was recovered.
We would like to thank Liliang Lin for kindly providing us with results of data analyses. This work was supported by the National Natural Science Foundation of China (Grant No. 31601846 and 31801993), Natural Science Foundation of Shaanxi Province, China (Grant No. 2017JQ3014 and 2020JM-270), China Postdoctoral Science Foundation (Grant No. 2016M602760), Fundamental Research Funds for the Central Universities, China (Grant No. GK202003052 and GK202103068), Projects of the Department of Science and Technology of Shaanxi Province, China (Grant No. 2019NY-089 and 2018ZDXM-NY-071), and the First Wildlife Resource Survey Project of Xi’an (Grant No. XCZX2015-0297-3). The complete mitogenomes of Motacilla flava, M. cinerea, M. alba and Dendronanthus indicus are openly available on GenBank at https://www.ncbi.nlm.nih.gov/nuccore, with accession No. MW929088-MW929091. CY, XD and XL analyzed the data and wrote the manuscript. CY, YH, XL and YH participated in the design of the experiments. XD, XD, YL, HY, QW, XH, HG and YW helped to improve the manuscript. All authors read and approved the final manuscript.
Figure S1
Data type: Image.
Explanation note: Phylogenetic hypotheses of previous studies. a:
Figure S2
Data type: Image.
Explanation note: Secondary structures of rrnS of M. flava. Note: differences within the six mitogenomes from the genus Motacilla are indicated by filled grey circles.
Figure S3
Data type: Image.
Explanation note: Secondary structures of rrnL of M. flava. Note: differences within the six mitogenomes from the genus Motacilla are indicated by filled grey circles.
Figure S4
Data type: Image.
Explanation note: Secondary structures of tRNAs of M. flava. Note: differences within the six mitogenomes from the genus Motacilla are indicated by filled grey circles.
Figure S5
Data type: Image.
Explanation note: Control region structures of M. flava. Note: differences within the six mitogenomes from the genus Motacilla are indicated with filled circles.
Figure S6
Data type: Image.
Explanation note: Maximum likelihood and Bayesian inference phylogenetic results based on the mtDNA dataset, corresponding to Fig.
Table S1
Data type: Doc file.
Explanation note: Mitogenome sequences employed for reconstructing phylogenetic trees.
Table S2
Data type: Doc file.
Explanation note: Nucleotide composition and bias of six mitogenomes of the genus Motacilla. Notes: Stop codons of protein-coding genes were excluded; AT-skew=[A-T]/[A+T], GC-skew=[G-C]/[G+C]. The sequenced mitogenome species in this study are shown in the bold format.
Table S3
Data type: Doc file.
Explanation note: Initial and terminal codons of protein-coding genes in the six mitogenomes from the genus Motacilla. Notes: The sequenced mitogenome species in this study are shown in the bold format.