Research Article |
Corresponding author: Jairo Arroyave ( jarroyave@ib.unam.mx ) Academic editor: Caleb McMahan
© 2022 Adán Fernando Mar-Silva, Jairo Arroyave, Píndaro Díaz-Jaimes.
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Mar-Silva AF, Arroyave J, Díaz-Jaimes P (2022) The complete mitochondrial genome of the Mexican-endemic cavefish Ophisternon infernale (Synbranchiformes, Synbranchidae): insights on patterns of selection and implications for synbranchiform phylogenetics. ZooKeys 1089: 1-23. https://doi.org/10.3897/zookeys.1089.78182
|
Ophisternon infernale is one of the 200+ troglobitic fish species worldwide, and one of the two cave-dwelling fishes endemic to the karstic aquifer of the Yucatán Peninsula, Mexico. Because of its elusive nature and the relative inaccessibility of its habitat, there is virtually no genetic information on this enigmatic fish. Herein we report the complete mitochondrial genome of O. infernale, which overall exhibits a configuration comparable to that of other synbranchiforms as well as of more distantly related teleosts. The KA/KS ratio indicates that most mtDNA PCGs in synbranchiforms have evolved under strong purifying selection, preventing major structural and functional protein changes. The few instances of PCGs under positive selection might be related to adaptation to decreased oxygen availability. Phylogenetic analysis of mtDNA comparative data from synbranchiforms and closely related taxa (including the indostomid Indostomus paradoxus) corroborate the notion that indostomids are more closely related to synbranchiforms than to gasterosteoids, but without rendering the former paraphyletic. Our phylogenetic results also suggest that New World species of Ophisternon might be more closely related to Synbranchus than to the remaining Ophisternon species. This novel phylogenetic hypothesis, however, should be further tested in the context of a comprehensive systematic study of the group.
Blind swamp eel, karst aquifer, mitogenome, systematics, troglobitic, Yucatan Peninsula
Ophisternon infernale (Synbranchiformes, Synbranchidae), commonly known as the blind swamp eel, is a rare and elusive freshwater teleost fish endemic to the cenotes and submerged caves of the Yucatan Peninsula (YP) in southeastern Mexico. Like most troglobites, O. infernale exhibits typical regressive troglomorphic traits associated with life in absolute darkness, such as the absence of both pigmentation and eyes. Besides its endemism and troglomorphism, O. infernale is exceptional in that it is one of two fish species that permanently inhabit the dark and oligotrophic subterranean waters of the YP karst aquifer; the other being the Mexican blind brotula (Typhlias pearsei) (
All methods were carried out in accordance with relevant guidelines and regulations, and the study was carried out in compliance with the ARRIVE guidelines. Sampling of the O. infernale individual used to generate the mitogenome presented here was accomplished with the assistance of a professional cave diver who captured the specimen using a custom-made hand net specifically designed for efficient capture and secure storage while cave diving. The sample was collected under collecting permit SGPA/DGVS/05375/19 issued by the Mexican Ministry of Environment and Natural Resources (Secretaría de Medio Ambiente y Recursos Naturales; SEMARNAT) to JA. The sampling locality is the cenote Kan-Chin (Huhí, Yucatán), located at 20°40'11"N, 89°10'6"W. The voucher specimen was euthanized with MS-222 prior to preservation in accordance with recommended guidelines for the use of fishes in research (
The quality of the raw data was assessed with FastQC (Andrews, 2010). Good-quality sequences that did not contain ambiguous nucleotides and reads with average quality of 30Q were demultiplexed, trimmed and merged using Geneious Prime 2020.0.4 (https://www.geneious.com). Mitogenome assembly was conducted with MITObim v.1.9 (
Nucleotide and amino acid composition, codon usage profiles of protein-coding genes (PCGs), Relative Synonymous Codon Usage (RSCU), and characterization the non-coding mtDNA control region (CR) were computed with mega X (
We investigated patterns of selection on PCGs on a mitogenomic scale and phylogenetic relationships among major synbranchiform lineages based on all mitogenomic comparative data for the group available on GenBank. To measure of the strength and mode of natural selection acting on PCGs, we estimated the ratio of non-synonymous (KA) to synonymous (KS) substitutions (KA/KS, also known as ω or dN/dS) using the HyPhy 2.5 package (
The complete mitochondrial genome of O. infernale presented herein (GenBank accession number OM388306) is 16,804 bp in total length (Fig.
Mitochondrial genes and associated features of O. infernale. Intergenic space (IGS) described as intergenic (+) or overlapping nucleotides (–). AA = amino acid.
Locus | Type | One-letter code | Start | End | Length (bp) | Strand | # of AA | Anticodon | Start codon | Stop codon | IGS |
---|---|---|---|---|---|---|---|---|---|---|---|
tRNAPhe | tRNA | F | 1 | 69 | 69 | H | GAA | 0 | |||
12s rRNA | rRNA | 70 | 1017 | 948 | H | 0 | |||||
tRNAVal | tRNA | V | 1018 | 1091 | 74 | H | TAC | 0 | |||
16s rRNA | rRNA | 1092 | 2766 | 1092 | H | 0 | |||||
tRNA-Leu | tRNA | L | 2767 | 2840 | 74 | H | TAA | 63 | |||
NAD1 | Protein-coding | 2904 | 3872 | 951 | H | 316 | ATG | TAA | 7 | ||
tRNAIle | tRNA | I | 3880 | 3949 | 70 | H | GAT | 8 | |||
tRNAGln | tRNA | Q | 3958 | 4028 | 71 | L | TTG | –1 | |||
tRNAMet | tRNA | M | 4028 | 4097 | 70 | H | CAT | 0 | |||
NAD2 | Protein-coding | 4098 | 5144 | 1047 | H | 337 | ATG | AGA | –3 | ||
tRNATrp | tRNA | W | 5142 | 5211 | 70 | H | TCA | 1 | |||
tRNAAla | tRNA | A | 5213 | 5281 | 69 | L | TGC | 1 | |||
tRNAAsn | tRNA | N | 5283 | 5355 | 73 | L | GTT | 53 | |||
tRNACys | tRNA | C | 5409 | 5475 | 67 | L | GCA | 0 | |||
tRNATyr | tRNA | Y | 5476 | 5542 | 67 | L | GTA | 1 | |||
COX1 | Protein-coding | 5544 | 7082 | 1539 | H | 489 | GTG | AGA | –4 | ||
tRNASer | tRNA | S | 7127 | 7197 | 71 | L | TGA | 2 | |||
tRNAAsp | tRNA | D | 7200 | 7270 | 71 | H | GTC | 2 | |||
COX2 | Protein-coding | 7273 | 7963 | 691 | H | 225 | ATG | T | 0 | ||
tRNALys | tRNA | K | 7964 | 8036 | 73 | H | TTT | 1 | |||
ATP8 | Protein-coding | 8038 | 8205 | 168 | H | 51 | ATG | TAA | –8 | ||
ATP6 | Protein-coding | 8196 | 8878 | 683 | H | 223 | ATG | TA | 0 | ||
COX3 | Protein-coding | 8879 | 9662 | 784 | H | 249 | ATG | T | 0 | ||
tRNAGly | tRNA | G | 9663 | 9731 | 69 | H | TCC | 0 | |||
NAD3 | Protein-coding | 9732 | 10079 | 348 | H | 112 | ATG | GAC | 0 | ||
tRNAArg | tRNA | R | 10080 | 10148 | 69 | H | TCG | 0 | |||
NAD4L | Protein-coding | 10149 | 10445 | 297 | H | 97 | ATA | TAA | –5 | ||
NAD4 | Protein-coding | 10439 | 11819 | 1380 | H | 445 | ATG | T | 0 | ||
tRNAHis | tRNA | H | 11820 | 11888 | 69 | H | GTG | 0 | |||
tRNASer | tRNA | S | 11889 | 11952 | 64 | H | GCT | –1 | |||
tRNALeu | tRNA | L | 11952 | 12024 | 73 | H | TAG | 1 | |||
NAD5 | Protein-coding | 12026 | 13855 | 1830 | H | 598 | ATG | TA | –2 | ||
NAD6 | Protein-coding | 13852 | 14373 | 522 | L | 172 | ATG | T | 1 | ||
tRNAGlu | tRNA | E | 14375 | 14443 | 69 | L | TTC | 2 | |||
CYTB | Protein-coding | 14446 | 15586 | 1141 | H | 369 | ATG | T | 0 | ||
tRNAThr | tRNA | T | 15587 | 15662 | 76 | H | TGT | –1 | |||
tRNAPro | tRNA | P | 15662 | 15730 | 69 | L | TGG | 0 | |||
D-loop | Non-coding | 15731 | 16804 | 1074 | H | 0 |
Size and nucleotide composition of the complete synbranchiform mitochondrial genomes (and their concatenated PCGs) analyzed in this study. *NAD1 gene missing from published mitogenome.
Species | GenBank Accession # | Entire genome | Protein-coding genes | ||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Length (bp) | A(%) | T(%) | C(%) | G(%) | AT(%) | AT skew | GC skew | Length (bp) | AT(%) | AT skew | GC skew | ||
Ophisternon infernale | OM388306 | 16804 | 31.6 | 28.7 | 26.5 | 13.2 | 60.4 | 0.046 | -0.277 | 11449 | 60.1 | -0.038 | -0.348 |
Ophisternon candidum | MT436449 | 16526 | 31.5 | 27.5 | 27.9 | 13.1 | 59 | 0.067 | -0.36 | 11377 | 59.1 | -0.015 | -0.374 |
Synbranchus marmoratus* | AP004439 | 15561 | 30.7 | 26.8 | 28.5 | 14 | 57.5 | 0.067 | -0.341 | 10529 | 57.1 | -0.027 | -0.355 |
Monopterus albus | NC003192 | 16622 | 28.9 | 27.2 | 29.4 | 14.5 | 56.1 | 0.03 | -0.34 | 11430 | 54.9 | -0.052 | -0.356 |
Mastacembelus armatus | NC023977 | 16487 | 29.1 | 25.3 | 30.9 | 14.7 | 54.4 | 0.069 | -0.355 | 11404 | 53.1 | -0.013 | -0.381 |
Mastacembelus erythrotaenia | NC035141 | 16493 | 29 | 24.5 | 31.6 | 14.9 | 53.4 | 0.086 | -0.357 | 11417 | 52.2 | -0.003 | -0.382 |
Macrognathus aculeatus | KT443991 | 16543 | 30 | 26.5 | 28.7 | 14.8 | 56.4 | 0.063 | -0.322 | 11420 | 55.9 | -0.014 | -0.345 |
Macrognathus pancalus | NC032080 | 16549 | 29.7 | 26 | 29.6 | 14.7 | 55.7 | 0.664 | -0.337 | 11420 | 54.9 | -0.02 | -0.363 |
Annotated map of the mitochondrial circular genome of O. infernale. The outer ring corresponds to the H- (outermost) and L-strands, and depicts the location of PCGs (in black, except for ND6 which is encoded in the L-strand and is portrayed in red), the non-coding control region (in dark brown), tRNAs (in red), and rRNAs (in light brown). The inner ring (black sliding window) denotes GC content along the genome. Live specimen photograph taken in the Cenote Kancabchen (Homún, Yucatán), courtesy of cave diver Erick Sosa.
The 13 PCGs, altogether totaling 11,449 bp, correspond to 68.1% of the O. infernale mitogenome. These genes consist of seven regions that code for the subunits of the NADH dehydrogenase (ubiquinone) protein complex (NAD1-6, NADL4), three that code for the subunits of the enzyme cytochrome c oxidase (COX1-3), one that codes for the enzyme cytochrome b (CYTB), and two that code for the subunits 6 and 8 of the enzyme ATP synthase FO (ATP6, ATP8). Except for COX1 and ND4L, PCGs exhibit an ATG (Met) start codon, which is the standard in eukaryotic systems (
Results from the Relative Synonymous Codon Usage (RSCU) analysis for the PCGs of the mitochondrial genome of O. infernale.
Amino acid | Codon | Number | Freq. (%) | RSCU | Amino acid | Codon | Number | Freq. (%) | RSCU |
---|---|---|---|---|---|---|---|---|---|
Phe | TTT | 110 | 2.9 | 1.02 | Ala | GCA | 48 | 1.3 | 1.12 |
TTC | 105 | 2.8 | 0.98 | GCG | 10 | 0.3 | 0.23 | ||
Leu | TTA | 139 | 3.6 | 1.49 | Tyr | TAT | 119 | 3.1 | 1.13 |
TTG | 45 | 1.2 | 0.48 | TAC | 92 | 2.4 | 0.87 | ||
CTT | 144 | 3.8 | 1.54 | His | CAU | 59 | 1.5 | 1.08 | |
CTC | 85 | 2.2 | 0.91 | CAC | 50 | 1.3 | 0.92 | ||
CTA | 110 | 2.9 | 1.18 | Gln | CAA | 78 | 2 | 1.42 | |
CTG | 37 | 1 | 0.4 | CAG | 32 | 0.8 | 0.58 | ||
Ile | ATT | 134 | 3.5 | 1.17 | Asn | AAT | 101 | 2.6 | 1 |
ATC | 95 | 2.5 | 0.83 | AAC | 102 | 2.7 | 1 | ||
Met | ATA | 119 | 3.1 | 1.49 | Lys | AAA | 78 | 2 | 1.56 |
ATG | 41 | 1.1 | 0.51 | AAG | 22 | 0.6 | 0.44 | ||
Val | GTT | 31 | 0.8 | 1.18 | Asp | GAT | 34 | 0.9 | 1.1 |
GTC | 23 | 0.6 | 0.88 | GAC | 28 | 0.7 | 0.9 | ||
GTA | 32 | 0.8 | 1.22 | Glu | GAA | 50 | 1.3 | 1.49 | |
GTG | 19 | 0.5 | 0.72 | GAG | 17 | 0.4 | 0.51 | ||
Ser | TCT | 73 | 1.9 | 1.22 | Cys | TGT | 30 | 0.8 | 0.98 |
TCC | 80 | 2.1 | 1.34 | TGC | 31 | 0.8 | 1.02 | ||
TCA | 88 | 2.3 | 1.47 | Trp | TGA | 63 | 1.7 | 1.26 | |
TCG | 23 | 0.6 | 0.39 | TGG | 37 | 1 | 0.74 | ||
AGT | 42 | 1.1 | 0.7 | Arg | CGT | 14 | 0.4 | 0.67 | |
AGC | 52 | 1.4 | 0.87 | CGC | 22 | 0.6 | 1.06 | ||
Pro | CCT | 104 | 2.7 | 1.42 | CGA | 28 | 0.7 | 1.35 | |
CCC | 91 | 2.4 | 1.24 | CGG | 19 | 0.5 | 0.92 | ||
CCA | 86 | 2.3 | 1.17 | Gly | GGT | 44 | 1.2 | 1.35 | |
CCG | 12 | 0.3 | 0.16 | GGC | 36 | 0.9 | 1.11 | ||
Thr | ACT | 66 | 1.7 | 0.87 | GGA | 30 | 0.8 | 0.92 | |
ACC | 120 | 3.1 | 1.59 | GGG | 20 | 0.5 | 0.62 | ||
ACA | 101 | 2.6 | 1.34 | Stop | TAA | 83 | 2.2 | 1.78 | |
ACG | 15 | 0.4 | 0.2 | TAG | 39 | 1 | 0.84 | ||
Ala | GCT | 45 | 1.2 | 1.05 | AGA | 37 | 1 | 0.8 | |
GCC | 69 | 1.8 | 1.6 | AGG | 27 | 0.7 | 0.58 |
Results from analysis of Relative Synonymous Codon Usage (RSCU) of the mitochondrial genome of O. infernale. Codon families are plotted on the x-axis. The label for the 2, 4, or 6 codons that compose each family is shown in the boxes below the x-axis, and the colors correspond to those in the stacked columns. RSCU values are shown on the y-axis.
The mitogenome of O. infernale contains the typical 22 tRNAs usually documented for mitogenomes of other teleosts and vertebrates (
The mtDNA control region of O. infernale is 1074 bp long (15731–16804), encoded in the H-strand, and flanked by tRNAPro and tRNAPhe at the 5' and 3' ends, respectively (Fig.
Comparison (multiple sequence alignment) of the mtDNA control region of O. infernale with those of fellow teleosts Siniperca chuatsi and Cyprinion semiplotum. The alignment displays the three canonical domains distinguished by Termination Associated Sequences (TAS) of the upstream hypervariable region (in red), central conserved domain blocks (CSB-F, CSB-E, CSB-D) (in blue), and conserved sequence blocks of the downstream hypervariable region (CSB-1, CSB-2 and CSB-3) (in green).
Results from KA/KS analyses (Fig.
Our understanding of phylogenetic relationships in synbranchiform fishes is incipient compared to that of other teleost groups. Despite the fact that for the past two decades molecular systematics has been routinely employed to refine and update the classification of fishes and our knowledge of their evolutionary history (
Phylogenetic relationships of major synbranchiform lineages. Molecular phylogeny based on comparative mitochondrial PCGs from relevant available mitogenomes and the newly generated herein for O. infernale. Troglobitic cave-dwelling species are marked with an asterisk to distinguish them from surface-dwelling ones. Outgroup taxa not shown. Colored circles on nodes indicate degree of clade support as determined by bootstrap values.
The first complete annotated mitochondrial genome of O. infernale, herein reported, exhibits an organization and arrangement similar to that of other synbranchiform fishes as well as of more distantly related teleosts. Based on our comparative mitogenomic dataset, most mitochondrial PCGs in synbranchiforms appear to have evolved under strong purifying selection, which has prevented major structural and functional protein changes. The few instances of mtDNA PCGs under positive selection might be related to adaptation to decreased oxygen availability and the evolution of more metabolically efficient variants in hypogean synbranchiform lineages. Phylogenetic analysis of mtDNA comparative data from synbranchiforms and closely related taxa (including the indostomid Indostomus paradoxus) corroborate the notion that indostomids are more closely related to synbranchiforms than to gasterosteoids, but without rendering the former paraphyletic. Our phylogenetic results also suggest that New World species of Ophisternon might be more closely related to Synbranchus than to the remaining Ophisternon species. This novel phylogenetic hypothesis, however, should be further tested in the context of a comprehensive systematic study of the group.
The authors would like to thank the Laboratorio Nacional de Cómputo de Alto Desempeño (LANCAD) and CONACyT for granting computing time on the computer clusters Yotla, Miztli, and Xiuhcoatl, from the Laboratorio de Supercómputo y Visualización en Paralelo (LSVP) of the Universidad Autónoma Metropolitana Unidad Iztapalapa, the Dirección General de Cómputo y Tecnologías de Información y Comunicación of the Universidad Nacional Autónoma de México (DGTIC-