Research Article |
Corresponding author: Russell L. Minton ( minton001@gannon.edu ) Academic editor: Thierry Backeljau
© 2016 Russell L. Minton, Marco A. Martinez Cruz, Mark L. Farman, Kathryn E. Perez.
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Minton RL, Martinez Cruz MA, Farman ML, Perez KE (2016) Two complete mitochondrial genomes from Praticolella mexicana Perez, 2011 (Polygyridae) and gene order evolution in Helicoidea (Mollusca, Gastropoda). ZooKeys 626: 137-154. https://doi.org/10.3897/zookeys.626.9633
|
Helicoidea is a diverse group of land snails with a global distribution. While much is known regarding the relationships of helicoid taxa, comparatively little is known about the evolution of the mitochondrial genome in the superfamily. We sequenced two complete mitochondrial genomes from Praticolella mexicana Perez, 2011 representing the first such data from the helicoid family Polygyridae, and used them in an evolutionary analysis of mitogenomic gene order. We found the mitochondrial genome of P. mexicana to be 14,008 bp in size, possessing the typical 37 metazoan genes. Multiple alternate stop codons are used, as are incomplete stop codons. Mitogenome size and nucleotide content is consistent with other helicoid species. Our analysis of gene order suggested that Helicoidea has undergone four mitochondrial rearrangements in the past. Two rearrangements were limited to tRNA genes only, and two involved protein coding genes.
Gene rearrangement, mitochondria, tRNA, homoplasy, convergence, phylogeny
Helicoidea (Mollusca, Gastropoda) is a globally distributed and diverse superfamily of terrestrial mollusks (reviewed in
The mitogenome serves as a powerful evolutionary tool given its small size and fast mutation rate relative to the nuclear genome (
Within Helicoidea, only three of the constituent 19 families (
Mitochondrial genome of Praticolella mexicana UTRGV and McAllen illustrated with an image of the species holotype (ANSP 426031). Gene order and sizes are shown relative to one another, not including non-coding regions. Genes are color coded by H (black) or L (red) strand. IUPAC single letter codes are used to identify tRNA genes.
We collected one adult P. mexicana each from the UTRGV campus in Edinburg, Texas (26.30726; -98.1714), and from a residential neighborhood in McAllen, Texas (26.2085; -98.2254). We immersed foot tissue from each snail in reconstituted BupH™ phosphate buffered saline (Thermo Scientific) and homogenized it with a Dounce homogenizer. We used EDTA-free Protease Inhibitor Cocktail (Thermo Scientific) and the Mitochondrial Isolation Kit for Tissue (Thermo Scientific) to isolate intact mitochondria from the homogenates. Mitochondrial DNA was extracted using the Mitochondrial DNA Isolation Kit (BioVision) followed by application of Plasmid-Safe™ ATP-dependent DNase (Epicenter) to remove any remaining nuclear DNA.
We purified enriched mitochondrial DNA using the Zymoclean Genomic DNA Clean and Concentrator kit (Zymo Research) and quantified it using a BioAnalyzer (Agilent Technologies). Approximately 50 ng of total DNA was used for barcoded library construction using the Nextera DNA library prep kit (Illumina), precisely following the manufacturer’s instructions. We pooled the P. mexicana samples on one flowcell and sequenced them on the MiSeq (Illumina) platform using the 2x250 bp run mode. Barcodes and deconvolution of the pooled reads was performed automatically in the BaseSpace (Illumina) server and used their native format. The CLCBio 8.0.2 de novo genome assembly tool was used to assemble the reads using default parameter settings. Genomic contigs representing mitochondrial DNA segments were subsequently identified using the CLCBio assembly Fasta files to query a BLAST database comprising the Achatina fulica mitochondrial genome (GenBank: KJ744205).
We loaded the assembly for each individual into Geneious 8 (http://www.geneious.com,
To determine the position of Polygyridae within Helicoidea, we extracted the amino acid sequences for all 13 protein-coding genes from the two new genomes. These were combined with mitochondrial genome sequences from eight other helicoid taxa, and eight stylommatophoran and one non-stylommatophoran outgroup (Table
Taxonomy | GenBank | Reference |
---|---|---|
Clade Systellommatophora | ||
Superfamily Onchidioidea | ||
Family Onchidiidae | ||
Onchidella celtica | AY345048 |
|
Clade Stylommatophora | ||
Superfamily Achatinoidea | ||
Family Achatinidae | ||
Achatina fulica | KJ744205 |
|
Superfamily Clausilioidea | ||
Family Clausiliidae | ||
Albinaria caerulea | X83390 |
|
Superfamily Helicoidea | ||
Family Bradybaenidae | ||
Aegista aubryana | KT192071 |
|
Aegista diversifamilia | KR002567 |
|
Dolicheulota formosensis | KR338956 |
|
Mastigeulota kiangsinensis | KM083123 |
|
Family Camaenidae | ||
Camaena cicatricosa | KM365408 |
|
Family Helicidae | ||
Cepaea nemoralis | U23045 |
|
Cylindrus obtusus | JN107636 |
|
Cornu aspersum | JQ417194 |
|
Family Polygyridae | ||
Praticolella mexicana McAllen | KX259343 | this study |
Praticolella mexicana UTRGV | KX278421 | this study |
Superfamily Orthalicoidea | ||
Family Cerionidae | ||
Cerion incanum | KM365085 | unpublished |
Family Orthalicidae | ||
Naesiotus nux | KT821554 |
|
Superfamily Pupilloidea | ||
Family Pupillidae | ||
Gastrocopta cristata | KC185403 |
|
Pupilla muscorum | KC185404 |
|
Family Vertiginidae | ||
Vertigo pusilla | KC185405 |
|
Superfamily Succineoidea | ||
Family Succineidae | ||
Succinea putris | JN627206 |
|
For all included mitogenomes, we determined the gene order and strand assignment. Using Cornu aspersum (JQ417194) as reference (
Approximately 12 million sequences reads derived from the UTRGV P. mexicana sample assembled into over 450,000 contigs, the largest of which was 14,275 bp in length. A BLAST search against the A. fulica mitogenome revealed that the largest contig was comprised entirely of mitochondrial sequence. The McAllen P. mexicana sample comprised 26 million reads that assembled into more than 300,000 contigs. The largest contig spanned 14,259 bp and was also composed entirely of mitochondrial sequence. After final sequence editing, both P. mexicana mitogenomes were found to be 14,008 bp in length.
The complete P. mexicana mitochondrial genomes (KX278421 UTRGV, KX259343 McAllen; Figure
Gene | Start | Stop | Length | Strand | Start codon | Stop codon | Anticodon |
---|---|---|---|---|---|---|---|
cox1 | 1 | 1525 | 1525 | H | TTG | T | |
tRNA-Val | 1526 | 1587 | 62 | H | TAC | ||
16S | 1589 | 2579 | 991 | H | |||
tRNA-Leu1 | 2580 | 2642 | 63 | H | TAG | ||
tRNA-Pro | 2640 | 2706 | 67 | H | TGG | ||
tRNA-Ala | 2706 | 2768 | 63 | H | TGC | ||
ND6 | 2769 | 3239 | 471 | H | GTG | TAA | |
ND5 | 3223 | 4893 | 1671 | H | TTG | TAG | |
ND1 | 4887 | 5768 | 882 | H | ATC | TAG | |
ND4L | 5768 | 6052 | 285 | H | GTG | TAA | |
cytB | 6054 | 7148 | 1095 | H | ATT | TAG | |
tRNA-Asp | 7149 | 7212 | 64 | H | GTC | ||
tRNA-Cys | 7209 | 7265 | 57 | H | GCA | ||
tRNA-Phe | 7270 | 7331 | 62 | H | GAA | ||
cox2 | 7332 | 8003 | 672 | H | ATG | TAG | |
tRNA-Gly | 8007 | 8068 | 61 | H | TCC | ||
tRNA-His | 8063 | 8123 | 61 | H | GTG | ||
tRNA-Tyr | 8131 | 8192 | 62 | H | GTA | ||
tRNA-Trp | 8186 | 8248 | 63 | H | TCA | ||
tRNA-Gln | 8338 | 8396 | 59 | L | TTG | ||
tRNA-Leu2 | 8397 | 8454 | 58 | L | TAA | ||
atp8 | 8458 | 8608 | 151 | L | ATG | T | |
tRNA-Asn | 8612 | 8670 | 59 | L | GTT | ||
atp6 | 8671 | 9322 | 652 | L | ATG | T | |
tRNA-Arg | 9323 | 9382 | 60 | L | TCG | ||
tRNA-Glu | 9383 | 9442 | 60 | L | TTC | ||
12S | 9443 | 10186 | 744 | L | |||
tRNA-Met | 10187 | 10248 | 62 | L | CAT | ||
ND3 | 10250 | 10597 | 348 | L | TTG | TAA | |
tRNA-Ser2 | 10598 | 10665 | 68 | L | TGA | ||
tRNA-Ser1 | 10723 | 10776 | 54 | H | GCT | ||
ND4 | 10777 | 12100 | 1324 | H | ATG | T | |
tRNA-Thr | 12010 | 12162 | 62 | L | TGT | ||
cox3 | 12163 | 12958 | 796 | L | ATT | T | |
tRNA-Ile | 12985 | 13044 | 60 | H | GAT | ||
ND2 | 13048 | 13954 | 907 | H | ATG | T | |
tRNA-Lys | 13955 | 14008 | 61 | H | TTT |
Genome size for P. mexicana is comparable with the other helicoid and stylommatophoran taxa available (Table
Whole genome composition | ||||||||
---|---|---|---|---|---|---|---|---|
Species | Size (bp) | A% | C% | G% | T% | A+T% | AT skew | GC skew |
Achatina fulica | 15057 | 0,280 | 0,171 | 0,195 | 0,355 | 0,634 | -0,118 | 0,064 |
Aegista aubryana | 14238 | 0,313 | 0,145 | 0,164 | 0,379 | 0,692 | -0,095 | 0,062 |
Aegista diversifamilia | 14039 | 0,325 | 0,133 | 0,157 | 0,386 | 0,711 | -0,086 | 0,083 |
Albinaria caerulea | 14130 | 0,328 | 0,138 | 0,155 | 0,379 | 0,707 | -0,073 | 0,059 |
Camaena cicatricosa | 13843 | 0,319 | 0,135 | 0,167 | 0,379 | 0,698 | -0,086 | 0,108 |
Cepaea nemoralis | 14100 | 0,262 | 0,189 | 0,213 | 0,336 | 0,598 | -0,125 | 0,058 |
Cerion incanum | 15177 | 0,298 | 0,158 | 0,185 | 0,360 | 0,657 | -0,095 | 0,077 |
Cylindrus obtusus | 14610 | 0,258 | 0,166 | 0,219 | 0,358 | 0,615 | -0,162 | 0,137 |
Dolicheulota formosensis | 14237 | 0,284 | 0,131 | 0,167 | 0,418 | 0,702 | -0,191 | 0,120 |
Gastrocopta cristata | 14060 | 0,308 | 0,136 | 0,172 | 0,384 | 0,692 | -0,110 | 0,116 |
Helix aspersa | 14050 | 0,307 | 0,136 | 0,165 | 0,392 | 0,699 | -0,121 | 0,097 |
Mastigeulota kiangsinensis | 14029 | 0,295 | 0,144 | 0,182 | 0,379 | 0,674 | -0,125 | 0,118 |
Naesiotus nux | 15197 | 0,336 | 0,120 | 0,147 | 0,397 | 0,733 | -0,083 | 0,100 |
Praticolella mexicana McAllen | 14008 | 0,289 | 0,126 | 0,188 | 0,398 | 0,686 | -0,159 | 0,198 |
Praticolella mexicana UTRGV | 14008 | 0,288 | 0,126 | 0,188 | 0,398 | 0,686 | -0,160 | 0,198 |
Pupilla muscorum | 14149 | 0,325 | 0,129 | 0,153 | 0,393 | 0,718 | -0,094 | 0,083 |
Succinea putris | 14092 | 0,339 | 0,109 | 0,122 | 0,430 | 0,769 | -0,113 | 0,055 |
Vertigo pusilla | 14078 | 0,326 | 0,123 | 0,155 | 0,397 | 0,722 | -0,098 | 0,116 |
Maximum-likelihood analysis of our protein sequence dataset of 19 taxa and 4,011 aligned amino acid positions yielded a single tree (Figure
Maximum likelihood phylogeny of Stylommatophora protein coding genes. Analysis in IQTREE yielded a single tree (log likelihood = -89104.188) under the mtZOA+F+I+G4 model. Branch support >50% is shown based on 10,000 ultra-fast bootstrap replicates. Helicoidea, Bradybaenidae, and Helicidae were recovered as monophyletic. Nodes A-E refer to rearrangements shown in Figure
Given the protein topology represented the more likely relationships among our included taxa, we reconstructed ancestral gene orders predicted by MLGO with that topology. The results suggested a pattern of four rearrangements in the helicoid mitochondrial genome (Figure
Ancestral gene order reconstructions for Helicoidea. Columns (A–E) correspond to labeled nodes in Figure
Gastropod mitogenomes tend to be compact (
The two P. mexicana mitogenomes differ by 71 bp, which was fewer differences than seen in Cornu aspersum from Chile (107-149 bp), the only other stylommatophoran with more than one mitogenome available (
Mitochondrial gene rearrangements are common across Metazoa (
Our protein sequence data support previous works showing the monophyly of Helicoidea, Helicidae, and Bradybaenidae. Previous work has suggested close relationships between Bradybaenidae, Camaenidae, and Polygyridae, but the monophyly of the former two families remains in question (
RLM is supported in part by faculty research and development funds and a Barrios fellowship from UHCL. MAMC and KEP would like to acknowledge UTRGV Science Education Grant #52007568 funded by the Howard Hughes Medical Institute, ADVANCE Institutional Transformation Grant (NSF# 1209210), UTRGV Faculty Research Council, and College of Sciences for financial support. This work was supported in part by the National Science Foundation (under grant HRD-1463991). Any opinions, findings, and conclusions or recommendations are those of the authors and do not necessarily reflect the views of NSF. Jolanta Jaromczyk at UK Healthcare Genomics assisted with genome assembly.