Research Article |
Corresponding author: Pei Wang ( wangpei601@126.com ) Academic editor: Frank Köhler
© 2014 Pei Wang, Haifang Yang, Weichuan Zhou, Chungchi Hwang, Weihong Zhang, Zhouxing Qian.
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Wang P, Yang H, Zhou W, Hwang C, Zhang W, Qian Z (2014) The mitochondrial genome of the land snail Camaena cicatricosa (Müller, 1774) (Stylommatophora, Camaenidae): the first complete sequence in the family Camaenidae. ZooKeys 451: 33-48. https://doi.org/10.3897/zookeys.451.8537
|
The complete mitochondrial (mt) genome of the snail Camaena cicatricosa (Müller, 1774) has been sequenced and annotated in this study. The entire circular genome is 13,843 bp in size and represents the first camaenid mt genome, with content of 31.9%A, 37.9%T, 13.5%C and 16.7%G. Gene content, codon usage and base organization show similarity to a great extent to the sequenced mt genome from Stylommatophora, whereas, gene order is different from them, especially the positions of tRNACys, tRNAPhe, COII, tRNAAsp, tRNAGly, tRNAHis and tRNATrp. All protein coding genes use standard initiation codons ATN except for COII with GTG as start signal. Conventional stop codons TAA and TAG have been assigned to all protein coding genes. All tRNA genes possess the typical clover leaf structure, but the TψC arm of tRNAAsp and dihydrouridine arm of tRNASer(AGN) only form a simple loop. Shorter intergenic spacers have been found in this mt genome. Phylogenetic study based on protein coding genes shows close relationship of Camaenidae and Bradybaenidae. The presented phylogeny is consistent with the monophyly of Stylommatophora.
Camaena cicatricosa , Camaenidae , Stylommatophora , mitochondrial genome, secondary structure
The mitochondrial (mt) genome of metazoa usually comprise 37 genes, including 13 protein coding genes (PCGs) (COI−COIII, Cytb, ND1−ND6, ND4L, ATP6 and ATP8), two ribosomal RNA (rRNA) genes, and 22 transfer RNA (tRNA) genes (
Camaenidae, one of the most diverse families, was erected by Pilsbry in 1893 (
The mt genome of land snail is similar to other invertebrates in containing 37 genes. Since the first mt genome of Albinaria caerulea was obtained in 1995 (
Adults of C. cicatricosa were collected from Xishan Park in Guiping (23°23'58"N, 110°3'46"E), Guangxi, China in November 2, 2013. Specimens were initially preserved in 100% ethanol in the field, and then stored at -20 °C at Fujian Entry-Exit Inspection & Quarantine Bureau (FJCIQ). Total genomic DNA was extracted from the pedal muscle tissue of single individual using the DNeasy Blood and Tissue kit (Qiagen) according to the manufacturer’s instructions. Voucher specimen (FJCIQ 18483) is deposited at the Key Laboratory of Molluscan Quarantine and Identification of AQSIQ, Fujian Entry-Exit Inspection & Quarantine Bureau, Fuzhou, Fujian.
The entire genome was successfully amplified by polymerase chain reaction (PCR) in overlapping fragments with four pairs of mitochondrial universal primers from previous works (
No. of fragment | Primer name | Nucleotide sequence (5’–3’) and location | Size (bp) | Reference |
---|---|---|---|---|
1 | LCO-1490 | GGTCAACAAATCATAAAGATATTGG |
|
|
HCO-2198 | TAAACTTCAGGGTGACCAAAAAATCA |
|
||
2 | Fcoi | TGAACTGTTTATCCTCCAC (364–382) | 1908 | Present study |
RL | TAGGGTCTTCTCGTCTTT (2254–2271) | Present study | ||
3 | 16Sar-L | CGCCTGTTTATCAAAAACAT |
|
|
16Sbr-H | CCGGTCTGAACTCAGATCACGT |
|
||
4 | FL2 | CGATGTTGGATTAGGAAGTTGA (2415–2436) | 4267 | Present study |
Rcb2 | TAAAGGATTTGTTGACCCACG (6661–6681) | Present study | ||
5 | 144F | TGAGSNCARATGTCNTWYTG |
|
|
272R | GCRAANAGRAARTACCAYTC |
|
||
6 | Fcb | GTGGGTCAACAAATCCTT (6662–6679) | 816 | Present study |
Rcoii | ATGAACACCTCGGGTAGT (7460–7477) | Present study | ||
7 | FCOII | AAATAATGCTATTTCATGAYCAYG |
|
|
RCOII | GCTCCGCAAATCTCTGARCAYTG |
|
||
8 | SF1F | AAATTCCATTAGAGGGGCTTATACGCCGCC (6984–7013) | 6957 | Present study |
SF1R | CAAGAGATAGTCCCGTACCAACTATGCCGC (68–79) | Present study |
Short fragments were sequenced from both directions after purification using the BigDye Terminator Sequencing Kit (Applied Biosystems, San Francisco, CA, USA) and the ABI PRIMERTm3730XL DNA Analyzer (PE Applied Biosystems) with internal primers for primer walking. For the long fragments, the shotgun libraries of C. cicatricosa were constructed, and then the positive clones were sequenced using above kit and sequenator with vector-specific primers BcaBest primer M13-47 and BcaBest Primer RV-M.
Raw sequences were proof-read and aligned into contigs with BioEdit v.7.0.5.3 (
PCGs were aligned with Clustal X (
Phylogenetic analyses were performed based on 11 representative gastropod mt genomes from GenBank (Table
Subclass /order | Family | Species | Accession number | Reference |
---|---|---|---|---|
Stylommatophora | ||||
Camaenidae | Camaena cicatricosa | KM365408 | Present study | |
Bradybaenidae | Euhadra herklotsi | Z71693–Z71701 |
|
|
Mastigeulota kiangsinensis | KM083123 |
|
||
Helicidae | Cornu aspersum | JQ417195 |
|
|
Cepaea nemoralis | CMU23045 |
|
||
Cylindrus obtusus | JN107636 |
|
||
Succineidae | Succinea putris | JN627206 |
|
|
Clausiliidae | Albinaria caerulea | X83390 |
|
|
Achatinidae | Achatina fulica | NC024601 |
|
|
Basommatophora | Lymnaeidae | Galba pervia | JN564796 |
|
Opisthobranchia | Aplysiidae | Aplysia californica | AY569552 |
|
The complete mt genome of C. cicatricosa was a double-stranded circular molecule of 13,843 bp in length (GenBank: KM365408). It contained 13 PCGs, 22 tRNA genes, two rRNA genes, similar to other mt genomes of land snails from the order Stylommatophora. All genes were divided into two groups, encompassing 24 genes on the majority coding strand (J strand) and others on the minority coding strand (N strand) (Fig.
The mt genome of Camaena cicatricosa. The tRNA genes are labeled based on the IUPACIUB single letter amino acid codes. Genes with underline illuminate the direction of transcription from 3’ to 5’, and without underline illuminating from 5’ to 3’. Numbers and overlapping lines within the circle indicate PCR fragments amplified for sequencing (see Table
Gene | Direction | Location | Size (bp) | Anticodon | Start codon | Stop codon | Intergenic nucleotides |
---|---|---|---|---|---|---|---|
COI | F | 1–1527 | 1527 | ATG | TAG | ||
tRNAVal | F | 1527–1585 | 59 | 1557–1559 TAC | –1 | ||
lrRNA | F | 1586–2582 | 997 | 0 | |||
tRNALeu(CUN) | F | 2583–2642 | 60 | 2611–2613 TAG | 0 | ||
tRNAPro | F | 2640–2702 | 63 | 2669–2671 TGG | –3 | ||
tRNAAla | F | 2705–2764 | 60 | 2735–2737 TGC | 2 | ||
ND6 | F | 2784–3251 | 468 | ATA | TAA | 19 | |
ND5 | F | 3202–4893 | 1692 | ATT | TAA | –50 | |
ND1 | F | 4914–5786 | 873 | ATA | TAA | 20 | |
ND4L | F | 5797–6072 | 276 | ATA | TAA | 10 | |
CytB | F | 6076–7188 | 1113 | ATG | TAA | 3 | |
tRNACys | F | 7185–7246 | 62 | 7215–7217 GCA | –4 | ||
tRNAPhe | F | 7249–7309 | 61 | 7279–7281 GAA | 2 | ||
COII | F | 7310–7984 | 675 | GTG | TAG | 0 | |
tRNAAsp | F | 7989–8048 | 60 | 8019–8021 GTC | 4 | ||
tRNATyr | F | 8075–8136 | 62 | 8105–8107 GTA | 26 | ||
tRNAGly | F | 8132–8191 | 60 | 8162–8164 TCC | –5 | ||
tRNAHis | F | 8188–8246 | 59 | 8218–8220 GTG | –4 | ||
tRNATrp | F | 8255–8314 | 60 | 8282–8284 TCA | 8 | ||
tRNAGln | R | 8311–8369 | 59 | 8338–8340 TTG | –4 | ||
tRNALeu(UUR) | R | 8366–8429 | 64 | 8398–8400 TAA | –4 | ||
ATP8 | R | 8431–8595 | 165 | ATG | TAA | 1 | |
tRNAAsn | R | 8597–8652 | 56 | 8620–8622 GTT | 1 | ||
ATP6 | R | 8652–9332 | 681 | ATT | TAA | –1 | |
tRNAArg | R | 9309–9366 | 58 | 9339–9341 TCG | –24 | ||
tRNAGlu | R | 9366–9430 | 65 | 9393–9395 TTC | –1 | ||
SrRNA | R | 9431–10112 | 682 | 0 | |||
tRNAMet | R | 10113–10174 | 62 | 10140–10142 CAT | 0 | ||
ND3 | R | 10165–10524 | 360 | ATA | TAA | –10 | |
tRNASer(UCN) | R | 10517–10569 | 53 | 10548–10550 TGA | –8 | ||
tRNASer(AGN) | F | 10570–10629 | 61 | 10594–10596 GCT | 0 | ||
ND4 | F | 10648–11988 | 1341 | ATA | TAA | 18 | |
tRNAThr | R | 11940–11999 | 60 | 11967–11969 TGT | –49 | ||
COIII | R | 11965–12792 | 828 | ATT | TAA | –35 | |
tRNAIle | F | 12822–12885 | 64 | 12852–12854 GAT | 29 | ||
ND2 | F | 12887–13828 | 942 | ATG | TAA | 1 | |
tRNALys | F | 13790–13843 | 54 | 13819–13821 TTT | –39 |
The length of PCGs was 10,941bp, accounting for 79.04% of the whole mt genome (Table
Proportion of nucleotides | ||||||||
---|---|---|---|---|---|---|---|---|
Feature | %A | %T | %G | %C | %A+T | AT Skew | GC Skew | No. of nucleotides |
Whole genome | 31.90 | 37.90 | 16.70 | 13.50 | 69.80 | –0.09 | 0.11 | 13843 |
Protein coding genes | 31.18 | 38.14 | 17.05 | 13.64 | 69.32 | –0.10 | 0.11 | 10941 |
Protein coding genes (J) | 28.83 | 40.41 | 17.54 | 13.23 | 69.24 | –0.17 | 0.14 | 8907 |
Protein coding genes (N) | 28.22 | 41.45 | 15.44 | 14.90 | 69.67 | –0.19 | 0.02 | 2034 |
tRNA genes | 34.95 | 36.46 | 15.81 | 12.78 | 71.41 | –0.02 | 0.11 | 1322 |
tRNA genes (J) | 33.96 | 36.80 | 17.51 | 11.72 | 70.77 | –0.04 | 0.20 | 845 |
tRNA genes (N) | 35.85 | 36.69 | 14.68 | 12.79 | 72.54 | –0.01 | 0.07 | 477 |
rRNA genes | 35.14 | 37.28 | 14.83 | 12.75 | 72.42 | –0.03 | 0.08 | 1679 |
The 22 tRNA genes typically found in metazoan mt genomes were also discovered in C. cicatricosa, and 18 of them were determined by tRNAscan-SE (
The length of tRNA genes ranged from 53 to 65 bp (Table
The rRNA genes comprising large rRNA subunit (lrRNA) and small rRNA subunit (srRNA) are presumed to block in the spaces of flanking genes (
Like other snail mt genomes, the nucleotide composition of the C. cicatricosa mt genome was obviously biased toward adenine and thymine (A = 31.90%, T = 37.90%, C = 13.50%, G = 16.70%). The entire mt genome had a high A+T content of 69.80%, by the composition of 69.32% in PCGs, 71.41% in tRNA genes, 72.42% in rRNA genes. Nucleotide bias can also be reflected by codon usage. Evidently, we can see that NNA and NNU were applied frequently in most PCGs. Furthermore, codons TTT (phenylalanine), TTA (leucine), ATT (isoleucine) and ATA (methionine) which were used widely were all composed of A and T. Especially, more and more codons were biased in favor of those codons with A or T in the third position (Fig.
The nucleotide composition of metazoan mt genomes usually demonstrate an obvious strand bias (
The noncoding regions of C. cicatricosa mt genome contained some short intergenic spacers. These short sequences possibly acted as splicing recognition sites during the process of transcription (
A large noncoding region called control region or AT-rich region is commonly seen in metazoan mt genomes (
ML tree was estimated according to the GTR+G+I substitution model selected by AIC. The ML and MP trees (Fig.
We sincerely thank Professor De-Niu Chen of Institute of Zoology, Chinese Academy of Science for helpful comments on the manuscript. This research is supported by National Natural Science Foundation of China (31372162; 31040072), Natural Science Foundation in Fujian Province (2012J01063), Science and Technology Program in Fujian Province (2012I0001), National Science Council, Taiwan (NSC 98-2621-B-390-001) and Public Science and Technology Research Funds Projects of General Administration of Quality Supervision, Inspection and Quarantine of the People’s Republic of China (No. 201410076).