Research Article |
Corresponding author: Toni I. Gossmann ( toni.gossmann@tu-dortmund.de ) Corresponding author: Mansour Aliabadian ( aliabadi@um.ac.ir ) Academic editor: George Sangster
© 2025 Sahar Javaheri Tehrani, Elham Rezazadeh, Niloofar Alaei Kakhki, Leila Nourani, Vali Ebadi, Sahar Karimi, Mojtaba Karami, Fatemeh Ashouri, Asaad Sarshar, Toni I. Gossmann, Mansour Aliabadian.
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Javaheri Tehrani S, Rezazadeh E, Alaei Kakhki N, Nourani L, Ebadi V, Karimi S, Karami M, Ashouri F, Sarshar A, Gossmann TI, Aliabadian M (2025) DNA barcoding of passerine birds in Iran. ZooKeys 1236: 19-39. https://doi.org/10.3897/zookeys.1236.143336
|
Exploring genetic diversity is essential for precise species delimitation, especially within taxonomically complex groups like passerine birds. Traditional morphological methods often fail to resolve species boundaries; however, DNA barcoding, particularly through the mitochondrial cytochrome c oxidase subunit I (COI) gene, provides a powerful complementary method for accurate species identification. This study establishes a comprehensive DNA barcode library for Iranian passerine birds, analyzing 546 COI sequences from 94 species across 23 families and 53 genera. There is a pronounced barcode gap, with average intraspecific divergence at 0.41% and interspecific divergence at 18.6%. Notable intraspecific variation emerged in the Persian nuthatch (Sitta tephronota) and the Lesser whitethroat (Curruca curruca), while the European goldfinch (Carduelis carduelis) and the grey-crowned goldfinch (Carduelis caniceps) showed limited genetic differentiation despite marked morphological distinctions. Phylogenetic analysis revealed significant east-west genetic splits in C. curruca and S. tephronota, reflecting Iran’s geographic and zoogeographic boundaries. These findings demonstrate the effectiveness of DNA barcoding in elucidating biogeographic patterns, emphasizing Iran’s key role as an ornithological crossroads for avian biodiversity. Moreover, our results suggest that much of the genetic variation in the COI gene arises from synonymous mutations, highlighting the role of purifying selection in shaping mtDNA diversity across species.
COI gene, Genetic diversity, Selection, Species delimitation
Genetic diversity is a fundamental aspect of biodiversity, representing the variety of genetic information within and among species (
Birds represent one of the most extensively studied animal groups in DNA barcoding projects (
Iran is recognized as a globally significant biodiversity hotspot, characterized by its remarkable species richness and high levels of endemism (
Our main objectives are (i) to evaluate genetic variation in COI among passerine birds in Iran—a region characterized by numerous contact zones between passerine species (
The study area covers the northeastern and western regions of Iran (Suppl. material
DNA was extracted from blood and feather samples using a standard salt extraction method (
Primer pairs that have been successfully used to obtain bird barcodes. This table includes forward and reverse primer names, primer sequences, annealing temperature, and citation.
Primer Name | Primer Sequences (5‘-3‘) | Annealing Temperature | Citation |
---|---|---|---|
BirdF1 | TTCTCCAACCACAAAGACATTGGCAC | 50 °C | ( |
BirdR1 | ACGTGGGAGATAATTCCAAATCCTG | 50 °C | ( |
BirdR2 | ACTACATGTGAGATGATTCCGAATCCAG | 50 °C | ( |
PasserF1 | CCAACCACAAAGACATCGGAACC | 58 °C | ( |
PasserR1 | GTAAACTTCTGGGTGACCAAAGAATC | 58 °C | ( |
AWCF1 | CGCYTWAACAYTCYGCCATCTTACC | 57.5 °C | ( |
AWCR6 | ATTCCTATGTAGCCGAATGGTTCTTT | 57.5 °C | ( |
Sequences were aligned and edited in BIOEDIT v. 7.0.1 (
The phylogenetic tree was rooted with one representative of Galliformes (Gallus gallus). Two criteria used for identifying and confirming species based on their DNA barcode if: a) it was monophyletic (i.e., the species formed a single cluster) and b) it did not share a barcode with any other species. Consequently, high intraspecific genetic distances in the COI gene are frequently utilized to predict cryptic or potentially new species. In our dataset, this pattern is observed in two species showing elevated genetic distances: the Persian Nuthatch Sitta tephronota Sharpe, 1872 and the Lesser Whitethroat Curruca curruca Linnaeus, 1758 (Suppl. material
Based on the COI sequence fragments and the subsequent global alignment we obtained genetic diversity at 0-fold and 4-fold sites for all species. For this, we used the vertebrate mitochondrial genetic code with MEGA. We then used the Tajimas_d package from the bfx suite (https://py-bfx.readthedocs.io/en/latest/) to calculate nucleotide diversity for each species for 0-fold and 4-fold sites, respectively. We excluded species with zero diversity for either 4-fold or 0-fold sites. One can quantify effective population by dividing genetic diversity with the mutation rate per generation (π = 2Ne * µ, where Ne equals the effective population size, µ, the mutations per generation and π is the observed pairwise differences in a population genetic sample). Because we have limited knowledge of mitochondrial gene specific mutation rates for all passerine birds and only rough estimates for generation time, we cannot directly estimate effective population sizes. However, here we use genetic diversity at silent sites as a proxy for effective population size, which is not unreasonable because we restrict our analysis to passerine birds, a taxonomic group with supposedly little variation in mutation rate and generation time (
A total of 546 sequences from 94 passerine bird species were generated and uploaded to the NCBI database (publicly available, Suppl. material
Comparisons of K2P pairwise distances based on the COI gene of 96 passerine bird species. Intraspecific distances are indicated with blue bars and interspecific distances with orange bars. Left Y-axis: numbers of intraspecific comparisons; right Y-axis: numbers of interspecific comparisons.
Conversely, our analyses revealed low interspecific genetic distance between Carduelis carduelis and Carduelis caniceps (0.71%), which contrasts with the significant morphological differentiation observed (Suppl. material
Sitta tephronota includes three subspecies in Iran, i.e., S. t. dresseri (Zagros Mts. in SE Turkey to N Iraq and W Iran), S. t. obscura (NE Turkey to the Caucasus and Iran) and S. t. iranica (NE Iran and S Turkmenistan). For this species, we analyzed six samples from the western population (S. t. dresseri) and seven samples from the eastern population (S. t. iranica) (Fig.
Pairwise FST values between the studied subspecies and species of S. tephronota and S. neumayer estimated from the mitochondrial data.
Subspecies | S. t. iranica | S. t. dresseri | S. neumayer |
---|---|---|---|
S. t. dresseri | 0.91 | ||
S. neumayer | 0.93 | 0.95 | |
S. neumayer (potential admixed) | 0.50 | 0.96 | 0.97 |
Phylogenetic and haplotype network analysis of COI data for S. tephronota and the origin of study material A neighbor-joining tree, values on the branches shows bootstrap values and, an asterisk indicates Iranian COI sequences from GenBank B distribution range and collection sites for the samples included in the study. Distribution map of S. tephronota, with green indicating areas where the species is native resident according to bird species distribution maps of the world (https://datazone.birdlife.org); sampling sites are indicated by red triangles C haplotype network, where colors indicate the origin of the haplotypes (orange: western population; green: eastern population; blue: S. neumayer) and the number of bars at each branch indicates the number of mutations.
Curruca curruca is thought to have three breeding subspecies in Iran, including minula, althaea, and curruca and one non-breeding subspecies halimodendri Sarudny, 1911. For this species, we primarily analyzed ten samples from the eastern population and six samples from the western population (Fig.
Phylogenetic and haplotype network analysis of COI data for C. curruca and the origin of study material A neighbor-joining tree, values on the branches shows bootstrap values and, an asterisk indicates Iranian COI sequences from GenBank B distribution range and collection sites for the samples included in the study. Approximate presumed breeding ranges of C. curruca taxa, modified from the map by
Another eastern subclade includes individuals from Sarakhs, a lower-elevation area near the Turkmenistan border, which are strongly separated from another eastern birds (BB 100%) (Fig.
Pairwise FST values between the studied subspecies and species of Curruca curruca estimated from the mitochondrial data.
Subspecies | C. curruca ssp? (east) | C. c. curruca (west) | C. c. althaea |
---|---|---|---|
C. c. curruca (west) | 1.00 | ||
C. c. althaea (east) | 0.96 | 0.99 | |
C. c. halimodendri | 0.70 | 0.95 | 0.65 |
Carduelis carduelis and C. caniceps are distributed in west and east of Iran respectively (
Phylogenetic and haplotype network analysis of COI data for C. carduelis, C. caniceps and the origin of study material A neighbor-joining tree, values on the branches shows bootstrap values and, an asterisk indicates Iranian COI sequences from GenBank B distribution range and collection sites for the samples included in the study. Distribution map of C. carduelis and C. caniceps, with green indicating areas where the species is native resident according to bird species distribution maps of the world (https://datazone.birdlife.org); sampling sites are indicated by red triangles C haplotype network, where colors indicate the origin of the haplotypes (orange: western species; green: eastern species) and the number of bars at each branch indicates the number of mutations.
We quantified site-specific coding diversity (i.e., at 0-fold and 4-fold degenerate sites) of each species where we had multiple samples (Suppl. material
Here, we provide a DNA barcode reference library for a substantial dataset of passerine birds in Iran, encompassing the identification of 94 distinct species. This study provides an important foundation for understanding the genetic diversity of Iranian passerine birds through DNA barcoding, with extensive sampling from both eastern and western regions. Our results again demonstrate that DNA barcoding is an effective tool for preliminary biodiversity assessments. No species shared sequences or had overlapping clades with any other species, and every passerine species had distinct COI sequences. The development of this DNA barcode library provides a valuable resource for the biodiversity of passerine birds in Iran and will facilitate future studies on the geographic variation and genetic diversity of passerine birds in this area. Our results generally do not resolve phylogenetic relationships above the generic level for Fringillidae, Emberizidae, and Muscicapidae, at the generic level for Luscinia and Emberiza and, at the species level for L. collurio, all of which exhibited paraphyletic patterns in the phylogenetic analysis (Suppl. material
Several DNA barcoding studies of birds have revealed genetically distinct yet morphologically cryptic species (
Moreover, our NJ tree and haplotype network showed that two individuals identified in NCBI as S. neumayer were located in the eastern clade of S. tephronota. S. tephronota has an ecologically and morphologically similar congener, S. neumayer, which have overlapping distribution ranges in eastern Turkey and Iran (
The C. curruca complex is an intricate model for studying cryptic speciation, presenting challenges in taxonomy due to conflicting morphological and genetic data (
The results revealed a split between two phenotypically different species C. carduelis and C. caniceps. Carduelis carduelis ranges into the Zagros mountains in the west and north of Iran, whereas on the eastern side of its distributional range in Iran, it is replaced by the morphologically divergent species, C. caniceps, which ranges further north into south-central Siberia and northwestern Mongolia (
According to population genetic theory, the effectiveness of selection is more pronounced in species with larger effective population sizes (
The current distribution and genetic makeup of species in Iran reflect its unique biogeographic characteristics (
In this study, the patterns of intraspecific divergence observed in S. tephronota, C. curruca, C. carduelis, and C. caniceps align with Iran’s key zoogeographic boundaries and geographical barriers. These patterns reflect an east-west geographical split within these species that corresponds to the distinct biogeographical realms they inhabit. This highlights the influence of Iran’s transitional geographic position, as well as the role of mountains and refugia, in shaping species differentiation and current genetic patterns. In addition, this east-west genetic distinctiveness in our target taxa is paralleled in several other widespread sister Eurasian passerine taxa that have allopatric populations at their southern range margin in Iran, such as the Eurasian nuthatch, S. europaea (
The present study fills a significant biodiversity knowledge gap in the barcoding data of passerine birds in Iran and demonstrates the utility of standardized DNA-based species delimitation methods in enhancing biodiversity inventories. The observed patterns of intraspecific divergence in S. tephronota, C. curruca, C. carduelis, and C. caniceps align with key zoogeographic boundaries in Iran, reflecting an east-west geographical split within these species. This finding underscores the important role of DNA barcodes in revealing phylogeographical patterns, consistent with previous studies that highlight the effectiveness of DNA barcoding in resolving such patterns (
We gratefully acknowledge Reza Rezaei and Sajad Noori for their assistance in creating the location map for our study and extend our thanks to Justin Wilcox for his support in enhancing the fluency of the English language in this manuscript. We also thank Iran’s Environmental Organization for their help during the field sampling and providing the necessary research permits for this study. Additionally, we sincerely thank Shima Noori for her assistance with bird sampling. We are particularly grateful to R. Rezaei for providing the digital drawings of birds. Anonymous reviewers are acknowledged, whose comments contributed to improvement of the manuscript.
The authors have declared that no competing interests exist.
No ethical statement was reported.
This research was supported by Ferdowsi University of Mashhad research grant INSF 4002006 to Mansour Aliabadian and by grant 3971 from Ferdowsi University of Mashhad awarded to Sahar Javaheri Tehrani. Moreover, this project has received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme grant agreement No. 947636.
Investigation: SJT, NAK, LN, VE, SK, MK, FA, MA; Resources: SJT, NAK, LN, VE, SK, MK, FA, AS, MA; Formal analysis: SJT, TIG, ER; Writing – original draft: SJT; Funding acquisition: MA, SJT, TIG; Project administration: MA; Supervision: MA, TIG; All authors read and approved the final version of the manuscript.
Sahar Javaheri Tehrani https://orcid.org/0000-0002-4824-1837
Elham Rezazadeh https://orcid.org/0000-0003-4935-0522
Leila Nourani https://orcid.org/0000-0001-7932-2480
Asaad Sarshar https://orcid.org/0000-0002-4325-6706
Toni I. Gossmann https://orcid.org/0000-0001-6609-4116
Mansour Aliabadian https://orcid.org/0000-0002-3200-4853
Sequence data and their accession numbers have been deposited in the NCBI database (https://www.ncbi.nlm.nih.gov).
Additional information
Data type: docx
Explanation note: fig. S1. Sampling locations. Colors and the size of circles represents the number of individuals. fig. S2. Bayesian tree of COI sequences from 96 species of Iranian passerine birds. Values at nodes show posterior probabilities; full support is indicated with an asterisk. Species that form reciprocally monophyletic clades have been collapsed. Two species showed deep intraspecific divergence: (a) Curruca curruca and (b) Sitta tephronota; these are marked in blue. Families labeled on the right of the figure. fig. S3. Phylogenetic and haplotype network analysis of COI data for C. curruca. a) Neighbor-joining tree, values on the branches shows bootstrap values and, our COI sequences are indicated in red. b) Haplotype network, where colors indicate the origin of the haplotypes (Orange: western population; Green: eastern populations) and the number of bars at each branch indicates the number of mutations. table S1. List of all Iranian passerine birds that have been sequenced in this study, with voucher numbers and collection localities. Coordinates are given in decimal degrees. table S2. Comparisons of K2P-pairwise distances within species. Distances are calculated for Iranian passerine birds for which two or more sequences were available; species including one individual are not calculated (n/c); three challenging taxa indicated bold and highlighted grey; distances are expressed in percentages. table S3. K2P distances (%) for the populations of S. tephronota and S. neumayer (COI seq), below the diagonal between group average. table S4. K2P distances (%) for the populations of Curruca curruca (COI seq), below the diagonal between group average. table S5. K2P distances (%) for the populations of Carduelis carduelis (COI seq), below the diagonal between group average. table S6. Genetic diversity at coding sites. Non-synonymous nucleotide diversity πn, number of non-synonymous nucleotide diversity N, synonymous nucleotide diversity πs, number of synonymous nucleotide diversity S.