(C) 2010 Lyubomir Penev. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
For reference, use of the paginated PDF or printed version of this article is recommended.
Scholarly publishing and citation practices have developed largely in the absence of versioned documents. The digital age requires new practices to combine the old and the new. We describe how the original published source and a versioned wiki page based on it can be reconciled and combined into a single citation reference. We illustrate the citation mechanism by way of practical examples focusing on journal and wiki publishing of taxon treatments. Specifically, we discuss mechanisms for permanent cross-linking between the static original publication and the dynamic, versioned wiki, as well as for automated export of journal content to the wiki, to reduce the workload on authors, for combining the journal and the wiki citation and for integrating it with the attribution of wiki contributors.
The static character of academic publications inherited
from the era of paper publishing is obviously at odds with the dynamic
and interminable process of taxonomic research (
The idea of a combination between journal-published taxon descriptions and Internet-based updates was first proposed by
In an attempt to reconcile the static character of taxon
descriptions and the need to continuously update them, ZooKeys
published recently the pilot article of Hendriks and Balke (2011),
where the journal description of the new beetle species Neobidessodes darwiniensis was exported manually to a wiki taxon page on species-id.net on the day of publication (http://species-id.net/wiki/Neobidessodes_darwiniensis).
The original journal description contains the link to the wiki page of
the species, while the wiki page points to the journal publication as
original source of information. Both sides will profit from such a
workflow: the wiki version of the description can be further edited and
expanded with new information, providing an opportunity for a
potentially eternal process of improvement and data enrichment; at the
same time, the journal publication validates taxa (re-)descriptions and
provides a permanent publication record. The presence of the originally
published source on the wiki page derived from it and the explicit
requirement to cite both, will increase citation rates of taxonomic
publications (see, e.g.,
The current paper describes an improved method to (1) combine the citation of the original taxon treatment published in a journal with the versioned wiki page of it into a single bibliographic reference; (2) list the names of the contributors to the particular versioned wiki page in the same reference alongside with the date and version number; (3) automatically export taxon treatments and create wiki pages of them simultaneously with the journal publication; (4) provide a permanent cross-linking between the original publication and the respective versioned wiki pages.
The method is demonstrated by way of sample papers (
Currently, wiki pages like other Internet resources are usually cited as URLs, often adding a “date of accession”. Clearly, the latter has little practical meaning if the cited Internet resource does not provide a public version history. On most wiki platforms, however, each separate edit of a wiki page is versioned and time-stamped in a publicly accessible manner. This feature, along with the ease with which edits can be made, is a major factor contributing to the phenomenal success of Wikipedia and the high popularity of wiki environments in general.
Any versioned online source, however, has the problem that two or more possible citations (and corresponding URLs) might exist, that is, the one for the most recent version and the one or more for previous versions in time. For many use cases, it is desirable to link to the most recent, presumably improved or error-corrected URL, and thus this URL is normally used when referring to an article. At arXiv and Nature Precedings, for instance, the generic identifier always points to the latest version (cf. http://arxiv.org/abs/1103.3216 and https://doi.org/10.1038/npre.2009.3267), even though individual versions can technically be cited as easily (cf. http://arxiv.org/abs/1103.3216v2 and https://doi.org/10.1038/npre.2009.3267.4). Both sites explain the versioning in places where few citing authors will look, which makes it difficult to establish a consistent citation pattern for different versions of the same article. Similarly, the “permanent link” available for any version of any page in a standard wiki environment (cf. Fig. 1) is not known to all users who cite wiki articles. Furthermore, attribution to the contributors to a versioned wiki page provides another barrier to a proper citation, because it requires significant wiki experience and substantial work to extract the names of the contributors from a highly edited page.
Citation template for the simultaneous journal and wiki publication of Sinocallipus catba Stoev & Enghoff, 2011 (generic link: http://species-id.net/wiki/Sinocallipus_catba, permanent link of the version depicted in the figure: http://species-id.net/w/index.php?title=Sinocallipus_catba&oldid=4534). The generic link always points to the most recent version of the page, while a permanent link is specific to one particular revision.
Lack of appropriate mechanisms for recognition of wiki
authorship is one of the major reasons for many academics to stay away
from the wiki world (cf.
Thus far, wiki elements in scholarly publishing are rare (for an overview, see
To the best of our knowledge, however, there is no established citation format that combines both an original non-wiki source and the respective wiki page within one and the same citation reference. In our understanding, such a mechanism should credit both the authors of the original publication and the contributors to the respective version of the wiki page. In addition, the reader will always be referred to a defined, numbered and time-stamped version of the wiki page that also links directly to the original published source of the wiki content.
We propose to cite wiki pages based on original scholarly publications according to the following scheme:
<Author 1, Author 2 ..... Author n> (<year>) <Title> <Journal> <IssueNo> <pages> <DOI>. Versioned wiki page: <YEAR-MM-DD>, version <sequential number of revision>, http://species-id.net/wiki/index.php?title=Genus_species&oldid=<sequential number of revision>, contributors (alphabetical order): Contributor 1, Contributor 2 ..... Contributor N.
The citation style is automatically added to the top of each page on Species-ID by means of a template, as illustrated in Fig. 1.
The features of the proposed citation mechanism are outlined below:
1. The citation of both the original publication and the wiki page is embedded in the automated export to the wiki.
2. The names of contributors are automatically extracted from the page history and displayed on the versioned wiki page that a user is viewing. This list includes only users who contributed to the currently displayed version of the wiki page, as well as those who have contributed to all earlier versions.
3. The version number is unique within the respective wiki (it is a consecutive revision number for all revisions of all pages on the same wiki).
4. The date field lists the date of creation of the respective version (the date when the page has been accessed is available separately).
5. The “page history” link allows a quick overview of all revisions made on the paper, usually listing also the respective time stamp and author, along with a short summary of the revision.
6. Furthermore, as a service to the reader, ready-to-copy citations in BibTeX, RIS (Endnote) and wiki (Wikipedia, Citizendium) formats are provided, and supplementary formats can easily be added in.
Furthermore, this way of citation could be applied not only to recent publications but also to historical literature from where original taxa descriptions could be extracted, marked up and exported to the wiki for further updates, as shown in the examples from Plazi (Figs 3, 4).
Automated export to a wikiPensoft routinely publishes XML versions of the journal
papers based on the TaxPub extension of the NLM DTD (National Library of
Medicine’s Document Type Definitions format) (
To automate the export to a wiki environment, in this case Species-ID, we developed a tool, named Pensoft Wiki Convertor (PWC), which converts the XML versions of the papers into MediaWiki-based wiki pages. The PWC also converts the internal structure of taxon treatments (e.g., Type Location, Description, Distribution, Etymology and others) into sections of the wiki page. In addition, it converts reference lists and identification keys, exports taxon images and places them on the wiki page. At the same time, the PWC uses several wiki templates to facilitate the display of recommended citations, taxonomic classifications and other content elements in a way that is consistent across the site and easily editable as well. Several of the hyperlinks present on the ZooKeys article – e.g., ZooBank LSIDs, georeferenced coordinates and others – are also transferred to and displayed on the wiki page.
Conversely, the hyperlinked URL of the generic wiki page for each taxon treatment is published in the original journal publication, right next to the taxon treatment name (Fig. 2).
The original description of Sinocallipus catba Stoev & Enghoff, 2011 displaying the generic URL of the wiki page (http://species-id.net/wiki/Sinocallipus_catba) right below the ZooBank LSID (see arrow).
Additional wiki templates embedded by the PWC provide various links of the taxon name to external biodiversity resources (e.g., GBIF, EOL, NCBI, PubMed, Biodiversity Heritage Library (BHL), ZooBank, the International Plant Name Index (IPNI), Index Fungorum, Tropicos, PLANTS, Wikispecies, Wikipedia and others) harvested “on the fly” through the Pensoft Taxon Profile (PTP) tool.
Once the XML file of a paper is converted into MediaWiki markup, the PWC uses wiki bots to automatically create separate wiki pages for each taxon treatment and to upload the respective content there.
The Species-ID wikiSpecies-ID (http://species-id.net)
is dedicated to collecting and integrating open taxon descriptions and
identification tools for different taxa. The audience addressed are
scientists and naturalists, both amateurs (
A successful wiki platform can exist and develop only if there is an active community contributing to it. Several layers of contributions are welcome to Species-ID:
● Descriptions and identification tools (species treatments, dichotomous, polytomous, multi-access keys, etc.).
● Checking, editing and updating of existing wiki pages.
● Enhancing the access and usability through restructuring, categorizing, semantic wiki information or tools, wiki templates or adding new software extensions.
Species-ID publishes materials under an open content
policy that is compatible with other open content projects such as
Wikipedia or Open educational resources (OER). It does not resort to the
“non-commercial” clause (which is highly problematic and not an open
source license, see
The ability to publish independent research implies the possibility of conflict. Contributions on Species-ID may therefore either be shared, normal pages which are dedicated to neutrality, or authored pages (having the authors’ names in the title), which may support the views of the respective authors in polite discourse.
Export of taxon treatments from legacy literatureMark up and digitization of historical literature is a widely discussed problem in taxonomy (
There are two working groups that already provide methods
and tools for mark up of taxonomy literature: Plazi (based on the
taxonX XML schema, see http://sourceforge.net/projects/taxonx and
Using an exporting tool similar to the PWC, Plazi provided several sample treatments on species-id.net (Appendix 1). In this way, treatments from the historical literature that are available in the Plazi repository (Fig. 3) could be opened up for updating and editing, bearing at the same time the original citation details on the wiki page (Fig. 4).
Treatment of Anochetus boltoni Fisher extracted through XML markup from the original paper of
Wiki page of Anochetus boltoni Fisher (http://species-id.net/wiki/Anochetus_boltoni) exported from the Plazi Treatment Repository to Species-ID.
The present paper describes a workflow that will positively affect the exposition and dissemination of taxonomic information through:
1. Reconciliation of “static” and “dynamic” versions of a published treatment on a dedicated wiki page hosted on species-id.net;
2. Increasing exposition, discovery and linking of published information in an additional and important Internet environment, MediaWiki;
3. Combination of citation for both the original source and derivative wiki pages into a single reference record;
4. Provision of attribution to both the author(s) of the original publication and the contributor(s) to the versioned wiki page;
5. Possibility to update and edit treatments on wiki, which should provide a continuous accumulation of new knowledge;
6. Newly published and legacy treatments will be brought together to a common format for further editing, updates and possibly mashups;
7. Automating the upload to wiki will facilitate a large scale accumulation of treatments on species-id.net.
8. Wiki treatments can easily be transferred to other wikis, e.g., to Wikipedia.
We are convinced that the proposed approach will also positively influence the process of opening up data and knowledge in biodiversity science.
The current implementation is funded in part by the ViBRANT (Virtual Biodiversity Research and Access Network for Taxonomy, www.vbrant.eu) FP7 project. We also thank Konrad Förstner for initial discussions and test code for automated export to MediaWiki.
List of the Species-ID pages generated automatically from the original source for the purposes of the pilot project described here.
Source:
http://species-id.net/wiki/Sinocallipus
http://species-id.net/wiki/Sinocallipus_catba
http://species-id.net/wiki/Sinocallipus_deharvengi
http://species-id.net/wiki/Sinocallipus_jaegeri
http://species-id.net/wiki/Sinocallipus_simplipodicus
http://species-id.net/wiki/Sinocallipus_steineri
http://species-id.net/wiki/Sinocallipus_thai
Source:
http://species-id.net/wiki/Amynthas_phatubensis
http://species-id.net/wiki/Amynthas_tontong
http://species-id.net/wiki/Amynthas_borealis
http://species-id.net/wiki/Amynthas_srinan
Source:
http://species-id.net/wiki/Larsenianthus
http://species-id.net/wiki/Larsenianthus_wardianus
http://species-id.net/wiki/Larsenianthus_careyanus
http://species-id.net/wiki/Larsenianthus_assamensis
http://species-id.net/wiki/Larsenianthus_arunachalensis
Source:
http://species-id.net/wiki/Solanum_kulliwaita
http://species-id.net/wiki/Solanum_dillonii
http://species-id.net/wiki/Solanum_oxapampense
http://species-id.net/wiki/Solanum_verecundum
Source: Plazi Treatment Repository
http://species-id.net/wiki/Nixonia_masneri
http://species-id.net/wiki/Anochetus_boltoni
http://species-id.net/wiki/Formica_herculeana
http://species-id.net/wiki/Camponotus_imitator