Research Article |
Corresponding author: Dirk Steinke ( dsteinke@uoguelph.ca ) Academic editor: Pavel Stoev
© 2024 Dirk Steinke, Jaclyn T. A. McKeown, Allison Zyba, Joschka McLeod, Corey Feng, Paul D. N. Hebert.
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Steinke D, McKeown JTA, Zyba A, McLeod J, Feng C, Hebert PDN (2024) Low-cost, high-volume imaging for entomological digitization. ZooKeys 1206: 315-326. https://doi.org/10.3897/zookeys.1206.123670
|
Large-scale digitization of natural history collections requires automation of image acquisition and processing. Reflecting this fact, various approaches, some highly sophisticated, have been developed to support imaging of museum specimens. However, most of these systems are complex and expensive, restricting their deployment. Here we describe a simple, inexpensive technique for imaging arthropods larger than 5 mm. By mounting a digital SLR camera on a CNC (computer numerical control) motor-drive rig, we created a system that captures high-resolution z-axis stacked images (6960 × 4640 pixels) of 95 specimens in 30 minutes. This system can be assembled inexpensively ($1000 USD without a camera) and it is easy to set-up and maintain. By coupling low cost with high production capacity, it represents a solution for digitizing any natural history collection.
AI, Arthropoda, collections, databases, insects, machine learning, photography
Advances in computational and imaging technologies have stimulated the digitization of specimens in natural history collections (
Projects that seek to digitize entire insect collections require automated image acquisition and processing. Because of the effort in handling individual specimens and the risk of damaging them, some digitization programs have imaged drawers of specimens (
Recently, several approaches have been developed to digitize individual specimens in museum collections (
Optimal high-throughput specimen digitization requires combining technologies in novel workflows and is largely driven by purpose (collection digitization versus one component of a multifaceted workflow). This study introduces an imaging system developed to support the specimen-centric workflow employed by the Centre for Biodiversity Genomics (CBG), Guelph, Canada to gather DNA barcode records. Because images are essential to validate DNA barcodes, the CBG photographs every specimen. Small specimens (<5 mm) are each placed into a well in a 96-well plate and are imaged with a high-resolution automated microscope system (Steinke et al. in prep.) before entering molecular analysis. Larger individuals are pinned, arrayed in Schmidt boxes, and then imaged using the digital SLR camera rig described here. This system is easy to install and was designed to provide high production capacity at low cost for operations ranging from small entomology laboratories to large natural history collections.
The SLR rig (Fig.
The SLR rig is controlled by a program that employs both a python script (Suppl. material
Pinned arthropods are loaded into the SLR rig in batches of 95 after being transferred to a 75 cm by 47 cm foam platform (Fig.
Gphoto2 is used to transfer images to the computer for further processing. Z-stacked images are cropped to standardised dimensions with a 4×3 aspect ratio using a machine-learning-based cropping tool (
Fig.
The CNC motor activates at a pre-set time, which depends upon the distance between each slot and the dwell time (3.75 s) at each stop, allowing the camera to take nine images and transfer them to the computer. Operating in this mode, the SLR rig images 95 specimens in 30 min and the stacking software requires another 11.5 min to process these images, but this usually occurs while the next batch is being photographed. The transfer of pinned specimens to and off the foam platform takes about 15 min each and is done while the SLR rig is running another batch. The time required to crop and edit each batch varies (15–30 min) with the type of specimens. Operated by one staff member, the SLR rig can image 4000 specimens in a week. The CBG’s system has now imaged more than 250,000 specimens and the sole maintenance involved the replacement of a wire leading to one motor.
The SLR rig cost of $4500 USD reflects three main components: 1) CNC machine kit ($1000), 2) Apple computer ($1000), and 3) Canon 90D including lens ($2500). Costs can be reduced by replacing the computer with a raspberry pi ($100), but under heavy usage (40 h a week), it will need replacement every six months. Less expensive cameras can be used if they can be controlled with gphoto2. They do have a lower resolution (12–20 megapixels) than the Canon 90D (32.5 megapixels), but this resolution is adequate for many applications. However, it is important to select a camera with a depth stacking function such as focus bracketing (e.g., Canon PowerShot G7 X Mark II, $900; Olympus OM-D E-M1 Mark II & III, $920 for body). By careful selection of components, the overall cost can be reduced to about $2000 (using the CNC kit, a raspberry pi, and a low budget point and shoot camera with depth stacking). The light setup can also vary in cost. The Neewer 24×24 Softbox pair used in our study costs about $150 but it can be replaced by LED strips (~$20) attached to the inner part of the CNC frame. Plastic components for the OpenBuilds Acro 1010 are freely available as 3D models so users can modify and 3D-print custom components if such a system is available. One modification made to our SLR rig was the addition of bumpers and a triangular structure to improve wire management during operation (Suppl. materials
The SLR rig can image a wide variety of specimens by adjusting settings as described in this section. The distance between the camera and the specimen dictates the size of the image (focal distance from base = 16 cm). The frame size varies by 0.5 cm in both directions depending upon the depth of the focus point determined by the auto-focus program. This limits the size range of specimens which can be imaged (5–45 mm). As each slot on the platform is designed to fit the camera frame, no specimen should overlap an adjacent space because the cropping tool is likely to malfunction. However, larger specimens can be imaged if the distance between the camera and the specimen is increased as this enlarges the size of the frame. Conversely, reducing the camera-specimen distance decreases the size of the frame, allowing smaller specimens (down to 2 mm using the 60 mm macro lens in our setup) to be imaged. Any change in the camera’s operating height is difficult with the described setup as it requires remounting the camera at a higher or lower position on its mount or the exchange of the legs mounted to the rig frame. Future optimization could incorporate legs capable of height adjustment.
Background colour and light settings can also be modified to improve image quality. Dark backgrounds improve the contrast for dark specimens, helping to highlight otherwise subtle features and also help to contrast pale specimens that blend into a white background. To make this adjustment, a second platform can be made of dark foam, or dark strips can be temporarily added to the existing platform. As lighting and whitening settings on the camera must be adjusted to accommodate the change in background colour, all 95 slots must have the same background.
The number of images taken of each specimen can be adjusted with the depth stack function on the camera. Increasing the image count expands the depth range in focus, but increases the time required to capture photographs and to process them in Helicon Focus. The dwell time of the CNC motor system would need to be extended to allow more images to be taken before the camera moves. Conversely, imaging and processing times can be reduced by reducing the number of images taken per specimen. Experimentation with sets of specimens in the target group is the best way to optimize the number of images taken.
Although this CBG’s SLR system is primarily used with pinned insects, it is effective in imaging other specimens (e.g., soft-bodied invertebrates in liquid preservatives). In the latter case, the foam platform is simply replaced by a grid structure that holds each specimen vial (
Generating an image with enough resolution to allow species identification can be difficult with any automated system given the manifold differences in shape and size of specimens (
At the CBG, specimens are usually imaged before they are labelled. When labelled specimens are imaged, a small white piece of paper with a slit in the middle is used to cover labels, allowing images of small specimens to remain sharp when cropped. Alternatively, the labels can be removed and reattached to the specimen after photography.
The present SLR rig was designed to photograph terrestrial arthropods that were being analyzed to construct DNA barcode reference libraries. About 90% of these specimens are small enough to be imaged within 96-well plates, but the remainder must be pinned. As the CBG currently barcodes three million specimens annually, it was essential to develop a system capable of imaging the larger specimens in a cost-effective way. This led to the present solution, which can be acquired for $2000–4500 USD depending on the choice of camera and controller and generates almost 200 high-resolution specimen images per hour.
As the CBG’s SLR rig has performed reliably for 2.5 years of heavy use (12h/day), this system is ideal for deployment in settings remote from technical support. Because of its capacity to rapidly generate large numbers of high-quality digital images for online databases, it is also an asset for any large specimen collection.
We thank Suzanne Bateson for helping with graphics.
The authors have declared that no competing interests exist.
No ethical statement was reported.
This study was enabled by awards to PDNH from the Walder Foundation, the Ontario Ministry of Economic Development, Job Creation and Trade, the Canada Foundation for Innovation, and by a grant from the Canada First Research Excellence Fund to the University of Guelph’s “Food From Thought” research program.
Conceptualization: AZ, DS, JTAM. Data curation: CF, JTAM. Formal analysis: DS. Funding acquisition: PDNDH. Investigation: CF, JM. Methodology: AZ, JM, JTAM. Project administration: DS. Supervision: PDNDH. Validation: CF, AZ, JM. Visualization: DS. Writing – original draft: DS. Writing – review and editing: JTAM, PDNDH, CF, JM.
Dirk Steinke https://orcid.org/0000-0002-8992-575X
Jaclyn T. A. McKeown https://orcid.org/0009-0005-7193-2643
Joschka McLeod https://orcid.org/0000-0002-7503-1835
Corey Feng https://orcid.org/0009-0007-7630-7884
Paul D. N. Hebert https://orcid.org/0000-0002-3081-6700
All of the data that support the findings of this study are available in the main text or Supplementary Information.
SLR Rig controlling script
Data type: py
G-code control script
Data type: gcode
Image packaging and upload to BOLD
Data type: py
File for 3D printing of bumpers used in system
Data type: x3d
File for 3D printing of triagular wire management piece
Data type: x3d