Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

Deep learning and computer vision will transform entomology

View ORCID ProfileToke T. Høye, Johanna Ärje, Kim Bjerge, Oskar L. P. Hansen, Alexandros Iosifidis, Florian Leese, Hjalte M. R. Mann, Kristian Meissner, Claus Melvad, Jenni Raitoharju
doi: https://doi.org/10.1101/2020.07.03.187252
Toke T. Høye
1Department of Bioscience and Arctic Research Centre, Aarhus University, Grenåvej 14, DK-8410 Rønde, Denmark
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Toke T. Høye
  • For correspondence: tth@bios.au.dk
Johanna Ärje
1Department of Bioscience and Arctic Research Centre, Aarhus University, Grenåvej 14, DK-8410 Rønde, Denmark
2Unit of Computing Sciences, Tampere University, Finland
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Kim Bjerge
3School of Engineering, Aarhus University, Finlandsgade 22, 8200 Aarhus N, Denmark
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Oskar L. P. Hansen
1Department of Bioscience and Arctic Research Centre, Aarhus University, Grenåvej 14, DK-8410 Rønde, Denmark
4Natural History Museum Aarhus, Wilhelm Meyers Allé 10, DK-8000 Aarhus C
5Department of Biology – Center for Biodiversity Dynamics in a Changing World (BIOCHANGE), Aarhus University, Ny Munkegade 116, DK-8000 Aarhus C
6Department of Biology - Ecoinformatics and Biodiversity, Aarhus University, Ny Munkegade 116, DK-8000 Aarhus C
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Alexandros Iosifidis
7Department of Engineering, Aarhus University, Denmark
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Florian Leese
8Aquatic Ecosystem Research, University of Duisburg-Essen, 45141 Essen, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Hjalte M. R. Mann
1Department of Bioscience and Arctic Research Centre, Aarhus University, Grenåvej 14, DK-8410 Rønde, Denmark
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Kristian Meissner
9Programme for Environmental Information, Finnish Environment Institute, Jyväskylä, Finland
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Claus Melvad
10School of Engineering, Aarhus University, Inge Lehmannsgade 10, 8000 Aarhus C, Denmark
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jenni Raitoharju
9Programme for Environmental Information, Finnish Environment Institute, Jyväskylä, Finland
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Preview PDF
Loading

ABSTRACT

Most animal species on Earth are insects, and recent reports suggest that their abundance is in drastic decline. Although these reports come from a wide range of insect taxa and regions, the evidence to assess the extent of the phenomenon is still sparse. Insect populations are challenging to study and most monitoring methods are labour intensive and inefficient. Advances in computer vision and deep learning provide potential new solutions to this global challenge. Cameras and other sensors that can effectively, continuously, and non-invasively perform entomological observations throughout diurnal and seasonal cycles. The physical appearance of specimens can also be captured by automated imaging in the lab. When trained on these data, deep learning models can provide estimates of insect abundance, biomass, and diversity. Further, deep learning models can quantify variation in phenotypic traits, behaviour, and interactions. Here, we connect recent developments in deep learning and computer vision to the urgent demand for more cost-efficient monitoring of insects and other invertebrates. We present examples of sensor-based monitoring of insects. We show how deep learning tools can be applied to the big data outputs to derive ecological information and discuss the challenges that lie ahead for the implementation of such solutions in entomology. We identify four focal areas, which will facilitate this transformation: 1) Validation of image-based taxonomic identification, 2) generation of sufficient training data, 3) development of public, curated reference databases, and 4) solutions to integrate deep learning and molecular tools.

Significance statement Insect populations are challenging to study, but computer vision and deep learning provide opportunities for continuous and non-invasive monitoring of biodiversity around the clock and over entire seasons. These tools can also facilitate the processing of samples in a laboratory setting. Automated imaging in particular can provide an effective way of identifying and counting specimens to measure abundance. We present examples of sensors and devices of relevance to entomology and show how deep learning tools can convert the big data streams into ecological information. We discuss the challenges that lie ahead and identify four focal areas to make deep learning and computer vision game changers for entomology.

Competing Interest Statement

The authors have declared no competing interest.

  • Glossary

    Bin picking
    an industrial term for robots that pick up one of many objects randomly placed in a container.
    Convolutional Neural Network (CNN)
    a deep learning algorithm in the family of neural networks with serval different layers commonly applied for image recognition and classification. A CNN can be trained to recognize various objects and patterns in an image. There are four main different operations in a CNN: convolution, activation functions, sub sampling, and fully connected layer. During training the learnable parameters of each convolutional and fully connected layer are adjusted so the CNN is able to recognize different patterns of the training data and used for final image classification.
    Data augmentation
    a technique that can be used to artificially expand the size of a training dataset by creating modified images with objects of interest for classification.
    Machine learning
    a subset of artificial intelligence associated with creating algorithms that can change themselves without human intervention to get the desired result – by feeding themselves through structured data.
    Deep learning
    a subset of machine learning where algorithms are created and function similarly to machine learning, but where there are many levels of these algorithms, each providing a different interpretation of the data it conveys.
    DNA barcoding
    Identification of a species using a short, standardised gene fragment.
    Initialization
    description of an object to be tracked.
    Training data
    classified images (e.g. images of known species identified by experts) that are recorded to train a deep learning model.
    Precision
    the number of true positives divided by the sum of true positives and false positives
    Recall
    also called the true positive rate, is the number of true positives divided by the sum of true positives and false negatives.
    Classification accuracy
    the sum of true positives and true negatives divided by the total number of specimens.
  • Copyright 
    The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. All rights reserved. No reuse allowed without permission.
    Back to top
    PreviousNext
    Posted July 04, 2020.
    Download PDF

    Supplementary Material

    Email

    Thank you for your interest in spreading the word about bioRxiv.

    NOTE: Your email address is requested solely to identify you as the sender of this article.

    Enter multiple addresses on separate lines or separate them with commas.
    Deep learning and computer vision will transform entomology
    (Your Name) has forwarded a page to you from bioRxiv
    (Your Name) thought you would like to see this page from the bioRxiv website.
    CAPTCHA
    This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
    Share
    Deep learning and computer vision will transform entomology
    Toke T. Høye, Johanna Ärje, Kim Bjerge, Oskar L. P. Hansen, Alexandros Iosifidis, Florian Leese, Hjalte M. R. Mann, Kristian Meissner, Claus Melvad, Jenni Raitoharju
    bioRxiv 2020.07.03.187252; doi: https://doi.org/10.1101/2020.07.03.187252
    Reddit logo Twitter logo Facebook logo LinkedIn logo Mendeley logo
    Citation Tools
    Deep learning and computer vision will transform entomology
    Toke T. Høye, Johanna Ärje, Kim Bjerge, Oskar L. P. Hansen, Alexandros Iosifidis, Florian Leese, Hjalte M. R. Mann, Kristian Meissner, Claus Melvad, Jenni Raitoharju
    bioRxiv 2020.07.03.187252; doi: https://doi.org/10.1101/2020.07.03.187252

    Citation Manager Formats

    • BibTeX
    • Bookends
    • EasyBib
    • EndNote (tagged)
    • EndNote 8 (xml)
    • Medlars
    • Mendeley
    • Papers
    • RefWorks Tagged
    • Ref Manager
    • RIS
    • Zotero
    • Tweet Widget
    • Facebook Like
    • Google Plus One

    Subject Area

    • Ecology
    Subject Areas
    All Articles
    • Animal Behavior and Cognition (4230)
    • Biochemistry (9123)
    • Bioengineering (6766)
    • Bioinformatics (23968)
    • Biophysics (12109)
    • Cancer Biology (9510)
    • Cell Biology (13753)
    • Clinical Trials (138)
    • Developmental Biology (7623)
    • Ecology (11674)
    • Epidemiology (2066)
    • Evolutionary Biology (15492)
    • Genetics (10631)
    • Genomics (14310)
    • Immunology (9473)
    • Microbiology (22822)
    • Molecular Biology (9086)
    • Neuroscience (48919)
    • Paleontology (355)
    • Pathology (1480)
    • Pharmacology and Toxicology (2566)
    • Physiology (3840)
    • Plant Biology (8322)
    • Scientific Communication and Education (1468)
    • Synthetic Biology (2295)
    • Systems Biology (6180)
    • Zoology (1299)