Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

Maximizing citizen scientists’ contribution to automated species recognition

View ORCID ProfileWouter Koch, View ORCID ProfileLaurens Hogeweg, View ORCID ProfileErlend B. Nilsen, View ORCID ProfileAnders G. Finstad
doi: https://doi.org/10.1101/2022.02.17.480847
Wouter Koch
1Department of Natural History, Norwegian University of Science and Technology, Trondheim, Norway
2Norwegian Biodiversity Information Centre, Trondheim, Norway
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Wouter Koch
  • For correspondence: wouter.koch@artsdatabanken.no
Laurens Hogeweg
3Intel Benelux, High Tech Campus 83, 5656 AE Eindhoven, The Netherlands
4Naturalis Biodiversity Center, PO Box 9517, 2300 RA, Leiden, The Netherlands
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Laurens Hogeweg
Erlend B. Nilsen
5Faculty of Biosciences and Aquaculture, Nord University, Steinkjer, Norway
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Erlend B. Nilsen
Anders G. Finstad
1Department of Natural History, Norwegian University of Science and Technology, Trondheim, Norway
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Anders G. Finstad
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Data/Code
  • Preview PDF
Loading

Abstract

Technological advances and data availability have enabled artificial intelligence-driven tools that can increasingly successfully assist in identifying species from images. Especially within citizen science, an emerging source of information filling the knowledge gaps needed to solve the biodiversity crisis, such tools can allow participants to recognize and report more poorly known species. This can be an important tool in addressing the substantial taxonomic bias in biodiversity data, where broadly recognized, charismatic species are highly overrepresented. Meanwhile, the recognition models are trained using the same biased data, so it is important to consider what additional images are needed to improve recognition models. In this study, we investigated how the amount of training data influenced the performance of species recognition models for various taxa. We utilized a large Citizen Science dataset collected in Norway, where images are added independently from identification. We demonstrate that while adding images of currently under-represented taxa will generally improve recognition models more, there are important deviations from this general pattern. Thus, a more focused prioritization of data collection beyond the basic paradigm that “more is better” is likely to significantly improve species recognition models and advance the representativeness of biodiversity data.

Competing Interest Statement

The authors have declared no competing interest.

Footnotes

  • https://github.com/WouterKoch/citizen_science_VoI

Copyright 
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY 4.0 International license.
Back to top
PreviousNext
Posted February 19, 2022.
Download PDF
Data/Code
Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Maximizing citizen scientists’ contribution to automated species recognition
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Maximizing citizen scientists’ contribution to automated species recognition
Wouter Koch, Laurens Hogeweg, Erlend B. Nilsen, Anders G. Finstad
bioRxiv 2022.02.17.480847; doi: https://doi.org/10.1101/2022.02.17.480847
Reddit logo Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Maximizing citizen scientists’ contribution to automated species recognition
Wouter Koch, Laurens Hogeweg, Erlend B. Nilsen, Anders G. Finstad
bioRxiv 2022.02.17.480847; doi: https://doi.org/10.1101/2022.02.17.480847

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Ecology
Subject Areas
All Articles
  • Animal Behavior and Cognition (4237)
  • Biochemistry (9155)
  • Bioengineering (6797)
  • Bioinformatics (24052)
  • Biophysics (12149)
  • Cancer Biology (9562)
  • Cell Biology (13814)
  • Clinical Trials (138)
  • Developmental Biology (7652)
  • Ecology (11729)
  • Epidemiology (2066)
  • Evolutionary Biology (15534)
  • Genetics (10663)
  • Genomics (14346)
  • Immunology (9502)
  • Microbiology (22876)
  • Molecular Biology (9113)
  • Neuroscience (49080)
  • Paleontology (357)
  • Pathology (1487)
  • Pharmacology and Toxicology (2576)
  • Physiology (3851)
  • Plant Biology (8347)
  • Scientific Communication and Education (1473)
  • Synthetic Biology (2299)
  • Systems Biology (6202)
  • Zoology (1302)