Abstract
As biodiversity plummets due to anthropogenic disturbances, the conservation of oceanic species is made harder by limited knowledge of their distributions and migrations. Indeed, tracking species distributions in the open ocean is particularly challenging due to the scarcity of observations and the complex and variable nature of the ocean system. In this study, we propose a new method that leverages deep learning, specifically convolutional neural networks (CNNs), to capture spatial features of environmental variables. This novelty eliminates the need to predefine these features before modelling and creates opportunities to discover unexpected correlations. Our aim is to present the results of the first trial of this method in the open ocean, discuss limitations and provide feedback for future improvements or adjustments.
In this case study, we considered 38 taxa comprising pelagic fishes, elasmobranchs, marine mammals, marine turtles and birds. We trained a model to predict probabilities from the environmental conditions at any specific point in space and time, using species occurrence data from the Global Biodiversity Information Facility (GBIF) and environmental data from various sources. These variables included sea surface temperature, chlorophyll concentration, salinity and fifteen others.
During the testing phase, the model was applied to environmental data at locations where species occurrences were recorded. The classifier accurately predicted the observed taxon as the most likely taxon in 69% of cases and included the observed taxon among the top three most likely predictions in 89% of cases. These findings show the adequacy of deep learning for species distribution modelling in the open ocean.
Additionally, this purely correlative model was then analysed with explicability tools to understand which variables had an influence on the model’s predictions. While variable importance was species-dependent, we identified finite-size Lyapunov exponents (FSLEs), sea surface temperature, pH and salinity as the most influential variables, in that order. These insights can prove valuable for future species-specific ecology studies.
Competing Interest Statement
The authors have declared no competing interest.
Footnotes
This version of the manuscript has been revised to take into account the follow-up comments made by the three PCI Ecology reviewers. In particular, we expanded on the discussion. A detailed change log will be provided on the PCI Ecology website.