Abstract
Long distance migrations are well known to occur in many baleen whale species. Yet, a global synthesis of this information is lacking. Here, we study baleen whales as a group and at a global scale, first analyzing the grey and peer-reviewed literature for information on the location of baleen whale calving and feeding grounds around the world. This information was then combined with modeled-data produced from an Earth System Model to estimate the global distribution of baleen whale calving and feeding habitats. A simple network theoretic heuristic was then used to identify the shortest over-water path connecting habitats. These shortest paths map well to known major migration routes for a number of species, suggesting that migration has evolved primarily to minimize travel distances. Identifying distance minimizing routes globally, that have demonstrable consistency to known migration routes for certain baleen whale species, offers a useful baseline perspective on large-scale migration patterns, from which many perturbations can by judged. As an example, we used our modeled migration routes to identify regions of the ocean that are likely hotspots of whale ship-strikes. Such information is useful for developing global conservation and management priorities for baleen whales.
Introduction
That baleen whales migrate is well known [1, 2]. One can observe them doing so from the numerous whale watching tours that are available around the world, and scientists have photographed and tagged individual whales, and tracked their travels [3–8]. From such data, and other sources of information, such as historical whaling charts [9], persistent long distance migration routes have been identified around the world, from the Gulf of California, the Hawaiian islands and the China Sea, to the Bering and Chuckchi Seas; from South Africa, Australia, New Zealand, Chile and Argentina, to the Southern Ocean and the Antarctic peninsula; from the Caribbean and the west coast of Africa, to the north Atlantic and Arctic. The list goes on. However, this information is rarely synthesized, with most studies focusing on migration patterns in specific regions and on specific species. Here, we make a global analysis of migration for baleen whales as a group, identifying the factors that likely determine migration patterns throughout the world’s oceans.
Annual migrations are commonly seen in baleen whales of different species, for example in hump-back [10] and gray whales [1], where individuals travel between warm low-latitude breeding grounds, and productive high-latitude feeding grounds [2]. Other baleen species are known to have more diffuse migration patterns, such as blue [5], sei [11] and fin [2] whales, but of those that make latitudinal migrations, they are thought to do so to secure better chances of survival for their calves in warmer low-latitude waters [2, 12], and feed on the abundance of copepods, euphausiids (krill) and small fishes at high latitudes in summer months [13, 14].
Low-to-high-to-low latitudinal migrations have been observed directly for many areas of the world’s oceans (see our Supplementary Information for a bibliography). Yet comparisons between regions are lacking, and a global synthesis of migration routes would develop our understanding of why certain baleen whale species, behave in such a (bioenergetically) expensive manner. Here, we develop a macroecological study, using network theoretic algorithms and output from an Earth System Model (ESM) to infer the location of migration pathways in the global ocean. ESMs have been developed primarily to explain the major biogeochemical cycles of the earth, ultimately giving us a predictive capacity for future climate states [15]. To make these calculations, earth-system scientists include descriptions of phytoplankton and (in some cases) zooplankton dynamics [16]. Here, we use a recent iteration of an ESM [17] that includes “large zooplankton”, which approximates the dynamics of copepods and euphausiids, as well as providing information useful for estimating the abundance of small fishes. This ESM data is interpolated to calving and feeding locations found from a literature review, and then a machine learning algorithm is used to define potential calving and feeding habitats globally. Major migration pathways between modeled calving and feeding habitats are then identified using a modified Dijkstra’s algorithm, which finds the shortest ocean-path between a given calving and feeding location.
Estimating the distribution of baleen whale calving and feeding habitats, and migration routes is not without uncertainty, for it is well known that baleen whale species differ in their use of the ocean. However, the global scale of our analysis offers a course grained view of species-specific information, and allows us to look at how baleen whales as a group, might occupy broad, basin-scale, regions of the global ocean. Furthermore, this large-scale perspective allows us to estimate the impact of humans on baleen whales, and we do so in the context of whale ship-strikes, thought to be a large source of mortality for migrating whales [18, 19]: using our modeled migration routes with cargo ship-track data, we infer hot-spots of potential ship-strikes globally.
Methods
The Earth System Model
The Carbon, Ocean Biogeochemistry and Lower Trophics (COBALT) marine ecosystem model provides the biogeochemical and planktonic food web information that we used to estimate potential whale migration routes [17]. COBALT is run as part of the Modular Ocean Model (MOM) version 4.1 [20], with 60 year simulations (1948-2008) forced by the Common Ocean-Ice Reference Experiment (CORE-II) data set [21]. The horizontal resolution of the simulation is 1° Latitude/Longitude, except along the equator where the resolution is refined to 1/3°, and the model uses 50 vertical layers, with a resolution of 10m over the top 200m.
The representation of planktonic food web dynamics within COBALT is based on body-size, and provides information relevant to our study of baleen whale migration. In particular, medium-sized zoo-plankton are parameterized to represent small and medium copepods (0.2 - 2mm in equivalent spherical diameter or ESD), and large-sized zooplankton are parameterized to represent large copepods and small euphasiids (2 - 20mm ESD). Furthermore, COBALT employs a density dependent mortality term for medium and large zooplankton. This represents the mortality of these zooplankton to higher-predators, namely small fishes. Although COBALT does not model fish explicitly [17], this implicit representation of small-fish feeding rates, has been used to model upper trophic levels explicitly [22] and for this application, provides information on the spatial distribution of a major source of food for certain baleen whale species.
Baleen whale habitats
Data from the COBALT retrospective ocean-ice ecosystem simulation was used in a classification algorithm to identify likely baleen whale calving and feeding habitats globally. These data were chosen for their relevance to baleen whale habitats, and include: monthly COBALT temperature data averaged over the top 200m (°C), medium and large zooplankton (copepods and small euphausiids) biomass integrated over the top 200m (g carbon m−2; our analysis is based on relative difference in zooplankton abundances, so the exact units are unimportant), medium and large zooplankton mortality rates to small fishes integrated over the top 200m (g carbon m−2 day−1), depth (km). All data were taken for the period 1988-2008. Baleen whales generally calve during the winter, and feed during the summer, and hence these data were filtered for these times, producing time average austral winters and boreal summers (june, july, august), and boreal winters and austral summers (november, december, january). For illustrative purposes we show COBALT’s annual average sea-surface temperature (Fig. 1a) and average total abundance of medium and large zooplankton (Fig. 1b).
These time-averaged COBALT data fields were then linearly interpolated in space (the modeled data are on a 1° grid) to feeding and calving locations identified from a literature search: 33 and 36 feeding and calving sites respectively (Fig. 1). See the Supplementary Information for the bibliography of compiled literature, and the precise locations. Using this point-data, a machine learning algorithm - a bagged classification tree - was used to estimate the distribution of feeding and calving habitat at all other locations globally [23]. The algorithm was used on feeding and calving habitats separately. Taking feeding habitat as an example: the bagged classification tree required as input, a categorical response variable - a vector of 1s and 0s relating to those ocean locations where we identified the presence and absence of feeding respectively, from the literature. The other input is a matrix of predictor variables: the interpolated COBALT data at the feeding presence and absence locations. The output of the bagged classification tree, is another vector of 1s and 0s, relating to the classification of all ocean locations as either feeding habitat or not.
Absence locations were created by inspecting the distribution of zooplankton abundances, temperature, depth and irradiance across feeding and calving locations, and then using logical arguments to select ocean locations beyond the tails of these joint distributions. This essentially identifies areas of the ocean where calving and/or feeding does not happen. These are more accurately pseudo-absence locations because they are chosen, rather than identified [24], and for each run of any classification tree, 100 pseudo-absence points were selected randomly in this manner. With this information, the bagged classification tree then predicted the most likely habitat type (calving or feeding, or pseudo-absence) at every COBALT grid-cell, producing global maps of feeding and calving habitat.
Because the pseudo-absence points were chosen randomly, we repeated the classification process two hundred times, building a distribution of likely feeding, calving and pseudo-absence locations. Finally, because our concern is with the major migration routes, we isolated the most likely calving and feeding locations by choosing those areas with a 50% or greater probability of feeding or calving classification. Implicit to the bagged classification process is a k-fold cross-validation test of precision [25]. Here, part of the training data is left out and validated against, with the classification tree made using the remaining data. This process is repeated k times, producing an estimate of the expected classification error.
To assess the accuracy of the feeding and calving habitat classification trees, we compared its results with another set of information gained from the literature - we collected as much information as we could from peer and non-peer reviewed articles and numerous websites (representing the grey literature; see supplementary information) on the global distribution of calving and feeding habitats for a number of baleen whale species. The majority of this information was not geo-referenced in any detail, and as a result we had to visually overlay their maps, and estimate by-hand the distribution of baleen whale habitats and migration routes. This information was then compare qualitatively with the habitat distribution maps produced from the bagged-classficiation tree. This information was also used to divide the global feeding and calving habitat distributions, produced from the classification trees, into geographically consistent regions.
Baleen whale migration pathways
Once global distributions of baleen whale calving and feeding habitats were made using the bagged-classificiation tree, we identified potential migration pathways using a modified Dijkstra’s algorithm [26]. This algorithm calculates the shortest-path between two nodes in a network. Here, we considered each COBALT grid-cell centroid a node in an ocean network. Then the great-circle distance (km) between each grid cell centroid and the centroid of every neighboring cardinal grid cell was calculated. These defined the edges in the ocean network. The result was a highly sparse matrix (or network): each row representing a grid-cell, with eight real-valued distances associated with its cardinal neighbors. Connections to land grid-cells were assigned an infinite distance. Dijkstra’s algorithm was then applied to this ocean network, identifying the shortest over-water or “ocean distance” between two given locations in the sea. However, because the network is constructed over a grid, unrealistic city-block paths were typical solutions. To solve this problem we added small amounts of white noise to the network edges, and applied Dijkstra’s algorithm a number of times (typically 1000), creating a set of candidate shortest paths. From the set of candidate shortest paths we calculated the most likely (or expected) shortest path between a given calving-feeding grid cell pair. Using this information, we identified the shortest path between calving and feeding regions. This approach is still a heuristic, and as a result the shortest paths are not exact solutions. An example shortest path is shown in Fig. 2. Here, numerous candidate paths are identified by the black dots, and the expected shortest path in red. The starting location is off Costa Rica, and the destination location is off Vancouver Island. The shortest path hugs the coast, and resembles the path taken by baleen whale species that migrate in this region (for example, blue whales [5]).
Results
When compared qualitatively to the information gained from the literature (Fig. 3), the habitat classification algorithm identified realistic calving and feeding regions (Fig. 4; green and orange regions respectively; habitat regions are the same across all panels), and average cross-validation error for both calving and feeding trees was satisfyingly low (0.07 and 0.05 respectively after 50 trees; see Supplementary Information Fig. S1). Feeding habitats were identified mostly at high latitude: the Sea of Okhotsk, the Bering sea, the northwest Atlantic, the northeast Atlantic, Greenland, the North Sea, the Barents sea, the Crozet Islands, the south coast of Australia, southern New Zealand, the Patagonia Shelf and the whole of Antarctica. Calving habitats were found predominantly at in shallow, warm, low latitudes regions: Baja California, the off Costa Rica and Ecuador, off northwest Africa below the Cape Verde islands, the Gulf of Guinea, off the west-coast of Madagasca in the Mozambique channel, in the Arabian Sea, the Bay of Bengal, areas off the northwest and northeast coasts of Australia, and Fiji. These feeding and calving distributions map well to areas identified qualitatively from the peer-reviewed and grey literature (Fig. 3).
The partitioning of global habitat into geographically consistent regions, based on qualitative information gained from the literature, is shown in Fig. 4 (regions in different shades of green and orange). For example, the Hawaiian Islands, the Gulf of California, and the eastern tropical Pacific were all divided into separate calving regions. Similarly for the distribution of feeding habitat, areas such as Antarctica, the Bering Sea, and the North Sea are separated into distinct feeding regions. While agreement on a regional scale is good, there are a few noticeable calving regions not seen in the literature (Fig. 3). For example northern New Zealand and the Gulf of Mexico are classified as calving habitat, but there is little evidence for this [27]. Similarly for feeding grounds, the classification tree identified the southern half of New Zealand, but there was little evidence in the literature that this is a major feeding area for baleen whales. However, that is to say there is little evidence that contemporary whale distributions cover these areas [27]. Both are interesting as they constitute potential feeding and calving regions for right whales, species which have been essentially exterminated and may show significant range contraction. There is evidence that Southern New Zealand was a feeding ground and Northern New Zealand was a calving area for Southern Right Whales before they were almost wiped out [28]. Similarly, the Gulf of Mexico may have been an extension of the Florida East Coast calving area for Northern Right Whales, with a few individuals spotted in the GOM in the past several decades. Considering there are only about 400 Northern Right Whales currently, it seem highly plausible that the GOM/Caribbean was an important calving area for them when they numbered in the 10’s of thousands [29].
Globally, the first, second and third shortest paths that minimize over-water distance are shown in Fig. 4a, b & c respectively. The maximum, minimum and mean path lengths (km), for these three sets are: first - 7291, 696, 3037; second - 9469, 2091, 5014; third - 10828, 2414, 6948. Beyond these simple statistics, what is evident are the realistic migration routes. These shortest paths between every calving region and their three nearest feeding regions describe all migration routes identified in the grey literature (Fig. 3). For example, from Hawaii, the three nearest feeding regions are the Sea of Okhotsk, the Bering Sea, and the waters off the northwest US and Canada. From the southeast Caribbean, shortest migration routes connect to the waters of the northeast US and Canada, to Greenland, and to the North Sea. In the southern hemisphere, many calving locations connect to the Southern Ocean and Antarctica, traversing many degrees of latitude, for little change in longitude. There are however shortest paths that traverse several degrees on longitude. These are most commonly the third shortest paths, for example connecting Madagascar and northwest Australia, with the Crozet Islands (Fig. 4c).
Discussion
Baleen whale feeding and calving regions were identified, using peer-reviewed and grey literature information on calving and feeding locations, within a bagged-classification tree. Then, using a modified Dijkstra’s algorithm, we identified paths that minimized the ocean distance between feeding and calving regions. Both the global distribution of feeding and calving habitats and the shortest paths connecting them, map well to the known distributions of these habitats and migration routes for certain species. Our results suggest that baleen whale migration has evolved to minimize ocean-distances between calving and feeding habitats. Other factors, such as predator avoidance (e.g. from killer whales) are likely to play a role too in shaping where baleen whales go, but at smaller spatial scales [12].
This analysis is global in scale, and baleen whales were analyzed as a group. This latter step places constraints on our analysis, as it is known that migration patterns, and behavior in general, can differ greatly between whale-species. This is reflected in the specificity of most whale-movement data, which is typically collected for particular individuals and groups of certain whale species, in specific regions, where focused observational and tagging efforts can give fine-scale knowledge of movements [5, 8]. The global-scale analysis presented herein lacks the detail of regional, species specific studies, however, it provides a coarser global perspective on large-scale movements of baleen whales and the mechanisms underlying them. Indeed, the large scale of our analysis does not preclude species-specific use: different baleen whale species show common reliance on broad ocean areas (at scales: > 1000s km) citeGregr2001, and hence our modeled baleen whale habitat maps and migration routes may capture patterns that are general across species. Furthermore, simply having a global atlas of possible migration routes allows us to gauge how humans may have impacted whale migrations around the world.
As an example, we use our global atlas of baleen whale migration to identify hot-spots of potential ship strikes. Ship strikes are thought to be a large source of mortality for some whale groups [18, 19]. But like assessments of migration, most analyses of ship strikes are at regional scales. In contrast, our work here can help identify global patterns of ship strike potential. Using information from the PASTA MARE project [30], we interpolated normalized shipping density (a score between [0,1]; see Fig. S2 for a detailed description) to locations along the major whale migration pathways, resulting in the identification of potential ship-strike hot-spots (Fig. 5). Japanese waters emerge as having the highest strike risk for a given whale migrating in this area. Secondary hot-spots occur in Indonesian waters, off South Africa, off Puerto Rico and off New Foundland, Canada. Although this brief analysis looks only at ship strike potential along migration routes, ignoring strikes in calving and feeding regions, this information can be useful for developing global conservation and management priorities. Two important caveats to this analysis is that first, these strike risk estimates ignore any regional-scale measures to reduce risk, for example as has been successfully done for the North Atlantic Right Whale [31]. Second, we assume a pristine ocean, for we do not account for the effect of historic whaling on contemporary whale densities. Thus, inferring ship-strike potential in this way is likely inaccurate due to the loss of migration routes/populations from whaling [9]. Indeed, a worthwhile effort is the comparison of contemporary migration routes with our estimates of migration in a pristine ocean. Through this, one can identify (or at least hypothesize for) the loss of particular migration routes.
Looking to the past is important, but equally so is to understand how humans may continue to impact whale migration in the future. Our feeding and calving habitat maps, and migration routes were trained on Earth System Model data, averaged over the period 1988-2008. As a result our modeled distributions of feeding and calving habitats are specific to this period. It is possible too, to use our techniques to project how habitat distributions and migration routes might change in the future. The Earth System Model data used in this analysis, as well as looking back in time in hindcast simulations, also look forward in time and project the distribution of oceanographic variables such as temperature and the abundance of zooplankton [15, 17]. Thus, with this information it is possible, in future work, to estimate how baleen whale migration feeding and calving habitats might move under different future climate scenarios.
To improve the accuracy of these future projections, especially the estimates of migration routes, there are several modifications that could be made to our basic approach. We say basic because our migration route-finding algorithm does not include any biological information. Hence, we cannot answer the question of what condition whale individuals might be in, for example, if calving and feeding habitats move further apart in the future, or if their quality (e.g. the abundance of large zooplankton or krill) might diminish. It is critical to do so because individual condition can have a highly non-linear relationship with travel-distance/time [32], and slight increases in the shortest migration path distance, could be detrimental to whale health, rendering certain migration routes impossible. The experience along a migration route is important too. For example, it is expected that areas of the surface ocean are likely to warm in the coming decades. This may incur a significant bioenergetic cost on migrating whales, as biomass specific respiration rates are positively related to temperature [33]. In other words, baleen whales are likely to expend more energy on their annual migrations in the future, and the consequences of this change are unknown.
In addition, there are several algorithmic modifications to our approach that can be made. First, we have only identified paths that minimize ocean distance. It is possible too, to find paths between two locations is the sea that minimize travel-time. We have done so in our context (Fig. S3), assuming that baleen whales swim with a constant speed of 80 km day−1 [5], and instead of distances as network edges (see methods) we assign travel times, accounting for ocean currents. Applying Dijkstra’s algorithm to this network creates solutions to what is known as the Zermelo navigation problem, that is, finding the path that minimizes travel time between two locations in the sea [34]. The resulting migration routes are shown in Fig. S3, and we find little difference between these and the distance minimizing routes. This is due to the fast swimming speeds of baleen whales. However, solutions to Zermelo’s problem for other species that swim more slowly, for example for marine turtles [34], have identified multiple possible migration routes, depending on the movement rule employed. This brief analysis reveals the numerous ways in which potential migration routes can be identified, and how this information can be used to develop hypotheses about the optimal movement strategies that marine species have evolved.
In summary, we estimated the distribution of baleen whale feeding and calving habitats globally, and the migration routes connecting them. We have shown evidence that baleen whales, as a group, have migration routes that minimize distance (and/or travel time), and we have also discussed that the main utility of our macroecological analyses is in the generation of hypotheses and questions. For example, do baleen whales minimize travel distance or time? Answers to these questions may come from evolutionary agent-based simulations that have been employed in idealized settings to answer similar questions [35]. Indeed, combining these tools with Earth System Model data, will better position us to understand where baleen whales migrate to and from, how they do it, and why.
Whale Migration Websites
http://nationalgeographic.pny.com/Mapshome.aspx?Category=Maps
http://www.apex-environmental.com/
http://www.ecoflores.org/en/marine+conservation/supporting+foreign+ngo/apex/
Acknowledgments
We would like to thank Andre Boustany, Jorge Sarmiento, Simon Levin, Malin Pinsky, Rebecca Asch and Jodi Elliott for their help in preparing this work. JW was part funded by the Nippon Foundation (NF) Nereus Program. JW was also funded by the NSF Coupled Natural-Human Systems grant GEO-1211972. BF was funded by Princeton University’s Grand Challenges program.