Abstract
Most vertebrates use head and eye movements to quickly change gaze orientation and sample different portions of the environment with periods of stable fixation. Visual information must be integrated across several fixations to construct a more complete perspective of the visual environment. In concert with this sampling strategy, neurons adapt to unchanging input to conserve energy and ensure that only novel information from each fixation is processed. We demonstrate how adaptation recovery times and saccade properties interact, and thus shape spatiotemporal tradeoffs observed in the motor and visual systems of different species. These tradeoffs predict that in order to achieve similar visual coverage over time, animals with smaller receptive field sizes require faster saccade rates. Indeed, we find comparable sampling of the visual environment by neuronal populations across mammals when integrating measurements of saccadic behavior with receptive field sizes and V1 neuronal density. We propose that these mammals share a common statistically driven strategy of maintaining coverage of their visual environment over time calibrated to their respective visual system characteristics.
Introduction
A large body of work clearly shows that we move our eyes to align the central portion of our visual field on goal-relevant objects1–5, but even when we are not engaged in a task or even when we are fixating on an object, we incessantly continue to make saccadic eye movements several times a second. Outside of highly controlled experimental setups4, the target of many of our gaze changes are not predictable6. Similar to humans, many other vertebrates also view the world over a sequence of discrete stable fixations generated from coordinated head and eye movements7–9. The distributions of size and frequency of the gaze changes between fixations vary substantially for different mammals, including different non-human primates10. Even during passive viewing and irrespective of the presence of a fovea, gaze changes are informative by overcoming inhomogeneity in the sensory representation of different animals and providing receptive fields a unique and updated perspective at each fixation (Fig. 1a, b). The retina in several animals contains a differential density of photoreceptors11–14 and occlusion by retinal vasculature15. This inhomogeneity persists along visual pathways and is related to cortical magnification factor16,17, acuity18–23, color sensitivity24, and irregular ocular dominance domains25. Additional inhomogeneities emerge within cortex, such as the irregular distribution of orientation representation in mice23 and disparity selectivity in mice and non-human primates26–28. By making gaze changes to cover different regions of the scene, an internal representation may be constructed by integrating novel receptive field information over successive fixations29–34.
a, b, Sequential fixations change receptive field inputs and differences in adaptation recovery time constants allow primates (a) to make more frequent saccades compared to mice (b). c, Top row: example macaque V1 responses to two repeated presentations of the same stimulus38. Black represents neural activity to the first presentation and different shades of blue depict the neural activity after the second presentation in progressively more delayed conditions. Bottom row: example mouse V1 responses to a similar experiment41. See model examples in Supplemental Fig. 1. d, Magnitude of initial model responses for different saccade rates for macaques and mice. For large saccades, the stimulus changed randomly after a saccade, while for small saccades, the stimulus stayed the same after a saccade. e, Saccade rates for a range of recovery time constants that maintain 75% of initial model responses. f, Saccade rate versus size distributions for five species. Each solid point represents the medians of a single subject. Open circles of marmosets43, macaques43, and cats100,101 represent previously published data from other laboratories (see Methods). The large dashed outline represents the central half of the distribution (between the 25th and 75th percentiles) and the small cross in the center represents the standard error of the median for the distribution of all subjects. Due to the large number of samples, the standard error bars are smaller than even the size of the data points. See full distributions in Supplemental Fig. 2. g, Preferred temporal frequency versus receptive field size distributions from previously published data (see Methods)18–21,49–59. The open circle represents the median and the large dashed outline represents the central half of the distribution (between the 25th and 75th percentiles). h, Saccade size increases10 and saccade rate decreases in mice with monocular deprivation during the critical period (N = 4 mice: n = 1448 saccades) compared to control mice (N = 3 mice: n = 1586 saccades).
Our hypothesis is that neural adaptation and gaze changes work together to provide novel receptive field inputs and conserve energy. Neurons quickly adapt their responsiveness to unchanging visual input35, which should lead to the conservation of metabolic energy by reducing overall neural activity36,37. Eye and head movements, which change where a receptive field samples, could be continuous and uniform across the visual scene, but that would be inefficient and prevent sufficient processing that requires a stable view of images9. To conserve energy and allow the nervous system to process visual inputs, gaze changes are discrete and quick only moving on average the minimum distance necessary to provide novel inputs to the receptive fields given natural scene statistics10. This argument successfully accounts for the differences in the distribution of saccade amplitudes across mammals10, but it is not clear if it also explains differences in saccade rates. Since sampling the visual environment depends on both space and time, we expanded our conceptual model to include adaptation dynamics, examined the differences in visual response properties and spatiotemporal saccade statistics across different mammals in new and previously published data, and measured visual coverage across time in these animals.
Results
The adaptation properties of cortical neurons are remarkably distinct across mammals. For both mice and macaques, the responses to brief presentations (<0.5 s) within visual receptive fields decay within a couple hundred milliseconds, but macaque neurons recover their responsiveness very quickly (within a few hundred milliseconds, Fig. 1a, c, blue)35,38,39 while mouse neurons recover much slower (over several seconds, Fig. 1b, c, red)40,41. Recovery time constants have not been explicitly measured in cat neurons, but the available data suggests that they recover with a time constant somewhere in between these two values42. Recovery to adaptation for these animals in these studies were measured for repeated presentations of the same stimuli while the animal was fixating. Saccadic eye movements that decorrelate input across fixations may decrease the effects of neural adaptation10, but the adaptation that persists nonetheless requires time for recovery39. Macaque neurons will still recover much faster than mouse neurons even with large enough saccades.
To quantify how different saccade rates influence adaptation, we constructed a model in which the dynamics of adaptation could be incorporated into the theoretical framework we previously used to explain the relationship between neural adaptation and saccade size10. Responses (rm) of a population of M = 500 neurons to N = 500 stimuli (sn) following sequential saccades were multiplied by a divisive change in gain dependent on the response to the previous stimulus (sn-1):
Parameters of this function (k and average responses) were chosen to approximately match observed adaptive changes in gain in macaques, mice, and cats, where the average gain never reaches zero for even the fastest repeat presentations (see Supplemental Fig. 1 and Methods for details)38–42. We varied the time between stimuli to represent different saccade rates and model responses decayed over time (t) as an exponential decay function with a fixed time constant τdecay. The divisive gain based on the initial response after each saccade recovered over time based on a second exponential decay function:
We varied the time constant (τrecovery) of this function to cover a range that includes experimentally observed adaptation recovery time constants to brief presentations of stimuli in macaques and mice38–41. To mimic the effects of saccades, we repeated this analysis but changed the stimulus. For some conditions, the stimulus changed randomly, thus providing a decorrelated input mimicking large saccades. For other conditions, the stimulus did not change, mimicking small saccades relative to receptive field sizes. As saccade rate increases, neurons have less time to recover from adaptation and the overall responses decline (Fig. 1d). For both animals, large saccades reduced adaptation compared to small saccades yielding comparatively higher responses for faster saccade rates, reflecting a linkage between saccade size and rate (Fig. 1d, solid vs. dashed lines). To illustrate how the recovery time would relate to saccade rate, we plotted the saccade rate that would still maintain 75% of responsiveness against the recovery time constant (Fig. 1e). This simple model predicts a dramatic difference in saccade rate that depends on the adaptive dynamics in visual neurons, and captures the large differences in saccade rates observed between macaques (Fig. 1e, blue point; 3.8 saccades/s)43 and mice (Fig. 1e, red point; 0.6 saccades/s)10.
Our model suggests that there should be a functional link between the spatiotemporal dynamics of the visual and motor systems. To characterize the spatiotemporal tradeoffs made by these two systems, we first recorded and compared the saccade rates and sizes when presenting natural scenes to three different species known to have different visual system properties: humans, marmosets, and mice (Fig. 1f). As demonstrated previously10, there are clear differences in saccade sizes between these species, with all animals exhibiting a skewed distribution in saccade amplitude (Supplemental Fig. 2b). Marmosets have the smallest saccades and mice have the largest ones. For all animals, saccade sizes were slightly larger for larger images (see also ref. 44) (Supplemental Fig. 2b; bootstrapped, p < 0.001 for all comparisons). There are also clear differences in saccade rates between species (Fig. 1f). Marmosets have the highest saccade rates and mice have the lowest saccade rates. For all animals, saccade rates were slightly higher for larger images (see also ref. 44) (Supplemental Fig. 2c; bootstrapped, p < 0.001 for all comparisons). Saccade behavior is similar between animals with only a shift in rates and sizes. Including previously reported data from other animals (open circles for marmosets, macaques, and cats) reveals an inverse relationship between saccade rate and saccade size across distinct mammalian species (Fig. 1f). The saccade size and rate statistics reported in several publications are remarkably similar to our data when the authors employed similar “free viewing” approaches and image size and duration are considered44–48. Humans and all animals were passively viewing natural images of the same or similar size as humans and were prevented from making head movements. One distinction was that mice were allowed to freely run, but their data were similar whether they were running or stationary (except a lower rate when stationary; see ref. 10).
The decorrelation hypothesis predicts that saccade size differences between species are attributed to the properties of their respective visual systems10. Human and macaque receptive fields are small relative to mouse receptive fields, whereas marmoset and cat receptive fields lie between these sizes. Consequently, to achieve decorrelation and sample new information, mice need much larger saccades in order to move their larger receptive fields as compared to humans and the other species. Our model also suggests that saccade rates will be matched to differences in adaptation recovery time constants (Fig 1e). Since adaptation time constants in individual animals can vary depending on experimental conditions and data based on similar conditions is fairly limited across species, we rather considered preferred temporal frequency distributions across several species to provide a more comprehensive view of the relationship between spatial and temporal properties of their visual system neurons. Indeed, temporal frequency tuning has been measured in several animals under very similar experimental conditions (e.g., spike rate responses to sinusoidal luminance gratings; see Methods) in different laboratories49–59. Similar to the saccade statistics, there is an inverse relationship between preferred temporal frequency and receptive field size across these animals (Fig. 1g). Both the oculomotor and visual systems share spatiotemporal tradeoffs across species where those with smaller receptive fields can make smaller saccades, but require more frequent sampling and faster visual processing. Indeed, we find that low luminance contrast of natural images, which it is known to shift temporal frequency tuning lower54,60,61, is associated with a corresponding reduction in saccade rate (Supplemental Fig. 3).
The relationship we uncover between saccade size and rate may result from differences between species other than receptive field sizes. To test whether changes in receptive field size are linked to saccade rate, we reduced the spatial acuity of mice using monocular deprivation during the critical period (age 24-32 days). This manipulation shifted their saccades to larger sizes (Fig 1h, top; see ref. 10) and significantly reduced their saccade rates compared to control mice (Fig 1h, bottom; bootstrapped, P < 0.001). This causal manipulation demonstrates that the inverse relationship found between saccade size and rate is related to visual functional properties such as acuity, and presumably receptive field sizes. A similar change in saccade size and rate is observed in humans on a coarse scale with substantial reduced acuity due to macular degeneration or foveal occlusion62,63 and on a finer scale in subjects with myopia and amblyopia64,65.
Our modeling and empirical data in different species show that visual and motor systems share similar spatiotemporal tradeoffs to sample visual information. We reasoned that such a tradeoff is constrained by a common computational goal: achieving equivalent novel (decorrelated) spatiotemporal coverage of their visual environment. The mouse with both large V1 receptive fields and eye movements would need to make eye movements less frequently compared to the human with both small receptive fields and eye movements in order to achieve similar updated coverage over time.
To assess coverage over time quantitatively, we applied a circular mask equal to the median V1 receptive field size for each animal to every fixation point (Fig. 2a, green circles). We then added up all of those masks (each equal to one for that area) across space and time for all images to generate cumulative spatial maps. Lastly, we divided those maps by the total of the intersaccadic interval times to get a percent coverage per second (Fig. 2a, right). Because coverage is based on area, the mouse achieves much more coverage over time compared to marmosets and humans even with their slower saccade rates (Fig. 2b).
a, Computing average receptive field coverage over time from saccades. b, Horizontal cross sections of receptive field coverage over time for each mammal. Lighter colors are data obtained when subjects were viewing small images. Vertical dashed lines represent image boundaries. c, Relative brain sizes between each mammal. d, Horizontal coverage data from b multiplied by the average two-dimensional V1 cell density for each mammal. Human data are replotted with marmoset and mouse data to facilitate comparison.
One notable exception to the general inverse relationship between saccade rate and size across species is observed for humans, who appear to have lower saccade rates than macaques although they have slightly smaller saccade sizes (Fig. 1f). In addition, if we assume that marmosets make larger saccades with their heads43 with no decrease in saccade rate (see Supplemental videos 1-2 and Supplemental Figs 4-5), they would have higher saccade rates than macaques with larger saccade sizes. This suggests that saccade rates might also depend on species-specific factors other than saccade sizes and receptive field sizes.
We postulated that the main bottleneck constraining coverage over time is the receptive field size and density of the first cortical stage, the primary visual cortex. The animals we are examining have very different densities of V1 cortical neurons per degree of visual angle (dva), and therefore differences in densities of feature preferences66 (Fig. 2c). Humans have many more neurons for each receptive field location representing many more different visual features than mice and mathematically, larger networks can provide greater decorrelation67. We can take our original coverage maps over time (Fig. 2b) and multiply by the two-dimensional (2D) V1 neuronal density for each animal (see Methods for details) to compare receptive field cortical coverage over time between species (Fig. 2d). Because marmosets often use head movements for gaze changes and their eye positions, unlike humans, appear to be restricted to mostly ±10 dva (Fig. 2b, center; see also Supplemental videos 1-2 and Supplemental Figs 4-5 and ref 43), we observe comparable coverage for smaller 20 dva images between humans and marmosets (Fig. 2d, center, light gray versus light cyan). Mice also use head movements for changes in gaze68–70, but their eye positions extend out to at least cover ±20 dva (Fig. 2b, right) so we observe comparable coverage for 40-dva images between humans and mice (Fig. 2d, right, dark gray versus red). These comparisons suggest that saccades for these mammals are decorrelating similar numbers of neurons over time despite having very different saccade rates, saccade sizes, and receptive field sizes. With this sampling strategy, differences in saccade rates and sizes between animals are explained by corresponding differences in receptive field sizes and cortical densities with respect to visual space. If saccade rates and sizes are calibrated to achieve a certain level of visual coverage, this would also explain why saccade rates and sizes change with image size and decrease over viewing time (Supplemental Figs. 2 and 6; see also refs 10,44). As our model predicts, saccade rates are always higher in the condition where it is easier to achieve decorrelation. The oculomotor system indeed seems to favor larger versus smaller saccades, since it is easier and faster to make larger saccades71–73.
Discussion
We propose that gaze changes efficiently sample visual scenes based on natural statistics, neural receptive fields, and adaptation, but this hypothesis makes some important assumptions. First, for the purposes of predicting saccade sizes, we assumed that the receptive field is static and only examined image differences across sequential fixations10. In fact, saccades have complex dynamics over 10’s of milliseconds and the eyes drift and oscillate continuously over 100’s of milliseconds or seconds of fixation. These fast and slow oculomotor dynamics effectively whiten the visual inputs74–76. This whitening is typically outside of the frequency spectrum of the V1 response preferences that we used to predict saccade sizes (Supplemental Fig. 7). Indeed, V1 responses following a saccade are very similar to those evoked by sequentially flashed stimuli77. Nonetheless, saccades can produce an intra-saccadic flow of information that is sensed78,79, can boost post-saccadic visual motion processing80, and have functional consequences for spatial vision81,82. Second, we assumed that the primary visual cortex was the main processing bottleneck that constrains saccade sizes and rates. This choice was partly made because of the large number of comparable data available across multiple species. This assumption does not rule out that differences in receptive fields found earlier, later, or subcortically in the visual system between species contribute to their oculomotor differences. Future work shall include additional processing that occurs during both saccades and fixation and how they influence both cortical and subcortical responses. Another possibility that we did not address in this work is that the oculomotor spatial and temporal species-specific properties have actually contributed to shape the properties of the visual system rather than the oculomotor behavior being the consequence of the visual system characteristics83.
Yarbus illustrated that humans move their eyes deliberately towards objects and features of interest and the overall pattern of those movements can heavily depend on the task1. Decades of research support that the primary purpose of saccadic eye movements is to move an area of interest into the high-spatial acuity portion of the visual field1–5. Even animals with no fovea, such as mice, use gaze changes to move objects of interest into the central portion of their upper visual field69,84 where binocularity and spatial features are represented better22,23,27. Our hypothesis is complementary to foveation and does not attempt to predict saccade targets instantaneously. We use decorrelation as a measurement to quantify a statistical sampling strategy for natural scenes over a long time scale of several fixations to update an internal visual representation. It is important to point out that this sampling strategy can depend on and adapt to environmental and task demands as well. We see this when varying image contrast and size (Supplemental Figs. 3 and 6; see also ref. 44), but previous studies have also shown predictable changes in coverage (size/rate) statistics using images with biased spatial frequency content85, tasks with high acuity demands86, or tasks with high cognitive demands45,48. Previous work examining transsaccadic integration also shows that information from past fixations influence perception during the current fixation. If that past information about a particular visual feature was more reliable than the current information, it has a stronger influence on perception32,33,87,88. Finally, a higher rate of saccades improves performance in detecting changes within a scene89 highlighting the importance of maintaining sufficient coverage. Overall, our analyses illustrate that the oculomotor and visual systems of multiple mammals coordinate to sample the environment efficiently under a diverse range of processing constraints.
Materials and Methods
Adaptation model
For the model, we chose parameters for equation 2 that best matched the average response dynamics (decaying to 30% of the initial response) to brief sequential presentations reported for adaptation experiments in macaques and mice38–42. The decay time constant τdecay for all simulations was 60 ms and the recovery time constant τrecovery varied between 100 and 900 ms. The initial non-adapted firing rate was 50 spikes/s and the constant k for the divisive gain function was 30.
Human participants
Five participants of the Institute des Neurosciences de la Timone (22-29 years old, 2 females) took part in the free viewing tasks. Experiments were approved by the Ethics Committee of Aix-Marseille University and conducted in accordance with the Declaration of Helsinki. All participants gave written informed consent.
Mice
Experiments and procedures were performed on eight C57bl6 adult female and male mice (RRID: JAX: 00664), one female and two male PV-Cre mice90, two female PV-Cre;Ai14 mice90, and two female and four male PV-Cre;ChR2 mice91 (all from the C57bl6 line of mice). Mice were group housed and maintained on a 12-hour light/dark cycle under standard housing conditions. All procedures and care were performed in accordance with the guidelines of the Institutional Animal Care and Use Committees at the University of Texas at Austin.
Marmosets
Seven marmosets (1.5-4 years, 2 females) were obtained from TxBiomed and bred in-house and were group-housed. Food and water were provided ad libitum. All procedures and care were performed in accordance with the guidelines of the Institutional Animal Care and Use Committees at the University of Texas at Austin.
Human saccades
For free viewing experiments, participants sat in a quiet and dimly illuminated room, with their head positioned on a chin and forehead rest. The experiments were controlled by a HP Z420 (Hewlett-Packard, Palo Alto, CA, USA) computer equipped with an Nvidia Quadro 600 video card (Nvidia, Santa Clara, CA, USA). Binocular eye position was recorded using an Eyelink 1000 tower mount (SR Research, Osgoode, ON, Canada) at a sampling rate of 2 kHz (1 kHz per eye). The experimental software controlling the display as well as the eye tracking was implemented in Matlab (The MathWorks, Natick, MA, USA), using the Psychophysics92,93 and Eyelink toolboxes94. Stimuli were presented at a viewing distance of 60 cm on a 32-inch Display++ LCD monitor (Cambridge Research System, Rochester, UK) with a spatial resolution of 1920 by 1080 pixels and a vertical refresh rate of 120 Hz. Participants’ gaze position was calibrated with a 13-point custom calibration sequence with sequences validated at the beginning of an experimental session as well as when necessary.
Participants took part in free viewing tasks composed each of 40 runs of eight trials each. These runs were completed in four experimental sessions (on different days) of about 60 min each (including breaks). Participants were instructed to freely explore a set of 80 images displayed at the center of a gray background screen. These pictures were presented on each trial for a duration of 30 seconds and separated each by two seconds of blank gray screen. Pictures were selected from the natural images from the McGill Calibrated Color Image Database95. For the first 20 runs, participants randomly explored the screen with either the 80 full-sized selected images covering 40 by 32 dva of the visual field or the same 80 images center-cropped covering 20 by 16 dva of the visual field. For the next 20 runs, each participant explored 40 images selected randomly from the same set of 80 images, but with their luminance contrast scaled down uniformly at either 5 or 10% of the original contrast. These images covered 40 by 32 dva of the visual field during the first 10 runs and 20 by 16 dva of the visual field in the last 10 runs. Data from images of both sizes were combined for contrast-related results.
For all experiments, saccades were detected as previously described for marmosets with a velocity threshold10.
Marmoset saccades
Marmoset eye position was tracked as previously described using an EyeLink 1000 (SR Research, Osgoode, ON, Canada) camera to capture binocular eye position at sampling rate of 1 kHz (500 Hz for each eye)10,27 and saccades were detected with a velocity threshold10. Twenty-four small (20 by 16 dva) and 24 large (40 by 32 dva) images from the McGill Calibrated Color Image Database95 were presented per session on a CRT monitor (85 Hz refresh rate) 50 cm away randomly interleaved them with unrelated 1-s fixation trials using Maestro software suite (https://sites.google.com/a/srscicomp.com/maestro/). In separate sessions, the images were presented with their luminance contrast scaled down uniformly at either 5 or 10% of the original contrast. Data from images of both sizes were combined for contrast-related results.
Marmosets had higher saccade rates than expected with respect to their temporal frequency relationship to the other animals (Fig. 2f-g). Previously, we found that marmosets have smaller saccade sizes than we predicted based on their receptive field sizes10. We attributed this mismatch to marmosets normally using saccadic head movements to make rapid changes in gaze43. To probe this idea further, head and eye position were measured in marmosets that were not head restrained using DeepLabCut96 to track pupils and the center of the upper forehead in 400 by 400 pixel images collected at 60 frames per second (see Supplemental video 1).
Calibration for these results was done on head restrained marmosets using the fixation methods described previously43,97 (see Supplemental video 2). Saccade statistics based on DeepLabCut of posted marmosets were not distinguishable from those measured with the EyeLink system (Supplemental Fig 2). For analysis of unposted marmosets, we only looked at continuous segments when the gaze was towards the monitor (within ±50 dva from the monitor center).
Marmosets clearly used combined head and eye movements for nearly all changes in gaze, which always produced larger gaze changes than with the eyes alone (Supplemental Fig 3a-c). We did not observe obvious differences in the gaze change rates (Supplemental Fig 3d).
Interestingly, marmosets also cock their heads frequently and rapidly, especially when presented with novel objects98. This might increase their ability to decorrelate orientation tuned receptive field responses and allow them to sample visual information at a faster rate than with saccades alone (shift the curve in Fig. 1d to the right).
For comparison to our data, the two open circles (median size and rate for each subject) in Fig. 2f were reformatted from figures from a previous publication and the two marmosets were viewing 44 by 34 dva images displayed for 20 s of natural images including humans, macaques, and marmosets43.
Mouse saccades
For image size experiments, saccade statistics are from four C57bl6 and two PV-Cre;Ai14 female mice from a previously reported study10. These mice were trained for a binocular discrimination task27 and 8-12 small (40 by 32 dva) and 8-12 large (80 by 64 dva) images/per session were presented before or after 2-3 training sessions. Images from the McGill Calibrated Color Image Database95 were presented onto a screen 22 cm away with a DLP LED projector. Binocular eye movements were tracked using custom software to analyze 250 by 250-pixel images collected at 20-30 frames per second with cameras aligned with the orbital axis of each eye. Since mice have a much higher contrast threshold and saturation point than primates, for image contrast experiments, we had them view natural images scaled down to 25 and 50% of the original contrast. This increase for mice compared to primates roughly corresponds to the ratio of increase in contrast thresholds for mice compared to humans99. For three male and one female C57bl6 mice, images were scaled to modulate at a ratio of 0.25 or 0.5 of their original contrast modulation. Experiments were run in the same manner as the previous study except four images each of 25, 50, and 100% contrast were randomly interleaved for two sessions. Mice were first trained to discriminate black from white or crossed versus uncrossed disparities from dynamic random dot stimuli by licking left or right. Natural images were shown before or after a training session when mice reached consistent performance above 75% correct for any task. Eyes were tracked in the same manner as previously described except that DeepLabCut96 was used to track pupil position. Saccades were detected as previously described with a velocity threshold and comparing detection for both eyes10.
For monocular deprivation (MD) experiments100, one eye in each mouse (two female and two male PV-Cre;ChR2) was sutured closed at P24 under anesthesia and sutures were removed at P33. Three age-matched PV-Cre mice (one female and two males) were used as controls. Acuity was assessed and saccades for all mice viewing natural images were measured at P4010.
Macaque saccades
Macaque data in Fig. 3A were reformatted from figures from a previous publication (median size and rate for each subject) and the two macaques were viewing 44 by 34 dva images displayed for 20 s of natural images including humans, macaques, and marmosets43. The median and 25th and 75th percentile was also extracted from the aggregate distribution of both subjects.
Cat saccades
Cat saccade size statistics (median, 25th percentile quartile, and 75th percentile quartile of reported sizes) were reformatted from figures from previous publications where the cats were either free viewing in the dark101 or viewing a 21-min natural video of unknown size102. Cat saccade rate statistics (median, 25th percentile quartile, and 75th percentile quartile of intersaccadic intervals) were extracted from horizontal and vertical traces of unpublished data of a head-fixed cat watching a 30-s video of animals at the zoo (unknown size, but smaller than the 60 by 40 dva screen) provided by Theodore Weyand at Louisana State University School of Medicine. Saccades were detected as previously described for mice and marmosets with a velocity threshold. Dr. Weyand observed that cats made saccades more often for the movie compared to dots or a blank screen (unpublished observations).
Receptive field size and temporal frequency data
All receptive field size distributions were extrapolated from data from previously published studies that measured the areas or widths of the minimum response field from electrophysiological recordings for cats, marmosets, and macaques, and GCaMP fluorescence thresholded at half the maximum response for mice. Receptive field (RF) size distributions for cats18–20, marmosets17, and macaques17,21 were generated by combining receptive field size versus eccentricity data fits with cortical magnification data fits, and for mice, we used the reported distribution103 (see reference 10 for more details).
All preferred temporal frequency distributions were aggregated from previously published electrophysiology studies that were based on peak responses in tuning curves generated from average spike rates measured from single neurons in anesthetized animals. For all studies, responses for each temporal frequency were measured using drifting sinusoid gratings centered on the receptive field of single neurons using gratings with the preferred orientation and spatial frequency of the neuron. All spikes were measured extracellularly for single units. All studies used single insulated tungsten electrodes, except a silicon microprobe with 16 recording sites was used to record spikes in mice for one study55. Only summary statistics from the microprobe study were used to validate our aggregate statistics from other studies. For macaques, peak temporal frequencies for 130 single neurons from ref. 50 were combined with fitted peak temporal frequencies for 75 single neurons from ref. 51 to generate an aggregate distribution of peak temporal frequencies. This distribution was consistent with summary statistics reported in additional macaque studies using similar methods53,54,57. For cats, peak temporal frequencies for 36 single neurons from ref. 49 were combined with fitted peak temporal frequencies for 72 single neurons from ref. 52 to generate an aggregate distribution of peak temporal frequencies. For mice, fitted peak temporal frequencies for 192 single neurons from ref. 56 were combined with fitted peak temporal frequencies for 69 single neurons from ref. 57 to generate an aggregate distribution of peak temporal frequencies. This distribution was consistent with summary statistics reported in an additional mouse study using similar methods55. From the complete aggregate distributions for each animal, we measured the median and 25th and 75th quartiles.
Cortical density
Cortical density per dva was estimated by taking the total number of V1 neurons and computing what the total 2D representation of neurons would be based on the surface area and depth of V1, and then dividing that by the total visual area in degrees. We estimated total numbers of neurons using the volume of V1 in mm3 times a density of 150,000 neurons/mm3 because this density has been a relatively consistent value reported in V1 for chimpanzees104, macaques105, marmosets106, and mice107. For humans, we used 1470 mm2 for surface area108 and 2.5 mm for depth109 to get 211 neurons/dva2. For marmosets, we used 200 mm2 for surface area110 and 1.1 mm for depth111 to get 28.7 neurons/dva2. For mice, we used 3 mm2 for surface area112 and 0.8 mm for depth109 to get 0.43 neurons/dva2.
Statistics and reproducibility
All statistical tests were nonparametric based on the median and error bars were based on bootstrap analysis of the median by resampled data 1000 times, allowing repeats, to produce surrogate datasets of the same size. The 160th and 840th samples were used for the standard error of the median for all results. For data we collected on humans, marmosets, and mice, we present the entire distribution of all subjects grouped or single subject medians as single solid data points, an outline connecting the standard error of the median of grouped data across rate and size distributions, and an outline connecting the 25th and 75th quartiles of grouped data across rate and size distributions. For receptive field, temporal frequency and saccade data published previously by other authors, we present the median of grouped data or medians of single subjects as open circles and a dashed outline connecting the 25th and 75th quartiles of grouped data across y- and x-axis distributions. For statistical tests, bootstrapped data for one set of observations were used to find the percentile equal to the median of the compared set of observations. If the compared median was completely outside of the bootstrapped dataset, we reported that as P < 0.001.
Data and code availability
The data presented in the figures and the model code used to generate the figures are available at Figshare: https://figshare.com/projects/Mammals_achieve_common_neural_coverage_of_visual_scenes_using_distinct_sampling_behaviors/136912
Author Contributions
J.M.S., M.S., A.M., G.S.M., and N.J.P. designed research; J.M.S., M.S., C.B. and A.M. performed research; J.M.S., M.S., C.B. and A.M. analyzed data; and J.M.S. wrote the original draft; J.M.S., M.S., A.M., G.S.M., and N.J.P. edited the paper.
Competing Interest Statement
No competing interests.
Classification
Biological Sciences: Neuroscience
Acknowledgments
We thank Allison Laudano for animal care and Jagruti Pattadkal for helping set up all marmoset experiments. We thank Theodore Weyand for sharing cat data and helpful discussions. This research was funded by ANR-20-NEUC-0002 (G.M.) and NIH-R01-NS120562 (N.P.).
Footnotes
The text and figures were revised to improve clarity and better support the interpretations.
References
- 1.↵
- 2.↵
- 3.
- 4.↵
- 5.↵
- 6.↵
- 7.↵
- 8.
- 9.↵
- 10.↵
- 11.↵
- 12.
- 13.
- 14.↵
- 15.↵
- 16.↵
- 17.↵
- 18.↵
- 19.
- 20.↵
- 21.↵
- 22.↵
- 23.↵
- 24.↵
- 25.↵
- 26.↵
- 27.↵
- 28.↵
- 29.↵
- 30.
- 31.
- 32.↵
- 33.↵
- 34.↵
- 35.↵
- 36.↵
- 37.↵
- 38.↵
- 39.↵
- 40.↵
- 41.↵
- 42.↵
- 43.↵
- 44.↵
- 45.↵
- 46.
- 47.
- 48.↵
- 49.↵
- 50.
- 51.
- 52.
- 53.↵
- 54.↵
- 55.↵
- 56.
- 57.↵
- 58.
- 59.↵
- 60.↵
- 61.↵
- 62.↵
- 63.↵
- 64.↵
- 65.↵
- 66.↵
- 67.↵
- 68.↵
- 69.↵
- 70.↵
- 71.↵
- 72.
- 73.↵
- 74.↵
- 75.
- 76.↵
- 77.↵
- 78.↵
- 79.↵
- 80.↵
- 81.↵
- 82.↵
- 83.↵
- 84.↵
- 85.↵
- 86.↵
- 87.↵
- 88.↵
- 89.↵
- 90.↵
- 91.↵
- 92.↵
- 93.↵
- 94.↵
- 95.↵
- 96.↵
- 97.↵
- 98.↵
- 99.↵
- 100.↵
- 101.↵
- 102.↵
- 103.↵
- 104.↵
- 105.↵
- 106.↵
- 107.↵
- 108.↵
- 109.↵
- 110.
- 111.
- 112.