Mice use robust and common strategies to discriminate natural scenes

Yiyi Yu; Riichiro Hira; Jeffrey N. Stirman; Waylin Yu; Ikuko T. Smith; Spencer L. Smith

doi:10.1101/156653

Abstract

Mice use vision to navigate and avoid predators in natural environments. However, the spatial resolution of mouse vision is poor compared to primates, and mice lack a fovea. Thus, it is unclear how well mice can discriminate ethologically relevant scenes. Here, we examined natural scene discrimination in mice using an automated touch-screen system. We estimated the discrimination difficulty using the computational metric structural similarity (SSIM), and constructed psychometric curves. However, the performance of each mouse was better predicted by the population mean than SSIM. This high inter-mouse agreement indicates that mice use common and robust strategies to discriminate natural scenes. We tested several other image metrics to find an alternative to SSIM for predicting discrimination performance. We found that a simple, primary visual cortex (V1)-inspired model predicted mouse performance with fidelity approaching the inter-mouse agreement. The model involved convolving the images with Gabor filters, and its performance varied with the orientation of the Gabor filter. This orientation dependence was driven by the stimuli, rather than an innate biological feature. Together, these results indicate that mice are adept at discriminating natural scenes, and their performance is well predicted by simple models of V1 processing.

Introduction

Visual processing of natural scenes is essential for animal survival. Mammalian visual systems including primates and rodents evolved to efficiently process natural stimuli^1-5. Mice use vision to hunt prey⁶, avoid danger^7-9, and navigate^10,11.

A number of studies have characterized the ability of mice to discriminate visual stimuli including gratings^12-14, simple shapes^13-16, and random dot kinematograms¹⁷. However, these results cannot be extrapolated to natural scene discrimination, because visual coding is substantially different between natural images and artificial ones^4,5,18. Moreover, the spatial resolution of mouse vision is orders of magnitude lower than that of primates and carnivorans^12,19-23. Even when natural scenes might be discriminated, individual mice could focus on different regions of the images to discriminate them, and this would lead to high mouse-to-mouse variability. Thus, investigating natural scene discrimination in mice can provide essential information for understanding evolved encoding strategies of mammalian visual systems.

The perception of visual information depends on processing by primary (V1) and higher visual cortical areas^24-28. One prominent feature of V1 neurons is their orientation tuning^29,30. This selectivity can facilitate the sparse coding of natural images by V1 neurons^1-3. Orientation specific features are further transformed and integrated in higher visual areas to extract higher order statistical structures of the image and detect objects^31-33. Thus, the orientation selectivity is a foundation of visual perception. However, it is unclear how orientation features in naturalistic images can contribute to animal behavior.

Here, we developed a natural image discrimination task for freely moving mice using an automated touchscreen-based system. We found that mice successfully and quickly learned to discriminate images of natural scenes, the mouse-to-mouse consistency was high, and their performance could be well predicted by a simple model of V1 encoding.

Results

Mice learned to discriminate natural scenes

We used the automated touchscreen-based system^16,34 that we previously adapted for visual discriminations¹⁷ (Fig. 1a). In the task, mice were presented with two images simultaneously, each in one of two presentation windows on the screen. The mice learned to touch a target image, avoiding a distractor image, to get a reward. Thus, it is a type of two-alternative, forced-choice (2AFC) task. All mice trained in the main experiment (6 of 6) successfully passed the pre-training phases (see Methods), meeting criteria to advance to natural image discrimination (NID) training in 14.5 ± 2.9 days (mean ± S.D.) (Fig. 1b). These mice also readily acquired the NID training task (6 of 6), ultimately discriminating correctly between a natural target image and 10 distractor images on 85% or more of trials (Fig. 1c,d). Two out of six mice were trained for an hour per day, and the other four mice were trained for two hours per day. Once mice performed the NID training task with 85% accuracy for two consecutive days, they moved to the NID testing phase. Mice required fewer training sessions to reach criterion for NID compared to the mice trained in the random dot kinematogram (RDK) task we previously reported¹⁷ (3, 3, 3, 5, 8, 9 days for NID vs. 5, 10, 11, 14, 15, 18 days for RDK; p = 0.0088, t-test).

Fig. 1. Touchscreen-based training system and learning curves

(a)Touchscreens displayed stimuli and registered responses in the behavior training system. Mice learned to touch the target image and avoid a distractor to get a reward. (b)Mice progressed through a series of pretraining phases within about a month (FR, Free Reward; MT, Must Touch; IM, Image; NID, Natural Image Discrimination; phases are explained in Methods). (c) Mice learned to perform the NID task to criterion (correct on 85% of trials) in 10 sessions or less, which corresponded to (d) 12 hours or less of training (data in panels c,d are the same, but are plotted against different units on the X-axis).

The NID testing phase consists of testing blocks and interleaved training blocks (Fig. 2a; see Methods), a strategy we used in our prior work¹⁷. In testing blocks, one of the 12 distractor images were selected for each trial. In the interleaved training blocks, the distractor image was always the same, and was easy to discriminate from the target. The target image was always the same in both block types. The mice had to touch the target image to receive a reward in the interleaved training blocks, otherwise they received a time-out. By contrast, mice received a reward on every trial, when touching either of the images in the testing block. Mice are never cued as to which trial type they are in. We find this approach effective¹⁷, perhaps because the always-rewarded testing blocks prevent the mice from getting too frustrated by the difficult discrimination trials, while the interleaved training blocks keep the mice honest. Five out of six mice performed correctly on > 85% of the interleaved training trials, which indicated that they were performing the task correctly, and they were included for further analysis. By contrast, one mouse (Mouse 6) had a lower correct rate for interleaved training blocks (Supplementary fig.1). This mouse seemed to recognize that it does not have to touch the target image to get rewards during the testing blocks, and it tended to select the left panel. Accordingly, we excluded this mouse, and analyzed the data in the testing blocks from the remaining five mice. We computed the correct trial rate with all five animals for the testing blocks (range: 1920 - 2712 trials, over 4 – 8 sessions per mouse). Repeated trials with the same sets of a target and distractor (range: 60-226 trails per image pair) enabled us to precisely estimate the correct trial rate.

Supplementary Figure S1. History dependency and choice bias

(a) Mouse number 6 tended to select the same side of the touchscreen on subsequent trials, while other mice did not. (b) Mouse number 6 tended to select the left panel, while the other mice were more even in their selections.

Fig. 2. Testing procedure and SSIM index for quantifying expected difficulty

(a) NID testing sessions were composed of two types of blocks, testing blocks and interleaved training blocks. In testing block trials, one of 12 distractor images was presented with a target image. In interleaved training trials, the same distractor image was always presented with a target image. The incorrect choices were only punished (with a time-out and no reward) in the interleaved training trials. (b) Distractor and target images displayed in a trial were compared using structural similarity (SSIM). An SSIM map was obtained for each pair, and averaged yield a scalar SSIM index for the image pair. (c) Twelve distractor images were used, which varied in SSIM index with the target image (see also Supplementary Table 1.). One distractor (ID: 12) was the same as the target image, and served as an internal control.

View this table:

Supplementary Table S1. Images used during NID testing phase

The target image and all distractor images for NID testing block and interleaved training block are listed. The SSIM index and information for the database for each image are also listed.

Behavior performance was predicted by structural similarity between images

To create psychometric curves to analyze mouse performance on this task, we need a metric that corresponds to the difficulty of each discrimination. For example, for discriminating gratings with different orientations, the orientation difference would be the appropriate metric to use. With natural images, the choice of metric is not straightforward, and multiple metrics could suffice. We chose to estimate the image similarities between two simultaneously presented images using the structural similarity (SSIM) index metric. The SSIM indices for all pairs of presented natural stimuli were calculated as reported by Wang et al. 2004 (Ref. ³⁵) (see Methods). SSIM indices have been used to estimate the discriminability of artificial image pairs in prior mouse behavior studies^36,37. In the testing blocks, the SSIM indices for image pairs ranged from 0.074 for the most dissimilar images, to 1 for trials when the same image was displayed on both sides of the screen (Fig. 2b,c; Supplementary table 1,2.). Psychometric curves were plotted using the correct trial rate and SSIM indices for 12 distractor images (Fig. 3a). The performances of the mice were remarkably similar (the thresholds of the psychometric curves were 0.29, 0.30, 0.27, 0.34 and 0.32; Fig. 3b). In total, the SSIM index approximated the correct trial rate, and the threshold was 0.30 ± 0.03 (mean ± S.D.) (Fig. 3c). To quantify the inter-mouse similarity, we computed the coefficient of variation (CV) for the psychometric threshold. The CV for the psychometric threshold in the natural scene discrimination task was 0.089, which is much smaller than the CV in the global motion discrimination task (0.24) that was carried out using the same apparatus¹⁷. Thus, the performance in the natural scene discrimination task is highly reproducible mouse-to-mouse.

View this table:

Supplementary Table S2. Images used during NID training phase

A target image and all distractor images for training phase are listed. The SSIM index and information for the database for each image are also listed.

Fig. 3. Psychometric curves for natural image discrimination

(a) During the testing blocks, mice were presented stimulus pairs of SSIM indices between 0.074 and 1 in random order, allowing us to sample the full range of SSIM indices for each animal. The same pairs were shown repeatedly to generate accurate estimates of the correct rate for each data point. Psychometric curves were fit to the data. The SSIM threshold (vertical dotted line; percent coherence at 70% accuracy) was computed from the fitted curves. (b) The psychometric curves from the five mice were similar. (c) The SSIM thresholds for the five mice were also very similar. (d) Each data point shows mean ± S.E.M. for each distractor image across all five mice. The psychometric curve is for the averaged data (threshold = 0.30). Note that for some distractors (e.g., #5, #9, and #11), the correct rate was not accurately approximated by the psychometric curve.

High inter-mouse agreement

The SSIM index-based psychometric curves fit the data well, but behavioral data for some image pairs deviated from the psychometric curves (Fig. 3d, 4a). Notably, this deviation was not due to outlying data points from a few mice, but instead, data from all mice similarly deviated. In fact, the correct rates for each mouse were highly predictable by the mean correct rate of the other four animals (Fig. 4b). We analyzed the residuals of the fits by computing the root mean squared error (RMSE) between a predictor and the actual data. The RMSE for the simple case where the correct rate for each mouse was predicted by the mean correct rate of the other mice (mean ± S.E.M.: 0.047 ± 0.0046) were significantly smaller than the RMSE values for the SSIM-based psychometric curve fits (mean ± S.E.M.: 0.079 ± 0.0026) (Fig. 4c; p = 0.00053, paired t-test). These results indicate that the mouse visual system uses strategies that are not fully captured by SSIM indices.

Fig. 4. High inter-mouse agreement

(a) To examine the difference between the actual correct rate of each mouse and the predicted correct rate by the SSIM-based psychometric curve for each distractor image, we plotted the residuals. For some images, the mice collectively deviated from the predicted correct rate (ANOVA, F(48, 11) = 17.8, p= 2.3 x 10^-13; t-test, ^*p<0.05^{, **}p<0.01, ^***p<0.001). (b) The correct rates of each mouse were plotted as a function of mean correct rates of the other mice, for each distractor image (for reference, the unity line is shown in black). The correct rate of each mouse can be accurately predicted by that of other mice. (c) The RMSE of the inter-mouse agreement was smaller (i.e., more accurate) than that from SSIM-based psychometric curves (^***p<0.001; paired t-test).

The mouse visual system does not have sufficient acuity with which to distinguish each pixel of the touch screen in this apparatus. One of the highest reported behaviorally-measured acuities in mice is 0.49 cycles per degree (Ref. ¹²), and this corresponds to 2.2 pixels if mice view the screen from just 10 mm. Thus, in this apparatus, mice are discriminating lower resolution representations of the two images. To investigate whether high spatial frequency information was biasing the SSIM-based estimator, we filtered the image with Gaussian filter with standard deviation from 1 to 112 pixels (Fig. 5a, b), and recomputed SSIM indices after Gaussian filtering (fltSSIM). The fltSSIM reached a minimal RMSE when the standard deviation was 4.6 pixels, but the improvement was small. This indicates that high spatial frequency information in the natural images is not responsible for more than a small amount of the RMSE when estimating behavior with SSIM.

Fig. 5. Blurring the images and other attempts to reduce the fit error (RMSE)

Several approaches were investigated to determine whether they could reduce the error of the psychometric curve fit. (a) To determine whether high spatial frequency features of the images (which could be imperceivable to the mice, but still influence SSIM calculations) caused the relatively high root mean squared error (RMSE) values, the images were filtered (blurred) with Gaussian filters of a range of standard deviations (s). The inset shows the filtered target image. (b) Overall, blurring the images only marginally decreased the RMSE, and aggressive blurring led to even higher RMSE. (c) Registration of images prior to SSIM calculations did not improve the psychometric curve fit. (d) An alternative metric, pixel-wise RMSE between distractors and a target image, also did not improve the RMSE of a fit curve. (e) Another alternative metric, pixel-wise cross-correlation between distractors and the target image, also failed to yield low-RMSE curve fits. (f) Finally, calculating the SSIM between distractors and a distractor for the training phase (the “anti-target”), also failed to yield a low RMSE. RMSEs for (c-f) was 0.089, 0.095, 0.11 and 0.10, respectively.

SSIM is commonly used for estimating image similarity, but it is vulnerable to image translation. To robustly estimate the similarity between two images, we translated images so that the mean squared difference between two images was minimized. The SSIM values were then recomputed (SSIM after registration, regSSIM). However, the predictability of regSSIM was comparable to that of the original SSIM calculations, and RMSE still exceeded that of a simple inter-mouse predictor (Fig. 5c). Overall, SSIM analysis does not predict performance on this task as accurately as inter-mouse agreement.

SSIM is a sophisticated measurement with multiple components, and this sophistication could lead to biases that degrade performance as a predictor for mouse behavior on this task. Thus, we turned to two simple metrics to measure the similarity of two images: pixel-wise cross-correlation and RMSE between the two images. These parameters also failed to predict the correct rate as accurately as the mean performance of the other mice could (Fig. 5d,e). Finally, we tested the hypothesis that mice tended to avoid images similar to the distractor of the interleaved training phase (“anti-target”), because mice were only punished when they mistakenly selected interleaved anti-target images during the NID testing. However, recomputed SSIM indices using the anti-target did not yield an improved predictor (Fig. 5f). This is evidence against the hypothesis and indicates that the mice did not have tendency to avoid the “anti-target”. Overall, these results indicate that the mouse visual system uses strategies to judge the similarity between two images that are not captured well by SSIM or other metrics we examined. Therefore, we turned to a neurobiological model-based approach.

V1-inspired model accurately predicts discrimination performance

So far, we have explored image comparison metrics that measure the similarity between two images. These metrics predicted mice behavior, but less accurately than a simple mean of other mice. Moreover, these metrics are not an intuitive model for the mouse visual system. Accordingly, we attempted to predict the discrimination performance with a model based on basic features of neuronal selectivity in mouse V1. The receptive fields of simple neurons in mouse V1 can be modeled as Gabor filters^38,39. Each Gabor filter is convolved with a small patch of the image, and the result represents the degree of which that patch of the image and the Gabor filter are matched in orientation and wavelength (Fig. 6a). We convolved the target and distractor images with individual Gabor filters of various orientations and wavelengths, and calculated the similarity between the target and distractor images after filtering. We refer to this similarity as the orientation specific similarity (OSS) (Fig. 6b), since it is a function of the orientation of the Gabor filter (as well as wavelength). We plotted psychometric curves as the correct fraction versus OSS (Fig. 6c). We obtained RMSE values for these curves, for each set of wavelength and orientation parameters of the Gabor filters (Fig. 6e,f). The RMSE of the OSS model averaged over all orientations reached a minimal value when the wavelength of Gabor filter is approximately 7.07 pixels (Fig. 6d).

Fig. 6. A simple Gabor filter model predicts mouse performance

(a) For a simple model the responses of V1 neurons, Gabor filters with various orientations and wavelengths were used to convolve the stimulus images. (b) The orientation specific similarity (OSS) between the target image and a distractor image was calculated by convolving a single Gabor filter with each image and then comparing the two results pixel-wise. The correct rate from the behavior data was then compared with these OSS values for each distractor image (compared to the target). (c) Example fits for the average correct rate across all mice against OSS values show that a filter with θ = 30°, λ = 10 (left) provided a good fit (RMSE = 0.055); and a filter with θ = 90°, λ = 10 (right) provided a poor fit (RMSE = 0.13). (d) Gabor filter patches with relatively short wavelengths provided lower RMSE fits to the behavior data (mean and S.E.M. are plotted; based on an average OSS over all orientations), and this suggests that relatively high spatial frequency information in the images is used by the mice for this discrimination. (e) The orientation of the Gabor patch used for calculating OSS influenced the resulting RMSE (p = 9 ×10^-11, ANOVA; patches had wavelength = 10;mean and S.E.M. are plotted; blue curve is a sinusoidal fit). (f) An examination of the interaction between wavelength and orientation revealed that RMSE of the OSS-based fits depended more on orientation than wavelength of the Gabor filters.

The prediction RMSE of OSS varied by orientation (Fig. 6e, ANOVA p = 9.0 × 10^-11). When OSS analysis used horizontally oriented Gabor filter, they performed poorly at predicting the correct trial rate, regardless of the spatial scale of the Gabor filters. By contrast, OSS analysis using a near vertically oriented Gabor filter predicted the animal behavior more accurately than the SSIM model. When OSS-based predictions were averaged over all orientations (using the optimal wavelength) the RMSE is similar to that of the SSIM model (using the optimal Gaussian filter, or blur) (RMSE of OSS, 0.082 ± 0.003 (λ = 7.07); RMSE of SSIM, 0.074 ± 0.003; t-test, p=0.08). The orientation bias in OSS-based prediction accuracy was consistent over a wide range of Gabor filter wavelengths (Fig. 6f). These results suggest that mice could use orientation specific features in the naturalistic image discrimination task.

The orientation specific prediction accuracy could result from an intrinsic bias of orientation selectivity in the mouse visual system^40,41, or be acquired through the learning of specific images. To investigate these possibilities, we trained a new cohort of mice using the same set of images but rotated by 90° (n = 4 mice; Fig. 7a). If the orientation bias was intrinsic, then the orientation dependence of the OSS-based prediction should be identical to that obtained in the main experiment (Fig. 6e). If instead, the bias was learned based on the stimuli, then the orientation dependence of the OSS-based prediction should be rotated by 90°. All mice trained in the additional experiments (four of four) successfully passed the pre-training phases, just as the mice in the main experiments had. In the NID testing phase, we found that the orientation bias was shifted by ~90° (Fig. 7b). These results indicate that the orientation-dependence of the OSS-based prediction is not a result of a static innate orientation bias in the mouse visual system. Instead, the results suggest that the mouse visual system can learn to extract specific orientation information based on the natural images used in this behavior assay.

Fig. 7. Experiments with rotated images showed that the orientation bias is not innate

(a) To examine whether the orientation-dependence of the OSS-based predictions was stimulus-based or innate, a separate cohort of mice was trained and tested on rotated versions of the same images. (b) The prediction error (RMSE) from the OSS-based model in the original experiment (black data points and blue sinusoidal curve fit; plotted against the bottom axis) was similar to the prediction error (RMSE) from the rotated images experiment (grey data points and grey curve; plotted against the top axis) after shifting the rotated image data by exactly 90 degrees (all OSS data used patches with wavelength = 10). Note that the bottom axis (for the main experiment) and the top axis (for the rotated images experiment) are offset by 90 degrees. This shows that the orientation specific prediction accuracy also shifted by 90 degrees.

Discussion

Here we investigated the ability of mice to discriminate natural scenes, using an automated touchscreen-based system. We found that mice learned to discriminate natural scenes quickly, and that psychometric functions based on SSIM indices fit the data well. Further investigation revealed that the behavioral performance was highly consistent mouse-to-mouse, and deviated in significant and reproducible ways from the predictions from the SSIM-based psychometric curves. Thus, mice discriminate natural scenes using robust and common strategies, even when the images are displayed artificially, using an LCD monitor. To improve on the SSIM-based psychometric curves, we searched for a parameter or model that more accurately predicted the performance of the mice on this task. We found that a simple, V1-inspired model provides a prediction whose accuracy can approach that of the inter-mouse agreement. Thus, V1 processing may partly explain the way in which the mouse vision discriminates natural images.

Natural scenes were compared using the SSIM index, which is commonly used to estimate the similarity of two images³⁵. SSIM indices correlated with the correct rate of the mice in our task. However, the psychometric curves plotted against SSIM provided only marginal fits to the data. In particular, there were several images that had significantly higher or lower correct rates across all mice than predicted by SSIM (Fig. 4a). In addition, the correct rates of mice were highly correlated, which indicated that the deviations from the SSIM-based psychometric curves were not due to mouse-to-mouse variance, but rather that the mice perceived the similarity of the images in ways not captured by SSIM.

Reducing the spatial resolution (i.e., blurring) of the images to better match mouse vision did not substantially improve the fits of SSIM-based psychometric functions, nor did several other approaches that we explored. Instead, we found that a Gabor filter-based approach provided the best fit. Notably, the improvement was specific to the orientation angle of the Gabor patch used. Horizontal orientation filtering did not provide a good fit, but a peri-vertical one did. This orientation bias of the Gabor filter-based approach was not due to innate properties of the mouse visual system. We know this because we tested a separate cohort of mice on the same images rotated by 90 degrees, and the orientation bias of the Gabor filter-based approach was rotated by the same amount. These results indicate that mice could extract orientation specific information depending on the task context.

In mammals, the orientation information itself could be modulated depending on the current demands. For instance, when humans attend to a specific orientation, there is increased activity in the portions of visual cortex that preferred the attended orientation⁴². In monkey V1, orientation-selective neurons fire at higher rates when attention is focused in the neurons’ receptive fields⁴³. Also, the direction and orientation specificity of mouse V1 neurons increased during learning of a visual discrimination tasks when the preferred direction of the neuron was task relevant^44,45. Thus, the orientation specific enhancement of visual processing appears to be a common feature of mammalian visual systems, and enhanced visual processing for specific orientations might explain the results from the present study. These changes in neural responses based on orientation could arise through learning.

Visual processing tasks can be categorized into scene/object recognition and motion/location recognition, which are represented along different “streams” or subnetworks of cortical visual areas^28,46,47. In that context, this NID task complements the random dot kinematogram task we recently developed¹⁷. These experimental paradigms can be used to investigate stream-specific higher visual processing of mice^25,26,28,48.

Methods

Subjects

Ten adult C57BL/6 mice (two males and four females in main experiments and two males and two females in additional experiments) were used in the experiments reported here. Animals were between 80 and 160 days old at the start of training, which lasted for approximately one month. Mice were housed in rooms on a reversed light-dark cycle (dark during the day, room light on at night), and training and testing were performed during the dark cycle of the mice. All training and procedures were reviewed and approved by the Institutional Animal Care and Use Committee of the University of North Carolina, and performed on campus, which is accredited by the Association for Assessment and Accreditation of Laboratory Animal Care (#329).

Apparatus

The operant chamber and controlling devices were the same as previously described¹⁷, and are based on work by Saksida and Bussey^15,16,34. Briefly, a touchscreen panel was on the long size of a trapezoidal chamber, opposite of liquid reward port (a strawberry-flavored yogurt-based drink; Kefir). Correct responses were indicated by a brief auditory tone, and the reward port was illuminated. During time-outs, a brief burst of white auditory noise was played, and the chamber light was illuminated for the duration of the time-out.

Stimuli

For the NID task, we selected pictures of natural scenes (430 by 430 pixels static images, JPEG format). The original set of images was taken from three naturalistic image databases:UPenn(http://tofu.psych.upenn.edu/~upennidb/)⁴⁹,McGill(http://tabby.vision.mcgill.ca/)⁵⁰, and MIT (http://cvcl.mit.edu/database.htm)⁵¹. Average luminance of images was adjusted to be similar. One image was used as the target image during NID training sessions and NID testing sessions. Ten images that have low SSIM with the target image, and eleven images with varied SSIM (plus one image that is the same as the target image) were selected as distractor images on the NID training and NID testing sessions, respectively (Supplementary table 1,2). All images were masked by a circular aperture. During NID testing sessions, 5 interleaved training trials were provided after every 12 testing trials. One image with low SSIM (of the eleven distractors) was used as the distractor for the interleaved training.

Behavioral Training

Food restriction and training stages 1-3 (FR, free reward; MT, must touch; IM image discrimination) for the 2AFC task were conducted as previously described¹⁷. Briefly, during the FR phase (training stage 1), mice learned to lick the reward spout to receive a liquid reward. As a result, mice associated the tone with the delivery of a reward, and learned the location of the reward. During the MT phase (training stage 2), mice had to touch any location on the screen at the front of the box to receive a reward. The goal of this phase was to associate touching the screen with delivery of a reward. During the IM phase (training stage 3), a simple black-and-white dot and fan image pair stimulus¹⁶ was presented, and mice were required to touch a specific target stimulus on the screen to earn a reward. On the NID training phase (training stage 4), mice had to touch the target image, avoiding one of 10 distractors. SSIM indices of all pairs were less than 0.2 (low SSIM implies that the two images are not similar). This stage of training incorporated correction trials as described previously¹⁷.

Behavioral Testing

The NID testing condition was similar to the testing condition for kinematogram discriminations we previously described¹⁷. Images for the testing phase consisted of one target image and 12 distractor images whose SSIM indices ranged from 0.074 to 1. Testing sessions consisted of interleaved blocks of testing and training. Mice were not cued as to whether they were in a testing or training block. During testing blocks, all 12 distractors (12 trials per block) were presented in a random order, and all answers were rewarded. During training blocks (5 trials per block), stimuli with SSIM = 0.14 (i.e., very dissimilar to the target) were presented, and normal performance feedback (including time-out periods and correction trials) were provided. Performance during these interleaved training blocks served as an internal control to ensure the mice were still working to distinguish the stimuli in the testing blocks. Testing data was analyzed only if the animal performed on average at criteria (≥ 85% correct) during the interleaved training blocks. This criterion excluded one mouse out of the six mice in the main experiment. Testing sessions were conducted for 120 - 150 minutes per day. In additional experiments, the target and distractor images were rotated only in the NID training and testing sessions. All four mice generated testing data in the additional experiments.

SSIM (Structural similarity)

SSIM indices for all pairs of presented natural stimuli were calculated as reported by Wang et al 2004 (Ref. ³⁵). First, the pair of two images were smoothed with Gaussian filter (σ= 1.5 pixels). SSIM index was obtained for each window of a pair of images. First, the SSIM for each pixel was calculated for square windows, centered at the same pixel (w,h) of two images. The length of a side of the square window was eight pixels. The SSIM (x,y) was then obtained for two windows in a target image (x) and a distractor image (y) as follows, where μ_x, μ_y are pixel intensity averages of window x and y, σ_x, σ_y, σ_xy are standard deviation of window x and y, and covariance of two windows, c₁ and c₂ are (0.01 x L)2 and (0.03 x L)², respectively, where L is 255 (corresponding to the dynamic range of 8-bit monochrome images).

SSIM(x,y) was obtained for all (w,h) and was averaged over the circular area where the natural images are located. The average of SSIM(x,y) was called simply the “SSIM index” for each pair of target and distractor images in this paper (Fig.2b). One distractor was the same as a target image, so that its SSIM index = 1. The other image pairs had SSIM indices < 1.

Psychometric curve

Psychometric curves were obtained with a regression of the following function (Weibull function) to the data, where α and β are parameters determined by the regression to each data set. The threshold was taken as the SSIM value (from this function) that corresponded to 70%accuracy.

Image processing

Pixel-wise correlation and root mean squared error (RMSE) between a target image and each distractor image were obtained as candidate parameters that may capture the similarity of two images. The registration of two images were done with a MSE based registration algorithm (Turboreg)⁵².

Gabor filter

We used Gabor filtering to analyze the orientation specific features of the images:

The Gabor filter bank was generated using the built-in Matlab function “gabor”. The wavelength (λ) of the Gabor filters ranged from 5 – 28 pixels per cycle. The orientation (θ) of Gabor filters was either 0°, 30°, 60°, 90°, 120°, or 150°. We set the aspect ratio (γ) of all Gabor filters to 0.5. The standard deviation of the Gaussian envelops, σ = determined by the wavelength. We computed the Gabor feature magnitude G_A,0(x, y) by convolving the Gabor filter (with a specific orientation and wavelength) with each image. Then we computed the orientation specific similarity (OSS) between two images by the following function:

Statistics

Student’s t-test was used for statistical comparisons. Pairwise comparisons were two-tailed. Error bars in graphs represent the mean ±S.E.M. unless otherwise noted. Analysis of variance (ANOVA) was used for multiple comparisons, followed by t-test (presented p-values for t-test were Bonferroni corrected). No statistical tests were performed to predetermine sample size. No blinding or randomization were performed.

Acknowledgements

This work was supported by grants to S.L.S. from the Human Frontier Science Program, NIH (R01NS091335 and R01EY024294), the Simons Foundation (SCGB 325407SS), and the Whitehall Foundation. Y.Y. was supported by a Helen Lyng White fellowship, R.H. was supported by fellowships from Naitoh and the Japan Society for the Promotion of Science, and J.N.S. was support by a Career Award from Burroughs-Wellcome.

Footnotes

↵† Co-first authors

References

↵
Olshausen, B. A. & Field, D. J. Emergence of simple-cell receptive field properties by learning a sparse code for natural images. Nature 381, 607 (1996).
OpenUrl CrossRef PubMed Web of Science
↵
Bell, A. J. & Sejnowski, T. J. The “independent components” of natural scenes are edge filters. Vision Res. 37, 3327–3338 (1997).
OpenUrl CrossRef PubMed Web of Science
↵
Itti, L., Koch, C. & Niebur, E. A model of saliency-based visual attention for rapid scene analysis. IEEE Trans. Pattern Anal. Mach. Intell. 20, 1254–1259 (1998).
OpenUrl CrossRef
↵
Vinje, W. E. & Gallant, J. L. Sparse coding and decorrelation in primary visual cortex during natural vision. Science 287, 1273–1276 (2000).
OpenUrl Abstract/FREE Full Text
↵
Froudarakis, E. et al. Population code in mouse V1 facilitates readout of natural scenes through increased sparseness. Nat. Neurosci. 17, 851–857 (2014).
OpenUrl CrossRef PubMed
↵
Hoy, J. L., Yavorska, I., Wehr, M. & Niell, C. M. Vision drives accurate approach behavior during prey capture in laboratory mice. Curr. Biol. 26, 3046–3052 (2016).
OpenUrl CrossRef PubMed
↵
De Franceschi, G., Vivattanasarn, T., Saleem, A. B. & Solomon, S. G. Vision guides selection of freeze or flight defense strategies in mice. Curr. Biol. 26, 2150–2154 (2016).
OpenUrl CrossRef PubMed
Vale, R., Evans, D. A. & Branco, T. Rapid spatial learning controls instinctive defensive behavior in mice. Curr. Biol. 27, 1342–1349 (2017).
OpenUrl
↵
Yilmaz, M. & Meister, M. Rapid innate defensive responses of mice to looming visual stimuli. Curr. Biol. 23, 2011–2015 (2013).
OpenUrl CrossRef PubMed
↵
Harvey, C. D., Collman, F., Dombeck, D. A. & Tank, D. W. Intracellular dynamics of hippocampal place cells during virtual navigation. Nature 461, 941–946 (2009).
OpenUrl CrossRef PubMed Web of Science
↵
Youngstrom, I. A. & Strowbridge, B. W. Visual landmarks facilitate rodent spatial navigation in virtual reality environments. Learn. Mem. 19, 84–90 (2012).
OpenUrl Abstract/FREE Full Text
↵
Prusky, G. T., West, P. W. & Douglas, R. M. Behavioral assessment of visual acuity in mice and rats. Vision Res. 40, 2201–2209 (2000).
OpenUrl CrossRef PubMed Web of Science
↵
Glickfeld, L. L., Histed, M. H. & Maunsell, J. H. Mouse primary visual cortex is used to detect both orientation and contrast changes. J. Neurosci. 33, 19416–19422 (2013).
OpenUrl Abstract/FREE Full Text
↵
Long, M., Jiang, W., Liu, D. & Yao, H. Contrast-dependent orientation discrimination in the mouse. Sci. Rep. 5, doi:1038/srep15830. (2015).
OpenUrl CrossRef
↵
Nithianantharajah, J. et al. Bridging the translational divide: identical cognitive touchscreen testing in mice and humans carrying mutations in a disease-relevant homologous gene. Sci. Rep. 5, doi:10.1038/srep14613. (2015).
OpenUrl CrossRef
↵
Bussey, T. J., Saksida, L. M. & Rothblat, L. A. Discrimination of computer-graphic stimuli by mice: a method for the behavioral characterization of transgenic and gene-knockout models. Behav. Neurosci. 115, 957 (2001).
OpenUrl CrossRef PubMed Web of Science
↵
Stirman, J. N., Townsend, L. B. & Smith, S. L. A touchscreen based global motion perception task for mice. Vision Res. 127, 74–83 (2016).
OpenUrl CrossRef PubMed
↵
Rikhye, R. V. & Sur, M. Spatial correlations in natural scenes modulate response reliability in mouse visual cortex. J. Neurosci. 35, 14661–14680 (2015).
OpenUrl Abstract/FREE Full Text
↵
Dräger, U. C. Receptive fields of single cells and topography in mouse visual cortex. J. Comp. Neurol. 160, 269–289 (1975).
OpenUrl CrossRef PubMed Web of Science
Mangini, N. J. & Pearlman, A. L. Laminar distribution of receptive field properties in the primary visual cortex of the mouse. J. Comp. Neurol. 193, 203–222 (1980).
OpenUrl CrossRef PubMed Web of Science
De Valois, R. L., Morgan, H. & Snodderly, D. M. Psychophysical studies of monkey vision-III. Spatial luminance contrast sensitivity tests of macaque and human observers. Vision Res. 14, 75–81 (1974).
OpenUrl CrossRef PubMed Web of Science
Umino, Y., Solessio, E. & Barlow, R. B. Speed, spatial, and temporal tuning of rod and cone vision in mouse. J. Neurosci. 28, 189–198 (2008).
OpenUrl Abstract/FREE Full Text
↵
Jacobson, S. G., Franklin, K. B. J. & McDonald, W. I. Visual acuity of the cat. Vision Res. 16, 1141–1143 (1976).
OpenUrl CrossRef PubMed Web of Science
↵
Felleman, D. J. & Van Essen, D. C. Distributed hierarchical processing in the primate cerebral cortex. Cereb. Cortex. 1, 1–47 (1991).
OpenUrl CrossRef PubMed Web of Science
↵
Andermann, M. L., Kerlin, A. M., Roumis, D. K., Glickfeld, L. L. & Reid, R. C. Functional specialization of mouse higher visual cortical areas. Neuron 72, 1025–1039 (2011).
OpenUrl CrossRef PubMed Web of Science
↵
Marshel, J. H., Garrett, M. E., Nauhaus, I. & Callaway, E. M. Functional specialization of seven mouse visual cortical areas. Neuron 72, 1040–1054 (2011).
OpenUrl CrossRef PubMed Web of Science
Wang, Q. & Burkhalter, A. Area map of mouse visual cortex. J. Comp. Neurol. 502, 339– 357 (2007).
OpenUrl CrossRef PubMed Web of Science
↵
Smith, I. T., Townsend, L. B., Huh, R., Zhu, H. & Smith, S. L. Stream-dependent development of higher visual cortical areas. Nat. Neurosci. 20, 200–208 (2017).
OpenUrl CrossRef PubMed
↵
Hubel, D. H. & Wiesel, T. N. Receptive fields, binocular interaction and functional architecture in the cat's visual cortex. J. Physiol. 160, 106–154 (1962).
OpenUrl CrossRef PubMed Web of Science
↵
Ohki, K., Chung, S., Ch'ng, Y. H., Kara, P. & Reid, R. C. Functional imaging with cellular resolution reveals precise micro-architecture in visual cortex. Nature 433, 597–603 (2005).
OpenUrl CrossRef PubMed Web of Science
↵
Young, M. P. & Yamane, S. Sparse population coding of faces in the inferotemporal cortex. Science 256, 1327–1331 (1992).
OpenUrl Abstract/FREE Full Text
Quiroga, R. Q., Reddy, L., Kreiman, G., Koch, C. & Fried, I. Invariant visual representation by single neurons in the human brain. Nature 435, 1102–1107 (2005).
OpenUrl CrossRef PubMed Web of Science
↵
Serre, T., Wolf, L., Bileschi, S., Riesenhuber, M. & Poggio, T. Robust object recognition with cortex-like mechanisms. IEEE Trans. Pattern Anal. Mach. Intell. 29, 411–426 (2007).
OpenUrl CrossRef PubMed Web of Science
↵
Bussey, T. J. et al. The touchscreen cognitive testing method for rodents: how to get the best out of your rat. Learn. Mem. 15, 516–523 (2008).
OpenUrl Abstract/FREE Full Text
↵
Wang, Z., Bovik, A. C., Sheikh, H. R. & Simoncelli, E. P. Image quality assessment: from error visibility to structural similarity. IEEE Trans. Image Process. 13, 600–612 (2004).
OpenUrl CrossRef PubMed Web of Science
↵
Treviño, M. Stimulus similarity determines the prevalence of behavioral laterality in a visual discrimination task for mice. Sci. Rep. 4, doi:10.1038/srep07569. (2014).
OpenUrl CrossRef
↵
Treviño, M. et al. Controlled variations in stimulus similarity during learning determine visual discrimination capacity in freely moving mice. Sci. Rep. 3, doi:10.1038/srep01048 (2013).
OpenUrl CrossRef
↵
Bonin, V., Histed, M. H., Yurgenson, S. & Reid, R. C. Local diversity and fine-scale organization of receptive fields in mouse visual cortex. J. Neurosci. 31, 18506–18521 (2011).
OpenUrl Abstract/FREE Full Text
↵
Jones, J. P. & Palmer, L. A. An evaluation of the two-dimensional Gabor filter model of simple receptive fields in cat striate cortex. J. Neurophysiol. 58, 1233–1258 (1987).
OpenUrl CrossRef PubMed Web of Science
↵
Hagihara, K. M., Murakami, T., Yoshida, T., Tagawa, Y. & Ohki, K. Neuronal activity is not required for the initial formation and maturation of visual selectivity. Nat. Neurosci. 18, 1780–1788 (2015).
OpenUrl CrossRef PubMed
↵
Rochefort, N. L. et al. Development of direction selectivity in mouse cortical neurons. Neuron 71, 425–432 (2011).
OpenUrl CrossRef PubMed Web of Science
↵
Warren, S. G., Yacoub, E. & Ghose, G. M. Featural and temporal attention selectively enhance task-appropriate representations in human primary visual cortex. Nat. Commun. 5, doi:10.1038/ncomms6643. (2014).
OpenUrl CrossRef
↵
McAdams, C. J. & Maunsell, J. H. Effects of attention on orientation-tuning functions of single neurons in macaque cortical area V4. J. Neurosci. 19, 431–441 (1999).
OpenUrl Abstract/FREE Full Text
↵
Goltstein, P. M., Coffey, E. B., Roelfsema, P. R. & Pennartz, C. M. In vivo two-photon Ca2+ imaging reveals selective reward effects on stimulus-specific assemblies in mouse visual cortex. J. Neurosci. 33, 11540–11555 (2013).
OpenUrl Abstract/FREE Full Text
↵
Poort, J. et al. Learning enhances sensory and multiple non-sensory representations in primary visual cortex. Neuron 86, 1478–1490 (2015).
OpenUrl CrossRef PubMed
Goodale, M. A. & Milner, A. D. Separate visual pathways for perception and action. Trends Neurosci. 15, 20–25 (1992).
OpenUrl CrossRef PubMed Web of Science
Wang, Q., Gao, E. & Burkhalter, A. Gateways of ventral and dorsal streams in mouse visual cortex. J. Neurosci. 31, 1905–1918 (2011).
OpenUrl Abstract/FREE Full Text
↵
Stirman, J. N., Smith, I. T., Kudenov, M. W. & Smith, S. L. Wide field-of-view, multi-region, two-photon imaging of neuronal activity in the mammalian brain. Nat. Biotech. (2016).
↵
Tkačik, G. et al. Natural images from the birthplace of the human eye. PLoS One 6, e20409, doi:10.1371/journal.pone.0020409. (2011).
OpenUrl CrossRef PubMed
↵
Olmos, A. & Kingdom, F. A. A. A biologically inspired algorithm for the recovery of shading and reflectance images. Perception 33, 1463–1473 (2004).
OpenUrl CrossRef PubMed Web of Science
↵
Oliva, A. & Torralba, A. Modeling the shape of the scene: A holistic representation of the spatial envelope. Int. J. Comput. Vis. 42, 145–175 (2001).
OpenUrl CrossRef Web of Science
↵
Thevenaz, P., Ruttimann, U. E. & Unser, M. A pyramid approach to subpixel registration based on intensity. IEEE Trans. Image Process. 7, 27–41 (1998).
OpenUrl CrossRef PubMed Web of Science

View the discussion thread.

Posted July 13, 2017.

Download PDF

Citation Tools

Subject Area

Neuroscience

Subject Areas

All Articles

Animal Behavior and Cognition (5213)
Biochemistry (11744)
Bioengineering (8751)
Bioinformatics (29193)
Biophysics (14968)
Cancer Biology (12094)
Cell Biology (17411)
Clinical Trials (138)
Developmental Biology (9421)
Ecology (14178)
Epidemiology (2067)
Evolutionary Biology (18303)
Genetics (12244)
Genomics (16801)
Immunology (11866)
Microbiology (28082)
Molecular Biology (11592)
Neuroscience (60959)
Paleontology (451)
Pathology (1870)
Pharmacology and Toxicology (3238)
Physiology (4957)
Plant Biology (10427)
Scientific Communication and Education (1683)
Synthetic Biology (2885)
Systems Biology (7339)
Zoology (1651)

[1] ↵
Olshausen, B. A. & Field, D. J. Emergence of simple-cell receptive field properties by learning a sparse code for natural images. Nature 381, 607 (1996).
OpenUrl CrossRef PubMed Web of Science

[2] ↵
Bell, A. J. & Sejnowski, T. J. The “independent components” of natural scenes are edge filters. Vision Res. 37, 3327–3338 (1997).
OpenUrl CrossRef PubMed Web of Science

[3] ↵
Itti, L., Koch, C. & Niebur, E. A model of saliency-based visual attention for rapid scene analysis. IEEE Trans. Pattern Anal. Mach. Intell. 20, 1254–1259 (1998).
OpenUrl CrossRef

[4] ↵
Vinje, W. E. & Gallant, J. L. Sparse coding and decorrelation in primary visual cortex during natural vision. Science 287, 1273–1276 (2000).
OpenUrl Abstract/FREE Full Text

[5] ↵
Froudarakis, E. et al. Population code in mouse V1 facilitates readout of natural scenes through increased sparseness. Nat. Neurosci. 17, 851–857 (2014).
OpenUrl CrossRef PubMed

[6] ↵
Hoy, J. L., Yavorska, I., Wehr, M. & Niell, C. M. Vision drives accurate approach behavior during prey capture in laboratory mice. Curr. Biol. 26, 3046–3052 (2016).
OpenUrl CrossRef PubMed

[7] ↵
De Franceschi, G., Vivattanasarn, T., Saleem, A. B. & Solomon, S. G. Vision guides selection of freeze or flight defense strategies in mice. Curr. Biol. 26, 2150–2154 (2016).
OpenUrl CrossRef PubMed

[8] Vale, R., Evans, D. A. & Branco, T. Rapid spatial learning controls instinctive defensive behavior in mice. Curr. Biol. 27, 1342–1349 (2017).
OpenUrl

[9] ↵
Yilmaz, M. & Meister, M. Rapid innate defensive responses of mice to looming visual stimuli. Curr. Biol. 23, 2011–2015 (2013).
OpenUrl CrossRef PubMed

[10] ↵
Harvey, C. D., Collman, F., Dombeck, D. A. & Tank, D. W. Intracellular dynamics of hippocampal place cells during virtual navigation. Nature 461, 941–946 (2009).
OpenUrl CrossRef PubMed Web of Science

[11] ↵
Youngstrom, I. A. & Strowbridge, B. W. Visual landmarks facilitate rodent spatial navigation in virtual reality environments. Learn. Mem. 19, 84–90 (2012).
OpenUrl Abstract/FREE Full Text

[12] ↵
Prusky, G. T., West, P. W. & Douglas, R. M. Behavioral assessment of visual acuity in mice and rats. Vision Res. 40, 2201–2209 (2000).
OpenUrl CrossRef PubMed Web of Science

[13] ↵
Glickfeld, L. L., Histed, M. H. & Maunsell, J. H. Mouse primary visual cortex is used to detect both orientation and contrast changes. J. Neurosci. 33, 19416–19422 (2013).
OpenUrl Abstract/FREE Full Text

[14] ↵
Long, M., Jiang, W., Liu, D. & Yao, H. Contrast-dependent orientation discrimination in the mouse. Sci. Rep. 5, doi:1038/srep15830. (2015).
OpenUrl CrossRef

[15] ↵
Nithianantharajah, J. et al. Bridging the translational divide: identical cognitive touchscreen testing in mice and humans carrying mutations in a disease-relevant homologous gene. Sci. Rep. 5, doi:10.1038/srep14613. (2015).
OpenUrl CrossRef

[16] ↵
Bussey, T. J., Saksida, L. M. & Rothblat, L. A. Discrimination of computer-graphic stimuli by mice: a method for the behavioral characterization of transgenic and gene-knockout models. Behav. Neurosci. 115, 957 (2001).
OpenUrl CrossRef PubMed Web of Science

[17] ↵
Stirman, J. N., Townsend, L. B. & Smith, S. L. A touchscreen based global motion perception task for mice. Vision Res. 127, 74–83 (2016).
OpenUrl CrossRef PubMed

[18] ↵
Rikhye, R. V. & Sur, M. Spatial correlations in natural scenes modulate response reliability in mouse visual cortex. J. Neurosci. 35, 14661–14680 (2015).
OpenUrl Abstract/FREE Full Text

[19] ↵
Dräger, U. C. Receptive fields of single cells and topography in mouse visual cortex. J. Comp. Neurol. 160, 269–289 (1975).
OpenUrl CrossRef PubMed Web of Science

[20] Mangini, N. J. & Pearlman, A. L. Laminar distribution of receptive field properties in the primary visual cortex of the mouse. J. Comp. Neurol. 193, 203–222 (1980).
OpenUrl CrossRef PubMed Web of Science

[21] De Valois, R. L., Morgan, H. & Snodderly, D. M. Psychophysical studies of monkey vision-III. Spatial luminance contrast sensitivity tests of macaque and human observers. Vision Res. 14, 75–81 (1974).
OpenUrl CrossRef PubMed Web of Science

[22] Umino, Y., Solessio, E. & Barlow, R. B. Speed, spatial, and temporal tuning of rod and cone vision in mouse. J. Neurosci. 28, 189–198 (2008).
OpenUrl Abstract/FREE Full Text

[23] ↵
Jacobson, S. G., Franklin, K. B. J. & McDonald, W. I. Visual acuity of the cat. Vision Res. 16, 1141–1143 (1976).
OpenUrl CrossRef PubMed Web of Science

[24] ↵
Felleman, D. J. & Van Essen, D. C. Distributed hierarchical processing in the primate cerebral cortex. Cereb. Cortex. 1, 1–47 (1991).
OpenUrl CrossRef PubMed Web of Science

[25] ↵
Andermann, M. L., Kerlin, A. M., Roumis, D. K., Glickfeld, L. L. & Reid, R. C. Functional specialization of mouse higher visual cortical areas. Neuron 72, 1025–1039 (2011).
OpenUrl CrossRef PubMed Web of Science

[26] ↵
Marshel, J. H., Garrett, M. E., Nauhaus, I. & Callaway, E. M. Functional specialization of seven mouse visual cortical areas. Neuron 72, 1040–1054 (2011).
OpenUrl CrossRef PubMed Web of Science

[27] Wang, Q. & Burkhalter, A. Area map of mouse visual cortex. J. Comp. Neurol. 502, 339– 357 (2007).
OpenUrl CrossRef PubMed Web of Science

[28] ↵
Smith, I. T., Townsend, L. B., Huh, R., Zhu, H. & Smith, S. L. Stream-dependent development of higher visual cortical areas. Nat. Neurosci. 20, 200–208 (2017).
OpenUrl CrossRef PubMed

[29] ↵
Hubel, D. H. & Wiesel, T. N. Receptive fields, binocular interaction and functional architecture in the cat's visual cortex. J. Physiol. 160, 106–154 (1962).
OpenUrl CrossRef PubMed Web of Science

[30] ↵
Ohki, K., Chung, S., Ch'ng, Y. H., Kara, P. & Reid, R. C. Functional imaging with cellular resolution reveals precise micro-architecture in visual cortex. Nature 433, 597–603 (2005).
OpenUrl CrossRef PubMed Web of Science

[31] ↵
Young, M. P. & Yamane, S. Sparse population coding of faces in the inferotemporal cortex. Science 256, 1327–1331 (1992).
OpenUrl Abstract/FREE Full Text

[32] Quiroga, R. Q., Reddy, L., Kreiman, G., Koch, C. & Fried, I. Invariant visual representation by single neurons in the human brain. Nature 435, 1102–1107 (2005).
OpenUrl CrossRef PubMed Web of Science

[33] ↵
Serre, T., Wolf, L., Bileschi, S., Riesenhuber, M. & Poggio, T. Robust object recognition with cortex-like mechanisms. IEEE Trans. Pattern Anal. Mach. Intell. 29, 411–426 (2007).
OpenUrl CrossRef PubMed Web of Science

[34] ↵
Bussey, T. J. et al. The touchscreen cognitive testing method for rodents: how to get the best out of your rat. Learn. Mem. 15, 516–523 (2008).
OpenUrl Abstract/FREE Full Text

[35] ↵
Wang, Z., Bovik, A. C., Sheikh, H. R. & Simoncelli, E. P. Image quality assessment: from error visibility to structural similarity. IEEE Trans. Image Process. 13, 600–612 (2004).
OpenUrl CrossRef PubMed Web of Science

[36] ↵
Treviño, M. Stimulus similarity determines the prevalence of behavioral laterality in a visual discrimination task for mice. Sci. Rep. 4, doi:10.1038/srep07569. (2014).
OpenUrl CrossRef

[37] ↵
Treviño, M. et al. Controlled variations in stimulus similarity during learning determine visual discrimination capacity in freely moving mice. Sci. Rep. 3, doi:10.1038/srep01048 (2013).
OpenUrl CrossRef

[38] ↵
Bonin, V., Histed, M. H., Yurgenson, S. & Reid, R. C. Local diversity and fine-scale organization of receptive fields in mouse visual cortex. J. Neurosci. 31, 18506–18521 (2011).
OpenUrl Abstract/FREE Full Text

[39] ↵
Jones, J. P. & Palmer, L. A. An evaluation of the two-dimensional Gabor filter model of simple receptive fields in cat striate cortex. J. Neurophysiol. 58, 1233–1258 (1987).
OpenUrl CrossRef PubMed Web of Science

[40] ↵
Hagihara, K. M., Murakami, T., Yoshida, T., Tagawa, Y. & Ohki, K. Neuronal activity is not required for the initial formation and maturation of visual selectivity. Nat. Neurosci. 18, 1780–1788 (2015).
OpenUrl CrossRef PubMed

[41] ↵
Rochefort, N. L. et al. Development of direction selectivity in mouse cortical neurons. Neuron 71, 425–432 (2011).
OpenUrl CrossRef PubMed Web of Science

[42] ↵
Warren, S. G., Yacoub, E. & Ghose, G. M. Featural and temporal attention selectively enhance task-appropriate representations in human primary visual cortex. Nat. Commun. 5, doi:10.1038/ncomms6643. (2014).
OpenUrl CrossRef

[43] ↵
McAdams, C. J. & Maunsell, J. H. Effects of attention on orientation-tuning functions of single neurons in macaque cortical area V4. J. Neurosci. 19, 431–441 (1999).
OpenUrl Abstract/FREE Full Text

[44] ↵
Goltstein, P. M., Coffey, E. B., Roelfsema, P. R. & Pennartz, C. M. In vivo two-photon Ca2+ imaging reveals selective reward effects on stimulus-specific assemblies in mouse visual cortex. J. Neurosci. 33, 11540–11555 (2013).
OpenUrl Abstract/FREE Full Text

[45] ↵
Poort, J. et al. Learning enhances sensory and multiple non-sensory representations in primary visual cortex. Neuron 86, 1478–1490 (2015).
OpenUrl CrossRef PubMed

[46] Goodale, M. A. & Milner, A. D. Separate visual pathways for perception and action. Trends Neurosci. 15, 20–25 (1992).
OpenUrl CrossRef PubMed Web of Science

[47] Wang, Q., Gao, E. & Burkhalter, A. Gateways of ventral and dorsal streams in mouse visual cortex. J. Neurosci. 31, 1905–1918 (2011).
OpenUrl Abstract/FREE Full Text

[48] ↵
Stirman, J. N., Smith, I. T., Kudenov, M. W. & Smith, S. L. Wide field-of-view, multi-region, two-photon imaging of neuronal activity in the mammalian brain. Nat. Biotech. (2016).

[49] ↵
Tkačik, G. et al. Natural images from the birthplace of the human eye. PLoS One 6, e20409, doi:10.1371/journal.pone.0020409. (2011).
OpenUrl CrossRef PubMed

[50] ↵
Olmos, A. & Kingdom, F. A. A. A biologically inspired algorithm for the recovery of shading and reflectance images. Perception 33, 1463–1473 (2004).
OpenUrl CrossRef PubMed Web of Science

[51] ↵
Oliva, A. & Torralba, A. Modeling the shape of the scene: A holistic representation of the spatial envelope. Int. J. Comput. Vis. 42, 145–175 (2001).
OpenUrl CrossRef Web of Science

[52] ↵
Thevenaz, P., Ruttimann, U. E. & Unser, M. A pyramid approach to subpixel registration based on intensity. IEEE Trans. Image Process. 7, 27–41 (1998).
OpenUrl CrossRef PubMed Web of Science