SUMMARY
Early retinal circuits divide incoming visual information into functionally opposite elementary signals: On and Off, transient and sustained, chromatic and achromatic. Together these signals can yield an efficient representation of the scene for transmission to the brain via the optic nerve. For example, primate On- and Off-parasol circuits are transient, while On- and Off-midget circuits are sustained. But this long-standing interpretation of retinal function is based on mammals, and it is unclear whether this functional arrangement is common to all vertebrates. Here we show that poultry chicks use a fundamentally different strategy to communicate information from the eye to the brain. Rather than using functionally opposite pairs of retinal output channels, chicks encode the polarity, timing, and spectral composition of visual stimuli in a highly correlated manner: fast achromatic information is encoded by Off-circuits, and slow chromatic information overwhelmingly by On-circuits. Moreover, most retinal output channels combine On- and Off-circuits to simultaneously encode, or multiplex, both achromatic and chromatic information.
Our results from birds conform to evidence from fish, amphibians, and reptiles which retain the full ancestral complement of four spectral types of cone photoreceptors. By contrast, mammals lost two of these cones early in their evolution, and we posit that this loss drove a radical simplification and reorganisation of retinal circuits, while birds and many other extant non-mammalian lineages retain the ancestral strategy for retinal image processing.
HIGHLIGHTS
First large-scale survey of visual functions in an avian retina
Off-circuits are fast and achromatic, On-circuits are slow and chromatic
Most avian RGCs are OnOff and encode both types of information
Colour and greyscale information can be decoded based on the kinetics
INTRODUCTION
Vertebrate retinal circuits process a stream of spatial, temporal, and spectral information into parallel channels for transmission to the brain1. The number of channels, what each channel encodes, and why, is a subject of active debate2, but some general principles have emerged3. For example, that to save energy, and to expand dynamic range, retinal circuits divide the visual signal into an approximately equal number of On-and Off-channels4, and into transient and sustained channels5–7. Information about polarity and kinetics is thereby represented by independent ‘elementary building blocks’ with decorrelated signals5. Similarly, wavelength information is efficiently funnelled into a subset of chromatic channels in parallel to the achromatic channels that dominate the retinal output8,9.
These long-standing principles are overwhelmingly based on research on mammals, such as mice, primates, cats, and rabbits, and evidence from non-mammalian species is beginning to question their generality. For example, zebrafish lack the approximately balanced representations of On- and Off-, and of transient and sustained channels of mammalian retinas10–13. Similarly, the profusion of colour-opponent responses in turtles14, or the heavy dominance of OnOff channels in salamanders15,16, do not easily map onto the current framework. Yet, a systematic overview of retinal functions is lacking for most non-mammalian lineages, and especially for birds17,18. This is in part due to the difficulty of recording from the ex-vivo bird retina19– 21, where minor mechanical insult causes pathological waves of depression22. Consequently, to date insights into the operation of avian retinas are largely19,23 restricted to non-invasive techniques such as electroretinograms24,25.
Working on the retina of poultry chicks17 (Figure 1a-c) we have overcome these limitations by suppressing pathological signal spread during dissection (Supplemental Figure 1). We can record light-driven spiking activity from neurons in the retina’s ganglion cell layer with a high-density multielectrode array (MEA) for several hours (Figure 1d,e), and now present what to our knowledge is the first large-scale study of light-driven retinal function in a bird.
We find that unlike mammals2,26, chicks represent information about the polarity, kinetics, and wavelength in a highly correlated manner, with fast and achromatic Off-circuits, and slow and chromatic On-circuits. Moreover, most retinal outputs combine both On- and Off-circuits to simultaneously inform about both sets of information.
RESULTS
Most recorded cells correspond to retinal ganglion cells
We recorded from n = 17 pieces of dorsal retina (n = 14 animals), yielding n = 3,987 spike sorted cells that passed a minimum response criterion (Methods). To estimate what fraction of these cells stemmed from retinal ganglion cells (RGCs), which are the retina’s output neurons, rather than from displaced amacrine cells (dACs)3, we computed the ‘electrical image’20 (EI) for a subset of spike-sorted cells (Figure 1f-l, Methods, see also Supplemental Video S1). This image permits inferring (i) each cell’s spike initiation zone and soma, (ii) the trajectory of its axon, if present and well-attached (e.g. Figure 1f), (iii) its conduction velocity (Figure 1g), and (iv) whether it displayed saltatory (Figure 1h) or non-saltatory spike propagation (Figures 1f,I, cf. Figure 1j). We reasoned that axon bearing cells projecting towards the optic disc were likely to be RGCs. In contrast, axon-less cells likely included dACs and any RGCs whose axons were not detected (e.g. because they were on the array-edge, or because their axon was far from the retinal surface). No EIs had multiple axons radiating from the centre, as for polyaxonal dACs in mammals27. Accordingly, we took the fraction of EIs with a clear axon (Methods) as the lower bound of RGCs. The reliability of axon-detection scaled with the number of available spikes before reaching an asymptote at ∼87% (Figure 1k), suggesting that at least 87% of recorded cells were RGCs. For these, we then quantified conduction velocities (Methods), which in line with recent work20 varied by a factor of more than four, from <0.5 m/s for the slowest non-myelinated axons, to >2 m/s for the fastest axons (Figure 1l).
A large-scale functional survey of the avian retina
To begin understanding how the chick retina functions, we focussed on temporal and colour vision. Like most birds, the chick eye has seven28 distinct types of photoreceptors (Figure 1a-c). These include rods (RH1), the full complement of four ancestral29,30 single cones (SWS1, SWS2, RH2, LWS, Figure 1a), and the principal and accessory members of the double cone (expressing LWS opsin).
We presented four complementary sets of widefield light-stimuli, which were chosen to characterise achromatic and spectral responses within a limited recording time, and allow direct comparison with other species10,31–33 (Methods):
A series of ‘white steps’ (WS) from dark to light at different contrasts (2 s On, 2 s Off at 100-10% contrast; Figure 1m showing 100% step).
A corresponding series of full contrast ‘coloured steps’ (CS) from dark at different wavelengths. (630, 560, 505, 480, 420, 360 nm, Figure 1n showing the first five, cf. Figure 1b).
A 100% contrast ‘white’ chirp stimulus, exponentially accelerating from 1-30 Hz over 30 s (Figure 1o).
A 20 minute 20 Hz pseudorandom sequence of ‘coloured’ noise (420, 480, 505, 630 nm) to compute spectral kernels, which correspond to the linear component of the impulse response34 (Figure 1p).
Because our multielectrode array is opaque, stimuli were presented from the photoreceptor side, by-passing the spectral filtering by oil droplets that occurs in the intact eye35.
Figures 1m-p show responses of four individual cells to the stimuli, chosen to capture some of the diversity found. Cell 1 responded transiently to light offset (Figure 1m), preferred long-over short-wavelengths (Figure 1n), and had bandpass temporal tuning centred around 10 Hz (Figure 1o). The cell’s spectral kernels were correspondingly narrow, indicating high-frequency temporal tuning, and confirmed the cell’s non-opponent long wavelength preference (Figure 1p). Cells 2 and 3 were both OnOff cells but differed in their kinetics and spectral tuning. Cell 2 was On-sustained but Off-transient, with broad, low-pass tuning to temporal flicker, and its spectral kernels were colour opponent. Cell 3 had transient responses to light onset and offset with a preference for ‘white’ over ‘coloured’ light (compare Figures 1m,n) and low-pass tuning to temporal flicker that did not respond above 5 Hz. Cell 4 responded transiently to light onset with little response to temporal flicker or to spectral noise (Figures 1o,p). Instead, it had selective spectral sensitivity to ‘red’- and ‘yellow’ wavelengths, alongside a relatively weak response to ‘white’ light (Figures 1m,n).
An interactive online data explorer of avian retinal ganglion cell functions
To identify major functional structures across the dataset we clustered responses to the four stimuli using a Mixture of Gaussian model, followed by manual curation (Methods). This yielded n = 36 clusters, of which n = 22 had a minimum of 20 members and were retained for further analysis (n = 3,914 cells, 98.1%). Figure 2a plots a representative selection of these clusters, while all 22 clusters are shown in Extended Data 1. The full dataset, including further analysis, is interactively plotted and available for download at http://chicken-data.retinal-functomics.net/.
To what extent the 22 clusters correspond to anatomically and molecularly distinct types of RGCs and/or dACs is unknown. However, the 41 transcriptomically described RGC types in the adult chicken36 and a similar anatomical RGC diversity17 imply that some of our clusters comprise more than one type. Further probing the response space may well reveal additional functional diversity.
We now describe the basic complement of neural responses in the chick retina. For simplicity, and unless otherwise specified, analysis was performed on the cluster means. Cell-wise summaries of key indices are available in the interactive online data explorer.
Response polarity, kinetics, and colour opponency are linked
We began by exploring the clusters’ light response properties based on the polarity (On, Off, OnOff) and kinetics (‘transient’ or ‘sustained’) of their ‘white’ step responses (Figure 2b, Methods). This revealed surprising trends. First, rather than forming segregated On- and Off-channels, most clusters responded both to light onset and offset. Second, polarity and kinetics were linked: Off-dominated clusters were transient, while On-dominated clusters were sustained. Third, analysis of the spectral kernels (Methods) revealed that colour processing was also linked to polarity: Most OnOff clusters had large-amplitude colour opponent kernels, whereas most Off- or On-dominated clusters were non-opponent or had low-amplitude kernels (Figure 2c). As subset of these latter clusters responded more strongly to ‘coloured’ steps than to the white steps that were used to define their polarity and transience (discussed below) we also computed these measures from their peak responses to the ‘coloured’ stimuli (Supplemental Figure S2a). This allowed characterisation of On-dominated clusters as distinct transient and sustained groups.
Accordingly, our 22 clusters fell into four groups:
Off, transient, non-opponent (C1, n = 180 cells, 4.5%)
OnOff, kinetically mixed, colour-opponent (C2-13, n = 2,688 cells, 67.4%)
On, transient, non-opponent (C14-19, n = 784 cells, 19.7%)
On, sustained, non-opponent (C19-22, n = 262 cells, 6.6%)
The cluster-means shown in Figure 2a illustrate these groups, and their key differences (Supplemental Figure 2b summarises how clusters and response groups are distributed across experimental sessions). The groups and their associated analysis are colour-coded in black, brown, orange, and grey, respectively (e.g. in Figure 2b,c).
OnOff dominance as a hallmark of non-mammalian retinas?
To compare chick RGC responses to those of other vertebrates, we sourced widefield ‘white-step’ response datasets of RGCs from larval zebrafish10, mice37, and humans38 and computed their corresponding distributions of polarity and transience (Figure 2d-f, cf. Supplemental Figure 2c-f). This revealed similarities between the chick and zebrafish datasets, which systematically differed from the mammalian systems.
Chicks (Figure 2a), and to a lesser extent zebrafish (Figure 2d), feature a sizable complement of OnOff channels (see also Supplemental Figure 2c,e,f). OnOff channels are also dominant in salamanders15 and turtles39. In contrast, segregated On- and Off-channels predominate in mice and humans (Figure 2e,f). Even the few well-described mammalian OnOff channels, such as the OnOff direction selective RGCs of mice or the small bistratified RGCs of primates, are dominated by either their On- or their Off-components when probed with ‘white’ steps of light. Second, the link between polarity and kinetics observed for chicks (Figure 2a) was also a feature of the zebrafish dataset (Figure 2d), but no such trend was detectable for mice or humans, whose RGCs occupy the full coding space encompassed by polarity and kinetics (Figure 2e,f).
What might be the benefit for chicks, and perhaps also of other non-mammalian vertebrates, to have combined OnOff-channels, rather than segregated channels for encoding On- and Off-events? And why are polarity, kinetics, and colour opponency linked?
To address these questions, we now analyse each system in turn: Off-, On- and then OnOff. We find that Off-channels encode fast achromatic contrast, On-channels primarily deal with spectral information, while their combination into OnOff-channels simultaneously carries information about both achromatic and chromatic aspects of the visual stimulus.
Off-circuits encode achromatic temporal contrast
No cluster yielded exclusively Off-responses, but the most Off-dominated cluster C1 was distinctive as the only cluster having large amplitude kernels without appreciable colour opponency (Figure 3a, cf. Figure 2b, Extended Data 1), and the fastest response, with centroids (Methods) peaking above 5 Hz for red-, green- and cyan-components (Figures 3b,c). Correspondingly, the cluster’s chirp response (Figure 3d, top) exhibited bandpass tuning that peaked around 10 Hz (Figure 3e-f), and was phase locked to temporal flicker up to the maximum tested frequency of 30 Hz (Figure 3g,h). Most other clusters had low pass tuning to the chirp stimulus and did not follow flicker above 5 Hz. The exception was C2, the second-most Off-biased transient cluster (Figure 2b), which was allocated to the OnOff group due to its strong colour opponency (Figure 2c). C2 frequency tuning matched C1, as described by their spectral kernels (Figure 3b,c), and for temporal flicker (Figure 3d-g), but had a weaker preference for high- (4 - 14 Hz) over low- (<2 Hz) frequencies (Supplemental Figure 3a, cf. Figure 3e) and weaker phase locking (Figure 3g). Both C1,2 had smaller On- than Off-responses to intensity steps.
Although C1,2 were the two most Off-dominated clusters and had the fastest temporal tuning, their flicker responses were On-at low and Off-at high-frequencies (Figure 3h). Importantly the same reversal of response polarity applied to all clusters with any phase locking (Supplemental Figure 3b).
Thus, it appears that chicks use On- and Off-circuits to encode slow and fast temporal contrast respectively. Axonal conduction velocities of the four groups supported this conclusion: Off-RGCs were faster than OnOff-RGCs, which in turn were faster than On-RGCs (Figure 3i).
Beyond speed, Off-circuits had three properties suited to encoding achromatic intensity, exemplified by cluster C1. First, Off-responses had linear contrast-response functions to white steps (Figure 3j). Second, their spectral tuning matched a log-sensitivity function of the LWS opsin (Figure 3k), which is expressed in red single cones and the double cone (Figure 1b) – the latter having long been implicated in fast, achromatic vision40. Third, key features of Off-responses were common to all clusters, namely their kinetics to white-steps (Supplemental Figure 2d), their contrast-response functions (Figure 3l), and their spectral tunings (Figure 3m). By comparison On-responses were diverse (cf. Figure 4, discussed below).
Cluster C1 and C2, which contain the chick’s rapid achromatic Off-circuits, comprised 4.6% and 2.8% of recorded cells respectively. By comparison, the functionally similar primate parasol cells comprise 10-16% of RGCs41, while the types of alpha cells in mice make up about 5%31,42–44. Unlike in chicks, these mammalian fast achromatic contrast systems have equal proportions of On- and Off-cells.
On-circuits encode wavelength information
In contrast to the homogeneous, fast, and achromatic Off-circuits, On-circuits tended to be heterogeneous, slow, and spectrally nuanced. For example, Ontr-cluster C18 had a highly selective On-response to blue-light (Figure 4a-c), while cluster C15 was selective for red-light (Figure 4d-f). Both clusters were selective for coloured over white light with the peak response to the spectral stimuli exceeding the corresponding ‘white’ response. Both C15 and C18 lacked any overt sign of colour-opponency but their’ spectral tuning was narrower than those of the spectrally closest opsins (Figure 4b,e), unlike the Off-cluster C1 (Figure 3k). The remaining three Ontr-clusters had similar properties to C15,18 (Figure 4c,f, C14,16,17). By contrast, the four Onsus-clusters (C19-22, (Figure 4g-i) has larger responses to white compared to coloured light (e.g. C20 in Figure 4g, cf. Extended Data 1 for C19,21,22), and their spectral tuning functions were broader than a single opsin in isolation.
To explore how On- and Off-responses encode wavelength information we computed two indices (Methods): The spectral dominance index compares a cluster’s largest colour step response to its 100% white-step response, such that −1 and 1 indicate exclusive response to white and colour steps, respectively, while 0 indicates equal responses. Conversely, the spectral tuning index compares the width of a cluster’s spectral tuning function to that of its spectrally nearest log-opsin template, such that −1 and 1 respectively indicate infinitely narrow or broad tuning functions, while 0 indicates that the cluster’s tuning matches that of a single opsin. Together these two indices revealed systematic differences in how On- and Off-responses encode spectral information. For On-responses, spectral dominance and spectral tuning were strongly correlated, and spanned much of the possible coding space (Figures 4j, cf. Supplemental Figure S4a). In the lower left quadrant, the four Onsus clusters exhibited spectrally broad and white-dominant responses, while in the upper right quadrant, several of the Ontr clusters had some of the most spectral-dominant and/or narrow responses. The remaining Off and OnOff clusters lie between these extremes. By contrast, Off-responses generally fell around the origin of both indices, indicating no preference for white or coloured light, and spectral tuning consistent with drive from a single opsin (Figure 4k, cf. Supplemental Figure S4b). Accordingly, On-circuits had spectrally distinct responses, while Off-circuits exhibited low spectral diversity. This difference was also illustrated by spectral tuning functions for On-responses (Figure 4l) compared to the tunings of Off-responses (Figure 3m). Moreover, most clusters had markedly sub-linear On-contrast-response functions to white light (Figure 4m), rather than the more linear contrast-response functions of Off-circuits (cf. Figure 3l). The Onsus group was an exception to this rule, with supra-linear contrast-response functions.
Taken together, it therefore appears that the chick retina disproportionately leverages Off-circuits to encode rapid achromatic contrast (Figure 3), and On-circuits to encode wavelength (Figure 4).
OnOff channels multiplex spectral and temporal information
Building on the foregoing observations, we asked what happens when the chick’s functionally distinct On- and Off-circuits are combined to form OnOff-channels. The combination might allow emergence of new functions but degrade one or both incoming messages. Alternatively, if On- and Off-signals remain partly decodable – for example in time – the new channel could encode aspects of both sets of incoming information. Such multiplexed signal transmission might allow efficient use of the optic nerve’s limited bandwidth, as in communications technology45.
Accordingly, we explored the possibility that OnOff-channels in the chick retina might be simultaneously informative about both spectral and temporal contrast (i.e. “colour” and “greyscale”).
Evidence for multiplexing comes from the spectral kernels of OnOff cells (Figure 5a). The full kernels were colour opponent in all OnOff clusters (C2-13), but their opponency was usually time-dependent with the spectral kernels converging to a non-opponent Off-signal in the final ∼100 ms preceding the spike. This effect is illustrated by cluster C7 (Figure 5a, fifth entry): at ∼500 ms preceding the spike, the cluster is ‘red/green’-On ‘blue’-Off opponent (shaded in brown), but as the blue-kernel was monophasic (i.e. ‘slow’) and the red/green-kernels were biphasic (i.e. ‘fast’), all three converged to yield an achromatic late Off-period immediately preceding the spike (shaded in grey). Thus, cluster C7 combines “colour” and “greyscale”-information, segregated in time.
To systematically quantify this property, we defined colour-opponent (+1) and non-opponent (−1) phases in each OnOff cluster’s spectral kernels (Figure 5a, brown and grey shadings, respectively). We also normalised each cluster’s kernels in time (with “-1” and “0” indicating the timepoints where the kernels’ first exceed a minimum threshold amplitude, and the time of the spike, respectively). For all twelve OnOff clusters we then computed the mean±SD ‘opponency fraction’ over normalised time (Figure 5b; 1 and − 1 on the y-axis denoting that all clusters were opponent or non-opponent, respectively, Methods). This confirmed the systematic nature of the spectro-temporal responses: On average, spectral kernels were colour opponent over long time scales, but non-opponent over short time scales.
Even though OnOff clusters differed by nearly an order of magnitude in their overall kernel kinetics (Figure 5a, cf. Extended Data 1, Figure 4b,c), within each cluster, kinetic order was stereotyped: Red-kernels tended to be the fastest, followed by green-, then cyan-, and finally blue. Accordingly, most OnOff clusters encoded a similar spectral hierarchy over different time-scales. Clusters C2-4 were fast at all four wavelengths, C5,6 intermediate and C7-12 were remarkably slow. The achromatic and strongly Off-dominated cluster C1 fitted the fast extreme of this pattern, as the fast non-opponent fraction of the kernel was retained but the slow opponent fraction lost (Extended Data 1). Overall, the analysis of spectral kernels suggests that OnOff channels simultaneously encode slow spectral and fast achromatic information, with the two sets of features being segregated, and thus decodable, by their relative timings.
“Time-dependent opponency” in OnOff channels emerges from differential spectral integration across On- and Off-circuits
We next asked how these time-wavelength features might relate to the properties of the On- and Off-circuits in isolation. To this end, we analysed the OnOff clusters’ On- and Off-responses to ‘white’ and ‘coloured’ steps of light (Figure 5c-f). In line with corresponding observations at the level of the kernels, this revealed a systematic wavelength-dependence of clusters’ step responses: For both On- and Off-transitions, the largest amplitudes tended to occur for red, followed in turn by yellow, green, and cyan. For of the Off-channel, this trend extended to blue, while for On, blue-responses tended to be larger than cyan responses. Together, these spectral responses accounted for the previously observed biphasic and monophasic spectral tuning functions of the On- and Off-channel, respectively (On: Figure 4l, Off: Figure 3m).
Step responses differed in their latencies as well as their amplitudes (Figure 5c,d). For example, red responses tended to be large, with short latencies, while cyan responses were smaller and delayed. An inverse link between response amplitude and latency is expected, but the variance across these two parameters was sometimes substantial. For example, the latencies of red- and green-On responses of cluster C6 differed by more than 60 milliseconds (red: 104 ms, green: 167 ms) despite their almost identical slopes (Figure 5c, fourth entry). By comparison, the On-response to the 100% white step was intermediate at 110 ms. In fact, within the On-channel, the latencies of red-responses tended to be slightly below those of the corresponding 100% contrast white step responses. Conversely, in the Off channel the white response was generally dominant (Figure 5d).
To explore if and how these types of amplitude and kinetic differences might encode stimulus wavelength and/or intensity, we quantified the OnOff clusters’ ‘colour’ and ‘white’ step-response amplitudes and latencies and normalised each relative to their respective red-response. (Figure 5e,f). This revealed that for the On-, but not for the Off-channel, the combination of these two simple metrics alone sufficed to substantially disentangle wavelength from intensity information. Red-responses were systematically faster than white-responses, allowing their detection by latency alone (Figure 5e), whereas, green- and blue-, but not cyan- or yellow-, could be distinguished from the ‘white’ contrast series by their long latencies, despite their large amplitudes. While white-responses increased in latency while dropping in amplitude with decreasing contrast, blue- and green-On responses had almost equal amplitudes to red-responses but a nearly two-fold greater latency. In contrast, the Off-responses to ‘coloured’ and ‘white’-steps had similar amplitudes and latencies, precluding their differentiation by these metrics (Figure 5f).
Beyond amplitude and latency, step responses of the OnOff cells differed in their overall temporal envelopes. Accordingly, to capture amplitude and kinetic differences across ‘coloured’ and ‘white’-step responses we used Principal Component Analysis (PCA). We first averaged step responses to all red-, yellow-, green-, cyan- and blue-step as well as to the 100, 80, 60, 40 and 20% contrast steps (Figure 5g,h, top), which as expected reproduced the amplitude and kinetic features discussed previously. From these averages, we performed PCA separately on On- and Off-responses, both of which yielded a relatively slow first component followed by a faster second component that together captured >99% of the total variance (Figure 5g,h, middle). We then projected each response’s loading onto these two components (Figure 5g,h, bottom). As predicted from our analysis of amplitudes and latencies alone (Figure 5e,f), this highlighted systematic differences in the encoding of ‘coloured’ versus ‘white’ stimuli for the On-channel, but a common encoding scheme for the Off-channel. For On-responses, the white contrast series was almost completely captured by the first principal component, while colour steps exhibited an approximately fixed loading onto PC1 but wavelength-dependent differences onto PC2. As expected, for Off-responses colour and white steps followed essentially the same trajectory in PC-space.
Finally, to establish the generality of this encoding strategy, we performed PCA separately for a subset of individual OnOff-clusters (C3-8) (Figure 5i,j). In each case, ensuring a common polarity of the first two components (Figure 5i,j, top), we normalised the loadings onto the two PCs between −1 and 1 to enable side-by-side comparison (Figure 5i,j, bottom). By and large, this strategy recapitulated the same distribution of loadings in PC space across coloured and white steps. White-On responses mostly followed PC1 with generally only weak loadings onto PC2, while conversely, red-, yellow, green- and blue-On-responses required a relatively fixed loading onto PC1 but systematically distinct loadings onto PC2. In contrast, as before, coloured, and white Off-responses followed a common, intermixed trajectory.
Taken together, we conclude that in the chick retina, and perhaps also in some other non-mammalian species, On- and Off-channels predominately encode slow-spectral and fast-achromatic information, respectively, while a majority of OnOff-channels simultaneously capture both types of information.
DISCUSSION
We have shown that the spiking output from the chick retina comprises multiple kinetically and spectrally diverse OnOff-channels, alongside sparser populations of fast achromatic Off-, and slow chromatic On-channels. These correlations between polarity, kinetics, and wavelength selectivity (Figure 2a-c) imply that there is a general organising principle in this avian retina, where Off- and On-channels represent fundamentally different aspects of the visual scene namely: time and colour.
Pathway splitting in non-mammalian retinas
The division of the visual signal into On- and Off-pathways at the retina’s first synapse is a fundamental and ancient46 feature of vertebrate vision4. The division reduces energy requirements and increases the dynamic range47 so it is not surprising that sensory systems should balance On- and Off-channels, as exemplified in mammal RGCs2 (cf. Figure 2d-f). Well-segregated and coordinated On- and Off-channels also emerge de novo in retina-inspired computational models aiming to capture as much information as possible from natural scenes48,49. Why, then, do chicks not comply with this arrangement?
One partial explanation might be that beyond efficient use of neural bandwidth and energy, other aspects of visual processing benefit from combining On- and Off-signals. Specifically, encoding wavelength independent of intensity, which is an essential feature of colour vison, requires opponency between spectrally distinct On- and Off-inputs28. One example of an OnOff chromatic opponent channel is the primate small bistratified RGC50,51, which has two notable parallels to the multiple colour opponent OnOff cells of chicks. First, as in chicks, the Off-circuit is driven by LWS cones, and second, the blue On- and yellow Off-circuits have different time-courses: On is slow, but Off is fast50. In primates, this feature is probably inherited from corresponding kinetic differences between SWS1 and LWS cones52. It is unknown if also in chicks the different spectral photoreceptors have systematically different kinetics – but if they do, it might partly account for the correspondingly systematic kinetic differences in spectral kernels observed at the level of RGCs (Figure 5a).
Nevertheless, with ∼67% of all recorded cells, the numerical abundance of chick colour-opponent OnOff RGCs is superficially at odds with efficient coding theory53–55. Spectral variance makes up only a small fraction of information in natural scenes, which are dominated by achromatic contrasts5. Correspondingly53–55, visual systems including our own are thought to prioritise the encoding of intensity rather than wavelength8,56. This implies that beyond wavelength, OnOff RGCs in chicks might encode other aspects of the visual scene, such as time: while the spectral and kinetic order of OnOff clusters’ kernels was remarkably consistent (from ‘red and fast’, to ‘blue and slow’), the overall response kinetics of these clusters varied by more than a log unit in speed. The different OnOff clusters therefore systematically encoded similar sets of spectral and kinetic contrasts, but for a rage of temporal regimes.
To what extent these considerations will stand the scrutiny of systematic computational exploration will be important to explore in the future. It will then also be key to establish whether and how such a system might be extendable to the encoding of space. Based on long-standing work on the retinal output of mammals57, a central prediction would be that the slow and fast cells should correspondingly encode small and large-scale spatial features, respectively.
Why are chick and mammal retinas so different?
The systematic functional differences in the retinal organisation of chicks, and perhaps of other vertebrate lineages, from that of mammals (Figure 2) implies that different clades have evolved divergent strategies for communicating visual information from the eye to the brain (Figure 6). Exploring these differences, their computational consequences, and how they may have come about, will be important in the future, but for now it may be useful to posit one possible avenue of exploration; namely that birds, reptiles, amphibians, and fish differ from mammals in their complements of photoreceptors (Figure 6a). While some lineages in each of the former clades retain the full complement of four ancestral single cones58 – often elaborating them in various ways – early mammals lost their SWS2 and RH2 cones28,29, so that typical mammalian retinas are driven by two cone inputs, but many non-mammalian retinas are driven by four or more.
This systematic difference in inputs may lead to different retinal processing challenges. Perhaps the complex interplay of time and wavelength coding in the chick is a consequence of their relatively complex input system that is carried over from ancestral vertebrates, while the loss of two cones more than 200 million years ago, alongside – presumably - freeing up a substantial diversity of now disused inner retinal circuits, left mammals with an opportunity to evolve new, powerful processing strategies that were previously precluded.
In support, mammalian retinas also differ from those of birds, reptiles, amphibians, and fish in other ways (Figure 6b). For example, their neuronal density is substantially lower2,26 (with the exception of salamanders16, but not frogs59). The only other major vertebrate lineage with consistently low cellular density retinas are elasmobranchs60 (sharks, rays, and skates), which, like mammals, have lost two or more of the four ancestral cones28. Mammals43,61 and elasmobranchs62 are also unusual among vertebrates in that most of their bipolar cells and RGCs are mono- or at most bistratified, rather than routinely multistratified as is the case in birds17,63, reptiles64, amphibians65, and fish10,66,67. In view of the well-documented links between inner retinal depth, response polarity, and kinetics3,5,6,12,68,69, these anatomical differences may directly feed into the observed functional differences. Future functional exploration of other vertebrate retinas, especially those that like mammals have lost a subset of their original photoreceptor complement, may be instructive.
The avian double cone as the input to the fast, achromatic Off-circuit?
Bird and reptile eyes are unique having ‘double cones’ that are distinct from the full complement of ancestral single cones28,40,63. Double cones are made up of two tightly associated cells: a principal and an accessory member, which are independently wired into outer retinal circuits63. Both members express the same ‘red’ LWS opsin that is also found in the ancestral LWS single cones. However, unlike LWS single cones of other species, which are generally associated with achromatic processing29,30,33, direct recordings from either member of avian double cones have not been achieved, leaving insights as to their functions speculative70. In general, their numerical abundance in the periphery71, but absence from the fovea72, hints at a key role in finely resolved temporal rather than spatial processing10,73,74.
In chick, the only robust responses to fast achromatic flicker occurred in the Off channel, whose spectral tuning was consistent with near-exclusive drive from an LWS-expressing photoreceptor system (Figure 3). The simplest explanation for these observations is that the chick’s rapid and achromatic Off-circuits are driven by either or both members of the double cone, and/or the red-single cone. In tentative agreement, at least two anatomical types of chick bipolar cells receive exclusive direct input from the principal member of double cones alongside inputs from rods63. Both these cells stratify in the upper to middle fraction of the inner retina which is generally associated with Off-dominated processing5. Our data therefore lends further credence to the idea that the avian double-cone system might support fast, achromatic vision40, and add the perhaps surprising notion that this signal appears to be exclusively carried by Off-circuits.
On for ‘colour’, Off as a common reference
To distinguish wavelength from intensity, circuits for colour vision use colour opponency as their fundamental ‘currency’28. However, beyond a basic requirement of combining spectrally distinct On-and Off-signals, there are many options to build an opponent circuit. A short-wavelength On-pathway could be combined with a long-wavelength Off-pathway, but it is equally plausible to do the reverse, i.e. to combine short-Off with long-On. And yet, overwhelmingly, vertebrate circuits for colour vision, including in mammals, appear to favour the former9,11,28,29,32,75. Second, when more than one colour opponent axis is established, the second axis could either oppose two entirely new wavelength ranges, or it could reuse one of the two signals from the first axis. Here, the spectral heterogeneity of the On-channels (Figure 4), but homogeneity of the Off-channels (Figure 3), suggests that chicks do the latter: They systematically oppose their spectrally diverse On-signals to a common, LWS-driven Off reference. Similarly, the zebrafish brain appears to be dominated by spectrally narrow and diverse On-signals, but spectrally homogenous, broad and LWS-shaped Off-signals13.
Cortex-like hue-coding in the avian retina?
Despite lacking overt signs of colour opponency, many cells were ‘spectrally selective’ in that they exhibited sharper-than-opsin spectral tuning and a preference for spectrally narrow over ‘white’ light (Figure 4). Sharper-than-opsin tuning can, in principle, be achieved by rectification of a spectrally broader, non-opponent drive. However, in that case the cell would nevertheless be expected to respond strongly to ‘white’ stimulation, which was not the case for many of the spectrally narrow clusters (Figure 4j). Alternatively, narrow tuning could be built by rectifying an already opponent input. For example, a hypothetical RGC rectifying an incoming drive from a colour opponent BC12,69,76 such that only the On-lobe of the opponency persists could readily account for the profusion of ‘colour-selective’ On-RGCs in our dataset. Neurons with similar properties have long been discussed as part of the colour vision machinery of the primate cortex, where they are usually referred to as hue-selective77. Similarly, narrower-than-opsin On-responses also exist in the zebrafish tectum13,78 and in the mushroom bodies of butterflies79. To our knowledge, they have not been described in a vertebrate retina.
METHODS
Lead Contact
Further information and requests for resources and reagents should be directed to and will be fulfilled by the Lead Contact, Tom Baden (t.baden{at}sussex.ac.uk).
Data and Code Availability
All pre-processed spike data (cellwise trial-averaged means and kernels) are available for interactive display and download from http://chicken-data.retinal-functomics.net/. The underlying individual spike timings, corresponding stimuli, and an explanation of the file structures can be downloaded from the same site.
Materials Availability
No unique materials were generated as part of this study.
EXPERIMENTAL MODEL AND SUBJECT DETAILS
Animals
All procedures were performed in accordance with the UK Animals (Scientific Procedures) act 1986 and approved by the animal welfare committee of the University of Sussex. Male chicks, breed Shaver Brown, aged between 1 and 14-days post hatching were obtained from Joice and Hill (part of Hendrix Genetics, Peterborough, UK) and kept in a specifically designed cage in the university’s animal facility. Food (chick crumb) was provided ad libitum and elements for facilitating play behaviour were provided. Chicks were kept on a 12h:12h light:dark cycle, and never kept in isolation for prolonged periods of time.
Ringer solution
Ringer solution for the experiment was based on Ref80, calibrated to 0.331 osmol. In moles per l: 0.1 NaCl, 0.006 KCl, 0.002 MgSO4, 0.001 CaCl2, 0.03 NHCO3, 0.001 NaH2PO4, 0.05 sucrose. Two litres of ringer solution were freshly prepared for each experiment. The solution was bubbled with carboxygen (95% O2 / 5% CO2) for at least one hour before experiments and heated to 37°C. A second batch of the same ringer solution, however with added 0.4 mM MgSO4, was prepared for dissection.
Enucleation and dissection
Chicks were dark adapted for at least 12 hours overnight. In total darkness using infrared goggles, chicks were sacrificed by cervical dislocation followed cutting of the aorta. Eyes were enucleated by first cutting the eyelid around the cornea, followed by a single anterior leading cut between eyes and beak. Using curved forceps (FST 11652-10, FST Heidelberg, Germany), the eyes were then lifted, and the optic nerve was cut using scissors with a partially blunt tip (FST 14083-08). In the following the remaining muscle tissue around the eyes were removed and lifted from the skull. The eyes were cut two times proximal to the edge of the cornea using pointed scissors (FST 15017-10) and transferred into two bottles containing preheated, high-magnesium oxygenated ringer solution at 37°C. The bottles were light sealed. Eyes were transferred to the experimental site for retinal dissection. Total time for the above was ∼15 min. Dissection followed procedures detailed in Ref80. However, all steps were performed under infrared light using night vision googles (PSV-14, ACT in Black, Luxembourg), and in a high-magnesium dissection ringer solution. The following steps were performed in a petri dish unless otherwise stated:
Removal of the cornea from the eyeball with as few cuts around the horizon as possible (FST 15017-10).
Cutting the eyeball along the dorsal – ventral axis using two cuts from opposite sides.
Removing the vitreous from the eyeball using forceps.
Cutting a ∼4×4 mm piece out of the central area of the dorsal hemisphere.
Transfer of this piece onto a filter paper, with the RGCs facing the filter paper and the remaining sclera facing up.
The filter paper and tissue were then transferred onto a kitchen roll paper to draw solution, which flattened the tissue and aided attachment to the filter paper.
The remaining sclera, choroid and retinal pigment epithelium were removed from the retina by using forceps to peel those layers off the retina, which remained attached to the filter paper.
The filter paper and retina were transferred back to the petri dish.
The retina was removed from the filter paper using forceps.
A smaller, about 2.5 mm2 piece was cut. The retina would commonly get folded at a few places during step 6 due to flatting of what normally is a curved tissue. In this step, an area without folds was chosen. Folded parts of the retina were avoided even if this resulted in smaller preparation. In addition, corners in the tissue were rounded off since these sites tended to trigger the degenerative waves.
The retina was transferred to the MEA chamber using a spoon. The tissue was guided onto the spoon using forceps and was continuously guided while being on the spoon to avoid strong movement of the tissue during transfer.
In the MEA chamber the corners of the tissue were cut. This was done, to remove parts of the tissue that had been damaged by the forceps during tissue guiding steps.
The tissue was placed onto the electrode array with a fine paintbrush.
The MEA chamber was dried using kitchen roll to suck out the ringer solution. As soon as the tissue was exposed to the air new ringer solution was added directly on top of the tissue. This would normally attach the tissue to the MEA.
The MEA chamber was connected to the head stage.
The tissue was left to further attach to the MEA for ∼ 30 minutes.
The tissue was perfused throughout the experiment using the low magnesium-ringer solution. The MEA chamber was heated to 37°C and the perfusion solution was preheated inline to the same temperature.
METHOD DETAILS
Light Stimulation
Since our multielectrode array was opaque, all light stimuli were delivered from the photoreceptor side by a combination of two custom-modified 3-channel light projection engines (“Lightcrafters”) that together were driven with six spectral LEDs (UV: 360 nm, LED365-06Z, 20 nW; Blue, B: 420 nm, ROHS 247-1757, 65 nW; Cyan, C: 480 nm, ROHS 810-0492, 60 nW; Green, G: 505 nm, RoHS 769-3551, 65 nW; Yellow, Y: 560 nm, RoHS 247-1735, 59 nW; Red, R: 630 nm, RoHS 904-7367, 100 nW) as described previously81. All stimuli used in this study were applied as widefield, exceeding the active area of the multielectrode array (2.67 mm2). The framerates of the stimulators were 60 Hz. LEDs 420 nm – 630 nm were bandpass filtered using Edmund Optics’ 65-137, 65-145, 34-506, 88-011, 65-166 and combined using Chroma Optics beam splitters: NC474265, AT455DC, T550lpxr and T600lpxr. Four types of full-field stimuli were used: (i) flashes of light with all 6 LEDs driven together, at ten different contrast levels (‘white steps’ WS, 100-10%, 2 s On, 2 s Off), (ii) flashes of light at each of the six wavelengths individually (‘colour steps’, CS, 2 s On, 2 s Off), (ii) a 100% contrast white’ chirp stimulus68 (exponentially accelerating from 1-30 Hz over 30 s), and (iv) a binary dense and spectrally flat white noise, in which four of the six LEDs (R, G, C, B) were flickered independently in a known binary M-sequence at 20 Hz for 20 minutes. This allowed us to estimate spike-triggered averages (STA) per LED and cell, hereafter referred to as spectral kernels (SK, below).
Multielectrode array recordings
MEA recordings were performed on a BIOCAM X platform produced by 3Brain AG, Wädenswil, Switzerland. The chip was Arena HD-MEA. It consists of 4,096 electrodes in a 2.67 × 2.67 mm square area (64*64 21 µm electrodes with 42 µm interelectrode spacing. The chamber above the array is 7 mm deep and 25 mm in diameter. The software for data acquisition was Brainwave 4. It was run on a bespoke PC provided as part of the MEA by 3Brain. The integration time was set to medium, resulting in a sampling frequency of ∼18 kHz.
Data analysis
Data analysis was performed using custom written Python (v. 3.9.5, Python Software Foundation, Wilmington, Delaware, USA) scripts executed in JuypterLab, IGOR Pro 6.3 (Wavemetrics), Fiji (NIH) and Matlab R2019b / R2020b (Mathworks).
Spike sorting
Herding Spikes 2 (HS2) algorithm82 was used for spike sorting. HS2 uses a combination of spike location and spike shapes information and performs a mean shift clustering approach to separate spikes coming from different cells. The following parameters were used: clustering_bandwidth: 8; clustering_alpha: 5.5; clustering_n_jobs: −1; clustering_bin_seeding: True; clustering_min_bin_freq: 16; clustering_subset: None; left_cutout_time: 0.3; right_cutout_time: 0.3; detect_threshold: 5; probe_masked_channels: []; probe_inner_radius: 70; probe_neigbor_radius: 90; probe_event_length: 0.26; probe_peak_jitter: 0.2; num_com_centers: 1; maa: 12; ahpthr: 11; out_file_name: HS2_sorted.hdf5; decay_filtering: False; save_all: False; amp_evaluation_time: 0.4; spk_evaluation_time: 1.0; pca_components: 2; pca_whiten: True; freq_min: 300.0; freq_max: 6000.0; filter: True; pre_scale: True; pre_scale_value: 20.0; filter_duplicates: True. After spike sorting, cells that spiked >10 spikes per second on average over a period of one hour were excluded from any further analysis.
Electrical footprints
Spike triggered averages (STA) were computed by reverse correlating spikes from each cell with the voltage signal of all MEA channels within a window of 10 frames before and 40 frames after a spike resulting in 4,096 STAs (one for each channel), consisting of 50 frames. For this, all spikes that occurred within the first half hour of a recording were considered. Electrical footprints were computed for a subset of 5 recordings.
Processing electrical images (EIs)
To identify which channels had recorded an electrical signal from a cell, channels were filtered by calculating a signal strength matrix: The resulting Esignal matrix was median-normalised and all channels with a score below 0 were filtered out. The matrix was reshaped according to the MEA array (64 × 64) and channels with a zscore above 0 but without neighbouring channels were filtered out. If the remaining number of channels was above 400 the EI was not further considered. In most cases only a handful of channels would remain (called filtered channels in the following), of which most would show a typical spike waveform in their STA. We next detected and traced the axon, where present, as follows: The channel with the highest Esignal value was defined as the spatial origin of the spike Smin0, an the frame with the lowest number of counts in the corresponding STA was defined as the temporal origin of the spike signal Fmin0. The soma of the cell was identified by identifying all filtered channels within a radius of 6 channels around S0. Those channels were not considered for the axon tracing. To trace the axon, an array was created storing x and y positions of all remaining filtered channels and their respective Fmin values. The array was sorted from low to high Fmin. Based on the x and y positions a random_geometric_graph was created using the networkx Python package (https://networkx.org/documentation/stable/index.html). Using the function shortest_path, the shortest path between the origin of the signal at S0/Fmin0 and Smax/Fminmax, was calculated. The distance of the part Dshort was calculated as the total length of the path multiplied by the width of the array in mm.
The speed of the signal in m / s was calculated as: with Freq corresponding to the reciprocal of the MEA’s sampling frequency.
Clustering and initial computation of STAs
Clustering was performed on a dataset containing the functional responses of ‘cells’ recovered from the spike sorting ‘coloured’ (CS) and ‘white’ steps (WS) of light, chirp (Chirp) stimuli, and spectral kernels (SK) derived from the spectral noise (SN) stimulus. In all experiments, the complete CS and WS stimuli were repeated 5 times, the chirp stimulus was repeated 3 times and the SN stimulus ran for 18,000 stimulus frames (20 mins) without repeat (using the R, G, C and B LEDs).
In what follows we describe the procedures followed to cluster the data. We clustered using all four of the above stimuli: CS, WS, Chirp and SK.
To determine the spiking rate over time for each cell in response to each of the CS, WS and Chirp stimuli we mapped all spike times onto the time interval spanned by the first repeat of that stimulus and applied kernel density estimation (KDE) using the Matlab routine ksdensity. We used the default probability density function for the KDE, such that the area under the resulting curve is equal to one, thus normalising the spiking rate across cells and stimuli. The KDE was computed at 1,000 equally spaced points and a smoothing bandwidth of 0.05 employed for all three stimuli.
Spectral kernels were computed by collecting the stimulus segments preceding each spike to form a spike-triggered ensemble (STE) and taking the mean of this ensemble. We then subtracted the mean raw stimulus, calculated as the mean of all possible stimulus segments. Stimulus segments were calculated over the one second interval preceding each spike (−1 sec to 0 sec inclusive) at 51 evenly spaced time points (0.02 second intervals). Thus, the segments were calculated at a finer temporal resolution than the stimulus (20 Hz = 0.05 second intervals) resulting in smoother and more detailed SKs (since a spike may occur at any time within a given stimulus window, a higher temporal resolution results in different stimulus segments depending upon the spike’s location within that window). For the mean raw stimulus, the mean was taken over all stimulus segments during the interval over which the stimulus was played, segment initiation times ranging from the earliest to the latest possible time that will allow a full stimulus segment, in intervals of 0.02 seconds. The spectral kernels were further normalised, by subtracting the mean and dividing by the standard deviation of the SK in the [-2,-1] second interval preceding each spike, where this SK is calculated in the same way as the original spectral kernels. This allows us to determine spectral kernel quality (see below) in those cases where the spike count is low.
Cells with low-quality responses to all four stimuli were identified and removed from the data set, cells with a high-quality response to at least one stimulus being retained in all cases. The quality of response to the CS, WS and Chirp stimuli was determined using the signal-to-noise ratio quality index: QI = Var[⟨C⟩r]t/⟨Var[C]t⟩r, where C is the T by R response matrix (time samples by stimulus repetitions), and ⟨·⟩x and Var[·]x denote the mean and variance respectively across the indicated dimension, x ∈ {r, t} (see Ref31). This quality index was applied to the KDE-derived spiking rates, where KDE was applied to each repeat separately rather than mapping all spikes onto the first repeat as for the computation of the spiking rates used for clustering (described above). A quality threshold of 0.4 was chosen, below which CS, WS and Chirp responses were judged to be of poor quality. We calculated the standard deviation in the light intensity over time for each stimulus colour in the kernel (R, G, C and B). The kernel quality of each cell was defined as the maximum standard deviation across the four LEDs. A kernel quality threshold of 2.5 was chosen, below which kernels were judged to be of poor quality. The raw data set was of size n = 60,713, spread across 17 separate experiments. Following quality control, the data set was of size: n = 4,230 (7.00% (3 s.f.) of the original).
We used principal component analysis (PCA) to reduce the dimensions of the problem prior to clustering. PCA was performed using the Matlab routine pca (default settings). We applied PCA separately to each of the 10 segments of the WS stimulus (100%, 90%, …10% contrast segments), to each of the 10 WSs, the Chirp stimulus, each of the 6 CSs, and four SKs. We retained the minimum number of principal components necessary to explain ≥50% of the variance. The resulting 23 ‘scores’ matrices were then concatenated into a single matrix ready for clustering. The following numbers of principal components were used – CS: 24 components in total (3 R components, 3 Y components, 4 G components, 5 C components, 2 B components and 7 U components); WS: 61 components in total ([4,4,5,6,6,7,7,7,7,8] [100,90,80,70,60,50,40,30,20,10]% contrast components); Chirp: 47 components; SK: 8 components in total (2 R components, 2 G components, 2 C components and 2 B components), giving a grand total of 140 PCA components.
We clustered the combined ‘scores’ matrix using Gaussian Mixture Model (GMM) clustering, performed using the Matlab routine fitgmdist. We initially sorted the data into 100 clusters using i) shared-diagonal, ii) unshared-diagonal, iii) shared-full and iv) unshared-full covariance matrices, such that four different clustering options were explored in total. For each clustering option 20 replicates were calculated (each with a different set of initial values) and the replicate with the largest loglikelihood chosen. A regularisation value of 10−5 was chosen to ensure that the estimated covariance matrices were positive definite, while the maximum number of iterations was set at 104. All other fitgmdist settings were set to their default values.
The optimum clustering was judged to be that which minimised the Bayesian information criterion (BIC), which balances the explanatory power of the model (loglikelihood) with model complexity (number of parameters). Using the above procedure, we found that unshared diagonal covariance matrices provided the optimal solution. From here we combined 100 initial clusters into 36 merged clusters by hand based on similarity in cluster mean responses. Finally, of the 36 remaining “compound-clusters”, 14 that comprised fewer than 20 members were discarded, yielding a final total of 22. The full dataset, including allocations to the original 100 clusters and their subsequent combinations are available online at http://chicken-data.retinal-functomics.net/.
Quality indices
We computed three separate response quality indices pertaining to the WS and CS stimuli as well as the spectral kernels. The quality indices for WS and CS stimuli were calculated as: where SYNC denotes the SPIKE-synchronization, ISI the inter spike interval and PSTH being the peri stimulus histogram calculated over all 5 repeats of stimulus repetition. SYNC, ISI and PSTH were calculated using the Pyspike Python package83. Kernel quality was calculated by adding the sum of the 100 highest counts and the absolute sum of the 100 lowest counts of z-normalised STAs across all chromatic channels. Based on these, we then computed a ‘compound QI’ for each cell by first normalising the distributions of each of the three individual QIs (CS, WS, kernels) to a mean of 1, followed by averaging across the three normalised QIs. As such, the three stimuli were given equal weighting. This compound QI was used as the basis of sorting cell by quality within each cluster. Note that a different set of QIs was used to preselect cells for inclusion in the clustering in the first place (see above).
Step-responses (WS, CS)
Response amplitudes. We computed three basic response amplitude measures for each step-transition. (i) The total On and Off response amplitudes were taken as the mean number of spikes elicited during the 2 seconds of stimulus presentation. These are used as the basis of all plots showing spectral tunings and contrast response functions (e.g. Figure 3j,k), to compute the polarity index (e.g. Figure 2b) and to compute the spectral tuning index (e.g. Figure 4j). We also computed a baseline value per cell as follows: A 1,000-bin amplitude histogram was computed based on the concatenated, cluster-averaged responses to WS and CS, and the peak of this histogram was taken as the baseline. Accordingly, the baseline was defined as the most common amplitude value throughout the full response trace to these two stimuli, which yielded reasonable estimates as judged by manual inspection. This cluster-wise baseline estimate was also individually associated with each cell within a cluster because this procedure was judged to yield more robust cell-wise estimates compared to computing the same metric based on each cell. The baseline value was used in two ways: As a display item in the spectral tuning and contrast-response plots (e.g. Figure 3j,k), and as a means to normalise response amplitudes across clusters, where tuning functions were normalised between 0 (baseline) and 1 (peak response, e.g. Figure 3l,m).
For each step transition (i.e. On and Off), we also computed ‘transient’ and ‘sustained’ response measured based on the peak spike rate within time windows of 80-160 and 240 – 2,000 ms following the step transition, respectively. To ameliorate the effects of noise, we used box-smoothed (window size of 40 ms) response traces to estimate these latter two metrics. The transient and sustained response metrics were used as the basis of the transience indices (e.g. Figure 2b). The transient-response amplitude was further used as the basis of spectral dominance index (Figure 4j,k, Supplemental Figure 4), and to compute normalised amplitudes used for relating latencies and amplitudes in Figure 5e,f.
Polarity index (PI)
For clusters, the polarity index (PI) was computed based on On- and Off-response amplitudes (see above) of the 100% contrast WS as: Where AOn and AOff are the On- and Off-response amplitudes, respectively. Accordingly, PI ranged from −1 to 1 to denote entirely Off- and On-dominated responses, respectively. A PI of 0 denotes a cell with equal amplitude On- and Off-responses. The same measure was also used to compute polarity of the zebrafish, mouse and human datasets (Figure 2d-f, Supplemental Figure 2c), with the exception that in this case, AOn and AOff were taken as the total response for 1 second (rather than 2 seconds) following a step-transition. This adjustment was necessary because the step-duration in the mouse dataset was 1 second.
Transience index (TI)
As for polarity (above), the transience index (TI) was computed as the contrast between transient and sustained responses per 100% contrast WS as follows: Where ATr and Asus are the transient and sustained response amplitudes as described above. Accordingly, TI ranged from −1 to 1 to refer to cells entirely dominated by their sustained and transient response component, respectively.
For each dataset, the TI was computed three times: As On- and Off versions, based on the respective On- and Off-measures of transient and sustained responses, and as a ‘compound TI’ where the associated polarity index would dictate which of the On- or Off-versions would be used for transience. This ‘compound TI’ was used for Figure 2b-e, while the individual On- and Off-TIs were used for Supplemental Figure 2d.
Spectral dominance (SD)
Spectral dominance (SD) was computed as follows: where Acolour and Awhite are the transient response measures of the largest CS responses, and that of the 100% contrast white response. SD ranged from −1 to 1, indicating responses entirely dominated by WS and CS stimuli, respectively.
Spectral tuning (ST)
An index of “spectral tuning” (ST) was devised to indicate how closely the spectral tuning of a given response matches that of any of the four opsins expressed across the chick’s cones (LWS, RH2, SWS2, SWS117). To compute ST, we first log-transformed Govadovski-templates84 of each of the four opsins and stretched it over one log-unit, such that in each case a linear response of 10% and 100% maximum was mapped to zero and 1, respectively. Such a ∼10-fold log-transform is expected based on the phototransduction cascade of cones85. The resultant transforms are also used as the basis of all opsin-templates presented across the figures. For each cell or cluster’s normalised spectral tuning function (see above) we next determined which of the four templates provided the closest match based on their correlation coefficient. We then subtracted the normalised tuning functions from their respective template, and computed ST as the mean of their difference, multiplied by −1. As such, ST was zero when the opsin template and response were perfectly matched, but negative and positive, respectively, if the response was spectrally broader or narrower than the opsin. Throughout, we applied a minimum response threshold of 3 spikes as the peak response – tuning functions based on fewer spikes were not considered.
Latency
Response latency was computed separately from On- and Off-transitions of all time-smoothed WS and CS responses (40 ms window size) as the time to half peak.
Principal Component Analysis (PCA)
To compare the temporal response-envelopes elicited by CS and WS stimuli, we used principal component analysis. Computing separately for On- and Off-transitions, for a given cluster (Figure 5i,j), or the mean responses of all OnOff clusters (Figure 5g,h), we combined the first five CS (i.e. excluding UV) and first five WS (100, 90, 80, 70, 60% contrast) into a 50×10 input matrix (50 time bins of 20 ms each, 10 responses). To exclude residual activity from the preceding stimulus, we zeroed the first 4 bins (=80 ms). The first two principal components emerging from PCA across this matrix consistently explained >94% of the total variance. Accordingly, higher PCs were discarded. To relate the results from the PCA across clusters (Figure 5i,j), sets of loadings were individually peak normalised to 1.
Spectral Kernels (SK)
Kernel amplitudes
Kernels were individually (R, G, C, B) z-normalised based on timepoints between 1,000 and 500 ms preceding the spike, and amplitudes were subsequently computed (in z-scores) as the difference between their maximum and minimum values. Cluster-mean kernels with an amplitude <2.5 were discarded from further analysis.
Spectral Centroids
To quantify the kinetics of each kernel, we estimated their central frequency in the Fourier domain (‘spectral centroid’) as follows. We first computed each (R, G, C, B) kernels’ probability mass function as its area-normalised magnitude-squared Fourier transform. We then multiplied each entry by the reciprocal of its associated frequency in Hz. The kernels’ central tendency in Hz was then taken as the sum of this product.
Colour Opponency
Colour opponency was determined based on the relative trajectories of each cell or cluster’s group of four spectral kernels (R, G, C, B). Due to the complex interplay between time and polarity across the group of kernels, we started by categorising each kernel-group’s 1 ms time-bin as either non-responsive, responsive and non-opponent, or responsive and opponent. Time-bins where none of the four kernels exceeded a minimum absolute amplitude of 3 (or 1.5 for cluster means) were defined as non-responsive and set to NaN. All remaining responsive bins were categorised as either opponent (1) or non-opponent (0) depending on the relative signs of all four kernels amplitudes. Next, for all kernel-groups where at least 20 (of 1,000) time bins were categorised as opponent (i.e. 20 out of 1,000 ms maximal kernel duration), a colour opponency index (COI) was defined as the mean of this vector. Accordingly, colour opponency ranged from 0 (entirely non-opponent) to 1 (entirely opponent). Kernel-groups with a Kernel-QI (see above) < 1,000 were excluded from this analysis and categorised as non-opponent. The time-dependent opponency vectors were further used as the basis of aligning the opponent and non-opponent time-regions of each OnOff cluster (Figure 5a,b). To this end, we time-compressed each associated vector between its beginning (time −1), defined as the first timepoint where the group of kernels exceeded a minimum amplitude of 1, and the time of the spike (time 0). In this case, non-opponent kernel bins were set to −1 instead of 0 (see Figure 5a,b)
Chirps
Fourier analysis, best frequency, and high-frequency index (HFI)
For each cluster mean, we computed the magnitude-squared Fourier transform for the full chirp-response (accelerating from 1 – 30 Hz over 30 s – shown as area normalised in Figure 3e,f), and used this as the basis of a high-frequency index (HFI): where Phigh and Plow were the mean power between 0.9 and 2 Hz, and between 4 and 15 Hz, respectively. As such, HFI varied between −1 and 1 to indicate clusters entirely dominated by a low- and high-frequency response, respectively. Clusters that exhibited fewer than 20% additional spikes during the chirp-stimulus compared to the 5 seconds preceding it were considered unresponsive and excluded from this analysis.
Best frequency was taken as the peak of each cluster’s magnitude-squared Fourier transform.
Phaselocking analysis
For each spike elicited during chirp stimulation, we computed the phase relative to the stimulus as follows. First, to compensate for the phototransduction delay and inner retinal processing, all spikes were negatively time-shifted by 100 ms relative to the stimulus. The value of 100 ms was chosen based on the mean of On- and Off-latencies (time at half maximum) across all cells’ 100% contrast WS-responses (= 99.4 ms). We next divided the 30 s chirp segment (1-30 Hz) into ten equal time bins of 3 seconds each, with their corresponding mean frequencies centred at 1.2, 1.7, 2.4, 3.3, 4.7, 6.5, 9.8, 12.5, 22.5 and 25.0 Hz. We then computed the phase angle θ for each spike (ranging from 0 to Τ, where 0 and Τ/2 denote the onset of the On and Off-phase, respectively, as indicated e.g. in Figure 3h). For each cell and time bin we then determined resultant median vector strength r (ranging from 0 to 1 to indicate a random relationship between each spike and the stimulus, and perfect phase locking, respectively) as described elsewhere86. For statistical testing, we also computed the same metrics based on 1,000 Poisson statistics but time-randomised artificial neurons that comprised the same number of average spikes as each cluster. We then used the upper 95% confidence intervals from the distribution of vector strengths emerging from this artificial set of neurons as the basis of determining statistical significance (i.e. p<0.05): Any cluster-entries that failed to exceed this confidence boundary of their corresponding bootstrapped distribution were deemed non-significant. For clarity, these data points are removed from the presented data (e.g. Figure 3h).
Choice and pre-processing of functional RGC datasets other vertebrate species
To relate our data from chick retina to those of other species, we sourced previously published RGC-response datasets from human, mice, and zebrafish. To ensure good compatibility, we specifically looked for step-responses to spectrally broad stimulation, and for datasets that comprised representative fraction of most, if not all RGC types in that species. However, since the state-of-the-art with regards to different species is not fully equivalent, some compromises had to be made. The mouse dataset was sourced from Ref37 (rgc-types.org) as probably the most comprehensive available to date. The dataset presents mean spike-responses of all 42 RGC types currently described in mice, to a 1-second step of scotopic light aimed to stimulate the rod-system. The relative abundance of each type was then complemented by assessing their corresponding abundance on Eyewire43. Next, as an example for primates, we chose a multielectrode array dataset collected from post-mortem human donors, published as part of Ref38. While our understanding of the functional RGC types in the human retina is perhaps still lagging that of some other primate species, we nevertheless chose this dataset because it included systematically recorded ‘white’ step responses – in this case for 3-second 100% contrast low-photopic steps of light. The dataset consisted of n = 411 individual neurons. From here, we clustered the data using Kmeans into 20 clusters that capture some of the major structure. We chose the value of 20 as a compromise between a very conservative estimate of the full RGC diversity in primate retina, and in view of the limited dataset which is likely to lack some of the rarer types. The exact number of clusters had no notable effect on the overall distribution of clusters with regards to polarity and transience (see also the original publication38 which shows a very similar distribution based on 5 clusters). Finally, we also included our previously published functional RGC dataset from larval zebrafish10, which included 3-second step responses to 100% contrast low-photopic white light. Because this data was collected by 2-photon mGCaMP6f imaging of RGC-responses in the live eye, rather than by recording ex-vivo RGC spikes, we approximated spiking activity by deconvolving responses with a Tau = 0.4 s decaying time kernel. The exact value of this deconvolution did not notably affect results. A correspondingly comprehensive zebrafish dataset based on direct measurements of spike responses has not been published. Because compared to mice and primates, in larval zebrafish the link between functional10, anatomical67 and genetic87 RGC types remains notably less complete, we opted to use the n = 16 functional RGC clusters and their relative abundances that were described in the original study as the basis of our analysis. It is likely that this relatively low number of clusters underestimates the full RGC diversity in larval zebrafish. Since the duration of step responses used across the 4 datasets were not identical, we cropped the first 1 second following each On- and Off-transition and resampled each into fifty time-bins of 20 ms. From here, analysis for all datasets was identical, and followed the steps for computing polarity and transience indices described above. When separately computing histograms of On- and Off-transience, applying a minimum response threshold of 20% for each type/cluster’s ‘weaker’ polarity for inclusion, where applicable (such that e.g. a cluster that exhibited a more than 5-fold stronger response to On versus Off was included for the On-histogram, but not the Off).
QUANTIFICATION AND STATISTICAL ANALYSIS
Statistics
No statistical methods were used to predetermine sample size. Owing to the exploratory nature of our study, we did not use randomization or blinding.
Linear correlation tests were used to establish the correlation coefficient and significance of relationships between polarity and transience indices (Figures 2a,d-f).
Wilcoxon Rank Sum tests were used to probe for differences in conduction velocity between different cluster groups (Figure 3i). The custom bootstrapping approach associated with evaluating vector strengths (Figures 3g) is detailed in the section on phase-locking.
Supplemental Video S1. Video-montage of the three electrical images (EIs) shown in Figure 1f,h,I, shown at 1:500th real time.
Acknowledgements
We thank Thomas Euler for critical feedback. Funding was provided by the Wellcome Trust (Investigator Award in Science 220277/Z20/Z to TB), the European Research Council (ERC-StG “NeuroVisEco” 677687 to TB), UKRI (BBSRC, BB/R014817/1 and BB/W013509/1 to TB), the Leverhulme Trust (PLP-2017-005 and RPG-2021-026 to TB; GK was supported by the Leverhulme DTP DS-2017-011) and the Lister Institute for Preventive Medicine (to TB). This research was funded in part by the Wellcome Trust [220277/Z20/Z]. For the purpose of Open Access, the authors have applied a CC BY public copyright licence to any Author Accepted Manuscript version arising from this submission.
Footnotes
Funding Acquisition, DO, TB.
The authors declare no conflict of interest, nor interest in conflict.
REFERENCES
- 1.↵
- 2.↵
- 3.↵
- 4.↵
- 5.↵
- 6.↵
- 7.↵
- 8.↵
- 9.↵
- 10.↵
- 11.↵
- 12.↵
- 13.↵
- 14.↵
- 15.↵
- 16.↵
- 17.↵
- 18.↵
- 19.↵
- 20.↵
- 21.↵
- 22.↵
- 23.↵
- 24.↵
- 25.↵
- 26.↵
- 27.↵
- 28.↵
- 29.↵
- 30.↵
- 31.↵
- 32.↵
- 33.↵
- 34.↵
- 35.↵
- 36.↵
- 37.↵
- 38.↵
- 39.↵
- 40.↵
- 41.↵
- 42.↵
- 43.↵
- 44.↵
- 45.↵
- 46.↵
- 47.↵
- 48.↵
- 49.↵
- 50.↵
- 51.↵
- 52.↵
- 53.↵
- 54.
- 55.↵
- 56.↵
- 57.↵
- 58.↵
- 59.↵
- 60.↵
- 61.↵
- 62.↵
- 63.↵
- 64.↵
- 65.↵
- 66.↵
- 67.↵
- 68.↵
- 69.↵
- 70.↵
- 71.↵
- 72.↵
- 73.↵
- 74.↵
- 75.↵
- 76.↵
- 77.↵
- 78.↵
- 79.↵
- 80.↵
- 81.↵
- 82.↵
- 83.↵
- 84.↵
- 85.↵
- 86.↵
- 87.↵
- 88.↵
- 89.↵