Eﬃcient coding is consistent with the irregular shapes of retinal ganglion cell receptive ﬁelds E Doi 1 , J Gauthier 3 , GD Field 3 , A Sher 4 , J Shlens 3 , M Greschner 3 , K Mathieson 5 , D Gunning 5 , AM Litke 4 , L Paninski 6 , EJ Chichilnisky 3 , EP Simoncelli 1,2 1 Center for Neural Science & 2 Howard Hughes Medical Institute, New York University, 3 The Salk Institute, 4 Santa Cruz Institute for Particle Physics, 5 University of Glasgow, 6 Center for Theoretical Neuroscience, Columbia University Physiology Theory Diﬀerence ON-Parasol OFF-Parasol ON-midget OFF-midget Simultaneous recording Error A single matrix #33 14.5% #46 7.4% #56 16.3% 7.1% #60 12.7% #27 12.5% #49 8.2% #50 12.7% #67 #10 27.2% #12 8.4% #13 8.1% #14 11.1% #5 #8 17.2% 25.4% #11 #12 16.0% 19.5% 0 + − Background Eﬃcient coding is a fundamental principle underlying sensory coding. [Barlow 61] Earlier studies: [Linsker 89, Atick & Redlich 90, van Hateren 92, Ruderman 94, Doi & Lewicki 07] • Assume Gaussian and linear model (Fig. a) • Incorporate natural image statistics (b) • Seek the optimal receptive ﬁeld (RF) shape (= a convolution kernel) • Results: - Concentric center-surround RF organization (c) - Transition from low-pass to band-pass ﬁltering with the mean light level (d) Direct comparison with physiology has been hampered by simplifying assumptions. • Cone : retinal ganglion cell (RGC) ratio is not 1 : 1 (Fig. e) • Cone sampling lattice is not regular (f ) • Individual RF shapes are not identical nor circularly symmetric (g) • Roughly 20 RGC types with varying RF size x W u δ r � noisy signal signal linear transform optical blur representation noisy representation channel noise � sensory noise ν s H Amplitude Spatial frequency Data Average a b c d e a b c d e Model Comparison HΣ s H T = EΛE T W = PΩQ T We seek an M×N matrix W that maximizes mutual information between signal and representation: Subject to a constraint on the total power of neural responses: P tr[ W(HΣ s H T + σ 2 ν I N )W T ] ≤P 0 Using SVD, , and we proved that the solution is given by The optimal W eff does the following: : the eigenvectors (PCA basis) of the blurred signal covariance : the diagonal matrix whose entries can be found using the KKT condition : any orthogonal matrix is of blurred signal with diﬀerent photoreceptor SNR with diﬀerent neural SNR is unconstrained with diﬀerent population size 20 dB 10 dB 0 dB −10 dB Modulation Eigenvector index 1 2 4 0 10 100 1 10 100 1 10 100 20 dB 10 dB 0 dB 10× 1× 0.1× Evaluate how much the measured RF population is consistent with the theory I (s; r )= 1 2 log |WHΣ s H T W T + σ 2 ν WW T + σ 2 δ I M | |σ 2 ν WW T + σ 2 δ I M | How to compare with physiology? • Select an orthogonal matrix P fit that minimizes the distance between W eff and W mea • Solution manifold W eff ( P ) Random manifold W rnd( P ) Measured RFs W mea E Pixelated image Cone mosaic RGCs N=69 N=5 N=6 N=26 N=32 Theory Human psychophysics Eﬃcient coding is in a good agreement with the population consisting of ON/OFF midget/parasol cells. Appendix C Fitting errors for diﬀerent subsets of a RGC population Appendix A Finding subject to where L is the number of RGCs in the region-of-interest. • Note that compensates for scale ambiguity between W eff and W mea. • If L = M, we can ﬁnd the solution in closed form. • Otherwise, we can employ numerical optimization. Appendix B How much of the data variance explained by the model is due to eﬃcient coding? This suggests that ~50% of the variance explained is due to the eﬃcient coding solution, not the ﬁtting with P . • 665 cones and 145 RGCs = 4.6 cones/RGC • 27 degrees of retinal eccentricity • Cone-weights W are visualized with the optical blur of the human eye • We show the contour of RFs at 0.3 × peak height • 62 natural images [Doi et al. 03] • 6.2 × 10 6 data points to calculate the data covariance • Human’s optical blur at 30 deg. of eccentricity [Navarro et al. 93] • 10 dB photoreceptor SNR • 10 dB neural SNR [Borst & Theunissen 99] Fovea 40 deg 0 1 2 RGC : Cone ratio We generalize the model to resolve these limitations and compare it with physiology. References K. M. Ahmad, K. Klug, S. Herr, P. Sterling, and S. Schein. Cell density ratios in a foveal patch in macaque retina. Visual Neuroscience, 20:189–209, 2003. J. J. Atick and A. N. Redlich. Towards a theory of early visual processing. Neural Computation, 2:308–320, 1990. J. J. Atick and A. N. Redlich. What does the retina know about natural scenes? Neural Computation, 4:196–210, 1992. H. B. Barlow. Possible principles underlying the transformation of sensory messages. In W. A. Rosenblith, editor, Sensory communication, pages 217–234. MIT Press, MA, 1961. A. Borst and F. E. Theunissen. Information theory and neural coding. Nature Neurosci., 2:947–957, 1999. C. A. Curcio and K. A. Allen. Topography of ganglion cells in human retina. Journal of Comparative Neurology, 300:5–25, 1990. C. A. Curcio, K. A. Allen, K. R. Sloan, C. L. Lerea, J. B. Hurley, I. B. Klock, and M. A. H. Distribution and morphology of human cone photoreceptors stained with anti-blue opsin. J. Comp. Neurol., 312:610–624, 1991. E. Doi, T. Inui, T.-W. Lee, T. Wachtler, and T. J. Sejnowski. Spatiochromatic receptive ﬁeld properties derived from information-theoretic analyses of cone mosaic responses to natural scenes. Neural Computation, 15:397–417, 2003. E. Doi and M. S. Lewicki. A theory of retinal population coding. In Advances in NIPS 19 (NIPS*2006), pp. 353–360. MIT Press, 2007. J. Gauthier, G. D. Field, A. Sher, M. Greschner, J. Shlens, A. M. Litke, & E. J. Chichilnisky. Fine-scale functional coordination enhances population encoding in large neural circuits. submitted. J. Gauthier, G. D. Field, A. Sher, M. Greschner, J. Shlens, D. Gunning, K. Mathieson, A. M. Litke, & E. J. Chichilnisky. Functional identiﬁcation of individual cones in the receptive ﬁelds of primate retinal ganglion cells. SfN2008 Poster 365.5/HH20, 2008. J. H. van Hateren. A theory of maximizing sensory information. Biol. Cybern. 68:23–29, 1992. R. Linsker. An application of the principle of maximum information preservation to linear systems. In Advances in NIPS 1 (NIPS*1988), pp. 186–194. MIT Press, 1989. R. Navarro, P. Artal, and D. R. Williams. Modulation transfer of the human eye as a function of retinal eccentricity. J. Opt. Soc. Am. A, 10:201–212, 1993. A. Roorda and D. R. Williams. The arrangement of the three cone classes in the living human eye. Nature, 397:520–522, 1999. D. L. Ruderman. Designing receptive ﬁelds for highest ﬁdelity. Network, 5:147–155, 1994. = 34.32% (13.44% in the blurred plot) Fig. (a) (b) (c) (d) (e) (f) (g) Future directions: • Filling-in missing RGCs • Trichromatic RFs and cone-type speciﬁcity • More RGC types Limiting factor of the current method: • Boundary of the cone mosaic • Noise model and measurement • Adaptation • Temporal RFs • RGC nonlinearity • Non-Gaussian image statistics • Rod system PP T = I M α • How much does the solution constrain the result? • Compare with a random , which deﬁnes , and ﬁnd . • Error = 83.44 ± 0.11% with diﬀerent (compare to 34.32% with ) ΩQ T Individual cell-type ON-parasol OFF-parasol parasol midget ON OFF ON-midget OFF-midget All parasol or midget All ON or OFF 44.37% 48.57% 42.56% 38.02% 52.44% 47.57% 50.04% 45.43% Eﬃcient coding solution is closest to the RFs of a whole population. This suggest that the simplistic view of ON and OFF types as the mirror of each other should be too simplistic (see “All ON or OFF”), and that an alternative view in which those four types of RGCs jointly constitute a single information channel is a better description. Q eﬀ = E Ω eﬀ P Ω eﬀ Ω eﬀ Ω eﬀ {α t , P ﬁt } = arg min α,P �W mea − αI L×M P eﬀ Q T eﬀ � 2 F P ﬁt Ω eﬀ Q T eﬀ P ﬁt �W m a − W ﬀ (P ﬁt )� 2 F �W mea � 2 F Acknowledgements: This work is supported by NIH #EY018003 (EJC, LP, EPS) and HHMI (EPS) W eﬀ = � representation with M -D orthogonal basis � ◦ � modulating x in the eigenspace � ◦ � representing x in the eigenspace � W (P ﬁt ) All cell-types (total error 34.32%) ON-parasol OFF-parasol ON-midget OFF-midget 45.64% 28.78% 33.05% 35.73% [Atick & Redlich 90] [Atick & Redlich 92] [Curcio & Allen 90, Ahmad et al. 03] See also Gauthier et al. SfN08 [Curcio et al. 91] [Roorda & Williams 99] [Gauthier et al. submitted] W eff ( P fit) W rnd (P)= PΩ rnd Q T rnd Ω rnd Q T rnd Ω eﬀ Q T eﬀ Q eﬀ P