PT - JOURNAL ARTICLE AU - Ilker Yildirim AU - Mario Belledonne AU - Winrich Freiwald AU - Joshua Tenenbaum TI - Efficient inverse graphics in biological face processing AID - 10.1101/282798 DP - 2019 Jan 01 TA - bioRxiv PG - 282798 4099 - http://biorxiv.org/content/early/2019/02/13/282798.short 4100 - http://biorxiv.org/content/early/2019/02/13/282798.full AB - Vision must not only recognize and localize objects, but perform richer inferences about the underlying causes in the world that give rise to sensory data. How the brain performs these inferences remains unknown: Theoretical proposals based on inverting generative models (or “analysis-by-synthesis”) have a long history but their mechanistic implementations have typically been too slow to support online perception, and their mapping to neural circuits is unclear. Here we present a neurally plausible model for efficiently inverting generative models of images and test it as an account of one high-level visual capacity, the perception of faces. The model is based on a deep neural network that learns to invert a three-dimensional (3D) face graphics program in a single fast feedforward pass. It explains both human behavioral data and multiple levels of neural processing in non-human primates, as well as a classic illusion, the “hollow face” effect. The model fits qualitatively better than state-of-the-art computer vision models, and suggests an interpretable reverse-engineering account of how images are transformed into percepts in the ventral stream.