User profiles for Joel Z. Leibo

Joel Z Leibo

Research scientist
Verified email at google.com
Cited by 12891

Multi-agent reinforcement learning in sequential social dilemmas

JZ Leibo, V Zambaldi, M Lanctot, J Marecki… - arXiv preprint arXiv …, 2017 - arxiv.org
Matrix games like Prisoner's Dilemma have guided research on social dilemmas for decades.
However, they necessarily treat the choice to cooperate or defect as an atomic action. In …

Reinforcement learning with unsupervised auxiliary tasks

…, V Mnih, WM Czarnecki, T Schaul, JZ Leibo… - arXiv preprint arXiv …, 2016 - arxiv.org
Deep reinforcement learning agents have achieved state-of-the-art results by directly
maximising cumulative reward. However, environments contain a much wider variety of possible …

Human-level performance in 3D multiplayer games with population-based reinforcement learning

…, N Sonnerat, T Green, L Deason, JZ Leibo… - Science, 2019 - science.org
Reinforcement learning (RL) has shown great success in increasingly complex single-agent
environments and two-player turn-based games. However, the real world contains multiple …

Prefrontal cortex as a meta-reinforcement learning system

…, D Kumaran, D Tirumala, H Soyer, JZ Leibo… - Nature …, 2018 - nature.com
Over the past 20 years, neuroscience research on reward-based learning has converged
on a canonical model, under which the neurotransmitter dopamine ‘stamps in’associations …

Value-decomposition networks for cooperative multi-agent learning

…, M Jaderberg, M Lanctot, N Sonnerat, JZ Leibo… - arXiv preprint arXiv …, 2017 - arxiv.org
We study the problem of cooperative multi-agent reinforcement learning with a single joint
reward signal. This class of learning problems is difficult because of the often large combined …

Learning to reinforcement learn

…, Z Kurth-Nelson, D Tirumala, H Soyer, JZ Leibo… - arXiv preprint arXiv …, 2016 - arxiv.org
In recent years deep reinforcement learning (RL) systems have attained superhuman
performance in a number of challenging task domains. However, a major limitation of such …

Deepmind lab

C Beattie, JZ Leibo, D Teplyashin, T Ward… - arXiv preprint arXiv …, 2016 - arxiv.org
DeepMind Lab is a first-person 3D game platform designed for research and development of
general artificial intelligence and machine learning systems. DeepMind Lab can be used to …

Social influence as intrinsic motivation for multi-agent deep reinforcement learning

…, P Ortega, DJ Strouse, JZ Leibo… - International …, 2019 - proceedings.mlr.press
We propose a unified mechanism for achieving coordination and communication in Multi-Agent
Reinforcement Learning (MARL), through rewarding agents for having causal influence …

Open problems in cooperative ai

…, Y Bachrach, T Collins, KR McKee, JZ Leibo… - arXiv preprint arXiv …, 2020 - arxiv.org
Problems of cooperation--in which agents seek ways to jointly improve their welfare--are
ubiquitous and important. They can be found at scales ranging from our daily routines--such as …

The dynamics of invariant object recognition in the human visual system

L Isik, EM Meyers, JZ Leibo… - Journal of …, 2014 - journals.physiology.org
The human visual system can rapidly recognize objects despite transformations that alter
their appearance. The precise timing of when the brain computes neural representations that …