User profiles for Joel Z. Leibo
Joel Z LeiboResearch scientist Verified email at google.com Cited by 12891 |
Multi-agent reinforcement learning in sequential social dilemmas
Matrix games like Prisoner's Dilemma have guided research on social dilemmas for decades.
However, they necessarily treat the choice to cooperate or defect as an atomic action. In …
However, they necessarily treat the choice to cooperate or defect as an atomic action. In …
Reinforcement learning with unsupervised auxiliary tasks
Deep reinforcement learning agents have achieved state-of-the-art results by directly
maximising cumulative reward. However, environments contain a much wider variety of possible …
maximising cumulative reward. However, environments contain a much wider variety of possible …
Human-level performance in 3D multiplayer games with population-based reinforcement learning
Reinforcement learning (RL) has shown great success in increasingly complex single-agent
environments and two-player turn-based games. However, the real world contains multiple …
environments and two-player turn-based games. However, the real world contains multiple …
Prefrontal cortex as a meta-reinforcement learning system
Over the past 20 years, neuroscience research on reward-based learning has converged
on a canonical model, under which the neurotransmitter dopamine ‘stamps in’associations …
on a canonical model, under which the neurotransmitter dopamine ‘stamps in’associations …
Value-decomposition networks for cooperative multi-agent learning
We study the problem of cooperative multi-agent reinforcement learning with a single joint
reward signal. This class of learning problems is difficult because of the often large combined …
reward signal. This class of learning problems is difficult because of the often large combined …
Learning to reinforcement learn
In recent years deep reinforcement learning (RL) systems have attained superhuman
performance in a number of challenging task domains. However, a major limitation of such …
performance in a number of challenging task domains. However, a major limitation of such …
Deepmind lab
DeepMind Lab is a first-person 3D game platform designed for research and development of
general artificial intelligence and machine learning systems. DeepMind Lab can be used to …
general artificial intelligence and machine learning systems. DeepMind Lab can be used to …
Social influence as intrinsic motivation for multi-agent deep reinforcement learning
We propose a unified mechanism for achieving coordination and communication in Multi-Agent
Reinforcement Learning (MARL), through rewarding agents for having causal influence …
Reinforcement Learning (MARL), through rewarding agents for having causal influence …
Open problems in cooperative ai
Problems of cooperation--in which agents seek ways to jointly improve their welfare--are
ubiquitous and important. They can be found at scales ranging from our daily routines--such as …
ubiquitous and important. They can be found at scales ranging from our daily routines--such as …
The dynamics of invariant object recognition in the human visual system
The human visual system can rapidly recognize objects despite transformations that alter
their appearance. The precise timing of when the brain computes neural representations that …
their appearance. The precise timing of when the brain computes neural representations that …