Google Scholar

User profiles for Joel Z. Leibo

Joel Z Leibo

Research scientist

Verified email at google.com

Cited by 12891

[PDF] arxiv.org

Multi-agent reinforcement learning in sequential social dilemmas

JZ Leibo, V Zambaldi, M Lanctot, J Marecki… - arXiv preprint arXiv …, 2017 - arxiv.org

Matrix games like Prisoner's Dilemma have guided research on social dilemmas for decades.
However, they necessarily treat the choice to cooperate or defect as an atomic action. In …

Save Cite Cited by 894 Related articles All 14 versions View as HTML

[PDF] arxiv.org

Reinforcement learning with unsupervised auxiliary tasks

…, V Mnih, WM Czarnecki, T Schaul, JZ Leibo… - arXiv preprint arXiv …, 2016 - arxiv.org

Deep reinforcement learning agents have achieved state-of-the-art results by directly
maximising cumulative reward. However, environments contain a much wider variety of possible …

Save Cite Cited by 1349 Related articles All 7 versions View as HTML

[PDF] arxiv.org

Human-level performance in 3D multiplayer games with population-based reinforcement learning

…, N Sonnerat, T Green, L Deason, JZ Leibo… - Science, 2019 - science.org

Reinforcement learning (RL) has shown great success in increasingly complex single-agent
environments and two-player turn-based games. However, the real world contains multiple …

Save Cite Cited by 907 Related articles All 13 versions

[PDF] biorxiv.org

Prefrontal cortex as a meta-reinforcement learning system

…, D Kumaran, D Tirumala, H Soyer, JZ Leibo… - Nature …, 2018 - nature.com

Over the past 20 years, neuroscience research on reward-based learning has converged
on a canonical model, under which the neurotransmitter dopamine ‘stamps in’associations …

Save Cite Cited by 605 Related articles All 12 versions

[PDF] arxiv.org

Value-decomposition networks for cooperative multi-agent learning

…, M Jaderberg, M Lanctot, N Sonnerat, JZ Leibo… - arXiv preprint arXiv …, 2017 - arxiv.org

We study the problem of cooperative multi-agent reinforcement learning with a single joint
reward signal. This class of learning problems is difficult because of the often large combined …

Save Cite Cited by 1573 Related articles All 10 versions View as HTML

[PDF] arxiv.org

Learning to reinforcement learn

…, Z Kurth-Nelson, D Tirumala, H Soyer, JZ Leibo… - arXiv preprint arXiv …, 2016 - arxiv.org

In recent years deep reinforcement learning (RL) systems have attained superhuman
performance in a number of challenging task domains. However, a major limitation of such …

Save Cite Cited by 996 Related articles All 8 versions View as HTML

[PDF] arxiv.org

Deepmind lab

C Beattie, JZ Leibo, D Teplyashin, T Ward… - arXiv preprint arXiv …, 2016 - arxiv.org

DeepMind Lab is a first-person 3D game platform designed for research and development of
general artificial intelligence and machine learning systems. DeepMind Lab can be used to …

Save Cite Cited by 585 Related articles All 4 versions View as HTML

[PDF] mlr.press

Social influence as intrinsic motivation for multi-agent deep reinforcement learning

…, P Ortega, DJ Strouse, JZ Leibo… - International …, 2019 - proceedings.mlr.press

We propose a unified mechanism for achieving coordination and communication in Multi-Agent
Reinforcement Learning (MARL), through rewarding agents for having causal influence …

Save Cite Cited by 454 Related articles All 6 versions View as HTML

[PDF] arxiv.org

Open problems in cooperative ai

…, Y Bachrach, T Collins, KR McKee, JZ Leibo… - arXiv preprint arXiv …, 2020 - arxiv.org

Problems of cooperation--in which agents seek ways to jointly improve their welfare--are
ubiquitous and important. They can be found at scales ranging from our daily routines--such as …

Save Cite Cited by 184 Related articles All 2 versions View as HTML

[PDF] physiology.org Full View

The dynamics of invariant object recognition in the human visual system

L Isik, EM Meyers, JZ Leibo… - Journal of …, 2014 - journals.physiology.org

The human visual system can rapidly recognize objects despite transformations that alter
their appearance. The precise timing of when the brain computes neural representations that …

Save Cite Cited by 267 Related articles All 12 versions

Create alert

Cite

Advanced search

Saved to My library

User profiles for Joel Z. Leibo

Joel Z Leibo

Multi-agent reinforcement learning in sequential social dilemmas

Reinforcement learning with unsupervised auxiliary tasks

Human-level performance in 3D multiplayer games with population-based reinforcement learning

Prefrontal cortex as a meta-reinforcement learning system

Value-decomposition networks for cooperative multi-agent learning

Learning to reinforcement learn

Deepmind lab

Social influence as intrinsic motivation for multi-agent deep reinforcement learning

Open problems in cooperative ai

The dynamics of invariant object recognition in the human visual system