psc's website

In Defense of Atari - the ALE is not 'solved'!

This post is based on a talk I gave at the AutoRL workshop in ICML 2024, which unfortunately was not recorded.

Introduction

Reinforcement Learning (RL) has been used successfully in a number of challenging tasks, such as beating world champions at Go, controlling tokamak plasmas for nuclear fusion, optimized chip placement, and controlling stratospheric balloons. All these successes have leveraged years of research and expertise and, importantly, rely on the combination of RL algorithms with deep neural networks (as proposed in the seminal DQN paper).

December 2, 2024 Read

From "Bigger, Better, Faster" to "Smaller, Sparser, Stranger"

This is a post based on a talk I gave a few times in 2023. I had been meaning to put it in blog post form for over a year but kept putting it off… I guess better late than never. I think some of the ideas still hold, so hope some of you find it useful!

Bigger, better, faster

In the seminal DQN paper, Mnih et al. demonstrated that reinforcement learning, when combined with neural networks as function approximators, could learn to play Atari 2600 games at superhuman levels. The DQN agent learned to do this over 200 million environment frames, which is roughly equivalent to 1000 hours of human gameplay…

November 27, 2024 Read

The Dormant Neuron Phenomenon in Deep Reinforcement Learning

We identify the dormant neuron phenomenon in deep reinforcement learning, where an agent’s network suffers from an increasing number of inactive neurons, thereby affecting network expressivity.

Ghada Sokar, Rishabh Agarwal, Pablo Samuel Castro*, Utku Evci*

This blogpost is a summary of our ICML 2023 paper. The code is available here. Many more results and analyses are available in the paper, so I encouraged you to check it out if interested!

The following figure gives a nice summary of the overall findings of our work (we are reporting the Interquantile Mean (IQM) as introduced in our Statistical Precipice NeurIPS'21 paper):

June 19, 2023 Read

Hola, I'm psc

Pablo Samuel Castro

Señor Swesearcher at Google

Recent Posts

In Defense of Atari - the ALE is not 'solved'!

Introduction

From "Bigger, Better, Faster" to "Smaller, Sparser, Stranger"

Bigger, better, faster

The Dormant Neuron Phenomenon in Deep Reinforcement Learning

Selected Publications

CALE - Continuous Arcade Learning Environment

On the consistency of hyper-parameter selection in value-based deep reinforcement learning

Mixture of Experts in a Mixture of RL settings

In value-based deep reinforcement learning, a pruned network is a good network

Mixtures of experts unlock parameter scaling for deep RL

Stop Regressing - Training Value Functions via Classification for Scalable Deep RL

Small batch deep reinforcement learning

Minigrid & miniworld - Modular & customizable reinforcement learning environments for goal-oriented tasks

A Kernel Perspective on Behavioural Metrics for Markov Decision Processes

Bigger, Better, Faster, Human-level Atari with human-level efficiency

The dormant neuron phenomenon in deep reinforcement learning

Proto-Value Networks, Scaling Representation Learning with Auxiliary Tasks

Reincarnating Reinforcement Learning, Reusing Prior Computation to Accelerate Progress

The State of Sparse Training in Deep Reinforcement Learning

A general class of surrogate functions for stable and efficient reinforcement learning

MICo, Learning improved representations via sampling-based state similarity for Markov decision processes

The Difficulty of Passive Learning in Deep Reinforcement Learning

Deep Reinforcement Learning at the Edge of the Statistical Precipice

Losses, Dissonances, and Distortions

Revisiting Rainbow, Promoting more insightful and inclusive deep reinforcement learning research

Metrics and continuity in reinforcement learning

Contrastive Behavioral Similarity Embeddings for Generalization in Reinforcement Learning

Autonomous navigation of stratospheric balloons using reinforcement learning

Estimating Policy Functions in Payment Systems using Reinforcement Learning

Dopamine, A Research Framework for Deep Reinforcement Learning

GANterpretations

Rigging the lottery, Making all tickets winners

Scalable methods for computing state similarity in deterministic MDPs

A Geometric Perspective on Optimal Representations for Reinforcement Learning

A comparative analysis of expected and distributional reinforcement learning

ML-Jam, Performing Structured Improvisations with Pre-trained Models

Shaping the Narrative Arc, Information-Theoretic Collaborative Dialogue

An Atari Model Zoo for Analyzing, Visualizing, and Comparing Deep Reinforcement Learning Agents

Distributional reinforcement learning with linear function approximation

Inverse reinforcement learning with multiple ranked experts