facebookresearch / mtenv
MultiTask Environments for Reinforcement Learning.
☆74Updated 2 years ago
Related projects: ⓘ
- Multi Task RL Baselines☆221Updated 2 years ago
- Automatic Data-Regularized Actor-Critic (Auto-DrAC)☆101Updated last year
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆83Updated 3 years ago
- Code accompanying the paper "Better Exploration with Optimistic Actor Critic" (NeurIPS 2019)☆68Updated last year
- Code for "Data-Efficient Reinforcement Learning with Self-Predictive Representations"☆157Updated 2 years ago
- Invariant Causal Prediction for Block MDPs☆43Updated 4 years ago
- impact-driven-exploration☆125Updated 11 months ago
- A collection of RL algorithms written in JAX.☆92Updated 2 years ago
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆61Updated last year
- Datasets for data-driven deep reinforcement learning with PyBullet environments☆143Updated 3 years ago
- Revisiting Rainbow☆73Updated 3 years ago
- ☆107Updated last year
- pytorch-implementation of Dreamer (Model-based Image RL Algorithm)☆158Updated 2 years ago
- Pytorch implementations of RL algorithms, focusing on model-based, lifelong, reset-free, and offline algorithms. Official codebase for Re…☆96Updated 2 years ago
- AGAC: Adversarially Guided Actor-Critic☆47Updated 3 years ago
- Vectorization techniques for fast population-based training.☆52Updated 2 years ago
- Convert DeepMind Control Suite to OpenAI gym environments.☆83Updated 4 years ago
- ☆85Updated last month
- Code for "Learning to Reach Goals via Iterated Supervised Learning"☆76Updated 2 years ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆49Updated 2 years ago
- Decoupled Reward-free ExplorAtion and Execution for Meta-reinforcement learning☆89Updated last year
- Tensorflow 2 source code for the PI-SAC agent from "Predictive Information Accelerates Learning in RL" (NeurIPS 2020)☆40Updated last year
- OpenAI Gym wrapper for the DeepMind Control Suite☆200Updated 4 months ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆102Updated 3 weeks ago
- Reinforcement Learning with Latent Flow☆42Updated 3 years ago
- Continual Reinforcement Learning in 3D Non-stationary Environments☆35Updated 5 years ago
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆96Updated 2 years ago
- SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning☆119Updated 3 years ago
- ExORL: Exploratory Data for Offline Reinforcement Learning☆100Updated 2 years ago
- Code for 'Dynamics-Aware Unsupervised Discovery of Skills' (DADS). Enables skill discovery without supervision, which can be combined wit…☆186Updated 3 years ago