facebookresearch / e3bLinks
Official repo for the E3B algorithm described in the paper "Exploration via Elliptical Episodic Bonuses".
☆87Updated last year
Alternatives and similar repositories for e3b
Users that are interested in e3b are comparing it to the libraries listed below
Sorting:
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆120Updated last year
- [NeurIPS 2022] Open source code for reusing prior computational work in RL.☆99Updated 2 years ago
- Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective☆82Updated 2 years ago
- impact-driven-exploration☆132Updated 2 years ago
- CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery☆83Updated 3 years ago
- Discovering and Achieving Goals via World Models, NeurIPS 2021☆86Updated last year
- ☆101Updated last year
- Reproduction of Dreamerv1 and v2 in pytorch for deepmind control suite☆47Updated 3 years ago
- ☆54Updated 2 years ago
- Deep Hierarchical Planning from Pixels☆112Updated 3 years ago
- ☆46Updated last year
- ☆28Updated 3 years ago
- Challenging Memory-based Deep Reinforcement Learning Agents☆109Updated last year
- Fast reinforcement learning research☆61Updated last year
- TeachMyAgent is a testbed platform for Automatic Curriculum Learning methods in Deep RL.☆76Updated 2 years ago
- Contains JAX implementation of algorithms for inverse reinforcement learning☆74Updated last year
- Evaluating long-term memory of reinforcement learning algorithms☆161Updated 2 years ago
- Proto-RL: Reinforcement Learning with Prototypical Representations☆85Updated 3 years ago
- General Modules for JAX☆72Updated 4 months ago
- Implementations of robust Dual Curriculum Design (DCD) algorithms for unsupervised environment design.☆138Updated last year
- Baselines for gymnax 🤖☆74Updated 2 years ago
- The Controllable Agent project trains RL Agents able to optimize any reward function specified in real time, without any further learning…☆70Updated 2 years ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆59Updated 3 years ago
- Docker containers of baseline agents for the Crafter environment☆30Updated 4 years ago
- PAIRED in PyTorch 🔥☆64Updated 2 years ago
- Simplistic Pytorch Implementation of the Dreamer-RL☆20Updated 8 months ago
- Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"☆57Updated last year
- ☆19Updated 3 years ago
- ExORL: Exploratory Data for Offline Reinforcement Learning☆123Updated 3 years ago
- Object Centric Atari games☆96Updated last month