facebookresearch / e3b
Official repo for the E3B algorithm described in the paper "Exploration via Elliptical Episodic Bonuses".
β82Updated last year
Alternatives and similar repositories for e3b:
Users that are interested in e3b are comparing it to the libraries listed below
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRLβ110Updated 7 months ago
- Baselines for gymnax π€β66Updated 2 years ago
- TeachMyAgent is a testbed platform for Automatic Curriculum Learning methods in Deep RL.β71Updated last year
- PAIRED in PyTorch π₯β58Updated 2 years ago
- MR.Q is a general-purpose model-free reinforcement learning algorithm.β85Updated last week
- Deep Hierarchical Planning from Pixelsβ95Updated 2 years ago
- Evaluating long-term memory of reinforcement learning algorithmsβ141Updated last year
- impact-driven-explorationβ130Updated last year
- Proto-RL: Reinforcement Learning with Prototypical Representationsβ82Updated 2 years ago
- Simple single-file baselines for Q-Learning in pure-GPU settingβ151Updated 3 weeks ago
- β44Updated last year
- [NeurIPS 2022] Open source code for reusing prior computational work in RL.β96Updated last year
- CIC: Contrastive Intrinsic Control for Unsupervised Skill Discoveryβ80Updated 2 years ago
- Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"β53Updated last year
- Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according β¦β35Updated 10 months ago
- β27Updated last year
- Open source code for paper "Denoised MDPs: Learning World Models Better Than the World Itself"β136Updated last year
- General Modules for JAXβ64Updated last week
- Discovering and Achieving Goals via World Models, NeurIPS 2021β85Updated last year
- Implementations of robust Dual Curriculum Design (DCD) algorithms for unsupervised environment design.β130Updated 7 months ago
- β41Updated 9 months ago
- Reproduction of Dreamerv1 and v2 in pytorch for deepmind control suiteβ38Updated 2 years ago
- Fast reinforcement learning researchβ59Updated 4 months ago
- Challenging Memory-based Deep Reinforcement Learning Agentsβ97Updated 5 months ago
- The Controllable Agent project trains RL Agents able to optimize any reward function specified in real time, without any further learningβ¦β61Updated last year
- Foundation Policies with Hilbert Representations (ICML 2024)β82Updated last year
- β43Updated 6 months ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weightsβ¦β54Updated 2 years ago
- β76Updated 3 weeks ago
- Vectorization techniques for fast population-based training.β55Updated 2 years ago