facebookresearch / e3b
Official repo for the E3B algorithm described in the paper "Exploration via Elliptical Episodic Bonuses".
β79Updated 7 months ago
Related projects β
Alternatives and complementary repositories for e3b
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRLβ105Updated 2 months ago
- Baselines for gymnax π€β58Updated last year
- β61Updated 2 months ago
- Simple single-file baselines for Q-Learning in pure-GPU settingβ93Updated 3 months ago
- Discovering and Achieving Goals via World Models, NeurIPS 2021β83Updated 9 months ago
- Deep Hierarchical Planning from Pixelsβ90Updated last year
- Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"β52Updated 7 months ago
- [NeurIPS 2022] Open source code for reusing prior computational work in RL.β91Updated last year
- Evaluating long-term memory of reinforcement learning algorithmsβ132Updated last year
- General Modules for JAXβ58Updated 3 months ago
- Challenging Memory-based Deep Reinforcement Learning Agentsβ84Updated last week
- DEIR: Efficient and Robust Exploration through Discriminative-Model-Based Episodic Intrinsic Rewardsβ18Updated 6 months ago
- Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objectiveβ79Updated last year
- TeachMyAgent is a testbed platform for Automatic Curriculum Learning methods in Deep RL.β66Updated 11 months ago
- CIC: Contrastive Intrinsic Control for Unsupervised Skill Discoveryβ78Updated 2 years ago
- β28Updated 2 years ago
- Object Centric Atari gamesβ48Updated this week
- Contains JAX implementation of algorithms for inverse reinforcement learningβ62Updated 2 months ago
- The Controllable Agent project trains RL Agents able to optimize any reward function specified in real time, without any further learningβ¦β59Updated last year
- JAX reimplementation of the DeepMind paper "Genie: Generative Interactive Environments"β32Updated last week
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weightsβ¦β49Updated 2 years ago
- β66Updated 10 months ago
- Accelerated replay buffers in JAXβ39Updated 2 years ago
- Efficient baselines for autocurricula in JAX.β172Updated 2 months ago
- Vectorization techniques for fast population-based training.β54Updated 2 years ago
- Codebase for the solution that won first place and was awarded the most human-like agent in the 2021 NeurIPS Competition MineRL BASALT Chβ¦β51Updated 2 years ago
- Synchronized Curriculum Learning for RL Agentsβ21Updated last week
- Proto-RL: Reinforcement Learning with Prototypical Representationsβ82Updated 2 years ago
- impact-driven-explorationβ126Updated last year
- β64Updated this week