swan-utokyo / deir
DEIR: Efficient and Robust Exploration through Discriminative-Model-Based Episodic Intrinsic Rewards
☆19Updated 6 months ago
Related projects ⓘ
Alternatives and complementary repositories for deir
- Reproduction of Dreamerv1 and v2 in pytorch for deepmind control suite☆30Updated last year
- Evaluating long-term memory of reinforcement learning algorithms☆133Updated last year
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆105Updated 2 months ago
- Deep Hierarchical Planning from Pixels☆90Updated last year
- CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery☆78Updated 2 years ago
- Pytorch implementation of DreamerV2: Mastering Atari with Discrete World Models, based on the original implementation☆17Updated 2 years ago
- ☆41Updated 3 years ago
- ExORL: Exploratory Data for Offline Reinforcement Learning☆105Updated 2 years ago
- METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)☆58Updated last year
- ☆36Updated last year
- The Controllable Agent project trains RL Agents able to optimize any reward function specified in real time, without any further learning…☆59Updated last year
- Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective☆79Updated last year
- ☆20Updated last year
- Baselines for gymnax 🤖☆60Updated last year
- Discovering and Achieving Goals via World Models, NeurIPS 2021☆83Updated 9 months ago
- ☆17Updated 4 months ago
- Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"☆52Updated 7 months ago
- Contains JAX implementation of algorithms for inverse reinforcement learning☆63Updated 3 months ago
- General Modules for JAX☆58Updated 3 months ago
- ☆65Updated 2 weeks ago
- Pytorch version of Dreamer, which follows the original TF v2 codes.☆113Updated 2 years ago
- Open source code for paper "Optimal Goal-Reaching Reinforcement Learning via Quasimetric Learning" ICML 2023☆44Updated 6 months ago
- Object Centric Atari games☆48Updated this week
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Updated last year
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆99Updated 2 years ago
- A collection of RL algorithms written in JAX.☆95Updated 2 years ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆49Updated 2 years ago
- My Body Is A Cage☆38Updated 3 years ago
- [ICML 2023] Pre-train world model-based agents with different unsupervised strategies, fine-tune the agent's components selectively, and …☆33Updated 8 months ago