evgenii-nikishin / rl_with_resets
JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"
☆99Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for rl_with_resets
- ExORL: Exploratory Data for Offline Reinforcement Learning☆105Updated 2 years ago
- ☆54Updated 8 months ago
- Simple maze environments using mujoco-py☆52Updated 10 months ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆83Updated 3 years ago
- Author's PyTorch implementation of TD7 for online and offline RL☆117Updated last year
- Evaluating long-term memory of reinforcement learning algorithms☆133Updated last year
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆55Updated 10 months ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆49Updated 2 years ago
- Deep Hierarchical Planning from Pixels☆90Updated last year
- Benchmarked implementations of Offline RL Algorithms.☆65Updated 6 months ago
- Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective☆79Updated last year
- Pytorch implementation on OpenAI's Procgen ppo-baseline, built from scratch.☆31Updated 4 years ago
- Implementation of Truncated Quantile Critics method for continuous reinforcement learning. https://bayesgroup.github.io/tqc/☆90Updated 3 years ago
- Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238☆44Updated 4 years ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆105Updated 3 months ago
- Code for MOPO: Model-based Offline Policy Optimization☆171Updated 2 years ago
- Extreme Q-Learning: Max Entropy RL without Entropy☆80Updated last year
- Implementation of Trajectory Transformer with attention caching and batched beam search☆107Updated last year
- RE3: State Entropy Maximization with Random Encoders for Efficient Exploration☆67Updated 3 years ago
- Learning Invariant Representations for Reinforcement Learning without Reconstruction☆144Updated 3 years ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Updated last year
- Synthetic Experience Replay☆74Updated 5 months ago
- Skeleton for scalable and flexible Jax RL implementations☆63Updated last year
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆95Updated 5 months ago
- Conservative Q learning in Jax☆51Updated last year
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆76Updated last year
- Learning Laplacian Representations in Reinforcement Learning☆17Updated 3 years ago
- The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)☆26Updated 2 years ago
- A collection of RL algorithms written in JAX.☆95Updated 2 years ago
- ☆110Updated last year