abaisero / asym-rlpo
Asymmetric methods for partially observable reinforcement learning
☆8Updated 4 months ago
Related projects: ⓘ
- Implementation of Tactical Optimistic and Pessimistic value estimation☆24Updated last year
- Conservative Q learning in Jax☆49Updated last year
- A version of the DeepMind Control Suite with randomly generated graphics, for measuring visual generalization in continuous control.☆16Updated 3 years ago
- Learning Laplacian Representations in Reinforcement Learning☆17Updated 3 years ago
- ExORL: Exploratory Data for Offline Reinforcement Learning☆100Updated 2 years ago
- Accompanying Code for "Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning", ICML 2023☆16Updated 8 months ago
- ☆12Updated 3 years ago
- ☆53Updated 6 months ago
- ☆41Updated last year
- Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238☆44Updated 3 years ago
- ☆18Updated 7 months ago
- Simple maze environments using mujoco-py☆52Updated 8 months ago
- Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective☆76Updated last year
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆96Updated 2 years ago
- Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"☆53Updated 3 months ago
- [ICLR 2024] Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View.☆19Updated 5 months ago
- ☆29Updated 3 years ago
- Implementation of the skill discovery algorithm described in ICLR submission "Option Discovery using Deep Skill Chaining"☆28Updated 4 years ago
- Code for MOBILE: Model-Bellman Inconsistency Penalized Offline Policy Optimization☆14Updated 5 months ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆83Updated 3 years ago
- ☆39Updated 3 years ago
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆87Updated 3 months ago
- ☆46Updated last year
- Code for Latent Action Space for Offline Reinforcement Learning [CoRL 2020]☆47Updated 2 years ago
- Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning☆28Updated 2 years ago
- Code for "Hierarchical Skills for Efficient Exploration" HSD-3 Algorithm and Baselines☆49Updated 2 years ago
- Official Codebase for Offline Reinforcement Learning from Images with Latent Space Models☆28Updated 3 years ago
- ☆51Updated last year
- ☆17Updated 2 years ago
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆71Updated 9 months ago