RyanNavillus / reward-surfacesLinks
☆17Updated last year
Alternatives and similar repositories for reward-surfaces
Users that are interested in reward-surfaces are comparing it to the libraries listed below
Sorting:
- ☆17Updated last year
- [ICLR 2024] Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View.☆23Updated last year
- Efficient seed-parallel implementation of "Breaking the Replay Ratio Barrier"☆24Updated 2 years ago
- EARL: Environment for Autonomous Reinforcement Learning☆37Updated 2 years ago
- ☆23Updated 2 years ago
- ☆43Updated 2 years ago
- ☆47Updated 6 months ago
- ☆47Updated 2 years ago
- ☆24Updated last year
- [ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"☆34Updated 2 years ago
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆84Updated 6 months ago
- ☆15Updated 2 years ago
- ☆28Updated last year
- Skeleton for scalable and flexible Jax RL implementations☆82Updated last year
- Conservative Q learning in Jax☆54Updated 2 years ago
- Lipschitz-constrained Unsupervised Skill Discovery (ICLR 2022)☆37Updated 2 years ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆26Updated last year
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆101Updated last year
- [NeurIPS'20] Code for the paper "Offline Imitation Learning with a Misspecified Simulator"☆12Updated 3 years ago
- Official release of CompoSuite, a compositional RL benchmark☆47Updated last year
- ☆17Updated 3 years ago
- Official Codebase for Offline Reinforcement Learning from Images with Latent Space Models☆30Updated 4 years ago
- ☆24Updated 11 months ago
- A Dual-RL method DVL: Dual-V Learning for offline and online reinforcement learning☆11Updated last year
- ☆15Updated last year
- Code for "Hierarchical Skills for Efficient Exploration" HSD-3 Algorithm and Baselines☆50Updated 3 years ago
- My Body Is A Cage☆41Updated 4 years ago
- Authors' PyTorch implementation of 'Recomposing the Reinforcement Learning Building-Blocks with Hypernetworks' (HypeRL)☆25Updated 3 years ago
- A version of the DeepMind Control Suite with randomly generated graphics, for measuring visual generalization in continuous control.☆18Updated 4 years ago
- ☆15Updated 2 years ago