RyanNavillus / reward-surfaces
☆15Updated 10 months ago
Alternatives and similar repositories for reward-surfaces:
Users that are interested in reward-surfaces are comparing it to the libraries listed below
- Efficient seed-parallel implementation of "Breaking the Replay Ratio Barrier"☆22Updated last year
- [ICLR 2024] Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View.☆23Updated 10 months ago
- ☆23Updated 2 years ago
- ☆15Updated last year
- Official release of CompoSuite, a compositional RL benchmark☆47Updated last year
- ☆39Updated 3 months ago
- EARL: Environment for Autonomous Reinforcement Learning☆36Updated 2 years ago
- OpenAI gym environments for goal-conditioned and language-conditioned reinforcement learning☆13Updated 2 years ago
- ☆23Updated last year
- Source code for the paper "Policy Architectures for Compositional Generalization in Control"☆30Updated 2 years ago
- ☆17Updated 11 months ago
- Controllability-Aware Unsupervised Skill Discovery (ICML 2023)☆23Updated last year
- A System for Morphology-Task Generalization via Unified Representation and Behavior Distillation (ICLR2023)☆13Updated 2 years ago
- ☆48Updated 2 years ago
- ☆23Updated 8 months ago
- ☆35Updated 2 years ago
- Lipschitz-constrained Unsupervised Skill Discovery (ICLR 2022)☆34Updated last year
- Official codebase for Generating Diverse Cooperative Agents by Learning Incompatible Policies (notable-top-25% @ ICLR 2023)☆14Updated 9 months ago
- This repository contains the implementation of the PTR algorithm described in the paper: Pre-Training for Robots: Leveraging Diverse Mult…☆29Updated 2 years ago
- Accompanying Code for "Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning", ICML 2023☆19Updated last year
- Authors' PyTorch implementation of 'Recomposing the Reinforcement Learning Building-Blocks with Hypernetworks' (HypeRL)☆25Updated 3 years ago
- ☆41Updated last year
- Official codebase for "The Generalization Gap in Offline Reinforcement Learning" accepted to ICLR 2024☆28Updated 6 months ago
- A Dual-RL method DVL: Dual-V Learning for offline and online reinforcement learning☆10Updated last year
- Conservative Q learning in Jax☆52Updated 2 years ago
- Skeleton for scalable and flexible Jax RL implementations☆73Updated last year
- [ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"☆34Updated 2 years ago
- Jaxplorer is a Jax reinforcement learning (RL) framework for exploring new ideas.☆12Updated 7 months ago
- [ICLR 2023 Oral] The official implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Reg…☆44Updated last year
- ☆10Updated 8 months ago