kvfrans / rlbase_stableLinks
☆41Updated 10 months ago
Alternatives and similar repositories for rlbase_stable
Users that are interested in rlbase_stable are comparing it to the libraries listed below
Sorting:
- Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"☆57Updated last year
- Skeleton for scalable and flexible Jax RL implementations☆82Updated last year
- ☆47Updated 2 years ago
- Contains JAX implementation of algorithms for inverse reinforcement learning☆73Updated 9 months ago
- ☆79Updated 2 months ago
- Unified Implementations of Offline Reinforcement Learning Algorithms☆80Updated last month
- General Modules for JAX☆66Updated last month
- Open source code for paper "Optimal Goal-Reaching Reinforcement Learning via Quasimetric Learning" ICML 2023☆46Updated 2 weeks ago
- Foundation Policies with Hilbert Representations (ICML 2024)☆88Updated last year
- METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)☆69Updated last year
- Synchronized Curriculum Learning for RL Agents☆45Updated 2 months ago
- Goal-Conditioned Reinforcement Learning with JAX☆162Updated 3 weeks ago
- Code for "SimbaV2: Hyperspherical Normalization for Scalable Deep Reinforcement Learning"☆46Updated last month
- Fast reinforcement learning research☆61Updated 5 months ago
- Official code for TLDR: Unsupervised Goal-Conditioned RL via Temporal Distance-Aware Representations☆32Updated 7 months ago
- Code for "Planning Goals for Exploration", ICLR2023 Spotlight. An unsupervised RL agent for hard exploration tasks.☆78Updated last year
- Public code for "Reinforcement Learning from Passive Data via Latent Intentions"☆89Updated last year
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆84Updated 6 months ago
- Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective☆80Updated 2 years ago
- Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"☆74Updated last year
- ☆45Updated 2 years ago
- ☆32Updated last week
- Corax: Core RL in JAX☆38Updated last year
- ☆23Updated 2 years ago
- PWM: Policy Learning with Large World Models☆49Updated 3 months ago
- Jax/Flax Implementation of TD-MPC2☆61Updated last week
- Official Implementation of "Can Learned Optimization Make Reinforcement Learning Less Difficult"☆22Updated last month
- ☆92Updated 3 months ago
- ☆24Updated 11 months ago
- Conservative Q learning in Jax☆54Updated 2 years ago