dojeon-ai / SimbaV2
Code for "SimbaV2: Hyperspherical Normalization for Scalable Deep Reinforcement Learning"
☆44Updated this week
Alternatives and similar repositories for SimbaV2:
Users that are interested in SimbaV2 are comparing it to the libraries listed below
- ☆18Updated 3 months ago
- ☆82Updated 2 months ago
- Unified Implementations of Offline Reinforcement Learning Algorithms☆64Updated last week
- Official Implementation of "Can Learned Optimization Make Reinforcement Learning Less Difficult"☆22Updated last week
- METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)☆68Updated last year
- Official codebase for "The Generalization Gap in Offline Reinforcement Learning" accepted to ICLR 2024☆28Updated 8 months ago
- Foundation Policies with Hilbert Representations (ICML 2024)☆84Updated last year
- Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"☆70Updated 11 months ago
- Skeleton for scalable and flexible Jax RL implementations☆80Updated last year
- ☆47Updated 2 years ago
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆80Updated 5 months ago
- Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"☆55Updated last year
- Learning diverse options through the Laplacian representation.☆23Updated last year
- [ICLR 2024] Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View.☆23Updated last year
- Accompanying Code for "Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning", ICML 2023☆19Updated last year
- [ICML 2023] Pre-train world model-based agents with different unsupervised strategies, fine-tune the agent's components selectively, and …☆37Updated last year
- MR.Q is a general-purpose model-free reinforcement learning algorithm.☆91Updated 3 weeks ago
- Conservative Q learning in Jax☆53Updated 2 years ago
- ☆41Updated 9 months ago
- Jax-Baseline is a Reinforcement Learning implementation using JAX and Flax/Haiku libraries, mirroring the functionality of Stable-Baselin…☆50Updated 2 weeks ago
- Corax: Core RL in JAX☆37Updated last year
- ☆76Updated last month
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆100Updated 11 months ago
- Efficient seed-parallel implementation of "Breaking the Replay Ratio Barrier"☆23Updated last year
- Repository for "Quality-Diversity Actor-Critic: Learning High-Performing and Diverse Behaviors via Value and Successor Features Critics" …☆16Updated 10 months ago
- Meta-RL Model-Based Algorithm☆33Updated this week
- Goal-Conditioned Reinforcement Learning with JAX☆150Updated this week
- ☆44Updated last year
- ☆75Updated 6 months ago
- Synthetic Experience Replay☆93Updated 11 months ago