naumix / BiggerRegularizedOptimisticLinks
Official implementation of the BRO algorithm
☆46Updated 5 months ago
Alternatives and similar repositories for BiggerRegularizedOptimistic
Users that are interested in BiggerRegularizedOptimistic are comparing it to the libraries listed below
Sorting:
- ☆99Updated 4 months ago
- Online Goal-Conditioned Reinforcement Learning in JAX. ICLR 2025 Spotlight.☆173Updated 2 months ago
- A benchmark for offline goal-conditioned RL and offline RL☆196Updated 2 weeks ago
- official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning☆102Updated 11 months ago
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆103Updated last year
- a simple and scalable agent for training adaptive policies with sequence-based RL☆131Updated last week
- ReDo: The Dormant Neuron Phenomenon in Deep Reinforcement Learning (pytorch)☆28Updated 8 months ago
- [ICML 2025 oral] Network Sparsity Unlocks the Scaling Potential of Deep Reinforcement Learning☆19Updated last month
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆86Updated 7 months ago
- Unified Implementations of Offline Reinforcement Learning Algorithms☆85Updated 2 months ago
- DrM, a visual RL algorithm, minimizes the dormant ratio to guide exploration-exploitation trade-offs, achieving significant improvements …☆75Updated last year
- ☆28Updated last year
- MR.Q is a general-purpose model-free reinforcement learning algorithm.☆105Updated 3 weeks ago
- Transformer-based World Models☆83Updated 2 years ago
- Code for "SimbaV2: Hyperspherical Normalization for Scalable Deep Reinforcement Learning"☆56Updated last month
- Skeleton for scalable and flexible Jax RL implementations☆83Updated 2 years ago
- Accompanying Code for "Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning", ICML 2023☆19Updated last year
- Synthetic Experience Replay☆94Updated last year
- Official code for ICML 2024 paper Reinformer: Max-Return Sequence Modeling for offline RL☆41Updated 9 months ago
- [ICLR 2024] Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View.☆23Updated last year
- ☆19Updated 2 months ago
- Foundation Policies with Hilbert Representations (ICML 2024)☆89Updated last year
- Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"☆76Updated last year
- The official implementation of "Horizon Reduction Makes RL Scalable"☆116Updated last month
- ☆49Updated 7 months ago
- Open source code for paper "Optimal Goal-Reaching Reinforcement Learning via Quasimetric Learning" ICML 2023☆46Updated last month
- Clean single-file implementation of offline RL algorithms in JAX☆150Updated 6 months ago
- Code for the ICLR 2024 spotlight paper: "Learning to Act without Actions" (introducing Latent Action Policies)☆116Updated 11 months ago
- Recall to Imagine, a model-based RL algorithm with superhuman memory. Oral (1.2%) @ ICLR 2024☆70Updated last year
- Public code for "Reinforcement Learning from Passive Data via Latent Intentions"☆89Updated last year