Dragon-Zhuang / Reinformer
Official code for ICML 2024 paper Reinformer: Max-Return Sequence Modeling for offline RL
☆37Updated 3 months ago
Alternatives and similar repositories for Reinformer:
Users that are interested in Reinformer are comparing it to the libraries listed below
- ☆12Updated last month
- Official codebase for "Privileged Sensing Scaffolds Reinforcement Learning", contains the Scaffolder algorithm and Sensory Scaffolding Su…☆23Updated 9 months ago
- ☆20Updated 9 months ago
- ☆24Updated last year
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆76Updated last month
- Code for MOBILE: Model-Bellman Inconsistency Penalized Offline Policy Optimization☆17Updated 9 months ago
- [NeurIPS 2023] Efficient Diffusion Policy☆91Updated last year
- Synthetic Experience Replay☆84Updated 8 months ago
- Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)☆51Updated last year
- ☆34Updated 2 months ago
- [ICLR 2023 Oral] The official implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Reg…☆45Updated last year
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆98Updated 8 months ago
- Author's PyTorch implementation of TD7 for online and offline RL☆124Updated last year
- DrM, a visual RL algorithm, minimizes the dormant ratio to guide exploration-exploitation trade-offs, achieving significant improvements …☆70Updated 8 months ago
- DAC: Diffusion Actor-Critic: Formulating Constrained Policy Iteration as Diffusion Noise Regression for Offline Reinforcement Learning.☆12Updated 7 months ago
- A PyTorch implementation of Implicit Q-Learning☆71Updated 3 years ago
- ☆57Updated 2 months ago
- ☆29Updated last year
- [ICLR 2024] Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View.☆23Updated 9 months ago
- ICML'2024: Q-value Regularized Transformer for Offline Reinforcement Learning☆21Updated last month
- Official implementation for: Consistency Models as a Rich and Efficient Policy Class for Reinforcement Learning ICLR'24☆24Updated 5 months ago
- Skeleton for scalable and flexible Jax RL implementations☆69Updated last year
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆81Updated last year
- [NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"☆57Updated last year
- [ICLR 2024] The official implementation of "Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model"☆83Updated this week
- Implementation of Jump-Start Reinforcement Learning (JSRL) with Stable Baselines3☆29Updated last year
- ☆68Updated 3 months ago
- Transformer-based World Models☆75Updated last year
- ReDo: The Dormant Neuron Phenomenon in Deep Reinforcement Learning (pytorch)☆25Updated 3 months ago
- Extreme Q-Learning: Max Entropy RL without Entropy☆82Updated last year