Dragon-Zhuang / Reinformer
Official code for ICML 2024 paper Reinformer: Max-Return Sequence Modeling for offline RL
☆33Updated last month
Related projects ⓘ
Alternatives and complementary repositories for Reinformer
- ☆21Updated last year
- Synthetic Experience Replay☆74Updated 5 months ago
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆76Updated last year
- Scaling Pareto-Efficient Decision Making via Offline Multi-Objective RL, published in ICLR 2023☆29Updated last year
- Official implementation for: Consistency Models as a Rich and Efficient Policy Class for Reinforcement Learning ICLR'24☆21Updated 2 months ago
- ☆53Updated last week
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆95Updated 5 months ago
- Codes accompanying the paper "Score Regularized Policy Optimization through Diffusion Behavior" (ICLR 2024).☆41Updated 9 months ago
- OGBench: Benchmarking Offline Goal-Conditioned RL☆83Updated 3 weeks ago
- Official implementation of the BRO algorithm☆10Updated 3 weeks ago
- DrM, a visual RL algorithm, minimizes the dormant ratio to guide exploration-exploitation trade-offs, achieving significant improvements …☆62Updated 5 months ago
- ReDo: The Dormant Neuron Phenomenon in Deep Reinforcement Learning (pytorch)☆25Updated last month
- ☆22Updated this week
- Code for MOBILE: Model-Bellman Inconsistency Penalized Offline Policy Optimization☆15Updated 7 months ago
- Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)☆52Updated last month
- ☆55Updated last month
- ☆17Updated 7 months ago
- [NeurIPS 2023] Implementation of Elastic Decision Transformer☆30Updated last year
- Transformer-based World Models☆71Updated last year
- [NeurIPS 2023] Efficient Diffusion Policy☆82Updated last year
- METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)☆59Updated last year
- Official code repository for Prompt-DT.☆98Updated 2 years ago
- [ICLR 2024] Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View.☆21Updated 7 months ago
- ☆22Updated 10 months ago
- ☆47Updated last year
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆73Updated 11 months ago
- ☆63Updated 5 months ago
- ☆26Updated last year
- Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"☆57Updated 5 months ago