charleshsc / QTLinks
ICML'2024: Q-value Regularized Transformer for Offline Reinforcement Learning
☆34Updated 10 months ago
Alternatives and similar repositories for QT
Users that are interested in QT are comparing it to the libraries listed below
Sorting:
- Codes accompanying the paper "Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning" (NeurIPS…☆74Updated 3 years ago
- [NeurIPS 2024] Official Implementation of Meta-DT☆50Updated last year
- Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)☆61Updated 2 years ago
- Code for Mildly Conservative Q-learning for Offline Reinforcement Learning (NeurIPS 2022)☆60Updated last year
- [NeurIPS 2023] The official implementation of "Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value Regularizat…☆39Updated last year
- [NeurIPS 2023] Implementation of Elastic Decision Transformer☆36Updated 2 years ago
- [ICLR 2024] The official implementation of "Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model"☆114Updated 9 months ago
- Official code repository for Prompt-DT.☆117Updated 3 years ago
- [ICML 2024] The algorithm of Reinforcement Learning with an Assistant Reward Agent (ReLara)☆14Updated last year
- [NeurIPS 2023] Efficient Diffusion Policy☆112Updated 2 years ago
- Official code for ICML 2024 paper Reinformer: Max-Return Sequence Modeling for offline RL☆44Updated last year
- ☆62Updated last year
- ☆37Updated 2 years ago
- A list of Offline to Online RL papers (continually updated)☆56Updated last week
- Official code for "Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning".☆53Updated last year
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆92Updated last year
- Code for MOBILE: Model-Bellman Inconsistency Penalized Offline Policy Optimization☆22Updated last year
- Official codebase for CuGRO: Continual Offline Reinforcement Learning via Diffusion-based Dual Generative Replay☆32Updated last year
- Synthetic Experience Replay☆106Updated last year
- ☆115Updated 2 years ago
- [NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"☆57Updated 2 years ago
- ☆51Updated 3 years ago
- ☆42Updated 2 years ago
- [ICLR 2023 Oral] The official implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Reg…☆46Updated 2 years ago
- ☆115Updated 2 years ago
- [ECCV2022] [T-PAMI] StARformer: Transformer with State-Action-Reward Representations.☆94Updated 2 years ago
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆164Updated 2 years ago
- Code release for "HarmonyDream: Task Harmonization Inside World Models" (ICML 2024), https://arxiv.org/abs/2310.00344☆49Updated last year
- Implementation of Multi-Game Decision Transformers in PyTorch☆47Updated 2 years ago
- Benchmarked implementations of Offline RL Algorithms.☆76Updated 8 months ago