Manchery / iql-pytorch
Unofficial PyTorch implementation (replicating paper results) of Implicit Q-Learning (In-sample Q-Learning) for offline RL
☆22Updated 3 months ago
Alternatives and similar repositories for iql-pytorch:
Users that are interested in iql-pytorch are comparing it to the libraries listed below
- Source files to replicate experiments in my ICLR 2022 paper.☆67Updated 7 months ago
- ☆23Updated last year
- ☆22Updated 2 years ago
- Official Codebase for Offline Reinforcement Learning from Images with Latent Space Models☆31Updated 3 years ago
- Authors' PyTorch implementation of 'Recomposing the Reinforcement Learning Building-Blocks with Hypernetworks' (HypeRL)☆25Updated 3 years ago
- [ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"☆34Updated 2 years ago
- Official repository for Paper "Offline Goal-Conditioned Reinforcement Learning via f-Advantage Regression" (NeurIPS 2022)☆35Updated last year
- ☆47Updated last year
- [ICLR 2023 Oral] The official implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Reg…☆44Updated last year
- Code for ICLR 2022 paper Rethinking Goal-Conditioned Supervised Learning and Its Connection to Offline RL.☆26Updated 3 years ago
- ☆39Updated 2 months ago
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆76Updated 2 months ago
- Code base for paper: Reparameterized Policy Learning for Multimodal Trajectory Optimization☆26Updated last year
- Official code for "RAMBO: Robust Adversarial Model-Based Offline RL", NeurIPS 2022☆26Updated last year
- Code for the ICML 2023 paper "What is Essential for Unseen Goal Generalization of Offline Goal-conditioned RL?".☆9Updated last year
- EARL: Environment for Autonomous Reinforcement Learning☆36Updated 2 years ago
- ☆17Updated 11 months ago
- ☆55Updated 2 years ago
- Maximum Entropy Population Based Training for Zero-Shot Human-AI Coordination☆26Updated 2 years ago
- ☆17Updated 3 years ago
- Implementation of SAC and TD3 based on various RNN and Transformer.☆19Updated 4 months ago
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆60Updated last year
- V-MPO torch version with DMLab30 and GTrXL☆12Updated 3 years ago
- Pytorch code for "Learning Belief Representations for Imitation Learning in POMDPs" (UAI 2019)☆18Updated 2 years ago
- CORRO code☆35Updated 2 years ago
- Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"☆64Updated 8 months ago
- Advantage weighted Actor Critic for Offline RL☆50Updated 2 years ago
- ☆23Updated 9 months ago
- ☆22Updated 3 years ago
- RL Algorithms for Visual Continuous Control☆33Updated last year