OffDynamicsRL / off-dynamics-rl
☆21Updated this week
Related projects ⓘ
Alternatives and complementary repositories for off-dynamics-rl
- [ICLR 2023 Oral] The official implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Reg…☆44Updated last year
- ☆14Updated last year
- ☆21Updated last year
- Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning☆28Updated 2 years ago
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆95Updated 5 months ago
- [ICLR 2024] Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View.☆21Updated 7 months ago
- Code for MOBILE: Model-Bellman Inconsistency Penalized Offline Policy Optimization☆15Updated 7 months ago
- [ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"☆34Updated last year
- ☆52Updated last year
- Author's PyTorch implementation of ICML'23 paper "Policy Regularization with Dataset Constraint for Offline Reinforcement Learning" for D…☆15Updated 2 weeks ago
- Benchmarked implementations of Offline RL Algorithms.☆65Updated 6 months ago
- ☆47Updated last year
- Official Codebase for Offline Reinforcement Learning from Images with Latent Space Models☆29Updated 3 years ago
- Code for ICLR 2022 paper Rethinking Goal-Conditioned Supervised Learning and Its Connection to Offline RL.☆26Updated 2 years ago
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆76Updated last year
- ☆22Updated 9 months ago
- Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)☆66Updated 2 years ago
- The official implementation of "Mind the Gap: Offline Policy Optimization for Imperfect Rewards" (ICLR2023)☆17Updated last year
- Code for "Masked Autoencoding for Scalable and Generalizable Decision Making". NeurIPS 2022☆44Updated 8 months ago
- ☆17Updated 7 months ago
- Official implementation for: Consistency Models as a Rich and Efficient Policy Class for Reinforcement Learning ICLR'24☆21Updated 2 months ago
- Official repository for paper "Versatile Offline Imitation from Observations and Examples via Regularized State-Occupancy Matching" (ICML…☆25Updated last year
- CORRO code☆34Updated 2 years ago
- OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation☆13Updated last year
- ☆17Updated 2 years ago
- Code base for paper: Reparameterized Policy Learning for Multimodal Trajectory Optimization☆26Updated last year
- ☆18Updated last year
- ☆26Updated last year
- Synthetic Experience Replay☆74Updated 5 months ago