OffDynamicsRL / off-dynamics-rl
☆16Updated 2 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for off-dynamics-rl
- ☆20Updated last year
- Code for MOBILE: Model-Bellman Inconsistency Penalized Offline Policy Optimization☆14Updated 6 months ago
- [ICLR 2023 Oral] The official implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Reg…☆43Updated last year
- Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning☆28Updated 2 years ago
- ☆14Updated last year
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆95Updated 5 months ago
- ☆13Updated last year
- [ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"☆34Updated last year
- Benchmarked implementations of Offline RL Algorithms.☆64Updated 6 months ago
- The official implementation of "Mind the Gap: Offline Policy Optimization for Imperfect Rewards" (ICLR2023)☆17Updated last year
- ☆17Updated 6 months ago
- [ICLR 2024] Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View.☆21Updated 6 months ago
- Official implementation for: Consistency Models as a Rich and Efficient Policy Class for Reinforcement Learning ICLR'24☆22Updated 2 months ago
- xTED: Cross-Domain Adaptation via Diffusion-Based Trajectory Editing☆13Updated 3 weeks ago
- ☆19Updated 9 months ago
- Author's PyTorch implementation of ICML'23 paper "Policy Regularization with Dataset Constraint for Offline Reinforcement Learning" for D…☆14Updated this week
- ☆18Updated last year
- ☆51Updated last year
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆76Updated 11 months ago
- Scaling Pareto-Efficient Decision Making via Offline Multi-Objective RL, published in ICLR 2023☆28Updated last year
- OGBench: Benchmarking Offline Goal-Conditioned RL☆75Updated 2 weeks ago
- Synthetic Experience Replay☆71Updated 5 months ago
- Efficient seed-parallel implementation of "Breaking the Replay Ratio Barrier"☆21Updated last year
- Codes accompanying the paper "Score Regularized Policy Optimization through Diffusion Behavior" (ICLR 2024).☆36Updated 9 months ago
- Domain-Robust Visual Imitation Learning with Mutual Information Constraints code☆15Updated 3 years ago
- ☆18Updated last year
- ☆13Updated 7 months ago
- ☆26Updated last year
- ☆47Updated last year