zzmtsvv / ORL
☆50Updated 10 months ago
Alternatives and similar repositories for ORL:
Users that are interested in ORL are comparing it to the libraries listed below
- Data-Driven NetHack Tools: Datasets (30+) and recurrent-baselines (AWAC, BC, CQL, IQL, REM)☆69Updated last year
- [ICLR 2023 Oral] The official implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Reg…☆45Updated last year
- Official implementation for "Anti-Exploration by Random Network Distillation", ICML 2023☆52Updated last year
- ☆53Updated last year
- Unofficial PyTorch implementation (replicating paper results) of Implicit Q-Learning (In-sample Q-Learning) for offline RL☆22Updated 2 months ago
- Synthetic Experience Replay☆84Updated 8 months ago
- Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)☆51Updated last year
- ☆29Updated last year
- DAC: Diffusion Actor-Critic: Formulating Constrained Policy Iteration as Diffusion Noise Regression for Offline Reinforcement Learning.☆12Updated 7 months ago
- Implementation of ``Actor-Critic Alignment for Offline-to-Online Reinforcement Learning''☆13Updated last year
- Conservative Q Learning on top of SAC☆122Updated 2 years ago
- ☆34Updated 2 months ago
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆76Updated last month
- Scaling Pareto-Efficient Decision Making via Offline Multi-Objective RL, published in ICLR 2023☆30Updated last month
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆98Updated 8 months ago
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆60Updated last year
- Official repository for Paper "Offline Goal-Conditioned Reinforcement Learning via f-Advantage Regression" (NeurIPS 2022)☆35Updated last year
- Author's PyTorch implementation of TD7 for online and offline RL☆124Updated last year
- Official code for "RAMBO: Robust Adversarial Model-Based Offline RL", NeurIPS 2022☆26Updated last year
- ☆24Updated last year
- Code base for paper: Reparameterized Policy Learning for Multimodal Trajectory Optimization☆26Updated last year
- A PyTorch implementation of Implicit Q-Learning☆71Updated 3 years ago
- Official codebase for GTA: Generative Trajectory Augmentation with Guidance for Offline Reinforcement Learning.☆17Updated 2 months ago
- [ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"☆34Updated 2 years ago
- ☆20Updated 9 months ago
- Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)☆66Updated 2 years ago
- Fast and flexible multi-agent gridworld reinforcement learning environments.☆37Updated 3 weeks ago
- Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorch☆47Updated last year
- Code for Mildly Conservative Q-learning for Offline Reinforcement Learning (NeurIPS 2022)☆52Updated 9 months ago
- [NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"☆57Updated last year