keirp / return_transforms
☆19Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for return_transforms
- Model-Based Offline Reinforcement Learning☆47Updated 3 years ago
- Code for FOCAL Paper Published at ICLR 2021☆49Updated 11 months ago
- Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)☆66Updated 2 years ago
- [ICLR 2022 Spotlight] Code for Reinforcement Learning with Sparse Rewards using Guidance from Offline Demonstration☆26Updated 2 years ago
- Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)☆47Updated last year
- Official Implementation for Quality-Similar Diversity via Population Based Reinforcement Learning☆17Updated last year
- Code for Mildly Conservative Q-learning for Offline Reinforcement Learning (NeurIPS 2022)☆52Updated 6 months ago
- [NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"☆54Updated last year
- Related papers for offline reforcement learning (we mainly focus on representation and sequence modeling and conventional offline RL)☆17Updated 2 years ago
- ☆40Updated 3 years ago
- [ICLR 2023 Oral] The official implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Reg…☆43Updated last year
- TensorFlow implementation for our paper "Learning Long-Term Reward Redistribution via Randomized Return Decomposition"☆19Updated 2 years ago
- ☆22Updated 2 years ago
- Paper Collection for Batch RL with brief introductions.☆85Updated 2 years ago
- Official PyTorch implementation of "Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble" (NeurIPS'21)☆71Updated 2 years ago
- ☆24Updated last year
- Codes accompanying the paper "Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning" (NeurIPS…☆70Updated 2 years ago
- ☆47Updated last year
- ☆53Updated 8 months ago
- Code for NeurIPS 2021 paper "Curriculum Offline Imitation Learning"☆17Updated 2 years ago
- Meta RL codebase for Unstable Baselines☆20Updated last year
- CORRO code☆34Updated 2 years ago
- ☆51Updated last year
- Official Codebase for TMLR 2023, Benchmarks and Algorithms for Offline Preference-Based Reward Learning☆19Updated last year
- ExORL: Exploratory Data for Offline Reinforcement Learning☆104Updated 2 years ago
- Code for MOBILE: Model-Bellman Inconsistency Penalized Offline Policy Optimization☆14Updated 6 months ago
- code for the paper Offline Prioritized Experience Replay☆13Updated last year
- Official code repository for Prompt-DT.☆96Updated 2 years ago
- ☆106Updated last year
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆52Updated 9 months ago