keirp / return_transforms
☆19Updated 2 years ago
Alternatives and similar repositories for return_transforms:
Users that are interested in return_transforms are comparing it to the libraries listed below
- Official Implementation for Quality-Similar Diversity via Population Based Reinforcement Learning☆17Updated last year
- [NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"☆57Updated last year
- [ICLR 2023 Oral] The official implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Reg…☆45Updated last year
- Code for FOCAL Paper Published at ICLR 2021☆51Updated last year
- TensorFlow implementation for our paper "Learning Long-Term Reward Redistribution via Randomized Return Decomposition"☆19Updated 2 years ago
- Code for NeurIPS 2021 paper "Curriculum Offline Imitation Learning"☆17Updated 2 years ago
- Model-Based Offline Reinforcement Learning☆48Updated 4 years ago
- Official PyTorch implementation of "Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble" (NeurIPS'21)☆73Updated 2 years ago
- Code for NeurIPS 2022 paper "Robust offline Reinforcement Learning via Conservative Smoothing"☆19Updated last year
- Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)☆66Updated 2 years ago
- Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)☆50Updated last year
- Code for Mildly Conservative Q-learning for Offline Reinforcement Learning (NeurIPS 2022)☆52Updated 8 months ago
- Official code repository for Prompt-DT.☆102Updated 2 years ago
- Code for "Offline Meta-Reinforcement Learning with Advantage Weighting" [ICML 2021]☆46Updated 2 years ago
- [ICLR 2022 Spotlight] Code for Reinforcement Learning with Sparse Rewards using Guidance from Offline Demonstration☆26Updated 2 years ago
- Random parameter environments using gym 0.7.4 and mujoco-py 0.5.7☆20Updated 5 years ago
- ☆41Updated 3 years ago
- Implementation of Multi-Game Decision Transformers in PyTorch☆45Updated last year
- Code for MOBILE: Model-Bellman Inconsistency Penalized Offline Policy Optimization☆17Updated 9 months ago
- Re-implementations of SOTA RL algorithms.☆129Updated last year
- Benchmarked implementations of Offline RL Algorithms.☆68Updated this week
- ☆17Updated 2 years ago
- ☆47Updated last year
- Official repository of the paper "FightLadder: A Benchmark for Competitive Multi-Agent Reinforcement Learning"☆18Updated 5 months ago
- ☆53Updated last year
- ☆54Updated 10 months ago
- ☆29Updated 2 years ago
- code for the paper Offline Prioritized Experience Replay☆13Updated last year
- Official Codebase for TMLR 2023, Benchmarks and Algorithms for Offline Preference-Based Reward Learning☆20Updated 2 years ago