keirp / return_transforms
☆19Updated 2 years ago
Alternatives and similar repositories for return_transforms:
Users that are interested in return_transforms are comparing it to the libraries listed below
- ☆42Updated 3 years ago
- Code for FOCAL Paper Published at ICLR 2021☆52Updated last year
- [NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"☆58Updated 2 years ago
- Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)☆67Updated 2 years ago
- Code for NeurIPS 2022 paper "Robust offline Reinforcement Learning via Conservative Smoothing"☆19Updated 2 years ago
- Code for NeurIPS 2021 paper "Offline Reinforcement Learning with Reverse Model-based Imagination"☆19Updated 3 years ago
- Github repo for HIDIO: Hierarchical Reinforcement Learning by Discovering Intrinsic Options☆45Updated 3 years ago
- ☆17Updated 2 years ago
- ☆53Updated last year
- Official Implementation for Quality-Similar Diversity via Population Based Reinforcement Learning☆17Updated 2 years ago
- Code for Mildly Conservative Q-learning for Offline Reinforcement Learning (NeurIPS 2022)☆55Updated 11 months ago
- Code for "Offline Meta-Reinforcement Learning with Advantage Weighting" [ICML 2021]☆47Updated 2 years ago
- Codes accompanying the paper "Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning" (NeurIPS…☆74Updated 2 years ago
- Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)☆53Updated 2 years ago
- Implementation of Multi-Game Decision Transformers in PyTorch☆46Updated 2 years ago
- Model-Based Offline Reinforcement Learning☆50Updated 4 years ago
- TensorFlow implementation for our paper "Learning Long-Term Reward Redistribution via Randomized Return Decomposition"☆19Updated 3 years ago
- Official code repository for Prompt-DT.☆109Updated 2 years ago
- Paper Collection for Batch RL with brief introductions.☆85Updated 3 years ago
- ☆12Updated last year
- ☆29Updated 3 years ago
- Codebase for "Uni[MASK]: Unified Inference in Sequential Decision Problems"☆55Updated 9 months ago
- [ICLR 2022 Spotlight] Code for Reinforcement Learning with Sparse Rewards using Guidance from Offline Demonstration☆26Updated 3 years ago
- Official pytorch implementation of the paper <Model-based Multi-agent Policy Optimization with Adaptive Opponent-wise Rollouts>.☆19Updated 3 years ago
- CORRO code☆35Updated 2 years ago
- Repo for the Greedy when Sure and Conservative when Uncertain about the Opponents (GSCU)☆20Updated 2 years ago
- Benchmarked implementations of Offline RL Algorithms.☆73Updated last month
- ☆14Updated 3 years ago
- ☆55Updated 2 years ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆82Updated last year