dematsunaga / alberdiceLinks
Official PyTorch implementation of AlberDICE
☆23Updated last year
Alternatives and similar repositories for alberdice
Users that are interested in alberdice are comparing it to the libraries listed below
Sorting:
- Representation Learning for RL☆128Updated 2 years ago
- A PyTorch implementation of Implicit Q-Learning☆91Updated 4 years ago
- Author's PyTorch implementation of TD7 for online and offline RL☆153Updated 2 years ago
- Synthetic Experience Replay☆106Updated last year
- An open source benchmark for Multi Agent Reinforcement Learning☆30Updated 2 years ago
- Code for Mildly Conservative Q-learning for Offline Reinforcement Learning (NeurIPS 2022)☆60Updated last year
- Official codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.☆131Updated 4 years ago
- Conservative Q Learning on top of SAC☆132Updated 3 years ago
- PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and co…☆141Updated last year
- Code for NeurIPS2023 accepted paper: Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning.☆40Updated 9 months ago
- Code for MOBILE: Model-Bellman Inconsistency Penalized Offline Policy Optimization☆22Updated last year
- Diversity is All You Need: Learning Skills without a Reward Function in PyTorch.☆80Updated 2 years ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆181Updated 3 years ago
- Implementation of Trajectory Transformer with attention caching and batched beam search☆116Updated 2 years ago
- Extreme Q-Learning: Max Entropy RL without Entropy☆87Updated 2 years ago
- StarCraft II Multi Agent Challenge : QMIX, COMA, LIIR, QTRAN, Central V, ROMA, RODE, DOP, Graph MIX☆73Updated 4 years ago
- Datasets with baselines for Offline MARL.☆188Updated 2 weeks ago
- Implementation of Multi-Game Decision Transformers in PyTorch☆47Updated 2 years ago
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆174Updated last year
- Official code repository for Prompt-DT.☆117Updated 3 years ago
- Official codebase for GTA: Generative Trajectory Augmentation with Guidance for Offline Reinforcement Learning.☆27Updated last year
- 🔥 Datasets and env wrappers for offline safe reinforcement learning☆107Updated last week
- Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)☆61Updated 2 years ago
- ☆15Updated last year
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆68Updated last year
- Official PyTorch implementation of "Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble" (NeurIPS'21)☆79Updated 3 years ago
- ☆58Updated 2 years ago
- Code for Tackling Long-Horizon Tasks with Model-based Offline Reinforcement Learning☆13Updated 9 months ago
- [NeurIPS 2024] Maximum Entropy Reinforcement Learning via Energy-Based Normalizing Flow☆39Updated last year
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆112Updated last year