yihaosun1124 / pytorch-mopoLinks
re-implementation of the offline model-based RL algorithm MOPO in pytorch
☆25Updated 3 years ago
Alternatives and similar repositories for pytorch-mopo
Users that are interested in pytorch-mopo are comparing it to the libraries listed below
Sorting:
- Official codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.☆126Updated 3 years ago
- [ICML 2022] Robust Task Representations for Offline Meta-Reinforcement Learning via Contrastive Learning☆35Updated 2 years ago
- Official PyTorch implementation of "Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble" (NeurIPS'21)☆75Updated 2 years ago
- ☆17Updated last year
- Official code repository for Prompt-DT.☆113Updated 2 years ago
- Implemention of the Decision-Pretrained Transformer (DPT) from the paper Supervised Pretraining Can Learn In-Context Reinforcement Learni…☆69Updated last year
- Author's PyTorch implementation of TD7 for online and offline RL☆144Updated last year
- ☆12Updated last year
- ☆111Updated 2 years ago
- Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning☆27Updated 3 years ago
- ☆41Updated 2 years ago
- ☆10Updated 4 years ago
- [ICML 2025 oral] Network Sparsity Unlocks the Scaling Potential of Deep Reinforcement Learning☆19Updated last month
- [ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"☆34Updated 2 years ago
- A PyTorch implementation of Implicit Q-Learning☆83Updated 3 years ago
- Official Codebase for TMLR 2023, Benchmarks and Algorithms for Offline Preference-Based Reward Learning☆20Updated 2 years ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆170Updated 3 years ago
- Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)☆57Updated 2 years ago
- Codes accompanying the paper "RODE: Learning Roles to Decompose Multi-Agent Tasks (ICLR 2021, https://arxiv.org/abs/2010.01523). RODE is …☆76Updated 7 months ago
- Conservative Q Learning on top of SAC☆132Updated 2 years ago
- Representation Learning for RL☆126Updated 2 years ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆86Updated last year
- Maximum Entropy Population Based Training for Zero-Shot Human-AI Coordination☆28Updated 2 years ago
- This is a repository for Hidden-utility Self-Play.☆26Updated last year
- [NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"☆57Updated 2 years ago
- Official pytorch implementation of the paper <Model-based Multi-agent Policy Optimization with Adaptive Opponent-wise Rollouts>.☆20Updated 3 years ago
- Code for MOPO: Model-based Offline Policy Optimization☆179Updated 3 years ago
- ☆19Updated 2 years ago
- [NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.☆85Updated 2 years ago
- ☆56Updated 2 years ago