Dragon-Zhuang / BPPO
Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).
☆73Updated 11 months ago
Related projects ⓘ
Alternatives and complementary repositories for BPPO
- Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)☆48Updated last year
- A collection of offline reinforcement learning algorithms.☆159Updated 5 months ago
- Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.☆51Updated 5 months ago
- Author's PyTorch implementation of TD7 for online and offline RL☆117Updated last year
- Code for "Constrained Variational Policy Optimization for Safe Reinforcement Learning" (ICML 2022)☆65Updated last year
- 🔥 Datasets and env wrappers for offline safe reinforcement learning☆76Updated 2 months ago
- ☆61Updated last year
- 🚀 A fast safe reinforcement learning library in PyTorch☆165Updated last month
- Model-based Offline Policy Optimization re-implement all by pytorch☆28Updated last year
- Codes accompanying the paper "Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning" (NeurIPS…☆71Updated 2 years ago
- An open-source framework to benchmark and assess safety specifications of Reinforcement Learning problems.☆62Updated last year
- [ICLR 2024] The official implementation of "Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model"☆71Updated 2 months ago
- [NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"☆54Updated last year
- Code for "Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning" ICML 2021☆62Updated 3 years ago
- PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and co…☆124Updated 6 months ago
- The implementation of ICLR-2023 paper "Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data".☆38Updated 3 weeks ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆157Updated 2 years ago
- DSAC; Distributional Soft Actor-Critic☆113Updated 9 months ago
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆38Updated 4 years ago
- Implementations of safe reinforcement learning algorithms☆21Updated 8 months ago
- Transformer in RL for decision-making☆74Updated last year
- 🤖 Elegant implementations of offline safe RL algorithms in PyTorch☆177Updated 2 months ago
- Code for the paper "WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning"☆51Updated last year
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆96Updated 3 years ago
- ☆106Updated last year
- Implementation of "MADiff: Offline Multi-agent Learning with Diffusion Models"☆42Updated 3 weeks ago
- ☆52Updated last year
- [NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.☆84Updated last year
- Level-Based Foraging (LBF): A multi-agent reinforcement learning environment☆40Updated 2 months ago
- Conservative Q Learning on top of SAC☆120Updated 2 years ago