VictorYXL / ReplenishmentEnv
☆35Updated 7 months ago
Related projects ⓘ
Alternatives and complementary repositories for ReplenishmentEnv
- The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"☆54Updated 10 months ago
- This is the official implementation of ERL-Re2.☆59Updated 5 months ago
- Multi-agent PPO with noise (97% win rates on Hard scenarios of SMAC)☆54Updated last year
- A novel preference-driven multi-objective reinforcement learning algorithm using a single policy network that covers the entire preferenc…☆25Updated last year
- Codes accompanying the paper "Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning" (NeurIPS…☆71Updated 2 years ago
- This repository is an implementation of "MASER: Multi-Agent Reinforcement Learning with Subgoals Generated from Experience Replay Buffer"…☆21Updated last year
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆49Updated last year
- ☆28Updated last year
- PyTorch implementation of Foerster, Jakob N., et al. "Counterfactual multi-agent policy gradients."☆52Updated 4 years ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆73Updated 11 months ago
- The code for the SRDQN algorithm to train an agent for the beer game problem☆76Updated last year
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆146Updated 7 months ago
- Codebase for [Order Matters: Agent-by-agent Policy Optimization](https://openreview.net/forum?id=Q-neeWNVv1)☆26Updated last year
- ☆28Updated last year
- [NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.☆84Updated last year
- (NeurIPS 2021) Neural Auto-Curricula in Two-Player Zero-Sum Games.☆27Updated 3 years ago
- ☆50Updated 4 months ago
- ☆16Updated last year
- Deep Implicit Coordination Graphs☆41Updated 5 months ago
- ☆158Updated last year
- Order Fulfillment by Multi-Agent Reinforcement Learning☆18Updated 11 months ago
- Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024…☆30Updated this week
- Official Implementation of 'UPDeT: Universal Multi-agent Reinforcement Learning via Policy Decoupling with Transformers' ICLR 2021(spotli…☆130Updated 3 years ago
- We extend pymarl2 to pymarl3, equipping the MARL algorithms with permutation invariance and permutation equivariance properties. The enh…☆129Updated 10 months ago
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆151Updated last year
- ☆50Updated 2 months ago
- Implement many Sparse Reward algorithms in Gym Fetch environment☆82Updated 4 years ago
- Code for "ALMA: Hierarchical Learning for Composite Multi-Agent Tasks" NeurIPS 2022☆24Updated 2 years ago
- Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)☆52Updated last year
- ☆36Updated 2 years ago