WentseChen / Soft-QMIXLinks
Soft-QMIX: Integrating Maximum Entropy For Monotonic Value Function Factorization
☆15Updated last year
Alternatives and similar repositories for Soft-QMIX
Users that are interested in Soft-QMIX are comparing it to the libraries listed below
Sorting:
- curriculum☆26Updated 2 years ago
- [ICLR 2024] Official Implementation of ACORM☆62Updated last year
- [ICML' 24] The PyTorch implementation of our paper: "Individual Contributions as Intrinsic Exploration Scaffolds for Multi-agent Reinforc…☆22Updated last year
- Enabling Mixed Opponent Strategy Script and Self-play on SMAC☆38Updated 4 months ago
- The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"☆56Updated last year
- Open-source codebase for MAZero, from "Efficient Multi-agent Reinforcement Learning by Planning" at ICLR 2024.☆37Updated last year
- Official codebase for CuGRO: Continual Offline Reinforcement Learning via Diffusion-based Dual Generative Replay☆32Updated last year
- Codebase for [Order Matters: Agent-by-agent Policy Optimization](https://openreview.net/forum?id=Q-neeWNVv1)☆31Updated last week
- (Official) PyTorch implementation for Efficient Episodic Memory Utilization of Cooperative Multi-Agent Reinforcement Learning (EMU) (ICLR…☆52Updated last year
- Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024…☆41Updated last year
- The implementation of ICLR 2023 paper "Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data".☆46Updated last year
- Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human …☆41Updated last year
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆62Updated 2 years ago
- MATE: the Multi-Agent Tracking Environment.☆48Updated 2 years ago
- ☆25Updated last year
- [NeurIPS 2024] Official Implementation of Meta-DT☆50Updated last year
- Implementation of "MADiff: Offline Multi-agent Learning with Diffusion Models"☆93Updated 5 months ago
- The code for paper 'STAS: Spatial-Temporal Return Decomposition for Multi-agent Reinforcement Learning'☆15Updated last year
- ☆55Updated 5 months ago
- LLM-Empowered State Representation for Reinforcement Learning (ICML2024 Accepted paper)☆36Updated last year
- A collection of recent MARL papers☆99Updated last year
- ICML'2024: HarmoDT: Harmony Multi-Task Decision Transformer for Offline Reinforcement Learning☆23Updated last year
- An open source benchmark for Multi Agent Reinforcement Learning☆30Updated 2 years ago
- MuZero for Combinatorial Action Spaces: open-source codebase for MA-Gumbel-AlphaZero, MA-Sampled-AlphaZero, MA-Gumbel-MuZero and MA-Sampl…☆19Updated last year
- ☆15Updated 3 years ago
- Codes accompanying the paper "Score Regularized Policy Optimization through Diffusion Behavior" (ICLR 2024).☆48Updated last year
- Google Research Football MARL Benchmark and Research Toolkit☆52Updated last year
- [NeurIPS 2022] Official codebase for "Meta-Reward-Net: Implicitly Differentiable Reward Learning for Preference-based Reinforcement Learn…☆25Updated 9 months ago
- ICLR'2024: Learning Multi-Agent Communication from Graph Modeling Perspective☆49Updated last year
- (ICML 2023) The official code for RACE: Improve Multi-Agent Reinforcement Learning with Representation Asymmetry and Collaborative Evolut…☆40Updated 2 years ago