WentseChen / Soft-QMIX
Soft-QMIX: Integrating Maximum Entropy For Monotonic Value Function Factorization
☆12Updated 6 months ago
Alternatives and similar repositories for Soft-QMIX:
Users that are interested in Soft-QMIX are comparing it to the libraries listed below
- curriculum☆20Updated last year
- ☆27Updated 9 months ago
- [ICML' 24] The PyTorch implementation of our paper: "Individual Contributions as Intrinsic Exploration Scaffolds for Multi-agent Reinforc…☆15Updated 7 months ago
- Implementation of "MADiff: Offline Multi-agent Learning with Diffusion Models"☆51Updated 2 weeks ago
- Google Research Football MARL Benchmark and Research Toolkit☆37Updated 8 months ago
- ☆20Updated last year
- Open-source codebase for MAZero, from "Efficient Multi-agent Reinforcement Learning by Planning" at ICLR 2024.☆23Updated 8 months ago
- MATE: the Multi-Agent Tracking Environment.☆44Updated last year
- Overcooked human-AI experiment platform☆32Updated last year
- ☆29Updated 2 years ago
- This repository provides a survey on the applications of deep generative models for offline reinforcement learning and imitation learning…☆38Updated 5 months ago
- [ICLR 2024] The official implementation of "Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model"☆83Updated last week
- MuZero for Combinatorial Action Spaces: open-source codebase for MA-Gumbel-AlphaZero, MA-Sampled-AlphaZero, MA-Gumbel-MuZero and MA-Sampl…☆15Updated 11 months ago
- ICLR'2024: Learning Multi-Agent Communication from Graph Modeling Perspective☆23Updated 10 months ago
- Codebase for [Order Matters: Agent-by-agent Policy Optimization](https://openreview.net/forum?id=Q-neeWNVv1)☆26Updated last month
- The implementation of ICLR-2023 paper "Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data".☆38Updated 2 months ago
- Official pytorch implementation of the paper <Model-based Multi-agent Policy Optimization with Adaptive Opponent-wise Rollouts>.☆19Updated 2 years ago
- ELIGN: Expectation Alignment as a Multi-agent Intrinsic Reward☆18Updated 2 years ago
- [NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.☆85Updated last year
- [NeurIPS 2023] The official implementation of "Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value Regularizat…☆31Updated 10 months ago
- [ICML'2023] "AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners"☆54Updated last year
- [TNNLS] PGDQN: A generalized and efficient preference-guided epsilon-greedy policy equipped DQN for Atari and Autonomous Driving☆10Updated last year
- ☆57Updated 2 months ago
- ☆20Updated 8 months ago
- ☆38Updated 2 years ago
- ☆42Updated 2 years ago
- rlplot is an easy to use and highly encapsulated RL plot library (including basic error bar lineplot and a wrapper to "rliable").☆28Updated last year
- Codes for the paper "Multi-task Hierarchical Adversarial Inverse Reinforcement Learning"☆16Updated last year
- This repo relates to the survey paper <Goal-Conditioned Reinforcement Learning: Problems and Solutions>. We collects widely used benchmar…☆117Updated last year
- D2C(Data-driven Control Library) is a library for data-driven control based on reinforcement learning.☆23Updated last year