WentseChen / Soft-QMIXLinks
Soft-QMIX: Integrating Maximum Entropy For Monotonic Value Function Factorization
☆15Updated last year
Alternatives and similar repositories for Soft-QMIX
Users that are interested in Soft-QMIX are comparing it to the libraries listed below
Sorting:
- Enabling Mixed Opponent Strategy Script and Self-play on SMAC☆37Updated 3 months ago
- curriculum☆26Updated 2 years ago
- Open-source codebase for MAZero, from "Efficient Multi-agent Reinforcement Learning by Planning" at ICLR 2024.☆37Updated last year
- [ICML' 24] The PyTorch implementation of our paper: "Individual Contributions as Intrinsic Exploration Scaffolds for Multi-agent Reinforc…☆21Updated last year
- [NeurIPS 2024] Official Implementation of Meta-DT☆48Updated last year
- The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"☆56Updated last year
- Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human …☆41Updated last year
- LLM multi-agent discussion framework for multi-agent/robot situations.☆39Updated last year
- [ICLR 2024] Official Implementation of ACORM☆61Updated last year
- Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024…☆41Updated 11 months ago
- M^3PC: Test-Time Model Predictive Control for Pretrained Masked Trajectory Model, ICLR 2025☆17Updated 7 months ago
- ☆54Updated 5 months ago
- Implementation of SAC and TD3 based on various RNN and Transformer.☆26Updated last year
- Official code for "Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning".☆53Updated last year
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆61Updated 2 years ago
- (Official) PyTorch implementation for Efficient Episodic Memory Utilization of Cooperative Multi-Agent Reinforcement Learning (EMU) (ICLR…☆52Updated last year
- [NeurIPS 2022] Official codebase for "Meta-Reward-Net: Implicitly Differentiable Reward Learning for Preference-based Reinforcement Learn…☆24Updated 8 months ago
- Implementation of "MADiff: Offline Multi-agent Learning with Diffusion Models"☆86Updated 4 months ago
- This is the official PyTorch implementation of the paper "Boosting Continuous Control with Consistency Policy".☆43Updated last year
- MuZero for Combinatorial Action Spaces: open-source codebase for MA-Gumbel-AlphaZero, MA-Sampled-AlphaZero, MA-Gumbel-MuZero and MA-Sampl…☆19Updated last year
- MATE: the Multi-Agent Tracking Environment.☆48Updated 2 years ago
- ICLR 2024: SafeDreamer: Safe Reinforcement Learning with World Models☆79Updated last year
- ☆20Updated last year
- official implementation of QVPO☆52Updated last year
- ☆48Updated last year
- [NeurIPS' 24] The PyTorch implementation of our paper: "Kaleidoscope: Learnable Masks for Heterogeneous Multi-agent Reinforcement Learnin…☆20Updated last year
- [ICLR 2024] The official implementation of "Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model"☆112Updated 8 months ago
- Official codebase for CuGRO: Continual Offline Reinforcement Learning via Diffusion-based Dual Generative Replay☆32Updated last year
- LLM-Empowered State Representation for Reinforcement Learning (ICML2024 Accepted paper)☆35Updated last year
- ☆61Updated 11 months ago