IQL, QMIX, VDN, COMA, QTRAN (QTRAN-Base and QTRAN-Alt), MAVEN, CommNet, DYMA-Cl, G2ANet, and MADDPG
☆19Dec 6, 2021Updated 4 years ago
Alternatives and similar repositories for MARL_Agent_and_ENV
Users that are interested in MARL_Agent_and_ENV are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Multi-agent Reinforcement Learning Algorithms(COMA, VDN, QMIX)☆16May 24, 2020Updated 6 years ago
- ☆22Sep 28, 2018Updated 7 years ago
- an implementation of ATOC☆14Dec 6, 2021Updated 4 years ago
- ☆14Sep 27, 2019Updated 6 years ago
- multi-agent pathfinding via dqn☆16May 19, 2021Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- meta-MADDPG (Python implementation)☆19Sep 16, 2018Updated 7 years ago
- 本项目致力于多人合作实现强化学习用于交通信号灯控制领域,代码将同步更新☆12Mar 11, 2019Updated 7 years ago
- The visualization of a multi-agent reinforcement learning (MARL)-based strategy with efficient exploration strategy.☆20Oct 28, 2022Updated 3 years ago
- PyTorch implementations of popular off-policy multi-agent reinforcement learning algorithms, including QMix, VDN, MADDPG, and MATD3.☆542Jul 21, 2023Updated 2 years ago
- Code for the paper "Age-of-Information-based Scheduling in Multiuser Uplinks with Stochastic Arrivals: A POMDP Approach"☆13Nov 3, 2023Updated 2 years ago
- This program aims to solve an MAPP problem raised in our published one paper on the Chinese Automation Conference (CAC 2021), and the pro…☆22May 28, 2023Updated 3 years ago
- Implementations of IQL, QMIX, VDN, COMA, QTRAN, MAVEN, CommNet, DyMA-CL, and G2ANet on SMAC, the decentralised micromanagement scenario…☆1,745Sep 8, 2022Updated 3 years ago
- Simulation code of paper "H. Zhu, Z. Wang, F. Yang, Y. Zhou and X. Luo, "Intelligent Traffic Network Control in the Era of Internet of Ve…☆11Mar 30, 2022Updated 4 years ago
- Project explores collaboration capabilities of VDN and IQL agents on a custom MARL Food Collector environment☆11Apr 6, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- We use policy gradient to help agents learn optimal policies in a competitive multi-agent contextual bandit setting☆12Mar 9, 2018Updated 8 years ago
- Deep reinforcement learning with a particle dynamics environment applied to emergency evacuation of a room with obstacles☆10Mar 6, 2026Updated 3 months ago
- MATE: the Multi-Agent Tracking Environment.☆46Mar 31, 2023Updated 3 years ago
- Code for ICML25 paper "HYGMA: Hypergraph Coordination Networks with Dynamic Grouping for Multi-Agent Reinforcement Learning"☆25Nov 11, 2025Updated 7 months ago
- Implementation of the G2RL approach in the POGEMA environment☆15Jun 5, 2024Updated 2 years ago
- Learning to Ground Multi-Agent Communication with Autoencoders [NeurIPS 2021]☆49Oct 29, 2021Updated 4 years ago
- ☆21Apr 5, 2024Updated 2 years ago
- Training and testing pipeline for ransomware classification based on screenshots of the splash screens or ransom notes (https://arxiv.org…☆11Jul 19, 2020Updated 5 years ago
- ☆17Sep 23, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- code of paper 《Independent Reinforcement Learning for Weakly Cooperative Multiagent Traffic Control Problem》☆17Dec 14, 2020Updated 5 years ago
- Continual Multi-agent Reinforcement Learning in Dynamic Environments☆11Jul 1, 2021Updated 4 years ago
- ☆14Mar 28, 2020Updated 6 years ago
- MiniMax Multi-Agent Deep Deterministic Policy Gradient (M3DDPG) pytorch implementation☆15Feb 19, 2021Updated 5 years ago
- ICML 2019 RL for Real Life Workshop: Recurrent MADDPG for Partially Observable and Limited Communication Settings☆50Dec 17, 2019Updated 6 years ago
- RL projects including implementation of DQN/DDPG/MADDPG/BicNet on StarCraft II multi-agent learning environment SMAC☆46Feb 7, 2020Updated 6 years ago
- ☆114Oct 25, 2021Updated 4 years ago
- Adaptation of DQN, DDQN and COMA for multi-agent Gym environments☆10Oct 3, 2023Updated 2 years ago
- A Novel Network-Flow Model for Building Evacuation: Route Choices of Evacuees are Modeled with Herding Effect☆11Sep 6, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆15May 1, 2022Updated 4 years ago
- Here is our algorithm for Pursuit Problem based on the Distributed Reinforcement Learning for Cooperative Multi-robot Pursuit☆10Apr 17, 2019Updated 7 years ago
- Prioritized Sequence Experience Replay☆10Aug 16, 2021Updated 4 years ago
- [ECCV'2024] HERGen: Elevating Radiology Report Generation with Longitudinal Data☆31Jan 25, 2026Updated 4 months ago
- A repository of load frequency control models implemeted in matlab☆12Feb 22, 2025Updated last year
- This is a MADDPG algorithm to be used on particle environment styles. I use it to test my own scenarios for underwater target localizatio…☆18Jun 23, 2021Updated 4 years ago
- Version 3.0.0 Pytorch implementations of DQN, DDQN, DDPG, SAC, Discrete SAC. With more features :)☆12Feb 16, 2023Updated 3 years ago