Adaptation of DQN, DDQN and COMA for multi-agent Gym environments
☆11Oct 3, 2023Updated 2 years ago
Alternatives and similar repositories for multiagent_gym
Users that are interested in multiagent_gym are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A multi-agent version of the Double DQN algorithm, with Foraging Task and Pursuit Game test scenarios☆13Apr 24, 2017Updated 8 years ago
- Multi-agent Reinforcement Learning Algorithms(COMA, VDN, QMIX)☆16May 24, 2020Updated 5 years ago
- Multi-Agent Reinforcement Learning for Path Planning☆15Jan 8, 2022Updated 4 years ago
- Project on multi agent reinforcement learning applied on patrolling agents☆40Dec 10, 2019Updated 6 years ago
- Here is our algorithm for Pursuit Problem based on the Distributed Reinforcement Learning for Cooperative Multi-robot Pursuit☆10Apr 17, 2019Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Implementation of Machine Learning algorithms from scratch in Python☆34Jul 9, 2019Updated 6 years ago
- Deep Q Network for Multi-agent RL☆15Oct 18, 2020Updated 5 years ago
- ☆13Apr 11, 2022Updated 3 years ago
- solve pursuit-evasion problem with multi-agent deep reinforcement learning☆13Sep 9, 2020Updated 5 years ago
- Minimal fastai code needed for working with pytorch☆15Aug 25, 2021Updated 4 years ago
- ☆25Jul 10, 2025Updated 8 months ago
- Multi-Exit Evacuation simulation; Rainbow DQN application☆18Sep 4, 2020Updated 5 years ago
- Belief-Rule Based systems algorithms' implementation.☆16Dec 27, 2022Updated 3 years ago
- Pytorch implementations of the multi-agent reinforcement learning algorithms, including QMIX, VDN, COMA, MADDPG, MATD3, FACMAC and MASoft…☆55Mar 10, 2025Updated last year
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Project explores collaboration capabilities of VDN and IQL agents on a custom MARL Food Collector environment☆11Apr 6, 2022Updated 3 years ago
- We use policy gradient to help agents learn optimal policies in a competitive multi-agent contextual bandit setting☆12Mar 9, 2018Updated 8 years ago
- Decision Transformer JAX - Reproduction of 'Decision Transformer: Reinforcement Learning via Sequence Modeling' in JAX and Haiku☆13Aug 14, 2024Updated last year
- Combination of Rapidly-Exporing Random Trees (RRT) and Safe Interval Path Planning (SIPP) for high-DOF planning in dynamic environments,…☆17Mar 13, 2026Updated 2 weeks ago
- Deep reinforcement learning with a particle dynamics environment applied to emergency evacuation of a room with obstacles☆10Mar 6, 2026Updated 3 weeks ago
- Parallel Quantum Annealing☆10Jan 7, 2023Updated 3 years ago
- 🎾 Multi-Agent Proximal Policy Optimization approach to a competitive reinforcement learning problem☆22Sep 25, 2022Updated 3 years ago
- Master Thesis Project in Computer Engineering at Aarhus University 2024 on "Simulating Multi-agent Path Planning in Complex environments …☆16Oct 12, 2025Updated 5 months ago
- ☆13Dec 21, 2018Updated 7 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Official Repository for Can Language Models be Instructed to Protect Personal Information?☆13Oct 8, 2023Updated 2 years ago
- ☆25Jan 2, 2019Updated 7 years ago
- RL projects including implementation of DQN/DDPG/MADDPG/BicNet on StarCraft II multi-agent learning environment SMAC☆47Feb 7, 2020Updated 6 years ago
- Implementation of the G2RL approach in the POGEMA environment☆14Jun 5, 2024Updated last year
- 基于优化算法的人员应急疏散优化方案 | Optimization Plan for Emergency Evacuation of Personnel Based on Optimization Algorithm☆13Sep 4, 2024Updated last year
- Multi-Agent PathFinding (MAPF) for 2D Robots moving inventory on a grid - Practice building environment + robots + planning + inventory m…☆16Nov 20, 2023Updated 2 years ago
- Supporting code for "Learning to Solve Combinatorial Graph Partitioning Problems via Efficient Exploration".☆13Jun 18, 2022Updated 3 years ago
- Model Context Protocol (MCP) server for mapping clinical terminology to Observational Medical Outcomes Partnership (OMOP) concepts using …☆34Feb 19, 2026Updated last month
- [ICLR 2025] UniCO: On Unified Combinatorial Optimization via Problem Reduction to Matrix-Encoded General TSP☆15Jun 20, 2025Updated 9 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Continual Multi-agent Reinforcement Learning in Dynamic Environments☆11Jul 1, 2021Updated 4 years ago
- ☆10Feb 22, 2023Updated 3 years ago
- A library for research in unnatural language semantics☆14Mar 5, 2026Updated 3 weeks ago
- A Novel Network-Flow Model for Building Evacuation: Route Choices of Evacuees are Modeled with Herding Effect☆11Sep 6, 2024Updated last year
- In this study, a multi agent chase-escape problem using Deep Q learning. Actors of the problem are smart evader and smart pursuers with o…☆28Jun 29, 2023Updated 2 years ago
- ☆12Dec 26, 2024Updated last year
- ☆12Nov 28, 2023Updated 2 years ago