A new paper list for multi-agent reinforcement learning (actively maintained)
☆24Mar 27, 2020Updated 5 years ago
Alternatives and similar repositories for Paper-List-of-MARL
Users that are interested in Paper-List-of-MARL are comparing it to the libraries listed below
Sorting:
- ☆13Mar 16, 2023Updated 2 years ago
- Personal Repo to keep track of RL papers☆31May 3, 2021Updated 4 years ago
- Meta-Reinforcement Learning with Policy Residual Representation☆11Aug 15, 2019Updated 6 years ago
- Benchmark result of different RL algorithms on MetaDrive environments, including Multi-agent RL (IPPO, centralized critics, CoPO).☆16Oct 25, 2022Updated 3 years ago
- Codes accompanying the paper "Influence-Based Multi-Agent Exploration" (ICLR 2020 spotlight)☆33Mar 16, 2020Updated 5 years ago
- Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)☆19Aug 20, 2023Updated 2 years ago
- A list of multi-task machine learning papers.☆37Nov 4, 2018Updated 7 years ago
- 清华大学研究生社会实践系统爬虫☆17Jun 4, 2024Updated last year
- an implementation of 'Recurrent Convolutional Neural Network for Object Recognition'☆27Sep 26, 2019Updated 6 years ago
- ☆108Feb 10, 2021Updated 5 years ago
- Simulink Reference example for modeling smart trucks with the intelligence to form a platoon based on certain criteria.☆26Nov 26, 2019Updated 6 years ago
- CARMA Streets is a component of CARMA ecosystem, which enables such a coordination among different transportation users. This component p…☆11Aug 21, 2025Updated 6 months ago
- ☆30Dec 2, 2021Updated 4 years ago
- Learning Individual Intrinsic Reward in MARL☆65Dec 8, 2022Updated 3 years ago
- Code implementing the CORE-RL algorithm with DDPG, PPO, and TRPO. See the paper "Control Regularization for Reduced Variance Reinforcemen…☆32Jan 7, 2021Updated 5 years ago
- We implement MADDPG in a congestion env, and compare with several control groups to highlight the performance of MADDPG☆11Jul 14, 2021Updated 4 years ago
- Compare Laguerre-based MPC and Traditional MPC for platoon of vehicles.☆13Feb 14, 2023Updated 3 years ago
- Paper Collection of Reinforcement Learning Exploration covers Exploration of Muti-Arm-Bandit, Reinforcement Learning and Multi-agent Rein…☆36Nov 8, 2019Updated 6 years ago
- Gridworld for MARL experiments☆144Jan 29, 2021Updated 5 years ago
- MTalk-Bench: Evaluating Speech-to-Speech Models in Multi-Turn Dialogues via Arena-style and Rubrics Protocols☆17Nov 19, 2025Updated 3 months ago
- ☆10Dec 16, 2023Updated 2 years ago
- Distributed multi-agent average consensus☆11Mar 17, 2020Updated 5 years ago
- This is the class in matlab for convex optimization algorithms☆10Nov 19, 2023Updated 2 years ago
- Build a very simple intersection assistant driving system simulation based on SUMO and TraCI4Matlab☆10Feb 24, 2019Updated 7 years ago
- The Ecoacoustic Dataset from Arctic North Slope Alaska☆11May 29, 2025Updated 9 months ago
- Simulation code of paper "H. Zhu, Z. Wang, F. Yang, Y. Zhou and X. Luo, "Intelligent Traffic Network Control in the Era of Internet of Ve…☆11Mar 30, 2022Updated 3 years ago
- This tool develops driving cycles for specific driving style properties (comfort, consumption, fastness, subjective safety) of AVs.It is …☆13May 16, 2022Updated 3 years ago
- Simple model for sentence compression (a.k.a Baseline in Klerke et al., NAACL 2016)☆10Dec 16, 2018Updated 7 years ago
- Official repository for the paper Local Linear Attention: An Optimal Interpolation of Linear and Softmax Attention For Test-Time Regressi…☆23Oct 1, 2025Updated 5 months ago
- Benchmark for Continuous Multi-Agent Robotic Control, based on OpenAI's Mujoco Gym environments.☆370Mar 16, 2023Updated 2 years ago
- ☆13Oct 8, 2024Updated last year
- 输入时延下的固定是时延一致性☆15Mar 25, 2022Updated 3 years ago
- ☆10Nov 26, 2020Updated 5 years ago
- NeurIPS 2024☆13Oct 29, 2025Updated 4 months ago
- This repository is the official implementation of Low-Rank Modular Reinforcement Learning via Muscle Synergy.☆11Oct 27, 2022Updated 3 years ago
- Cooperation and Fairness in Multi-Agent Reinforcement Learning☆16Aug 6, 2025Updated 7 months ago
- 二阶系统比例一致性☆11Mar 25, 2022Updated 3 years ago
- A dataset containing features extracted from measurements collected from multi-hop WSN deployments☆10Nov 9, 2016Updated 9 years ago
- Code and data recipes for the paper: Optimal Condition Training for Target Source Separation by Efthymios Tzinis, Gordon Wichern, Paris S…☆14Feb 15, 2023Updated 3 years ago