A new paper list for multi-agent reinforcement learning (actively maintained)
☆24Mar 27, 2020Updated 6 years ago
Alternatives and similar repositories for Paper-List-of-MARL
Users that are interested in Paper-List-of-MARL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13Mar 16, 2023Updated 3 years ago
- Codes accompanying the paper "Influence-Based Multi-Agent Exploration" (ICLR 2020 spotlight)☆34Mar 16, 2020Updated 6 years ago
- Meta-Reinforcement Learning with Policy Residual Representation☆11Aug 15, 2019Updated 6 years ago
- Personal Repo to keep track of RL papers☆31May 3, 2021Updated 5 years ago
- Official eval code for ROVER: Benchmarking Reciprocal Cross-Modal Reasoning for Omnimodal Generation☆26Dec 12, 2025Updated 5 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Benchmark result of different RL algorithms on MetaDrive environments, including Multi-agent RL (IPPO, centralized critics, CoPO).☆16Oct 25, 2022Updated 3 years ago
- ☆108Feb 10, 2021Updated 5 years ago
- rllab is a framework for developing and evaluating reinforcement learning algorithms, fully compatible with OpenAI Gym.☆14Apr 3, 2017Updated 9 years ago
- Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)☆19Aug 20, 2023Updated 2 years ago
- SIR, SEIR, and beyond☆10Jul 6, 2023Updated 2 years ago
- Code for ICLR 2019 paper: Learning when to Communicate at Scale in Multiagent Cooperative and Competitive Tasks☆233Oct 3, 2023Updated 2 years ago
- Code for paper 'Learning transferable cooperative behaviors in multi-agent teams' (ICML 2019)☆122Dec 8, 2022Updated 3 years ago
- Paper Collection of Reinforcement Learning Exploration covers Exploration of Muti-Arm-Bandit, Reinforcement Learning and Multi-agent Rein…☆37Nov 8, 2019Updated 6 years ago
- This is MPE-pytorch, fix some bugs.☆11Apr 26, 2020Updated 6 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code for the paper☆11May 24, 2024Updated 2 years ago
- Learning Individual Intrinsic Reward in MARL☆65Dec 8, 2022Updated 3 years ago
- Re-produce DQN, REINFORCE, REINFORCE with baseline, one-step AC, QAC, QAC with shared network, PPO2, DDPG, TD3, SAC, SAC discrete,A2C,A3C☆21Jul 27, 2020Updated 5 years ago
- OmniByteFormer is a generalized Transformer model that can process any type of data by converting it into byte sequences, bypassing tradi…☆16May 25, 2026Updated last week
- A simple program scheduler for your code on different devices.☆12Mar 8, 2026Updated 3 months ago
- Submission for MAVEN: Multi-Agent Variational Exploration☆60Apr 6, 2022Updated 4 years ago
- ☆10Nov 4, 2019Updated 6 years ago
- A pathway and collection of resources to learning Jax from beginning to advance.☆11Jan 2, 2021Updated 5 years ago
- 多智能体学习库☆22Dec 28, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Code for [NeurIPS'2019 Spotlight] Policy Continuation with Hindsight Inverse Dynamics☆15Jan 7, 2020Updated 6 years ago
- Official PyTorch implementation of "ACE:Off-Policy Actor-Critic with Causality-Aware Entropy Regularization"☆35May 13, 2024Updated 2 years ago
- ☆12Aug 15, 2020Updated 5 years ago
- pytorch implementation of "Efficient Communication in Multi-Agent Reinforcement Learning via Variance Based Control"☆53Dec 8, 2022Updated 3 years ago
- Code for SyncTwin: Treatment Effect Estimation with Longitudinal Outcomes (NeurIPS 2021)☆12Nov 30, 2021Updated 4 years ago
- Code for reproducing the results in "Forecasting Human Dynamics from Static Images"☆13Jun 16, 2024Updated last year
- ☆12Sep 30, 2017Updated 8 years ago
- A public repo for ICML 2021 "Shortest-Path Constrained Reinforcement Learning for Sparse Reward Tasks"☆13Jul 19, 2021Updated 4 years ago
- Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.☆103Mar 6, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Repo containing code for multi-agent deep reinforcement learning (MADRL).☆747Apr 12, 2023Updated 3 years ago
- Gridworld for MARL experiments☆146Jan 29, 2021Updated 5 years ago
- Official code for "Traffic Speed Imputation with Spatio-Temporal Attentions and Cycle-Perceptual Training" (CIKM'22).☆13Mar 8, 2024Updated 2 years ago
- SimPER: A Minimalist Approach to Preference Alignment without Hyperparameters (ICLR 2025)☆17Aug 22, 2025Updated 9 months ago
- An adaptive training algorithm for residual network☆17Aug 22, 2020Updated 5 years ago
- Multi-Robot Warehouse (RWARE): A multi-agent reinforcement learning environment☆75Sep 15, 2024Updated last year
- Cooperation and Fairness in Multi-Agent Reinforcement Learning☆16Aug 6, 2025Updated 10 months ago