hsvgbkhgbv / SQDDPGView external linksLinks
This is a framework for the research on multi-agent reinforcement learning and the implementation of the experiments in the paper titled by ''Shapley Q-value: A Local Reward Approach to Solve Global Reward Games''.
☆120Nov 4, 2024Updated last year
Alternatives and similar repositories for SQDDPG
Users that are interested in SQDDPG are comparing it to the libraries listed below
Sorting:
- This repo is the implementation of paper ''SHAQ: Incorporating Shapley Value Theory into Multi-Agent Q-Learning''.☆51Dec 4, 2023Updated 2 years ago
- hsvgbkhgbv / Thermostat-assisted-continuously-tempered-Hamiltonian-Monte-Carlo-for-Bayesian-learningThermostat-assisted continuously-tempered Hamiltonian Monte Carlo for Bayesian learning☆10Dec 10, 2018Updated 7 years ago
- Code for ICLR 2019 paper: Learning when to Communicate at Scale in Multiagent Cooperative and Competitive Tasks☆226Oct 3, 2023Updated 2 years ago
- Code for "Actor-Attention-Critic for Multi-Agent Reinforcement Learning" ICML 2019☆785May 29, 2022Updated 3 years ago
- Submission for MAVEN: Multi-Agent Variational Exploration☆59Apr 6, 2022Updated 3 years ago
- ☆123Feb 16, 2023Updated 3 years ago
- Learning to Communicate with Deep Multi-Agent Reinforcement Learning in PyTorch☆358Apr 1, 2019Updated 6 years ago
- Codes accompanying the paper "ROMA: Multi-Agent Reinforcement Learning with Emergent Roles" (ICML 2020 https://arxiv.org/abs/2003.08039)☆168Dec 8, 2022Updated 3 years ago
- Repo containing code for multi-agent deep reinforcement learning (MADRL).☆733Apr 12, 2023Updated 2 years ago
- ☆20Feb 15, 2023Updated 3 years ago
- pytorch implementation of "Efficient Communication in Multi-Agent Reinforcement Learning via Variance Based Control"☆53Dec 8, 2022Updated 3 years ago
- There will be updates later☆88May 13, 2019Updated 6 years ago
- Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)☆51Dec 8, 2022Updated 3 years ago
- Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)☆87Dec 8, 2022Updated 3 years ago
- A Multi-agent Learning Framework☆62May 10, 2021Updated 4 years ago
- MAGNet: Multi-agents control using Graph Neural Networks☆132Mar 28, 2019Updated 6 years ago
- Python Multi-Agent Reinforcement Learning framework☆2,157Dec 8, 2022Updated 3 years ago
- some Multiagent enviroment in 《Multi-agent Reinforcement Learning in Sequential Social Dilemmas》 and 《Value-Decomposition Networks For Co…☆130Jan 13, 2023Updated 3 years ago
- A code implementation for our arXiv paper "Multi-agent Adhoc Team Play using Decompositional Q function"☆132Aug 14, 2023Updated 2 years ago
- Repo for reproduction of sequential social dilemmas☆412Mar 6, 2025Updated 11 months ago
- Code for paper 'Learning transferable cooperative behaviors in multi-agent teams' (ICML 2019)☆116Dec 8, 2022Updated 3 years ago
- ☆20Sep 14, 2019Updated 6 years ago
- Neurosymbolic transformers for multi-agent communication.☆22Oct 22, 2020Updated 5 years ago
- Hello, I pushed some python environments for Multi Agent Reinforcement Learning.☆741May 23, 2022Updated 3 years ago
- Code repository for SARNet: Learning Multi-Agent Communication through Structured Attentive Reasoning (NeurIPS 2020)☆25Oct 30, 2021Updated 4 years ago
- PyTorch implementation of CommNet☆36Dec 2, 2017Updated 8 years ago
- This is the code for Q-value Path Decomposition for Deep Multiagent Reinforcement Learning (NeurIPS 2019).☆12May 20, 2019Updated 6 years ago
- Implementations of IQL, QMIX, VDN, COMA, QTRAN, MAVEN, CommNet, DyMA-CL, and G2ANet on SMAC, the decentralised micromanagement scenario…☆1,722Sep 8, 2022Updated 3 years ago
- multi-agent deep reinforcement learning for networked system control.☆440Sep 29, 2020Updated 5 years ago
- Code release for Learning with Opponent-Learning Awareness and variations.☆151Apr 13, 2023Updated 2 years ago
- Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"☆2,724Apr 9, 2024Updated last year
- This repository contains the code to implement the Hierarchical Actor-Critic (HAC) algorithm.☆269May 20, 2020Updated 5 years ago
- ☆78Jun 2, 2024Updated last year
- ☆18Oct 12, 2022Updated 3 years ago
- ☆12Aug 15, 2020Updated 5 years ago
- Code for "SMIX(λ): Enhancing Centralized Value Functions for Cooperative Multi-Agent Reinforcement Learning" AAAI 2020☆26Dec 8, 2022Updated 3 years ago
- FLUIDS is a lightweight driving simulator for benchmarking Deep Reinforcement and Imitation learning algorithms.☆24May 3, 2019Updated 6 years ago
- DGN Code☆363Mar 25, 2023Updated 2 years ago
- Implementation for ICML 16 paper "Deep reinforcement learning with opponent modeling"☆72Aug 18, 2016Updated 9 years ago