wwxFromTju / maddpg-tf
use tensorflow to implement the MADDPG(simple_tag)
☆17Updated 6 years ago
Related projects: ⓘ
- Submission for MAVEN: Multi-Agent Variational Exploration☆57Updated 2 years ago
- pytorch implementation of "Efficient Communication in Multi-Agent Reinforcement Learning via Variance Based Control"☆50Updated last year
- ☆21Updated 5 years ago
- ☆81Updated 2 years ago
- There will be updates later☆79Updated 5 years ago
- Code for "SMIX(λ): Enhancing Centralized Value Functions for Cooperative Multi-Agent Reinforcement Learning" AAAI 2020☆26Updated last year
- CommNet and BiCnet implementation in tensorflow☆54Updated 6 years ago
- an implementation of CommNet☆29Updated 6 years ago
- Learning Individual Intrinsic Reward in MARL☆62Updated last year
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆91Updated 2 years ago
- ☆44Updated 3 years ago
- Code for "Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning" ICML 2021☆61Updated 3 years ago
- ☆43Updated last year
- A Pytorch Implementation of Multi Agent Soft Actor Critic☆34Updated 5 years ago
- ☆87Updated 3 years ago
- ☆45Updated 5 years ago
- an implementation of ATOC☆13Updated 2 years ago
- Implementation of the paper Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation - https:/…☆79Updated 6 years ago
- ☆45Updated 4 years ago
- ☆39Updated 3 years ago
- Distributed Multi-Agent Cooperation Algorithm based on MADDPG with prioritized batch data.☆98Updated 3 years ago
- ☆40Updated 3 years ago
- ICML 2019 RL for Real Life Workshop: Recurrent MADDPG for Partially Observable and Limited Communication Settings☆39Updated 4 years ago
- ☆36Updated 2 years ago
- ☆24Updated 2 years ago
- Value-Decomposition Multi-Agent Actor-Critics☆39Updated last year
- Attention-based Curiosity-driven Exploration in Deep Reinforcement Learning☆25Updated 4 years ago
- ☆70Updated 4 years ago
- Codes for Paper "Delay-Aware Multi-Agent Reinforcement Learning".☆49Updated 4 years ago
- Multi-Agent Determinantal Q-Learning☆41Updated last year