roop-pal / Meta-Learning-for-StarCraft-II-Minigames
We reproduced DeepMind's results and implement a meta-learning (MLSH) agent which can generalize across minigames.
☆26Updated 3 years ago
Related projects: ⓘ
- My internship project in 𝖢𝖠𝖲𝖨𝖠. 🤗☆1Updated 5 years ago
- Multi-Agent Determinantal Q-Learning☆41Updated last year
- ☆18Updated 3 years ago
- Attention-based Curiosity-driven Exploration in Deep Reinforcement Learning☆25Updated 4 years ago
- pytorch implementation of "Efficient Communication in Multi-Agent Reinforcement Learning via Variance Based Control"☆50Updated last year
- ☆18Updated 4 years ago
- ☆43Updated last year
- Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games☆43Updated 3 weeks ago
- IJCAI 2019 - Regularized Opponent Model with Maximum Entropy Objective (ROMMEO)☆23Updated last year
- ☆38Updated this week
- N-Layered FeUdal Networks based on FeUdal Networks adapted to suit PySC2 observations☆15Updated 5 years ago
- Deep learning implementations (Asynchronous Deep Q-Learning) of multiple Game Theory algorithms for adversarial learning (WoLF-PHC, GIGA-…☆15Updated 7 years ago
- Independent Generative Adversarial Self-Imitation Learning In Cooperative Multiagent Systems☆31Updated 5 years ago
- FEN Code☆36Updated 4 years ago
- Implementation of Deep Reinforcement Learning from Self-Play in Imperfect-Information Games (Heinrich and Silver, 2016)☆44Updated 5 years ago
- Value-Decomposition Multi-Agent Actor-Critics☆39Updated last year
- ☆21Updated 5 years ago
- Code for "SMIX(λ): Enhancing Centralized Value Functions for Cooperative Multi-Agent Reinforcement Learning" AAAI 2020☆26Updated last year
- There will be updates later☆79Updated 5 years ago
- Learning Individual Intrinsic Reward in MARL☆62Updated last year
- ☆12Updated 4 years ago
- Submission for MAVEN: Multi-Agent Variational Exploration☆57Updated 2 years ago
- ☆25Updated 6 years ago
- Code for "Coordinated Exploration via Intrinsic Rewards for Multi-Agent Reinforcement Learning"☆32Updated 3 years ago
- Playing Mountain-Car without reward engineering, by combining DQN and Random Network Distillation (RND)☆41Updated 5 years ago
- ☆45Updated 5 years ago
- A Pytorch Implementation of Multi Agent Soft Actor Critic☆34Updated 5 years ago
- Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)☆51Updated last year
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆91Updated 2 years ago
- MADDPG in Ray/RLlib☆50Updated 4 years ago