DRL-CASIA / GameAI-FightingAI
☆32Updated 4 years ago
Alternatives and similar repositories for GameAI-FightingAI:
Users that are interested in GameAI-FightingAI are comparing it to the libraries listed below
- This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).☆62Updated 6 years ago
- ☆32Updated 2 years ago
- ☆52Updated 6 years ago
- Implement PPO-clip and PPO-penalty on Atari, which is the only open source of PPO-penalty☆56Updated 6 years ago
- Unified Model-Free Hierarchical Reinforcement Learning Framework☆37Updated 6 years ago
- ☆33Updated 7 years ago
- Attention-based Curiosity-driven Exploration in Deep Reinforcement Learning☆26Updated 5 years ago
- Deep Reinforcement Learning with pytorch & visdom☆14Updated 4 years ago
- Submission for MAVEN: Multi-Agent Variational Exploration☆57Updated 3 years ago
- ☆19Updated 3 years ago
- Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.☆103Updated last month
- Learning Individual Intrinsic Reward in MARL☆62Updated 2 years ago
- Reinforcement learning with unsupervised auxiliary tasks☆23Updated 6 years ago
- There will be updates later☆84Updated 5 years ago
- CommNet and BiCnet implementation in tensorflow☆55Updated 6 years ago
- Multi-Agent Determinantal Q-Learning☆42Updated 2 years ago
- The Reinforcement-Learning-Related Papers of ICLR 2019☆47Updated 5 years ago
- DHER: Hindsight Experience Replay for Dynamic Goals (ICLR-2019)☆66Updated 5 years ago
- Transplant a implementation of MADDPG to the environment provided by openAI (multiagent-particle-envs).☆21Updated 4 years ago
- Code accompanying HAAR paper, NeurIPS 2019 - Hierarchical Reinforcement Learning with Advantage-Based Auxiliary Rewards☆32Updated 2 years ago
- Code repository for SARNet: Learning Multi-Agent Communication through Structured Attentive Reasoning (NeurIPS 2020)☆25Updated 3 years ago
- Implement many Sparse Reward algorithms in Gym Fetch environment☆86Updated 4 years ago
- IJCAI 2019 - Regularized Opponent Model with Maximum Entropy Objective (ROMMEO)☆23Updated 2 years ago
- ☆120Updated 2 years ago
- ☆44Updated 2 years ago
- PyTorch Implementation of the RDPG (Recurrent Deterministic Policy Gradient)☆55Updated 2 years ago
- A curated list of awesome Model-based reinforcement learning resources☆93Updated 4 years ago
- soft q learning and soft actor critic☆15Updated 6 years ago
- an implementation of CommNet☆32Updated 7 years ago
- The code for paper, "Episodic Multi-agent Reinforcement Learning with Curiosity-driven Exploration", NeurIPS 2021.☆40Updated 2 years ago