NeteaseFuxiRL / wuji
original source code of the ASE 2019 paper: Wuji: Automatic Online Combat Game Testing Using Evolutionary Deep Reinforcement Learning
☆27Updated 4 years ago
Alternatives and similar repositories for wuji
Users that are interested in wuji are comparing it to the libraries listed below
Sorting:
- Code from the paper "Effective Diversity in Population Based Reinforcement Learning", presented as a spotlight at NeurIPS 2020. This is t…☆44Updated 4 years ago
- Source code for the paper "Divergence-Augmented Policy Optimization"☆37Updated 5 years ago
- IJCAI 2019 - Regularized Opponent Model with Maximum Entropy Objective (ROMMEO)☆23Updated 2 years ago
- Related papers for offline reforcement learning (we mainly focus on representation and sequence modeling and conventional offline RL)☆18Updated 3 years ago
- Environments with IC3Net paper☆13Updated 6 years ago
- This is the source code of RPG (Reward-Randomized Policy Gradient)☆42Updated 2 years ago
- A Multi-agent Learning Framework☆62Updated 4 years ago
- Implementation of Deep Reinforcement Learning from Self-Play in Imperfect-Information Games (Heinrich and Silver, 2016)☆45Updated 6 years ago
- Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.☆83Updated 4 years ago
- Independent Generative Adversarial Self-Imitation Learning In Cooperative Multiagent Systems☆31Updated 6 years ago
- Unified Model-Free Hierarchical Reinforcement Learning Framework☆37Updated 6 years ago
- FEN Code☆37Updated 5 years ago
- Implementation of CoDAIL in the ICLR 2020 paper <Multi-Agent Interactions Modeling with Correlated Policies>☆18Updated 3 years ago
- ☆18Updated 6 years ago
- Hierarchical Deep RL Network☆31Updated 8 years ago
- Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games☆50Updated 8 months ago
- Reinforcement Learning and Transfer Learning based StarCraft Micromanagement☆46Updated 7 years ago
- Implementation of "Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update", NeurIPS 2019.☆17Updated 5 years ago
- Safe Option-Critic: Learning Safety in the Option-Critic Architecture☆20Updated 6 years ago
- Meta-Inverse Reinforcement Learning with Probabilistic Context Variables☆73Updated 2 years ago
- Experiments to train transformer network to master reinforcement learning environments.☆32Updated 4 years ago
- Code for Multi-Agent Common Knowledge Reinforcement Learning (NeurIPS 2019)☆33Updated 5 years ago
- ☆22Updated 2 years ago
- The Reinforcement-Learning-Related Papers of ICLR 2019☆47Updated 5 years ago
- ☆97Updated 4 years ago
- Single Episode Policy Transfer in Reinforcement Learning☆17Updated 2 years ago
- ☆4Updated 5 months ago
- ☆32Updated 2 years ago
- ☆44Updated 2 years ago
- A code implementation for our arXiv paper "Multi-agent Adhoc Team Play using Decompositional Q function"☆131Updated last year