liuanji / WU-UCT
A novel parallel UCT algorithm with linear speedup and negligible performance loss.
☆118Updated 3 years ago
Alternatives and similar repositories for WU-UCT:
Users that are interested in WU-UCT are comparing it to the libraries listed below
- Keeping track of RL experiments☆162Updated 2 years ago
- Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.☆103Updated last month
- A code implementation for our arXiv paper "Multi-agent Adhoc Team Play using Decompositional Q function"☆131Updated last year
- ☆128Updated 8 months ago
- A Multi-agent Learning Framework☆62Updated 3 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆94Updated 6 years ago
- ☆143Updated 4 months ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆110Updated 7 months ago
- Soft Actor-Critic☆144Updated 7 years ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆167Updated 3 years ago
- Source code for the paper "Divergence-Augmented Policy Optimization"☆37Updated 5 years ago
- A framework for easy prototyping of distributed reinforcement learning algorithms☆95Updated 4 years ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆83Updated 3 years ago
- Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games☆50Updated 7 months ago
- Pytorch implementation of distributed deep reinforcement learning☆76Updated 2 years ago
- ☆74Updated 10 months ago
- MARS is shortened for Multi-Agent Research Studio, a library for mulit-agent reinforcement learning research.☆46Updated last year
- This is the source code of RPG (Reward-Randomized Policy Gradient)☆43Updated 2 years ago
- Explorer is a PyTorch reinforcement learning framework for exploring new ideas.☆92Updated 3 weeks ago
- Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO☆198Updated 2 years ago
- Code from the paper "Effective Diversity in Population Based Reinforcement Learning", presented as a spotlight at NeurIPS 2020. This is t…☆44Updated 4 years ago
- Combining Evolutionary Algorithms and deep RL in various ways☆102Updated 4 years ago
- ☆194Updated 2 years ago
- ☆4Updated 4 months ago
- Codes accompanying the paper "Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning" (NeurIPS…☆73Updated 2 years ago
- The Reinforcement-Learning-Related Papers of ICLR 2019☆47Updated 5 years ago
- Code accompanying the paper "Better Exploration with Optimistic Actor Critic" (NeurIPS 2019)☆70Updated last year
- Code for MOPO: Model-based Offline Policy Optimization☆177Updated 2 years ago
- ☆92Updated 4 years ago
- RE3: State Entropy Maximization with Random Encoders for Efficient Exploration☆68Updated 3 years ago