liuanji / WU-UCTLinks
A novel parallel UCT algorithm with linear speedup and negligible performance loss.
☆122Updated 4 years ago
Alternatives and similar repositories for WU-UCT
Users that are interested in WU-UCT are comparing it to the libraries listed below
Sorting:
- Keeping track of RL experiments☆163Updated 2 years ago
- Source code for the paper "Divergence-Augmented Policy Optimization"☆37Updated 5 years ago
- ☆131Updated last year
- A framework for easy prototyping of distributed reinforcement learning algorithms☆96Updated 4 years ago
- A code implementation for our arXiv paper "Multi-agent Adhoc Team Play using Decompositional Q function"☆130Updated 2 years ago
- ☆146Updated 10 months ago
- ☆113Updated 5 years ago
- Random Network Distillation(RND) algo in Pytorch☆50Updated 6 years ago
- Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.☆103Updated 7 months ago
- Combining Evolutionary Algorithms and deep RL in various ways☆105Updated 4 years ago
- Code for "Proximal Distilled Evolutionary Reinforcement Learning", accepted at AAAI 2020☆55Updated last year
- SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning☆128Updated 4 years ago
- Soft Actor-Critic☆155Updated 7 years ago
- QuaRL is an open-source framework for systematically studying the effect of applying quantization to reinforcement learning algorithms.☆70Updated 2 years ago
- Implementation of Deep Reinforcement Learning from Self-Play in Imperfect-Information Games (Heinrich and Silver, 2016)☆46Updated 6 years ago
- ☆91Updated 4 years ago
- A Multi-agent Learning Framework☆62Updated 4 years ago
- Explorer is a PyTorch reinforcement learning framework for exploring new ideas.☆95Updated 3 months ago
- Code from the paper "Effective Diversity in Population Based Reinforcement Learning", presented as a spotlight at NeurIPS 2020. This is t…☆44Updated 4 years ago
- Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning☆102Updated 3 years ago
- Code for the paper "Phasic Policy Gradient"☆265Updated 2 years ago
- Unofficial Pytorch code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"☆193Updated 2 years ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆91Updated 4 years ago
- PyTorch RL for Pommerman☆38Updated 7 years ago
- Code for "Data-Efficient Reinforcement Learning with Self-Predictive Representations"☆161Updated 3 years ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆115Updated last year
- Pytorch implementation of Soft Actor-Critic☆20Updated 5 years ago
- Pytorch implementation of distributed deep reinforcement learning☆76Updated 3 years ago
- This is the source code of RPG (Reward-Randomized Policy Gradient)☆42Updated 3 years ago
- Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO☆205Updated 3 years ago