liuanji / WU-UCTLinks
A novel parallel UCT algorithm with linear speedup and negligible performance loss.
☆120Updated 4 years ago
Alternatives and similar repositories for WU-UCT
Users that are interested in WU-UCT are comparing it to the libraries listed below
Sorting:
- Keeping track of RL experiments☆162Updated 2 years ago
- ☆111Updated 5 years ago
- ☆144Updated 8 months ago
- QuaRL is an open-source framework for systematically studying the effect of applying quantization to reinforcement learning algorithms.☆72Updated 2 years ago
- ☆132Updated last year
- A framework for easy prototyping of distributed reinforcement learning algorithms☆96Updated 4 years ago
- SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning☆127Updated 4 years ago
- Code for the paper "Phasic Policy Gradient"☆262Updated 2 years ago
- Random Network Distillation pytorch☆251Updated 6 years ago
- A code implementation for our arXiv paper "Multi-agent Adhoc Team Play using Decompositional Q function"☆130Updated last year
- Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning☆102Updated 3 years ago
- Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.☆103Updated 5 months ago
- Pytorch Implementation of MuZero☆353Updated 2 years ago
- Explorer is a PyTorch reinforcement learning framework for exploring new ideas.☆95Updated last month
- Source code for the paper "Divergence-Augmented Policy Optimization"☆37Updated 5 years ago
- Random Network Distillation(RND) algo in Pytorch☆50Updated 6 years ago
- Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO☆205Updated 2 years ago
- Implementation of Deep Reinforcement Learning from Self-Play in Imperfect-Information Games (Heinrich and Silver, 2016)☆46Updated 6 years ago
- Pytorch implementation of Soft Actor-Critic☆20Updated 5 years ago
- Code release for Learning with Opponent-Learning Awareness and variations.☆149Updated 2 years ago
- Code for Stabilizing Off-Policy RL via Bootstrapping Error Reduction☆161Updated 5 years ago
- A Multi-agent Learning Framework☆62Updated 4 years ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆88Updated 4 years ago
- Unofficial Pytorch code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"☆190Updated 2 years ago
- Multi Task RL Baselines☆246Updated 3 years ago
- PyTorch implementation of FQF, IQN and QR-DQN.☆182Updated last year
- ☆106Updated 4 years ago
- PyTorch implementation of Never Give Up: Learning Directed Exploration Strategies☆59Updated 4 years ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆114Updated 11 months ago
- PyTorch implementation of our paper Real-Time Reinforcement Learning (NeurIPS 2019)☆74Updated 5 years ago