brilee / python_uct
Demo of UCT (MCTS) in Python / Numpy
☆85Updated 2 years ago
Alternatives and similar repositories for python_uct:
Users that are interested in python_uct are comparing it to the libraries listed below
- Code for the paper "Skynet: A Top Deep RL Agent in the Inaugural Pommerman Team Competition"☆37Updated 5 years ago
- An implementation of Monte Carlo Tree Search in python☆162Updated 4 years ago
- A novel parallel UCT algorithm with linear speedup and negligible performance loss.☆116Updated 3 years ago
- ☆67Updated 3 years ago
- A framework for easy prototyping of distributed reinforcement learning algorithms☆95Updated 4 years ago
- TensorFlow & Keras implementation of DQN with HER (Hindsight Experience Replay)☆40Updated 4 years ago
- Library for running a Monte Carlo tree search, either traditionally or with expert policies☆122Updated 11 months ago
- RainBow, Tensorflow☆49Updated 6 years ago
- A Multi-agent Learning Framework☆62Updated 3 years ago
- Atari - Deep Reinforcement Learning algorithms in TensorFlow☆135Updated 11 months ago
- Board game AI implementations using Monte Carlo Tree Search☆183Updated 4 years ago
- An implement of DQfD(Deep Q-learning from Demonstrations) raised by DeepMind:Learning from Demonstrations for Real World Reinforcement Le…☆133Updated 7 years ago
- High-quality implementations of deep reinforcement learning algorithms for experiments☆51Updated 6 months ago
- Implementation of the algorithm in Python 3, TensorFlow and OpenAI Gym☆176Updated 7 years ago
- Fictitious Self-play & Reinforcement Learning☆18Updated 7 years ago
- RUDDER: Return Decomposition for Delayed Rewards☆47Updated 4 years ago
- Efficient Exploration through Bayesian Deep Q-Networks☆37Updated 7 years ago
- Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.☆103Updated 2 weeks ago
- Implementation of selected reinforcement learning algorithms in Tensorflow. A3C, DDPG, REINFORCE, DQN, etc.☆150Updated last year
- A python implemenation of tabular MuZero for educational purposes☆21Updated 5 years ago
- PyTorch implementation of Proximal Policy Optimization☆51Updated 7 years ago
- Implementation of clipped action policy gradient (CAPG) with PPO and TRPO☆31Updated 6 years ago
- A collection of multi-agent reinforcement learning OpenAI gym environments☆45Updated 4 years ago
- ☆27Updated 5 years ago
- PyTorch RL for Pommerman☆38Updated 6 years ago
- some Multiagent enviroment in 《Multi-agent Reinforcement Learning in Sequential Social Dilemmas》 and 《Value-Decomposition Networks For Co…☆129Updated 2 years ago
- Combining Evolutionary Algorithms and deep RL in various ways☆102Updated 4 years ago
- Reproduction of the paper "Soft Q-Learning with Mutual Information Regularization" CoRL 2019.☆10Updated 6 years ago
- MAGNet: Multi-agents control using Graph Neural Networks☆131Updated 5 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆94Updated 6 years ago