krocki / mcts_mpi
GPU Monte Carlo Tree Search with MPI
☆26Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for mcts_mpi
- A novel parallel UCT algorithm with linear speedup and negligible performance loss.☆110Updated 3 years ago
- ☆14Updated 8 years ago
- Reproducing results from DeepMind's paper on Population Based Training of Neural Networks.☆56Updated 6 years ago
- Reinforcement Learning Assembly☆92Updated 3 years ago
- A python implemenation of tabular MuZero for educational purposes☆21Updated 4 years ago
- ☆65Updated 3 years ago
- Paper: Challenges in High-dimensional Reinforcement Learning with Evolution Strategies☆26Updated 2 years ago
- C51-DDQN in Keras☆125Updated 7 years ago
- An implementation of the AlphaZero algorithm for chess☆34Updated last year
- A framework for easy prototyping of distributed reinforcement learning algorithms☆95Updated 3 years ago
- High-quality implementations of deep reinforcement learning algorithms for experiments☆51Updated 2 months ago
- A reinforcement learning framework☆154Updated 5 years ago
- A simple implementation of MuZero algorithm for connect4 game☆95Updated 4 years ago
- Demo of UCT (MCTS) in Python / Numpy☆83Updated last year
- Keeping track of RL experiments☆159Updated last year
- Solving the Rubik's cube with deep reinforcement learning and Monte Carlo tree search☆95Updated 5 years ago
- Policy gradient reinforcement learning algorithm with importance sampling☆31Updated 7 years ago
- Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.☆78Updated last year
- Scalable implementation of DREAM - Deep RL for multi-agent imperfect information games☆112Updated 3 months ago
- C++ implementation of Proximal Policy Optimization☆73Updated 2 years ago
- PyTorch RL for Pommerman☆38Updated 6 years ago
- Scalable Implementation of Neural Fictitous Self-Play☆73Updated 5 years ago
- A Repository with C++ implementations of Reinforcement Learning Algorithms (Pytorch)☆91Updated 5 years ago
- Highly Modular and Scalable Reinforcement Learning☆114Updated 4 years ago
- Ape-X DQN & DDPG with pytorch & tensorboard☆101Updated 5 years ago
- Implementation of selected reinforcement learning algorithms in Tensorflow. A3C, DDPG, REINFORCE, DQN, etc.☆151Updated last year
- This repo replicates the results Horgan et al obtained in "Distributed Prioritized Experience Replay"☆190Updated 5 years ago
- Actor-critic with experience replay☆252Updated 2 years ago