hildensia / mctsLinks
An implementation of Monte Carlo Tree Search in python
☆163Updated 5 years ago
Alternatives and similar repositories for mcts
Users that are interested in mcts are comparing it to the libraries listed below
Sorting:
- Implementation of selected reinforcement learning algorithms in Tensorflow. A3C, DDPG, REINFORCE, DQN, etc.☆151Updated 2 years ago
- Noisy Networks for Exploration☆186Updated 7 years ago
- NIPS 2017 Value Prediction Network☆166Updated 7 years ago
- A PyTorch implementation of Rainbow DQN agent☆170Updated 7 years ago
- Board game AI implementations using Monte Carlo Tree Search☆184Updated 5 years ago
- Accompanying code for "Deep Reinforcement Learning that Matters"☆153Updated 8 years ago
- Pytorch implementation of Distributed Proximal Policy Optimization: https://arxiv.org/abs/1707.02286☆184Updated 7 years ago
- C51-DDQN in Keras☆126Updated 7 years ago
- Implementations of deep RL papers and random experimentation☆177Updated 7 years ago
- I am implementing a lot of reinforcement learning and imitation learning algorithms since I'm sick of reading about them but not really u…☆52Updated 5 years ago
- Actor-critic with experience replay☆255Updated 3 years ago
- implement of prioritized experience replay☆159Updated 7 years ago
- Open source implementation of the PAAC algorithm presented in Efficient Parallel Methods for Deep Reinforcement Learning☆201Updated 8 years ago
- ☆159Updated 8 years ago
- Value Iteration Networks☆289Updated 8 years ago
- A platform of grid world that supports up to 1 million reinforcement-learning agents.☆69Updated 8 years ago
- ☆120Updated 5 years ago
- "Continuous Deep Q-Learning with Model-based Acceleration" in TensorFlow☆192Updated 7 years ago
- Pytorch implementation of Value Iteration Networks (NIPS 2016 best paper)☆318Updated 5 years ago
- ☆79Updated 7 years ago
- Deep Attention Recurrent Q-Network☆115Updated 9 years ago
- This package allows to use PLE as a gym environment.☆72Updated 5 years ago
- RUDDER for ATARI games with delayed rewards in OpenAI Baselines package☆267Updated 6 years ago
- Advantage async actor-critic Algorithms (A3C) and Progressive Neural Network implemented by tensorflow.☆120Updated 9 years ago
- TensorFlow implementation of the DDPG algorithm from the paper Continuous Control with Deep Reinforcement Learning (ICLR 2016)☆215Updated 7 years ago
- A TensorFlow implementation of DeepMind's A Distributional Perspective on Reinforcement Learning.(C51-DQN)☆57Updated 8 years ago
- Yet another prioritized experience replay buffer implementation.☆48Updated 3 years ago
- A Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow☆182Updated 6 years ago
- This repo is intended as an extension for OpenAI Gym for auxiliary tasks (multitask learning, transfer learning, inverse reinforcement le…☆218Updated 6 years ago
- This repo replicates the results Horgan et al obtained in "Distributed Prioritized Experience Replay"☆190Updated 6 years ago