brilee / python_uct
Demo of UCT (MCTS) in Python / Numpy
☆85Updated 2 years ago
Alternatives and similar repositories for python_uct:
Users that are interested in python_uct are comparing it to the libraries listed below
- An implementation of Monte Carlo Tree Search in python☆162Updated 4 years ago
- High-quality implementations of deep reinforcement learning algorithms for experiments☆51Updated 5 months ago
- Board game AI implementations using Monte Carlo Tree Search☆183Updated 4 years ago
- Actor-critic with experience replay☆252Updated 2 years ago
- A novel parallel UCT algorithm with linear speedup and negligible performance loss.☆115Updated 3 years ago
- C51-DDQN in Keras☆125Updated 7 years ago
- Implementation of selected reinforcement learning algorithms in Tensorflow. A3C, DDPG, REINFORCE, DQN, etc.☆150Updated last year
- Proximal policy optimization in PyTorch. Easy to read and understand.☆48Updated 4 years ago
- Code for the paper "Skynet: A Top Deep RL Agent in the Inaugural Pommerman Team Competition"☆37Updated 5 years ago
- RUDDER: Return Decomposition for Delayed Rewards☆47Updated 4 years ago
- Deep Recurrent Attention Reinforcement Learning in Atari☆83Updated 6 years ago
- hierarchical deep reinforcement learning algorithms☆41Updated 7 years ago
- Fictitious Self-play & Reinforcement Learning☆18Updated 7 years ago
- Bandits Environments for the OpenAI Gym☆90Updated 5 years ago
- Implementation of the algorithm in Python 3, TensorFlow and OpenAI Gym☆175Updated 6 years ago
- A python implemenation of tabular MuZero for educational purposes☆21Updated 5 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆135Updated 6 years ago
- research and implementations of Deep RL agents and their applications☆49Updated 2 weeks ago
- ☆66Updated 3 years ago
- Code release for Learning with Opponent-Learning Awareness and variations.☆146Updated last year
- Efficient Exploration through Bayesian Deep Q-Networks☆37Updated 7 years ago
- Hindsight Experience Replay - Bit flipping experiment in Tensorflow☆58Updated 6 years ago
- The Reinforcement-Learning-Related Papers of ICLR 2019☆47Updated 5 years ago
- Proximal Policy Optimization implementation with TensorFlow☆105Updated 6 years ago
- Highly Modular and Scalable Reinforcement Learning☆114Updated 5 years ago
- TensorFlow & Keras implementation of DQN with HER (Hindsight Experience Replay)☆40Updated 4 years ago
- My implementation of the Proximal Policy Optisation algorithm using Keras as a backend☆88Updated 5 years ago
- RainBow, Tensorflow☆49Updated 6 years ago
- implement of prioritized experience replay☆159Updated 6 years ago
- Open AI gym environment for the game 2048☆71Updated 2 years ago