jbradberry / mcts
Board game AI implementations using Monte Carlo Tree Search
☆181Updated 4 years ago
Related projects: ⓘ
- An implementation of Monte Carlo Tree Search in python☆159Updated 3 years ago
- Implementations of deep RL papers and random experimentation☆177Updated 6 years ago
- Demo of UCT (MCTS) in Python / Numpy☆81Updated last year
- An implementation of 9x9 Tic Tac Toe☆77Updated 4 years ago
- implement of prioritized experience replay☆156Updated 6 years ago
- A TensorFlow based implementation of the DeepMind Atari playing "Deep Q Learning" agent that works reasonably well☆91Updated 7 years ago
- Multiagent Cooperation and Competition with Deep Reinforcement Learning☆123Updated 8 years ago
- Simplest Version of playing Atari with Deep Q Learning in Tensorflow☆160Updated 6 years ago
- Connect4 reinforcement learning by AlphaGo Zero methods.☆114Updated 3 years ago
- Pytorch implementation of Distributed Proximal Policy Optimization: https://arxiv.org/abs/1707.02286☆180Updated 6 years ago
- Accompanying code for "Deep Reinforcement Learning that Matters"☆153Updated 6 years ago
- Implementation for ICML 16 paper "Deep reinforcement learning with opponent modeling"☆71Updated 8 years ago
- ☆117Updated this week
- RUDDER for ATARI games with delayed rewards in OpenAI Baselines package☆265Updated 4 years ago
- Tensorflow implementation of deep Q networks in paper 'Playing Atari with Deep Reinforcement Learning'☆162Updated 7 years ago
- Basic DQN implementation☆219Updated 6 years ago
- ☆117Updated 4 years ago
- Value Iteration Networks☆287Updated 7 years ago
- An implementation of FeUdal Networks for Hierarchical Reinforcement Learning as published : https://arxiv.org/abs/1703.01161☆179Updated 6 years ago
- Convert sc2 environment to gym-atari and play some mini-games☆21Updated 6 years ago
- Minimal Monte Carlo Policy Gradient (REINFORCE) Algorithm Implementation in Keras☆159Updated 4 years ago
- Advantage async actor-critic Algorithms (A3C) and Progressive Neural Network implemented by tensorflow.☆121Updated 7 years ago
- A reinforcement learning framework☆154Updated 5 years ago
- Monte Carlo Tree Search with UCT with a couple of example games.☆151Updated 3 years ago
- TensorFlow implementation of the DDPG algorithm from the paper Continuous Control with Deep Reinforcement Learning (ICLR 2016)☆209Updated 6 years ago
- Collection of Deep Reinforcement Learning algorithms☆297Updated 5 years ago
- Implementation of selected reinforcement learning algorithms in Tensorflow. A3C, DDPG, REINFORCE, DQN, etc.☆150Updated last year
- Implementation of DDPG (Modified from the work of Patrick Emami) - Tensorflow (no TFLearn dependency), Ornstein Uhlenbeck noise function,…☆65Updated 7 years ago
- NIPS 2017 Value Prediction Network☆165Updated 6 years ago
- ☆99Updated 8 years ago