awni / backgammon
Backgammon Game and RL Agents (based on TDGammon)
☆16Updated 7 years ago
Related projects ⓘ
Alternatives and complementary repositories for backgammon
- Implementation of TD-Gammon in TensorFlow.☆110Updated 5 years ago
- Temporal Difference Learning based Backgammon game using Neural Network based model☆11Updated 6 years ago
- Board game AI implementations using Monte Carlo Tree Search☆183Updated 4 years ago
- The released model of the paper 'Automatic Bridge Bidding by Deep Reinforcement Learning' in ECAI 2016☆19Updated 7 years ago
- This is the code for the "How to Beat Pong Using Policy Gradients (LIVE)" by Siraj Raval on Youtube☆62Updated 7 years ago
- A video game description language (VGDL) built on top pf pygame.☆152Updated 5 years ago
- Reinforcement learning algorithms to play Poker☆15Updated 2 years ago
- random search, hill climbing, policy gradient☆140Updated 6 years ago
- Combining deep learning and reinforcement learning.☆81Updated 3 years ago
- RlGlue code library used in the RL specialization on Coursera.☆25Updated 11 months ago
- A collection of Deep Reinforcement Learning algorithms implemented in tensorflow. Very extensible. High performing DQN implementation.☆30Updated 7 years ago
- SpielViz is an interactive viewer for OpenSpiel games.☆28Updated 6 months ago
- Hands-on Deep Reinforcement Learning, published by Packt☆67Updated last year
- This is the code for "Actor Critic Algorithms" by Siraj Raval on Youtube☆75Updated 6 years ago
- This package allows to use PLE as a gym environment.☆72Updated 4 years ago
- Chainer implementation of Double Deep Q-Network (Double DQN)☆27Updated 8 years ago
- Demo of UCT (MCTS) in Python / Numpy☆83Updated last year
- Bandits Environments for the OpenAI Gym☆89Updated 4 years ago
- This is a pip package implementing Reinforcement Learning algorithms in non-stationary environments supported by the OpenAI Gym toolkit.☆32Updated 5 years ago
- Using a paper from Google DeepMind I've developed a new version of the DQN using threads exploration instead of memory replay as explain …☆84Updated 8 years ago
- A python implemenation of tabular MuZero for educational purposes☆21Updated 4 years ago
- A library to implement counterfactual regret minimization on various abstract strategy games☆16Updated 5 years ago
- implement of prioritized experience replay☆156Updated 6 years ago
- Source code of the MaastCTS2 agent for General Video Game playing. Champion of the 2016 GVG-AI Single-Player Track, and runner-up of the …☆14Updated 3 years ago
- ☆65Updated 3 years ago
- Our NIPS 2017: Learning to Run source code☆56Updated last year
- Non stationary bandit for experiments with Reinforcement Learning☆34Updated 7 years ago
- A2C for GVG-AI☆21Updated 6 years ago