awni / backgammon
Backgammon Game and RL Agents (based on TDGammon)
☆16Updated 7 years ago
Alternatives and similar repositories for backgammon:
Users that are interested in backgammon are comparing it to the libraries listed below
- Implementation of TD-Gammon in TensorFlow.☆111Updated 5 years ago
- Reinforcement learning in python☆36Updated 5 years ago
- random search, hill climbing, policy gradient☆140Updated 6 years ago
- Open AI Gym version of Berkeley AI Pacman with images as states☆13Updated 6 years ago
- Awesome RL: Papers, Books, Codes, Benchmarks☆116Updated last year
- Our NIPS 2017: Learning to Run source code☆55Updated last year
- An RL-trained Backgammon agent☆17Updated 5 years ago
- Implementation of selected reinforcement learning algorithms in Tensorflow. A3C, DDPG, REINFORCE, DQN, etc.☆150Updated last year
- ☆133Updated 6 years ago
- Implementation for ACER in tensorflow and sonnet by deepmind☆11Updated 7 years ago
- This is the code for the "How to Beat Pong Using Policy Gradients (LIVE)" by Siraj Raval on Youtube☆66Updated 7 years ago
- Materials for the Practical Sessions of the Reinforcement Learning Summer School 2019: Bandits, RL & Deep RL (PyTorch).☆88Updated 5 years ago
- ☆102Updated 5 years ago
- C51-DDQN in Keras☆125Updated 7 years ago
- Code release for Learning with Opponent-Learning Awareness and variations.☆146Updated last year
- A simple stochastic OpenAI environment for training RL agents☆89Updated last year
- Reinforcement learning on gridworld with Q-learning☆9Updated 8 years ago
- ☆66Updated 3 years ago
- Implementation of Double Deep Q Networks and Dueling Q Networks using Keras on Space Invaders using OpenAI Gym. Code can be easily genera…☆37Updated 6 years ago
- Collection of Deep Reinforcement Learning algorithms☆298Updated 5 years ago
- A TensorFlow based implementation of the DeepMind Atari playing "Deep Q Learning" agent that works reasonably well☆92Updated 7 years ago
- An environment of the board game Go using OpenAI's Gym API☆168Updated 2 years ago
- This is the code for "Actor Critic Algorithms" by Siraj Raval on Youtube☆75Updated 7 years ago
- Code for the paper, "Learning Human Objectives by Evaluating Hypothetical Behavior"☆83Updated 5 years ago
- RL experiments☆70Updated 2 years ago
- This package allows to use PLE as a gym environment.☆72Updated 4 years ago
- Reproducing results from DeepMind's paper on Population Based Training of Neural Networks.☆56Updated 6 years ago
- Contains Jupyter notebooks associated with the "Deep Reinforcement Learning Tutorial" tutorial given at the O'Reilly 2017 NYC AI Conferen…☆273Updated 5 years ago
- This is a pip package implementing Reinforcement Learning algorithms in non-stationary environments supported by the OpenAI Gym toolkit.☆31Updated 5 years ago
- Reinforcement Learning with TensorFlow, published by Packt☆41Updated 2 years ago