shakedzy / tic_tac_toe
Teaching the computer to play Tic Tac Toe using Deep Q Networks
☆26Updated 4 years ago
Related projects: ⓘ
- Reinforcement Learning in Keras on VizDoom☆146Updated 6 years ago
- A reinforcement learning framework☆154Updated 5 years ago
- A collection of python Machine Learning articles and examples. You will find code related to Reinforcement Learning, Q Learning, MDP, Bel…☆185Updated last year
- Source code for OpenAI Retro Contest for Sonic the Hedgehog☆30Updated 6 years ago
- A simple stochastic OpenAI environment for training RL agents☆89Updated last year
- Examples of published reinforcement learning algorithms in recent literature implemented in TensorFlow☆101Updated 4 years ago
- Evolving deep neural network agents using Genetic Algorithms☆66Updated 5 years ago
- Gym - Doom environments based on VizDoom.☆102Updated 7 years ago
- C51-DDQN in Keras☆125Updated 6 years ago
- A packaged and slightly-modified version of https://github.com/bbitmaster/ale_python_interface☆40Updated 5 years ago
- DQN, DDDQN, A3C, PPO, Curiosity applied to the game DOOM☆82Updated 3 years ago
- PySC2 OpenAI Gym Environments☆48Updated 5 years ago
- NEAT implementation for Flappy Bird game☆23Updated 4 years ago
- Web-based Reinforcement Learning Control Center☆64Updated 8 years ago
- safemutations☆143Updated 6 years ago
- RLtime is a reinforcement learning library focused on state-of-the-art q-learning algorithms and features☆138Updated 4 years ago
- Value & Policy Iteration for the frozenlake environment of OpenAI☆15Updated 5 years ago
- random search, hill climbing, policy gradient☆138Updated 6 years ago
- Deep reinforcement learning using an asynchronous advantage actor-critic (A3C) model.☆66Updated 6 years ago
- tensorflow implementation of Andrej Karpathy's blog about reinforcement learning. http://karpathy.github.io/2016/05/31/rl/☆31Updated 3 years ago
- Augmented environments with RL☆102Updated 5 years ago
- Highly Modular and Scalable Reinforcement Learning☆113Updated 4 years ago
- World Models applied to the Open AI Sonic Retro Contest☆77Updated 6 years ago
- My implementation of the Proximal Policy Optisation algorithm using Keras as a backend☆88Updated 4 years ago
- Codes of our team for the OpenAI Retro Contest of reinforcement learning☆100Updated 6 years ago
- Bandits Environments for the OpenAI Gym☆88Updated 4 years ago
- TensorFlow implementation of asynchronous advantage actor-critic (A3C)☆39Updated 2 years ago
- Snake-like openAI gym environment, both single and multiagent☆33Updated 3 years ago
- This is the code for "Actor Critic Algorithms" by Siraj Raval on Youtube☆73Updated 6 years ago
- A binary release of trained deep reinforcement learning models trained in the Atari machine learning benchmark, and a software release th…☆201Updated 4 years ago