shakedzy / tic_tac_toe
Teaching the computer to play Tic Tac Toe using Deep Q Networks
☆26Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for tic_tac_toe
- Implementation of Deep/Double Deep/Dueling Deep Q networks for playing Atari games using Keras and OpenAI gym☆40Updated 6 years ago
- Reinforcement Learning in Keras on VizDoom☆146Updated 7 years ago
- C51-DDQN in Keras☆125Updated 7 years ago
- A reinforcement learning framework☆154Updated 5 years ago
- Gym - Doom environments based on VizDoom.☆102Updated 7 years ago
- Examples of published reinforcement learning algorithms in recent literature implemented in TensorFlow☆101Updated 4 years ago
- Source code for OpenAI Retro Contest for Sonic the Hedgehog☆30Updated 6 years ago
- A simple stochastic OpenAI environment for training RL agents☆89Updated last year
- Highly Modular and Scalable Reinforcement Learning☆114Updated 4 years ago
- Evolving deep neural network agents using Genetic Algorithms☆66Updated 5 years ago
- RL experiments☆69Updated 2 years ago
- A binary release of trained deep reinforcement learning models trained in the Atari machine learning benchmark, and a software release th…☆201Updated 4 years ago
- Code for our paper "Visualizing and Understanding Atari Agents" (https://goo.gl/AMAoSc)☆119Updated 3 years ago
- Neuroevolution as a direct policy search deep reinforcement learning method, implemented using Keras and DEAP.☆70Updated 3 years ago
- A collection of python Machine Learning articles and examples. You will find code related to Reinforcement Learning, Q Learning, MDP, Bel…☆186Updated 2 years ago
- PySC2 OpenAI Gym Environments☆48Updated 5 years ago
- Implementation of selected reinforcement learning algorithms in Tensorflow. A3C, DDPG, REINFORCE, DQN, etc.☆151Updated last year
- A fast Evolution Strategy implementation in Python☆270Updated 4 years ago
- Keras implementation of Deep Learning papers☆94Updated 6 years ago
- Monte Carlo Tree Search Based AI Connect 4 Bot☆39Updated 5 years ago
- A Tensorflow implementation of GAN, WGAN and WGAN with gradient penalty.☆115Updated 6 years ago
- An implementation of (Double/Dueling) Deep-Q Learning to play Super Mario Bros.☆70Updated 3 years ago
- My implementation of the Proximal Policy Optisation algorithm using Keras as a backend☆88Updated 5 years ago
- Proximal Policy Optimization implementation with TensorFlow☆104Updated 6 years ago
- Augmented environments with RL☆102Updated 5 years ago
- [ICML-18] Codes for the custom games we built to compare RL agents with humans☆67Updated 6 years ago
- A simple Gridworld environment for Open AI gym☆24Updated 6 years ago
- Bandits Environments for the OpenAI Gym☆89Updated 4 years ago
- Deep reinforcement learning using an asynchronous advantage actor-critic (A3C) model.☆66Updated 6 years ago