shakedzy / tic_tac_toe
Teaching the computer to play Tic Tac Toe using Deep Q Networks
☆27Updated 5 years ago
Alternatives and similar repositories for tic_tac_toe:
Users that are interested in tic_tac_toe are comparing it to the libraries listed below
- A simple stochastic OpenAI environment for training RL agents☆89Updated last year
- Gym - Doom environments based on VizDoom.☆102Updated 7 years ago
- Tensorflow implementation of Deepminds dqn with double dueling networks☆215Updated 4 years ago
- C51-DDQN in Keras☆125Updated 7 years ago
- Simple grid-world environment compatible with OpenAI-gym☆49Updated 4 years ago
- My implementation of the Proximal Policy Optisation algorithm using Keras as a backend☆88Updated 5 years ago
- Reinforcement Learning in Keras on VizDoom☆145Updated 7 years ago
- Examples of published reinforcement learning algorithms in recent literature implemented in TensorFlow☆102Updated 4 years ago
- Snake-like openAI gym environment, both single and multiagent☆33Updated 4 years ago
- An implementation of (Double/Dueling) Deep-Q Learning to play Super Mario Bros.☆70Updated 3 years ago
- A basic 2D maze environment where an agent start from the top left corner and try to find its way to the bottom left corner.☆357Updated last year
- A reinforcement learning framework☆154Updated 6 years ago
- Deep Q Learning via Pytorch☆86Updated 7 years ago
- PPO implementation for OpenAI gym environment based on Unity ML Agents☆148Updated 6 years ago
- ☆102Updated 5 years ago
- random search, hill climbing, policy gradient☆140Updated 6 years ago
- OpenAI's cartpole env solver.☆152Updated last year
- A binary release of trained deep reinforcement learning models trained in the Atari machine learning benchmark, and a software release th…☆201Updated 4 years ago
- This repo is intended as an extension for OpenAI Gym for auxiliary tasks (multitask learning, transfer learning, inverse reinforcement le…☆215Updated 5 years ago
- A PyTorch implementation of Rainbow DQN agent☆168Updated 6 years ago
- Generic reinforcement learning codebase in TensorFlow☆95Updated 3 years ago
- Code for the blog post "Learning Montezuma’s Revenge from a Single Demonstration"☆197Updated 6 years ago
- Helper for NeurIPS 2018 Challenge: AI for Prosthetics☆39Updated 6 years ago
- Modifiable OpenAI Gym environments for studying generalization in RL☆86Updated 6 years ago
- PPO Dash: Improving Generalization in Deep Reinforcement Learning☆16Updated 5 years ago
- ☆27Updated 3 years ago
- Highly Modular and Scalable Reinforcement Learning☆114Updated 5 years ago
- Twin Delayed DDPG (TD3) PyTorch solution for Roboschool and Box2d environment☆102Updated 5 years ago
- Proximal Policy Optimization implementation with TensorFlow☆105Updated 6 years ago
- Implementation of 'A Distributional Perspective on Reinforcement Learning' and 'Distributional Reinforcement Learning with Quantile Regre…☆132Updated 5 years ago