khpeek / Q-learning-Tic-Tac-Toe
Reinforcement learning of the game of Tic Tac Toe in Python
☆58Updated 7 years ago
Alternatives and similar repositories for Q-learning-Tic-Tac-Toe:
Users that are interested in Q-learning-Tic-Tac-Toe are comparing it to the libraries listed below
- Collection of Deep Reinforcement Learning algorithms☆298Updated 5 years ago
- Accompanying repository for Let's make a DQN / A3C series.☆395Updated 6 years ago
- DQN implementation in Keras + TensorFlow + OpenAI Gym☆158Updated 6 years ago
- Reinforcement Learning in Keras on VizDoom☆146Updated 7 years ago
- Connect4 reinforcement learning by AlphaGo Zero methods.☆113Updated 3 years ago
- Contains Jupyter notebooks associated with the "Deep Reinforcement Learning Tutorial" tutorial given at the O'Reilly 2017 NYC AI Conferen…☆274Updated 5 years ago
- Gym - 32 levels of original Super Mario Bros☆287Updated 6 years ago
- Lua/Torch implementation of DQN (Nature, 2015)☆591Updated 7 years ago
- A TensorFlow based implementation of the DeepMind Atari playing "Deep Q Learning" agent that works reasonably well☆92Updated 7 years ago
- Using Keras and Deep Deterministic Policy Gradient to play TORCS☆719Updated 7 years ago
- Simple deep Q-learning agent.☆698Updated 5 years ago
- An experimentation framework for Reinforcement Learning using OpenAI Gym, Tensorflow, and Keras.☆326Updated 6 years ago
- Half Field Offense in Robocup 2D Soccer☆229Updated 2 years ago
- Reaver: Modular Deep Reinforcement Learning Framework. Focused on StarCraft II. Supports Gym, Atari, and MuJoCo.☆555Updated 4 years ago
- Implementations of Reinforcement Learning Models in Tensorflow☆487Updated 7 years ago
- Replicating "Asynchronous Methods for Deep Reinforcement Learning" (http://arxiv.org/abs/1602.01783)☆401Updated 7 years ago
- RUDDER for ATARI games with delayed rewards in OpenAI Baselines package☆266Updated 5 years ago
- Minimal Monte Carlo Policy Gradient (REINFORCE) Algorithm Implementation in Keras☆160Updated 5 years ago
- Hacks for training RL systems from John Schulman's lecture at Deep RL Bootcamp (Aug 2017)☆1,105Updated 7 years ago
- Persistent advantage learning dueling double DQN for the Arcade Learning Environment☆265Updated 6 years ago
- Multiagent Cooperation and Competition with Deep Reinforcement Learning☆125Updated 9 years ago
- Basic DQN implementation☆222Updated 7 years ago
- A Deep Learning AI for 2048 (2048:94.15%, 4096:78.48%, 8192: 34.5% 16384: 0.177%)☆149Updated 6 years ago
- A reinforcement learning agent for tic-tac-toe. Implements the example from Chapter 1 of Sutton and Barto.☆51Updated 6 years ago
- Neural Networks For Playing Pong☆77Updated 8 years ago
- random search, hill climbing, policy gradient☆141Updated 6 years ago
- Asynchronous Methods for Deep Reinforcement Learning☆591Updated 6 years ago
- TensorFlow implementation of the DDPG algorithm from the paper Continuous Control with Deep Reinforcement Learning (ICLR 2016)☆212Updated 6 years ago
- Simplest Version of playing Atari with Deep Q Learning in Tensorflow☆159Updated 7 years ago
- Implementation of selected reinforcement learning algorithms in Tensorflow. A3C, DDPG, REINFORCE, DQN, etc.☆151Updated last year