shakedzy / tic_tac_toeLinks
Teaching the computer to play Tic Tac Toe using Deep Q Networks
☆28Updated 5 years ago
Alternatives and similar repositories for tic_tac_toe
Users that are interested in tic_tac_toe are comparing it to the libraries listed below
Sorting:
- Reinforcement Learning in Keras on VizDoom☆142Updated 8 years ago
- Examples of published reinforcement learning algorithms in recent literature implemented in TensorFlow☆103Updated 5 years ago
- Coding Demos from the School of AI's Move37 Course☆183Updated 6 years ago
- An implementation of (Double/Dueling) Deep-Q Learning to play Super Mario Bros.☆73Updated 4 years ago
- Monte Carlo Tree Search Based AI Connect 4 Bot☆40Updated 6 years ago
- Multi Agent Reinforcement Learning using MalmÖ☆262Updated 5 years ago
- Solutions to the Deep RL Bootcamp labs☆43Updated 8 years ago
- A collection of python Machine Learning articles and examples. You will find code related to Reinforcement Learning, Q Learning, MDP, Bel…☆190Updated 2 years ago
- Tensorflow implementation of Deepminds dqn with double dueling networks☆216Updated 5 years ago
- Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"☆329Updated 3 years ago
- Deep Q Learning via Pytorch☆86Updated 7 years ago
- TensorFlow implementation of asynchronous advantage actor-critic (A3C)☆39Updated 4 years ago
- Gym - 32 levels of original Super Mario Bros☆290Updated 6 years ago
- Deep reinforcement learning model implementation in Tensorflow + OpenAI gym☆304Updated 2 years ago
- PPO Dash: Improving Generalization in Deep Reinforcement Learning☆16Updated 6 years ago
- My implementation of the Proximal Policy Optisation algorithm using Keras as a backend☆88Updated 5 years ago
- C51-DDQN in Keras☆126Updated 7 years ago
- Reinforcement Learning using Policy Gradient to solve OpenAI Gym games☆112Updated 7 years ago
- An implementation of the ideas from this paper https://arxiv.org/pdf/1803.10122.pdf☆283Updated 2 years ago
- Deep learning and Reinforcement learning lecture and course work☆100Updated 7 years ago
- Snake-like openAI gym environment, both single and multiagent☆34Updated 5 years ago
- ☆106Updated 5 years ago
- DQN, DDDQN, A3C, PPO, Curiosity applied to the game DOOM☆87Updated 4 years ago
- A SpaceX Rocket Lander environment for OpenAI gym using Box2D☆303Updated 4 years ago
- OpenAI's cartpole env solver.☆156Updated 2 years ago
- A binary release of trained deep reinforcement learning models trained in the Atari machine learning benchmark, and a software release th…☆202Updated 5 years ago
- ☆305Updated 2 years ago
- Solving OpenAI Gym problems.☆187Updated 4 years ago
- Code for the blog post "Learning Montezuma’s Revenge from a Single Demonstration"☆206Updated 6 years ago
- Code for the paper "Emergent Complexity via Multi-agent Competition"☆825Updated 2 years ago