rockingdingo / gym-gomoku
OpenAI Gym Env for game Gomoku(Five-In-a-Row, 五子棋, 五目並べ, omok, Gobang,...)
☆85Updated 2 weeks ago
Related projects: ⓘ
- Simplest Version of playing Atari with Deep Q Learning in Tensorflow☆160Updated 6 years ago
- An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku☆185Updated 4 years ago
- implement of prioritized experience replay☆156Updated 6 years ago
- A student implementation of Alpha Go Zero☆276Updated 6 years ago
- Congratulation to DeepMind! This is a reengineering implementation (on behalf of many other git repo in /support/) of DeepMind's Oct19th …☆340Updated last year
- DQN implementation in Keras + TensorFlow + OpenAI Gym☆158Updated 6 years ago
- This is a simple implementation of DeepMind's PySC2 RL agents.☆271Updated 6 years ago
- Pytorch implementation of Distributed Proximal Policy Optimization: https://arxiv.org/abs/1707.02286☆180Updated 6 years ago
- This is the code for the "How to Beat Pong Using Policy Gradients (LIVE)" by Siraj Raval on Youtube☆63Updated 7 years ago
- Pytorch LSTM RNN for reinforcement learning to play Atari games from OpenAI Universe. We also use Google Deep Mind's Asynchronous Advanta…☆170Updated 5 years ago
- Board game AI implementations using Monte Carlo Tree Search☆181Updated 4 years ago
- advantage actor-critic reinforcement learning for openai gym cartpole☆64Updated 7 years ago
- Connect4 reinforcement learning by AlphaGo Zero methods.☆114Updated 3 years ago
- Actor-critic with experience replay☆251Updated last year
- C51-DDQN in Keras☆125Updated 6 years ago
- Collection of Deep Reinforcement Learning algorithms☆122Updated 7 years ago
- A pytorch based Gomoku game model. Alpha Zero algorithm based reinforcement Learning and Monte Carlo Tree Search model.☆160Updated 5 years ago
- ☆69Updated 5 years ago
- A continuous action space version of A3C LSTM in pytorch plus A3G design☆258Updated 5 months ago
- A Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow☆182Updated 5 years ago
- An Python N-in-Row game based on Monte Carlo Tree Search and UCT RAVE☆50Updated 7 years ago
- some Multiagent enviroment in 《Multi-agent Reinforcement Learning in Sequential Social Dilemmas》 and 《Value-Decomposition Networks For Co…☆129Updated last year
- Repository for codes of 'Deep Reinforcement Learning'☆214Updated 4 years ago
- Accompanying repository for Let's make a DQN / A3C series.☆391Updated 6 years ago
- Play flappy bird with DQN, a demo for reinforcement learning, implemented using PyTorch☆68Updated 7 years ago
- Atari - Deep Reinforcement Learning algorithms in TensorFlow☆137Updated 5 months ago
- ICML 2018 Self-Imitation Learning☆274Updated 4 years ago
- Policy Gradient algorithms (REINFORCE, NPG, TRPO, PPO)☆366Updated 5 years ago
- ☆135Updated 3 years ago
- RainBow, Tensorflow☆49Updated 6 years ago