cody2007 / alpha_go_zero_implementation
An implementation of the Alpha Go Zero algorithm, runnable on a single GPU
☆49Updated 4 years ago
Alternatives and similar repositories for alpha_go_zero_implementation:
Users that are interested in alpha_go_zero_implementation are comparing it to the libraries listed below
- A student implementation of Alpha Go Zero☆280Updated 6 years ago
- Connect4 reinforcement learning by AlphaGo Zero methods.☆114Updated 4 years ago
- ☆67Updated 3 years ago
- Distributed implementation of popular evolutionary methods☆64Updated 7 years ago
- ☆15Updated 8 years ago
- Reinforcement Learning for Super Mario Bros using A3C on GPU☆37Updated 7 years ago
- AlphaGo Zero paper and code for studying purpose☆28Updated 7 years ago
- Actor Critic using Kronecker-Factored Trust Region☆19Updated 6 years ago
- Policy Optimization with Penalized Point Probability Distance: an Alternative to Proximal Policy Optimization☆44Updated 6 years ago
- Hands On Reinforcement Learning with Python[Video], Published by Packt☆13Updated 4 years ago
- Reinforcement Learning Assembly☆92Updated 3 years ago
- ☆11Updated 7 years ago
- Attempt at reinforcement learning with curiosity for Sonic the Hedgehog games. Number 149 on OpenAI retro contest leaderboard, but more w…☆32Updated 6 years ago
- This is a tutorial written for Caffe2 which mocks google AlphaGo Fan and AlphaGo Zero.☆8Updated 6 years ago
- Optimized Differentiable Neural Computer In Chainer☆23Updated 6 years ago
- Unofficial attempt to rebuild AlphaGo Zero☆58Updated 11 months ago
- PyTorch AlphaZero implementation with multiplayer support [NeurIPS 2019 Deep Reinforcement Learning Workshop]☆34Updated 4 years ago
- ☆8Updated 5 years ago
- ☆43Updated 5 years ago
- Weight Agnostic Neural Networks (in Python)☆18Updated 5 years ago
- ☆30Updated 5 years ago
- A Policy Network in Tensorflow to classify chess moves☆17Updated 8 years ago
- ☆50Updated 5 years ago
- An implementation of the AlphaZero algorithm for chess☆33Updated 2 years ago
- Atari-DRQN (keras ver.)☆33Updated 6 years ago
- A StarCraft 2 agent for harvesting resources☆13Updated 6 years ago
- The exact codes used by the team "liveinparis" at the kaggle football competition ranked 6th/1141☆57Updated 4 years ago
- Library for running a Monte Carlo tree search, either traditionally or with expert policies☆124Updated last year
- Colab notebooks for d2l-book☆11Updated 5 years ago
- This project explores deep reinforcement learning, hybrid actor-critic approach with A3C/PPO combined with curiosity for playing Super M…☆79Updated 6 years ago