cody2007 / alpha_go_zero_implementationLinks
An implementation of the Alpha Go Zero algorithm, runnable on a single GPU
☆49Updated 5 years ago
Alternatives and similar repositories for alpha_go_zero_implementation
Users that are interested in alpha_go_zero_implementation are comparing it to the libraries listed below
Sorting:
- Solving the Rubik's cube with deep reinforcement learning and Monte Carlo tree search☆104Updated 6 years ago
- Reinforcement Learning Assembly☆92Updated 4 years ago
- A Policy Network in Tensorflow to classify chess moves☆19Updated 8 years ago
- Deep reinforcement learning baselines base on OpenAI. More algorithms are included, such as Rainbow: Combining Improvements in Deep Rei…☆35Updated 7 years ago
- ☆67Updated 3 years ago
- Reinforcement Learning for Super Mario Bros using A3C on GPU☆37Updated 7 years ago
- Open-source library for a reinforcement learning research.☆54Updated 2 years ago
- Library for running a Monte Carlo tree search, either traditionally or with expert policies☆127Updated last year
- ☆56Updated 2 years ago
- This project explores deep reinforcement learning, hybrid actor-critic approach with A3C/PPO combined with curiosity for playing Super M…☆80Updated 6 years ago
- Reinforcement Learning implementations and research prototyping in TensorFlow☆82Updated 6 years ago
- A python implemenation of tabular MuZero for educational purposes☆21Updated 5 years ago
- The exact codes used by the team "liveinparis" at the kaggle football competition ranked 6th/1141☆57Updated 4 years ago
- A student implementation of Alpha Go Zero☆280Updated 7 years ago
- Policy Optimization with Penalized Point Probability Distance: an Alternative to Proximal Policy Optimization☆44Updated 6 years ago
- Distributed implementation of popular evolutionary methods☆64Updated 7 years ago
- Source code for OpenAI Retro Contest for Sonic the Hedgehog☆31Updated 7 years ago
- ☆69Updated 6 years ago
- Generic reinforcement learning codebase in TensorFlow☆95Updated 3 years ago
- Atari-DRQN (keras ver.)☆33Updated 6 years ago
- AlphaGo Zero Clone☆17Updated 5 years ago
- Attempt at reinforcement learning with curiosity for Sonic the Hedgehog games. Number 149 on OpenAI retro contest leaderboard, but more w…☆32Updated 7 years ago
- TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x☆62Updated 4 years ago
- The server portion of the Neural Chat project to deploy chatbots on web. This code is accompanied by another repository that includes the…☆36Updated 4 years ago
- 2nd place solution of NIPS2017 LearningToRun Competition.☆129Updated 3 years ago
- Full Chainer implementation of OpenAI's Reinforcement Learning using Random Network Distillation☆31Updated 6 years ago
- A clean implementation based on Expert Iterations for any game, inspired by alpha-zero-general☆44Updated 2 years ago
- Highly Modular and Scalable Reinforcement Learning☆118Updated 5 years ago
- Connect4 reinforcement learning by AlphaGo Zero methods.☆113Updated 4 years ago
- Minimal Monte Carlo Policy Gradient (REINFORCE) Algorithm Implementation in Keras☆160Updated 5 years ago