ARVILab / open_alpha_zeroLinks
☆26Updated 5 years ago
Alternatives and similar repositories for open_alpha_zero
Users that are interested in open_alpha_zero are comparing it to the libraries listed below
Sorting:
- Deep Reinforcement Learning library for humans☆299Updated 7 years ago
- ☆50Updated 7 years ago
- RL experiments☆69Updated 2 years ago
- Implementation of TD-Gammon in TensorFlow.☆112Updated 6 years ago
- Easy TensorFlow logging for quick prototypes☆110Updated 3 years ago
- RUDDER for ATARI games with delayed rewards in OpenAI Baselines package☆267Updated 5 years ago
- Chess position evaluation using neural networks☆26Updated 5 years ago
- A reinforcement learning framework☆155Updated 6 years ago
- Connect4 reinforcement learning by AlphaGo Zero methods.☆113Updated 4 years ago
- ☆182Updated 11 months ago
- ☆15Updated 2 years ago
- A fast Evolution Strategy implementation in Python☆274Updated 5 years ago
- Simple Reinforcement Learning Framework☆34Updated 7 years ago
- Yandex SDA classes on deep learning. Version of year 2017☆116Updated 8 years ago
- Codes of our team for the OpenAI Retro Contest of reinforcement learning☆99Updated 7 years ago
- Reinforcement Learning in Keras on VizDoom☆143Updated 7 years ago
- Deep Q-networks made easy☆15Updated 6 years ago
- ☆22Updated 6 years ago
- ☆77Updated 8 years ago
- An implementation of the ideas from this paper https://arxiv.org/pdf/1803.10122.pdf☆282Updated 2 years ago
- Reproducing results from DeepMind's paper on Population Based Training of Neural Networks.☆56Updated 6 years ago
- ☆13Updated 3 years ago
- Counterfactual regret minimization algorithm for Kuhn poker☆173Updated 6 years ago
- Research code implementing the search AI agent for Hanabi, as well as a web server so people can play against it☆128Updated 2 years ago
- ☆117Updated 5 years ago
- Open source implementation of the PAAC algorithm presented in Efficient Parallel Methods for Deep Reinforcement Learning☆202Updated 8 years ago
- The project is a platform of zero learning with a library of games.☆267Updated 3 years ago
- MCTS project for Tetris☆348Updated 9 months ago
- lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.☆375Updated 2 years ago
- Cellular automaton-based calculus for the masses☆68Updated 5 years ago