che-shr-cat / alphago
Code to recreate AlphaGo Zero models
☆19Updated last year
Related projects ⓘ
Alternatives and complementary repositories for alphago
- Python wrappers for Pachi. Contains a modified version of the bleeding-edge Pachi source code.☆41Updated last year
- Tensorflow implementation of Neural Arithmetic Logic Unit, Trask et al.☆29Updated 6 years ago
- An attempt to reimplement experiments from the 2013 paper by Wissner-Gross & Freer☆10Updated this week
- Differentiable Neural Computer implementation in Tensorflow☆36Updated 7 years ago
- Python implementation of tabular asynchronous actor critic☆11Updated 8 years ago
- Neural Arithmetic Logic Units(arXiv:1808.00508)☆13Updated 6 years ago
- Differentiable Neural Computer in TensorFlow☆27Updated 7 years ago
- ☆30Updated 6 years ago
- Implementation of the Monte-Carlo CTW AIXI approximation as described by Joel Veness et al.☆12Updated 7 years ago
- ☆22Updated 6 years ago
- An implementation of the AlphaZero algorithm for chess☆34Updated last year
- Backprop training of recurrent neural networks with Hebbian plastic connections☆20Updated 3 years ago
- Combining deep learning and reinforcement learning.☆81Updated 3 years ago
- Forward Modeling for Partial Observation Strategy Games - A StarCraft Defogger☆31Updated 3 years ago
- MM-NEAT version 2.0 is no longer supported. Please get MM-NEAT 3+ from https://github.com/schrum2/MM-NEAT☆11Updated 7 years ago
- Simple, small, fully-connected Python version of NeoRL☆11Updated 8 years ago
- Code for my blog post "Design by Evolution"☆21Updated 6 years ago
- Toolkit designed to ease development of your Deep Neural Network models for the game of Go (weiqi, baduk).☆20Updated 7 years ago
- ☆48Updated 5 years ago
- Explore the optimization landscape for direct policy learning reinforcement learning.☆50Updated 5 years ago
- Web-based Reinforcement Learning Control Center☆64Updated 8 years ago
- A platform of grid world that supports up to 1 million reinforcement-learning agents.☆70Updated 7 years ago
- Unsupervised ML algorithm for predictive modeling and time-series analysis☆38Updated 4 years ago
- Actor Critic using Kronecker-Factored Trust Region☆19Updated 6 years ago
- A python implemenation of tabular MuZero for educational purposes☆21Updated 4 years ago
- This is the code for "Neural Arithmetic Logic Units" By Siraj Raval on Youtube☆92Updated 6 years ago
- Scripts to generate a dataset with static frames from the Arcade Learning Environment☆18Updated 10 years ago
- Deep Reinforcement Learning with Fined Grained Action Repetition☆23Updated 6 years ago