int8 / chess-position-evaluation
Chess position evaluation using neural networks
☆25Updated 4 years ago
Related projects: ⓘ
- Codes of our team for the OpenAI Retro Contest of reinforcement learning☆100Updated 6 years ago
- ☆30Updated 6 years ago
- An implementation of the AlphaZero algorithm for chess☆34Updated last year
- Connect4 reinforcement learning by AlphaGo Zero methods.☆114Updated 3 years ago
- Trained models for keras-rl.☆21Updated 7 years ago
- ☆46Updated 6 years ago
- Source code for OpenAI Retro Contest for Sonic the Hedgehog☆30Updated 6 years ago
- Combining deep learning and reinforcement learning.☆81Updated 2 years ago
- ☆39Updated 6 years ago
- Our NIPS 2017: Learning to Run source code☆56Updated last year
- ☆57Updated last year
- Code for "Spinning Up a Pong AI With Deep RL" on FloydHub.☆54Updated 5 years ago
- This is the code for "Synthetic Gradients Explained" by Siraj Raval on Youtube☆61Updated 6 years ago
- This is the code for "How Does DeepMind's AlphaGo Zero Work?" Siraj Raval on Youtube☆122Updated 6 years ago
- A TensorFlow implementation of "DeepChess: End-to-End Deep Neural Network for Automatic Learning in Chess"☆87Updated 5 years ago
- This is the code for "Neural Arithmetic Logic Units" By Siraj Raval on Youtube☆92Updated 6 years ago
- Tensorflow implementation of A3C algorithm☆48Updated 7 years ago
- This is the code for the "How to Beat Pong Using Policy Gradients (LIVE)" by Siraj Raval on Youtube☆63Updated 7 years ago
- Style transfer for chess.☆31Updated 7 years ago
- A reimplementation of the Google AlphaZero algorithm.☆18Updated 4 years ago
- reinfore learning tool box, contains trpo, a3c algorithm for continous action space☆43Updated 6 years ago
- World Models applied to the Open AI Sonic Retro Contest☆77Updated 6 years ago
- C51-DDQN in Keras☆125Updated 6 years ago
- random search, hill climbing, policy gradient☆138Updated 6 years ago
- Tensorflow Implementation of PathNet: Evolution Channels Gradient Descent in Super Neural Networks☆102Updated 7 years ago
- Convolution neural network... for draw video poker. Perhaps, we learn something useful for other poker, too.☆109Updated 8 years ago
- A Pygame+Pymunk Carrom Simulation Testbed for reinforcement learning. [CS747][ Foundations of Intelligent and Learning Agents]☆15Updated 5 years ago
- Deep reinforcement learning using an asynchronous advantage actor-critic (A3C) model.☆66Updated 6 years ago
- Playing Atari games with TensorFlow implementation of Asynchronous Deep Q-Learning☆43Updated 6 years ago
- ☆20Updated 6 years ago