tansey / rl-tictactoe
A reinforcement learning agent for tic-tac-toe. Implements the example from Chapter 1 of Sutton and Barto.
☆50Updated 6 years ago
Alternatives and similar repositories for rl-tictactoe:
Users that are interested in rl-tictactoe are comparing it to the libraries listed below
- Generative Adversarial Network Demo for Fresh Machine Learning #2☆73Updated 6 years ago
- Repository for practical assignments for UvA Deep Learning Course 2016☆51Updated 7 years ago
- Montréal Deep Learning Summer School 2016 material☆100Updated 8 years ago
- Reasonably-okay-performing implementation of a GAN and an adversarial autoencoder on MNIST.☆29Updated 9 years ago
- ☆24Updated 9 years ago
- Tutorial for Visual Turing Test (visual question answering, image question answering).☆116Updated 8 years ago
- ☆88Updated 8 years ago
- Universal library for deep reinforcement learning.☆38Updated 8 years ago
- This is the code for "Synthetic Gradients Explained" by Siraj Raval on Youtube☆62Updated 6 years ago
- Deepmind Recurrent Environment Simulators paper implementation in tensorflow☆74Updated 7 years ago
- X is a temporary name, but here lies RL☆40Updated 7 years ago
- ☆107Updated 6 years ago
- things I help(ed) to build☆53Updated 4 years ago
- my solutions for oxford's "deep nlp 2017" practical assignments☆65Updated 7 years ago
- Implementations of "LSTM: A Search Space Odyssey" variants and their training results on the PTB dataset.☆95Updated 7 years ago
- Save keras weight matrices as short animated videos during training☆105Updated 7 years ago
- A list of deep neural network architectures for reinforcement learning tasks.☆166Updated 8 years ago
- tensorflow reinforcement learning agents for OpenAI gym environments☆113Updated 7 years ago
- Generative Adversarial Networks with Keras☆156Updated 4 years ago
- Deep Learning Dashboard☆38Updated 8 years ago
- ☆69Updated 6 years ago
- Machine learning and data science blog.☆66Updated last year
- code for pydata madrid presentation☆53Updated 8 years ago
- A curated list of awesome hyperparameters for deep learning☆78Updated 7 years ago
- A Quick and Dirty Progressive Neural Network written in TensorFlow.☆52Updated 6 years ago
- DrMAD☆107Updated 7 years ago
- Playing Atari games with TensorFlow implementation of Asynchronous Deep Q-Learning☆42Updated 6 years ago
- Testbed for deep reinforcement learning☆160Updated 7 years ago
- Pytorch implementation of "Forward Thinking: Building and Training Neural Networks One Layer at a Time"☆65Updated 7 years ago
- Quick and Dirty TensorFlow command framework to train and evaluate models and make inference☆56Updated 5 years ago