sanghyunyi / alphago_zero
A reproduction of Alphago Zero in "Mastering the game of Go without human knowledge"
☆13Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for alphago_zero
- ☆12Updated 4 years ago
- an implementation of reinforcement learning problem, stock prices☆10Updated 7 years ago
- Keras implementation of MinimalRNN: Toward More Interpretable and Trainable Recurrent Neural Networks☆17Updated 6 years ago
- Bayesian Uncertainty Exploration in Deep Reinforcement Learning☆17Updated 7 years ago
- Gathers machine learning and deep learning models for Reinforcement Learning☆9Updated 6 years ago
- Implementing the supervised learning policy networks of AlphaGo☆13Updated 6 years ago
- ☆8Updated 7 years ago
- Machine Learning examples adapted from Hastie, Tibshirani, and Friedman book☆9Updated 7 years ago
- Distributed Tensorflow Implementation of Asynchronous Methods for Deep Reinforcement Learning☆31Updated 6 years ago
- ☆26Updated 6 years ago
- Tensorflow implementation of 'Asynchronous Methods for Deep Reinforcement Learning'☆13Updated 7 years ago
- tabular q learning for trading☆10Updated 5 years ago
- LSTM-based recurrent neural network which trains RNN on 30-day span of stock data, then accepts 30-day span to make prediction for the 31…☆10Updated 7 years ago
- An implementation of the AlphaZero algorithm for chess☆34Updated last year
- Actor Critic using Kronecker-Factored Trust Region☆19Updated 6 years ago
- ☆16Updated 7 years ago
- This is a tutorial written for Caffe2 which mocks google AlphaGo Fan and AlphaGo Zero.☆8Updated 5 years ago
- Monte Carlo Conterfactual Regret Minimization for imperfect information games☆13Updated 5 years ago
- ☆11Updated 8 years ago
- reinfore learning tool box, contains trpo, a3c algorithm for continous action space☆43Updated 6 years ago
- the solustion to https://openai.com/requests-for-research☆12Updated 7 years ago
- Neural machine translation with Recurrent Deterministic Policy Gradient☆10Updated 8 years ago
- reinforcement learning. policy gradient. PCL☆38Updated 7 years ago
- Playing Atari games with TensorFlow implementation of Asynchronous Deep Q-Learning☆43Updated 6 years ago
- SimEc code relying on the theano library - check out the simec repo instead for keras based code!☆10Updated 6 years ago
- ☆14Updated 8 years ago
- Framework for deep reinforcement learning.☆29Updated 6 years ago
- Random Forest Learner to predict stock prices☆11Updated 10 years ago