sanghyunyi / alphago_zero
A reproduction of Alphago Zero in "Mastering the game of Go without human knowledge"
☆13Updated 7 years ago
Alternatives and similar repositories for alphago_zero:
Users that are interested in alphago_zero are comparing it to the libraries listed below
- ☆27Updated 7 years ago
- ☆28Updated 5 years ago
- LSTM-based recurrent neural network which trains RNN on 30-day span of stock data, then accepts 30-day span to make prediction for the 31…☆9Updated 7 years ago
- trading by Deep Q-Network☆14Updated 8 years ago
- Framework for deep reinforcement learning.☆29Updated 6 years ago
- Collection of reinforcement learners implemented in python. Mainly including DQN and its variants☆54Updated 7 years ago
- tabular q learning for trading☆11Updated 6 years ago
- Deep Reinforcement Learning with Fined Grained Action Repetition☆23Updated 7 years ago
- Source repo for tensor-train recurrent neural network for long-term forecasting☆8Updated 6 years ago
- an implementation of reinforcement learning problem, stock prices☆10Updated 8 years ago
- RWA in pytorch☆14Updated 7 years ago
- reinforcement learning. policy gradient. PCL☆37Updated 7 years ago
- Reinforcement learning algorithms to play Poker☆15Updated 3 years ago
- Series Algorithms of Deep Reinforcement Learning, such as DQN, DDQN, one-step-DQN, DDPG, etc☆42Updated 8 years ago
- 数学基础☆13Updated 7 years ago
- ☆8Updated 8 years ago
- Differentiable neural computers☆27Updated 8 years ago
- Duel_DDQN (Dueling Network Architectures + Double DQN) using Keras☆31Updated 8 years ago
- Neuroevolution as a direct policy search deep reinforcement learning method, implemented using Keras and DEAP.☆70Updated 4 years ago
- Attentional Mechanism incorporated in Asynchronous Advantage Actor Critic a3c/a2c deep mind☆10Updated 7 years ago
- Implementing the supervised learning policy networks of AlphaGo☆12Updated 7 years ago
- Recurrent Reinforcement Learning Algorithm Matlab Implementation☆46Updated 4 years ago
- simple reinforcement learning example for the minecraft☆9Updated 6 years ago
- This project demonstrates the use of generic bi-directional LSTM models for predicting importance of words in a spoken dialgoue for under…☆10Updated 2 years ago
- ☆11Updated 8 years ago
- Deep recommendation system☆13Updated 8 years ago
- Distributed Tensorflow Implementation of Asynchronous Methods for Deep Reinforcement Learning☆29Updated 7 years ago
- SimEc code relying on the theano library - check out the simec repo instead for keras based code!☆10Updated 7 years ago
- Actor Critic using Kronecker-Factored Trust Region☆19Updated 6 years ago
- ☆30Updated 7 years ago