sanghyunyi / alphago_zero
A reproduction of Alphago Zero in "Mastering the game of Go without human knowledge"
☆13Updated 6 years ago
Alternatives and similar repositories for alphago_zero:
Users that are interested in alphago_zero are comparing it to the libraries listed below
- ☆12Updated 4 years ago
- an implementation of reinforcement learning problem, stock prices☆10Updated 8 years ago
- Series Algorithms of Deep Reinforcement Learning, such as DQN, DDQN, one-step-DQN, DDPG, etc☆41Updated 8 years ago
- ☆26Updated 7 years ago
- Attentional Mechanism incorporated in Asynchronous Advantage Actor Critic a3c/a2c deep mind☆9Updated 7 years ago
- Distributed Tensorflow Implementation of Asynchronous Methods for Deep Reinforcement Learning☆31Updated 7 years ago
- LSTM-based recurrent neural network which trains RNN on 30-day span of stock data, then accepts 30-day span to make prediction for the 31…☆10Updated 7 years ago
- Actor Critic using Kronecker-Factored Trust Region☆19Updated 6 years ago
- Random Forest Learner to predict stock prices☆12Updated 11 years ago
- Bayesian Uncertainty Exploration in Deep Reinforcement Learning☆18Updated 7 years ago
- Model-Free Episodic Control☆15Updated 8 years ago
- RWA in pytorch☆14Updated 7 years ago
- Keras implementation of MinimalRNN: Toward More Interpretable and Trainable Recurrent Neural Networks☆17Updated 6 years ago
- Deep Reinforcement Learning with Fined Grained Action Repetition☆23Updated 7 years ago
- ☆8Updated 8 years ago
- Machine Learning examples adapted from Hastie, Tibshirani, and Friedman book☆9Updated 7 years ago
- ☆53Updated 8 years ago
- Contextual Bandits Action Elimination DQN☆19Updated 6 years ago
- Variational Recurrent Auto Encoder☆16Updated 8 years ago
- Exercises for the semi-supervised summer school https://semisupervised-learning.compute.dtu.dk.☆9Updated 8 years ago
- Using Asynchronous Deep Reinforcement Learning to play Flappy Bird from pixel input.☆30Updated 7 years ago
- tabular q learning for trading☆11Updated 6 years ago
- ☆28Updated 5 years ago
- ☆14Updated 9 years ago
- Duel_DDQN (Dueling Network Architectures + Double DQN) using Keras☆32Updated 8 years ago
- Collection of reinforcement learners implemented in python. Mainly including DQN and its variants☆54Updated 7 years ago
- An implementation of the AlphaZero algorithm for chess☆33Updated 2 years ago
- Any Stream to Reinforcement Learning Environment (Time Series Data, Stock Market )☆12Updated 6 years ago
- reinforcement learning. policy gradient. PCL☆38Updated 7 years ago
- Code base for solving Markov Decision Processes and Reinforcement Learning problems using Recurrent Convolutional Neural Networks.☆70Updated 7 years ago