obventio56 / approximating_deepstack
☆12Updated 3 years ago
Related projects: ⓘ
- Reinforcement learning algorithms to play Poker☆15Updated 2 years ago
- ☆50Updated this week
- A reproduction of Alphago Zero in "Mastering the game of Go without human knowledge"☆13Updated 6 years ago
- Gridworlds and Markov Decision Processes in Python☆11Updated 11 years ago
- A RNN PokerBot implementing DeepStack strategies☆53Updated 7 years ago
- an implementation of reinforcement learning problem, stock prices☆10Updated 7 years ago
- Gathers machine learning and deep learning models for Reinforcement Learning☆9Updated 6 years ago
- This is a tutorial written for Caffe2 which mocks google AlphaGo Fan and AlphaGo Zero.☆8Updated 5 years ago
- ☆11Updated 8 years ago
- Deep Reinforcement Learning with Fined Grained Action Repetition☆23Updated 6 years ago
- Framework for deep reinforcement learning.☆29Updated 6 years ago
- a Renju game, replicate paper "Mastering the game of Go with deep neural networks and tree search"☆20Updated 8 years ago
- Code for paper "A simple algorithm for computing Nash-equilibria in incomplete information games"☆10Updated 7 years ago
- Monte Carlo Conterfactual Regret Minimization for imperfect information games☆13Updated 5 years ago
- trading by Deep Q-Network☆15Updated 7 years ago
- ☆12Updated 6 years ago
- LSTM-based recurrent neural network which trains RNN on 30-day span of stock data, then accepts 30-day span to make prediction for the 31…☆10Updated 7 years ago
- ☆26Updated this week
- Distributed Tensorflow Implementation of Asynchronous Methods for Deep Reinforcement Learning☆31Updated 6 years ago
- Random Forest Learner to predict stock prices☆11Updated 10 years ago
- RWA in pytorch☆14Updated 7 years ago
- Uses Hierarchical Temporal Memory to predict the price of RLG based on historical data☆11Updated 9 years ago
- Implementing the supervised learning policy networks of AlphaGo☆13Updated 6 years ago
- reinfore learning tool box, contains trpo, a3c algorithm for continous action space☆43Updated 6 years ago
- Counterfactual Regret Minimization for poker games☆19Updated 5 years ago
- Investigations into simplified holdem poker☆11Updated 11 years ago
- ☆21Updated this week
- ☆22Updated 5 years ago
- Code to build MLP models for outdoor head orientation tracking☆17Updated 11 years ago
- ☆26Updated 6 years ago