henrycharlesworth / big2_PPOalgorithm
Application of proximal policy optimization algorithm to the card game Big 2 using Tensorflow
☆79Updated last year
Alternatives and similar repositories for big2_PPOalgorithm:
Users that are interested in big2_PPOalgorithm are comparing it to the libraries listed below
- Scalable Implementation of Neural Fictitous Self-Play☆76Updated 6 years ago
- My implementation of the Proximal Policy Optisation algorithm using Keras as a backend☆88Updated 5 years ago
- Counterfactual regret minimization algorithm for Kuhn poker☆170Updated 6 years ago
- C51-DDQN in Keras☆126Updated 7 years ago
- A Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow☆180Updated 6 years ago
- StarCraft II / PySC2 Deep Reinforcement Learning Agents (A2C)☆135Updated 6 years ago
- PySC2 OpenAI Gym Environments☆48Updated 6 years ago
- Pytorch Implementation of MuZero☆350Updated last year
- DQN, DDDQN, A3C, PPO, Curiosity applied to the game DOOM☆84Updated 4 years ago
- This code is based on the implementation of http://www.cs.cmu.edu/afs/cs/Web/People/sandholm/potential-aware_imperfect-recall.aaai14.pdf,…☆34Updated 6 years ago
- Scalable implementation of DREAM - Deep RL for multi-agent imperfect information games☆115Updated 8 months ago
- Proximal Policy Optimization implementation with TensorFlow☆106Updated 6 years ago
- Reinforcement Learning in Keras on VizDoom☆145Updated 7 years ago
- Atari - Deep Reinforcement Learning algorithms in TensorFlow☆135Updated last year
- A continuous action space version of A3C LSTM in pytorch plus A3G design☆259Updated 5 months ago
- OpenAI Gym No Limit Texas Hold 'em Environment for Reinforcement Learning☆162Updated 5 years ago
- Implementing reinforcement-learning algorithms for pysc2 -environment☆89Updated 7 years ago
- OpenAI Gym Env for game Gomoku(Five-In-a-Row, 五子棋, 五目並べ, omok, Gobang,...)☆88Updated 5 months ago
- Using self-play, MCTS, and a deep neural network to create a hearthstone ai player☆29Updated 6 years ago
- Random Network Distillation(RND) algo in Pytorch☆49Updated 6 years ago
- Deep recurrent Q Learning using Tensorflow, openai/gym and openai/retro☆174Updated 2 years ago
- Scalable Implementation of Deep CFR and Single Deep CFR☆291Updated 4 years ago
- Implementation of selected reinforcement learning algorithms in Tensorflow. A3C, DDPG, REINFORCE, DQN, etc.☆150Updated last year
- Pytorch LSTM RNN for reinforcement learning to play Atari games from OpenAI Universe. We also use Google Deep Mind's Asynchronous Advanta…☆182Updated 6 months ago
- RainBow, Tensorflow☆49Updated 7 years ago
- Fictitious Self-play & Reinforcement Learning☆18Updated 7 years ago
- TensorFlow implementation of asynchronous advantage actor-critic (A3C)☆39Updated 3 years ago
- Random Network Distillation pytorch☆247Updated 6 years ago
- Pytorch implementation of distributed deep reinforcement learning☆75Updated 2 years ago
- implement of prioritized experience replay☆160Updated 6 years ago