henrycharlesworth / big2_PPOalgorithmLinks
Application of proximal policy optimization algorithm to the card game Big 2 using Tensorflow
☆81Updated last year
Alternatives and similar repositories for big2_PPOalgorithm
Users that are interested in big2_PPOalgorithm are comparing it to the libraries listed below
Sorting:
- Proximal Policy Optimization implementation with TensorFlow☆107Updated 6 years ago
- My implementation of the Proximal Policy Optisation algorithm using Keras as a backend☆88Updated 5 years ago
- StarCraft II / PySC2 Deep Reinforcement Learning Agents (A2C)☆137Updated 6 years ago
- C51-DDQN in Keras☆126Updated 7 years ago
- OpenAI Gym No Limit Texas Hold 'em Environment for Reinforcement Learning☆165Updated 5 years ago
- Source code for OpenAI Retro Contest for Sonic the Hedgehog☆31Updated 6 years ago
- Deep reinforcement learning baselines base on OpenAI. More algorithms are included, such as Rainbow: Combining Improvements in Deep Rei…☆35Updated 6 years ago
- Pytorch implementation of Distributed Proximal Policy Optimization: https://arxiv.org/abs/1707.02286☆183Updated 7 years ago
- RainBow, Tensorflow☆49Updated 7 years ago
- Counterfactual regret minimization algorithm for Kuhn poker☆172Updated 6 years ago
- A structured implementation of MuZero☆204Updated 3 years ago
- Connect4 reinforcement learning by AlphaGo Zero methods.☆113Updated 4 years ago
- Deep reinforcement learning using an asynchronous advantage actor-critic (A3C) model.☆66Updated 7 years ago
- Random Network Distillation pytorch☆250Updated 6 years ago
- A Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow☆180Updated 6 years ago
- Pytorch Implementation of MuZero☆353Updated last year
- A Tensorflow implementation of the Option-Critic Architecture☆71Updated 8 years ago
- This package allows to use PLE as a gym environment.☆72Updated 4 years ago
- Qiita投稿用に作成したAgent57(強化学習)の実装コードです。☆45Updated 2 years ago
- Using Asynchronous Deep Reinforcement Learning to play Flappy Bird from pixel input.☆30Updated 8 years ago
- ☆69Updated 6 years ago
- DQN, DDDQN, A3C, PPO, Curiosity applied to the game DOOM☆87Updated 4 years ago
- Scalable Implementation of Neural Fictitous Self-Play☆80Updated 6 years ago
- A3C-LSTM algorithm tested on CartPole OpenAI Gym environment☆48Updated 6 years ago
- Hybrid Reward Architecture☆77Updated 7 years ago
- A simple stochastic OpenAI environment for training RL agents☆88Updated 2 years ago
- OpenAI Gym environments for Legends of Code and Magic, a collectible card game designed for AI research☆37Updated 7 months ago
- implement of prioritized experience replay☆159Updated 6 years ago
- Fictitious Self-play & Reinforcement Learning☆18Updated 7 years ago
- Hindsight Experience Replay - Bit flipping experiment in Tensorflow☆58Updated 6 years ago