henrycharlesworth / big2_PPOalgorithm
Application of proximal policy optimization algorithm to the card game Big 2 using Tensorflow
☆79Updated last year
Alternatives and similar repositories for big2_PPOalgorithm:
Users that are interested in big2_PPOalgorithm are comparing it to the libraries listed below
- My implementation of the Proximal Policy Optisation algorithm using Keras as a backend☆88Updated 5 years ago
- C51-DDQN in Keras☆126Updated 7 years ago
- A Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow☆180Updated 6 years ago
- Counterfactual regret minimization algorithm for Kuhn poker☆171Updated 6 years ago
- Proximal Policy Optimization implementation with TensorFlow☆106Updated 6 years ago
- StarCraft II / PySC2 Deep Reinforcement Learning Agents (A2C)☆135Updated 6 years ago
- Reinforcement Learning for Super Mario Bros using A3C on GPU☆37Updated 7 years ago
- RainBow, Tensorflow☆49Updated 7 years ago
- ☆69Updated 6 years ago
- Pytorch LSTM RNN for reinforcement learning to play Atari games from OpenAI Universe. We also use Google Deep Mind's Asynchronous Advanta…☆182Updated 7 months ago
- Pytorch Implementation of MuZero☆351Updated last year
- A high-performance Atari A3C agent in 180 lines of PyTorch☆171Updated 3 years ago
- Implementation of selected reinforcement learning algorithms in Tensorflow. A3C, DDPG, REINFORCE, DQN, etc.☆150Updated last year
- Fictitious Self-play & Reinforcement Learning☆18Updated 7 years ago
- RLtime is a reinforcement learning library focused on state-of-the-art q-learning algorithms and features☆139Updated 5 years ago
- Code for the blog post "Learning Montezuma’s Revenge from a Single Demonstration"☆197Updated 6 years ago
- Reinforcement learning models in ViZDoom environment☆133Updated 3 years ago
- Scalable Implementation of Neural Fictitous Self-Play☆77Updated 6 years ago
- Using Asynchronous Deep Reinforcement Learning to play Flappy Bird from pixel input.☆30Updated 7 years ago
- An environment of the board game Go using OpenAI's Gym API☆173Updated 2 years ago
- A Tetris environment to train machine learning agents☆68Updated last year
- ☆303Updated 2 years ago
- Hindsight Experience Replay - Bit flipping experiment in Tensorflow☆58Updated 6 years ago
- A student implementation of Alpha Go Zero☆280Updated 6 years ago
- Deep reinforcement learning baselines base on OpenAI. More algorithms are included, such as Rainbow: Combining Improvements in Deep Rei…☆35Updated 6 years ago
- A PyTorch implementation of Rainbow DQN agent☆169Updated 7 years ago
- Trust Region Policy Optimization with TensorFlow and OpenAI Gym☆362Updated 4 years ago
- Implementing reinforcement-learning algorithms for pysc2 -environment☆89Updated 7 years ago
- Code for hierarchical imitation learning and reinforcement learning☆290Updated 7 years ago
- Scalable implementation of DREAM - Deep RL for multi-agent imperfect information games☆115Updated 9 months ago