henrycharlesworth / big2_PPOalgorithm
Application of proximal policy optimization algorithm to the card game Big 2 using Tensorflow
☆75Updated last year
Related projects ⓘ
Alternatives and complementary repositories for big2_PPOalgorithm
- My implementation of the Proximal Policy Optisation algorithm using Keras as a backend☆88Updated 5 years ago
- Counterfactual regret minimization algorithm for Kuhn poker☆163Updated 5 years ago
- Source code for OpenAI Retro Contest for Sonic the Hedgehog☆30Updated 6 years ago
- C51-DDQN in Keras☆125Updated 7 years ago
- World Models applied to the Open AI Sonic Retro Contest☆77Updated 6 years ago
- Hindsight Experience Replay - Bit flipping experiment in Tensorflow☆58Updated 6 years ago
- Pytorch LSTM RNN for reinforcement learning to play Atari games from OpenAI Universe. We also use Google Deep Mind's Asynchronous Advanta…☆173Updated 2 months ago
- Proximal Policy Optimization implementation with TensorFlow☆104Updated 6 years ago
- StarCraft II / PySC2 Deep Reinforcement Learning Agents (A2C)☆134Updated 6 years ago
- Using self-play, MCTS, and a deep neural network to create a hearthstone ai player☆29Updated 6 years ago
- Pytorch implementation of Distributed Proximal Policy Optimization: https://arxiv.org/abs/1707.02286☆180Updated 6 years ago
- Atari - Deep Reinforcement Learning algorithms in TensorFlow☆135Updated 7 months ago
- RainBow, Tensorflow☆49Updated 6 years ago
- Scalable Implementation of Neural Fictitous Self-Play☆73Updated 5 years ago
- A Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow☆182Updated 5 years ago
- implement of prioritized experience replay☆156Updated 6 years ago
- Random Network Distillation(RND) algo in Pytorch☆48Updated 5 years ago
- RLtime is a reinforcement learning library focused on state-of-the-art q-learning algorithms and features☆139Updated 5 years ago
- Fictitious Self-play & Reinforcement Learning☆19Updated 6 years ago
- some Multiagent enviroment in 《Multi-agent Reinforcement Learning in Sequential Social Dilemmas》 and 《Value-Decomposition Networks For Co…☆131Updated last year
- ☆69Updated 5 years ago
- Implementation of 'A Distributional Perspective on Reinforcement Learning' and 'Distributional Reinforcement Learning with Quantile Regre…☆131Updated 5 years ago
- Deep reinforcement learning baselines base on OpenAI. More algorithms are included, such as Rainbow: Combining Improvements in Deep Rei…☆36Updated 6 years ago
- Deep reinforcement learning using an asynchronous advantage actor-critic (A3C) model.☆66Updated 6 years ago
- A simple stochastic OpenAI environment for training RL agents☆89Updated last year
- Scalable implementation of DREAM - Deep RL for multi-agent imperfect information games☆112Updated 3 months ago
- Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning☆97Updated 2 years ago
- Simple grid-world environment compatible with OpenAI-gym☆49Updated 4 years ago
- ☆27Updated 3 years ago
- TensorFlow & Keras implementation of DQN with HER (Hindsight Experience Replay)☆40Updated 4 years ago