henrycharlesworth / big2_PPOalgorithmLinks
Application of proximal policy optimization algorithm to the card game Big 2 using Tensorflow
☆82Updated 2 years ago
Alternatives and similar repositories for big2_PPOalgorithm
Users that are interested in big2_PPOalgorithm are comparing it to the libraries listed below
Sorting:
- C51-DDQN in Keras☆126Updated 8 years ago
- Counterfactual regret minimization algorithm for Kuhn poker☆181Updated 6 years ago
- A PyTorch implementation of Rainbow DQN agent☆170Updated 7 years ago
- A Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow☆182Updated 6 years ago
- ☆29Updated 4 years ago
- Pytorch implementation of Distributed Proximal Policy Optimization: https://arxiv.org/abs/1707.02286☆183Updated 7 years ago
- RainBow, Tensorflow☆49Updated 7 years ago
- This project explores deep reinforcement learning, hybrid actor-critic approach with A3C/PPO combined with curiosity for playing Super M…☆80Updated 6 years ago
- Source code for OpenAI Retro Contest for Sonic the Hedgehog☆31Updated 7 years ago
- Simple grid-world environment compatible with OpenAI-gym☆50Updated 5 years ago
- Reinforcement Learning for Super Mario Bros using A3C on GPU☆37Updated 7 years ago
- ICML 2018 Self-Imitation Learning☆278Updated 5 years ago
- Distributed implementation of popular evolutionary methods☆64Updated 8 years ago
- My implementation of the Proximal Policy Optisation algorithm using Keras as a backend☆88Updated 6 years ago
- Proximal Policy Optimization implementation with TensorFlow☆108Updated 7 years ago
- Scalable Implementation of Neural Fictitous Self-Play☆84Updated 6 years ago
- ☆69Updated 7 years ago
- Fictitious Self-play & Reinforcement Learning☆18Updated 7 years ago
- A Tetris environment to train machine learning agents☆73Updated 2 years ago
- Pytorch LSTM RNN for reinforcement learning to play Atari games from OpenAI Universe. We also use Google Deep Mind's Asynchronous Advanta…☆193Updated last year
- implement of prioritized experience replay☆159Updated 7 years ago
- Highly Modular and Scalable Reinforcement Learning☆118Updated 5 years ago
- Actor-critic with experience replay☆256Updated 3 years ago
- World Models applied to the Open AI Sonic Retro Contest☆77Updated 7 years ago
- Random Network Distillation(RND) algo in Pytorch☆51Updated 6 years ago
- Examples of published reinforcement learning algorithms in recent literature implemented in TensorFlow☆103Updated 5 years ago
- PyTorch implementation of Advantage Actor-Critic (A2C)☆47Updated 8 years ago
- Implementation of 'A Distributional Perspective on Reinforcement Learning' and 'Distributional Reinforcement Learning with Quantile Regre…☆133Updated 6 years ago
- Solving the Rubik's cube with deep reinforcement learning and Monte Carlo tree search☆107Updated 6 years ago
- Multi Agent Reinforcement Learning using MalmÖ☆264Updated 5 years ago