henrycharlesworth / big2_PPOalgorithmLinks
Application of proximal policy optimization algorithm to the card game Big 2 using Tensorflow
☆81Updated last year
Alternatives and similar repositories for big2_PPOalgorithm
Users that are interested in big2_PPOalgorithm are comparing it to the libraries listed below
Sorting:
- C51-DDQN in Keras☆126Updated 7 years ago
- ☆69Updated 6 years ago
- Scalable Implementation of Neural Fictitous Self-Play☆84Updated 6 years ago
- Fictitious Self-play & Reinforcement Learning☆18Updated 7 years ago
- StarCraft II / PySC2 Deep Reinforcement Learning Agents (A2C)☆138Updated 6 years ago
- Source code for OpenAI Retro Contest for Sonic the Hedgehog☆31Updated 7 years ago
- A Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow☆182Updated 6 years ago
- Counterfactual regret minimization algorithm for Kuhn poker☆176Updated 6 years ago
- PySC2 OpenAI Gym Environments☆48Updated 6 years ago
- This project explores deep reinforcement learning, hybrid actor-critic approach with A3C/PPO combined with curiosity for playing Super M…☆80Updated 6 years ago
- RainBow, Tensorflow☆49Updated 7 years ago
- Scalable implementation of DREAM - Deep RL for multi-agent imperfect information games☆119Updated last year
- My implementation of the Proximal Policy Optisation algorithm using Keras as a backend☆88Updated 5 years ago
- Pytorch implementation of Distributed Proximal Policy Optimization: https://arxiv.org/abs/1707.02286☆184Updated 7 years ago
- This is the code for "OpenAI Five vs DOTA 2 Explained" By Siraj Raval on Youtube☆167Updated 7 years ago
- Random Network Distillation(RND) algo in Pytorch☆50Updated 6 years ago
- Highly Modular and Scalable Reinforcement Learning☆118Updated 5 years ago
- Pytorch Implementation of MuZero☆354Updated 2 years ago
- OpenAI Gym No Limit Texas Hold 'em Environment for Reinforcement Learning☆166Updated 5 years ago
- Examples of published reinforcement learning algorithms in recent literature implemented in TensorFlow☆103Updated 5 years ago
- Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning☆103Updated 3 years ago
- This repo replicates the results Horgan et al obtained in "Distributed Prioritized Experience Replay"☆190Updated 6 years ago
- A PyTorch implementation of Rainbow DQN agent☆170Updated 7 years ago
- DQN, DDDQN, A3C, PPO, Curiosity applied to the game DOOM☆87Updated 4 years ago
- Proximal Policy Optimization with Beta distribution - uses multi agent Unity ML Tennis☆29Updated 6 years ago
- Proximal Policy Optimization implementation with TensorFlow☆106Updated 6 years ago
- Pytorch LSTM RNN for reinforcement learning to play Atari games from OpenAI Universe. We also use Google Deep Mind's Asynchronous Advanta…☆189Updated last year
- Actor-critic with experience replay☆254Updated 2 years ago
- This is a simple implementation of DeepMind's PySC2 RL agents.☆274Updated 7 years ago
- Some baselines for Pommerman competition☆46Updated 7 years ago