henrycharlesworth / big2_PPOalgorithmLinks
Application of proximal policy optimization algorithm to the card game Big 2 using Tensorflow
☆82Updated 2 years ago
Alternatives and similar repositories for big2_PPOalgorithm
Users that are interested in big2_PPOalgorithm are comparing it to the libraries listed below
Sorting:
- C51-DDQN in Keras☆127Updated 8 years ago
- ☆69Updated 7 years ago
- Pytorch LSTM RNN for reinforcement learning to play Atari games from OpenAI Universe. We also use Google Deep Mind's Asynchronous Advanta…☆195Updated last year
- This project explores deep reinforcement learning, hybrid actor-critic approach with A3C/PPO combined with curiosity for playing Super M…☆80Updated 7 years ago
- ☆29Updated 4 years ago
- DQN, DDDQN, A3C, PPO, Curiosity applied to the game DOOM☆90Updated 5 years ago
- Fictitious Self-play & Reinforcement Learning☆18Updated 8 years ago
- Pytorch implementation of Distributed Proximal Policy Optimization: https://arxiv.org/abs/1707.02286☆184Updated 7 years ago
- RainBow, Tensorflow☆49Updated 7 years ago
- A Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow☆182Updated 7 years ago
- implement of prioritized experience replay☆159Updated 7 years ago
- My implementation of the Proximal Policy Optisation algorithm using Keras as a backend☆88Updated 6 years ago
- Counterfactual regret minimization algorithm for Kuhn poker☆181Updated 6 years ago
- Reinforcement Learning in Keras on VizDoom☆142Updated 8 years ago
- Proximal Policy Optimization implementation with TensorFlow☆108Updated 7 years ago
- A PyTorch implementation of Rainbow DQN agent☆170Updated 7 years ago
- A high-performance Atari A3C agent in 180 lines of PyTorch☆173Updated 4 years ago
- Simple grid-world environment compatible with OpenAI-gym☆50Updated 5 years ago
- Code accompanying the blog post "Deep Reinforcement Learning with TensorFlow 2.1"☆206Updated 4 years ago
- Scalable Implementation of Neural Fictitous Self-Play☆85Updated 7 years ago
- Distributed implementation of popular evolutionary methods☆64Updated 8 years ago
- Deep reinforcement learning baselines base on OpenAI. More algorithms are included, such as Rainbow: Combining Improvements in Deep Rei…☆35Updated 7 years ago
- A3C-LSTM algorithm tested on CartPole OpenAI Gym environment☆48Updated 7 years ago
- Source code for OpenAI Retro Contest for Sonic the Hedgehog☆31Updated 7 years ago
- StarCraft II / PySC2 Deep Reinforcement Learning Agents (A2C)☆138Updated 7 years ago
- ICML 2018 Self-Imitation Learning☆278Updated 5 years ago
- A continuous action space version of A3C LSTM in pytorch plus A3G design☆260Updated last year
- This repo replicates the results Horgan et al obtained in "Distributed Prioritized Experience Replay"☆190Updated 6 years ago
- ☆92Updated 5 years ago
- Proximal Policy Optimization with Beta distribution - uses multi agent Unity ML Tennis☆30Updated 7 years ago