pathway / alphaxosLinks
Deep Reinforcement Learning with Self-Play
☆12Updated 6 years ago
Alternatives and similar repositories for alphaxos
Users that are interested in alphaxos are comparing it to the libraries listed below
Sorting:
- An implementation of (Double/Dueling) Deep-Q Learning to play Super Mario Bros.☆74Updated 4 years ago
- DQN implementation in Keras + TensorFlow + OpenAI Gym☆158Updated 8 years ago
- Code accompanying the blog post "Deep Reinforcement Learning with TensorFlow 2.1"☆206Updated 4 years ago
- ☆304Updated 2 years ago
- RainBow, Tensorflow☆49Updated 7 years ago
- Interfacing RL agents with user-definable neural networks and OpenAI-gym environments.☆12Updated 6 years ago
- Deep reinforcement learning baselines base on OpenAI. More algorithms are included, such as Rainbow: Combining Improvements in Deep Rei…☆35Updated 7 years ago
- My implementation of the Proximal Policy Optisation algorithm using Keras as a backend☆88Updated 6 years ago
- Connect4 reinforcement learning by AlphaGo Zero methods.☆113Updated 4 years ago
- A collection of python Machine Learning articles and examples. You will find code related to Reinforcement Learning, Q Learning, MDP, Bel…☆190Updated 3 years ago
- Simplest Version of playing Atari with Deep Q Learning in Tensorflow☆158Updated 8 years ago
- Collection of Deep Reinforcement Learning algorithms☆300Updated 6 years ago
- Gym - 32 levels of original Super Mario Bros☆292Updated 7 years ago
- Proximal Policy Optimization implementation with TensorFlow☆108Updated 7 years ago
- Pytorch LSTM RNN for reinforcement learning to play Atari games from OpenAI Universe. We also use Google Deep Mind's Asynchronous Advanta…☆195Updated last year
- A fast Evolution Strategy implementation in Python☆272Updated 5 years ago
- implement of prioritized experience replay☆159Updated 7 years ago
- Minimal Monte Carlo Policy Gradient (REINFORCE) Algorithm Implementation in Keras☆160Updated 6 years ago
- A Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow☆182Updated 6 years ago
- This is the code for the "How to Beat Pong Using Policy Gradients (LIVE)" by Siraj Raval on Youtube☆70Updated 8 years ago
- API to access OpenAI Gym from other languages via HTTP☆297Updated 2 years ago
- OpenAI Gym Env for game Gomoku(Five-In-a-Row, 五子棋, 五目並べ, omok, Gobang,...)☆88Updated last year
- A structured implementation of MuZero☆206Updated 3 years ago
- Half Field Offense in Robocup 2D Soccer☆236Updated 3 years ago
- This is the code for "Actor Critic Algorithms" by Siraj Raval on Youtube☆75Updated 8 years ago
- Use Asynchronous advantage actor-critic algorithm (A3C) to play Flappy Bird using Keras☆39Updated 8 years ago
- A3C-LSTM algorithm tested on CartPole OpenAI Gym environment☆48Updated 7 years ago
- AlphaZero implementation for Othello, Connect-Four and Tic-Tac-Toe based on "Mastering the game of Go without human knowledge" and "Maste…☆93Updated 7 years ago
- Reinforcement Learning in Keras on VizDoom☆142Updated 8 years ago
- Tensorflow + OpenAI Gym implementation of Deep Q-Network (DQN), Double DQN (DDQN), Dueling Network and Deep Deterministic Policy Gradient…☆79Updated 8 years ago