nczempin / gym-tic-tac-toe
A simple two-player environment for openai/gym
☆20Updated 6 years ago
Alternatives and similar repositories for gym-tic-tac-toe:
Users that are interested in gym-tic-tac-toe are comparing it to the libraries listed below
- Examples of published reinforcement learning algorithms in recent literature implemented in TensorFlow☆102Updated 4 years ago
- Variation of "Asynchronous Methods for Deep Reinforcement Learning" with multiple processes generating experience for agent (Keras + Thea…☆44Updated 6 years ago
- Combining deep learning and reinforcement learning.☆80Updated 3 years ago
- RL experiments☆70Updated 2 years ago
- Simple grid-world environment compatible with OpenAI-gym☆49Updated 4 years ago
- This package allows to use PLE as a gym environment.☆72Updated 4 years ago
- implement of prioritized experience replay☆159Updated 6 years ago
- Deep reinforcement learning using an asynchronous advantage actor-critic (A3C) model.☆66Updated 6 years ago
- ☆117Updated 4 years ago
- Keras implementation of DQN on ViZDoom environment☆53Updated 8 years ago
- C51-DDQN in Keras☆125Updated 7 years ago
- DQN implementation in Keras + TensorFlow + OpenAI Gym☆158Updated 7 years ago
- ☆57Updated 2 years ago
- A working implementation of the Categorical DQN (Distributional RL).☆96Updated 6 years ago
- Deep reinforcement learning baselines base on OpenAI. More algorithms are included, such as Rainbow: Combining Improvements in Deep Rei…☆36Updated 6 years ago
- Deep Reinforcement Learning☆17Updated 7 years ago
- DQN implementation in Keras + TensorFlow + OpenAI Gym☆46Updated 7 years ago
- Tensorflow implementation of Generative Adversarial Imitation Learning(GAIL) with discrete action☆113Updated 6 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆93Updated 6 years ago
- A reinforcement learning framework☆154Updated 6 years ago
- Playing Atari games with TensorFlow implementation of Asynchronous Deep Q-Learning☆42Updated 6 years ago
- Noisy Networks for Exploration☆185Updated 7 years ago
- Tensorflow implementation of A3C algorithm☆47Updated 7 years ago
- Code release for Learning with Opponent-Learning Awareness and variations.☆146Updated last year
- TensorFlow implementation of asynchronous advantage actor-critic (A3C)☆39Updated 3 years ago
- Accompanying code for "Deep Reinforcement Learning that Matters"☆151Updated 7 years ago
- Chainer implementation of Double Deep Q-Network (Double DQN)☆27Updated 8 years ago
- PyTorch implementation of CommNet☆36Updated 7 years ago
- A TensorFlow based implementation of the DeepMind Atari playing "Deep Q Learning" agent that works reasonably well☆92Updated 7 years ago
- A simple Gridworld environment for Open AI gym☆25Updated 6 years ago