apoddar573 / Tic-Tac-Toe-Gym_Environment
This is an implementation of the tic-tac-toe game as a gym environment. It can be used to make the computer learn playing the Tic-Tac-Toe game.
☆26Updated 5 years ago
Related projects: ⓘ
- Simple bit flipping with sparse rewards using HER, similarly to the original paper☆38Updated 5 years ago
- A simple stochastic OpenAI environment for training RL agents☆89Updated last year
- My implementation of the Proximal Policy Optisation algorithm using Keras as a backend☆88Updated 4 years ago
- Hindsight Experience Replay - Bit flipping experiment in Tensorflow☆57Updated 5 years ago
- Implementation of selected reinforcement learning algorithms in Tensorflow. A3C, DDPG, REINFORCE, DQN, etc.☆150Updated last year
- Old and new Reinforcement Learning algorithms run on the GridUniverse ecosystem☆22Updated 5 years ago
- A simple Gridworld environment for Open AI gym☆24Updated 6 years ago
- Gridworld environments for OpenAI gym.☆78Updated 7 months ago
- Collection of Deep Reinforcement Learning Algorithms implemented in PyTorch.☆71Updated 3 years ago
- Simple grid-world environment compatible with OpenAI-gym☆49Updated 4 years ago
- Deep recurrent Q Learning using Tensorflow, openai/gym and openai/retro☆173Updated last year
- Tensorflow implementation of a Deep Distributed Distributional Deterministic Policy Gradients (D4PG) network, trained on OpenAI Gym envir…☆125Updated 4 years ago
- Bandits Environments for the OpenAI Gym☆88Updated 4 years ago
- Atari - Deep Reinforcement Learning algorithms in TensorFlow☆137Updated 5 months ago
- Keras Implementation of PPO to solve OpenAI Gym Environments☆16Updated 6 years ago
- Highly Modular and Scalable Reinforcement Learning☆113Updated 4 years ago
- ☆72Updated last year
- TensorFlow implementation of asynchronous advantage actor-critic (A3C)☆39Updated 2 years ago
- C51-DDQN in Keras☆125Updated 6 years ago
- General implementation of Advantage Actor Critic using Pytorch☆26Updated 2 years ago
- Implementation of DDPG (Modified from the work of Patrick Emami) - Tensorflow (no TFLearn dependency), Ornstein Uhlenbeck noise function,…☆65Updated 7 years ago
- ☆91Updated 3 years ago
- A Tensorflow implementation of the Option-Critic Architecture☆71Updated 7 years ago
- ☆15Updated 4 years ago
- This is a pip package implementing Reinforcement Learning algorithms in non-stationary environments supported by the OpenAI Gym toolkit.☆30Updated 5 years ago
- PyTorch implementation of Proximal Policy Optimization☆50Updated 6 years ago
- Yet another prioritized experience replay buffer implementation.☆47Updated last year
- A collection of multi-agent reinforcement learning OpenAI gym environments☆44Updated 4 years ago
- Using RLLib and PycoLab to explore intelligent cooperative behavior in sequential social dilemmas☆48Updated last year
- ☆76Updated 6 years ago