apoddar573 / Tic-Tac-Toe-Gym_Environment
This is an implementation of the tic-tac-toe game as a gym environment. It can be used to make the computer learn playing the Tic-Tac-Toe game.
☆26Updated 6 years ago
Alternatives and similar repositories for Tic-Tac-Toe-Gym_Environment:
Users that are interested in Tic-Tac-Toe-Gym_Environment are comparing it to the libraries listed below
- Old and new Reinforcement Learning algorithms run on the GridUniverse ecosystem☆22Updated 6 years ago
- Simple bit flipping with sparse rewards using HER, similarly to the original paper☆39Updated 6 years ago
- Hindsight Experience Replay - Bit flipping experiment in Tensorflow☆58Updated 6 years ago
- A simple stochastic OpenAI environment for training RL agents☆89Updated 2 years ago
- This is a framework for the research on multi-agent reinforcement learning and the implementation of the experiments in the paper titled …☆117Updated 4 months ago
- Upside-Down Reinforcement Learning (⅂ꓤ) implementation in PyTorch. Based on the paper published by Jürgen Schmidhuber.☆77Updated 4 years ago
- TensorFlow implementation of asynchronous advantage actor-critic (A3C)☆39Updated 3 years ago
- This repository contains the game bubble shooter as a gym environment. Based on: https://github.com/justinmeister/bubbleshooter☆17Updated 5 years ago
- Simple grid-world environment compatible with OpenAI-gym☆49Updated 5 years ago
- ☆16Updated 5 years ago
- TF2 Implementation of the Soft Actor-Critic Algorithm☆45Updated 2 years ago
- Implementation of Soft Actor Critic☆37Updated 3 years ago
- A framework for easy prototyping of distributed reinforcement learning algorithms☆95Updated 4 years ago
- A simple Gridworld environment for Open AI gym☆25Updated 6 years ago
- This is a pip package implementing Reinforcement Learning algorithms in non-stationary environments supported by the OpenAI Gym toolkit.☆31Updated 5 years ago
- Gridworld environments for OpenAI gym.☆80Updated last year
- Proximal policy optimization in PyTorch. Easy to read and understand.☆49Updated 4 years ago
- A Tensorflow implementation of the Option-Critic Architecture☆72Updated 7 years ago
- Deep recurrent Q Learning using Tensorflow, openai/gym and openai/retro☆174Updated 2 years ago
- Bandits Environments for the OpenAI Gym☆91Updated 5 years ago
- Implementation of selected reinforcement learning algorithms in Tensorflow. A3C, DDPG, REINFORCE, DQN, etc.☆150Updated last year
- My implementation of the Proximal Policy Optisation algorithm using Keras as a backend☆88Updated 5 years ago
- C51-DDQN in Keras☆126Updated 7 years ago
- A repository for code of reinforcement learning algorithms with PyTorch☆30Updated 3 years ago
- Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC☆98Updated 5 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆137Updated 6 years ago
- Implementation of Deep/Double Deep/Dueling Deep Q networks for playing Atari games using Keras and OpenAI gym☆40Updated 6 years ago
- PyTorch implementation of Proximal Policy Optimization☆51Updated 7 years ago
- Twin Delayed DDPG (TD3) PyTorch solution for Roboschool and Box2d environment☆104Updated 5 years ago
- Gridworld for MARL experiments☆139Updated 4 years ago