pigooosuke / gym_reversi
openAI gym env for reversi/othello game
☆20Updated last year
Related projects ⓘ
Alternatives and complementary repositories for gym_reversi
- Example implementation of Alpha Zero' s algotirhm on Jupyter notebook☆15Updated 5 years ago
- Fast Flexible Replay Buffer Library (Mirror repository of https://gitlab.com/ymd_h/cpprb)☆72Updated 5 months ago
- An out-of-the-box GUI tool for offline deep reinforcement learning☆95Updated 3 years ago
- DQN implementation in Keras + TensorFlow + OpenAI Gym☆46Updated 7 years ago
- Best Papers nominees from top conferences related to Artificial Intelligence☆20Updated 5 years ago
- ☆53Updated last year
- Datasets for data-driven deep reinforcement learning with PyBullet environments☆143Updated 3 years ago
- ☆18Updated last month
- PyTorch RL for Pommerman☆38Updated 6 years ago
- Deep reinforcement learning with tensorflow2☆91Updated 3 weeks ago
- Code for the paper "Skynet: A Top Deep RL Agent in the Inaugural Pommerman Team Competition"☆37Updated 5 years ago
- Simple tools for statistical analyses in RL experiments☆66Updated 6 years ago
- Some baselines for Pommerman competition☆46Updated 6 years ago
- Simple Distributed Reinforcement Learning Framework(シンプルな分散強化学習フレームワーク)☆41Updated this week
- NIPS 2017 Value Prediction Network☆166Updated 6 years ago
- This package allows to use PLE as a gym environment.☆72Updated 4 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆90Updated 6 years ago
- AI for google research football☆27Updated 3 years ago
- HandyRL is a handy and simple framework based on Python and PyTorch for distributed reinforcement learning that is applicable to your own…☆282Updated 6 months ago
- some common TD Learning algorithms☆67Updated 4 years ago
- Implementation of clipped action policy gradient (CAPG) with PPO and TRPO☆30Updated 6 years ago
- Exploration Strategies for Deep Reinforcement Learning☆39Updated 6 years ago
- Code for 'The Grand Atari Challenge dataset' paper☆52Updated 7 years ago
- Reinforcement Learning Assembly☆92Updated 3 years ago
- Implementation of the Option-Critic Architecture on the Atari (ALE) environment☆170Updated 7 years ago
- Pytorch implementation of distributed deep reinforcement learning☆74Updated 2 years ago
- A3C style Option-Critic with deliberation cost☆39Updated 6 years ago
- Accompanying code for "Deep Reinforcement Learning that Matters"☆153Updated 7 years ago
- ☆10Updated 7 years ago