tanliyon / gym-xiangqi
This repo sets up the environment to play Xiang Qi (chinese chess) following the OpenAI Gym framework.
☆33Updated last year
Related projects ⓘ
Alternatives and complementary repositories for gym-xiangqi
- An environment of the board game Go using OpenAI's Gym API☆168Updated 2 years ago
- Solving the Rubik's cube with deep reinforcement learning and Monte Carlo tree search☆95Updated 5 years ago
- This project is implementation code of AlphaStar☆187Updated 10 months ago
- Pytorch Implementation of MuZero☆343Updated last year
- Scalable Implementation of Neural Fictitous Self-Play☆73Updated 5 years ago
- Connect4 reinforcement learning by AlphaGo Zero methods.☆114Updated 3 years ago
- DQN-Atari-Agents: Modularized & Parallel PyTorch implementation of several DQN Agents, i.a. DDQN, Dueling DQN, Noisy DQN, C51, Rainbow,…☆120Updated 3 years ago
- Various ways to learn a computer to escape from a maze. From random walk to a simple neural network.☆93Updated 2 years ago
- A novel parallel UCT algorithm with linear speedup and negligible performance loss.☆112Updated 3 years ago
- An OpenAI Gym environment for the Flappy Bird game☆113Updated 2 years ago
- ☆13Updated 2 years ago
- very easy implementation of dueling DQN in pytorch☆69Updated last year
- PyTorch AlphaZero implementation with multiplayer support [NeurIPS 2019 Deep Reinforcement Learning Workshop]☆34Updated 3 years ago
- PyTorch implementation of FQF, IQN and QR-DQN.☆161Updated 3 months ago
- Simple, readable, yet full-featured implementation of PPO in Pytorch☆44Updated 2 years ago
- A fast, generalized, and modified implementation of Deepmind's distinguished AlphaZero in PyTorch.☆66Updated last year
- The Arcade Learning Environment (ALE) -- a platform for AI research.☆22Updated 2 months ago
- Random Network Distillation(RND) algo in Pytorch☆48Updated 5 years ago
- ☆65Updated 3 years ago
- General Python implementation of Monte Carlo Tree Search for the use with Open AI Gym environments.☆35Updated 4 years ago
- Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.☆101Updated 3 years ago
- Deep Q-Learning (DQN) implementation for Atari pong.☆72Updated 2 years ago
- Code for the paper "Phasic Policy Gradient"☆252Updated last year
- An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku☆189Updated 4 years ago
- DQN to play Atari Pong☆111Updated 5 years ago
- A code implementation for our arXiv paper "Multi-agent Adhoc Team Play using Decompositional Q function"☆128Updated last year
- Python and R tutorial for RLCard in Jupyter Notebook☆83Updated 2 years ago
- Sokoban environment for OpenAI Gym☆330Updated last year
- ☆25Updated 3 years ago
- A PyTorch implementation of SEED, originally created by Google Research for TensorFlow 2.☆12Updated 3 years ago