Ktakuya332C / deepcube
An implementation of the paper "Solving the Rubik's Cube without Human Knowledge"
☆12Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for deepcube
- Using Deep Reinforcement Learning, a computer program learns how to solve the Rubik's Cube, the world's most popular toy.☆18Updated 6 years ago
- Solving the Rubik's cube with deep reinforcement learning and Monte Carlo tree search☆95Updated 5 years ago
- Edax reversi version 4.4 and above☆103Updated 2 months ago
- ☆39Updated last year
- Rubik's Cube Solver coded in Python.☆22Updated 4 years ago
- Improving upon state of the art cooperative deep reinforcement learning in StarCraft II☆13Updated 5 years ago
- Learning from zero (mostly based off of AlphaZero) in General Game Playing.☆81Updated 2 years ago
- Efficient Reinforcement Learning with a Thought-Game for StarCraft☆46Updated last year
- An implementation of improved AlphaGo algorithm in the game of Gomoku.☆57Updated 5 years ago
- ☆13Updated 3 years ago
- Connect4 reinforcement learning by AlphaGo Zero methods.☆114Updated 3 years ago
- Stroke-based Character Reconstruction ---> https://arxiv.org/abs/1806.08990☆15Updated 2 years ago
- Single-Life Reinforcement Learning☆14Updated last year
- Code for "Joint Policy Search for Collaborative Multi-agent Incomplete Information Games"☆50Updated last year
- Chess reinforcement learning by AlphaZero methods.☆38Updated 6 years ago
- Open AI gym environment for the game 2048☆71Updated 2 years ago
- Demo of UCT (MCTS) in Python / Numpy☆83Updated last year
- Implementation of Deep Reinforcement Learning from Self-Play in Imperfect-Information Games (Heinrich and Silver, 2016)☆45Updated 5 years ago
- ☆8Updated 2 years ago
- Computer go engine using Monte-Carlo Tree Search written in Python3.☆56Updated 6 months ago
- A PyTorch AI that learns to solve Rubik's Cubes using Deep Q-Learning.☆22Updated 4 years ago
- Manually exported from https://github.com/okuhara/edax-reversi-AVX to local on 2018/09/10.☆32Updated last year
- Mining GOLD Samples for Conditional GANs (NeurIPS 2019)☆17Updated 5 years ago
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆49Updated last year
- Implicit Distributional Actor Critic☆10Updated 2 years ago
- This is the source code of RPG (Reward-Randomized Policy Gradient)☆43Updated 2 years ago
- different AI algorithms to solve board games☆18Updated 6 years ago
- Parallel Monte Carlo Tree Search, see README.md for more detailed usage and information.☆40Updated 3 years ago
- An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku☆189Updated 4 years ago