Ktakuya332C / deepcubeLinks

An implementation of the paper "Solving the Rubik's Cube without Human Knowledge"

☆12

Alternatives and similar repositories for deepcube

Users that are interested in deepcube are comparing it to the libraries listed below

Sorting:

jasonrute / puzzle_cube
Solving the Rubik's cube with deep reinforcement learning and Monte Carlo tree search
☆103Updated 6 years ago
xuzijian629 / combopt-zero
A reinforcement learning based solver for combinatorial problems
☆44Updated 3 years ago
richemslie / galvanise_zero
Learning from zero (mostly based off of AlphaZero) in General Game Playing.
☆83Updated 2 years ago
kamildar / gym-match3
env for gym, match3 game
☆11Updated 6 years ago
vivek3141 / rubiks-cube-ai
Using Deep Reinforcement Learning, a computer program learns how to solve the Rubik's Cube, the world's most popular toy.
☆19Updated 6 years ago
hardmaru / RainbowSlimeVolley
Using Rainbow implementation in Chainer RL for Slime Volleyball Pixel Environment
☆23Updated 5 years ago
Zeta36 / connect4-alpha-zero
Connect4 reinforcement learning by AlphaGo Zero methods.
☆113Updated 4 years ago
liuanji / WU-UCT
A novel parallel UCT algorithm with linear speedup and negligible performance loss.
☆119Updated 4 years ago
ronaldosvieira / gym-locm
OpenAI Gym environments for Legends of Code and Magic, a collectible card game designed for AI research
☆37Updated 8 months ago
lake4790k / lockfree-mcts-cpp
General lockfree Monte Carlo Tree Search implementation in Cpp
☆9Updated 9 years ago
jcoreyes / evolvingrl
Supplementary Data for Evolving Reinforcement Learning Algorithms
☆46Updated 4 years ago
jidiai / Competition_AAMAS2023
source code for AAMAS 2023 Imperfect-information Card Game Competition
☆13Updated last year
rwightman / pytorch-pommerman-rl
PyTorch RL for Pommerman
☆38Updated 6 years ago
yenchenlin / evf-public
Experience-embedded Visual Foresight, CoRL 2019
☆14Updated 5 years ago
EricSteinberger / Neural-Fictitous-Self-Play
Scalable Implementation of Neural Fictitous Self-Play
☆83Updated 6 years ago
yuta0821 / agent57_pytorch
unofficial code reproducing Agent57
☆37Updated last year
huangeddie / GymGo
An environment of the board game Go using OpenAI's Gym API
☆175Updated 3 years ago
initial-h / AlphaZero_Gomoku_MPI
An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku
☆209Updated 4 months ago
facebookresearch / jps
Code for "Joint Policy Search for Collaborative Multi-agent Incomplete Information Games"
☆52Updated last year
ShaniGam / RL-GAN
Transfer Learning for Related Reinforcement Learning Tasks via Image-to-Image Translation
☆49Updated 5 years ago
jcwleo / mario_rl
☆69Updated 6 years ago
junsu-kim97 / PIG
PyTorch code accompanying the paper "Imitating Graph-Based Planning with Goal-Conditioned Policies" (ICLR 2023).
☆19Updated 2 years ago
petosa / multiplayer-alphazero
PyTorch AlphaZero implementation with multiplayer support [NeurIPS 2019 Deep Reinforcement Learning Workshop]
☆34Updated 4 years ago
AranKomat / Alpha-Transformer
Alpha Zero equipped with Transformer with various novel techniques for speedup in tree search
☆27Updated 6 years ago
maxreciprocate / offline
Offline RL experiments
☆15Updated 2 years ago
shumaym / Rubiks_Cube_AI
A PyTorch AI that learns to solve Rubik's Cubes using Deep Q-Learning.
☆22Updated 5 years ago
icaros-usc / dqd
A python implementation of differentiable quality diversity.
☆49Updated 3 years ago
Deepest-Project / WorldModels-A3C
World Models with A3C on Carracing-v0 in gym
☆31Updated 5 years ago
machine-reasoning-ufrgs / GNN-GCP
Graph Neural Network architecture to solve the decision version of the graph coloring problem (GCP)
☆25Updated 5 years ago
DwangoMediaVillage / chainer_spiral
A modified implementation of Synthesizing Programs for Images using Reinforced Adversarial Learning (SPIRAL) using ChainerRL.
☆24Updated 4 years ago