Ktakuya332C / deepcubeLinks
An implementation of the paper "Solving the Rubik's Cube without Human Knowledge"
☆12Updated 6 years ago
Alternatives and similar repositories for deepcube
Users that are interested in deepcube are comparing it to the libraries listed below
Sorting:
- Solving the Rubik's cube with deep reinforcement learning and Monte Carlo tree search☆101Updated 6 years ago
- env for gym, match3 game☆11Updated 6 years ago
- OpenAI Gym environments for Legends of Code and Magic, a collectible card game designed for AI research☆37Updated 7 months ago
- Content Masked Loss: Human-Like Brush Stroke Planning in a Reinforcement Learning Painting Agent. Code for AAAI'21 Paper.☆19Updated 2 years ago
- ☆23Updated 3 years ago
- Improving upon state of the art cooperative deep reinforcement learning in StarCraft II☆13Updated 6 years ago
- Paper notes☆12Updated 7 years ago
- Code for "Joint Policy Search for Collaborative Multi-agent Incomplete Information Games"☆51Updated last year
- The code of building a web demo for Auto_painter☆27Updated 5 years ago
- ☆40Updated last year
- ☆17Updated last year
- openAI gym env for reversi/othello game☆20Updated last year
- Neural Crossbreed: Neural Based Image Metamorphosis☆21Updated 3 years ago
- A Tetris environment to train machine learning agents☆69Updated last year
- Reinforcement learning algorithms to play Poker☆14Updated 3 years ago
- SPADE-based Line Art Colorization☆15Updated 2 years ago
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆13Updated 2 years ago
- World Models with A3C on Carracing-v0 in gym☆32Updated 5 years ago
- Using Deep Reinforcement Learning, a computer program learns how to solve the Rubik's Cube, the world's most popular toy.☆19Updated 6 years ago
- Contextual Bandits Action Elimination DQN☆21Updated 6 years ago
- Using Rainbow implementation in Chainer RL for Slime Volleyball Pixel Environment☆23Updated 4 years ago
- Qiita投稿用に作成したAgent57(強化学習)の実装コードです。☆45Updated 2 years ago
- A modified implementation of Synthesizing Programs for Images using Reinforced Adversarial Learning (SPIRAL) using ChainerRL.☆24Updated 3 years ago
- ☆13Updated 3 years ago
- AI for the game Uno☆19Updated 5 years ago
- A modified Alphazero implementation with C++ where performance matters.☆17Updated last year
- General lockfree Monte Carlo Tree Search implementation in Cpp☆9Updated 8 years ago
- A novel parallel UCT algorithm with linear speedup and negligible performance loss.☆119Updated 4 years ago
- ☆68Updated 3 years ago
- PyTorch implementation of Never Give Up: Learning Directed Exploration Strategies☆58Updated 4 years ago