Ktakuya332C / deepcubeLinks
An implementation of the paper "Solving the Rubik's Cube without Human Knowledge"
☆14Updated 6 years ago
Alternatives and similar repositories for deepcube
Users that are interested in deepcube are comparing it to the libraries listed below
Sorting:
- AlphaZero in JAX☆78Updated last year
- PyTorch code accompanying the paper "Imitating Graph-Based Planning with Goal-Conditioned Policies" (ICLR 2023).☆19Updated 2 years ago
- Solving the Rubik's cube with deep reinforcement learning and Monte Carlo tree search☆103Updated 6 years ago
- env for gym, match3 game☆10Updated 6 years ago
- A reinforcement learning based solver for combinatorial problems☆44Updated 3 years ago
- ☆89Updated 7 months ago
- Code to reproduce results on toy tasks and companion blog for the paper.☆21Updated 3 years ago
- PyTorch AlphaZero implementation with multiplayer support [NeurIPS 2019 Deep Reinforcement Learning Workshop]☆34Updated 4 years ago
- ☆17Updated last year
- Official repository for the paper "Going Beyond Linear Transformers with Recurrent Fast Weight Programmers" (NeurIPS 2021)☆49Updated 2 months ago
- Research Into Learning to Generate Game Levels through Play☆30Updated 5 years ago
- OpenAI Gym environments for Legends of Code and Magic, a collectible card game designed for AI research☆38Updated 10 months ago
- ☆13Updated 3 years ago
- Code for the paper "A Boolean Task Algebra For Reinforcement Learning"☆12Updated 2 years ago
- MiniZero: An AlphaZero and MuZero Training Framework☆98Updated last month
- A novel parallel UCT algorithm with linear speedup and negligible performance loss.☆120Updated 4 years ago
- ☆43Updated 4 years ago
- Code to reproduce the NeurIPS 2019 paper "Generalization in Reinforcement Learning with Selective Noise Injection and Information Bottlen…☆50Updated 5 years ago
- Code for the paper "Batch size invariance for policy optimization"☆52Updated 2 years ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆114Updated last year
- ☆17Updated 4 years ago
- Code for "World Model as a Graph: Learning Latent Landmarks for Planning" (ICML 2021 Long Presentation)☆66Updated 4 years ago
- The collection of the research works about Automatic Reinforcement Learning in Microsoft Research Asia.☆58Updated last month
- An implementation of MuZero in JAX.☆56Updated 2 years ago
- RAD: Reinforcement Learning with Augmented Data (code for procgen experiments)☆18Updated 4 years ago
- A2C is a special case of PPO!☆22Updated 3 years ago
- A PyTorch AI that learns to solve Rubik's Cubes using Deep Q-Learning.☆23Updated 5 years ago
- source code for AAMAS 2023 Imperfect-information Card Game Competition☆13Updated last year
- RE3: State Entropy Maximization with Random Encoders for Efficient Exploration☆69Updated 4 years ago
- Official PyTorch implementation of "Discovering Hierarchical Achievements in Reinforcement Learning via Contrastive Learning" (NeurIPS 20…☆34Updated 6 months ago