Ktakuya332C / deepcubeLinks
An implementation of the paper "Solving the Rubik's Cube without Human Knowledge"
☆14Updated 6 years ago
Alternatives and similar repositories for deepcube
Users that are interested in deepcube are comparing it to the libraries listed below
Sorting:
- Solving the Rubik's cube with deep reinforcement learning and Monte Carlo tree search☆104Updated 6 years ago
- PyTorch code accompanying the paper "Imitating Graph-Based Planning with Goal-Conditioned Policies" (ICLR 2023).☆19Updated 2 years ago
- Code for DeepCubeA, a Deep Reinforcement Learning algorithm that can learn to solve the Rubik's cube.☆192Updated 9 months ago
- A PyTorch AI that learns to solve Rubik's Cubes using Deep Q-Learning.☆23Updated 5 years ago
- ☆10Updated 4 years ago
- Code to reproduce the NeurIPS 2019 paper "Generalization in Reinforcement Learning with Selective Noise Injection and Information Bottlen…☆50Updated 5 years ago
- ☆89Updated 8 months ago
- Learning from zero (mostly based off of AlphaZero) in General Game Playing.☆84Updated 2 years ago
- Code for our CVPR-2021 paper on Combining Semantic Guidance and Deep Reinforcement Learning For Generating Human Level Paintings.☆27Updated 3 years ago
- M-CURL: Masked Contrastive Representation Learning for Reinforcement Learning☆28Updated 4 years ago
- ☆11Updated 4 years ago
- env for gym, match3 game☆10Updated 6 years ago
- Code for "Joint Policy Search for Collaborative Multi-agent Incomplete Information Games"☆52Updated last year
- Using Deep Reinforcement Learning, a computer program learns how to solve the Rubik's Cube, the world's most popular toy.☆19Updated 7 years ago
- Implementation of Adverserial autoencoders☆11Updated 4 years ago
- A python implementation of differentiable quality diversity.☆49Updated 3 years ago
- On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learning☆16Updated 2 years ago
- [IEEE ToG] MiniZero: An AlphaZero and MuZero Training Framework☆103Updated 2 months ago
- Supplementary Data for Evolving Reinforcement Learning Algorithms☆46Updated 4 years ago
- AlphaZero in JAX☆78Updated last year
- ☆13Updated 4 years ago
- RE3: State Entropy Maximization with Random Encoders for Efficient Exploration☆69Updated 4 years ago
- Code for "World Model as a Graph: Learning Latent Landmarks for Planning" (ICML 2021 Long Presentation)☆67Updated 4 years ago
- Offline RL experiments☆15Updated 2 years ago
- ☆31Updated last year
- PyTorch AlphaZero implementation with multiplayer support [NeurIPS 2019 Deep Reinforcement Learning Workshop]☆34Updated 4 years ago
- Edax reversi version 4.6☆122Updated 6 months ago
- This is the pytorch implementation of the UAI2023 paper "A Trajectory is Worth Three Sentences: Multimodal Transformer for Offline Reinf…☆11Updated last year
- This is code to accompany the paper "Accelerating Exploration with Unlabeled Prior Data".☆25Updated last year
- Codebase for the paper "How Crucial is Transformer in Decision Transformer?". Containing experiments on different pendulum tasks and code…☆27Updated 2 years ago