Solving the Rubik's cube with deep reinforcement learning and Monte Carlo tree search
☆108Apr 15, 2019Updated 7 years ago
Alternatives and similar repositories for puzzle_cube
Users that are interested in puzzle_cube are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An implementation of the paper "Solving the Rubik's Cube without Human Knowledge"☆14Dec 9, 2018Updated 7 years ago
- RL experiments☆69Nov 21, 2022Updated 3 years ago
- A PyTorch AI that learns to solve Rubik's Cubes using Deep Q-Learning.☆25Mar 4, 2020Updated 6 years ago
- different AI algorithms to solve board games☆19Nov 4, 2018Updated 7 years ago
- Online demo of DRLViz, an interactive tool to understand decisions and memory in Deep Reinforcement Learning☆16Dec 8, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- JAX implementation of Graph Attention Networks☆13Jan 29, 2022Updated 4 years ago
- Project 1 of Udacity's Deep Reinforcement Learning nanodegree program☆13Dec 2, 2018Updated 7 years ago
- A Neural Network designed to solve any rubik's cube of size NxNxN☆20Aug 23, 2015Updated 10 years ago
- Upper Confidence Tree Planner for ATARI games☆19Mar 9, 2016Updated 10 years ago
- A3C style Option-Critic with deliberation cost☆40Jan 9, 2018Updated 8 years ago
- Code for the paper "Skynet: A Top Deep RL Agent in the Inaugural Pommerman Team Competition"☆37May 9, 2019Updated 6 years ago
- ☆13Mar 11, 2018Updated 8 years ago
- Stochastic Markov Games☆12Oct 5, 2017Updated 8 years ago
- ☆14Jun 21, 2016Updated 9 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [CHIL 2024] Interpretation of Intracardiac Electrograms Through Textual Representations☆12Sep 4, 2024Updated last year
- Othello program created by Gunnar Andersson - This is a copy of the original code -☆16Apr 29, 2014Updated 12 years ago
- Some baselines for Pommerman competition☆46Jul 18, 2018Updated 7 years ago
- Actor critic reinforcement learning + motion and task planning under LTL tasks + wireless sensor network routing☆15Mar 6, 2021Updated 5 years ago
- This project was created for Unity ML-Agents Challenge - https://connect.unity.com/challenges/ml-agents-1☆12Aug 15, 2020Updated 5 years ago
- AI for Mathematics Paper List☆17Jan 14, 2025Updated last year
- Implement Google Deep Minds DQN for multiple agents for a grid world environment where vehicles must pick up customers.☆29Mar 7, 2018Updated 8 years ago
- A standalone release of DeepMind Lab's maze generator with Python bindings.☆67Oct 3, 2023Updated 2 years ago
- My Simple Implementation of AlphaGo Zero on Connect4☆18Apr 25, 2018Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Autonomous visual navigation using the depth images☆11Aug 15, 2019Updated 6 years ago
- MCM/ICM 2017 B☆10Jan 29, 2017Updated 9 years ago
- ☆15Apr 1, 2026Updated last month
- Code for Continual Reinforcement Learning with Multi-Timescale Replay☆24Apr 16, 2020Updated 6 years ago
- Some hard problems for reinforcement learning.☆32Oct 5, 2018Updated 7 years ago
- 📝 Papers I read and notes/reviews I made. Also useful links to courses (RL/NLP/Bio/QC/DevOps)☆10May 4, 2021Updated 5 years ago
- RC-NFQ: Regularized Convolutional Neural Fitted Q Iteration. A batch algorithm for deep reinforcement learning. Incorporates dropout regu…☆12Mar 17, 2021Updated 5 years ago
- Published by Packt☆11Jan 18, 2021Updated 5 years ago
- the solustion to https://openai.com/requests-for-research☆12Mar 23, 2017Updated 9 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Various DQN method with cartpole☆11May 30, 2018Updated 7 years ago
- Official code for NeurIPS paper "Combinatorial Optimization for Panoptic Segmentation: A Fully Differentiable Approach".☆16Jun 30, 2022Updated 3 years ago
- ☆11May 15, 2020Updated 5 years ago
- Automatically exported from code.google.com/p/okuharaandroid-edax-reversi☆22Apr 21, 2026Updated 2 weeks ago
- ☆67Apr 27, 2026Updated last week
- ☆19Sep 20, 2018Updated 7 years ago
- (CoRL 2019 Spotlight) Asynchronous Methods for Model-Based Reinforcement Learning☆14Dec 27, 2022Updated 3 years ago