Solving the Rubik's cube with deep reinforcement learning and Monte Carlo tree search
☆107Apr 15, 2019Updated 6 years ago
Alternatives and similar repositories for puzzle_cube
Users that are interested in puzzle_cube are comparing it to the libraries listed below
Sorting:
- RL experiments☆69Nov 21, 2022Updated 3 years ago
- Stochastic Markov Games☆12Oct 5, 2017Updated 8 years ago
- This project was created for Unity ML-Agents Challenge - https://connect.unity.com/challenges/ml-agents-1☆12Aug 15, 2020Updated 5 years ago
- Supplementary Material to accompany the paper, DJ Warne, SA Sisson, C Drovandi (2019) Acceleration of expensive computations in Bayesian…☆13Oct 23, 2020Updated 5 years ago
- ☆14Updated this week
- ☆13Mar 11, 2018Updated 7 years ago
- ☆14Jun 21, 2016Updated 9 years ago
- Implement Google Deep Minds DQN for multiple agents for a grid world environment where vehicles must pick up customers.☆29Mar 7, 2018Updated 8 years ago
- Online demo of DRLViz, an interactive tool to understand decisions and memory in Deep Reinforcement Learning☆16Dec 8, 2022Updated 3 years ago
- gui for board game hex (and Y) by broderick arneson☆15Dec 13, 2023Updated 2 years ago
- A standalone release of DeepMind Lab's maze generator with Python bindings.☆67Oct 3, 2023Updated 2 years ago
- Actor critic reinforcement learning + motion and task planning under LTL tasks + wireless sensor network routing☆15Mar 6, 2021Updated 5 years ago
- Codes of our team for the OpenAI Retro Contest of reinforcement learning☆99Jun 19, 2018Updated 7 years ago
- different AI algorithms to solve board games☆19Nov 4, 2018Updated 7 years ago
- RC-NFQ: Regularized Convolutional Neural Fitted Q Iteration. A batch algorithm for deep reinforcement learning. Incorporates dropout regu…☆12Mar 17, 2021Updated 4 years ago
- CUDA extension for the SPORCO project☆18Jul 5, 2021Updated 4 years ago
- ☆13Apr 22, 2022Updated 3 years ago
- A3C style Option-Critic with deliberation cost☆40Jan 9, 2018Updated 8 years ago
- Implementation of SPW and DPW for Monte Carlo Tree Search in Continuous action/state space☆20Oct 3, 2023Updated 2 years ago
- Implementation of self-play based reinforcement learning for Checkers based on the AlphaGo Zero methods.☆19May 8, 2018Updated 7 years ago
- Python 2x2 Rubik's Cube representation & solver☆21Jul 29, 2022Updated 3 years ago
- Code for the paper "Decomposing the Enigma: Subgoal-based Demonstration Learning for Formal Theorem Proving"☆19May 25, 2023Updated 2 years ago
- Federated Learning Infra Architecture on Kubernetes(EKS)☆20Nov 18, 2019Updated 6 years ago
- ☆47Jun 19, 2018Updated 7 years ago
- Library for running a Monte Carlo tree search, either traditionally or with expert policies☆128Apr 22, 2024Updated last year
- ☆23Oct 7, 2018Updated 7 years ago
- Efficient Reinforcement Learning with a Thought-Game for StarCraft☆46Dec 8, 2022Updated 3 years ago
- Code for Continual Reinforcement Learning with Multi-Timescale Replay☆24Apr 16, 2020Updated 5 years ago
- Some baselines for Pommerman competition☆46Jul 18, 2018Updated 7 years ago
- Using Deep Reinforcement Learning, a computer program learns how to solve the Rubik's Cube, the world's most popular toy.☆18Aug 4, 2018Updated 7 years ago
- dqn autoplay mario bros☆21Jul 24, 2017Updated 8 years ago
- ☆26Nov 1, 2021Updated 4 years ago
- ☆19Sep 20, 2018Updated 7 years ago
- AMR-to-Text Generator with Side Information☆24Jul 20, 2021Updated 4 years ago
- ☆27Feb 3, 2026Updated last month
- Code for the CoNLL 2019 paper "Compositional Generalization in Image Captioning" by Mitja Nikolaus, Mostafa Abdou, Matthew Lamm, Rahul Ar…☆26Jun 14, 2020Updated 5 years ago
- rectorch is a pytorch-based framework for state-of-the-art top-N recommendation☆150Mar 16, 2021Updated 4 years ago
- Easing non-convex optimization with neural networks.☆23Aug 21, 2018Updated 7 years ago
- ☆28Nov 28, 2021Updated 4 years ago