jasonrute / puzzle_cubeLinks
Solving the Rubik's cube with deep reinforcement learning and Monte Carlo tree search
☆107Updated 6 years ago
Alternatives and similar repositories for puzzle_cube
Users that are interested in puzzle_cube are comparing it to the libraries listed below
Sorting:
- Highly Modular and Scalable Reinforcement Learning☆118Updated 6 years ago
- Library for running a Monte Carlo tree search, either traditionally or with expert policies☆127Updated last year
- Reinforcement Learning implementations and research prototyping in TensorFlow☆81Updated 6 years ago
- Clone of OpenAI's Spinning Up in PyTorch☆156Updated 3 years ago
- This project explores deep reinforcement learning, hybrid actor-critic approach with A3C/PPO combined with curiosity for playing Super M…☆80Updated 7 years ago
- Random Network Distillation(RND) algo in Pytorch☆51Updated 6 years ago
- Code for the paper, "Learning Human Objectives by Evaluating Hypothetical Behavior"☆84Updated 6 years ago
- Awesome RL: Papers, Books, Codes, Benchmarks☆119Updated 2 years ago
- TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x☆62Updated 4 years ago
- Reinforcement Learning Assembly☆92Updated 4 years ago
- Distributed implementation of popular evolutionary methods☆64Updated 8 years ago
- An implementation of Monte Carlo Tree Search in python☆163Updated 5 years ago
- Benchmarking Canonical Evolution Strategies for Playing Atari☆82Updated 7 years ago
- PyTorch implementation of Advantage Actor-Critic (A2C)☆47Updated 8 years ago
- Source code of Neural Logic Reinforcement Learning (https://arxiv.org/abs/1904.10729)☆77Updated 6 years ago
- PyTorch implementation of Proximal Policy Optimization☆53Updated 8 years ago
- A short and easy implementation of Quantile Regression DQN | Distributional Reinforcement Learning☆97Updated 5 years ago
- Augmented environments with RL☆103Updated 6 years ago
- A standalone release of DeepMind Lab's maze generator with Python bindings.☆67Updated 2 years ago
- Open AI gym environment for the game 2048☆76Updated 3 years ago
- Demo of UCT (MCTS) in Python / Numpy☆88Updated 3 years ago
- Loose taxonomy of reinforcement learning algorithms☆190Updated 5 years ago
- A framework for easy prototyping of distributed reinforcement learning algorithms☆96Updated 5 years ago
- ☆66Updated 4 years ago
- A PyTorch Library for Reinforcement Learning Research☆198Updated 6 months ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆96Updated 7 years ago
- A novel parallel UCT algorithm with linear speedup and negligible performance loss.☆123Updated 4 years ago
- Full Chainer implementation of OpenAI's Reinforcement Learning using Random Network Distillation☆32Updated 6 years ago
- ☆71Updated 3 years ago
- Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.☆103Updated 10 months ago