jasonrute / puzzle_cube
Solving the Rubik's cube with deep reinforcement learning and Monte Carlo tree search
☆98Updated 5 years ago
Alternatives and similar repositories for puzzle_cube:
Users that are interested in puzzle_cube are comparing it to the libraries listed below
- Reinforcement Learning implementations and research prototyping in TensorFlow☆81Updated 5 years ago
- Highly Modular and Scalable Reinforcement Learning☆114Updated 5 years ago
- Random Network Distillation(RND) algo in Pytorch☆48Updated 5 years ago
- ☆66Updated 3 years ago
- Reinforcement Learning Assembly☆92Updated 3 years ago
- ☆106Updated 5 years ago
- Keeping track of RL experiments☆159Updated 2 years ago
- TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x☆62Updated 3 years ago
- Deep reinforcement learning baselines base on OpenAI. More algorithms are included, such as Rainbow: Combining Improvements in Deep Rei…☆36Updated 6 years ago
- Open AI gym environment for the game 2048☆71Updated 2 years ago
- Clone of OpenAI's Spinning Up in PyTorch☆146Updated 2 years ago
- Atari - Deep Reinforcement Learning algorithms in TensorFlow☆135Updated 10 months ago
- C51-DDQN in Keras☆125Updated 7 years ago
- Reinforcement learning framework to accelerate research☆204Updated 3 years ago
- Benchmarking Canonical Evolution Strategies for Playing Atari☆81Updated 6 years ago
- This project explores deep reinforcement learning, hybrid actor-critic approach with A3C/PPO combined with curiosity for playing Super M…☆79Updated 6 years ago
- Code for the paper "Skynet: A Top Deep RL Agent in the Inaugural Pommerman Team Competition"☆37Updated 5 years ago
- Accompanying code for "Deep Reinforcement Learning that Matters"☆151Updated 7 years ago
- This package allows to use PLE as a gym environment.☆72Updated 4 years ago
- My implementation of the Proximal Policy Optisation algorithm using Keras as a backend☆88Updated 5 years ago
- RUDDER for ATARI games with delayed rewards in OpenAI Baselines package☆266Updated 5 years ago
- Augmented environments with RL☆103Updated 5 years ago
- RL experiments☆70Updated 2 years ago
- Tensorflow/Keras code and trained models for Episodic Curiosity Through Reachability☆199Updated 4 years ago
- A novel parallel UCT algorithm with linear speedup and negligible performance loss.☆115Updated 3 years ago
- A short and easy implementation of Quantile Regression DQN | Distributional Reinforcement Learning☆94Updated 4 years ago
- A PyTorch implementation of Rainbow DQN agent☆168Updated 6 years ago
- A binary release of trained deep reinforcement learning models trained in the Atari machine learning benchmark, and a software release th…☆201Updated 4 years ago
- Hindsight Experience Replay - Bit flipping experiment in Tensorflow☆59Updated 6 years ago
- In Progress : State of the art Distributed Distributional Deep Deterministic Policy Gradient algorithm implementation in pytorch.☆18Updated 6 years ago