jasonrute / puzzle_cube
Solving the Rubik's cube with deep reinforcement learning and Monte Carlo tree search
☆98Updated 5 years ago
Alternatives and similar repositories for puzzle_cube:
Users that are interested in puzzle_cube are comparing it to the libraries listed below
- RL experiments☆70Updated 2 years ago
- Reinforcement Learning implementations and research prototyping in TensorFlow☆81Updated 5 years ago
- Clone of OpenAI's Spinning Up in PyTorch☆146Updated 2 years ago
- Highly Modular and Scalable Reinforcement Learning☆114Updated 5 years ago
- Upside-Down Reinforcement Learning (⅂ꓤ) implementation in PyTorch. Based on the paper published by Jürgen Schmidhuber.☆77Updated 4 years ago
- This project explores deep reinforcement learning, hybrid actor-critic approach with A3C/PPO combined with curiosity for playing Super M…☆79Updated 6 years ago
- Easy TensorFlow logging for quick prototypes☆110Updated 3 years ago
- RLtime is a reinforcement learning library focused on state-of-the-art q-learning algorithms and features☆140Updated 5 years ago
- Reinforcement Learning Assembly☆92Updated 3 years ago
- Basic versions of agents from Spinning Up in Deep RL written in PyTorch☆199Updated 3 years ago
- RUDDER for ATARI games with delayed rewards in OpenAI Baselines package☆266Updated 5 years ago
- A framework for easy prototyping of distributed reinforcement learning algorithms☆95Updated 4 years ago
- C51-DDQN in Keras☆125Updated 7 years ago
- Atari - Deep Reinforcement Learning algorithms in TensorFlow☆135Updated 10 months ago
- ☆66Updated 3 years ago
- A short and easy implementation of Quantile Regression DQN | Distributional Reinforcement Learning☆94Updated 4 years ago
- Open AI gym environment for the game 2048☆71Updated 2 years ago
- A customizable framework to create maze and gridworld environments☆260Updated 5 years ago
- Code for the blog post "Learning Montezuma’s Revenge from a Single Demonstration"☆197Updated 6 years ago
- This repo replicates the results Horgan et al obtained in "Distributed Prioritized Experience Replay"☆189Updated 5 years ago
- Keeping track of RL experiments☆160Updated 2 years ago
- PyTorch implementation of Proximal Policy Optimization☆50Updated 7 years ago
- A novel parallel UCT algorithm with linear speedup and negligible performance loss.☆115Updated 3 years ago
- Some baselines for Pommerman competition☆46Updated 6 years ago
- lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.☆374Updated 2 years ago
- A high-performance Atari A3C agent in 180 lines of PyTorch☆171Updated 3 years ago
- ☆92Updated 4 years ago
- A collection of baselines for the MineRL environment/datasets & the NeurIPS 2021 MineRL competitions☆147Updated 3 years ago
- PyTorch implementation of our paper Real-Time Reinforcement Learning (NeurIPS 2019)☆73Updated 4 years ago
- TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x☆62Updated 3 years ago