jasonrute / puzzle_cube
Solving the Rubik's cube with deep reinforcement learning and Monte Carlo tree search
☆95Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for puzzle_cube
- Reinforcement Learning implementations and research prototyping in TensorFlow☆80Updated 5 years ago
- Clone of OpenAI's Spinning Up in PyTorch☆146Updated 2 years ago
- Highly Modular and Scalable Reinforcement Learning☆114Updated 4 years ago
- Atari - Deep Reinforcement Learning algorithms in TensorFlow☆135Updated 7 months ago
- A short and easy implementation of Quantile Regression DQN | Distributional Reinforcement Learning☆94Updated 4 years ago
- RLtime is a reinforcement learning library focused on state-of-the-art q-learning algorithms and features☆139Updated 5 years ago
- This package allows to use PLE as a gym environment.☆72Updated 4 years ago
- ☆65Updated 3 years ago
- RUDDER for ATARI games with delayed rewards in OpenAI Baselines package☆266Updated 5 years ago
- High-quality implementations of deep reinforcement learning algorithms for experiments☆51Updated 2 months ago
- Reinforcement Learning Assembly☆92Updated 3 years ago
- Code for the paper "Skynet: A Top Deep RL Agent in the Inaugural Pommerman Team Competition"☆37Updated 5 years ago
- Implementation of selected reinforcement learning algorithms in Tensorflow. A3C, DDPG, REINFORCE, DQN, etc.☆151Updated last year
- Simple grid-world environment compatible with OpenAI-gym☆49Updated 4 years ago
- ☆91Updated 3 years ago
- Random Network Distillation(RND) algo in Pytorch☆48Updated 5 years ago
- Distributed implementation of popular evolutionary methods☆64Updated 6 years ago
- ☆106Updated 4 years ago
- This repo replicates the results Horgan et al obtained in "Distributed Prioritized Experience Replay"☆190Updated 5 years ago
- Open AI gym environment for the game 2048☆71Updated 2 years ago
- A PyTorch implementation of Rainbow DQN agent☆165Updated 6 years ago
- Some baselines for Pommerman competition☆46Updated 6 years ago
- My implementation of the Proximal Policy Optisation algorithm using Keras as a backend☆88Updated 4 years ago
- Examples of published reinforcement learning algorithms in recent literature implemented in TensorFlow☆101Updated 4 years ago
- PyTorch implementation of our paper Real-Time Reinforcement Learning (NeurIPS 2019)☆73Updated 4 years ago
- A Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow☆182Updated 5 years ago
- Machine Learning Course Project Skoltech 2018☆108Updated 5 years ago
- PyTorch RL for Pommerman☆38Updated 6 years ago
- Basic versions of agents from Spinning Up in Deep RL written in PyTorch☆197Updated 3 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆90Updated 6 years ago