mgroling / GymRubiksCube
OpenAi gym environment for the Rubik's Cube (3x3x3).
☆9Updated 2 years ago
Alternatives and similar repositories for GymRubiksCube:
Users that are interested in GymRubiksCube are comparing it to the libraries listed below
- ☆38Updated 2 years ago
- The source code for the gym-microrts paper.☆42Updated 2 years ago
- ☆28Updated 2 years ago
- Let's solve the flatland challenge!☆73Updated last year
- A2C is a special case of PPO!☆20Updated 2 years ago
- Gym environment for playing Wordle with RL agents☆39Updated 3 years ago
- Upside-Down Reinforcement Learning (⅂ꓤ) implementation in PyTorch. Based on the paper published by Jürgen Schmidhuber.☆77Updated 4 years ago
- (partial) replication of results from https://arxiv.org/abs/1912.07768☆26Updated 5 years ago
- flexible meta-learning in jax☆13Updated last year
- Simple implementations of multi-agent evolutionary strategies using pytorch.☆16Updated 3 years ago
- Reinforcement learning modular with pytorch☆11Updated 4 years ago
- Little article showing how to load pytorch's models with linear memory consumption☆34Updated 2 years ago
- General implementation of Advantage Actor Critic using Pytorch☆27Updated 3 years ago
- Gym wrapper for pysc2☆10Updated 2 years ago
- Web version of “Neuroevolution of Self-Interpretable Agents” (https://arxiv.org/abs/2003.08165)☆21Updated 3 years ago
- Official repo for the E3B algorithm described in the paper "Exploration via Elliptical Episodic Bonuses".☆82Updated last year
- Attempt at reinforcement learning with curiosity for Sonic the Hedgehog games. Number 149 on OpenAI retro contest leaderboard, but more w…☆32Updated 6 years ago
- Train an agent to play VizDoom with multi sensory inputs. Trained using sample factory☆14Updated 3 years ago
- Customizable RecSys Simulator for OpenAI Gym☆27Updated 3 years ago
- ☆17Updated last year
- Landing a Spaceship using Upside-Down Reinforcement Learning (a.k.a ⅂ꓤ)☆11Updated last year
- Our solution of the Kaggle Abstraction and Reasoning Challenge☆22Updated 4 years ago
- Repo to reproduce the First-Explore paper results☆37Updated 4 months ago
- Flax (JAX) implementation of Progressive Growing of GANs for Improved Quality, Stability, and Variation☆12Updated 3 years ago
- Official code for the paper "Context-Aware Language Modeling for Goal-Oriented Dialogue Systems"☆34Updated 2 years ago
- ☆19Updated 6 years ago
- Pytorch implementation of the Deep Deterministic Policy Gradients for Continuous Control☆26Updated 2 years ago
- An implementation of PPO in Pytorch☆72Updated 2 months ago
- JAX implementation of Learning to learn by gradient descent by gradient descent☆27Updated 6 months ago
- Understanding RL vision Distill article☆23Updated 2 years ago