mgroling / GymRubiksCube
OpenAi gym environment for the Rubik's Cube (3x3x3).
☆9Updated 2 years ago
Alternatives and similar repositories for GymRubiksCube
Users that are interested in GymRubiksCube are comparing it to the libraries listed below
Sorting:
- ☆18Updated last year
- Gym environment for playing Wordle with RL agents☆39Updated 3 years ago
- Attempt at reinforcement learning with curiosity for Sonic the Hedgehog games. Number 149 on OpenAI retro contest leaderboard, but more w…☆32Updated 6 years ago
- ☆56Updated 2 years ago
- ☆18Updated 2 years ago
- ☆28Updated 2 years ago
- A2C is a special case of PPO!☆21Updated 2 years ago
- Using Rainbow implementation in Chainer RL for Slime Volleyball Pixel Environment☆23Updated 4 years ago
- Generalised UDRL☆37Updated 3 years ago
- The source code for the gym-microrts paper.☆42Updated 2 years ago
- A framework for implementing equivariant DL☆10Updated 3 years ago
- Train an agent to play VizDoom with multi sensory inputs. Trained using sample factory☆14Updated 3 years ago
- Let's solve the flatland challenge!☆73Updated last year
- ☆19Updated 6 years ago
- (partial) replication of results from https://arxiv.org/abs/1912.07768☆26Updated 5 years ago
- Automatically generate simple meta-learning tasks from a very large space☆15Updated last year
- Semi-Markov Afterstate Actor-Critic (SMAAC) with Maze☆10Updated 3 years ago
- Upside-Down Reinforcement Learning (⅂ꓤ) implementation in PyTorch. Based on the paper published by Jürgen Schmidhuber.☆77Updated 4 years ago
- An open source implementation of CLIP.☆32Updated 2 years ago
- Simple implementations of multi-agent evolutionary strategies using pytorch.☆16Updated 3 years ago
- Gym wrapper for pysc2☆10Updated 2 years ago
- Web version of “Neuroevolution of Self-Interpretable Agents” (https://arxiv.org/abs/2003.08165)☆21Updated 3 years ago
- Official code for the paper "Context-Aware Language Modeling for Goal-Oriented Dialogue Systems"☆34Updated 2 years ago
- ☆23Updated 3 years ago
- JAX implementation of Learning to learn by gradient descent by gradient descent☆27Updated 7 months ago
- RL agent to play μRTS with Stable-Baselines3 and PyTorch☆26Updated 3 years ago
- ☆38Updated 2 years ago
- A working AlphaZero implementation that's simple enough to be able to understand what's going on at a quick glance, without sacrificing t…☆13Updated 2 years ago
- Generate bird's-eye views of conference proceedings.☆24Updated 5 months ago
- Adversarial examples to the new ConvNeXt architecture☆20Updated 3 years ago