mgroling / GymRubiksCubeLinks
OpenAi gym environment for the Rubik's Cube (3x3x3).
☆10Updated 2 years ago
Alternatives and similar repositories for GymRubiksCube
Users that are interested in GymRubiksCube are comparing it to the libraries listed below
Sorting:
- ☆28Updated 2 years ago
- Web version of “Neuroevolution of Self-Interpretable Agents” (https://arxiv.org/abs/2003.08165)☆21Updated 3 years ago
- ☆18Updated last year
- Gym environment for playing Wordle with RL agents☆39Updated 3 years ago
- Additional code for Stable-baselines3 to load and upload models from the Hub.☆87Updated 11 months ago
- Simple implementations of multi-agent evolutionary strategies using pytorch.☆16Updated 3 years ago
- ☆19Updated 6 years ago
- A2C is a special case of PPO!☆22Updated 3 years ago
- Official implementation for "Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size", NeurIPS 2022, Offline RL Worksho…☆21Updated 2 years ago
- Implementation of CASCADE in Learning General World Models in a Handful of Reward-Free Deployments (NeurIPS 22).☆29Updated 2 years ago
- Semi-Markov Afterstate Actor-Critic (SMAAC) with Maze☆10Updated 3 years ago
- Implementation of Hierarchical Transformer Memory (HTM) for Pytorch☆75Updated 3 years ago
- Flax (JAX) implementation of Progressive Growing of GANs for Improved Quality, Stability, and Variation☆12Updated 4 years ago
- Adversarial examples to the new ConvNeXt architecture☆20Updated 3 years ago
- Implements sharpness-aware minimization (https://arxiv.org/abs/2010.01412) in TensorFlow 2.☆60Updated 3 years ago
- A collection of Gymnasium compatible games for reinforcement learning.☆79Updated last week
- Pytorch implementation of the Deep Deterministic Policy Gradients for Continuous Control☆26Updated 2 years ago
- Implementation of Vision Transformers in Flax☆18Updated 4 years ago
- Generalised UDRL☆37Updated 3 years ago
- RL agent to play μRTS with Stable-Baselines3 and PyTorch☆26Updated 3 years ago
- ☆18Updated 2 years ago
- Toy environment set for multi-agent reinforcement learning and more☆38Updated 7 months ago
- The source code for the gym-microrts paper.☆42Updated 2 years ago
- Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorch☆51Updated 2 years ago
- Understanding RL vision Distill article☆23Updated 2 years ago
- A web based platform for collecting human actions in reinforcement learning environments☆30Updated last year
- Official code for the paper "Context-Aware Language Modeling for Goal-Oriented Dialogue Systems"☆34Updated 2 years ago
- ☆56Updated 2 years ago
- ☆14Updated 3 years ago
- Let's solve the flatland challenge!☆73Updated last year