abaisero / gym-gridverse
Gridworld domains in the gym interface
☆26Updated last month
Related projects ⓘ
Alternatives and complementary repositories for gym-gridverse
- Accompanying Code for "Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning", ICML 2023☆18Updated 10 months ago
- Simple maze environments using mujoco-py☆52Updated 10 months ago
- A tool for aggregating and plotting MARL experiment data.☆61Updated 2 weeks ago
- Benchmarking RL generalization in an interpretable way.☆132Updated 9 months ago
- Pytorch implementation on OpenAI's Procgen ppo-baseline, built from scratch.☆31Updated 4 years ago
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆99Updated 2 years ago
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆95Updated 5 months ago
- Evaluating long-term memory of reinforcement learning algorithms☆133Updated last year
- Skeleton for scalable and flexible Jax RL implementations☆63Updated last year
- Asymmetric methods for partially observable reinforcement learning☆9Updated 6 months ago
- Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"☆57Updated 5 months ago
- ☆54Updated 8 months ago
- JAX implementation of RL algorithms and vectorized environments☆34Updated 10 months ago
- Deep Hierarchical Planning from Pixels☆90Updated last year
- ExORL: Exploratory Data for Offline Reinforcement Learning☆105Updated 2 years ago
- Conservative Q learning in Jax☆51Updated last year
- Fast and flexible multi-agent gridworld reinforcement learning environments.☆32Updated 3 weeks ago
- Goal-Conditioned Reinforcement Learning with JAX☆94Updated this week
- ☆38Updated last year
- Implementation of Truncated Quantile Critics method for continuous reinforcement learning. https://bayesgroup.github.io/tqc/☆90Updated 3 years ago
- Challenging Memory-based Deep Reinforcement Learning Agents☆86Updated 3 weeks ago
- ☆17Updated 4 months ago
- Partially Observable Process Gym☆167Updated 4 months ago
- The Starcraft Multi-Agent challenge lite☆38Updated 2 months ago
- A Simplified Pytorch Version of the Dreamer Algorithm☆111Updated last year
- Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective☆79Updated last year
- A collection of RL algorithms written in JAX.☆95Updated 2 years ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Updated last year
- Pytorch version of Dreamer, which follows the original TF v2 codes.☆113Updated 2 years ago
- Author's PyTorch implementation of TD7 for online and offline RL☆116Updated last year