jurgisp / memory-mazeLinks
Evaluating long-term memory of reinforcement learning algorithms
☆149Updated 2 years ago
Alternatives and similar repositories for memory-maze
Users that are interested in memory-maze are comparing it to the libraries listed below
Sorting:
- Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective☆81Updated 2 years ago
- ☆48Updated 2 years ago
- Deep Hierarchical Planning from Pixels☆109Updated 2 years ago
- Object Centric Atari games☆92Updated this week
- ExORL: Exploratory Data for Offline Reinforcement Learning☆116Updated 3 years ago
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆102Updated 3 years ago
- Benchmarking RL generalization in an interpretable way.☆166Updated this week
- Challenging Memory-based Deep Reinforcement Learning Agents☆104Updated 11 months ago
- Discovering and Achieving Goals via World Models, NeurIPS 2021☆85Updated last year
- Pytorch version of Dreamer, which follows the original TF v2 codes.☆131Updated 3 years ago
- Simple maze environments using mujoco-py☆56Updated last year
- Code and data for the paper "Bridging RL Theory and Practice with the Effective Horizon"☆50Updated last year
- Implementations of robust Dual Curriculum Design (DCD) algorithms for unsupervised environment design.☆135Updated last year
- Code for "Data-Efficient Reinforcement Learning with Self-Predictive Representations"☆161Updated 3 years ago
- Conservative Q learning in Jax☆55Updated 2 years ago
- OpenAI Gym wrapper for the DeepMind Control Suite☆222Updated last year
- Simple single-file baselines for Q-Learning in pure-GPU setting☆187Updated 7 months ago
- Datasets for data-driven deep reinforcement learning with Atari (wrapper for datasets released by Google)☆125Updated last year
- ☆48Updated 2 years ago
- Accompanying Code for "Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning", ICML 2023☆21Updated last year
- CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery☆81Updated 3 years ago
- Online Goal-Conditioned Reinforcement Learning in JAX. ICLR 2025 Spotlight.☆190Updated 2 weeks ago
- [ICLR 2024] Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View.☆23Updated last year
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆116Updated last year
- pytorch-implementation of Dreamer (Model-based Image RL Algorithm)☆166Updated 9 months ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆58Updated 3 years ago
- ☆18Updated 5 months ago
- PyTorch implementation of DreamerV2 model-based RL algorithm☆229Updated 2 years ago
- Partially Observable Process Gym☆202Updated 4 months ago
- DMControl Generalization Benchmark☆177Updated last year