jurgisp / memory-mazeLinks
Evaluating long-term memory of reinforcement learning algorithms
☆146Updated 2 years ago
Alternatives and similar repositories for memory-maze
Users that are interested in memory-maze are comparing it to the libraries listed below
Sorting:
- ☆48Updated 2 years ago
- Deep Hierarchical Planning from Pixels☆107Updated 2 years ago
- Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective☆80Updated 2 years ago
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆100Updated 3 years ago
- Object Centric Atari games☆88Updated last month
- ExORL: Exploratory Data for Offline Reinforcement Learning☆115Updated 3 years ago
- Benchmarking RL generalization in an interpretable way.☆161Updated 2 months ago
- Pytorch version of Dreamer, which follows the original TF v2 codes.☆131Updated 3 years ago
- Simple maze environments using mujoco-py☆57Updated last year
- Code for "Data-Efficient Reinforcement Learning with Self-Predictive Representations"☆162Updated 3 years ago
- Discovering and Achieving Goals via World Models, NeurIPS 2021☆85Updated last year
- CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery☆81Updated 3 years ago
- pytorch-implementation of Dreamer (Model-based Image RL Algorithm)☆166Updated 7 months ago
- Challenging Memory-based Deep Reinforcement Learning Agents☆103Updated 9 months ago
- Code and data for the paper "Bridging RL Theory and Practice with the Effective Horizon"☆48Updated last year
- Implementations of robust Dual Curriculum Design (DCD) algorithms for unsupervised environment design.☆133Updated last year
- OpenAI Gym wrapper for the DeepMind Control Suite☆219Updated last year
- PyTorch implementation of DreamerV2 model-based RL algorithm☆226Updated 2 years ago
- Datasets for data-driven deep reinforcement learning with Atari (wrapper for datasets released by Google)☆120Updated 11 months ago
- Conservative Q learning in Jax☆54Updated 2 years ago
- [ICLR 2024] Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View.☆23Updated last year
- Accompanying Code for "Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning", ICML 2023☆19Updated last year
- DMControl Generalization Benchmark☆175Updated last year
- ☆43Updated 4 years ago
- Skeleton for scalable and flexible Jax RL implementations☆84Updated 2 years ago
- Simple single-file baselines for Q-Learning in pure-GPU setting☆181Updated 5 months ago
- The Controllable Agent project trains RL Agents able to optimize any reward function specified in real time, without any further learning…☆66Updated 2 years ago
- Transformer-based World Models☆85Updated 2 years ago
- ☆36Updated 2 years ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆58Updated 3 years ago