jurgisp / memory-mazeLinks
Evaluating long-term memory of reinforcement learning algorithms
☆160Updated 2 years ago
Alternatives and similar repositories for memory-maze
Users that are interested in memory-maze are comparing it to the libraries listed below
Sorting:
- ☆52Updated 2 years ago
- Deep Hierarchical Planning from Pixels☆112Updated 3 years ago
- Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective☆82Updated 2 years ago
- Object Centric Atari games☆96Updated last month
- Challenging Memory-based Deep Reinforcement Learning Agents☆108Updated last year
- Benchmarking RL generalization in an interpretable way.☆174Updated last month
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆103Updated 3 years ago
- PyTorch implementation of DreamerV2 model-based RL algorithm☆236Updated 2 years ago
- Pytorch version of Dreamer, which follows the original TF v2 codes.☆139Updated 3 years ago
- Code for "Data-Efficient Reinforcement Learning with Self-Predictive Representations"☆162Updated 4 years ago
- ExORL: Exploratory Data for Offline Reinforcement Learning☆123Updated 3 years ago
- Discovering and Achieving Goals via World Models, NeurIPS 2021☆86Updated last year
- Simple maze environments using mujoco-py☆57Updated 2 years ago
- DMControl Generalization Benchmark☆187Updated 2 years ago
- OpenAI Gym wrapper for the DeepMind Control Suite☆226Updated last year
- A Simplified Pytorch Version of the Dreamer Algorithm☆146Updated 2 years ago
- pytorch-implementation of Dreamer (Model-based Image RL Algorithm)☆167Updated 11 months ago
- Partially Observable Process Gym☆211Updated 6 months ago
- Accompanying Code for "Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning", ICML 2023☆22Updated 2 years ago
- Datasets for data-driven deep reinforcement learning with Atari (wrapper for datasets released by Google)☆127Updated last year
- Code and data for the paper "Bridging RL Theory and Practice with the Effective Horizon"☆50Updated last year
- CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery☆83Updated 3 years ago
- ☆52Updated 2 years ago
- Implementations of robust Dual Curriculum Design (DCD) algorithms for unsupervised environment design.☆138Updated last year
- ☆36Updated 3 years ago
- Representation Learning for RL☆129Updated 2 years ago
- Transformer-based World Models☆87Updated 2 years ago
- Conservative Q learning in Jax☆57Updated 2 years ago
- Simple single-file baselines for Q-Learning in pure-GPU setting☆231Updated last month
- PyTorch Implementation of the Maximum a Posteriori Policy Optimisation☆78Updated 3 years ago