MarcoMeter / endless-memory-gym
Challenging Memory-based Deep Reinforcement Learning Agents
☆97Updated 5 months ago
Alternatives and similar repositories for endless-memory-gym:
Users that are interested in endless-memory-gym are comparing it to the libraries listed below
- Synthetic Experience Replay☆91Updated 10 months ago
- Simple single-file baselines for Q-Learning in pure-GPU setting☆153Updated 3 weeks ago
- Accompanying Code for "Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning", ICML 2023☆19Updated last year
- Evaluating long-term memory of reinforcement learning algorithms☆141Updated last year
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆100Updated 2 years ago
- ☆44Updated last year
- Reproduction of Dreamerv1 and v2 in pytorch for deepmind control suite☆38Updated 2 years ago
- The Starcraft Multi-Agent challenge lite☆42Updated 7 months ago
- Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorch☆52Updated last year
- A tool for aggregating and plotting MARL experiment data.☆77Updated 2 months ago
- Object Centric Atari games☆72Updated this week
- Deep Hierarchical Planning from Pixels☆95Updated 2 years ago
- ☆74Updated 5 months ago
- MR.Q is a general-purpose model-free reinforcement learning algorithm.☆85Updated last week
- A categorised list of Multi-Agent Reinforcemnt Learning (MARL) papers☆51Updated 2 years ago
- ☆76Updated 3 weeks ago
- Benchmarking RL generalization in an interpretable way.☆153Updated last month
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆79Updated 4 months ago
- Skeleton for scalable and flexible Jax RL implementations☆79Updated last year
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆172Updated 9 months ago
- ExORL: Exploratory Data for Offline Reinforcement Learning☆111Updated 3 years ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆110Updated 7 months ago
- Learning diverse options through the Laplacian representation.☆23Updated last year
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆100Updated 10 months ago
- Extreme Q-Learning: Max Entropy RL without Entropy☆86Updated 2 years ago
- ☆18Updated 2 months ago
- Synchronized Curriculum Learning for RL Agents☆45Updated last month
- Code for "SimbaV2: Hyperspherical Normalization for Scalable Deep Reinforcement Learning"☆42Updated this week
- Transformer-based World Models☆80Updated 2 years ago
- Implementation of Trajectory Transformer with attention caching and batched beam search☆111Updated last year