proroklab / ffm
Reinforcement Learning with Fast and Forgetful Memory
☆23Updated last year
Related projects ⓘ
Alternatives and complementary repositories for ffm
- Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]☆89Updated 11 months ago
- ☆63Updated 3 months ago
- Contains JAX implementation of algorithms for inverse reinforcement learning☆63Updated 3 months ago
- Goal-Conditioned Reinforcement Learning with JAX☆94Updated this week
- Code and data for the paper "Bridging RL Theory and Practice with the Effective Horizon"☆42Updated 4 months ago
- ☆149Updated this week
- ☆65Updated 2 weeks ago
- Evaluating long-term memory of reinforcement learning algorithms☆133Updated last year
- Highly scalable 2D JAX physics engine.☆35Updated last week
- Learning diverse options through the Laplacian representation.☆22Updated 10 months ago
- On the model-based stochastic value gradient for continuous reinforcement learning☆55Updated last year
- Baselines for gymnax 🤖☆60Updated last year
- ☆34Updated last year
- Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"☆52Updated 7 months ago
- Simple single-file baselines for Q-Learning in pure-GPU setting☆103Updated 3 months ago
- Accelerated minigrid environments with JAX☆119Updated 3 months ago
- Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"☆57Updated 5 months ago
- Clean single-file implementation of offline RL algorithms in JAX☆95Updated 3 months ago
- Foundation Policies with Hilbert Representations (ICML 2024)☆72Updated 7 months ago
- ☆42Updated last year
- METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)☆59Updated last year
- Conservative Q learning in Jax☆51Updated last year
- General Modules for JAX☆59Updated 3 months ago
- a simple and scalable agent for training adaptive policies with sequence-based RL☆92Updated this week
- Jax-Baseline is a Reinforcement Learning implementation using JAX and Flax/Haiku libraries, mirroring the functionality of Stable-Baselin…☆40Updated last week
- Plug-and-play hydra sweepers for the EA-based multifidelity method DEHB and several population-based training variations, all proven to e…☆70Updated 11 months ago
- Skeleton for scalable and flexible Jax RL implementations☆63Updated last year
- Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective☆79Updated last year
- Discovering and Achieving Goals via World Models, NeurIPS 2021☆83Updated 9 months ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆105Updated 3 months ago