Structured state space sequence models
☆2,854Jul 17, 2024Updated last year
Alternatives and similar repositories for s4
Users that are interested in s4 are comparing it to the libraries listed below
Sorting:
- Mamba SSM architecture☆17,257Feb 18, 2026Updated 2 weeks ago
- Implementation of https://srush.github.io/annotated-s4☆512Jun 20, 2025Updated 8 months ago
- ☆316Jan 8, 2025Updated last year
- ☆198May 27, 2024Updated last year
- Sequence modeling with Mega.☆303Jan 28, 2023Updated 3 years ago
- Convolutions for Sequence Modeling☆913Jun 13, 2024Updated last year
- Sequence Modeling with Structured State Spaces☆67Aug 2, 2022Updated 3 years ago
- Simple, minimal implementation of the Mamba SSM in one file of PyTorch.☆2,920Mar 8, 2024Updated last year
- ☆164Jan 24, 2023Updated 3 years ago
- Long Range Arena for Benchmarking Efficient Transformers☆781Dec 16, 2023Updated 2 years ago
- Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)☆9,415Feb 20, 2026Updated last week
- Fast and memory-efficient exact attention☆22,460Updated this week
- PyTorch implementation of Structured State Space for Sequence Modeling (S4), based on Annotated S4.☆89Mar 1, 2024Updated 2 years ago
- Pytorch library for fast transformer implementations☆1,763Mar 23, 2023Updated 2 years ago
- RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable)…☆14,393Feb 21, 2026Updated last week
- Repository for the paper: 'Diffusion-based Time Series Imputation and Forecasting with Structured State Space Models'☆333May 24, 2025Updated 9 months ago
- Language Modeling with the H3 State Space Model☆522Sep 29, 2023Updated 2 years ago
- Differentiable ODE solvers with full GPU support and O(1)-memory backpropagation.☆6,350Apr 4, 2025Updated 11 months ago
- Collection of papers on state-space models☆615Nov 4, 2025Updated 4 months ago
- 🚀 Efficient implementations of state-of-the-art linear attention models☆4,474Updated this week
- Liquid Structural State-Space Models☆384Feb 1, 2024Updated 2 years ago
- maximal update parametrization (µP)☆1,686Jul 17, 2024Updated last year
- Annotated version of the Mamba paper☆497Feb 27, 2024Updated 2 years ago
- Foundation Architecture for (M)LLMs☆3,135Apr 11, 2024Updated last year
- Flax is a neural network library for JAX that is designed for flexibility.☆7,094Updated this week
- A Python package for probabilistic state space modeling with JAX☆932Jan 6, 2026Updated last month
- A simple and efficient Mamba implementation in pure PyTorch and MLX.☆1,433Jan 26, 2026Updated last month
- Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities☆22,030Jan 23, 2026Updated last month
- Implementation of DiffWave and SaShiMi audio generation models☆128Apr 4, 2023Updated 2 years ago
- Reading list for research topics in state-space models☆348Jun 11, 2025Updated 8 months ago
- Hackable and optimized Transformers building blocks, supporting a composable construction.☆10,353Feb 20, 2026Updated last week
- functorch is JAX-like composable function transforms for PyTorch.☆1,436Aug 21, 2025Updated 6 months ago
- 🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (i…☆9,513Feb 26, 2026Updated last week
- Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.☆30,884Updated this week
- A PyTorch library entirely dedicated to neural differential equations, implicit models and related numerical methods