Structured state space sequence models
☆2,899Jul 17, 2024Updated last year
Alternatives and similar repositories for s4
Users that are interested in s4 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of https://srush.github.io/annotated-s4☆517Jun 20, 2025Updated 11 months ago
- ☆320Jan 8, 2025Updated last year
- Mamba SSM architecture☆18,326May 10, 2026Updated 2 weeks ago
- ☆201May 27, 2024Updated 2 years ago
- Sequence modeling with Mega.☆303Jan 28, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Sequence Modeling with Structured State Spaces☆67Aug 2, 2022Updated 3 years ago
- PyTorch implementation of Structured State Space for Sequence Modeling (S4), based on Annotated S4.☆91Mar 1, 2024Updated 2 years ago
- Simple, minimal implementation of the Mamba SSM in one file of PyTorch.☆2,948Mar 8, 2024Updated 2 years ago
- Convolutions for Sequence Modeling☆912Jun 13, 2024Updated last year
- ☆165Jan 24, 2023Updated 3 years ago
- Long Range Arena for Benchmarking Efficient Transformers☆788Dec 16, 2023Updated 2 years ago
- Repository for the paper: 'Diffusion-based Time Series Imputation and Forecasting with Structured State Space Models'☆334May 24, 2025Updated last year
- Fast and memory-efficient exact attention☆23,917Updated this week
- Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)☆9,494Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Collection of papers on state-space models☆621Nov 4, 2025Updated 6 months ago
- RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable)…☆14,536Updated this week
- Pytorch library for fast transformer implementations☆1,771Mar 23, 2023Updated 3 years ago
- Language Modeling with the H3 State Space Model☆522Sep 29, 2023Updated 2 years ago
- Liquid Structural State-Space Models☆395Feb 1, 2024Updated 2 years ago
- 🚀 Efficient implementations for emerging model architectures☆5,139Updated this week
- A concise but complete full-attention transformer with a set of promising experimental features from various papers☆5,864May 19, 2026Updated last week
- Implementation of DiffWave and SaShiMi audio generation models☆128Apr 4, 2023Updated 3 years ago
- Implementation of Gated State Spaces, from the paper "Long Range Language Modeling via Gated State Spaces", in Pytorch☆101Feb 25, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Annotated version of the Mamba paper☆501Feb 27, 2024Updated 2 years ago
- Reading list for research topics in state-space models☆363May 18, 2026Updated last week
- Differentiable ODE solvers with full GPU support and O(1)-memory backpropagation.☆6,432Apr 4, 2025Updated last year
- Pytorch implementation of Simplified Structured State-Spaces for Sequence Modeling (S5)☆87Apr 26, 2024Updated 2 years ago
- A simple and efficient Mamba implementation in pure PyTorch and MLX.☆1,462May 3, 2026Updated 3 weeks ago
- A Python package for probabilistic state space modeling with JAX☆965Jan 6, 2026Updated 4 months ago
- Foundation Architecture for (M)LLMs☆3,131Apr 11, 2024Updated 2 years ago
- Code for SpaceTime 🌌⏱️. Proposed in Effectively Modeling Time Series with Simple Discrete State Spaces, ICLR 2023.☆181Mar 17, 2023Updated 3 years ago
- Flax is a neural network library for JAX that is designed for flexibility.☆7,204May 22, 2026Updated last week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- maximal update parametrization (µP)☆1,719Jul 17, 2024Updated last year
- Implementation of Mega, the Single-head Attention with Multi-headed EMA architecture that currently holds SOTA on Long Range Arena☆207Aug 26, 2023Updated 2 years ago
- Vector (and Scalar) Quantization, in Pytorch☆3,948May 16, 2026Updated last week
- Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities☆22,133Jan 23, 2026Updated 4 months ago
- Hackable and optimized Transformers building blocks, supporting a composable construction.☆10,475May 21, 2026Updated last week
- functorch is JAX-like composable function transforms for PyTorch.☆1,436Aug 21, 2025Updated 9 months ago
- [ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model☆3,869Feb 13, 2025Updated last year