Implementation of https://srush.github.io/annotated-s4
☆515Jun 20, 2025Updated 9 months ago
Alternatives and similar repositories for annotated-s4
Users that are interested in annotated-s4 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Structured state space sequence models☆2,875Jul 17, 2024Updated last year
- ☆317Jan 8, 2025Updated last year
- Annotated version of the Mamba paper☆500Feb 27, 2024Updated 2 years ago
- Paper: Lexicon Learning for Few-Shot Neural Sequence Modeling☆16Jan 8, 2022Updated 4 years ago
- ☆35Nov 22, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Accelerated First Order Parallel Associative Scan☆197Jan 7, 2026Updated 3 months ago
- Official Repository of Pretraining Without Attention (BiGS), BiGS is the first model to achieve BERT-level transfer learning on the GLUE …☆118Mar 16, 2024Updated 2 years ago
- PyTorch implementation of Structured State Space for Sequence Modeling (S4), based on Annotated S4.☆90Mar 1, 2024Updated 2 years ago
- Sequence Modeling with Structured State Spaces☆67Aug 2, 2022Updated 3 years ago
- Following research on S4 in jax☆16Jun 15, 2022Updated 3 years ago
- Recursive Bayesian Networks☆11May 11, 2025Updated 11 months ago
- Reading list for research topics in state-space models☆356Jun 11, 2025Updated 10 months ago
- What would you do with 1000 H100s...☆1,168Jan 10, 2024Updated 2 years ago
- ☆165Jan 24, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Sequence modeling with Mega.☆303Jan 28, 2023Updated 3 years ago
- ☆29Nov 30, 2021Updated 4 years ago
- Simple, minimal implementation of the Mamba SSM in one file of PyTorch.☆2,940Mar 8, 2024Updated 2 years ago
- maximal update parametrization (µP)☆1,695Jul 17, 2024Updated last year
- Convolutions for Sequence Modeling☆911Jun 13, 2024Updated last year
- FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores☆350Dec 28, 2024Updated last year
- Language Modeling with the H3 State Space Model☆522Sep 29, 2023Updated 2 years ago
- ☆51Jan 28, 2024Updated 2 years ago
- A method for evaluating the high-level coherence of machine-generated texts. Identifies high-level coherence issues in transformer-based …☆11Mar 18, 2023Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Mamba SSM architecture☆17,902Updated this week
- ☆167Jul 5, 2023Updated 2 years ago
- Train very large language models in Jax.☆209Oct 21, 2023Updated 2 years ago
- ☆39Apr 5, 2024Updated 2 years ago
- Code for exploring Based models from "Simple linear attention language models balance the recall-throughput tradeoff"☆251Jun 6, 2025Updated 10 months ago
- Non official implementation of the Linear Recurrent Unit (LRU, Orvieto et al. 2023)☆62Sep 3, 2025Updated 7 months ago
- Long Range Arena for Benchmarking Efficient Transformers☆788Dec 16, 2023Updated 2 years ago
- Blog post☆17Feb 16, 2024Updated 2 years ago
- Implementation of Gated State Spaces, from the paper "Long Range Language Modeling via Gated State Spaces", in Pytorch☆102Feb 25, 2023Updated 3 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆93Jan 25, 2024Updated 2 years ago
- Fast, general, and tested differentiable structured prediction in PyTorch☆1,125Apr 20, 2022Updated 3 years ago
- Silly twitter torch implementations.☆46Oct 14, 2022Updated 3 years ago
- Elegant easy-to-use neural networks + scientific computing in JAX. https://docs.kidger.site/equinox/☆2,845Apr 5, 2026Updated last week
- Parallel Associative Scan for Language Models☆18Jan 8, 2024Updated 2 years ago
- A simple and efficient Mamba implementation in pure PyTorch and MLX.☆1,450Jan 26, 2026Updated 2 months ago
- ☆10Jun 27, 2024Updated last year