[ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling
☆965Nov 16, 2025Updated 7 months ago
Alternatives and similar repositories for Samba
Users that are interested in Samba are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Schedule-Free Optimization in PyTorch☆2,307Jun 18, 2026Updated last week
- Implementation for MatMul-free LM.☆3,071Dec 2, 2025Updated 6 months ago
- PyTorch implementation of models from the Zamba2 series.☆194Jan 23, 2025Updated last year
- HGRN2: Gated Linear RNNs with State Expansion☆57Aug 20, 2024Updated last year
- 🚀 Efficient implementations for emerging model architectures☆5,249Updated this week
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Code for exploring Based models from "Simple linear attention language models balance the recall-throughput tradeoff"☆255Jun 6, 2025Updated last year
- Mamba SSM architecture☆18,481Jun 15, 2026Updated 2 weeks ago
- Annotated version of the Mamba paper☆501Feb 27, 2024Updated 2 years ago
- Implementation of Diffusion Transformer (DiT) in JAX☆317Jun 11, 2024Updated 2 years ago
- Minimalistic large language model 3D-parallelism training☆2,729May 26, 2026Updated last month
- Accelerated First Order Parallel Associative Scan☆198Jan 7, 2026Updated 5 months ago
- Efficient Triton Kernels for LLM Training☆6,456Updated this week
- A PyTorch native platform for training generative AI models☆5,466Updated this week
- Tile primitives for speedy kernels