Simple, minimal implementation of the Mamba SSM in one pytorch file. Using logcumsumexp (Heisen sequence).
☆132Oct 18, 2024Updated last year
Alternatives and similar repositories for mamba-tiny
Users that are interested in mamba-tiny are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Notes on the Mamba and the S4 model (Mamba: Linear-Time Sequence Modeling with Selective State Spaces)☆181Jan 7, 2024Updated 2 years ago
- Simple, minimal implementation of the Mamba SSM in one file of PyTorch.☆2,940Mar 8, 2024Updated 2 years ago
- A simple and efficient Mamba implementation in pure PyTorch and MLX.☆1,450Jan 26, 2026Updated 2 months ago
- PyTorch implementation of Structured State Space for Sequence Modeling (S4), based on Annotated S4.☆90Mar 1, 2024Updated 2 years ago
- ☆35Nov 22, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Target speaker automatic speech recognition (TS-ASR)☆13Oct 14, 2023Updated 2 years ago
- Code for the paper "Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns"☆18Mar 15, 2024Updated 2 years ago
- Mamba SSM architecture☆17,902Updated this week
- My personal toolbox for doing datascience (especially deep learning) in python.☆18Mar 21, 2020Updated 6 years ago
- Minimal JAX implementation unifying Diffusion and Flow Matching algorithms as alternative strategies for transporting data distributions.☆63Dec 19, 2025Updated 3 months ago
- Code implementing "Efficient Parallelization of a Ubiquitious Sequential Computation" (Heinsen, 2023)☆98Dec 5, 2024Updated last year
- ☆40Jan 5, 2024Updated 2 years ago
- Generative Modeling via Drifting in MLX☆42Feb 6, 2026Updated 2 months ago
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆93Jan 25, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Annotated version of the Mamba paper☆500Feb 27, 2024Updated 2 years ago
- Evaluating the Mamba architecture on the Othello game☆49Apr 25, 2024Updated last year
- Deep learning library implemented from scratch in numpy. Mixtral, Mamba, LLaMA, GPT, ResNet, and other experiments.☆54Apr 12, 2024Updated 2 years ago
- ☆22Apr 22, 2024Updated last year
- Official PyTorch Implementation of "The Hidden Attention of Mamba Models"☆232Oct 16, 2025Updated 5 months ago
- Simple voice activity detection (VAD) algorithm in Python☆15Aug 10, 2023Updated 2 years ago
- [NeurIPS 2024] Official implementation of the paper "MambaLRP: Explaining Selective State Space Sequence Models" 🐍☆46Nov 6, 2024Updated last year
- ☆18Oct 26, 2024Updated last year
- Implementation of a modular, high-performance, and simplistic mamba for high-speed applications☆40Nov 11, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Paper implementation☆52Apr 8, 2025Updated last year
- Preprocessed data of SignDiff: Learning Diffusion Models for American Sign Language Production☆18May 1, 2025Updated 11 months ago
- ☆11Oct 11, 2023Updated 2 years ago
- Code for Fooling Contrastive Language-Image Pre-trainined Models with CLIPMasterPrints☆15Jan 25, 2026Updated 2 months ago
- ☆15Jul 24, 2022Updated 3 years ago
- Transformer from scratch with einsum method☆11Jul 8, 2021Updated 4 years ago
- minimal diffusion model for self-study☆27Jul 8, 2023Updated 2 years ago
- Collect papers about Mamba (a selective state space model).☆15Aug 6, 2024Updated last year
- ☆12Dec 22, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Experiments in Joint Embedding Predictive Architectures (JEPAs).☆48Jan 5, 2024Updated 2 years ago
- RWKV-X is a Linear Complexity Hybrid Language Model based on the RWKV architecture, integrating Sparse Attention to improve the model's l…☆56Mar 31, 2026Updated last week
- Official repository for "Boosting Adversarial Transferability using Dynamic Cues " (ICLR 2023)☆20Aug 24, 2023Updated 2 years ago
- [ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling☆956Nov 16, 2025Updated 4 months ago
- Mamba for Multivariate Time Series Forecasting☆87May 2, 2025Updated 11 months ago
- This is a port of Mistral-7B model in JAX☆33Jul 1, 2024Updated last year
- Code for "Is Mamba Effective for Time Series Forecasting?"☆378May 20, 2025Updated 10 months ago