Zyphra / BlackMambaView external linksLinks
Code repository for Black Mamba
☆261Feb 8, 2024Updated 2 years ago
Alternatives and similar repositories for BlackMamba
Users that are interested in BlackMamba are comparing it to the libraries listed below
Sorting:
- GoldFinch and other hybrid transformer components☆45Jul 20, 2024Updated last year
- Implementation of MoE Mamba from the paper: "MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts" in Pytorch and Ze…☆120Jan 31, 2026Updated 2 weeks ago
- PyTorch implementation of models from the Zamba2 series.☆186Jan 23, 2025Updated last year
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆93Jan 25, 2024Updated 2 years ago
- Implementation of the Mamba SSM with hf_integration.☆55Aug 31, 2024Updated last year
- Some preliminary explorations of Mamba's context scaling.☆218Feb 8, 2024Updated 2 years ago
- A repository for DenseSSMs☆89Apr 11, 2024Updated last year
- Beyond Language Models: Byte Models are Digital World Simulators☆334Jun 6, 2024Updated last year
- Annotated version of the Mamba paper☆496Feb 27, 2024Updated last year
- This is the code that went into our practical dive using mamba as information extraction☆57Dec 22, 2023Updated 2 years ago
- Mamba-Chat: A chat LLM based on the state-space model architecture 🐍☆940Mar 3, 2024Updated last year
- ☆82Apr 16, 2024Updated last year
- Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters☆132Dec 3, 2024Updated last year
- Linear Attention Sequence Parallelism (LASP)☆88Jun 4, 2024Updated last year
- Token Omission Via Attention☆128Oct 13, 2024Updated last year
- Code for exploring Based models from "Simple linear attention language models balance the recall-throughput tradeoff"☆248Jun 6, 2025Updated 8 months ago
- PyTorch Implementation of Jamba: "Jamba: A Hybrid Transformer-Mamba Language Model"☆206Jan 17, 2026Updated 3 weeks ago
- Mamba SSM architecture☆17,186Jan 12, 2026Updated last month
- ☆23Jan 27, 2025Updated last year
- ☆15Apr 26, 2025Updated 9 months ago
- See https://github.com/cuda-mode/triton-index/ instead!☆11May 8, 2024Updated last year
- Implementation of a Hierarchical Mamba as described in the paper: "Hierarchical State Space Models for Continuous Sequence-to-Sequence Mo…☆15Nov 11, 2024Updated last year
- Cascade Speculative Drafting☆32Apr 2, 2024Updated last year
- A simple and efficient Mamba implementation in pure PyTorch and MLX.☆1,429Jan 26, 2026Updated 2 weeks ago
- A novel implementation of fusing ViT with Mamba into a fast, agile, and high performance Multi-Modal Model. Powered by Zeta, the simplest…☆461Jan 19, 2026Updated 3 weeks ago
- [ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling☆944Nov 16, 2025Updated 2 months ago
- Reference implementation of Megalodon 7B model☆528May 17, 2025Updated 8 months ago
- ☆51Jan 28, 2024Updated 2 years ago
- Simple, minimal implementation of the Mamba SSM in one file of PyTorch.☆2,918Mar 8, 2024Updated last year
- ☆13Dec 15, 2025Updated last month
- Auto-Video maker handling many AI's☆11Mar 18, 2024Updated last year
- Decoding of the speech envelope from EEG using the VLAAI deep neural network☆15Sep 28, 2022Updated 3 years ago
- Implementation of "VL-Mamba: Exploring State Space Models for Multimodal Learning"☆86Mar 21, 2024Updated last year
- ☆208Jan 14, 2026Updated last month
- seqax = sequence modeling + JAX☆170Jul 23, 2025Updated 6 months ago
- Convolutions for Sequence Modeling☆911Jun 13, 2024Updated last year
- Official code for ReLoRA from the paper Stack More Layers Differently: High-Rank Training Through Low-Rank Updates☆473Apr 21, 2024Updated last year
- Evaluating the Mamba architecture on the Othello game☆49Apr 25, 2024Updated last year
- Graph-Mamba: Towards Long-Range Graph Sequence Modelling with Selective State Spaces☆335Feb 2, 2024Updated 2 years ago