Implementation of the Mamba SSM with hf_integration.
☆55Aug 31, 2024Updated last year
Alternatives and similar repositories for mamba-hf
Users that are interested in mamba-hf are comparing it to the libraries listed below
Sorting:
- The PanGenome Graph Builder☆16Jul 17, 2024Updated last year
- Template for creating a BioCypher-driven knowledge graph☆13Jan 15, 2026Updated last month
- Implementation of a modular, high-performance, and simplistic mamba for high-speed applications☆40Nov 11, 2024Updated last year
- Integrating Mamba/SSMs with Transformer for Enhanced Long Context and High-Quality Sequence Modeling☆215Jan 30, 2026Updated last month
- Implementation of MambaByte in "MambaByte: Token-free Selective State Space Model" in Pytorch and Zeta☆125Feb 6, 2026Updated 3 weeks ago
- ☆31Dec 29, 2023Updated 2 years ago
- Mamba-Chat: A chat LLM based on the state-space model architecture 🐍☆942Mar 3, 2024Updated last year
- Inference of Mamba and Mamba2 models in pure C☆197Jan 22, 2026Updated last month
- Implementation of a simple linear regression algorithm in MAMBA☆10Feb 12, 2020Updated 6 years ago
- A Next.js chat app to use Llama 2 locally using node-llama-cpp☆12Oct 27, 2024Updated last year
- an implementation of paper"Retentive Network: A Successor to Transformer for Large Language Models" https://arxiv.org/pdf/2307.08621.pdf☆11Jul 25, 2023Updated 2 years ago
- Code repository for Black Mamba☆262Feb 8, 2024Updated 2 years ago
- Opensource, personal & local chat interface for language models.☆13Jun 24, 2024Updated last year
- PanGenome Graph Building with the first 100 assemblies from the 1000G ONT Sequencing Consortium☆12Apr 5, 2025Updated 10 months ago
- Fine-tuning, DPO, RLHF, RLAIF on LLMs - Qwen3, Zephyr 7B GPTQ with 4-Bit Quantization, Mistral-7B-GPTQ☆15Jul 5, 2025Updated 7 months ago
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆93Jan 25, 2024Updated 2 years ago
- Implementation of mamba with rust☆92Mar 9, 2024Updated last year
- Deep learning library implemented from scratch in numpy. Mixtral, Mamba, LLaMA, GPT, ResNet, and other experiments.☆54Apr 12, 2024Updated last year
- Annotated version of the Mamba paper☆497Feb 27, 2024Updated 2 years ago
- Official Documentation for DSPy Library☆21Updated this week
- Fast approximate inference on a single GPU with sparsity aware offloading☆39Jan 4, 2024Updated 2 years ago
- ☆35Nov 22, 2024Updated last year
- Simple, minimal implementation of the Mamba SSM in one file of PyTorch.☆2,921Mar 8, 2024Updated last year
- Training GPTs to solve interaction nets☆18Aug 14, 2024Updated last year
- pangenome analyses for complete genomes of great apes (and gibbon)☆20Oct 12, 2024Updated last year
- Implementation of MoE Mamba from the paper: "MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts" in Pytorch and Ze…☆120Jan 31, 2026Updated last month
- Minimal AlphaZero in PyTorch, trained on Connect4 on a 6x6 board.☆21Aug 12, 2022Updated 3 years ago
- QLoRA with Enhanced Multi GPU Support☆38Aug 8, 2023Updated 2 years ago
- Implementing the BitNet model in Rust☆45Apr 18, 2024Updated last year
- Rust implementation of VG handle graph☆19Dec 18, 2023Updated 2 years ago
- ☆14Jul 26, 2023Updated 2 years ago
- A composite GitHub Action to login to the HuggingFace Hub☆15Feb 4, 2023Updated 3 years ago
- Multiple sequence alignment of long tandem repeats☆25Jan 9, 2026Updated last month
- Low-Rank adapter extraction for fine-tuned transformers models☆180May 2, 2024Updated last year
- A Keras like abstraction layer on top of the Rust ML framework candle☆23Jun 16, 2024Updated last year
- PG-SCUnK mesure quality of Pan-Genome Graphs using Single Copy and Universal k-mers☆23Feb 16, 2026Updated last week
- Generative AI web UI and server☆22May 23, 2023Updated 2 years ago
- A simple implementation of [Mamba: Linear-Time Sequence Modeling with Selective State Spaces](https://arxiv.org/abs/2312.00752)☆22Jan 22, 2024Updated 2 years ago
- Collection of autoregressive model implementation☆85Updated this week