LegallyCoder/mamba-hf

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/LegallyCoder/mamba-hf)

LegallyCoder / mamba-hf

Implementation of the Mamba SSM with hf_integration.

☆55

Alternatives and similar repositories for mamba-hf

Users that are interested in mamba-hf are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

kyegomez / MambaByte
View on GitHub
Implementation of MambaByte in "MambaByte: Token-free Selective State Space Model" in Pytorch and Zeta
☆128Updated this week
kyegomez / MambaTransformer
View on GitHub
Integrating Mamba/SSMs with Transformer for Enhanced Long Context and High-Quality Sequence Modeling
☆225Jul 13, 2026Updated last week
kyegomez / SimpleMamba
View on GitHub
Implementation of a modular, high-performance, and simplistic mamba for high-speed applications
☆41Nov 11, 2024Updated last year
kroggen / mamba.c
View on GitHub
Inference of Mamba, Mamba2 and Mamba3 models in pure C
☆202Mar 18, 2026Updated 4 months ago
midrender / mamba-chat
View on GitHub
Mamba-Chat: A chat LLM based on the state-space model architecture 🐍
☆944Mar 3, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
RL10x / RetNet
View on GitHub
an implementation of paper"Retentive Network: A Successor to Transformer for Large Language Models" https://arxiv.org/pdf/2307.08621.pdf
☆11Jul 25, 2023Updated 2 years ago
Maluuba / FigureQA
View on GitHub
☆27Nov 5, 2019Updated 6 years ago
bayjarvis / llm
View on GitHub
Fine-tuning, DPO, RLHF, RLAIF on LLMs - Qwen3, Zephyr 7B GPTQ with 4-Bit Quantization, Mistral-7B-GPTQ
☆15Jul 5, 2025Updated last year
Zyphra / BlackMamba
View on GitHub
Code repository for Black Mamba
☆265Feb 8, 2024Updated 2 years ago
floapfel / MAMBA-implementations
View on GitHub
Implementation of a simple linear regression algorithm in MAMBA
☆10Feb 12, 2020Updated 6 years ago
ramakanth-pasunuru / video-dialogue
View on GitHub
Dataset and models for paper "Game-Based Video-Context Dialogue (EMNLP 2018)"
☆19Oct 25, 2018Updated 7 years ago
Ammar-Alnagar / Ammar-Alnagar
View on GitHub
☆13Apr 10, 2026Updated 3 months ago
biocypher / project-template
View on GitHub
Template for creating a BioCypher-driven knowledge graph
☆13Jan 15, 2026Updated 6 months ago
flawedmatrix / mamba-ssm
View on GitHub
Implementation of mamba with rust
☆96Mar 9, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
TalnUPF / ConceptExtraction
View on GitHub
☆11Aug 15, 2023Updated 2 years ago
The-Swarm-Corporation / Mamba-R1
View on GitHub
Mamba R1 represents a novel architecture that combines the efficiency of Mamba's state space models with the scalability of Mixture of Ex…
☆25Oct 13, 2025Updated 9 months ago
johnma2006 / mamba-minimal
View on GitHub
Simple, minimal implementation of the Mamba SSM in one file of PyTorch.
☆2,963Mar 8, 2024Updated 2 years ago
radarFudan / mamba-minimal-jax
View on GitHub
☆36Nov 22, 2024Updated last year
kyegomez / MambaFormer
View on GitHub
Implementation of MambaFormer in Pytorch ++ Zeta from the paper: "Can Mamba Learn How to Learn? A Comparative Study on In-Context Learnin…
☆21Updated this week
johnma2006 / candle
View on GitHub
Deep learning library implemented from scratch in numpy. Mixtral, Mamba, LLaMA, GPT, ResNet, and other experiments.
☆56Apr 12, 2024Updated 2 years ago
dubssieg / pancat
View on GitHub
Pangenome graphs visualisation, distance computing, reconstruction of sequences and other utility functions
☆38Apr 28, 2026Updated 2 months ago
RobertCsordas / moe_attention
View on GitHub
Official repository for the paper "SwitchHead: Accelerating Transformers with Mixture-of-Experts Attention"
☆101Sep 30, 2024Updated last year
buttercutter / Mamba_SSM
View on GitHub
A simple implementation of [Mamba: Linear-Time Sequence Modeling with Selective State Spaces](https://arxiv.org/abs/2312.00752)
☆23Jan 22, 2024Updated 2 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
prateekstark / retnet
View on GitHub
☆14Jul 26, 2023Updated 2 years ago
LAION-AI / math_problems-step-by-step_solutions
View on GitHub
Here we provide and collect many functions to generate math problem and step by step solutions for LLM training
☆19Jun 21, 2023Updated 3 years ago
vv9k / AIrtifex
View on GitHub
Generative AI web UI and server
☆22May 23, 2023Updated 3 years ago
syncdoth / RetNet
View on GitHub
Huggingface compatible implementation of RetNet (Retentive Networks, https://arxiv.org/pdf/2307.08621.pdf) including parallel, recurrent,…
☆227Mar 12, 2024Updated 2 years ago
not-lain / pxia
View on GitHub
minimalistic AI library that resembles HF's transformers
☆13Dec 31, 2024Updated last year
Oxen-AI / mamba-dive
View on GitHub
This is the code that went into our practical dive using mamba as information extraction
☆57Dec 22, 2023Updated 2 years ago
GAIR-NLP / Entropy-ABF
View on GitHub
Official implementation for 'Extending LLMs’ Context Window with 100 Samples'
☆82Jan 18, 2024Updated 2 years ago
dai-dao / Grounded-Language-Learning-in-Pytorch
View on GitHub
Implementation of Grounded Language Learning in a 3D Simulated World (DeepMind)
☆34Jul 22, 2017Updated 8 years ago
LLM360 / amber-train
View on GitHub
Pre-training code for Amber 7B LLM
☆175May 10, 2024Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
osbm / huggingface-login
View on GitHub
A composite GitHub Action to login to the HuggingFace Hub
☆15Feb 4, 2023Updated 3 years ago
srush / annotated-mamba
View on GitHub
Annotated version of the Mamba paper
☆501Feb 27, 2024Updated 2 years ago
joey00072 / ohara
View on GitHub
Collection of autoregressive model implementation
☆84Jun 10, 2026Updated last month
anthonymartin / RKDO-recursive-kl-divergence-optimization
View on GitHub
☆16Jun 4, 2025Updated last year
lucidrains / MEGABYTE-pytorch
View on GitHub
Implementation of MEGABYTE, Predicting Million-byte Sequences with Multiscale Transformers, in Pytorch
☆655Dec 27, 2024Updated last year
kyegomez / MoE-Mamba
View on GitHub
Implementation of MoE Mamba from the paper: "MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts" in Pytorch and Ze…
☆132Jul 13, 2026Updated last week
ffaltings / InteractiveTextGeneration
View on GitHub
☆34Mar 25, 2023Updated 3 years ago