midrender/mamba-chat

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/midrender/mamba-chat)

midrender / mamba-chat

Mamba-Chat: A chat LLM based on the state-space model architecture 🐍

☆944

Alternatives and similar repositories for mamba-chat

Users that are interested in mamba-chat are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

state-spaces / mamba
View on GitHub
Mamba SSM architecture
☆18,628Jul 7, 2026Updated 2 weeks ago
johnma2006 / mamba-minimal
View on GitHub
Simple, minimal implementation of the Mamba SSM in one file of PyTorch.
☆2,963Mar 8, 2024Updated 2 years ago
LegallyCoder / mamba-hf
View on GitHub
Implementation of the Mamba SSM with hf_integration.
☆55Aug 31, 2024Updated last year
Oxen-AI / mamba-dive
View on GitHub
This is the code that went into our practical dive using mamba as information extraction
☆57Dec 22, 2023Updated 2 years ago
kroggen / mamba.c
View on GitHub
Inference of Mamba, Mamba2 and Mamba3 models in pure C
☆202Mar 18, 2026Updated 4 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
jzhang38 / LongMamba
View on GitHub
Some preliminary explorations of Mamba's context scaling.
☆220Feb 8, 2024Updated 2 years ago
alxndrTL / mamba.py
View on GitHub
A simple and efficient Mamba implementation in pure PyTorch and MLX.
☆1,470May 3, 2026Updated 2 months ago
geronimi73 / mamba
View on GitHub
☆31Dec 29, 2023Updated 2 years ago
microsoft / Samba
View on GitHub
[ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling
☆966Nov 16, 2025Updated 8 months ago
Zyphra / BlackMamba
View on GitHub
Code repository for Black Mamba
☆265Feb 8, 2024Updated 2 years ago
jzhang38 / TinyLlama
View on GitHub
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
☆9,014May 3, 2024Updated 2 years ago
srush / annotated-mamba
View on GitHub
Annotated version of the Mamba paper
☆501Feb 27, 2024Updated 2 years ago
BlinkDL / RWKV-LM
View on GitHub
RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable)…
☆14,629Updated this week
lucidrains / self-rewarding-lm-pytorch
View on GitHub
Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI
☆1,411Apr 11, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
togethercomputer / stripedhyena
View on GitHub
Repository for StripedHyena, a state-of-the-art beyond Transformer architecture
☆434Mar 7, 2024Updated 2 years ago
kyegomez / MultiModalMamba
View on GitHub
A novel implementation of fusing ViT with Mamba into a fast, agile, and high performance Multi-Modal Model. Powered by Zeta, the simplest…
☆473Jul 13, 2026Updated last week
arcee-ai / mergekit
View on GitHub
Tools for merging pretrained large language models.
☆7,246Jun 17, 2026Updated last month
pengzhangzhi / Awesome-Mamba
View on GitHub
Awesome list of papers that extend Mamba to various applications.
☆141Jun 4, 2026Updated last month
jquesnelle / yarn
View on GitHub
YaRN: Efficient Context Window Extension of Large Language Models
☆1,737Apr 17, 2024Updated 2 years ago
NousResearch / StripedHyenaTrainer
View on GitHub
☆67Dec 8, 2023Updated 2 years ago
kyegomez / MambaByte
View on GitHub
Implementation of MambaByte in "MambaByte: Token-free Selective State Space Model" in Pytorch and Zeta
☆128Jul 13, 2026Updated last week
axolotl-ai-cloud / axolotl
View on GitHub
Go ahead and axolotl questions
☆12,219Updated this week
kyegomez / MambaTransformer
View on GitHub
Integrating Mamba/SSMs with Transformer for Enhanced Long Context and High-Quality Sequence Modeling
☆225Jul 13, 2026Updated last week
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
mit-han-lab / streaming-llm
View on GitHub
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
☆7,248Jul 11, 2024Updated 2 years ago
llm-random / llm-random
View on GitHub
☆212Jun 17, 2026Updated last month
state-spaces / s4
View on GitHub
Structured state space sequence models
☆2,911Jul 17, 2024Updated 2 years ago
huggingface / alignment-handbook
View on GitHub
Robust recipes to align language models with human and AI preferences
☆5,639May 26, 2026Updated last month
AnswerDotAI / fsdp_qlora
View on GitHub
Training LLMs with QLoRA + FSDP
☆1,549Nov 9, 2024Updated last year
tanaymeh / mamba-train
View on GitHub
A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM
☆62Apr 8, 2024Updated 2 years ago
kyegomez / Jamba
View on GitHub
PyTorch Implementation of Jamba: "Jamba: A Hybrid Transformer-Mamba Language Model"
☆219Jul 13, 2026Updated last week
intel / intel-extension-for-transformers
View on GitHub
⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Pl…
☆2,177Oct 8, 2024Updated last year
SebastianBodza / EnsembleForecasting
View on GitHub
Using multiple LLMs for ensemble Forecasting
☆16Jan 17, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Lightning-AI / litgpt
View on GitHub
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
☆13,491Updated this week
meta-pytorch / torchtune
View on GitHub
PyTorch native post-training library
☆5,784Updated this week
AvivBick / awesome-ssm-ml
View on GitHub
Reading list for research topics in state-space models
☆366May 18, 2026Updated 2 months ago
XueFuzhao / OpenMoE
View on GitHub
A family of open-sourced Mixture-of-Experts (MoE) Large Language Models
☆1,691Mar 8, 2024Updated 2 years ago
radarFudan / mamba-minimal-jax
View on GitHub
☆36Nov 22, 2024Updated last year
uclaml / SPIN
View on GitHub
The official implementation of Self-Play Fine-Tuning (SPIN)
☆1,247May 8, 2024Updated 2 years ago
sdan / selfextend
View on GitHub
an implementation of Self-Extend, to expand the context window via grouped attention
☆119Jan 7, 2024Updated 2 years ago