state-spaces / mambaLinks

Mamba SSM architecture

☆16,573

Alternatives and similar repositories for mamba

Users that are interested in mamba are comparing it to the libraries listed below

Sorting:

johnma2006 / mamba-minimal
Simple, minimal implementation of the Mamba SSM in one file of PyTorch.
☆2,887Updated last year
hustvl / Vim
[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model
☆3,686Updated 9 months ago
alxndrTL / mamba.py
A simple and efficient Mamba implementation in pure PyTorch and MLX.
☆1,372Updated 11 months ago
Dao-AILab / flash-attention
Fast and memory-efficient exact attention
☆20,804Updated last week
MzeroMiko / VMamba
VMamba: Visual State Space Models，code is based on mamba
☆2,926Updated 8 months ago
state-spaces / s4
Structured state space sequence models
☆2,792Updated last year
KindXiaoming / pykan
Kolmogorov Arnold Networks
☆16,018Updated 10 months ago
facebookresearch / DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
☆8,091Updated last year
microsoft / LoRA
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
☆13,010Updated 11 months ago
Blealtan / efficient-kan
An efficient pure-PyTorch implementation of Kolmogorov-Arnold Network (KAN).
☆4,522Updated last year
test-time-training / ttt-lm-pytorch
Official PyTorch implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States
☆1,282Updated last year
Jamie-Stirling / RetNet
An implementation of "Retentive Network: A Successor to Transformer for Large Language Models"
☆1,211Updated 2 years ago
microsoft / torchscale
Foundation Architecture for (M)LLMs
☆3,121Updated last year
fla-org / flash-linear-attention
🚀 Efficient implementations of state-of-the-art linear attention models
☆3,937Updated this week
lucidrains / x-transformers
A concise but complete full-attention transformer with a set of promising experimental features from various papers
☆5,706Updated 3 weeks ago
google-research / vision_transformer
☆12,053Updated 8 months ago
facebookresearch / dinov2
PyTorch code and models for the DINOv2 self-supervised learning method.
☆11,980Updated 3 months ago
yyyujintang / Awesome-Mamba-Papers
Awesome Papers related to Mamba.
☆1,377Updated last year
facebookresearch / xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
☆10,131Updated 2 weeks ago
facebookresearch / ijepa
Official codebase for I-JEPA, the Image-based Joint-Embedding Predictive Architecture. First outlined in the CVPR paper, "Self-supervised…
☆3,134Updated last year
huggingface / peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
☆20,157Updated last week
NVlabs / MambaVision
[CVPR 2025] Official PyTorch Implementation of MambaVision: A Hybrid Mamba-Transformer Vision Backbone
☆1,902Updated 4 months ago
artidoro / qlora
QLoRA: Efficient Finetuning of Quantized LLMs
☆10,778Updated last year
NX-AI / xlstm
Official repository of the xLSTM.
☆2,036Updated 3 weeks ago
huggingface / accelerate
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (i…
☆9,329Updated this week
BlinkDL / RWKV-LM
RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable)…
☆14,177Updated 2 weeks ago
openai / transformer-debugger
☆4,110Updated last year
google-research / big_vision
Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.
☆3,247Updated 6 months ago
meta-pytorch / gpt-fast
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
☆6,162Updated 3 months ago
huggingface / trl
Train transformer language models with reinforcement learning.
☆16,473Updated this week