johnma2006 / mamba-minimal
Simple, minimal implementation of the Mamba SSM in one file of PyTorch.
β2,700Updated 10 months ago
Alternatives and similar repositories for mamba-minimal:
Users that are interested in mamba-minimal are comparing it to the libraries listed below
- A simple and efficient Mamba implementation in pure PyTorch and MLX.β1,112Updated last month
- Mamba SSM architectureβ13,857Updated last week
- π Efficient implementations of state-of-the-art linear attention models in Pytorch and Tritonβ1,822Updated this week
- Collection of papers on state-space modelsβ572Updated this week
- Awesome Papers related to Mamba.β1,292Updated 3 months ago
- Structured state space sequence modelsβ2,541Updated 6 months ago
- Mamba-Chat: A chat LLM based on the state-space model architecture πβ919Updated 10 months ago
- Official PyTorch implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden Statesβ1,110Updated 6 months ago
- [ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Modelβ3,188Updated 2 months ago
- Schedule-Free Optimization in PyTorchβ2,069Updated last month
- An implementation of "Retentive Network: A Successor to Transformer for Large Language Models"β1,175Updated last year
- Meta-Transformer for Unified Multimodal Learningβ1,562Updated last year
- Vector (and Scalar) Quantization, in Pytorchβ2,863Updated this week
- This project extends the idea of the innovative architecture of Kolmogorov-Arnold Networks (KAN) to the Convolutional Layers, changing thβ¦β818Updated 2 months ago
- [Mamba-Survey-2024] Paper list for State-Space-Model/Mamba and it's Applicationsβ663Updated last month
- VMamba: Visual State Space ModelsοΌcode is based on mambaβ2,368Updated 3 months ago
- A family of open-sourced Mixture-of-Experts (MoE) Large Language Modelsβ1,427Updated 10 months ago
- Annotated version of the Mamba paperβ470Updated 11 months ago
- π¦ Lion, new optimizer discovered by Google Brain using genetic algorithms that is purportedly better than Adam(w), in Pytorchβ2,087Updated 2 months ago
- Official repository of the xLSTM.β1,653Updated 2 weeks ago
- An efficient pure-PyTorch implementation of Kolmogorov-Arnold Network (KAN).β4,202Updated 5 months ago
- β714Updated 8 months ago
- Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"β6,757Updated 7 months ago
- A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includesβ¦β1,856Updated 3 weeks ago
- Foundation Architecture for (M)LLMsβ3,039Updated 9 months ago
- Build high-performance AI models with modular building blocksβ459Updated this week
- A concise but complete full-attention transformer with a set of promising experimental features from various papersβ5,005Updated last week
- PyTorch code and models for V-JEPA self-supervised learning from video.β2,747Updated 5 months ago
- Open weights language model from Google DeepMind, based on Griffin.β614Updated 6 months ago
- A Pytorch implementation of Sparsely-Gated Mixture of Experts, for massively increasing the parameter count of language modelsβ671Updated last year