HITESHLPATEL / Mamba-PapersLinks
Awesome Mamba Papers: A Curated Collection of Research Papers , Tutorials & Blogs
☆25Updated last year
Alternatives and similar repositories for Mamba-Papers
Users that are interested in Mamba-Papers are comparing it to the libraries listed below
Sorting:
- HGRN2: Gated Linear RNNs with State Expansion☆54Updated 9 months ago
- Implementation of Griffin from the paper: "Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models"☆55Updated 2 months ago
- Video descriptions of research papers relating to foundation models and scaling☆31Updated 2 years ago
- [NeurIPS 2024] Official implementation of the paper "MambaLRP: Explaining Selective State Space Sequence Models".☆38Updated 7 months ago
- A repository for DenseSSMs☆87Updated last year
- Code and pretrained models for the paper: "MatMamba: A Matryoshka State Space Model"☆59Updated 6 months ago
- More dimensions = More fun☆22Updated 10 months ago
- Implementation of Infini-Transformer in Pytorch☆111Updated 5 months ago
- Official implementation of "Hydra: Bidirectional State Space Models Through Generalized Matrix Mixers"☆137Updated 4 months ago
- User-friendly implementation of the Mixture-of-Sparse-Attention (MoSA). MoSA selects distinct tokens for each head with expert choice rou…☆17Updated last month
- State Space Models☆67Updated last year
- Explorations into improving ViTArc with Slot Attention☆41Updated 7 months ago
- ☆43Updated 4 months ago
- Implementation of Agent Attention in Pytorch☆90Updated 10 months ago
- Implementation of MoE Mamba from the paper: "MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts" in Pytorch and Ze…☆105Updated 2 months ago
- MambaFormer in-context learning experiments and implementation for https://arxiv.org/abs/2402.04248☆54Updated 11 months ago
- Official PyTorch Implementation for Paper "No More Adam: Learning Rate Scaling at Initialization is All You Need"☆51Updated 4 months ago
- Implementation of the general framework for AMIE, from the paper "Towards Conversational Diagnostic AI", out of Google Deepmind☆63Updated 8 months ago
- ☆51Updated 11 months ago
- ☆10Updated last year
- Implementation of the proposed LVMAE, from the paper, Extending Video Masked Autoencoders to 128 frames, in Pytorch☆51Updated 6 months ago
- Code for NOLA, an implementation of "nola: Compressing LoRA using Linear Combination of Random Basis"☆55Updated 9 months ago
- Implementation of Zorro, Masked Multimodal Transformer, in Pytorch☆96Updated last year
- Distributed Optimization Infra for learning CLIP models☆26Updated 8 months ago
- Code for the paper "Cottention: Linear Transformers With Cosine Attention"☆17Updated 7 months ago
- Official implementation of ECCV24 paper: POA☆24Updated 9 months ago
- Trying out the Mamba architecture on small examples (cifar-10, shakespeare char level etc.)☆46Updated last year
- ☆23Updated 8 months ago
- [ICLR 2025] Official Code Release for Explaining Modern Gated-Linear RNNs via a Unified Implicit Attention Formulation☆42Updated 3 months ago
- ☆22Updated 5 months ago