HITESHLPATEL / Mamba-PapersLinks
Awesome Mamba Papers: A Curated Collection of Research Papers , Tutorials & Blogs
☆25Updated last year
Alternatives and similar repositories for Mamba-Papers
Users that are interested in Mamba-Papers are comparing it to the libraries listed below
Sorting:
- Video descriptions of research papers relating to foundation models and scaling☆31Updated 2 years ago
- PyTorch implementation of "From Sparse to Soft Mixtures of Experts"☆58Updated last year
- Implementation of MaMMUT, a simple vision-encoder text-decoder architecture for multimodal tasks from Google, in Pytorch☆103Updated last year
- More dimensions = More fun☆22Updated 11 months ago
- PyTorch implementation of Soft MoE by Google Brain in "From Sparse to Soft Mixtures of Experts" (https://arxiv.org/pdf/2308.00951.pdf)☆74Updated last year
- Pytorch Implementation of the paper: "Learning to (Learn at Test Time): RNNs with Expressive Hidden States"☆25Updated 2 weeks ago
- Implementation of "PaLM2-VAdapter:" from the multi-modal model paper: "PaLM2-VAdapter: Progressively Aligned Language Model Makes a Stron…☆16Updated 8 months ago
- Code for NOLA, an implementation of "nola: Compressing LoRA using Linear Combination of Random Basis"☆56Updated 10 months ago
- Implementation of Infini-Transformer in Pytorch☆111Updated 6 months ago
- Implementation of Agent Attention in Pytorch☆90Updated last year
- Distributed Optimization Infra for learning CLIP models☆26Updated 9 months ago
- Implementation of the general framework for AMIE, from the paper "Towards Conversational Diagnostic AI", out of Google Deepmind☆65Updated 10 months ago
- Code for “Pretrained Language Models as Visual Planners for Human Assistance”☆61Updated 2 years ago
- Official implementation of "Hydra: Bidirectional State Space Models Through Generalized Matrix Mixers"☆140Updated 5 months ago
- Code for experiments for "ConvNet vs Transformer, Supervised vs CLIP: Beyond ImageNet Accuracy"☆101Updated 10 months ago
- ☆24Updated 2 years ago
- Exploration into the proposed "Self Reasoning Tokens" by Felipe Bonetto☆56Updated last year
- This is a PyTorch implementation of the paperViP A Differentially Private Foundation Model for Computer Vision☆36Updated 2 years ago
- A repository for DenseSSMs☆87Updated last year
- ☆49Updated last week
- Implementation of Zorro, Masked Multimodal Transformer, in Pytorch☆97Updated last year
- Code for the paper "Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers" [ICCV 2025]☆73Updated 3 weeks ago
- Implementation of 🌻 Mirasol, SOTA Multimodal Autoregressive model out of Google Deepmind, in Pytorch☆89Updated last year
- Code for "Are “Hierarchical” Visual Representations Hierarchical?" in NeurIPS Workshop for Symmetry and Geometry in Neural Representation…☆21Updated last year
- [ACL 2023] PuMer: Pruning and Merging Tokens for Efficient Vision Language Models☆32Updated 9 months ago
- HGRN2: Gated Linear RNNs with State Expansion☆55Updated 10 months ago
- WorldSense benchmark for grounded reasoning in language models☆19Updated last year
- [CVPR 2023] HierVL Learning Hierarchical Video-Language Embeddings☆46Updated last year
- ☆22Updated 6 months ago
- Code and pretrained models for the paper: "MatMamba: A Matryoshka State Space Model"☆59Updated 7 months ago