kyegomez / MultiModalMambaLinks
A novel implementation of fusing ViT with Mamba into a fast, agile, and high performance Multi-Modal Model. Powered by Zeta, the simplest AI framework ever.
☆447Updated 2 months ago
Alternatives and similar repositories for MultiModalMamba
Users that are interested in MultiModalMamba are comparing it to the libraries listed below
Sorting:
- Repo for "Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture"☆550Updated 5 months ago
- Code repository for Black Mamba☆246Updated last year
- Build high-performance AI models with modular building blocks☆519Updated this week
- Mamba-Chat: A chat LLM based on the state-space model architecture 🐍☆924Updated last year
- [ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling☆876Updated last month
- Open weights language model from Google DeepMind, based on Griffin.☆639Updated last week
- Implementation of DoRA☆294Updated 11 months ago
- 👁️ + 💬 + 🎧 = 🤖 Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials]☆618Updated last year
- PyTorch implementation of Infini-Transformer from "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention…☆289Updated last year
- Integrating Mamba/SSMs with Transformer for Enhanced Long Context and High-Quality Sequence Modeling☆193Updated last month
- Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI☆1,384Updated last year
- PyTorch Implementation of "V* : Guided Visual Search as a Core Mechanism in Multimodal LLMs"☆619Updated last year
- LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skills☆744Updated last year
- Embed arbitrary modalities (images, audio, documents, etc) into large language models.☆184Updated last year
- HPT - Open Multimodal LLMs from HyperGAI☆316Updated last year
- ☆447Updated last year
- ☆864Updated last year
- ☆707Updated last year
- Inference code for Mistral and Mixtral hacked up into original Llama implementation☆371Updated last year
- LLaVA-Interactive-Demo☆372Updated 10 months ago
- This is the official repository for the LENS (Large Language Models Enhanced to See) system.☆350Updated last year
- Official JAX implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States☆411Updated 9 months ago
- Implementation of Recurrent Memory Transformer, Neurips 2022 paper, in Pytorch☆408Updated 4 months ago
- ☆613Updated last year
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆231Updated 7 months ago
- ☆412Updated last year
- Reference implementation of Megalodon 7B model☆520Updated 2 weeks ago
- ICLR2024 Spotlight: curation/training code, metadata, distribution and pre-trained models for MetaCLIP; CVPR 2024: MoDE: CLIP Data Expert…☆1,450Updated 2 months ago
- [ICLR2025 Spotlight🔥] Official Implementation of TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters☆559Updated 3 months ago
- Official code for "TOAST: Transfer Learning via Attention Steering"☆188Updated last year