kyegomez / MultiModalMamba
A novel implementation of fusing ViT with Mamba into a fast, agile, and high performance Multi-Modal Model. Powered by Zeta, the simplest AI framework ever.
☆438Updated last week
Related projects ⓘ
Alternatives and complementary repositories for MultiModalMamba
- Code repository for Black Mamba☆232Updated 9 months ago
- Official implementation of "Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling"☆803Updated 3 months ago
- This repository provides the code and model checkpoints of the research paper: Scalable Pre-training of Large Autoregressive Image Model…☆697Updated 6 months ago
- Build high-performance AI models with modular building blocks☆420Updated last week
- Official Implementation of TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters☆335Updated last week
- Repo for "Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture"☆537Updated 6 months ago
- run paligemma in real time☆122Updated 6 months ago
- PyTorch Implementation of "V* : Guided Visual Search as a Core Mechanism in Multimodal LLMs"☆526Updated 10 months ago
- PyTorch implementation of models from the Zamba2 series.☆158Updated this week
- ☆228Updated 2 months ago
- Integrating Mamba/SSMs with Transformer for Enhanced Long Context and High-Quality Sequence Modeling☆169Updated last week
- ☆700Updated 8 months ago
- 👁️ + 💬 + 🎧 = 🤖 Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials]☆577Updated 8 months ago
- PyTorch Implementation of Jamba: "Jamba: A Hybrid Transformer-Mamba Language Model"☆137Updated last week
- PyTorch implementation of Infini-Transformer from "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention…☆280Updated 6 months ago
- [CVPR 2024] OneLLM: One Framework to Align All Modalities with Language☆590Updated 3 weeks ago
- ☆175Updated this week
- HPT - Open Multimodal LLMs from HyperGAI☆312Updated 5 months ago
- Open weights language model from Google DeepMind, based on Griffin.☆607Updated 4 months ago
- ☆448Updated 7 months ago
- ☆461Updated 3 months ago
- ☆292Updated 5 months ago
- [NeurIPS 2024] Official Repository of The Mamba in the Llama: Distilling and Accelerating Hybrid Models☆174Updated this week
- LLaVA-Interactive-Demo☆352Updated 3 months ago
- Annotated version of the Mamba paper☆457Updated 8 months ago
- Implementation of DoRA☆283Updated 5 months ago
- Janus-Series: Unified Multimodal Understanding and Generation Models☆1,084Updated last week
- ☆470Updated 2 months ago