kyegomez / MultiModalMamba
A novel implementation of fusing ViT with Mamba into a fast, agile, and high performance Multi-Modal Model. Powered by Zeta, the simplest AI framework ever.
☆430Updated last week
Related projects: ⓘ
- Official implementation of "Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling"☆778Updated last month
- Build high-performance AI models with modular building blocks☆377Updated this week
- Code repository for Black Mamba☆218Updated 7 months ago
- ☆544Updated this week
- PyTorch Implementation of "V* : Guided Visual Search as a Core Mechanism in Multimodal LLMs"☆502Updated 8 months ago
- This repository provides the code and model checkpoints of the research paper: Scalable Pre-training of Large Autoregressive Image Model…☆680Updated 4 months ago
- LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skills☆692Updated 7 months ago
- HPT - Open Multimodal LLMs from HyperGAI☆309Updated 3 months ago
- 👁️ + 💬 + 🎧 = 🤖 Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials]☆564Updated 6 months ago
- Mamba-Chat: A chat LLM based on the state-space model architecture 🐍☆897Updated 6 months ago
- PyTorch implementation of Infini-Transformer from "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention…☆271Updated 4 months ago
- Repo for "Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture"☆530Updated 4 months ago
- Embed arbitrary modalities (images, audio, documents, etc) into large language models.☆170Updated 5 months ago
- LLaVA-Interactive-Demo☆344Updated last month
- ☆694Updated 6 months ago
- From scratch implementation of a vision language model in pure PyTorch☆149Updated 4 months ago
- Quick exploration into fine tuning florence 2☆250Updated last month
- Reference implementation of Megalodon 7B model☆503Updated 5 months ago
- Code for Quiet-STaR☆478Updated last month
- [CVPR 2024] OneLLM: One Framework to Align All Modalities with Language☆553Updated last week
- ☆557Updated 7 months ago
- ☆428Updated last month
- ☆409Updated 10 months ago
- MLX-VLM is a package for running Vision LLMs locally on your Mac using MLX.☆187Updated this week
- Implementation of 💍 Ring Attention, from Liu et al. at Berkeley AI, in Pytorch☆452Updated last month
- PyTorch Implementation of Jamba: "Jamba: A Hybrid Transformer-Mamba Language Model"☆120Updated last week
- Implementation of DoRA☆278Updated 3 months ago
- [ICML2024 (Oral)] Official PyTorch implementation of DoRA: Weight-Decomposed Low-Rank Adaptation☆570Updated 3 weeks ago
- Famous Vision Language Models and Their Architectures☆295Updated last week