MIO-Team / MIO
MIO: A Foundation Model on Multimodal Tokens
☆21Updated last month
Alternatives and similar repositories for MIO:
Users that are interested in MIO are comparing it to the libraries listed below
- ☆15Updated last week
- Pytorch Implementation of the Model from "MIRASOL3B: A MULTIMODAL AUTOREGRESSIVE MODEL FOR TIME-ALIGNED AND CONTEXTUAL MODALITIES"☆26Updated 2 months ago
- UnifiedMLLM: Enabling Unified Representation for Multi-modal Multi-tasks With Large Language Model☆20Updated 5 months ago
- ACDiT: Interpolating Autoregressive Conditional Modeling and Diffusion Transformer☆22Updated 2 weeks ago
- DiffuGPT and DiffuLLaMA: Scaling Diffusion Language Models via Adaptation from Autoregressive Models☆71Updated last month
- Language Quantized AutoEncoders☆95Updated last year
- Implementation of Retrieval-Augmented Denoising Diffusion Probabilistic Models in Pytorch☆64Updated 2 years ago
- [CVPR2024] ModaVerse: Efficiently Transforming Modalities with LLMs☆28Updated 6 months ago
- An official pytorch implementation of AAAI 2024 paper "Latent Space Editing in Transformer-based Flow Matching"☆34Updated 9 months ago
- A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.☆33Updated 6 months ago
- ☆41Updated last year
- LMM which strictly superset LLM embedded☆37Updated 2 months ago
- Official Repository of IJCAI 2024 Paper: "BATON: Aligning Text-to-Audio Model with Human Preference Feedback"☆18Updated 2 weeks ago
- The open source implementation of the cross attention mechanism from the paper: "JOINTLY TRAINING LARGE AUTOREGRESSIVE MULTIMODAL MODELS"☆25Updated 10 months ago
- Official implementation of the paper "MMInA: Benchmarking Multihop Multimodal Internet Agents"☆40Updated 9 months ago
- ☆35Updated 6 months ago
- PyTorch implementation of StableMask (ICML'24)☆12Updated 6 months ago
- ☆21Updated 3 months ago
- The official code for paper "EasyGen: Easing Multimodal Generation with a Bidirectional Conditional Diffusion Model and LLMs"☆73Updated last month
- Code for paper "Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning"☆65Updated 11 months ago
- code based for rectified flow☆30Updated this week
- [Under Review] Official PyTorch implementation code for realizing the technical part of Phantom of Latent representing equipped with enla…☆49Updated 3 months ago
- Implementation of the paper: "Audio Mamba: Bidirectional State Space Model for Audio Representation Learning" in pytorch☆13Updated this week
- A project for tri-modal LLM benchmarking and instruction tuning.☆17Updated 2 months ago
- Implementation of Qformer from BLIP2 in Zeta Lego blocks.☆34Updated 2 months ago
- The official implementation of MAGVLT: Masked Generative Vision-and-Language Transformer (CVPR'23)☆26Updated 11 months ago
- ☆78Updated last year
- SMILE: A Multimodal Dataset for Understanding Laughter☆13Updated last year
- Official implementation for the paper "A Cheaper and Better Diffusion Language Model with Soft-Masked Noise"☆53Updated last year
- The official PyTorch implementation for Improving Long-Text Alignment for Text-to-Image Diffusion Models (LongAlign)☆59Updated 3 months ago