facebookresearch / chameleon
Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.
☆1,955Updated 7 months ago
Alternatives and similar repositories for chameleon:
Users that are interested in chameleon are comparing it to the libraries listed below
- Cambrian-1 is a family of multimodal LLMs with a vision-centric design.☆1,877Updated 4 months ago
- Next-Token Prediction is All You Need☆2,042Updated last week
- Anole: An Open, Autoregressive and Native Multimodal Models for Interleaved Image-Text Generation☆733Updated 7 months ago
- This repository provides the code and model checkpoints for AIMv1 and AIMv2 research projects.☆1,245Updated 4 months ago
- Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation☆1,623Updated 7 months ago
- ☆3,591Updated last month
- 4M: Massively Multimodal Masked Modeling☆1,701Updated 2 weeks ago
- Training Large Language Model to Reason in a Continuous Latent Space☆998Updated 2 months ago
- VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and clou…☆3,043Updated this week
- Official implementation of "Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling"☆857Updated last month
- Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI☆982Updated last week
- Muon is Scalable for LLM Training☆974Updated 3 weeks ago
- An Open Large Reasoning Model for Real-World Solutions☆1,475Updated 3 weeks ago
- Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks☆2,063Updated this week
- AllenAI's post-training codebase☆2,827Updated this week
- Recipes to scale inference-time compute of open models☆1,044Updated last month
- ☆1,347Updated 4 months ago
- Code for BLT research paper☆1,436Updated this week
- Codebase for Aria - an Open Multimodal Native MoE☆1,025Updated 2 months ago
- Official PyTorch implementation for "Large Language Diffusion Models"☆1,313Updated last week
- Pretraining code for a large-scale depth-recurrent language model☆697Updated last week
- MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.☆1,266Updated last month
- LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoning☆1,910Updated 2 months ago
- GPT4V-level open-source multi-modal model based on Llama3-8B☆2,315Updated 3 weeks ago
- Minimalistic large language model 3D-parallelism training☆1,715Updated this week
- Mixture-of-Experts for Large Vision-Language Models☆2,127Updated 3 months ago
- ☆602Updated last year
- A family of open-sourced Mixture-of-Experts (MoE) Large Language Models☆1,481Updated last year
- VideoSys: An easy and efficient system for video generation☆1,944Updated 2 weeks ago
- MINT-1T: A one trillion token multimodal interleaved dataset.☆804Updated 7 months ago