Open reproduction of MUSE for fast text2image generation.
☆359Jun 1, 2024Updated last year
Alternatives and similar repositories for open-muse
Users that are interested in open-muse are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An in-context conditioning version of MUSE with pre-trained checkpoints.☆116Jun 4, 2023Updated 2 years ago
- Fast and controllable text-to-image model.☆41Jun 16, 2023Updated 2 years ago
- Official Jax Implementation of MaskGIT☆558Nov 18, 2022Updated 3 years ago
- ☆88Jan 4, 2024Updated 2 years ago
- Official implementation of Würstchen: Efficient Pretraining of Text-to-Image Models☆556Apr 6, 2024Updated last year
- Code for instruction-tuning Stable Diffusion.☆248Feb 16, 2024Updated 2 years ago
- MoVQGAN - model for the image encoding and reconstruction☆264Oct 31, 2023Updated 2 years ago
- Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation☆1,941Aug 15, 2024Updated last year
- Emu Series: Generative Multimodal Models from BAAI☆1,772Jan 12, 2026Updated 2 months ago
- Official implementation of SEED-LLaMA (ICLR 2024).☆642Sep 21, 2024Updated last year
- Official JAX implementation of MAGVIT: Masked Generative Video Transformer☆995Jan 17, 2024Updated 2 years ago
- This repo contains the code for 1D tokenizer and generator☆1,129Mar 20, 2025Updated last year
- SEED-Voken: A Series of Powerful Visual Tokenizers☆998Nov 25, 2025Updated 3 months ago
- DataComp: In search of the next generation of multimodal datasets☆771Apr 28, 2025Updated 10 months ago
- Adaptive Length Image Tokenization via Recurrent Allocation | How many tokens is an image worth ?☆147Feb 11, 2025Updated last year
- [ICLR2025] Halton Scheduler for Masked Generative Image Transformer☆282Oct 28, 2025Updated 4 months ago
- LaVIT: Empower the Large Language Model to Understand and Generate Visual Content☆605Oct 6, 2024Updated last year
- Official PyTorch implementation of the paper "In-Context Learning Unlocked for Diffusion Models"☆414Mar 25, 2024Updated last year
- Simple large-scale training of stable diffusion with multi-node support.☆133May 8, 2023Updated 2 years ago
- 🤗 Unofficial huggingface/diffusers-based implementation of the paper "Training-Free Layout Control with Cross-Attention Guidance".☆42May 24, 2023Updated 2 years ago
- [NeurIPS 2024]OmniTokenizer: one model and one weight for image-video joint tokenization.☆323Jul 9, 2024Updated last year
- PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis☆3,281Oct 31, 2024Updated last year
- Implementation of Muse: Text-to-Image Generation via Masked Generative Transformers, in Pytorch☆33Dec 15, 2023Updated 2 years ago
- Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"☆1,476May 31, 2023Updated 2 years ago
- An unofficial implementation of both ViT-VQGAN and RQ-VAE in Pytorch☆322Apr 7, 2025Updated 11 months ago
- Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers"☆1,120Dec 22, 2025Updated 3 months ago
- An open source implementation of "Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning", an all-new multi modal …☆365Dec 15, 2023Updated 2 years ago
- SpeeD: A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Training☆190Jan 27, 2025Updated last year
- Lumina-T2X is a unified framework for Text to Any Modality Generation☆2,253Feb 16, 2025Updated last year
- Official Pytorch implementation for LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior (ICLR 2025 Oral).☆100Feb 11, 2025Updated last year
- A suite of image and video neural tokenizers☆1,716Feb 11, 2025Updated last year
- [ICCV 2023] Online Clustered Codebook☆184Sep 19, 2024Updated last year
- Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis☆651May 24, 2024Updated last year
- Unoffical implement for [StyleDrop](https://arxiv.org/abs/2306.00983)☆586Aug 23, 2023Updated 2 years ago
- VideoSys: An easy and efficient system for video generation☆2,020Aug 27, 2025Updated 6 months ago
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆86Jul 16, 2024Updated last year
- Open source implementation and models of One-step Diffusion with Distribution Matching Distillation☆182May 26, 2024Updated last year
- Consistency Distilled Diff VAE☆2,213Nov 7, 2023Updated 2 years ago
- Official Implementation of "Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraini…☆643Oct 16, 2025Updated 5 months ago