huggingface / open-muse
Open reproduction of MUSE for fast text2image generation.
☆332Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for open-muse
- Official PyTorch implementation of the paper "In-Context Learning Unlocked for Diffusion Models"☆379Updated 7 months ago
- Large-scale text-video dataset. 10 million captioned short videos.☆602Updated 3 months ago
- Simple large-scale training of stable diffusion with multi-node support.☆126Updated last year
- Huggingface-compatible SDXL Unet implementation that is readily hackable☆401Updated last year
- ☆442Updated 9 months ago
- Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis☆310Updated last year
- Official Jax Implementation of MaskGIT☆449Updated 2 years ago
- Easily create large video dataset from video urls☆546Updated 3 months ago
- Implementation of MagViT2 Tokenizer in Pytorch☆564Updated last month
- [NeurIPS 2023] This repository includes the official implementation of our paper "An Inverse Scaling Law for CLIP Training"☆298Updated 5 months ago
- MoVQGAN - model for the image encoding and reconstruction☆197Updated last year
- 🐟 Code and models for the NeurIPS 2023 paper "Generating Images with Multimodal Language Models".☆433Updated 10 months ago
- PyTorch implementation of InstructDiffusion, a unifying and generic framework for aligning computer vision tasks with human instructions.☆393Updated 6 months ago
- Official implementation of SEED-LLaMA (ICLR 2024).☆579Updated 2 months ago
- AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more samp…☆242Updated 2 weeks ago
- Better Aligning Text-to-Image Models with Human Preference. ICCV 2023☆266Updated last year
- An open source implementation of "Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning", an all-new multi modal …☆362Updated 11 months ago
- [ICML 2024 Spotlight] FiT: Flexible Vision Transformer for Diffusion Model☆389Updated last week
- Get hundred of million of image+url from the crawling at home dataset and preprocess them☆206Updated 5 months ago
- Subject-Diffusion:Open Domain Personalized Text-to-Image Generation without Test-time Fine-tuning☆284Updated 4 months ago
- Official repo for paper "MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions"☆371Updated 2 months ago
- An unofficial implementation of both ViT-VQGAN and RQ-VAE in Pytorch☆286Updated last year
- Official Pytorch Implementation of Synthesizing Coherent Story with Auto-Regressive Latent Diffusion Models☆192Updated last year
- An in-context conditioning version of MUSE with pre-trained checkpoints.☆111Updated last year
- Code for instruction-tuning Stable Diffusion.☆212Updated 9 months ago
- Official implementation of "Controlling Text-to-Image Diffusion by Orthogonal Finetuning".☆281Updated 3 weeks ago
- Densely Captioned Images (DCI) dataset repository.☆159Updated 4 months ago
- LaVIT: Empower the Large Language Model to Understand and Generate Visual Content☆532Updated last month
- ☆156Updated last year
- [SIGGRAPH Asia 2024] ReVersion: Diffusion-Based Relation Inversion from Images☆468Updated last month