An in-context conditioning version of MUSE with pre-trained checkpoints.
☆116Jun 4, 2023Updated 2 years ago
Alternatives and similar repositories for MUSE-Pytorch
Users that are interested in MUSE-Pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Open reproduction of MUSE for fast text2image generation.☆359Jun 1, 2024Updated last year
- Unoffical implement for [StyleDrop](https://arxiv.org/abs/2306.00983)☆586Aug 23, 2023Updated 2 years ago
- This is an unofficial PyTorch implementation of StyleDrop: Text-to-Image Generation in Any Style.☆226Jul 11, 2023Updated 2 years ago
- [CVPR 2024] CapsFusion: Rethinking Image-Text Data at Scale☆214Feb 27, 2024Updated 2 years ago
- Paper List for In-context Learning 🌷☆19Jan 3, 2023Updated 3 years ago
- ☆88Jan 4, 2024Updated 2 years ago
- recipe for training fully-featured self supervised image jepa models☆12Jun 4, 2025Updated 9 months ago
- A PyTorch implementation of the paper "All are Worth Words: A ViT Backbone for Diffusion Models".☆1,099Mar 25, 2023Updated 2 years ago
- API to extract data from wikiHow☆17Jul 10, 2021Updated 4 years ago
- Obj2Seq: Formatting Objects as Sequences with Class Prompt for Visual Tasks (NeurIPS2022)☆84Nov 2, 2022Updated 3 years ago
- Official PyTorch implementation of the paper "In-Context Learning Unlocked for Diffusion Models"☆414Mar 25, 2024Updated last year
- [NeurIPS 2022] code for "Visual Concepts Tokenization"☆23Oct 10, 2022Updated 3 years ago
- ELITE: Encoding Visual Concepts into Textual Embeddings for Customized Text-to-Image Generation (ICCV 2023, Oral)☆543Jan 8, 2024Updated 2 years ago
- Zero-Shot Video Editing Using Off-The-Shelf Image Diffusion Models☆356Jul 4, 2023Updated 2 years ago
- Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"☆1,476May 31, 2023Updated 2 years ago
- unofficial pytorch implementation of HiFi-GAN with fast MISR.☆15Mar 21, 2023Updated 3 years ago
- Test-Time Training on Video Streams☆69Jul 24, 2023Updated 2 years ago
- Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers"☆1,120Dec 22, 2025Updated 3 months ago
- [CVPR'23] Video Probabilistic Diffusion Models in Projected Latent Space☆324May 14, 2024Updated last year
- Official Jax Implementation of MaskGIT☆558Nov 18, 2022Updated 3 years ago
- ☆64Jul 1, 2023Updated 2 years ago
- A PyTorch implementation of MAGE: MAsked Generative Encoder to Unify Representation Learning and Image Synthesis☆576Mar 10, 2023Updated 3 years ago
- Versatile Diffusion: Text, Images and Variations All in One Diffusion Model, arXiv 2022 / ICCV 2023☆1,337Aug 10, 2023Updated 2 years ago
- [IJCV'24] AutoStory: Generating Diverse Storytelling Images with Minimal Human Effort☆149Mar 5, 2026Updated 2 weeks ago
- Official PyTorch Implementation for Shape-Guided Diffusion with Inside-Outside Attention, WACV 2024☆38Aug 19, 2023Updated 2 years ago
- Code release for Deep Incubation (https://arxiv.org/abs/2212.04129)☆92Mar 16, 2023Updated 3 years ago
- Mini-DALLE3: Interactive Text to Image by Prompting Large Language Models☆312Dec 28, 2023Updated 2 years ago
- The Official Implementation of Neural View Synthesis and Matching for Semi-Supervised Few-Shot Learning of 3D Pose [NIPS 2021](https://ar…☆20Dec 7, 2021Updated 4 years ago
- This repo contains the code for 1D tokenizer and generator☆1,129Mar 20, 2025Updated last year
- Phonemes and durations labeling based on whisper small☆11Jul 7, 2024Updated last year
- Exploiting unlabeled data with vision and language models for object detection, ECCV 2022☆94Jan 16, 2024Updated 2 years ago
- [NeurIPS 2023] Text data, code and pre-trained models for paper "Improving CLIP Training with Language Rewrites"☆288Jan 14, 2024Updated 2 years ago
- [CVPR 2023] RILS: Masked Visual Reconstruction in Language Semantic Space (https://arxiv.org/abs/2301.06958)☆44Sep 5, 2023Updated 2 years ago
- code release of research paper "Exploring Long-Sequence Masked Autoencoders"☆100Oct 14, 2022Updated 3 years ago
- MoVQGAN - model for the image encoding and reconstruction☆264Oct 31, 2023Updated 2 years ago
- Emu Series: Generative Multimodal Models from BAAI☆1,772Jan 12, 2026Updated 2 months ago
- 🤗 Unofficial huggingface/diffusers-based implementation of the paper "Training-Free Structured Diffusion Guidance for Compositional Text…☆120Mar 29, 2023Updated 2 years ago
- ☆180Nov 14, 2025Updated 4 months ago
- ☆190Dec 17, 2024Updated last year