☆6,882Mar 3, 2024Updated 2 years ago
Alternatives and similar repositories for instruct-pix2pix
Users that are interested in instruct-pix2pix are comparing it to the libraries listed below
Sorting:
- ☆3,441May 14, 2024Updated last year
- Let us control diffusion models!☆33,663Feb 25, 2024Updated 2 years ago
- T2I-Adapter☆3,797Jun 21, 2024Updated last year
- Zero-shot Image-to-Image Translation [SIGGRAPH 2023]☆1,143Oct 16, 2024Updated last year
- 🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.☆32,873Feb 26, 2026Updated last week
- ☆3,049Feb 27, 2023Updated 3 years ago
- Using Low-rank adaptation to quickly fine-tune diffusion models.☆7,526Mar 22, 2024Updated last year
- The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.☆6,480Jun 28, 2024Updated last year
- ☆7,843Apr 14, 2024Updated last year
- Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion (CVPR 2023)☆1,971Dec 1, 2025Updated 3 months ago
- High-Resolution Image Synthesis with Latent Diffusion Models☆13,864Feb 29, 2024Updated 2 years ago
- Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion.☆8,807Dec 10, 2023Updated 2 years ago
- LAVIS - A One-stop Library for Language-Vision Intelligence☆11,177Nov 18, 2024Updated last year
- Official implementation of "Composer: Creative and Controllable Image Synthesis with Composable Conditions"☆1,560Dec 26, 2023Updated 2 years ago
- Official implementation of AnimateDiff.☆12,038Jul 31, 2024Updated last year
- Open-Set Grounded Text-to-Image Generation☆2,196Mar 6, 2024Updated last year
- Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion☆7,753Dec 8, 2022Updated 3 years ago
- Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"☆8,382May 31, 2024Updated last year
- VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models☆5,032Jan 9, 2026Updated last month
- Generative Models by Stability AI☆26,943Dec 16, 2025Updated 2 months ago
- Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and …☆17,431Sep 5, 2024Updated last year
- An open source implementation of CLIP.☆13,430Updated this week
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.☆24,500Aug 12, 2024Updated last year
- PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis☆3,281Oct 31, 2024Updated last year
- A latent text-to-image diffusion model☆72,575Jun 18, 2024Updated last year
- Taming Transformers for High-Resolution Image Synthesis☆6,438Jul 30, 2024Updated last year
- The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoi…☆53,551Sep 18, 2024Updated last year
- [ICCV 2023] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation☆4,377Oct 25, 2023Updated 2 years ago
- Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference☆4,615Jun 14, 2024Updated last year
- Zero-1-to-3: Zero-shot One Image to 3D Object (ICCV 2023)☆3,026Dec 5, 2023Updated 2 years ago
- [ICCV 2023 Oral] Text-to-Image Diffusion Models are Zero-Shot Video Generators☆4,244May 6, 2023Updated 2 years ago
- [ICML'23] StyleGAN-T: Unlocking the Power of GANs for Fast Large-Scale Text-to-Image Synthesis☆1,201Apr 7, 2023Updated 2 years ago
- Instruct-NeRF2NeRF: Editing 3D Scenes with Instructions (ICCV 2023)☆850Feb 12, 2024Updated 2 years ago
- Official repo for consistency models.☆6,477Mar 22, 2024Updated last year
- Official Pytorch Implementation for “Plug-and-Play Diffusion Features for Text-Driven Image-to-Image Translation” (CVPR 2023)☆994Jun 19, 2023Updated 2 years ago
- text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)☆12,449Nov 4, 2025Updated 4 months ago
- Inpaint anything using Segment Anything and inpainting models.☆7,599Feb 29, 2024Updated 2 years ago
- CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image☆32,642Feb 18, 2026Updated 2 weeks ago
- [ICCV 2023] Consistent Image Synthesis and Editing☆840Aug 19, 2024Updated last year