tencent-ailab / IP-AdapterLinks
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
☆6,343Updated last year
Alternatives and similar repositories for IP-Adapter
Users that are interested in IP-Adapter are comparing it to the libraries listed below
Sorting:
- T2I-Adapter☆3,771Updated last year
- Nightly release of ControlNet 1.1☆5,124Updated last year
- Transparent Image Layer Diffusion using Latent Transparency☆2,180Updated last year
- Official implementation of AnimateDiff.☆11,919Updated last year
- AnimateDiff for AUTOMATIC1111 Stable Diffusion WebUI☆3,387Updated last year
- "Effective Whole-body Pose Estimation with Two-stages Distillation" (ICCV 2023, CV4Metaverse Workshop)☆2,625Updated 2 years ago
- ☆6,788Updated last month
- Improved AnimateDiff for ComfyUI and Advanced Sampling Support☆3,331Updated 4 months ago
- InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥☆1,997Updated last year
- [ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors☆2,974Updated last year
- PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis☆3,236Updated last year
- ☆5,633Updated 7 months ago
- PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation☆1,879Updated last year
- Segment Anything for Stable Diffusion WebUI☆3,523Updated last year
- [ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)☆1,837Updated 10 months ago
- Character Animation (AnimateAnyone, Face Reenactment)☆3,460Updated last year
- [WIP] Layer Diffusion for WebUI (via Forge)☆4,101Updated last year
- Open-Set Grounded Text-to-Image Generation☆2,180Updated last year
- Auto detecting, masking and inpainting with detection model.☆4,634Updated last week
- FreeU: Free Lunch in Diffusion U-Net (CVPR2024 Oral)☆1,889Updated 11 months ago
- Image to prompt with BLIP and CLIP☆2,925Updated last year
- Official repository of In-Context LoRA for Diffusion Transformers☆2,036Updated 11 months ago
- Using Low-rank adaptation to quickly fine-tune diffusion models.☆7,487Updated last year
- A general fine-tuning kit geared toward image/video/audio diffusion models.☆2,639Updated this week
- ☆2,228Updated last year
- Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference☆4,592Updated last year
- [NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment☆3,494Updated 4 months ago
- More relighting!☆8,317Updated 9 months ago
- Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models☆3,147Updated 11 months ago
- Lora beYond Conventional methods, Other Rank adaptation Implementations for Stable diffusion.☆2,438Updated last month