ToTheBeginning / PuLID
[NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment
☆3,325Updated last week
Alternatives and similar repositories for PuLID
Users that are interested in PuLID are comparing it to the libraries listed below
Sorting:
- InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥☆1,911Updated 7 months ago
- Official repository of In-Context LoRA for Diffusion Transformers☆1,850Updated 4 months ago
- Dead simple FLUX LoRA training UI with LOW VRAM support☆2,528Updated last month
- A minimal and universal controller for FLUX.1.☆1,545Updated 3 weeks ago
- 📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion☆2,146Updated 2 months ago
- ☆2,045Updated 6 months ago
- ComfyUI nodes for LivePortrait☆1,965Updated 9 months ago
- Kolors Team☆4,381Updated 6 months ago
- ☆1,515Updated 6 months ago
- A general fine-tuning kit geared toward diffusion models.☆2,303Updated 2 weeks ago
- [ICLR 2025] CatVTON is a simple and efficient virtual try-on diffusion model with 1) Lightweight Network (899.06M parameters totally), 2)…☆1,378Updated 2 months ago
- ☆2,324Updated 8 months ago
- ☆1,606Updated last month
- Official implementation of Magic Clothing: Controllable Garment-Driven Image Synthesis☆1,504Updated 9 months ago
- The ultimate training toolkit for finetuning diffusion models☆4,695Updated this week
- High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance☆2,347Updated 7 months ago
- ☆5,082Updated last month
- MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation☆2,529Updated 2 months ago
- The best OSS video generation models☆3,144Updated 4 months ago
- SUPIR upscaling wrapper for ComfyUI☆1,928Updated last month
- The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.☆5,940Updated 10 months ago
- Lumina-T2X is a unified framework for Text to Any Modality Generation☆2,188Updated 3 months ago
- [CVPR 2025] StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text☆1,552Updated last month
- SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer☆4,126Updated last week
- ☆1,484Updated 3 months ago
- HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo☆1,416Updated last month
- [CVPR 2025] Learning Flow Fields in Attention for Controllable Person Image Generation☆1,517Updated 3 months ago
- Unofficial implementation of InstantID for ComfyUI☆1,419Updated 11 months ago
- LTX-Video Support for ComfyUI☆1,780Updated this week
- PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation☆1,804Updated 6 months ago