ToTheBeginning / PuLID
Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment
β1,619Updated this week
Related projects: β
- InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation π₯β1,597Updated 2 months ago
- A general fine-tuning kit geared toward diffusion models.β1,534Updated this week
- Lumina-T2X is a unified framework for Text to Any Modality Generationβ2,020Updated last month
- [ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priorsβ2,401Updated last week
- ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignmentβ1,048Updated 2 months ago
- [ACM MM 2024] This is the official code for "AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion β¦β1,364Updated last month
- CatVTON is a simple and efficient virtual try-on diffusion model with 1) Lightweight Network (899.06M parameters totally), 2) Parameter-Eβ¦β721Updated 2 weeks ago
- [ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)β1,651Updated last week
- Official implementation of Magic Clothing: Controllable Garment-Driven Image Synthesisβ1,353Updated last month
- Unofficial implementation of InstantID for ComfyUIβ1,311Updated 3 months ago
- β1,347Updated this week
- PixArt-Ξ£: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generationβ1,604Updated last month
- Unofficial implementation of PhotoMaker for ComfyUIβ781Updated 3 months ago
- πIMAGDressingπ: Interactive Modular Apparel Generation for Virtual Dressingβ954Updated 3 weeks ago
- High-Quality Human Motion Video Generation with Confidence-aware Pose Guidanceβ1,595Updated last week
- [ECCV 2024] OMG: Occlusion-friendly Personalized Multi-concept Generation In Diffusion Modelsβ614Updated 2 months ago
- ComfyUI nodes for LivePortraitβ1,438Updated last month
- MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generationβ2,108Updated last month
- StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Textβ1,342Updated 2 weeks ago
- β1,334Updated 3 weeks ago
- β778Updated last week
- Dead simple FLUX LoRA training UI with LOW VRAM supportβ613Updated this week
- V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.β2,182Updated 2 months ago
- β1,173Updated this week
- SUPIR upscaling wrapper for ComfyUIβ1,468Updated last month
- πΊ An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusionβ1,155Updated 3 weeks ago
- Unofficial Implementation of Animate Anyone by Novita AIβ735Updated 3 months ago
- Create images of a given character in different posesβ551Updated 3 months ago
- Official Code for MotionCtrl [SIGGRAPH 2024]β1,263Updated last month
- Layer Diffuse custom nodesβ1,427Updated 3 weeks ago