Zheng-Chong / CatVTONLinks
[ICLR 2025] CatVTON is a simple and efficient virtual try-on diffusion model with 1) Lightweight Network (899.06M parameters totally), 2) Parameter-Efficient Training (49.57M parameters trainable) and 3) Simplified Inference (< 8G VRAM for 1024X768 resolution).
β1,529Updated 8 months ago
Alternatives and similar repositories for CatVTON
Users that are interested in CatVTON are comparing it to the libraries listed below
Sorting:
- [AAAI 2025]πIMAGDressingπ: Interactive Modular Apparel Generation for Virtual Dressing. It enables customizable human image generation β¦β1,300Updated last month
- Official implementation of Magic Clothing: Controllable Garment-Driven Image Synthesisβ1,536Updated last year
- Official repository of In-Context LoRA for Diffusion Transformersβ2,029Updated 10 months ago
- β591Updated 8 months ago
- Official implementation of "FitDiT: Advancing the Authentic Garment Details for High-fidelity Virtual Try-on"β596Updated 9 months ago
- [CVPR2024] StableVITON: Learning Semantic Correspondence with Latent Diffusion Model for Virtual Try-Onβ1,232Updated last month
- [CVPR 2025] Learning Flow Fields in Attention for Controllable Person Image Generationβ1,622Updated 2 months ago
- ComfyUI adaptation of IDM-VTON for virtual try-on.β549Updated last year
- β1,324Updated 6 months ago
- High-Quality Human Motion Video Generation with Confidence-aware Pose Guidanceβ2,469Updated 3 months ago
- [ICCV 2025 Highlight] OminiControl: Minimal and Universal Control for Diffusion Transformerβ1,815Updated 4 months ago
- A repository for organizing papers, codes and other resources related to Virtual Try-on Modelsβ346Updated 3 weeks ago
- ViViD: Video Virtual Try-on using Diffusion Modelsβ553Updated last year
- [NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignmentβ3,488Updated 3 months ago
- β2,217Updated last year
- β783Updated 11 months ago
- πΊ An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusionβ2,228Updated 8 months ago
- StoryMaker: Towards consistent characters in text-to-image generationβ713Updated 11 months ago
- β1,657Updated last year
- [ICCV 2025] π₯π₯ UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioningβ1,325Updated 2 months ago
- PyTorch implementation of "TryOnDiffusion: A Tale of Two UNets", a virtual try-on diffusion-based network by Googleβ355Updated last year
- Official implementation of "MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling"β1,555Updated 4 months ago
- β1,529Updated 3 months ago
- β1,041Updated 5 months ago
- Nodes for image juxtaposition for Flux in ComfyUIβ1,389Updated 10 months ago
- ComfyUI nodes for LivePortraitβ2,087Updated last year
- Phantom: Subject-Consistent Video Generation via Cross-Modal Alignmentβ1,451Updated 2 months ago
- πΉ A more flexible framework that can generate videos at any resolution and creates videos from images.β1,513Updated last week
- ComfyUI custom node that simply integrates the OOTDiffusion.β467Updated last year
- unofficial implementation of Comfyui magic clothingβ588Updated last year