Zheng-Chong / CatVTONLinks
[ICLR 2025] CatVTON is a simple and efficient virtual try-on diffusion model with 1) Lightweight Network (899.06M parameters totally), 2) Parameter-Efficient Training (49.57M parameters trainable) and 3) Simplified Inference (< 8G VRAM for 1024X768 resolution).
β1,522Updated 7 months ago
Alternatives and similar repositories for CatVTON
Users that are interested in CatVTON are comparing it to the libraries listed below
Sorting:
- Official implementation of Magic Clothing: Controllable Garment-Driven Image Synthesisβ1,534Updated last year
- [AAAI 2025]πIMAGDressingπ: Interactive Modular Apparel Generation for Virtual Dressing. It enables customizable human image generation β¦β1,293Updated 2 weeks ago
- β592Updated 7 months ago
- Official repository of In-Context LoRA for Diffusion Transformersβ2,023Updated 10 months ago
- [CVPR 2025] Learning Flow Fields in Attention for Controllable Person Image Generationβ1,619Updated last month
- [CVPR2024] StableVITON: Learning Semantic Correspondence with Latent Diffusion Model for Virtual Try-Onβ1,228Updated last week
- Official implementation of "FitDiT: Advancing the Authentic Garment Details for High-fidelity Virtual Try-on"β594Updated 8 months ago
- ComfyUI adaptation of IDM-VTON for virtual try-on.β548Updated last year
- [ICCV 2025 Highlight] OminiControl: Minimal and Universal Control for Diffusion Transformerβ1,797Updated 3 months ago
- ViViD: Video Virtual Try-on using Diffusion Modelsβ550Updated last year
- High-Quality Human Motion Video Generation with Confidence-aware Pose Guidanceβ2,457Updated 2 months ago
- [NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignmentβ3,481Updated 2 months ago
- πΊ An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusionβ2,218Updated 7 months ago
- A repository for organizing papers, codes and other resources related to Virtual Try-on Modelsβ337Updated last week
- β1,316Updated 6 months ago
- β779Updated 10 months ago
- Official implementation of "MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling"β1,554Updated 4 months ago
- πΉ A more flexible framework that can generate videos at any resolution and creates videos from images.β1,477Updated this week
- PyTorch implementation of "TryOnDiffusion: A Tale of Two UNets", a virtual try-on diffusion-based network by Googleβ355Updated last year
- [ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.β753Updated 10 months ago
- InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation π₯β1,969Updated last year
- StoryMaker: Towards consistent characters in text-to-image generationβ713Updated 10 months ago
- [ICCV 2025] π₯π₯ UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioningβ1,315Updated last month
- ComfyUI nodes for LivePortraitβ2,081Updated last year
- β2,205Updated 11 months ago
- ComfyUI custom node that simply integrates the OOTDiffusion.β465Updated last year
- β1,644Updated 11 months ago
- Nodes for image juxtaposition for Flux in ComfyUIβ1,385Updated 9 months ago
- β2,511Updated last year
- β1,524Updated 2 months ago