Zheng-Chong / CatVTONLinks
[ICLR 2025] CatVTON is a simple and efficient virtual try-on diffusion model with 1) Lightweight Network (899.06M parameters totally), 2) Parameter-Efficient Training (49.57M parameters trainable) and 3) Simplified Inference (< 8G VRAM for 1024X768 resolution).
β1,539Updated 9 months ago
Alternatives and similar repositories for CatVTON
Users that are interested in CatVTON are comparing it to the libraries listed below
Sorting:
- Official implementation of Magic Clothing: Controllable Garment-Driven Image Synthesisβ1,534Updated last year
- [AAAI 2025]πIMAGDressingπ: Interactive Modular Apparel Generation for Virtual Dressing. It enables customizable human image generation β¦β1,309Updated 2 months ago
- Official repository of In-Context LoRA for Diffusion Transformersβ2,034Updated 11 months ago
- β594Updated 8 months ago
- [CVPR2024] StableVITON: Learning Semantic Correspondence with Latent Diffusion Model for Virtual Try-Onβ1,236Updated last month
- [CVPR 2025] Learning Flow Fields in Attention for Controllable Person Image Generationβ1,625Updated 2 months ago
- Official implementation of "FitDiT: Advancing the Authentic Garment Details for High-fidelity Virtual Try-on"β600Updated 9 months ago
- β1,329Updated 7 months ago
- ComfyUI adaptation of IDM-VTON for virtual try-on.β556Updated last year
- [ICCV 2025 Highlight] OminiControl: Minimal and Universal Control for Diffusion Transformerβ1,840Updated 4 months ago
- [NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignmentβ3,495Updated 4 months ago
- A repository for organizing papers, codes and other resources related to Virtual Try-on Modelsβ359Updated last month
- πΊ An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusionβ2,236Updated 8 months ago
- ViViD: Video Virtual Try-on using Diffusion Modelsβ555Updated last year
- High-Quality Human Motion Video Generation with Confidence-aware Pose Guidanceβ2,477Updated 2 weeks ago
- β784Updated last year
- β2,226Updated last year
- πΉ A more flexible framework that can generate videos at any resolution and creates videos from images.β1,557Updated this week
- [ICCV 2025] π₯π₯ UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioningβ1,335Updated 2 months ago
- β1,670Updated last year
- StoryMaker: Towards consistent characters in text-to-image generationβ717Updated last year
- Phantom: Subject-Consistent Video Generation via Cross-Modal Alignmentβ1,455Updated 2 months ago
- PyTorch implementation of "TryOnDiffusion: A Tale of Two UNets", a virtual try-on diffusion-based network by Googleβ359Updated last year
- Official implementation of "MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling"β1,556Updated 5 months ago
- β1,532Updated 3 months ago
- ComfyUI nodes for LivePortraitβ2,098Updated last year
- β1,044Updated 6 months ago
- ComfyUI custom node that simply integrates the OOTDiffusion.β469Updated last year
- InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation π₯β1,992Updated last year
- Nodes for image juxtaposition for Flux in ComfyUIβ1,389Updated 10 months ago