Zheng-Chong / CatVTONLinks
[ICLR 2025] CatVTON is a simple and efficient virtual try-on diffusion model with 1) Lightweight Network (899.06M parameters totally), 2) Parameter-Efficient Training (49.57M parameters trainable) and 3) Simplified Inference (< 8G VRAM for 1024X768 resolution).
☆1,501Updated 6 months ago
Alternatives and similar repositories for CatVTON
Users that are interested in CatVTON are comparing it to the libraries listed below
Sorting:
- [AAAI 2025]👔IMAGDressing👔: Interactive Modular Apparel Generation for Virtual Dressing. It enables customizable human image generation …☆1,284Updated last month
- Official implementation of Magic Clothing: Controllable Garment-Driven Image Synthesis☆1,533Updated last year
- ☆586Updated 6 months ago
- Official repository of In-Context LoRA for Diffusion Transformers☆2,015Updated 9 months ago
- [CVPR2024] StableVITON: Learning Semantic Correspondence with Latent Diffusion Model for Virtual Try-On☆1,217Updated 8 months ago
- ComfyUI adaptation of IDM-VTON for virtual try-on.☆547Updated last year
- ViViD: Video Virtual Try-on using Diffusion Models☆548Updated last year
- Official implementation of "FitDiT: Advancing the Authentic Garment Details for High-fidelity Virtual Try-on"☆588Updated 7 months ago
- [CVPR 2025] Learning Flow Fields in Attention for Controllable Person Image Generation☆1,606Updated last week
- ☆1,297Updated 5 months ago
- [ICCV 2025 Highlight] OminiControl: Minimal and Universal Control for Diffusion Transformer☆1,769Updated 2 months ago
- ☆774Updated 10 months ago
- High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance☆2,453Updated last month
- 📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion☆2,207Updated 6 months ago
- [ICCV 2025] 🔥🔥 UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioning☆1,292Updated last week
- [NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment☆3,469Updated last month
- Official implementation of "MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling"☆1,546Updated 3 months ago
- Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment☆1,413Updated last week
- 📹 A more flexible framework that can generate videos at any resolution and creates videos from images.☆1,421Updated this week
- StoryMaker: Towards consistent characters in text-to-image generation☆710Updated 9 months ago
- A repository for organizing papers, codes and other resources related to Virtual Try-on Models☆318Updated last week
- Diffusion-based Portrait and Animal Animation☆833Updated 6 months ago
- [ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.☆752Updated 9 months ago
- ☆1,034Updated 4 months ago
- [ECCV 2024] PowerPaint, a versatile image inpainting model that supports text-guided object inpainting, object removal, image outpainting…☆976Updated last year
- ☆2,192Updated 10 months ago
- ComfyUI nodes for LivePortrait☆2,070Updated last year
- InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥☆1,961Updated last year
- ☆617Updated 2 months ago
- PyTorch implementation of "TryOnDiffusion: A Tale of Two UNets", a virtual try-on diffusion-based network by Google☆348Updated 11 months ago