Zheng-Chong / CatVTON
[ICLR 2025] CatVTON is a simple and efficient virtual try-on diffusion model with 1) Lightweight Network (899.06M parameters totally), 2) Parameter-Efficient Training (49.57M parameters trainable) and 3) Simplified Inference (< 8G VRAM for 1024X768 resolution).
β1,328Updated 2 months ago
Alternatives and similar repositories for CatVTON:
Users that are interested in CatVTON are comparing it to the libraries listed below
- Official implementation of Magic Clothing: Controllable Garment-Driven Image Synthesisβ1,496Updated 8 months ago
- [AAAI 2025]πIMAGDressingπ: Interactive Modular Apparel Generation for Virtual Dressing. It enables customizable human image generation β¦β1,221Updated last month
- Official repository of In-Context LoRA for Diffusion Transformersβ1,823Updated 4 months ago
- β494Updated last month
- Official implementation of "FitDiT: Advancing the Authentic Garment Details for High-fidelity Virtual Try-on"β517Updated 2 months ago
- ComfyUI adaptation of IDM-VTON for virtual try-on.β496Updated 8 months ago
- [CVPR2024] StableVITON: Learning Semantic Correspondence with Latent Diffusion Model for Virtual Try-Onβ1,144Updated 3 months ago
- A minimal and universal controller for FLUX.1.β1,485Updated last week
- ViViD: Video Virtual Try-on using Diffusion Modelsβ522Updated 10 months ago
- β999Updated this week
- β1,476Updated 2 months ago
- πΊ An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusionβ2,131Updated last month
- β673Updated 5 months ago
- πΉ A more flexible framework that can generate videos at any resolution and creates videos from images.β914Updated last week
- unofficial implementation of Comfyui magic clothingβ562Updated 7 months ago
- [CVPR 2025] Learning Flow Fields in Attention for Controllable Person Image Generationβ1,488Updated 2 months ago
- High-Quality Human Motion Video Generation with Confidence-aware Pose Guidanceβ2,317Updated 7 months ago
- Official implementation of "MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling"β1,482Updated 3 months ago
- Nodes for image juxtaposition for Flux in ComfyUIβ1,265Updated 3 months ago
- StoryMaker: Towards consistent characters in text-to-image generationβ688Updated 4 months ago
- [ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.β736Updated 4 months ago
- ComfyUI custom node that simply integrates the OOTDiffusion.β455Updated 9 months ago
- [NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignmentβ3,287Updated 4 months ago
- β2,018Updated 5 months ago
- π₯π₯ UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioningβ876Updated last week
- β535Updated last week
- β1,499Updated 5 months ago
- Memory-Guided Diffusion for Expressive Talking Video Generationβ793Updated 3 months ago
- A pipeline parallel training script for diffusion models.β937Updated this week
- ComfyUI nodes to use segment-anything-2β867Updated last month