Zheng-Chong / CatVTON
CatVTON is a simple and efficient virtual try-on diffusion model with 1) Lightweight Network (899.06M parameters totally), 2) Parameter-Efficient Training (49.57M parameters trainable) and 3) Simplified Inference (< 8G VRAM for 1024X768 resolution).
β1,086Updated 3 weeks ago
Alternatives and similar repositories for CatVTON:
Users that are interested in CatVTON are comparing it to the libraries listed below
- [AAAI 2025]πIMAGDressingπ: Interactive Modular Apparel Generation for Virtual Dressing. It enables customizable human image generation β¦β1,127Updated 3 weeks ago
- Official implementation of Magic Clothing: Controllable Garment-Driven Image Synthesisβ1,446Updated 5 months ago
- β338Updated last month
- Official repository of In-Context LoRA for Diffusion Transformersβ1,480Updated 3 weeks ago
- ViViD: Video Virtual Try-on using Diffusion Modelsβ492Updated 6 months ago
- [ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.β696Updated last month
- A minimal and universal controller for FLUX.1.β1,100Updated this week
- β1,272Updated this week
- ComfyUI adaptation of IDM-VTON for virtual try-on.β448Updated 4 months ago
- StoryMaker: Towards consistent characters in text-to-image generationβ628Updated last month
- [CVPR2024] StableVITON: Learning Semantic Correspondence with Latent Diffusion Model for Virtual Try-Onβ1,085Updated 2 weeks ago
- unofficial implementation of Comfyui magic clothingβ540Updated 4 months ago
- β560Updated last month
- ComfyUI custom node that simply integrates the OOTDiffusion.β432Updated 6 months ago
- Nodes for image juxtaposition for Flux in ComfyUIβ1,000Updated last week
- [ECCV 2024] OMG: Occlusion-friendly Personalized Multi-concept Generation In Diffusion Modelsβ673Updated 6 months ago
- High-Quality Human Motion Video Generation with Confidence-aware Pose Guidanceβ2,092Updated 3 months ago
- β742Updated 2 months ago
- β1,279Updated 2 months ago
- [CVPR 2024] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translationβ750Updated 7 months ago
- Diffusion-based Portrait and Animal Animationβ608Updated this week
- You can using EchoMimic in ComfyUIβ498Updated this week
- Learning Flow Fields in Attention for Controllable Person Image Generationβ944Updated last week
- πΊ An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusionβ1,680Updated this week
- πΉ A more flexible CogVideoX that can generate videos at any resolution and creates videos from images.β613Updated last month
- Official implementation of Posterior-Mean Rectified Flow: Towards Minimum MSE Photo-Realistic Image Restorationβ585Updated 3 months ago
- Stable-Hair: Real-World Hair Transfer via Diffusion Modelβ403Updated 2 months ago
- Dead simple FLUX LoRA training UI with LOW VRAM supportβ1,726Updated last week
- β385Updated this week
- [ECCV 2024] HiDiffusion: Increases the resolution and speed of your diffusion model by only adding a single line of code!β793Updated last month