Zheng-Chong / CatVTON
CatVTON is a simple and efficient virtual try-on diffusion model with 1) Lightweight Network (899.06M parameters totally), 2) Parameter-Efficient Training (49.57M parameters trainable) and 3) Simplified Inference (< 8G VRAM for 1024X768 resolution).
☆933Updated last month
Related projects ⓘ
Alternatives and complementary repositories for CatVTON
- Official implementation of Magic Clothing: Controllable Garment-Driven Image Synthesis☆1,408Updated 3 months ago
- 👔IMAGDressing👔: Interactive Modular Apparel Generation for Virtual Dressing. It enables customizable human image generation with flexib…☆1,040Updated last month
- Official repository of In-Context LoRA for Diffusion Transformers☆818Updated this week
- StoryMaker: Towards consistent characters in text-to-image generation☆562Updated last week
- ComfyUI adaptation of IDM-VTON for virtual try-on.☆415Updated 3 months ago
- [ECCV 2024] OMG: Occlusion-friendly Personalized Multi-concept Generation In Diffusion Models☆637Updated 4 months ago
- [ECCV 2024 Oral🔥] Arc2Face: A Foundation Model for ID-Consistent Human Faces☆600Updated last month
- unofficial implementation of Comfyui magic clothing☆517Updated 2 months ago
- ☆377Updated 2 months ago
- [CVPR2024] StableVITON: Learning Semantic Correspondence with Latent Diffusion Model for Virtual Try-On☆1,032Updated last month
- ☆897Updated this week
- [ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.☆629Updated 3 months ago
- 📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion☆1,490Updated this week
- ComfyUI custom node that simply integrates the OOTDiffusion.☆409Updated 4 months ago
- ☆404Updated 2 weeks ago
- 📹 A more flexible CogVideoX that can generate videos at any resolution and creates videos from images.☆464Updated this week
- [ECCV 2024] HiDiffusion: Increases the resolution and speed of your diffusion model by only adding a single line of code!☆771Updated 3 months ago
- [CVPR 2024] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation☆731Updated 5 months ago
- PuLID-Flux ComfyUI implementation☆395Updated last month
- ☆335Updated 3 months ago
- ☆1,119Updated 3 weeks ago
- ☆755Updated 2 weeks ago
- ☆1,625Updated last week
- PuLID native implementation for ComfyUI☆704Updated last month
- [ECCV 2024] PowerPaint, a versatile image inpainting model that supports text-guided object inpainting, object removal, image outpainting…☆663Updated 2 months ago
- ViViD: Video Virtual Try-on using Diffusion Models☆468Updated 5 months ago
- Concept Sliders for Precise Control of Diffusion Models☆973Updated 2 months ago
- Diffusers wrapper to run Kwai-Kolors model☆561Updated last month
- Dead simple FLUX LoRA training UI with LOW VRAM support☆1,325Updated this week
- High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance☆1,899Updated last month