Zheng-Chong / CatVTON
CatVTON is a simple and efficient virtual try-on diffusion model with 1) Lightweight Network (899.06M parameters totally), 2) Parameter-Efficient Training (49.57M parameters trainable) and 3) Simplified Inference (< 8G VRAM for 1024X768 resolution).
☆888Updated 2 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for CatVTON
- Official implementation of Magic Clothing: Controllable Garment-Driven Image Synthesis☆1,404Updated 3 months ago
- 👔IMAGDressing👔: Interactive Modular Apparel Generation for Virtual Dressing☆1,030Updated 3 weeks ago
- [CVPR2024] StableVITON: Learning Semantic Correspondence with Latent Diffusion Model for Virtual Try-On☆1,020Updated 3 weeks ago
- ComfyUI adaptation of IDM-VTON for virtual try-on.☆409Updated 2 months ago
- [ECCV 2024 Oral🔥] Arc2Face: A Foundation Model for ID-Consistent Human Faces☆595Updated last month
- [ECCV 2024] OMG: Occlusion-friendly Personalized Multi-concept Generation In Diffusion Models☆634Updated 4 months ago
- 📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion☆1,280Updated this week
- ComfyUI custom node that simply integrates the OOTDiffusion.☆406Updated 3 months ago
- ViViD: Video Virtual Try-on using Diffusion Models☆460Updated 4 months ago
- StoryMaker: Towards consistent characters in text-to-image generation☆549Updated last month
- ☆385Updated this week
- ☆373Updated last month
- unofficial implementation of Comfyui magic clothing☆510Updated 2 months ago
- [ECCV 2024] PowerPaint, a versatile image inpainting model that supports text-guided object inpainting, object removal, image outpainting…☆643Updated 2 months ago
- [ECCV 2024] HiDiffusion: Increases the resolution and speed of your diffusion model by only adding a single line of code!☆761Updated 2 months ago
- Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA☆1,395Updated last month
- ☆756Updated last week
- Bring portraits to life in Real Time!onnx/tensorrt support!实时肖像驱动!☆509Updated last month
- Inference Microsoft Florence2 VLM☆733Updated this week
- 📹 A more flexible CogVideoX that can generate videos at any resolution and creates videos from images.☆430Updated this week
- A Versatile and Robust SDXL-ControlNet Model for Adaptable Line Art Conditioning☆487Updated 5 months ago
- ☆1,078Updated last week
- InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥☆1,656Updated last month
- [ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.☆622Updated 3 months ago
- PuLID native implementation for ComfyUI☆684Updated last month
- Code and data for "AnyV2V: A Tuning-Free Framework For Any Video-to-Video Editing Tasks" (TMLR 2024)☆501Updated last week
- High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance☆1,870Updated last month
- ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment☆1,086Updated 3 months ago
- Concept Sliders for Precise Control of Diffusion Models☆971Updated last month
- ComfyUI nodes to use segment-anything-2☆631Updated last month