HiDream-ai / SPM-Diff
[ICLR 2025] Official lmplementation of SPM-Diff: Incorporating Visual Correspondence into Diffusion Model for Virtual Try-On
☆13Updated 2 months ago
Alternatives and similar repositories for SPM-Diff:
Users that are interested in SPM-Diff are comparing it to the libraries listed below
- ☆21Updated 4 months ago
- Blending Custom Photos with Video Diffusion Transformers☆46Updated 3 months ago
- PEA-Diffusion: Parameter-Efficient Adapter with Knowledge Distillation in non-English Text-to-Image Generation☆32Updated 6 months ago
- Implementation code of the paper MIGE: A Unified Framework for Multimodal Instruction-Based Image Generation and Editing☆57Updated 2 months ago
- [arXiv'25] AnyCharV: Bootstrap Controllable Character Video Generation with Fine-to-Coarse Guidance☆34Updated 2 months ago
- TIP-I2V: A Million-Scale Real Text and Image Prompt Dataset for Image-to-Video Generation☆30Updated 5 months ago
- Code for Paper 'Redefining Temporal Modeling in Video Diffusion: The Vectorized Timestep Approach'☆17Updated 7 months ago
- Concat-ID: Towards Universal Identity-Preserving Video Synthesis☆40Updated this week
- Conceptrol: Concept Control of Zero-shot Personalized Image Generation☆38Updated last month
- AAAI 2025: Anywhere: A Multi-Agent Framework for User-Guided, Reliable, and Diverse Foreground-Conditioned Image Generation☆38Updated 11 months ago
- MotionShop: Zero-Shot Motion Transfer in Video Diffusion Models with Mixture of Score Guidance☆25Updated 4 months ago
- ☆30Updated 4 months ago
- FantasyID: Face Knowledge Enhanced ID-Preserving Video Generation☆57Updated 2 weeks ago
- ☆25Updated last month
- Official code for CustAny: Customizing Anything from A Single Example. Accepted by CVPR2025 (Oral)☆42Updated last month
- [ECCV 2024] IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation☆54Updated 7 months ago
- An official pytorch implementation of "MoLE: Enhancing Human-centric Text-to-image Diffusion via Mixture of Low-rank Experts"☆31Updated 5 months ago
- Code for full fintuing Mochi model with FSDP (and CP)☆17Updated 3 weeks ago
- ☆38Updated 8 months ago
- ☆15Updated last month
- ☆40Updated 7 months ago
- ☆48Updated 4 months ago
- Official repo for: SuperEdit - Rectifying and Facilitating Supervision for Instruction-Based Image Editing☆59Updated this week
- We introduce OpenStory++, a large-scale open-domain dataset focusing on enabling MLLMs to perform storytelling generation tasks.☆15Updated 8 months ago
- RepText: Rendering Visual Text via Replicating 🔥☆68Updated last week
- Magic Mirror: ID-Preserved Video Generation in Video Diffusion Transformers☆116Updated 3 months ago
- ☆26Updated this week
- Official code of "LayerTracer: Cognitive-Aligned Layered SVG Synthesis via Diffusion Transformer"☆46Updated last month
- CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method☆27Updated last year
- ☆20Updated 7 months ago