alibaba / mm-diff
MM-Diff: High-Fidelity Image Personalization via Multi-Modal Condition Integration
โ25Updated 10 months ago
Alternatives and similar repositories for mm-diff:
Users that are interested in mm-diff are comparing it to the libraries listed below
- โ29Updated 5 months ago
- experimental implementation of Consistoryโ19Updated 9 months ago
- [NeurIPS 2024] ๐ซCoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matchingโ156Updated 5 months ago
- [CVPR 2024] Official PyTorch implementation of FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Compositionโ147Updated 2 months ago
- [ECCV 2024] AnyControl, a multi-control image synthesis model that supports any combination of user provided control signals. ไธไธชๆฏๆ็จๆท่ช็ฑ่พๅ ฅๆงโฆโ123Updated 9 months ago
- A collection of resources on personalized image generation.โ113Updated 2 weeks ago
- [CVPR 2025] Official implementation of the paper "SmartEraser: Remove Anything from Images using Masked-Region Guidance".โ104Updated 3 weeks ago
- The official implementation of the paper titled "StableV2V: Stablizing Shape Consistency in Video-to-Video Editing".โ151Updated 4 months ago
- [Arxiv 2024] Edicho: Consistent Image Editing in the Wildโ114Updated 3 months ago
- โ48Updated 3 months ago
- โ85Updated last month
- Official Repo for Tuning-Free Noise Rectification for High Fidelity Image-to-Video Generationโ29Updated last year
- โ110Updated last year
- [ECCV2024] Source Prompt Disentangled Inversion for Boosting Image Editability with Diffusion Modelsโ42Updated 9 months ago
- X2I: Seamless Integration of Multimodal Understanding into Diffusion Transformer via Attention Distillationโ60Updated 3 weeks ago
- Pytorch Implementation of "SSR-Encoder: Encoding Selective Subject Representation for Subject-Driven Generation"(CVPR 2024)โ113Updated 8 months ago
- [CVPR2024] Official code for Drag Your Noise: Interactive Point-based Editing via Diffusion Semantic Propagationโ86Updated last year
- UniCombine: Unified Multi-Conditional Combination with Diffusion Transformerโ75Updated last month
- Conceptrol: Concept Control of Zero-shot Personalized Image Generationโ32Updated 3 weeks ago
- Official code for K-LoRA (CVPR 2025)โ99Updated last month
- InstantStyle-Plus: Style Transfer with Content-Preserving in Text-to-Image Generation ๐ฅโ114Updated 9 months ago
- โ91Updated 9 months ago
- โ21Updated last month
- Official implementation of "IFAdapter: Instance Feature Control for Grounded Text-to-Image Generation".โ56Updated 7 months ago
- PosterMaker [CVPR 2025] https://poster-maker.github.io/โ28Updated this week
- โ24Updated 10 months ago
- โ114Updated 6 months ago
- code for "MVOC:atraining-free multiple video object composition method with diffusion models"โ21Updated 9 months ago
- โ27Updated 6 months ago
- Consistency Distillation with Target Timestep Selection and Decoupled Guidanceโ77Updated 3 months ago