OPPO-Mente-Lab / X2I
X2I: Seamless Integration of Multimodal Understanding into Diffusion Transformer via Attention Distillation
โ60Updated 3 weeks ago
Alternatives and similar repositories for X2I:
Users that are interested in X2I are comparing it to the libraries listed below
- Consistency Distillation with Target Timestep Selection and Decoupled Guidanceโ77Updated 3 months ago
- [NeurIPS 2024] ๐ซCoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matchingโ156Updated 5 months ago
- Conceptrol: Concept Control of Zero-shot Personalized Image Generationโ32Updated 3 weeks ago
- VisualCloze: A universal image generation framework that can support a wide range of in-domain tasks and generalize to unseen ones.โ167Updated this week
- Official code of "Edit Transfer: Learning Image Editing via Vision In-Context Relations"โ72Updated last week
- โ110Updated last year
- [CVPR 2025] Aesthetic Post-Training Diffusion Models from Generic Preferences with Step-by-step Preference Optimizationโ201Updated 2 weeks ago
- โ87Updated 2 weeks ago
- [ECCV 2024] AnyControl, a multi-control image synthesis model that supports any combination of user provided control signals. ไธไธชๆฏๆ็จๆท่ช็ฑ่พๅ ฅๆงโฆโ123Updated 9 months ago
- โ91Updated 9 months ago
- [Arxiv'25] BlobCtrl: A Unified and Flexible Framework for Element-level Image Generation and Editingโ82Updated last month
- [Arxiv 2024] Edicho: Consistent Image Editing in the Wildโ114Updated 3 months ago
- Magic Mirror: ID-Preserved Video Generation in Video Diffusion Transformersโ115Updated 3 months ago
- โ48Updated 3 months ago
- EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLMโ54Updated 2 weeks ago
- Subjects200K datasetโ107Updated 3 months ago
- Official implementation of "IFAdapter: Instance Feature Control for Grounded Text-to-Image Generation".โ56Updated 7 months ago
- Blending Custom Photos with Video Diffusion Transformersโ47Updated 3 months ago
- โ48Updated 4 months ago
- UniCombine: Unified Multi-Conditional Combination with Diffusion Transformerโ75Updated last month
- experimental implementation of Consistoryโ19Updated 9 months ago
- โ60Updated 9 months ago
- an unofficial implementation of dreamtunerโ24Updated last year
- Official PyTorch implementation of paper "CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up".โ204Updated 2 weeks ago
- [ICLR 2025] Official implementation of MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidanceโ254Updated last week
- Concat-ID: Towards Universal Identity-Preserving Video Synthesisโ33Updated 3 weeks ago
- Official code for VMix: Improving Text-to-Image Diffusion Model with Cross-Attention Mixing Controlโ184Updated 3 months ago
- Official Repo for Tuning-Free Noise Rectification for High Fidelity Image-to-Video Generationโ29Updated last year
- PEA-Diffusion: Parameter-Efficient Adapter with Knowledge Distillation in non-English Text-to-Image Generationโ32Updated 5 months ago
- [CVPR 2025] Official implementation of the paper "SmartEraser: Remove Anything from Images using Masked-Region Guidance".โ104Updated 3 weeks ago