OPPO-Mente-Lab / X2I
X2I: Seamless Integration of Multimodal Understanding into Diffusion Transformer via Attention Distillation
โ66Updated last month
Alternatives and similar repositories for X2I
Users that are interested in X2I are comparing it to the libraries listed below
Sorting:
- Conceptrol: Concept Control of Zero-shot Personalized Image Generationโ38Updated last month
- [NeurIPS 2024] ๐ซCoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matchingโ158Updated 5 months ago
- Consistency Distillation with Target Timestep Selection and Decoupled Guidanceโ78Updated 4 months ago
- โ95Updated last month
- Official code of "Edit Transfer: Learning Image Editing via Vision In-Context Relations"โ76Updated last month
- โ50Updated 4 months ago
- Subjects200K datasetโ110Updated 3 months ago
- โ48Updated 4 months ago
- โ46Updated this week
- [CVPR 2025] Aesthetic Post-Training Diffusion Models from Generic Preferences with Step-by-step Preference Optimizationโ206Updated last month
- Maximize the Resolution Potential of Pre-trained Rectified Flow Transformersโ53Updated 6 months ago
- [ECCV 2024] AnyControl, a multi-control image synthesis model that supports any combination of user provided control signals. ไธไธชๆฏๆ็จๆท่ช็ฑ่พๅ ฅๆงโฆโ123Updated 10 months ago
- Magic Mirror: ID-Preserved Video Generation in Video Diffusion Transformersโ116Updated 4 months ago
- Official implementation of "IFAdapter: Instance Feature Control for Grounded Text-to-Image Generation".โ55Updated 7 months ago
- [ICML 2025] Official PyTorch implementation of paper "Ultra-Resolution Adaptation with Ease".โ95Updated last week
- Concat-ID: Towards Universal Identity-Preserving Video Synthesisโ40Updated this week
- an unofficial implementation of dreamtunerโ24Updated last year
- Official PyTorch implementation of paper "CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up".โ203Updated last month
- [Arxiv'25] BlobCtrl: A Unified and Flexible Framework for Element-level Image Generation and Editingโ88Updated last month
- RepText: Rendering Visual Text via Replicating ๐ฅโ68Updated last week
- Blending Custom Photos with Video Diffusion Transformersโ46Updated 3 months ago
- experimental implementation of Consistoryโ19Updated 9 months ago
- [CVPR 2025] PatchDPO: Patch-level DPO for Finetuning-free Personalized Image Generationโ29Updated 2 months ago
- Implementation code of the paper MIGE: A Unified Framework for Multimodal Instruction-Based Image Generation and Editingโ57Updated 2 months ago
- [Arxiv 2024] Edicho: Consistent Image Editing in the Wildโ117Updated 3 months ago
- UniCombine: Unified Multi-Conditional Combination with Diffusion Transformerโ80Updated last month
- EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLMโ55Updated last month
- โ110Updated last year
- โ167Updated 10 months ago
- [CVPR 2025] Official Implementation of MotionPro: A Precise Motion Controller for Image-to-Video Generationโ55Updated last month