OPPO-Mente-Lab / X2ILinks
Official code for ICCV 205 paper, X2I: Seamless Integration of Multimodal Understanding into Diffusion Transformer via Attention Distillation
☆74Updated 2 weeks ago
Alternatives and similar repositories for X2I
Users that are interested in X2I are comparing it to the libraries listed below
Sorting:
- Consistency Distillation with Target Timestep Selection and Decoupled Guidance☆83Updated 6 months ago
- [ICCV 2025] VisualCloze: A universal image generation framework that can support a wide range of in-domain tasks and generalize to unseen…☆246Updated last month
- Official code of "Edit Transfer: Learning Image Editing via Vision In-Context Relations"☆80Updated last month
- Conceptrol: Concept Control of Zero-shot Personalized Image Generation☆39Updated 3 months ago
- ImgEdit: A Unified Image Editing Dataset and Benchmark☆138Updated 2 weeks ago
- Subjects200K dataset☆114Updated 5 months ago
- Official PyTorch implementation of paper "CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up".☆207Updated 3 months ago
- [Arxiv'25] BlobCtrl: A Unified and Flexible Framework for Element-level Image Generation and Editing☆9Updated 3 months ago
- ☆111Updated 3 weeks ago
- EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM☆61Updated 3 months ago
- Official Implementation: Training-Free Efficient Video Generation via Dynamic Token Carving☆214Updated 2 weeks ago
- ☆50Updated 6 months ago
- [NeurIPS 2024] 💫CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching☆159Updated 7 months ago
- [ICLR 2025] Official implementation of MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance☆284Updated last week
- [ICCV 2025] Code & Data for: SuperEdit - Rectifying and Facilitating Supervision for Instruction-Based Image Editing☆156Updated 2 weeks ago
- UniCombine: Unified Multi-Conditional Combination with Diffusion Transformer☆93Updated 2 weeks ago
- ☆112Updated last year
- ☆50Updated 6 months ago
- RepText: Rendering Visual Text via Replicating 🔥☆119Updated last month
- [ICCV 2025] MagicMirror: ID-Preserved Video Generation in Video Diffusion Transformers☆118Updated 2 weeks ago
- [CVPR 2025] Official Implementation of MotionPro: A Precise Motion Controller for Image-to-Video Generation☆109Updated last month
- Official code for VMix: Improving Text-to-Image Diffusion Model with Cross-Attention Mixing Control☆187Updated 6 months ago
- experimental implementation of Consistory☆20Updated last year
- Official implementation of HiFlow: Training-free High-Resolution Image Generation with Flow-Aligned Guidance☆67Updated last month
- Concat-ID: Towards Universal Identity-Preserving Video Synthesis☆54Updated 2 months ago
- Pytorch Implementation of "SSR-Encoder: Encoding Selective Subject Representation for Subject-Driven Generation"(CVPR 2024)☆123Updated 11 months ago
- EEdit⚡: Rethinking the Spatial and Temporal Redundancy for Efficient Image Editing☆42Updated 3 months ago
- The official implementation of the paper titled "StableV2V: Stablizing Shape Consistency in Video-to-Video Editing".☆158Updated 7 months ago
- [ECCV 2024] AnyControl, a multi-control image synthesis model that supports any combination of user provided control signals. 一个支持用户自由输入控…☆127Updated last year
- Blending Custom Photos with Video Diffusion Transformers☆47Updated 5 months ago