OPPO-Mente-Lab / X2ILinks
X2I: Seamless Integration of Multimodal Understanding into Diffusion Transformer via Attention Distillation
☆72Updated 2 months ago
Alternatives and similar repositories for X2I
Users that are interested in X2I are comparing it to the libraries listed below
Sorting:
- Official code of "Edit Transfer: Learning Image Editing via Vision In-Context Relations"☆76Updated last month
- Conceptrol: Concept Control of Zero-shot Personalized Image Generation☆38Updated 2 months ago
- Consistency Distillation with Target Timestep Selection and Decoupled Guidance☆81Updated 5 months ago
- RepText: Rendering Visual Text via Replicating 🔥☆77Updated last month
- [NeurIPS 2024] 💫CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching☆158Updated 6 months ago
- Magic Mirror: ID-Preserved Video Generation in Video Diffusion Transformers☆117Updated 4 months ago
- Subjects200K dataset☆111Updated 4 months ago
- ☆50Updated 5 months ago
- [Arxiv'25] BlobCtrl: A Unified and Flexible Framework for Element-level Image Generation and Editing☆90Updated 2 months ago
- Concat-ID: Towards Universal Identity-Preserving Video Synthesis☆47Updated 3 weeks ago
- ☆97Updated 2 months ago
- Official repo for: SuperEdit - Rectifying and Facilitating Supervision for Instruction-Based Image Editing☆129Updated 3 weeks ago
- [Arxiv 2024] Edicho: Consistent Image Editing in the Wild☆118Updated 4 months ago
- UniCombine: Unified Multi-Conditional Combination with Diffusion Transformer☆87Updated 3 weeks ago
- [Few-Step Student Surpasses Teacher Diffusion] Learning Few-Step Diffusion Models by Trajectory Distribution Matching☆40Updated 2 months ago
- Official PyTorch implementation of paper "CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up".☆206Updated last month
- EEdit⚡: Rethinking the Spatial and Temporal Redundancy for Efficient Image Editing☆27Updated 2 months ago
- [ECCV 2024] AnyControl, a multi-control image synthesis model that supports any combination of user provided control signals. 一个支持用户自由输入控…☆125Updated 10 months ago
- ☆95Updated 10 months ago
- Official Repo for Tuning-Free Noise Rectification for High Fidelity Image-to-Video Generation☆28Updated last year
- The official implementation of the paper titled "StableV2V: Stablizing Shape Consistency in Video-to-Video Editing".☆155Updated 5 months ago
- ☆50Updated 5 months ago
- Official Implementation: Training-Free Efficient Video Generation via Dynamic Token Carving☆170Updated this week
- ☆111Updated last year
- [ICML 2025] Official PyTorch implementation of paper "Ultra-Resolution Adaptation with Ease".☆96Updated last month
- ☆87Updated this week
- [CVPR 2025] Official implementation of the paper "SmartEraser: Remove Anything from Images using Masked-Region Guidance".☆120Updated 2 months ago
- VisualCloze: A universal image generation framework that can support a wide range of in-domain tasks and generalize to unseen ones. (🔥 �…☆229Updated 2 weeks ago
- ☆246Updated this week
- 【CVPR 2025 Oral】Official Repo for Paper "AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea"☆128Updated 2 months ago