OPPO-Mente-Lab / X2I
X2I: Seamless Integration of Multimodal Understanding into Diffusion Transformer via Attention Distillation
☆55Updated this week
Alternatives and similar repositories for X2I:
Users that are interested in X2I are comparing it to the libraries listed below
- Official Repo for Paper "AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea"☆65Updated this week
- Consistency Distillation with Target Timestep Selection and Decoupled Guidance☆74Updated 2 months ago
- Official PyTorch implementation of paper "CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up".☆199Updated last month
- [ECCV 2024] AnyControl, a multi-control image synthesis model that supports any combination of user provided control signals. 一个支持用户自由输入控…☆124Updated 8 months ago
- [Arxiv 2024] Edicho: Consistent Image Editing in the Wild☆114Updated 2 months ago
- an unofficial implementation of dreamtuner☆24Updated last year
- [NeurIPS 2024] 💫CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching☆152Updated 4 months ago
- experimental implementation of Consistory☆19Updated 8 months ago
- ☆46Updated 3 months ago
- ☆49Updated 3 months ago
- Blending Custom Photos with Video Diffusion Transformers☆46Updated 2 months ago
- UniCombine: Unified Multi-Conditional Combination with Diffusion Transformer☆52Updated last week
- [Arxiv'25] BlobCtrl: A Unified and Flexible Framework for Element-level Image Generation and Editing☆69Updated last week
- Pytorch Implementation of "SSR-Encoder: Encoding Selective Subject Representation for Subject-Driven Generation"(CVPR 2024)☆111Updated 8 months ago
- ☆108Updated last year
- Official PyTorch implementation of paper "Ultra-Resolution Adaptation with Ease".☆85Updated this week
- Official code of "Edit Transfer: Learning Image Editing via Vision In-Context Relations"☆62Updated last week
- Implementation of "EasyControl: Adding Efficient and Flexible Control for Diffusion Transformer"☆136Updated last week
- Conceptrol: Concept Control of Zero-shot Personalized Image Generation☆26Updated this week
- InstantUnify: Integrates Multimodal LLM into Diffusion Models 🔥☆39Updated 7 months ago
- FantasyID: Face Knowledge Enhanced ID-Preserving Video Generation☆43Updated last month
- Official code for VMix: Improving Text-to-Image Diffusion Model with Cross-Attention Mixing Control☆182Updated 3 months ago
- Official implementation of "IFAdapter: Instance Feature Control for Grounded Text-to-Image Generation".☆54Updated 6 months ago
- [CVPR 2025] Aesthetic Post-Training Diffusion Models from Generic Preferences with Step-by-step Preference Optimization☆192Updated this week
- Maximize the Resolution Potential of Pre-trained Rectified Flow Transformers☆50Updated 5 months ago
- [ICLR 2025] IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation☆177Updated last month
- Subjects200K dataset☆103Updated 2 months ago
- [CVPR 2025] Official implementation of the paper "SmartEraser: Remove Anything from Images using Masked-Region Guidance".☆96Updated last week
- Accelerating Diffusion Transformers with Token-wise Feature Caching☆115Updated 2 weeks ago
- ☆27Updated 5 months ago